P-Value Calculator (Z-Test)
Calculate the p-value from a Z-score (or standard normal test statistic) for one-tailed or two-tailed hypothesis tests. Furthermore, results are automatically evaluated at α = 0.05, 0.01, and 0.001 significance thresholds with a plain-English interpretation.
How to use the P-Value Calculator (Z-Test)
Type the standardised test statistic. Furthermore, for a one-sample Z-test: Z = (x̄ − µ₀)/(σ/√n). For comparing proportions: Z = (p̂₁ − p̂₂)/SE. Large |Z| values give small p-values.
Two-tailed (H₁: µ ≠ µ₀): p = 2 × P(Z > |z|). One-tailed upper (H₁: µ > µ₀): p = P(Z > z). Furthermore, one-tailed lower: p = P(Z < z). Choose the tail based on your alternative hypothesis, not on what looks significant.
p-value appears alongside significance at three standard alpha thresholds. Moreover, the insight explains the correct interpretation of p — it is not the probability that H₀ is true.
p < 0.05: reject H₀ at 5% level (statistically significant). p < 0.01: highly significant. Furthermore, p ≥ 0.05 means insufficient evidence to reject H₀, not that H₀ is true.
Z = (x̄ − µ₀)/(σ/√n) requires known population σ. Furthermore, for unknown σ, use the t-distribution with n−1 degrees of freedom — the t-distribution approaches Z for large n.
Variants, options and when to use each
| Z-score | Two-tailed p | Interpretation |
|---|---|---|
| 1.645 | 0.100 | Significant at α = 0.10 only |
| 1.960 | 0.050 | Critical value for α = 0.05 |
| 2.326 | 0.020 | Significant at α = 0.02 |
| 2.576 | 0.010 | Critical value for α = 0.01 |
| 3.291 | 0.001 | Critical value for α = 0.001 |
The formula explained
Φ(Z) = standard normal CDF = P(Z ≤ z)
Z = standardised test statistic
α = significance level (threshold for rejecting H₀)
The p-value is the probability of obtaining a test statistic as extreme or more extreme than the observed value, assuming H₀ is true. Furthermore, for a two-tailed test, both tails are included: p = 2 × P(Z > |z|). For a one-tailed test, only the relevant tail is used. Moreover, the p-value does NOT measure the probability that H₀ is true — this is one of the most common misinterpretations in statistics.
Worked example — Z = 2.5, two-tailed test
| Step | Calculation | Result |
|---|---|---|
| Φ(2.5) | P(Z ≤ 2.5) | 0.9938 |
| P(Z > 2.5) | 1 − 0.9938 | 0.00621 |
| Two-tailed p | 2 × 0.00621 | p = 0.01242 |
| Significant at α=0.05? | 0.01242 < 0.05 | Yes |
What is a p-value in statistics?
The p-value is the probability, under the null hypothesis H₀, of observing a test statistic as extreme or more extreme than the one calculated from the data. Furthermore, a small p-value means the observed data is unlikely under H₀ — providing evidence against H₀. The conventional significance level α = 0.05 means we reject H₀ when p < 0.05.P-values are derived from the null distribution of the test statistic — in this calculator, the standard normal (Z) distribution. Moreover, the Z-test applies when the population standard deviation σ is known and the sample size is large (n > 30). For unknown σ and small samples, the t-distribution should be used instead.
The p-value does not equal the probability that H₀ is true. Additionally, this is the most common misinterpretation. A p-value is a conditional probability — conditioned on H₀ being true. Bayesian posterior probabilities P(H₀|data) require a prior P(H₀) and are mathematically distinct from p-values.
Who uses this calculator?
Scientists and researchers use p-values in hypothesis testing across all quantitative fields — clinical trials, psychology experiments, biology, and engineering. Furthermore, regulatory bodies (FDA, EMA) require p < 0.05 as a minimum threshold for drug efficacy claims. Quality engineers use p-values in process capability testing and SPC (statistical process control). Moreover, data scientists use p-values in A/B testing for website and product optimisation.
Historical context and related concepts
The p-value was introduced by Karl Pearson in the early 1900s and formalised by Ronald Fisher in "Statistical Methods for Research Workers" (1925). Furthermore, Fisher proposed p < 0.05 as a conventional threshold for significance. Neyman and Pearson (1933) formalised the complementary framework of Type I and Type II errors. Moreover, the p < 0.05 threshold has been debated extensively — the American Statistical Association issued a statement (2016) cautioning against over-reliance on p-values alone.
Why p-value calculation is central to scientific inference and drug approval
Regulatory approval of new drugs requires randomised controlled trials with p < 0.05 (often with multiple hypothesis correction). Furthermore, a new drug showing efficacy with p = 0.01 in a Phase III trial provides strong evidence for approval. Moreover, p-values guide resource allocation in research — statistically significant findings receive further investigation and publication.P-values and the replication crisis in science
Many scientific fields have experienced a "replication crisis" — published findings with p < 0.05 often fail to replicate. Furthermore, this arises from p-hacking (selectively reporting significant results), small sample sizes (underpowered studies), and publication bias. Moreover, the solution includes pre-registration of hypotheses, larger samples, reporting effect sizes alongside p-values, and raising significance thresholds (some researchers advocate p < 0.005 as the new standard for discovery claims).
Frequently asked questions
Related tools
Z-Score Calculator
Calculate Z-score from a value, mean, and standard deviation. Furthermore, Z-score is the input to this p-value calculator.
→Significant Figures Calculator
Round p-values appropriately — report to 2–3 significant figures. Moreover, exact p-values are more informative than dichotomous "significant/not significant".
→Bacteria Generation Time Calculator
Statistical analysis of growth data. Furthermore, biological experiments use p-values to test treatment effects on growth rates.
→Log Reduction Calculator
Statistical validation of disinfection log reductions. Moreover, disinfectant efficacy claims require statistically significant p-values.
→Normal Distribution Calculator
Convert Z to percentile and p-value. Furthermore, p-value = 1 − Φ(Z) for one-tailed upper test.
→Scientific Notation Converter
Very small p-values (p = 1.2×10⁻⁸) require scientific notation. Moreover, genomic association studies regularly report p-values below 10⁻⁷.
→