Skip to content
Statistics

F-Statistic Calculator

Enter two sample standard deviations and sizes to test whether their underlying variances are equal, or switch to regression mode and supply the sum-of-squared residuals for your full and restricted models. The calculator returns the F-statistic, degrees of freedom, exact p-value, critical value, and a plain-English conclusion. A step-by-step panel shows every arithmetic operation so you can check the working in detail.

Your details

Variance ratio: compare two sample variances. Regression: test joint significance of excluded coefficients. F to p-value: enter an F-statistic directly.
The probability threshold for rejecting the null hypothesis, commonly 0.05 or 0.01.
One-tailed (right) is standard for ANOVA and regression. Two-tailed is used when testing whether variances differ in either direction.
The sample standard deviation of the first group (square root of the sample variance).
Number of observations in the first group. Minimum 2.
The sample standard deviation of the second group.
Number of observations in the second group. Minimum 2.
F-statisticNot significant
1.601

Ratio of the two mean squared quantities under test

Numerator df24
Denominator df29
p-value0.1131
Critical F-value1.9005
DecisionFail to reject H₀: insufficient evidence that variances differ (p ≥ α).
1.60127.4% above · F

F = 1.601: not statistically significant at α = 0.05

  • The F-statistic follows an F(24, 29) distribution under the null hypothesis.
  • The p-value is 0.1131: the probability of this result (or more extreme) if H₀ is true.
  • Sample 1 variance (153.7600) is 1.60x the Sample 2 variance (96.0400).

Next stepNo significant variance difference: a pooled t-test (equal-variance assumption) is acceptable for means comparison.

What is the F-statistic?

The F-statistic is the ratio of two mean-squared quantities, each following a chi-squared distribution scaled by its degrees of freedom. Under the null hypothesis, the ratio follows an F-distribution parametrised by two degrees-of-freedom values: the numerator df (df1) and the denominator df (df2). A large F-value means the numerator is much larger than expected by chance, which is evidence against the null hypothesis. The F-distribution is right-skewed and always non-negative, so large positive values are the ones that challenge H0.

Two-sample variance test (Snedecor F-test)

The most direct application of the F-statistic is testing whether two populations have the same variance. You compute S1-squared / S2-squared where S1 and S2 are the two sample standard deviations. Under H0 (equal variances) this ratio follows an F(n1-1, n2-1) distribution. A p-value below your significance level (commonly 0.05) means the evidence favours unequal variances. This test is a standard pre-check before running a two-sample t-test: if variances are unequal, use Welch's t-test rather than the pooled version. Assumptions: both samples are drawn from normally distributed populations and observations are independent.

Regression and Wald F-test

In linear regression, the F-statistic tests whether a set of J coefficient restrictions holds jointly. You fit the unrestricted (full) model and the restricted model, record their sums of squared residuals (SSR_F and SSR_R), and form F = [(SSR_R - SSR_F) / J] / [SSR_F / (N - K)]. The numerator is the average increase in residuals per restriction; the denominator is the baseline mean-squared error of the full model. A significant F means at least one of the J restrictions is false, so the restricted model is misspecified. A common special case is testing whether all slope coefficients are zero, which is the "overall model F" reported by most regression packages.

Interpreting the p-value and critical value

Two decision rules produce the same conclusion. Rule 1: compare the p-value to alpha. If p < alpha, reject H0. Rule 2: compare the computed F to the critical value F_crit. If F > F_crit, reject H0. The critical value is the F-quantile that cuts off an area equal to alpha in the right tail of the distribution, so the two rules are mathematically equivalent. Reporting the exact p-value is generally preferred over a binary reject/fail-to-reject statement because it lets readers apply their own significance threshold. Common alpha levels are 0.10 (marginal), 0.05 (conventional), and 0.01 (stringent).

Approximate critical F-values (right-tailed, α = 0.05)

df₁ \ df₂10203060120
14.964.354.174.003.92
24.103.493.323.153.07
33.713.102.922.762.68
43.482.872.692.532.45
53.332.712.532.372.29
102.982.352.161.991.91

Rows = numerator df (df₁), columns = denominator df (df₂). Reject H₀ if F > critical value.

Frequently asked questions

What does a large F-statistic mean?

A large F-statistic means the variance in the numerator (between-group variation, or the reduction in residuals from removing the restrictions) is much larger than the variance in the denominator (within-group noise, or the baseline residual variance). The larger F is, the more evidence there is against the null hypothesis. Whether a given value counts as "large" depends on the degrees of freedom, which is why you always need the p-value or critical value to make a decision.

What is the difference between F and t statistics?

For a two-tailed t-test with df degrees of freedom, the square of the t-statistic equals an F(1, df) statistic. The F-test generalises the t-test by testing multiple restrictions simultaneously. When you have just one restriction (for example, testing one regression coefficient), F equals t-squared and the two tests give the same p-value. For two or more restrictions you need the F-test because t cannot handle joint hypotheses.

Does the variance F-test require normally distributed data?

Yes, the Snedecor F-test for equality of variances is sensitive to non-normality. With moderate departures from normality the p-value can be misleading. For non-normal data, Levene's test or the Brown-Forsythe test are more robust alternatives. The regression F-test is less sensitive to non-normality in large samples because of the central limit theorem, but small-sample regression F-tests also assume normally distributed errors.

What is the null hypothesis of the F-test?

For the variance test, H0 is that both populations have the same variance (sigma-1-squared = sigma-2-squared), so the ratio equals 1. For the regression Wald test, H0 is that all J excluded coefficients are jointly zero, meaning the restrictions hold in the true model. In ANOVA, H0 is that all group means are equal, which is equivalent to saying the between-group variance is no larger than expected by chance.

Can the F-statistic be less than 1?

Yes. If the sample variance of group 1 is smaller than that of group 2, the ratio S1-squared / S2-squared is less than 1. This is valid: small F-values favour the null hypothesis. Some textbooks and software always place the larger variance in the numerator to keep F >= 1, but this changes the hypothesis and can be misleading. This calculator uses sample 1 as the numerator and sample 2 as the denominator, exactly as you enter them.

Sources

Written by Dr. Hannah Brandt, PhD Statistician · Munich, Germany

Applied statistician translating rigorous probability theory into clear, accurate tools for researchers and practitioners.

Search 3,500+ calculators

Loading search…