Q: What is a paired t-test and when should I use it?

A paired t-test is a one-sample test applied to the within-subject differences. You compute the difference for each matched pair, then test whether the mean of those differences differs significantly from zero (or another null value). Use it when observations come in natural pairs: the same person measured twice, twin studies, left-eye versus right-eye, or matched case-control designs. Because pairing removes between-subject variability from the error, a paired test is more sensitive than an independent two-sample test when the pairing is genuine.

Q: Why does the calculator need at least two observations?

Degrees of freedom equal n minus 1, so with only one observation there would be zero degrees of freedom and no way to estimate variability. A meaningful sample standard deviation also requires at least two values. With n below 2 the t-test is undefined, so the calculator returns no result until you enter a valid sample size.

Q: What is the p-value and what does it not tell me?

The p-value is the probability of obtaining a test statistic at least as extreme as the one computed, assuming the null hypothesis is exactly true. It is not the probability that the null hypothesis is true, nor the probability of making an error if you reject it. A p-value below your alpha threshold is evidence against the null, but it says nothing about the size or practical importance of the effect. Always pair the p-value with an effect size and confidence interval for a complete report.

Question 1

What is the difference between a one-tailed and two-tailed t-test?

Accepted Answer

A two-tailed test asks whether the mean is different from the reference value in either direction and splits the significance level equally between both tails of the distribution. A one-tailed test focuses the entire alpha on one direction: left-tailed tests whether the mean is significantly less than the reference, right-tailed tests whether it is significantly greater. One-tailed tests are statistically more powerful in the hypothesized direction but inappropriate unless you have a strong prior reason to rule out differences in the opposite direction. If in doubt, use two-tailed.

Question 2

Why does this calculator use the Welch correction by default for two-sample tests?

Accepted Answer

The classic Student's two-sample t-test assumes both groups have the same population variance. When that assumption is violated, the test can have an inflated false-positive rate. The Welch correction adjusts the degrees of freedom using the Welch-Satterthwaite equation, which accounts for differing variances without requiring you to assume they are equal. Simulation studies show that the Welch test performs at least as well as the pooled test even when variances are equal, and much better when they are not, making it the recommended default in most statistical guidelines including those of the American Psychological Association.

Question 3

How do I interpret Cohen's d?

Accepted Answer

Cohen's d is the mean difference expressed in standard deviation units. A value of 0.5 means the two means are half a standard deviation apart. Jacob Cohen's original benchmarks (small 0.2, medium 0.5, large 0.8) are rough guides: real-world interpretation should account for the domain, measurement precision, and practical consequences of the effect. Effect sizes matter most when sample sizes are large enough to detect trivially small differences at p < 0.05, or when comparing studies with different sample sizes.

Question 4

What is a paired t-test and when should I use it?

Accepted Answer

A paired t-test is a one-sample test applied to the within-subject differences. You compute the difference for each matched pair, then test whether the mean of those differences differs significantly from zero (or another null value). Use it when observations come in natural pairs: the same person measured twice, twin studies, left-eye versus right-eye, or matched case-control designs. Because pairing removes between-subject variability from the error, a paired test is more sensitive than an independent two-sample test when the pairing is genuine.

Question 5

Why does the calculator need at least two observations?

Accepted Answer

Degrees of freedom equal n minus 1, so with only one observation there would be zero degrees of freedom and no way to estimate variability. A meaningful sample standard deviation also requires at least two values. With n below 2 the t-test is undefined, so the calculator returns no result until you enter a valid sample size.

Question 6

What is the p-value and what does it not tell me?

Accepted Answer

The p-value is the probability of obtaining a test statistic at least as extreme as the one computed, assuming the null hypothesis is exactly true. It is not the probability that the null hypothesis is true, nor the probability of making an error if you reject it. A p-value below your alpha threshold is evidence against the null, but it says nothing about the size or practical importance of the effect. Always pair the p-value with an effect size and confidence interval for a complete report.

df	alpha = 0.10	alpha = 0.05	alpha = 0.01	alpha = 0.001
1	6.314	12.706	63.657	636.619
2	2.92	4.303	9.925	31.599
5	2.015	2.571	4.032	6.869
10	1.812	2.228	3.169	4.587
20	1.725	2.086	2.845	3.85
30	1.697	2.042	2.75	3.646
60	1.671	2	2.66	3.46
inf (z)	1.645	1.96	2.576	3.291

T-Test Calculator

Your details

Formula

Worked example

Which t-test should you use?

Reading the t statistic and p-value

Effect size and confidence intervals

Assumptions and when to check them

Common critical t values (two-tailed)

Frequently asked questions

Sources