Question 1

What does accuracy mean in statistics?

Accepted Answer

In statistics, accuracy is how close a result is to the true value. In classification (machine learning, diagnostic testing), it is the proportion of all predictions that are correct: (TP + TN) / (TP + TN + FP + FN). In experimental science, accuracy is quantified as 100% minus the percent error between a measured value and the accepted true value.

Question 2

What is a good accuracy for a classification model?

Accepted Answer

There is no universal threshold because it depends on the application, the class balance, and the cost of errors. As a rough guide, above 95% is considered excellent for most tasks, 85-95% is good, and below 70% usually signals a model that needs improvement. For imbalanced datasets (e.g. fraud, rare disease), accuracy alone is misleading - look at F1 score or precision-recall AUC instead.

Question 3

What is the difference between precision and recall?

Accepted Answer

Precision measures how trustworthy positive predictions are: TP / (TP + FP). Recall measures how complete positive detection is: TP / (TP + FN). A model tuned for high precision avoids false alarms but may miss real positives. A model tuned for high recall catches most positives but may raise more false alarms. The F1 score is the harmonic mean that balances both, and it falls between them.

Question 4

Why is accuracy misleading for imbalanced data?

Accepted Answer

If 99% of your data is "negative", a model that always predicts "negative" achieves 99% accuracy without being useful at all. In that case, precision, recall, and F1 score expose the real situation: the model has 0% recall, meaning it never detects any positive case. Always check precision and recall alongside accuracy, especially when one class is rare.

Question 5

What is the prevalence-adjusted accuracy formula?

Accepted Answer

When a study sample does not reflect the real proportion of positive cases in the population, raw accuracy from the sample is biased. The prevalence-adjusted formula corrects this: Accuracy = (Sensitivity x Prevalence) + (Specificity x (1 - Prevalence)). For example, a test with 75% sensitivity and 90% specificity applied to a population with 10% prevalence gives an adjusted accuracy of 0.75 x 0.10 + 0.90 x 0.90 = 88.5%.

Question 6

How do I calculate percent error and percent accuracy?

Accepted Answer

Question 7

What is the F1 score and when should I use it?

Accepted Answer

The F1 score is the harmonic mean of precision and recall: 2 x (Precision x Recall) / (Precision + Recall). It ranges from 0% to 100% and is highest when both precision and recall are high. Use F1 when you want a single metric that balances both, particularly when the dataset is imbalanced or when false positives and false negatives are both costly.

Accuracy range	Interpretation	Typical use cases
95% - 100%	Excellent	Production ML models, clinical diagnostics
85% - 94%	Good	Most business applications, research models
70% - 84%	Fair	Baseline models, early prototypes
Below 70%	Poor	Needs improvement; check for class imbalance

Accuracy Calculator

Your details

What is accuracy and how is it calculated?

Precision, recall, F1 score, and specificity explained

The three accuracy calculation methods

Class imbalance and when to use other metrics

Classification performance thresholds (general guidance)

Frequently asked questions

Sources