Question 1

What does the parallelizable fraction mean?

Accepted Answer

The parallelizable fraction (p) is the proportion of your total workload that can be divided across multiple processors and run simultaneously. The remaining fraction (1 - p) must run serially: file I/O, initialization, synchronization barriers, and any code that depends on the output of a previous step. A fraction of 0.9 means 90% of the work can run in parallel and 10% must run sequentially. You can estimate this by profiling your application and adding up the time spent in sections that cannot be parallelized.

Question 2

What is the maximum speedup, and why can't I exceed it?

Accepted Answer

The maximum speedup is 1 / (1 - p), the limit as the number of processors approaches infinity. With p = 0.8, the maximum is 1 / 0.2 = 5x, no matter how many cores you use. This hard ceiling exists because the serial portion always takes the same absolute time - more cores can only reduce the parallel portion's contribution to zero, but the serial time remains. This is why reducing the serial fraction is usually more valuable than buying more hardware.

Question 3

What is parallel efficiency and what is a good value?

Accepted Answer

Parallel efficiency is speedup divided by the number of processors, expressed as a percentage. It tells you how productively each core is being used. An efficiency of 100% means perfect linear scaling: doubling cores exactly doubles speed. In practice, values above 85% are excellent, 60-85% are good, 35-60% are moderate, and below 35% means most cores are wasted waiting on the serial section. Efficiency always falls as you add more cores, so there is a practical sweet spot between raw speedup and cost.

Question 4

How do I find my workload's parallelizable fraction?

Accepted Answer

Profile your application using a performance profiler such as perf (Linux), Instruments (macOS), or VTune (Intel). Identify sections that run in a strictly sequential order and sum their execution times. Divide that serial time by the total runtime to get 1 - p; subtract from 1 to get p. Alternatively, run the workload on 1, 2, 4, and 8 cores and fit the speedup data to the Amdahl formula to back out p empirically.

Question 5

What is the difference between Amdahl's Law and Gustafson's Law?

Accepted Answer

Amdahl's Law assumes a fixed problem size and asks how much faster you can finish it with more processors. Gustafson's Law assumes fixed wall-clock time and asks how much more work you can do. For latency-critical applications (serving a web request, rendering a single frame), Amdahl is the right model. For throughput-oriented or scientific workloads where you scale the problem with the hardware (larger simulation grids, more Monte Carlo trials), Gustafson's Law gives a more optimistic and often more accurate picture.

Question 6

Can the speedup ever exceed the maximum?

Accepted Answer

No, not within the Amdahl model. The formula guarantees S <= 1 / (1 - p) for any finite number of processors. In real systems a phenomenon called super-linear speedup can occasionally occur when more processors mean more total cache, reducing cache misses and making the parallel version faster per operation than the serial baseline. This is outside the scope of Amdahl's model, which assumes the only gain comes from parallel execution.

Parallel fraction (p)	Max speedup (inf. cores)	Speedup at 8 cores	Speedup at 64 cores
50% (0.50)	2.00x	1.78x	1.98x
75% (0.75)	4.00x	3.20x	3.84x
80% (0.80)	5.00x	3.81x	4.74x
90% (0.90)	10.00x	6.40x	9.14x
95% (0.95)	20.00x	9.14x	16.97x
99% (0.99)	100.00x	13.91x	39.28x

Amdahl's Law Calculator

Your details

What is Amdahl's Law?

How to use this calculator

Parallel efficiency and the point of diminishing returns

Amdahl's Law versus Gustafson's Law

Amdahl speedup limits by parallel fraction

Frequently asked questions

Sources