Why would classical correlation in Bell's experiment be a linear function of angle?

I think you misunderstood the significance of could for a classical theory. The text below the picture you took from Wikipedia says: "Many other possibilities exist for the classical correlation subject to these side conditions", so classicality does not imply linearity. It does, however, rule out the cosine, by the following (slightly heuristic) argument:

Classical means heuristically "all measurement results exist, whether there is measurement or not".

Take a polarizer at an angle $\theta$. Classical/local hidden theories insist that the probabilities $P(A_\theta = A_\phi)$ that the photon that passed through at angle $\theta$ would have passed through at an angle $\phi$ through the same polarizer exist all at the same time. Note that it is important that this is the probability to detect the quantum particle - if we were just talking about continuous field strength, as your projection argument would imply, the following probabilistic argument would not work. It is, however, experimentally shown that you indeed measure single incident photons.

Now, a basic probabilistic law says that

$$ P(x = z) \ge P(x = y) + P(y = z) - 1$$

If we now divide $[\theta_0,\theta_N]$ into $N$ equally big intervals of length $\Delta\theta = \frac{\theta_N - \theta_0}{N}$ with angles $\theta_0,\dots,\theta_{N+1}$, we get:

$$ P(A_{\theta_0} = A_{\theta_N}) \ge \sum_{i = 0}^N P(A_{\theta_i} = A_{\theta_{i+1}}) - N$$

But the cosine probability $P(A_{\theta_i} = A_{\theta_{i+1}})$ does not depend on the absolute value of these angles, so every summand is $P(A_0 = A_{\Delta\theta}) = \mathrm{cos}^2(\Delta\theta)$ and we have that a local hidden theory demands:

$$ \mathrm{cos}^2(\theta_0 - \theta_N) \ge (N + 1)\mathrm{cos}^2(\Delta\theta) - N$$

Take a total angle difference of $\theta_0 - \theta_N = 90°$ and $N = 89$, and you get that

$$ 0 \ge 90\mathrm{cos}^2(1°) - 89$$

which anyone with a calculator can prove false. Therefore, the assumption that all $P(A_{\theta_i} = A_{\theta_{i+1}})$ exist without making the measurement is false, since the cosine is what we measure.


Bell's argument makes very weak assumptions about the behavior of the two particles (which is why it's interesting). In effect, the particles are black boxes that take an angle as input and produce a spin direction as output. There is no restriction on how they choose the spin direction; there could be a source of true randomness in there, or a human being who makes the decision. The only limitations are that neither box is told what angle was given to the other box, and if both boxes are given the same angle, they must return opposite results.

Each box could have a secret "real spin axis" in it (pointing opposite to the other box's axis) and upon being told the measurement axis it could compute the $\cos^2$ of the angle between those axes. However, it can't return that as the result, because the result has to be either "up" or "down". It could return "up" with probability equal to the cosine squared, and "down" otherwise. But then if both boxes were given the same measurement axis, but it was not the "real" axis, there would be a nonzero probability that they would return the same answer, which violates the requirement that they always return opposite answers in that case.

If you think about it, there is no alternative but to pre-decide the result each box will produce for each angle, since there is no other way to ensure they will always match. So "measurement results are predetermined" is not an assumption of the theorem, it is just the only apparent way to comply with the requirements given some seemingly self-evident assumptions about reality.

Bell proved an over-general result that is unnecessarily hard to understand. You don't need a continuum of measurement angles to get a nonclassical result, just three. With three angles, the argument above shows that there are only $2^3=8$ possible "answer strategies" for the boxes, which we can write UUU, UUD, UDU, ..., DDD (where U means the first box says "up" and the second "down", and D is the reverse). Two of those, UUU and DDD, lead to the boxes always disagreeing. The other six are all equivalent under permutations and exchange of U and D, and they lead to the boxes agreeing 2/3 of the time when the angles are different. So 2/3 agreement is the highest possible in a classical world. But in a quantum world, measuring Bell-pair electrons along axes of 0°, 120° and 240° gives agreement 3/4 of the time.