If the gambler's fallacy is false, how do notions of "expected number" of events work?

But we do expect a 5 on any single spin. Or, at least, we expect $1\%$ of a 5 (if that even makes sense).

More seriously, though, on any single spin the probability of getting a 5 is low. However, that is exactly outweighed by how much smaller the wait until the next 5 becomes if you get a 5 next.

In $99\%$ of cases, you will get a not-5 on the first spin, and in those cases you are expected to spin a total of 101 times before you see your first 5 (including the first failed spin that just failed). However, in $1\%$ of cases, you spin a 5, and in those cases you are expected to spin 1 time before you get your first 5. These cancel out to give a total of 100 expected spins.

As for question 2, that's because you can have two or more 5's appear. The possibility of two or more 5's within the first 50 spins, but still an expected number of 0.5 5's means the probability of any 5 at all must be less than $0.5$: $$ 0.5=\text{Expected number of 5's}=1\cdot P(\text{one 5})+2\cdot P(\text{two 5's})+\cdots\\ \implies P(\text{at least one 5})=P(\text{one 5})+P(\text{two 5's})+\cdots<0.5 $$


It's worth noting that, as you flip the spinner, you have more information than you did before - which changes your expectations. A simpler example, without expectation, would be that if you flip a fair coin twice, there is a $1/4$ chance that both flips are heads. However, we can think about what happens after the first flip:

If our first flip lands heads, then there is now a $1/2$ chance that both flips will be. If not, there is a $0$ chance of that. Prior to the first flip, we know that there is a $1/2$ chance of landing in either of these two cases, so the total probability is $1/2\cdot 1/2 + 1/2\cdot 0=1/4$.

A similar thing happens with your example: suppose we let $X$ be the number of times $5$ comes up in $100$ spins of the spinner. Most of the time - $99/100$ times to be precise, the first spin is not $5$, and, given this, we now only have $99$ spins left, so expect $X$ to be $99/100$. On the other hand, however, if $5$ does come up, we now expect $X$ to be $1+99/100$ - and averaging these two cases with their probabilities does indeed show that we expect $X$ to be $1$ overall.

Basically, you see that if you fail to get a $5$ on the first round, then your odds of seeing a $5$ have shifted downwards - but this is perfectly balanced by the less likely event that you do see a $5$. This is the same as in your second example with probabilities - yes, as soon as we see that we didn't get a $5$, we still think we need the same number of further spins, but if we get a $5$, we only used $1$ spin which is way below what we thought we'd need - and balances things.

It's worth noting that expectation is a precise mathematical term that may not perfectly align to what you'd like it to mean intuitively. It does not say anything about the most likely event - for instance, if you flipped a fair coin, the expected number of heads is $1/2$, but that's not even a possible outcome. Expectation just says "look at this value over all possible ways things could play out. Average them, weighted according to probability."

This also tells us why the probabilities are not the expectations: if we make $50$ trials, the expected number of $5$'s being $1/2$ could equally well mean "There is a $99/100$ chance that there were no $5$'s, but there's a $1/100$ chance that there were $200$ instances of $5$" or "There is a $1/2$ chance that there were no $5$'s and a $1/2$ chance that there was one five" - with the truth in this case lying in between those two somewhat absurd cases. Basically, cases where there are lots of $5$'s get counted disproportionately, where probability would count them equally to the case where there is just one $5$.

As for the paradox that no number is likely, but some number always exists, this is the same deal for probability: here are two variants of a game you might play:

Guess a number. Spin the wheel. You win if they are equal.

In this game, you will only win with probability $1/100$ because you have no information. The low probability measures this game. A related game is the following:

Spin the wheel. Guess a number. You win if they are equal.

This game you can always win because you just read off what number was spun! The relevant probability here is more like "What's the probability you spun a $5$, given that you spun a $5$" - which is $1$. You just need to be careful about exactly what you already know if you're dealing with probabilities - otherwise seemingly paradoxical results start to appear.


Before the first spin, the expected number of occurrences of $5$ in the first $100$ spins is $1$.

Suppose the first spin yields a value not equal to $5$.

If we are given that information, then:

  • The expected number of occurrences of $5$ in the first $100$ spins (spins $1$ through $100$) is now less than $1$ (more precisely, it's equal to ${\large{\frac{99}{100}}}$).$\\[4pt]$
  • However the expected number of occurrences of $5$ in the next $100$ spins (spins $2$ through $101$) is equal to $1$.

Extra information can change the probability distribution of a random variable, hence can change its expectation.

Regarding the second question . . .

Assume independent spins, each yielding a random element of $\{1,...,n\}$, with all values equally likely.

Let $X$ be the number of spins until the occurrence of a given value, say $1$.

For each positive integer $k$, let $x_k=P(X=k)$.

Letting $p={\large{\frac{1}{n}}}$, we get $$ x_k=(1-p)^{k-1}p \qquad\;\;\;\;\; $$ and the mean of $X$ is given by \begin{align*} E(X)& =1x_1+2x_2+ 3x_3+\cdots\\[1pt] &=\sum_{k=1}^\infty kx_k\\[1pt] &=\sum_{k=1}^\infty k(1-p)^{k-1}p\\[1pt] &=p\sum_{k=1}^\infty k(1-p)^{k-1}\\[1pt] &=p{\,\cdot\,}\frac{1}{p^2}\\[1pt] &=\frac{1}{p}\\[3pt] &=n\\[1pt] \end{align*} If the distribution of $X$ is was symmetrical, the median would be equal to the mean (but not half of the mean).

However the distribution of $X$ is not symmetrical, so we can't infer the median of $X$ just from the knowledge that the mean of $X$ is $n$.

For the case $n=100$,

enter image description here

the mean of $X$ is $100$, whereas the median of $X$ is $69$ which is less than the mean.

But in any case, there's no good reason to expect the median of $X$ to be exactly half of the mean.