Uniformly choose two derangements $\sigma_i,\sigma_j$. What is the distribution of $\sigma_i\circ \sigma_j$?

Incomplete proof (that the conjecture is true).

Let $\sigma$ be some target permutation, and we seek $p(\sigma) = P(\sigma_i \circ \sigma_j = \sigma)$. The event is equivalent to $\sigma_i^{-1} \circ \sigma = \sigma_j$. So the question becomes two-part:

Is $\sigma_i^{-1} \circ \sigma$ a derangement?
Conditioned on $\sigma_i^{-1} \circ \sigma$ being a derangement, what is the probability that $\sigma_j$ equals that derangement? The answer to the 2nd question is simply $1/|D_n|$. I.e.

$$p(\sigma) = P(\sigma_i^{-1} \circ \sigma \in D_n) \times \frac1{|D_n|}$$

This explains why the identity $\sigma^*$ is favored: $P(\sigma_i^{-1} \circ \sigma^* \in D_n) = 1$. It has in fact the maximum $p(\sigma)$ among all target $\sigma$'s.

Now, $\sigma_1^{-1}$ is just a random derangement, so the question becomes calculating, for any given target $\sigma$, the probability $f(\sigma) = P(\pi \circ \sigma \in D_n)$ where $\pi$ is a random derangement, and we will have $p(\sigma) = f(\sigma) / |D_n|$.

The rest of this answer is non-rigorous. I would imagine the main thing (the only thing?) that matters is the number of fixed points of $\sigma$. The more fixed points, the higher $f(\sigma)$, because:

Any fixed point of $\sigma$, e.g. $\sigma(i) = i$, will certainly not be fixed any more in $\pi \circ \sigma$, i.e. $\pi(\sigma(i)) \neq \sigma(i) = i$.
However, any non-fixed point of $\sigma$, e.g. $\sigma(i) = j \neq i$, might unluckily become fixed in $\pi \circ \sigma$, i.e. $\pi(\sigma(i)) = i$, if $\pi$ happens to "undo" $\sigma$ at that point, i.e. $\pi(j) = i$.
And of course if $\pi \circ \sigma$ has a fixed point then it is $\notin D_n$. I.e. every non-fixed point of $\sigma$ is a sort of "potential reason" to violate $\pi \circ \sigma \in D_n$, thereby lowering $f(\sigma)$.

Anyway, based on this non-rigorous argument, the lowest $f(\sigma)$ would happen if $\sigma$ has no fixed points, i.e. it is itself a derangement. Without loss, any derangement would do for $\sigma$ in this case, and $f(\sigma)$ becomes the probability that two derangements compose to make a third derangement -- which is exactly addressed in this answer. According to it, the asymptotic value of the probability is $1/e$.

I.e., if you believe that answer (it's too advanced for me to check), and if you believe me that $\sigma$ being itself a derangement is the "worst case", then your conjecture is true, with bounds $c = 1/e^2, C = 1/e$.

Incidentally, I tried using the union bound, but as expected it is not strong enough. Let $k$ denote the number of non-fixed points of $\sigma$ (i.e. $n-k = $ no. of fixed points of $\sigma$), and let $\sigma(i) = j \neq i$ denote a typical such non-fixed point. We have:

$$\pi \circ \sigma \in D_n \iff \bigcap \pi(j) \neq i$$

$$P(\pi \circ \sigma \in D_n) = 1 - P(\bigcup \pi(j) = i) \ge 1 - k P(\pi(j) = i) = 1 - {k \over n-1}$$

where the intersection and union are both over all the non-fixed points of $\sigma$, and $P(\pi(j) = i) = 1/(n-1)$ by symmetry. However, this bound not only fails for $k=n$ (i.e. the "hard" case of $\sigma$ being a derangement), it also does not give an asymptotic bound away from $0$ for e.g. $k = n-10$.

I'll find an approximation for the distribution below, which proves your guess is right. Note that for any given $\pi \in S_n$, $\mathbb{P}[\sigma_i \circ \sigma_j = \pi]$ is $\frac{1}{|D_n|^2}$ times the number $N(\pi)$ of derangements $\sigma$ such that $\pi^{-1} \circ \sigma$ is also a derangement. Equivalently, $N(\pi)$ is the number of $\sigma \in S_n$ such that $\sigma(x) \neq x, \pi(x)$ for all $x \in [n]$. We can find an expression for $N(\pi)$ using an inclusion-exlusion argument similar to the standard one used to find the number of derangements. We have $$N(\pi) = \sum_{A \subset [n]} (-1)^{|A|} \#\{\sigma \in S_n \mid \sigma(a) \in \{a, \pi(a)\} \, \forall a \in A\}.$$ Now, given a set $A \subset [n]$ and an injective function $f : A \to [n]$ such that $f(a) \in \{a, \pi(a)\}$ for all $a \in A$, there are exactly $(n-|A|)!$ choices of $\sigma \in S_n$ which agree with $f$ on $A$, so we can write $$\#\{\sigma \in S_n \mid \sigma(a) \in \{a, \pi(a)\} \, \forall a \in A\} = F_A(\pi)(n-|A|)!$$ where $F_A(\pi)$ is the number of such functions. This gives $$N(\pi) = \sum_{A \subset [n]} (-1)^{|A|} F_A(\pi)(n-|A|)! = n! \sum_{k=0}^n \frac{(-1)^k}{k!} \binom{n}{k}^{-1} \sum_{|A| = k} F_A(\pi).$$ Clearly each $F_A(\pi) \leq 2^k$, so $\binom{n}{k}^{-1} \sum_{|A| = k} F_A(\pi) \leq 2^k$, so these terms decrease extremely quickly: $$\left|\frac{N(\pi)}{n!} - \sum_{k=0}^m \frac{(-1)^k}{k!} \binom{n}{k}^{-1} \sum_{|A| = k} F_A(\pi) \right| \leq \sum_{k=m+1}^n \frac{2^k}{k!} \leq \frac{2^m}{m!} \sum_{k=1}^\infty \frac{2^k}{m^k} \leq \frac{2^m}{m!}$$ holds for $m \geq 4$.

Now we find an approximation for the largest terms. Note that $k! \sum_{|A| = k} F_A(\pi)$ is the number of sequences of pairs $(a_1, b_1), \dots, (a_k, b_k) \in [n]^2$ such that all $a_i$ are distinct, all $b_i$ are distinct, and $b_i \in \{a_i, \pi(a_i)\}$ for each $i$. Call a sequence $a_1, \dots, a_k$ good if all $\{a_i, \pi(a_i)\}$ are disjoint (so in particular all $a_i$ are distinct), and note that for a good sequence, the number of corresponding sequences of pairs is $\prod_{i=1}^k |\{a_i, \pi(a_i)\}|$. As before, for any (not necessarily good) sequence $a_1, \dots, a_k$, the number of corresponding sequences of pairs is at most $2^k$. Finally, the number of sequences $a_1, \dots, a_k$ which are not good is at most $2k^2 n^{k-1}$ (since a sequence is not good if there are indices $i$ and $j$ with either $a_i = a_j$ or $a_i = \pi(a_j)$). Thus $$\left|k!\sum_{|A| = k}F_A(\pi) - \sum_{a_1, \dots, a_k} \prod_{i=1}^k |\{a_i, \pi(a_i)\}|\right| \leq 2^{k+1}k^2 n^{k-1}$$ but $\sum_{a_1, \dots, a_k} \prod_{i=1}^k |\{a_i, \pi(a_i)\}| = (\sum_a |\{a, \pi(a)\}|)^k = (2n-t)^k$, where $t$ is the number of fixed points of $\pi$. Letting $c = t/n$, we have $\left|\frac{k!}{n^k} \sum_{|A| = k}F_A(\pi) - (2-c)^k \right| \leq \frac{2^{k+1}k^2}{n}.$ This can be written as $\frac{k!}{n^k} \sum_{|A| = k}F_A(\pi) = (2-c)^k + O(\frac{2^k k^2}{n})$, and so since $\binom{n}{k}^{-1} = \frac{k!}{n^k}(1 + O(\frac{k^2}{n}))$ for small $k$, we get $$\binom{n}{k}^{-1} \sum_{|A| = k}F_A(\pi) = (2-c)^k + O\left(\frac{2^k k^2}{n}\right)$$ hence $\sum_{k=0}^m (-1)^k \binom{n}{k}^{-1} \sum_{|A| = k}F_A(\pi) = \sum_{k=0}^m \frac{(c-2)^k}{k!} + O(1/n)$. Noting that $|\sum_{k=0}^\infty \frac{(c-2)^k}{k!} - \sum_{k=0}^m \frac{(c-2)^k}{k!}| \leq \frac{2^m}{m!}$ for $m \geq 4$ as well, and taking $m$ large enough so that $2^m/m! \leq 1/n$, we get $$\frac{N(\pi)}{n!} = \sum_{k=0}^\infty \frac{(c-2)^k}{k!} + O(1/n) = e^{c-2} + O(1/n)$$ where the error term is uniform over all $\pi$. Thus since $|D_n| = n! e^{-1}(1 + O(1/n))$, we have $$\mathbb{P}[\sigma_i \circ \sigma_j = \pi] = \frac{N(\pi)}{|D_n|^2} = \frac{1}{n!} (e^c + O(1/n))$$ where again, $c = c(\pi) = t/n$ is the fraction of $[n]$ fixed by $\pi$.

Uniformly choose two derangements $\sigma_i,\sigma_j$. What is the distribution of $\sigma_i\circ \sigma_j$?

Tags:

Probability

Permutations

Group Theory

Probability Distributions

Symmetric Groups

Related

Recent Posts