Lorentz transformation of Gamma matrices $\gamma^{\mu}$

I think the clearest way to think about this is to say that the gamma matrices don't transform. In other words, the fact that they carry a vector index doesn't mean that they form a four vector. This is analogous to how the Pauli matrices work in regular quantum mechanics, so let me talk a little bit about that.

Suppose you have a spin $1/2$ particle in some state $|\psi\rangle$. You can calculate the mean value of $\sigma_x$ by doing $\langle \psi | \sigma_x | \psi\rangle$. Now let's say you rotate your particle by an angle $\theta$ around the $z$-axis. (Warning: There is about a 50% chance my signs are incorrect.) You now describe your particle with a different ket, given by $|\psi'\rangle = \exp(-i \sigma_z \theta /2)$. Remember that we are leaving the coordinates fixed and rotating the system, as is usually done in quantum mechanics. Now the expectation value is given by

$$\langle \psi' | \sigma_x | \psi' \rangle = \langle \psi |\, e^{i\sigma_z \theta /2}\, \sigma_x\, e^{-i \sigma_z \theta / 2}\, | \psi\rangle$$

There is a neat theorem, not too hard to prove, that says that

$$e^{i\sigma_z \theta /2}\, \sigma_x\, e^{-i \sigma_z \theta / 2} = \cos \theta\, \sigma_x -\sin \theta\, \sigma_y$$

So it turns out that the expectation value for the rotated system is also given by $\langle \psi |\, \cos \theta\, \sigma_x -\sin \theta\, \sigma_y \, |\psi\rangle = \cos \theta\, \langle \sigma_x \rangle - \sin \theta\, \langle \sigma_y \rangle$. It's as if we left our particle alone and rotated the Pauli matrices. But note that if we apply the rotation to $|\psi\rangle$, then we don't touch the matrices. Also, I never said that I transformed the matrices. I just transformed the state, and then found out that I could leave it alone and rotate the matrices.

The situation for a Dirac spinor is similar. The analogous identity is that $S(\Lambda) \gamma^\mu S^{-1}(\Lambda) \Lambda^\nu_{\ \mu} = \gamma^\nu$. This is just something that is true; nobody said anything about transforming $\gamma^\mu$. There's no $\gamma^\mu \to \dots$ here.

Now let's take the Dirac equation, $(i \gamma^\mu \partial_\mu - m)\psi = 0$, and apply a Lorentz transformation. This time I will change coordinates instead of boosting the system, but there's no real difference. Let's say we have new coordinates given by $x'^\mu = \Lambda^\mu_{\ \nu} x^\nu$, and we want to see if the Dirac equation looks the same in those coordinates. The field $\psi'$ as seen in the $x'^\mu$ frame is given by $\psi' = S(\Lambda) \psi \iff \psi = S^{-1}(\Lambda) \psi'$, and the derivatives are related by $\partial_\mu = \Lambda^\nu_{\ \mu} \partial'_\nu$. Plugging in we get $(i\gamma^\mu \Lambda^\nu_{\ \mu} \partial'_\nu-m) S^{-1}(\Lambda)\psi' = 0$, which doesn't really look like our original equation. But let's multiply on the left by $S(\Lambda)$. $m$ is a scalar so $S$ goes right through it and cancels with $S^{-1}$. And in the first term we get $S(\Lambda)\gamma^\mu S^{-1}(\Lambda) \Lambda^\nu_{\ \mu}$, which according to our trusty identity is just $\gamma^\nu$. Our equation then simplifies to

$$(i\gamma^\mu \partial'_\mu - m)\psi'=0$$

This is the same equation, but written in the primed frame. Notice how the gamma matrices are the same as before; when you're in class and the teacher writes them on the board, you don't need to ask in what coordinate system they are valid. Everyone uses the same gamma matrices. They're not really a four-vector, but their "transformation law" guarantees that anything written as if they were a four vector is Lorentz invariant as long as the appropiate spinors are present.


The gamma matrices do not change if one does not apply a change of representation (e.g., chiral -> standard) along with the Lorenz transformation. Recall that you can write Dirac's equation in any frame with gamma-matrices in the same (e.g., chiral) representation. If you change the representation of them by using an invertible matrix $\gamma^\mu \to S\gamma^\mu S^{-1}$ (which is not a Lorenz transformation, i.e. a scalar in this sense) the spinor also transforms by S: $\Psi \to S\Psi$. This is the source of confusion: is $S=I$ then your equation

$$\gamma^\mu\to S[Λ]\gamma^\mu S[Λ]^{-1} =Λ^\mu_\nu \gamma^\nu $$

should read as

$$ S[Λ]\gamma^\mu S[Λ]^{-1} = Λ^\mu_\nu \gamma^\nu. $$

I hope it helped to exorcize the confusion.