A.43 Relativistic neutrinos

It is certainly dubious to describe beta decay non­relativisti­cally. Neutrinos are highly relativistic particles with almost zero rest mass. So [15, p. 258] rightly states that it is absurd to treat them non­relativisti­cally. Then he immediately proceeds to do it anyway. (But he confirms the results later using a proper analysis.)

One big problem is that relativisti­cally, spin and orbital angular momentum become mixed-up. That problem was encountered earlier for the photon, which is an even more relativistic particle. Chapter 7.4.3 needed some contortions to talk around that problem.

But note that while the problem for the neutrino is quali­tatively similar, the details are quite different. While the photon is a boson, the neutrino is a fermion. Fermions, unlike bosons, are described by the Dirac equation, chapter 12.12 versus {A.21}.

To understand the Dirac equation requires some linear algebra, in particular matrices. But to understand the discussion here, the infor­mation in the notations section should be plenty.

Consider first the Dirac Hamiltonian eigenvalue problem for the electron and its anti­particle, the positron, in empty space. It is simplest thought of as a system of two equations:

\begin{displaymath}
\fbox{$\displaystyle
m c^2 \vec\psi^- + c {\skew 4\wideh...
...-.5}\vec p}}\cdot\vec\sigma \vec\psi^- = E \vec\psi^+
$}
%
\end{displaymath} (A.271)

Here $c$ is the speed of light and $m$ the rest mass of the electron or positron. So $mc^2$ is the rest mass energy according to Einstein’s famous relation. Further ${\skew 4\widehat{\skew{-.5}\vec p}}$ is the linear momentum operator. And $\vec\sigma$ is the spin angular momentum operator, except for a scale factor. More precisely, $\vec\sigma$ $\vphantom0\raisebox{1.5pt}{$=$}$ ${\skew 6\widehat{\vec S}}$$\raisebox{.5pt}{$/$}$$\frac12\hbar$. The three components of $\vec\sigma$ are the famous “Pauli spin matrices.”

In the non­relativistic limit, $\vec\psi^-$ becomes the wave function of an electron. Recall from chapter 5.5.1 that this wave function can be thought of as a vector with two components; the first component corre­sponds to the spin-up state and the second to the spin-down state. Similarly $\vec\psi^+$ becomes the wave function of a positron.

The non­relativistic limit means mathemati­cally that the rest mass energies $mc^2$ are much larger than the kinetic energies. Or more simply, it is the limit $c\to\infty$. Under those conditions, by approxi­mation,

\begin{displaymath}
m c^2 \vec\psi^- \approx E \vec\psi^-
\qquad
- m c^2 \vec\psi^+ \approx E \vec\psi^+
\end{displaymath}

So in the non­relativistic limit the energy of the electron is approximately the rest mass energy, as it should be. Note however that the value of $E$ for a positron is negative. That should not be taken to mean that the rest mass energy of a positron is negative. It is the same as the one of an electron, positive. The positron is an anti­particle, and the value of $E$ of an anti­particle picks up an extra minus sign because these particles move backward in time. Quantum mechanics relates $E$ to the time derivative as $E\to{\rm i}\hbar\partial/\partial{t}$, as seen in the Schrödinger equation, so a sign change in $E$ is equivalent to a reversal in time.

If you improve the accuracy of the non­relativistic approxi­mations a bit using some mathematical manipul­ation, the energy will also include the classical kinetic energy. To see how that works, (without using more proper matrix eigenvalue methods), premul­tiply the second equation in (A.271) by ${\skew 4\widehat{\skew{-.5}\vec p}}\cdot\vec\sigma$$\raisebox{.5pt}{$/$}$$2mc$ and add it to the first. In doing so, note that $({\skew 4\widehat{\skew{-.5}\vec p}}\cdot\vec\sigma)^2$ is simply ${\widehat p}^{\,2}$. That can be verified by multi­plying out the Pauli matrices given in chapter 12.10. In particular,

\begin{displaymath}
({\skew 4\widehat{\skew{-.5}\vec p}}\cdot\vec\sigma)({\ske...
...igma_j\sigma_i = 0 \mbox{ if } i\ne j
\end{array}
\right.
\end{displaymath}

Taking that into account, you get

\begin{displaymath}
\left(m c^2 + \frac{{\widehat p}^{\,2}}{2m}\right) \vec\ps...
...t{\skew{-.5}\vec p}}\cdot\vec\sigma}{2mc^2} \vec\psi^+\right)
\end{displaymath}

This expression can be rewritten as

\begin{displaymath}
\left(m c^2 + \frac{{\widehat p}^{\,2}}{2m}\right) \left(\...
...{\skew{-.5}\vec p}}\cdot\vec\sigma}{2mc^2} \vec\psi^+ \right)
\end{displaymath}

(If you multiply this out, you get an additional term, but it is negligibly small compared to the rest.) Now note that this is an eigenvalue problem for an electron whose wave function has picked up a little bit of $\vec\psi^+$. You might say that the slightly relativistic electron picks up a bit of a non­relativistic positron wave function. Also its energy has picked up an additional term ${\widehat p}^{\,2}$$\raisebox{.5pt}{$/$}$$2m$, the classical kinetic energy.

Derivation {D.82} shows how take this further, and do it rigorously using proper linear algebra procedures.

But this addendum is not really inter­ested in the non­relativistic limit. Instead it is inter­ested in the ultra­relativistic limit, where it is the rest mass that is negligible compared to the kinetic energy. And for zero rest mass, the Dirac eigenvalue problem (A.271) becomes

\begin{displaymath}
c {\skew 4\widehat{\skew{-.5}\vec p}}\cdot\vec\sigma \vec\...
...skew{-.5}\vec p}}\cdot\vec\sigma \vec\psi^- = E \vec\psi^+
%
\end{displaymath} (A.272)

To make some sense out of this, define two new partial wave functions as

\begin{displaymath}
\vec\psi_{\rm {R}} = \frac{\vec\psi^- + \vec\psi^+}{2}
\qquad
\vec\psi_{\rm {L}} = \frac{\vec\psi^- - \vec\psi^+}{2}
\end{displaymath}

By taking sums and differen­ces of the ultra­relativistic equations above, you then get
\begin{displaymath}
\fbox{$\displaystyle
c {\skew 4\widehat{\skew{-.5}\vec p...
...dot\vec\sigma \vec\psi_{\rm{L}} = E \vec\psi_{\rm{L}}
$}
%
\end{displaymath} (A.273)

The mathema­tician Weyl first noted how remarkable these equations really are. The equations for the two partial wave functions are not coupled! Partial wave function $\vec\psi_{\rm {L}}$ is not in the equation for $\vec\psi_{\rm {R}}$ and $\vec\psi_{\rm {R}}$ is not in the one for $\vec\psi_{\rm {L}}$. So it seems like each equation might describe a particle whose wave function is a simple two-di­mensional vector. In other words, each equation could describe a particle without a partner. Note incidentally that the second equation is the negative $E$ version of the first. So presumably the second equation could merely describe the anti­particle of the particle of the first equation.

Reasonably speaking, these particles should be electri­cally neutral. After all, their wave functions are equal combin­ations of $\vec\psi^-$, the equivalent of the negatively charged non­relativistic electron in this system, and $\vec\psi^+$, the equivalent of the positively charged non­relativistic positron. So these particles should be neutral as well as massless. In short, they should be neutrinos.

There are a couple of problems however. The Dirac equation includes both the electron and the positron, the time-reversed electron. If you write an equation for a particle that does not inherently include a time-reversed partner, are you violating time-reversal symmetry?

The other problem is what happens if you look at nature in the mirror. Or rather, what you really want to do is “inversion:” swap the positive direction of all three axes in a Cartesian coordinate system. This has the effect of creating a mirror image of nature, since the coordinate system is now left-handed instead of right-handed. And its effects are mathemati­cally easier to describe.

First consider what happens to the original Dirac equation (A.271). The linear momentum operator ${\skew 4\widehat{\skew{-.5}\vec p}}$ introduces a minus sign under inversion. That is because this operator is propor­tional to the gradient operator $\nabla$, and every Cartesian coordinate in $\nabla$ changes sign. However, the spin operators $\vec\sigma$ are like orbital angular momentum, ${\skew0\vec r}$ $\times$ ${\skew 4\widehat{\skew{-.5}\vec p}}$, so they introduce two minus signs, which means no sign change.

(These arguments as presented look at the dot product purely algebrai­cally. If you consider the gradient as a vector, there is an additional sign change since the unit vectors change direction. But the same holds for the $\vec\sigma$ vector in the dot product, so there is no difference in the final answer.)

At face value then, the ${\skew 4\widehat{\skew{-.5}\vec p}}\cdot\vec\sigma$ terms in the Dirac system change sign. That would mean that nature does not behave in the same way when seen in the mirror; the Dirac equation is not the same. And that is definitely wrong; it is very accurately established that electro­magnetic systems involving electrons do behave in exactly the same way when seen in the mirror.

Fortunately there is a loophole to save the mathematics of the Dirac equation. It can be assumed that either the electron or the positron has negative “intrinsic” parity. In other words it can be assumed that either non­relativistic wave function changes sign under inversion. You can check that if either $\vec\psi^-$ or $\vec\psi^+$ changes sign under inversion, then the Dirac system stays the same under inversion. Which one of the two changes sign is not important, since the sign of the wave function does not make a difference for the physics. It is convention to assume that the anti­particles of fermions have the negative intrinsic parity. In that case, you do not need to worry about intrinsic parity if you have no anti­particles present.

But now reconsider the Hamiltonian eigenvalue problem (A.273) for the neutrinos. The trick no longer works! If you can have one of these particles by itself, not tied to a partner, the physics is no longer the same when seen in the mirror.

There is an important consequence to that. If nature is the same when seen in the mirror, there is a conserved mathematical quantity called “parity.” That is just like there is a conserved quantity called angular momentum because nature behaves in the same way when you look at it in a rotated coordinate system, chapter 7.3. What it boils down to is that Weyl was saying that parity might not be conserved in nature.

At the time, Pauli lambasted Weyl for daring to suggest something so obviously stupid like that. However, Pauli lived long enough to learn in 1956 that experi­ments showed that indeed neutrinos do not conserve parity. (Weyl had already died.)

The fact that the Weyl Hamiltonians do not conserve parity can be described more abstractly. Let ${\mit\Pi}$ be the “parity operator” that expresses what happens to a wave function when the axes are inverted. The above discussion can then be summarized abstractly as

\begin{displaymath}
{\mit\Pi}(c{\skew 4\widehat{\skew{-.5}\vec p}}\cdot\vec\si...
...widehat{\skew{-.5}\vec p}}\cdot\vec\sigma) {\mit\Pi}\vec\psi
\end{displaymath}

In words, when you invert axes, the ${\skew 4\widehat{\skew{-.5}\vec p}}\cdot\vec\sigma$ operator introduces a minus sign. Also the wave function $\vec\psi$ changes into its mirror image. Now the expression in parentheses is the first Weyl Hamiltonian, or minus the second. So the expression above can be read that if you inter­change the order of the parity operator and either Hamiltonian, it introduces a minus sign. It is said that the parity operator and the Hamiltonian do not commute: the order in which they are applied makes a difference.

It is quite generally true that if an operator does not commute with the Hamiltonian, the corre­sponding quantity is not preserved. See for example chapter 4.5.1, 7.1.4 and {A.19}. So you might ask whether orbital angular momentum is preserved by the Weyl Hamiltonian. If you grind out the commutator of an orbital angular momentum component with the Weyl Hamiltonian using the rules of chapters 4.5.4 and 5.5.3, you find that you get something non­zero. So orbital angular momentum is not conserved by the Weyl neutrinos. That was no big surprise to physicists, because the Dirac equation does not conserve orbital angular momentum either. Similarly, if you work out the commutator of a component of spin, you find that spin is not preserved either.

However, if you add the two operators, you get the net angular momentum, orbital plus spin. That operator does commute with the Weyl Hamiltonians. So the Weyl Hamiltonians do conserve net angular momentum, like the Dirac equation does. That is very fortunate, because without doubt Pauli would have had a fit if Weyl had suggested that nature does not conserve net angular momentum.

But relativity does mix up orbital angular momentum with spin. Often this book describes particles as being in states of specific orbital angular momentum with specific values of the spin. For example, that is how the non­relativistic hydrogen atom was described, chapter 4.3. Such a description is not really justified for particles at relativistic speeds. Fortunately the electron in a hydrogen atom has a kinetic energy much less than its rest mass energy. Things are much worse in beta decay, where the neutrino is emitted at almost the speed of light.

You might ask what else the Weyl Hamiltonians commute with. It can be seen that they commute with the linear momentum operators: different linear momentum operators commute because basically they are just derivatives, and spin commutes with non­spin. Different spin components do not commute, but everything commutes with itself. Since the Hamiltonian only involves the spin component in the direction of the linear momentum, (because of the dot product), the Hamiltonian commutes with the spin in that direction.

It is instructive to form a mental picture of these neutrinos. According to the above, the neutrinos can be in states with definite linear momentum and with definite spin in the direction of that linear momentum. In those states the neutrinos move in a specific direction and also rotate around an axis in the same direction. You can think of it macroscopi­cally as screws; screws too rotate around their axis of motion while they move in or out of their hole.

Now there are two kinds of screws. Normal, “right-handed,” screws move into the hole if you rotate them clockwise with a screwdriver. Some special appli­cations use “left-handed” screws, which are machined so that they move into the hole when you rotate them counter-clockwise. (One of the screws that keep the pedals on a bicycle is normally a left-handed screw, to prevent you from loosening it up when pedaling.)

Experi­ments show that all anti­neutrinos observed in a normal lab setting act like right-handed screws. Neutrinos act like left-handed screws. Since a right-handed screw turns into a left-handed screw when seen in a mirror, nature is not the same when seen in a mirror. If, say, a beta decay is observed in a mirror, the anti­neutrino that comes out is left-handed, rather than right-handed as it should.

(Using arguments like in {D.71}, it can be seen that $\vec\psi_{\rm {R}}$ above is righthanded and $\vec\psi_{\rm {L}}$ left handed, [51, p. 95].)

Note that theoretical physicists do not know about screws. So they came up with a different way to express that anti­neutrinos are right-handed like normal screws, while neutrinos are left-handed. They call the component of spin in the direction of motion, scaled so that its values are $\pm1$, the “helicity.” So according to theoretical physicists, anti­neutrinos as seen in a lab have helicity 1, while neutrinos have helicity $\vphantom0\raisebox{1.5pt}{$-$}$1.

It is ironic that Pauli in turn did not live long enough to learn that Weyl’s neutrinos were wrong on at least one major count. Around 1998, experi­ments found that neutrinos and anti­neutrinos are not massless as was long thought. Their mass is extremely tiny, (far too tiny to be accurately measured by experi­mental methods available at the time of writing), but it is not zero.

Fundamentally, it makes a big difference, because then their speed must be less than the speed of light. Hypotheti­cally, an observer can then move with a speed faster than some anti­neutrino produced in a lab. The anti­neutrino then seems to go backward for that observer. Since the spin is still in the same direction, the observer sees a left-handed anti­neutrino, not a right-handed one. So the handed­ness, or helicity, of anti­neutrinos, and similarly neutrinos, is not fundamental. A process like the beta decay of an nucleus moving at a speed extremely close to the speed of light could produce a left-handed anti­neutrino trailing the nucleus. But since this does not happen for a nucleus at rest, nature is still not the same when seen in the mirror.

At the time of writing, the precise nature of neutrinos is not yet fully understood. Usually, it is assumed that neutrinos are distinct from anti­neutrinos. Neutrinos for which that is true are called Dirac neutrinos. However, so-called Majorana neutrinos would be their own anti­particles; neutrinos and anti­neutrinos would simply differ in the spin state.