This note needs more work, but as far as I know is basically OK. Unfortunately, a derivation of electron capture for zero spin transitions is not included.
This note derives the Fermi theory of beta decay. In particular, it gives the ballparks that were used to create figure 14.52. It also describes the Fermi integral plotted in figure 14.50, as well as Fermi’s (second) golden rule.
When beta decay was first observed, it was believed that the nucleus simply ejected an electron. However, problems quickly arose with energy and momentum conservation. To solve them, Pauli proposed in 1931 that in addition to the electron, also a neutral particle was emitted. Fermi called it the “neutrino,” for “small neutral one.” Following ideas of Pauli in 1933, Fermi in 1934 developed a comprehensive theory of beta decay. The theory justifies the various claims made about allowed and forbidden beta decays. It also allows predictions of the decay rate and the probability that the electron and antineutrino will come out with given kinetic energies. This note gives a summary. The ballparks as described in this note are the ones used to produce figure 14.52.
A large amount of work has gone into improving the accuracy of the Fermi theory, but it is outside the scope of this note. To get an idea of what has been done, you might start with [22] and work backwards. One point to keep in mind is that the derivations below are based on expanding the electron and neutrino wave functions into plane waves, waves of definite linear momentum. For a more thorough treatment, it may be a better idea to expand into spherical waves, because nuclear states have definite angular momentum. That idea is worked out in more detail in the note on gamma decay, {A.25}. That is news to the author. But it was supposed to be there, I think.
A classical quantum treatment will not do for beta decay. To see why,
note that in a classical treatment the wave function state before the
decay is taken to be of the form
You might think that maybe the electron and antineutrino were always
there to begin with. But that has some major problems. A lone
neutron falls apart into a proton, an electron and an antineutrino.
So supposedly the neutron would consist of a proton, an electron, and
an antineutrino. But to confine light particles like electrons and
neutrinos to the size of a nucleon would produce huge kinetic
energies. According to the Heisenberg uncertainty relation
![]()
![]()
,
,
Further, a high-energy antineutrino can react with a proton to create a neutron and a positron. That neutron is supposed to consist of a proton, an electron, and an antineutrino. So, following the same reasoning as before, the original proton before the reaction would consist of a positron, an electron, and a proton. That proton in turn would supposedly also consist of a positron, an electron, and an proton. So the original proton consists of a positron, an electron, a positron, an electron, and a proton. And so on until a proton consists of a proton and infinitely many electron / positron pairs. Not just one electron with very high kinetic energy would need to be confined inside a nucleon, but an infinite number of them, and positrons to boot. And all these electrons and positrons would somehow have to be prevented from annihilating each other.
It just does not work. There is plenty of solid evidence that
neutrons and protons each contain three quarks, not other
nucleons along with electrons, positrons, and neutrinos. The electron
and antineutrino are created out of pure energy during beta decay, as
allowed by Einstein’s famous relativistic expression
.
In particular, it is necessary to deal mathematically with the
appearance of the electron and an antineutrino out of nothing. To do
so, a more general, more abstract way must be used to describe the
states that nature can be in. Consider a decay that produces an
electron and an antineutrino of specific momenta
,
.
| (A.274) |
Similarly, the state before the decay is written as
| (A.275) |
(You could also add kets for different momentum states that the final electron and antineutrino are not in after the decay. But states that have zero electrons and neutrinos both before and after the considered decay are physically irrelevant and can be left away.)
That leaves the nuclear part of the wave function. You could use
Fock-space kets to deal with the disappearance of a neutron and
appearance of a proton during the decay. However, there is a neater
way. The total number of nucleons remains the same during the decay.
The only thing that happens is that a nucleon changes type from a
neutron into a proton. The mathematical trick is therefore to take the
particles to be nucleons, instead of protons and neutrons. If you
give each nucleon a “nucleon type” property, then the
only thing that happens during the decay is that the nucleon type of
one of the nucleons flips over from neutron to proton. No nucleons
are created or destroyed. Nucleon type is typically indicated by the
symbol
and is defined to be
if the nucleon is
a proton and
if the nucleon is a neutron. (Some older
references may define it the other way around.) The general form of
the nuclear wave function therefore becomes
Of course, the name “nucleon type” for
is not
really acceptable, because it is understandable. In the old days, the
names “isobaric spin” or “isotopic spin”
were used, because nucleon type has absolutely nothing to do with
spin. However, it was felt that these nonsensical names could cause
some smart outsiders to suspect that the quantity being talked about
was not really spin. Therefore the modern term “isospin”
was introduced. This term contains nothing to give the secret away
that it is not spin at all.
Next the source of the decay must be identified. Ultimately that must be the Hamiltonian, because the Hamiltonian describes the time evolution of systems according to the Schrödinger equation.
In a specific beta decay process, two states are involved. A state
describes the nucleus before the decay, and a state
describes the combination of nucleus, electron, and antineutrino after
the decay. That makes the system into a so-called “two state
system.” The unsteady evolution of such systems was discussed
in chapter 7.6 and {D.38}. The key to the
solution were the “Hamiltonian coefficients.” The first
one is:
The Hamiltonian coefficient for the final state is similarly
The Hamiltonian coefficient that describes the interaction between the
two states is crucial, because it is the one that causes the decay.
It is
If
is zero, no decay will occur. And most of the Hamiltonian
does not produce a contribution to
.
,
Unfortunately, Fermi had no clue what
was. He assumed that beta
decay would not be that much different from the better understood
decay of excited atomic states in atoms. Gamma decay is the direct
equivalent of atomic decay for excited nuclei. Beta decay is
definitely different, but maybe not that different. In atomic decay
an electromagnetic photon is created, rather than an electron and
antineutrino. Still the general idea seemed similar.
In atomic decay
is essentially proportional to the product of the
charge of the excited electron, times the spatial eigenstate of the
photon, times a “photon creation” operator
:
There is another term in
that involves an annihilation operator
instead of a creation operator. An annihilation operator
destroys photons. However, that does not produce a contribution to
;
would be zero without the creation
operator. Looked at the other way around, the presence of the
creation operator in the Hamiltonian ensures that the final state must
have one more photon for the decay to occur. (See addendum
{A.15} for more details on electromagnetic interactions,
including a more precise description of
.
Fermi assumed that the general ideas of atomic decay would also hold
for beta decay of nuclei. Electron and antineutrino creation
operators in the Hamiltonian would turn the zero-electron and
zero-antineutrino kets into one-electron and one-antineutrino ones.
Then the inner products of the kets are equal to one pairwise.
Therefore both the creation operators and the kets drop out of the
final expression. In that way the Hamiltonian coefficient simplifies
to
To write expressions for the wave functions of the electron and antineutrino, you face the complication that unbound states in infinite space are not normalizable. That produced mathematical complications for momentum eigenstates in chapter 7.9.2, and similar difficulties resurface here. To simplify things, the mathematical trick is to assume that the decaying nucleus is not in infinite space, but in an extremely large “periodic box.” The assumption is that nature repeats itself spatially; a particle that exits the box through one side reenters it through the opposite side. Space “wraps around” if you want, and opposite sides of the box are assumed to be physically the same location. It is like on the surface of the earth: if you move along the straightest-possible path on the surface of the earth, you travel around the earth along a big circle and return to the same point that you started out at. Still, on a local scale, the surface on the earth looks flat. The idea is that the empty space around the decaying nucleus has a similar property, in each of the three Cartesian dimensions. This trick is also commonly used in solid mechanics, chapter 10.
In a periodic box, the wave function of the antineutrino is
The wave function of the electron will be written in a similar way:
The usual way to deal with the problem is to stick with the exponential electron wave function for now, and fix up the problem later in the final results. The fix-up will be achieved by throwing in an additional fudge factor. While “Fermi fudge factor” alliterates nicely, it does not sound very respectful, so physicists call the factor the “Fermi function.”
The bottom line is that for now
That leaves the still unknown operator
.
is simply defined so that the operator
has a magnitude
that is of order one. That means that
should never be
greater than about one, though it could be much less if
and
turn out to be almost
orthogonal. It is found that
has a rough value of about 100 eV
![]()
![]()
The question of allowed and forbidden decays is directly related to
the Hamiltonian coefficient
,
First note that the emitted electron and antineutrino have quite
small momentum values, on a nuclear scale. In particular,
in their combined wave function
Since the first term in the Taylor series is by far the largest, you
would expect that the value of
,
However, clearly this approximation does not work if the value of
is zero for some reason:
If the simplified coefficientIf the decay is forbidden, higher order terms in the Taylor series will have to be used to come up with a nonzero value foris nonzero, the decay is allowed. If it is zero, the decay is forbidden.
Why would a decay not be allowed? In other words why would
and
be exactly
orthogonal? If you took two random wave functions for
and
,
and
are not
random wave functions. They satisfy a significant amount of symmetry
constraints.
One very important one is symmetry with respect to coordinate system
orientation. An inner product of two wave functions is independent of
the angular orientation of the coordinate system in which you evaluate
it. Therefore, you can average the inner product over all directions
of the coordinate system. However, the angular variation of a wave
function is related to its angular momentum; see chapter
7.3 and its note. In particular, if you average a wave
function of definite angular momentum over all coordinate system
orientations, you get zero unless the angular momentum is zero. So,
if it was just
in the inner product in
,
will
be zero unless the initial angular momentum is exactly balanced by the
net final angular momentum. And that is angular momentum
conservation. The decay has to satisfy it.
Note that the linear momenta of the electron and antineutrino have
become ignored in
.
is zero unless the angular momentum of the final nucleus
plus the spin angular momentum of electron and antineutrino equals the
angular momentum of the original nucleus. Since the electron and
antineutrino can have up to one unit of combined spin, the nuclear
spin cannot change more than one unit. That is the first selection
rule for allowed decays given in chapter 14.19.4.
Another important constraint is symmetry under the parity
transformation
.
is zero if the total integrand has odd parity. For a
nonzero value, the integrand must have even parity, and that means
that the parity of the initial nucleus must equal the combined parity
of the final nucleus electron, and antineutrino.
Since the electron and antineutrino come out without orbital angular momentum, they have even parity. So the nuclear parity must remain unchanged under the transition. (To be sure, this is not absolutely justified. Nuclear wave functions actually have a tiny uncertainty in parity because the weak force does not conserve parity, chapter 14.19.6. This effect is usually too small to be observed and will be ignored here.)
So what if either one of these selection rules is violated? In that
case, maybe the second term in the Taylor series for the electron and
antineutrino wave functions produces something nonzero that can drive
the decay. For that to be true,
And note that because
changes sign when every
is
replaced by ![]()
,
not to be zero. That is indeed the
parity selection rule for first-forbidden decays.
The higher order forbidden decays go the same way. For an
th-forbidden decay,
This subsection will have a closer look at the nuclear operator
.
Although Fermi did not know what
was, Pauli had already
established the possible generic forms for it allowed by relativity.
It could take the form of a scalar (S), a vector (V), an axial vector
(A, a vector like angular momentum, one that inverts when the physics
is seen in a mirror), a pseudoscalar (P, a scalar like the scalar
triple product of vectors that changes sign when the physics is seen
in the mirror), or a tensor (T, a multiple-index object like a matrix
that transforms in specific ways.) Fermi simply assumed the
interaction was of the vector, V, type in analogy with the decay of
excited atoms.
Fermi ignored the spin of the electron and antineutrino. However,
Gamow & Teller soon established that to allow for decays where
the two come out with spin, (Gamow-Teller decays),
also
should have terms with axial, A, and/or tensor, T, character. Work of
Fierz combined with experimental evidence showed that the Hamiltonian
could not have both S and V, nor both A and T terms. Additional
evidence narrowed
down to STP combinations or VA ones.
Finally, it was established in 1953 that the correct one was the STP combination, because experimental evidence on RaE, (some physicists cannot spell bismuth-210), showed that P was present. Unfortunately, it did not. For one, the conclusion depended to an insane degree on the accuracy of a correction term.
However, in 1955 it was established that it was STP anyway, because experimental evidence on helium-6 clearly showed that the Gamow-Teller part of the decay was tensor. The question was therefore solved satisfactorily. It was STP, or maybe just ST. Experimental evidence had redeemed itself.
However, in 1958, a quarter century after Fermi, it was found that beta decay violated parity conservation, chapter 14.19.6, and theoretically that was not really consistent with STP. So experimentalists had another look at their evidence and quickly came back with good news: “The helium-6 evidence does not show Gamow-Teller is tensor after all.”
The final answer is that
is VA. Since so much of our knowledge
about nuclei depends on experimental data, it may be worthwhile to
keep this cautionary tale, taken from the Stanford Encyclopedia of
Philosophy, in mind.
It may next be noted that
will need to include a isospin
creation operator to be able to turn a neutron into a proton. In
Fermi decays,
is assumed to be just that operator. The constant
of proportionality
,
,
describes the strength
of the electromagnetic interaction between charged particles and
photons. In Fermi decays it is found that
is about 88 eV
![]()
![]()
In Gamow-Teller decays,
is assumed to consist of products of
isospin creation operators times spin creation or annihilation
operators. The latter operators allow the spin of the neutron that
converts to the proton to flip over. Suitable spin creation and
annihilation operators are given by the so-called “Pauli spin
matrices,” chapter 12.10 When they act on a nucleon,
they produce states with the spin in an orthogonal direction flipped
over. That allows the net spin of the nucleus to change by one unit.
The appropriate constant of proportionality
is found to be a
bit larger than the Fermi one.
The relevant operators then become, [5],
So how do these nuclear operators affect the decay rate? That is best
understood by going back to the more physical shell-model picture. In
beta minus decay, a neutron is turned into a proton. That proton
usually occupies a different spatial state in the proton shells than
the original neutron in the neutron shells. And different spatial
states are supposedly orthogonal, so the inner product
will usually be
pretty small, if the decay is allowed at all. There is one big
exception, though: mirror nuclei. In a decay between mirror nuclei, a
nucleus with a neutron number
decays into one
with neutron number
.
If you allow for beta decay to excited states, more superallowed
decays are possible. States that differ merely in nucleon type are
called isobaric analog states, or isospin multiplets, chapter
14.18. There are about twenty such superallowed decays in
which the initial and final nuclei both have spin zero and positive
parity. These twenty are particularly interesting theoretically,
because only Fermi decays are possible for them. And the Fermi inner
product is
.
instead of 1 like for mirror nuclei can be seen from thinking of
isospin as if it is just normal spin. Mirror nuclei have an odd
number of nucleons, so the net nuclear isospin is half integer. In
particular the net isospin will be
in the ground state.
However, nuclei with zero spin have an even number of nucleons, hence
integer net isospin. The isospin of the twenty decays is one; it
cannot be zero because at least one nucleus must have a nonzero net
nucleon type
.
.
These decays therefore allow the value of the Fermi coupling constant
to be determined from the decay rates. It turns out to be about
88 eV ![]()
is used to analyze the decay rates of the mirror
nuclei, including the free neutron that has no neighbors, the data
show no such effect. The hypothesis that neighboring nucleons do not
affect the Fermi decay process is known as the “conserved vector current hypothesis.” What name could be
clearer than that? Unlike Fermi decays, Gamow-Teller decays are
somewhat affected by the presence of neighboring nuclei.
Besides the spin and parity rules already mentioned, Fermi decays must satisfy the approximate selection rule that the magnitude of isospin must be unchanged. They can be slowed down by several orders of magnitude if that rule is violated.
Gamow-Teller decays are much less confined than Fermi ones because of the presence of the electron spin operator. As the shell model shows, nucleon spins are uncertain in energy eigenstates. Therefore, the nuclear symmetry constraints are a lot less restrictive.
The previous four subsections have focussed on finding the Hamiltonian
coefficients of the decay from a state
to a state
.
that drives the decay. The next step is solution of the
Schrödinger equation to find the evolution of the decay process.
The quantum amplitude of the pre-decay state
will be
indicated by
and the quantum amplitude of the final decayed
state
by
.
increases from zero according to
Now picture the following. At the initial time there are a large
number of pre-decay nuclei, all with
1. All these nuclei
then evolve according to the Schrödinger equation, above, over a
time interval
that is short enough that
stays close to
one. (Because the perturbation of the nucleus by the weak force is
small, the magnitudes of the coefficients only change slowly on the
relevant time scale.) In that case,
can be dropped from the
equation and its solution is then seen to be
Then at the final time
,
of the amplitude
of the decayed state. Therefore, a fraction
of the
nuclei will be found to have decayed and
will be found
to be still in the pre-decay state
.
nuclei that did not decay.
The bottom line is however that a fraction
did.
Therefore, the ratio ![]()
![]()
gives the specific decay rate,
the relative amount of nuclei that decay per unit time. Plugging in
the above expression for
gives:
And there may be more. If the final nuclear state has spin you also
need to sum over all values of the magnetic quantum number of the
final state. (The amount of nuclear decay should not depend on the
angular orientation of the initial nucleus in empty space. However,
if you expand the electron and neutrino wave functions into spherical
waves, you need to average over the possible initial magnetic quantum
numbers. It may also be noted that the total coefficient
for the decay
will not be the same as the one
for
:
However, all these details are of little importance in finding a
ballpark for the dominant decay process. The real remaining problem
is summing over the electron and antineutrino momentum states. The
total ballparked decay rate must be found from
Based on energy conservation, you would expect that decays should only
occur when the total energy
of the nucleus, electron and
antineutrino after the decay is exactly equal to the energy
of
the nucleus before the decay. However, the summation above shows that
that is not quite true. For a final state that has
exactly
equal to
,
of, for example,
,
How can energy not be conserved? The reason is that neither the
initial state nor the final state is an energy eigenstate, strictly
speaking. Energy eigenstates are stationary states. The very fact
that decay occurs assures that these states are not really energy
eigenstates. They have a small amount of uncertainty in energy. The
nonzero value of the Hamiltonian coefficient
assures that,
chapter 5.3, and there may be more decay processes
adding to the uncertainty in energy. If there is some uncertainty in
energy, then
is not an exact relationship.
To narrow this effect down more precisely, the fraction is plotted in
figure 7.7. The spikes in the figure indicate the energies
of the possible final states. Now the energy states are almost
infinitely densely spaced, if the periodic box in which the decay is
assumed to occur is big enough. And the box must be assumed very big
anyway, to simulate decay in infinite space. Therefore, the summation
can be replaced by integration, as follows:
Now assume that the complete problem is cut into bite-size pieces for
each of which
is about constant. It can then be taken out
of the integral. Also, the range of energy in figure 7.7
over which the fraction is appreciable, the energy slop, is very small
on a normal nuclear energy scale: beta decay is a slow process, so the
initial and final states do remain energy eigenstates to a very good
approximation. Energy conservation is almost exactly satisfied.
Because of that, the density of states ![]()
![]()
will be
almost constant over the range where the integrand is nonzero. It can
therefore be taken out of the integral too. What is left can be
integrated analytically, [39, 18.36]. That gives:
Because of the assumptions involved, like dividing the problem into
bite-size pieces, the above expression is not very intuitive to apply.
It can be rephrased into a more intuitive form that does not depend on
such an assumption. The obtained decay rate is exactly the same as if
in an energy slop range
The good news is that phrased this way, it indicates the relevant
physics much more clearly than the earlier purely mathematical
expression for Fermi’s golden rule. The bad news is that it
suffers esthetically from still involving the poorly defined time
,
under the mat.
Therefore, it is more appealing to write things in terms of the energy
slop altogether:
It may be noted that the golden rule does not apply if the evolution
is not to a continuum of states. It also does not apply if the slop
range
is so large that ![]()
![]()
is not constant
over it. And it does not apply for systems that evolve without being
perturbed over times long enough that the decay probability
becomes significant before the system is “measured.” (If
becomes appreciable,
can no longer be close to one
since the probabilities
and
must add to
one.) “Measurements,” or rather interactions with the larger
environment, are called “collisions.” Fermi’s golden
rule applies to so-called “collision-dominated”
conditions. Typically examples where the conditions are not collision
dominated are in NMR and atomic decays under intense laser light.
Mathematically, the conditions for Fermi’s golden rule can be
written as
| (A.280) |
It should also be noted that the rule was derived by Dirac, not Fermi. The way Fermi got his name on it was that he was the one who named it a “golden rule.” Fermi had a flair for finding memorable names. God knows how he ended up being a physicist.
The previous subsections derived the basics for the rate of beta decay. The purpose of this section is to pull it all together and get some actual ballpark estimates for beta decay.
First consider the possible values for the momenta
and
of the electron and antineutrino. Their wave
functions were approximately of the form
In a periodic box the wave function must be the same at opposite sides
of the box. For example, the exponential factor
is 1 at
=0, and it must be 1 again at
.![]()
![]()
to be a whole multiple of
.
must be a whole multiple of ![]()
![]()
.
values are therefore spaced the finite
amount ![]()
![]()
apart. And so are successive
and
values.
![]() |
Graphically this can by visualized by plotting the possible momentum
values as points in a three-dimensional
axis system. That is
done in figure A.27. Each point correspond to one
possible momentum state. Each point is the center of its own little
cube with sides ![]()
![]()
.
.
is the
physical volume
of the periodic box, the “volume”
in momentum space taken up by each momentum state is
![]()
![]()
That allows the number of different momentum states to be computed.
In particular, consider how many states have magnitude of momentum
less than some maximum value
.
,
.
.![]()
![]()
,
is given by the number of such “volumes” in
the sphere:
The number of electron states that have momentum in a range from
to
can be found by taking a differential of the
expression above:
Now the kinetic energy of the antineutrino
is fixed in
terms of that of the electron and the energy release of the decay
by:
The electron and antineutrino kinetic energies are related to their
momenta by Einstein’s relativistic expression,
chapter 1.1.2:
These result shows that the neutrino momentum is fixed for given
electron momentum
.
and so no neutrino states.
However, Fermi’s golden rule says that the theoretical energy
after the decay does not need to be exactly the same as the one before
it, because both energies have a bit of uncertainty. This slop in the
energy conservation equation allows a range of energies
Each of these states adds a contribution to the specific decay rate
given by
The derived expression (A.277) for the Hamiltonian coefficient
can be written in the form
It is not easy to say much about
in general, beyond the fact
that its magnitude should not be much more than one. This book will
essentially ignore
to ballpark the decay rate, assuming that
its variation will surely be much less than that of the beta decay
lifetimes, which vary from milliseconds to ![]()
The decay rate becomes after clean up
The factor
that popped up out of nothing in the decay rate is
thrown in to correct for the fact that the wave function of the
electron is not really just an exponential. The nucleus pulls on the
electron with its charge, and so changes its wave function locally
significantly. The correction factor
for allowed decays is
called the “Fermi function” and is given by
To ballpark the effect of the nuclear charge on the electron wave function, this book will use the relativistic Fermi function above whether it is an allowed decay or not.
For allowed decays, the factor in the decay rate that is governed by
the
-
Note that
also depends a bit on the mass number through the
nuclear radius in
.
| (A.286) |
| (A.287) |
Electron capture is much more simply to analyze than beta decay, because the captured electron is in a known initial state.
It will be assumed that a 1s, or K-shell, electron is captured, though
L-shell capture may also contribute to the decay rate for heavy
nuclei. The Hamiltonian coefficient that drives the decay is
Here
is the hydrogen ground state wave function, but
rescaled for a nucleus of charge
instead of
.
is spherically symmetric. In other words, the 1s
electron has no orbital angular momentum and so cannot contribute to
conservation of angular momentum and parity. Therefore,
can safely be approximated by its value at the origin, from chapter
4.3,
The square Hamiltonian coefficient for
th-forbidden decays then
becomes
The decay rate for electron capture is
Put it all together, including the fact that there are two K
electrons, and the electron-capture decay rate becomes