In quantum mechanics, photons are the particles of the electromagnetic field. To actually use photons, something like a wave function for them is needed. But that is not quite trivial for a purely relativistic particle with zero rest mass like the photon. That is the primary topic of this addendum. It will be assumed throughout that the photon is in empty space.
To see the problem with a photon wave function, a review of the wave
function of the nonrelativistic electron is useful, chapters
3.1 and 5.5.1. The electron wave function
can be written as a vector with two components:
Now a wave function of type 1 is not really meaningful for a photon. What would it mean, find a photon? Since the photon has no rest mass, you cannot bring them to a halt: there would be nothing left. And anything you do to try to localize the electromagnetic field is likely to just produce new photons. (To be sure, with some effort something can be done towards a meaningful wave function of type 1, e.g. [Sype, J.E. 1995 Phys. Rev. A 52, 1875]. It would have two components like the electron, since the photon has two independent spin states. But wave functions of that type are not widely accepted, nor useful for the purposes here.)
So what? A wave function of type 1 is not that great anyway. For
one, it only defines the magnitudes of the components of the wave
function. If you only define the magnitude of a complex function, you
define only half of it. True, even as a type 2 wave function the
classical electron wave function is not quite unique. You can still
multiply either component by a factor
,
a real constant, without changing any of the physics. But
that is not by far as bad as completely ignoring everything else
besides the magnitude.
Furthermore, relativistic quantum mechanics has discovered that what we call an electron is something cloaked in a cloud of virtual particles. It is anybody’s guess what is inside that cloak, but it will not be anything resembling what we would call an electron. So what does it really mean, finding an electron within an infinitesimal volume around a point? What happens to that cloak? And to really locate an electron in an infinitesimal volume requires infinite energy. If you try to locate the electron in a region that is small enough, you are likely to just create additional electron-positron pairs much like for photons.
For most practical purposes, classical physics understands the particle behavior of electrons very well, but not their wave behavior. Conversely, it understands the wave behavior of photons very well, but not their particle behavior. But when you go to high enough energies, that distinction becomes much less obvious.
The photon most definitely has a wave function of type 2 above. In quantum electrodynamics, it may simply be called the photon wave function, [23, p. 240]. However, since the term already seems to be used for type 1 wave functions, this book will use the term “photon type 2 wave function.” It may not tell you where to find that elusive photon, but you will definitely need it to figure out how that photon interacts with, say, an electron.
What the type 2 wave function of the photon is can be guessed readily
from classical electromagnetics. After all, the photon is supposed to
be the particle of the electromagnetic field. So, consider first
electrostatics. In classical electrostatics the forces on charged
particles are described by an electric force per unit charge
.
But quantum mechanics uses potentials, not forces. For example, the
solution of the hydrogen atom of chapter 4.3 used a
potential energy of the electron
.
where ![]()
![]()
is called the electrostatic
potential. This potential is not directly observable nor unique; you
can add any constant to it without changing the observed physics.
Clearly, an unobservable function
for the electromagnetic
field sounds much like a wave function for the particle of that field,
the photon. But actually, the electrostatic potential
is
only part of it. In classical electromagnetics, there is not just an
electric field
,
.
.
The following relationships give the electric and magnetic fields in
terms of these potentials:
When both potentials are allowed for, the nonuniqueness becomes much
larger. In particular, for any arbitrary function
of position
and time, you can find two different potentials
and
that produce the exact same electric and magnetic fields as
and
.
Finally, it turns out that classical relativistic mechanics likes to
combine the four scalar potentials in a four-dimensional vector, or
four-vector, chapter 1.3.2:
That is the one. Quantum mechanics takes a four-vector potential of
this form to be the type 2 wave function of the photon
.
The photon wave functionWave functions are in general complex. The classical four-potential, and especially its physically observable derivatives, the electric and magnetic fields, must be real. Indeed, according to quantum mechanics, observable quantities correspond to eigenvalues of Hermitian operators, not to wave functions. What the operators of the observable electric and magnetic fields are will be discussed in addendum {A.23}.should not be confused with the classical four-potential
.
To use the photon wave function in practical applications, it is
essential to simplify it. That can be done by choosing a clever gauge
function
in the gauge property (A.87).
One very helpful simplification is to choose
so that
To achieve the Lorenz condition, assume an initial wave function
that does not satisfy it. Then
plug the gauge property (A.87) into the Lorenz condition
above. That shows that the needed gauge function
must satisfy
There is another reason why you want to satisfy the Lorenz condition.
The photon is a purely relativistic particle with zero rest mass.
Then following the usual ideas of quantum mechanics, in empty space
its wave function should satisfy the homogeneous Klein-Gordon
equation, {A.14} (A.43):
Maxwell’s classical electromagnetics provides additional support for that idea. There the Klein-Gordon equation for the potentials also requires that the Lorenz condition is satisfied, {A.36}.
Since the inhomogeneous Klein-Gordon equation for the gauge function
is second order in time, it still leaves two initial conditions
to be chosen. These can be chosen such as to make the initial values
for
and its time-derivative zero. That then makes
completely zero, because it satisfies the homogeneous
Klein-Gordon equation.
And so the fully simplified photon wave function becomes:
It should be noted that the Coulomb gauge is not Lorentz invariant. A
moving observer will not agree that the potential
is
zero and that
is solenoidal. In real life that means
that if you want to study a process in say a center-of-mass system,
first switch to that system and then assume the Coulomb gauge. Not
the other way around. The Coulomb-Lorenz gauge is too helpful not to
use, although that is possible, [23, p. 241].
Now that the photon wave function has been simplified the photon spin
can be determined. Recall that for the electron, the two components
of the wave function correspond to its two possible values of the spin
angular momentum
in the chosen
-
corresponds to
,
to
.
.
one of
,
,
,
.
The simplified wave function (A.90) has only three
nontrivial components. And the gauge property requires that this
simplified wave function still describes all the physics. Since the
only nontrivial part left is the three-dimensional vector
,
-
are
,![]()
.
However, that is not quite the end of the story. There is still that
additional condition
0 to
satisfy. In principle this constraint allows another component of the
wave function to be eliminated. However, all three remaining
components are spatial ones. So it does not make much sense to
eliminate one and not the other. More importantly, it is known from
relativity that
behaves like a normal three-dimensional
vector under rotations of the coordinate system, not like a
two-dimensional spinor like the electron wave function. That is
implicit in the fact that the complete four-vector transforms
according to the Lorentz transform, chapter 1.3.2. The
spin is really 1.
Still, the additional constraint does limit the angular momentum of the photon. In particular, a photon does not have independent spin and orbital angular momentum. The two are intrinsically linked. What that means for the net angular momentum of photons is worked out in subsection A.21.7.
For now it may already be noted that the photon has no state of zero
net angular momentum. A state of zero angular momentum needs to look
the same from all directions. That is a consequence of the
relationship between angular momentum and symmetry, chapter
7.3. Now the only vector wave functions that look the
same from all directions are of the form
.
is the distance from the origin around which the angular momentum
is measured and
the unit vector pointing away from the
origin. Such a wave function cannot satisfy the condition
0. That follows from applying the
divergence theorem for a sphere around the origin.
Following the rules of quantum mechanics, {A.14},
photon states of definite energy
take the form
Substitution in the Klein-Gordon equation and cleaning up shows that
this eigenfunction needs to satisfy the eigenvalue problem,
{A.14},
A classical wave function for a particle is normalized by demanding that the square integral of the wave function is 1. That does not work for a relativistic particle like the photon, since the Klein-Gordon equation does not preserve the square integral of the wave function, {A.14}.
However, the Klein-Gordon equation does preserve the following
integral, {D.36.1},
Consider for a second the case that
was a classical
potential
instead of a photon wave function. Then the above
integral can be rewritten in terms of the electric and magnetic fields
and
as, {D.36.1},
Of course, there needs to be an additional constant; the integral
above does not have units of energy. If you check, you find that the
“permittivity of space”
8.85
![]()
/J m
.
as half
of what it did.
Now the photon wave function is not physically observable and does not
have to conform to the rules of classical physics. But if you have to
choose a normalization constant anway? Why not choose it so that what
classical physics would take to be the energy is in fact the correct
energy
?
So, the photon wave function normalization that will be used in this
book is:
Assume that you start with an unnormalized energy eigenfunction
.
The simplest quantum states for photons are states of definite linear
momentum
.
-
In that case, the photon wave function takes the form
The vector
is not completely arbitrary; its
-
is zero as the Coulomb-Lorenz
gauge requires. So the wave function can be written as
There is another useful way to write the wave function:
There are still only two independent states. But another way of
thinking about that is that the spin angular momentum in the direction
of motion cannot be zero. The relative spin in the direction of
motion, ![]()
![]()
is called the “helicity.” It turns out that for a particle with zero rest mass
like the photon, the helicity can only be 1 (right handed) or -1 (left
handed), [23, p. 65].
Note further that the angular momenta in the
and
directions
are uncertain. It so happens that the angular momentum in the
direction of motion commutes with all three components of linear
momentum, chapter 4.5.4. So it can have definite values.
But the
and
angular momenta do not commute.
For later use, it is necessary to normalize the wave function using
the procedure described in the previous subsection. To do so, it must
be assumed that the photon is in a periodic box of volume
,
In order to compare to the classical electromagnetic wave in chapter
7.7.1, another example is needed. This photon wave function
has its linear momentum in the
-
-
For a general direction of the wave motion and its linear
polarization, the above expession becomes
For convenience, the density of states as needed for Fermi’s
golden rule will be listed here. It was given earlier in chapter
6.3 (6.7) and 6.19:
It is often convenient to describe photons in terms of states of definite net angular momentum. That makes it much easier to apply angular momentum conservation in the emission of radiation by atoms or atomic nuclei. Unfortunately, angular momentum states are a bit of a mess compared to the linear momentum states of the previous subsection. Fortunately, engineers are brave.
Before diving in, it is a good idea to look first at a spinless
particle. Assume that this hypothetical particle is in an energy
eigenstate. Also assume that this state has square orbital angular
momentum
where
is called the azimuthal quantum
number. And that the state has orbital angular momentum in the
-
where
is called the magnetic quantum
number. Then according to quantum mechanics, chapter 4.2.3,
must be a nonnegative integer and
must be an integer no
larger in magnitude than
.
,
.
Now the photon is a particle with spin 1. Its wave function is
essentially a vector
.
0. Normally, for
three-dimensional vectors you expect three types of angular momentum
states, like in {A.20}. But for the photon there are
only two types.
The two types of photon energy eigenfunctions with definite net
angular momentum are, {D.36.2} and with drums please,
The azimuthal quantum number
is at least 1; the expressions
above produce zero for
0. (
is just a constant
and the gradient of a radial function is in the direction of
.
as
with
the speed of light, (A.91). That is
really the Planck-Einstein relation, because
is the photon
frequency
.
The parity of the electric multipole wave functions is negative if
is odd and positive if
is even, {D.36.2.7}.
The parity of the magnetic multipole wave functions is exactly the
other way around. From that it can be seen, {D.36.2.8},
that magnetic multipole wave functions have orbital angular momentum
.
and
.
Atomic or nuclear transitions in which a photon in a state
is emitted or absorbed are called “electric multipole” transitions. They are indicated as
![]()
In particular, for net angular momentum
1, they are called
or electric dipole transitions. That is the normal kind.
However, as discussed in chapter 7.4, such transitions
may not be able to satisfy conservation of angular momentum and
parity. Since the photon in the state has
1, transitions in
which the atomic angular momentum changes by more than one unit cannot
be accomodated. Neither can transitions in which the atomic or
nuclear momentum parity does not change, because the
photon has odd parity.
Such transitions may be accomodated by transitions in which photons in
different states are emitted or absorbed, using the photon angular
momenta and parities as noted above. Electric multipole transitions
with
2 are called
or electric quadrupole
transitions. Those with
3 are
or electric
octupole ones, with
4
or electric
hexadecapole ones, with
5
or electric
triakontadipole ones, for
6
or electric
hexacontatetrapole ones and so on until your knowledge of latin and
greek powers of 2 runs out.
Similarly, transitions in which photons in a state
are emitted or absorbed are called “magnetic multipole transitions.” The same latin applies.
Like the states of definite linear momentum in the previous
subsection, the states of definite angular momentum cannot be
normalized in infinite space. To deal with that, it will be assumed
that the photon is confined inside a sphere of a very large radius
.
The normalized wave function and unobservable fields for electric
multipole photons are then, subsection A.21.5 and
{D.36},
The normalized wave function and unobservable fields for magnetic
multipole photons are
Assume now that there is an atom or atomic nucleus at the origin that
interacts with the photon. An atom or nucleus is typically very small
compared to the wave length of the photon that it interacts with.
Phrased more appropriately, if
is the typical size of the atom or
nucleus, then
is typically small. The atom or nucleus is just a
tiny speck at the origin.
Now the wave functions
are larger at small radii
than the
.
are of order
while the
are of order
,
smaller. These powers of
reflect the lowest measurable orbital
angular momentum of the states.
A glance at the unobservable fields of electric multipole photons
above then shows that for these photons, the field is primarily
electric at the atom or nucleus. And even the electric field will be
small unless
1, in other words, unless it is an electric dipole
photon.
For the magnetic multipole photons, it is the magnetic field that
dominates at the atom or nucleus. And even that will be small unless
1, which means a magnetic dipole photon. Note that the magnetic
field acts as if it had one unit or orbital angular momentum less than
the photon; the magnetic field is essentially the wave function of an
electric multipole photon.
For later reference, the density of states as needed for Fermi’s
golden rule will be listed here, {D.36.2.6}: