Most of the time in quantum mechanics, exact solution of the Hamiltonian eigenvalue problem of interest is not possible. To deal with that, approximations are made.
Perturbation theory can be used when the Hamiltonian
consists of
two parts
and
,
can be
solved and where
is small. The idea is then to adjust the found
solutions for the “unperturbed Hamiltonian”
so
that they become approximately correct for
.
This addendum explains how perturbation theory works. It also gives a few simple but important examples: the helium atom and the Zeeman and Stark effects. Addendum,{A.38} will use the approach to study relativistic effects on the hydrogen atom.
To use perturbation theory, the eigenfunctions and eigenvalues of the
unperturbed Hamiltonian
must be known. These eigenfunctions
will here be indicated as
and the corresponding
eigenvalues by
.
to indicate the quantum numbers of the eigenfunctions. If the basic
system is an hydrogen atom, as is often the case in textbook examples,
and spin is unimportant,
would likely stand for the set of
quantum numbers
,
,
.
might stand for the quantum
numbers
,
,
.
and
indicates that they ignore the
perturbation Hamiltonian
.
The key to perturbation theory are the “Hamiltonian perturbation coefficients” defined as
In the application of perturbation theory, the idea is to pick one
unperturbed eigenfunction
of
of interest and then
correct it to account for
,
.
is
degenerate, i.e. there is more than one unperturbed eigenfunction
of
with that energy, you must use a
“good” eigenfunction to correct the energy. How to do
that will be discussed in subsection A.37.3.
For now just assume that the energy is not degenerate or that you
picked a good eigenfunction
.
to account for the perturbation
is very simple, {D.80}; just add the corresponding
Hamiltonian perturbation coefficient:
Unfortunately, it does happen quite a lot that the above correction
is zero because of some symmetry or the other.
Or it may simply not be accurate enough. In that case, to find the
energy change you have to use what is called “second order perturbation theory:”
Sometimes you may also be interested in what happens to the energy
eigenfunctions, not just the energy eigenvalues. The corresponding
formula is
In some cases, instead of using second order theory as above, it may
be simpler to compute the first order wave function perturbation and
the second order energy change from
One application of perturbation theory is the “Hellmann-Feynman theorem.” Here the perturbation Hamiltonian is
an infinitesimal change
in the unperturbed Hamiltonian
caused by an infinitesimal change in some parameter that it depends
on. If the parameter is called
,
One prominent deficiency in the approximate analysis of the heavier atoms in chapter 5.9 was the poor ionization energy that it gave for helium. The purpose of this example is to derive a much more reasonable value using perturbation theory.
Exactly speaking, the ionization energy is the difference between the
energy of the helium atom with both its electrons in the ground state
and the helium ion with its second electron removed. Now the energy
of the helium ion with electron 2 removed is easy; the Hamiltonian for
the remaining electron 1 is
This Hamiltonian is exactly the same as the one for the hydrogen atom
in chapter 4.3, except that it has
where the hydrogen
one, with just one proton in its nucleus, has
.
in that solution,
replace it by
.
for
the helium ion is half the Bohr radius
for hydrogen,
It is interesting to see that the helium ion has four times the energy of the hydrogen atom. The reasons for this much higher energy are both that the nucleus is twice as strong, and that the electron is twice as close to it: the Bohr radius is half the size. More generally, in heavy atoms the electrons that are poorly shielded from the nucleus, which means the inner electrons, have energies that scale with the square of the nuclear strength. For such electrons, relativistic effects are much more important than they are for the electron in a hydrogen atom.
The neutral helium atom is not by far as easy to analyze as the ion.
Its Hamiltonian is, from (5.34):
Note however that the repulsion term is qualitatively similar to the
nuclear attraction terms, except that there are four of these nuclear
attraction terms versus a single repulsion term. So maybe then, it
may work to treat the repulsion term as a small perturbation, call it
,
given by the first four terms?
Of course, if you ask mathematicians whether 25% is a small amount,
they are going to vehemently deny it; but then, so they would for any
amount if there is no limit process involved, so just don’t ask
them, OK?
The solution of the eigenvalue problem
is simple:
since the electrons do not interact with this Hamiltonian, the ground
state wave function is the product of the ground state wave functions
for the individual electrons, and the energy is the sum of their
energies. And the wave functions and energies for the separate
electrons are given by the solution for the ion above, so
According to this result, the energy of the atom is
while the
ion had
,
,
To get a better ionization energy, try perturbation theory. According
to first order perturbation theory, a better value for the energy of
the hydrogen atom should be
The inner product of the final term can be written out as
The result of the integration is
The second order perturbation result should give a much more accurate result still. However, if you did the integral above, you may feel little inclination to try the ones involving all possible products of hydrogen energy eigenfunctions.
Instead, the result can be improved using a variational approach, like
the ones that were used earlier for the hydrogen molecule and
molecular ion, and this requires almost no additional work. The idea
is to accept the hint from perturbation theory that the wave function
of helium can be approximated as
where
is the hydrogen ground state wave function using a modified
Bohr radius
instead of
:
No new integrals need to be done to evaluate the inner product above.
Instead, noting that for the hydrogen atom according to the virial
theorem of chapter 7.2 the expectation kinetic energy
equals
![]()
![]()
and the potential energy
equals
,
Energy eigenvalues are degenerate if there is more than one
independent eigenfunction with that energy. Now, if you try to use
perturbation theory to correct a degenerate eigenvalue of a
Hamiltonian
for a perturbation
,
1 independent eigenfunctions
with energy
and that they are numbered as
Unfortunately, the full Hamiltonian
is not likely to agree
with
about that. As far as the full Hamiltonian is concerned,
normally only very specific combinations are acceptable, the
“good” eigenfunctions. It is said that the perturbation
“breaks the degeneracy” of the energy eigenvalue.
The single energy eigenvalue splits into several eigenvalues of
different energy. Only good combinations will show up these changed
energies; the bad ones will pick up uncertainty in energy that hides
the effect of the perturbation.
The various ways of ensuring good eigenfunctions are illustrated in
the following subsections for example perturbations of the energy
levels of the hydrogen atom. Recall that the unperturbed energy
eigenfunctions of the hydrogen atom electron, as derived in chapter
4.3, and also including spin, are given as
and
.
have the same energy
,
corresponding to the square orbital
angular momentum
;
corresponding to the
orbital angular momentum
in the
-
corresponding to the spin angular momentum
in
the
-
is two-fold degenerate, it is the same for both
,
and
,
.
is eight-fold
degenerate, it is the same for
,
,
,
,
.
There are two important rules to identify the good eigenfunctions,
{D.80}:
In particular, if the perturbation Hamiltonian commutes with all
additional operators that make the eigenfunctions of
unique,
stop worrying: every eigenfunction is good already.
For example, for the usual hydrogen energy eigenfunctions
,
,
,
make the eigenfunctions at a given unperturbed energy
level
unique. They correspond to the operators
,
,
.
commutes with any one of these operators, the corresponding
quantum number is good. If the perturbation commutes with all
three, all eigenfunctions are good already.
Otherwise linear algebra is required. For each set of energy
eigenfunctions
Unfortunately, if the eigenvalues of this matrix are not all different, the eigenvectors are not unique, so you remain unsure about what are the good eigenfunctions. In that case, if the second order energy corrections are needed, the detailed analysis of derivation {D.80} will need to be followed.
If you are not familiar with linear algebra at all, in all cases mentioned here the matrices are just two by two, and you can find that solution spelled out in the notations under “eigenvector.”
The following, related, practical observation can also be made:
Hamiltonian perturbation coefficients can only be nonzero if all the good quantum numbers are the same.
If you put an hydrogen atom in an external magnetic field
,
If for simplicity a coordinate system is used with its
-
| (A.245) |
For this perturbation, the
energy eigenfunctions are
already good ones, because
commutes with all of
,
and
.
The Zeeman effect can be seen in an experimental spectrum. Consider
first the ground state. If there is no electromagnetic field, the two
ground states
and
would have exactly
the same energy. Therefore, in an experimental spectrum, they would
show up as a single line. But with the magnetic field, the two energy
levels are different,
Some disclaimers should be given here. First of all, the 2 in
is only equal to 2 up to about 0.1% accuracy. More
importantly, even in the absence of a magnetic field, the energy
levels at a given value of
do not really form a single line in the
spectrum if you look closely enough. There are small errors in the
solution of chapter 4.3 due to relativistic effects, and so
the theoretical lines are already split. That is discussed in
addendum {A.38}. The description given above is a good one
for the “strong” Zeeman effect, in which the magnetic
field is strong enough to swamp the relativistic errors.
If an hydrogen atom is placed in an external electric field
instead of the magnetic one of the previous
subsection, its energy levels will change too. That is called the
“Stark effect.” Of course a Zeeman, Dutch for sea-man,
would be most interested in magnetic fields. A Stark, maybe in a
spark? (Apologies.)
If the
-
| (A.246) |
Since the typical magnitude of
is of the order of a Bohr radius
,
.
of the order of 0.000,5 eV,
[24, p. 339]. That is really small compared to the
typical electron energy levels.
And additionally, it turns out that for many eigenfunctions, including
the ground state, the first order correction to the energy is zero.
To get the energy change in that case, you need to compute the second
order term, which is a pain. And that term will be much smaller still
than even
for reasonable field strengths.
Now first suppose that you ignore the warnings on good eigenfunctions,
and just compute the energy changes using the inner product
.
So, since all first order energy changes that you compute are zero, you would naturally conclude that to first order approximation none of the energy levels of a hydrogen atom changes due to the electric field. But that conclusion is wrong for anything but the ground state energy. And the reason it is wrong is because the good eigenfunctions have not been used.
Consider the operators
,
,
that make
the energy eigenfunctions
unique. If
commuted with them all, the
would
be good eigenfunctions. Unfortunately, while
commutes with
and
,
,
is bad.
Still, the two states
with the ground state energy
are good states, because there are no states with the same energy and
a different value of the bad quantum number
.
are both good, the ground state energy does indeed not change to first
order.
But now consider the eight-fold degenerate
2 energy level.
Each of the four eigenfunctions
and
is a good one because for each of them, there is
no other
2 eigenfunction with a different value of the bad
quantum number
.
However, the remaining two
2 spin-up states
and
have different values for the bad quantum number
,
0 and
for the good quantum numbers of orbital and spin
-
and
are bad and will
have to be combined.
It suffices to just analyze the spin up states, because the spin down
ones go exactly the same way. The coefficients
and
of the
good combinations
must be
eigenvectors of the matrix
The eigenvectors of this matrix are simple enough to guess; they have
either equal or opposite coefficients
and
:

Remarkably, the good combinations of
and
are
the “sp” hybrids of carbon fame, as described in chapter
5.11.4. Note from figure 5.13 in that section
that these hybrids do not have the same magnitude at opposite
sides of the nucleus. They have an intrinsic “electric dipole
moment,” with the charge shifted towards one side of the atom,
and the electron then wants to align this dipole moment with the
ambient electric field. That is much like in Zeeman splitting, where
electron wants to align its orbital and spin magnetic dipole moments
with the ambient magnetic field.
The crucial thing to take away from all this is: always, always, check whether the eigenfunction is good before applying perturbation theory.
It is obviously somewhat disappointing that perturbation theory did
not give any information about the energy change of the ground state
beyond the fact that it is second order, i.e. very small compared to
.
is a good state with
0 and
,
for all
values of
and
.
,
It is however possible to find the perturbation in the wave function
from the alternate approach (A.243), {D.81}.
In that way the second order ground state energy is found to be
Finally, note that the typical value of 0.000,5 eV or so for
quoted earlier is very small compared to the
about 100 eV for
,
is much smaller than the first order energy changes
in the
energy level.
A weird prediction of quantum mechanics is that the electron will
eventually escape from the atom, leaving it ionized. The reason is
that the potential is linear in
,
-
,
1