4.3 Two-State Systems

The protons in the H$_2^+$ hydrogen molecular ion of chapter 3.5 are held together by a single shared electron. However, in the H$_2$ neutral hydrogen molecule of the previous section, they are held together by a shared pair of electrons. The main purpose of this section is to shed some light on the question why chemical bonds involving a single electron are relatively rare, while bonds involving pairs of shared electrons are common.

The unifying concept relating the two bonds is that of “two state systems.” Such systems involve two intuitive basic states $\psi_1$ and $\psi_2$. For the hydrogen molecular ion, one state, $\psi_1=\psi_L$, described that the electron was around the left proton, the other, $\psi_2=\psi_R$, that it was around the right one. For the hydrogen molecule, $\psi_1=\psi_L\psi_R$ had electron 1 around the left proton and electron 2 around the right one; $\psi_2=\psi_R\psi_L$ was the same, except with the electrons reversed.

There are many other physical situations that may be described as two state systems. Covalent chemical bonds involving atoms other than hydrogen would be an obvious example. Just substitute a positive ion for one or both protons.

The C$_6$H$_6$benzene molecular ring” consists of a hexagon of 6 carbon atoms that are held together by 9 covalent bonds. The way that the 9 bonds between the 6 atoms can be arranged is to make every second bond a double one. However, that still leaves two possibilities, by swapping the locations of the single and double bonds, hence two different states $\psi_1$ and $\psi_2$.

The NH$_3$ammonia molecule” consists of an nitrogen atom bonded to three hydrogen atoms. By symmetry, the logical place for the nitrogen atom to sit would surely be in the center of the triangle formed by the three hydrogen atoms. But it does not sit there. If it was in the center of the triangle, the angles between the hydrogen atoms, measured from the nitrogen nucleus, should be 120$^\circ$ each. However, as discussed later in chapter 4.11.3, valence bond theory requires that the angles should be about 90$^\circ$, not 120$^\circ$. (The actual angles are about 108$^\circ$ because of reasons similar to those for water as discussed in chapter 4.11.3.) The key point here is that the nitrogen must sit to the side of the triangle, and there are two sides, producing once again two different states $\psi_1$ and $\psi_2$.

In each case described above, there are two logical physical states $\psi_1$ and $\psi_2$. The peculiarities of two state systems arise from states that are combinations of these two states, as in

\begin{displaymath}
\Psi=a\psi_1+b\psi_2
\end{displaymath}

Note that according to the ideas of quantum mechanics, the square magnitude of the first coefficient of the combined state, $\vert a\vert^2$, represents the probability of being in state $\psi_1$ and $\vert b\vert^2$ the probability of being in state $\psi_2$. Of course, the total probability of being in one of the states should be one:

\begin{displaymath}
\vert a\vert^2+\vert b\vert^2=1
\end{displaymath}

(This is only true if the $\psi_1$ and $\psi_2$ states are orthonormal. In the hydrogen molecule cases, orthonormalizing the basic states would change them a bit, but their physical nature would remain much the same, especially if the protons are not too close.)

The key question is what combination of states has the lowest energy. The expectation value of energy is

\begin{displaymath}
\langle E\rangle =
\langle a\psi_1+b\psi_2\vert H\vert a\psi_1+b\psi_2\rangle
\end{displaymath}

This can be multiplied out as, (remember that numerical factors come out of the left of an inner product as complex conjugates,)

\begin{displaymath}
\langle E\rangle = a^*aH_{11} + a^*bH_{12} + b^*aH_{21} + b^*bH_{22}
\end{displaymath}

where the shorthand notation

\begin{displaymath}
H_{11} = \langle\psi_1\vert H\psi_1\rangle, \quad
H_{12}...
..._1\rangle, \quad
H_{22} = \langle\psi_2\vert H\psi_2\rangle
\end{displaymath}

was used. Note that $H_{11}$ and $H_{22}$ are real, (1.16), and the states will be ordered so that $H_{11}$ is less or equal to $H_{22}$. Normally, $H_{12}$ and $H_{21}$ are not real but complex conjugates, (1.16), but you can always change the definition of, say, $\psi_1$ by a factor of magnitude one to make $H_{12}$ equal to a real and negative number, and then $H_{21}$ will be that same negative number. Also note that $a^*a=\vert a\vert^2$ and $b^*b=\vert b\vert^2$.

The above expression for the expectation energy consists of two kinds of terms, which will be called:

 $\displaystyle \mbox{the averaged energy: }$ $\textstyle \vert a\vert^2 H_{11} +
\vert b\vert^2 H_{22}$    (4.11)
 $\displaystyle \mbox{the twilight terms: }$ $\textstyle (a^* b + b^* a) H_{12}%
$    (4.12)

Each of those contributions will be discussed in turn.

The averaged energy is the energy that you would intuitively expect the combined wave function to have. It is a straightforward average of the energies of the two component states $\psi_1$ and $\psi_2$ times the probabilities of being in those states. In particular, in the important case that the two states have the same energy, the averaged energy is that energy. What is more logical than that any mixture of two states with the same energy would have that energy too?

But the twilight terms throw a monkey wrench in this simplistic thinking. It can be seen that they will always make the ground state energy lower than the energy $H_{11}$ of the lowest component state. (To see that, just take $a$ and $b$ positive real numbers and $b$ small enough that $b^2$ can be neglected.) This lowering of the energy below the lowest component state comes out of the mathematics of combining states; absolutely no new physical forces are added to produce it. But if you try to describe it in terms of classical physics, it really looks like a mysterious new “twilight force” is in operation here. It is no new force; it is the weird mathematics of quantum mechanics.

So, what are these twilight terms physically? If you mean, what are they in terms of classical physics, there is simply no answer. But if you mean, what are they in terms of normal language, rather than formulae, it is easy. Just have another look at the definition of the twilight terms; they are a measure of the inner product $\langle\psi_1\vert H\psi_2\rangle$. That is the energy you would get if nature was in state $\psi_1$ if nature was in state $\psi_2$. On quantum scales, nature can get really, really ethereal, where it moves beyond being describable by classical physics, and the result is very concrete, but weird, interactions. For, at these scales twilight is real, and classical physics is not.

For the twilight terms to be nonzero, there must be a region where the two states overlap, i.e. there must be a region where both $\psi_1$ and $\psi_2$ are nonzero. In the simplest case of the hydrogen molecular ion, if the atoms are far apart, the left and right wave functions do not overlap and the twilight terms will be zero. For the hydrogen molecule, it gets a bit less intuitive, since the overlap should really be visualized in the six-dimensional space of those functions. But still, the terms are zero when the atoms are far apart.

The twilight terms are customarily referred to as “exchange terms,” but everybody seems to have a different idea of what that is supposed to mean. The reason may be that these terms pop up all over the place, in all sorts of very different settings. This book prefers to call them twilight terms, since that most clearly expresses what they really are. Nature is in a twilight zone of ambiguity.

The lowering of the energy by the twilight terms produces more stable chemical bonds than you would expect. Typically, the effect of the terms is greatest if the two basic states $\psi_1$ and $\psi_2$ are physically equivalent and have the same energy. This is the case for the hydrogen examples and most of the others mentioned. For such states, the ground state will occur for an equal mixture of states, $a=b=\sqrt{\frac12}$, because then the twilight terms are most negative. In that case, the lowest energy, call it $E_L$, is an amount $H_{12}$ below the energy $H_{11}=H_{22}$ of the component states.

On the other hand, if the lower energy state $\psi_1$ has significantly less energy than state $\psi_2$, then the minimum energy will occur for $\vert a\vert\approx 1$ and $\vert b\vert\approx 0$. (This assumes that the twilight terms are not big enough to dominate the energy.) In that case $ab\approx 0$, which pretty much takes the twilight terms (4.12) out of the picture completely.

This happens for the single-electron bond of the hydrogen molecular ion if the second proton is replaced by another ion, say a lithium ion. The energy in state $\psi_1$ where the electron is around the proton will be less than that of state $\psi_2$ where it is around the lithium ion. For such asymmetrical single-electron bonds, the twilight terms are not likely to help forge a strong bond. While it turns out that the LiH$^+$ ion is stable, the binding energy is only 0.14 eV or so, compared to 2.8 eV for the H$_2^+$ ion. Also, the LiH$^+$ bond seems to be best described as polarization of the hydrogen atom by the lithium ion, instead of as a true chemical bond.

In contrast, for the two-electron bond of the neutral hydrogen molecule, if the second proton is replaced by a lithium ion, states $\psi_1$ and $\psi_2$ will still be the same: both have one electron around the proton and one around the lithium ion. The two states do have the electrons reversed, but the electrons are identical. Thus the twilight terms are still likely to be effective. Indeed neutral LiH lithium hydride exists as a stable molecule with a binding energy of about 2.5 eV at low pressures. It should be noted that the LiH bond is very ionic, with the “shared” electrons mostly at the hydrogen side, so the actual ground state is quite different from the model. But the model should be better when the nuclei are farther apart, so the analysis can at least justify the existence of a significant bond.

For the ammonia molecule, the two states $\psi_1$ and $\psi_2$ differ only in the side of the hydrogen ring that the nitrogen atom is at. Since these two states are physically equivalent, there is again a significant lowering of the energy $E_L$ for the symmetric combination $a=b$. Similarly, there is a significant raising of the energy $E_H$ for the antisymmetric combination $a=-b$. Transitions between these two energy states produce photons of a single energy in the microwave range. It allows a maser (microwave-range laser) to be constructed, and the first maser was in fact an ammonia one. It gave rise to the subsequent development of optical-range versions. These were initially called “optical masers,” but are now known as “lasers.” Masers are important for providing a single frequency reference, like in some atomic clocks. See chapter 5.3.2 for the operating principle of l/masers.


Key Points
$\diamond$
In quantum mechanics, the energy of different but physically equivalent states can be lowered by mixing them together.

$\diamond$
This lowering of energy does not come from new physical forces, but from the weird mathematics of the wave function.

$\diamond$
The effect tends to be much less when the original states are physically very different.

$\diamond$
One important place where states are indeed physically the same is in chemical bonds involving pairs of electrons. The equivalent states here merely have the identical electrons interchanged.



4.3 Review Questions
1
The effectiveness of mixing states was already shown by the hydrogen molecule and molecular ion examples. But the generalized story above restricts the “basis” states to be orthogonal, and the states used in the hydrogen examples were not.

Show that if $\psi_1$ and $\psi_2$ are not orthogonal states, but are normalized and produce a real and positive value for $\langle\psi_1\vert\psi_2\rangle $, like in the hydrogen examples, then orthogonal states can be found in the form

\begin{displaymath}
\bar\psi_1 = \alpha\left(\psi_1 - \varepsilon\psi_2\right) ...
...ad\bar\psi_2 = \alpha\left(\psi_2 - \varepsilon\psi_1\right).
\end{displaymath}

For normalized $\psi_1$ and $\psi_2$ the Cauchy-Schwartz inequality says that $\langle\psi_1\vert\psi_2\rangle $ will be less than one. If the states do not overlap much, it will be much less than one and $\varepsilon $ will be small.

(If $\psi_1$ and $\psi_2$ do not meet the stated requirements, you can always redefine them by factors $ae^{{\rm i}{c}}$ and $be^{-{\rm i}{c}}$, with $a$, $b$, and $c$ real, to get states that do.) Answer

2
Show that it does not have an effect on the solution whether or not the basic states $\psi_1$ and $\psi_2$ are normalized, like in the previous question, before the state of lowest energy is found.

This requires no detailed analysis; just check that the same solution can be described using the nonorthogonal and orthogonal basis states. It is however an important observation for various numerical solution procedures: your set of basis functions can be cleaned up and simplified without affecting the solution you get. Answer