Nonrelativistic mechanics is often based on the use of a potential energy to describe the forces. For example, in a typical molecular dynamics computation, the forces between the molecules are derived from a potential that depends on the differences in position between the atoms. Unfortunately, this sort of description fails badly in the truly relativistic case.
The basic problem is not difficult to understand. If a potential depends only on the spatial configuration of the atoms involved, then the motion of an atom instantaneously affects all the other ones. Relativity simply cannot handle instantaneous effects; they must be limited by the speed of light or major problems appear. It makes relativistic mechanics more difficult.
The simplest way to deal with the problem is to look at collisions between particles. Direct collisions inherently avoid erroneous action at a distance. They allow simple dynamics to be done without the use of a potential between particles that is relativistically suspect.
As a first example, consider two particles of equal mass and opposite speeds that collide as shown in the center of figure 1.3. You might think of the particles as two helium atoms. It will be assumed that while the speed of the atoms may be quite high, the collision is at a shallow enough angle that it does not excite the atoms. In other words, it is assumed that the collision is elastic.
As seen by observer C, the collision is perfectly symmetric. Regardless of the mechanics of the actual collision, observer C sees nothing wrong with it. The energy of the helium atoms is the same after the collision as before. Also, the net linear momentum was zero before the collision and still zero afterwards. And whatever little angular momentum there is, it too is still the same after the collision.
But now consider an observer A that moves horizontally along with the
top helium atom. For this observer, the top helium atom comes down
vertically and bounces back vertically. Observer B moves along with
the bottom helium atom in the horizontal direction and sees that atom
moving vertically. Now consider the Lorentz transformation
(1.7) of the vertical velocity
of the top atom
as seen by observer A into the vertical velocity
of that atom as
seen by observer B:
Clearly, linear momentum conservation is too fundamental a concept to
be summarily thrown out. Instead, observer A perceives the mass of the
rapidly moving lower atom to be the moving mass
,
by the Lorentz factor:
It is not difficult to understand why things are like this. The
nonrelativistic definition of momentum allows two plausible
generalizations to the relativistic case:
A little thought shows that the right time differential has to be
.
transforms like
,
1,2,...is the same before and after,
Using the chain rule of differentiation, the components of
the momentum four-vector
can be written out as
| (1.14) |
![]() |
(1.15) |
How about the zeroth component? Since it too is part of the
conservation law, reasonably speaking it can only be the relativistic
equivalent of the nonrelativistic kinetic energy. Indeed, it equals
except for a trivial scaling factor 1![]()
to give it units of
momentum.
Note that so far, this only indicates that the difference between
and
gives the kinetic energy. It does not imply that
by itself also corresponds to a meaningful energy. However,
there is a beautifully simple argument to show that indeed kinetic
energy can be converted into rest mass, [20]. Consider
two identical rest masses
that are accelerated to high speed and
then made to crash into each other head-on, as in the left part of
figure 1.4. In this case, think of the masses as
macroscopic objects, so that thermal energy is a meaningful concept
for them. Assume that the collision has so much energy that the
masses melt and merge without any rebound. By symmetry, the combined
mass
has zero velocity. Momentum is conserved: the net momentum
was zero before the collision because the masses had opposite
velocity, and it is still zero after the collision. All very
straightforward.
But now consider the same collision from the point of view of a second
observer who is moving upwards slowly compared to the first observer
with a small speed
.
.
before the collision and so has the mass
after the collision. But then vertical momentum conservation
inevitably implies
Lagrangian mechanics can simplify many complicated dynamics problems. As an example, in this section it is used to derive the relativistic motion of a particle in an electromagnetic field.
Consider first the nonrelativistic motion of a particle in an
electrostatic field. That is an important case for this book, because
it is a good approximation for the electron in the hydrogen atom. To
describe such purely nonrelativistic motion, physicists like to define
a Lagrangian as
| (1.16) |
Physicists next define canonical, or generalized, momentum as the
partial derivative of the Lagrangian with respect to velocity. An
arbitrary component
of the canonical momentum is found as
![]() |
(1.17) |
![]() |
(1.18) |
Since the Lagrangian is a just a scalar, it is relatively simple to
guess its form in the relativistic case. To get the momentum right,
simply replace the kinetic energy by an reciprocal Lorentz factor,
The potential energy part of the Lagrangian is a bit trickier. The
previous section showed that momentum is a four-vector including
energy. Therefore, going from one observer to another mixes up energy
and momentum nontrivially, just like it mixes up space and time. That
has consequences for energy conservation. In the classical solution,
kinetic energy of the particle can temporarily be stored away as
electrostatic potential energy and recovered later intact. But
relativistically, the kinetic energy seen by one observer becomes
momentum seen by another one. If that momentum is to be recovered
intact later, there should be something like potential momentum.
Since momentum is a vector, obviously so should potential momentum be:
there must be something like a vector potential
.
Based on those arguments, you might guess that the Lagrangian should
be something like
![]() |
(1.20) |
The Lagrangian equations of motion become, the same way as before, but
after clean up and in vector notation, {D.6}:
| (1.21) |
Of course, if the Lagrangian above is right, it should apply to all
observers, regardless of their relative motion. In particular, all
observers should agree that the so-called “action”
integral
is stationary for the way that the particle
moves, {A.1.3}, {D.3.1} That
requires that
transforms according to the Lorentz
transformation.
(To see why, recall that dot products are the same for all observers,
and that the square root in the Lagrangian (1.19) equals
![]()
![]()
where the proper time interval
is the
same for all observers. So the action is the same for all observers.)
From the Lorentz transformation of
,
might be more
fundamental physically than the more intuitive electric and magnetic
fields. And that is in fact exactly what more advanced quantum
mechanics shows, chapter 13.1.
It may be noted that the field strengths are unchanged in a “gauge transformation” that modifies
and
into
| (1.22) |
The energy can be found following addendum {A.1} as