Professional Documents
Culture Documents
Relativistic Quantum Mechanics: Andreas Brandhuber
Relativistic Quantum Mechanics: Andreas Brandhuber
Andreas Brandhuber
Department of Physics
Queen Mary, University of London
Mile End Road, London, E1 4NS
United Kingdom
email: a.brandhuber@qmul.ac.uk
Abstract
Lecture notes for the course Relativistic Quantum Mechanics MSci 4241/QMUL PHY-
414. (These are preliminary notes and may contain typos, so use with care!)
1 Symmetries and Angular Momentum
First consider a single particle. If ~x is the position vector, then a translation is the
operation
~x → ~x0 = ~x + ~a . (1.5)
If H
b is invariant then
b x0 ) = H(~
H(~ b x + ~a) = H(~
b x) . (1.6)
For an infinitesimal displacement we can make a Taylor expansion2
b x + ~a) ∼
H(~ = H(~ ~ H(~
b x) + ~a · ∇ b x) , (1.7)
b x + ~a) − H(~
0 = H(~ ~ H(~
b x) = ~a · ∇ b x) . (1.8)
1
i.e. the QM operator A
b corresponding to the observable A obeys ∂ A/∂t
b = 0.
2
The symbol ∼= indicates that we only expand to first order in ~a and suppress all higher order terms.
1
In general for the momentum operator P~ and any other operator O(~
b x) we have
b
where we have suppressed the explicit ~x and t dependence. Since eqn. (1.9) is true for
arbitrary wavefunctions Ψ
[P~ , O(~ ~ O(~
x)] = −i~∇ b x) . (1.10)
b b
In particular for O
b=H
b
[P~ , H] ~H
= −i~∇ b. (1.11)
b b
Now,
~H
0 = −i~~a · ∇ ~ , H]
b = ~a · [P
b b (1.12)
and since this is true for an arbitrary displacement vector ~a we find
[P~ , H] = 0. (1.13)
b b
~i
dhP
We conclude that momentum is a conserved quantity dt
= 0 if the H
b is translationally
invariant.
Take e.g.
2
Hb =−~ ∇ ~ 2 + V (~x) (1.14)
2m
For the translation, ~x0 = ~x + ~a, in particular x0 = x + a
∂ ∂x0 ∂ ∂
⇒ = = (1.15)
∂x ∂x ∂x0 ∂x0
~ and ∇
and similarly for y and z. Thus ∇ ~ 2 are invariant under translations and
2
b x + ~a) = − ~ ∇
b x0 ) = H(~
H(~ ~ 02 + V (~x0 )
2m
~2 ~ 2
= − ∇ + V (~x + ~a) . (1.16)
2m
Hence, for a translationally invariant Hamiltonian we must require
which is only true for a (trivial) constant potential, i.e. for a free particle. Thus, the
momentum of a free particle is conserved in QM in the sense
dhP~ i
= 0. (1.18)
dt
2
Consider now a two particle system (easily generalised to N particles). If the two
particles have position vectors ~x1 and ~x2 , the invariance condition for the translation of
the system through ~a reads
H(~
b x1 , ~x2 ) = H(~
b x1 + ~a, ~x2 + ~a) . (1.19)
P~ = P~ 1 + P~ 2 , (1.22)
b b b
where
P~ 1 = −i~∇
~ 1 and P
~ 2 = −i~∇
~2. (1.23)
b b
and so
[P~ , O(~ ~1+∇
x1 , ~x2 )] = −i~(∇ ~ 2 )O(~
b x1 , ~x2 ) . (1.25)
b b
~1+∇
0 = −i~~a · (∇ ~ 2 )H ~ , H]
b = ~a · [P
b b . (1.26)
Since this must be true for arbitrary translation vector ~a, we have
[P~ , H] = 0, (1.27)
b b
~i
dhP
and total momentum is conserved in the sense of QM i.e. dt
= 0.
Take spherical polar coordinates and take the axis of rotation to be the z-axis. Spec-
ify the position vector ~x in spherical polar coordinates (r, θ, φ) of the point. Then the
3
symmetry operation ~x → ~x0 corresponding to a rotation by an angle α about the z−axis
is
(r, θ, φ) → (r0 , θ0 , φ0 ) = (r, θ, φ + α) . (1.28)
For the Hamiltonian to be invariant under rotations about the z-axis
b x0 ) = H(~
H(~ b x)
b 0 , θ0 , φ0 ) = H(r,
⇐⇒ H(r b θ, φ)
⇐⇒ H(r,
b θ, φ + α) = H(r, b θ, φ) . (1.29)
b θ, φ + α) ∼
H(r, b θ, φ) + α ∂ H(r,
= H(r, b θ, φ) , (1.30)
∂φ
0 = b θ, φ + α) − H(r,
H(r, b θ, φ) = α ∂ H(r,
b θ, φ)
∂φ
∂ b
−→ H(r, θ, φ) = 0 . (1.31)
∂φ
bz = −i~ ∂
L (1.32)
∂φ
b = −i~( ∂ (OΨ)
b = −i~[ ∂ , O] b ∂ Ψ) = −i~ ∂ O Ψ .
b
[L
bz , O]Ψ b −O (1.33)
∂φ ∂φ ∂φ ∂φ
This is true for arbitrary wavefuntions Ψ, thus
b = −i~ ∂O
b
[L
bz , O] , (1.34)
∂φ
[L
bz , H]
b = 0. (1.36)
3
In cartesian coordinates ~x = (x, y, z): L
b z = (~x ~ )z = −i~(x ∂ − y ∂ ) . The other components, L
b×P
b bx
∂y ∂x
and Lb y , can be obtained by cyclic permutation of (x, y, z).
4
We can define angles φx and φy analogous to φz ≡ φ for rotations about the x and y-axis.
If H
b is also invariant under rotations about the x and y-axis we will conclude that
[L
bx , H]
b = [L
by , H]
b = [L
bz , H] ~b H]
b = 0 i.e. [L, b = 0. (1.37)
H
b = H(r
b 1 , θ1 , φ1 ; r2 , θ2 , φ2 ) . (1.40)
The invariance condition for rotation of the system by an angle α about the z-axis is
H(r
b 1 , θ1 , φ1 ; r2 , θ2 , φ2 ) = H(r
b 1 , θ1 , φ1 + α; r2 , θ2 , φ2 + α) , (1.41)
0 = b 1 , θ1 , φ1 + α; r2 , θ2 , φ2 + α) − H(r
H(r b 1 , θ1 , φ1 ; r2 , θ2 , φ2 )
= α(∂φ1 + ∂φ2 )H b
−→ (∂φ1 + ∂φ2 )H
b = 0. (1.43)
The z-components of the orbital AM operator for the two particles are
b1z = −i~∂φ1 , L
L b2z = −i~∂φ2 . (1.44)
5
The z-component of the total orbital AM is:
L
bz = L
b1z + L
b2z . (1.45)
[L b = −i~(∂φ1 + ∂φ2 )O
bz , O] b, (1.46)
and in particular
[L b = −i~(∂φ1 + ∂φ2 )H
bz , H] b. (1.47)
If the Hamiltonian is invariant under rotations about the z-axis, we now conclude that
[L
bz , H]
b = 0. (1.48)
[L
bx , L
by ] = i~L
bz , [L
bx , L
by ] = i~L
bz , [L
bx , L
by ] = i~L
bz . (1.51)
~
Consequently, it is only possible to know simultaneously the values of one component L
~ 2 ~ 2
and L , for example, Lz and L . Therefore, we may take the conserved quantities to be
Lz and L~ 2.
An atom with atomic number Z with Coulomb forces between the nucleus and the
electrons and between the electrons is an example of a system with the necessary rotational
invariance for conservation of AM. In this case:
Z Z Z
~2 X ~ 2 X Ze2 X e2
H=−
b ∇ − + , (1.52)
2m i=1 i i=1
4π0 |~xi | i,j=1,i<j 4π0 |~xi − ~xj |
where ∇~ i acts on the coordinates of the i-th electron, and ~xi is the position vector of
the i-th electron relative to the nucleus. As before, ∇ ~ 2 and |~xi | are scalars (rotationally
i
invariant) and so is |~xi − ~xj |, so that H
b is rotationally invariant.
J~ = L
b ~b b ~
+S (1.53)
6
that commutes with the Hamiltonian
b~ b
[J, H] = 0 (1.54)
b is rotationally invariant, and then hJz i and hJ~2 i are constants of motion.
if H
For the atomic (non-relativistic) Hamiltonian above, hLz i and hL ~ 2 i are also constants
of motion, because the spin does not appear explicitly in the Hamiltion. This is a mani-
festation of the fact that spin is an effect of Special Relativity as we will see later in the
course. However, if we include the Spin-Orbit interaction due to Relativistic effects
1 1 dV ~b b ~,
H
b Spin−Orbit = L·S (1.55)
2m2e c2 r dr
For a particle with orbital AM operator L ~b and spin angular momentum operator S
~ we
b
often need to construct the eigenstates of the total AM operator
J~ = L
b ~b b ~,
+S (1.56)
Recall from QM that for eigenvalue of L ~ 2 equal to l(l + 1)~2 with l an integer, the
eigenvalues of Lz are ml ~ with ml taking values ml = −l, −l + 1, . . . , l − 1, l and similarly
~ and J~ where s and j are integer or half-integer.
for S
Also for a 2 particle system, we often need simultaneous eigenstates |jmi of the total
AM operators J~2 and Jz in terms of simultaneous eigenstates |j1 m1 i of J~12 and J1z and
|j2 m2 i of J~22 and J2z .
In the following we assume that all eigenstates are ”orthonormalized” in the familiar
fashion, e.g. hjm|j 0 m0 i = δjj 0 δmm0 .
4
In QM we encounter the eigenstates |l, ml i when the Schrödinger equation is solved in spherical
coordinates. Usually they are denoted as spherical harmonics Yl,ml (θ, φ) but their explicit form will not
be needed in the following.
7
Clebsch-Gordon coefficients are defined to be the coefficients in the expansion
X
|jmi = C(j1 m1 , j2 m2 |jm)|j1 m1 i|j2 m2 i . (1.57)
m1 ,m2
• We have
J~ = J~1 + J~2 −→ Jz = J1z + J2z , (1.58)
thus for any simultaneous eigenstate |Ψi of J1z and J2z we have
where mu ~, m1u ~ and m2u ~ are the maximal eigenvalues of Jz , J1z and J2z , respec-
tively. Since these maximal eigenvalues are just j~, j1 ~ and j2 ~, we must have
j ≤ j1 + j2 . (1.61)
⇒ j ≥ j1 − j2 . (1.62)
Similarly, if j2 > j1 ,
⇒ j ≥ j2 − j1 . (1.63)
In general:
j ≥ |j1 − j2 | (1.64)
Thus, the possible values of j that can be constructed from j1 and j2 are
8
Explicit Construction of Clebsch-Gordon Coefficients
j1 = 12 , j2 = 1
We shall need to know J− |j, mi where J± = Jx ± iJy are the familiar raising and
lowering operators. Note that they are the adjoint operators of each other, i.e. J+† = J−
and J−† = J+ and obey J+ J− = J~2 − Jz2 + ~Jz .
The constant of proportionality can be determined (without proof) with the result
p
J− |j, mi = (j − m + 1)(j + m)~|j, m − 1i . (1.66)
Begin by constructing the state with maximum value of j and maximum value of m,
i.e. |j = 23 , m = 32 i.
Since m = m1 + m2 and m1 = ± 21 and m2 = −1, 0, +1, there is only one way to get
m = 32 , namely 23 = 12 + 1. Thus,
3 3 1 1
|j = , m = i = |j1 = , m1 = i|j2 = 1, m2 = 1i . (1.67)
2 2 2 2
We can now use the lowering operator J− to construct states with the same value for
j but smaller values of m.
J~ = J~1 + J~2
⇒ J− = (J1 )− + (J2 )− . (1.68)
From now on we will take natural units with ~ = 1. Apply eqn. (1.68) to eqn. (1.67).
The left hand side gives
3 3 p 3 1 √ 3 1
J− |j = , m = i = (3/2 − 3/2 + 1)(3/2 + 3/2)|j = , m = i = 3|j = , m = i ,
2 2 2 2 2 2
(1.69)
9
whereas the right hand side gives
3 3 1 1
J− |j = , m = i = ((J1 )− + (J2 )− )|j1 = , m1 = i|j2 = 1, m2 = 1i
2 2 2 2
1 1
= ((J1 )− |j1 = , m1 = i)|j2 = 1, m2 = 1i +
2 2
1 1
|j1 = , m1 = i((J2 )− |j2 = 1, m2 = 1i)
2 2
p 1 1
= (1/2 − 1/2 + 1)(1/2 + 1/2)|j1 = , m1 = − i|j2 = 1, m2 = 1i +
2 2
p 1 1
(1 − 1 + 1)(1 + 1)|j1 = , m1 = i|j2 = 1, m2 = 0i , (1.70)
2 2
thus,
3 3 1 1
J− |j = , m = i = ((J1 )− + (J2 )− )|j1 = , m1 = i|j2 = 1, m2 = 1i
2 2 2 2
1 1
= |j1 = , m1 = − i|j2 = 1, m2 = 1i +
2 2
√ 1 1
2|j1 = , m1 = i|j2 = 1, m2 = 0i . (1.71)
2 2
Comparing left and right hand side we obtain,
3 1 1 1 1
|j = , m = i = √ |j1 = , m1 = − i|j2 = 1, m2 = 1i +
2 2 3 2 2
√
2 1 1
√ |j1 = , m1 = i|j2 = 1, m2 = 0i . (1.72)
3 2 2
Now we can apply J− again to this state, and after a similar calculation as before we
obtain
√
3 1 2 1 1
|j = , m = − i = √ |j1 = , m1 = − i|j2 = 1, m2 = 0i +
2 2 3 2 2
r
1 1 1
|j1 = , m1 = i|j2 = 1, m2 = −1i . (1.73)
3 2 2
10
1
How can we obtain the j = 2
states?
Start from the state with j = 12 and the maximum value of m (for that j) i.e. |j =
1
2
,m = 21 i. This state has the same value of m as |j = 32 , m = 21 i and must be a linear
combination of the same |j1 , m1 i|j2 , m2 i states. However, because it has different value of
j it must be orthogonal to |j = 23 , m = 21 i. (Distinct eigenvalues of a Hermitian Operator
have orthogonal eigenfunctions.)
Write,
1 1 1 1
|j = , m = i = c1 |j1 = , m1 = − i|j2 = 1, m2 = 1i +
2 2 2 2
1 1
c2 |j1 = , m1 = i|j2 = 1, m2 = 0i . (1.75)
2 2
Orthogonality to |j = 23 , m = 21 i implies
3 1 1 1
hj = , m = |j = , m = i = 0 , (1.76)
2 2 2 2
hence,
c 1 1 1 1
√1 hj1 = , m1 = − |j1 = , m1 = − ihj2 = 1, m2 = 1|j2 = 1, m2 = 1i +
3 2 2 2 2
√
c2 2 1 1 1 1
√ hj1 = , m1 = |j1 = , m1 = ihj2 = 1, m2 = 0|j2 = 1, m2 = 0i +
3 2 2 2 2
c 1 1 1 1
√2 hj1 = , m1 = − |j1 = , m1 = ihj2 = 1, m2 = 1|j2 = 1, m2 = 0i +
3 2 2 2 2
√
c1 2 1 1 1 1
√ hj1 = , m1 = |j1 = , m1 = − ihj2 = 1, m2 = 0|j2 = 1, m2 = 1i
3 2 2 2 2
= 0. (1.77)
Due to ”orthonormality” of the eigenstates, the third and fourth line of this equation
vanish and from the rest we get
√
r
c1 2
√ + c2 = 0 ⇒ c1 = − 2c2 . (1.78)
3 3
11
Take c1 and c2 real5 , then
2c22 + c22 = 3c22 = 1
r
1 2
⇒ c2 = √ ⇒ c1 = − , (1.81)
3 3
so
r
1 1 2 1 1
|j = , m = i = − |j1 = , m1 = − i|j2 = 1, m2 = 1i
2 2 3 2 2
1 1 1
+ √ |j1 = , m1 = i|j2 = 1, m2 = 0i . (1.82)
3 2 2
Using again the J− operator we can also obtain the state |j = 21 , m = − 21 i with the
result
r
1 1 2 1 1
|j = , m = − i = |j1 = , m1 = i|j2 = 1, m2 = −1i
2 2 3 2 2
1 1 1
− √ |j1 = , m1 = − i|j2 = 1, m2 = 0i . (1.83)
3 2 2
12
degenerate in energy. This degeneracy is a manifestation of the Rotational Symmetry of
the Hydrogen atom.
It is known that for 11 H, not only are all the states with the same AM Quantum Number
l degenerate in energy, but also are all the states with l = 0, 1, . . . , n − 1 corresponding to
the Principle Quantum Number n (the proof can be found in standard QM books). We
want to show here that this is a consequence of a larger symmetry group that includes
Rotational invariance.
In classical physics it can be shown that for a central potential V (|~x|) not only is
~ = ~x × p~ ,
L (1.87)
a constant of motion, but so also is the Lenz vector (in units where me = 1)
e2 ~x
~ 1 ~ ~
M=√ L × ~x − ~x × L + . (1.88)
−8E 2π0 |~x|
Correspondingly, in QM both
~b = ~x
L b×b
p~ , (1.89)
and !
2
~ =p1 ~b + e ~x ,
b
M
c ~b × ~x
L b − ~x
b×L (1.90)
−8H
b 2π0 |~x|
commute with H.
b If we define the linear combinations
b~ 1 ~b c ~ = 1 (L
~ ) and K ~b − M
~)
I = (L + M (1.91)
b c
2 2
it can be shown that I~ and K ~ commute with each other and they behave like AM operators
b b
i.e.
[Ibx , Iby ] = iIbz , [K
bx, K
b y ] = iK
b z , plus cyclic permutations (1.92)
with ~ = 1. This is referred to as the O(4) or SU (2) × SU (2) algebra. With some effort
it can be shown that the Hamiltonian can be written in the following simple form
2 2
b =− 1 1 e
H 2 2 (1.93)
4 b~ ~ + 1 4π 0
I +K
b
2
2 2
Because I~ and K~ each obey the AM algebra, the eigenvalues of b
I~ and K
~ are of the form
b b b
i(i + 1) and k(k + 1) with i and k integers (with ~ = 1).
13
These energy levels have degeneracy (2i + 1)(2k + 1).
Because of the properties of the triple scalar product ~a · (~b × ~c) we have
~b · M
L ~ = 0,
c
(1.95)
thus,
2
b~2 2 2
I =K~ =1
b ~b + M
L ~
c
(1.96)
4
and we must have i = k. If we write n = 2i + 1 = 2k + 1:
n2 − 1
⇒ i(i + 1) = (1.97)
4
Then the energy levels are
2 −1 2
e2 n2 − 1 n2 − 1 1 e2
1 1
En = − + + =− 2 . (1.98)
4 4π0 4 4 2 2n 4π0
The Degeneracy is also the Expected Degeneracy because we have degenerate states
with l = 0, 1, . . . , n − 1 and for each l, ml = −l, . . . , l. Thus there are
n−1
X
(2l + 1) = n2 (1.99)
l=0
14
If aµ and bµ are four-vectors
a · b ≡ a0 b 0 − a1 b 1 − a2 b 2 − a3 b 3 (2.1)
is a scalar (invariant) under LT’s. It has the same value for all observers just like
(x0 )2 − (x1 )2 − (x2 )2 − (x3 )2 = c2 t2 − x2 − y 2 − z 2 (2.2)
2 µ E2
p ≡ p pµ ≡ (p ) − (p ) − (p ) − (p ) = 2 − p~2 = m2 c2
0 2 1 2 2 2 3 2
c
⇒ E 2 = p~2 c2 + m2 c4 . (2.4)
If we define the differential operator 4-vectors, ∇ = µ ∂ ~
, −∇ = 1 ∂ ~
, −∇ and
∂x0 c ∂t
~ = 1 ∂ ,∇
∇µ = ∂x∂ 0 , ∇ ~ , then the 4-momentum operator is
c ∂t
~2
p
In non-relativistic QM, the free Hamiltonian H = E = 2m
is quantised by the substitution
∂ ~
H → i~ , p~ → −i~∇ (2.8)
∂t
15
to give the Schrödinger equation
∂Ψ ~2 ~ 2
i~ =− ∇ Ψ. (2.9)
∂t 2m
16
Furthermore, this implies
∇ · (Ψ∗ ∇Ψ − Ψ∇Ψ∗ ) = 0
∂ 1 ∗ ∗ ~
∗~ ~ ∗
⇒ (Ψ ∂t Ψ − Ψ∂ t Ψ ) − ∇ · Ψ ∇Ψ − Ψ∇Ψ = 0. (2.16)
∂t c2
After multiplying through by ic2 , in order to make ρ real, we can write this as desired as
a continuity equation with
ρ = i (Ψ∗ ∂t Ψ − Ψ∂t Ψ∗ ) and J~ = −ic2 Ψ∗ ∇Ψ~ − Ψ∇Ψ~ ∗ . (2.17)
Now because of the negative sign between the two terms in ρ, the probability density can
both take positive and negative values (in contrast to non-relativistic QM where ρ = |Ψ|2
is positive definite)! This is an absurd and nonsensical result for a probability density!!
Junk the KG equation for the moment and try harder. Schrödinger was the first
to write down this relativistic wave equation, but discarded it for a different reason; the
spectrum is not bounded from below. This was the historical route. See later for a rebirth
of the KG equation thanks to Feynman.
b = i~ ∂Ψ
HΨ (2.18)
∂t
and try to give a meaning to the square root in
q q
b 2 4 b 2 ~2.
H = m c + p~c = m2 c4 − ~2 c2 ∇ (2.19)
∂
Since the equation is linear in ∂t
, Lorentz covariance suggests it should be linear also
in the ∂x∂ i , i = 1, 2, 3.
So we write
H
b = c~ p~ + βmc2
α·b
= −i~c~α·∇~ + βmc2 , (2.20)
where αi and β are coefficients to be determined. More explicitly this equation can be
written
1 ∂ 2 ∂ 3 ∂
b = −i~c α
H +α +α + βmc2 . (2.21)
∂x1 ∂x2 ∂x3
17
Now we determine the coefficients αi and β by requiring that this linear operator ”squares”
to the KG operator
H ~ 2 + m 2 c4 .
b 2 = −~2 c2 ∇ (2.22)
We find
2 2 2
2 2 2 1 2 ∂ 2 2 ∂ 3 2 ∂
H = −~ c (α )
b + (α ) + (α ) + β 2 m2 c4
∂(x1 )2 ∂(x2 )2 ∂(x3 )2
3 1 1 ∂
−i~mc (α β + βα ) 1 + . . .
∂x
∂2
2 2 1 2 2 1
−~ c (α α + α α ) 1 2 + . . . . (2.23)
∂x ∂x
Thus we need to solve
(αi )2 = I , i = 1, 2, 3
αi αj + αj αi = 0 , i 6= j
β2 = I
αi β + βαi = 0 , i = 1, 2, 3 , (2.24)
where I denotes a unit matrix (if a subscript is added it denotes the dimensionality, e.g.
I2 denotes a 2 × 2 unit matrix).
It is obviously NOT possible to solve those equations if the coefficients are simply
complex numbers. So let us assume that they are N ×N matrices. With some (guess)work
it can be shown that the smallest value of N for which eq. (2.24) can be solved is N = 4.
This implies that the Dirac wave function is a 4-component column vector
Ψ1 (x)
Ψ2 (x)
Ψ(x) =
Ψ3 (x) (2.25)
Ψ4 (x)
where x ≡ (x0 , ~x) and the Dirac equation becomes a matrix equation
∂Ψ ~ + βmc2 )Ψ .
i~ = HΨ
b = (c~ α·bp~ + βmc2 )Ψ = (−i~c~ α·∇ (2.26)
∂t
This is a set of 4 first order linear differential equations to determine Ψ1 , . . . , Ψ4 .
A particular set of solutions of (2.24) for the 4 × 4 matrices αi , β can be written with the
help of the 2 × 2 Pauli matrices σ i , i = 1, 2, 3,
1 0 1 2 0 −i 3 1 0
σ = , σ = , σ = (2.27)
1 0 i 0 0 −1
18
which obey the following identities
2
σ i = I2 , i = 1, 2, 3
σ i σ j + σ j σ i = 0 , i 6= j . (2.28)
We may satisfy the first two lines in eq. (2.24) by taking αi to be the 4 × 4 matrices
i 0 σi
α = , i = 1, 2, 3 . (2.29)
σi 0
Now we may satisfy the remaining two lines in eq. (2.24) by taking
I2 0
β= . (2.30)
0 −I2
(αi )† = αi , β † = β . (2.31)
∂Ψ† ~ †·α
−i~ = i~∇Ψ ~ + mc2 Ψ† β . (2.33)
∂t
(Recall that αi and β are hermitian and (AB)† = B † A† .)
Now take Ψ† ×(Dirac eqn.) and (Hermitian conjugate eqn.)×Ψ and subtract the two
to obtain:
∂(Ψ† )
† ∂Ψ
i~ Ψ + Ψ = −i~c Ψ† α ~ + ∇(Ψ
~ · ∇Ψ ~ †) · α
~Ψ . (2.34)
∂t ∂t
Dividing this equation by i~ we obtain a Continuity Equation
∂ρ ~ ~
+∇·J =0 (2.35)
∂t
19
with the positive definite probability density given by
4
X 4
X
†
ρ=ΨΨ= Ψ∗k Ψk = |Ψk |2 > 0 , (2.36)
k=1 k=1
20
By acting with the Hamiltonian operator H b = i~ ∂ we find that the first term in the
∂t
solution (2.42) carries positive energy (+mc2 ) whereas the second term carries negative
energy (−mc2 ).
i~∂t Ψ = HΨ
b with H
b = c~ p~ + βmc2
α·b (2.43)
~
p~ = −i~∇.
with b
0 0 0 −1
1
⇒ we are describing spin 2
particles (and anti-particles).
21
2.8 The Covariant Form of the Dirac Equation
γ µ ≡ γ 0 , ~γ = γ 0 , γ 1 , γ 2 , γ 3 ,
(2.54)
which makes the Dirac equation now look Lorentz covariant.
Using the properties of the αi and β we may show that the gamma-matrices obey the
following anti-commutation identity
{γ µ , γ ν } ≡ γ µ γ ν + γ ν γ µ = 2g µν I4 , (2.58)
22
1 0 0 0
0 −1 0 0
with g µν =
0 0 −1 0 .
0 0 0 −1
In particular, (γ 0 )2 = I4 , (γ i )2 = −I4 , i = 1, 2, 3.
µ
A 4-vector, a , may be regardedas a contravariant vector (under LT’s) and
1 0 0 0
0 −1 0 0
gµν ≡ g µν = as a Metric Tensor. Then, aµ ≡ 3 gµν aν ≡ gµν aν is
P
0 0 −1 0 ν=0
0 0 0 −1
a covariant vector. Summation over repeated indices is understood (Einstein summation
convention), where one of the indices always is an upper (contravariant) index and the
other is a lower (covariant) index.
→ a · b = aµ b µ
23
2.11 Plane Wave Solutions of the Dirac Equation
From now on we will work in natural units ~ = c = 1. We look for plane wave solutions
of the Dirac equation of the form
∓ip·x φ
Ψ=e (2.62)
χ
where φ are the upper two components and χ the lower two components of Ψ and p · x =
Et − p~ · ~x, with E > 0.
The factor e−ip·x gives solutions with positive energy E and momentum p~, and the
factor e+ip·x gives solutions with negative energy −E and momentum −~p. Substituting
back into the Dirac equation
~ + βm Ψ = i ∂Ψ ,
−i~
α·∇ (2.63)
∂t
and using ∂t Ψ = ∓iEΨ and ∂x Ψ = ±ipx Ψ e.t.c., we obtain
α · p~ + βm)Ψ = i(∓iE)Ψ
(−i(±i)~
→ (±~α · p~ + βm)Ψ = ±EΨ . (2.64)
If we use the standard representation for the αi and β from Section 2.4 we obtain
mI ±~σ · p~ φ φ
= ±E (2.65)
±~σ · p~ −mI χ χ
which gives the coupled set of equations
(m ∓ E)φ ± ~σ · p~χ = 0
±~σ · p~φ − (m ± E)χ = 0 . (2.66)
We will construct the solutions in such a way that they have a straighforward p~ = 0 limit.
(m − E)φ + ~σ · p~χ = 0
~σ · p~φ − (m + E)χ = 0 . (2.67)
For p~ = 0, E = m and χ = 0 (in agreement with Section 2.6).
24
then,
−ip·x φ
Ψ=e σ ·~
~ p . (2.69)
(E+m)
φ
Note that the first equation in (2.67) only gives the on-(mass)shell condition E 2 = p~2 +m2 .
1 0
We can write φ in terms of φ1 = and φ2 = .
0 1
where
√
φs
U (p, s) = E+m σ ·~
~ p (2.71)
(E+m) s
φ
is a positive energy Dirac spinor. In the last expression a convenient normalization factor
has been introduced.
It can be checked that the first equation in (2.67) is automatically satisfied by using
the identity (~σ · p~)2 = p~2 I2 .
(m + E)φ − ~σ · p~χ = 0
−~σ · p~φ − (m − E)χ = 0 . (2.72)
For p~ 6= 0 it is convenient to solve for φ in terms of χ using the first equation in (2.72),
i.e.
~σ · p~
φ= χ (2.73)
(E + m)
then, σ ·~
~ p
+ip·x (E+m)
χ
Ψ=e . (2.74)
χ
Note that the second equation in (2.72) only gives the on-(mass)shell condition E 2 =
p~2 + m2 .
0 1
We can write χ in terms of χ1 = and χ2 = .
1 0
25
There are thus two independent negative energy solutions
Ψ = e+ip·x V (p, s) , s = 1, 2 (2.75)
where
√ σ ·~
~ p
(E+m) s
χ
V (p, s) = E+m (2.76)
χs
is a negative energy Dirac spinor. In the last expression a convenient normalization factor
has been introduced.
It can be checked that the second equation in (2.72) is automatically satisfied by using
the identity (~σ · p~)2 = p~2 I2 .
Interpretation
To find the physical interpretation for the four independent solutions we consider the
rest frame p~ = 0. Then:
1
√ √
1
φ 0
U (p, 1) = 2m = 2m
0 0
0
0
√ √
2
φ 1
U (p, 2) = 2m = 2m
0 0
0
0
√ √
0 0
V (p, 1) = 2m 1 = 2m
χ 0
1
0
√ √
0 0
V (p, 2) = 2m = 2m (2.77)
χ2 1
0
~ = ~x × p~ = 0 so that the total angular momentum
Furthermore, for p~ = 0 we have L
operator becomes
J~ = L~ +S ~=S ~ = 1Σ~
1 2
σ3 0
⇒ Sz = 2 (2.78)
0 12 σ3
Thus, U (p, 1) and U (p, 2) are positive energy solutions with Sz eigenvalues sz = +1/2
and sz = −1/2 respectively, whereas V (p, 1) and V (p, 2) are negative energy solutions
with Sz eigenvalues sz = −1/2 and sz = +1/2 respectively.
26
In general, U and V are the Lorentz boosts of these solutions to a frame where p~ 6= 0.
Interpret the negative energy solutions later.
i.e. U (p, s) obeys the Dirac equations with pbµ simply replaced by pµ . Similarly, we find
We will often make use of the adjoint spinors which are defined as
One may also check directly (using again (~σ · p~)2 = p~2 I2 ) that
and
U (p, s)U (p, s) = 2m , V (p, s)V (p, s) = −2m , s = 1, 2 . (2.84)
Since the Dirac equation has negative energy solutions, why do positive energy electrons
not radiate energy and fall into a negative energy state? Dirac: Negative energy states
are completely filled and the Pauli exclusion principle (which applies to fermions) forbids
the transition. Consequently, the Vacuum is a state with all positive energy states empty
but all negative energy states filled.
27
(picture goes here)
The absence of a spin-up electron of energy −|E| and momentum −~p is equivalent to
the presence of a spin-down positron of energy +|E| and momentum +~p. (Think about
time running backwards or the arrow in a Feynman diagram reversed)
Thus, the electron wavefunction eip·x V (p, s) corresponding to energy −E and momen-
tum −~p describes a positron of energy +E and momentum +~p. Also, V (p, 1) and V (p, 2)
which describe spin down and spin up negative energy electrons must describe spin up
and spin down positrons.
In general the infinite negative charge of the vacuum produces no effect because the
distribution of charge is homogeneous.
However, consider the effect of a positive energy electron with charge −|e| on the
vacuum. It repels the negative energy electrons and electrically polarises the vacuum.
Thus the physical charge −|e| seen by a test charge at a large distance from the electron
is numerically smaller than the bare charge −|e0 |, i.e. |e| < |e0 |.
However, if the test charge comes very close it will see the bare charge −|e0 |. For
S-wave electrons (l = 0) in an atom, the proton sees a charge numerically greater than
the ordinary electric charge |e|. Note that for l > 0 the wavefunction vanishes at the
origin and the proton feels a numerically smaller charge. This effect leads to measurable
shifts of the energy levels of atoms.
28
2.15 Charge Conjugation Symmetry C
C : Ψ → ΨC (2.85)
which turns a positive energy electron wavefunction (e− ) into a negative energy wave
function (e+ ) with the same momentum and spin state. If Ψ = e−ip·x U (p, s) then ΨC =
eip·x V (p, s). The required operation turns out to be
C : Ψ → ΨC = Cγ 0 Ψ∗ , (2.86)
We say that the Dirac equation is charge conjugation invariant. The Dirac equation
may be written as
(iγ µ ∇µ − m)Ψ = 0, (2.87)
taking the complex conjugate gives
(−iCγ 0 (γ µ )∗ ∇µ − m)Ψ∗ = 0
→ (iγ µ (Cγ 0 ))∇µ − mcγ 0 )Ψ∗ = 0
→ (iγ µ ∇µ − m)ΨC = 0 , (2.89)
where we have used the identity Cγ 0 (γ µ )∗ = −γ µ (Cγ 0 ). This shows that ΨC obeys the
same equation as Ψ.
The Dirac equation is also invariant under reflection of space coordinates in the origin
29
The corresponding operation7 on Dirac spinors is
P : Ψ → Ψ0 = P Ψ (2.91)
with P = γ 0 . It can be checked by direct calculation that P U (~p, s) = U (−~p, s) i.e.
p~ → −~p as expected for space inversion, but the spin state is unchanged. Also P V (~p, s) =
−V (−~p, s). The −1 factor indicates that anti-particles have opposite Parity to particles.
~ = ~x × p~, and
p
This is as expected for time reversal, since p~ = m~v / 1 − ~v 2 /c2 and L
~ → −L
thus under time reversal p~ → −~p and L ~ and in particular Lz → −Lz . We assume
that this applies to any AM operator, so that in particular Sz → −Sz . In this case, Ψ0
obeys the same equation as Ψ with ∂t replaced by −∂t .
It is important in the study of the Weak Interactions to know the properties of objects
like Ψγ µ Ψ, Ψγ µ γ5 Ψ, etc, where we introduced
0 1 2 3 0 I
γ5 ≡ iγ γ γ γ = . (2.95)
I 0
7
Since P also transforms the space time coordinates this operation should be written more properly
as Ψ(t, ~x) → Ψ0 (t0 , ~x0 ) = Ψ0 (t, −~x) = P Ψ(t, ~x). Hence Ψ0 (t, ~x) = P Ψ(t, −~x); a similar comment applies
to time reversal T .
30
We list here the behaviour of some of the Dirac covariants under Lorentz transforma-
tions, P, C:
Covariant LT’s P C
ΨΨ scalar +ΨΨ −ΨΨ
Ψγ5 Ψ pseudoscalar −Ψγ5 Ψ −Ψγ5 Ψ
Ψγ µ Ψ 4-vector +Ψγ 0 Ψ +Ψγ µ Ψ
−Ψγ i Ψ
Ψγ µ γ5 Ψ (pseudo) 4-vector −Ψγ 0 γ5 Ψ −Ψγ µ γ5 Ψ
+Ψγ i γ5 Ψ
Thus,
Ψγ 0 Ψ → Ψγ 0 γ 0 γ 0 Ψ = Ψγ 0 Ψ (2.97)
Ψγ i Ψ → Ψγ 0 γ i γ 0 Ψ
= −Ψ(γ 0 )2 γ i Ψ
= −Ψγ i Ψ (2.98)
which is invariant under both P and C because the two negative signs for the U γ i U
cancel for space inversion. Thus the Electromagnetic interactions are both space reflection
invariant and charge conjugation invariant.
31
The corresponding probability amplitude for the Weak Interaction has a factor
U (p2 , s2 )γµ (I − γ5 )U (p1 , s1 )U (p4 , s4 )γ µ (I − γ5 )U (p3 , s3 )
= U γµ U U γ µ U + U γµγ5 U U γ µ γ5 U
−U γµ γ5 U U γ µ U − U γµ U U γ µ γ5 U . (2.100)
The term
U γµ γ5 U U γ µ U
3
X
0 0
= U γ γ5 U U γ U − U γ i γ5 U U γ i U (2.101)
i=1
Note, that in a general theory C, P and T are not preserved, but the combination of
the three transformations CPT is always a symmetry.
2.19 Neutrinos
Some modification of RQM is needed in the physically important case of massless spin-1/2
particles — Neutrinos (with todays experimental evidence of Neutrino oscillations this is
not quite true, nevertheless it is a very good approximation.)
Experimental observation shows that whereas an e− can have Helicity +1/2 or −1/2,
a Neutrino (which is massless) can only have Helicity −1/2 and an Anti-Neutrino can
only have Helicity +1/2.
Thus, whereas we need four degrees of freedom to describe the 2 spin states of an
electron or positron, we need only 2 degrees of freedom to describe the spin states of the
neutrino and anti-neutrino. We need to discard 2 spin states of the Dirac particle.
32
Now we return to the Dirac equation for a positive energy solution of energy E and
momentum p~. However, we choose a different representation of the Dirac matrices (and
hence a different representation of the gamma-matrices). This does not effect the physics
but makes the proof much easier. It may be checked that
0 I ~σ 0
β= , α
~= (2.103)
I 0 0 −~σ
~σ · p~φ + mχ = Eφ
mφ − ~σ · p~χ = Eχ . (2.105)
~σ · p~φ = Eφ
~σ · p~χ = −Eχ , (2.106)
~σ · p~ 1
φ = φ
2|~p| 2
~σ · p~ 1
χ = − χ. (2.107)
2|~p| 2
Thus the upper 2 components of Ψ describe Helicity 1/2, and the lower two describe
helicity −1/2 when Ψ has positive energy. To obtain an appropriate Ψ to describe a
Neutrino we perform a projection that removes the upper 2 components.
and
5 −σ 1 σ 2 σ 3 0 I 0
γ =i 1 2 3 = (2.109)
0 σ σ σ 0 −I
33
1 0 0
If we form 2
(I−γ5 )
= , then we may use it to project the upper 2 components
0 I
and leave the Helicity −1/2 components. Thus,
1
ΨL ≡ (I − γ5 )Ψ , (2.110)
2
with Ψ a positive energy spinor, may be used to describe the neutrino.
If instead we start from a negative energy solution Ψ, the from Section 2.11
α · p~ + βm)Ψ = −EΨ .
(−~ (2.111)
ip·x φ
For Ψ = e we then have
χ
−~σ · p~ mI φ φ
= −E , (2.112)
mI ~σ · p~ χ χ
~σ · p~φ = Eφ
~σ · p~χ = −Eχ , (2.114)
~σ · p~ 1
φ = φ
2|~p| 2
~σ · p~ 1
χ = − χ. (2.115)
2|~p| 2
This is the same as for the positive energy solution. Thus, the upper 2 components of Ψ
still describe Helicity +1/2 and the lower 2 components describe Helicity −1/2. To obtain
an appropriate Ψ to describe an Anti-Neutrino with Helicity +1/2 we need a negative
energy state with Helicity −1/2.
34
2.20 Feynman’s Interpretation of the Klein-Gordon Equation
could give negative values (we have renamed the wavefuntion Ψ by φ).
∂2 ~ 2 + m2 )φ = 0
( 2 −∇ (2.117)
∂t
has positive energy solutions
ρ = |N |2 (±2E) . (2.120)
Thus, negative probabilities come from negative energy solutions. These are (as usual)
the problem.
We need an interpretation for the negative energy solutions of the KG equation. Dirac
Hole theory will NOT work for the spin-0 Bosons described by the KG equation, because
they do not obey the Dirac exclusion principle to give a filled negative energy sea.
Feynman gave an alternative way of interpreting negative energy solutions which works
for both bosons and fermions!
In Feynman diagrams, which are the rules of calculating scattering and decay am-
plitudes in RQM, when Anti-Particles are involved we draw lines for negative energy
particles propagating backwards in time and use Feynman’s interpretation. E.g. for elec-
tromagnetic Electron-Positron scattering via photon exchange there are two diagrams
that contribute:
35
2.21 Dirac Equation in an Electromagnetic Field
pµ → pµ + qAµ , (2.121)
The free particle Dirac equation is (iγ · ∇ − m)Ψ = 0. Making the above substitution
γ · ∇ → γ · ∇ − iqγ · A (2.125)
or
/ − m)Ψ = −qA
(i∇ /. (2.127)
It is sometimes convenient to write the equation in terms of the Dirac matrices i.e. in
Hamiltonian form. Begin with the equation
∂ ~ − m)Ψ = −q(A0 γ 0 − A
~ · ~γ )Ψ
(iγ 0 + i~γ · ∇ (2.128)
∂t
and multiply from left with γ 0 = β. Since (γ 0 )2 = I and γ 0 γ i = ββαi = αi we obtain
∂Ψ ~ + q A)
~ · α + βm Ψ − qA0 Ψ
= (−i∇
i
∂t
∂Ψ ~
→ i = α · Π + βm Ψ − qA0 Ψ , (2.129)
b
∂t
where
~b = −i∇
Π ~ + qA
~ =b ~.
p~ + q A (2.130)
36
2.22 The Magnetic Moment of the Electron
In the non-relativistic limit the rest mass mc2 is the largest energy in the problem (since
|~v |2 << c2 ) and we can write for a positive energy solution
−imt φ
Ψ=e (2.131)
χ
where φ and χ vary slowly with time and will be called large and small components for
reasons that will become clear in a moment.
Substituting in the Dirac equation (with the Dirac representation for β and αi , which
is more appropriat for studying non-relativistic limits) in an electromagnetic field
!
~
0
φ ∂ φ (−qA + m)I ~
σ · Π φ
b
t
me−imt + ie−imt = e−imt ,
χ ∂t χ ~
~σ · Π
b
(−qA0 − m)I χ
(2.132)
multiplying with e+imt and subtracting the first term on the left hand side from both sides
we obtain !
~b
i∂t φ −qA0 I ~σ · Π φ
= . (2.133)
i∂t χ ~ (−qA0 − 2m)I
~σ · Π
b χ
~b
~σ · Π
χ∼ φ (2.135)
2m
~b = b
where Π ~ Hence, for momenta and EM fields small compared to the rest mass
p~ + q A.
χ << φ . (2.136)
~b ,
i∂t φ = −qA0 φ + ~σ · Πχ (2.137)
~b 2
(~σ · Π)
0
⇒ i∂t φ = −qA φ + φ. (2.138)
2m
37
To simplify this further we may use the identity
~b Π)φ
(Π× ~b = (b ~
p~ +q A)×( ~ = (−i∇+q
p~ +q A)φ
b ~ ~
A)×(−i ~
∇+q ~ = −(∇+iq
A)φ ~ ~
A)×( ~
∇+iq ~ .
A)φ
(2.143)
The x component of this expression is
and hence
~b × Π)φ
(Π ~b = −iq Bφ
~ . (2.145)
Now
~b 2 φ = Π
(~σ · Π) ~b · Πφ
~b + q~σ · Bφ
~ , (2.146)
hence, the non-relativistic limit of the Dirac equation in an EM field (also called Pauli
equation) may be written as
!
∂φ (p
b~ + q ~ 2 q~σ · B
A) ~
i = −qA0 + + φ (2.147)
∂t 2m 2m
38
Comparing with the usual form of the non-rel. Schrödinger equation
∂Ψ 1 ~2
i = − ∇ + V Ψ, (2.149)
∂t 2m
we can interpret the last term in eqn. (2.148) as a potential energy −~µspin · B ~ due to
the spin magnetic moment of the electron in an external magnetic field. Thus the spin
magnetic moment is
~
qS ~
qS
~µspin = − ≡ −g (2.150)
m 2m
where g is the so-called gyromagnetic ration. Hence, the Dirac equation predicts g = 2
whereas classically we would expect g = 1. This prediction was confirmed experimentally
and is one of the spectacular successes of the Dirac equation! Including radiative correction
from Quantum Electrodynamics (QED) yields a more precise value of g = 2(1.0011 . . .)
which agrees up to nine digits after the dot with experiment!
l
The problem can be solved using simultaneous eigenstates ψj,m of J~2 , Jz and the parity
operator P (which takes ~x → −~x). The corresponding quantum numbers are j(j + 1), m
and (−1)l , where l is orbital angular momentum.
39
For the spin-1/2 electron, the allowed values of j are j = l ± 1/2. The gory details of
the calculation can be found in section 2.3.2 of [4], with the result for the energy levels
1 Z 2 α2 1 Z 4 α4
1 3 6
En,j = me 1 − − − + O((Zα) ) (2.156)
2 n2 2 n3 j + 1/2 4n
This result predicts correctly the splitting of the energy levels with the same princi-
ple quantum number n but different j (Fine Splitting); It does not predict the observed
splitting of energy levels with the same n and j but different parity (−1)l (Lamb Shift).
This requires the quantization of the EM field Aµ and, hence, the use of Quantum Elec-
trodynamics (QED). Other important quantum corrections are discussed in [4].
3 Propagator Theory
3.1 Introduction
We need a method for calculating the rates of physical processes such as scattering of
particles or the decay of a particle into other particles in RQM. In non-relativistic QM
this is done using time-dependent perturbation theory. We shall develope an alternative
approach that can be generalized to RQM → Porpagator Theory. This is equivalent to a
complete solution of the Schrödinger equation.
The starting point is the Huygen’s Principle: In Optics this says that the propagation of
a light wave can be understood by assuming that each point on the wave front acts as
a source of a secondary wave which spreads out spherically. This idea can be applied to
QM, since the Schrödinger equation is a linear differential equation and, hence, any linear
combination of known solutions is also a solution.
If the wave function ψ(t, ~x) is known at some time t, then for some later time t0 > t
the wave function ψ(t0 , ~x0 ) may be found by treating each point of space at time t as a
source of a spherical wave. At the time t0 , the amplitude of the wave which propagated
from ~x at time t to ~x0 at time t0 will be proportional to ψ(t, ~x). We write the contribution
to ψ(t0 , ~x0 ) as
iG(t0 , ~x0 ; t, ~x)ψ(t, ~x) (3.1)
40
and summing over all contributions gives
Z
ψ(t , ~x ) = i d3 xG(t0 , ~x0 ; t, ~x)ψ(t, ~x) .
0 0
(3.2)
G(t0 , ~x0 ; t, ~x) is referred to as the Green function or Propagator, and the knowledge of
G(t0 , ~x0 ; t, ~x) will enable us to construct the wave function at a later time from the wave
function at an earlier time.
We construct the propagator for a particle that interacts with a potential an arbitrary
number of times from the propagator of a freely moving particle. Later we shall construct
the free propagator which we will denote by G0 (t0 , ~x0 ; t, ~x) and we will use at times x and
x0 as shorthand for (t, ~x) and (t0 , ~x0 ).
Suppose that an interaction potential is turned on at time t1 , for a short time interval
∆t1 . For t < t1 , the wave function is the free particle wave function that we denote by
φ(t, ~x). It obeys the equation
Z
φ(t , ~x ) = i d3 xG0 (t0 , ~x0 ; t, ~x)φ(t, ~x) .
0 0
(3.3)
For t > t1 + ∆t1 , the wave function ψ differs from φ by an amount ∆φ induced by the
potential V . ∆φ can be calculated using the Schrödinger equation
∂φ
(a) = H0 φ ,
i
∂t
∂ψ
(b) i = (H0 + V )ψ , (3.4)
∂t
where H0 denotes the free Hamiltonian, and (a) and (b) are the Schrödinger equation
without and with interaction, respectively.
41
For t > t1 + ∆t1 , there is no interaction and so ∆φ evolves like a free particle wave
function, so that
Z
∆φ(t , ~x ) = i d3 x1 G0 (t0 , ~x0 ; t1 , ~x1 )∆φ(t1 , ~x1 )
0 0
Z
→ ∆φ(t , ~x ) = d3 x1 G0 (t0 , ~x0 ; t1 , ~x1 )V (t1 , ~x1 )φ(t1 , ~x1 )∆t1
0 0
(3.7)
The complete wave funtion at t0 , ~x0 for t0 > t1 + ∆t1 is therefore ψ(t0 , ~x0 ) ≡ ψ(x0 ) =
φ(t0 , ~x0 ) + ∆φ(t0 , ~x0 ) and hence
Z
ψ(t , ~x ) = i d3 x1 G0 (x0 ; x)φ(x)
0 0
Z
+ d3 x1 G0 (x0 ; x1 )V (x1 )φ(x1 )∆t1 . (3.8)
R
Also φ(x1 ) = i d3 xG0 (x1 ; x)φ(x), so
Z Z
0 3 0 3 0
ψ(x ) = i d x G0 (x ; x) + d x1 G0 (x ; x1 )V (x1 )G0 (x1 ; x)∆t1 ψ(x) (3.9)
with Z
0 0
G(x ; x) = G0 (x ; x) + d3 x1 G0 (x0 ; x1 )V (x1 )G0 (x1 ; x)∆t1 (3.11)
Continuous Interaction: if the interaction of the particle with the potential is occuring
continuously the Interaction Propagator becomes
Z
G(x ; x) = G0 (x ; x) + d4 x1 G0 (x0 ; x1 )V (x1 )G0 (x1 ; x) + . . .
0 0
(3.12)
R
where we have replaced ∆t1 simply by dt, where the . . . indicateR higher order
R R terms for
multiple interaction that go like V 2 , V 3 , . . . and we have defined d4 x = dt d3 x.
42
3.4 Scattering Amplitudes
We are interested in studying the scattering of a particle off a potential, e.g. an electron
scattering off a Coulomb potential due to a nucleus.
The incoming particle for t → −∞ is a free particle, and the outgoing particle for
t0 → +∞ is a free particle. We want to calculate the Probability Amplitude Sf i for the
transition from the state with the free particle wave function φi (t, ~x) for t → −∞ to the
state with the free particle wave function φf (t0 , ~x0 ) for t0 → +∞. If we work with plane
wave solutions, then for t → −∞,
Sf i is called the scattering amplitude at the time t0 > t. The Interacting particle wave
function ψi which develops from the free particle wave function φi is given by
Z
ψi (x ) = lim i d3 xG(x0 ; x)φi (x) .
0
(3.15)
t→−∞
If we substitute for G(x0 ; x) from eqn. (3.12) we can calculate the Interacting particle
wave function ψi (x0 ) to arbitrary order in V . Then,
Write ψ(x0 ) = i d3 xG(x0 , x)ψ(x) for t0 > t and ψ(x0 ) = 0 for t0 < t to ensure
R
that NO propagation of waves backwards in time occurs which would violate causality.
Equivalently:
G(x0 ; x) = 0 , for t0 < t . (3.17)
43
where θ is the step function, which is defined as
1 , t0 − t > 0
0
θ(t − t) = (3.19)
0 , t0 − t < 0
The step function also has an interesting integral representation
Z +∞
1 dωe−iωτ
θ(τ ) = lim+ − . (3.20)
→0 2πi −∞ (ω + i)
This can be checked by evaluating the corresponding contour integral in the complex
ω plane, where the contour is taken to be a path along the real axis which is closed at
infinity by a half-circle in the lower or upper half-plane. For τ > 0 integrate around a
half-circle in the lower half-plane to ensure exponential damping of the integrand, and
the value of the integral is 1 by Cauchy’s theorem, since the pole at ω = −i is inside
the closed integration contour. For τ < 0 the contour is closed above and the integral
vanishes because the pole lies outside the contour.
44
Eqn. (3.24) is the differential equation which we shall solve for G0 with the bound-
ary condition G0 (x0 ; x) = 0 for t0 < t. This defines the retarded Green’s function or
propagator.
Now we will solve (3.24) for the case of a free particle. Then,
1 ~2
H(~x) = H0 (~x) = − ∇ , (3.26)
2m
and (3.24) becomes
∂ 1 ~ 02
i 0+ ∇ G0 (x0 ; x) = δ 4 (x0 − x) (3.27)
∂t 2m
where ∇~ 0 denotes derivates with respect to (the components of) ~x0 . Because of the homo-
geneity of space-time, G0 can only depend on the combination x0 − x.
0 0
where we used e p·(~x −~x)
∂ i~
∂x0
= ipx ei~p·(~x −~x) , etc.
p~2
ω− G0 (ω, p~) = 1
2m
1
→ G0 (ω, p~) = (3.30)
~2
p
ω − 2m
p
~ 2
except for the singularity at ω = 2m . The correct presription to deal with this singularity
which ensures the retarded boundary condition turns out to be
1
G0 (ω, p~) = , (3.31)
~2
p
ω− 2m
+ i
45
where → 0+ after integration.
Now,
+∞ 0
d3 p i~p·(~x0 −~x) e−iω(t −t)
Z Z
0 dω
G0 (x − x) = lim+ e . (3.32)
→0 (2π)3 −∞ 2π ω − p~2 + i
2m
Using the same contour integrations as for θ(t0 − t) in section 3.5, we find that
+∞ 0
e−iω(t −t)
Z
dω 0 ~2
p
−i 2m (t0 −t)
= −iθ(t − t)e (3.33)
−∞ 2π ω − p~ + i
2
2m
thus
d3 p i~p·(~x0 −~x) −i p~2 (t0 −t) 0
Z
0
G0 (x − x) = −i e e 2m θ(t − t) . (3.34)
(2π)3
We can now check by inserting this expression for G0 (x0 −x) into (3.27) that this is indeed
a solution of (3.27). Because of the θ(t0 − t) factor it also satisfies the correct boundary
condition G0 (x0 − x) = 0 for t0 < t.
We can calculate the interacting propagator from the free propagator using
Z
G(x ; x) = G0 (x ; x) + d4 x1 G0 (x0 ; x1 )V (x1 )G0 (x1 ; x) + order V 2
0 0
(3.35)
Then we can use section 3.4 to calculate the scattering amplitude Sf i from the initial
state i to the final state f
Z
Sf i = d3 x0 φ∗f (x0 )ψi (x0 )
Z Z
3 0
→ Sf i = i 0 lim dx d3 xφ∗f (x0 )G(x0 ; x)φi (x) (3.36)
t →+∞,t→−∞
if we use Normalised wave functions. In the particular case of plane wave functions δf i is
replaced by δ 3 (~pf − p~i ).
46
Thus the complete expression is
Z Z Z
3 0 3
Sf i = δf i + i lim dx dx d4 x1
t0 →+∞,t→−∞
If the interaction potential V is not too large then the first few terms give a good approx-
imation to the Scattering Amplitude.
To generalize the propagator theory to the relativistic case we proceed somewhat intu-
itively. We need to generalize the differential equation for the non-relativistic propagator
G(x0 ; x), which we found in Section 3.5 to be
∂
i 0 − H(x ) G(x0 ; x) = δ 4 (x0 − x) .
0
(3.39)
∂t
/ − m) Ψ = −qγ · AΨ
(i∇
⇒ (i∇
/ − eA
/ − m) Ψ = 0 , (3.41)
this suggests that the Relativistic Propagator, which we will denote by SbF (x0 ; x) should
obey the differential equation
/ 0− eA
(i∇ / 0− m) SbF (x0 ; x) = Iδ 4 (x0 − x) , (3.42)
47
3.9 Free Electron Propagator
The free electron Propagator is denoted by SF (x0 ; x) and obeys the differential equation
/ 0− m) SF (x0 ; x) = Iδ 4 (x0 − x) .
(i∇ (3.44)
d4 p −ip·(x0 −x) 0
Z
e (γ p0 − ~γ · p~ − m)S̃F (p) = δ 4 (x0 − x)I . (3.46)
(2π)4
But on the other hand the Fourier transform of a Dirac δ function is 1, so we can write
d4 p −ip·(x0 −x)
Z
4 0
δ (x − x) = e , (3.47)
(2π)4
and we only have to compare Fourier coefficients. We conclude that
and hence we find for the free electron propagator in momentum space
/ − m)−1 ,
S̃F (p) = (p (3.49)
There are singularities at p2 = m2 because the denominator in S̃F (p) becomes zero
and the propagator develops poles. The singularities are localized at
p20 − p~2 = m2
p
⇒ p0 = ± p~2 + m2 ≡ ±E . (3.51)
This means that the singularities occur when the 4-momentum p is on-shell i.e. it obeys
the relativistic energy-momentum relation p2 = m2 .
48
x to x0 . Positive (negative) energy electrons are represented by wave functions with
positive (negative) frequency time behaviour, namely e−iωt (eiωt ) where ω > 0. Because
of hole theory, electrons are associated with positive energy electrons propagating forward
in time, and positrons are associated with negative energy wave functions propagating
backwards in time. (Recall that a negative energy wave function of 4-momentum −p,
which is propagating backwards in time, represents a positron of momentum +p, which
is therefore propagating forward in time.)
To ensure that an electron (positron) does not spontaneously change into a positron
(electron) we require that a positive (negative) frequency wave propagating from x into
the future (past) possesses ONLY positive (negative) frequency components. Therefore
we have to require that for t0 > t (t0 < t) the propagator SF (x0 − x) contains only positive
(negative) frequency components. These are the Feynman boundary conditions on SF ; it
turns out that in momentum space this translates into (i prescription)
(p
/ + m)
S̃F (p) = , (3.52)
(p2 − m2 + i)
where → 0+ .
Let us check this for t0 > t. Starting point is the Fourier transform of (3.52)
d4 p −ip·(x0 −x)
Z
0 (p
/ + m)
SF (x − x) = 4
e
(2π) (p − m2 + i)
2
Let us focus on the p0 integration. For t0 > t we evaluate the p0 integral by closing the
contour with a semi-circle in the lower half p0 −plane, since
0 0 0
e−ip0 (t −t) = e−i<(p0 )(t −t)+=(p0 )(t −t)
0 0
= e=(p0 )(t −t) e−i<(p0 )(t −t) (3.54)
0
and for t0 > t we have e=(p0 )(t −t) → 0 if =(p0 ) → −∞.
49
contour. Instead of adding the small imaginary parts, we can slightly deform the contour
to include the singularity at p = E and exclude the singularity at p0 = −E.
0
−ip0 (t0 −t) (p/ + m) (p0 γ 0 − p~ · ~γ + m)e−ip0 (t −t)
e = (3.56)
(p2 − m2 ) (p0 + E)(p0 − E)
p
with E = p~2 + m2 .
For t0 < t we have to close the contour in the upper half p0 plane and we find in a
similar fashion that only negative frequency components contribute.
/ 0− m).
which can be checked by acting on both sides with (i∇
50
• zeroth order: Neglecting eA/ we have SbF (x0 ; x) = SF (x0 ; x).
• first order: To first order in eA/, substitute the zeroth order solution for SbF on the
righthand side of (3.62) to obtain
Z
SF (x ; x) = SF (x ; x) + e d4 x1 SF (x0 ; x1 )A
b 0 0
/(x1 )SF (x1 ; x) , (3.63)
where Ψb denotes an Interacting electron, whereas the free wave function at t0 > t for a
positive energy electron is
Z
Ψ(x ) = i d3 xSF (x0 ; x)Ψ(x) .
0
(3.65)
The incoming particle for t → −∞ is a free, positive energy electron and the outgoing
particle for t0 → +∞ is a free electron. The electron has been scattered by the electro-
magnetic field Aµ . We wish to calculate the probability amplitude Sf i for the transition
from the state with the free electron wave function ψi (t, ~x) for t → −∞ to the state with
the electron wavefunction ψf (t0 , ~x0 ) for t0 → +∞.
At time t0 > t, the interacting electron wavefunction ψbi which develops from the free
electron wave function ψi is given by
Z
ψi (x ) = lim i d3 xSbF (x0 ; x)ψi (x) .
b 0
(3.66)
t→−∞
Substituting for SbF from (3.62) we get ψbi (x0 ) to any required order in e. Then
51
Proceeding as before
Z Z
3 0
Sf i = i lim dx d3 xψ f (x0 )SbF (x0 ; x)ψi (x) (3.68)
t0 →+∞,t→−∞
Note, that δf i has to be replaced by a Dirac δ function if we consider plane wave solutions.
If we use normalized wave functions the complete expression (to first order in the
interaction) for Sf i becomes
Z Z Z
3 0
Sf i = δf i + i 0 lim e dx d x d4 x1 ψ f (x0 )SF (x0 ; x1 )A
3
/(x1 )SF (x1 ; x)ψi (x) + . . .
t →+∞,t→−∞
Z
⇒ Sf i = δf i − ie d4 x1 ψ f (x1 )A/(x1 )ψi (x1 ) (3.70)
For a positron scattering off an electromagnetic field, we note that to lowest order in the
interaction the cross section is identical to electron scattering. Here the incoming state is
a negative energy electron in the future propagating backwards in time with wave function
ψi and the outgoing state is a negative energy electron in the past with wave function ψf .
Then,
Sf i = 0 lim hψf (x0 )|ψbi (x0 )i (3.71)
t →−∞
and we get essentially the same result with negative energy wavefunctions. Note that
the momentum of the ”incoming” negative energy eletron is related to the momentum
− +
of the corresponding outgoing positron via pei = −pef , and that the momentum of the
”outgoing” negative energy eletron is related to the momentum of the corresponding
− +
outgoing positron via pef = −pei .
4 Applications
This is the Rutherford scattering problem. An electron e− scatters off the Coulomb
potential due to a nucleus. We calculate the scattering amplitude Sf i to first order in e
52
(i.e. to first order in the interaction) then, if the final state is different from the initial
state (i 6= f ) Z
Sf i = −ie d4 xψ f (x)A/(x)ψi (x) (4.1)
If we choose free particle wave functions ψi and ψf normalised in a box of volume V , then
1
ψi (x) = √ e−ipi ·x U (pi , si ) and
2Ei V
1
ψf (x) = p e−ipf ·x U (pf , sf ) . (4.2)
2Ef V
ψ = N e−ip·x U (p, s)
⇒ ρ = ψ † ψ = |N |2 e−ip·x e+ip·x U † (p, s)U (p, s) = |N |2 U † (p, s)U (p, s) (4.3)
Then
ie2 Z 1 d4 x i(pf −pi )·x
Z
1
Sf i = p U (pf , sf )γ 0 U (pi , si ) e , (4.7)
4π 2V Ei Ef |~x|
| {z }
=I
53
where ~q = p~f − p~i .
Write
Z +∞
2πδ(Ef − Ei ) = dtei(Ef −Ei )t
−∞
Z +T /2
= lim dtei(Ef −Ei )t
T →∞ −T /2
1 i(E −Ei )t T /2
= lim e f −T /2
T →∞ i(Ef − Ei )
2 sin((Ef − Ei )T /2)
= lim , (4.11)
T →∞ (Ef − Ei )
thus
24 sin2 ((Ef − Ei )T /2)
[2πδ(Ef − Ei )] = lim . (4.12)
T →∞ (Ef − Ei )2
However, it is known that for α < Ei < β
Z β
sin2 ((Ef − Ei )T /2)
dEf = 2πT , (4.13)
α (Ef − Ei )2
for large T . Hence, we can interprete the RHS of (4.12) as 2πT δ(Ef − Ei ) and we can
take the intepretation
[2πδ(Ef − Ei )]2 = 2πT δ(Ef − Ei ) . (4.14)
The difficulty we encountered can be avoided by using wave packets instead of plane
waves.
With this intepretation of the square of the Dirac δ-function, the transition probability
per unit time becomes
54
We are interested in the transition probability with a chosen range of final momenta,
V 3
p~f to p~f + d~pf containing (2π) 3 d pf states. This is
V
d3 pf |Sf i |2 . (4.16)
(2π)3
V 2 |Sf i |2
p f dp f dΩ (4.18)
(2π)3 T
The differential cross section dσ(θ, φ) for scattering into the final solid angle dΩ is
defined to be
Probability per unit time of a transition into dΩ
dσ(θ, φ) = (4.19)
Flux of incident particles
where (Flux of incident particles) = (Incident probability per unit area per unit time).
Thus,
|~pi |
Flux = (ψi† ψi ) × (4.20)
| {z } Ei
P robability density |{z}
=|~vi |
|~pi |
Flux = . (4.22)
V Ei
55
Because Ef2 = p2f + m2 we have
dσ Z 2 α2
= |U (pf , sf )γ 0 U (pi , si )|2 , (4.26)
dΩ |~q|4
If we do not observe the spin state of the final electrons, then we should sum over final
dσ
spin states (polarisations) in dΩ . If in the incident beam of electrons each spin is equally
likely (unpolarised beam), then we should also average over the initial spin states. In that
case,
dσ Z 2 α2 X
= |U (pf , sf )γ 0 U (pi , si )|2 , (4.27)
dΩ 2|~q|4 s ,s
f i
all evaluated at Ef = Ei and |~pf | = |~pi |. Note that we multiplied with 1/2 for the average
over the two initial spin states.
Such spin summations are evaluated by reducing the problem to evaluating the trace
of a matrix and occur often in scattering problems, so we will digress a little into this
subject.
|U (f )ΓU (i)|2 = U (f )ΓU (i)(U (f )ΓU (i))∗ = U (f )ΓU (i)U (i)ΓU (f ) . (4.29)
56
Using the properties of γ-matrices we can show that
γµ = γµ
γ 5 = −γ 5
γ µ γ 5 = γ µ γ 5 = −γ 5 γ µ (4.30)
and
a//b/c . . . /q = /q . . . /c/ba
/. (4.31)
The spin sums are reduced to matrix traces as follows. We use the following property of
Dirac spinors which can be derived from their explicit representation
X
Uα (p, s)U β (p, s) = (p
/ + mI)αβ . (4.32)
s
X
2
P
sf ,si |U (f )ΓU (i)| = U (f )ΓU (i)U (i)ΓU (f )
sf ,si
X
= U (f )α Γαβ U (i)β U (i)γ Γγδ U (f )δ
sf ,si
X
= U (f )δ U (f )α Γαβ U (i)β U (i)γ Γγδ
sf ,si
= (p
/f + mI)δα Γαβ (p
/i + mI)βγ Γγδ
X
= (p
/f + mI)Γ(p /i + mI)Γ δδ (4.33)
δ
X
|U (f )ΓU (i)|2 = T r (p
⇒ /f + mI)Γ(p
/i + mI)Γ (4.34)
sf ,si
Therefore, the problem of evaluating the spin sums reduces to evaluating the trace of a
matrix.
Proof:
T r(a/1 . . . a/n ) = T r(a
/1 . . . a
/n γ5 γ5 )
= T r(γ5 a
/1 . . . a
/n γ5 )
n
= (−1) T r(a /1 . . . a
/n γ5 γ5 )
n
= (−1) T r(a /1 . . . a
/n ) , (4.35)
where we have used γ5 γ5 = I in the first line, the cyclic property of any matrix trace in
the second line. The identity γ5 γ µ = −γ µ γ5 was used n times in the third line to bring
57
γ5 from the left in the matrix trace to the right inside the matrix trace and in line four
we used again γ5 γ5 = I.
Theorem 2: T r(a//b) = 4a · b
Proof:
T r(a//b) = T r(b/a
/)
1
= (T r(a//b) + T r(b /a
/))
2
1
= T r(a//b + /ba
/)
2
1
= T r(aµ bν (γ µ γ ν + γ ν γ µ ))
2
1
= T r(aµ bν 2g µν I)
2
= a · b T rI
= 4a · b Q.E.D. (4.37)
Proof:
T r(a /) = −T r(b/a//cd
//b/cd /) + 2(a · b)T r(Ic
/d/)
= T r(b//ca/d/) − 2(a · c)T r(b /) + 2(a · b)T r(c
/Id /d
/)
= −T r(b//cd/a/) + 2(a · d)T r(b//cI) − 2(a · c)T r(b/) + 2(a · b)T r(c
/d /d
/) , (4.39)
from which follows
T r(a
//b/cd
/) + T r(b//cd/a/) = 2T r(a//b/cd
/)
= 2(a · d)T r(b //c) − 2(a · c)T r(b/) + 2(a · b)T r(c
/d /d
/) . (4.40)
58
Using Theorem 2 we find
Return to the spin avaraged differential cross-section for scattering an electron off a fixed
Coulomb potential — the Mott Cross-Section.
then introduce the 4-vector e ≡ (1, 0, 0, 0), so that /e = γ 0 . Hence, we can write
Therefore,
X
|U (pf , sf )γ 0 U (pi , si )|2 = 4[m2 + 2Ef Ei − pf · pi ] , (4.44)
si ,sf
59
X
⇒ |U (pf , sf )γ 0 U (pi , si )|2 = 8E 2 [1 − β 2 sin2 (θ/2)] , (4.46)
si ,sf
|~
p|
with β = E
.
hence,
|~q|4 = 16|~p|4 sin4 (θ/2) . (4.48)
dσ Z 2 α2 X
= 4
|U (pf , sf )γ 0 U (pi , si )|2
dΩ 2|~q| s ,s
i f
Z α2 2
= 8E 2 [1 − β 2 sin2 (θ/2)]
32|~p|4 sin4 (θ/2)
Z 2 α2
= 2 2 4 (1 − β 2 sin2 (θ/2)) , (4.49)
4|~p| β sin (θ/2)
|~
p|
where β = E
. This is the Mott Cross-Section.
dσ Z 2 α2
= (4.50)
dΩ 4m2 |~v |4 sin4 (θ/2)
Instead of considering the scattering in a fixed potential, as in the last section, we consider
the scattering of an electron in the EM field generated by a free Dirac proton. We visualise
this process in terms of the electron e− and the proton p exchanging a photon which may
be virtual, i.e. q 2 6= 0
60
The Scattering Amplitude to first order in the charge e is given by
Z
Sf i = −ie d4 xψ f (x)A /(x)ψi (x) (4.51)
where ψi and ψf are free particle wave functions for the initial and final electron and the
EM 4-vector potential Aµ is produced by the EM 4-vector current J µ due to the proton
p. Note that J µ has components (ρ, ~j) where ρ is the charge density and ~j is the current
density vector. We need to calculate Aµ in terms of J µ , which is done using the a photon
propagator (Green’s function).
∂ν ∂ ν Aµ (x) = 0 , (4.53)
∂ν ∂ ν DF (x − y) = δ 4 (x − y) , (4.54)
where the derivatives are taken w.r.t. x. The corresponding momentum space propagator
D̃F (q) is defined by
d4 q −iq·(x−y)
Z
DF (x − y) = e D̃F (q) . (4.55)
(2π)4
Therefore,
d4 q −iq·(x−y)
Z
ν
∂ν ∂ DF (x − y) = e (−(q 0 )2 + ~q2 )D̃F (q) = δ 4 (x − y) (4.56)
(2π)4
and, since the Fourier transform of the δ function is 1, −q 2 D̃F (q) = 1. Thus,
−1
D̃F (q) = (4.57)
q2
for q 2 6= 0, and in analogy with the e− propagator, we avoid the poles at q 2 = 0 by writing
−1
D̃F (q) = (4.58)
q2
+ i
with → 0+ . Thus the photon propagator is
d4 q −iq·(x−y) −1
Z
DF (x − y) = e , (4.59)
(2π)4 q 2 + i
and, hence, Z
µ
A (x) = d4 yDF (x − y)J µ (y) (4.60)
61
obeys
∂ν ∂ ν Aµ (x) = J µ (x) . (4.61)
as Z Z
4
Sf i = −ie dx d4 yψ f (x)γµ ψi (x)DF (x − y)J µ (y) . (4.63)
Recall that the probability density of a Dirac particle is ψ † ψ = ψγ 0 ψ and the probability
current is ψ † α
~ ψ = ψγ 0 α
~ ψ = ψ~γ ψ. For a particle of charge q, this will give the charge
density ρ = qψγ 0 ψ and the current density vector ~j = qψ~γ ψ. Thus the 4-vector current is
J µ = qψγ µ ψ. For a proton with initial wavefunction ψip (x), final wave function ψfp (x) and
charge +e, the generalization of J µ is the so called Electromagnetic Transition Current
p
J µ = eψ f (x)γ µ ψip (x) , (4.64)
so that
Z Z
p
4
d4 y − eψ f (x)γµ ψi (x) DF (x − y) + eψ f (x)γ µ ψip (x) .
Sf i = i dx (4.65)
The S-matrix element couples the EM transition current for the electron to the EM
transition current for the proton through the photon propagator.
Normalising in a box of volume V , as in the last section, the free particle wave functions
for the incoming and outgoing electrons and protons are:
1
ψi (x) = √ e−ipi ·x U (pi , si )
2Ei V
1
ψf (x) = p e−ipf ·x U (pf , sf )
2Ef V
1
ψip (y) = √ e−iPi ·y U (Pi , Si )
2Ei V
1
ψfp (y) = p e−iPf ·y U (Pf , Sf ) (4.66)
2Ef V
where Ei and Ef are the proton energies. Therefore,
1 1 1 1
Sf i = −ie2 √ p √ p U (pf , sf )γµ U (pi , si )U (Pf , Sf )γ µ U (Pi , Si )
2Ei V 2Ef V 2Ei V 2Ef V
Z Z
× d x d4 y ei(pf −pi )·x ei(Pf −Pi )·y DF (x − y) .
4
(4.67)
62
The integral in (4.67) is
d4 q −iq·(x−y) −1
Z Z Z
4 4 i(pf −pi )·x i(Pf −Pi )·y
I = d x d ye e e
(2π)4 q 2 + i
d4 q −1
Z Z
= (2π)4 δ 4 (q − pf + pi ) d4 yei(q+Pf −Pi )·y
(2π)4 q 2 + i
−1
Z
= 2 d4 yei(pf −pi +Pf −Pi )·y
q + i
−1
= 2 (2π)4 δ 4 (pf − pi + Pf − Pi ) (4.68)
q + i
where q is set to q = pf − pi due to the δ function in the second line!
Thus,
1 1 1 1
Sf i = √ p √ p (2π)4 δ 4 (pf − pi + Pf − Pi )Mf i (4.69)
2Ei V 2Ef V 2Ei V 2Ef V
where we have introduced the important quantity Mf i , which is also called Invariant
Amplitude or Invariant Matrix Element, and
ie2
Mf i = − U (pf , sf )γµ U (pi , si )U (Pf , Sf )γ µ U (Pi , Si ) (4.70)
q2
with q = pf − pi . The Transition Amplitude from state i to state f is given by
|Mf i |2 2
|Sf i |2 = 4 4
(2π) δ (p f − p i + P f − P i ) . (4.71)
16Ei Ef Ei Ef V 4
Using the interpretation of the square of the δ function as in Section 4.1 we obtain for
large times T
2 |Mf i |2
|Sf i | = 4
(2π)4 δ 4 (pf − pi + Pf − Pi )V T , (4.72)
16Ei Ef Ei Ef V
where we got an additional factor of the volume V compared to Section 4.1 because of
the square of three δ functions for the three spatial momenta!
As in Section 4.1, we assume that the incident beams of electrons and protons are un-
polarised with each spin state equally likely. Hence, we have to average over initial spin
states. We also assume that we do not observe the final spin states of the electrons and
63
protons. Therefore, we also have to sum over the final spin states. Thus, in order to
evaluate the cross section we need to evaluate
1 X
|Mf i |2 ≡ |M f i |2 , (4.73)
4 s ,s ,S ,S
f i f i
where the prefactor 1/4 comes from averaging over the initial electron and proton spin
states. In particular we need to calculate
X
|U (pf , sf )γµ U (pi , si )U (Pf , Sf )γ µ U (Pi , Si )|2
sf ,si ,Sf ,Si
X
U (pf , sf )γµ U (pi , si )U (Pf , Sf )γ µ U (Pi , Si )
=
sf ,si ,Sf ,Si
∗ ∗
× U (pf , sf )γν U (pi , si ) U (Pf , Sf )γ ν U (Pi , Si )
X
= U (pf , sf )γµ U (pi , si )U (pi , si ) γ ν U (pf , sf )
|{z}
sf ,si =γν
X
× U (Pf , Sf )γ µ U (Pi , Pi )U (Pi , Si ) γ ν U (Pf , Sf )
|{z}
Sf ,Si =γ ν
64
dσ
4.6 dΩ for Electron-Proton Scattering
The Transistion Amplitude per unit time per unit volume into the range of finite momenta
p~f to p~f + d~pf and P~f to P~f + dP~f is
|Sf i |2 V 3 V 3 |Sf i |2 V
d p f d P f = d3 pf d3 Pf
V T (2π)3 (2π)3 T (2π)6
|M f i |2 4 4 V
= 4
(2π) V T δ (p f + P f − p i − P i ) 6
d3 pf d3 Pf
16Ei Ef Ei Ef V T (2π)
2
|M f i |
= δ 4 (pf + Pf − pi − Pi )d3 pf d3 Pf . (4.78)
16V Ei Ef Ei Ef (2π)2
2
dσe
To obtain the differential cross section for the scattered electron dΩe
we need:
dσe = (4.79)
−
Probability of e scattering into solid angle dΩe per unit time
(Nr. of Target Particles)(Nr. of beam particles incident per unit area per unit time)
In our normalisation
1
(Nr. of Target Particles per unit volume) = , (4.80)
V
also for collinear beams
1 e
~vi − ~viP
(Nr. of beam particles incident per unit area per unit time) =
V
1 |~p | |P~ |
i i
= + (4.81)
V Ei Ei
dσe
To obtain dΩe
we must integrate d3 Pf and d|~pf | but NOT dΩe . Now,
R
The d3 Pf integration is trivial because of the δ functions in (4.78), i.e. we simply
have to evaluate everything at P~f = P~i + p~i − p~f . We find
65
evaluated at P~f = P~i + p~i − p~f . Therefore,
dσe |M f i |2 |~pf |
= (4.85)
dΩe 16Ei Ei Ef (2π) |~vi − ~viP |
2 e
evaluated at Pf = Pi + pi − pf .
For a target proton initially at rest pi = (Ei , p~i ), pf = (Ef , p~f ), Pi = (Ei = M, P~i = ~0).
Then,
~vi − ~viP = p~i
e
(4.86)
Ei
and
dσe |M f i |2 |~pf |
= (4.87)
dΩe 16(2π)2 Ei Ef |~pi |
In this case the proton recoil is tiny and Ef ∼ Ei = M , Ef ∼ Ei , |~pf | ∼ |~pi | and
P~f ∼ ~0. This is like scattering off a static nucleus with Z = 1 and we should recover the
Mott cross section formula.
dσe |M f i |2
= (4.88)
dΩe 16(2π)2 M 2
Also, with |M f i |2 from Section 4.6, we obtain
8e4
|M f i | ∼ 2 2 M 2 2Ei2 + m2 − pf · pi
2
(4.89)
(q )
with, q = pf − pi ∼ (0, p~f − p~i ) = (0, ~q). Using |~q|4 = 16|~pi |4 sin4 (θ/2) from Section 4.3
and after further simplifications we recover the Mott cross section (with Z = 1)
dσe α2
1 − β 2 sin2 (θ/2)
= 2 2 4 (4.90)
dΩe 4|~pi | β sin (θ/2)
66
4.7 Feynman Rules
More generally, the Invarian Amplitude Mf i derived from a Feynman diagram is given by
the following Rules:
(a) Associate with each incoming/outgoing external electron (or spin- 12 particle) line of
4-momentum pi /pf a factor U (pi , si )/U (pf , sf ).
(b) Associate with each incoming/outgoing negative energy electron line a factor V (pi , si )/
V (pf , sf ) where −pi / −pf are the 4-momenta of the negative energy electrons, i.e.
pi /pf are the 4-momenta of the positive energy positrons.
(c) Associate with each internal photon line of 4-momentum q a factor −i gqµν2 .
(d) Associate with each interaction vertex a factor −ieγ µ for a particle (or negative
energy particle) of charge e.
Example 1: For electron-positron scattering there are two Feynman diagrams contributing
to Mf i
Then,
(2π)4 δ 4 (p3 + p4 − p1 − p2 )
−i
Sf i = √ √ √ √ U (p3 )(−ieγµ )U (p1 )V (p2 )(−ieγ µ )V (p4 )
2E1 V 2E2 V 2E3 V 2E4 V (p3 − p1 )2
−i µ
− V (p2 )(−ieγµ )U (p1 )U (p3 )(−ieγ )V (p4 ) (4.92)
(p1 + p2 )2
Note that the relative minus sign between the first and second term has been introduced
to ensure ANTI-SYMMETRY under the interchange of the two incoming electrons (one
of which has negative energy) and under the interchange of the two outgoing electrons.
Note that this anti-symmetrisation is only necessary if identical particles, in this case
electrons, are considered:
Example 2: Electron scattering off a Dirac proton. There is only one diagram to consider
(photon exchange between the two particles) because the two particles are not identical. It
67
is left as an exercise to check that the above stated Feynman reproduce the same invariant
matrix element Mf i and, hence, the same scattering amplitude as derived in Section 4.4.
(2π)4 δ 4 (p3 + p4 − p1 − p2 )
−i
Sf i = √ √ √ √ 2
U (p3 )(−ieγµ )U (p1 )U (p4 )(−ieγ µ )U (p2 )
2E1 V 2E2 V 2E3 V 2E4 V (p3 − p1 )
−i µ
− U (p4 )(−ieγµ )U (p1 )U (p3 )(−ieγ )U (p2 ) (4.93)
(p1 − p4 )2
5.1 Background
The first example of a classical field theory was electromagnetism as described by Maxwell
The first Quantum Field Theory (QFT) was the theory of the electromagnetic field Aµ (x).
Quantisation of the electromagnetic field by imposing commutation relations led to a par-
ticle interpretation in terms of photons. The quantised electromagnetic field Aµ becomes
an operator that can create and annihilate photons. In simple terms, the quantised field
can be viewed as an infinite collection of harmonic oscillators.
To begin with let us consider the simpler case of the Klein-Gordon field φ(x). The
main novelty here is that φ is treated as an operator rather than a wave function as we
did in RQM.
68
5.2 Field Equations
To a field φ(x) we assign a Lagrangian density L(φ, ∂µ φ), and the corresponding La-
grangian is Z
L = d3 xL(φ, ∂µ φ) . (5.2)
where we dropped a boundary term coming from partial integration due to the boundary
condition on δφ. Since this must be true for arbitrary variation δφ we arrive at the field
equations
∂L ∂L
= ∂µ . (5.5)
∂φ ∂(∂µ φ)
Consider a Hermitian field operator φ(x) = φ† (x) which satisfies the Klein-Gordon equa-
tion
(∂µ ∂ µ + m2 )φ = 0 . (5.6)
Note that φ is an operator and NOT a wave function (which would not make much sense
since φ is Hermitian i.e. real now). The Klein-Gordon equation is the field equation
derived from the Lagrangian density
1
∂µ φ∂ µ φ − m2 φ2 .
L= (5.7)
2
69
Any solution of the KG equation can be decomposed in terms of positive and negative
energy plane wave solutions (complete set of eigenfunctions)
Z
(+) (−)
φ(x) = d3 k a+ (k)fk (x) + a− (k)fk (x) (5.8)
p
where fk± (x) = √ 1
e∓ik·x with Ek = + ~k 2 + m2 . If φ is Hermitian, φ† = φ, then
(2π)3 2Ek
a− (k) = a†+ (k). Write a+ (k) ≡ a(k), a− (k) ≡ a† (k). Hence,
Z
3 ~ (+) † ~ (−)
φ(x) = d k a(k)fk (x) + a (k)fk (x) . (5.9)
←→
where A ∂0 B = A∂0 B − (∂0 A)B. With the help of these orthonormality conditions we
can calculate a and a† in terms of φ.
←→
Z
(+)∗
d3 xfk0 (x) ∂0 φ(x)
←
→ (+)
Z Z
(+)∗
= d k a(k) d3 xfk0 (x) ∂0 fk (x)
3
←→ (−)
Z
† 3 (+)∗
+a (k) d xfk0 (x) ∂0 fk (x)
Z
= d3 k a(k)(−iδ 3 (k~0 − ~k)) = −ia(k 0 ) . (5.11)
Hence,
←→
Z
(+)∗
a(k) = i d3 xfk (x) ∂0 φ(x) (5.12)
70
In analogy to the QM commutation relation [p, x] = −i~ we impose the canonical com-
mutation relations
h i
0 3 0 0
[Π(t, ~x), φ(t, ~x )] = −iδ (~x − ~x ) = φ̇(t, ~x), φ(t, ~x )
[φ(t, ~x), φ(t, ~x0 )] = [Π(t, ~x), Π(t, ~x0 )] = 0 . (5.15)
The corresponding commutation relations for a(k) and a† (k) can now be derived
←→ ←
→
Z Z h i
† 0
3 3 (+)∗ (−)∗
a(k), a (k ) = i(−i) d x d y fk (x) ∂0 φ(x), fk0 (y) ∂0 φ(y)
Z Z
3 3 (+)∗ ∂ ∂ (+)∗ (−)∗ ∂ ∂ (−)∗
= d x d y fk (x) φ(x) − f (x)φ(x), fk0 (y) φ(y) − f 0 (y)φ(y)
∂x0 ∂x0 k ∂y0 ∂y0 k
Z Z
(+)∗ ∂ (−)∗ ∂ (+)∗ (−)∗
= d3 x d3 y −fk (x) [φ̇(x), φ(y)] fk0 (y) − f (x) [φ(x), φ̇(y)] fk0 (y)
| {z } ∂y0 ∂x0 k | {z }
=−iδ 3 (~
x−~
y) =+iδ 3 (~
y −~
x)
Z Z
∂ (−)∗
(+)∗ ∂ (+)∗ (−)∗
=i d3 xfk f 0 (x) −
(x) 3
dx f (x)fk0 (x)
∂x0 k ∂x0 k
←→ (−)∗
Z
(+)∗
= i d3 xfk (x) ∂0 fk0 (x)
= δ 3 (~k − ~k 0 ) , (5.16)
hence,
a(k), a† (k 0 ) = δ 3 (~k − ~k 0 ) .
(5.17)
Similarly,
[a(k), a(k 0 )] = a† (k), a† (k 0 ) = 0 .
(5.18)
Now write the Hamiltonian in terms of a and a† . Begin with the Lagrangian density
1 2 ~ 2 − m2 φ2 .
L= (φ̇) − (∇φ) (5.19)
2
Compare this with the Lagrangian L = T − V and Hamiltonian H = T + V in classical
mechanics. Therefore the Hamiltonian density is
1 2
~ 2 + m2 φ2 .
H= (φ̇) + (∇φ) (5.20)
2
71
R
and the Hamiltonian is H = d3 xH. Now,
Z
(+) (−)
φ̇ = −i d3 kk0 fk (x)a(k) − fk (x)a† (k)
Z
(+) (−)
∇φ = i d3 k~k fk (x)a(k) − fk (x)a† (k)
~
Z
1 3 1
2 ~ 2 2 2
H = d x (φ̇) + (∇φ) + m φ
2 2
Z Z Z
(+) (+)
= 3
dk dk 3 0
d3 x(−k0 k00 − ~k · ~k 0 ) × fk fk0 a(k)a(k 0 )
(−) (−) (+) (−) (−) (+)
+fk fk0 a† (k)a† (k 0 ) − fk fk0 a(k)a† (k 0 ) − fk fk0 a† (k)a(k 0 )
Z Z Z
3 0 (+) (+)
3
+ dk dk d3 xm2 × fk fk0 a(k)a(k 0 )
(−) (−) (+) (−) (−) (+)
+fk fk0 a† (k)a† (k 0 ) + fk fk0 a(k)a† (k 0 ) + fk fk0 a† (k)a(k 0 ) (5.21)
72
The first term acting on the vacuum gives zero, but the second term yields an infinite
vacuum energy. Since we are only interested in differences between energies, we subtract
off the infinite vacuum energy.
Thus, we replace the Hamiltonian H by its normal ordered counterpart H̃, which is
defined as Z
H̃ = d3 kEk a†k a(k) , (5.26)
h0|H̃|0i = 0 , (5.27)
The state obtained by m applications of a† (k) on the vacuum state has energy
Em = mEk . (5.28)
5.4 Interactions
Also the Dirac field ψ can be quantised, now using anti-commutation relations instead
of canonical commutation relations. The free Lagrangian density for Dirac particles and
photons turns out to be
1
Lf ree = ψ(iγ µ ∂µ − m)ψ − Fµν F µν . (5.29)
4
Introducing the electromagnetic interaction by the usual ”minimal substitution”
∂µ → ∂µ + ieAµ (5.30)
we get
1
L = Lf ree + LI = ψ(iγ µ ∂µ − eγ µ Aµ − m)ψ − Fµν F µν , (5.31)
4
where the interaction Lagrangian is given by
LI = eψγ µ Aµ ψ . (5.32)
The detailed development of QFT leads inR the end to the same Feynman rules as for
RQM. Notice the resemblance to Sf i = −ie d4 xψ f (x)γ µ Aµ (x)ψi (x).
73
References
[1] J. Bjorken and S. Drell, ”Relativistic Quantum Mechanics” and ”Relativistic Quan-
tum Fields”, McGraw-Hill.
[2] M.E. Peskin and D.V. Schroeder, ”An Introduction to Quantum Field Theory”,
Addison-Wesley.
[5] I.J. Aitchison and A.J. Hey, ”Gauge Theories in Particle Physics”, Adam Hilger.
74