Professional Documents
Culture Documents
Fall 2010
Matthew Schwartz
Lecture 10:
Spinors and the Dirac Equation
1 Introduction
1
From non-relativistic quantum mechanics, we already know that the electron has spin 2
. We
usually write it as a doublet
↑
|ψ i = (1)
↓
You probably learned that its dynamics in the non-relativistic limit are governed by the
Schrodinger-Pauli equation
K · LK ) 1 0 − 2µB
2
p Bz Bx − iB y
i∂t |ψ i = H |ψ i = ( + V (r) − µBB |ψ i (2)
2m 0 1 Bx + iB y − Bz
e
where µB = 2m is the “Bohr magneton” (the size of the electron’s orbital magnetic moment) and
K = Kx × Kp is thee angular momentum operator.
L
You may also have learned of a shorthand notation for this involving the Pauli matrices
0 1 0 −i 1 0
σ1 = , σ2 = , σ3 = (3)
1 0 i 0 0 −1
K · LK 12×2 − 2µBBK · σK ψx
2
p
i∂tψ = + V (r) − µBB (4)
2m
This equation is written with the Pauli matrices combined into Kσ = (σ1, σ2, σ3) to call atten-
tion to the fact that these σi′s transform as the components of a 3-vector, just like the magnetic
K · BK )ψ is rotationally invariant. This is non-trivial, and only works because
field Bi. Thus (σ
which are the same algebraic relations satisfied by infinitesimal rotations (we will review this
shortly). Another useful fact is
{σi , σ j } ≡ σiσ j + σ jσi = 2δi j (6)
Keep in mind that σi do not change under rotations – they are always given by Eq. (3) in any
frame. ψ is changing and Bi is changing, and these changes cancel when we write (σK · BK )ψ.
We could also have written down a rotationally invariant equation of motion for ψ
1 · ∂tψ − ∂iσiψ = 0 (7)
Since ∂i transforms like a 3-vector and so does σiψ, this equation is rotationally invariant. It
turns out it is Lorentz invariant too. In fact, this is just the Dirac equation!
If we write
σ µ = (12×2, σ1, σ2, σ3) (8)
Then it is
σ µ∂ µψ = 0 (9)
1
2 Section 2
which is nice and simple looking. Actually, this is the Dirac equation for a Weyl spinor, which is
not exactly the same as the equation commonly called the Dirac equation.
By the way, it does not follow that this equation is Lorentz invariant just because we’ve
written it as σ µ∂ µ. For example,
(σ µ∂ µ + m)ψ = 0 (10)
We have already argued that particles in our world should transform under unitary representa-
tions of the Poincare group. These are characterized by mass m and spin j. We motivated this
in our study of the spin 1 fields. Mass is Lorentz invariant, so it is an obvious quantum number.
Momentum is also conserved, but it is Lorentz covariant. If choose a frame in which the
momentum has some canonical form, for example p µ = (m, 0, 0, 0) for m > 0, then the particles
are characterized by the group that holds this momentum fixed. That is the little group, in this
case the group of 3D rotations, SO(3). The group of 3D rotations provides the second quantum
number, j. The way the polarizations transform under the full Lorentz group is then induced by
the transformations under SO(3) and the way the momentum transforms under boosts.
We also saw that to do field theory, we have to write down Largrangians involving fields.
These are things like V µ or φ or T µν which don’t have to involve particles per se, although obvi-
ously in a physical theory they will. As we saw for spin 1, there’s a lot a trouble that comes
from having to embed particles of fixed mass and spin into fields like V µ. For example, V µ has 4
degrees of freedom which describes spin 0 and spin 1, so the Lagrangian has to be carefully
chosen to make sure the physical theory never excites the spin 0 component. In addition, when
we want the particle to have m = 0 and spin 1, we needed to go further and make sure the longi-
tudinal polarization is never produced. This led directly to charge conservation. The next log-
ical step to make these embeddings a bit more systematic is to see what kinds of Lorentz-
1
invariant fields we can write down at all. This will reveal the existence of the spin 2 states, and
help us characterize their embeddings into fields.
A group is a set of elements {gi } and a rule gi ⊗ gj = gk which tells how each pair of ele-
ments is multiplied to get a third. The rule defines the group, independent of any particular way
to write the group elements down as matrices. More precisely, the mathematical definition
requires the group elements to have inverses, there to be an identity element for which 1 ⊗ gi =
gi, and for the rule to be associative (gi ⊗ gj ) ⊗ gk = gi ⊗ (gj ⊗ gk). A representation is a par-
ticular embedding of these gi′s into matrices. Often we talk about the vectors on which the
matrices act as being the representation, but technically the matrix embedding is the represen-
tation. Any group has the trivial representations r: gi → 1. But more generally we care about
embeddings which are faithful, for which each element gets its own matrix. Recall that the
Lorentz group is the set of rotations and boosts which preserve the Minkoswki metric
ΛTηΛ = η (11)
V µ → ΛµνVν (12)
Where Λ is a combination of rotations and boosts. We saw that we could write a Lorentz trans-
formation as the product of 3 rotations and 3 boosts:
Representations of the Lorentz Group 3
1 0 0 0 1 1
0 cos θz − sin θz cos θ y − sin θ y
1
Λ =
0 sin θz cosθz 0
(13)
1 cosθx − sin θx
0 0 0 1 sin θ y cosθ y sin θx cos θx
coshβx sinhβx coshβ y sinhβ y coshβz sinhβz
sinhβx coshβx 1 1
× (14)
1 sinhβ y coshβ y 1
1 1 sinhβz coshβz
This is a particular representation of the Lorentz group. That is, it is one embedding of the
group into a set of matrices.
The group itself is a mathematical object independent of any particular representation. To
extract the group away from its representations, it is easiest to look at infinitesimal transforma-
tions. For vectors, the infinitesimal transformations ∆V µ = Λ µνVν − Vν are
∆V0 = βiVi (15)
∆Vi = βiV0 − εij kθ jVk (16)
For any representation, we can always write these group elements as exponentials of the
infinitesimal generators
Λ = exp(iθiJi + iβiKi) = 1 + i[θiJi + βiKi] + (17)
More generally, you could write any element g of any group G as g = exp(i cigλi)
where cig
are
numbers and λi are group generators. They are called generators because by multiplying by
numbers and exponentiating, you can get any element of the group.
The generators are in an algebra, because you can add and multiply them, while the group
elements are in a group because you can only multiply them. For example, the real numbers
form an algebra (there is a rule for addition and a rule for multiplication) but rotations are a
group (there is only one rule, multiplication). Groups that have an infinite number of elements
but a finite number of generators and do not have any weird singular points are called Lie
groups. Lie groups are critical to understanding the standard model, since the weak force is
described by the special unitary group SU(2) and the strong force by the group SU(3). For
QED, the only group we need is the Lorentz group, SO(3, 1). This is a special (determinant 1)
orthogonal (preserves a metric) group corresponding to a metric with (3, 1) signature (i.e. η µν =
diag(1,-1,-1,-1)).
For G = SO(3), there are 6 generators: 3 rotations, λi = Ji, and 3 boosts λi+3 = Ki. For the
vector representation (above), the generators are
0 0 0
0 −1 0 −1 0
J1 = V12 = i , J2 = V13 = i , J3 = V23 = i
1 0 0 0 −1
0 1 0 1 0
0 1 0 1 0 1
1 0 0 0
K1 = V01 = i , K2 = V02 = i , K3 = V03 = i (18)
0 1 0 0
0 0 1 0
We can check that the generators in this representation satisfy some relations
[V01 , V12 ] = iV02 , [V12 , V23 ] = iV13 , (19)
which can be combined into the concise form
[V µν , V ρσ ] = i(ηνρV µσ − η µρVνσ − ηνσV µρ + η µσVνρ) (20)
The generators in any other representation must satisfy these same relations. In fact, you can
even define the the Lorentz group is the set of transformations generated by these generators.
4 Section 3
3 General representations
There is a very nice way to study the representations of the Lorentz group. Start with the rota-
tion generators Ji and the boost generators K j . For example, you could take J1 = V23 , K1 = V01 ,
etc. as a particular representation of this basis. These satisfy
[Ji , J j ] = iǫijkJk (22)
[Ji , K j ] = iǫijkKk (23)
[Ki , K j ] = − iǫijkJk (24)
where ǫijk is defined by ε123 = 1 and the rule that the sign flips when you swap any two indices.
For example, ε213 = − 1, ε231 = 1, etc.. It is called the totally antisymmetric tensor and
comes up a lot in field theory. As is probably clear to you already, [Ji , J j ] = iǫi jkJk is the
algebra for rotations, so(3) so the Ji generate the subgroup of rotations in 3D.
Now take the linear combinations
1
Ji+ = (Ji + iKi) (25)
2
1
Ji− = (Ji − iKi) (26)
2
which satisfy
[Ji+, J j+] = iǫijkJk+ (27)
The regular rotation generators are JK = JK + JK , where I now use the vector superscript to
+ −
call attention the fact that the spins must be added vectorially (remember Clebsch-Gordon coef-
ficients?). Since the 3D rotations, SO(3) is a subgroup of the Lorentz group, every representa-
tion of the Lorentz group will also be representations of SO(3). In fact, irreducible representa-
tions of the Lorentz group, which are characterized by two half-integers (A, B) produce many
representations of SO(3): with spins J = A + B, A + B − 1, , |A − B |. For example,
We saw that the regular 4D vector representation A µ contains spins 1, 0 so now we understand
1 1
that it corresponds to the ( 2 , 2 ) representation of the Lorentz group. The general tensor repre-
n n
sentations T µ1 µn corresponds to the ( 2 , 2 ). These are all irreducible representations of the
Lorentz group, but reducible representations of the SO(3) subgroup.
1
4 Spin 2
representation
All we know so far are the tensor representations, T µ1 µn which have only integer spins, so to
1
get representations containing spin 2 , we need something new.
1 1 1
There exist two spin 2 representations, ( 2 , 0) and (0, 2 ). What do these representations actu-
ally look like? We need to find 2x2 matrices that satisfy
K = JK + + JK −
What does this mean for actual Lorentz transformations? Well, the rotations are J
K
and the boosts are K = i(J− − J+ ) so
1 σi σi
( , 0): Ji = , Ki = − i (40)
2 2 2
1 σi σi
(0, ): Ji = , Ki = i (41)
2 2 2
Since the Pauli matrices are Hermetian σi† = σi we see that the rotations are Hermetian but the
boosts are anti-Hermetian. This is the same as what we found for the vector representation.
Also notice that these two representations are complex conjugates of each other. In contrast, the
vector representation was real .
1
Explicitly, if ψL is a ( 2 , 0) spinor, known also as a left-handed Weyl spinor, then under rota-
tions angles θi and boost angles βi
1 1
ψL = (1 + iθiσi + βiσi + )ψL
1
(iθiσi +βiσi)
ψL → e 2 (42)
2 2
Similarly,
1 1
ψR = (1 + iθiσi − βiσi + )ψR
1
(iθiσi − βiσi)
ψR → e 2 (43)
2 2
Infinitesimally,
1
δψL = (iθi + βi)σiψL (44)
2
1
δψR = (iθi − βi)σiψR (45)
2
Note again the angles θi and βi are real numbers. Although we mapped Ji− or Ji+ to 0, we still
have non-trivial action of all the Lorentz generators. So these are faithful irreducible representa-
tions of the Lorentz group. Similarly
1
δψL† = ( − iθi + βi)ψL† σi (46)
2
† 1 †
δψR = ( − iθi − βi)ψR σi (47)
2
Since the group element is the exponential of the generator Λ = eiλ, unitarity requires that λ † =
λ, that is, that λ be Hermetian.
Since su(2) is the special unitary algebra, all of its representations are unitary. So, the gener-
ators for the su(2) × su(2) decomposition J K ± = JK ± i KK are Hermetian. Thus exp(i θ+i J+i +
i i i i
iθ−J−) is unitary, for real θ+ and θ−. But this doesn’t mean that the corresponding representa-
tions of the Lorentz group are unitary. We said that a Lorentz group element is
Λ = exp(iθiJi + iβiKi) (49)
where the θi are the rotation angles and βi the boosts “angles”. These are real numbers. They
i
are related to the angles for the J± generators of su(2) × su(2) by θ+ = θi + iβi and θi− = θi − iβi.
So for a boost, the J+ and J− generators get multiplied by imaginary angles, which makes the
transformation anti-unitary. Thus none of the representations of the Lorentz group generated
this way will be unitary. We have just proven that there are no finite dimensional unitary repre-
sentations of the Lorentz group.
1
Spin 2
representation 7
To construct a unitary field theory, we will have to use the same trick we used for spin 1
particles. We will construct an infinite dimensional representation by having the basis depend
on the momentum ψi(p). For fixed momentum, say p µ = (m, 0, 0, 0) in the massive case, or p µ =
(E , 0, 0, E) in the massless case, the group reduces to SO(3) and ISO(2) respectively. These
groups do have unitary representations. For the case of spin 1, we were led to Lagrangians with
2 1
F µν for spin 1, and gauge invariance and charge conservation if m = 0. For spin 2 , we will find
that unitary and Lorentz invariance requires fermions to anticommute.
fields. Then we see that this is just the Lagrangian for a couple of scalars. So it’s not enough to
declare the Lorentz transformation properties of something, the Lagrangian has to force those
transformation properties. In the same way, a vector field is just 4 scalars until we contract it
with ∂ µ in the Lagrangian.
To proceed, let’s look at
ψL† σiψL (56)
This transforms as
1 1
δψL† σiψL = ψL† σi[(iθ j + β j )σ jψL] + [ψL† ( − iθ j + β j )σ j ]σiψL (57)
2 2
βj † iθ
= ψ (σiσ j + σ jσi)ψL + j ψL† (σiσ j − σ jσi)ψL (58)
2 L 2
= β jψL† ψL − θ jεijkψL† σkψL (59)
8 Section 5
5 Dirac matrices
Expanding them out, the Dirac matrices are
1 0 σi
γ0 = γi = (73)
1 − σi 0
Dirac matrices 9
Or,
0 1 0 1
01 0 1
γ0 = , γ1 = (74)
1 0 −1 0
1 0 −1 0
0 −i 0 1
0 i 0 −1
γ2 = , γ3 = (75)
i 0 −1 0
−i 0 1 0
They satisfy
{γ µ , γν } = 2η µν (76)
In the same way that the algebra of the Lorentz group is more fundamental than any particular
representation, the algebra of the γ ′s is more fundamental than any particular representation of
them. We say the γ ′s form the Dirac algebra, which is a special case of a Clifford algebra.
This particular form of the Dirac matrices is known as the Weyl representation.
The Lorentz generators are
i
S µν = [γ µ , γν ] (77)
4
They satisfy the Lorentz algebra for any γ ′s satisfying the Clifford algebra. That is, you can
derive from {γ µ , γν } = 2η µν that
Note: this is different from the 4-vector representation, with the matrices V µν . We have
found 2 different 4-dimensional representations. In each case, the group element is determined
1 1
by 6 real angles θ µν (3 rotations and 3 boosts). There is the vector or ( 2 , 2 ) representation,
which is irreducible, which has Lorentz group element
ΛV = exp(iθ µνV µν ) (79)
1 1
and the Dirac representation, ( 2 , 0) ⊕ (0, 2 ), which is reducible and has Lorentz generators
6 Rotations
Now let’s see what happens when we rotate by an angle θ in the z plane. We use
Λ(θz) = exp(iθzJz ) (85)
How do we exponentiate a matrix? The standard trick is to first diagonalizing it with a unitary
transformation, do the rotation, then transform back. This unitary transformation is like
choosing a direction, except it is purely mathematical as the direction must be complex!
First, for the vector representation
0 0
0 1
= U −1 −1
J3 = V12 = i U (86)
−1 0 1
0 0
So,
0
exp( − iθz)
ΛV (θz) = exp(iθzV12 ) = U −1 U (87)
exp(iθz)
0
ΛV (2π) = 1 (88)
That is, we rotate 360 degrees and we’re back to where we started.
For the spinor representation
Λs(θz ) = exp(iθzS12 ) (89)
the 12 rotation is already diagonal:
1
1 −1
S12 = (90)
2 1
−1
So,
i
exp( 2 θz)
i
− exp( 2 θz)
Λs(θz) = exp(iθzS12 ) = (91)
i
exp( 2 θz)
i
− exp( 2 θz)
−1
1
Λs(2π) = (92)
−1
1
Lorentz Invariants 11
Thus a 2π rotation does not bring us back where we started! If we rotate by 4π it would. So we
1
say spinors are spin 2 . What does this mean physically? I have no idea. There are plenty of
1
physical consequences of spinors being spin 2 , but this business of rotating by 2π is not actually
a physical thing you can do.
As an aside, note that the factor of 2 is determined by the normalization of the matrices,
which is set by the Lie Algebra. For each representation by itself, this would be arbitrary, but it
is important for expressions which combine the representations to be invariant.
7 Lorentz Invariants
The γ matrices themselves transform nicely under the Lorentz group.
where the Λs are the Lorentz transformations acting on each γ µ individually, as a matrix, and
the ΛV is the vector representation which mixes up the Lorentz indices. That is, writing out the
matrix indices γ µαβ , this means
(Λs−1)δαγ µαβ (Λs) βγ = (ΛV ) µνγναβ (94)
where µ refers to which γ matrix, and α and β index the elements of that matrix.
Then the equation
{γ µ , γν } = 2η µν (95)
really means
γ µαγγνγβ + γναγγ µγβ = 2η µνδ αβ (96)
And the equation
i
S µν = [γ µ , γν ] (97)
4
Should really be written as
αβ i αγ γβ
S µν = γ µ γν − γναγγ µγβ (98)
4
For an expression like
V µV µ = V µη µνVν = V µ {γ µ , γν }Vν (99)
For this to be invariant, we would need Λs† = Λs−1, that is, for the representation of the Lorentz
group to be unitary. The spinor representation, like the vector representation is not unitary,
because the boost generators are anti-Hermetian.
It is useful to study the properties of the Lorentz generators from the Dirac algebra itself,
without needing to choose a particular basis for the γ µ. First note that
So the eigenvalues of γ0 are ± 1 and the eigenvalues of γi are ± i. Thus if we diagonalize γ0, we
would find it Hermetian, and if we diagonalize γ1, γ2, or γ3 we would find they are anti-Herme-
tian. This is true in general.
γ0† = γ0, γi† = − γi (102)
Then,
†
† i ih † † i ih † † i
(S µν ) = [γ µ , γν ] =− γ , γ = γ µ , γν (103)
4 4 ν µ 4
12 Section 7
Which implies
† †
Sij = Sij S0i = − S0i (104)
Again, we see that the rotations are unitary and the boosts are not. You can see this from the
explicit representations above. But because we showed it algebraically, it is true in ANY repre-
sentation of the Dirac algebra.
Now, observe that one of the Dirac matrices is Hermetian, γ0 (γ0 is the only Hermetian
Dirac matrix because the metric signature is (1,-1,-1-,1)). Moreover
⇒ γ µ† = γ0 γµγ0 (106)
† ih † † i ih i i
⇒ γ0S µν γ0 = γ0 γ µ , γν γ0 = γ0 γ µ† γ0, γ0 γν† γ0 = [γ µ , γν ] = Sµν (107)
4 4 4
† †
(γ0Λsγ0) = γ0exp(iθ µνSµν ) † γ0 = exp( − iθ µνγ0S µνγ0) = exp( − iθ µνS µν ) = Λs−1 (108)
Then, finally,
ψ † γ0 ψ → (ψ †Λs†)γ0(Λsψ) = ψ † γ0Λs−1Λsψ = ψ † γ0 ψ (109)
ψ̄ ≡ ψ † γ0 (110)
The point is that ψ̄ transforms according to Λs−1. Thus ψ̄ψ is Lorentz invariant.
We can also construct objects like like
( + m2)ψ = 0 (118)
It is in this sense that people sometimes say the Dirac equation is the square-root of the Klein-
Gordon equation.
Coupling to the photon 13
We can integrate the Lagrangian by parts to derive the equations of motion for ψ̄ :
ψ̄ ( − i ∂ − m) = 0 (121)
where the derivative acts to the left. This makes the conjugate equation look more like the orig-
inal Dirac equation.
[(i∂ µ + eA µ)2 − m2 φ
(125)
Following the same route as before
0 = (i∂ + eA + m)(i∂ + eA − m)ψ (126)
2
= (i∂ µ + eA µ)(i∂ν + eAν )γ µγ ν − m ψ
(127)
1 1
= {i∂ µ + eA µ , i∂ν + eAν }{γ µ , γ ν } + [i∂ µ + eA µ , i∂ν + eAν ][γ µ , γ ν ] − m2 ψ (128)
4 4
This corresponds to a magnetic dipole moment. With conventional normalization, the size of the
e
magnetic moment is µB = 2m . We’ve made a physical prediction: charged fermions should have
e
magnetic dipole moments with size given by exactly µB .
This is pretty remarkable physical result. For a free spinor, we reproduce the equation of
motion of a scalar field. But when the spinor is coupled to the photon, we find an additional
interaction corresponds to a magnetic dipole moment. We can read off that the electron has spin
1
2
. Note: the coupling to the electric field is not an electric dipole moment – that would not have
an i, but is simply the effect of a magnetic moment in a boosted frame.
9 Probability current
The Noether current associated with the global symmetry ψ → eiαψ is
J µ = ψ̄γ µψ (134)
This, like any Noether current, is conserved on the equations of motion even if we set A µ = 0.
Note that the zero component of this current is
Q = ψ † ψ = ψL† ψL + ψR
†
ψR (135)
We originally hoped this would be Lorentz invariant, which it is not. Now we see that it trans-
forms as the 0 component of a conserved current. We can interpret this as the probability den-
sity for a fermion. The expectation value of Q is electron-number, which is the number of elec-
trons minus the number of positrons. The spatial components of J µ denote electron number-
flow. This is the same thing as the charge current, which couples to A µ up to a factor of the
electric charge e.
10 Helicity eigenstates
1 1
Dirac spinors, what we have been using, are 4 component complex fields in the ( 2 , 0) ⊕ (0, 2 )
representation of the Lorentz group. Let’s return for a moment to thinking about the 2-compo-
nent fields.
In the Weyl basis, the γ matrices have the form
0 σµ
γµ = (136)
σ̄ µ 0
We can write our spinor as the left and right handed parts
ψL
ψ= (138)
ψR
The Dirac equation is
− m iσ µDµ ψL
(139)
iσ̄ µD µ − m ψR
meaning
(iσ̄ µ∂ µ + eσ̄ µA µ)ψR = mψL (140)
So the electron mass mixes the left and right handed states.
Helicity eigenstates 15
1 + γ5 1 − γ5
0 1
PR = = , PL = = (149)
2 1 2 0
Spin is a vector quantity. We say spin up, or spin down, etc. It is the eigenvalue of S K = σK2
for a Fermion. If there is no angular momentum, the spin and the rotation operators are iden-
K = JK . We also talk about spin s, as a scalar, which is the eigenvalue s(s + 1) of the oper-
tical S
K 2
ator S . This is what we mean when we say spin 2
1
Helicity refers to the projection of spin on the direction of motion. Helicity eigenstates sat-
K · Kp
S
isfy E Ψ = ± Ψ. Helicity eigenstates exist for any spin. For spin 1, circularly polarized light are
the helicity eigenstates.
Chirality is a concept that only exists for Fermions. It refers to the representation of the
Lorentz group the fields transform under, i.e. ψL or ψR for a Dirac Fermion. These are eigen-
states of γ5 for which we use L and R as subscripts. We write γ5 ψL = ψL and γ5 ψR = − ψR.
3
Chirality works for higher half-integer spins too. For example, a spin 2 field can be put in a
Dirac spinor with a µ index, ψ µ. Then γ5 ψ µ = ± ψ µ are the chirality eigenstates. Up until now
we have been saying right-handed and left-handed, which was shorthand for chirality.
For free massless spinors, the spin eigenstates are also helicity eigenstates and chirality eigen-
states. The Hamiltonian for the massless Dirac equation commutes with the operators for chi-
JK · Kp K . The QED interaction ψ̄ Aψ = ψ̄Lγ µψL +
rality, E , the helicity, γ5, and the spin operators, S
ψ̄RAψR preserves chirality. Helicity, on the other hand, is not necessarily preserved by QED: if
a left-chirality spinor reverses direction, its helicity flips. Thinking about the helicity of spinors
at high energy is therefore useful, while thinking about chirality is not so much, because it never
changes.
For massive spinors, the free Hamiltonian no longer commutes with the chirality operator
due to the mψ̄ψ = mψ̄LψR + ψ̄RψL term. Thus even under free evolution ψL will pick up a ψR
component over time. However, for the free theory which is rotationally invariant, J K commutes
with the Hamiltonian. Thus, for a free particle, spin, momentum, and helicity are all conserved.
Helicity is therefore a good quantum number for a free massive theory. However, in the massive
case, when we go to the non-relativistic limit, it is often easier to talk about spin, the vector.
Projecting on the direction of motion doesn’t make so much sense when the particle is nearly at
rest, or in a gas, say, when its direction of motion is constantly changing. The QED interactions
do not preserve spin, however only a strong magnetic field can flip an electrons spin. So as long
as magnetic fields are weak, spin is good quantum number.
In practice, we hardly ever talk about chirality. The word is basically reserved for chiral the-
ories, which are theories that are not symmetric under L ↔ R, such as the theory of the weak
interactions. We often talk about helicity. In the high energy limit, helicity is used interchange-
ably with chirality. As a slight abuse of terminology, we say ψL and ψR are helicity eigenstates.
In the non-relativistic limit, helicity is only used for photons, when it is synonymous with polar-
ization of circularly polarized light. For fermions, we use spin, the vector, as the useful quantity.
In the rest frame, p20 = m2, so p0 = ± m. The solution with p0 = − m is confusing. It is a legiti-
mate solution to the equation of motion, but it says that these particles are going backward in
time! But note that
φ = φ peip0 t ⇔ φ⋆ = φ⋆pe−ip0 t (157)
So we can just as easily interpret these solutions as anti-particles going forward in time. Obvi-
ously this interpretation is easier to swallow, but Feynman spent some time showing that there
really is no physically distinguishable difference.
Now back to spinors. The Dirac equation is
ψ̄ ( − i ∂ − eA − m) = 0 (159)
with p2 = m2. These are like the solutions A µ(x) = ε µ(p)eipx for spin-1 plane waves. In fact
there are solutions to the equations of motion p2 = m2 with p0 < 0 as well as p0 > 0. We already
know that the negative frequency solutions are not a problem since they correspond to antipar-
ticles with positive frequency. Rather than work also with ψ̄ we simply write the other solutions
as
d3 p
Z
ψs(x) = vs(p)e−ipx (161)
(2π)3
with p0 = Kp 2 + m2 > 0 for both. Then we can think of us and v̄s are the polarizations for par-
p
ticles and anti-particles, respectively. Note that in QM notation, this is consistent with ψ, us
and vs being kets (column vectors).
Now let’s use the Dirac equation. In the Weyl basis, (i∂ − m)ψ becomes
− m p µσ µ − m − p µσ µ
us(p) = vs(p) = 0 (162)
p µσ̄ µ − m − p µσ̄ µ − m
The Dirac spinor is a complex 4-component object, with 8 degrees of freedom. The equations of
motion reduce it to 4 degrees of freedom: spin up and spin down for particle and anti-particle.
Now let’s boost in the z-direction. Peskin and Schroeder do the actual boost. But we’ll just
solve the equations again in the boosted frame and match the normalization. If p = (E , 0, 0, pz)
then
µ µ E − pz 0 E + pz 0
p σ = , pµσ̄ µ = (166)
0 E + pz 0 E − pz
√ √
Let a = E − pz and b = E + pz , then m2 = (E − pz)(E + pz)=ab.
Then the Dirac equation is
− ab 0 a2 0
0 − ab 0 b2
(i∂ − m)ψ = 2 u β (p) = 0 (167)
b 0 − ab 0
0 a2 0 − ab
The solutions are
aξ1
a 0
bξ2 0 b ξ
us = u ξ(p) = bξ1 = b 0 ξ
(168)
0 a
aξ2
which are easy to check. Note that in the rest frame pz = 0, a2 = b2 = m, and ξ reduces to ξs
above. So the solutions in the pz frame are
√ !
E − pz 0
√ ξs
0 E + pz
us(p) = √ ! (169)
E + pz 0
√ ξs
0 E − pz
Similarly,
√ !
E − pz 0
√ ηs
0 E + pz
vs(p) = √ ! (170)
− E + pz 0
√ ηs
0 − E − pz
Using
√ ! √ !
√ E − pz 0 √ E + pz 0
p·σ = √ , p · σ̄ = √ (171)
0 E + pz 0 E − pz
where the square root of a matrix means diagonalize it and then take the square root. In prac-
tice, we will always pick p along the z axis, so we don’t really need to know how to make sense
√
of p · σ . But this notation is useful because it makes Lorentz invariance manifest.
In the massless limit, the solutions are
! !
0 0 0 0
0 √2E ξs 0
√
2E
ηs
us(p) = √ ! , vs(p) = √ ! (173)
2E 0 ξ
− 2E 0
s ηs
0 0 0 0
Solving the Dirac equation 19
For Weyl spinors, there are only 4 degrees of freedom off-shell, so there can only be 2 on-shell
once we use the equations of motion. Recalling that the Dirac equation splits up into separate
equations for ψL and ψR, we see that there is only one particle and one antiparticle solution in
the top two rows, and one particle and one antiparticle solution in the bottom two rows. Thus
on shell, the 2 degrees of freedom Weyl spinors have are particle and antiparticle, for the same
helicity.
= 2mδss ′ (177)
This is the (conventional) normalization for the inner product .
For m = 0 this way of writing the normalization convention is not useful. Instead we can cal-
culate
√ ! √ !
p · σ ξs p · σ ξs ′
us†(p)us(p) = √ † √ = 2Eξs† ξs ′ = 2Eδss ′ (178)
p · σ̄ ξs p · σ̄ ξs ′
√ R 3
These are the same 2E factors which help make Lorentz invariance manifest for d p inte-
grals.
We can also compute the outer product
2
X
us(p)ūs(p) = p + m (179)
s=1
So when we sum Lorentz indices or internal spinor indices, we use an inner product and get a
number. When we sum over polarizations/spins, we get a matrix.
You know about spin-statistics already from quantum mechanics. Say we have two left-
handed Weyl spinors ψ and χ. Then ψ Tσ2 χ = i(ψ1 χ2 − ψ2 χ1) is Lorentz invariant. This may
look more familiar if we use arrows for the ψ and χ states:
1
|Ψi = √ (|↑i|↓i − |↓i|↑i) (197)
2
This kind of two-particle wavefunction is automatically antisymmetric to the exchange of parti-
cles. This is the Pauli exclusion principle. Here we see that it is intimately related to Lorentz
invariance.
This isn’t anything close to a derivation of the spin-statistics theorem. We just showed that
there is a Lorentz-invariant bi-linear we can construct from a Weyl fermion. In the next lecture,
we will show that the S-matrix is not Lorentz invariant unless Fermions anti-commute, which is
a much more convincing argument that the spin-statistics theorem must hold.
13 Majorana fermions
If we allow fermions to be Grassman numbers, then we can write down a Lagrangian for a single
Weyl spinor with a mass term
m
L = iψL† σ µ∂ µψL + i (ψL† σ2 ψL
⋆ T
− ψL σ2 ψL) (198)
2
These kinds of mass terms are called Majorana masses. Note that this mass term breaks the
symmetry under ψ → eiαψ, since
T T iα T
ψL σ2 ψL → ψLe σ2eiαψL = e2iαψL σ2 ψL (199)
So a particle with a Majorana mass cannot be charged.
The equation of motion for the Majorana fermion is
⋆
σ µ∂ µψL + mσ2 ψL =0 (200)
which follows from the Lagrangian above.
If we have a Majorana fermion, we can still use the Dirac algebra to describe it, but we have
to put it into a 4-component spinor
ψL
ψ= ⋆ (201)
iσ2 ψL
⋆
This transforms like a Dirac spinor because σ2 ψL transforms like ψR. Then
m m
ψ̄ψ = − i (ψL† σ2 ψL
⋆ T
− ψL σ2 ψL) (202)
2 2
which is the Majorana mass term.
Since (in the Weyl basis), using σ22 = 1,
( − i)( − i)σ2σ2⋆ ψL
0 σ2 ψL ψL
− iγ2 ψ ⋆ = − i ⋆
⋆
= = ⋆ =ψ (203)
− σ2 0 iσ2 ψL ( − i)( − i)( − σ22)ψL
⋆ iσ2 ψL
13.2 summary
In summary,
We have seen three types of spinors
• Dirac Spinors: massive, left AND right handed
• Weyl Spinors: massless, left OR right handed
• Majorana spinors: real constrained Dirac spinors.
For a little physics on top of this algebra, there’s a particle in nature called the neutrino which
is not charged. So it can be a Majorana or a Dirac fermion. In fact, we’re not sure what it is,
but we’re trying hard to find out. Weyl spinors are also important. They play a key role in the
theory of the Weak interactions. Weyl spinors are also critical for supersymmetry and string
theory. But for QED, we can just stick with Dirac spinors.