Professional Documents
Culture Documents
Introduction to
General Relativity
Amol Sasane
, . , World Scientific
A Mathematical
Introduction to
General Relativity
Other World Scientific Titles by the Author
Amol Sasane
London School ofEconomicJ'~ UK
All rights reserved. This book, or parts thereof, may not be reproduced in any form or by any means,
electronic or mechanical, including photocopying, recording or any i,iformation storage and retrieval
system now known or to be invented, without written permission from the publisher.
For photocopying of material in this volume, please pay a copying fee through the Copyright Clearance
Center, Inc., 222 Rosewood Drive, Danvers, MA 01923, USA. In this case permission to photocopy
is not required from the publisher.
Printed in Singapore
To my teachers
This page intentionally left blank
Preface
vii
viii A Mathematical Introduction to General Relativity
What determines the spacetime (M, g)? The answer is the field equation 1:
--------------
Terms appearing here will be explained eventually:
1 Ones on the left-hand side (cap-
turing the geometry of M), will be covered in Chapter 9, while the one on the right-hand
side (describing the matter content in M) will be specified in Chapter 13.
Preface ix
where the right-hand side T describes the matter distribution (called the
'energy-momentum tensor field of matter'), and the left-hand side is ex-
pressible in terms of the metric g. The metric g is something that equips
the tangent space TpM at each point of the manifold M with a Lorentzian
scalar product, and essentially determines the 'shape' of M. So to obtain
M, one needs to find a metric g which is a solution to this field equation.
Given the metric g, the motion of 'test' matter (that is matter whose
effect on the T can be neglected) is given by the geodesic equation 2 ,
'v'v-r V.,, = 0,
where V.,, is the velocity field along 'Y, and this describes the motion of a
particle in the spacetime M by means of a curve 'Y : [a, b] -+ M, called its
'worldline'.
Does space 'evolve'? Or, what is the reality experienced by us, in terms
of our view of the spatial world, with a notion of time passing in our minds?
Firstly, we are a 'worldline', namely a curve 'Y: J-+ M, where I c JR is an
interval, thought of as the list of events we encounter in our lifetimes-the
curve merely describes the order in which we meet these events. The 'time'
we experience between two events p="((a) and p' ="f(b) along our worldline
is the 'proper time', defined by
(The notation will be clarified eventually in due course.) We will see that
our worldline 'Y at each event p = 'Y(t) along it, has a 'timelike' tan-
gent vector v.,, ,P in the 4-dimensional tangent space TpM. This v.,, ,P has
a 3-dimensional orthogonal complement (v.,,,p)1_ (with respect to the scalar
product g(p)), and this is roughly the 'space' we perceive at each moment.
Thus the time experienced is 'personal' to each observer, and so is the
perceived 'space' at each moment. Different observers thus have different
ticking clocks (depending on what worldline they have in spacetime, and
also different spatial realities.
How does one use this theory of spacetime in classical (i.e. nonquan-
tum) physics? Given a matter distribution, for example a star, or even the
universe, the field equation is solved to obtain the metric g (analogous to
finding the 'gravitational potential' U by solving a Poisson equation involv-
ing the matter density distribution pin classical Newtonian gravity; minus
2 The notation which appears here will be clarified in Chapter 7, and we will discuss
geodesics in Chapter 8.
X A Mathematical Introduction to General Relativity
the gradient of the potential then gives the gravitational field g). This can
then be used to obtain the geodesic equations for test matter (analogous
to Newton's second law of motion; knowing the acceleration g, and solving
the resulting equation describes the particle trajectory).
It is well-known that one road towards the goal of mastering a subject is
to teach it, since it makes it necessary to organise what has been learnt, so
that the core principles are laid bare, and these notes were originally writ-
ten with this intention. I hope that readers of this book will find it useful.
For readers who want to delve deeper into the subject, some of the sources
that have influenced the writing of this book, are recommended. A deeper
and a more complete treatment is given in the book [Sachs and Wu(1977)],
but it assumes prior familiarity with differential geometry, while we develop
this language along the way, starting from scratch. A more modern text-
book, with an emphasis on Lorentzian geometry, is [O'Neill(1983)]. More
elementary than these two books, but also approaching the subject mathe-
matically, are the clear online lectures by Frederic Schuller [Schuller(2015)],
and the book [Oloff(2018)] (in German3 ). I have used these and many other
sources which are listed in the bibliography, but no claim to originality is
made for any of the discussed material in case there is a missing reference.
I am grateful to Jose Natario for reading the book, for pointing out
various errors and typos, and for suggesting improvements in the text and
additional exercises. Of course, any remaining errors or deficiencies are
solely my responsibility. In particular, an improved proof of Lemma 5.1,
and Exercises 5.19, 5.20, 8.8, 9.15, 12.9, 13.4, 14.5, 14.6, 14.9, 16.3, 16.6,
16. 7 were suggested by Jose. The flagged errors, and the suggestions for
improvements, are too numerous to list here.
Thanks are also due to OlofRubin for pointing out a gap in Theorem 1.2,
to Sara Sal Santos for useful feedback, including spotting a mistake in an
earlier version of the picture on page 144, and to Adam Ostaszewski for
his comments on the Preface. It is a pleasure to thank Rochelle Kronzek
and Rok Ting Tan at World Scientific for their help and enthusiasm. I am
indebted to Sara Maad Sasane, for useful discussions, detailed feedback on
the first six chapters, and encouragement.
Amol Sasane
Lund, 2021
We use N for the set {1, 2, 3, • • •} of natural numbers, Z for the set
{· • • , -1, 0, 1, • • •} of integers, and JR for the set ofreal numbers. The equiv-
alence class of an element a E S under an equivalence relation on S will be
denoted by [a]. We denote the identity map on a set S, S 3 x ...... x E S,
ids, and if Sis clear from context, simply by id. For a function f: X-> Y,
and a y E Y, the notation f = y means that for all x EX, f(x) = y. RHS
means 'right-hand side', and LHS means 'left-hand side'.
Points or vectors in JR 3 are often denoted by bold face letters, for exam-
ple, x = (x,y,z) E JR 3 . The 'vector/cross product' of vectors x,y E JR 3
is denoted by x x y E JR3 . The components of a vector x in !Rm are
often denoted by xi, 1 ~ i ~ m, in accordance with the physics liter-
ature, and if e.g. the square of xi E JR is meant, we will use paren-
thesis, writing (xi) 2 . The standard basis vectors in !Rm are given by
e1 = (1, 0, · · · , 0), · · · , em = (0, · · · , 0, 1). We will also use the nota-
tion (x, y) = x 1 y 1 + ••• + xmym for the Euclidean inner product of
x = (x1,- · · , xm) and y = (y1,- · · , ym) E !Rm, and llxll := -./(x, x) for
the Euclidean 2-norm. Given x E !Rm and r > 0, the open ball B(x, r) in
the Euclidean topology is B(x,r) = {y E !Rm: IIY - xii < r}. The closure
of a subset S of a topological space will be denoted by S. Thus the clo-
sure of the open ball B (x, r) in the Euclidean topology is the closed ball
B(x, r) = {y E !Rm : IIY - xii ~ r}. Throughout the book, unless otherwise
stated, repeated indices, mostly one appearing as a superscript and one as
a subscript, will be summed over the range of the index in question. Thus
here xiei means x 1 e 1 + •••+ xmem. The Kronecker delta symbol is denoted
by 8j, or sometimes by 8ij, and means the following:
8i = { 1 if i = j,
1 0 if i =I- j.
xi
xii A Mathematical Introduction to General Relativity
Given a real vector space V, its dual vector space consisting of all linear
maps w : V -+ IR, will be denoted by V* , and if {ei, i E J} is a basis for
V , the dual basis is denoted by {Ei, i E J}, and is defined by €i ( ej) = 8).
If Sc V, then spanS is the subspace consisting of all linear combinations
of vectors from S (with an empty sum being defined as the zero vector).
Also, -S := {-v : v E S}. An invertible linear transformation T : V -+ W
between vector spaces V, W is referred to as a linear isomorphism.
Let a~, bij , cij , 1 :,;;; i :,;;; n and 1 :,;;; j :,;;; m, be collections of numbers
or functions. Then writing A = [a~], B = [bij], C = [cij] means A , B , C
are the n x m matrices, whose entry in the ith row and jth column is a~,
respectively bij, cij. The identity matrix is denoted by I or sometimes by
Im (to emphasise its size as an m x m matrix). Thus Im = [8j] . The
transpose of a matrix M will be denoted by Mt. The determinant of a
square matrix M is denoted by <let M. !Rnxm denotes the set of n x m
matrices with real entries. GL(IR,m) stands for the group of invertible in
!Rmxm with the operation of matrix multiplication. The zero entries in a
matrix are sometimes left as blanks.
For n E N the nth order derivative of an n times differentiable function
f: JR-+ JR at a point t E JR will be denoted by J<nl(t). If Uc !Rm is an open
set, then for an f : U -+ !Rn having the component functions / 1 , · · · , fn,
and differentiable at p E !Rm , the Jacobian matrix off is given by
OU 1 1
of 1 (p) ... OUm
of (p) 1
[f'(p)] =
ror :
or
: .
OU 1 (p) ... OUm (p)
Preface vii
1. Smooth Manifolds 1
1.1 Charts and atlases 1
1.2 Topology on a smooth manifold . 10
1.3 Smooth maps . . . . . . . . . . . 13
2. Co-/Tangent space 17
2.1 Tangent vectors to a surface as derivations. 18
2.2 Tangent vector definition . . . . . . . . . 22
2.3 Bump functions . . . . . . . . . . . . . . . 23
2.4 Chart-induced basis for the tangent space 26
2.5 Derivatives of smooth maps 30
2.6 Cotangent space .. 34
2. 7 Pull-back of 1-forms 37
3. Co-/Tangent bundles 39
3.1 Tangent bundle 40
3.2 Vector fields .. 41
3.3 Push forward of vector fields under diffeomorphisms 44
3.4 Lie algebra of vector fields . 46
3.5 Cotangent bundle . . . . . 48
3.6 1-form fields . . . . . . . . . 50
3.7 Pull-back of a 1-form field . 53
3.8 Integral curves and the flow of vector fields 54
xiii
xiv A Mathematical Introduction to General Relativity
4. Tensor fields 61
4.1 (r, s)-tensor fields . . . . . . . . . . . . . . . . . . . . . . . 61
4.2 Point evaluation of tensor fields . . . . . . . . . . . . . . . 63
4.3 Tensor field components, tensor product and contraction . 68
4.4 Pull-back of (0, s)-tensor fields . . . . . . . . . . . . . . . 72
5. Lorentzian manifolds 75
5.1 Scalar product 76
5.2 Semi-Riemannian, Riemannian, Lorentzian manifolds. 85
5.3 Time-orientation, observers, and proper time 90
5.4 Raising/Lowering indices or musicality . 103
8. Geodesics 155
8.1 Geodesic equation 155
8.2 Existence and uniqueness 158
8.3 Affine parameter . . . . . 162
8.4 Lightlike geodesics in Schwarzschild spacetime. 165
8.5 The exponential map . . . . . 170
8.6 Normal coordinates . . . . . . 174
8. 7 Geodesics as extremal paths . 176
9. Curvature 181
9.1 Ricci and scalar curvatures . . . . . . . . . . . . . . . . 188
9.2 Cosmological time and cosmological red-shift in FLRW. 192
9.3 Geodesic deviation and curvature . . . . . . . . . . . . . 195
Contents xv
Solutions 361
Bibliography 477
Index 479
Chapter 1
Smooth manifolds
We have prior experience with using coordinates, for example, for the sur-
face of a sphere, one could use polar and azimuthal angles. But then we
realise that in order to obtain an injective mapping from the sphere to the
set of parameter ranges of the coordinates, we must work with patches on
the sphere, instead of the whole sphere in one go. So we anticipate that
also in the case of a manifold, it will need to be covered by patches, and
coordinates need to be set up in each patch.
1
2 A Mathematical Introduction to Geneml Relativity
Recall that !Rm can be equipped with its usual Euclidean topology.
Henceforth, we will often drop the specification 'm' in 'm-chart', and simply
refer to 'charts' for M, with the understanding that for a given M, them is
fixed. A chart allows us to talk about the coordinates of a point p E U c M,
with respect to the chart (U, <p), as the m-tuple of numbers cp(p) E !Rm.
Smooth Manifolds 3
Example 1.2. (A chart for !Rm). Let U = !Rm, and cp: !Rm-+ !Rm be the
identity map id. Then (!Rm, id) is a chart on !Rm. ◊
We will see later on that locally a manifold looks like !Rm (its tangent space).
In Example 1.1, there is a distinguished point, namely the origin, but
as nature does not provide natural coordinate systems, we also introduce
!Rm which has forgotten its origin, namely an affine space.
Fig. 1.1 Charts (Un, 'Pn) and (Us, cp 8 ) on S 2 . Looking at the two similar triangles in
the left picture, we have (1- z) : 1 = x: u and (1- z) : 1 = y: v. Analogously, from the
two similar triangles in the right picture, (1 + z): 1 = x: u and (1 + z): 1 = y: v . This
holds irrespective of the sign of z.
Exercise 1.2. Show that the inverse cp;; 1 : JE. 2 -+ S2 \{n} of the map 'Pn is given by
(u , v)
'P;:; 1 (
I-----+
2u 2v u 2 +v 2
u2 + v2 + 1 ' u2 + v2 + 1 ' u2 + v2 + 1 .
-1)
. 2 2 x 2 + y2 1 - z2
Hint: u + v = (l - z)2 = (1 - z)2.
A collection of charts for M (with the same m) will form an atlas provided
they cover the set M and satisfy a compatibility condition.
- 1
'Pi O<p j
Just as with charts, we will often drop them, and speak simply of an atlas,
instead of an m-atlas. Note that '{}j o <p-; 1 : 'Pi (Ui n Uj) --> '{}j (Ui n Uj) is a
bijection. Recall that a function f : !Rm --> !Rn is C 00 if the components Ji
of f , 1 ~ i ~ n, have all partial derivatives of all orders with respect to the
variables xJ, 1 ~ j ~ m, which are also continuous.
The single charts in Examples 1.2, 1.3 and Exercise 1.3 are all atlases
in a trivial manner.
Example 1.5. (S 2 revisited) .
The charts (Un , 'Pn) and (Us , 'Ps) from Example 1.4 form an atlas for S 2.
Firstly, UnuUs = S 2 . Secondly, 'Pn(UnnUs) = IR 2 \ {(0, 0)} = 'Ps(UnnUs)-
Finally, <p 8 o<p~1, 'Pno<p-;; 1 : IR 2\ {(0,0)}--> IR 2\ {(0, 0)} , the chart transition
maps, are both
(u ,v)
(u , v),..... 2 2'
u +v
which is C 00 • ◊
Exercise 1.5. Show that the four charts in Exercise 1.4 form an atlas for the
cylinder C in JE. 3 .
M
---+
r.p
g )
p' e'
~ 1 r.p'
---+
2
Given x E ~m, q := r.p- 1 (x) EM, and we wish to find the coordinates x' of
this q using the point p' and the basis B'. So we need to write q = p' +vp'q,
and find x' by expanding v p' q using the basis B'. We have v p' q = v p' P + v pq,
and as q = r.p- 1 (x), also Vpq = xiei. Introduce the change of basis matrix
A = [A{] E GL(~, m), where A; denotes the entry in the l h row and i th
column of A, defined by ei = A{ e1. Also, let b = (b1, • • • , bm) E ~m be
defined by vp'p = bJe1. Then
vp'q = vp'p + Vpq = l?e1+ xiei = bJe 1+ xi(A{e1)=(bl+ A{xi)e1,
and so the chart transition map r.p' o r.p- 1
is the affine linear map given by
r.p' o r.p- 1 (x) = b + Ax, for all x E ~m , which is C 00 . Its inverse r.p o (r.p')- 1 is
given by r.p o (r.p')- 1(x) = -A- 1b + A- 1x, x E ~m. So the atlases {(M, r.p)}
and {(M, r.p')} are compatible. ◊
Smooth Manifolds 7
Exercise 1.6. Show that the set Rm x GL(m, R) is a group with the composition
(h2,A2) · (h1,A1) = (h2+A2h1,A2A1) for (h2,A2), (h1,A1) in Rm x GL(m,R).
Thus the atlases given in Examples 1.2, 1.3, 1.4 and Exercise 1.4 can be
used to make the respective set M into a smooth manifold. As we will
use it frequently, we will call !Rm with the atlas comprising the single chart
(!Rm, id) as the smooth manifold !Rm with the standard smooth structure.
In Example 1.6, the two atlases A 1 and A 2 are not compatible. Hence
(JR, [A1]) and (JR, [A2]) are two different smooth manifolds.
The angle 0(p) is called the polar angle of the point p E 8 2 , and the angle
cp(p) is called the azimuthal angle of the point p E 8 2 . For a point p E U,
we have cp(p) := (0(p), cp(p)) E (0, n) x (0, 2n), where if p = (x, y, z), then
1 The name 'manifold' comes from the German word 'mannigfaltigkeit' used by Riemann
in his doctoral thesis, which contained, among other things, a discussion of multi-valued
complex functions and their (now called) Riemann-surfaces.
8 A Mathematical Introduction to General Relativity
l:~-:;~:k : ::::
0(p) = cos- 1 z and
</>(p) := 2\'..(x, y) :=
21r-cos- 1 ~ ify<0.
yx2+y2
Here (-1, 1) -> (0, 1r) and
cos- 1 : (-1 , 1) -> ( - ~ , ~) are the inverse
sin- 1 :
trigonometric functions. It can be checked that </> is well-defined and that
the map JR 2 \{(x, y) E JR 2 : x;;,, O} 3 (x,y) >-+ 4-(x, y) is C 00 . Using this, it
can be checked that (U, <.p) is an admissible chart: e.g., if v > 0, then
2 2
(u, v) cpocp;; cos- 1 u 2 +v 2 -1 , cos- 1 ~ u )
1
(
I.Pn(Un n U) 3 >------>
u +v +1 vu2+v2
is C 00 , and
(U 11 Un) 3 ( 0, </>) cp~ 1 ( (sin0) cos</> , (sin0) sin</>)
<.p 1- cos0 1- cos0
is C 00 .
z
The chart U covers 8 2 except for a 'slit', namely the intersection of 8 2 with
the half plane {(x, y, z) : y = 0, x;;,, 0} . In order to cover 8 2 , we can take
another chart (V, '¢), defined in a similar manner, by taking a differently
placed slit, in a plane perpendicular to the one containing the original slit.
Then V together with U, covers 8 2 . More explicitly, V covers 8 2 except
for the intersection of 8 2 with the half plane {(x , y, z) : z = 0, x,;;; 0}. Vis
the image of the map
(0, 1r) x (0, 21r) 3 (0, </>) >-+ (-(sin 0)(cos </>), cos 0, (sin 0)(sin </>)) E 8 2 c JR3 ,
and this map is obtained by taking the polar angle with the positive y-axis,
and the azimuthal angle with the negative x-axis counterclockwise in the
y = 0 plane; see the picture above. ◊
Smooth Manifolds 9
Exercise 1.8. Consider the square S := {(x,y) ER2 : lxl+IYI = 1}. Show that the
collection A:= {(U+, 'P+ ), (U-, cp-), (V+, "P+ ), (V-, 'I{'-)} is an atlas for S, where
U+:={(x,y)ES:x>O} cp+(x,y)=y,
U-=={(x,y)eS:x<O} cp-(x,y)=y,
V+:={(x,y) ES: y>O} "P+(x,y)=x,
V-=={(x,y) ES: y<O} "P-(x,y)=x.
Thus (S, [A]) is a smooth manifold. So a smooth manifold may not necessarily
'appear' smooth.
Example 1.11. (FLRW spacetime). Let I:= (0, oo). Then M :=Ix JR3 ,
with the atlas {(Ix JR3 , idixR.3)} is a 4-dimensional smooth manifold, called
the FLRW spacetime (after Friedman-Lemaitre-Robertson-Walker). ◊
We will want to talk about continuous maps between smooth manifolds, for
example a 'worldline' in a spacetime M (Definition 1.8). The way we equip
a smooth manifold with a topology is by insisting that the chart maps are
homeomorphisms (Theorem 1.2). This is the motivation for the following
definition.
Definition 1.6. (Open set in a smooth manifold).
Let (M,[A]) be an m-dimensional smooth manifold and {(Ui, rpi), iEI}E [A].
A set Uc Mis open if for all i EI, rpi(U n Ui) is open in !Rm, where !Rm is
given its standard Euclidean topology, described by the Euclidean metric
m
d(x,y) := ~(xi-yi) 2 x=(x1,··· ,xm), y=(y1,··· ,ym)E!Rm.
Proof. Let {(Ui,'Pi),i EI} E [A]. Then 0 = 'Pi(0 n Ui) is open in !Rm
for all i EI, and so 0 E 0. Also, for all i EI, 'Pi(Ui) = 'Pi(M n Ui) is open
in !Rm since (Ui, 'Pi) is a chart, and so ME 0.
Let U, V E O. Then for all i E I, 'Pi ((Un V) n Ui) = 'Pi (Un Ui) n 'Pi (V n Ui)
( c is always true for any map, and ::::, holds by the injectivity of 'Pi). Being
the intersection of open sets, 'Pi((U n V) n Ui) is open in !Rm for all i EI,
and consequently, U n V E O.
Let ½EO for all j E J. Then we have that for all i EI,
the inverse image under the (C 00 and hence) continuous map (cp j o cpi 1 )- 1
of the open set V n 'Pi(Ui n Uj) (c 'Pi(Ui)), is open. Hence cpj((cpi 1 V) n Uj)
is open for all j E I, that is, cpi 1 V is open in M. So 'Pi : Ui -+ 'Pi(Ui) is
continuous.
Let U c Ui be open. We want to show that 'Pi(U) = (cpi 1 )- 1 U is
open in !Rm (and hence open in 'Pi(Ui)). The fact that U is open means in
particular that 'Pi(U n Ui) is open in !Rm. But 'Pi(U n Ui) = 'Pi(U), since
Uc Ui. Thus the inverse map cpi 1 : 'Pi(Ui)-+ Ui is also continuous. □
Exercise 1.10. Let Rm be equipped with the standard smooth structure. Show
that the topology induced by this smooth structure coincides with the standard
Euclidean topology.
Exercise 1.14. Let M1, M2, M3 be smooth manifolds, and let fi2 : M1 -+ M2,
h3 : M2 -+ M3 be smooth maps. Prove that h3 o fi2 : M1 -+ M3 is smooth.
ter 2, we'll construct nonzero functions that vanish outside a neighbourhood of a point.
16 A Mathematical Introduction to General Relativity
Exercise 1.17. Let R be equipped with the two incompatible atlases A1 and A2
given in Example 1.6. Prove that (R, [A1]) is diffeomorphic to (R, [A2]). (From
our earlier considerations, the incompatibility of A1 with A2 can be expressed
by saying that the identity map fails to be a diffeomorphism between the smooth
manifolds (R, [A1]) and (R, [A2]). However, this exercise shows that there may
nevertheless be other maps which serve as a diffeomorphism.)
Exercise 1.20. (Lie group and left translation diffeomorphisms). A Lie group is
a group (G, •) equipped with a smooth structure, such that the multiplication
map G x G 3 (p, q) >-+ p • q E G, and the inverse map G 3 q >-+ q- 1 E G, are
smooth. Given p E G, the left-translation by pis the map Lp : G-+ G defined by
G 3 q >-+ p • q. Show that Lp is a diffeomorphism for each p E G.
Before moving on, we make a final remark on some notation used. Charts
will often be denoted by (U, cp), but also by (U, x), where the understanding
is that the component functions of the map x : U -> !Rm are denoted by
xi : U -> JR, 1 ~ i ~ m. Moreover, given a function f : M -> JR, a point
p EM, and an admissible chart (U, cp), we will denote the partial derivative
off o cp- 1 : cp(U) -> JR with respect to the ith variable at the point cp(p) by
Co-/Tangent space
Fig. 2.1 Tangent plane TpM and tangent vector vat a point p on the surface M .
17
18 A Mathematical Introduction to General Relativity
This may appear strange, but it is not too far-fetched. Indeed, imagine
what a child running in a playground experiences as its own velocity vector
in spacetime: The child perceives the changes in (the smooth functions
on spacetime such as) temperature, pressure and so on. The faster these
change, the larger the perceived velocity. Thus if 'Y: (-e:, e:) -+ M describes
the child's worldline, with 7(0) = p, then it makes sense to think of its
velocity at a point p E M as a map
f,..... :;f O 'Y) (0), f E C 00 (M),
since the above expression registers how fast f (temperature, pressure, etc)
is changing at p, as the child goes about playing!
showing that v,,..,,P is linear. It also satisfies the Leibniz rule since
v (f·) = d((J-g)o'Y)(O) = d((Jo'Y)·(go'Y))(O)
'"Y,P g dt dt
=(JO "f)(0) :;9 0 'Y) (0) + (g O "f)(0) :? O "f) (0)
= f(p)v,,..,,p(g) + g(p)v,,..,,p(J).
Now let us show that every derivation arises from a tangent vector to some
curve. To do this we will first prove the following 'division' lemma.
Lemma 2.1. Let B be an open ball in !Rm centered at 0. Suppose that
f E C 00 (B) and f(O) = 0. Then there exist 91, · · · ,9m E C 00 (B) such that
f=x 1g1+···+xmgm in U.
f 1 of f 1 of d(txk) f1 d
Xk9k = Jo oxk(tx)xkdt = Jo oxk(tx)~dt = Jo dt(J(tx))dt
Either from the formula for 9k in the proof, or upon differentiating both
sides off= x 1 g1 + · · · + xmgm with respect to Xk, we see that
of
9k(O) = oxk (0), 1 ~ k ~ m.
We will also need the following two lemmas saying that derivations annihi-
late constant functions and that they are 'local'.
We have
x(q) - x(p) = x(r(u, v)) - x(r(uo, vo)) = X(u, v) - X(uo, vo)
= (u - uo)e(u, v) + (v - vo)'Tl(u, v)
= (,/(q) - ,/(p))e(,/(q), ,p2(q))
+ (,p2(q) - ,p2(p)),,,(,p1(q), ,p2(q)).
Operating by v on both sides, we obtain, using the Leibniz rule, that
vx - 0 = v(q,..... ,p1(q))e(,p1(p), ,p2(p))
+ (,p1(p) - ,p1(p))v(q I-+ e(,p1(q), ,p2(q)))
+v(q,..... ,p2(q))'Tl(,p1(p), ,p2(p))
+ (,p2(p) - ,p2(p))v(q,..... 'T/(</(q), ,p2(q)))
ax
= v(q,..... ,p1(q)) ou (uo, vo)
ax
+ 0 + v(q,..... ,p2(q)) a:;-(uo, vo) + 0.
With similar computations carried out for vy and vz, we get
ax ax
ou (uo, vo) a;(uo, vo)
where U := v(q,..... cp 1 (q)) and V := v(q,..... cp 2 (q)). Since the normal vector
np is nonzero, the above U, V are the unique. Recall that (u 0 , v 0 ) E D is
such that r(uo,vo)=p. Define u(t)=uo+ Ut, v(t)=vo+ Vt fort E (-1:,1:),
where the 1: > 0 is taken small enough so that (u(t),v(t)) lies in a ball
centered at (u 0 ,v0 ) contained in D. Define -y(t) = r(u(t),v(t)), t E (-1:,1:).
Then
= f(p)v7,p(g) + g(p)v7,p(J).
◊
'lj;(x) := J~ 00
cp(t)dt,
-1 0 1 -1 0 1
By scaling and shifting, we get a function depicted in the top left-hand side
of the following figure, and by reflecting, the one in the bottom left-hand
side. Their pointwise product yields the bump function a in one variable,
shown on the right in the following picture.
Co-/Tangent space 25
-3
I
/ I
-1
I' I I'.\
I'.\.
1 3
-3
I I
-1 1 3
(0 ,0) X
26 A Mathematical Introduction to General Relativity
We will now show the following crucial result, which will allow tangent
vectors to act on smooth functions defined 'locally'.
Lemma 2.5. Let M be a smooth manifold, and pEM. Let vETpM. Then
if f,g E C 00 (M) agree in a neighbourhood ofp, we have vf = vg.
It is clear that Oxi,p E TpM: It is linear, since for f, g E C 00 (M) and c E JR,
1 )( (p))
::i.
ux, p(f +cg )=o((J+cg)ox-
::i . x
' uui
o(J O x- 1 ) o(g O x- 1 )
= 0ui. (x(p))+c Oui. (x(p))=(oxipf)+c(oxipg).
, ,
Co-/Tangent space 27
:(~'i
uJ
o x-1) (x(p))
as required.
Co-/Tangent space 29
Swapping the roles of x and x', ox,i,p = ;;: (e')oxi,p, where f =x'(p). ◊
Lemma 2.6. (Vectors as the tangent vectors of curves).
Let v E TpM. Then there exists an 1:. > 0 and a smooth curve 'Y : (-1:., 1:. )-+ M
such that 7(0) = p and v'Y,P = v.
Proof. Let (U, x) be an admissible chart containing p. If Oxi,p, 1 ~ i ~ m,
denote the chart-induced tangent vectors, then we can write v = viOxi,p for
some numbers vi. Let 1:. > 0 be small enough so that x(p) + tviei E x(U)
for ltl < 1:., where e1, · · · , em are the standard basis vectors in !Rm. Set
7(t) = x- 1(x(p) + tviei), t E (-1:., 1:.). Then 7(0) = x- 1(x(p)) = p, and for
all f E C 00 (M),
v f = d(J O 'Y) (0) = d(J(x- 1(x(p) + tviei))) (0)
'Y,P dt dt
o(Jox-1) i i
= au•. (x(p))v = V Oxi ,p/ = vf.
Consequently, v'Y,P = v. □
In particular, from the above, Oxi,P is the velocity vector v'Y;,P of a locally
defined curve 'Yi passing through p = 'Yi(0): 'Yi(t) = x- 1(x(p) + tei), for
ltl < f. for a small enough 1:. > 0.
Example 2.4. (Tangent vectors to curves decomposed using chart-induced
basis). Let 'Y : J -+ M be a smooth curve, where J is an open interval, and
suppose that 7(J) c U, where (U, x) is an admissible chart for M. We
claim that
30 A Mathematical Introduction to General Relativity
Exercise 2.7. Consider the smooth manifold IR 2 with the standard smooth struc-
ture. Consider the admissible chart (V, v,), where V = {(x,y): y > O}, vi= (r,0),
r(x,y) = ,Jx 2 +y 2 and0(x,y)=cos- 1 (x/,Jx 2 +y 2 ) . Letp=(x,y)EV. Express
Or,p and 00 ,p in terms of Ox ,p and Oy ,p•
Exercise 2.8. Let M x N be the product of the smooth manifolds M, N, and let
(p, q) E M X N. For VE TpM and w E TqN, define V EB w : C 00 (M X N) -+ IR by
(v EB w)f = v(J(- , q)) + w(J(p, ·)) for all f E C 00 (M x N). Show that v EB w belongs
to T(p ,q)(MxN). Prove that the map TpMxTqN 3 (v, w) >-+ vEBw E T(p ,q)(MxN)
is linear, injective, and surjective. Thus T(p ,q) ( M x N) is isomorphic to the direct
sum of TpM and TqN, written as T(p ,q)(MxN) "'TpMxTqN .
If 'Y : JR. --> M is a smooth curve, and f : M --> N is a smooth map, then
f o 'Y : JR.--> N is a smooth curve. For p = -y(0) E M and g E Coc,(N),
V = d(g O (f O 'Y)) (0) = d( (g O f) 0 'Y)) (0) = V ( 0 /).
f o--y,f(p)9 dt dt --Y,P g
Hence f : M --> N induces a mapping sending tangent vectors at p E M to
tangent vectors at f(p) EN. This motivates the following definition.
f
---+
For v E TpM, dfp(v) is a tangent vector at f(p) EN: For all g, h E C 00 (N)
and c Ell,
(dfp(v))(g+ch) = v((g+ch) of) =v(g of +c(h of)) =v(g of)+cv(h of)
= (dfp(v))(g) + c(dfp(v))(h), and
(dfp(v))(g-h) = v((g-h) o J) = v((g o J)-(h o J))
= (go J)(p)v(h o J) +(ho J)(p)v(g o J)
= g(J(p))(dfp(v))(h) + h(J(p))(dfp(v))(g).
Exercise 2.9. Prove that dfP : TpM -+ Tt(p)N is a linear map.
Exercise 2.10. Let 'Y : R -+ M be a smooth curve passing through the point
p = 7(0). Let Ot,o denote the tangent vector at t = 0 E R induced by the chart
(R, id). Let v'Y,P be defined by (2.3). Show that v'Y,P = d7o(Ot,o).
Exercise 2.11. Consider R 2 as a smooth manifold with the standard smooth struc-
ture. Fort ER, let '¢t: R 2 -+ R 2 be given by (x,y) >-+ (x-yt,y). Show that
(d'¢t)p(oy,p) = -tox,,J,,(p) + oy,,J,,(p)•
Exercise 2.12. Let M1, M2, M3 be smooth manifolds. Suppose that f: M1 -+ M2
and g: M2-+ M3 are smooth maps. Let p E M1. Show d(g o f)p = dgf(p) o dfp-
Exercise 2.13. Let f: M-+ N be a diffeomorphism between the smooth manifolds
Mand N. Let p EM. Show that dfp: TpM-+ Tt(p)N is an isomorphism.
Conclude that M and N have the same dimension.
32 A Mathematical Introduction to General Relativity
We end this section with a useful result, namely the inverse function the-
orem, in the smooth manifold setting. First we give a C 00 version in the
Euclidean setting, as a consequence of the 'usual' continuously differentiable
version; see for example [Rudin(1983), Thm. 9.24, p. 221].
Proposition 2.1. (Euclidean+C00-inverse function theorem).
Let U, V c !Rm be open. Let f: U-+ V be C 00 , and f'(p) be invertible at
p E U. Then there exist neighbourhoods Uo c U of p and Vo c V of f(p)
such that fluo : Uo-+ Vo is a diffeomorphism.
[Artin(1991), Thm. 5.7, p. 29], the (i,j)th entry of the matrix for (J- 1 )'(x)
is given by a polynomial combination of the partials of f, divided by the
nonzero determinant of the Jacobian matrix off, and as these are C 00 , it
follows that 1- 1 is C 00 • □
Vo
Vo := 'ljJ- 1 c Vis a neighbourhood of f(p). Moreover,
-1 ~
flu" = '1/J O flu" 0 cp,
that is, flu0 : Uo -+ Vo is the composition of the diffeomorphism fluo with
the diffeomorphisms 'ljJ- 1 and ¢. So f lu0 : Uo-+ Vo is a diffeomorphism. □
Remark 2.2. We now give the justification of a technical fact, as an ap-
plication of the Euclidean C 00 inverse function theorem. This fact will be
used later on in Remark 7.1.
Co-/Tangent space 33
g'(/3) = [r,'(/31) 0 ]
* Im-1
In this section, we will consider the dual space (TpM)* of TpM consisting
of linear maps w : TpM -+ R Elements of (TpM)* will be called cotangent
vectors or l-forms 5 • Later on, we will also learn that the instantaneous
momentum is a 1-form. This can be intuitively understood as follows: we
know that force is the rate of change of momentum, and so as a geometric
object, the nature of the force is the same as that of momentum. We know
that work, which is a scalar, is obtained by taking the 'line integral' of the
force. But when doing so, we take the dot product of the force with a little
vectorial displacement along the curve, namely a tangent vector. So the
force/momentum acts on tangent vectors and produces scalars.
We will see later on that tangent spaces together with their duals (called
cotangent spaces) at a point p E M can be used to build the space of
tensors at the point p, and when we consider a varying p, one gets a map
pi--+ (tensor at p), which is referred to as a tensor field. The field equations
of spacetime will involve such tensor fields. The modest aim in this section,
though, is to introduce the cotangent space as the dual of the tangent space.
Example 2.5. (Gradient of a function). Let jEC00 (M) and pEM. Define
the 1-form dfp E (TpM)* by dfp(v) = v(J) for all v E TpM. For v, w E TpM
and c E JR, dfp(v+cw) = (v+cw)(J) = v(J) + cw(!) = dfp(v) + cdfp(w).
The 1-form dfp is called the gradient off at p. ◊
Proof. In fact, {(dx 1 )v,··· ,(dxm)p} is the dual basis to the chart-
induced basis {ilx1,p, · · · , ilx"',p}:
· · o(xi O x- 1 ) oui ·
(dx')p(Oxi,p) = Oxi,pX' = oui (x(p)) = oui (x(p)) = 8}.
If cidxi = 0, then by acting on Oxi,p, we obtain Ci = 8}ci = 0. Thus
(dx 1 )p, • • • (dxm)p are independent. Also, for w E (TpM)* and v E TpM, we
can write v = vi Oxi ,p, and so
(w - w(ox;,p)(dxi)p) (v) = (w - w(ox;,p)(dxi)v) (vi Oxi,p)
= viw(Oxi,p) - w(ox;,p)vi8J = 0,
showing that w = w(ox;,p)(dxi)p- Hence the I-forms (dx 1 )p, • • • (dxm)p span
(TvM)*. □
l
chosen a basis, if we represent the 1-form as row vector of its components with
respect to the basis, then the above transformation expression can be rewritten
as matrix multiplication as follows:
a(x 1 o x'- 1 ) (x'(p)) ___ a(x 1 o x'- 1 ) (x'(p))
au 1 au"'
[
[ w~ · · · w~ ] = [ W1 • • • Wm ] : ··. : .
1 1
a(x'"' 0 x'- ) (x'(p)) ... a(x'"' 0 x'- ) (x'(p))
au 1 au"'
In particular, as (dxi)p=8l(dxk)p, we obtain
i ( i 0Xk I ) lj 0Xi I lj
(dx )p = 8k ox'i (e) (dx )p = ox'i (e )(dx )p-
Exercise 2.16. Let (U, x), (U', x') be admissible charts for a smooth manifold M.
Show that the matrices describing the change of components of tangent vectors
and 1-forms, with respect to the chart-induced bases, are inverses of each other:
If
Exercise 2.17. Let M be a smooth manifold. Let f,g EC 00 (M) and p EM. Prove
the Leibniz rule d(f-g)p = f(p)dgp + g(p)dfp-
Co-/Tangent space 37
2. 7 Pull-back of 1-forms
The first hurdle, to intrinsically define a vector field, does not seem to be
problematic. We want to associate to each point p E M a tangent vector
VP E TpM. So the vector field must be a map V with domain M. But
what should the target space be? At the point p, we need to pick a tangent
vector in TpM, and not from some other tangent space TqM with q =Ip.
Thus we consider the 'disjoint union' of all the tangent spaces,
TM= LJ TpM = {(p,v): pE Mand v E TpM} = LJ ({p} x TpM).
pEM pEM
39
40 A Mathematical Introduction to General Relativity
As the sets U from A cover M, the corresponding sets U from A cover TM.
Also, as cp : U -+ cp(U) is surjective, and as every m-tuple (v 1 , • • • , vm) E !Rm
results in a vector v = vi Oxi,p E TpM, <p is a surjective map. It is also
injective: If 1.p(p,v) = 1.p(q,w), then cp(p) = cp(q), so that p= q, thanks to
the injectivity of cp, and moreover, v = v(xi) Oxi,p = w(xi) Oxi,q = w. Thus
<p : U -+ cp(U) x !Rm is a bijective map onto the open subset cp(U) x !Rm of
JR 2m, and so (U,<p) is a chart for TM.
Moreover, if (U, 1.p), (V, 1/J) are lifts of the charts (U, cp), (V, '¢) for M,
then we have 1.p(U n V) = cp(U n V) x !Rm, which is open in JR2 m.
Let (U, 1.p), (V, 1/J) be lifts of admissible charts (U, cp), (V, '¢) for M. We
will show that the chart transition maps corresponding to (U, <p), (V, 1/J) are
smooth. Besides completing the verification that A is an atlas, this will also
show that the construction of the smooth structure on TM is independent
of the choice of the defining atlas A from the smooth structure [A] on M.
Let (o.,/3) E 1.p(U n V). With /3 = (/31,··· ,/3m), cp = (x1,··· ,xm) and
'¢ = (y 1 , • • • , ym), we have
(1/J O l.{)-l)(o.,/3) = 1/J(cp-la.,/3iOxi,cp-lo.)
= ( ('¢ O cp-l )( a.), /3i Oxi,cp-lo.y1, · · · , /3i Oxi,cp-lo.Ym).
The map cp(U n V) 3 a.>-+('¢ o cp- 1 )(o.) is C 00 . Also, the map
j o(yi ocp- 1 ) -l o('l/Jocp-l)j
a. f-+ Oxi cp-lo.Y = a . (cp(cp a.)) = a . (a.)
' u• u•
is a C 00 function (since the chart transition map'¢ o cp- 1 is C 00 ). It follows
that the map (a., /3) >-+ (('¢ o cp- 1)(a.), /3i Oxi,cp-l(o.)Y1, · · · , /3i Oxi,cp-l(o.)Ym)
is C 00 , as wanted.
Consequently, TM is a smooth manifold of dimension 2m, where m
is the dimension of the smooth manifold M (and also the vector space
dimension of each tangent space TpM).
Exercise 3.1. Let M be a smooth manifold. Show that the canonical projection
TM -+ M is a smooth map.
1r :
Now that we have a tangent bundle with the structure of a smooth manifold,
we can talk about its sections which are smooth, and these are called vector
fields.
42 A Mathematical Introduction to General Relativity
(1) V is smooth.
(2) For every f E C 00 (M), VJ E C 00 (M).
(3) For all charts (U, x) from an atlas defining the smooth manifold M, the
maps given by U 3 p >-+ Vpxi E JR, 1 ,:;;; i ,:;;; m, belong to C 00 (U).
Proof.
(1)=?(2): Let (U, cp) be an admissible chart for M. For o: E cp(U), we have
((VJ) ocp- 1 )(o:) = Vip-taf = Vip-1a(xi)Oxi,ip-1af.
But by virtue of the fact that f is smooth, the following map is smooth:
cp(U) Oxi ,ip-la! = 0u•.
3 0: >-+
au
o cp- 1 )
(a).
Let (U, c,o) be the lift of (U, cp). As V is smooth, it follows that the map
c,o o Vo cp- 1 : cp(U) >-+ JR 2 m is smooth, that is, the map
cp(U) 3 o: >-+ (o:, Vip-ia(x 1 ), · · • , Vip-ia(xm))
is smooth, and in particular, also the projection to any of the last m com-
ponents, o: >-+ Vip-1a(xi), is smooth. So (VJ) o cp- 1 is smooth on cp(U).
But cp : U -+ cp(U) is a diffeomorphism. Hence V flu is smooth for each
admissible chart (U,cp) for M, and by Exercise 1.16, VJ is smooth on M.
(2)=?(3): Let p E U. Let x E C 00 (M) be a bump function which is iden-
tically equal to 1 in a neighbourhood U0 c U of p, and is O outside as
slightly bigger neighbourhood U1 , with U1 c U. Then xxi E C 00 (M), and
so V(xxi) is smooth. In particular, the map given by
cp(Uo) 3 0: >-+ (V(xxi))(cp- 1 0:) = Vip-la(xxi) = Vip-lc,(xi)
1 The somewhat odd notation will become clear when we discuss tensor fields (Chap. 4).
Co-/Tangent bundles 43
We will henceforth identify V(p) = (p, Vp) with Vp itself, and so the vector
field V sends p E M to Vp E TpM.
Clearly, the result of pointwise adding a pair of vector fields is a new
vector field, and so is the result of scaling a vector field pointwise. This
makes the set TJ M of all smooth vector fields into a real vector space.
However, given a vector field V, rather than just producing a uniform
scaling everywhere by a factor c E JR, that is, M 3 p i--+ c Vp E TpM, we
can consider a less restrictive possibility of having the c depend on p in a
smooth manner. Thus, we want to consider not just multiplication by real
scalars, • : JR x TJ M -+ TJ M, but rather, replace JR by C 00 ( M), and have a
'scalar' multiplication with smooth functions,•: C 00 (M) x TJM-+ TJM.
The set C 00 (M) with its pointwise operations is not a field, but only a ring.
We will see that scalar multiplication by elements of C 00 ( M) makes TJ M a
C 00 (M)-module. First we give the definition of multiplication of arbitrary
vector fields by elements of the ring C 00 ( M).
Definition 3.4. (Multiplication of vector fields by smooth functions).
Let VE TJM and f E C 00 (M). Define f· VE TJM by (J-V)(p)=f(p)Vp,
for all p EM.
TJM is a module over the ring C 00 (M). Thus the usual the vector space
axioms with respect to vector addition hold, with vector addition on TJ M
defined pointwise: For V, WE TJ M, V +Wis given by (V + W)p = Vp + Wp,
for all p EM. Let 1 E C 00 (M) denote the constant function taking value 1
everywhere on M. For all f, g E C 00 (M), and all V, WE TJ M,
1-V=V
(f g) · V = f · (g · V)
(f + g) · V = f · V + g · V
f · (V + W) = f · V + f · W
Exercise 3.3. (Vector fields are derivations 2 ). Let M be a smooth manifold. Show
that V(fg) = f V g + g V f for all f, g E C 00 (M) and VE TJ M.
2 A derivation 8 on an algebra A over JR is a linear map 8 : A -+ A satisfying the Leibniz
rule 8(ab) = a8b + bfo for all a, b EA. An algebra A over JR is a real vector space with a
bilinear product • : A x A -+ A.
44 A Mathematical Introduction to General Relativity
Exercise 3.4. Let IR.2, R be given standard smooth structures. We use the global
charts (R 2 , (x, y) I-+ (x, y)) and (R, u I-+ u). Suppose that V =xyox+x2 oy E TJR 2 '
f =x2 y E C 00 (R 2 ) and p= (1, 1) E R 2 . Determine (JV)p, (V f)(p), dfp(Vp)-
• The map f : M -> N may not be surjective, and so not every point
q EN might be f(p) for some p EN. As the action of dfp only produces
vectors at Tf(p)N, we cannot define a vector field on N, since a vector
field on N has to be defined everywhere on N.
• The map f : M -> N may not be injective. This gives rise to the
problem that if /(m 1 ) = /(m 2 ), then we have a conflict: should the
evaluation of the vector field at this point be the push forward of Vm 1
or of Vm 2 ? They can be different, as shown in the following example.
function, for example, the cubic f(x) = (x - l)x(x + 1). Note that / is
surjective, but not injective (e.g. /(0) = /(1) = 0). Let V E TJ!R be the
vector field V = u Ou- Then note that Vo = 0 E To!R, and Vi = Ou,1 E T1R
But dfo(Vo) #dfi(V1) since if we take g= [u >-+ u] E C 00 (1R), then we have
(dfo(Vo))g=O, while
d((u - l)u(u + 1)) 2
(dfi (Vi))g = Vi (go J) = du (1) = (3u - l)lu=l = 2. ◊
Exercise 3.6. (Left-invariant vector field on a Lie group, Lie algebra of a Lie
group). Let G be a Lie group with identity element e E G. For p E G, recall
(from Exercise 1.20) the left translation map, G 3 q >-+ Lpq = p·q E G. A vector
field VE TJG is called left-invariant if (dLp)q Vq = Vpq for all p, q E G. Show that
V E TJG is left-invariant if and only if (LP)* V = V. The set of all left-invariant
vector fields is denoted by g. With pointwise operations, g forms a vector space,
called the Lie algebra3 of G. The aim of this exercise is to show that TeG :,, g
as vector spaces. For a vector v E TeG, define I(v) = V, where V : G-+ TG is
defined by V(p) = (p, (dLp)ev) for all p E G. It can be shown that the section V
is smooth, so that VE TJG. Show that Vis left-invariant. Prove that the linear
map I : TeG-+ g, sending v E TeG to Iv E TJG, is bijective.
Exercise 3. 7. Consider the smooth manifold ~.2 with the standard smooth struc-
ture, and the global chart (l~.2, (x, y) >-+ (x, y)). Fort ER, let "P-t : R 2 -+ R 2 be
the diffeomorphism given by (x, y) >-+ (xcost-ysin t, xsin t+ycos t). Let V = Oy.
Determine ("P-t)*V.
Now we show that the Lie bracket of chart-induced vector fields vanishes.
Lemma 3.3. Let (U, x) be an admissible chart for a smooth manifold, and
let Oxi E TJU, 1 ,:;;; i ,:;;; m, be the coordinate induced vector fields on U.
Then [ox;,Oxi]=O for all l,,;; i,j,,;; m.
Exercise 3.10. Let G be a Lie group, and g its Lie algebra. Show that if V, Weg,
then [V, W] E g.
= LJ ({p} x (TpM)*).
pEM
(a, /3) >-+ ( ( '¢ o cp- 1 ) (a), /3i (dxi)"'-•a. (8y1 ,<p-•a.), •• • , /3i (dxi)"'-•a. (oy"' ,<p-•a.))
is C 00 , as wanted.
Consequently, TM* is a smooth manifold of dimension 2m, where m
is the dimension of the smooth manifold (and also the dimension of each
cotangent space (TpM)*).
Exercise 3.13. Let M be a smooth manifold. Show that the canonical projection
TM* -+ M is a smooth map.
1r :
50 A Mathematical Introduction to General Relativity
In this section, we will introduce the 'dual' object to a vector field, namely
a 'I-form field', which is, roughly speaking, a smooth distribution of 1-
forms on the manifold. We will see that, just like the C 00 (M)-module TJ M
of vector fields, the collection Tf M of all I-form fields will also have the
structure of a C 00 (M)-module. Together, TJ Mand Tf M form the building
blocks for (r, s)-tensor fields (which will be 'C 00 (M)-multilinear maps' from
r copies of Tf M ands copies of TJ M to C 00 (M), as we will learn in the
next chapter). These tensor fields form the vocabulary to discuss geometric
notions in Lorentzian geometry, such as the metric g (a (0, 2)-tensor field),
the curvature tensor field R (a (1, 3)-tensor field), etc. The field equation
for spacetime is an equality between two (0, 2)-tensor fields.
Recall that in the previous section, using the smooth manifold structure
on M, we made TM* a smooth manifold by gluing together the cotangent
spaces smoothly. Hence we can talk about smooth 'TM*-sections', which
are called I-form fields.
(1) 0 is smooth.
(2) For every VE Tf M, OV E C 00 (M).
(3) For all charts (U, x) from an atlas defining the smooth manifold M, the
maps given by U 3 p >-+ Op(lJxi,p) E JR, 1,:;;; i,:;;; m, belong to C 00 (U).
0
Co-/Tangent bundles 51
Proof.
(1)=?(2): Let (U, cp) be an admissible chart for M. For a E cp(U), we have
((0 V) 0 cp- 1 )a = o,.,-la V,.,-la = o,.,-1a(V,.,-1a(xi)Oxi,cp-la)
= V,.,-1a(xi)0,.,-1aOxi,cp-la·
As V is smooth, a >-+ V,.,-1a (xi) is smooth on cp(U). Also, since O is
smooth, <po O o cp- 1 is smooth, that is,
0 f-+ ( a, o,.,-la (Oxl,cp-la), • • • , o,.,-1a(Ox"',cp-la))
is smooth. So the map a >-+ 0,.,-1aOxi,cp-1a is smooth for all i. Thus
a>-+ ((OV) o cp- 1 )a is smooth.
We identify O(p) = (p, Op) with Op E (TpM)*, and so the 1-form field 0
sends points p EM to corresponding 1-forms Op E (TpM)*.
Clearly, the result of pointwise adding a pair of 1-form fields is a new
1-form field, and so is the result of scaling a 1-form field pointwise. This
makes the set Tf M of all 1-form fields into a vector space. However, given a
1-form field 0, rather than just uniformly producing a scaling with a factor
c E JR, that is, M 3 p >-+ c Op E (TpM)*, we can consider a less restrictive
possibility of having the c depend on pin a smooth manner. We will see
that this makes Tf M into a C 00 (M)-module. First we give the definition
of multiplication of arbitrary 1-form fields by elements of the ring C 00 (M).
52 A Mathematical Introduction to General Relativity
□
Exercise 3.16. (A 1-form field which is not a gradient field).
Consider R 2 with the standard smooth structure. Show that not every 1-form field
0 on R 2 is df for some f E C 00 (R2 ) as follows. If O = df for some f E C 00 (R 2 ),
then in the chart (R 2 ,(x,y) >-+ (x,y)), writing n = (oxf)dx +(oyf)dy, we get
Oox = oxf and Ooy = oyf, implying oy(Oox) = ox(Ooy)- Construct a 1-form field
n, which is not a gradient field, by choosing the component functions n Ox, n Oy
appropriately.
Co-/Tangent bundles 53
The previous two exercises give a recipe for the computation of the pull-back
in any chart. Let p EM, and (U, x), (V, y) be admissible charts for M, N,
respectively, containing p, f(p), and such that f(U) c V. If n E TP N and
we define nj = n(oyi) E C 00 (V), then in u,
j*n = j*(nidyi) = (ni o J)-f*(dyi) = (ni o J)-d(yi o J)
= (ni o J)-((d(yi o J))(oxi))-dxi = (ni o J)-(ox;(yi o J))dxi.
Example 3.4. Let M = !R3 and N = JR2 be equipped with the standard
smooth structures. Consider the admissible charts:
(U = IR 3 , (x, y, z) >-+ (x, y, z)) for IR3, and
(V = IR 2 , (u,v) >-+ (u,v)) for JR 2 .
Let f :IR3 --+ IR 2 be the smooth map f(x, y, z) = (x +y, yz) for (x, y, z) E IR3 .
Let n = vdu + udv E Tf N. Then we have
j*n = (v o J)-(ox(u o J)dx + oy(u o J)dy + oz(u o !)dz)
+(u o J)-(ox(v o J)dx + oy(v o J)dy + oz(v o !)dz)
= (yz)·(ox(x + y)dx + oy(x + y)dy + oz(x + y)dz)
+(x + y)·(ox(yz)dx + oy(yz)dy + oz(yz)dz)
= (yz)-(dx + dy) + (x + y)(zdy + ydz)
= yzdx + (x + 2y)zdy + (x + y)ydz. ◊
:t O
1')i (t) = vi(1'(t)), 1,:;;; i,:;;; m, t E J. (3.1)
, t E J. (3.2)
Example 3.5. Consider the smooth manifold IR2 with the standard smooth
structure given by the global chart (IR2 , (x, y) i--+ (x, y)). Let V = xoy-YOx-
The differential equations describing an integral curve 1'(t) = (x(t), y(t)) of
V are given by
{ ±(t) = -y(t)
y(t) = x(t)
where· = ft-. If the curve starts at t = 0 from (xo, Yo), then the unique
solution is
Exercise 3.19. In contrast to the above example, where the integral curve existed
for all t E R, the curve may exist only on a finite interval. As an example, consider
V = u 2 Ou E TJ R on the smooth manifold R with the standard smooth structure
given by the global chart (R, u >-+ u). Show that if the integral curve starts at
t = 0 at p = -y(O) > 0, then -y(t) = p/(1 - pt) for all t < 1/p, which 'escapes to oo'
in the finite time 1/p < oo.
From the theory of differential equations, it follows that (since the vi are
all smooth) there exists a neighbourhood (-e:, e:) of O and a neighbourhood
e
N of x(p) in x(U) such that for each = (~ 1, · · · , ~m) E N, the initial value
problem for the system of ordinary differential equations (3.2), has a unique
smooth solution (-e:, e:) 3 t 1--+ (1' 1 (t; e), · · · , 1'm(t; e)). Also, the solution to
56 A Mathematical Introduction to General Relativity
(3.2) depends smoothly on the initial condition f We will not prove this
here, but refer the interested reader to [Hartman(2002), Thm. 4.1, p.100].
Thus (3.1) has a unique smooth solution 'Y(t; x(q)) for each initial condition
q in the neighbourhood x- 1 N of p, living on the time interval (-1:, 1:), and
the map x- 1 N 3 q >-+ 'Y(t; x(q)) is smooth.
Let us suppose from now on that for our vector field on the smooth
manifold M, the integral curves exist for all t E JR, and the pathological
cases as in Exercise 3.19 do not arise. We will call such vector fields as
complete vector fields.
Since for each p E M, 'Yp(O) = p, we have 'I/Jo = idM. Also the following
result makes composition a group operation on Fv.
Proof. For p EM, let "Ip be the integral curve of V such that 'Yp(O) = p.
Lets ER. Set q = "fp(s) = 1/Js(p).
______'Yq
q = 'Yp(s)
Consider the curve 'Y: R-+ M, given by 'Y(t) = 'Yp(s+t) for all t ER Then
'Y(O) = "fp(s) = q, and V7 (t) = V7 p(s+t) = v7 p,'°Yp(s+t)· But for f E C 00 (M),
_ d(J o "fp) ( t) _ 1. f('Yp(s + t + h)) - f('Yp(s + t))
v,,,,p, "",p (s+t) f - d t s + - h-+O
Im h
Hence V7 (t) = v7 p,'°Yp(s+t) = v7 ,7 (t), that is, 'Y is an integral curve of V such
that 'Y(O) = q. By the uniqueness of solutions, 'Y = 'Yq· So
for all p EM. Thus 1Pt+s = 1Pt o 1Ps· Also, 1Pt+s = 1Ps+t = 1Ps o 1Pt· □
Exercise 3.21. Consider the smooth manifold JR3 with the standard smooth struc-
ture given by the global chart (JR3 , (x, y, z) >-+ (x, y, z)). Let Lx, Ly, Lz E TJJR3 be
the vector fields given by Lx =yoz - ZOy, Ly =ZOx - XOz, Lz =XOy -yOx-
Compute [La, Lb] for a, b = x, y, z. Find the flow maps for Lx, Ly, Lz.
Finally we give the geometric meaning of the Lie bracket. More generally,
we define a Lie derivative below, which measures the rate of change of the
quantity at hand (function, vector field, and later on even tensor fields)
along the flow of a vector field.
58 A Mathematical Introduction to General Relativity
Proof. Let 7q(t) = 1/Jt(q) fort ER and q EM. Let p EM. Define H by
H(t, s) = W,t,.p(/ o 1/J-s)- Let f E C 00 (M). Then
Thus
[V, W]pf = ([V, W]J)(p) = (V(WJ))(p) - (W(VJ))(p)
= oH(O 0) oH(O 0)= d(t>-+H(t,t))(O)
at ' + as ' dt
= :;W,t,.p(/ o 1P-t)) (O).
Notes
The proof of Proposition 3.2 is based on [Godinho and Natario(2014), see Exer-
cise 12(a), p.33, and its solution, pp.332-333].
Tensor fields
In this chapter, using the building blocks TJ- M and Tf M, namely the
C 00 (M)-modules of all vector fields and of all 1-form fields, we will in-
troduce (r, s)-tensor fields, which will be C 00 (M)-multilinear maps defined
on the Cartesian products of these modules to C 00 (M). As we had men-
tioned earlier, tensor fields are fundamental in Lorentzian geometry and
in spacetime physics. The Lorentzian geometry will be specified by the
metric g on a smooth manifold, where the metric will be a certain type
of a (0, 2)-tensor field. This metric will induce a curvature tensor field R
(via something called the Levi-Civita connection), and R will be a (1, 3)-
tensor field. Vector field and 1-form fields are themselves simple examples
of tensor fields: vector fields are (1, 0)-tensor fields, and 1-form fields are
(0, 1)-tensor fields.
We are familiar with multilinear maps in the context of vector spaces (for
example, the inner product on Rm is a bilinear map, and the determinant
on Rm is an m-linear form with some extra properties). An (r, s)-tensor
field is just a special type of a multilinear map, except that the field R of
scalars is replaced by the ring C 00 ( M) consisting of all smooth functions on
a smooth manifold M.
Definition 4.1. ((r, s)-tensor field).
Let M be a smooth manifold. Let r, s ~ 0 be integers. An (r, s)-tensor
field is a map
T: TfMx ·· · xTfM x TJ-Mx · · · xTJ-M-+ C 00 (M)
r copies s copies
61
62 A Mathematical Introduction to General Relativity
More explicitly,
for all 1 ~ i ~ r, for all D, 8 E Tf M , f E C 00 (M) , and
for all n 1, · · · , ni-l, [1i+l, · · · , n r Erp M , Vi,···
, Vs E TJ M,
T(D1,· ·· f8 , Di+1, . . . , nr , Vi , ··· ,Vs)
,ni- 1 , D+ (4.1)
T(D ,··· ,ni- 1 , n , ni+ 1 , ... ,nr,Vi,··· ,Vs)
1
Exercise 4.4. (Bilinearity of®). With the same notation as in the previous ex-
ercise, show that for any C 00 (M) function f, f(O ® 8) = (f 0) ® 8 = 0 ® (!8).
Also, prove that for 01, 02, 8 E Tf M, (01 + 02) (8) 8 = 01 (8) 8 + 02 (8) 8 and
8® (01 +02) =8®01 +8®02.
and
for all 1 !(j !( s , for all v , w E V, c E JR, and
forallw 1,·· · ,wrEV*, w1,··· ,Vj-1,Vj+l,· · · ,Vs EV, we have
T(w 1 ,· ·· ,wr,v1, · · · ,Vj-1, v + cw , v 1+1,··· ,vs ) (4.4)
T(w1, ·· · ,wr,v1,··· , Vj - 1, v , v 1+1,·· · ,vs)
+ cT(w 1, · ·· ,wr,v1, · ·· , Vj - 1, w , v 1+1 ,· ·· , vs)-
We set TJ V = (V* )*, Tf V = V* and rgv = JR. If V = TpM, then we denote
the set of all (r, s)-tensors by r;M(p).
Since (V*)* ~ V, we identify v EV as an element of TJV with the action
v(w) = w( v) for all w E V*. With pointwise addition and scalar multiplica-
tion, r;v is a real vector space.
Exercise 4.5. Let V = R 3 . Define T : V* x V x V----> R by T(w , v, w) = w(v x w),
for w E (R 3 )* , v , w E R 3 , and where v x w denotes the cross product in R 3 of
v, w. Show that Tis a (1, 2)-tensor on R 3 .
Exercise 4.6. (Tensor product of tensors). Let V be a real vector space. Let T be
an (r, s)-tensor on V and r' be an (r', s')-tensor on V. Define their tensor product
T © 'T 1 : (V* r+r' X v s+s'-: R by
(T@T')(w\··· , wr+r,V1, ··· , Vs+s')
= T(w\· ·· ,w: v1,· ·· , Vs )T1 (wr+1, ... ,wr+r:Vs+1,··· ,Vs+s')-
Then T © r' is an (r + r', s + s')-tensor on V. Prove that the tensor product©
is associative.
1 The stress in continuum mechanics is an example, and the word 'tensor' comes from
the La tin word 'tendere ', meaning 'stretch'.
Tensor fields 65
We will now show that every (r, s)-tensor field T can be viewed as a map
= LJ ({p}xT;M(p)),
pEM
where T(p) Er; M(p) is an (r, s)-tensor on TpM. Before giving the defini-
tion of the (r, s )-tensor T (p) arising from a tensor field T, we will prove the
following result.
T(pJ
---~------ M
p
Just like a vector field and a 1-form field have chart-induced components
that represent the field at hand, we can represent a tensor field using its
chart-induced components.
Definition 4.4. (Chart-induced tensor field components at a point).
Let (U, x) be an admissible chart for an m-dimensional smooth manifold
M, p E M, and T E r;
M. We define the chart-induced tensor components
of Tat p by
T(p)t:t = T(p)((dxi 1 )p, · · ·, (dxir)p, oxii,p, · · · , Oxis,p),
where 1 ~ i1, · · · ,ir,J1, · · · ,Js ~ m, where mis the dimension of M.
Exercise 4.8. Let (U,x) be an admissible chart for a smooth manifold M, and
TE r; M. The mr+s chart-induced tensor field components TJ;.".".t are the maps
U 3 p >-+ T(p)t::t Show that TJ;.".".t E C 00 (U).
Exercise 4.9. Let T E Tf M be the (1, 1)-tensor field on a smooth manifold M,
given by T(n, V) = nv for all n E Tf Mand all VE TJM. Suppose that (U,x)
is an admissible chart for M. Determine the chart-induced components of T.
Exercise 4.10. (Transformation rule for components).
Let M be a smooth manifold and T E r; M. Prove that if (U, x) and (U', x') are
two overlapping admissible charts, then the chart-induced tensor field components
T 1i1
ii ···ir
···is, Ti1 ···ir
ii ···is . h
wit respect to (U' , x ') , (U, x ) , respective
. 1y, trans £orm as £o11ows.
For pEU n U',
Cl li1 Cl lir c, i~ c, i~ 1 1
T'i1 •··ir ( UX ( ( ))
) UX ( ( )) UX ( '( )) UX ( '( ))Til ···ir ( )
ii ···is = OXi~ X p ... OXi~ X p ox'il X p ... ox'is X p i~ ···i~ p
p
0Xi / O(XiOXl-l) / 0X 1i O(x'iox-l)
where -;;--,. (x (p)) := 0 . (x (p)) and ~(x(p)) := 0 . (x(p)).
ux' u' ux' u'
Exercise 4.11. (Physicist's definition of a tensor field).
Let M be a smooth manifold described by an atlas A. Suppose that for each chart
(U,x) in A, there exist functions TJ;.".".t;u E C 00 (x(U)), such that the following
transformation rule holds for any two overlapping charts (U, x) and (U', x'):
···ir;U' (x'( ))
Ti1
•_-·Js
Jl p . ., .,
0X/t 1 0Xlir OX11 I OX1s I i~ •··i~;U
= - . , (x(p)) • • • -;-:,(x(p)),;--;-:-(x (p)) • • • ,;--;-:-(x (p))T., ...., (x(p)).
ox'1 ux'r ux J1 ux Js Ji 1s
An admissible chart has the nice feature that there are available chart-
induced 1-form fields dxi and vector fields Oxi. This suggests that we
could use the tensor field component functions of T to analyse the 're-
striction TI u' to the chart U by defining, for example in the r = s = 1 case,
Tlu(nidxi, ViOxi) = TJn;Vi, where n = nidxi E TfU, V = ViOxi E TJM.
But if U is the intersection of two admissible charts, then on U there are
two available coordinate chart maps, say x and x'. Let us show that the
above is well-defined, that is, it does not depend on the choice of the chart
map: We have
Proof. For ease of exposition, we show this just for the case of a (1, 1)-
tensor field. Let x be a bump function around p, that is, a C 00 (M) element
that is identically 1 in a neighbourhood of p, and identically zero outside
Uo, with Uo c U. Then we have
T(p)(w,v) = (T(xwidxi,xvi Dxi))(p) = wivi(T(xdxi,XOxi))(p)
= WiviT(p)((dxi)p, Oxi,p) = WiviT(p);.
Also, if we define O=widxi E rpu and V =vi Oxi E TJU, then we have
Tlu(p)(w,v) = (Tlu(O, V))(p) = (Tlu(widxi,vi Dxi))(p)
= Wivi(Tlu(dx\oxi))(p) = WiviTf(p) = WiviT(p);.
This completes the proof. □
In order to get the representation of the tensor field using its chart-induced
components, we will produce a (C 00 (U)-module) spanning set for r;u using
the building blocks {dx 1 , • • • , dxm} and {Oxi, • • • , Ox=}. To this end, we first
introduce the notion of the tensor product of two tensor fields.
Let 01, ···,or E TfU and V1, · · ·, V,, E TJU. Then Oi=Oidxk (1,:;;;i ,,;;r)
and ½ = vfOxe E TJU (1 ,:;;; j ,,;; s), where ot
= Oi Oxk = Oxkoi and
v~/= Vj(x') = (dx1)Vj. Thus
Tlu(0 1 , ... , or, Vi, ... , Vs)
= T!l """!r 0~i1 ... 0'.'Lr v,li1 ... vis
Jl •·•Js 8
The tensor product produces new tensor fields from existing ones. We end
this section by learning about yet another operation, called 'contraction'.
We begin with the following crucial result, which gives an intrinsic function
associated with a (1, 1)-tensor field, akin to the trace of a linear transfor-
mation T : V ---+ V on a finite-dimensional vector space V.
Now we are ready to define the operation of contraction. Given an (r, s)-
tensor field T with r and s both at least 1, we note that if we fill in all
the I-form field slots except for one, and all the vector field slots except
for one, then we are left with a (1, 1)-tensor field. We can then apply the
contraction map C to this. The result is a C 00 function. But clearly this
depends C 00 (M)-multilinearly on the initially chosen I-form fields and the
vector fields. Thus we have a (r -1, s - 1)-tensor field. We give the precise
formulation below.
Definition 4. 7. (Contraction).
Let M be a smooth manifold. Let 1 ~ i ~ r, 1 ~j ~ s. Let TE TJ M. Define
the contraction C}T E r;~l
M with respect to the i and j indices by
Exercise 4.16. Consider M = Rm with the standard smooth structure given by the
global chart (Rm, id). Let T= [Tj] be an mxm matrix. Then matrix multiplication
of column vectors in Rm by T induces a (smooth) map T : Rm -+ Rm. Let the
tensor field TE T{M be defined by (T(O, V))(p) = Or(p)((dT)pVp) for p EM.
Show that CT= trace(T).
field f*D. E Tf M. Now we will see that this operation of pulling back can
be extended to (0, s)-tensor fields on N.
So far, we have introduced notions using only the smooth structure on the
manifold. But the smooth structure is still rather flexible. Within this
picture, two spheres in JR 3 of different radii would be considered the same,
and even the surface of a potato would be the same smooth manifold as a
sphere. In this sense the smooth manifold has not yet acquired a 'shape'.
We now introduce the notion of a metric, which is roughly speaking, a
(0, 2)-tensor field g on a manifold M, whose evaluation at any point p EM
equips the tangent space TpM with a scalar product g(p) (analogous to
the dot product of vectors in the plane). This structure allows us to assign
lengths to tangent vectors and angles between them. It will also enable us
to assign lengths to curves.
In the mathematical subject of Riemannian geometry, one studies
smooth manifolds equipped with a metric, which in each tangent space
TpM gives an 'inner product' g(p), namely a 'positive definite' scalar prod-
uct. On the other hand, in spacetime physics, the metric g on spacetime
is such that the scalar product g(p) on TpM at each p EM is 'indefinite',
and in fact 'Lorentzian', which means that the scalar product in each tan-
gent space TpM has the 'index 1' (roughly speaking, there exists a vector
v E TpM so that the scalar product g(p)(v,v) < 0, and the restriction of
g(p) to the 'orthogonal complement' v1_ c TpM of v is positive definite).
We will see what this means precisely, in the course of this chapter. This
structure allows us to talk about observers, the observer's instantaneous
perception of space, and of time. We can then introduce the notion of
geodesics, which are, loosely speaking, the straightest curves. We will also
see in later chapters that the matter content in the spacetime essentially
'curves' spacetime, and determines a tensor field derived from metric g.
Test matter moves along geodesics in this curved spacetime.
75
76 A Mathematical Introduction to General Relativity
Exercise 5.1. Let V be a finite-dimensional vector space with the scalar product
g. Let V* be the dual space of V . Show that the map V 3 v >-+ v" := g(v, •)EV*
is a linear isomorphism.
If Ng= {O}, where O denotes the subspace {O}, and OE Vis the zero vector,
then ig=O.
Proof. Set U' = {w E V* : w( u) = 0 for all u E U}. Let us first show that
U', which is a subspace of V* has dimension m - k. Let {e 1 , · · · , ek} be
a basis for U, and complete it to a basis {e 1 ,··· ,ek,ek+1,··· ,em} for V.
Let {E1 , • • • , Em} be the corresponding dual basis for V*, that is, Ei ( ej) = 8}
for all i, j. Clearly, Ek+l, • • • , Em E U'. Now let w E U', and decompose
78 A Mathematical Introduction to General Relativity
Thus the map U 1- 3 v >-+ vi> E U' is an isomorphism, and so it follows that
dimU1_= dimU'= m - k. □
Exercise 5.2. Let V be a finite-dimensional vector space with the scalar product
g. Let Ube a subspace of V. Show that U_j__j_ = U.
Theorem 5.2.
Let V be an m-dimensional vector space, and g a scalar product on V.
Let U be a k-dimensional subspace of V. Then the following are equivalent:
(1) V=U+U1_ andUnU1-={0}. (WewriteV=UEBU1-.)
(2) g : U x U ---+ JR is nondegenerate, i.e., the restriction gluxu is nonde-
generate.
Proof.
(1) ~ (2): If u E U is such that for all u' E U, g(u, u') = 0, then we have
(U3)uEU1-. SouEUnU1-={0}.
(2)~(1): Ifu EU n u1-, then for all u' EU, g(u,u')=0. As g: U x U---+ JR
is nondegenerate, u=0. So Un u1- = {0}. We have dimU1_ =m-k. Let
B= {e1, · · · , ek} be a basis for U, and B' ={Ji,··· , fm-k} be basis for u1-.
As Un u1- = {0}, it follows that B=B u B' is independent, and since there
are m elements in B, span B = V. So U + U 1- = V. □
we have that
span{e1, · · · , et, et+d = span{u1, · · · , Ut, et+1} = span{u1, · · · , Ut, Ut+d-
This completes the proof. □
80 A Mathematical Introduction to General Relativity
_ ·) _ {- oii if i ~ i*,
g (e,, e1 - ;: "f . .
Uij 1 Z > Z*.
Proof. Let the subspace U E Ng be such that dimU = ig. Define the map
L: U-+ span{e1, • •· ,ei.} as follows: For u=uiei, Lu:=u 1e1 + • • • + ui*ei.·
Then L is easily seen to be linear. We now show it is injective. If Lu= 0,
that is, ui = 0 for i = 1, • • •, i*, then we have
m
Besides the Minkowski plane considered earlier in Example 5.2, the follow-
ing higher-dimensional generalisation is an example of a vector space with
a Minkowski scalar product.
timelike
_ ____;_:;;..__----!_ spacelike
timelike
Lorentzian manifold, this will imply that if we have two future-pointing timelike vectors
at a point, then their scalar product will be negative.
84 A Mathematical Introduction to General Relativity
Exercise 5.6. Let us denote by [77] E Rmxm the diagonal matrix with the entries
-1, 1, • • • , 1 along the diagonal. The entry in the ith row and jth column of [77]
(respectively A) is denoted by 11ii (respectively Aj ), the indices being counted
from 0, i.e., 0 ~ i,j ~ m-1. Show that the following are equivalent:
(1) A E 0(1, m-1).
(2) N[77]A = [77].
(3) 77ij = Af A}T/kl•
Exercise 5.7. Show the inclusion 0(1,m-l)cGL(m,R), that is, every Lorentz
transformation is invertible.
Exercise 5.8. Show that 0(1, m-1) is a subgroup of the group GL(m, R).
Exercise 5.9. Show that in R 4 , the following are Lorentz transformations. These
are examples of a 'boost', a spatial rotation, 'time-reversal', and a spatial reflec-
tion. Here ef>, 0 E R.
(l) Bis an orthonormal basis for V, g(/0 , Jo)= -1, g(f;, Ji)= 1 (1 ~i~m-1).
(2) A E 0(1, m-1).
Proof. Let [T/ij] be the diagonal matrix with the diagonal entries
-1, 1, • • • , 1. Then
g(fi, Ji)= g(A~ek, A;ee) =A~ A;g(ek, ee) =A~ A]T/kl· (5.2)
(1)=?(2): T/ij ~ g(fi, Ji) <5~l Af A]TJke, and by Exercise 5.6, AE 0(1, m -1).
(2)=?(1): g(fi, Ji)(~) Af A]TJke = %· The last equality is due to (2) and
Exercise 5.6. □
Example 5.5. In Example 5.3, we had seen that {eo, e1, · · · , em-1} forms
an orthonormal basis with respect to the Minkowski scalar product TJ on
!Rm. For any matrix A E 0(1,m -1), define the vectors /j = AJei for all
j = 0, 1, · · ·, m - 1. Then {/o, Ji,··· , fm-d is an orthonormal basis. We
note that Jo, Ji,··· , fm-1 are them columns of the matrix A. ◊
Lorentzian manifolds 85
Exercise 5.11. Suppose that (M, g) and (N, h) are semi-Riemannian manifolds.
We call a map f : M -+ N an isometry if f is a diffeomorphism, and f*h = g.
Show that Iso( M) = {J : M -+ MI f is an isometry} is a subgroup of (Diff( M), o)
(the group of diffeomorphisms; see Exercise 1.19, p.16).
Exercise 5.12. Consider R 3 as a smooth manifold with the standard smooth struc-
ture. Equip R 3 with the metric g, given in the chart (R3 , (x,y,z) >-+ (x,y,z)) by
g = dx ® dx + dy ® dy + dz ® dz. Determine g in the 'spherical coordinates' chart
(U, x) described in Exercise 2.15 (p.36). For t E R, let 'lf;t : R 3 -+ R 3 be the map
'lf;t(x, y, z) = (x, (cos t)y - (sin t)z, (sin t)y + (cos t)z) for (x, y, z) E R 3 . Show that
'lf;t is an isometry.
The chart map (x 0 , x1, x 2 , x 3 ) is such that for any q EM, q = p + xi(q)fi
(summation over i = 0, 1, 2, 3). Define the metric g by specifying its chart-
induced components:
gij = g(Oxi, Oxi) = T/ij,
where T/oo = -1, T/11 = T/22 = T/33 = 1, and whenever i =I= j, T/ij = 0. Hence
g(q): TqM x TqM-+ JR is given by
· v} 0x1,q
g(q)(ViO x-i,q, •1iJ·ViWj = -VOWO + v w + v w + v w ·
· ) = 'l'I· 1 1 2 2 3 3
For a surface M in JR3 , any tangent vector v E TpM can act on the three
smooth functions x,y,z: M-+ JR, and produce a vector (vx,vy,vz) in JR3 .
So if JR3 is equipped with a scalar product g (for example the Euclidean
inner product), then for p E M, define g(p) : TpM x TpM -+ JR by
g(p)(v, w) = g((vx, vy, vz), (wx, wy, wz)) for v, w E TpM.
Then g(p) : TpM x TpM -+ JR is bilinear, and symmetric. In particular,
g(p) is a (0, 2)-tensor on TpM. Using Proposition 4.1, it can be shown that
g E TJ M. Suppose that g(p) is nondegenerate on TpM for each p E M,
and that the index of g(p) is constant. Then g is a metric on M, called
the induced metric on M from (JR3 ,g), and (M,g) is a semi-Riemannian
manifold. We note that if we use the Euclidean inner product on JR 3 , then
g(p) will be positive definite on TpM for each p E M, and so (M, g) is a
Riemannian manifold in this case. We will now consider an example.
Lorentzian manifolds 87
Example 5. 7. (Sphere).
Consider the sphere as a smooth manifold with the smooth structure given
by the atlas in Example 1.5 (p.5). We use the compatible atlas of spherical
polar coordinates described in Example 1.8 (p. 7). The image of the injective
map (0, 7r) x (0, 27r) 3 (0, <p) >-+ ((sin 0)( cos¢), (sin 0)(sin <p ), cos 0) E 8 2 c JR 3
is called U, and the inverse of this map is denoted by cp. Then (U, cp) is an
admissible chart. We equip JR 3 with the Euclidean inner product. Then the
chart-induced component g(p)00 of g at p E 8 2 , where cp(p) = (0 0, ¢ 0 ) is
g(p)00 =g(p)(o0,p, O0,p) = (O0,px) 2 + (O0,py) 2 + (O0,pz) 2
= ( o((sin 0) cos <I>) (0 ,I.. )~ 2 (o((sin 0) sin <I>) (0 ,I.. )~ 2 ( ocos 0 (0 ,I.. )~ 2
a0 o, '1'0 ') + a0 o, '1'0 ') + a0 o, '1'0 ')
= (cos 0o)2(cos ¢o) 2 + (cos 0o)2(sin¢o) 2 + (- sin0o) 2 = 1.
Similarly,
g(p)0ct, = g(p)ct,0 = g(p)(o0,p, oct,,p)
= (O0,px)(oct,,pX) + (O0,py)(oct,,pY) + (O0,pz)(oct,,pZ)
= (cos 0o)( cos </Jo)(sin 00)(- sin </Jo)
+(cos 0o)(sin </Jo)(sin 0o)( cos </Jo) + (- sin 0o)0
= 0,
g(p)ct,ct, =g(p)(oct,,p, oct,,p) = (oct,,px) 2 + (oct,,pY) 2 + (oct,,pz) 2
= (sin0o)2(-sin¢o) 2 + (sin0o)2(cos¢o) 2 + 02 = (sin0o) 2 •
Summarising, we arrange the components in a matrix, describing the metric
gin the chart (U, cp): If p E 8 2 is such that cp(p) = (0 0, ¢ 0 ), then
[ g(p)00 g(p)0ct,] [1 0 ]
g(p)ct,0 g(p)ct,ct, = 0 (sin0o) 2 ·
Exercise 5.13. Consider the surface Min l~.3, where M = {(x, y, z) E R 3 : z =xy},
as a smooth manifold with the smooth structure [A], where A consists of the
single chart M with the chart map M 3 p = (u, v, uv) >-+ ( u, v) E R 2 . Let R 3 be
equipped with the Euclidean inner product (·, •), and M be equipped with the
induced metric g from (R3 ,(·, •)). Determine the component matrix for g using
the chart-induced basis.
[ g(p)00 g(p)0t] [1 OJ
g(p)t0 g(p)tt = 0 -1 '
where p E U c M is such that cp(p) = (0 0, t 0 ). We see that this is an
invertible matrix. Hence g(p) is nondegenerate for all p E U. Also the
index is 1. In particular, (U, g) is a Lorentzian manifold. Taking V to be
the image of (-1r, 1r) x JR 3 (0, t) i--+ (cos 0, sin 0, t) E 8 1 x JR = M c JR3 , we
obtain another admissible chart (V, '¢), which together with U, covers M.
In a manner similar to the above, we can compute component matrix for
Lorentzian manifolds 89
the induced metric on V from (R3 , r,), and in fact it turns out to be identical
to the one above for the chart (U, cp). From these component matrices, we
see that the metric has index i(g) = 1 everywhere on C, and so (M, g) is a
Lorentzian manifold. ◊
[
g(p)tt g(p)tx g(p)ty g(p)tz
g(p)xt g(p)xx g(p)xy g(p)xz _
g(p)yt g(p)yx g(p)yy g(p)yz -
l [-1
be the metric on M which has the metric component matrix for g(p) with
0
0
(a(~)) 2
0
~ ~
(a(t)) 2 0
l '
g(p)zt g(p)zx g(p)zy g(p)zz 0 0 0 (a(t)) 2
where p= (t, x, y, z) EM. Then (M, g) is a Lorentzian manifold. ◊
l·
in this order, is given by:
~
-(1- 2/ : ) 0
[ Q (1-!;)-l
0 0 r~
0 0 0 r~ (sin 00 ) 2
We note that this has determinant -r6(sin0o) 2 =I= 0, and so the scalar
product is nondegenerate, and moreover, the index is 1, since for r 0 > 2m,
the first diagonal entry is negative, while the others are positive. We obtain
the same expression for the metric component matrix for g(p) in the chart
JR x (2m, oo) x V, where V is as in Example 1.8 (p. 7). ◊
In each tangent space TpM of a Lorentzian manifold (M, g), where p EM,
the metric g(p) at p gives a scalar product with index 1, resulting in a 'causal
structure', partitioning nonzero vectors of TpM into spacelike, lightlike and
timelike vectors.
Lorentzian manifolds 91
The following result shows that the proper time experienced is independent
of the parametrisation of the curve.
Proposition 5.4. Let 'Y: [a, b] -> M be a smooth curve such that v-y,-y(t) is
timelike for all t E [a, b]. Let cp : [c, d] -> [ a, b] be a smooth map such that
cp(c) = a, cp(d) = b, and cp'(t) >0 for all t E [c, d]. Let 7 : [c, d] -> M be given
by 7 ='Yo cp. Then r(7(c), 7(d)) = r('Y(a), "f(b)).
Thus v7 ,:=y(t) = cp' (t) v-y,-y(<p(t)) · So v7 ,7(t) is timelike for all t E [c, d], thanks
to the facts that v-y,-y(t) is timelike for all t E [a, b], cp'(t) > 0 for all t E [a, b],
and the relation
i d
-g('Y( cp( t))) (cp' (t) V-y,-y(<p(t))' cp' (t) V-y,-y(<p(t))) dt
i d
-g('Y(cp(t)))(v-y,-y(<p(t)), v-y,-y(<p(t))) cp'(t) dt (as cp'(t) >0)
s=<p(t)
=
Jb v-g('Y(s))(v-y,-y(s),V-y,-y(s))
a
.j ds = T-y(p,q).
s!:. 'Y
L,
u
p
p
d
Thus if 'YA, 'Y8 are imagined to be the worldlines of two twins, then twin
A ages faster. We will see later on that "/A is a 'geodesic', implying that
the motion of twin A is 'unaccelerated', and in this sense twin A can be
thought of as 'resting', while the other twin undergoes 'accelerated motion'.
The inequality in their experienced proper times can then be summarised
by saying: Resting is rusting!
Intuitively, the twin paradox inequality follows from a repeated appli-
cation of the reversed triangle inequality by a 'discretisation' process. A
picture in the Minkowski plane with a coarse discretisation is shown below.
q=p+v
Proof. We will use a basis induced by a global chart. From the proof of
Theorem 5.4, we can construct a basis B = {e0 ,e 1 ,e2 ,e 3 } for V, orthonor-
mal with respect to the Minkowski scalar product g on V, such that
V
eo = --;:=::;:====;::.
,J-g(v, v)
We use the chart (M,<p) induced by 6 p and B. For all t E [0,1], we have
'YB (t) = p + 'Y~ (t)ei, where 'Y~ := (<po "f8)i are real-valued smooth functions
oftE[0,1]. Thenvw,w(t)='Y~Oxi,w(t),where ·=~t· ForapointxEM,
recall 7 the isomorphism I : V --> TxM, which sends w ~ v,,w,x, where
6 See Example 1.3 (p.3) .
7 See Example 2.2 (p.28).
96 A Mathematical Introduction to General Relativity
Tw(p,p+ v) = f .J-g(1'B(t))(vw,w(t),Vw,w(t))dt
~f V(i'g(t)) 2 dt
Proof. Using the chart (V, '¢) from Example 5.8, it can be seen that the
proper time experienced by observer "/A is given by
For the observer "/B, we will use the chart (U, r.p), which covers "/Bl(o, 2,,.),
and only the endpoints p, q do not belong to U. We have
f27r
Tw(p,q) = Jo
f27r
= Jo V-(1-4)dt=v'3·(211").
We reiterate that in Minkowski space, the symmetry among the two twins
was broken because one of them was a geodesic while the other one wasn't.
So the non-resting twin 'feels' accelerated. In the case of the cylindrical
spacetime, the symmetry is broken by the fact that one of them winds
around the cylinder, while the other doesn't. However, neither twin 'feels'
accelerated! It is the global topology which distinguishes the two geodesics
and the resulting difference in the experienced proper times.
The Schwarzschild spacetime models the spacetime outside a spherically
symmetric body of mass m, in an otherwise empty universe. It turns out
that realistically, this is a good approximation near the body (where the
effect of the body is strongly felt, and the rest of the universe doesn't
matter). In particular, it can be used for modelling the spacetime geometry
near the Earth. We end this section with the discussion of a result in
Schwarzschild spacetime, which says roughly speaking that objects near
the surface of the Earth age slower, and that the light emanating from the
surface of the Earth, observed high above, is red-shifted. But first we give
the following definition.
98 A Mathematical Introduction to General Relativity
Definition 5.10.
(Energy/frequency of a light signal measured by an observer at an event).
Let (M, g) be a Lorentzian manifold with a time-orientation. A light signal
is a smooth curve8 >. : J -> M, where I c JR is an interval, such that
for each t E I, v,\,,\(t) is lightlike. Let 'Y be an observer, and suppose that
p E M belongs to the intersection of the images of 'Y and >.. Then the
energy/frequency of>. measured by 'Y at pis defined to be
g(p)(v")',p, V,\,p)
v-g(p)(v")',p, V"Y,P)
0 = g(>.(r))(v,\,,\(T), V,\,,\(T))
= g(.X(r) )(t' (r) Ot,,\(T) + r' (r) Or,,\(T), t' (r) Ot,,\(T) + r' (r) Or,,\(T))
= -(1- r2(m)) (t'(r))2+ l _\m (r'(r))2.
T r(t)
As the light signal travels radially outwards, we suppose that r' (T) > 0 and
t'(r) > 0. Let T := r 1 : [t1, t 2] -> [a, b] be the inverse oft: [a, b] -> [t1, t 2].
With h:=ror: [t1,t2]-> (2m,oo), we get the differential equation
t A
>- P2
A
>-
A
2m r1 r2 r
'
')'1 ')'2
Suppose another radial light signal 3: is sent by the first observer at the event
A = (t1 , r1 , p) , with t1 > t1 , towards the second observer, reaching it at the
event A= =
(t2 , r2, p). Then it follows from the above that t2 - t1 t2 - ti,
and so also t2 - t2 = t1 - t1. We have that the proper time experienced by
the first observer between the events A and Pi
is
~ = ~l--(t2
T,2 (P2, P2) m ~ - t2).
r2
Thanks to the equality t 2 - t 2 = t 1 - ti=: f::lt, and the inequality r 2 > r 1 we
obtain that
~ =~
T,2 (P2 , P2) - - f::lt > ~
1m 1m ~
- - f::lt = T,1 (P1 , A).
r2 r1
If A , A were the subsequent ticks of a clock carried by the first observer,
then the information about these ticks is 'seen' and recorded by the second
observer along the light signals >-, 3: at the events P 2 , A. By the above
inequality, the second observer reckons that the first observer's clock runs
slow (as compared to the clock carried by him/ her). We say that 'a clock
in a stronger gravitational field is seen as running slow by an observer in a
weaker gravitational field '. So near the surface of the Earth, we age slower
as compared to someone higher up.
100 A Mathematical Introduction to General Relativity
Red-shift: The frequency of the outgoing light signal >. measured by the
first observer at A is
g(A)(v71,Pi, V>.,Pi)
The frequency of the incoming light signal >. measured by the second ob-
server at P2 is
g(P2)(V72,P2, V>.,P2)
V2 = ---;=:::;:=::::;::::;======;::
vf-g(P2)( V72,P2, V72,P2)
= _ g(P2)(t'(b) Ot,P2 + r'(b) Or,P2, Ot,P2) = t'(b) ✓l _ 2m.
vf-g(P2)(8t,P2, Ot,P2 ) r2
It will be shown later (see (15.6) in the proof of Proposition 15.1, p.332)
that since >. is a lightlike geodesic, there exists a constant E such that for
all TE [a, b],
So
t'(bh/1 - ~
t'(a)-Jl - 2r7
As r2 > r1, we have v2 < v1, that is, the light observed by the second
observer is 'redder' than that observed by the first observer. The red-shift
z is defined by
Here •' := ~r. Using these, prove that there exists a constant E > 0 such that
f:::..7>f:::..tF3f!-.
(The particle experiences more proper time than the stationary observer at r = r 0 .)
Hint: Compare the derivatives of !:::..T and t::..t,Jl- 2rom with respect to r 0 using
9 Follows from the r-component of the geodesic equation, using r' =0, 0 =O; see the
1
Assuming the uncertainly relation EL:::..t = %, find E, and also show that the
energy Eobs erved of the photon >-+ measured by the observer is given by
Eobs erved = na.
(This can be related to an effective temperature of the horizon perceived by the
observer,
T = Eobserved !ia
2Jrks
where ks is the Boltzmann constant.)
11 Roughly speaking, a horizon is the boundary of a region R of spacetime such that the
events from within R cannot send a light signal to observers (that is, paths whose range
lies) outside R.
Lorentzian manifolds 103
by Vlu = giiOiOxi, where we use the notation gii from Exercise 5.16 for
the chart-induced inverse metric components. As the charts cover M, this
procedure allows us to define a vector in TpM at each p E M. To check well-
definition, we note that if (U', x') were an overlapping chart, then using the
transformation properties of the I-form components, of the basis elements
of TpM, and of the inverse metric components (Exercise 5.17), we obtain
104 A Mathematical Introduction to General Relativity
in Un U' that
HiUx'1
OX 1i kf. OX 1j r, OXr OX 8 :l
.
In the above we also used the result from Exercise 2.16. The smoothness
of V follows from Theorem 3.1, since we know that the chart-induced com-
ponents are smooth. To complete the proof of the surjectivity, we will now
show that for this V, we have that Vb = n. Indeed, for any W E TJ M, we
have in any admissible chart (U, x) that
b · · k. · k · ·
V W = g(V, W) = gijV"W1 = gijg ·nkw1 = 8jnkw1 = njW1 = nw.
As the charts cover M, VbW =OW on M. Since WE TJM was arbitrary,
Vb =0. □
w~ = f w(vi)g(vi, Vi)Vi-
i=I
Since the Jacobian matrices of the chart transition maps are equal to the
identity matrix, and also as
au(cp + 21r, z) = au(cp, z) = av(cp, z),
av(0 - 21r, z) = av(0, z) = au(0, z),
it follows that au, av define a (0, 2)-tensor field g on C.
g is a metric: Firstly, the matrices au, av, are both pointwise symmetric
and invertible, giving the symmetry and nondegeneracy of g(p) for each
p EC. To see that the metric index of g is 1, we note that with
v = ( cos i) e1 + ( sin i) e2, w = -( sin i) e1 + ( cos i) e2,
we have vta(a)v= 1, wta(a)w= -1, vta(a)w=0.
Consider the smooth curve 'Y: (0, 21r)-> C given by
'Y(0) = (cos0,sin0,0) for all 0 E (0,21r).
Let WE TJU be the vector field
Wp = -(sin;) Ot,p + (cos;) 00,p
for all points p = (cos 0, sin 0, z) E U, 0 < 0 < 21r, z E R. Then Wp is timelike
for all p E U since g(p)(Wp, Wp) = -1 < 0. Now suppose X E TJC is a
time-orientation for (C,g), and write X = xtot + X'Pocp in V, for some
Xt, X'P E C 00 (V). Then for all p E Un V, we have
ob oz
ocp,p = ocp (y(p))o0,p + ocp (y(p))oz,p = l-00,p + O·oz,p = 00,p-
Thus in Un V, we have X = Xtot + X'Pocp = Xtot + X'Po0. Define smooth
function f: (0, 21r)-> R by
/(0) = g('Y(0))(W.,,(0),X.y(0)) for all 0 E (0,21r).
Then by the continuity of f, and the fact that W is timelike everywhere, we
must have f either everywhere positive, or everywhere negative. (Indeed,
otherwise f will be zero at some 0*, giving a point 'Y(0*) E C, where the
nonzero timelike vectors X.,,c0.), W.,,c0.) are orthogonal, a contradiction to
Lemma 5.1.) Set p = (l, 0, 0) E C. Then as Xp is timelike, we obtain
(Xt(p)) 2 - (X'P(p)) 2 < 0. This implies that X'P(p) =I- 0. But we have
. ( )-[ . 0 0 ][cos0 sin0][Xt(y- 1 (0,0))]- 'P()
J~f 0 - -sm 2 cos 2 sin0 -cos0 X'P(y- 1 (0,0)) --X P'
while
. ( )-[ . 0 0 ][cos0 sin0][Xt(y- 1 (0-21r,O))]- 'P()
hm f 0 - -sm 2 cos 2 Sln
0 /21r
. 0 - COS 0
X'P( y -1( 0 - 27r, 0)) -X P,
a contradiction to the fact that / is either everywhere positive or everywhere
negative. So (C, g) is not time-orientable.
Chapter 6
Levi-Civita connection
107
108 A Mathematical Introduction to General Relativity
The basic difficulty is that W -y (t) E T-y(t)M, while W -y (O) E T -y (o)M , since the
spaces T-y(t)M, T -y (o)M are different, we cannot make sense of W-y(t)-W-y(O)·
It would be great if we could somehow 'connect' the tangent spaces
T -y (t)M, T -y (o)M , as in the happy situation of !Rm, where (as we shall see
below), there is a standard way of doing so (using the vector space structure
of !Rm), making the above idea work. In a general manifold, however, there
is no such 'connection' available from the smooth structure alone.
W -y( t )
-----t---4. , (t)
p= , 0)
We will have to put such an additional structure 'by hand' , and this is
exactly the notion of a 'connection' , which we will soon introduce. There
are infinitely many connections available on a given smooth manifold, and
so one has to make a choice in order to be able to do the aforementioned
differentiation. However, for a semi-Riemannian manifold, there is a certain
natural choice, induced from the metric, called the Levi-Civita connection.
The Levi-Civita connection allows the definition of a geometric object called
the curvature tensor field R. The spacetime field equation involves tensor
fields derived from R .
The smooth manifold M = !Rm , with its standard smooth structure, has
a global chart provided by the identity map, and so we can identify each
tangent space TpM with !Rm , using the map
smooth curve such that 'Y(O)=O and v"f,p=v, e.g., 'Y(t)=p+tv. Then
The reason behind calling (C4) the 'torsion-free' property is that the map
T 01 MxT01 M 3 (V, W) >-+
-
B(V, W):=v'vW -v'wV - [V, W]
-
E T 01 M
is C 00 (M)-bilinear (see Exercise 6.4), and so the map
T: TfMxTJMxTJM-> JR
(n, V, W) >-+ O(B(V, W)) = O(v'vW - v'wV - [V, W])
is a (1, 2)-tensor field, which is called the torsion tensor-field.
Exercise 6.1. Check (Cl)-(C5) for the flat connection V on (Rm, g).
Exercise 6.2. Consider R 3 with the standard smooth structure and the flat con-
nection. Let V =yzox+zxoy+xyoz, and W = (xy 2 +z)ox +(y 2 -x)oy+(x+z 3 )oz.
Determine VvW, VvP W, and (VvW)(p), where p= (1, 1, 1).
Exercise 6.3. Consider the smooth manifold Rm with the standard smooth struc-
ture. Suppose that rfi E C 00 (Rm) are m 3 arbitrary smooth functions on Rm. Let
V: TJMxTJM-+ TJM be given by (V, W) >-+ VvW = (ViWirfi + VWk)Oxk,
for all V = Vi Oxi, W = Wio"'; E TJRm (xi are the component functions of id!R"' ).
Show that V satisfies (Cl), (C2), (C3), and so is a connection on Rm.
Example 6.1. Consider IR 2 with its standard smooth structure, and define
the connection v' as in Exercise 6.3, where
I'"' -
xx -
rxyy -- I'Yxx -- I'Y - Q
yy - ,
rxxy -- rxyx -- 1 + x2y + y2 ' I'Y - I'Y -
xy - yx -
X
1 + x2 + y2
Then v' defines a connection on IR 2 . In Example 6.2, we will see that this
connection gives a 'non-flat saddle shape' to IR2 . ◊
Exercise 6.6. Let IR.2 be given the standard smooth structure, and suppose it is
also equipped with the flat connection V. Show that 2V is not a connection on
IR.2. Hint: The Leibniz rule (C2) can be shown to fail, e.g. by taking f = x and
V =Ox= W in the admissible chart (IR.2, (x, y) >-+ (x, y)).
Exercise 6.8. (Lie derivative revisited). We had seen in Definition 3.15 (p.58) that
the Lie derivative £ : TJ M x TJ M -+ TJ M can be defined on a smooth manifold,
using simply its smooth structure, by Cv W = [V, W] for all V, W E TJ M.
Prove that£ satisfies (Cl) and (C2). However (C3) fails in general, and show this
concretely, by taking M = R 2 with its standard smooth structure, the function
f = x, and the vector fields W =Ox= V, X = 0, in the global admissible chart
(R 2 , (x,y) >-+ (x,y)).
1 For an (r, s )-tensor field, the top indices on its components are historically referred
to as contravariant, and the ones below as covariant. Fixing W, we associate v'.W
with the (1, 1)-tensor field sending (n, V) E Tf M x TJ M to n(v'v W) E C 00 (M). Thus
W,..... v'.W takes a (1, 0)-tensor field W, and makes it a (1, 1)-tensor field, increasing its
covariance index from O to 1. Thus W ,..... v'. W is referred to as 'taking the covariant
derivative'.
2 The name originates from the fact that this tensor field describes how a tangent space
twists along a curve when it is 'parallelly transported' along the curve. The notion of
parallel transport will be discussed in the chapter on geodesics.
112 A Mathematical Introduction to General Relativity
Remark: So the Lie derivative Lv W is not the appropriate notion of the 'di-
rectional derivative of W in the direction of V' we seek. Indeed, the sought-after
directional derivative at a point should depend only on the value Vp of V at p,
and not on what V does in the vicinity of p . But the Lie derivative does take into
account how V changes. To see this, consider a V of the form V = f X. Then we
have L f x W = f Lx W - (W f)X. The presence of the extra term (W f)X shows
that, if we use two different f s that match at p, so that the Vp is the same, the
different values of f in the neighbourhood of p will have an effect, even at the
single point p being considered (since the W acts on f as a differential operator;
so although the two fs match at p , if their derivatives differ, then this will be
detected by a suitable differential operator W) . Also, one of the applications of
the connection we want to have, is to be able to define the acceleration of a curve
by calculating the derivative v' v-,,r (·> v,.,,,,.,,(·) of the velocity vector to the curve in
the direction of the velocity vector of the curve, and again the Lie derivative is
not useful since Lv V = 0 for any vector field V which is an extension of the v,.,, ,,.,,(·) ·
*
Example 6.2. Consider the parametrised surface Mc JR 3 given by
JR 2 3 (u,v) I-+ r(u , v)=(u, v , uv) c JR 3 .
This is the graph of z=f(x,y)=xy, which has a 'saddle' shape.
Thus we define v' : TJ- M X TJ- M --> TJ- M by v' V w = v' V w - (v' V w,n) n,
where n is the 'unit normal field ' to M , given by
or or
-x-
ou OV (-v , -u , 1) 1
n= ✓1 2 2(-v, -u, l).
11(-v, -u, 1)11 +u +v
v'vW = v'vW-(Wuvv+wvvu)((0,0,1),n.)n.
= V(Wu)ou + V(Wv)ov + (Wuvv + wvvu)oz
-(Wuvv + wvvu)((0, 0, 1), n)fi.
= V(Wu)ou + V(Wv)ov + (Wuvv + wvvu)(oz - ((0, 0, 1), fi.)fi.).
We have
oz - ((0, 0, 1), fi.)fi.
1
= (0,0,1)- 1 2 2 (-v,-u,1)= 1 12 2 (v,u,u 2 +v)
2
+u +v +u +v
So with
we have
In this section, we will learn that the covariant derivative can be computed
locally, and the connection information is encoded in any admissible chart
by m 3 functions I'fi called the 'connection coefficients or Christoffel sym-
bols'. We begin with the following crucial fact.
Proof. We prove this in two steps. First keeping V fixed, and then re-
placing also V. Let x E C 00 (M) be a bump function such that x = 1 in
a neighbourhood Uo of p, and x = 0 outside a neighbourhood U1 where
U1 c U. Then we have x(W - W) = 0 E TJ M. With c = 0 E JR, we have
Levi- Civita connection 115
Also,
(v'v(JW))p = (v'x2v(x 2 JW))p
= ((x 2 V)(xf))(p)· (xW)p + (xf)(p) · (v' x2v(xW))p
= (VJ)(p)·(xW)p + (xf)(p)·x(p)·(v'xv(xW))p
= ((VJ)W + f·v'vW)p,
proving (C2). Finally,
(v'Jv +x W)p = (v' x 2 uv +x) (x 2 W))p
+ x(p)(v'xx(x 2 W))p
= (xf)(p) · (v'xv(x 2 W))p
= (xf)(p)·((xV)x)(p)·(xW)p + (xf)(p)-x(p)-(v'xv(xW))p
+l·((xX)x)(p)·(xW)p + 1-x(p)-(v'xx(xW))p
= 1-f(p)·0·Wp + f(p)-1-(v'vW)p + 1-0·Wp + 1-1-(v'~W)p
= (Jv'vW+v'~W)p,
showing (C3). This completes the proof. □
Exercise 6.9. The aim of this exercise is to show the following stronger version of
Lemma 6.1, which, roughly speaking, says that VvW is determined at a point
p by just knowing the value Vp of V at p and the values of W in any arbitrarily
small neighbourhood of p.
Let M be a smooth manifold with a connection V on M. Suppose that p E M
and U c M be an open set containing p. Let the vector fields V, V, W, W E TJ M
be such that Vp=Vp, and W=W in U. Then (VvW)p=('v\7 W)p-
Hint: Proceed as follows. Keeping V fixed, the claim about replacing W by W
was proved in Lemma 6.1. It remains to show (v'v-vW)p = 0. Set X = V-V.
Choose a chart (U, x) containing p, and write X = Xi Oxi, and note that X\p) = 0.
Using Proposition 6.1, we may use the restriction of the connection to U. Note
that (V x W)p = (v'~; 0.,, Wlu )p-
kth compoment of v' i'J.,; V is (v' i'J.,; v)k = vk ,i + rji Vi' which is written as
k k
V ;i = V ,i + r iik V 1. .
The notation Vk ;i for (v' 0,,; V)k may cause confusion, as the latter means we
take the kth component of the vector field v' 0,,i V, while the former suggests
that we are taking a new sort of derivative of only the kth component (and
this is clearly wrong, since (v' a,,; V)k does involve all the other components
of V too, as is evident by the right-hand side of the above display). We will
not use the notation V\i- *
Proposition 6.2. Let M be a smooth manifold with a connection v'. Let
(V, y), (U, x) be admissible charts for M, with connection coefficients de-
. l
noted by riJk·V
, riJk-U , respective y. T h en
Proof. Let us first show that for any function f E C 00 (M), we have
(oyiXr)oxrf = Oyif- Indeed, we have that the right-hand side is
which matches with the left-hand side of the equality we wanted to prove. In
order to show the claimed transformation law for the connection coefficients,
we use the definition of the connection coefficients in the chart (V, y), and
convert all the coordinate expressions relying on the (V, y) chart using the
118 A Mathematical Introduction to General Relativity
Example 6.3. Consider IR 2 with its standard smooth structure and the flat
connection v'. Let us determine the connection coefficients with respect to
the chart (U = JR 2 , idIR2). We have
v'oxl-Oxi = 0,
for all i,j E {1, 2}, and so it follows that r:f
=0 for all i,j, k E {1, 2}. How-
ever, the connection coefficients do not necessarily vanish in other charts,
e.g. with polar coordinates, as we shall see explicitly now. Suppose that
(for ease of handling the angular coordinate) V = {(x, y) E IR2 : y > 0}, and
y(x, y) = (r, 0), where
r:=,v'x 2 +y 2 and 0:=cos- 1 x ,
y'x2+y2
and cos- 1 : (-1, 1) ---+ (0, 1r) is the inverse of the cosine function. Then,
using the above transformation rule, we have for example (where we use
Levi- Civita connection 119
the more suggestive 0, r to label the r-symbols rather than number indices)
r00 = 0 + (oxr)o0(00x) + (oyr)o0(00y)
x (-x) + y (-y) = -,Jx2+y2 = -r.
,Jx2+y2 ,Jx2+y2
Similarly,
and
Substituting v' a.,; Oxi = r;ioxr and multiplying throughout by the inverse
metric component gek (and summing over the es) yields
r~j = rji = r;i 8: = r;i gre gfk = g(r;i Oxr, Ox£) gfk
gfk
= g(v' o.,; Oxi, Ox£) gfk = 2 (ox;gje + Oxigfi - Oxtgij)-
This completes the proof. □
Exercise 6.13. Let (M, g) be an m-dimensional Lorentzian manifold with the Levi-
Civita connection v'. Let (U, x) be an admissible chart for M, and define G = [gii]-
Let Sm denote the set of all permutations of {1, • • • m}. For a permutation 1r E Sm,
let sign 1rE {-1, 1} denote its signature 4 .
4 See page 210.
Levi- Civita connection 123
Similarly,
where a:= ~~. Similarly one can also calculate that for 1 ~ i, j, k ~ 3 that
rgo = 0, rgi = r?o = 0, r?j = 8ijaa, rio = 0, rij = rjo = 8ij ~' and r)k = 0.
◊
Similarly,
t t m
rtr = rrt = r2(1- 2;."),
rr m rr ( 2m)
rr = - r 2(l-2;."), 00 = -r 1---;:- ,
0 0 1 0
rr0 = r0r = -, rcfuf, = -(sin0)(cos0),
r
</> </> cos0 cf, - cf, - 1
r 0</> = r ¢0 = ---:--0,
sm
rrcf, - rcf,r - -,
r
and all other r-symbols are zeros. ◊
Levi- Civita connection 125
Exercise 6.14. For the saddle surface M considered in Exercise 5.13, with the
induced metric g on M from the Euclidean inner product on 113 , determine the
Christoffel symbols for the Levi-Civita connection with respect to the global chart
given there. Note that these coincide with the connection coefficients determined
in Example 6.2, showing that the connection considered in Example 6.2 was in
fact the Levi-Civita connection corresponding to g.
A similar strategy works with any (r, s)-tensor field T. By the desired
Leibniz rule, for 0 1, · · · , nr E TP M and V, W1, · · · , Ws E TJ- M, we have
V(T(01, · · · , or, W1, · · ·, Ws))
= v'v(T(01, · · · , nr, W1, · · ·, Ws))
= (v'vT)(01,- · ·, nr, W1, · · · , Ws)
r
+ ~ T(n1, · · · ,ni-l, v'vfl\ni+ 1, · · · ,nr, W1, · · ·, Ws)
i=l
s
(v'vT)(fn+ e, W)
= V(T(JO + e, W)) -T(v'v(JO + 8), W) -T(JO + e, v'vW)
= V(JT(O, W) + T(e, W))-T((Vf)O + fv'vO + v'v8, W)
-JT(O, v'vW) -T(8, v'vW)
= (Vf)T(O, W) + JV(T(O, W)) + V(T(8, W))
-(Vf)T(O, W)
- f (T(v'vO, W)) -T(v'v8, W) - JT(O, v'vW) -T(8, v'vW)
= f (V(T(O,W)) -T(v'vO,W) -T(O, v'vW))
+ V(T(e, W)) -T(v'v8, W) -T(e, v'vW)
= f (v'vT)(n, W) + (v'vT)(e, W).
C 00 ( M)-linearity in the second slot follows similarly. The above motivates
the following.
128 A Mathematical Introduction to General Relativity
for 0 1 ' ... 'nr E Tf M and W1, ... 'Ws E TJ- M (and where an empty sum
is taken as the zero function).
From the definition we see that v'v(T + S) = v'vT + v'vS for (r, s)-tensor
fields T and S.
Exercise 6.18. Let M be a smooth manifold with a connection v'. Let R be a
(1, 3)-tensor-field such that it has the symmetry R(n, X, Y, Z) = -R(n, Y, X, Z)
for all n E Tf M and all X, Y, Z E TJ M. Show that the covariant deriva-
tive inherits this symmetry, that is, for n E Tf M and all V, X, Y, Z E TJ M,
(v'vR)(n, X, Y, Z) =-(v'vR)(n, Y,X, Z).
Exercise 6.19. Let M be a smooth manifold with a connection v'. Let f E C 00 (M),
TE T;M and VE TJM. Prove that v'v(JT)=(Vf)T + f(v'vT).
v' satisfies a Leibniz product rule over the tensor product ® of tensor fields.
Proposition 6.5. Suppose that M is a smooth manifold with a connection
v'. Let S E TJ M and T E r:; M. Then for all V E TJ M, we have that
v'v(S®T) = (v'vS)®T+S®v'vT.
130 A Mathematical Introduction to General Relativity
(v'v(O®T))(e, W,X)
= V(O(W)T(e, X)) - (n ® T)(v've, w, X)
-(O®T)(e, v'vW,X)- (O®T)(e, w, v'vX)
= V(OW)T(e, X) + O(W)V(T(e, X)) - O(W)T(v've, X)
-O(v'vW)T(e,x) - O(W)T(e, v'vX)
-------------
= (V(OW) - O(v'vW))T(e,x)
+n(W)(V(T(8,X))-T(v'v8,X)-T(e, v'vX))
= (v'vO)(W)T(e, X) + O(W)(v'vT)(e, X)
= ((v'vO) ® T)(e, w, X) + (n ® v'vT)(e, w, X)
= ((v'v0)®T+O®v'vT)(8,W,X).
Let M be a smooth manifold. Recall that if VE TJ- M, then the Lie deriva-
tive .Cv f := VJ for f E C 00 (M), and .CvW := [V, W] if W ETJ-M (Defini-
tion 3.15). We now extend this, and define the Lie derivative of arbitrary
tensor fields by demanding a Leibniz rule, akin to what was done in the
previous section for the covariant derivative. We begin by extending the
definition of the Lie derivative to 1-form fields.
r
- ~ T(n1,- · · ,ni- 1,.cvn\ni+ 1, · · · ,nr, W1, .. ·, Ws)
i=l
s
- ~ T(n1,- · · , nr, W1, · · ·, Wj-1, .CvWJ, Wj+1, · · · , Ws),
j=l
Again, we must check (01, • • • , nr, W1 , • • • , Ws) >-+ [the RHS of the above],
is C 00 (M)-multilinear. We check this for (1, 1)-tensor fields T. The proof
in the general case is analogous. For f E C 00 (M), e, n E Tf M, WE TJ M,
(CvT)(JO+ e, W)
= V(T(JO + e, W)) -T(Cv(JO + 8), W) -T(JO + 8,CvW)
= V(JT(O, W) + T(e, W))
-T((VJ)O + JCvO + Cve, W)-JT(O,CvW) -T(8,CvW)
= (VJ)T(O, W) + JV(T(O, W)) + V(T(e, W))
-(VJ)T(O, W) - JT(Cvn, W) -T(Cve, W)
-JT(O, CvW) -T(e, CvW)
= f(V(T(O, W)) -T(Cvn, W) -T(O, CvW))
+ V(T(e, W)) - T(Cve, W) - T(e, Cv W)
= J(CvT)(O, W) + (CvT)(e, W).
The C 00 (M)-linearity in the second slot follows similarly.
Exercise 6.26. (Killing vector field and Killing's equation).
Let (M, g) be a semi-Riemannian manifold with the Levi-Civita connection v'
induced by the metric g. A vector field V E TJ M is called a Killing" vector field
if .Cvg=O. Show that if Vis a Killing vector field, then for all X, YE TJ M,
• g([V, X], Y) + g(X, [V, Y]) = V(g(X, Y)), and
• (Killing's equation) g(v' x V, Y) + g(X, v'y V) = 0.
Exercise 6.27. Let M = IR.3 be equipped with the standard smooth structure and
the metric g described in Exercise 5.12. Show that the vector fields Lx, Ly, Lz
given in Exercise 3.21 are Killing vector fields for g.
ForX,YETJM, andpEM,
IL(((1Pt)*T-T)(X, Y)(p))
= IL(T('lt,tp)((d'lt,t)pXp, (d'lt,t)vYv) -T(p)(Xp, Yp)}
= IL(T('lt,tp)((d'lt,t)pXp, (d'lt,t)vYv) -T('lt,tp)(X,t,,p, Y,i,,p))
+IL(T('lt,tp)(X,t,,p, Y,t,,v) -T(p)(Xp, Yp)}. (6.2)
We will consider the last two summands above separately. First, with
1'v (t) := 1PtP, t E R, we have v'Yp ,P = Vp as 1'v is an integral curve of V. With
f :=T(X, Y), we have for the second summand that
So
So
Proof.
'If part': Suppose that each1Pt is an isometry. Then (1/Jt)*g = g, so that
Proposition 6.6 immediately yields .Cvg = 0, that is, V is a Killing vector
field.
6 At least formally, this is clear, but we do not include a detailed proof here (which can
be carried out by considering a chart containing p and all 1PtP for t close enough to 0,
and using the smoothness of gij, "/p, and using the chart representative Jacobian matrix
of (d'I/Jt)p)-
Levi- Civita connection 135
'Only if part': Let .Cvg = 0. Suppose that X, Y E TJ- M. For a fixed s E JR,
set X = ('1/Js)*X and Y = ('1/Js)*Y. For anyp EM, ((.Cvg)(X, Y))('I/JsP) = 0,
and so by Proposition 6.6,
O = lim g('l/Jt'l/JsP)((d'l/Jt),J,.pX,j,.p, (d'l/Jt),J,.pY,p.p) - g('l/J 8 p)(X,J,.p, Y,p.p)
t--+O t
= lim g('l/Jt+sP)((d'l/Jt),J,.p(d'l/Js)pXp, (d'l/Jt),J,.p(d'l/Js)pYp)-g('l/J.p)((d'l/Js)pXp, (d'l/Js)pYp)
t--+O t
= lim g('l/Jt+sP)((d('l/Jt o 'l/Js))pXp, (d('l/Jt o 'l/Js))pYp) - g('l/J.p)((d'l/Js)pXp, (d'l/Js)pYp)
t--+O t
= lim g('l/Jt+sP)((d'l/Jt+s)pXp, (d'l/Jt+s)pYp) - g('l/J.p)((d'l/Js)pXp, (d'l/Js)pYp)
t--+O t
= d(g('l/JuP)((d'l/Ju)pXp, (d'l/Ju)pYp)) ( )
~ s.
As s E JR was arbitrary, we conclude that the map
JR 3 a 1--+ g('I/JuP)((d'I/Ju)pXp, (d'I/Ju)pYp)
is constant, so that (putting a = 0)
g('I/JuP)((d'I/Ju)pXp, (d'I/Ju)pYp) = g(p)(Xp, Yp) for all a E JR.
Hence ('1/Ja)*g = g for all a ER □
Example 6.10. Consider JR 3 with the standard smooth structure, and the
Riemannian metric g, given in the global chart (JR3, (x, y, z),..... (x, y, z)) by
g = dx ® dx + dy ® dy + dz ® dz (Exercise 5.12). Recall from Exercise 3.21,
that the flow maps for the vector fields
Lx = YOz - ZOy
Ly= ZOx -XOz
Lz = XOy-YOx
are rotations about the x-, y-, z-axis, respectively, which are isometries
(Exercise 5.12). In Exercise 6.27 above, we had verified that Lx, Ly, Lz are
Killing vector fields. ◊
Notes
The proof of Theorem 6.3 is based on [O'Neill{1983), Chap. 9, Prop. 21].
This page intentionally left blank
Chapter 7
Parallel transport
The tangent vector along a curve exists only along the points of the curve.
In order to talk of the acceleration, we would like to differentiate this 'vector
field' that 'lives along the curve'. This prompts the following definition.
137
138 A Mathematical Introduction to General Relativity
A vector field along a curve is not necessarily the restriction of a vector field
on M to 'Y(J). Indeed, the curve may intersect itself, say at p="((a) ="f(b),
where a, b E J with a # b, and if the vector field V along 'Y is such that
V"Y(a) # V"Y(b), then we cannot create a global extension W of the V to a
vector field on M, since there is a problem of specifying what Wp ought to
be. The picture below illustrates the problem when we take the tangent
vectors along 'Y (which we will see is a vector field along 'Y)- However, a
'local' extension is possible for a part of the curve in the neighbourhood of
'Y(to) if v"Y,"Y(to) #0; see Remark 7.1.
V"Y ,"Y(b)
Proof. We recall that the admissible chart (U, cp) for M induces an ad-
missible chart (U,<p) for TM, where U = {(p,v): p EU, v E TpM} and
<p(p,v)=(cp(p),v(x 1 ),· · · ,v(xm)). Then <p: U--> <p(U) c IR. 2 m is a diffeo-
morphism. Now suppose that V is smooth. Then <po V is smooth, that is,
I 3 t >-> (<po V)(t) = (cp('Y(t)), V 1 (t), • • • , vm(t)) is smooth. In particular,
Vi E C 00 (J).
Pamllel tmnsport 139
(1) V is smooth.
(2) For any chart (U, x) in A such that Iu -# 0, we have Vi E C 00 (Iu) for
all i E {1, · · · , m}, where V(t) = ('Y(t), Vi(t)Oxi,-y(t)), t E Iu.
Proof.
(1)=?(2): Let (U,x) EA and t E Iu. Then 7(t) EU. By the continuity
of 'Y, there exists a small enough open interval It c Iu containing t. Then
7(It) c U. By Lemma 7.1 above, Vil1, is C 00 • As the choice oft E Iu was
arbitrary, vi E C 00 (Iu ).
(2)=?(1): Lett EI. Let (U, x) EA be such that 7(t) E U. Then t E Iu -# 0-
As Iu is open, there exists an open interval It c Iu containing t. By the
hypothesis, Vi E C 00 (Iu), and so Vi E C 00 (It) as well. By Lemma 7.1, VI It
is smooth. As t E I was arbitrary, V is smooth. □
For a vector field V E TJ7 along a curve 'Y : I -+ M, just as we have been
doing with vector fields on M, we will identify V(t) = ('Y(t), v(t)) (t E I),
where v(t) E T'Y(t)M, with v(t).
'Y)
so that v;-= dtd(xi o
, and these are smooth on I u ( as 'Y is smooth). ◊
We note that by Exercise 6.9, the notation v' v-r,-r<•J W is legitimate, since
v' x Y at a point p depends only on Xp (and on the values of Y in a neigh-
bourhood of p). Thus, here we mean that if t 0 E J and V E TJU is any
vector field that coincides with the vector v7 ,7 (to) at the point 'Y(to) E M
(and such a vector field can be constructed using a bump function; see
Lemma 3.1), then v'v-r,-r<•l W := (v'y-W) 7 (t)·
Proof.
Existence: Given WE TJ'Y~ write W = WiOxi- Also,
TT() •i();i h ·i() d(xio'Y)()
v 7 t =v7 ,7 (t) ="f t uxi,'"l'(t), w ere 'Y t := dt t.
· dWk
Set Wk=&' and define
(v'v'Y W)(t) = (Wk(t) + rt/'Y(t))')'i(t) Wi(t))Oxk,7(t), t E J. (7.1)
As wk+(rfjo'Yhj wi E C 00 (J), we have v'v'Y WE TJ'Yu by Proposition 7.1.
We now verify that v'v,'Y has the
•
claimed
.
properties. The JR-linearity follows
.
immediately since (cW +X)'=cW'+X'.
Pamllel tmnsport 141
Theorem 7.1. Let M be a smooth manifold with a connection v'. For each
smooth curve 'Y: I-> M, where I is an open interval in R, there exists a
map v'v-r : TJ-'Y-> TJ-'Y such that the following hold for W, X E TJ-'Y, c ER
and f E C 00 (I):
Proof.
Uniqueness: Let there exist a map D7 : TJ-'Y -> TJ-'Y satisfying the given
properties. Let t 0 E I, and (U, x) be an admissible chart containing the
point 'Y(t 0 ). Let Jc I be an open interval containing t 0 such that 'Y(J) c U.
Given WE TJ-'Y, write W(t) = Wi(t) Oxi,'"l'(t), t E J. Using the Leibniz rule,
To find (D7 oxi,'"l'(·))(to), we first take any vector field V E TJ- M extending
V7 (to), so that V7(to) = V7 (to) = Vi(to)Oxi,'"l'(to) = v7 ,7 (to) = i'i(to)Oxi,')'(to)·
We note that Vi(t 0 ) =i'i(t0 ). Thus
(D70xi,7(•))(to) = (v' v-y,-y(to)Oxi )7(to) = (v' y-Oxi )7(to) = (v' Vio,,i Oxi )7(to)
~. ~. k
= (V3 v' O,,j Oxi )7(to) = (V 3 r i j Oxk )7(to)
~. k
= V 3 ('Y(to))rij('Y(to))oxk,')'(to)
= Vi(to)rfj('Y(to))Oxk,7(to) = 7i(to)rfj('Y(to))oxk,7(to)·
Substitution in (*) now yields (D7 oxi,'"l'(·))(to) = (v'~-r W)(to), showing the
claimed uniqueness.
Existence: We will define v' V-r by using the charts from an atlas A defining
the smooth structure on M. (Then by uniqueness, it follows that the con-
struction of v'v-r does not depend on the atlas A.) Let t 0 EI, and (U, x) be
Pamllel tmnsport 143
Then v' V-r as defined above has the claimed properties by the proof of the
existence part of Lemma 7.2. Also, we note that if (fi, x) is another chart
containing the point 'Y(t), then the uniqueness part of Lemma 7.2 applied
to U n ft shows that
(v'v,'Y W)(t) = (v'v,"uW)(t)
'Y
= (v'i'Y W)(t).
That I 3 t >-+ (v'v-r W)(t) is an element of TJ'Y follows from Lemma 7.1.
The last claim on the expression for v' V-r W in a chart follows from the
existence part combined with the uniqueness part. □
Exercise 7.1. Let M be a smooth manifold with a connection v'. Let I, J be open
subsets of IR, and J 3 u >-+ h( u) E J be a C 00 function. Let , : J -+ IR be a smooth
curve. From Exercise 2.4, the velocity vector field of , o h : J -+ M is given by
V-yoh(s) = h(s)V-y(h(s)) for s E J. For a given Z E TJ,, define WE TJ(, oh) by
W(s) = Z(h(s)), s E J. Show that (v'v-roh W)(s) = h(s) (v'v-rZ)(h(s)), s E J.
Exercise 7.2. The aim of this exercise is show a further strengthening of the results
in Lemma 6.1 and Exercise 6.9, namely that v'vW is determined at a point p by
just knowing
• the value Vp of V at p and
• the values of W along a curve , passing through p whose velocity vector at
p is Vp (instead of knowing W in an arbitrarily small neighbourhood of p as
in Exercise 6.9).
Let M be a smooth manifold with a connection v', p E M and v E TpM. Let
V, V E TJ M be any two vector fields such that
Vp = V = Vp.
Let , : J -+ M be a smooth curve, where I is an open interval in IR containing 0,
such that 1 (0) = p and V-r,-r(o) = v. Suppose that W, W E TJ M are such that
W-y(t) = W-y(t) (t E I).
Prove that
(v'vW)p=(v'yW)p-
1
Hint: Define the vector field X E T 0 , by X(t)
~ and consider
= W-y(t) = W-y(t)
(v'v-rX)(0).
144 A Mathematical Introduction to General Relativity
Example 7.3. (Sphere). Recall Example 5.7 on page 87, where we consid-
ered the sphere S 2 as a Riemannian manifold using the metric g induced
from the Euclidean inner product on IR 3 . Let 'v denote the Levi-Civita con-
nection induced by g. Consider the meridian , with a constant longitude
</>o E (0, 27r), which in the chart (U, <p) described in Example 5.7, is given by
,(t) = ((cos</>o)(sint),(sin</>o)(sint) , cost) , for all t E (0, 7r). We claim that
the vector field W along , given by the latitudinal vector field O<f>, is not
parallel along,. Firstly, we have W(t) = o<l>,'!(t) , and so W<I> = 1, w 0 = 0,
w
giving W<I> = 0 = 0. Next, v"Y,'!(t) = ,:.t9 00 ,"Y(t) + ,y<I> o<l>,"Y(t) , where
,y9 (t) = d(O O , ) (t) = dt (t) = 1 and ,y<f>(t) = d(</> 0 ,) (t) = d</>o (t) = 0.
dt dt ' dt dt
Thus, keeping track of the nonzero terms, we have that
fi(t) = d(O O µ) (t) = d(~) (t) = 0 and µ<f>(t) = d(</> 0 µ) (t) = dt (t) = 1.
dt dt ' dt dt
146 A Mathematical Introduction to General Relativity
= 0-1-180,µ,(t) + cos01
sinO O=f ·l-loq,,µ,(t) = 0 + 0 = 0.
Exercise 7.3. Consider the same curveµ from Example 7.3: µ(t) = (cost,sint,0),
t E (0, 211-). Take Y(t) = Oef>,µ(t), t E (0, 211-). Prove that Y is parallel alongµ.
Conclude that for a, /3 E R, aX + f3Y is also parallel along µ, where X is the
vector field along µ given in Example 7.3, namely X(t) = oo,µ(t), t E (0, 21r).
Proof. It is enough to show this in admissible charts that meet 'Y(J). Let
(U, x) be such a chart. Then we have
Now we use the metric compatibility property (C5) of the Levi-Civita con-
nection to expand the right-hand side (V(g(ox, , Oxj)))(,(t)) , obtaining
!~g(ox,,Oxj)) (t)
w
148 A Mathematical Introduction to General Relativity
7 .3 Parallel transport
In this section, we will meet the parallel transport map. Given a smooth
curve 'Y : J -> M, where M is a smooth manifold with a connection v',
and I is an open interval in R containing the points a and b the parallel
transport map
P:b : T'Y(a)M -> T"f(b)M
will transport tangent vectors from T'Y(a)M to tangent vectors in T"f(b)M,
parallelly along the curve 'Y· In this manner, it 'connects' the two 'distant'
tangent spaces using the connection v'. The transportation itself will be
done by working locally in charts, setting up a system of ordinary differen-
tial equations for components, and using a known existence and uniqueness
result from the theory of differential equations.
Proposition 7 .3.
Let I be an open interval in R and A: J-> Rmxm have C 00 entries. Then
given a E J and B E Rm, there exists a unique Y : J -> Rm such that the
entries of Y are C 00 , and
Y(t) = A(t) Y(t), t EI,
{
Y(a) = B.
Proof. This follows from a C 1 -version 1 of the result (where the A is just
assumed to be continuous), by the following argument: First view A as c 0 ,
from which we deduce that there is a unique C 1 solution Y. But then look
at the right-hand side AY, which is C1, since A is C 1 and so is Y. Hence
we conclude that Y = A Y is C 1 , which is the same as saying that Y is
C 2 . Again Y = A Y is now C 2 , since the right-hand side is the product of
A, Y E C 2 , and so Y E C 3 , and so on. □
Proof. First suppose that 'Y(J) lies entirely within an admissible chart
(U, x). We write W = WiOxi. Then Wis parallel along 'Y if and only if for
all t EI, (v'v,, W)(t) =0, i.e.,
Wk(t) + ,yi(t)rt/'Y(t)) Wi(t) = o, 1 ~ k ~ m.
1 See for example, [Apostol(1969), §7.21].
Parallel transport 149
{ Y(t) = A(t)Y(t) ,
t E J,
Y(a) = B.
Here A(·) is the matrix whose entry in the ith row and jth column is given
by the function --l(•)qkb(·)) E C 00 (J). By Proposition 7.3, there exists
a unique C 00 solution Y. Consequently, there exists a unique W E TJ---y
parallel to ')' satisfying W (a) = w.
I --y(/3)
--y(/3- !)
Now 2 suppose that there does not exist a single admissible chart covering
all of --y(J). Let (a, (3) be the maximal interval containing a on which a
W E TJ---yl(a,/3) exists which is parallel along '°Yl(a,/3) and W(a) = w. If
(3 = sup J and a= inf J, then we are done. Suppose that (3 < sup J. (The
proof for a > inf J is analogous.) There is an admissible chart (V, y) such
that --y(/3) EV. We can then choose a 8 > 0 such that --y( (/3 - 8, (3 + 8)) E V.
We know that there exists a vector W(,B - !)-
Taking this as an initial
condition, there exists a unique vector field W on (/3 - 8, (3 + 8) such that
Proof. Let w,x E T"Y(a)M, and denote by W,X, respectively, their par-
allel transports along"(: That is, W, XE T1h, "Vv-r W = 0, W(a) = w, and
"Vv-rX = 0, X(a) = x. Let c ER Then by the R-linearity of "Vv-r, we have
"Vv-r(W + cX) = "Vv-r W + c"Vv-rX = 0 + 0 = 0,
and by the definition of addition in TJ'Y,
(W + cX)(a) = + cX(a) = w + ex.
W(a)
Hence W + cX is the parallel transport of w + ex along 'Y· In particular,
Pdb(w +ex)= (W + cX)(b) = W(b) + cX(b) = Pdbw + cPdbx.
Thus Pdb is a linear map. Next we show that Pdb : T"Y(a)M-> T"f(b)M is
invertible. If Mis m-dimensional, then since dimT"Y(a) = m = dimT"f(b)M,
it is enough to show injectivity. Let w E T"Y(a), and W denote the corre-
sponding parallel transport of w along 'Y· Then Pdb(w) = W(b). But now
suppose we want the parallel transport of the vector x := W(b) E T"f(b)M
along the curve 'Y· Clearly W already satisfies "Vv-r W =0 and W(b) =x. By
the uniqueness established in Theorem 7.2, we conclude that the parallel
transport of x along 'Y is W. In particular, P~ x = W (a) = w. Hence
(P~ 0 pdb)w = P~(P:bw) = P~x = W = idT-y(a)MW-
As w E T"Y(a)M was arbitrary, we have P~ o Pdb = idT-rcaiM• Thus Pdb is
injective. So Pdb: T"Y(a)M-> T"f(b)M is a linear isomorphism. □
Proof. We have already seen that P:b is linear and invertible. It remains
to show that it preserves the scalar product. Let w, x E T"Y(a)M , and denote
by W, XE TJ'°Y, respectively, the corresponding parallel transports along '°Y·
Corollary 7.1 (p.147) implies that
g('Y(b))(P:bw, P:bx) = g('Y(b))(W(b) , X(b)) = g('Y(a))(W(a) , X(a))
= g('Y(a))(w , x).
Hence P:b : T"Y(a)M-> T"Y(b)M is an isometry. □
Remark 7.2. In this chapter, we have seen that once we have a connection
v' on a smooth manifold, then it induces a notion of a covariant derivative
v' v.., along each smooth curve ')', and also a notion of parallel transport
along each smooth curve ')'. One can also show3 that the parallel transport
maps determine the covariant differentiation operators along curves via
(v'v, W)(t) = lim P;t(W(r)) - W(t) , WE Th ,
-y T-->t T - t
and also the manifold's connection by
( nv W) = 1· P;o(W"Y(h)) - Wp 1
v P h~ h , p E M, V, W E T0 M,
where')' is any smooth curve such that 'Y(O) = p and v'°Y,P = VP. As we will
not need this, we will not prove this here. *
Example 7.4. (Parallel transport under the flat connection on IR.m).
Consider IR.m with the standard smooth structure and the flat connection
v'. We use the global admissible chart (IR.m, id JR= ). Then for any p E M,
we have a linear isomorphism
m,m
11'>.
( 1
3 W = W , ·· · ,W
m) >--4
l i ::i T m,m
W Vxi,p E pl1'>. •
Let '°Y : I -> IR.m be any smooth curve, where I is an open interval in JR. that
passes through the two points p, q E IR.m. Let a, b E J be such that 'Y( a) = p
and 'Y(b) = q.
In the example above, the parallel transport that maps vectors in TpM to
vectors in TqM was independent of the path 'Y connecting the two points
p, q EM. However, this was because the space was 'flat' (and in the calcu-
lation above, in fact the terms containing 'Yj vanished in our chosen chart
as the f-symbols happened to be zeros). The parallel transport between
two tangent spaces at p, q E M in general does depend on the curve used. In
fact, it is a manifestation of the 'curvature' of the manifold. The following
is an illustration of this in the case of the sphere.
Example 7.5. (Sphere). Consider the sphere S2 as a Riemannian manifold
with the metric induced from the Euclidean inner product on JR 3 , and the
corresponding Levi-Civita connection. In Example 7.3 (p.145), we had
considered the smooth curveµ, given by µ(t) = (cost,sint,0), t E (0,21r).
In the admissible chart (U, cp) considered there, the chart map gives the
usual spherical polar coordinates (0u,</P). In this chart (U,cp), let the
vector field X alongµ be given by X(t) = o0u,µ(t), t E (0, 21r). Then we had
seen that X is parallel along µ (the equator in the chart U). Let a:= ~ and
b := 3~, so that p :=µ(a)= (0, 1, 0), and q := µ(b) = (0, -1, 0). Then with
x = 00u,P = X(a), we have P::bx =X(b) = 00u,q.
Now consider another admissible chart (ff, ip), where we use new spher-
ical coordinates (0u, ¢P), measuring the polar 0u angle of a point from the
positive x-axis (instead of the usual positive z-axis), and measuring the
azimuthal angle ¢P from the negative z-axis (instead of the usual positive
x-axis). Consider the curve µ given by µ(t) = (0, sin t, - cost), t E (0, 21r),
and take Y(t) = -oc/>u,µ,(t)' t E (0,21r).
By Exercise 7.3 (or by a direct calculation), Y is parallel alongµ. With
a=~ and b=3~, we have µ(a)= (0, 1, 0) =p, and µ(b) = (0, -1, 0) =q, and
(*)
Y(a) = - oc/>u,p = o0up
, = x.
That (*) holds is convincing based on the following picture, but a justifica-
tion is given at the end of this example.
Parallel transport 153
~ (•)
Referring to the picture above, P/:b x = Y(b) = - o,,,u
'f' ,q
= - o0uq
'
= - P/:b x.
Thus the result of parallel transporting x from p to q is different along the
two curves µ and µ.
Justification of ( *) and ( *):
(2.2) (p.22) implies that TpS 2 3 v >-+ v = (vx , vy , vz) E IR.3 is injective.
Thus it suffices to show (00upX,00upy,09upz)
, , ,
= - (o,,,uo/~
x,o,,,u
o/~
y,o,,,u
o/~
z)
o
in order to prove 0 u,P = - ov,v·
We have
o( (sin 0)( cos¢))
00
o( (sin 0)(sin ¢))
00
o(cos0)
~
Ul
Also,
o(cos0)
~
o( (sin 0)(sin ¢))
o,p
o(-(sin 0)(cos ¢))
o,p 0=f
t/>=t=f
Thus - o,t,fl,p = o0u,p, that is, ( *) holds. Similarly,
o( (sin 0)( cos¢))
o0
:;(sin
o(cos0)
O)(filn ¢)) - [_ l~
~ -
o( cos 0)
;~sin O)(sin,;))
o(-(sin0)(cos¢))
00 &=f o<f>
1>= 3;
justifying o'¥,,,u,q = o0 u ' q, that is, ( *) holds. ◊
This page intentionally left blank
Chapter 8
Geodesics
We call a(t) E T.,,(t)M the instantaneous acceleration at the point 'Y(t) EM.
We refer to v' V-r V.,, = 0 as the geodesic equation.
155
156 A Mathematical Introduction to General Relativity
d i d"i
Here 'Yi = xi o 'Y, :Yi = d7 , -yi = d7 , and rfj are the Christoffel symbols in
(U,x).
The following example shows that when we think of a geodesic 'Y : J -> M,
we should not only think of the range 'Y(J), as the parameterisation matters.
Example 8.1. Consider JR 2 with the standard smooth structure and the
flat connection v'. In the chart (JR2 , id), all the connection coefficients
vanish. If Jc JR is an open interval, then a curve I 3 t ...... 'Y(t) = (x(t), y(t))
is a geodesic if and only if we have x(t) = 0 = jj(t) for all t E J, that is,
(x(t),y(t)) = (xo, Yo)+ tv for some (xo, Yo), v E JR2 . So 'Y is a straight line.
However, not all curves whose image lies along a straight line will be a
geodesic. As an example, consider µ : JR -> JR2 , µ(t) = (t3, t 3 ) fort E R
Then V,,(t) = 3t2 Ox,µ(t) + 3t2 oy,µ(t), and so we have :yx(t) = 3t2 = lyY(t),
giving ,:;,x(t) = 6t = -yY(t). Thus the geodesic equation is not satisfied, as
-yk(t) + rfj('Y(t)) :Yi(t) lyi(t) = 6t =I= 0 fort =I= 0. Soµ is not a geodesic.
Hence, geodesics lie along straight lines, but not all curves lying along a
straight line are geodesics. ◊
We now show that given any point p E M and a direction v E TpM, there
exists a geodesic 'Y : I -+ M defined on some open interval I c R containing
0, such that "f(0) = p and v"Y,"Y(o) = v, called the 'geodesic passing through
pin the direction of v'. To show this, we will use the following result, which
follows from a well-known existence and uniqueness result from the theory of
ordinary differential equations; see e.g. [Apostol(1969), Thm. 7.19, p.229].
Proposition 8.3. Let Ve Rm be an open set, Fh E C 00 (V), l~i,j,k~m,
P E V, v E Rm. Then there exists an open interval I c R containing 0, and
a map
J 3 t 1-+ Y(t) = (Y 1 (t), · · · , ym(t)) EV
which is a solution to the following initial value problem:
f-k(t) + Fij(Y(t))Yi(t)Yi(t) = 0 (l~k~m)
{ Y(0) = P (8.1)
Y(O) = V.
Moreover, any two solutions Y: I-+ V and Y: 1-+ V coincide on In I.
Proof. For all l~k~m, let us define Ak(t) = Yk(t) and Bk(t) = Yk(t).
Set A= (A1,- ••,Am) and B = (B1,- • • , Bm). Then Y satisfies the given
initial value problem if and only if (A, B) is a solution to the following
Let r > 0 be such that the closed ball B(P, r) with center P and radius r
is contained in V. Then the function
K := B(P, r) x B(v, r) _. JR 2 m
(a 1 · · · am b1 · · · bm) >-+ (b1 · · · bm -F-1-(a)bilJi · · · -F'!'.'(a)bilJi)
~ ' ' ' ' ' ' 1,J '' 1,J
=:a
is a Lipschitz function (since its first order partial derivatives are all
bounded on the compact set K). So by [Apostol(1969), Thm. 7.19, p.229],
there exists a C 1 solution (A, B) : J -> IR 2 m on some interval I containing
0, satisfying the initial value problem (8.2). But as the functions Fij are all
smooth, the right-hand sides of the differential equations in (8.2) are C 1 .
So (A, B) is C 1 , i.e., (A, B) is C 2 . Again as the Fij are all smooth, the
right-hand sides of the differential equations in (8.2) are C 2 , implying that
(A, B) is C 2 , i.e., (A, B) is C 3 • Continuing in this manner, (A, B) E C 00 (J).
So there exists a C 00 ( J) solution Y: = A to (8.1).
By the local uniqueness of solutions [Apostol(1969), Thm. 7.19], Y, Y
coincide in a small interval (-1:, 1:) around 0. Set
Tmax = sup{T E J" J: Y(t) = Y(t) for all t ~ r}.
We want Tmax = sup(J n J). ~ Let Tmax < sup(J n J). ~ As Y, Y~ are C 1 ,
Theorem 8.1. Let M be a smooth manifold with a connection v'. For any
p E M and any v E TpM, there exists an open interval I c R containing 0,
and a geodesic 'Y: I--+ M such that "f(0) = p and v 7 ,p = v. Moreover, any
two such geodesics 'Y : I --+ M and ~ : J--+ M coincide on I n J.
So the differential equation in (8.1) implies that 'Y satisfies the geodesic
equation. Moreover, we have that 'Y(0) = x- 1 (Y(0)) = x- 1 (P) = p, and
V7,p =,yi(0) Oxi,"f(O) = Yi(O) Oxi,"f(O) = ViOxi,p = V.
The uniqueness part is immediate from Proposition 8.3 as long as one is
in a single chart. The argument in the general case is easily adapted from
the uniqueness part of the proof of Proposition 8.3. First, it follows from
Proposition 8.3 that in a small interval (-1:, 1:) around 0, the curves 'Y, ~
do coincide. Set Tmax = sup{r EI n J: 'Y(t) = ~(t) for all t ~ r}. Suppose
that Tmax <sup(J n 1). By continuity, 'Y(Tmax) =~(Tmax) =:pmax• Moreover,
v'Y,Pmax = v7,Pmax =: VPmax (since their components can be seen to be the
same by considering the limit from below Tmax)- By the uniqueness in a
single chart (containing the point Pmax), it follows that a geodesic passing
through Pmax with velocity Vmax is unique, and so 'Y and ~ must coincide in a
neighbourhood of Tmax, contradicting the choice of Tmax• Hence 'Y(t) =~(t)
for all nonnegative t E I n J. A similar argument shows the agreement also
for nonpositive t EI n J. Consequently, 'Y(t) =~(t) for all t EI n J. □
Corollary 8.1. Let M be a smooth manifold with a connection v'. For any
p E M and any v E TpM, there exists a unique maximal geodesic 'Y : I --+ M,
where I is an open interval containing 0, such that 'Y(0) = p and v 7 ,p = v.
Geodesics 161
But now take I = IR., and 'Y(t) defined by the same expression as above.
Then 'Y( t) E C, and 'Y also satisfies the geodesic equation in V ( unless o: = 0,
when it does not enter Vat all, and 'Y(IR.) c U). So 'Y is a maximal geodesic
defined on R Similarly, we can show that every geodesic with image in V
can be extended to a maximal geodesic defined on R Consequently, C is
geodesically complete. ◊
162 A Mathematical Introduction to General Relativity
Exercise 8.5. Let M be a smooth manifold with a connection v'. Let p EM, and
'Y : I-+ M, where I is an open interval in R containing 0, be a geodesic such that
-y(0) = p and v-y,p = 0. Prove that -y(t) = p for all t EI.
Hint: Consider ::Y(t) := p for all t E R, and show that it is a maximal geodesic
passing through p with velocity 0.
Exercise 8.6. Consider M = R 2 with the standard smooth structure, and a con-
nection v' described via its connection coefficients in the global chart (R 2 , id)
(see Exercise 6.3, p.110), given by r~Y = r~x = 1, and all other r-symbols iden-
tically 0. Determine the geodesics. Given any two points p, q in the plane, is
there a geodesic that starts at p and passes through q? Plot, using the computer,
geodesics starting at p = (0, 0) in the direction v = (cos 0)ox,p + (sin 0)oy,p, for
various values of 0, say 0 = k· ~~, for k = 0, • • • , 23.
Consequently, using the facts above, and the Leibniz rule for the covariant
derivative along the curve 'Y o h, we obtain, for all s E J
("Vv.., h V-yoh)(s) = ("Vv.., h (h V7 (h(-))))(s)
0 0
d(h) .
= -ud (s) V7 (h(s)) + h(s)("Vv V7 (h(-)))(s)
'Y
0h
r(t) = t
lto
lg('Y(s))(v'Y,'Y(s),V'Y,'Y(s))I ds, t EI.
h(r) = ar+b, where a= 1/-Jicf and b = -t0 , then 'Yoh is also a geodesic,
which we refer to as the geodesic 'Y 'parametrised by arclength'. Note that
V-yoh,yoh(r) = ~~ (r) V-y,-y(h(r)) = av'Y,'Y(h(r)), so that the tangent vector of"( 0 h
has everywhere a 'unit length':
lg(('Y O h)(r))(v-yoh,-yoh(r), V-yoh,-yoh(r))I
= a 2lg('Y(h(r)))(v7 ,-y(h(r)), V-y,-y(h(r)))I = a 2icl = 1.
For lightlike geodesics 'Y: J-> M, we have that r(t) = 0, that is, the 'proper
time' experienced by a lightlike particle (photon) between any two events
along its worldline is zero. Thus to label each point along the lightlike
geodesic, the arclength parameter is to no avail. On the other hand, by
Theorem 8.2, we know that any affine linear transformation of the interval
I gives an alternative parameterisation. ◊
TE..th 2r R,ai
Hint: Assume that the light signal from the events at the start and end of the
circular orbit reaches the observer on Earth radially, and revisit Example 5.16.
For Global Positioning System (GPS) satellites, r ~ 27000km, giving
To,b;t - TEuth ~ 4 _5 X l0-10,
TEarth
We now give the calculation behind one of the first observational verifica-
tions of the spacetime viewpoint of gravity. In the Newtonian viewpoint,
the path JR 3 t >-+ ( t, x( t)) E JR x JR3 of light is a straight line, or 'light travels
in a straight line'. However, a light ray, from our spacetime viewpoint, is
just a lightlike/null geodesic. The region of spacetime in the vicinity of a
star is described reasonably well by the Schwarzschild spacetime1 . We will
now consider the path of a light ray, and show that its path is 'bent' by the
star. One can first photograph a certain celestial region, recording the light
rays emanating from distant fixed stars, measured at night, and when the
Sun is far away from the path of the light rays reaching the observer (so
that the bending effect on light due to the Sun's gravity can be neglected).
1 0r we imagine that the universe contains only the star and nothing else.
166 A Mathematical Introduction to General Relativity
------------ .
Observer
*
Dista nt
fixed star Sun
One photographs the same celestial zone during a solar eclipse2 and a dif-
ferent time of the year:
* ------------------ ~
◊~ · -------------
A comparison the two photographs should reveal a change in the apparent
position of the stars due to the bending of the light ray path by the Sun's
gravity. This angular deflection can be calculated by using the geodesic
equation, and the predicted value was actually measured during the 1919
solar eclipse.
2 The solar eclipse is relevant here only because during a solar eclipse, the stars are
visible, and can be photographed.
Geodesics 167
Proof. The first four equations follow immediately from the geodesic
equation expressed in the chart (U, cI>) by using the connection coef-
ficients listed in Example 6.8. The constraint equation is the conse-
quence of the causal character of 'Y being lightlike, that is, for all T E J,
g('Y(r))(v7 ,7 (r), v7 ,7 (r)) = 0, expressed in coordinates, and using the com-
ponent matrix for the metric given in Example 5.10. □
( 2ri- + r 2 *)r - r 2 r
r4/p2
(using (8.4))
r r-3m
= - r2¢>2 = __r_2_ (using (8.6))
1 1
= -- +3m-.
r 2 r
168 A Mathematical Introduction to General Relativity
Here, a light ray starts in the equatorial plane, from a fixed distant star.
So, we assume that the light ray comes in from r = +oo and leaves with
r = +oo eventually. We assume that the point of closest approach of the
light ray to the centre of the Sun is exactly equal to the radius R 0 of the
Sun, so that the ray just grazes its surface. We choose this point of closest
approach as (-R 0 , 0), lying in the equatorial xy-plane, as shown above. In
order to compute the deflection angle, we need to compute the angles with
the x-axis of the two asymptotes, that is, the azimuthal angle ¢ at r = +oo
when coming in, and the angle ¢ at r = +oo when going out. In both cases,
r = +oo, which corresponds to u = 0. Moreover, u(1r) = r{0 and ~¢ (1r) = 0
(r-minimisation/ u-maximisation condition).
Equation (8.7), namely u" +u = 3mu 2 (where -' = ~,t,), is a nonlinear
differential equation. Note that
1 1
u = - ,( -
r R0 '
where R 0 ~ 7 x 105 km, while 3m = 3M 0 ~ 4.5 km. In units where R 0 = 1,
we have E := 3m « 1. So the term 3mu 2 = w 2 is small in comparison with
u, and we can use a perturbative method to solve the differential equation
approximately. To this end, we first consider the differential equation
u~ + u 0 = 0, with u 0 (1r) = ~ 0 and u~(1r) = 0.
Geodesics 169
The general solution is u 0 ( ¢)=A cos¢+ B sin¢, and using the above 'initial'
conditions, the solution is the vertical straight line, parallel to the y-axis:
uo(c/>)=- cos¢_
R0
Now, for our original nonlinear equation,
U 11 +U = €.U 2 , (8.8)
we seek a solution u having the form u = u0 + f.u 1 + f. 2u 2 + • • • . Substituting
this in (8.8), we obtain by comparing terms in f. that
,, 2 (cos¢) 2 l+cos(2¢)
U1 + U1 = Uo = 2 = 2
R0 2R0
The initial conditions for u1 are u1 (1r) = 0 and Ui (1r) = 0. To find a solution,
we first try to find a particular solution by making the assumption that
u1,p=a+bcos(2¢), so that Ui,p=-2bsin(2¢), and ur,p=-4bcos(2¢). Sub-
stituting this in the differential equation, and comparing the coefficients of
1 and cos(2¢), we see that a=~ and b= -~- Thus u 1 ,p = 3 -~~t
2<1>l. We
add a homogeneous solution to u 1 ,p in order to satisfy the initial conditions
u1(1r) =0 and ui(1r) =0:
. 3-cos(2¢)
u1(¢) = A cos¢+ Bsm¢ + 2
6R0
Using u1(1r) = 0 and ui(1r) = 0, we get A=~ and B = 0. So
,1..) cos¢ 3-cos(2¢)
U1 ('I' = - - 2 + 2
3R0 6R0
Hence
,1..) cos¢ (cos¢ 3-cos(2¢)) 2
U ('I' = - - - + €. - -2 + 2 + € U2 + · · · .
R0 3R0 6R0
To find the approximate value of the incoming azimuthal angle ¢in of the
light ray, we set u( ¢in) = 0, and neglect the € 2 terms, giving the following
quadratic equation in cos ¢in:
3-(2(cos¢in) 2-l)) 0
-cos¢in
- - + € (cos¢in
--2- + 2 = .
R0 3R0 6R0
Defining c5in = ~ - ¢in> 0, we obtain
= 4 ( 3~0 -1 (
+ ✓ 3~0 -1)2+ 8 )-1.
170 A Mathematical Introduction to General Relativity
This prediction was confirmed by observation during the 1919 solar eclipse.
/~v\
~
Note that expP O = p. The following example gives some motivation for the
terminology.
Geodesics 171
Example 8.5. (Exponential map for the unit circle). In the Euclidean
plane IR. 2 , consider the unit circle 8 1 = {(x , y) E IR. 2 : x 2 + y 2 = 1}. Let
U denote the circular arc which is the image of the injective map given by
(-1r, 1r) 3 0 >-+ (cos0, sin0) E 81, and denote its inverse by ¢ . Similarly, let
V be the image of the injective map (-1r, 1r) 3 0 >-+ ( - cos 0, sin 0) E 8 1 ,
and denote its inverse by '1/J . Then it can be seen that A= {(U, ¢ ), (V, '1/J)}
forms an atlas for 8 1 . We consider 8 1 as a Riemannian manifold with
metric induced from the Euclidean inner product on IR. 2 , that is, for p E 8 1
and v, w E Tp81, g(p)(v , w) = ((vx , vy) , (wx , wy) ) 111.2 , where x , y denote the
coordinate maps given by 8 1 3 (a , b) >-+ x(a, b) := a, y(a, b) := b. In (U, ¢),
g(p)00 = g(p)(oe,p, 00,p) = (00,px) 2 +(00,py) 2 = (-sin0) 2 +(cos0) 2 = 1, for
p = (cos 0, sin 0) E U. Let v' be the corresponding Levi-Ci vita connection.
Using Proposition 6.3 (p.122), the connection coefficient in (U, ¢) is 0 = 0. ri
For v E B: = ( -1 , 1) c IR., consider the curve 'Yv : (-1r, 1r) --> 8 1 defined by
'Yv(t) = (cos(vt) , sin(vt)) for t E (-1r, 1r). Then 'Yv(O) = (1 , 0) =: p E U c 8 1 .
Moreover, we have 'Y!(t) =vt, so that i'!(t) =v, and v'Y v,P=voe,p- The curve
'Yv is a geodesic because
.. e re . e . e o o o
'Yv + 00 'Yv 'Yv = + = ·
Thus {voe ,p : v E B} c VP c Tp8 1 and the exponential map at p = (1, 0)
sends voe,p, VE B, to 'Yv(l) = (cosv , sin v) E 8 1 .
0 p
Identifying IR. 2 with C, and Tp8 1 with IR. (by sending IR. 3 v >-+ voe ,p E TpM),
we write expp(v)=(cosv,sinv)=cosv+isinv= eiv _ ◊
Proof. Let J = (a, b). Then we have ¾I=(%,~)- Set h(t) :=ct fort E ¾I.
Then µ = 'Y o h. As 'Y is a geodesic, it follows from Theorem 8.2 that so is
µ. Moreover, µ(0)=7(c0)=7(0)=p and vµ,,p=h(O)v'Y,P=cv. □
Corollary 8.2. Suppose that M is a smooth manifold with a connection
v'. Let p EM, v E TpM and c > 0. Let 'Yv: Iv--+ M be the unique maximal
geodesic such that 'Yv(O) = p and v'Yv,P = v. Let 'Ycv : Icv --+ M be the unique
maximal geodesic such that 'Ycv(O) = p and v'Ycv,P = CV. Then elev= Iv and
for all t E Icv, 'Ycv(t) ="fv(ct).
Proof. We use the Scaling lemma, applied twice with the roles of 'Yv
and 'Ycv swapped, and the maximality of the two geodesics. First, by the
Scaling lemma, 'Yv (c •) : ¾Iv --+ M is a geodesic passing through p at t = 0
with velocity at p equal to cv. The maximality of 'Ycv : Icv --+ M implies
1 Iv c Icv and 'Ycvl1. 1 = 'Yv(c ·). Also, starting from 'Ycv, it follows by the
C cV
We note that TpM ~ !Rm via the isomorphism i, v=viOxi,p ...!'.. (v1, • • • ,vm),
where (U, x) is an admissible chart containing p. Thus we may endow
TpM ~ !Rm with the usual Euclidean metric topology induced by the Eu-
clidean norm II· II- Recall from Example 2.3 (p.28) that the matrix relating
the components of a vector in TpM, with respect to induced bases from two
charts, is invertible. It follows that this topology on TpM is independent of
the admissible chart (U, x) containing p used to define the isomorphism i.
Geodesics 173
Theorem 8.3. Let M be a smooth manifold with a connection v', and let
p E M. Then 'Dp contains a neighbourhood V of O E TpM, and there is a
neighbourhood U of p E M such that expP Iv : V --+ U is a diffeomorphism.
Proof. By the previous Lemma 8.3, there exists a neighbourhood W of
0 E TpM such that W c 'Dp and expP Iw : W --+ M is smooth. We will show
that the derivative of this smooth map at 0 E TpM is invertible, and use the
inverse function theorem (Theorem 2.3, p.32). Recall that the derivative
map at 0 E TpM is a linear transformation d(expp)o : T 0 (TpM) --+ TpM.
As dimT0 (TpM) = dimTpM, it is enough to show surjectivity of d(expp) 0 .
Let v E TpM. Consider the straight line fv : JR--+ TpM given by fv(t) =tv.
For small enough t>0, tv E W, and expp(fv(t)) =expp(tv) ="ftv(l) ="fv(t).
(Here the notation 'Yw is used for the maximal geodesic in M such that
'Yw(0)=p and V-yw,p=w.) Then for any f E C 00 (M),
d(f o expp ofv)
((d(expp))o(vev,o))(f) = vev,oU o expp) = dt (0)
d(f o 'Yv)
= dt (0) = V-yv,Pf = vf.
Hence, (d(expp))o(vev,o) =v E TpM. As the chosen v E TpM at the outset
was arbitrary, we have shown that d(expp)o is surjective (and hence invert-
ible), as wanted. So by the inverse function theorem (Theorem 2.3), there
exists a neighbourhood V c W of 0 E TpM and a neighbourhood U of
p E M such that expP Iv : V --+ U is a diffeomorphism. □
174 A Mathematical Introduction to General Relativity
expP l l exp;l
-
/ "\
e,,L ]Rm
CE) V --+
0 el
TpM l-1
\. .)
3 The admissibility claim follows from the fact that x is a diffeomorphism (being the
composition of diffeomorphisms).
Geodesics 175
In Euclidean space JR3 (i.e., JR 3 endowed with the Euclidean inner product),
we know that the straight line segment between two points p, q, is also the
shortest path from p to q. Likewise, on the sphere in JR 3 considered as
a Riemannian manifold, the 'short arcs' of great circles give the shortest
path between the endpoints of the arc. In this section, we will give an
interpretation of timelike geodesics as being stationary for the proper time
'functional'. More precisely, we will show the following.
Geodesics 177
In order to prove this result, we will prove the necessity of the Euler-
Lagrange equations for maximising curves.
Proposition 8.5. (Necessity of Euler-Lagrange equations).
Suppose that a, b E JR, a< b, U c !Rm is open, and Sl c U x !Rm is open.
Let£ : {l --+ JR, U X ]Rm ::J {l 3 (o., /3) >-+ .C(o., /3) E JR, be a C 00 function.
Let p, q EU, and define the set
Opq := {µ: [a,b]--+ U I µ(a)=p, µ(b)=q}.
Define f : Opq --+ JR by
i b
a
o.C ..
o/3/µ*(t), µAt)) h'(t)dt
= - ib
a
d ( o.C ) . o.C . ,t=b
dt of3i(µAt),µ,*(t)) h'(t)dt+ of3i(µ*(t),µ,*(t))h'(t) t=a
= - f 1t(!~(µAt),µ,*(t)))hi(t)dt+O-O,
where we used the boundary conditions h(a) = 0 = h(b). Putting this ex-
pression into (8.9), we obtain that for all smooth h: [a, b] ---+ Rm satisfying
h(a) =O=h(b),
We will show that this implies the ith Euler-Lagrange equation. Suppose
on the contrary that there is some t 0 E [a, b] such that rp(t 0 ) # 0, where
Without loss of generality, suppose that rp(t 0 ) > 0. Also, by the continuity
of rp, we may choose t 0 in the open interval (a, b). Let 8 > 0 be such that
rp(t) > 'P~o) fort E (t 0 -8,t 0 +8). Take any nonnegative bump function h
such that h = 1 on (to - ! , to+!) and h = 0 outside (to - 8, to+ 8). Then we
get the contradiction that
0= ia
b
rp(t)h(t)dt =
rto+.S
Jto-.s rp(t)h(t)dt >
rto+! (t ) (t )
Jo-% rp / ldt > rp / 8 > 0.
This completes the proof. □
s(t) = i a
t di di
-gij(µ.(t)) ):• (t) dt (t) dt, tE[a,b].
Curvature
181
182 A Mathematical Introduction to General Relativity
The (1, 3)-tensor field R, called the curvature tensor field, is defined by
R(O, X, Y, Z) = O(R(X, Y)Z)
= O(v'xv'yZ - v'yv'xZ - v'[x,YJZ)
for X, Y, Z E TJ M and n E Tf M. When (M, g) is a semi-Riemannian man-
ifold with the Levi-Civita connection v', we call R the Riemann curvature
tensor field.
Lemma 9.1. Let M be a smooth manifold with a connection v'. Then for
all vector fields X, Y, Z E TJ M, we have R(X, Y)Z = -R(Y, X)Z.
Curvature 183
In a chart (U, x), the covariant curvature tensor field has components
Rijk£ = g£rRijk, as
We now show the second Bianchi identity, which involves derivatives, and
bears some resemblance to the first Bianchi identity seen earlier. First we
note that as R is a (1, 3)-tensor field, given any V E TJ M, its covariant
derivative v'v R is a (1, 3)-tensor field. For n E Tf M and X, Y, Z E TJ M,
(v'vR)(O,X,Y,Z)
= V(R(O,X, Y,Z)) - R(v'vO,X, Y,Z)
-R(O, v'vX, Y,Z) - R(O,X, v'vY, Z) - R(O,X, Y, v'vZ)
= V(O(R(X, Y)Z)) - (v'vO)(R(X, Y)Z)
-O(R(v'vX, Y)Z) - O(R(X, v'vY)Z) - O(R(X, Y)v'vZ)
= O(v'v(R(X, Y)Z)-R(v'vX, Y)Z-R(X, v'vY)Z-R(X, Y)v'vZ).
186 A Mathematical Introduction to General Relativity
0 = (VaxhR)(dx\oxk,Oxe,Oxi)
+ (VaxkR)(dxi, Oxe, Oxh, Oxi)
+ (V oxeR)(dxi' Oxh, Oxk' Oxi)
= (VaxhR)i,j + (VaxkR)}hj + (VaxeR)hkj·
This completes the proof. □
188 A Mathematical Introduction to General Relativity
We have seen that the curvature tensor field Risa complicated (1, 3)-tensor
field, but by contracting it, simpler tensor fields are obtained.
We can further take the 'trace' of Ric~, i.e., operate by C, and obtain a
C 00 ( M)-function.
We note that for fixed Vi, · · · , Vs-1 E TJ M, the map 'I' given by
Tf MxTJM 3 (n, V) L (v'vT)(n", Vi,···, Vs_i)
is a (1, 1)-tensor field (the linearity in V follows from property (C3) of the
connection, and the linearity inn is a consequence of the C 00 (M)-linearity
of -"~and of v'vT in its first argument). So the contraction map C can act
on T.
4 After Erich Kretschmann (1887-1973), a German physicist. In Chapter 14 on the
field equation, Exercise 14.1 shows that 'vacuum spacetimes' (spacetimes for which the
energy-momentum tensor field T = 0 and the cosmological constant is 0) are 'Ricci-flat'
(Ric= 0). So the scalar curvature S = 0 for vacuum spacetimes. But the Kretschmann
scalar can then be useful since it may be still nonzero. For example, the Schwarzschild
spacetime is Ricci-flat, but the Kretchmann scalar is nonzero; see (15.4).
190 A Mathematical Introduction to General Relativity
Proof. Let (U, x) be any admissible chart. Then by the second Bianchi
identity, we have (after replacing h by i, and summing over i)
(9.6)
As contraction commutes with the covariant derivative, we obtain using the
symmetry property of R (Lemma 9.1) that
C~(v' Oxk R) = v' Oxk (C~R) = -v'axk (CtR) = -ct (v' Oxk R),
and so
(v'axkRic)tj = (v'axk(CtR))tj = (CUv'axkR))tj
1 i
= -(C2(v'axkR))tj = -(v'axkR)lij•
Moreover, (v' Oxl R)~kj = (CH'v' Oxl R))kj = (v' Oxl (Cf R))kj = (v'Oxl Ric)kj·
Using these in (9.6), and multiplying by gkr, we obtain
0 = gkr(v'ax;R)tlj - gkr(v'axkRic)tj + gkr(v'axlRic)kj·
Using (v'. Ric)U = v'. (Rieu) (Exercise 6.24, p.130), we have
gkj (v' Oxl Ric)kj = gkj (v' Oxl Ric)(Oxk' Oxi) = (v'Oxl Ric)( (dxi)U' Oxi)
= (v'i)xl Ric)U (dxj' Oxi) = (v'i)xl (Rieu) )(dxj' Oxi)
= (v'axl (Rieu)){
Using also the symmetry of Ric, (v'axkRic)tj = (v'axkRic)jt, and so
(div Ric)£ = (div Ric)(oxt) = (v' axk Ric)( (dxk)U, Oxt))
= (v' i)xk Ric)(gkj Oxi, Oxl))
= gkj (v'Oxk Ric)(Oxi' Oxl) = gkj (v'Oxk Ric)(oxt, Oxi)
k.
= g 1(v'axkRic)£j· (9.7)
Curvature 191
Thus
k"3 . k"
0= g (v'a.,;R)l,ej - g 3 (v'a.,kRic)ej + g k"3 (v'a.,£Ric)kj
= gki(v'a.,;R)1ej - (divRic)e + (v'a.,£(Ricu))1
= gki(v'a.,;R)1ej - (divRic)e + Cv'a.,£(Ricu)
= gki(v'a.,;R)1ej - (divRic)e + v'a.,£C(Ricu)
= gki(v'a.,;R)1ej - (divRic)e + v'a.,£S
= gki(v'a.,;R)1ej - (divRic)e + DxtS
= gki(v'a.,;R)1ej - (divRic)e + (dS)e-
Finally, we will show that the first summand on the right-hand side of the
last equality is -(divRic)e, completing the proof. Since R = CHR ® g),
v'.R=CH(v'.R) ®g), we obtain
(v'a.,;R)pqrs = (v'a.,;R)!qrgst, and gsu(v'a.,;R)pqrs = (v'a.,;R);qr·
Using the skew-symmetry of R in its last two entries, it follows that
(v' o.,; R)kfjs = - (v' o.,; R)kfsj, and so
kj i kj is ~ kj is
g (v' o.,; Rhej = g g (v' o.,; R)kfjs = -g g (v' o.,; R)kfsj
~
= -g kj g is gjr ('7 vo.,; R)kkfs
vo.,; R)rkfs = -gis(-7
= -gis(Ctv'a.,;R)es = -gis(v'a.,; (CtR))es
= - g is ('7
v o.,;
R.lC) fs (9.=7) - (d"IV R"lC)f.
In this section, we will consider the FLRW spacetime (M, g), as described
in Example 5.9 (p.89), where M = I x l~.3, and I= (0, oo ). In the chart
(M, idixIRa ), the chart-induced component matrix G = [gij] of the metric is
the diagonal matrix with diagonal entries -1, a 2, a 2, a 2. Now we will take
a special5 a: J-+ I, namely
2
a(t) = ta, tE (O,oo).
We will first calculate the scalar curvature, and note that the chart-
coordinate t has a geometric meaning. Then we will derive a 'cosmological
version' of the red-shift.
Thus the t-coordinate in the global chart (M, id1 xJR3) can be expressed in
terms of the geometric quantity S. We call this the 'cosmological time'.
t(p) = ~ f o r allpE M,
We note that as t(p) -+ 0, S(p) -+ oo, and so the scalar curvature blows up
as we go back in cosmological time (and we think of this as approaching
the 'big-bang singularity').
Geodesics in FLRW
Let J c JR be an open interval, and suppose that 'Y : J -+ M is a lightlike
geodesic. In the global chart (J x JR3 , id), write ,y(s) = (t(s ), x(s ), y(s ), z(s) ),
s E J. As 'Y is lightlike, g('Y(s))(v.,,,-y(s),v-y,-y(s))=0 for alls E J, giving
t
T
EA= -g(A)(vµq,A,V, ,A )
✓- g(A)(vµq ,A, Vµq,A)
= - g(A)(ot ,A , t' (sA)Ot ,A +x'(sA)Ox,A +y'(sA)Oy,A + z'(sA)Oz,A) = t' (SA)-
✓- g(A)(ot,A, Ot,A)
Consequently,
EA = t'(sA) = a(t(sB)) = a(T) = (T)f > l
EB t'(sB) a(t(sA)) a(t) t ·
Thus the frequency, when received by a 'comoving' observer along the
galaxy worldline µp (imagine this to be us), is shifted towards the red
end of the spectrum (i.e., lower frequency), as compared to what is mea-
sured by a comoving observer with galaxy µq. This reduction in frequency,
described by the quantity
EA
Z=--1=
EB
is called the cosmological red-shift.
Exercise 9.13. The age of the universe is estimated to be T = 13.8 billion years.
If the light from a distant galaxy, reaching us now, has the cosmological red-
shift z = ½, then determine how long ago the light was emitted. (This exercise
assumes that the Einstein-de Sitter model is the correct model for our universe.
As mentioned earlier, this is approximately correct from the Big Bang until today,
but not at all in the far future.)
M
i\
V.
r
We already know that the velocity vector field V(s, •) along a main curve
fs is a vector field along fs, that is, V(s, ·) E TJrs for each s E (-E, E).
Similarly, any transverse curve I\ has its velocity vector field J (·, t) E Tc} I\
for each t E J. Moreover, we have the following.
Lemma 9.2. We use the same notation as in Definition 9.7. We have
(1) V(·, t) E TJI\ for each t E J.
(2) J(s, ·) E TJrs for each s E (-E, E).
1~
(3) v' J(•,t) V(-, t) E T0 ft.
(4) v'v(s,-)J(s, ·) E TJfs.
(5) v'v(s,-)V(s, ·) = 0 for alls E (-E,E).
Proof. Below, (U, x) is any admissible chart containing the point f(t, s).
o(x O r)i o(x O r)i .
(1) We have vr.,r.(t) = ct (s, t) Oxi ,r,(s)· As ct (·, t) 1s smooth,
(-E,E) 3 s ~ V(s, t) E TJI\.
. o(x O r)i
(2) Analogous to the above, by observmg 08 (s, •) E C 00 (J).
(3) Follows from (1).
(4) Follows from (2).
(5) Since fs is a geodesic, we have v'v(s,·)V(s , ·) = 0 for alls E (-E, E). □
Curvature 197
Exercise 9.14. With the notation used in Definition 9.7, show that
(1) l 3 t >-+ (V J(•,t) V(·, t))(s) E TJr. for each s E (-E, E).
(2) (-E, E) 3 s >-+ (Vvcs,-)J(s, ·))(t) E TJi\ for each t EI.
Theorem 9.1.
Let (M, g) be a semi-Riemannian manifold with the Levi-Civita connection
'v, I c JR. be an open interval, f > 0, and r: (-1:, 1:) x I-+ M be a family
of geodesics. Suppose that there exist vector fields J, V E TJ- M such that
Jqt,s) = J(t,s) and Vr~,-?] ': V(t,s), for all (s,t) E (-1:,1:) x I. Then we
have7 'vv'vvJ = -(R(J, V)V) or on (-1:, 1:) x I.
((dr)(s,t)(Ot,(s,t)))f = Ot,(t,s)U O
au or) (s, t)
r)(s, t) = ot
d(f or.)
= dt (t) = vr.,r.(t)i = V(s, t)f.
Similarly, J(s,t) = (dl')(s,t)(os,(s,t))- Secondly, for (s,t) E (-1:,1:) x I, and
f EC 00 (M),
((V f) 0 r)(s, t) = (V f)(r(s, t)) = Vqs,ti! = V(s, t)f
= ((dl')(s,t) (ot,(s,t)))(f) = ot,(s,t) (for)= (ot (for) )(s, t).
Similarly, ((]!) 0 r)(s, t) = Os,(s,t)U Or)= (o.(f Or))(s, t). Hence for all
(s, t) E (-€, t:) XI and all/ E C 00 (M),
[], V]r(s,t)f = Jqs,t)(Vf)-Vqs,t)(]f) = J(s,t)(Vf)-V(s,t)(]f)
= ((dl')(s,t)(Os,(s,t)))(Vf) - ((dl')(s,t)(Ot,(s,t)))(]f)
= Os,(s,t)((Vf) 0 r) - Ot,(s,t)((]f) 0 r)
Theorem 9.1, relating the geodesic deviation with the curvature operator R,
can be used to explain the notion of tidal forces in the spacetime context.
We first adopt the classical Newtonian perspective. Consider any two con-
stituent particles of an extended body in an inhomogeneous gravitational
field. The resulting different accelerations lead to internal stresses in the
body, which are called tidal forces. Imagine for example a long rod falling
in the gravitational field of the Earth as shown in the picture below.
A B
8 This is guaranteed if J(O,O) Ev.land ('v'v(o,-)J(O, -))(0) E v.l; see Exercise 9.15.
200 A Mathematical Introduction to General Relativity
Fv is well-defined, i.e., Fv (w) does belong to v_1_ for each x E v_1_, because
where we have used [or, Ot] = 0. Using the connection coefficients noted in
Example 6.8 (p.124), we have v' 0,ot = r;tor = -;!½(1- 2;,")or, and so
2m 6m 2 ) m( 2m) r 2m 5m ) 2
= ( - -;y+7 or+ r 2 1---;:- rrror =
(
--;y+--;:r- or.
Also,
2m 5m m )o
(-----;:F"+--;:r-,:r
2 2
1 Fvur,p=
Consequenty, :i r,p 2m :i
-(l- 2;,") =~ur,p·
Exercise 9.17. Suppose that an astronaut can withstand an acceleration per unit
length of 100 s- 2 along his/her height. Consider the Schwarzschild spacetime,
which in this exercise, is assumed to model the spacetime in the vicinity of a
black hole. Determine the minimum mass m of a black hole, in terms of multiples
of solar masses, that a radially aligned astronaut can tolerate arbitrarily close to
the 'event horizon', i.e., r=2m+o, o>O arbitrary. Suppose that the astronaut is
an instantaneous observer with v = Ot,p E TpM as in Example 9.2. The mass M0
of the sun is M0 ~ 5µs= 5 x 10- 5 s (in units such that GN =c= 1).
1 ~ i ~ m, 1 ~ j ~ n,
(9.10)
1 ~ j ~ n.
Then there exists an open set U c V containing p such that there exists a
unique C 00 solution Y 1 , · • · , yn : U-+ JR. to (9.10).
Before giving the proof, we note that if there were a solution, then clearly
the Schwarz theorem on the exchangeability of the order of differentiation
for smooth functions implies
which yields the consistency condition. The content of the theorem is that
this is the only obstruction to the solvability of (9.10).
and the consistency condition is trivially satisfied. Then the usual result 12
on existence and uniqueness of a smooth solution guarantees a solution in
some neighbourhood of p.
12 See [Apostol(1969), Thm. 7.19, p.229] for the continuously differentiable (C 1 ) version.
The C 00 version can be obtained by successively differentiating the ODE and applying
the C 1 result.
Curvature 203
Now suppose that the statement is true for m=k-1. Let m=k. We write
p = (p 1 , ... , pk). Consider the following (k-1)-variate problem in the hyper-
plane H={(x1,---,xk) E ]Rk: xk= pk}:
oYk (X 1, • · • , X k-1) =
~
f ij (X 1, · · · ,X
k-1 , Pk , yl
H, .•. ,
yn)
H (9.11)
ux'
(1,;;;i,;;;k-1, 1,;;;j,;;;n)
(1,;;;j,;;;n). (9.12)
By the induction hypothesis, this has a unique smooth solution in the neigh-
bourhood of a compact subset of H given by
K 1 ={(x 1,··· ,xk-l , pk): lxi-pil,;;;81 , 1,(i,(k-1} ,
for some 81 >0.
(9.14)
By the ODE result on existence and uniqueness, this problem also has a
unique solution in a neighbourhood of
K 2 : = {( X~1 , · · · , X~k-1 , X k) : IX k - p kl ,( u2
s: } ,
for some 82 > 0. The smooth dependence on initial conditions 13 implies that
Yl, · · · ,YE depend smoothly on the parameters x1 , · · · , xk. A compactness
argument can be given to arrange a uniform 82 > 0 which works for all lines
emanating from all initial conditions (x1, · · · , xk-l ,pk) E K1. Let
U:= {(x1, · · · , xk) E ]Rm: lxk -pkl < 82, lxi -pil < 81 for 1,( i ,( k-1} 3 p.
13 See e.g. [Hartman(2002) , Thm. 4.1, p.100].
204 A Mathematical Introduction to General Relativity
It now follows from (9.15), and the uniqueness of solutions to the initial
value problem for OD Es, that T/i must be the trivial solution, that is, T/i = 0,
as wanted. □
Exercise 9.18. Consider the smooth manifold M = IR.3 with the standard smooth
structure. Show that a 1-form field n E Tf M is the gradient df of some smooth
function f E C 00 (M) if and only if the 'curl of n~' is zero, that is,
oyOx = oxOy, ozOy = oyOz, oxnz = oznx,
where n is decomposed as 0=0xdx+Oydy+Ozdz in the global admissible chart
(IR.3, (x, y, z) >-+ (x, y, z)).
Proof. The 'only if' part is trivial, and was noted in Exercise 6.15 (p.125).
We now prove the 'if' part. Let p EM, and v1 , • • • , Vm be a basis of TpM.
Step 1. We first show that there exist vector fields Vi,••• , Vm defined in
a neighbourhood U of p such that for all 1,:;;; i,:;;; m, we have (½)p = Vi,
v'.½ = 0, and for all q EU, {(V1)q, · · ·, (Vm)q} is a basis for TqM. To do
this, we first take an admissible chart (V, x) containing p, and use it to
write the as-yet-undetermined vector fields in terms of their components.
Fix an r E { 1, • • • , m}. Then the conditions v'. Vr = 0 and (Vr )p = Vr give the
following initial value problem for the component functions V/, • • • , vrm:
the various r) that contains p. Next we will show that for each q E U,
(Vi)q, • • • , (Vm)q form a basis for TqM. It is enough to show that they are
linearly independent. Let c1 , • • • , cm Ell be such that ci(½)q = 0. Consider
the vector field W = ci ½. This solves the equation v'. W = 0 with the initial
condition Wq = 0. But from the calculation done above, this is equivalent
to an initial value problem
:i
Ux'
· Wi. -- -
risi ws , l -=::,
.,,,. z,· J· .,,,.
-:::, m,
{
w~ = 0, 1 ,:;;; i ,:;;; m.
And just as done above, again the consistency conditions are satisfied due to
the vanishing of R. The Frobenius theorem then implies that the solution
is unique, implying that W = 0 in a small neighbourhood of q. In fact, the
subset ff:= {q E U : Wq = 0} is nonempty (q E ff), and (using the above
argument) open. Also the set U\ff consisting of the set of points where
the evaluation of W is a nonzero tangent vector is open (this is true for
any smooth vector field, since the component function are smooth, and in
particular, continuous). Thus we have a decomposition U = ff u (U\ff) as
a disjoint union of open sets. By shrinking U at the outset of necessary, we
could have assumed that U is connected, and thus we are able to conclude
that the nonempty set ff must be all of U. In particular, ci(½)v = 0, that
is, civi = 0, and so c1 =•••=cm= 0 (since v1 , • • • , Vm was a basis for TpM).
Step 2. In this step we will construct a new chart (U, y) containing p,
which will be the desired affine chart. To do this, we will first construct
I-form fields Oi from the ½ of Step 1, and then solve a system of differential
equations involving the components of the Oi in order to find the coordinate
functions yi. For each q EU, let (0 1 )q, • • • , (nm)q be the basis for (TqM)*
which is dual to the basis (V1 )q, • • • , (Vm)q for TqM.
Claim: For 1 ,:;;; k,,;; m, U 3 q >-+ (Ok)q E (TqM)* defines a I-form field
ni E rpu.
Firstly, since the vectors (V1 )q, • • • , (Vm)q form a basis for TqM, the matrix
M(q) := [((°Vj)q)'] is invertible for all q EU Also, as q >-+ ((Vj)ql = (V/)(q)
are smooth maps, the entries of M(•)- 1 belong to C 00 (U). Finally, we
have 8j = (Ok)q(Vj)q = (Ok)q(((Vj)qloxe,q) = ((Vj)ql(Ok)q,£, showing that
(Ok)q,t=(M(q)- 1 )r So q >-+ (Ok)q,£ belongs to C 00 (U), and nk is smooth.
Also, v'.Ok=0. To see this, we note that for all WE TJU,
o = W(8f) = v'w(Oi(½)) = (v'wOi)½ + ni(v'w½)
= (v'wOi)½ + ni(o) = (v'wOi)K
Curvature 207
But for each q, (V1 )q, • • • , (Vm)q forms a basis for TqM. Consequently,
v'wf2i =0, l~j~m.
Now we seek real-valued smooth functions yi, 1 ~ i ~ m such that
(9.17)
(We do not specify an initial condition for yi(p), 1 ~ i ~ m, which can be
taken as an arbitrary m-tuple.) Again, using the Frobenius theorem, this
is possible if Oxk n} = Oxi ni, 1 ~ i, j, k ~ m. But this is true, and can be seen
as follows. Firstly,
Oxkn} = v'a.,k(ni(Oxi)) = (v'a.,kni)(Oxi) +niv'a.,kOxi
e
= O(Oxi) + ni (rjkOxi) e i
= rjkne.
Similarly, Dxint = rf,jn~. By the torsion-freeness of the Levi-Civita con-
nection, rf,j = r;k. Hence the consistency condition in Frobenius theorem
is satisfied for all i,j, k. We will continue to denote the possibly smaller
subset of U containing p on which the functions yi exist, still by U. Define
the matrix
N := [oxiYi] l
C9 7 l [n{].
Then N is pointwise invertible in U since (!1 1 )q, • • • , (nm)q form a basis
for (TqM)* for each q EU. Hence the Jacobian of the 'transition function'
yo x- 1 is pointwise invertible on x(U). Also, as N is smooth on U, it
follows that yo x- 1 is smooth on x(U). By the inverse function theorem,
and by shrinking the neighbourhood U of p if necessary, it follows that
yo x- 1 : x(U) -+ (yo x- 1 )(x(U)) = y(U) is a diffeomorphism. Thus
y =yo x- 1 ox : U -+ JR.m is a diffeomorphism onto the open set y(U).
Hence (U, y) is an admissible chart for M. We have
k f fk k k .
8j = n (½) = nkdx (l'j Ox£)= nk l'j <>e = nk l'j = l'j OxiYJ,
i i i i i
Form fields
10.1 k-forms
209
210 A Mathematical Introduction to General Relativity
Clearly the action of a k-form with k;;,:: 2 on k vectors, any two of which
are identical, is 0. This also true if the k vectors are linearly dependent. In
particular, if Vis m-dimensional, then if k > m, every k-form w is 0, and
so ;\kV*= {0} for all k>m.
We now develop notation and collect preliminary facts on permutations,
which will be needed in the proofs.
We recall that a bijective map 1r : { 1, • • • , k} --+ { 1, • • • , k} is called a
permutation. We specify a permutation by writing the image 1r(i) below
i E {1, • • • , n} in a row as follows:
boils down to showing that the identity permutation cannot be written as a product
of an odd number of transpositions. Each element of the set must appear in an even
number of transposition factors of the identity (to be able to return back to itself).
Form fields 211
Let us check that Alt T E /\kV*. Indeed, for v1, •••, Vk E V, we have
• ~~71" •
(sign 1r)(Alt r)( v.,,.(1), · · ·, V1r(k)) = ~ ~ (s1gna) r(V1r(a(l)), · · ·, V1r(a(k)))
. ues.
sign1r .
= ~ ~ (s1gna)r(v.,,. 0 u(l),···,V.,,.ou(k))
. ues.
= :, ~(sign(1roa))r(v.,,. 0 ,,.(1),···,V.,,.oa(k))
. ues.
= t, ~
. µ.es.
(signµ)r(vµ(l),···,Vµ(k))
= (Altr)(v1, · · ·, vk)-
since for eachµ E Sk, there is a unique a E Sk such thatµ= 1r o a, namely
(T = 71"-10 µ.
We now list some useful properties of alternation, which will be used later.
212 A Mathematical Introduction to General Relativity
= (-ll£Alt(0©w)(v1,··· ,Vk+e)-
For a, b E JR= /\0 v*, we set a" b = ab. For a E JR=/\0 v* and w E /\kV*,
we define aAw=wAa:=aw.
□
Form fields 215
Proof. We have
(k+f)! (k+f)! kl kl
w I\ 0 = JJ:e!Alt (w ® 0) = JJ:e!(-1) Alt (0 ® w) = (-1) 0 AW. □
As w is alternating, we have
w = Altw = w( ei 1 , • • • , eik) Alt (€i 1 ® • • • ® ik)
1 i1
= w (ei 1 , • · • , eik
) k! € /\ • · • /\ €
ik
.
Now it can be seen, using Proposition 10.2, that the summands with some
ir equal to some is, with r-# s, will be zero. In a summand where the
i1, · · · , ik are all distinct, we can write €i 1 /\ • . . A f.ik = ( -1 lf.ii /\ ... /\ f.ik
for some e E N, with j1, · · · , Jk a permutation of ii,··· , ik and such that
J1 < · · · <Jk· This shows that B spans;\ kV*.
216 A Mathematical Introduction to General Relativity
l
Bis independent: Suppose that we have c:; 1 ··•ik 1:i 1 A· · ·A 1:ik = 0. Then acting
both sides on the tuple (ej 1 , • · • , eik), with 1 ~j1 < · · · <jk ~m, yields
l
Ei1 ( ei1) ... Ei1 ( eik)
=:t..5
We give an inductive argument (on k) to show that the determinant det b. 5
in a summand is nonzero only if J := (ii, ... ,ik) = (j1, ... ,jk) =: J. If
5
k = l, then clearly det b. =I= 0 implies I= J. Suppose that the claim is true
for some k. Then for a matrix of size k+l, the top row must be nonzero,
so that ii= JR. for some£. Expanding the determinant along the top row,
0 =I= det b.5 = (-1)£ det b.5',, where I', J' are defined by J' := (i2, · · · , ik+1),
J' := (j1, · · · ,Jt-1,JR.+1, · · · ,Jk+1)- Thus I'= J' by the induction hypoth-
esis. Now if £ > 1, then we get the contradiction that ii = JR. > j1 = i2.
So i 1 = j 1, and then together with J' = J', we obtain J = J. Moreover, if
I= J, then clearly b.5 is the identity matrix, and det b.5 = l. So we obtain
0 = Ci 1 , .. ik 1:i1 A· · · A1:ik ( eiu · · · , eik) = Cfr .. ik. Thus B is independent. □
A k-form field is just a (0, k)-tensor field non a smooth manifold M whose
pointwise evaluations at each point p EM is a k-form n(p) E /\k(TpM)*
on TpM. But then for all p EM, all permutations 71' of {1, • • • , k} and all
vector fields V1, · · · , vk E TJ M,
(sigll'lr)(n(V,,.(1), · · · , V,r(k)))(p) = (sigll'lr)n(p)((V,,.(1))p, · · · , W1r(k))p)
= i1(p)((Vi)p, · · · , (Vk)p)
= (!1(Vi, .. · , Vk))(p),
and so we can equivalently define a k-form field as follows.
Definition 10.5. (k-form field).
Let M be a smooth manifold. For 71' E Sk and TE T2M, define 7rT E T2M
by (7rT)(V1, .. · , Vk) =T(V1r(l), .. · , V,r(k)) for all Vi, .. · , vk E TJM.
Let k ;;,:: 2. A k-form field is a (0, k)-tensor field n E T2M which is
alternating, that is, for all permutations 71' E sk' n = (sign 71') 71' n.
Form fields 217
(0 + n) A e = 0 A 8 + n A e and e A (0 + n) = 8 A O+ 8 A n.
Thus the wedge product is bilinear over C 00 (M). We now show that the
wedge product is associative.
Form fields 219
= ~ (sigll7r)i1 1(V1r(1))···!1n(V7r(n))
1rESn
= det[n\½)].
□
We have the following consequence of Lemma 10.3.(3).
Proposition 10.6. Let M be a smooth manifold, n E {lk M and 8 E n£ M.
Then n /\ 8 = (-l)k£(8 /\ !1).
Proof. We have
(k+f)! (k+f)! kl kl
n A8=/Je!Alt(i1®0)=/Je!(-1) Alt(8®i1)=(-1) e/\n. □
for some functions Wii•••ik E C 00 (U). The functions Wii•·•ik are actually
ni 1 ··•ik. Indeed, recall first the definition of ni 1··•ik and note that a re-
arrangement of indices produces the sign of the permutation used, as n
is alternating. The same factor is picked while rearranging the indices in
dxi 1I\ ... I\ dxik. Finally, each term in the rearranged sum collects k! terms
in the original one. In particular, if M is m-dimensional, then every m-
form field n E nm M can be decomposed in an admissible chart (U, x) as
n = W dx 1 I\""" I\ dxm, where W = n(ox1,""" , Ox-m) E C 00 (U).
Exercise 10.10. Consider the smooth manifold M =R 3 with the standard smooth
structure, and the global admissible chart (R3 , (x, y, z) >-+ (x, y, z)). Suppose that
n := xdx+ydy+zdx E 0 1 M, 8 := ydx+zdy+xdz E 0 1 M. Determine rlA 8.
Exercise 10.11. Let M be a smooth manifold. In an admissible chart (U, x),
suppose that fl := j dxi 1 I\••• I\ dxik, for a function j E C 00 (U) and a fixed
k-tuple I= (i1, · · · ,ik), where l ,;;;;i1 < · · · <ik ,;;;;m. Show that if J = (j1, · · · ,jk),
with l ,;;;;j1 < · · · <jk ,;;;;m, then
J = I,
flil···jk =
{ Qf if
if J =f. /.
Pull-back
We had seen that if / : M -+ N is a smooth map between smooth manifolds
M and N, then a (0, k)-tensor field TE Tf N can be pulled back under f
to a (0, k)-tensor field f*T on M. If n E Ok N, then f*!1 E Ok M. Indeed,
for all Vi, ... ' vk E TJ M, any permutation 7r E sk, and all p EM, we have
((!*!1)(V1r(l), · · · , V1r(k)))(p) = !1(f(p))(dfp((V1r(1))p), · · · , dfp((V1r(k))p))
= (sign1r)n(f(p))(d/p((Vi)p),· .. ,dfp((Vk)p))
= (sign1r)((f*!1)(Vi, · · · , Vk))(p),
and so (f*i1)(V1r(l), · · ·, V1r(k)) = (sign1r) (!*!1)(Vi, · · · , Vk)- Consequently,
f*!1 E f2kM.
Form fields 221
where the product rule gives the last equality, and S is given by
k
s = Li (oxer yir )(Oxi OyirXir) dxi I\ dxi 1I\••• I\ dxir-l I\ dxlr I\ dxir+l I\•• • I\ dxik,
r=l
In the previous exercise, we note that for some form fields the exterior
derivative vanishes, while it does not for others. This prompts the first of
following definitions.
Proof. Let (U, x) be any admissible chart for M. Using Notation 10.1,
d(dn) = d((Oxin1)dxiAdx 1 )
. . I
= (OxiOxinJ)dx 3 Adx'Adx
. . I
= (OxiOxin1)dx 3 Adx'Adx
. . I
= -(OxiOxin1)dx'Adx3 Adx
= -d(dn),
where we used [ox;,Oxi]=O, and dxiAdxi = -dxiAdxi. So d(di1)=0. □
Exact form fields are closed, but the converse may not always hold.
224 A Mathematical Introduction to General Relativity
However, we now argue that n is not exact. Suppose, on the contrary, that
there exists an f E C 00 (M), such that O=df =oxfdx + oydy. Then
= 171" (
0
2 (- sin t)
(cost) 2 +(sint) 2
. (cost)
- - - - - - ( - s m t ) +------(cost) dt
(cost)2 + (sint) 2
~
Exercise 10.16. Consider the smooth manifold N =R 2 with the standard smooth
structure. Show that every closed 1-form field n E 0 1 Mis exact.
Form fields 225
We also know that we can create a vector field from a smooth function
f : R 3 -+ R by taking its 'gradient'
of
ox
-+ of
v' f :=
oy
of
oz
We note that the exterior derivative off En° M is the I-form field
of of of
df = ox dx + oy dy + oz dz,
which has the same components as V f.
Let V be a vector-valued function with the component functions given
by (Vx, VY, vz). We also recall from multivariable calculus that the dif-
ferential operator V x, the curl, acts on V to produce the vector-valued
function
ovz oVY
-----
oy oz
ovx ovz
-----
oz ox
oVY ovx
-----
ox oy
If we consider the 1-form field
0:= vxdx + VYdy + vzdz
using the components of V, then its exterior derivative d n is the 2-form
field given by
dO = d(Vxdx+ VYdy+ Vzdz)
ovx ovx oVY oVY
= ajidy A dx + Tzdz A dx + ox dx Ady + Tzdz Ady
ovz ovz
+ ox dxAdz + oy dyAdz
ovz oVY ovx ovz oVY ovx
= (----)dyAdz+ (----)dzAdx+ (-----)dxAdy,
~ ~ ~ fu fu ~
which has the same components as V x V. We recall that the divergence
map V• in multivariable calculus acts on a vector-valued function V, and
produces a smooth function,
-+ -+ ovx oVY ovz
v'·V=-+-+-.
ox oy oz
Form fields 227
Pull-back
Pulling back commutes with exterior differentiation.
Then it follows from Exercises 10.12, 3.17 (p.53) and 3.18 that
(10.2)
Proof. Let us call the right-hand side 8(V1, · · · , Vk+1), and the two
summands as 81(Vi, · · · , Vk+1) and 82(V1, · · · , Vk+1)- We note that
8 : TJ-M x • • • x TJ-M -> C 00 (M) is R-multilinear. We will show that
it is in fact also C 00 (M)-multilinear. For 1:::;;£:::;;k+l, and f E C 00 (M),
81(Vi, ... '!½, ... 'Vk+1)
= (-1l- 1(J½,)(O(V1,··· ,½,··· ,Vk+1))
+ ~ (-l)i- 1V;(O(V1,··· ,f½,··· ,¼,··· ,Vk+1))
i<i~k+l
Similarly, to compute 82(Vi, •••, J½, •••, Vk+1), we split the sum into the
parts when i=f, when j=f, and the rest (and the f factors out from this
last part). When i = e or j = e, we use [!½, ½] = ![½, ½] - (½!)½ and
[¼,J½]=f[¼, ½]+(¼!)½,. Thus
l
Then di1=ox;f dxiAdx 1. So we have
(dn)(oxii, """, Oxik+1) = (ox;J)(dxi I\ dxi 1 I\ """ I\ dxik )(oxii," "", Oxik+l)
DiJl ·--6~Jk+l
6i1 ... 6~1
= (Oxi J) det [ ~1 J~+i .
where the second equality follows by the Leibniz rule and by noting that
we have dx 1(Dxii, · · ·, oxi•+i) = a constant. D
So if n E S1 1 (M), then
(dn)(X, Y) =X(n(Y)) - Y(n(X)) - n([X, Y]),
for all X, YE TJM.
While the exterior derivative increases the degree of the form field, we now
learn about the operation of interior multiplication by a vector field V which
decreases the degree. Sometimes interior multiplication by V is referred to
as the interior derivative with respect to V, because it also satisfies a type
of Leibniz rule akin to the one for the exterior derivative.
Form fields 231
r=l
We will now show the following Leibniz type of rule for iv.
Proposition 10.10. Let M be a smooth manifold, V E TJ- M, nE nk M
and e E neM. Then iv(i1A8) = (ivi1)A8 + (-1)kn/\ (iv8).
Proof. It is enough to show this locally, using an admissible chart (U, x).
Decompose f2 = !11dx 1 = !11dxi 1 I\··· I\ dxik, 8 = 8 JdXJ = 8 Jdxi 1 I\· · · I\ dxit.
Then
iv(i1A8) = !118Jiv(dxi 1 I\·•• I\ dxik I\ dxi 1 I\••• I\ dxit)
k -
= n1eJ~(-1r- 1 dxir(V)dxi 1 I\ ... Adxirl\ ... Adxikl\dXJ
r=l
f -
+ n1eJ ~ (-1)k+r- 1 dxir(V)dx 1Adxi 1 I\ . . . I\ dxirl\ ... Adxit
r=l
= (ivi1)A8+(-1tn/\(iv8).
Note that after the second equality, in the second summand the extra ( -1 )k
is accounted by the fact that the summation index runs from r = 1 to r = e,
instead of from r = k+ 1 to r = k+e. □
Note that in the first sum we get an extra (-1) since the i runs from i = 0
to k (instead of from i = 1 to k+ 1 as in Theorem 10.1), but in the second
sum, now as both i and j now have been reduced by 1, the factor (-l)i+i
stays the same.
(d(ivi1))(Vi, · · ·, Vk)
k
= ~(-l)i-l½((iv!1)(V1,··· ,¼,··· ,Vk))
i=l
i=l
Integration
235
236 A Mathematical Introduction to General Relativity
11.1 Orientation
IR2
IR3
e3
e2
JR e2
e1 e1 e1
!})
Exercise 11.1. Let V be an m-dimensional vector space. Then each ordered basis
B = (e 1, · · · ,em) induces an orientation [w B] on V , called the induced orientation
on V by the basis B by setting w B = E 1 /\ • • • /\ Em, where (E 1 , • • • , Em) is the
dual basis for V* corresponding to B. Show that WB =I= 0, and (e1, ··· ,em) is
positively oriented with respect to [wB] - Prove that for another ordered basis
B' = (e~, • • • , e:,,), the following are equivalent:
(1) B' positively oriented with respect to [wB] -
(2) If ej =a.ie;, then the change of basis matrix [a}] has a positive determinant.
(3) WB ~WB'·
3 Since at any point x along the equator, (V2)x is perpendicular to (Vi )x , so that
{(V1)x , (V2)x} forms a basis for the tangent space TxU.
Integration 239
Now that we have the notion of orientability, we formally give the definition
of a spacetime.
Exercise 11.2. Consider the atlas A= {(Un, 'Pn), (Us, <ps)} from Example 1.5 (p.5),
defining the smooth structure of the sphere S2 • Show that A is not oriented.
Keeping the chart (Us, <ps), modify the chart map in (Un, 'Pn) by now setting
/Pn = Ro 'Pn, where R is a reflection in the u-axis, and check that the resulting
atlas .A= {(Us,'Ps),(Un,<pn)} is oriented. Show that .Au {(U,ip)} is oriented,
where (U, ip) is the 'spherical coordinate' chart given in Example 1.8 (p.7).
Proposition 11.1.
Suppose that M is a smooth manifold with the smooth structure [A].
Then M is orientable if and only if there exists an oriented atlas A E [A].
Given an open cover C = {Uj: j E J}, a partition of unity {rpi: i EI} is said
to be subordinate to C if for each i E I, there exists an j(i) E J such that
SUpprpi C Uj(i)·
M = uBn = B1
00
n=l
U ••• U Bn1 U ••• U Bn2 U •••
00 00
Exercise 11.3. Set Vo = 0- Show that for {Vn : n EN} from Proposition 11.2,
IJ Vn+l \Vn =M.
n=O
'I/J := I: I: 'I/Jpe'
n-1 f-1
ael:U0 r,Up#0
and so the action of the top form n(p) E /\m(TpM)* on the ordered basis
(ox},P' ... 'Ox;r,p) for TpM is
where we have used det[oX/3j x~] >0 on U,a n U0 • Thus n(p) #0. Asp EM
was arbitrary, n is nonvanishing everywhere, and so M is orientable. □
11.3 Integration
We first provide some heuristic motivation for the definition that is to follow.
The aim is to give intrinsic definition of the integral on a manifold.
246 A Mathematical Introduction to General Relativity
We note that, firstly, the absolute value on the determinant would disappear
if we used an oriented atlas. Secondly, if we formally replace dx 1 • • • dxm
by dx 1 A • • • A dxm, then the definition of the integral of a top form field n
by setting
= f (f o y- 1 o <I>) (u) nv( (y- 1 o <I>) (u)) det [ ~<I>i (u)] du 1 ... dum
J(xoy- 1 oy)(UnV) uuJ
Next, we need to check that the integral depends neither on the chosen [8]-
compatible oriented atlas from the smooth structure of M, nor on the parti-
tion of unity. Suppose that A= {(Un, xn) :a EA} and A'= {(V,a, Y,a) :/3 EB}
are two oriented [8]-compatible atlases, with corresponding subordinate
partitions of unity {'Pi: i EI} and {'l/Jj : j E J}, respectively. We have
and by symmetry, we get the same last expression also starting from
~f 'lpjn. This completes the justification of the well-posedness.
i Jv/l(j)
Integration 249
J, d(rpd1) =
Uo:(i)
i
Xo:(i)Uo:(i)
(0.,1/1 + · · · + Ox=fm)(x~(i)u)du 1 ···du"'.
f f f f·
M = M volg.
In the above, while extracting det[oyixk] outside the square root sign, we
have used its positivity, thanks to the fact that A is oriented. This shows
that the definition of vol g does not depend on the chosen chart from A.
So for each p EM, we have a well-defined top form vol g (p) E /\ m (TpM) *.
For vector fields Vi, · · · , Vm E TJ M, the map
p >-+ volg(p)((Vi)p, · · · , (Vm)p)
= VI det[gij(p)]I ((dx 1 )p A · · · A (dxm)p)((Vi)p, · · ·, (Vm)p)
= VI det[gij(p)]I det[Vj(xi)](p)
is smooth in each chart (U, x), and hence is smooth on M. It follows from
Proposition 4.1 that volg is a (0, m)-tensor field. As it is clearly alternating
in each chart, it belongs to nm M. We also note that for any p E (U, x),
the action ofvolg(p) E /\m(TpM)* on the ordered basis (oxi,p,··· ,ox"',p)
for TpM is given by
(volg(p))(oxi,p, · · · , Ox"',p) = Idet[g(Oxk,p, Oxt,p)]I det I> 0.
Integration 251
IM!= IMJ·volg,
where the integral of the right-hand side is that of the compactly supported
top form field f • vol g E nm M in the sense of Definition 11. 7. If M is
compact, we define its volume by
VolgM = IM l = IM volg.
Exercise 11.6. Consider the sphere (S 2 , g) as a Riemannian manifold with the
metric g induced on its tangent spaces from the Euclidean inner product on
R 3 , as described in Example 5. 7, p.87. Determine its volume VolgS2 with the
orientation induced by the oriented atlas .A from Exercise 11.2.
Exercise 11.7. Given the oriented atlas A*:= .Au{(U,rp)} for S 2 in Example 11.2,
consider the oriented atlas
A. = {(Rx (2m, oo) x W, p= (T,p,p) >-+ (p,u(p), T)), (W, a-) EA*}
for the Schwarzschild spacetime (M, g). Show that the volume form field volg is
the same as (11.2).
252 A Mathematical Introduction to General Relativity
If V is an m-dimensional real vector space, then dim/\ kV* = (1;:) for all
O~k~m. So
1rES,n
= (det[a;])w(e1, · · · ,em),
and since w(e1, · · · , em) > 0 too, it follows that det[a;] = 1. □
11 Named after W. Hodge (1903-75), a British geometer.
Integration 253
Exercise 11.9. Let V be an oriented m-dimensional real vector space with a scalar
product g of index lg, and the volume form volg. Let (v1, • • • , vm) be an arbitrary
(not necessarily orthonormal) positively oriented basis for V, and let (w 1, • • • , w m)
be the corresponding dual basis for V*. Show volg = -Jldet[g(vi, Vj)]lw 1 A••• I\ Wm.
By the JR-linearity of the wedge product in each of its factors, the map
fw: vm-k-+ JR, given by vm-k 3 (v1,··· ,Vm-k) >-+ fw(v1,··· ,Vm-k)
is multilinear. Also, being a wedge product, it is also alternating in
V1, · · · , Vm-k· Hence fw E Am-kv*. Again by the linearity of the wedge
product, the map A kV* 3 w ...... fw E 1-,m-kV* is linear. This map is called
the Hodge star operator (one for each fixed O:::;; k < m).
If k = m, then we set *(w) = w(e1, · · · , em) E JR, for w E AmV*, where
(e1, • • • , em) is any positively oriented orthonormal basis. This gives a linear
map * : AmV* -+ JR= A0 v*. To show that the map does not depend on
the chosen ordered orthonormal basis, we note that since {vol 9 } is a basis
for Amv*, for w E Amv*, there exists a CE JR such that w = cvolg, and if
(e1, · · · , em), (!1, · · · , fm) are positively oriented orthonormal bases, then
w(e1, ···,em) =cvolg(e1, ···,em) =cl =cvol 9 (fi, · ··, fm) =w(J1, · · ·, fm)-
(1) *VOlg = 1.
(2) *1 = (-l)f 9 vol 9 .
(3) (*w)(v1, · · · ,Vm-k) =*(WI\ (vd I\··· I\ (vm-k/).
Proof.
(l)*vol 9 =volg(e1,··· ,em)=l. Here(e1,··· ,em)isanyorderedpositively
oriented orthonormal basis.
(2) Let w1, · · · , Wm EV. The top form w:= (w1/ /\ · · · /\ (wm)I> is equal to
w(e1, · · ·, em)vol 9 , as w=cvol 9 , and we had seen that c=w(e1, · · · , em)
for any positively oriented orthonormal basis (e1, • • • , em). We have
w(e1, · · · , em) = ((wd I\··• I\ (wm)l>)(e1, · · · , em)
= det[(wdej] =det[g(wi, ej)] = det[(ejlwi]
= ((ed I\··· I\ (em/)(w1, · · · , Wm)
= (-li9 volg(w1, · · · , Wm)-
Integration 255
Thus
(* 1) (W1, · · · , Wm) vol 9 = 1 /\ (wd I\ · · · I\ (wml
= (-1)1 9 volg(w1, · · · , wm)vol 9 .
Consequently, that * 1 = (-1 )19 vol 9 .
(3) It follows by the IR-homegeneity of* : /\ mV* -+ JR = /\ 0 v* that
*(WI\ (vd I\ · · · I\ (Vm-kY') = *( (*W )( V1, · · · , Vm-k) vol 9 )
= (*W)(v1, · · · , Vm-k) · (*VOlg)
= (*w)(v1,··· ,Vm-k)· l. □
Analogous to the interior multiplication of form fields, we introduce the
following 'pointwise' version for forms.
Definition 11.12. (Interior multiplication).
Let V be a real vector space, and v EV. If k-;;:: 1 and w E /\kV*, then the
interior multiplication ivw E /\k- 1V* of w by vis defined by
(ivw)(v1, · · · , Vk-1) =w(v, v1, · · · , Vk-1), for all v1, · · · , Vk-1 EV.
If k= 1, then for a 1-form w EV*, we set ivw=w(v) E IR=/\ 0 V*.
Exercise 11.10. Let V be a vector space. For all v E V, w1, • • • , wk E V* = I\ 1 V*,
k ~
show that iv(w 1 A··· A wk)= ~ (-1r- 1 wr(v)w 1 A··· A wr A··· A wk.
r=l
Exercise 11.11. Let V be a vector space, and v EV. For all w E /\kV*, 0 E /\lV*,
show that iv(w A0) = (ivw) A0 + (-ltw Aiv 0.
Proof. We have
(-lr-l(*W) /\Vi>= (-1r- 1(-1r-kvi> /\ *W= (-lt+lvi> I\ *W.
Let *W=fIE 1 , l=(i1,··· ,im-k), where {€1,··· ,fm} is a basis for V*. For
all V1, · · · , Vm-k+l EV,
((-lr-l( *W) I\ Vb)(v1, · · · , Vm-k+l)
= (-1t+l(vl> I\ (*W))(v1, · · · , Vm-k+1)
vl>v1 · · · vl>Vm-k+l 1
= (-1t+lJidet
r Ei1_v1 Ei1v~-k+1 .
f.i=~•v1 • • · f.i=-•v~-k+l
Expanding the determinant along the top row, and using Lemma 11.4
((-lr-l(*W) /\Vb)(v1, · · · , Vm-k+l)
m-k+l
= (-lt+l ~ (-lf- 1(vi>vr)(*W)(v1,··· ,'Ur,··· ,Vm-k+l)
r=l
m-k+l -
= ~ (-1t+r(vl>vr) *(WI\ (vd I\··· I\ (vr)i> I\··· I\ (vm-k+d)
r=l
□
Exercise 11.12. Let V be an oriented m-dimensional real vector space with a
scalar product g of index 19 , and the volume form vol 9 • Let w E !\kV* and
v1, • • • , Vk EV. Prove that
l
(*€ )(eju · · · 'ei,n-k) =
{Q(sign1rIIJi(lc)
if lnJ=/=0,
if J =le.
} (11.3)
We will show that this coincides with the action of (sign1rIIJ i(lc) €1e on
(ej 1 , • • • , ei,n-k). Let le= (1!1, · · · , l!m-k)- Then
€ 1e(eji,""" ,ej,n-k) = (i 1 A ... Ai,n-k)(eji,""" ,ej,,,_k) = det[/r(ej.)].
0, then at least one column of the matrix [/-r (ej.)] is zero, so
If l n J =I=
that the determinant above vanishes. On the other hand, if J = le, then
det[/r(ej.)] = det[/r(ee.)] = det[8;] = 1,
and so E1e (ej 1 , • • • , ei,n-k) = 1. Summarising, we have shown that
(11.4)
*dX = *dx 1 = sign G~: !) g(oy, Oy) g(oz, Oz) g(£7t, (7t) dy A dz A dt
0 0
= (1)(1)(1)(-l)dyAdzAdt = -dyAdzAdt,
*dy = *dx 2 = sign G~: !) g(ox, Ox) g(oz, Oz) g(ot, Ot) dx A dz A dt
= (-1)(1)(1)(-l)dxAdzAdt = dxAdzAdt,
*dz = *dx 3 = sign G~ ~ !) g(ox, Ox) g(oy, Oy) g(ot, Ot) dx Ady A dt
= (1)(1)(1)(-l)dxAdyAdt = -dxAdyAdt,
*dt = *dx4 = sign ( !~ ~ !)g(ox, Ox) g(oy, Oy) g(oz, Oz) dx Ady A dz
= (-l)(l)(l)(l)dxAdyAdz = -dxAdyAdz.
To determine * on 0 2 M, we compute for example
= (l)(l)(l)dyAdz = dyAdz.
Analogously, one obtains
*(dx Ady)= -dz Adt, *(dx Adz) =dy Adt, *(dy Adz)= -dx Adt,
*(dy Adt) = -dx Adz, *(dz Adt) =dx Ady.
For f2 E 0 2 M, * f2 can be found by decomposing
0= ~ nijdXi Adxi,
= (-l)(l)dz = -dz.
Similarly *(dx Ady Adz)= -dt, *(dx Adz Adt) = dy, *(dy Adz Adt) = -dx.
These determine* on 0 3 M. We also have *(dxAdyAdzAdt) = *VOlg = 1,
the constant function equal to 1 everywhere. ◊
Exercise 11.15. In the same set up as Example 11.5, verify directly that
**n = (-l)k(m-l)+i(g)n
from Exercise 11.13 for all n E {} 3 M.
Chapter 12
1 After Hermann Minkowski (1864-1909), who realised that space and time should be
considered as a single whole (a 4-dimensional geometric object called spacetime), rather
than two separate entities, and that special relativity, introduced by Einstein (who was
Minkowski's former student) is best understood in this mathematical setting.
261
262 A Mathematical Introduction to General Relativity
k = t(v)
t(w) ·
So it makes sense that the relative velocity of the particle as reckoned by
the instantaneous observer v is defined by
kw-v w V
W V ~ W V
Proof. Subtract u = t(w) - t(v) from u = t(w) - t(v) · □
264 A Mathematical Introduction to General Relativity
future
past "'-
past
(__ ___:.:>
Classical spacetime Minkowski spacetime
q r
We note that a> 0, since Vf."i(t) is future-pointing for all t E ~ (as ::Y is an
observer) . For some t* > 0, consider the event q = 1(t*) = p + t*e 0 . Then
the instantaneous observer v"f ,q perceives space as q + (e 0 )_!_. In this space,
there is an event r belonging to the worldline of the observer ::Y, that is,
r=::Y(s*). To determines*, we use s*(ae0 +,Be 1) - t*e 0 E (e 0 )_!_, so that
t* = s*a = s * ~ > s*. Now the proper time elapsed between p and q
for the observer 'Y is given by
But the proper time elapsed between p and r for the observer 1 is
Thus, although the events q and rare deemed simultaneous by v--y ,q, we note
that 'Y has experienced more proper time to go from p to q than the proper
time experienced by 1 in going from p to r. But this is strange only if we
have not freed ourselves from the shackles of classical-spacetime-thinking.
◊
s 'Y
'Y
2£ := 2£
The instantaneous observer v'Y,P at p, perceives the rod to lie along the
spatial section {p + b e1 : lbl:::;; L }, and reckons the rod's length to be 2£.
The inequality 2£ < 2£ says that the length of the rod 'measured' by 'Y at
p is strictly greater than that measured by 1' at p. ◊
,J-g(p)(v, v) v
U= ~....,.....,.....,....-,--w - ---;====
g(p)(w,v) ,J-g(p)(v,v)
Exercise 12.1. We use the same notation as in Definition 12.6. Show that u E v_j_.
Also, prove that u=O if and only if v=cw for some c>O.
The rationale behind this definition is analogous to what was done in the
previous section, although the time elapsed for the instantaneous observer
in going from p top+ vis now given by y1-g(p)(v, v).
268 A Mathematical Introduction to General Relativity
Consider the particle 'Y passing through p with w = v-y,p· By referring to the
picture above, we seek the point q = p + kw in the parallel translate of v_j_
(space perceived by v) that passes through p + v. Thus kw - v E v_j_, i.e.,
g(p)(kw-v, v) =0, and so k= :e:N:·,:)-
So it makes sense that the relative
velocity of the particle as reckoned by the instantaneous observer is
kw - v ,J-g(p)(v, v) v
~======-~----W-~=====
-J-g(p)(v, v) g(p)(w, v) -J-g(p)(v, v)
We now show there is a universal speed limit, c:= 1, for relative speeds.
{ < 0+1=1,
;:,,-1+1=0
using the fact that as v, ware timelike vectors, the reversed Cauchy-Schwarz
inequality (p.83) gives (g(p)(v, w)) 2 ;:,, g(p)(v, v)·g(p)(w, w) > 0. □
Exercise 12.2. Let us revisit Example 12.2 on length contraction. Determine the
relative velocity u of~ at p with respect to 'Y at p . Also find lul := vg(p)(u, u),
and express the ratio of lengths, ~ ~ , reckoned by the two observers at p, in terms
of lul.
Minkowski spacetime physics 269
.J-g(7(a))(v'Y,"i(a),v'Y,"i(u))'
~ /3v+e1
e1:=~,
Thus
Using the definition of e1 , the components ui in terms of the ui, can now
be read off. □
272 A Mathematical Introduction to General Relativity
Now we examine the limiting case where the instantaneous particle velocity
w approaches the light cone. With the same notation as in Proposition 12.3,
let w= e0 + >. e 1 . Then g(p)(w, w) = -1 + >. 2 , and so to make this vector
approach the light cone, we take the limit>.-+ 1 (from below, as wis meant
to be timelike). As we want a normalised w, we take
w eo+>.e1
w = -v-;::::_=g::;:::(p::;::::)(::::;::w=
, w;=.: :) ✓ 1 - >,2 .
The picture shows that the tips of the arrows representing the future-
pointing vector w lie along the top branch of the hyperbola
Exercise 12.3. We use the same notation as in Proposition 12.3. Suppose that the
relative velocity of w with respect to vis u = a e1 , where o: E (0, 1) . Show that
the relative velocity of w with respect to v is given by (o: ® /3)e1, where
o:®/3 := o:+/3/3 _
l+o:
(This corresponds to the situation that if a person sitting in a train moving with
speed f3 with respect to the ground, sees a fly go past at a speed o:, then a person
on the ground reckons that the speed of the fly is o: ® /3.)
Given o:, f3 E ( 0, 1) , the number o: ® f3 can be interpreted geometrically as follows.
Draw a unit circle in JR 2 with center O at (0, 0) and vertical diameter AA', where
A= (0, 1) and A'= (0, -1). Join A to (o:, O) and extend it to meet the circle at
B. Similarly, join A' to (/3, O) and extend it to meet the circle at B' . Join BB'
and let it meet the x-axis at P. Show that the x-coordinate of Pis o: ® (3.
Hint: This can be done using coordinates, but we could also use the work already
done in establishing the 3D formula given Exercise 1.2.
Exercise 12.4. Show that the operation® from Exercise 12.3 is the following:
®=((o:, /3) >-+ tanh(tanh- 1o:+tanh- 1f3)),
that is, if o:, f3 E (0, 1) are written as o: = tanh a and /3 = tanh b for some a, b > 0,
then o: ® /3 = tanh(a+b).
Exercise 12.5. A cart rolls on a table with relative speed f3 E (0, 1). A smaller cart
rolls on the first cart in the same direction with a relative speed f3 with respect
to the first cart. A third cart rolls on the second cart in the same direction with
a relative speed f3 with respect to the second cart, and so on, up to n carts.
Determine the relative speed Un of the nth cart with respect to the table.
What is lim Un?
n- oo
274 A Mathematical Introduction to General Relativity
The reason for calling this the energy can be motivated as follows. Imagine
the scenario when u is much smaller than the speed of light (so that the
classical spacetime picture is approximately valid). Consider the function
f: (-1, 1)-+ IR, given by
1
f (t) = \/1 _ t 2 , t E ( -1, 1).
By a Taylor expansion of f around the point t = 0, we obtain
1 1 2
/,7) = 1 + -2 t + h(t),
vl-t 2
where lim h(3t) =0. So if lul := -Jg(p)(u,u), then for lul«l, we have
t-+O t
E= m = m ~m+ mlul 2-
-Jl-g(p)(u,u) -JI=Tui2 2
We identify the second summand as the kinetic energy possessed by the
particle as measured by the instantaneous observer, and this motivates also
calling E as an energy. We note that if u = 0 (i.e., the particle is at rest
with respect to the instantaneous observer), then E = m. Som is sometimes
referred to as the 'rest energy'.
We remark that if we know Plv_J_ and E, then we can recover p. Choose
an orthonormal basis {e1, e2, e3} for v1_. For i = 1, 2, 3, we define
Thus in the limiting case that the particle relative velocity approaches the
light cone, i.e. , lul = ,Jg(p)(u, u) approaches the speed of light c = 1, we
note that (p 1 ) 2 + (p 2 ) 2 + (p 3 ) 2 = E 2 . But then (12.5) shows that m --> 0,
that is, the light 'particles', i.e. , photons, must have zero mass.
Momentum of light
If we think of light as an electromagnetic wave, then the fact that it carries
momentum can be loosely justified as follows. Consider a charged particle
like an electron with a charge q encountering monochromatic light, assumed
to be an electromagnetic wave, propagating the z-direction. With e x, ey , ez
forming an orthonormal basis for IR 3 , E(x, y , z, t) = E 0 sin(kz - wt)ex and
B(x, y, z, t) = Bo sin(kz - wt)ey, where Eo = cBo, and c is the speed of
light (which is set to 1 with an appropriate choice of units). The electric
field E will move the charge up and down, along the x direction. This
moving charge (with an instantaneous velocity, say Vx, along the positive
x-direction), is subject to a perpendicular magnetic field , namely B, along
the positive y-direction. By the Lorentz force law, the charge experiences
a Lorentz force F(t) due to the magnetic field:
F() _ qvxEosin(kz-wt)
t - ------'----'-ez.
C
y
z
Particles and photons can 'collide' at an event, and can exchange energies
and momenta. In such an event, the total energy-momentum is conserved.
We make this precise below.
where E is the energy of the incident photon and 0 is the 'scattering angle'.
7 We will only consider allowed collisions.
Minkowski spacetime physics 279
12.5 Electromagnetism
Let us see that the above definition makes sense. Firstly, F(p) E A2(TpM)*,
and so in particular, also the restriction of F (p) to v_1_ x v_1_ belongs to
/\ 2 (v_1_)*. We also know that the restriction of g(p) the vector space v_1_ is
Exercise 12. 7. In the same setting as Definition 12.14, show that Bis alternatively
characterised by F(p)(x,y)=volg(p)(x,y,B,v), where volg(p) is the volume form
on the vector space TpM with the orientation [O(p)] and the scalar product g(p).
Example 12.4. Let (M, g) be the Minkowski spacetime with the standard
orientation induced by (p, B) as in Example 11.5 (p.259). Let f E C 00 (M).
Suppose that F = f dz I\ dx E !1 2 M. For a q E M, consider the normalised
instantaneous observer v = Ot,q E TqM. The electric field E measured by vat
q satisfies F(q)(v,w) = g(q)(E,w) for all w E v_1_. With the decomposition
W=WXOx,q+wYoy,q+wzaz,q, we have
F(q)(w, v) = F(q)(w, Ot,q) = (!dz Adx)(q)(wxax,q+wYoy,q+WzOz,q, Ot,q)
= f(q)(wXdz A dx(ox, Ot)+wYdz Adx(oy, Ot) +wzdz A dx(oz, Ot))(q)
= f(q)(O + 0 + 0) = 0.
So the electric field measured by v is E = 0.
Let v' = ½( .J5ot,q+ Ox,q) be another normalised instantaneous observer.
From above, F(q)(wxax,q+wYoy,q+wzaz,q,Ot,q) = 0 for all wX,wY,wZ ER
Now if w E (v')J_, and we decompose w = WtOt,q + wxax,q+wYoy,q+wzaz,q
for some wt, wx, wY, wz E JR, then we obtain
F(q)(w, v') = 0 + ½F(q)(wxax,q+wYoy,q+wzaz,q, Ox,q)
= ½Udz Adx)(q)(wxax,q+wYoy,q+wzaz,q, Ox,q)
= ½J(q)(O + 0 + Wzdz Adx(oz, Ox))(q)
= ½J(q)wz 1 = g(q)(E', w),
where E' = ½J(q)oz,q· So the electric field measured by v' is E' #0 if f(q) #0.
Consequently, we see explicitly from the two calculations above, that the
electric field perceived at a spacetime point by an instantaneous observer
depends on the observer's state of motion. A similar thing also happens
with the magnetic field. ◊
Proposition 12.4. Let (M, g) be the Minkowski spacetime with the stan-
dard orientation induced by (p, B) as in Example 11.5. Let F E !1 2 M be
the Faraday tensor field. Let q E M. Suppose that the normalised instan-
taneous observer v = Ot,q E TqM perceives the electric field E and magnetic
field B, where
E = Exox,q + EYoy,q + Ezaz,q
B = Bxox,q + BYoy,q + Bzaz,q·
Then the Faraday tensor field at q is given by
F(q) = (ExdxAdt + EYdyAdt + EzdzAdt
+BzdxAdy + BxdyAdz + BYdzAdx)(q).
One starts with a given J, from which the Faraday tensor field is determined
F by solving the Maxwell equations (assuming that the effect of F on the
spacetime (M, g) is negligible).
Definition 12.17. (Maxwell's equations).
Let (M, g) be a spacetime. Let J E TJ M be a charge-current density.
A (0, 2)-form field F E 0 2 M is said to satisfy the 11 Maxwell equations
corresponding to J if
• dF = 0 (closed)
• d*F = *(JP).
F is then called the Faraday tensor field.
version of the Maxwell PDEs have been absorbed in the charge-current density.
286 A Mathematical Introduction to General Relativity
2m e 2
wheref(r)=l--+-2 .Also,r+:=
r r
{o
m+vm--e-
~
if e 2 >m 2,
·r e2 ~m 2.
1
12 The Maxwell equations were known since the 1860s, but not in the vectorial form
that we learn nowadays. The latter were given by Oliver Heaviside in 1884, reducing
the more than 20 equations to the compactly written 4 equations familiar to us. The
Maxwell equations played an important role in the genesis of the spacetime viewpoint.
Encoded in them was the refutal of the earlier-fundamental 'instantaneous action-at-a-
distance viewpoint' of the operation of forces. Indeed, the Maxwell equations show that
the electric field and the magnetic field behave like 'waves' (see Exercise 12.8), showing
that there is a finite propagation speed, which is the speed of light!
13 ~E means~ acting component-wise on E (and a similar meaning for ~B).
Minkowski spacetime physics 287
Show that at each p E U, ((Vi)p, (Vi)p, (½)p, (Vi)p) is a positively oriented or-
thonormal basis for TpM with the induced orientation. Define F E Tf M by
F = ~n
r2
1 /\ n4
'
and let J := 0 E TJ M. Prove that F satisfies the Maxwell equations.
Notes
The diagrammatic procedure in Exercise 12.3 was given in [Kocik(2012)]; see
also the exposition [Sasane and Ufnarovski(2016)]. Exercise 12.5 is based on
[Lightman, Press, Price and Teukolsky(1975), Problem 1.4]. The discussion on
the momentum of light is based on [Feynman(1963), §34-9]. Lemma 12.2 and
Proposition 12.3 are based on [Oloff(2018), §5.1].
This page intentionally left blank
Chapter 13
Matter
We now focus on the right hand side of this equation, and also provide
some motivation for this equation. (The real belief in the equation as the
governing equation for spacetime rests on its ability to predict physical
phenomena, and we have already met two of these tests, namely the red
shift, and the deflection of light. We will also meet a third test, on the
perihelion shift of the orbit of Mercury, where the observed discrepancy with
the Newtonian prediction 1 was correctly accounted for within the spacetime
geometric view of gravitation.) We begin by revisiting Newtonian gravity,
and recalling the Poisson equation, that shows how the matter sources
determine the 'gravitational potential'. This serves as a precursor to the
field equation above.
(e,,, ey, ez) forming an orthonormal basis for ker t with respect to the inner product 1/·
3 In units where the universal gravitational constant is set to 1.
289
290 A Mathematical Introduction to General Relativity
For smooth f,g, V · (gV J) = (Vg) · V f +gtlf. Using the Gauss divergence
theorem 5 , for smooth f, g, either of which is compactly supported,
r gtlJdV(y)=r
JR3\B, (0)
-(vg)-vJdV(y)+r
JR3\B, (0)
v-(gvf)dV(y)
JR3\B, (0)
=- r (v g) -v I dV(y) +
JR3\B, (0)
(gv
JaB, (0)
r n .fiy ds(y),
where ny is the radially inward unit normal on the sphere oB,(O) at the
pointy, and dS(y) is the area element on the sphere. With J(y) = p(y + x)
and g(y) = IIYll- 1 , we get that J, equals
-f (llxp(t,y+x))-llllldV(y) = - f (llyp(t,y+x))-llllldV(y)
JR3\B,(O) Y JR3\B,(O) Y
= f (Vyp(t,y+x))·Vy-llllldV(y) - f -1 1 1 (Vyp(t,y+x))•fiydS(y).
JR3 \B,(O) Y JaB,(O) Y
Call these two last summand integrals as K, and L,, respectively. Firstly,
-+
IL,I:::;; llv'p(t,•)IIL (R3) 00
i oB,(0)
-1 1 lldS(y) = llv'p(t,·)IIL
Y
-+
(R3)4m,.
00 (13.3)
K, = r (Vyp(t,y+x))·Vy-llllldV(y)
JR3\B, (0) Y
= - f p(t,y+x)lly-llllldV(y)+ f Vy·(p(t,y+x)Vy-lllll)dV(y)
JR3\B, (0) Y JR3\B, (0) Y
= - f p(t,y+x)lly-llllldV(y)+ f
JR3\B,(O) Y JaB,(O)
1 11)•nydS(y)
p(t,y+x)(vy-11
Y
=-0+ f p(t,y+x)(vy-lllll)·nydS(y).
JaB,(O) Y
We used tly@ = 0 in JR3 \{0} to get the last equality. As Vy@ = 11;rr3'
f -y -y
K, = JaB,(o) p(t, y+x) IIYll 3 "jjyjf dS(y)
= ioB,(0)
( 1
pt, y+x)-II
l 2 dS(y)
Y
e--+O
---+ 4np(t, x). (13.4)
Taking the limit €-> 0 in (13.1), and using (13.2), (13.3), (13.4), we have
llx<I>(t,x) = lim(J,+J,) = lim(I,+K,+L,) = 0 + 41rp(t,x) + 0 = 41rp(t,x).
E--+0 e--+O
With 1'0 (t) = t, 1' 1 (t) = x(t), 1'2(t) = y(t), 1'3 (t) = z(t), we obtain
·=r°(t) = 0
·/(t) + !: (1'(t))-y°(t)-y°(t) = 0
·l(t) + !:
o<I>
b(t))-r°(t)-r°(t) = o
·l(t) + oz b(t))-r°(t)-r°(t) = o.
A naive comparison with the equations obtained in Proposition 8.1 (p.156)
suggests that the connection coefficients are all zero except the following,
where (t,x,y,z)=(x 0 ,x1,x2 ,x3 ):
2 o<I> 3 o<I>
roo(t,x) = oy (t,x), r 00 (t,x) = oz (t,x).
Define a corresponding connection "V as in Exercise 6.3 (p.110) with these
coefficient functions. Using Proposition 9.1 (p.182), it can be checked that
for the curvature tensor field R, we have
o2<I>
R~oo = ox2,
Ricoo = 4'11'p.
As mentioned earlier, this equation is not appropriate in the Lorentzian
manifold case, and an appropriate generalisation is sought of (to begin with)
the right-hand side. Here only the mass appears, but we have already seen
that the energy perceived by an instantaneous observer depends on the
observer's state of motion, that is, his own normalised velocity. A wild
guess is to replace the right-hand side by an appropriate (0, 2)-tensor field
T, so that we could maybe have the aesthetically pleasing equation
Ric= 471'T.
It turns out that this train of thought is in the right direction, but it does
not work out exactly like this. We first investigate what an appropriate
generalisation of the right-hand side is.
294 A Mathematical Introduction to General Relativity
Remark 13.1. Where does T come from? In modern physics, the evolu-
tion equations for physical (matter) fields are the Euler-Lagrange equations
for an 'action', which is an integral over spacetime of a Lagrangian. We
have seen the Euler-Lagrange equation in a one-dimensional context in Sec-
tion 8. 7. A similar equation can also be derived in the multivariable setting.
The variation of the Lagrangian with respect to the metric then yields the
energy-momentum tensor field. The vanishing divergence condition is the
energy-momentum conservation or the continuity equation, and in the La-
grangian viewpoint, it arises as a consequence of the invariance of the action
under diffeomorphisms. We refer the reader to [Wald(1984), Appendix E]
for these matters. *
Before looking at examples, we mention that if we imagine a continuous
matter distribution in spacetime, and v E TpM is a normalised instanta-
neous observer, then for this observer, T(p)(v,v) is meant to capture phys-
ically the energy density (energy per unit 'spatial' volume in v_1_). Similar
physical interpretations8 can be given for T(p)(u, w) for u, w E TpM.
7A vector field such that for each p EM, Vp is causal, that is, either timelike or lightlike.
8 E.g.T(p)(x, v) is the momentum density in the direction of the unit vector x Ev_!_, and
T(p)(x, y) is the stress (force component in the x-direction per unit area perpendicular
to they-direction), where x, y are unit vectors in v_!_. For a detailed treatment, we refer
the reader to [Misner, Thorne and Wheeler(2017), Chap. 5].
Matter 295
Dust
We begin with a model of matter where we imagine non-colliding, non-
interacting particles which are 'freely falling' (so that their worldlines are
geodesics), and close-by particles have approximately parallel worldlines
(so that we can imagine these worldlines as being the integral curves of a
smooth vector field). We give the precise definition below.
v:J_p
L. vvolg(· , •, •, V).
Before we check that this T satisfies all the properties demanded of the
energy-momentum tensor field , let us provide some heuristic motivation for
the above definition by checking that T(p)(v, v) does give the energy density
measured by any normalised instantaneous observer v E TpM. Suppose
that v1 , v2, v 3 E v1_ are linearly independent vectors with 'small' lengths
lvil = vg(p)(vi, vi) « 1, i = 1, 2, 3, and K c V J_ be an infinitesimal
parallelepiped formed by vi, v2 , V 3 .
10 Since volg(p)(- , •, •,v) = volg(p)(-, ·, •,v) E A3 (v_i)*; see Exercise 11.14, p.259.
Matter 297
But ½, =cv + w, for some c E JR and some vector w E span{v1, v2, v3} =v_1_.
Taking the scalar product with v, we obtain g(p)(Vp, v) = - c + 0, so that
½, = (E/m)v + w. By the linearity and skew-symmetry of volg, it follows
that lvolg(p)(v1, v2, v3, ½,)I = (E/m) lvolg(p)(v1, v2, v3, v)I- So the energy
density measured by v is
(div(!Vb ® Vb))W
= C((n,x) I-+ (v'x(/Vb®Vb))(nU,w))
= C((n,x) 1-+((v'x(/Vb))®Vb)(nu, W))
+C((!1,X) ,_.(!Vb ®v'x(Vb))(nU, W))
= (VbW) C((n, X) I-+ (v' x(/Vb))nU) + f C ((n, X) I-+ Vb(nU)(v' x(Vb))W}
= (VbW)div(!Vb) + /(v'c((n,xJ ...... v•(ni)X)(Vb))W
By Exercise 9.12 (p.190), div(!Vb) = div(JV). Also, in an admissible chart
(U, x), we have that
Hence div(! Vb® Vb)= div(! V)Vb + f v'v(Vb) = div(! V)Vb + f (v'vV)b,
where the last equality follows using Exercise 6.17 (p.126). □
Perfect fluid
Yet another example of an energy-momentum tensor field is that of a perfect
fluid, which is a bit more complicated than dust, because now we suppose
that there is also a pressure pat each spacetime point. This pressure arises
from the random thermal motion of the particles in the fluid, and is equal
in all spatial directions for any instantaneous observer. The fluid is 'perfect'
in the sense that there is no interparticle binding or viscosity.
• p: M--+ (0, oo) and p: M--+ [0, oo) are smooth maps,
• VE TJ Mis such that for all q EM, g(q)(Vq, Vq) = -1, and
Vq is future-pointing,
1 -((Vp)V + (dp)U).
• div(pV)=-pdivV and v'vV = - -
p+p
p is called the pressure, p is called the density, and V is called the velocity
field. The energy-momentum tensor field of a perfect fluid (p, p, V) is
T = (p + p)Vi,®Vi, +pg.
Matter 299
We note that ifp = 0, the energy-momentum tensor field of the perfect fluid
is that of dust. Intuitively, the condition on 'v' v V makes sense, because
'v'v V represents the acceleration of the fluid particles, and so the equation
roughly says that mass density times acceleration is driven by the pressure
gradient, akin to the wind flow on the surface of the Earth. The first
equation, div (p V) = - p div V can likewise be roughly interpreted as saying
that the change in energy density is driven by the work done by the pressure.
The symmetry of the T given above is clear.
Exercise 13.2. With the objects as in Definition 13.4, show that T satisfies the
weak energy condition.
Often, there is also available a relation between the pressure and density,
of the form
p = f(p).
y'l-(uo(q)) 2
[T(q)n] := mv(q) n 2 uo(q)
y'l-(uo(q)) 2 y'l-(uo(q)) 2
Intuitively, we expect that the 'effective' contribution of energy-momentum
tensor at q will be the expected value of the above. Denoting the 'expecta-
tion operator' by IE, we have
niuo(q) ) . (nini(uo(q)) 2 ) . .
IE ( l-(uo(q)) 2 =Ofori=l,2,3, and IE l-(uo(q)) 2 =Oforic/=J.
[T(q)] :=IE[T(q)n]
1
0 0 0
1-(uo(q)) 2
1 (uo(q)) 2
0 0 0
31-(uo(q)) 2
= mv(q) 1 (uo(q)) 2
0 0 0
31-(uo(q)) 2
1 (uo(q)) 2
0 0 0
3 l-(uo(q)) 2
Let us set
1 1 (uo(q)) 2 1 2
p(q):=mv(q) 1-(uo(q))2' p(q):=mv(q)31-(uo(q))2 = 3p(q)(uo(q)).
Then
On the other hand, we have that the components with respect to the basis
(Vq =:v, e1, e2, e3), of the (0, 2)-tensor ((p + p)VI> © v1> + pg)(q) E Tf M(q),
are given by the same matrix above. This motivates the Definition 13.4.
302 A Mathematical Introduction to General Relativity
Thus
a a2
-2- - - +A= 81rp. (13.7)
a a2
·2
Solving for ~, and substituting the resulting expression in the first Fried-
mann equation (13.6), results in the second Friedmann equation,
a A 41r
- - - = --(p+3p). (13.8)
a 3 3
The other mixed components in the field equation just reduce to O = 0. ◊
(b) Klein-Gordon scalar field: The Klein-Gordon equation for a scalar field
¢ : M - JR is given by
O¢-m2 ¢ = 0,
where O¢:=div((d¢)U). In an admissible chart (U,x), this becomes
.. .. k 2
g'3 OxiOxi</J - g'3 rij0xk¢ - m ¢ = 0.
The Klein-Gordon equation is the Euler-Lagrange equation for the
'Klein-Gordon action'. The corresponding energy-momentum tensor
field T is given by
Field equation
In the last chapter, we saw that the energy-momentum tensor field Ton a
spacetime (M, g) is assumed to satisfy
divT = 0.
If this is to match with a (geometric) (0, 2)-tensor field, then its divergence
ought to be zero too, and we had seen 1 one such tensor field in Chapter 9:
305
306 A Mathematical Introduction to General Relativity
Exercise 14.1. Let the cosmological constant be A=0. Recall the definition of the
trace of a (0, 2)-tensor field from Exercise 5.24 (p.105). By the definition of the
scalar curvature, S=traceRic. Show that the field equation is equivalent to
Ric= 87r(T-½(traceT)g).
It can be shown3 that the Riemann curvature tensor field is nonzero (so that the
3 See for example [Sachs and Wu(1977), §7.6] or [Beem, Ehrlich and Easley(1996), §13.1].
Field equation 307
spacetime (M, g) is 'not flat'), and that the Ricci curvature tensor field is given
by Ric= -(ox(oxh) +oy(oyh))(dt-dz)®(dt-dz). Show that the scalar curvature
S = 0. Assuming that the cosmological constant is A= 0, prove that the vacuum
field equation is satisfied if and only if his harmonic in the x, y variables, that is,
02 h 02 h
iJx2 + iJy2 = 0.
Let f, g : R -+ R be smooth functions such that /2 + g 2 is not identically 0. Set
f(u) 2 2
H(x,y,u) = - 2-(x -y ) + g(u)xy.
Check that the resulting h is harmonic. The spacetime (M, g) is called a plane
gravitational wave.
We expect that if the objects are not too massive, then the metric differs
only slightly from the flat Minkowski spacetime metric described by 'T/ from
Example 5.6 (p.85). So defining
hii = gii - 'T/ii,
we assume that lhiil « 1 and loxihjkl « 1. We also suppose that we have a
'stationary' spacetime metric where the components of g (and hence also
those of h) do not depend on t. If G= [gij], Y =[%],and H = [hij], then
Y=Y- 1 , and
a- 1 = (Y +H)- 1 = (Y(I +YH))- 1 = (I +YH)- 1 Y
= (I -Y H + (Y H) 2 - + · · ·) Y ~ Y - Y HY.
So gii = 'T/ij-'T/ik hke'T/ej- Note that traceT= T((dxi)~, Oxi) = Tjigii = pg00 .
We will need this below (when we use the field equation in the form given
in Exercise 14.1).
8M can be identified with li x li3 as described in the footnote on page 289.
Field equation 309
In the geodesic equation, which we will soon also use, we shall need the
Christoffel symbols. The Christoffel symbols are given by (suspending the
summation convention)
3 gfk 3 TJ
rtj = L 2 (Oxigje+Oxigei- Oxlgij) ~ L ;k (Oxihje+Oxihie- Oxihij)-
l=O l=O
Also, we note that thanks to the stationarity assumption, for i = 1, 2, 3,
rt0 = r~i ~ T/;£ Oxiho1,. (14.1)
Neglecting the rr terms in the expression from Proposition 9.1 (p.182) for
the curvature tensor field components, we obtain
R R,ijk ~ ::i -r£
~ Ux• ::i -r£
kj - Ux1 ki·
Hence Ricjk = RLk ~ Oxirtj - Oxirti, and in particular,
Ricoo ~ OxSbo - Ot rii = Oxirbo - 0 = Oxirbo,
where we used Ot r&i = 0 since the spacetime metric is 'stationary'.
X. + --+(
y' - 2lh00 ) = 0.
This coincides with the Newtonian equation of motion x + V<I> = 0, if
hoo = -2<I>,
(up to an additive constant). So the Newtonian picture would be complete
if we manage to show that -½hoo is in fact the solution to the Poisson
equation D.<I> = 41rp (which, in Newtonian gravity, is the equation that
describes how the matter source p produces a gravitational potential <I>,
whose gradient -V<I> then gives the gravitational field). This is done below.
= D-x( -½hoo)-
Here we used the fact that the time derivatives of the components of h are
zero (our assumption that the spacetime metric is 'stationary'). But while
looking at the equation of motion, we had identified ½h 00 = <I>, and so we
have Ric00 = D-x<I>. Thus the 00-component of the field equation (14.2)
above delivers the Poisson's equation in Newtonian gravity,
D-x<I> = 41rp,
as wanted.
Field equation 311
Proof. We use the chart (M, id). Then none of the components of g have
t-dependence. Indeed, dr = (l/r)(xdx + ydy + zdz), so that
dr ® dr = r- 2 (xdx + ydy + zdz) ® (xdx + ydy + zdz).
Inserting this in the expression for g, and collecting like terms dxi ® dxi,
we see that the coefficients of these do not have any t-dependence.
10 Under the diffeomorphism M 3 (t, (x, y, z)) >-+ (t, r, (x,~,z) ), where r= -Jx 2 +y 2 +z 2 •
Field equation 313
Lemma 14.2. If (M, g) is as above, and the vector fields Lx, Ly, Lz E TJ M
are given by
Lx=YOz-ZOy
Ly=ZOx-XOz
Lz = XOy -yox,
then LL.,g = LLyg = LL.g = 0.
Proof. Let X,Y E TJM, and decompose X=Xiox;, Y=Yioxi, where
Xi, yi E C 00 (M). Below we use (t,x,y,z) and (x 0 ,x1,x2 ,x3 ) interchange-
ably. For any smooth function cp E 0 00 ( M),
[Lx,X]cp
= yoz(Xiox;cp) - zoy(Xiox;cp) - Xiox;(yozcp) + Xiox;(zoycp)
= y(ozXi)oxi'P - z(oyXi)(ox;cp) - Xi(ox;y)(ozcp) + X\ox;z)(oycp)
+yXiOzOxi'P - zXioyOxi'P - XiYOxiOz'P + XiZOxiOycp
= y(ozXi)(ox;cp) - X\ox;y)(ozcp) - z(oyXi)(ox;cp) + Xi(ox;z)(oycp)
= (y(ozXi)Oxi - XYoz - z(oyXi)Oxi + Xzoy)cp
= ((LxXi)Oxi -XYoz +xzoy)cp.
Thus it follows that [Lx,X] = (LxXi)Oxi - XYoz + xzoy. We also note
3
LXiLxxi = Xi(YOz - zoy)xi = yXZ - zXY.
i=l
We have
3
(xdx+ydy+zdz)((LxXi)Oxi -XYoz+xzoy) = -zXY +yxz+ L xi LxX\
i=l
3
(xdx+ydy+zdz)Y =xYx +yYY +zYz = LXiyi_
i=l
314 A Mathematical Introduction to General Relativity
Also,
The field equations were published in 1915, but besides the Minkowski
spacetime solution, no other exact solutions were known then. In 1916,
Schwarzschild derived another solution, by making the simple symmetry
assumptions of 'time-independence' and 'spherical symmetry', and that the
spacetime metric looks like the Minkowski metric far away from the source.
The resulting Schwarzschild spacetime is a simple model of a spacetime
containing a single star. It is a good model for the exterior region in the
proximity of a star or a planet. Moreover, if the matter source is sufficiently
dense, then it also gives a model of the simplest possible 'black hole'.
Field equation 315
rg 1 = r~ 0 =
gtt
2 (iltgtr + ilrgtt -
1
iltgtr) = - 2/ or (- f)) = 2f'/"
Similarly, the other nonzero r-symbols can be found to be
r1 - L
oo - 2k,
1 k'
r u = 2k,
1 r
r 22 = - k,
r1 - - r(sin 0) 2
33 - k ,
2 2 1 1
r12=f21 =-, r~3=-(sin0)(cos0), rf3=f~1 =-, r~3=f~2=cot0.
r r
Knowing the Christoffel coefficients, we can compute the components of the
Ricci curvature tensor field. For example,
Ricoo = Oxkr~o - Otf~k + roor~k - rokr~o
Passing the limit r-+ oo in the relation f(r)k(r)=C, we get C=l. Thus
1
k(r) = f(r).
Substituting for k in Ric22 = 0 gives
rf'
-f +l - 2 +2
rf 2 (1)'
f = 0.
So rf'+ f =l, i.e., (rf)' =l. Thus by integrating, we obtain rf(r) = r + A,
where A E JR is an integration constant. This yields f(r) = 1 + 1-- So far,
we have not specified Tmin ~ 0. We want J(r) > 0 on (rmin, 00 ), with Tmin ~ 0
the smallest possible such number. We have the following cases:
1° A~ 0. Then Tmin := 0.
2° A< 0. Then Tmin := -A.
We will soon argue on physical grounds that A=: -2m < 0, where m > 0 is
thought of as the mass of the star. For now, given an A E JR, we just take
Tmin as specified above. Then,
f(r) =
A
1 +-;: and k(r) =
(
1 +-;:A)-1 .
We can plug these expressions back and check that indeed Ricii = 0 for
i = 0, 1, 2, 3, so that the field equations do hold (the other Ricci curvature
field components Ricij, i =I= j, were already known to be zero for arbitrary
f, k). Thus we obtain the Schwarzschild metric
g=-(1+4 )dt® dt + ( 1+4 )- dr ®dr + r d01 2 2 + r 2 (sin0) 2 d<f>®d<f>
for the spacetime region JR x (r min, oo) x S 2 ( thought of as the exterior of
a star), under the assumptions that the metric is stationary 11 , spherically
symmetric, and it approaches the Minkowski spacetime metric as r-+ oo.
11 Birkhoff's theorem says that the assumptions of stationarity and that the metric ap-
proaches the Minkowski metric away from the source, are superfluous here. For a precise
statement and a proof, we refer the reader to [Kriele(2001), Thm. 7.2.1].
Field equation 317
We now give an argument to justify that A=: -2m < 0, and that then
Tmin=-A= 2m, where m can be interpreted to be the mass of the star.
Recall that in §14.1, we had found that in the Newtonian limit, if we write
the metric g on JR x IR3 as gij = T/ij + hij, then hoo = - 2<1>, where <I> is the
gravitational potential. But at a distance r from the source of mass m, the
Newtonian gravitational potential is
<l>=-m.
r
So for large r, writing the Schwarzschild metric as gij = T/ij + hij, we have
goo = -1 + hoo ~ -1 - 2 <I> = -1 - 2 - ; = - ( 1 - 2~) .
Thus ifwe identify A with -2m, i.e., m:=-A/2, then this m corresponds
to the mass of the star in the classical viewpoint, and in particular m > 0.
So A=: -2m > 0, and Tmin =2m, where m has the interpretation of being
the mass of the star.
Alternatively, we could look at the geodesic equation to justify this
correspondence. We do this in the following exercise.
Exercise 14. 7. (Radial geodesic in Schwarzschild spacetime).
Let (M,g) be the Schwarzschild spacetime. Let 00 E (0,71") and ¢ 0 E (0,271") be
fixed, and p E S 2 be the point with the spherical polar coordinates (0 0 , ¢ 0 ).
Let 'Y: J-+ M, I 3 T >-+ (t(T),r(T),p), be the worldline of a free-falling particle,
parametrised by the arclength/proper time. Show that if •' := ~r, then:
-(l- 2m)
r
(t') + (l- 2m)-\r')2 = -l,
2
r
t" = _ 2m (l- 2m)-\'r', r" =
~ r
_ m.
~
Let 'Y pass through p EM. Consider a normalised instantaneous observer
v = ✓ (1- 2r; r
1
Ot,p E TpM.
Show that v perceives the relative speed of the particle 'Y to be
u= f,✓(1-o/-f 1 Or,p•
Determine the magnitude lul := -Jg(p)(u, u) of u as perceived by v.
Now suppose that r » 2m, and lul « 1 (that is, low speed as reckoned by a
'stationary observer' far away from the source). Conclude that lr'I «t', and t' R< 1.
(In physics parlance, t' R< 1 is expressed as 'coordinate time is proper time'.) Thus
if the map h denotes the inverse of the map T >-+ t(T), then 'Yoh is the map
t >-+ (t, r(h(t)), 0o, </>o). Show that
d2 m
dt2 (r oh) R< - r2.
From the Newtonian viewpoint, in the spacetime Rx R 3 , a radially freely falling
particle in the gravitational field of a mass m at the origin experiences an accel-
eration which matches with the above. So the constant m in the Schwarzschild
spacetime can be thought of as being approximately the mass of the star.
318 A Mathematical Introduction to General Relativity
We note that the Schwarzschild metric can be used only for r > 2m, since
if r = 2m, the coefficient of dt ® dt becomes zero, and also the coefficient of
dr ® dr is not defined. For a star like our Sun, we have 2m ~ 2M0 ~ 3 km,
and so when we are outside the body of the Sun, we do have r > 2m. For
the Earth, 2m ~ 2MffJ ~ 9 mm, which is again much smaller than its own
radius. For a body of mass m, we define its Schwarzschild radius Rs to
be 2m. A body with a radius smaller than its own Schwarzschild radius is
thought of as a 'black hole', and will be discussed in the next chapter.
Exercise 14.8. Find the minimum density needed for a body with radius equal to
that of the Earth (Re= 6400km), so that its Schwarzschild radius is at least as
large as Re- Compare it with the density of water, Pwater = 1000kg/m3 •
Thus if x(t) = (x(t), y(y), z(t)) E IR3 is the position of the planet at time
t E JR, then the equation of motion is
.. m
x = - llx ll3 x.
d( ) . . ..
dt xxv =xxv+xxv=vxv+xxx
m
= 0 - llxll3 (x x x) = 0.
Thus the 'angular momentum' vector L =xx vis a constant. We assume
henceforth that L =I= 0. Then the planet is confined 12 to move in the plane
which contains the 'initial' position vector x(0) of the planet, and the initial
velocity v(0) of the planet. See the following picture.
<
rr + (i-) 2 = (v, v) + (x, x) = llvll 2 + x, -11~3 X) = llvll2 - 1: 11. (14.4)
Consider an ellipse in the xy-plane with foci at (0, 0) and at (-2c, 0) where
c > 0. Then the centre C of the ellipse is at (-c, 0). If the lengths of the
major and minor axes are 2a, respectively 2b, then the ellipse is given by
(x + c)2 y2
a2 + b2 = 1. (14.8)
Recall also that the ellipse is the locus of a movable point P such that the
sum S of the distances of P to the two foci is a constant. By taking the
point P along the major axis, we obtain that S = c+a+ (a-c) = 2a. Next
taking the movable point P along the minor axis, and using Pythagoras
theorem in the right angled triangle formed by P, C and the origin, we
obtain b2 + c2 = (S/2) 2 =a 2. Thus the eccentricity e of the ellipse is
(- 2c, 0) C
a C
which matches (14.8) with a= l~e 2 > b= ✓ i~e 2 > 0, and c= 1 _:_~ 2 =ae. So
1 k
cpt---->--=----
u(cp) l+ecoscp
describes an ellipse, with (0, 0) as one of its foci. Consequently, the trajec-
tory of the planet is an ellipse with the Sun its focus.
322 A Mathematical Introduction to General Relativity
= r" + r2
m J(r)(t') 2 - ~(r') 2 - rf(r)(c/>') 2
r2 f(r)
2 2 d2u 2 (1 ) 2 4 • d2u m 2 )
-Lu d</>2 =-mu + ~-3m Lu , 1.e., d</>2 +u= £ 2 +3mu . (14.12
Comparing with (14.6), we see that there is now an extra term 3mu 2. As
opposed to (14.6), which we could solve explicitly, now because of the extra
term, we can no longer solve the equation in an elementary manner. But
since 2m ~ 3km, and for planetary orbits r is in the order of millions 13
of kilometers, it follows that 3mu « l. As 3mu 2 = 3muu « u, in equation
(14.12), the term 3mu 2 is quite small as compared to the u on the left-hand
side. Also, assuming that the orbital speed rep' is much less than the speed
of light, we have (mu 2)/"D" = ,f? = (rc/>') 2 « 1. So we proceed perturbatively
as in §8.4. We seek a solution u = uo + 1:u1 + 1:2u2 + · · · , to
(14.13)
Substituting the series expansion for u into the differential equation (14.13),
and by comparing terms in 1: that
d2u1 2 m2 2 m2 ( e2 e2 ~
d¢2 +u1= (uo) = L4(1+ecos¢) = L 4 1+ 2 +2ecos¢+ 2 cos(2¢)1.
Set u1,p = a+ ¢(bi cos <p + b2 sin¢)+ c cos(2¢) in order to find a particular
solution u 1 ,p. Plugging this into the equation, and using the independence
of 1, sin¢, cos¢, cos(2¢), we can determine a, b1, b2, c, and obtain
m2 ( e2 e2 ~
u1,p = L 4 1 + 2 + e¢sin¢- 6 cos(2¢)).
m 3m3 ( e2 ) m du
u(0) = L 2 (l+e)+ L 4 1+ 3 ~ L 2 (l+e) =uo(0), d<p (0) =0.
So we may take u as an approximation of the solution u to (14.12) with
m du
u(0)= L 2 (1+e) and d¢(0)=0.
The perihelion of a planet's trajectory is the point along the trajectory that
is closest to the Sun. Thus r is minimised, and this corresponds to u being
maximised. In the case of the ellipse described by¢ i--+ 1/u0 (¢), where
m
uo(</>) = L 2 (1 + ecos¢),
the perihelion occurs at ¢=0, ±2n, ±4n, • • •. To determine the local maxi-
u,
mum of we seek solutions to ~~ (¢) = 0. We have
Thus the Newtonian elliptical planetary orbit is now prevented from closing
in on itself, and the resulting trajectory resembles a rosette pattern, as
shown (much exaggerated) below, with an advancement of the perihelion
in each traversal of a 'petal'. This is called the perihelion precession 15 .
15 The terminology is borrowed from the mechanics of a spinning top in Earth's gravity,
where the axis of rotation itself rotates, spanning a cone, and one says the axis ' precesses'.
16 See for example [Misner, Thorne and Wheeler(2017) , p.1113].
17 An arcsecond (denoted henceforth by•") is l
3 00 of a degree, i.e., 3600 = 1° .
11
This page intentionally left blank
Chapter 15
Black holes
We had remarked that for a typical star, 2m is much smaller than its radius,
and so the above describes the exterior region of the star, where we assumed
that the cosmological constant is A= 0, and that there is vacuum, that is,
the energy-momentum tensor field is T=O.
But now let us consider the situation, where all the mass is located 'at
r = 0'. We do not think of this necessarily as a point, but rather simply
something not in our manifold (where in the manifold we have r > 0).
Let MaH := RxJxS2 , where J = (0,2m) u (2m,oo) c R. Then MaH
is a Lorentzian manifold with the same metric g given above, and again
the field equations are satisfied in MaH with A = 0 and with the vacuum
energy-momentum tensor T = 0. While Ot is timelike in the region r > 2m,
we have Or is timelike in the region r < 2m. Thus we cannot have a particle
trajectory with r = constant in the region r < 2m. The r-coordinate of the
particle in this region must necessarily increase or decrease, depending on
what time-orientation is chosen in this part.
By looking at the behaviour of lightlike geodesics, we will see that light-
rays emanating from the region 0 < r < 2m cannot travel to the exterior
region, while null geodesics are allowed to fall in. Thus no light emerges
from such an object, and it is legitimate to call it a black hole. We will only
study Schwarzschild black holes, roughly collapsed objects characterised by
a unique parameter, namely their mass m.
1 A time-orientation is given by Ot, and an orientation is induced by the orientation
[volg] on~ X ~ 3 , where volg = vldet[gij]I dxAdyAdzAdt.
327
328 A Mathematical Introduction to General Relativity
In this section we will show that the spacetime (MaH, g) is 'singular' with
a singularity at r = 0. But first, we will introduce the spacetime (MaH, g)
in more precise terms than the description given in the introduction.
Definition 15.1. (The spacetime (MaH, g)).
Let (MaH, g) be the smooth manifold
MBH = JR X J X S 2, where J = (O,2m) u (2m,oo) C JR,
with a Lorentzian metric g, given as follows. The smooth manifold J x S 2
is identified with {(x, y, z) E JR3 I O < r := .Jx2 +y 2 +z 2 =I= 2m}, using the
diffeomorphism (t, (x, y, z)) >-+ (t, r, (x,;,z)) E MaH· Then g is given in the
global chart (MaH, (t, x, y, z) >-+ (t, x, y, z)) by
t
y
► f y
0
r
~ r y
► y
Black holes 329
The expression for the metric g seems to break down at r = 2m. But we
shall see that this is a mere artefact of the chosen coordinates. To explain
what we mean by this, let us first consider the following example.
Example 15.1. Consider the Minkowski plane JR 2 with the standard
smooth structure, and the metric
g = -dt®dt + dx®dx
in the global admissible chart (JR 2 ,(t,x) >-+ (t,x)). Then (JR 2 ,g) is a
Lorentzian manifold. Consider the admissible 2 chart
(JR 2 \{(0,x): x E JR}, (t,x) >-+ (r(t,x):=t3 ,x)).
In this new chart, the metric is given by
g = -(1/9)r- 4/ 3 dr ® dr + dx ® dx.
The coefficient function of dr ® dr above, namely
1
JR 2 \{(0,x): XE JR} 3 (t,x) = p >-+ - 9t4'
does not stay bounded asp= (t, x)-> (0, 0). ◊
Example 15.2. The Minkowski plane (M, g) from Example 15.1 is geodesi-
cally complete, since each geodesic is a straight line
7(t) = (xo, Yo)+ tv (t
JR),E
for some point (x 0 , y 0 ) E JR 2 and some vector v E JR2 . This follows from the
fact that the Levi-Civita connection in this case is just the flat connection
on JR 2 , and we had determined the geodesics for the flat connection in
Example 8.1 (p.156).
Also, from Example 8.4 (p.164), the 4-dimensional Minkowski spacetime
is geodesically complete. ◊
2 As opposed to Example 1.6 (p.5), where we also had the chart map t ,..... t 3 , we took
the chart domain as the whole manifold, while we now remove the problematic points
(i.e. the t-axis), so that there is no obstacle for the chart transition maps to be smooth.
330 A Mathematical Introduction to General Relativity
One way3 to detect if the spacetime is singular is to find out if the curva-
ture goes to infinity along a geodesic as the affine parameter approaches
a finite value. In the example below we will see in this manner that the
Schwarzschild Lorentzian manifold (MaH, g) is singular, by showing that a
radial geodesic in the interior region (r < 2m) hits r = 0 in finite proper
time, while the Kretschmann scalar goes to oo.
We will now construct a solution to the above equations. Let r 0 E (0, 2m),
and let r satisfy the initial value problem
r' -- - V~rm'
--:;:- r (O) = ro E (0, 2m ) .
Then r(r) = (r!- 3 :v:;f°r)f, and (r') 2 = 2;'. Lett'= f(r), and t(0) =to, i.e.,
_;j_
2r 2
t(r)=to+ fT(l- 2m 2/3)-lds, for O~T< ~ -
J0 ( ro3/2 --2-s
3v'2m ) 3v2m
We check that (15.1), (15.2), (15.3) are satisfied. Indeed, (15.3) holds, as
, 2 1 , 2 1 1 2m 1 ( 2m)
-f(r)(t) + f(r)(r) =-/(r)(J(r))2+ f(r)--:;:-=- f(r) 1---:;:- =-1.
3 However, this is not a foolproof way. The curvature tensor field may behave benignly
even for geodesically incomplete semi-Riemannian manifolds. This can happen if a point
is missing and cannot be added, or has been viciously removed. For example, consider
the double cone C, = {(x, y, z) E li3 : x 2 + y 2 = z 2 }\{(0, 0, 0)} c li3 as a smooth manifold.
Equipping C, with the metric induced from the Euclidean inner product on li3 , the
curvature tensor field of C, can be shown to be everywhere zero. But if p E C,, then the
line L(t) =tp, t E (0, oo) can be shown to be a maximal geodesic, and leaves C, at t=0.
Black holes 331
Thus 'r = O' is a genuine singularity for the Lorentzian manifold (MaH, g).
What about r = 2m? From the above expression for the Kretschmann
scalar, we suspect that the problem at r = 2m for the expression for g
might very well be just a coordinate artefact. In order to see what happens
when r = 2m, we study null geodesics in (MaH, g). We refer to the 'surface'
JR x {2m} x 8 2 c JR x (0, oo) x 8 2 as the event horizon. The Schwarzschild
radius is r 8 :=2m (i.e., ~2m, after GN and c have been reinstated).
4 See e.g. [Grave and Mueller(2010), (2.2.5), p.18] or [O'Nei11(1983), Exercise 8, p.399].
We will not carry out this computation here. We remark that the scalar curvature is not
an indicator of the presence of the singularity: S = 0. This is because the Ricci curvature
tensor field Ric = 0 for the Schwarzschild spacetime. We had seen this in the exterior
region (r > 2m) while deriving the Schwarzschild metric (p.316). But since the connection
coefficients in the interior region (0<r<2m) are given by the same expressions as those
in the exterior region, the Ricci curvature tensor field is zero everywhere.
332 A Mathematical Introduction to General Relativity
Proposition 15.1. Let 0o E (0, 1r) and </>o E (0, 21r) be fixed, determining
a radial direction in S 2 . Let p E S 2 be the point with the spherical polar
coordinates (0o, </>o). Then the corresponding radial lightlike geodesics in
(M8 H, g) are affine reparametrisations of"(: I --> M 8 H, 1(r) = (t(r), r, p),
r E I , where the interval I c J, and J and t are given in the table below:
Proof. First consider the radial lightlike geodesic ::Y : J--> M 8 H described
by an affine parameter TE l: ::Y(T) = (t(T),r(T),p), TE J. With.'= ~7 ,
its </>- and 0-components satisfy </>' = 0 and 0' = 0. The t-component of the
geodesic equation for ::Y gives
~11 2m ~,,
t = - r2 f(r) tr
Let us now see how things appear to an observer outside the event horizon.
Definition 15.3. (Schwarzschild observer).
Let r 0 > 2m, 00 E (0, 7r), ¢ E (0, 27r). Let p E S2 be the point with the
spherical polar coordinates (0o , <Po). The worldline "Is : IR.--> M aH given by
"/s(r) = (r)(1- 2
is called a Schwarzschild observer.
r:r 1
, ro, p) , TEIR. ,
and so g("/s(r))(v"Y,,"Ys(-r), v"Y, ,"Ys(-r)) = -1 for all TE R Thus proper time for
"Is between two events p = "Is (T P) and q = "Is (Tq) along "Is is
lrp - Tql = lt(p) - t(q)I.
0
The (r, 0, ¢)-coordinates of "Is are fixed as (r0 , 00 , ¢ 0 ). Now consider a radial
infalling lightlike geodesic ,\ as shown in the picture below.
"Is
' '
'' ''
''
2m ::
0 >-------+-~-'l------r➔
334 A Mathematical Introduction to General Relativity
The radial light geodesics emanating from the events along >. will reach the
worldline of 'Ys at later and later times, so that the Schwarzschild observer
will think that the photon never reaches the event horizon. On the other
hand, if we consider the photon parametrised5 by the affine parameter r,
then starting from the parameter value r 0 > 2m, the photon reaches the
event horizon at the finite affine parameter value r 0 - 2m.
Let p E S 2 be the point with the spherical polar coordinates (0 0 , ¢ 0 ) . Then the
same steps as in Example 15.3 show that --y(T) := (t(T),r(T),p), TE (O,T) is a
timelike geodesic (particle) starting along the Schwarzschild observer's worldline
at (0,ro,p), and reaching the event horizon in the particle's proper time T<oo.
Set T : ( 2m, ro) --+ ( 0, T) to be the inverse of the map r, that is, r o T = id( 2 m,ro) .
Using(*) and(**), and setting t = toT,
No light rays emanating from the region inside the event horizon, reach
the Schwarzschild observer 'Ys, and also do not reach the events along any
timelike curve living in the exterior of the event horizon. Thus a black hole
appears black. Of course the starry light from elsewhere, e.g. 'behind' the
black hole, can travel to the observers outside the event horizon, as shown .
•
5 This fixes its energy/frequency as measured e.g. by the Schwarzschild observer at the
observer
• In the first step, we keep the Schwarzschild coordinate time t, but re-
place r by r, r
given by = r + 2m log( r - 2m). This has the effect of
mapping the half-plane {(r, t) :r > 2m} to the entire plane R 2 , in such a
way that the radial incoming and outgoing lightlike geodesics become
45° straight lines, as shown in Figure 15.1: Indeed, a point (r, t) on
such a curve in the (r, t)-plane is mapped to the point (r, t), where
t = ±(r+2mlog(r-2m))+c = ±r+c.
• In the second step, the transformation is (r, t) >-+ (t- r, t + r) =: (U, V),
which is a 45° anticlockwise rotation, followed by a reflection in the
vertical axis, followed by a dilation by y'2.
• In the third step, we compress the entire planar region to the second
quadrant via the map (U, V) >-+ ( - e 4~, e ln ) =: (u, v).
• In the final step, we use again the composition of a 45° anticlockwise
rotation, a reflection in the vertical axis, and a dilation, now by 1/v'2:
(u, v) >-+½(v - u, v + u) =: (u, v).
Altogether, incoming and outgoing light rays run along the 45° half-lines
v + u = c, respectively v = u - c in the (u, v) plane. We have brought the
original r = 2m event horizon in the (r, t )-plane (along which lied the edges
of light cones in the (r, t)-plane) along a 45° line L, where u= v, in the first
quadrant of the (u, v)-plane. Soon, we will transform the interior region
0 < r < 2m in a similar manner, and glue the so-obtained new region in the
(u, v )-plane along this line L.
We now calculate the metric in the new coordinates (v, u, 0, ¢). First,
we find the overall transformation (t, r) >-+ (v, u):
r
e 4-m t t t t
= -2-(e4"' +e- 4,,.' e4"' -e- 4,,.)
/
1:
N
r II
.... r
l U= t - r
V = t +r
U = t -r l
V=t +r
V V
u u
l ~
U
~
_..JL
= e 4m
v =e=V
~
U = -e _..JLl
~ = e-:rm
V
V
4m
V
v
u
u
V- U v- u
U=-
2 / "= -
2
V+U V+ U
V =- V =-
2 2
Proof. For x > -l, h'(x) = ex +xex = (1 +x) ex> 0, and so his strictly
increasing (and hence injective) on (-1,oo). As xexlx=-1 =-¼,and
lim xex = +oo,
X-+CX)
From these two equations, using v2 - u 2 = - re 2~ f (r), we solve for Ou, Ov:
10 We remark that for y ~ 0, the unique x ~ 0 such that xex = y is sometimes denoted
by Wo(y), and the so-obtained function Wo is called the Lambert W function; see e.g.
[Corless, Gonnet, Hare, Jeffrey and Knuth (1996)].
Black holes 339
16m2
Similarly, g(ov, ov) = -------,,--- and g(ou, ov) = g(ov, ou) = 0. Thus,
re2m
16m2
g = ----::i::-(-dv © dv +du© du)+ r 2 d0 ® d0 + r 2 (sin0) 2 d¢ © d<j), (15.10)
re2m
where r=r(u, v), given by (15.9).
Indeed,
1 v -U v -U 1 .!.±.f -t+r .!.±.f -t+r
(u, v) = 2(e'im -e"im, e'im +e"im) = 2(e 4 ,,, -e 4,,, , e 4,,, +e 4,,, )
= e 4 :'n ( sinh _t_ cosh-t-) = e 4:;. e½ Iog( 2m-r) ( sinh _t_ cosh-t-)
4m' 4m 4m' 4m
= e 4:;.V2m-r(sinh-t-, cosh-t-).
4m 4m
Thus again, we have v 2 - u 2 = (2m-r)e2:-n = -re2:-nf(r), and so r(u,v)
is given by (15.9). The equations after (15.9), all the way to the metric
expression in (15.10) remain the same.
Consequently, we take MBH as the smooth manifold n x 8 2 , where
n= {(u, V) E IR 2 : V > - u and v2 - u 2 < 2m}
with the metric given by
16m2
g = -----;:-(-dv ® dv +du® du) + r 2 d0 ® d0 + r 2 (sin 0)2d¢ © de/>,
re2m
where r=r(u, v), given by
TH= 8 G k
nc3
= 6.18 x 10
-1M0
-Kelvm.
.
(15.11)
7r N am m
Here, ka is the Boltzmann constant, n = h/(21r), h being the Planck con-
stant, m is the mass of the black hole, and M0 is the mass of our Sun. We
refer the interested reader to [Wald(1984), Chapter 14].
Exercise 15.3. When an object falls into a black hole, information is lost. Lack of
information is associated with entropy in statistical physics. Entropy is related
to temperature. Let us use these ideas to give a 'back of the envelope' heuristic
derivation of the Hawking temperature formula.
It was shown by Hawking in 1971 that if two black holes collide forming a
new black hole, then the area of the event horizon surface is bigger than the sum
of that of the original black holes. This result is reminiscent of the second law
of thermodynamics about entropy increasing in any dynamical process. This led
J. Bekenstein 12 to guess that the entropy of a black hole is an increasing function
of the area. But as entropy is additive, and since in the Hawking inequality
mentioned above, the right-hand side takes the sum of the areas, the simplest
guess is to take S ocA. As entropy is ka times a dimensionless quantity, we divide
the area A by the square of an appropriate 'universal' length.
The elementary particles, with kinematics governed by the constants c and
Ii, do not have a length scale, because these two constants cannot be combined
to form a quantity with the dimension of length. But adding mass, we do ob-
tain length, for example the Compton length Re := li/(mc). Similarly GN and c
also cannot be combined to give length. Adding mass again rescues the situa-
tion, since GNm/c2 has dimensions of length. Taking the geometric mean of the
lengths li/(mc) and GNm/c2, the pesky m cancels, and we obtain the universal
length scale, the so-called Planck length, given in terms of the universal physical
constants by fp=,JliGN/c3. Using this, we have S=akaA/.e;, where a is a nu-
merical proportionality constant, which was shown in Hawking's 1975 work to be
12 1947-2015, a Mexican-born Israeli-American theoretical physicist.
Black holes 343
equal to ¼. As we are only giving a heuristic derivation, let us use this numerical
constant, and see ifwe recover the rest of the terms in the formula (15.11) for TH.
A particle of mass !::!..m has 'rest energy' !::!..U = c2 !::!..m. If this enters the event
horizon, then, using T !::!..S = !::!..U, we obtain T !::!..S = c2 !::!..m. Using A= 471" ~, where
Rs := 2GNm/c2 is the Schwarzschild radius, derive (15.11).
Remark 15.2. (Emergent gravity paradigm). We have seen two instances
in spacetime physics, where the existence of a 'horizon' creates a thermo-
dynamics temperature: the Unruh effect in Exercise 5.21 and the black
hole temperature. In the classical viewpoint, the field equations of space-
time are fundamental, and the thermodynamics of spacetime is a conse-
quence. Roughly speaking, in the 'emergent gravity paradigm' one tries
to go the other way, obtaining the field equations as an effective macro-
scopic description of a more fundamental microscopic theory ('atoms of
spacetime'/associated degrees of freedom). The reader who wants to delve
deeper is referred to [Padmanabhan(2008)] and [Padmanabhan(2016)]. *
This page intentionally left blank
Chapter 16
Cosmology
345
346 A Mathematical Introduction to General Relativity
If (U,x) is an admissible chart for S, then the metric g on Min the chart
(Ix U, (t,p) ,_. (t,x(p))) is
g = -dt ® dt + (a(t)) 2 ~ <Tijdxi © dxi.
Here R( X, Y) Z = v' xv' y Z - v' y v' x Z - v' [x, Y] Z is the curvature operator
(Exercise 6.15, p.125). We note that the linear independence of v, w shows
that the denominator is positive:
•- u(p)(v,w) 0
z .- v - u(p)(w, w) w =/-
thanks to the linear independence of v, w, and so
0< ( )( )= ) + (u(p)(v,w)) 2
( )( _ 2 (u(p)(v,w))
2
CT p z, Z CT p V, V
( )(
(T p w,w
)
(T
(p)( w,w )
(u(v,w)) 2
= u(p)(v, v) - (T (p )(w,w )
We also need to check that Kp is well-defined, i.e., that it does not depend
on the spanning set {v, w} chosen. We first note that
u(p)(R(p)(v,w)w,v) = vl>(R(p)(v,w)w) = R(p)(vl>,v,w,w)
= R(p)(v, w, w, v).
1 We use this notation, as it is customary to call the set consisting of all k-dimensional
subspaces of a vector space Vas the Grassmannian, and this set is denoted by Gr (k, V).
Cosmology 347
u(p)(v' v') u(w' w')-(u(p)(v' w')) 2 = det [ u(p)( v'' v') u(p)( v'' w') ]
' ' ' u(p)(w',v') u(p)(w',w')
= det [ au(p)(v, v') +bu(p)(w, v') au(p)(v, w') +bu(p)(w, w')]
cu(p)(v, v') +du(p)(w, v') cu(p)(v, w') +du(p)(w, w')
= det ( [a b] [u(p)(v,v') u(p)(v,w')])
c d u(p)(w, v') u(p)(w, w')
= (detA)det [u(p)(v,v') u(p)(v,w')]
u(p)(w, v') u(p)(w, w')
= (det A) det [au(p)(v, v) +bu(p)(v, w) cu(p)(v, v) +du(p)(v, w) ]
au(p)(w, v)+bu(p)(w, w) cu(p)(w, v)+du(p)(w, w)
t
= (detA)det ( [u(p)(v,v) u(p)(v,w)] [a b] )
u(p)(w,v) u(p)(w,w) c d
= (detA)(det(At))det [u(p)(v,v) u(p)(v,w)]
u(p)(w,v) u(p)(w,w)
To check (*), we work locally in an admissible chart (U, x): for all W E TJ S,
(div(ku))(W)
= C(Tf S x TJS 3 (0, V) ,_. (v'v(ku))(O~, W))
= C(Tf S x TJS 3 (0, V) ,_. (Vk)u(O~, W) + k(v'vu)(O~, W))
= C(Tf S x TJS 3 (0, V) ,_. (Vk)u(O~, W) + kO)
= (ox;k)u((dxi)~, W) = (ox;k)u(uiioxi, Weoxt) = (ox;k)Weuiiuie
= (ox;k)We8~ = (ox;k)Wi = (Wiox;)k =Wk= (dk)W
From (16.3) and (16.4), 2(n-l)dk= (n-l)ndk, that is, (n-l)(n-2)dk=0.
As n ~ 3, this implies that dk = 0. Hence in any admissible chart (U, x),
Oxik = 0 for 1 ,::;; i ,::;; n, implying the local constancy of k. Since S is
connected, k is constant on S. □
3 We use 'b' rather than 'a', since, in our discussion of the FLRW spacetime so far, the
expression for the 'spatial part of the metric' of the metric g of the full spacetime M,
already has a function a(t), and we want to avoid confusion with that function.
4 These metrics are those of the 3-sphere of radius v'k if k > 0, or Euclidean space when
k = 0, or the 'hyperbolic space of radius ~ • if k < 0. The hyperbolic space of radius
a>0 is the open half-space {(x 1 ,x 2 ,x 3 ) E JR3 : x 3 >0} equipped with the Riemannian
metric given by gij = a 2 8ij/(x 3 )2; see e.g. [Godinho and Natario(2014), Example 4.2]
or [Lee(2018), pp.62-67].
352 A Mathematical Introduction to General Relativity
As mentioned earlier, the isotropy of the spatial section for each instan-
taneous observer motivates the spacetime model M =Ix S, where S has
constant sectional curvature at each point. So we consider the following.
Here · = ¼t-- Using the above, we can calculate the components of the Ricci
curvature tensor field, and the nonzero components are given as follows:
.
R lCrr _ aa + 2a2 + 2k
Rieu= -3~, - k ,
a 1- r 2
Ric00 = (aa + 2a 2 + 2k)r2 , Ricct,ct, = (sin 0)2Ric00-
5 U := JR3 \{(x, y, z) E JR 3 : x ~ 0 and y = O} and x := (r, 0, ¢), where r, 0, <p are the usual
spherical polar coordinates.
Cosmology 353
S= 6 aa + a2 + k
a2
So we have now collected all the information for writing the 'geometry' side
of the field equation.
and so 'Y is a geodesic. The curve 'Y is the worldline of a freely-falling galaxy
in the FLRW spacetime. ◊
rr</.> - r</.>r - r
</.> - </.> - 1
-, r0</.>
</.> </.>
= r<J.>0 = cot 0.
The aim in this exercise is show that a certain curve µ, joining the origin in S
to the point pin S with spherical coordinates (ro, 00 , </Jo), is a geodesic. We will
calculate the 'length' ofµ, which will then serve as a notion of distance between
the origin and the point p. Let so > 0, and let r be the solution to
dr(s) = -Jl - k(r(s)) 2 (O ~s~so ) ,
ds ()
at
r(O) = 0.
Let ro := r(so). Defineµ : [0, so] -+ S by µ(s) = x- 1 (r(s), 0o, </>o) for 0 ~ s ~ so.
Determine vµ,µ(s), and tr(µ(s))(vµ,µ(s),Vµ,µ(s))- Show that the length d(t) ofµ,
d(t) :=
o
iso
✓tr(µ(s))(vµ,µ(s),Vµ,µ(s)) ds =so= a(t)
1
--;:;:===;;::::::;;:dr.
o v'l - kr 2
iro
Show thatµ is a geodesic in (S,tr).
354 A Mathematical Introduction to General Relativity
Remark: A space slice in the FLRW spacetime (M, g) is a level set oft : M --+ I.
Identifying a space slice {t} x S with S, it is reasonable to define the distance
between (t, 0), (t, p) E {t} x Sc Mas d(t). Indeed, if the distance d(t) between 0
and pis defined as the length of the shortest curve joining Oto p, then d(t) is the
length of the geodesic joining these points by an analogue of Theorem 8.5 (p.177)
in the Riemannian manifold context; see e.g. [Lee(2018), Chap. 6 and Thm. 6.4].
We assume that on the matter side, we have a perfect fluid (p, p, V), where
V = Ot in the chart J x U described in the previous section. Here p and p
are assumed to be functions oft only. This is again due to our homogeneity
assumption that in each 'spatial slice', one point is not more special than
the other. Then the energy-momentum tensor field T= (p+p)V1,(8)V1, +pg
has the nonzero components given by
T _ p(t)(a(t)) 2
Tu=p(t), rr- l-kr2 ,
We also assume that the cosmological constant A is absorbed into the matter
side as described in Remark 13.2 (p.300). Then the field equation becomes
Ric-;g = 81rT.
6 Not really a constant, since it depends on t! The constancy is within the spatial slice.
7 After Edwin Hubble (1889-1953), an American astronomer. This was also discovered
by Georges Lemaitre (1894-1966), a Belgian physicist, two years prior to Hubble. The
law shows that the spatial slices are 'expanding'. Historically, this was an important
milestone, since ended the then-prevalent static view of the universe.
Cosmology 355
.
The tt-component gives - - 3aa + a2
- 3a 2
+ k (- 1) = 8n:p. Rearrangmg,
. s
a a
a2 k 8n: (16.9)
2a + 2a = -3 p.
The rr component of the field equation gives
aa+2a 2 +2k_ 3 aa+a 2 +k a2
l-kr 2 a2 l-kr 2
which reduces to the equation9
a a2 k
2- + - + - = -81rp. (16.10)
a a2 a2
The 00, </></> components of the field equation also give the same equation
as above, while the mixed components just reduce to O = 0.
The pair of equations (16.9) and (16.10) have three unknowns: a, p, p.
So we need one more equation if we hope to determine each of these func-
tions. The extra equation is often an equation of state, giving the pressure
as a function of the density. In a linear model p = wp, where w is a constant,
and the table below gives the interpretation of the type of matter:
matter type
I
3 radiation
0 dust
-1 cosmological constant
= a3 (p + 3(p + p)~) = a 0 = 0.
3
Below, we will assume that the universe is not empty, so that pa3 =C>0.
The case k = 0
. a2 871' 871' 0 . 871'
Equation (16.9) becomes a 2 = 3 p= 3 a3 , and so aa 2 = 3 o. Thus 13
d .a. 3 C· 3 ~
dta 2 = 2 vaa= 2 y 3 0=.a. (16.12)
Solving this on I= (0, oo) such that 14 lima(t)=O, we obtain, with ,8:=ai,
t--+O
a(t) = ,Bti.
Then the scalar curvature S is given by
s = 6 aa + a2 + 0 4
3t2
a2
and so S - oo as t '\. 0. After a rescaling of the t variable, the metric is a
constant multiple of the one considered in §9.2.
·2 k
Also (16.10) is satisfied: Using the above expression for \ + 2 , we have
a a
a a2
2-+-+-
k
a2 2 a a
-k 2 k 2k 3
=2~----- -2- ~ - - - - -1- - + ~ - - - - - -1- -3
4;C(l- cos(h- t)) 4;C(l- cos(h- t))
1 (4;C)2(1- cos(h- t))
= 0 = -81rp.
The graph oft >--> a(t) is a 'scaled version' of the cycloid. To see this, we
note that if a circle of radius R rolls on the x-axis, then the coordinates of
the point P on the circle, starting from the origin, are given by
(Rr-Rsin r , R-Rcosr) ,
where T is the angle that OP makes with the vertical, as shown in the
following picture.
(0 , 0) X
In our case, define R = 4; f Also, for t E (0, tmax) , if T E (0, 21r) is such
that h( T) = t, then
•· •2 k 6k 3 1
S = 6 aa + a + = --,---------- ~ oo
a2 (4; C) 2 (1 - cos(h- 1 t)) 3 ·
Using these expressions, one verifies in the same manner as in the k > 0
case, that (16.9) holds after substituting p=C/ a 3 :
a2 k 2(-k) 3
- + - = - - - - - - - - - = - - = -p
871' C 871'
a2 a2 ( 4;C) 2 (cosh(h- 1 t) -1) 3 3 a3 3 ·
Also, (16.10) is satisfied: Using the above expression for ~ + !,r, we have
ii a2 k
2-+-+- = 0 = -87l'p.
a a2 a2
We have a singularity at t = 0, since
S= 6 aii+a 2 +k=6(-k) 3 1 ~ 00
a2 ( 4; C) 2 (cosh(h- 1 t) - 1) 3 ·
Qualitative plots of a versus t in each of the three cases are shown below.
k= O
k >O
0 t
360 A Mathematical Introduction to General Relativity
Notes
Section 16.1 is based on [Oloff(2018), §15.1].
15 The 'CDM' part of the abbreviation, standing for 'cold dark matter'.
16 Indeed, M>O gives a(O)>O too, which yields H(0)2 = ~+-b->-b--
Solutions
Chapter 1
361
362 A Mathematical Introduction to General Relativity
Then
2 2 2 4u 2 +4v 2 +u4 +v 4 +1+2u 2v2-2u 2-2v2
x +y +z = (u2+v2+1)2
So p E S 2 • Also p =I= n, since otherwise
(
2u 2v u 2 +v 2-1) = (O O l)
+v +1' u +v +1' u 2 +v 2 +1
2 u2 2 2 ' ' '
and comparing the first two entries, we get u = 0 and v = 0, implying
u 2 +v 2 -1
- - - - = -1 =I= 1
u 2 +v 2 +1 '
a contradiction. Hence p E S2 \ { n}. Finally,
1 ( 2u 2v )
'Pn(P) = 1- u2+v2-1 u2+v2+1' u2+v2+1 = (u, v).
u 2 +v 2 +1
z
Solutions 363
(b3, A3) · ((b2, A2) · (b1, A1)) = (b3, A3)· (b2 + A2b1, A2A1)
= (b3 + A3(b2 + A2b1), A3(A2A1))
= ((b3 + A3b2) + (A3A2)b1, (A3A2)A1)
= (b3 +A3b2,A3A2)·(b1,A1)
= ((b3, A3)· (b2, A2)) · (b1, A1).
Let (b, A) E llm x GL(m, ll). With the zero vector OE llm and the identity
matrix I E GL(m, ll), (0, I)· (b, A) = (0 + I b, IA) = (b, A), and also
(b, A)· (0, I) = (b + A 0, AI) = (b, A). So (0, I) serves as the identity
element with respect to •.
Finally, for (b, A) E llm x GL(m, ll), (-A- 1 b, A- 1 ) E llm x GL(m, ll).
Moreover, (b,A)·(-A- 1b,A- 1) = (b + (A(-A- 1 b),AA- 1 ) = (0,I), and
(-A- 1 b, A- 1 )·(b, A)= (-A- 1 b + A- 1 b, A- 1 A)= (0, I). So every element
in llm x GL(m, ll) has an inverse with respect to •.
Solution to Exercise 1. 7
Denote the compatibility relation by~. Reflexivity (Au A= A is an atlas
on M for every atlas A on M) and symmetry (A1 uA2 = A2 uA1 is an
atlas on M for atlases A1,A2 on M such that A1 ~ A2) are clear. We
now show transitivity. Let A1, A2, A3 be atlases on M such that A1 ~ A2
and A2 ~ A3. Thus A1 u A2 and A2 u A3 are atlases on M. As the
charts in A 1 alone cover M, certainly the charts in A 1 u A3 cover M. So
to show A 1 u A 3 is an atlas, we only need to show that the transition
maps are C 00 • Clearly if both charts are from A 1 , or if both are from
A3, then there is nothing to prove. So let (U, cp) E A1, (V, '¢) E A3, and
Un V =I= 0- Let p E Un V. Then there exists a chart (W, a) E A 2 such
that p E W. As p E U n W, and since A1 u A2 is an atlas, the map
a o cp- 1 : cp(U n W)-> a(U n W) is C 00 • Similarly, asp E W n V, and
as A 2 u A3 is an atlas, the map '¢ o a- 1 : a(W n V) -> '¢(W n V) is
C 00 • Now the map '¢ o cp- 1 : cp(U n V n W) _. '¢(U n V n W) is C 00
because'¢ o cp- 1 = ('¢ o a- 1) o (a o cp- 1) is the composition of two C 00
maps. Hence '¢ o cp- 1 is C 00 in a neighbourhood of cp(p) E cp(U n V).
As p E U n V was arbitrary, '¢ o cp- 1 is C 00 on cp(U n V). Similarly,
cp o '¢- 1 : '¢(U n V) -> cp(U n V) is C 00 • So all the transition maps in
A1 u A3 are smooth, and so A1 u A3 forms an atlas. Thus A1 ~ A3.
Solutions 365
is open in llm+n, because 'Pi (Ui n Ui,) is open in llm, and 1Pi (½ x l'j,) is
open in lln.
366 A Mathematical Introduction to General Relativity
(cp; 2 x 'lj;h)((U;, x ½,) n (Uh x ½ 2)) 3 ((cp; 2 o cp~ 1)(a), ('lj;h o 'lj;;,1)(,B))
and
(cp; 2 x 'lj;h)((U;, x ½,) n (Uh x ½2 )) 3 (a,,B)
I(-Pi1 x VJ)l ) o ('Pi2 x VJ j2 )-1
<p
--+
Solutions 367
p u
Theniq(U)={(p,q):pE U} c UxV. Also, (xxy)oiqox- 1 :x(U)----> rntm+n
is given by
((x x y) o iq o x- 1 )(() =(xx y)(iq(x- 1(())) =(xx y)(x- 1((),q)
= (x(x- 1(()),y(q)) = ((,y(q))
for ( E xU, and the map x(U) 3 ( f--4 ((,y(q)) E rntm+n is clearly C 00 . But
p E M was arbitrary. Thus iq is smooth.
370 A Mathematical Introduction to General Relativity
~-~
9)
~.
0 f( U )
IR.= N
l
M = IR.
• (U, cp) = (ll, id), an admissible chart for ll, with t Ell = U.
• Let {e 1 , • • • , em} be a basis for V, and let '¢ : M -+ llm be the map
q >-+ 'l/J(q) := (y 1 , • • • , ym), where q = p + Vpq and Vpq = yiei. Then we
had seen that (M, '¢) is an admissible chart for M. Let v = viei. We
have 'Yv(U) = 'Yv(ll) c M, and for s E cp(U) = ll,
('¢ o 'Yv o cp- 1 )(s) = '¢ o 'Yv(s) = '¢(p + sv) = 'l/J(p + sviei)
= (sv 1 , · · · , svm).
Clearly, the map ll 3 s >-+ ( sv 1 , • • • , svm) E llm is C 00 .
As the choice oft was arbitrary, it follows that 'Yv is smooth.
Chapter 2
V72,-if = :? 0
1'2 ) (0) = :?(t+ 3t 2 - l)) (0) = f'(t+3t 2 - 1) (1 +6t)lt=O = J'(-1).
Thus for all f E C 00 (M), we have v71 ,-if = v72 ,-if. Hence v71 ,-1 = v72 ,-1-
Solution to Exercise 2. 7
The partial derivatives of the chart transition map '¢ o (id)- 1 are
or X or y
ox ,Jx2 + y2' oy ,Jx2 + y2'
80 -y 80 X
ox x2 + y2' oy x2 + y2 ·
vl . . vl
[ v2 ] ' g1vmg [ v2 ] = [
,Jx2X+ y2
y
,Jx2 + y2
l
.
For f E C 00 (M),
1
(d1o(Ot,o))(f) = Ot,o(f 01) = ~~O"(oidiR \dR(O)) = ~~ 01 \0)
= v,,,(o)f = v,,vf.
As f E C (M) was arbitrary, d"fo(ot,o) = v,,p·
00
So multiplying from the left by the row vectors (1, 0, 0), (0, 1, 0), (0, 0, 1),
respectively, we can read off the claimed relations.
378 A Mathematical Introduction to General Relativity
Chapter 3
Solution to Exercise 3. 7
For p E R 2 , using Theorem 2.2, we determine the matrix of the linear map
(d'l/J-t)p with respect to the ordered bases (ox,p, oy,p) and (ox,,t,_,p, oy,,t,_,p):
[ c~st -sint]
smt cost
[OJ=
1
[-sint].
cost
So
(('1/J-t)*V)'P-tP = (d'l/J-t)p½, = (-sint)ox,,t,_,p+(cost)oy,,P-tP
= ((- sin t) Ox+ (cost) oy),t,_,p•
Hence, as '1/J-t: R 2 -+ R 2 is surjective, ('1/J-t)*V = (-sint)ox+(cost)oy.
J ')'(t)
P
d ( 1)
- - du= -
du u
lt
0
dt.
1 1 p l 1
So - ()--=-t. Thus 'Y(t)=-- fort<-. (Ast/'-, 'Y(t)/+oo.)
'Y t p I-pt p p
[ 01 -tJ
1 .
Hence the action of (d1/J-t)p on oy,p is the multiplication of the above matrix
with the vector e2 E JR 2 . Consequently, (d1/J-t)p(oy,p) = -tox,,J,_,p + Oy,,J,_,p•
Solutions 385
[Lx,Ly]f
= (yoz - zoy)(zoxf - xozf) - (zox - XOz)(yozf - zoyf)
= y(ozz)(oxf) + yzozoxf - ~ - 0 - z 2 oyoxf - 0 + ZXOyOzf + 0
-zyoxozf - 0 + z 2 oxoyf + 0 + xyo;J + 0 - x(ozz)(oyf) - XZOzOyf
------ ~
Chapter 4
Also for V, WE TJ M,
((01 + 02) ®0)(V, W) = ((01 + 02)V)-0W = (01 V + 02V)-0W
= (n1 v)0w + (n2 v)0w
= (01 (8) 0)(V, W) + (02 (8) 0)(V, W)
= (01 ® 0 + 02 ® 0)(V, W), and
(0 ® (01 + 02))(V, W) = (0V)·((01 + 02)W) = (0V)-(01 W + 02W)
= (0V)01W + (0V)02W
= (0 ® 01)(V, W) + (0 ® 02)(V, W)
= (0 ® 01 + 0 ® 02)(V, W),
and so
(01 + 02) ®0 = 01 ®0 + 02 ®0 and
0 ® (n1 + n2) = 0 ® n1 + 0 ® n2.
Solution to Exercise 4. 7
First we show the spanning property. Let T be an (r, s)-tensor on V. For
any v E V, we have v = 1:i(v)ei, and for w E V*, w = w(ei)Ei. Thus for
w 1 , • • • , wr E V* and v1, · · · , Vs EV, we have
= i
T( 1 , · · · , ir, ej1 , · · · , ej.) ·
(ei 1 (8) · · · (8) eir (8) €j 1 • • • (8) €j" )(w1, · · · , Wr, V1, · · · , Vs)-
Next we show the independence. Let ct:t ei 1 (8) • • • (8) eir (8) 1:i1 • • • (8) 1:i• = 0.
Operating on (€k1, ... '€kr, eR,i, ... 'eR,.) E (V*t X vs'
Q = Ci_1 ·••i, e· (8) ... (8) e·tr @ €j1 ... @ €j" (1:k1 , . . .
Jl •••Js i1 ,
€kr , e•.c.1, . . . , e•.c. 8 )
£1 ... £. •1
= Ck1 •kr 00
£1 ... £•.
Solutions 389
=
,I/
,il ,•r J.Ji
,I/ .,
So T is well-defined.
Since smoothness is a local property (Exercise 1.16, p.14), the map
M 3 p >-+ T(01,- • • , nr, V1, • • • , Vs) is smooth. Hence Tis a map from
(Tf Mt x (TJ M) 8 to C 00 (M). The multilinearity of Tin each component
follows from the observations that
k ~
+ JO~k )pOxik,p = (0 k )pOxik,p + f(p)-(O)pOxik,p,
(n
(dxie)p(½ + J½)p = (½ + J½)pxie = (½)pxie + J(p)·(Ve)pxie
= (dxie)p(Vi)p + J(p)·(dxie)p(Ve)p
Then
Thus (T®T')(p)=T(p)®T'(p).
(T ® S)~: :::;.:~, (p) = (T ® S)(p)((dxi' )p, ... '(dxir+r' )p, Oxi,,p, ... 'Oxi•+•',p)
= ((T ® S)(xdxi', · · · , xdxir+r', XDxi,, · · · , xoxi•+•' ))(p),
Similarly,
((f*T) (or, 00)) (q) = r 2 ( c~s 0) 2 ( O+ (or,qr) (00,q0)( or,qr sin 0) (00,qr sin 0) +O+O)
(
1 ) (sin0)rcos0 = !tan0,
r 2 cos 0 2 r
((f*T) (00, or)) (q) = r 2 ( c~s 0) 2 ( O+O+ (00,q0) (or,qr) (00,qr sin 0) (or,qr sin 0) +O)
(
1 ) (rcos0) sin0 =!tan 0,
r 2 cos 0 2 r
((f*T)( 00, 00)) (q) = r 2 ( c~s 0) 2 ( O+O+ O+ (00,q0) (00,q0) (00,qr sin 0)( 00,qr sin 0))
1
( )2 (rcos0)(rcos0) = 1.
r 2 cos 0
Thus
1 1
f*T = 2 (tan 0)2dr ® dr + -(tan 0)(dr ® d0 + d0 ® dr) + d0 ® d0
r r
in the chart (M, (r, 0) >-+ (r, 0)).
Solutions 395
Chapter 5
(1)=?(2): Let A E O(1,m -1). For all u,v E !Rm, r,(Au,Av) = r,(u,v), that
is, (Aur[r,]Av = ut[r,]v, that is, utAt[r,]Av = ut[r,]v. For 1 ~ k ~ m, let
ek E !Rm be the column vector with all entries equal to zero, except for the
kth entry, which is equal to one. Taking succesively v = ek, 1 ~ k ~ m, we
obtain utAt[r,]A = ut[r,]. Taking successively u = ek, l~k~m, we obtain
At[r,]A = [r,].
(2)=?(3): The (i,j)th entry of At[r,]A is (At)iT/ktA] = AfT/ktAf Thus
T/ij = Af A]T/k£·
(3)=?(1): A E 0(1, m - 1), because for all u, v E !Rm that
r,(Au,Av) = (Au)t[r,]Av = utAt[r,]Av = uiAfT/ktA3vi
. k £ . . .
= u"AiAjT/ktV1 = u"T/ijV1 = r,(u,v).
Solution to Exercise 5. 7
If A E 0(1, m - 1), then At[r,]A = [r,]. Thus taking determinants,
-1 = det[r,] = det(At[r,]A)
= (det(A t))(det[r,]) det A
= (detA)(-l)detA = -(detA) 2.
So detA E {-1, 1}. As detA#0, A is invertible. So 0(1, m-1) c GL(m, JR).
We have
Finally,
Let w E TpM. Suppose that the vector field WE TJ Mis such that Wp = w.
(Such a vector field W exists, by Lemma 3.1, p.44.) Then the equality
g(p)(Vp, Wp) = g(p)(Vp, Wp) above gives g(p)(Vp,w) = g(p)(Vp,w), i.e.,
g(p)(½, - vv,w) = o.
As the choice of w E TpM was arbitrary, g(p)(Vp - Vp, w) = 0 for all
w E TpM. Since g(p) is nondegenerate, it follows that Vp - Vp = 0, that is,
Vp = Vp. Asp EM was arbitrary, we conclude that V = V.
Solutions 399
((('1/Jt)*g)(V, W))(p)
= g('l/JtP)((d'l/Jt)pVp, (d'l/Jt)pWp)
= ((dx),;,.p ® (dx),;,.p + (dy),;,.p ® (dy),;,.p
+(dz),;,.p ® (dz),;,.p)((d'l/Jt)pVp, (d'l/Jt)pWp)
= vx(p)Wx(p)
+ ((cos t)VY(p) - (sin t)Vz(p))((cos t)WY(p) - (sin t)Wz(p))
+ ((sin t)VY(p) + (cos t)Vz(p))((sin t)WY(p) + (cos t)Wz(p))
= vx(p)Wx(p) + ((cost)2 + (sint)2)VY(p)WY(p)
+((-sint) 2 + (cost)2)Vz(p)Wz(p)
= g(p)(½,, Wp) = (g(V, W))(p).
= 12 + 02 + v5 = 1 + v5,
g(p )vv = (Ov,pX ) 2 + (Ov,pY ) 2 + (Ov,pZ ) 2
= (!: r + (!: r +
(p) (p) (!~uv) (p) r
= 02 + 12 + u5 = 1 + u5,
g(p)uv = g(p)vu = (ou,px)(ov,pX) + (ou,pY)(ov,pY) + (ou,pz)(ov,pZ)
Thus
But p >--+ [adj G(p)]ij belongs to C 00 (U) as well. Hence p >--+ [(G(p))- 1]ij
belongs to C 00 (U).
= f✓ ~ d2 ( - 1) dt = ✓d2( : 2 - 1)
< vd 2(1+ ~:-1) = T.
In the above, the map t : M -> JR is the t-coordinate map, that is, the map
M = !Rx (2m,oo) xS 2 3 (t,r,p) >-+ t.
Solutions 405
Tmbit = ✓ (1- 3r7) (t(q) - t(p)) < ✓ (1- 2r7) (t(q) - t(p)) = T.tationaxy•
2m)- 1( r ') 2 = -1
2m) (t ') 2 + ( 1--;=-
- ( 1--;=-
E=vl- 2m_
r1
Hence
( dr) = E 2 _ 1 + 2m = 2m _ 2m.
2
dr r r r1
So, while r increases from r = ro tor= r1, we have
dr _. f2m 2m
dT - v-;::- - -;:;,
406 A Mathematical Introduction to General Relativity
and on the return trip from r = r 1 tor= r 0 , we have a minus sign in front
of the square root. Consequently the total proper time to make the round
trip is
Let To, T2 E J be such that r(ro) =ro =r(r2). Ifs denotes the inverse of the
(increasing) function r: (ro,r1)-+ (ro , r1) , then
. /i 2m
d(t o s) I = t'(r)-1- = V - -;:-;- 1
du r=r(T) r'(r) 1- 2m . /2m _ 2m
r V r r1
(17.1)
Also, we compute
d(Vl - ~l:!..t)
dr 0
Solutions 407
Using (17.1) and the above expressions, we obtain (after some algebra)
ro >-+ P , f 1 t - f1r,
(ta, t a)= B
ea .L A=,( to)
tB - l. sinh(toa)
The slope of the segment AB is -1, giving
tB -
r (t 0a) = -1, that is,
a cosh
1 etoa
tB = -(cosh(toa) + sinh(toa)) = - .
2a 2a
So the time f1t it takes for the photon ,\_ to reach the horizon in the
(U, x)-coordinates is
2a
Hence
E= Ii = Ii = lia.
2f1t 2 e- toa e-toa
2a
408 A Mathematical Introduction to General Relativity
17 More precisely, with p :=>..+(to), we ought to write V>.+,P = o,,o,p+ 0,,1,P = I(eo +e1),
where I : V -+ TpM is the isomorphism described in Example 2.2, p.28.
Solutions 409
Chapter 6
= V(W,X).
=VvW+cVvX.
(2) Vv(JW) = Vi(JW)irfjOxk = JViWiI'fjOxk = JVvW.
(3) VJV+xW = (JV +X)iWiI'fjOxk = JViWiI'fjOxk + XiWiI'fjOxk
=JVvW+VxW.
Hence:
(Cl) v'v(W +cX) = Vv(W +cX)+v'v(W +cX)
~ VvW+cVvX+v'vW+cv'vX
= v'vW +cv'vX.
(C2) v'v(JW) = Vv(JW)+v'v(JW)
~ JVvW+(VJ)W+fv'vW
= fv'vW+(VJ)W.
(C3) v' JV+xW = VJV+xW + v' JV+xW
~ JVvW+VxW+fv'vW+v'xW
= fv'vW+v'xW.
So v' is a connection.
Solutions 411
Solution to Exercise 6. 7
For f E C 00 (M), V, W,X E TJM, !1,8 E TfM, we have:
T(n + Je, V, W) = (n + f8)(v'vW-VvW)
= n(v'vW-VvW) + f8(v'vW-VvW)
= T(n, V, W) + f T(e, V, W),
T(n,Jv+x, W) = n(v'1v+xW-V1v+xW)
= n(Jv'vW + v'xW-JVvW-VxW)
= Jn(v'vW-VvW) + n(v'xW-VxW)
= f T(n, V, W) + T(n, X, W).
Finally,
T(n,V,JW+X)
= n(v'v(JW +X) - Vv(JW +X))
= n((VJ)W + fv'vW
+ v'vX - (VJ)W - JVvW - VvX)
= n(J(v'vW-VvW) + v'vX-VvX)
= Jn(v'vW-VvW) + n(v'vX-VvX)=JT(n, v, W) + T(n, v,x).
So T is a (1, 2)-tensor field.
Solutions 413
that is,
k k .
Thus Oi = riidx1 .
Existence: Define Of : TJU --> 0 00 (U) by Of = rfidxi E TfU. If V E TJ M,
then Vlu E TJU, and we can decompose Vlu = vtaxl, giving
. . £
(Oi V)Oxi = r]rdxr (V Oxl )Oxi = rtr. £
V 8I Oxi
-- rjir vr UxJ
:l . -
-
vrr, :l .
V Oxr Ux-i
m
= I; (sign 1r) I; gl7r(l) · · · g(i-l)1r(i-l) (Oxkgi,r(i)) g(i+1)1r(i+l) · · · gm1r(m)
1rES-m i=l
m
= I; I; (sign1r) gl7r(l) · · · g(i-l)1r(i-l) (Oxkgi,r(i)) g(i+1)1r(i+l) · · · gm1r(m)
i=l 1rES-m
gml gmm
where the summation convention is not used here, and Qii is the (i,j)th
minor of G, that is the matrix obtained from G by deleting its ith row and
jth column.
As G is invertible, we have a- 1 = de!a[adj G], where adj G is the
adjugate of G (the matrix whose entry in the ith row and jth column is
(-l)i+i detGii_ Since G is a symmetric matrix, we have Qii = (Gii)t, and
so detGii =det((Gii)t) =detGii_ Hence
(- 1 )i+j
(G- 1)ii = detGii_
detG
So (not using the summation convention),
m m
Oxk detG = ~ ~(-l)i+i(Oxkgij)(detG)(G-l)ij(-l)i+j
i=l j=l
m m
= (detG)~ ~ gijOxkgij·
i=l j=l
k gkl
Recall that rii = 2 (oxigje + Oxig;e - oxtgij)-
m j m m gif
~ rij = ~ ~ 2(0xigje + Oxigie - Oxlgij)
j=l j=ll=l
m m gif m m gif
= ~ ~ 2 ox;gje +~ ~ 2 (oxigie - Oxtgij)
j=ll=l j=ll=l
m m gif
= ~ ~ 2 ox;gje + 0,
j=ll=l
416 A Mathematical Introduction to General Relativity
where we have used gif = gfi, and exchanged the dummy indices j, e in
order to obtain the O in the line above. (Indeed, if
m m gif
S := ~ ~ 2 (c'\igu - c'\tgij),
j=lf=l
then after the j +-+ eexchange,
m m gfj m m gif
S = ~ ~ 2 (oxtgij - Oxigie) = ~ ~ 2 (-l)(Oxigie - Oxtgij) = -S,
f=lj=l j=lf=l
giving S = 0, as claimed.) On the other hand, also
Dxi (log-v'- det G) = Dxi log(-:et G) =
-2 et
~ G (- det G) f: f: gieoxigje
i=U=l
m m gif
= ~ ~ -Oxigje-
i=lf=l 2
Here we used, for h E C 00 (U) and a real-valued f smooth on an open set
containing h(U), that OxiU Oh)=(!' 0 h)ox;h: for all p Eu,
o(J oho x- 1 ) , ,
(ox;(Joh))(p) = 0u•. (x(p)) = f (h(p))oxi ,Ph=((! oh)ox;h)(p).
We have, in the case of the sphere with the chart (U, r.p), that:
R(o0,0¢)00 = "va 8 "va"'o0 - '\la"' "va 8 00 - "v[a 8 ,a<t>Jo0
= "v i3a (r:cf,00+ rtcf,a¢) - "v a<p (r:000 + rt0ocp)
= "v a8 (( cot 0)0¢) - 0 = (00 cot 0)o¢ + (cot 0)'\l a8 oc/>
(v'vR)(n,x, Y, Z)
= V(R(O,X, Y, Z)) - R(v'vO,X, Y, Z) - R(O, v'vX, Y, Z)
-R(O,X, v'vY, Z) - R(O,X, Y, v'vZ)
= V(-R(O, Y,X,Z)) + R(v'vO, Y,X,Z) + R(O, Y, v'vX, Z)
+R(O, v'vY,X, Z) + R(O, Y,X, v'vZ)
-------------
= -(V(R(O, Y, X, Z)) - R(v'vO, Y, X, Z) - R(O, v'v Y, X, Z)
-R(O, Y, v'vX, Z) - R(O, Y,X, v'vZ))
-------------
= -(v'v R)(n, Y, X, Z).
r
- ~T(01,--- ,v'Jv+wO\··· ,nr,X1,··· ,Xs)
i=l
s
(Vv(W ® T))(n, e, X)
= V(W(n)T(e, X)) - (W ® T)('\lvn, e, X)
- (W © T)(n, '\lv0, X) - (W © T)(n, e, '\lv X)
= V(Wn)T(e, X) + (Wn)V(T(e, X)) - W('\lvn)T(e, X)
- (Wn)T('\lv0, X) - (Wn)T(e, '\lv X)
= (V(Wn) - W('\lvn))T(e, X)
+ W(n)(V(T(0,X)) -T('\lv0,X) -T(e, '\lvX))
= (v(nw) - (Vvn)W)T(e, x) + w(n)(VvT)(e, x)
= (v(nw) - v(nw) + n(VvW))T(e,x) + (W® (VvT))(n,e,x)
= ('\lvW)(n)T(e,x) + (W®('\lvT))(n,e,x)
= (('\lvW) ®T)(n, e, X) + (W ® ('\lvT))(n, e, X)
= (('\lvW) ®T + w © ('\lvT))(n, e, X).
Chapter 7
Chapter 8
that is,
0(t) - (sin0(t))(cos0(t))(¢(t)) 2 = 0
¢(t) + 2(cot0(t))0(t)¢(t) = 0.
We had seen that in (U, rp), the component matrix for g is given by
Integrating from t = 0 tot, we get log ~(t) = -2 log si~ 00(t), that is,
¢(0) sm o
· ¢(0)(sin0o) 2 c
(17.11)
¢(t) = (sin0(t)) 2 = (sin0(t)) 2 '
where c:=¢(0)(sin00 ) 2. Substituting¢ from (17.11) in (17.10) gives
. 2 . 2 c2
(0) +(sm0) (sin0)4 =l.
So
c2
1---- (17.12)
(sin0)2'
where we consider the portion of the geodesic traversed in the 'increasing 0
direction', so that 0 > 0. Thus we can invert the function 0. Leth= 0- 1 .
Solutions 427
With a slight abuse of notation, we denote (<I> o h)(0) simply by¢. Then
<p-<po =Sin-l(ncot0o)-sin-l(ncot0),
1-c2 1-c2
(sin0)(cos¢)(sina) - (sin0)(sin</>)(cosa) - n c o s 0 = o.
1 - c2
dg('Y(-))(v.,,,.,,c-), X.,,c,))
dt
= g('Y(·))(v'v,, V.,,,X.,,(,)) + g('Y(-))(v.,,,.,,(·), (v'v,,X.,,(,))(-))
= g('Y(·))(0,X.,,c,)) + g('Y(-))(v.,,,.,,c-), (v'v,,X.,,(,))(-))
= g('Y(·))(v.,,,.,,c-), (v'v,,X.,,c,))(·).
By Remark 7.1, for each 7(t), t E I, there exists an open set U and a
V E TJU which extends V.,, in U, i.e., V.,,(t) = V.,,(t) for all t E J. By the
Killing equation (Exercise 6.26),
x(t) =a+ a lo
t
e
_ 213 T
dr =
{a+ at
i - e - 2 '3•
a + a ----;;:r i
if /3 = 0,
"f /3 O
=I= •
It can be checked that the y and x obtained above solve the geodesic equa-
tion and the initial conditions.
Let p = (a, b) and q = (c, d). We consider the two possible cases:
1° b = d. Take for example f3 = 0, and a = 1. We solve for T E JR so that
x(T) = c: c = x(T) =a+ aT =a+ lT yields T = c - a.
Then y(T) = b + (3T = b +OT= b = d.
2° b =I= d. Take for example f3 = 1 and T = d - b.
Theny(T) = b+f3T = b+l(d-b) = d. Wesolveforasothatx(T) = c:
2(c-a)
c = x (T) = a + a
i-e-2,ST i-e2(d-b) •
213 = a+a 2 gives a = i-e- 2 (d-b).
So given any p, q E JR 2 , there is a geodesic which passes through p and q.
430 A Mathematical Introduction to General Relativity
-2
-3
Solution to Exercise 8. 7
With J = (a - c,b- c) and h(s) = s + c, s E J, we have h(J) = (a,b).
Fors E J, h(s) E (a, b), and as, is a geodesic, (v'v., V1)(h(s)) = 0, giving
This gives the first two equations. For the third equation, we look at the
r-component in the geodesic equation:
0 = r" + r;t(t') 2 + r~r(r') 2 + f00(0') 2 + r;1(¢') 2 = r;i(t') 2 + r;1(¢') 2
-1 = - (t ,) 2 ( 1 - --;-
2m) 1r) 2(')
+ r 2 (.sm 2 ¢ 2 = - (')
t 2 ( 1 - --;-
2m) + r 2(')
<p 2.
Using (17.13) to replace (¢') 2 in the above yields, after a rearrangement,
(1 - 3~)(t')2 = 1.
Let P2 = 'Y(O), Pz
= 'Y(T ,b,,) E M be the events along 'Y corresponding to
0
Chapter 9
Solution to Exercise 9. 7
Recall that r:4> = -( sin 0) (cos 0) , rt¢ = cot 0 = r: 0 are the only nonzero
connection coefficients in the given chart. We have
ft1>1>1>1> = o ft 1>1>1>0 = o =o
ft 1>1>01> ~ ¢¢00 =0
~ ¢0¢¢ =0 R 4>04>0 = -(sin0) 2 R 4>004>= (sin 0) 2 ~ ¢000 = 0
~ 0¢¢4> =0 ~04>4>0 = (sin 0) 2 ~04>04> = -(sin0) 2 ~ 0¢00 = 0
R 001>1> =0 R 004>0 = 0 R 0004> = 0 R 0000 = 0
are the 16 components of the covariant Riemann curvature tensor field in
the given chart. Hence
R!e1> = 0 =1
R ! 00
Rte¢ = 0 Rt00 = 0
R: 4> =
0 -(sin0) 2 R:00 = 0
R: =04> 0 R:00 = 0.
~ ..9 ..9
= R.,, .,,
~
R¢0¢0 + R.,,..99 .,,R¢00¢
~ .. ~ ....9 ~
+ R 9 .,,.,,~ 9 ..9 .. ~
R0¢¢0 + R .,, .,,R0¢0¢
~
= - (si: 0)2 (-(sin0)2)+ (si: 0)2 (sin0)2+ (si: 0)2 (sin0)2- (si:0)2 (-(sin0)2)
= 4.
(div(T+cS))(V1, · · · , Vs-1)
= C((O, V) ,_. (v'v(T+cS))(nU, Vi,···, Vs-1))
= C((O, V) ,_. ((v'vT)(nu, Vi,··· , Vs-1)+c(v'vS)(nU, Vi,··· , Vs-1)))
= C((n, V) ...... (v'vT)(nu, V1, ... , Vs-1))
+c C((O, V) ,_. (v'vS)(nU, Vi,···, Vs-1))
= (divT)(Vi, · · · , Vs-1)+c(divS)(V1, · · · , Vs-1)
= ((divT)+c divS)(Vi, · · · , Vs-1).
div(Vi,) = (v'a.,;(Vi,))((dxi)U)
= Oxi(Vi,(dxi)U) - Vi,(v'a.,;(dxi)U)
= ox;(g(V, (dxi)U)) - g(V, v'a.,; (dxi)U)
= ox;(g(V, (dxi)U)) + g(v'a.,; V, (dxi)U) - ox;(g(V, (dxi)U))
= ((dxi)Ul(v'a.,; V)
= dxi(v' a.,; V)
= C((O,X) ,_. O(v'xV)} = divV.
Hence the light was emitted 13.8 - 10.5 = 3.3 billion years ago.
436 A Mathematical Introduction to General Relativity
= ( oosot
2 (x o rt k o(x o r)i
(s,t)+rij(r(s,t))ot
o2 (x o r)i ~
(s,t)os2 (s,t))Oxk,r(t,s)·
So (v'J(•,t) V(·, t))(s) = Wk(t)oxk,r.(t), where
Wk= o2(xor)k( ·) I'~-(I'( ·))o(xor)i( ·)o2(xor)i( ·)
osot s, + •J s, ot s, os2 s,
is smooth on I. Consequently, t ....... (v' J(·,t) V(·, t))(s) E TJrs.
(2) Analogous to the proof of (1). We have
(v'v(s,-)J(s, ·))(t)
o2 (x o rt o(x o r)i o2 (x o r)i ~
= ( otos
k
(s,t)+rij(r(s,t))os (s,t)ot2 (s,t))Oxk,r(t,s)·
so that (v'v(-,t)J(·, t))(s) = Xk(s)oxk,f,(s)' where
Xk = o2(x o r)k (· t) I'~- (I'(· t)) o(x o r)i (· t) o2(x o r)i (· t)
osot ' + •J ' OS ' ot2 '
1~
is smooth on (-1:, 1:). Hence s ....... (v'v(s,-)J(s, ·))(t) E T 0 I't.
:;g(rs(· ))(J(s, ·), V(s, -))) (to)= g(r s (to))((v' V(s,-)J(s, ·) )(to), V(s, to))= 0,
the ordinary differential equation
d2
dt2 (g(r s (t))(J(s, t), V(s, t))) = 0 (t E I)
has the unique solution given by g(I's(t))(J(s,t), V(s,t)) = 0 for all t EI.
Solutions 437
_2 2m 1 . m 1 1 1 1
So 100s ~ (2m)3 = 4m2, i.e., M0 ~ 2 IOs-1 5 x IQ-6s = IQ-4 = 10000.
Consequently, the mass of the black hole should be at least ten thousand
solar masses.
Chapter 10
(Altw)(v1,··· ,vk) = t, ~
. 1res.
(signa)w(vu(l),··· ,Vu(k))
= t, ~
"1res.
(signa)(signa)·w(v1,··· ,vk) = t, ~
"1res.
lw(v1,··· ,vk)
(7r(.Cv!1))(Vi, · · ·, Vk)
= (sign7r)(.Cv!1)(Vi, · · · , Vi).
Thus 7r(.Cv0) = (sign7r) (.CvO) for all 7r E sk, and so .Cvn E nk M.
Solutions 441
n/\e = (xdx+ydy+zdz)A(ydx+zdy+xdz)
= (xydxAdx+xzdxAdy+x 2 dxAdz
+y2 dy I\ dx+yzdy Ady+yxdy Adz
+zydz I\ dx+z 2 dz Ady+zxdz I\ dz)
= (0+xzdxAdy- x 2 dzAdx-y 2 dxAdy+0
+~~/\~+~~/\~-~~/\~+~
= (zx -y2 )dxAdy+(xy- z 2 )dyAdz+(yz - x 2 )dzAdx.
442 A Mathematical Introduction to General Relativity
((/*(0A8))(Vi, · · · , Vk+e))(p)
= (0 A 8)(J(p))(dfp(V1)p, · · · , dfp(Vk+e)p)
Chapter 11
= a i11 · ·· a i,n ( 1
m € /\ • · • /\ € m) (e'l.1,
· · )
• • • , e Z,n
= (det[aj])·l=det[a;].
We note that the third equality can be justified by noting that a sum-
mand will be nonzero only when (i 1, • • • , im) is a permutation of (1, • • • , m),
thanks to the skew-symmetry of w B = €1/\ • • •/\ €m.
(1)=?(2): If Bis positively oriented with respect to WB, then we have that
WB(e~,··•,e~)>0. From the above, det[a;]=wB(e~,··•,e~)>0.
(2)=?(3): If det[a;] > 0, then WB(e~, •••, e~) >0. But WB,(e~, •••, e~) > 0.
The k relating these nonzero top forms, WB' = kwB, is easily seen
to be >0 by operating on them-tuple (e~, •••, e~)- So WB ~wk.
(3)=?(1): If WB~wk, then we have WB=kwB, for a k>0. Hence we obtain
that WB(e~, · · · , e~) = kwB,(e~, · · · , e~) = kl >0. Consequently,
(e~, •••, e~) is positively oriented with respect to [wB]-
J(u, v) = [ _ (::,::~:,
(u2+v2)2
1
Thus det J(u, v) = ( 2 2) 2 < 0. So A is not oriented.
u +v
Solutions 447
l
having positive determinants.) Recalling 'Pn o <p- 1 from Example 1.8 (p.7),
we can compute its Jacobian to be
-cos¢> _ (sin0)(sinef>)
[ 1-cos0 1-cos0
-sin¢> (sin0)(cosef>)'
1-cos0 1-cos0
having the determinant
-sin0
(1-cos0) 2 < O.
Consequently, the determinant of the Jacobian of 'Pn o <p- 1 is
(- sin 0) sin 0 0
(detR)· (1-cos0) 2 (1-cos0) 2 > ·
J,Xa(i)Ua(i) (ox•fk)(x:(i)u)du 1 1
···du
m
=
i o(fkox:hi)
0 k
Xa(i)Ua(i) U
1
(u)du ···du
m
=i 09 ~(u)du 1 ···dum=
Xa(i)Ua(i) OU
f
JR= OU
09~(u)du 1 ---dum
f M
d(cpin) = L J,
j Ua(j)
cpjd(cpin) = L J,
j Ua(j)f"'>Ua(i)
cpjd(cpin)
since the contributions of the thin sets 0n(Un\U) and cp8 (Us\U) are 0. Here
(U, cp) is the spherical coordinate chart from Example 1.8 (p.7).
We now use a change of variables given by it>= IPn o cp- 1 in the first
integral, and <I>= 'Ps o cp- 1 in the second integral. Using the transformation
relations among gg and g~n, respectively g~•,
as expected.
450 A Mathematical Introduction to General Relativity
l
belongs to A*. It is enough to show that n coincides with volg on (U, x)
(since then by continuity they coincide also on M). In (U, x), we have
(1-2;1)-l
[g(ox;, Oxi )] = [ r2 r2(sin0)2 .
-(1-2;')
So det[g(ox;,Oxi)] = -r4 (sin0) 2 , and y'ldet[g(ox;,ilxi)]I = r 2 sin0. Thus
in the chart (U, x), we have
volg = r 2 (sin0)dr A d0 I\ d<p I\ dt = r 2 dr I\ ((sin0)d0 I\ d¢) I\ dt
= r 2 dr I\ (volg} lu) A dt.
Note that in the above, we used det[v;] > 0, which follows from the fact
that if [w] is the orientation on V, then w(e 1 , • • • , em) > 0, and also
1rES-m
= det [
w~v w~v 1 · · · w 1 ~k-l
.. ..
wkv wkv1 · · · WkVk-1
..
l .
k
Consequently, iv(w 1 I\··· I\W 1 ) = ~ (-1r- 1 (wrv)w 1 /\ · · · Aw"/\ ···/\Wk.
r=l
k -
= WJ0J I:(-1r-lfir(v)fi1 /\ ... /\fir/\ ... /\ fik I\ fJ
r=l
f -
+w10J I:(-lt+r-lfjr(v)fl /\ fj1 /\ ... /\ fjr /\ ... /\ fjl
r=l
= (ivW)/\0 + (-llwA(iv0).
452 A Mathematical Introduction to General Relativity
and so
**n = -Wx * (dx) + Wy * (dy) - Wz * (dz) - Wt* (dt)
= -wx(-dyAdzAdt) +wy(dxAdzAdt)
-wz(-dxAdyAdt)-wt(-dxAdyAdz)
= WxdyAdzAdt + wydxAdzAdt + wzdxAdyAdt + WtdXAdyAdz
= n.
With k=3, m=4, and i(g)=l, we have (-1?(m-l)+L(g)=(-1)1°=L
454 A Mathematical Introduction to General Relativity
Chapter 12
·
R earrangmg, v=cw, wh ere c := g(p)(v,v)
( )( ) . Clear1y c# 0 .
g p w,v
c> 0: Taking scalar product in (17.14) with the time-orientation V at p,
g(p)(v, v)
g(p)(v, Vp) = g(p)(w, v) g(p)(w, Vv),
Thus
.
1.e., lul
1/31-- ---;::::=;::::::::;;.:. so -2£ -- ~1 -1 - ✓ - Iu 12 .
yl - lul 2 2L 1+132
Solutions 455
B= ( ~ a2-1)
a 2 +1' a 2 +1 ·
To get the coordinates of B', we note that the reflection B of B' in the
. . ~ 2/3 /3 2 - 1 2/3 1 - /3 2
x-axis has the coordmates B = ( 132 + 1 , 132 + 1 ), and so B = ( 132 + 1 , 132 + 1 ). So
the equation of BB' is
a 2 -1 1- 132 a2 - 1
y-~ -----
132 + 1 a2 + 1
2a 2/3 2a
x---
a2 +1 132 + 1 a2 +1
To find the x-coordinate of the intersection point of BB' with the x-axis,
we set y = 0, and solve for x, which after some algebraic manipulations,
yields
a+/3
X= - - =a@/3.
l+a/3
456 A Mathematical Introduction to General Relativity
Claim: If BE v1_ satisfies for all x, y E v1_, volg(p) (x, y, B, v) = F(p)(x, y),
then B=B.
Suppose that w := B - B =I 0. Set Ji= aw, where a= 1/y'g(p)(w, w).
Then we can find vectors h, h E v1_ so that {/1, h, h} is an orthonormal
basis for v1_. As (Ji, h, h, v) is an orthonormal basis for TpM, we have
that volg(p)(/i, h, h, v) =I 0. But
0 = F(p)(h,h)-F(p)(h,h) = volg(p)(h,h,B,v)-volg(p)(h,h,B,v)
= volg(p)(h,h,afi,v) = avolg(pJ(/1,h,h,v),
and so volg(p)(/1, h, h, v) =0, a contradiction.
458 A Mathematical Introduction to General Relativity
ot2 = ot at = o/- J + v x 8 ) = - ot + o/ v x 8 ) = - ot + v x at
= -
oI
ot + v'
-+
X (-
-+
v' X
-+
E) = -
oI
ot -
-+ -+ -+
v' ( v'. E) + D. E
-+
oI -+
= - - - v'p + D.E and
-+
ot '
o2 B o oB o -+ -+ -+ oE -+ - -+ -+
ot2 = ot at = o/- v' X E) = - v' X at = - v' X (- j + v' X B)
Recall the volume form field volg 52 on (S2 , g 8 2) as in Example 11.4 (p.251).
Then the ordered basis (00,p, o,t,,p) is positively oriented with the induced
orientation on TpS 2 , for all p EU, and volg 52 (p)(o0,p,o,t,,p) = sin0 > 0.
With the orientation [O] on M, where n := r2dr I\ volg 52 A dt, for all p EU,
So for each p EU, ((V1)p, (V2)p, (Vi)p, (Vi)p) is a positively oriented ordered
basis for TpM.
Finally, we show that F satisfies the Maxwell equations d F = 0 and
d * F = 0. (The RHS of the second equation is 0 since J = 0.) That d F = 0
is easily seen, since
*F = e2 * ( ~ d r " v'i[r)dt)
r V f(r)
= ; 2 sign ( ! !)g(Vi, V2) g(Vi, Vi)
~ ~ 0 2A 0 3
Chapter 13
T~ ~ ~ l·
Since Tii =T(oxi, oxi) =mv*g(ot, Oxi) g(8t, oxi) = mv*(-8i0)(-8jo),
[Tij] = [
0 0 0 0
As the curvature tensor field for the Minkowski spacetime is identically
zero, it follows that the Ricci curvature tensor field Ric= 0, as well as the
scalar curvature S = 0, and so the field equation becomes 0 - 0 +Ag= 81rT,
and in the chart used above, this acquires the form
This holds if and only if 0 =A= - m v*. As m > 0, the field equation holds
if and only if v* = 0.
-3aa(:2
·2
-
A
3) = -3: ·3
+ Aaa = 81ra(3pa + 2pa + pa). (17.17)
But, by first Friedmann equation
a2
---
A 81rp
a2 3 3
So, replacing the bracketed expression on the LHS of (17.17) by~,
. B1rp = 81ra (3 pa+2pa+pa.
- 3 aa . . . )
3
Simplifying and rearranging, it follows that pa+3pa+3pa=0. Multiplying
throughout by a 2,
Chapter 14
[g.J] - 1
1
l[
trix of g is (with h :=H(x, y, t - z)):
. . -[-1 l
+
2~
-2h
0
~ ~
0 0
O O
-2~] - [-1 + 2h
0 -
2h -2h
l
1
-2h
1+2h
l
.
To see that Y is lightlike every where, we note that using the basis
(ot, ox, oy, oz), the vector field Y is represented by the column vector
(1, 0, 0, 1), and so computing
-1 + 2h - 2h ] [ 1 ] [ 1]
[11] [ -2h 1+2h 1=[-11] 1 =-1+1=0,
J
given above to compute
[ 2h + 3 2h + 1 ] [-1 + 2h - 2h ] [ : ~
2v'2 2v'2 -2h 1+2h 2h+l
l 2
2y'2
3
= _1
.
From the form of [g(¼, ½)], it follows that at each point p E M, the set
{ (¼)v : i = 1, 2, 3, 4} is an orthonormal basis for TpM with respect to the
scalar product g(p), and the index of g(p) is 1. Thus g is Lorentzian.
The scalar curvature is S = gii Ricij. To find [gii] = [gij J- 1, note
-1
l
-2h 1+2h -2h 1-2h .
Thus
.. [-1-2h -2h
[g'J] = 1 1 .
-2h 1-2h
As the only nonzero components of Ric are the (i,j)-components with
i,j E {1,4}, we obtain, with tlh := ox(oxh) + oy(oyh), that
S = g 11 Rieu + g 14 Ric14 + g 41 Ric41 + g 44 Ric44
= (-1- 2h)(-tlh) + (-2h)(tlh) + (-2h)(tlh) + (1- 2h)(-tlh)
= (tlh)(l + 2h - 2h - 2h - 1 + 2h) = (tlh)(0) = 0.
For a harmonic (in x, y) h, Llh=0. So Ric= -(tlh) (dt-dz) ® (dt-dz) =0.
So the vacuum (T = 0) field equation is satisfied with the cosmological
constant A= 0:
Ric - ~g +Ag= 0 - ~g + 0g = 0 = 87!"0 = 811"T.
Conversely, if the vacuum field equation is satisfied, then we have
Ric= Ric - ~g + 0g = Ric - ~g +Ag= 811"T = 871"0 = 0.
So Ric=0. In particular, Ric(8t, 8t) =0, i.e., -Llh=0. Thus his harmonic.
We have Ll(x 2 -y2 ) = 2 + (-2) =0, and tl(xy) =0 + 0=0. Thus
1 1
tlh = 2/(t- z)tl(x 2 -y 2 ) + g(t- z)tl(xy) = 2/(t- z)0 + g(t- z)0 = 0.
464 A Mathematical Introduction to General Relativity
J
satisfies (dt + exdy)½ =0 and g(½, ½) = 1.) Then
-y
Recall that (g<•) )ii is the entry in the i th row and i h column of c<s). Hence
(7 (g<•) )ij I
= [-Y HY]ij = - T/ik hkf T/fj .
OS s=O
466 A Mathematical Introduction to General Relativity
Consequently, hv = .Cvg.
2m)(t ') 2 +
- ( 1---;=- ( 2m)- 1( r ') 2 + 0 + 0 = -1.
1---;=- (17.18)
The other two equations are obtained by looking at the t- and r-components
of the geodesic equation ;;/ + rf/fyi = 0. Recall the connection coefficient
formulae from Example 6.8 (p.124). The t-component of the geodesic equa-
tion , O = t" + rttt (t') 2 + 2rttr t'r' + rtrr (r') 2 , gives
2m(1 -
2m)- 1
O=t"+O+- - t'r'+O.
r 2 r
Similarly, the r-component, 0 = r" + rrt (t') 2 + 2r~tr't' + r~r(r') 2 , gives
1 (') 2
0 = r ,, + -m( l -2m)(')
- t 2 +0- -m(1 -
2m)-
- r
r2 r r2 r
1
= r ,, - rm(
2 - ( 1- --;::-
2m)(') 2
t 2m)-
+ ( 1- --;::- (r' ) 2 )
= r II - -m ( -l ) (using (17.18)).
r2
Solutions 467
-J
g(p) ( t' Ot,p + r' Or,p, (1- ";' )- Ot,p) V
2 1 r '
-- -r'v(
t'
2m)-1
1-r- ar,p·
Thus
For lul « 1 and r » 2m, we obtain lr'I « lt'I- Equation (17.18) now shows
that t' ~ 1. From the equation for t 11 above, we see that if we neglect the
small terms, we get t 11 ~ 0. We note that
dh __l _
and
d2h t 11 (h(·)) 1~
t 11 (h(·))
dt t'(h(-)) dt 2 (t'(h(-))) 2 (t'(h(-))) 3 .
Thus
d(r oh) dr dh , l
dt = dr dt = r (h(-)) t'(h(·))'
d2(r oh) 11 l 1 , t 11 (h(·))
dt2 = r (h(·)) t'(h(·)) t'(h(·)) - r (h(·)) (t'(h(-))) 3 .
Using t' ~ 1, lr'I « t', and r»2m, we get
d2 ( ) 11 l , 1 11 m l , 1 ( - 2m) ( 2m )- 1 , ,
dt2 roh =r (t')2-r (t')3t =-r2(t')2 -r (t')3 r2 1- r tr
Chapter 15
s; (
ro
= - ~ t~(u-lva, _ u+ly1o,))du
u2 +a+
_
-
~
c
ya
3 + Ya Y ro Yr + a log (Fo-va.
ro.jro-ryr 2 r::( ~- r.::)
c
vr+va)
c . r,;;::
yro+ya
r,;:
yr-ya
The plot with the given numerical values is shown below.
-I
Finally, for the star to not shine, we must have that the escape velocity
v(R) at its surface exceeds the speed of light, that is, v(R) > c. Since
m = (volume)· (density)= 4; R3 p, substituting this in the expression for the
escape velocity, the inequality v(R) > c gives
2 GN \,rR R p > c,
3 .
1.e., R >
~c 2
- -0 .
81r NP
470 A Mathematical Introduction to General Relativity
Consequently,
Solutions 471
Chapter 16
dr d0o d<po
As v,,,,,(r) = dr or,,,(r) + dro0,,,(r) + dr oc/>,,,(r) = or,,,(r), we have
1
u('Y(r)) (v,,,,,(r), v,,,,,(r)) = l-kr 2 .
Thus if k = 0, then
L(r) = lr l
~ = = dr =
o ✓l-kr 2
-
o 1-0
dr = r.
lrl
If k < 0, then we use the substitution sinh u = H r E (0, oo) to obtain
L(r)= l r
~==dr=
o ✓ l-kr 2
l lsinh-i(v"='icr)
o
1 coshu sinh- 1 (Hr)
----du=--=--.
coshu R R
If k > 0, then we use the substitution sin u = v'kr E (0, 1) to obtain
By the mean value theorem, for x > 0, there is a Cx E (0, x) such that
. .. + 0 = 81T (pa
2 aa . 2 + 2 paa. ) ,
3
.. 81Tpa 2 +2paa . . . a a2 k .
and so a= -3 2 a. . Substitutmg m 2- + 2 + 2 = -81Tp yields
a a a
281T pa+ 2pa a 2 .!5_ _ -8
3 2 a' + a2 + a2 - 1Tp,
and using a:a + a\ = 831T pin the left-hand side of the above now gives
iI=~ _ a2 =~3 _41r3 p- 81r3 p-~=-~81r pC:J ~(~ -H2) =~(~ -H2)
a a2 3 2 3 2 3 2 £ 2 ·
. if J(p-H 2 ) Hi+H) 3 3 1 3 3
q= (i-H) 2 = (i-H) 2 = i-H = -2+£ (i-H) = Lq - 2·
a 3 e-ft - 1
Integrating from t = 0 tot, we obtain e--rtq(t) - q(0) = 2 3/L .
1 1 ~ 1( ~)
I±H(o)=I±y-;p+v=I i±y1+A3.
Using these, and by substituting for H(t) from (17.19) into (17.20),
= IT
to
1
--dT
a(r) ·
477
478 A Mathematical Introduction to General Relativity
479
480 A Mathematical Introduction to General Relativity
vacuum, 294
vector field, 42
vector field along a smooth curve, 137
vector-valued function in R 3 , 225
velocity field of a perfect fluid, 298
velocity vector field along a curve, 139
volume, 251
volume form field, 250
volume form on a vector space with a
scalar product, 253