Professional Documents
Culture Documents
IN MATHEMATICS 22
A Course in
Differential Geometry
and Lie Groups
Texts and Readings in Mathematics
Advisory Editor
C. S. Seshadri, Chennai Mathematical Institute, Chennai.
Managing Editor
Rajendra Bhatia, Indian Statistical Institute, New Delhi.
Editors
V. S. Borkar, Tata Institute of Fundamental Research, Mumbai.
R. L. Karandikar, Indian Statistical Institute, New Delhi.
C. Musili, University of Hyderabad, Hyderabad.
K. H. Paranjape, Institute of Mathematical Sciences, Chennai.
T. R. Ramadas, Tata Institute of Fundamental Research, Mumbai.
V. S. Sunder, Institute of Mathematical Sciences, Chennai.
s. Kumaresan
University of Mumbai
~HINDUSTAN
U LQj UBOOKAGENCY
Published by Hindustan Book Agency (India)
P 19 Green Park Extension, New Delhi 110016
All export rights for this edition vest exclusively with Hindustan
Book Agency (India). Unauthorized export is a violation of Copy-
right Law and is subject to legal action.
Preface ix
1 Differential Calculus 1
1.1 Definitions and examples . . . . . . . . . . . . . . 1
1.2 Chain rule, mean value theorem and applications 16
1.3 Directional derivatives . . . . . . . . . . . . 21
1.4 Inverse mapping theorem . . . . . . . . . . 32
1.5 Local study of immersions and submersions 44
1.6 Fundamental theorem of calculus . . . . 46
1. 7 Higher derivatives and Taylor's theorem 48
1.8 Smooth functions with compact support 55
1.9 Existence of solutions of ODE . . . . . . 58
4 Integration 207
4.1 Orient able manifolds .207
4.2 Integration on manifolds . .214
4.3 Stokes' theorem . . . .223
Bibliography 288
Index 292
Preface
versational tone. I hope that beginners will find it easy for self-study. I
crave the indulgence of experts in case they find my style a bit too abra-
sive. I hope that this book will instill confidence in the reader, provide
a working knowledge and create a desire for further studies. I shall be
happy if readers appreciate my efforts in making their learning curve of
this subject less steep.
I welcome corrections and suggestions for improvement.
Mumbai S. K umaresan
December 2001 kumaresa@math.mu.ac.in
Chapter 1
Differential Calculus
Definition 1.1.3 We say two norms, IIII and 1111', on a vector space E
are equivalent if they induce the same topology.
Using the fact that IXil ~ IIxlb, it is an easy exercise to show the
following facts about the norms II Ill' II 112 and II 1100 on Rn:
(1.1.1)
S := {x E Rn : IIxlb = 1}.
Let m := minxEs Ixl. Then m > 0, since otherwise, there exists a vector
xES with Ixl = 0. By the definition of norm, X = 0, a contradiction
to the fact that xES. If.x E Rn is any nonzero vector, then u := ~ X II II2
lies in S so that lui ~ m. But lui = ,,~112 so that we have Ixl ~ m IIx1l2.
We may then take C2 := ;k and finally get Cllxl ~ IIxll2 ~ C 2 1xl.
o
As an application of this, we observe the following fact which will be
needed later.
4 1. Differential Calculus
Lemma 1.1.6 Let JRN := JR~ x JRm so that N = n rt- m. For any
z E JRN we write z = (x, y) with x E JRn and y E JRm. Then the map
z >-+ max{llxll, Ilyll} is a norm on JRN equivalent to the Euclidean norm.
o
Exercise 1.1.9 Let E and F be normed linear spaces. Let dimE < 00.
Show that any linear map I: E -+ F is continuous.
Differentiability
Definition 1.1.12 Let I: (a, b) -+ lR be a function. We say that 1 is
differentiable at x if the limit
. I(x + h) - I(x)
11m ';""';"---''---'-'--'-
h-tO h
exists. If the limit exists it is usually denoted by f'(x) and called the
derivative of 1 at x. We can reformulate this using Landau's notation.
Let (X,d) be a metric space and a E X. If I,g: X \ {a} -+ Care
functions, we say that
then the derivative of 1 at x gives the slope of the tangent line at (x, 1(x))
to the curve, that is, "the line corresponding to the linear map above" ,
which is the best approximate at that point. Now the formulation (1.1.2)
of the derivative of a function 1 is easily adapted to maps I: U C E -+ F
where E and F are real Banach spaces and U an open subset of E. To
understand most of what follows, you may assume that E = lRm and
F = lRn.
6 1. Differential Calculus
x +W := {y E E : y := x + w} C U.
Ilf(x + h) - f(x) - Ahll :S € Ilhll for all h with Ilhli <8. (1.1.5)
IIA(tv) - B(tv)11
~ lif(x + tv)- f(x)-A(tv) II + II f(x + tv) - f(x)-B(tv) II
:S 2€ It I for all t with It I < €.
1.1. Definitions and examples 7
Remark 1.1.15 Notice that we make crucial use of the fact that U is
open in Exercise 1.1.14.
Exercise 1.1.19 Can you define the second derivative f"(X) of a func-
tion? What is 1" (x) for f as defined in Example 1.1.17?
f(x) := Ax + v.
Hence,
IIf(X +H) - f(X) - (XH + HX)II = o(IIHID.
So if we define Df(X)(H) = XH + HX, then H H XH + HX is
continuous and linear, and Df(x) satisfies (1.1.3).
Exercise 1.1.22 Can you generalize Example 1.1.21 to (i) infinite di-
mensional spaces, and (ii) to higher powers?
f(t) := (fl~t)) .
fn(t)
Then, for any t E (a, b), the derivative Df(t) is a linear map from IR to
IRn. We claim that Df(t)(h) = hUf(t), ... , f~(t))t.
point t or I(t) and denote it by f'(t). More on this later. Such a map I
has a physical interpretation which is worth knowing. We think of the
vector I(t) as the position vector of a particle at time t. Then I'(t) is
thought of as the velocity vector of the movement of the particle.
Before considering the second special case, we recall a fact from linear
algebra in the form of an exercise.
Exercise 1.1.26 Let ep: IRn -+ lR be linear. Then ep(x) = (x,v) where
v =(ep(el), ... , ep(e n )) where ej is the standard j-th basic vector.
where the inner product is the Euclidean inner product. This vector
grad I (x) is called the gradient of I at x. We shall identify this vector
later. (See Lemma 1.3.4.)
for a a permutation of {I, ... , n}. Using Example 1.1.29 we find the
familiar rule for the differentiation of determinants:
(1.1. 7)
The reader is strongly urged to check this. The reader might have seen
this when the entries are differentiable functions lij: lR -+ lR in the
following form: if F := det(J;j) := det(FI , ... , Fn) then
in an obvious notation.
We can reformulate the above in a different set-up. As in Exam-
ple 1.1.21 we can identify lRn x ... x IRn (n-times) with M(n,lR) so
that the map in 1.1.6 is I: M(n,lR) -+ lR given by I(X) = det(X) .
The tuple (el' ... ' en) of the canonical basis vectors goes under this
identification to I, the identity matrix. Now what is I'(I)(H)? Let
H := (h ij ) E M(n, lR). We use (1.1.7) where hi = (h il , .. . , h in )
E E = IRn and Xi = ei. We then get
1 0 0 0 0
0 1 0 0 0
0 0
det(xI , ... ,hi, ... ,x n ) = det h ii
hil hi2 hin
0 0 0 0 0 1
= h ii ·
Thus we have
D I(I)(H) =L hii = tr (H). (1.1.8)
Exercise 1.1.32 Let E be a Banach space and Xk EE. If'L:kllxk II < 00,
then the sequence Sn := L:~=1 Xk converges to an element x E E and we
then write x := L:k Xk·
We now consider the map f: U ~ U given by A H A-I. We want
to show that f is differentiable and compute its derivative. Let A E U
and H E BL(E). We have
it).
Hence we have
DF(x,y) = ( i~ay ax
But then ~: = a and ~; = -b. This implies that J'(z) = ~: - i~; at
z, as claimed. Thus we have shown that DF(x, y) is the ]R-linear map
underlying the (:-linear map ( H !' (z )(.
On first reading the reader may skip the following and go
to the next example.
DF(x, y) = ( aug~ By
au)
au
ax ay
is Clinear if and only if DF(x, y) 0 J = J 0 DF(x, y). This means that
( auauay au)
ax _ (au
au - - ax au)
au ay
au .
ay - ax ax ay
That is, if and only if
8u = 8v and 8u = _ 8v .
8x 8y 8y 8x
The next couple of examples deals with the infinite dimensional set-
up.
Example 1.1.36 Let E be a real Hilbert space, that is, a Banach space
whose norm comes from an inner product. Consider I(x) := (x, x) for
x E E. What is DI(x)(h) for h E E? What can you say if we take
g(x) := (x, x)1/2? Investigate what happens when I is as above but E
is a Hilbert space over C.
Chain rule
Lemma 1. 2.1 Let I: U C X -+ Y be differentiable at a E U. Then I
is locally Lipschitz at a, i. e., there exists a·"8 > 0 and L > 0 such that
II/(a + h) - l(a)11 ~ L Ilhll il Ilhll < 8. Hence I is continuous at a.
Ilg(b') - g(b) - B(b' - b) II < c111b' - bll, II b' - b II < "l. (1.2.5)
Since f is differentiable at a it is locally Lipschitz at a: There exists a
02 > 0 and an L > 0 such that
IIf(x+t(y-x))-f(x)1I :S IIf(x+t(y-x))-f(x+s(y-x))1I
+ II f(x + s(y - x)) - f{x) II. (1.2.10)
Since f is differentiable on [x, y], given c > 0 with c < M - M o, for all
t near s we have:
IIf(x + t(y - x))-f(x + s(y - X))_f'(X + s(y - x))(t - s)(y -,x) II
:S cit - silY - xl·
Hence
Corollary 1.2.5 Let vEE. Applying the mean value inequality to the
function g(x) := f(x) - xv, we get
Notice that t r--+ ,(t) := x + t(y - x) is the curve through the point
x := ,(0) and having ,'(0) = y - x as its tangent at x. We are now
ready to prove the general version of the mean value inequality.
E := {y E U : f(y) = f(x)}.
I(x) = {O u3
u 2 +y2
if x = 0;
if x = (u, v) =1= (0,0) .
Note that I(ax) = al(x). Then for any vector w E }R2, we have
Exercise 1.3.3 Show that the function I: }R2 ---* }R defined as follows:
x (X 2 + y)-l if x 2 + y =1= 0;
I(x,y) = { oY .
otherwIse
We use the notion of partial derivatives to identify the vector grad I(x)
of a differentiable function I: }Rn ---* lR.
Now let 1:}Rn ---* }R be differentiable. Given x E E, DI(x) is a
linear functional on lRn. That is, D I(x) E (}Rn)*, the dual of}Rn. Since
1.3. Directional derivatives 23
DI(x)(v) := (grad/(x),v) = L 01
ox. (x)Vj.
j J
o
We use this to arrive at the matrix representation of the linear map
DI(x) of a differentiable map I: UC lRm -+ lRn. Write 1= (h,· .. , In).
From Proposition 1.1.24, we have
2: j ~(X)Vj)
(
2:j ~(x)Vj
( ~:(X)
~(x)
:::
...
~(x)). (~l)
:1: (x) Vm
. (1.3.3)
I(X+tei)-/(x)-t (ali)
:. <eltl·
a n1
24 1. Differential Calculus
d
Dvl(x) = dt (f oa(t))lt=o' where 17(0) =x and 17'(0) = v.
(Check this.)
kT· I:
k=O
For more on this map see Exercise 1.3.14 at the end of this Section.
Then the curve ,(t) := exp(tX) = etX satisfies ,(0) = I and ,'(t) = X.
Hence it follows from Example 1.1.30 and the displayed box that
(1.3.4)
Before going into a proof let us ask ourselves whether it sounds plausible.
If X is a diagonal matrix, say X = (x!, . .. , xn), that is, Xij = 6ijXi.
(6 ij is the Kronecker delta which is 1 if i = j and 0 otherwise.) Then
exp(X) = (eXt, . .. , e Xn ) so that Equation (1.3.5) is true for all diagonal
matrices. Hence it is true for all diagonalizable matrices. (Check this.)
(A matrix X is diagonalizable if there exists an invertible matrix A
such that AX A -1 is diagonal.) But unfortunately not all matrices are
diagonalizable. If you remember your linear algebra well, the second
best thing to test Equation (1.3.5) on matrices which are in the Jordan
canonical form. It is easily verified that Equation (1.3.5) remains valid
in this case too and hence it is true for all matrices. (Or an alternative
is to realize that any matrix can be put in an upper triangular form
and hence ... ) We shall not go into the details of a proof along these
lines. Instead, we shall give a proof using calculus based on " det' (I) =
tr". The proof below makes use of the crucial fact that the map A H
det(A) is a differentiable map from CL(n,JR) to JR which is also a group
homomorphism.
Let us consider the function g(t) := det(e tX ) for a fixed X E M(n, JR).
We then have
g'(s) = .!!:.g(s
dt
+ t) = .!!:.
dt
det(e(sH)X)1
t=o
d
= -(det(e Sx ) det(etx))1
dt t=O
d
= det(e Sx ) - (det(e tx ))
dt
It=O
= g(s)tr (X)
26 1. Differential Calculus
f'(A)(X) = ! f 0 'Y(t)lt=o
= ~det(Aetr1X)1
dt t=o (1.3.6)
= det(A) ~I e t tr (A -1 X)
dt t=o
= det(A) tr (A- 1 X).
You should work this out using the approach in Example 1.1.30 and
verify that the expression obtained that way is the same as the one on
the right side of Equation 1.3.6. This will convince you of the merit of
the geometric principle enunciated above.
There is one more thing that we would like to point out: the light
that the geometric principle sheds on Taylor's formula for an lR-valued
function of several variables. Let U be a convex or star-shaped open
set in lRn. Let x E U and y E lRn. Assume that x + y E U and
f: U --+ lR is a (k + I)-times continuously differentiable function. Then
Taylor's formula gives us an expression for f(x+y) in terms of f and its
derivatives in the direction of y. The increment in the variable x is y
and we wish to approximate (express) the increment in the value. That
is, f(x + y) - f(x) by means of the change of f at x in the direction of
y. This intuitive idea suggests us that we consider the function
Proof If we write I = (iI, ... , In), it suffices to prove the result for
each of /i's by Proposition 1.1.24. So we consider I: U -+ R.
If the result is true, then we must have, for a E U
28 1. Differential Calculus
Hence we define a linear map h H Ah by the right hand side of the above
equation and show that f is differentiable at a. Let h = (hI,"" h n ).
Then we let
Because, Xk and Xk-l differ only in their k-th coordinate, we can apply
the mean value theorem of one variable calculus to find a point Yk on
the line segment joining Xk and Xk-l to get
We therefore have
Hence,
Duf(x) = Df(x)(v).
To prove the nontrivial. part of the lemma, the obvious choice for
Df(x) is the linear map Ax given by E 3 v I-t Duf(x). Thus we must
show that
II f(x + h) - f(x) - Ax(h) II = 0(11 h II)·
We set g(t) := f(x + th). Then g'(t) = Ax+th(h). We now have by the
mean value inequality (1.2.13)
Since f has CI-directional derivatives, the left side is 0(11 y II) and hence
we see that Df(x) = Ax.
o
There is an important corollary of Lemma 1.3.9.
where except Xi all are kept fixed. If D;J are all continuous then f is
C I and we have, for hi E E i ,
Since 9 is continuous, the left side of (1.3.7) is o(IIY - xII) and hence we
see that f'(x) = g(x).
o
Then, for any x E U, the sequence {A(x)} has a limit, say f(x) E F,
A -t f locally uniformly on U and f is differentiable on U with f' = g.
8. Set g(t) := etXe- tX and conclude that etX is invertible for all
t E IR and for all X E M(n, 1R).
9. There exists a unique solution for f'(t) = AI(t) with initial value
1(0) = B given by I(t) = etA B. Hint: If 9 is any solution, consider
h(t) = g(t) e- tA .
Now
so that
(1.4.3)
Suppose y E B(Yo,8') for some 8' > O. We want to solve for the
equation f(x) = y with x E B(xo, 8). We define recursively
Xk = Xk-l - Y - f(Xk-l).
We need to check whether Xk E B(xo,8). We have
Xk - Xk-l = Xk-l - Xk-2 - (f(xk-d - f(Xk-2)).
Taking norm on both sides, using (1.4.3) and induction we get
1
II Xk - xk-lll ~ "2 II Xk-l - xk-211
1
~ 2k- 1 Ilxl - xoll
~ 21 - k 8'.
Hence if we choose 8' = (8/2), we have Ilxk - xk-lll ~ 2- k 8. In particu-
lar, II Xo - Xk II ~ =~=o 82- i < 8. This also shows that Xk is Cauchy. Let
34 1. Differential Calculus
x = limxk = lim(xk-l
k
- y - f(Xk-d) =x - y - f(x).
.!!L
ax,
The determinant of this matrix is --.E.La
a
Xn+l
f 0 and hence <I>' (p) is
invertible. Thus <I> is a "Cl-diffeomorphism" of an open neighborhood
1.4. Inverse mapping theorem 35
of V of pin IRn +1 onto an open set cp(V) C IRn +l, by IMT. (By a e 1 _
diffeomorphism we mean a map F which is e 1 , one-one on its domain
U and F(U) is open and F- 1 is also e 1 on F(U).) Now we introduce a
new set of coordinates on V by setting Yi(r) := Ui 0 CP(r), where Ui are
the 'usual' coordinates on IRn +l: Ui(X) := Xi. In plain language this is:
if 1 :::; i :::; n;
ifi=n+1.
With respect to this new set of coordinates Yi, S has a local description
around p on V n S: it is the "hyperplane" {Yn+l = O}. Thus by taking
a suitable system of coordinates we "straighten" the hypersurface to a
hyperplane locally.
To see how this change of coordinates can help us, we pose the follow-
i~g question: Suppose the hypersurface S is also described around p as
g-I(O). That is, there exists an open set U 3 P such that SnU = 9- 1 (0),
with g: U ---+ IR being a e 1 -function. Is 9 divisible by f at least locally
around p? That is, does there exist another function h defined in an
open set containing p on which we can write 9 = fh?
Let F := f 0 cp-l and G := 9 0 cp-l. Then it follows that
has a solution in C(I), for all 9 E C(I) with IIgll < c. This is easy.
Consider the operator A given by AI := the left side - the right side of
Equation (1.4.5). Then A'(a) = >.I - K, which is invertible. Complete
the details.
Example 1.4.4 Let AI(x):= fo1 k(x,y, I(y)) dy for I E C(I). Assume
that k, ~:, the partial derivative with respect to the third variable u,
are continuous as functions from I x I x C -+ C. Then show that
A'(f)(h) = f: ~: (x, y, I(x)) h(y) dy.
Before closing this section we shall prove the implicit function the-
orem. First recall that if I: U C E x F -+ G is differentiable at (x, y)
then we have
Remark 1.4.6 Note that we can compute the derivative of Dg(x). Let
G: U -+ X x Y be the map G(x) = (x,g(x)) and cp := foG. Then
cp(x) = f 0 G(x) = f(x,g(x)) = 0 so that Dcp(x) = 0 for all x E U. We
apply the chain rule:
Hence
0= D<p(x)h = Df(x,g(x))(h,Dg(x)h)
= Dd(x,g(x))h + D2I(x,g(x)) 0 Dg(x)h.
T, S '= {
p' wE
]Rn+k I with
there exists c: (-c, c) -+ S C ]Rn+k}
c(O) = p and c'(0) = w .
notation, let us write for zEn, Z = (x, y) E ]Rn x ]Rk. Let p = (a, b)
and w = (u, v).
We wish to use the implicit function theorem. We repeat part of
the argument ofits proof to fix the notation. Let C(x,y):= (x,g(x,y)).
Then DG(p) is invertible and there exist neighbourhoods U of pin ]Rn+k
and V of a in ]Rn and a Cl-function h: V -+]Rk such that
(i) h(a) = b,
(ii) {z En: g(x, y) = O} = {(x, h(x)) : x E V},
so that g( x, h( x)) = 0 for all x E V. In particular, the portion S n U of
the surface S is parameterized by V.
We consider the curve ,(t) := a + tu. Since V is open ,(t) E V,
for sufficiently small t. Let c(t) := C- 1 (!(t),0). Since C is a C 1 _
diffeomorphism, c is Cl. Also,
DC(p)(u,v) = (u,O),
as DC-1(a,0) = DC(a,b)-I. It is easily verified that
DC(p)(u, v)
1 0 0 0 0
0 1 0 0 0
0 0 1 0 0
= £9..!. !lIn.. ~ £9..!.
8Xl 8x n 8Yl 8Yk
Inxn
DG(p)(u,v} = ( (~)
aXj 1$i$k
1$j$n
o
A special case worth noting is when g takes values only in R In
that case, the condition on Dg(p} =I- 0 for all pES is equivalent to
requiring that V'g(p} =I- 0 for all pES. (Here V'h(x) :=
for any differentiable function h: U C ]Rn -+ R
(:::1"'"
Recall that the
txhJ
derivative Dh(x): ]Rn -+ ]R is a linear functional given by Dh(x}v =
(v, V'h(x}).} Note also that in this case, the vector space TpS is the
orthogonal complement of V' g(p) in ]Rn+l. That is why 'V' g is called the
normal to the "hypersurface" S.
More generally, if g is as in Theorem 1.4.7, we write g = (g1, . .. , gk).
We then note that v E TpS if and only if (v, V'gi(p)) = 0 for 1 ~ i ~ k.
We say a vector w E ]Rn+k is normal to the surface S at the point pES
if and only if (w, v) = 0 for v E TpS. Thus, the set of vectors normal to
S at p is a vector subspace of ]Rn+k and has {V' gi (p) : 1 ::; i ::; k} as a
basis.
Let M be a smooth hypersurface in ]Rn+l defined by g = O. This
means that M := {x E ]Rn+l : g(x) = O} and that g : ]Rn+l -+]R is a C1_
function such that Dg(x) =I- 0 for any x E M. If f is a smooth function
on M, we want to investigate conditions for local extremum. Let p E M
be a local extremum for f. If 'Y: (-e, e) -+ M is a smooth curve through
p, that is, 'Y(O) = p, then f 0'Y: ]R -+ ]R has a local extremum at O. Hence
by calculus, we have (10 'Y),(O) = O. That is,
Now Equations (1.4.6) and (1.4.7) imply that V'f(p) = >..'V'g(p) for some
real number >... >.. is called the Lagrange multiplier. The above can
very easily be generalized for submanifolds which are not necessarily
hypersurfaces and which are defined by the equation g = 0 where g
satisfies the conditions of Theorem 1.4.7. In classical language the above
problem is posed as follows: Find the extrema of a function f subject
to the constraints gi(X) = 0 for 1 ::; i ::; k. Here n = n + k - k is the
1.4. Inverse mapping theorem 41
Remark 1.4.9 How is this related to analytic geometry? You can think
of the set {x E lRn : g(x) = I} as a quadric surface. Thus what we have
done above is to find the principal axes of this quadric surface.
Let ri(x) := (2: j X~j)1/2 be the norm of the i-th row vector. Let di > 0
be given. We wish to find the maximum of f(x) := det(x) subject to
the constraints
(1.4.10)
j j
f{x,y) = x2 - y2 on S := {x 2 + y2 = 1}.
Draw pictures and understand the geometric meaning of your solution.
Proof Once again, we write I = (ft, ... , 1m). Assume without loss of
generality that (U;(P)hSi,ism is invertible. Let k = n - m. Consider
the map rp: U -t ]Rm x ]Rk given by
rp(X) = (f(x),Xm+l, ... ,xn )' if X=(Xl, ... ,Xm,Xm+l, ... ,xn ).
*)
Then
Drp(p,O) = ((U;(P)OhSi,iS m
hxk
is invertible. By IMT, there exists a neighbourhood Vp of p such that
rp: Vp -t rp(Vp) is a diffeomorphism. Now 10 rp-l: rp(Vp) -t]Rn is given
by (Yl, ... , Ym, Ym+l,···, Yn) t-+ (Yb ... , Ym).
o
46 1. Differential Calculus
J:
extends to a continuous linear map from the closure S(J, E) of S(J, E)
in ~(J, ) to E. If IE S(J, E) we denote I(f) by I(t)dt.
The most important class offunctions that lie in S(J, E) is Cp, E),
the space of continuous functions: If IE C(J, E), by uniform continuity
of Ion J, for e > 0 given, there exists a 8> 0 such that for all x, y E J
with Ix-YI <8 implies that II/(x)-/(Y) II < e. Now for any partition
a:=tO<tl<·· ·<tn:=b of J with 1tHl - ti 1< 8 we define a step function 9
by setting g(t) := I(ti) if ti ~ t < ti+1. Then by the triangle inequality
it follows that II 1- gil < 2e.
We point out an important property of this integral: Let F be
another Banach space and let T E BL(E, F). If I E S(J, E), then
To IE S(J, F) and hence for every I E S(J, E) we have To IE S(J, F).
We also have I(T(f)) = T(I(f)). A particular case of this is got when
F = lR. In this case we get back I(f) as soon as we know I(u(f)) for all
u E E* , provided that E is reflexive. (Ignore this and the next line if you
do not know what E* and reflexive spaces are.) For, in this situation,
we can define I(f) E E to be the vector given by the duality:
(I(f), u) := I(u 0 f) for all u E E*.
A special case of this is when E is a Hilbert space. Here we can describe
the integral of I as that vector vEE which satisfies:
(v, w) = I( (f(t), w)) for all wEE.
1.6. Fundamental theorem of calculus 47
Exercise 1.6.1 Verify all the claims made in the previous paragraph.
After these preliminaries are over, we are ready for the fundamental
theorem of calculus.
Recall that a function f: J -t E is differentiable iff
IIF(r+h)-F(r)-hf(r)11 = iT+h(f(t)-f(r))dtl
Higher derivatives
In this section we shall define derivatives of higher order. Let I : U c
E -t F be differentiable. We thus have a map f' : U -t L(E, F) given
by x I-t f'(x). While working with higher. derivatives, it is simpler to
use f'(x) in place of DI(x) so that we can denote the second derivative
as f" (x) etc.
We now ask whether f' is differentiable on U. Notice that the deriva-
tive of I' if it exists is a continuous linear map from E to L(E, F). That
is, (f')' E L(E, L(E, F». As is customary we shall denote by f" the
second derivative of I. We say I is C 2 on U if f" (x) exists for every
x E U and the map x I-t f" (x) is continuous. It is now clear how to
define higher derivatives I(k) of f. By recursion we set I(k) := (f(k-I»,.
Thus I(k) E L(E, ... ,L(E, F». We say I is C k on U if x I-t I(k) (x) is
continuous.
In dealing with these higher derivatives the crucial observation is the
fact that the space L(E, ... ,L(E, F)) is norm isomorphic (that is, linear
isometric) to the space Lk(E, F) of all continuous k-linear maps from E
to F. Recall that the norm on Lk(E, F) is defined as follows:
Examples 1.7.1
(1) If I is a linear map from E to F, then f" = O. (Was not this
answered by you earlier?)
(2) If I: EI X ••• x En -t F is a continuous map which is linear in
each variable, what is I(k) for kEN?
(3) Let U := GL(E) be the open set in BL(E) consisting of all
invertible bounded linear maps on E. We now consider the map I : U -t
U given by I(A) = A-I. Recall that f'(A) is given by
f"(x) = ( (PI) .
ax. ax; 1<. '<n
_ ,3_
We shall indicate a proof of this. First we notice that f" (x) E L2 (an, a)
is a biline,ar map so that the matrix representation of f"(x) with respect
to the usual basis {e.} has entries b'j given by b'j = f"(x)(e.,ej). Now
by definition I" = g' where 9 := I' and we have seen (or remarked in
Section 1.1) that
1
f(x + y) = f(x) + J'(x)y + ... + k! fk(x) yk + o(yk)
The first term on the right side of the last eq\lation is 0 because of our
wise choice. Thus we get
f(x+y)
k-1
_ '" 1 j j 1
1
k k
-~ -I)!f (x)y +(k-I)!io f (x+ty)y (I-t)
k-1 r
dt.
]=1 J 0
Notice that in this proof we once again used only the directional deriva-
tive of f at x in the direction of y. Since
We now specialize to the case when E = lRn and F = lR. With the
above notation, if we set g(t) := f(x + ty) then
g'(t) = LyjDjf(x+ty)
j
f(x + y) = L DO f(x)(1/a!)yO
1019-1
+k 11
D
(1 - t)k-l L
lol=k
DO f(x + ty)(1/a!)yO dt.
where
(i) If Q(h) > 0 for all h =j:. 0, then f has a local minimum at a.
(ii) If Q(h) < 0 for all h =j:. 0, then f has a local maximum at a.
(iii) If Q(h) is indefinite, in every neighborhood of a, we can find points
x, y such that f(x) < f(a) < f(y). Such a point a is said to be a saddle
point of f.
Exercise 1.7.15 There are two well-known criteria for the positive def-
initeness of a symmetric matrix A = (aij).
(i) All the eigen values of A are positive.
(ii) All the matrices (aijh~i,j9 for 1 ::; k ::; n have positive determi-
nants.
Prove the first criterion for all n and the second criterion for n = 2.
1.8. Smooth functions with compact support 55
For later use we need the existence of smooth functions with compact
support on any open subset of IRn.
The basic ingredients in the construction of smooth function with
compact support is the function f defined on IR as follows:
f(t) = {o
exp(-l/t)
for t
for t
~ 0;
> o.
For any point x =I- 0, f is smooth since it is the composition of two
smooth functions. (Prove that the composition of two smooth functions
is smooth!) To prove that f is smooth on IR we need only show that f
is smooth at O.
We first observe that for x > 0, we have
'"""' xk
e x = L...J k! > k!1 x k c
lor ~T
a 11 k E 1"1.
k
Since inpk(x- l ) the power of X-I is at most k+2 and since If(x)1 ~ n!x n ,
for every n we see that for all n > k + 2 the following holds:
Thus f(k+1) (0) exists and is O. An analogous argument yields the conti-
nuity of f(k+l)(x) = Pk+1(x)f(x) at x = O. Thus f is a smooth function
56 1. Differential Calculus
1
1
0.8
0.6 0.6
0.4 0.4
0.2 0.2
-2 -1 3 4
0.12
0.1 0.8
0.08 0.6
0.06
0.04 0.4
0.02 0.2
1 3 4
o 3
See Figure 1.8.4. Then h is smooth with h(x) :5 1 for x :5 a and h(x) = 0
if x 2: b. If we now define 1/J(x) := h(Li xn for x := (Xl>.'" x n ) E
an, then 1/J(x) = 1 for x E B(O,a) and 1/J(x) = 0 for IIxll 2: b. The
following consequence of the above construction is of vital importance
in differential geometry for showing the existence of global objects on
smooth manifolds.
Then 2c: > O. (Prove this.) Choose a finite number of points Xj E K such
that K c Uj=1B(Xj,c:/2). Choose 1/Jj E coo(an ) such that 1/Jj(x) = 1
tt
on B(xj,c:/2) and 1/Jj(x) = 0 for x B(xj,c:). Then the function
k
cp(x) := 1 - II (1 -1/Jj)
j=1
does the job.
o
Corollary 1.8.3 Let I E COO(U) with U c an open. Let Xo E U.
Then there exists an open set V such that Xo EVe U and a function
9 E Coo (an) such that
g(X) = {/(X)' x EV
0, tt
x u.
Assume that we have proved the result for all k ::; n. Now, since xn{s) E
B[xo,r] for s E [-c,c], we have
:5 ML n Itl n+! .
Since M Ln Ln Itl n+! :5 Me Ln (Lc)n is a convergent geometric series,
it follows from Weierstrass M-test that the series Ln[xn(t) -.Xn-l(t)]
and hence (xn) is uniformly convergent on [-c,c] to a continuous func-
tion x: [-c,c]-t B[xo,r]. Hence appealing to the result on interchange
of uniform limit and the Riemann integral for a sequence of continuous
functions, we deduce that x satisfies the integral equation (1.9.2).
To prove uniqueness, let y be another solution ofthe IVP (1.9.1) and
hence the integral equation (1.9.2) on [-c, c]. The continuous function
t t-t IIx(t) - yet) II assumes its maximum value, say, at to. We then have
:5 Lc sup II x( s) - y( s) II
~
Then there exists a unique continuous solution x(t,,x) oil [-e, e] x A for
some suitable e.
(it
Then
f(t) ~ Oexp 9(S)dS) for t E [a, b].
so that
h'(t)
h(t) ~ g(t).
1.9. Existence of solutions of ODE 61
Theorem 1.9.5 Let A C ]Rk and U C ]RN be open. Let X: U x A ---+ ]RN
be Lipschitz continuous on U, uniformly in the variable from A:
Fix a point Xo E U. Choose r > 0 such that B(xo, 2r) C U. Then there
exists an c: > 0 and a continuous function
such that ftF(t, x, oX) = X(F(t, x, oX)) and F(O, x, oX) = x for all x E
B(xo, r), t E [-c:,c:] and oX E A.
In fact, F is Lipschitz in x uniformly in the variables (t, oX).
Proof We shall only highlight the arguments as the details are as in the
proof of Proposition 1.9.1. We shall not write the parameter variables
explicitly in what follows.
For x E B(xo, r), consider the integral equation
:$ eL1t11lx _ yll,
62 1. Differential Calculus
Proof Let x E B(xo,r/4). Choose hE JRn such that Ilhll < r/4 so that
x + hE B(xo, r). Let F(t, x) := IX(t) and F(t, x + h) := Ix+h(t) be the
unique solutions of the IVP with initial values x and x + h respectively.
We recall that F is Lipschitz in x-variable uniformly in t:
M1 := max{IIDX(x) II : x E B[O,r)}.
We have
Let f(t) := II F(t, x + h) - F(t, x) - ",,(t, x)h II. The integrand in the first
integral is dominated by
The first equation shows that F is CH1 in the t-variable while the second
shows that F is C k in the x-variable.
o
Chapter 2
In a similar way starting with the functions Yj, we see that the map
'IjJ 0 'P- 1 : 'P(U n V) --+ 'IjJ(U n V)
is C k . These two maps, being inverses of each other, are C k diffeo-
morphisms on their respective domains. So if we want a meaningful,
consistent definition of Ck-functions on M then we are led to impose
compatibility conditions on all the possible overlaps of the charts (U, 'P)
and (V,'IjJ).
We are now ready to make a precise definition.
local coordinates: if p E Uo: and if we denote by x~(p) :=. Ui( tpo:(p)), that
is, the i-th coordinate of the point tpo: (p) E JRm(o:). An atlas is said to
be smooth if it is a Ck-atlas for all kEN. The manifold M is said to
be smooth if the atlas is smooth.
Examples of manifolds
Example 2.1.3 Let M be a discrete topological space. By convention
JRo is a single point. Then M is a O-dimensional C k manifold for all
kEN U 00. What are the Ck-functions on M? (Of course we are yet to
define Ck-functions!)
Example 2.1.4 Let M be any nonempty open set in some JRm. Then
taking U = M and tp to be identity (inclusion) map of U into JRm we
get a Ck-atlas for all k with 1 :S k :S 00. Thus M is an m-dimensional
manifold.
As a particular example let us take M := GL(n, JR), the set of all
invertible linear maps of JRn to itself. Then as we have seen in Sec-
tion 1.1, M is an open subset of M(n,JR) :::= JRn 2 • Thus GL(n,JR) is an
n 2 -dimensional manifold. We shall have lots of occasions to look into
this manifold in future.
(2.1.2)
Let cP be the stereographic projection from the north pole onto lRn.
Proceed as before to show that for all x E lRn (2.1.2) corresponds to
Exercise 2.1.8 Verify that these form a Ck-atlas. This will help you
understand the next example better!
Now a natural question arises. We have endowed sn with two C k_
atlases. We want to know whether they are in some sense the same. In
what sense? Namely in the sense that a function f on M is "C k with
respect to the atlas A" if and only if it is "C k with respect to the atlas
~". If you have followed our motivation for the compatibility condition
(2.1.1) in the definition of a manifold it should be clear why we make
the following definition.
is C k . Notice that due to the symmetry in a and /3, it follows that the
above map is in fact a Ck-diffeomorphism. Hence we see that a function
f on M is Ck with respect to A if and only if it is so with respect to ~.
2.1. Differential manifolds 69
a) Let F :lRn +1 -+1R be given by F(x) :=I:i xr-1. Then F- 1 (O) =sn.
S n .-
.-
{x E
M( n, lTll)
ll'\o.
Ix
1<
= tx, that is,
_ . .<
_ n.
~,J
Xij = Xji for all } .
F := (ft,· .. , In).
Let pES. Then the matrix ( ~(p) )l:::;i:::;n+k has rank n by hypothesis.
Without loss of generality we assume that the submatrix ( ~ (p) ) l:::;i,i:::;n
is invertible. Consider now the map cp : U -+ JRn+k given by
Then we have
.'
Yi (r) =
{f;(r) 1 ~i~ n;
lli(r) if i > n
for all rEV, Then with respect to these new coordinates we see that
S n V' = {r E V: y;(r) = qi, 1 SIS n}. Look, for instance, at the case
2.1. Differential manifolds 71
.1.
'I-' 0 r.p- 1 ()
t. = ",('
'I-'
.
sm t, sm2t )
= {tt - ')
_7r I
< i <._._7r:
0_.--
, t <. :L7r.
72 2. Manifolds and Lie Groups
~1
~~~~
-1
Thus 'IjJ 0 <p-l (-6",6") looks like (-7r, -7r + 8) U {O} U (7r - 8,7r) which is
not an open set in (-7r, 7r). Look at Figure 2.1.4".
We shall have many occasions to look at these manifolds. We will
refer to them as oo-manifolds or Figure 8 manifolds.
This example is a typical way of constructing manifolds on sets which,
to start with, do not come with any topology. The following theorem for-
malizes the above construction. This formulation will help us construct
lots of manifolds in the sequel.
i) uUo=M.
iii) For all Q, f3 E A with Uo n Uf3 :f=. 0, the map <Po 0 <p~l from the
open set <pf3(Uo n U(3) C ]Rm to the open set <Po(Uo n U(3) is a
Ok -diffeomorphism.
2.1. Differential manifolds 73
for any x = (Xl>.", X n +1) in the equivalence class [y]. Again notice
that the map 'Pi is well defined, that is, it is independent of the choice
of x E [y].
74 2. Manifolds and Lie Groups
It is now an easy exercise to check that the family {(Ui , r,oi)} satisfies
the assumptions of the theorem. Thus lP'" (JR) is a Ck-manifold of dimen-
sion n. If you know about quotient topology you may be interested in
verifying that the topology we gave to lP'" (JR) is the quotient topology
induced from JRn+1 \ {O} with respect to "'.
Exercise 2 .1.17 There is another way to realize the set lP'" (JR) as the
quotient of the sphere sn: since any line through the origin meets the
sphere exactly at two points which are antipodal to each other (that is,
x and -x), the equivalence", on ]Rn+l \ {O} induces a relation on sn:
x '" y if and only if x = ±y, for x, y E sn.
Exercise 2.1.19 What can you say about Gr(r, V) and Gr(n - r, V)
for 0 < r < n?
Exercise 2.1.21 (Flag manifolds) Let l' denote the set of all flags
of subspaces: VI C V2 C ... C Vn - I C V = Vn in V, with dimension
dim Vi = i. Use Theorem 2.1.14 to give a manifold structure to1'. Prove
that l' is compact.
2.2. Smooth maps and diffeomorphisms 75
function on the domain U01.. These functions Xi will be called the coordi-
nate functions of the local chart (UOI.' <P0l.). In future most often we shall
use (U, x) for (U, <p) and call it a local chart.
Now the above construction can be generalized: If F is any smooth
function on <POl.(UOI.) then its pull-back f := F 0 <POI. is smooth on UOI..
(Check this.) However notice that these functions are in general local
in the sense that they may not admit any extension to the whole space
M. For example consider the coordinate function of the stereographic
atlas of the sphere given by Xi(S) = sd(l - Sn+1), the first coordinate
function corresponding to the chart (U, <p). This is defined at all points
of sn except (0, ... ,0,1). Clearly it is not possible to extend this even
as a continuous function to the whole of sn.
Thus while it is relatively easy to define local objects on M simply
by transferring the corresponding objects on JRm via the local charts, it
may not be possible to get globally defined objects on M.
Now comes the trick which will enable uS to produce global objects
on M. If F is a smooth function on <p(U) with compact support then
the function f defined as above at first on U by setting f := F 0 <p-l
and then f := 0 on M \ U is smooth on whole of M \ U. (Recall that
the support of a function f on a topological space M is defined to be
the closure of the set {p EM: f(p) =I- O}.) It is trivial to see that f is
smooth on M. We denote by Coo(M) the set of smooth functions on
M. Before we list the properties of smooth functions and of COO (M), we
want you to familiarize yourself with the concept of smooth functions
by checking which of the following functions are smooth on M.
Exercise 2.2.4 Let M be the manifold JR with the atlas <p(t) := tan- 1 t.
Then <p maps JR bijectively onto the open set (-7r /2, 7r /2). Is the function
f defined by f(t) := sin(t) smooth?
Exercise 2.2.5 Can you think of some smooth functions on JPl'" (JR)?
If f, 9 E Coo(M) and a, b E JR then af + bg E Coo(M), that is,
Coo (M) is a vector space over R (Check this.) Further if we define f g
on M by setting fg(p) := f(p)g(p), then fg E Coo(M). Thus Coo(M) is
an algebra over R By the existence of sufficiently many smooth functions
with compact support it follows that Coo(M) is an infinite dimensional
vector space over JR provided M is infinite as a set.
2.2. Smooth maps and diffeomorphisms 77
Example 2.2.9 Let X, Y be the manifolds (i) and (ii) in Figure 2.1.3.
Consider the oo-manifold in JR2 (see Example 2.1.13 in Section 2.1). Let
f be the identity map of the underlying sets of the manifolds: That is,
if (x, y) E X C ]R2 then f((x, y)) := (x, y) EYe JR2. Then f is not
smooth. This is yet another way of rephrasing a fact which we have seen
earlier. (What is this fact?)
Example 2.2.10 Let f : JRn+l \ {O} --t JP"(JR) be the quotient map,
namely, x --t [xl. Then f is a smooth map. Similarly the quotient map
from sn to JP" (JR) is smooth.
1. lRn with addition as the group operation and with the usual dif-
ferential structure is a Lie group.
2. lR· := lR \ {O} with multiplication as the group operation and the
smooth structure (as an open subset of lR) is a Lie group.
3. Sl considered as the subset {z E C : Izl = I} of C with the usual
multiplication in C as the group operation and with the smooth
structure introduced in Chapter 1 is a Lie group.
4. A most important example is G L( n, lR). This is a Lie group with
the smooth structure and with matrix multiplication as its group
operation. You should check the details of this example instead of
taking them for granted, as this will make you realize the signifi-
cance of some of our earlier exercises and examples.
5. The manifolds O(n,lR), U(n), SU(n) and SL(n,lR) are defined as
follows:
These are all Lie groups with smooth structures we have already
introduced and with matrix multiplication. Even though we are
fully equipped to substantiate our claim at this stage, we will post-
pone a proof of this. (See Remark 2.6.17, p.106.) Meanwhile the
reader should try to prove this. Even if one does not succeed it
will make one understand the various concepts introduced so far
and also enhance one's appreciation of things to come.
Exercise 2.2.21 Let Ra, Lb, Ag denote the maps of a Lie group G
into itself given by Rax := xa, Lax := ax, Agx := gxg- 1 respectively.
2.3. Tangent spaces to a manifold 81
Dvf(x) = 'L..
" Vi 8Ui
8f (x) = (grad f(x), v).
-8 I
8 (f) :=
8f (x).
-8
Ui x Ui
82 2. Manifolds and Lie Groups
for all j, so that the vectors v and w have the same components. Hence
v=w.
The set {Dx,v : v E ]Rn} has a natural vector space structure over JR,
as can be seen from the following:
To prove 8 = Dx,1) for some v E IRn, we first observe that (2) and (3)
imply
8(1) = 8(1· 1) = 18(1) + 8(1)1 = 28(1)
so that 8(c) = 0 for any constant function c. Let us now look at the
case n = 1. If the above representation is true then we should have
8(1) = a{d~)ix(l) = af'{x) for some constant a. (We assume without
loss of generality that x = 0.) This suggests that we use the Fundamental
Theorem of Calculus:
g(O) = 10r d
J' (0) du = J' (0) = du iu=o(l)·
Hence we get
F(x) = F(O) +L 1
Xi
iOU,
8F
1
a-:
(tX) dt
8F 8
gi(O) = 8Ui (0) = 8Ui 10(F). (2.3.3)
Remark 2.3.2 This remark is about a technical point and hence may
be omitted on first reading.
The reader may be puzzled as to why we always talked of smooth
functions and 8 as operators on smooth functions around p. After all we
wanted to talk of tangent vectors. For this purpose it may very well be
enough to have restricted to C 1-functions. If we had done so, in the first
order Taylor (-like) expansion f(x) = f(O)+xg(x) we obtained above, 9
will in general be continuous, not necessarily C 1 We cannot apply 8 to
9 as the domain of 8 is C 1-functions around O. Thus we could not have
concluded that 8 is of the form Do,v for some vERn.
~O
VXi P
I (f):= nO (f 0 cp-l )(0)
VUi
= nO
VUi 0
I (f 0 cp-l).
Then it is easy to check that a~i fp E TpM. They are nontrivial since if
we take f = Xi, then
J:l o I (f) = 0
J:l(Ui 0 cp-l)(O)
VXi P VUi
o
= J:l (Ui)(O) = 1.
VUi
86 2. M ani/olds and Lie Groups
d
,",/'(0)(/) := dt (/0,",/)(0)
d
= dt It=o(/ 0,",/).
We claim that ,",/'(0) is a tangent vector at p.
Xl - curves
Proof Most of our work is already done. We simply transfer the prob-
lem to one on the open set ep(U) of IRm. We shall assume that the chart
is centered at p.
For f E COO (p), we define F := foep-I. Then FE COO(O) in IRm. We
then have, as above, (2.3.2 and 2.3.3 on page 84) the first order Taylor
expression for F in IRm: F(x) = F(O)+ L:i XiGi(X) where G i are smooth
around 0 and Gi(O) = {J~i F(O). We now pull this expression back to U
via ep to get
f(x) = f(p) + LXigi(X),
8 8 8
gi(p) = Gi 0 ep(p) = Gi(O) = 8Ui 10(F) = 8Ui lo(f 0 ep-I) = 8Xi Ip(f),
by the very definition of {J~i Ip'
Now to deal with the case when (U, ep) is not centered at p is easy.
We define (U, '¢) by '¢(q) = ep(q) - ep(p). Thus, if the local coordinates
for '¢ are denoted by Yi, we see Yi(q) = Xi(q) - Xi(p). Hence we have
Ip Ip
{J~i = {J~i and also V(Yi) = V(Xi). These observations prove that the
expression for v in the statement of the theorem holds for all charts.
o
88 2. Manifolds and Lie Groups
Remarks 2.3.5
Ip
1) As a~i are linearly independent (since, a~i Ip(Xj) = c5ij ), we infer
dim TpM = dim M.
2) We wish to point out the importance of the representation (2.3.4)
in the Theorem 2.3.4, whose significance is most often overlooked.
Ip
It not only tells us that a~i span TpM but also how to write the
expression, that is, what the coefficients are if we write the vector
v in terms of the basis a~i Ip.
To wit, the i-th coefficient is the real
number V(Xi)!
We shall have many occasions to use this observation. Let us show
one such use now.
Example 2.3.6 Suppose that p also lies in another chart (V, 1/J) with
Ip
coordinates Yi· Since a~i E TpM we want to write a~i as a linear Ip
combination of the basic vectors a~i Ip. We denote a~i Ip by w. Then by
the basis theorem, we have
a a a
w="'w(Xj)-1 ="'-(Xj)·-I·
~
j
ax'J p ~ ay'
j'
ax'J p
v = '~
" via~x· I P
i '
vJ =
. L'v'-a-
aaxj
(p) = v' a-'· (2.3.6)
Xi x'
2.3. Tangent spaces to a manifold 89
Exercise 2.3.7 Show that for any tangent vector v E TpM, there exists
at least one smooth curve '1 passing through p such that v = '1'(0).
_--...L..----_lR
Figure 2.4.1 Derivative of smooth maps-functorial definition
Examples 2.4.2
1. Let us take M = (-g, g) with g > 0, an open interval in lR en-
dowed with the usual manifold structure. Let N be any smooth
manifold and '"'( : (-g, g) --t N be smooth. For t E (-g, g), we have
'"'(.(t)U!.lt) E T...,(t)N. Can you guess what it should be? (We were
very deliberate in our choice of notation!) Did you say "It is '"'(' (t)"?
If so, well and good. If not, now that you know the answer, why
don't you prove it?
2. Let us now look at the other analogous situation. Let 1 : M --t lR
be smooth. Then DI(p)(v) E Tq (lR) , with the obvious notation.
Now, on lR, we have a global chart, namely, u : t t-+ t so that
I
TqlR = lR d~ q' Hence
DI(p)(v) = a(v)-d
u
d
Iq
for some real number a depending on v. Can you guess what this
constant should be?
3. This example is perhaps best understood after Section 2.5. The
setting is as in Example 2.1.11. Let 1 : lRn +1 --t lR be smooth. Let
S := 1-1(q). Let pES. What is TpS? It can be considered as
a subset of TplRn as follows: The natural inclusion L : S --t lRn +1
is smooth. (Why?) Also we have DL : TpS --t TqlRn+l is one-
Ip
one. For, 8~j for j ~ 2 span TpS and DL(p)(8~j Ip)
= a~j Ip'
(The reader should try to verify this using the first definition of
derivative of smooth maps.) Thus TpS is a subspace of TplRn+1 .
Can we identify this subspace using our original function I?
Suppose v E TpS. Let '"'( be a curve that passes through p and is
such that '"'('(0) = v. (Such curves exist by the important exercise
(Exercise 2.3.7.)) Since '"'((t) E S for all t E (-g,g), we have
Ib(t)) = q. Thus the smooth function 10'"'( is constant on the
connected set (-g, g) and hence we have
DI(P)(v) = (f 0 '"'()'(O) = O.
Therefore we see that Tp(S) is contained in the kernel of the lin-
ear map D 1(p). Now D 1(p): TplRn+1 --t TqlR is a nonzero linear
94 2. Manifolds and Lie Groups
Let us start with the sphere 5 n described as a level set of the function
f : ]Rn+1 ---+ JR where f(XI, ... , I n+1) = Li x; - 1. We have for p E 5,
Df(p)(v)
of (p). Vi = (gradf(p),v)jRn+l
= L ou .
i '
Since gradf(p)=2(PI, ... ,Pn+l) for P:=(PI, ... ,Pn+1), we see that the
kernel of DF(p) on Tp(JRn+1) ::::JRn+1 via the canonical global coordinates
Ui and hence via the map LVi a~i f-t (VI, ... , V n+ 1) is
and
'1/;-1: 'I/;(V):= (-rr,rr) x lR -+ S with 'I/;-1(U,v) = (cosu,sinu,v).
Then we have Df(x)(w) = (2(Xb X2, 0), w)llP' Hence the kernel of D f(x)
is "the real span of the vectors (-Xl, X2, 0) and (0,0,1)", that is,
a ) +lR-
a +x1-a a
kernel(Df(x))=lR ( -x2-a
Ul X2 aUa .
Geometrically if you take the circle parallel to the Ul u2-plane through
the point x = (Xl,X2,0), then the tangent to the circle at x and the line
perpendicular to this plane span the tangent space. The first of these is
nothing but (-X2,X1,0), etc. See Figure 2.4.3.
We now give the answers to Examples 2.4.2 (1) and (2), in case you
have not got them. In the case of (1), let f E coo(,(t)). Then
d d
D'Y(t)(du It)(f) := du lu=t(f 0 'Y)(u) = (f 0 'Y)'(t) = 'Y'(t)(f).
In (2), the only number one could associate with the data is v(f). So, let
us prove that Df(p)(v) = v(f)d'!.l q. Using the Jacobian representation
or the basis theorem we have Df(p)(v) = v(u 0 f) d'!.lq' But u : lR -+ lR
is the identity map u(t) = t and hence the result follows.
Thanks to Example 2.4.2 (2), we can think of Df(p) as the linear
fun~tional df(p) defined on TpM by df(p)(v) = v(f) for v E TpM. We
den.)te by T; M the real dual of the vector space TpM for any p EM.
Thus df (p) E T; M. Let us now take f = Xi, a local coordinate function
of a chart (U, <p). Then df : U -+ UpEUT; M is such that dXi(p) form
a basis "for T; M. This basis is dual to the basis 8~i Ip of TpM. (Verify
this.) The reader should also find the transformation rule which the
components of WET; M obey analogous to the one which those of
v E TpM do. If you look up any classical book on tensor analysis you
will find that the rule for WET; M is the same as that of a covariant
tensor of rank 1.
If f E COO(M), then we see that the map df : p t-+ df(p) is from
M to UpT; M. df is called a smooth differential I-form. If (U, <p) is
a local chart with local coordinates Xi, then on U we can write df as
df(p) = Ei 9i (p}dXi (p) for some 9i(p) E lR and for all p E U. Omitting
p we can write this as df = E 9idxi, for some smooth (?!) functions 9i
on U. Can you guess what the 9i'S are? We believe that you guessed it
right and ask you to prove your guess. This establishes rigorously what
you learnt in advanced calculus about total differentials.
96 2. Manifolds and Lie Groups
We shall presently prove results which will show that locally any
immersion (respectively, submersion) is a canonical immersion (respec-
tively, submersion). More precisely we shall prove the following
Theorem 2.5.3
ExaInple 2.5.5 The quotient map c.p : sn -+ lP' (from lR.n +1\ 0 -+ IP') is
an immersion as well as a submersion.
Example 2.5.6 The map 'P or 1/J from (0, 21T) or from (-1T, 1T) to ]R2
considered in Example 2.1.13 is a one-one immersion.
Exercise 2.5.9 Work out the proof for F : ]R -+ ]R2 given by t t-t
(cost,sint) at the point p = O. Specify the maximum possible U and V
in the notation of Theorem 2.5.3.
We now prove the second part of Theorem 2.5.3. Here the proof is a
lot easier. Using standard notation, we may assume that the Jacobian
matrix of f at p is such that its first n x n-minor is invertible, that is,
the matrix ( ~(p) )l$i,j:5n is invertible. Then we consider 1] on V C N
given by
1](Z) := (h(z), ... ,1n(z)) for z E V.
From our hypothesis it follows that 1] is a local diffeomorphism at q.
Hence the required new charts are (U, '1') and (W,1]) with W C V on
which 1] is a diffeomorphism.
o
Exercise 2.5.11 Try to write a proof of the second part along the lines
similar to the conceptual proof indicated for immersions.
Did you ever wonder why one never gives an example of a local diffeo-
morphism which is not global in 1R? The following exercise tells you
why.
2.6 Submanifolds
If M is a topological space and S is a subspace then there is a natural
way of making S into a topological space. However if M is a smooth
manifold and S is a subset of M there is no natuml way in which we
can make S into a submanifold. A correct way to define a submanifold
is as follows:
Examples of submanifolds
Example 2.6.2 (JRm, £) is a submanifold of JRn for m S; n where £ is
the natural inclusion given by
Exercise 2.6.6 What are all the open submanifolds of a given manifold
M?
However notice that this does not say that L(U) = SnV. For instance, in
Example 2.6.3 above, (what else?) one can check that such a possibility
arises.
The neighbourhoods U and V as above will be referred to as adapted
neighbourhoods.
Before we proceed we shall introduce some definitions.
Using the formula for the tangent space of a level set manifold we get
Since this is true for all v E TpS it follows that there exists A E JR. such
that V'g(p) = AV'f(p).
o
Remark 2.6.14 This proof is decidedly simpler than the one in Section
1.4.1 thanks to the machinery!
Let us fix one ofthem, say, O(n). Then the product manifold O(n) x
O(n) is a regular submanifold of GL(n, JR.) x GL(n, JR.) and the map
sends O(n) x O(n) to O(n). From what we said earlier it is clear that the
restriction of ex to O(n) x O(n) is a smooth map into GL(n, JR.). Also,
ex is smooth from O(n) x O(n) to O(n) as follows from Theorem 2.6.16.
In other words, O(n) is a Lie group, called the orthogonal group. The
reader should observe that this was mentioned earlier and more signifi-
cantly that this proof could have been given there. In the same way, the
other submanifolds such as SL(n,JR), U(n) are all Lie groups.
Let us see how these two vector fields give directions at various points
of M. For brevity, we set cp-l(r,8) = re i8 . Now let p = re i9 EM' . We
r re
first express {}8 1 i8 in terms of the usual coordinates:
0 1
-08 ·n O nO
re = -rsmu -ax + rcosu-
i8 oy
at reiO. That is, tOlre
i8 is the tangent vector to the circle minus a point
second case the path is a circle. In either case it is the coordinate curve.
What was the path corresponding to Xi of Example 2.7.2 above?
(1) XI E Coo(M),
The properties (1), (2) and (3) above characterize vector fields; that is,
if D is a map from Coo(M) to Coo(M) satisfying (1), (2) and (3) then
there exists a unique vector field X on M such that D = X. While the
proof of this is not difficult, it is somewhat technical. One should use
the fact that such a D induces Du on any open set U c M and the
110 2. Manifolds and Lie Groups
Notice that we could have used the equivalent definition of a vector field
as a derivation of Coo (M) to define the bracket operation:
Example 2.7.7 Let (U, cp) be a local chart with Xi as local coordinates.
If we take Xi = {)~i we have [Xi, X j ] = 0 on U. (Check this.) As a
specific .example on JR2 \ {(x,O) E JR2: x ~ O} take Xl = tr
and
X2 = to' Verify that [XI,X2] = o.
=X~(Y~f)
ay ay -y~(x~f)1
ay ay p
a
= x-a (f)1 = XpJ.
y p
Thus [X, Yj = X. One can work this out in a more compact form as
follows, ignoring "second order terms":
[X, Yj = x~
ay
(y~)
ay
- y~ (x~)
ay ay
a a a ,a
= x ay (y) ay - y ay (x) ay
=X-.
a
ay
We shall often use this short form of the computation of the bracket.
Since COO(M) acts on X(M) we want to find out how the bracket
behaves under this action. It is easy to see as above that
[fX,gYj = fX(gY) - gY(fX)
f((Xg)Y + gXY) - g((Yf)X + fYX)
= fg[X, Yj + f(Xg)Y - g(Yf)X. (2.7.1)
The reader may wish to work out the above by using the definition on
[f X, gYj(h). The important thing to notice in this is that the bracket
operation is not COO(M)-linear in any of its arguments. In particular,
/:';jxY =I- f/:.;xY.
We can use the above equation (2.7.1) to derive the coordinate ex-
pression for the bracket operation. Let X = ~ fi a~i and Y = ~ 9j a~j
on a coordinate chart (U, x). Then
(2.7.2)
The most important thing in this equation to observe is, as earlier, that
the derivatives of the coefficients of each of the vector fields are involved.
The bracket operation [ , j has the following properties: For all
X, Y, Z E X(M),
112 2. Manifolds and Lie Groups
Proof The proof is carried out in three steps. We start with the fol-
lowing
2.7. Vector fields 113
Claim (1): If for X, Y E X(M) and for all f E Coo(M) we have X(f) =
Y(f), then X = Y.
It is enough to prove that if X(f) = 0 for all f E Coo(M), then
X = O. This follows if we prove that Xp = 0 for all p EM. Let (U,x)
be a chart around p. Then there exists agE Coo(M) such that 9 is 1
in a neighborhood of pin U and is 0 outside U. If we take f = gXi then
f is smooth on M and by hypothesis X(f) = O. This means that the
coordinates of Xp with respect to the local coordinates Xi are O. Hence
claim (1) follows.
Example 2.7.14 What are the left (= right) invariant vector fields on
an? We use Equation (2.7.7): X(f 0 La) = X(f) 0 La to solve this. Let
X = 2:>:ti 8~i ' with 0i E Coo (an). Let f E Coo (an), and a E an. Then
(2.7.8)
and
X (f) 0 La = ('L 0i 8~i f) 0 La· (2.7.9)
Therefore
(2.7.10)
and
(2.7.12)
Exercise 2.7.15 Carry out a similar argument for right invariant vector
fields.
Notice that the formula dim 9 = dim G implies that the only left-
invariant vector fields on lRn are the constant vector fields.
Let Xi E X(M) be cp-related to Yi E X(N), for i = 1,2. Are the
vector fields [Xl, X 2 ] and [Yl , Y2 ] cp-related? In view of Equation (2.7.5)
we need only show
Now
Here we used Equation (2.7.5) in the second and the third equations.
Similarly we have
Hence
[Xl! X 2 ](g 0 cp) = [Y1 , Y2 ](g 0 cp) .
Thus we have proved the following
= LXe(Xik)Ye(Xkj) - LYe(Xik)Xe(Xkj)
k k
= [X(e), Y(e»)ij.
when the latter is equipped with the bracket operation inherited from
M(n, 1R). Recall that we have computed the tangent space at e = I to
the Lie subgroups H = SL(n,IR), O(n,IR), U(n), SU(n) and that the
corresponding vector subspaces in M( n, 1R) are closed under the bracket
operation.
We now want to show how our study of cp-related vector fields help
us associate a derived morphism of the Lie algebras to a morphism of
Lie groups.
First some definitions.
For, we have
If X is compact, then f is Lipschitz, that is, there exists L > 0 such that
o
We let S(x,r):= {y E IRn : lIy - xII = r} and sn-I = S(O, 1).
Lemma 2.7.23 Let v: S(O, I} ~ IRn be a CI-unit tangent vector field.
Then the function ft: u t-+ u + tv(u) maps S(O, 1) onto S(O, v'f".+t2) if
It I is sufficiently small.
Proof Note that ft(S(O, 1)) C S(O, VI + t 2 ). Observe that ft(ru) =
r ft (u). We shall assume that n ~ 2. For It I sufficiently small, D ft (x) is
non-singular on K and hence by the inverse mapping theorem, ft maps
open subsets in the interior of K to open sets. Hence ft(K) is a relatively
open subset of S(O, Jf+t2). Since ft(S(O,I)) is compact and hence
closed, by connectedness of the spheres, it follows that ft(S(O,I)) =
S(O, v'f".+t2).
o
Proposition 2.7.24 There does not exist a CI-unit tangent vector field
on an even dimensional sphere.
122 2. Manifolds and Lie Groups
°
IIv(u) - p(u) II < m/2 for all u E sn-l. Let w(u) := p(u) - (P(u),u)u
for u E sn-l. Then (w(u), u) = and hence u f-t w(u) is a tangent
vector field on sn-l. We claim that w(u) ::j:. 0 for u E sn-l. First, we
observe that 1(P(u),u)1 < m/2. For, (P(u),u) = (P(u) - v(u),u) so that
IIg(x)-g(Y)II~Lllx-yll, forx,yEB.
If 0 ~ t < I/L, then It is one-one. For, if It(x) = It(y), then we have
x + tg(x) = y + tg(y) and hence
II x - y II = t II g( x) - g(y) II ~ tL II x - y II < II x - y II ,
a contradiction. Hence It is one-one if 0 ~ t < 1/ L.
We claim that It maps B bijectively onto itself if t is sufficiently
small. We prove this by a typical connectedness argument. Observe
that DIt(x) = 1+ tDg(x). Since Dg(x) is continuous, it follows that
D It is non-singular provided 0 ~ t < to for sufficiently small to. By
inverse function theorem It maps B onto an open subset Ut of B for
o ~ t < to. Let y E B\ Ut . Join y to a point x E Ut . Let z be a point on
the line segment joining x and y and such that z lies on the boundary
of Ut . Since It(B) is compact, there exists u E B such that z = It(u).
As z f/. Ut , u cannot be in B. Hence we infer that lIull = 1. But then
It(u) = u. We deduce that z = u E S. We therefore see that Ut = B
for 0 ~ t < to.
We now consider the integral
This means that the vectors ~ lie in the tangent space of S so that
detDfl(x) = 0, x E Bn.
Thus 1(1) = o.
o
124 2. Manilolds and Lie Groups
If I(x) #- x for x E B,
II I (x) - x II > c for
°
then by compactness there is an c > such that
all x E B. Now if we choose a polynomial function
2.8. Flows and exponential map 125
9 such that II f - gil < e, then, h defined as above cannot have a fixed
point. For, if h(xo) = Xo then we have
e > II f(xo) - g(xo) II
II (f(xo) - xo) + (xo - g(xo)) II
II f{xo) - Xo II > e,
a contradiction.
o
If you are still interested in the continuous version of the no retraction
theorem, you can derive it from Thm. 2.7.28!
For another simple (but not so elementary) proof of Brouwer's the-
orem see Section 4.3.
(2 .8.2)
126 2. Manifolds and Lie Groups
Equations (2.8.1) and (2.8.2) yield the following system of ordinary dif-
ferential equations (ODE):
d
dt(Xi 0"Y) = X(Xi) for all t E (-c,c),I:$ i:$ n. (2.8.3)
with the initial value (i.v., for short) Xi(p) = O. Thus finding an integral
curve through p is same as solving the above system of ODE.
Using local coordinates, we transfer the problem to one on an open
set containing the origin in lRn. Thus we wish to find an x : (-c, c) -+ U
satisfying the following vector-valued ordinary differential equations:
d
dt (x) = X{x) with i.v. condition x(O) = O.
Here the curve x and the vector field X are assumed to be smooth. The
basic theorem of ODE (see Section 1.9) assures us that such a system
has a unique solution. Let us look at some examples.
Example 2.8.2 Let U denote an open set in lRR. Consider the vector
field X = 8~;' Given a point a = (at, . .. ,an), the integral curve of X
through a is
dx
- = x with i.v. condition x(O) = a;
dt
~~ = y with i.v. condition y(O) = b.
The integral curve is given by "Y(t) = (x(t),y(t)) = et(a,b). That is, "Y
is the open line joining the origin and the point extending to infinity:
{ex(a, b) : ex > O}.
dx dy
-=-y and -=X.
dt . dt
2.8. Flows and exponential map 127
d2 x dy
= = x with i.v. x(O) = a
dt 2 dt
d2 y dx
= = -y with i.v. y(O) = b.
dt 2 dt
The solution is x(t)=acos(t)-bsin(t) and y(t)=asin(t)+bcos(t). Thus
if (a, b) ::f:. (0, 0), then the integral curve is non-constant and is a circle
with radius ...;a 2 + b2 and center at the origin.
Example 2.8.6 Let M = 1R2 \ {(x,O) : x ~ O}, i.e. 1R2 with the non-
negative x-axis removed, and X = :r' Take p = (a, b) := roe i90 E M.
Then the integral curve is 'Y(t) = (r + t, ( 0 ), Is this related to any of the
previous examples?
We now state the basic theorem of ODE (Section 1.9) since we need
to fix some notation.
F : (-e, e) x V -t U
I-y,,(s) = Ip - s.
Proof We set a(u) = "'((u - s). Then a'(O) = "'('( -s) = X-y(-s)' Hence
a is an integral curve of X through a(O) = "'(( -s), whence the observa-
tion.
o
Definition 2.8.12 A vector field X is said to be complete if the domains
of definition of all its maximal integral curves through points of Mare
lR. Otherwise X is said to be incomplete.
Example 2.8.13 The vector fields of Examples 2.8.3, 2.8.4 and 2.8.8
are complete while those of 2.8.5, 2.8.6 and 2.8.7 are not. What can you
say about the vector field in Example 2.8.2?
2.8. Flows and exponential map 129
n. dk k
f (O,p) = dt k (f 0 <pd(p) = Dx f(p)·
d n +!l
= dtn+! (O,p).
so that the smooth map r.pt has r.p-t as its inverse. Equation (2.8.4) is
an easy consequence of Observation 2.8.11 made above. This family r.pt
is referred to as the one parameter grou.p of diffeomorphisms associated
to X. In the general case, when X is not complete, the basic theorem
still yields what is known as the local one parameter family of local
diffeomorphisms.
This local family is very closely related to the vector field X. For, if
we are given a collection as in Definition 2.8.15 without any reference to
a vector field and if we set Xp(/) = It
10(/0 r.pt) then the map p -t Xp
is a smooth vector field with the given collection as the associated local
family. (Exercise.)
On~ thing should be noticed by the reader. There is no question of
uniqueness about this local family. For, if the data are as above we can
take 80: := ~c", and the resulting family defined by replacing 8's by c's
will still serve as a local family for X. Yet these are of use as we shall
see below.
terms of that of X and the map 1/;? We shall give the answer to this,
since we need it below.
The family {(1/;(Uo ),1/; 0 4>f 01/;-1, eo)} is a local family of Y.
Proof The idea is simple. Let e := inf eo. We define, for any p E M,
4>t(p):= cpf(p) ifp E Uo and t E (-e,e), and extend it to the entire real
line by setting
CPt := 4>1. 0 ... 04>1. = (4)1.)''
~ n
n times
Example 2.8.22 Let A = (aij) E M(n, lR). Then we have a vector field,
which we denote by A on lRn as follows: Li,j aij Xj a~i' The integral
curves of this vector field are given by F(t, x) = etAx. (Verify this.)
Here x is treated as a column vector.
Example 2.8.23 Let G be a Lie group, with 9 as the Lie algebra. Then
any left invariant vector field X Egis complete. So we can define Exp
as above. In this case we define
exp(X) := Expx(e) = cpl(e).
More generally, we define exp(tX) := cpt(e). Thus we have a map,
exp : 9 -+ G, again called the exponential map of G.
For example, if we take G = GL(n, lR), so that 9 = M(n, lR), the
exponential map defined above is nothing other than our favourite one!
That is, we have
Xk
LT!'
00
exp(X) = eX :=
k=O
This follows from the fact that the curve t t--; etX from lR to GL(n, lR) is
the maximal integral curve for the vector field X through e, as we have
seen earlier.
Do you wonder how the map Exp looks like at other points of the
group GL( n, lR)? Well, then solve the following
Exercise 2.8.25 Can you guess what the exp map for R is? Prove or
disprove your guess by finding the map from the first principles. Beware,
many beginners make mistakes here! What is the exp map of R+?
Proof We use Exercise 2.8.25 dealing with X, D¢>(X) and the corre-
sponding local families. For g E G, we have, by left invariance of X
Now we have
Proof If you grant that exp is smooth then Lemma 2.8.28 is more or
less obvious. For, we must show that the derivative of' exp at 0 is a
bijection. Now for any X E g the curve t -+ tX has X as its tangent at
o. The image of this curve under the exp map is the curve t t-+ exp(tX),
whose tangent vector at e is Xe c:::: X. Hence the result follows.
So the non-trivial point is to prove the smoothness of expo We resort
to a trick. First of all, what is the smooth structure on g? Since g
is a finite dimensional vector space over JR., canonically identified with
TeG, we may use a local chart around e to identify Te(G) c:::: g with JR.m.
(How?) Thus g is endowed with a smooth structure which is easily seen
to be independent of the local chart chosen. In particular, g becomes a
Lie group with addition as the group operation. If we can somehow show
that the flow F of X is a smooth function of t, x, and also of Xc:::: Xe
then the smoothness of exp follows. To this end, we consider the Lie
group g x G. On this Lie group, we consider the left invariant vector
field Z by setting Z(X, g) = (0, Xg). Then Z is a smooth vector field
on g x G. Also, its integral curve through (g, X) is t t-+ (X, 9 exp tX).
The corresponding one parameter group is <p(Z.t) (X, g) = (X,g exp tX).
Now exp = 7r 0 <P(Z,l) , where 7r is the projection onto the second factor.
Since <p(z.t) is the flow of a smooth vector field it is smooth and hence
exp is also smooth.
o
cp
I I
G - - - -....
>H
g----~> .fj
Proof We need only observe that the curves t t-+ ¢(exp(tX)) and t t-+
exp t(d¢(X)) are both integral curves of d¢(X) through e and hence
they are the same.
o
The importance of Lemma 2.8.29 lies in the fact that it yields the
geometric way of computing the derived homomorphism d¢ which we
2.8. Flows and exponential map 135
One-parameter subgroups
We now wish to establish the connection between left-invariant vector
fields and one-parameter subgroups. Let a: IR -+ G be a one-parameter
subgroup of a Lie group G. Then we have a one-parameter group of
diffeomorphisms, namely, {La. : t E 1R}. Let Y be the vector field
corresponding to this family. It is more or less clear that Y must be
right or left-invariant. We look into this:
d
Y(f 0 La)(g) = dt It=o (f(atga))
= Yga(f) = Y(f) 0 Ra(g)
[Xi, X j ] = L Cijk X k .
k=l
Let us look for the simplest case of an involutive system. Given p EM,
suppose that there exists open neighborhood U of p and Xl, ... , Xd E
X(U) such that Xi E 1> for all i and [Xi,X j ] = 0 for all i,j. (We assume
Xl, .. " Xd span 1> on U.) A prototype of such an involutive system is
Xi = 8~i' the coordinate vector fields on U. Can we say that these are
the only examples (locally)?
Let us take up a test case. If 1> is I-dimensional and 1> is spanned by
X on an open set U with p E U, we should then have a coordinate chart
(U, cp) around p with local coordinates Xi such that X = 8~1 on U. But
observe that a~l has the xl-coordinate curve t I-t cp-l(t, 0, ... ,0) as its
integral curve. Hence we should have the integral curve of X as the first
coordinate curve. This suggests us that if (V, t/J) with local coordinates
Yi is the original coordinate chart, we should replace the Yl-coordinate
curve by the integral curve of X. This idea is carried out to prove
°
following Theorem 2.8.9. F is defined for a sufficiently small neighbor-
hood of in IRm. Then
=p
around a neighborhood of 0 E ]Rm . Then F(O)
DF(O) is non-singular as it carries :610
to X p, :t 10 and F is smooth.
to Yp and 8~j to
8~; Ip for j F is a diffeomorphism of a neighborhood of 0 on
~ 3. Hence
to a neighborhood U of p in M. We set (U, F-l , x) as the coordinate
chart.
Are we through? No. For we can imitate the argument in Theo-
rem 2.9.2 to conclude X = ~88
Xl
on U and Y = -88
X2
on the set
{Z E U : Xl(Z) = O}
but not on the whole of U! (Convince yourself of this; the trouble is the
place of tp! in the expression for F and hence we cannot cross 'over!)
Since 8~i are coordinate vector fields, we have
That is, X(xd = 8il on U and Y = E Y(Xi) 8~i. So, what we know is
that the function Y(xd = 8i2 on {z E U : Xl(Z) = O} ='8. Through
every point of the slice 8 as we move along the Xl curves, we describe
the open set U. If we show that the function Y(Xi) remains constant as
it moves along the Xl curves, we are done. For, the constant has to be
the value of Y(Xi) on 8, namely, 8i2 .
Since Xl curves are the integral curves of X, to show that the function
Y(Xi) is constant along Xl curves is equivalent to proving X(Y(Xi)) = o.
But X(Y(Xi)) = Y(X(Xi)) as [X, Y] = o. By what we said above
X(Xi) = 8il , a constant and hence Y(X(Xi)) = 0 or X(Y(Xi)) = 0, that
is, Y(Xi) =
8i2 , or Y =
8~2 on U.
o
Remarks 2.9.5
span {Xi (p) : 1 :::; i :::; d} = span {8~i :1 :::; i :::; d}.
Therefore the submanifold is given by the slice
Exercise 2.9.6 Let S be the set of all lines through (0, y) parallel to
x-axis, where y varies over the set of irrationals. Is it an integral sub-
manifold for tx?
Exercise 2.9.7 Draw a few integral submanifolds of X = -y tx + tx y'
We shall now deduce the general Frobenius theorem from the weaker
version (Theorem 2.9.4 and its extension in item 1 of Remark 2.9.5)
using an algebraic trick.
Yi = -
a + "'. Ciij-a
ax''j~d+l
L....i ax· J
so that
a
[Yi, Yk] = L aikj ax.· (2.9.1)
i~d+l J
Since 'D is involutive, [Yi, Yk] E 'D for 1 ~ i, k ~ d and since {Yi}1=1
span'D we can write
Now if [Yi, Yk) =I- 0, by linear independence, bikl =I- 0 for some I with 1 ~
I ~ d. Therefore [Yi, Yk ) will have a non-zero 8~1 term for some 1 with 1 ~
1 ~ d. This contradicts Equation (2.9.1). We thus conclude [Yi, Yk) = 0
for 1 ~ i, k ~ d. Theorem 2.9.4 (or more precisely, Remark 2.9.5-1) now
completes the proof.
o
We are now ready to prove the global version of F'robenius theorem.
{(S . i(q)
i(q),X 1 , ... ,xi(q»)}
d
qEN
2.9. F'robenius theorem 143
Theorem 2.9.12 Let G be a Lie group with Lie algebra g. Then there
exists a bijective correspondence between connected Lie subgroups of G
and Lie subalgebras of g.
o
tk
Lo
n
f(g exp(tX)) = k,Xk(f)(g) + o(tn).
.
The following theorem lists some facts about the exponential map
which are the tools needed to establish the relation between the analytic
structure of a Lie group G and the algebraic structure of Lie(G).
Proof We point out that all these formulas are obvious generalizations
of the corresponding ones in the case of G = GL(n,JR). For example,
let us see how to prove the second one in this special case. Here, as we
pointed out earlier, we have
mt n
L
2
f(exp(sX) exp(tY)) = ;'xmyn(f)(e) + 0(3).
m.n.
m+n=O
2.10. Lie groups and Lie algebras 147
Here we have used 0(3) to denote terms in which the sum of exponents
of sand t are at least 3. Therefore we have
t +
L -,-,
2 m n
f(exp(tX) exp(tY)) = xmyn(f)(e) + 0(t 3 ). (2.10.1)
m+n=O m.n.
But ift is small enough then exp(tX) and exp(tY) lie in a neighbourhood
of e E G and so does exp(tX) exp(tY). Therefore there is a unique Z(t)
in a neighbourhood of 0 E g such that
exp(tX) exp(tY) = exp(Z(t)).
We note'that Z(O) = 0 and that t t---t Z(t) is smooth. (Why?) We can,
therefore, write
f(exp(Z(t))) = L2 1(
k! tZl
t2
+ 2"Z2)
k
f(e) + 0(t 3 )
k=O
t2 t2
= f(e) + tZ1(f)(e) + 2" Z2f(e) + 2"ZU(e) + 0(t 3 )
(2.10.2)
Comparing the coefficients of powers of t in Equations (2.10.1) and
(2.10.2) and taking f as the coordinate functions of a system of canon-
ical coordinates, we have
1 1
Zl = X +Y and 2"(Zf + Y2Z2) = XY + 2"(X 2 + y2).
Using the first in the second we have
~(X2 + y2 + XY + YX) + ~ 2 = XY + ~(X2 + y2)
which yields Z2 = [X, Yj. Hence (1) follows. Comparing the last of the
Equation (2.10.2) with the following equation
= L2
;
tm+n( t)r+s
xm yn xr YS(f)(e) + 0(t 3 )
m,n,r,s=O
Proof If the result is true, then we should be able to talk of the Lie
algebra of H. Why are we interested in finding it? The reason is that
using the "Lie subalgebra" and the exp we may be able to endow a
smooth structure on H. If K is any Lie subgroup of G then the Lie
algebra of K is given by
Adjoint representation
Let G be a Lie group. For a, g E G, let La denote the inner conjugation:
g H aga- 1 • Then La is a Lie group homomorphism. For example, in
the case of G = GL(n,lR), we have LA(B) = ABA-l for A, B in G.
What is the derived map of the corresponding Lie algebras? From what
150 2. Manifolds and Lie Groups
we learnt from the section on flows, we know that we must use the exp
map to compute the derivative. Let Ad(A) := dt(A) be the Lie algebra
homomorphism of 9 to itself. Then we have
d
Ad(A)(X) = dt A exp(tX)A
-11 t=o = d tAXA-II
dt e t=O'
so that Ad(A)(X) = AXA- 1 • Thus Ad : G --+ GL(g) := Aut(g) is a
Lie group homomorphism. Now Aut(g) has End(g) as its Lie algebra.
(The reader who is not comfortable with these abstract objects may start
with a basis of 9 and use it to identify GL(g) and End(g) with GL( n 2 , JR)
and M(n 2 ,JR) respectively.) Now again the same question; what is the
derived map of the Lie algebras? Notice that if we denote by ad the
derived map dAd, then ad : 9 --+ End(g). To find an expression for ad,
we take At := exp(tX) so that for Y E g,
Ad(Ad(Y) = etxYe- tX = e(tXY(-tX)).
Hence
ad(X)(Y) = ~e(tXY(-tX))1 = XY - YX.
dt t=O
That is, ad(X)(Y) = [X, Yj.
We now want to prove the analogues of these in the general case
using the Taylor expressions of the last section. The differential dt a :=
Ad(a) is a Lie algebra homomorphism of g. Thus Ad: G--+ GL(g) is
a group homomorphism of G into GL(g) = Aut(g), the group of linear
automorphisms of the vector space g. It is easily verified that Ad is
indeed smooth. (Exercise.) That is, Ad is a Lie group homomorphism
of G into Aut(g). The map ad is a Lie algebra homomorphism of 9 to
End(g), the Lie algebra of the Lie group Aut(g).
Let X E 9 and t E lR. Let 9 = exp(tX). Then we have
exp(Ad(g))(tY) = tg(exp(tY))
= exp( tX) exp( tY) exp( -tX)
= exp{tY + t2[X, Y] + O(t 3 )}.
But in view of the relation" cp(expX) = exp(dcp(X)) ", we get
Ad(exp(tX)) = e ad tX = et ad X.
r
G _ _A_d_-----t) GL(g)
exp jexp
ad
9 ) nd~)
2.10. Lie groups and Lie algebras 151
The reader should notice that on the right we have used the fact that
we have exp(Z) = e Z in GL(g). Thus we get
and the fact that the dyadic rationals are dense in lR. (Recall that the
set of all dyadic rationals is of the form {; : m, n E Z}.)
Let 9 E exp(~U). Then 9 = exp(~X) for some X E U. We can
also write 9 = exp(~X)2 with ~X E ~U. Thus 9 has a square root in
exp( ~U). If a E exp( ~U) is such that 9 = a 2 , then a = exp( ~ Y) for
some Y E U so that
<P G) 2= <p(c:)
(1
= exp 2X )2
2.10. Lie groups and Lie algebras 153
Proof Follows by considering the identity map from one to the other.
o
The fifth problem of D. Hilbert asks whether there exists any smooth
structure on G, as above, which makes G into a Lie group. It was
solved in the affirmative by Gleason and Montgomery. The details of
this solution can be found in the monograph Topological Transformation
Groups by of Montgomery-Zippin.
The following result shows that the only Lie subgroups which are
also topological subgroups are the closed subgroups.
a a
--j._/
- ax; aXj e·
J
The coefficients in the last sum are smooth and they constitute the
matrix elements of Ad(g) with respect to the basis 8~; Ie ofTe(G). Hence
Ad is smooth at e.
1jJ(g, v) := gv := g(v),
the image of the vector v E lRn under the linear map g. Then 1jJ is a
smooth action of G on lRn. Is this action transitive?
Example 2.11.5 Let H be the Lie subgroup O(n) of GL(n, lR) . Take
M = sn-l, the (n - 1)-dimensional unit sphere in lRn. Then the restric-
tion of the action 1jJ of the previous example gives rise to an action of
H on sn-l. This action is smooth, since O(n) (respectively, sn-l) is a
regular submanifold of GL(n, JR) (respectively, lRn).
Example 2.11.6 Let G = 8L(2, JR) and M be the upper half plane
M:= {(x,y) E JR2 : y > O} C JR2 . We think ofJR2 as theone-dimensional
2.11. Homogeneous spaces 157
complex plane C and write z for (x, y) E ]R2. Then G acts on M via
fractional linear transformations:
.
If g:=
(ac d
b) ' then gz :=
+ d·
az +
cz b
Example"2.11.7 Let
Remarks 2.11.9
1. 11' : V -+ U is smooth.
a(a) 0 7r(g) = 7r 0 La
To prove uniqueness, let (x, a) and (y, "1) be the data for Po. Then the
map x 0 y-l = X 0 (7r 0 "1) 0 y-l = (x 0 7r) 0 ("10 y-l) is a composition of
smooth maps on the common domain.
Let JL denote the multiplication map: (a, b) H abo Then the map a
IS
G x fj ----} G x G _ _J.' ~> G _ _1r ~> G/ H
Proof Let /I(a, b) := ab- 1 for a, bin G. Then /I is a smooth map from
G x G -+ G. The map (aH, bH) H (ab- 1 )H is the map 7r 0/1 0 (aa X ab)
on an appropriate domain.
o
Let us return to the action of a group on a set. If a group G acts
on a set M, then the isotropy or isotropy subgroup Hp at a point p is
defined to be the set {g E G : gp = pl. It is easily seen to be a subgroup
of G. If G acts on M and if q = gp then the isotropy at q is gHpg-l.
(Check this.) The most important thing in this situation is the fact
that there is an almost natural bijection Y from G / Hp to M given as
2.11. Homogeneous spaces 161
follows: gHp t-+ gpo Notice that the map is independent of the choice of
the representative of the coset gH. If g' H = gH then g' = gh for some
h E H and hence g' p = (gh)p = g( hp) = gpo The map is onto, for, if
q E M then by transitivity there is agE G such that q = gpo Thus
T(gH) = gp = q.
If, in the above set-up, M is a smooth manifold which is homogeneous
under the action of a Lie group G then it is natural to ask whether
the map T is a diffeomorphism. The next theorem answers this in the
affirmative. Before stating the theorem, let us look at some examples. In
Example 2.11.5, if we choose p = en+! := (0, ... 0,1), then the isotropy
at p is
be the map g r-t gpo. The map T is smooth since we have T = f3o~07r-l
on iJ. We claim that T has rank equal to dim G / H. This will show that
T has maximal rank so that by the Remark 2.11.14 below it is a diffeo-
morphism.
The idea is to show that f3 has maximal rank at e. Consider the map
Df3(e) : 9 -t Tpo(M). Let X E 9 be in the kernel of Df3(e). Then for
any f E COO(M), we have
Then we see
Lemma 2.11.15 Let G be a connected abelian Lie group and 9 its Lie
algebra. Then the exponential map is surjective.
vn E {v E H : l (v) :5 1/n }
for every n so that Vn --t 0 in an. In other words, any neighborhood of
o in H will have infinitely many points of H. Hence H is not discrete,
a contradiction.
Let Cl := min{l{v) : 0 #- v E H} > o. Let VI be an element of H
with l{vd = Cl. Let HI := ZVl. If HI = H we are through. So we
assume that HI #- H. We enlarge {vd to a basis {Vi} of an and define
ll{V) := I:~2Iail, where v = I:~1 aiVi. We claim that
{x E an : l{x) :5 C}
for some C > o. Let Y be a limit point of {Yk}. Then Y E H since H
is closed. This contradicts the assumption that H is discrete. Hence it
follows that Yr = Ys for sufficiently large rand s. We therefore conclude
that {ll{V) : v E H, v ¢ Hd has a positive minimum. Let W2 E H be
a point at which this minimum is attained .• It is clear that VI and W2
are linearly independent. We define H2 := ZVl EI3 ZW2. If H2 #- H, we
extend {VI, W2} to a basis {Wi} of an such that WI = VI. We proceed as
above by defining l2, etc. This process must stop at a finite stage, say,
r. (Why?)
Let V denote the vector subspace spanned by H. If
satisfy the conditions of Theorem 2.11.16, then both sets are bases of
the vector space V and hence r = s.
o
Chapter 3
Tensor Analysis
Definition 3.1.2 Let Vi, for 1 :s; i :s; n and V be R-modules. A map
t.p : Vi X ••• X Vn -+ V is said to be multilinear or n-linear if the map
166 3. Tensor Analysis
is linear in each of its variables. That is, for each i with 1 SiS n, the
map
The set Lk(Vb"" Vk; V), of all k-linear maps can be made an R-
module in a natural way. For <p and 1/1 in the above set ·and a E R we
set
and
To get a hang of this, let us have a closer look at the case when
R = JR. and bilinear maps between finite dimensional vector spaces over
lR. The case of multilinear maps is treated similarly. Let {vn~l and
{Wi}j=l be a basis of V* and W*, respectively. Now if BE L 2(V·, W*),
B is uniquely determined by its values on the pairs (vi, wi), For, if
3.1. Multilinear algebra 167
This means that if we know B(v;, wi) for all i and j, then we know the
bilinear map B. For v E V and w E W, let v ® w denote the bilinear
map
.Bv,w(v· ,w·) := v*(v) w·(w).
From what we said above it follows that the collection
Let {e i } denote the basis of V~ dual to the basis {ei}. .These are the
linear maps ei such that we have ei(e;) = oj for all i, j. Here oj is the
so called Kronecker delta which is 1 if i = j and 0 otherwise. Now if we
choose u i := ei , the dual basis of ei, then Li,; at u i ® e; corresponds to
the linear map A : V -t V given by
(Latei®e;)(ek) = LLat(ei®ej)(ek)
iJ i j
= LLatei(ek)ej
i ;
= Lato~e;
i,;
That is, the linear map A has matrix representation (at) with respect
to the basis {ei}. In particular, the identity map I corresponds to
; .
Li,j hi C' ® ej.
This is perhaps the right place to tell you a standard way of writing
subscripts and superscripts in differential geometry. Learning it now will
save some real headache for you later if you get into some nasty "local
computations" .
Let V be an n-dimensional vector space over R Let {ei}f=l be a
basis for V over R Let V· denote the dual space of V. That is, V·
consists of all real linear functionals from V to R Let {e i } denote the
basis of V· dual to the basis {ei}. Let {Vi} be another basis of V over
R Then there exists an invertible linear map A from V to itself which
takes ei to Vi' If we write A as a matrix (a1) with respect to the basis
{ei} then we have:
Late; = Vi· (3.1.1)
;
We also have
(3.1.3)
3.1. Multilinear algebra 169
ci = a;ui E a;ui
:= (3.1.4)
i
Ui = b~ci:= E b~ci , (3.1.5)
i
where (b~) is the inverse of the matrix (a~). We have also used the Ein-
stein summation convention, namely, the repeated indices are summed
unless otherwise specified. We shall always use this summation conven-
tion witQout saying so in the sequel. There is a subtle change in the way
the summations for Vi and that for u i go. If we think of the index i in
the matrix (at) as the row index then in the expression for Vj in terms of
ej the summation was over the row index whereas in the dual situation
This follows, for example, from the identification of each of the above
three quantities with the bilinear maps f3(av,w) , etc. It is impractical to
think of elements of V ® W as bilinear maps always. The reader will do
well to deal with objects of the form v ® w.
We now fix a finite dimensional vector space V and let ®kV denote
V ® ... ® V := Lk(V*, . .. , V*). For k = 0, we define ®oV = R We call
®kV as the k-th tensor power of V. We define ®V := $~=o ®k V, the
direct sum of all the k-th tensor powers of V. Recall that the direct sum
means that any element t E ®V is uniquely of the form t = Lk aktk
with tk E ®kV and ak E JR, with only finitely many of them non-zero.
There is a map from ®kV x ®lV to ®k+lV as follows: Let s E ®kV
and t E ®lV. Then s ® t is the natural element of (®kV) ® (®lV). This
170 3. Tensor Analysis
for all (1 in the symmetric (permutation) group of k-symbols, and for all
Ui in V. For example, Lid aijei ® ej E ®2V is symmetric if and only if
aij = aji for all i and j where ei are elements of a basis of V.
since TU varies all over the group as u varies. The last expression is the
definition of Sk t(Ul, ... , Uk). Notice that if t is already symmetric then
Sk(t) = t.
Classically, tensors were treated by their components. Let us fix a
basis {ed for V. Then any t E ®kV can be uniquely written as a finite
linear combination of the form
Thus the tensor t is known with respect to the basis {ed as soon as
we know its components ai 1 i 2 ... i h' What happens if we choose another
basis? We have a transformation rule for the tensor. (Do you recall the
transformation laws we derived for the components of tangent vectors
and cotangent vectors?) For example, if {Vi} is another basis of V, then
for the basis {Vi ® Vj} of ®2V we have
_ k
Vi ® Vj - ai ek ® ajel
l
= aik ajek
l
® et·
Here we assume that the matrix (at) takes {ed to {vd: Vi := atej.
Now if t E ®2(V) has components tili2 with respect to ei and Tili2 as
components with respect to another basis Vi, then these components are
related by
We write 1)~(p) for the tensor product of TpM (r-times) and T;(M)
(s-times). That is,
r times _ times
..... A..
1 "
= k! ~" (sgn(O")) 2 cp(V1,"" vn)
UESk
= CP(Vll"" vn),
(k + f)!
(cp,1/;) H cp 1\ 1/; := kIf! Alt(cp ® 1/;).
Here we assume that cp E I\kV and 1/; E I\lV. The expression cp 1\ 1/; is
read as "phi wedge psi". We shall later give a convincing reason for the
weird constant in the definition of the wedge product. It is easy to see
that for a, b E JR, we have
(k+£+r)!
cp 1\ (1/; 1\ "') = k!f!r! Alt( cp ® 1/; ® "') = (cp 1\ 1/;) 1\ ",.
Lemma 3.2.3 If cp E NkV and t/J E ®lV, then Alt(cp ® t/J) = O. Simi-
larly if cp E ®kV and t/J E NlV, then Alt(cp ® t/J) = o.
L sgn(a)a(cp) = o.
aEH
Hence
1
Alt(cp ® t/J) = (k + i)! L L sgn(ra)r(a(cp ® t/J»
T aEH
=0.
This proves the first part of the lemma. The other part is proved simi-
larly.
o
Lemma 3.2.4 The wedge product is associative. That is, for cp E t\kv,
t/J E t\lV and 1/ E NV we have
(k+i+r)!
cp t\ (t/J t\ 1/) = k!i!r! Alt( cp ® t/J ® 1/) = (cp t\ t/J) t\ 1/.
176 3. Tensor Analysis
(k+i+r)!
(<p 1\ 1/;) 1\ "7 = (k.+ i)!r! Alt((<p 1\ 1/;) ® "7)
(k+i+r)! ((k+i)! )
= (k+i)!r! Alt k!i! Alt(<p®1/;)®"7
(k + i + r)! (k + i)!
= (k+t')!r! k!t'! Alt(<p®1/;®"7)
where the third follows from the second in view of Equation (3.2.1). The
other equality is proved the same way.
o
Thus 1\ V becomes an associative algebra under the wedge product.
Notice that if x, yare in V = 1\ 1 V then
Let
1 2 ... s s+1
r·- (
.- r +1 r +2 . .. r +s 1
3.2. Exterior algebra 177
t
Then sgn( r) = (-1 3 , as can be seen by counting the number of inver-
sions in r. We leave it for the reader to show that r( cp ® 1/J) = 1/J ® cpo
Then we have
Alt( cp ® 1/J) = L sgn( 0- )o-( cp ® 1/J)
ITESr+.
= L sgn(o-)o-(r(1/J ® cp))
ITESr+.
= L sgn(o-or)sgn(r)o-r(1/J®cp)
ITESr+.
forms a basis of AkV where [:= (it, ... ,ik) with 1~ il < ... < ik ~ n. This
follows from the fact that any cp E AkV can be written as
and the skew symmetry of cpo Thus the dimension of the vector space
A kV = (~). AV is called the exterior algebra of the vector space V.
There is a formal way of looking at the exterior algebra of V. We fix
a basis {ei} of V as above. Then we want to constru~t an associative
algebra over lR. which is as "free" as possible subject to the following
relations:
e~ = 0 and ei.ej = -ej.ei
for all i and j with i =f. j. To construct such an algebra we start with
all real linear combinations of possible finite expressions of the form
eil ei2 ... ei r where the juxtaposition of the basis elements means the
"product". We add them formally and multiply them by real scalars
formally. For example, if we have an expression of the form e2el e2 we
then use the relations above to see that
178 3. Tensor Analysis
since e; = o. Similarly
The reader should try to write some more expressions like this and sim-
plify them using the relations. It is easy to see that this algebra is
isomorphic to the exterior algebra of V. One thing to notice here is that
while the definition of exterior algebra of V does not depend on any
basis of V, the second definition depends on the choice of a basis of V.
If A is a linear endomorphism of V then A induces a natural map on
the vector spaces I\kV, which we again denote by A. If e[ is as above
then Ae[ := (AeiJ 1\ ... 1\ (AeiJ. In fact we have
Proof We choose a basis {e;} for V. Let 1 ~ i i < ... < ik ~ n, and
set
) ·=Ae·'1 1\···I\Ae·'k·
A(e·'1 1\···l\e·'Ic·
Then extend A linearly to I\kv. If for a collection it, ... ,ik we have
ei = ei with i =1= j, then
Now the top degree space /\nv is one-dimensional so that the in-
duced map is multiplication by a scalar. We claim that this scalar is the
determinant of the operator. That is to say, if we choose a basis {ed for
V and express A as a matrix (a{) we have
U1(Vk»)
U 2 (Vk)
. .
Uk(Vk)
k! 1 k-l k
(k_l)!Alt k ((U A···Au )0u )(Vb V2, ... ,Vk) =
(k ~ I)! I: sgn(a)(u 1 A··· A uk-1)(vU(1)' ... ,Vu(k_l»Uk(Vu(k».
UESk
(3.2.3)
By induction hypothesis, we have
(u 1 A··· A uk-1)(VU(1), ... , Vu(k-I) = det (ui(vu(j h~i,j::;k-l.
As earlier, we identify the subgroup H of G (:= Sk) of all permutations
that leave k fixed with Sk-l. Then Equation (3.2.3) becomes
For T fixed, as 0' varies over G, aT varies over G so that the expression
within the parenthesis is det (ui(vj)h~i,j9' Now the result follows since
the right side is (k - 1)!( (l/(k - I)!) det (u i (Vj) h~i,j~k'
o
Remark 3.2.10 It is essential to observe that if Vi is a basis of V and
u i is the basis of V* dual to
Vi then the formula above reads
This is true because of our definition of the wedge product, for in some
books, the wedge product is defined by
so that
A'-
.-
(1+. xl
X2 X l
XIX2
1 +x~ x'xn)
X2 X n
. .
XnXl X n X2 1 +x~
182 3. Tensor Analysis
To compute its determinant we notice that the i-th row is ei +Xixt where
{ed is the canonical basis vector where we think of ei and x as column
vectors and xt denotes the transpose of x. Hence we have
Exercise 3.2.12 For v and w in ]R3, show that the vector v x w is the
unique vector such that thp following holds for all u E JR3:
r times ~ times
A A
An element tp of D:
(p) is called a tensor of contravariant rank rand
covariant rank s at p. Just as in the case of a vector field we say tp is
a tensor field if it is a map p f-t tpp E D:(p). How do we impose some
smoothness conditions on a tensor field? Given p E M we choQse any
local chart (U,x) of M with p E U. Then we have seen that a~i form Ip
a basis for TpM and dx;lp form a basis for T;M. Hence we can write tp
on U as follows:
tp =L .. _I
tp'.l"".r a ® ... ® -
11···1. ax II' p a x 'Ir
a®ldx·l
p 11 p
® ... ® dx·
1. P
1.
We say tp is a smooth teIisor field of type (r, s) if the coefficients tp~ (in
an obvious notation) are smooth for all coordinate charts.
In classical tensor analysis, tensors were recognized by their compo-
nents tp~ and the transformation rule they obeyed. As an instance of
this we have pointed out when we were dealing with vector fields, that
is, contravariant tensor fields of rank 1 or tensor fields of type (1,0).
Just to give you a sample of this formulation, let us see how a tensor
field of type (1, 2) is expressed in terms of two charts (U, x) and (V, y)
wit.h UnV #: 0.
We first recall the transformation rule for the tangent vectors and
the cotangent vectors.
a LaYj a
= (3.3.1)
ax; j ax', ay·1
dx; = Lax;
~dYj. (3.3.2)
j Y1
184 9. Tensor Analysis
(3.3.3)
with respect to the two charts. Then we use the transformation formulas
for the tangent vectors and the cotangent vectors to get
'P -
8 Ym (p) 8
" a ijk -8
- L.....J -8 Ip ® -8 p dYr Ip ® -8
8xj () p dys Ip
8Xk ()
Ok X, Ym Yr Ys
0
'1
(3.3.4)
= "L.....J -8
8Ym
°
8Xj
(p) -8 8Xk
(p) -8 (p) a:ok -88 Ip ® dYr Ip ® dys Ip
° Ok X, Yr Ys Ym
'1
We are now ready for the global definition of tensor fields. We set
1>r(M) to be the set of r mqltilinear maps
Note that the left side is a COO(M)-module and the linearity is with
respect to Coo (M).
Similarly we set 1>S(M) to be the set of all s-linear maps
Proof The proof follows the pattern of the fact that a derivation of
Coo(M) gives rise to a vector field on M.
Let F E (1)1)*. Ifw E 1>1(M) vanishes on an open set V, then F(w)
vanishes on V. The argument is same as in the case of vector fields
and hence is omitted. (Use the fact that F(fw) = fF(w).) If p E M
and (U, x) is a local chart containing p then any I-form on U looks
like Edidxi on U with Ii smooth on U. This implies., by the usual
reasoning, that F(w)(p) = 0 whenever wp = O.
The second step shows that the map wp I-t F (w) (p) is a well defined
linear functional on 1> 1(p) which is T; M. Hence there exists a unique
Xp E TpM such that
F(w)(p) = wp(Xp) for all wE 1>1(p).
Thus F-gives rise to p I-t Xp with Xp E TpM. We claim that the above
assignment is smooth. For, if Z E U, we write X z = Ei ai(z) 8~i Iz
for some ai(z) E lit Now for any i there is a I-form Wi = dXi on a
neighbourhood of z contained in U. Thus we have
as follows:
Example 3.3.7 Fix w E 1)1(M) and consider F : 1)1 (M) X 1)1 (M) -+
COO(M) defined by F(X, Y):= X(w(Y)). Is F a (0,2)-tensor?
188 9. Tensor Analysis
..
r times
on UnV.
Let us look at the simplest possible case: Let w = L at.dxi = L bjdYj
on (U,x) and (V,y). Then, as usual, ai = bj~j bj = ai~. In the
following if cp is a tensor field over M and p EM, then cp(p) (or ,
vlip) will stand for the map at the appropriate product of tangent and
cotangent vector spaces at p. For instance, if cp is a one form, then cp(p)
denotes the restriction of cp to Tp(M).
Therefore dLw(x) = L dai /\ dXi and
3.4. The exterior derivative 191
i=l
(3.4.1)
Does dw E 1)rH (M) for w E 1)r (M)? That is, we want to know
whether dw is an alternating covariant tensor of rank (r + 1): dw is
clearly additive. We show that for all f E COO(M)
k=i<j
+ L w([Xi ,fX k],X 1 , ... ,X:, ... ,fZ, ... ,X +d
r
i<j=k
192 3. Tensor Analysis
and
it follows that the second and the third terms add up to zero. Finally
we get
D(dx '1· ) A dx''2 A··· A dx'Ir - dx''1 A D(dx·'2 ) A··· A dx·'r + ...
by property (3) . Since D f = df for all f E Coo (M),
Thus the above properties characterize d L locally, that is, if at all there
exists D on all of I\M satisfying the properties, then for all w E 'D(M),
Dwlu = dL(wl u ) where (U,x) is any coordinate neighborhood in M.
Does d L satisfy the above properties (locally) on U? The first two
properties are clear. Since dL is linear, it is enough to prove the third
one when w and .,., are given as
and
Then
and
dL(w 1\.,.,) = d(ab) 1\ dXi1 1\ ... 1\ ... 1\ dXj.
= (da.b + a.db) 1\ dXi1 1\ ... 1\ dXir 1\ dxj, 1\ ... 1\ dXj.
= da 1\ dXil 1\ ... 1\ dXir 1\ bdxjl 1\ ... 1\ dXj.
+ a dXi1 1\ ... 1\ dXir 1\ (-Ir db 1\ dxj, 1\ ... 1\ dXj.
= dw 1\.,., + (-Irw 1\ d.,.,.
This proves the third property for d L . To prove the fourth one for d L ,
observe that (d L )2(J) = dL(d LJ) = 0 since
dL(d L J) = dL(I:
j
:!. 1
dXj)
= ~ (~a~i (:~)dx}dx;
I: 8 8.8f
2
= .dXi I\dxj
.. X, Xl
't)
=0,
as ~ ~
81:;81:j = 81: j 81:;· Now w is a sum of products of r I-forms of type
df and hence, by the third property, each term in (d L )2W has a factor
(d L )2 f = 0 and so (d L )2w = o.
194 3. Tensor Analysis
dw(Xill···,Xir+l) = 'L.,(-l)
" k+1 XikW(Xill···,Xik
~ ... ,Xir+l )
Proof Notice that we have already seen a very special case of this: If
9 : N ~ IR is smooth, then dg E 'D(N) and 'P·(dg) = d(g 0 'P). For, if
X E 'Dl (M), then
dw = da /\ dYl /\ . . . /\ dYr
and
(3.4.3)
Thus we see that d on I-forms can be identified with the curl (Y') of the
vector field F.
Let cp = g1 dX2 A dX3 + g2 dX3 A dX1 + g3 dX1 A dX2 be a 2-form on 1R3 .
Then proceeding as above we easily see that
In some books there is a factor (1/2) on the right side of the above equa-
tion. The reason for this anomaly is the way the exterior multiplication
wedge is defined. In the definition adapted by these books the definition
of wedge is
9.4. The exterior derivative 197
Let us verify the above equation for dJ.JJ from the basics so that we
can perceive how the definition of wedge product affects the exterior
derivative. It is enough to check it on the I-forms of the type W = f dX1'
By the local expression for d we have dJ.JJ = df /\ dX1' Now if Ok := {J~k'
etc., then we have
+ I)!
(df/\dxt}(ok,OI) = (1 1.1.
" Alt(df®dxt}(ok,ot}
(1 + I)! 1
= 1!1! 2!{(df®dx1)(ok,0t}
- (dX1 ® df)(OI, Ok)}
= df(ok) dx1(01) - df(0t}dx 1(ok)
= ok(f)81- 8lol(f).
On the other hand using the global definition we see that
dJ.JJ(Ok, 01) = Ok(W(Oz)) - OI(W(Ok)) - W([Ok, aiD
= Ok (fdx1 (01)) - 01 (fdX1 (Ok))
= 810k(f) - 8lol(f).
Thus we have verified the equation. Similarly in the expression for the
global definition of d in the other convention one will see a factor of
(11k) if the form is of degree k.
Now we attend to the dual formulation of Frobenius. If 'D is a dif-
ferential system of rank r, then we can describe 'D also by 'D* where
dJ.JJi = L wi /\ Wj'
i=1
Proof This is an easy exercise using the vector field version of the
Frobenius theorem and the above formula for dw.
o
198 3. Tensor Analysis
(iii) Compute dU*w) where f : ]R -t ]R2 with f(x) = (x, -x) and
w = dx + dy. (Compare your work with your friends!)
We saw above that if c.p = dw then dc.p = o. We may ask whether the
converse is true. This question is intimately connected with topology.
Before explaining what we mean by this let us show that locally the
converse is true. This follows from the following
We now compute
It=o 8t
(3.4.5)
3.5. Lie derivatives 199
When k > 0 we have .B1(o,u) = (Id)*cp = cpo Since F(I,x) = p for all
x E U, we have .B1(1,u) = O. Hence we get
o
We remark that Lemma 3.4.6 shows the following: On any manifold
M, if we are given a k-form dcp = 0 and a point p, then we can find a
neighborhood U of this point and a (k - I)-form wp such that we have
dwp = cp on U. However it may not be possible to find a form w on
the manifold M so that we have dw = cpo An example can be found
in Section 4.2. Thus to solve dw = cp locally, a necessary condition is
that we have dcp = O. However to solve this equation globally there are
topological obstructions. To explain this we need to introduce the de
Rham cohomology.
Definition 3.5.1 Let X E 1)1, with the corresponding flow CPt. Let
p EM and consider a basis {ei} of TpM. Since CPt is a diffeomorphism
around p, {cps.(ei)} is a basis of T",.(p)(M). Thus Ei : S I-t cps.(ei)
can be thought of as curves lying above the integral curve of X through
the point p. The map S I-t E = (Eb ... , Em(s)) can be considered as a
curve in the frame bundle l' of M. (This is the union UpFp of frames Fp,
that is, bases of TpM as p EM. Each Fp is diffeomorphic to GL(m,lR)
and we can make l' into a manifold of dimension m + m 2 • See Appendix
A for related notions.)
integral curve of - X
P integral curve of X
and we have
. (""-t).(Y<p.(p))-Yp = hm-
hm . 1 (( h () (( (
t , ... ,Im t))- h O), ... ,lm(O)))
t-+O t t-+O t
alI
( aXl'···' aIm)
= aXm (p),
and
[X, Y](p)
a a
= [ aXl' ~ Ii aXi
1(p) = ~ aXi
ali a
aXi Ip.
202 3. Tensor Analysis
Thus, in this case the result is proved. The case when Xp = is handled °
°
by a simple limiting argument.
Let now X = in a neighborhood of p. Then [X, Y](p) = 0. In this
case the integral curves starting at p are constant so that <Pt is identity
for all t. Hence the numerator in the difference quotient is zero and the
result follows in this case too.
If Xp = 0, there exists a sequence {Pn} of points such Pn --+ p and
such that XPn #- 0. The result follows in this case by continuity of X
and the first case.
o
Proof (2) The reader should compare this proof with that of the Basis
theorem (for the tangent space). We set F(t,p) := f(<pt(p» for an
arbitrary smooth function f on M. We then have the first order Taylor
expansion
F(t,p) - F(O,p) = t 10
f 1 at
8F
(st,p)ds = t . h(t,p), (3.5.1)
h(t) = df ((<p-t).Y",t(p»)
= ((<p-d. (Y",.(p»)) (J)
= Y",.(p) (J 0 <P-t).
We have
of d
Ft(O, 0) = dt It=o (Yp(f 0 tp-t)) (3.5.2)
= Yp(-Xf) = -Yp(Xf).
1t
In Equation 3.5.2 above, we used the fact that It=o and Yp are tangent
vectors to the factor spaces of lR x M and hence they commute.
o
We now wish to define the Lie derivative in a more systematic way
that will be of practical use.
Assume that one such £, exists. Let C : 'Dl -+ COO(M) be the contrac-
tion. Then property (2) implies that
The left side Equation (3.5.3) is X(w(Y)) whereas the right side is
(£'xw)(Y) + w([X, Y]). Hence it follows that we must define £'xw for
any I-form was
Dropping p we get
and hence
k
X(CP(Xl"'" Xk)w(Xk+d) - LCP(XI, ... , [X, Xi]"", Xk)W(XkH)
i=l
206 3. Tensor Analysis
We group the terms as follows: (first term + third term) + (second term
+ fourth term) to get
k
X(1/J(Xb ··., Xk+l» - L 1/J(X b ... , [X, Xi], ... , Xk+l).
i=l
.
zx<p
(Xb ... ,
X ) ._ {<p(X, Xl, ... ,Xk- l ), if k 2: 1
k-l . -
0, if k = o.
Cartan's formula relates d, £'x and i x .
ixd(<p)(Yb ... , Yk )
o
Chapter 4
Integration
Definition 4.1.1 We say that <p and 1jJ (both non-zero) are equivalent
or say that they define the same orientation if we have <p = a 1/1 with
a> O.
Notice that /\n V \ {O} = [<p] U [-<p], where <p is a non-zero element
and [<p] stands for the equivalence class of <po Thus, each V has two
orientations. For any ordered basis B := {VI, V2,' .. ,vn } of V, 1jJ(B) :=
1/1 (VI , ... , v n ) has the same sign for all1jJ E [<pl. The space (V, [<p]) with
a fixed orientation <p E /\n V \ {O} is said to be oriented. We also write
(V, <p) in place of (V, [<p]).
If (V, <p) is an oriented vector space, we sayan ordered basis {edk=l
of V is positively oriented if <peel, e2,"" en) > O.
Exercise 4.1.2 Two ordered bases {ed and {vd are both positively
oriented (respectively both negatively oriented if and only if the (invert-
ible) linear map A taking one to the other has positive (respectively
negative) determinant.
Now we want to extend the notion of orientability to smooth mani-
folds. The naive attempt to define this concept in terms of vector fields
would be: M is orient able if we can find Xj E X(M), 1 ::; j ::; dimM
208 4. Integration
such that {Xj (p)} is a basis of TpM for all p EM. This definition is not
the correct one, as we know that we may not even find one smooth vec-
tor field on M which is nowhere vanishing. There is a more appropriate
way of defining orient ability.
Since we want to talk of infinitesimal volume elements we would
like to assign oriented volumes to parallelopiped in TpM, for all p EM.
Thus we need to choose an oriented volume element wp in 1\m T; M \ {O}.
Naturally, we wish to stipulate that this choice be smooth, that is, p ~
wp is a smooth m-form on M, where m is the dimension of M. This
suggests the following
Definition 4.1.4 Two charts (UQ, xQ) and (U{3, x{3) are said to be con-
sistently oriented if either they do not intersect or if U n U{3 ::I 0, then
Q
8 8
wp (-8. '···'-8 . )=
xi x~
where the sum is over the finite number of Vi such that h(p) =f. o.
o
Example 4.1.6 If M is a manifold and if there exists an atlas consisting
of two coordinate charts {(U, x), (V, y)} with Un V connected, then M
is orientable. (Exercise.) Hence snis orientable.
(- 1, 1) 1 (1 1)
r---.4---,-----r--...,
(- 1. - 1) - 1 (1 -1 )
Figure 4.1.1 Mobius band
U1 = {(x, y) : x f- ±1},
U2 = {(x,y): ~ < x:::; 1 and -1:::; x < -~}.
Note that as (1, y) = (-1, -y) in M, we need to check that 'P'2 is well
defined. It is so and it is smooth. Clearly we have on
1
U1 n U2 = {(x,y) : 2" < x < I} U {(x,y) : -1 < x < -1/2}.
Note that U2 is twisted and then joined to U1 . (See Figure 4.1.1.) The
Jacobian of the coordinate changes has determinant +1 on the first set
011 the right side above and determinant -Ion the second set. Now this
implies that M is not orientable. (Why?)
4.1. Orient able manifolds 211
. 0.5
z-aX1S
y-axis
x-axis 1 -2
2
This defines wp E /\ nTpM and wp -:f. O. For, if {VI, ... , v n } is a basis for
TpM, then {Vl, ... , Vn , l1p} is a basis for TplRn+l and hence Wp(VI, ... , vn )
is the oriented volume of [7]p, VI, ... ,vnJ. The latter can be shown to be
Jdet((vi,Vi))O~i,i~n, where Vo = l1p. (See Lemma 4.2.10.)
We now prove the converse. From the local description of subman-
ifolds, we know that given p EM, there exists a coordinate chart
Up around p such that it is the level set of a regular function fp de-
fined on an open neighbourhood of Up, fp: Vp ::> Up -+ lR. The vec-
tor field 7]p: q t-+ II :~::! ~:~:~ II is a smooth normal field on Up- We let
(xf, ... , x~) be a positively oriented system of coordinates on Up. We
let vp(q) := ±7]p(q) according as
[ w= ±i t.p*(w). (4.2.1)
4.2. Integration on manifolds 215
where on the right side we have the Riemann integral of the smooth
function f 0 cp. It is finite since this function is compactly supported
on cp(U). Equation (4.2.1) tells us that the left side of Equation (4.2.2)
is well defined. For, if (V, '1/1-1) is another local chart inside which the
support of w lies we could have defined
r
JM
w:= r
J",(V)
f('I/I(p))dY1".dYm,
any partition of unity {Ii} subordinate to this oriented atlas. Here again
notice that we can and do take positively oriented coordinates on the
sets which refine the given atlas. Then we set
1M
r W:= L lUir liw.
i
The first thing to observe in the definition above is that the right side is
finite since the summation is over a finite number of indices and each of
these integrals is finite due to the locally finite refinement. The second
thing to see is whether this notion of integral is independent of the choice
of the partition of unity. Thus, if {gj} is another partition of unity, then
we claim that
L J gjw= L J liw. (4.2.3)
J •
This is easy. We carry this out in two steps. First we notice that if W
has support in the intersection of two positively oriented charts and if
we take integrals with respect to each of them then the integrals are the
same by the differential form version of the change of variable formula.
The second step is as follows. Note that gjW = L:i Ji9jW so that
J gjW = J L:i ligjw. By our assumption on the supports of the func-
tions, we get J 9jW = L:i J ligjw and hence
(4.2.4)
1..,
w:= f
l(a,b)
'Y*(W) = Lib i a
(Ii 0 'Y)(t) dd'Yi dt,
t
the last integral being the Riemann integral.
1. ,
Hence
w= fTr dt = 211' .
l-Tr
Example 4.2.1 is of significance to us. From the Poincare lemma
(Lemma 3.4.6) we know that if we are given any closed k form w on
M then locally we can find a k - I-form tp such that dtp = w. That
is, if p E M then there is an open set U 3 P and a form tp on U such
that dtp = w on U. In general, it is not possible to solve this equation
globally. That is to say, we may not find a tp on the whole of M such that
dtp = w holds on M. Example 4.2.1 furnishes one such instance. For,
if possible, let 9 be a smooth function on the manifold M := ]R2 \ {O}
such that dg = w. We then have 'Y·(w) = 'Y·(dg) = d(g 0 'Y). We claim
that f.., dg = O. This will contradict what we have shown above, namely,
f.., w = 211'. Now we prove the claim:
'Y*(w) = 'Y·(dg) = d(g 0 'Y)
so that
f w = r d(g 'Y) dt
i:
0
1.., l-Tr dt
= (g 0 'Y)'(t)dt
fa I(ag) fa I(g)
= for all a E G. (4.2.6)
This suggests the definition W := x-I dx. For then we shall have
Let us see how the I-form dx changes under left translations. Let
g = [a, bJ E G. The diffeomorphism Lg is given in local coordinates
by (x, y) f-t (ax, ay + b) since Lg([x, y]) = {ax, ay + bJ. This shows that
we have
L;(dx) adx;
L;(dy) ady.
! G
f(g)dg = 1 IR+ xlR
dxdy
f(x'y)-----2 .
X
Exercise 4.2.6 Let G = 8L(2, JR) act on the upper half plane X via
fractional linear transformations on X, as in Example 2.11.6. Find a
basis for G-invariant I-forms on X.
Now we compute
1. M is orientable
(Hint: Recall that you may assume M = G / H for some closed subgroup
H.)
The rest of the section should be read after Section 5.2.
so that
det((vi,1Jj)) = det(AAt) = (detA)2.
By definition, we have
where we have set G :=det(gij). Here, of course, gij(P):= g(8~; 8~; Ip' Ip)'
Thus, the form dV is smooth and is called the volume lorm correspond-
ing to the Riemannian metric g. This is the canonical choice for an
orientable Riemannian manifold. Hence we can define the integral of
any compactly supported smooth function I on M as
r
1",(u)
(f 0 cp-l)v'c(cp-l(p))dJ.L(p) = r
1~(v)
(f o1/l-1)v'c(1/I-l(p))dJ.L(p).
This follows trivially from the (non-differential form version of) the
change of variable formula and the following exercise.
4.3. Stokes'theorem 223
Exercise 4.2.11 Let (U,x) and (V,y) be two coordinate charts in the
Riemannian manifold M such that Un V is nonempty. We observe that
r,s ' J
Hence,
r f:= r
JM J",(U)
f 0 cp-l (x) det(gf,j) (p) dx,
where (U,x) == (U,cp) and cp(p) = x. Equation (4.2.7) shows that this is
well-defined.
Now using a partition of unity we can extend the notion of integral of any
(continuous) function with compact support. Let us once again remark
that if we wish to integrate forms we must assume that the manifold is
orientable.
n
That is, under the induced coordinate map Un is mapped to a set in
the half-space
1R~ := {x E IRm : Xm ~ a}.
qI q
( ax ) _
(axqm ) (ax I ) (4.3.1)
ax~ - ax~ a~
J l~i,j~m J l~i,j~m-l
::t
Yj l~i,j~m-l
o
Let M be a smooth oriented manifold and n be a domain in M with
smooth boundary. We want to fix an orientation on an. To motivate
our choice of orientation of an, let us look at the case when M = IRn+l
and n = 1R~+l = {x : Xm 2: O}. If {ei} is the canonical basis and {Ui}
the dual basis, then we have
n= {x E IRn+l : Un+l(x) 2: O}
and
an = {x E IRn+l : Un+l(x) = O}::: IRn.
Now on M = IRn+l, we have the orientation determined by the (n + 1)-
form dXll\.dx21\.·· ·I\.dxn+l' If we identify as usual, TxlRn+l with IRn+l so
that on each TxlRn+l we have the form ull\. u21\.· .. 1\. Un+l corresponding
to (dXll\. dX21\.···1\.
.
dxn+dl p .
Now the outward normal to an is given by the vector field p t-+
-,,2-1
UXn+l P
' that is, -en+l under the usual identification. We fix an ori-
entation on Tp (an) ::: IRn by requiring that {VI, V2, ... , V n }, a basis of
Tp(an), is a positively oriented basis if
(Ui I\. U2 1\. ... I\. Un+l)( -en+l, VI,.'" Vn ) > o.
For example, this means that if we take Vi = ei we have
-lLI(entd
lLI(e n ) )
- 1L2(e n+l) 1L2(en)
det ( .
We keep the above notation. Now for any p E an, we have a positively
oriented local chart (U, x) of p in M such that
Un an = {z E U : xm(z) = xm(p)}.
On this chart U n an, we stipulate that the orientation is given by the
form
dYl 1\ . . . 1\dYm-l if m - 1 is even ;
-dYl 1\ .. . 1\ dYm-l if m - 1 is odd.
n
Here U'" n = {q : xm(q) 2: a}.
Let {j",} be partition of unity subordinate to {U",}. Since an and
n are compact and {U",} is locally finite the number of a's such that
n
U'" n an -I- 0 or U'" n -I- 0 is finite. Hence we have
( i*(w) = { dw.
lan ln
4.3. Stokes'theorem 227
so that
Therefore
on an. Therefore
(4.3.2)
Hence we have
(4.3.3)
Proof Suppose that there is a smooth map f with the property that
I is identity on aBo We write f := (It, ... , In). We consider the two
(n - I)-forms defined as follows:
We now wish to apply Stokes theorem to the n-forms dw and dry. First
we observe that
dry := dlt /\ ... /\ din = 0,
since dJi E Tf(x)sn-l, an (n - I)-dimensional space. On the other hand
dw = dx l /\···/\ dx n , the volume form on B. Stokes' theorem yields the
following absurdity:
We now give the proof of the fixed point theorem essentially for
cultural reasons, since the proof is a beautiful mixture of analysis and
elementary topology.
r
1M
Div(X)dV= r
1M
(doix)dV= r
IBM
t*(ix dV ).
230 4. Integration
I
We then have
u
Div (X)dV =
Jau
r
(X, u) w. (4.3.4)
o
We derive an expression for the divergence of a vector field X on
U Let X := Li /jaj , where aj := 8~j on U. We claim that
c IRn.
· X -_ ""'
D IV fj
~ au.' a (4.3.5 )
j J
First observe that if rlV := dUl/l.· .. /1. dUn is the standard volume form,
then
so that
Proof Take X = (0, ... ,0, I, 0, ... ,0) with I at the i-th place in
(4.3.4).
o
Theorem 4.3.9 (Integration-by-parts) Let I,g E C1(O) n C(o}
l -l
Then
Ix;gdx = Igx; dx + l Igvi dS. (4.3.7)
°
Illg = on S (in particular ilone 01 th?m has compact support in OJ,
l -l
we have
Ix;gdx = Igx; dx (4.3.8)
r
In(u~v-v~u)dx= isr (av
uav-v au)
av dS. (4.3.11)
Riemannian Geometry
To(Y(-y(h)))
Y(-y(h))
In other words,
(5.1.3)
if we think of a vector field Z = E Z (ud 8~; = L Zi 8~; simply as
(Zl, ... ,Zn).
Let us summarize our findings in the form of a proposition for later
use.
234 5. Riemannian Geometry
a a
Dx Y =L X(Yi) aUi where Y = LYi.aui '
then Dx Y(p) depends only on the value Xp of X at p. That is, we can
find DxpY as soon as we know Y along any curve "I through p having
Xp as its tangent vector at p.
We call D x Y the covariant derivative of Y with respect to X. What
made things work on IRn was the existence of a natural parallelism or a
parallel transport from TqIRn to TpIRn, Tpq : TqIRn -7 TpIRn given by
Remark 5.1.2 (The reader can omit this on first reading). The formula
Equation (5.1.3) for DxY above clearly brings the distinction between
the covariant derivative Dx Y and the Lie derivative Lx Y = [X, Yj: in
Dx Y, X does not get differentiated whereas in Lx Y coefficients of X
get differentiated with respect to Y. Compare property (3) below with
(2.7.1).
Properties of D on M:= Rn
Since X(M) is a COO(M)-module it is natural to look what happens
when we take f X in place of X, gY instead of Y. (Is there any relation
between DxY and DyX etc.) We have for all X,Y,Z E X(M) and for
all f,g E COO(M)
3. D/xY = fDxY
4. Dx(gY) = gDX Y + X(g)Y
5. DxY - DyX = [X,Y]
We now explain the notation of Property (6) and then prO~'"2 i::. On
each TpJRn, we have a natural inner product gp defined as follows. Let
I
v = E Vi 8~i p ' and W = E Wi 8~i Ip' Then we set
n
gp(v,w) = (v,w) = I>iWi.
Z! = Z(LXj}j)
i
(5.1.4)
236 5. Riemannian Geometry
Similarly
(5.1.5)
From the Equations (5.1.1), (5.1.4) and (5.1.5) we get Property (6).
To say D 'on M' has Property (6) we need the notion of a Riemannian
metric, that is, an inner product on TpM for all p E M which varies
smoothly. This is the subject matter of Section 5.2
Before looking for connections on general manifolds, let us apply our
knowledge of D on IRn to find candidates for the covariant differentiation
on submanifolds of IRn. Let us assume for simplicity that S is an n-
dimensional submanifold (usually called a hypersurface) of IRn+1. We
denote by D what was D above: namely, the covariant differentiation
defined by Dx Y = "(X(Yl), ... , X(Yn+1))".
So, given X, Y E X(S) we want to define DxY, another vector field
on S. How about setting DxpY = DxpY? Notice that right side makes
sense: if"( is a curve in S such that "((0) = p and "('(0) = Xp, then
"( : (-g, g) -+ S '---+ IRn+1 is a curve in IRn+1 with the same properties.
By Proposition 5.1.1 of D, Dxp Y is well defined. The only trouble here
5.1. Covariant differentiation 237
- "'(I(t)"!' (
D 8
t )= " ( -y 8x 8 (-¥
+ x 8y ) ,-y 8x
8 + x 8y
8 (x ))"
= "- (x,y)"
= -(x~
8x
+y~).
8y
This is the normal field n, that is, at each point p = (x, y), this vector
is perpendicular to Tp(S) (with respect to the inner product on R2: we
have Tp(R2) = Tp(S) EBRnp , an orthogonal direct sum).
But there is a way out! Why not define Dx Y = (15x Yf, where
(Dx y)T is the tangential component: Tp(Rn+l) = Tp(S) + (Tp(S))J.
with respect to the natural inner product given by
/ ~,~ \ =8ij
\8Ui 8uj /
Thus we define
(5.1.6)
r
Dx Y is smooth on S if X, E X(S): Given pES choose adapted
coordinate neighborhood U and U· of p in_S ~n<!..JRn+l, and extend
every vector field in sight to U·. Call them X, Y, N, etc. Then
for all q E U. The right side depends only on Xq and the values of Yon
UCS.
That D : X(S) x X(S) -+ X(S) satisfies Properties (1)-(6) is easily
verified. Since TpS C TpJRn+l, TpS has a natural inner product which
smoothly varies as p does. For, if p E U is as above, then q f-t (Xq, Yq)
is the restriction of the smooth function z f-t (Xz, Yz) to U. Now it is
trivial to see that Property (6) also holds for D.
For q E U as above, we have
We require this map to be smooth in the sense that if for all coordi-
nate neighborhoods (U,x), the functions p I--t gij(p) = gp(8~i Ip' Ip)
8~j
is smooth for all i, j. This smoothness condition is the same as requiring
that for all X, Y E X(M), p I--t gp(Xp, Yp) is smooth. (Verify this.)
a
-a I I--t ei = (0, ... ,0, 1,0, ... ,0)
Ui P
°
Since Ii (p) =I for finitely many i only, the right hand side above is
a finite sum. It is clear that gp is a positive definite inner product on
TpM. The map p I--t gij(p) is smooth. (Check.)
240 5. Riemannian Geometry
First notice that the right side is defined. To show that 9 is smooth, we
use adapted coordinate systems to show that p t-t gij (p) is the compo-
sition p t-t gij 0 <p(p).
As a concrete example, let S be a surface in 1R3 . That is, S is
a 2-dimensional submanifold of 1R3 • Let (U, <p) be a coordinate chart.
We let the coordinates on 1R2 be (u, v) and the induced coordinates
on U be XI,X2. Then the coordinate vector fields {)~l = D1j;(:J and
,ft-
UX2
= D1j;(!
uV
) are written with respect to the coordinate fields QUi
o.{) for
1 ~ i ~ 3 on 1R as follows: Let
3
Similarly
Therefore
(Compare this with the formula (2.1.3) on page 68.) We pull the Rie-
mannian metric on S to ]Rn via cpo
For v E Tpsn, Dcp(p)(v) = -y'(O), where -y(t) = cp(p + tv). Therefore,
l("'t)
r (g"((t)("'t'(t),"'I'(t)))
= Ja
b 1
2 dt. (5.2.1)
with "'I = ("'11, "'12, .•. , "'In) where ]Rn is given the usual Riemannian metric
as in Example 5.2.2.
Then ds 2 = g(!fit, !fit). Assume "'I lies in a single coordinate chart (U,x)
it
so that
ds 2 = L gijdxidxj := I.
ij
Exercise 5.2.10 Calculate the first fundamental form for the flat torus
in JR.4 : (u,v) I-t (cosu,sinu,cosv,sinv).
Example 5.2.13 Let M = JR.2 with the usual Riemannian metric and
let M be the cylinder {(x,y,z) : x 2 + y2 = I} In JR.3 with the induced
Riemannian metric. Then cp : JR.2 -t M given by (u, v) I-t (cos u, sin u, v)
is a local isometry which is not a global isometry.
We have TpM = JR.:u EBJR.a: with
Now
x = Dcp(~)
au = -sinu~
ax +cosu~
ay
and
Example 5.2.16 Let D = {(x, Y) E JR.2 : x 2 + y2 < I}. Define the Poin-
care metric on D:
(v,w)
g(v, w) = (1 _ (x 2 + y2))2'
Therefore
(D (v) D (v)) _ (-y'{0),-y'{0)) (5.2.2)
<p ,<p - (bz + a)2{bz + a)2
But
2 1 - I Z 12 1 - I Z 12
(5.2.3)
1-1 <p{z) I = I a+bz 12 = (a+bz)(a+bz)
Equations (5.2.2) and (5.2.3) imply that
Example 5.2.17 Consider the helicoid (a spiral stair case) with the
parameterization
((),</» t-t (cosh () cos </>,cosh()sin</>,()) </>,() E JR and 0 < </> < 211".
Then the map <p : M -t M given by u t-t sinh (), v t-t </> is a local
isometry. (Exercise.)
.
Example 5.2.19 H of Example 5.2.18 and D of Example 5.2.16 are
isometric via the Cayley transformation D -+ H given by z t-+ -i !=~:~
(Prove this.)
We make an important remark below:
(DzX,Y) = Z(X,Y)-(X,DzY)
= Z(X,Y)-(X,DyZ)-(X,[Z,YJ). (5.3.1)
250 5. Riemannian Geometry
Subtracting Eq. 5.3.4 from the sum of Eq. 5.3.2 and Eq. 5.3.3 yields
1
(DzX, Y) = 2 «(X, [Y, Zl) + (Y, [Z, Xl) - (Z, [X, Yl))
1
+ 2 (Z (X, Y) - Y (Z,X) +X (Y, Z)). (5.3.5)
Since the right side does not involve D, we get uniqueness. One can also
check that D can be defined using this formula.
o
We shall indicate the relation of our formulation with the
local study. Let (U, x) be a local coordinate system such that
g'j(p) = gp(8~, Ip' Ip)'
8~j We write 8, for 8~, at times.
Let (gi j (p») be the inverse of (gij (p». (It exists since (gij (p» is
positive definite and hence non-singular.) g'j are smooth functions on
U.
Now if D is a map from X(U) x X(U) -+ X(U) satisfying the condi-
tions (i) and (ii), then, D8,8j = L:;;'=l rfj 8 k for uniquely defined smooth
funct.ions r~ Knowing rfj is equivalent to knowing the connection D.
For, If X = L fi8, and Y = L: hj 8j , then
Dx Y = L:D/;8i(L:hj 8j )
i j
= L:f,(L: D 8,(hj 8j ))
, j
= L:h{L:8,(hk)8k + L:hj
,k ~k
r: j 8k }
= L: {X(hk) + L:
k '.j
r:
j f,h j 8k}.
Hence we can compute DxY, once we know rt. rfj are called the
Christoffel symbols. We derive the classical expressions for these symbols
5.3. The Levi-Civita connection 251
8gij + 89
8
jk _ 8gki =
8
2~rl . . '= 2~rl.
LJ k,gl}· LJ k,· (5.3.6)
8 Xk Xi Xj I I
By changing 1 -+ k, k -+ i, i -+ j, j -+ l,
Surface of revolution
We now. take up the example of a surface of revolution and show how the
proof of Theorem 5.3.4 can be carried out to compute the Levi-Civita
connection of the surface with respect to the induced metric.
We consider a smooth curve cr: (a, b) -+ IR2, the (x,z)-plane in IRs
given by cr( u) = (x(u), z( u)). We assume that the curve lies in the open
right half (x, z) plane. We also assume that cr is parameterized by the
arc length so that the tangent vector crt (u) is of unit norm: II crt (u) II = 1.
If x( u) = 0 for some u in the domain we demand that z' (u) = 0 at that
point. The surface S of revolution is got by revolving the profile curve
cr about the z-axis. It has local parameterization given by
X: (u,v) ~ (x(u) cosv,x(u) sin v, z(u)) 0:5 v < 211".
The curves u = constant are called the parallels and those with v =
constant are called the meridians. The coordinate vector fields are given
252 5. Riemannian Geometry
by:
8) 8 . 8 8
Xu :=X. ( 8u =x'(u)cosv 8x +x'(u)smv 8y +z'(u)8z
8) . 8 8 8
Xv := X. ( 8v = -x(u) sm v 8x + x(u) cos v 8y + 0 8z'
1
(Da 980,80 ) = 280 (80,80 ) = 280(r
1 2
)= o. (5.3.8)
(Da980,8u) = 80 (80, 8u ) - (80, Da98u)
= - (80 , Dau 80)
1
= -2 8u (80,80)
= -~8u(r2) = -r'(u)r(u) . (5.3.9)
From Equations (5.3.8) and (5.3.9) it follows that Da980 has only 8 u -
component and
(5.3.10)
Also,
(5.3.13)
We also have
= - \ 8u , ~(~} 8e ) = 0, (5.3.14)
1
(Dau8u,8u) = 2"8u (8u,8u) = O. (5.3.15)
Thus we have
Dau8u = 0 (5.3.16)
Da88e = -r'(u)r(u)8u (5.3.17)
r'(u)
Da8 8u = r(u) 8e = Dau 8e. (5.3.18)
( 8 ) ax a 8y 8 az 8
Xu = ru = D<p(u, v) 8) (u,tJ) = 8u 8x + 8u 8y + 8u 8z'
254 5. Riemannian Geometry
That is,
(5.4.1)
and hence
(5.4.2)
Notice that Dxpo E TpS . For, the fact that (0,0) = 1 implies that
Xp (0,0) = 0 and hence 2(Dxpo, 0) = O. Thus Dxpo ..L o. Hence the
claim.
We define A: TpS -+ TpS by Av = -D"o. Thus we can write
(5.4.3)
Equation (5.4.3) is called the Gauss equation and the map A : TpS -+
TpS is called the Weingarten map. Since Equation (5.4.3) is a recast
5.4. Gauss theory of surfaces in IR3 255
= (Dxii, Y) Ip - (X,Dyii)
= X ( ii, Y) - (ii, D x Y) - Y( X, ii) + ( DyX, ii)
= (ii,DyX - DxY)
= (ii, [Y,Xl) = O.
Hence (v, Aw) = (Av, w).
Now the symmetric map A on TpS gives rise to a symmetric bilinear
form B : TpS x TpS -+ IR as follows:
B(v, w) := (Av, w) ,
(,) the induced inner product on TpS. B is called the second fundamental
form and is denoted by II. We now obtain the local expression for II
with respect to the parameterization. We use the notations introduced
above.
= - (Druo,ru)
(5.4.5)
= - \~: ,ru)
= (0, ruu) := L.
Equations (5.4.5) are obtained as follows : Since (0, ru) = 0, we have
tu(0, ru) = 0, which in turn implies (~~, ru) + (0, ruu) = O. Similarly
and
B(rv, rv) = (0, rvv) := N.
L, M and N are classical notations. We then write
II := Ldu 2 + 2M dudv + N dv 2 •
The above computation furnishes a second proof for the self adjointness
of A.
o
5.4. Gauss theory of surfaces in lR3 257
then
LN-M2
K= EG-F2'
Exercise 5.4.3 Show that in the above notation the Riemannian vol-
ume element of S is given by JEG - F 2 dx /\ dy.
Remark 5.4.4 Most often the parameterizations given below cover only
the good portion of the surface. We shall not comment, in general, upon
the rest. However the continuity of K, etc., will imply that the equation
for K will continue to hold for the whole of S.
Therefore
LN-M2
K= EG-F2
bcosu(a + bcosu)
= b2(a + bcosu)2
cosu
- b(a + bcosu)·
ruu == (-vcosu,-vsinu,O)
ruv = (-sinu,cosu,O)
rvv = (0,0,0)
-b 2
K = (b 2 + v2 ) .
Example 5.4.7 (Sphere) (u,v) f-t (cosu cos v, cosusinv, sinu). Com-
pute I, II and K and so on.
(ii) using the parameterization formula etc. Do the same for the sphere
too.
Compute its volume form with respect to the induced Riemannian met-
ric. (Hint: See Example 3.2.11.)
81 = {p E 1R3 : z(p) = O}
and
5.4. Gauss theory of surfaces in IR3 261
(x,y,O) H (cosx,sinx,y).
and st = Sand Si= the image of st under cp that is, the cylinder
with a line removed (parallel to the z-axis), then Sl is an imbedding of
S into IR3 in two different ways. However the first fundamental forms at
the corresponding points are the same and hence by Gauss theorem we
should expect Kl = K 2, etc.
cp(u,v) = (ucosv,usinv,v)
with °< 4> < 271' and 0 E IR, and the local isometry given by
v H 4>, u H sinhO.
The reader should calculate I, II, K, etc., and verify Theorem 5.4.12 in
this case. See also Figure 5.4.1 on page 263.
To prove Theorem 5.4.12, let us go back to the question posed at the
end of Section 5.1. The answer is that we would like to investigate
Then we have
Therefore
(R(X, Y)Y, X) = K. (5.4.8)
Thus the curvature K is computed as (R(X, Y)Y, X) which depended
only on D, the Levi-Civita connection and the Riemannian metric (1)
of 8. Since the connection D is determined by the metric, the theorem
is completely proved.
o
5.4. Gauss theory of surfaces in ]R3 263
We first show that (R(X, Y)Z)(P) depends only on X p, Yp, and Zp-
This means that R(X, Y)Z is a tensor:
R(fX,hY)(uZ) = fhuR(X,Y)Z
DfXDyZ = fDxDyZ
DyDfX Z = Dy(fDx Z )
= Y(f)DxZ + fDyDxZ.
Since (fX,Y] = f[X,Y]- Y(f)X, we have
R(fX,Y)Z = fR(X,Y)Z.
Similarly
R(X, fY)Z = f R(X, Y)Z = R(X, Y)(f Z).
5.5. Curvature and parallel transport 265
Proof Let us assume that such an operation exists and derive its local
expression. Let '{J: U --t]Rn be a local chart. Assume that c(I) c U. Let
c(t) := (Cl(t), ... , cn(t)) where Ci(t) = Xi(C(t)). Let Oi := 8~i' Given
a vector field along C we can write it as X = L JiOi where Ji(t) =
X(Oi(C(t))). By properties 1) and 2), we have
D "dfi D
dt X = L....J di Oi + Ji dt Oi. (5.5.1)
By 3) we have
D "d(xj 0 c)
dt Oi = Dc'(t)oi = DE d("JtO C) 8; Oi = L....J dt D 8 ;Oi'
Using this in Eq. 5.5.1, we get
DX
dt
=" dJi a. "f.d(Xi oc) D
L....J dt + L....J
l dt l
a.
8;.·
(5.5.2)
i,j
Thus if ¥t
exists, it is unique and its local expression is as in Eq. 5.5.2.
To prove the existence, we define ¥tx
on U by Eq. 5.5.2. It is easily
verified that this definition satisfies the properties 1)-3). The uniqueness
assertion above implies that ¥t
is well defined.
Note: Most often in the sequel we shall use this theorem without any
further comment. We also use the sloppy notation Dc'X, X' or X t for
¥tX for any vector field along c, if there is no cause of confusion.
Remark 5.5.2 The above theorem may be best understood if one thinks
in terms of induced bundles and induced connection. What goes on over
there is to pull back the tangent bundle on M to the domain of c via c.
Then ¥tis the induced connection on this bundle. See also Section 2.7.1.
D
dt X = "fI(t)oO
L....J X·
IP .
i l
Note that in this case, the pull-back bundle is the trivial bundle (a, b) x
Tp(M}.
5.5. Curvature and parallel transport 267
Example 5.5.5 Take 'Y(t) = (t, 0, ... ,{)) in M = an and D the usual
connection. X = E Ii 8~i' Then
· l' ( It
D ,,('(t) X = O Imp a
les OUI"'"
aIn)
OUI
0
= .
That is, Ii's are constant along the ul-axis. Clearly X = E Ci 8~i on
ul-axis, with Ci E R That is, X is parallel in the usual sense. More
generally, if 'Y is any curve and X is parallel along 'Y, then X ('Y( s)) is
parallel to X("((t)) in the usual sense for all s, t in domain of 'Y.
On the other hand Y = sin( 7r /2 + t) + Ei>2 .
- 8~ is not parallel on 'Y.
Example 5.5.6 Let 'Y(t) := (cos(t), sin(t), 0), tEa, be a curve on the
unit sphere S2 C a 3 endowed with the induced Riemannian metric. Let
X(t) := e3 for all t. Then X is parallel on 'Y.
dlk
= d
t
+"
L...JIj 1'i r ij
..
I k on l' for all k.
',]
These are the differential equations for the components Ii of a vector
field parallel along 1'.
So from the global existence and uniqueness of solution of linear
systems of ODE, we have a unique system of solutions (ft, 12,···, In)
as soon as we impose an initial condition, say li(t o) = Vi for 1 ~ i ~ m.
This means that a vector field X parallel along l' is uniquely determined
once we know its value at a point of the curve 1'. In other words, given
v E Ty(to)M, there exists a unique vector field X such that X is parallel
along l' and X('Y(to)) = v.
We therefore get a map T-y(to) : T-y(to)M --+ T-y(t)M as follows: for
all v E T-y(to)M, let X be the unique parallel vector field on l' such
that X('Y(to)) = v, then T-y(to)(v) = X('Y(t)). T-y(to) is called the parallel
transport from 1'(to) to 1'(t) along l' or with respect to 'Y. It has the
following property: for any Riemannian connection D, T-y(to) is a linear
isometry, as we show below.
It is linear: Let 1'(to) = P and 1'(t) = q. If v, w E TpM, with
corresponding parallel fields X, Y along 1', then X + Y will be parallel
along l' with (X + Y)(p) = v + w etc. Hence it is linear.
It is injective: For, v = 0 is the vector corresponding to X == 0 by
the uniqueness of ODE.
It is an isometry: For, if X, Yare as above, we have along 1', since
D is Riemannian,
d
dt (X, Y) = (D-y/(t)X, Y) + (X, D-y/(t)Y) = O. (5.5.3)
Then by the defining ODE for parallel vector fields along, (note that
XHt), as, is the integral curve for X) we have
,~(t) =
°
Theorem 5.5.10 Let M, D be as in Theorem 5.5.9. Fix p E M. Let
Xp =f. 0, Yp =f. and Zp E TpM. Let Xi be coordinates around p such
that Xi(p) = 0, Ip
8~1 = Xp, 8~21p = Yp. (This is always possible.) Let
"Ie denote the boundary of the square
T03 T32 T21 TlOZO - Zo = (T03 T32 T21 TlOZO - T03 T32 T21 ZI)
+ (T03 T32 T21 ZI - T03 T32 Z2)
+ (T03 T32 Z2 - T03 Z3) + (T03Z3 - Zo)
- TO T3 T2 TI0 Z 0 - TO321Z)
_(321 T3 T2 1
+ (TJ T; Ti ZI - TJ T; Z2)
+ TJ T;(Z2 - Ti Z3) + TJ(Z3 -
Tf ZO)
= T03T32T21 (-E(DyZh + ~ 2(D~ h)
We now group first and fifth term together and apply Theorem 5.5.9
above then second and sixth and apply Theorem 5.5.9, etc: We then get
1. Rxy = -Ryx
2. (Rxyv, w) = - (Rxyw, v)
4. (Rxyv, w) = (Rvwx, y)
Proof Exercise.
o
The above theorem lists the important symmetry properties enjoyed
by the curvature operator on a Riemannian manifold with Levi-Civita
connection.
Deiej = L fi~ek.
k=l
These forms wf
are called the connection forms of the connection
D. Notice that if we take ei = Xi, the coordinate vector fields, then
wf(ej) = r~, the Christoffel symbols. If we denote by wi the I-forms
dual to ei, then we refer to wi as a dual frame. We recall that for any
I-form w, the exterior derivative is given by (see (3.4.4) page 196)
dw(X, Y) = X(w(Y)) - Y(w(X)) - w([X, Y]).
We fix a local frame {ei} and its dual frame {wi}. We wish to compute
dw i . We have (using the symmetry of the connection)
r s
From Equations (5.6.2) and (5.6.3) we get the first structural equation
of Cartan:
dw i = - A j. L w; w (5.6.4)
j
We now derive the second equation of Cartan, which deals with the
relation between the exterior derivative of the connection form and the
curvature. We compute R(X, Y)ei:
R(X,Y)ei
= (DxDy - DyDx - D[X,Yj}ei
= Dx(LwHY)er ) -Dy(LwHX)e r ) - LwH[X,Yj)er
r r r
r r r
r r
+ LwHY)(Lw:(X)e a) - LwHX)(Lw:(Y)e a )
r $ r 8
a r
We set
~j kl == Rtkl := {R(ek' el)ei, ej}. (5.6.6)
If we define two-forms
then the curvature operator R(X, Y) has matrix entries n{ (X, Y) with
respect to the basis {ed. The 2-forms n{ are called the curvature forms
of the connection D. We have thus derived the second equation of Car-
tan:
n{ = dw{ + LW~ A wf. (5.6.8)
k
5.6. Cartan structural equations 275
Dxe; := I:wf(X)ej.
j
R(ek,el)e;:= I: Rfklej.
j
= dK 1\ wi 1\ wj + Kdw i 1\ wj - Kw i 1\ dw j
- L dwt 1\ w; +L wt 1\ dw;
k k
= dK 1\ wi 1\ wj +K L w; 1\ w k 1\ wj
k
+L wt 1\ (0,: - L wf 1\ w!)
k I
= dK 1\ wi I\w j .
278 5. Riemannian Geometry
Thus we get
,",(j au aU) _
L...J wi - ax _dXi + ax- dXj 1\ dXj - O.
j J l
Hence we have
-1 (u')'
- + (u')2_
- -0. (5.7.1)
r r r
dx~
1f(L: x n'
d 2 "'"
s = L... 1 +
by setting
¢(v) = (Xl(7r(V)), ... ,xm(7r(v)), V(Xl), ... , v(xm)) E 1R2m.
Then ¢ is one-one on 7r-l(U) onto </>(U) x IRm, an open subset of 1R2m.
We define local coordinates ~j on D by
for 1 :::; j :::; 2m
where Uj stand for the usual coordinates on 1R2m. That is,
We now show that this collection {(D, ¢)}, as U varies over all charts
of the atlas for M, forms a smooth atlas for T M. The only thing to be
checked is that the overlaps are smooth.
Notice that (D, ¢) and (V, 7;;) overlap if and only if the correspond-
ing charts (U, </» and (V,7/1) of the atlas overlap. Let Xi and Yi be the
corresponding local coordinates on U and V respectively. Then given
(a, b) E ;f(U n V) x IRm we see that
~-1
7/1
~
(a, b) = ~bia.
a I.p-l(a)"
;=1 y,
282 A. Tangent Bundles and Vector Bundles
That is, these are the components of the Jacobian of the map ¢ 0 ;j;-l.
Thus these maps are also smooth.
o
This smooth manifold T M is called the tangent bundle of the given
manifold M.
(c) There is a fixed r E N (called the rank of E) such that for each
p E M, 7r- 1 (p) is an r-dimensional vector space over R 7r- 1 (p) is
called the fiber over p and is usually denoted by Ep.
such that for any fixed q E U, the map r 1-+ cp(q, v) is a linear
isomorphism of]Rr onto the fiber E q •
satisfying condition (d). Now if U", n Uf3 f:. 0, then we have the diffeo-
morphisms:
3. If Uo. n U{3 n U·Y =1= 0, then g,,(o. = g"({3 0 g{3o. on the open set Uo. n
U{3 n U"(.
Partitions of Unity
2. gi ~ 0 on M for all i E I.
3. For all p EM, I:i gi(p) = 1. Note that by (1) this sum is finite.
G i is compact, G i ~ GHl, M = U Gi •
iEN
Notice that the sum in the denominator is > 0 and is finite. Then
{gi : i E J} is a partition of unity subordinate to the given atlas A.
D
Bibliography
A g ,80 Dvf(x),21
Altk,173 ~, 266
C k ,48 r(tp),188
df,184 rfj,251
Df(x),6 GL(E),48
a~iL,81 H(p),257
291
La, 80 sn,66
Lie(G),117 SU(n),80
L k (E,F),48 su(n),112
M(n,lR),7 Tp(n),81
\1 f(x), 40 T;M,95
nf,274 U(n), 80
O(n),69 I\V,174
R a ,80 XU),109
292
INDEX 293
Vector
normal,40
tangent, 9
Vector field, 106
ip-related, 114
along a curve, 265
complete, 128
divergence, 229
exponential map of a, 132
flow of a, 128
left invariant, 114
parallel along a curve, 267
Vector fields
right invariant, 115
Volume
oriented, 43, 173
Volume form
of a Riemannian manifold,
222