Professional Documents
Culture Documents
ADVANCED MATHEMATICS 64
EDITORIAL BOARD
D.J.H. GARLING, W. FULTON, K. RIBET, T. TOM DIECK,
P. WALTERS
CALCULUS OF VARIATIONS
Already published
1 W . M . L . Holcombe Algebraic automata theory
2 K . Petersen Ergodic theory
3 P.T. Johnstone Stone spaces
4 W,H. Schikhof Ultrametric calculus
5 J.-P. Kahane Some random series of functions, 2nd edition
6 H. Cohn Introduction to the construction of class fields
7 J . Lambek & P.J. Scott Introduction to higher-order categorical logic
8 H. Matsumura Commutative ring theory
9 C . B . Thomas Characteristic classes and the cohomology of finite groups
10 M. Aschbaeher Finite group theory
11 J . L . Alperin Local representation theory
12 P. Koosis The logarithmic integral I
13 A. Pietsch Eigenvalues and S-numbers
14 S.J. Patterson An introduction to the theory of the Riemann
zeta-function
15 H.J. Baues Algebraic homotopy
16 V.S. Varadarajan Introduction to harmonic analysis on semisimple
Lie groups
17 W. Dicks & M. Dunwoody Groups acting on graphs
18 L . J . Corwin & F.P. Greenleaf Representations of nilpotent Lie groups
and their applications
19 R. Pritsch & R. Piccinini Cellular structures in topology
20 H Klingen Introductory lectures on Siegel modular forms
21 P. Koosis The logarithmic integral II
22 M.J. Collins Representations and characters of finite groups
24 H. Kunita Stochastic flows and stochastic differential equations
25 P. Wojtaszczyk Banach spaces for analysts
26 J . E . Gilbert & M.A.M. Murray Clifford algebras and Dirac operators
in harmonic analysis
27 A. Prohlich & M.J. Taylor Algebraic number theory
28 K . Goebel & W . A . Kirk Topics in metric fixed point theory
29 J . F . Humphreys Reflection groups and Coxeter groups
30 D.J. Benson Representations and cohomology I
31 D.J. Benson Representations and cohomology II
32 C . Allday & V . Puppe Cohomological methods in transformation groups
33 C . Soule et al Lectures on Arakelov geometry
34 A. Ambrosetti & G . Prodi A primer of nonlinear analysis
35 J . Palis & F . Takens Hyperbolicity and sensitive chaotic dynamics at
homoclinic bifurcations
36 M. Auslander, I. Reiten & S. Smalo Representation theory of Artin algebras
37 Y . Meyer Wavelets and operators
38 C . Weibel An introduction to homological algebra
39 W. Bruns & J . Herzog Cohen-Macaulay rings
40 V . Snaith Explicit Brauer induction
41 G . Laumon Cohomology of Drinfeld modular varieties I
42 E . B . Davies Spectral theory and differential operators
43 J . Diestel, H. Jarchow & A. Tonge Absolutely summing operators
44 P. Mattila Geometry of sets and measures in Euclidean spaces
45 R. Pinsky Positive harmonic functions and diffusion
46 G . Tenenbaum Introduction to analytic and probabilistic number theory
47 C . Peskine An algebraic introduction to complex projective geometry I
48 Y . Meyer & R. Coifman Wavelets and operators II
49 R. Stanley Enumerative combinatories
50 I. Porteous Clifford algebras and the classical groups
51 M. Audin Spinning tops
52 V . Jurdjevic Geometric control theory
53 H. Voelklein Groups as Galois groups
54 J . Le Potier Lectures on vector bundles
55 D. Bump Automorphic forms
56 G. Laumon Cohomology of Drinfeld modular varieties II
59 P. Taylor Practical foundations of mathematics
60 M. Brodmann & R. Sharp Local cohomology
64 J . Jost & X . Li-Jost Calculus of variations
Calculus of Variations
CAMBRIDGE
U N I V E R S I T Y PRESS
PUBLISHED BY T H E PRESS SYNDICATE OF T H E UNIVERSITY OF CAMBRIDGE
The Pitt Building, Trumpington Street, Cambridge C B 2 1RP, United Kingdom
1 T h e classical theory 3
1.1 The Euler-Lagrange equations. Examples 3
1.2 The idea of the direct methods and some regularity
results 10
1.3 The second variation. Jacobi fields 18
1.4 Free boundary conditions 24
1.5 Symmetries and the theorem of E. Noether 26
4 T h e theory of H a m i l t o n and J a c o b i 79
4.1 The canonical equations 79
4.2 The Hamilton-Jacobi equation 81
4.3 Geodesies 87
4.4 Fields of extremals 89
4.5 Hilbert's invariant integral and Jacobi's theorem 92
4.6 Canonical transformations 95
vii
viii Contents
5 D y n a m i c optimization 104
5.1 Discrete control problems 104
5.2 Continuous control problems 106
5.3 The Pontryagin maximum principle 109
2 B a n a c h spaces 125
2.1 Definition and basic properties of Banach and Hilbert
spaces 125
2.2 Dual spaces and weak convergence 132
2.3 Linear operators between Banach spaces 144
2.4 Calculus i n Banach spaces 150
p
3 L and Sobolev spaces 159
p
3.1 L spaces 159
3.2 Approximation of LP functions by smooth functions
(mollification) 166
3.3 Sobolev spaces 171
3.4 Rellich's theorem and the Poincare and Sobolev
inequalities 175
6 T-convergence 225
6.1 The definition of T-convergence 225
Contents ix
Index 319
Preface and summary
x
Preface and summary xi
The authors thank Felicia Bernatzki, Ralf Muno, Xiao-Wei Peng, Mar-
ianna Rolf, and Wilderich Tuschmann for their help in proofreading and
checking the contents and various corrections, and Michael Knebel and
Micaela Krieger for their competent typing.
The present authors owe much of their education in the calculus of
variations to their teacher, Stefan Hildebrandt. In particular, the pre
sentation of the material of Chapters 1 and 4 in Part I is influenced
by his lectures that the authors attended as students. For example, the
regularity arguments in Section 1.2 are taken directly from his lectures.
For these reasons, and for his generous support of the authors over many
years, and for his profound contributions to the subject, in particular to
geometric variational problems, the authors dedicate this book to him.
Remarks on notation
d
A dot always denotes the Euclidean scalar product in R , i.e. if
d d d
x = (x\...,x ) ,y = (y\...,y )eR ,
then
d
x% % x% %
x -y y y (Einstein summation convention) ,
2=1
and
2
|x| = X X.
xv
xvi Remarks on notation
Co(ft)
denotes the space of functions of class C on ft that vanish identically
outside some compact subset K C ft (where K may depend on the
function, of course). Occasionally, we also use the notation
fc
<? (fi)
0
k
for C functions on ft that again vanish outside some compact subset
K Cft.
Finally, we use the notation
1.1 T h e E u l e r - L a g r a n g e equations. E x a m p l e s
The classical calculus of variations consists in minimizing expressions of
the form
d d
where F : [a, 6] x R x R > E is given. One seeks a function u : [a, 6]
d
R minimizing J. More generally, one is also interested in other critical
points of J. Usually, u has to satisfy some constraints, the most common
one being a Dirichlet boundary condition
u(a) = u\
u(b) = 1*2-
mm.
Of course, one knows and easily proves that the solution is the
straight line between u(a) and u(b), i.e. satisfies u(t) = 0.
3
4 The classical theory
I[u)=
L i-^w d t
Df(z )
0 - 0,
(at least for a local minimizer) the sufficient condition that i t is positive
definite.
In the present case, however, we do not have a function / of finitely
many independent real variables, but a functional Z o n a class of func
tions. Nevertheless, we expect that a first derivative of J something
still to be defined needs to vanish at a minimizer, and moreover that
a suitably defined second derivative is positive (semi)definite.
1.1 The Euler-Lagrange equations. Examples 5
Now
= J {F (t,u{t),u{t))-r)(t)
u + F (t,u{t),u{t))-T](t)}dt,
p
Ja
where F is the vector of partial derivatives of F w.r.t. the components
u
We now keep 77 fixed and let s vary. We are thus just in the situation of
a real valued f(s), s G R, (f(s) = I(u + srj)), and the condition / ' ( 0 ) = 0
translates into
0= / (1.1.2)
/ a
and this actually then has to hold for all rj CQ. We now assume that F
2
and u are even of class C . Equation (1.1.2) may then be integrated by
parts. Noting that we do not get a boundary term since 77(a) = 0 = 77(6),
we thus obtain
b
0 = ^ |(F(t, (t),u(<))-|(F (i u p ) U ( < ) ^ ) ) ) ) !,() J d
) (1.1.3)
d
for all 7] Co ([a, 6 ] , R ) . I n order to proceed, we need the so-called
fundamental lemma of the calculus of variations:
M*o) ^ 0.
l
Thus, h (to) i=- 0 for some index io { 1 , . . . , d}. Since / i is continuous,
there exists some <5 > 0 w i t h
a<t -6<t
0 0 + 6<b
and
io i o
\h (t)\ > ^ |/i (t )| 0 whenever | t - t\ < 6.
0
d
We then choose <p G C ((a, 6), R ) with
<p(t) = 0 if \t -t\>6
0
io
<p (t)>0 if |t -t|<(5
0
/ h(t)(p(t)dt= / h(t)(p(t)dt ^ 0,
./a J to 6
r
contradicting our assumption. Thus, necessarily /i(o) = 0 f all
(a, 6).
g.e.d.
2
Lemma 1.1.1 and (1.1.3) imply that a minimizer of I of class C has
to satisfy the so-called Euler-Lagrange equations, namely:
2 d d 2 D
T h e o r e m 1.1.1. Let F G C ( [ a , 6 ] x R x R , R ) , and letu G C ( [ a , 6 ] , R )
be a minimizer of
I(u) = J F(t,u{t),u(t))dt
Ja
among all functions with prescribed boundary values u(a) andu(b). Then
u is a solution of the following system of second order ordinary differ
ential equations, the Euler-Lagrange equations
(F (t,u(t),u(t)))
p - F (t,u(t),u(t))
u = 0. (1.1.4)
1.1 The Euler-Lagrange equations. Examples 7
F (t,
pp u(t),u(t))u(t) + F (t, pu u(t), u(t))u(t)
+ F (t,u(t),u(t))
pt - F (t,u(t),u(t))
u = 0, (1.1.5)
(1) Here F = 0, F =
u p / t ^ a > and we get
i/i+u(t)
d u(
u(t) u(t) 2
u(t) il(t)
0 =
3 '
i.e.
u(t) = 0
o= | + 2 ^ + ^
2 8
d*7(t,(*))>/l + ( * ) T
2
u{t) ii{t) u(t) 7 t
hence
2 2
0 = u(t) - ^ u ( t ) (1 + 7i(t) ) + ^ (1 + < i ( t ) ) . (1.1.6)
0 = fi(t) + (l+(*))
The classical theory
j (Ft - uF ) = u(F
p u - jF ) p = 0 by (1.1.4),
3 { ) 2
f jgftW ) Ls ^' )
for i = 1 , . . . , d.
d
We now recall that any smooth curve g(t) C R may be parameterized
by arc-length, i.e.
= 1. (1.1.9)
0 for i= l d
d
so that we see again that a length minimizing curve in E is a straight
line.
1.1 The Euler-Lagrange equations. Examples
I(u) = / F(t,u{t),u(t))dt
Ja
j t (F (t,u(t),ii(t))
p + \G (t,u(t),ii(t)))
p
- (F (t,u(t),u(t))
u + \G (t,u(t),u(t)))
u = 0. (1.1.12)
2
J( ) = / <i(*) d*
M
./a
2
5(ti)= / u(t) dt=l, (1.1.13)
with
I(u) = J F(t,u(t),ii(t))dt
Ja
10 The classical theory
d
for given F and unknown u : [a, 6] R . I f F and u are differentiate,
one may consider some kind of partial derivative, namely
d
for rj E Co([a,6],R ). For a minimizer u then
-F (t,u(t),ii(t))
p - F*(t,u(t),u(t)) = 0.
has been rather vague, because we did not specify in which class of
functions u we are trying to minimize / . The only things we did prescribe
were boundary conditions of Dirichlet type, i.e. we prescribed the values
d
u(a) and u(b) for our functions u : [a, 6] * R .
Because of our derivation of the Euler-Lagrange equations in the pre
2
ceding section, i t would be desirable to have a solution u of class C .
So one might want to specify in advance that one minimizes / only
2
among functions of class C . This, however, directly leads to the ques
2
tion whether / achieves its infimum among functions of class C (with
prescribed Dirichlet boundary conditions, as always) or not, and if i t
1
does, whether the infimum of / in some larger class of functions, say C ,
2
could be strictly smaller than the one in C . I n the light of this question,
it might be preferable to minimize / in the class of all functions u for
which
1.2 Direct methods, regularity results 11
u{t )-u(h)
2 = / ii(t)dt.
Jti
Note that F(t, u(t),u(t)) is a measurable function of t for u AC by our
assumptions on F and the fact that the composition of a measurable and
a continuous function is measurablef. The idea of the direct methods in
the calculus of variations, as opposed to the classical methods described
in the preceding section then consists i n minimizing / in a class of func
tions like AC([a,b]) and then trying to show that a solution u because
of its minimizing character actually enjoys better regularity properties,
2
for example to be of class C , provided F satisfies suitable assumptions.
This minimizing procedure will be treated later J, since we want to
return to the classical theory for a while. Nevertheless, even for the
classical theory, one occasionally needs certain regularity results, and
therefore we now briefly address the regularity theory. To simplify our
notation, we put / : = [a, 6]. A class of functions intermediate between
1
C and AC is
1 d d
D ( / , E ) : = {u : / M , u continuous and piecewise
continuously differentiable, i.e. there exist
a = to < t\ < ... < t = b with u G
m
d
C H M j + i ] , M ) for j = 0 , . . . , m - l } .
1
u G D then has left and right derivatives u~(tj) and u+(tj) even at the
points where the derivative is discontinuous, and
Examples
Example 1.2.1. [a, 6] = [ - 1 , 1 ] , d = 1
2 2
I(u) = ( l - ( f k ( t ) ) ) di,
t i ( - l ) = 1 = u(l).
A minimizer is
1
ti(0 = | t | D ( / , R )
1
which is not of class C . The minimizer of / is not unique (exercise:
1
determine all minimizers), but none of them is of class C .
2 2
I(u) = J (l-u(t)) u(t) dt
u(-l) = 0 , ti(l) = l .
2 2
J(u) = ^ (2t~u(t)) u(t) dt,
u(-l) = 0 , ti(l) = l .
6I(u,rj) = J {F (t,u,u)-rj
u + F (t,u,u)-f)}dt
p =0 (1.2.1)
Ja
1.2 Direct methods, regularity results 13
d d
for all rj G AC (I,R )0 (i.e. rj G AC(I,R )) and we require that if I =
[a, 6], there exist a < a\ <b\ < b with rj(x) 0 if x is not contained in
d
[ai,&i], as in the definition of C$([a, 6], R )). Then for almost all points
in I
j F (t,
t p u(t), u(t)) = F (t, u(t), u(t)) u (1.2.2)
(note, however, that the derivative on the left hand side cannot be com
l d
puted by the chain rule). If u G C (I,R ), (1.2.2) holds for all t G J,
1 d
and if u G J D ( / , E ) , at those points tj where u(tj) is discontinuous
T]dt = Fudy
jf F rjdt
u = j[ ^ (j[ F d
^ y) ~J a (/ ) V dt
Proof. We put
1
c := / h(t)dt
b a
~ Ja
14 The classical theory
and
<p(t):= J (h(y)~c)dy.
Ja
Then
0 = f \h{y)-c)h{y)dy = f \h(y) - cf dy
Ja Ja
F {t,u(t),u(t))
p = f F {y,u{y),u(y))dy
u +c (1.2.5)
Ja
1 1
(1.2.5) gives (1.2.3). The claims for u e C or D are obvious from the
proof.
q.e.d.
d d 1
T h e o r e m 1.2.2. Let F : I x R x R be of class C , and let F be also p
l or
of class C , and let det (F i j (t, u ( t ) , ^
p p f all t E I
l d
and a solution u G C (I,R ) of
d
6I(u,T])=0 for allr)eCl(I,R ).
2
Then u is of class C .
Proof. We define
d d d
<f>: R x R x R x R R
via
<t)(t,u,p,q) := F (t,u,p) p - q.
<t>(t,u,p,q) = 0
F(to,u ,po)
0 for any to G I . Thus, there exists a neighbourhood U of
(to,uo,qo) such that for each (t,u,q) G /, <t> 0 has a unique solution
d 1
p = <p(t, u, q) and that (p : U E is of class C . Since we already know
a solution of <fi = 0, namely (, u(t), u(t), F (t, u(t),u(t))), the uniqueness p
1 2
Since <p is of class C , so then is ii(t), hence t i G C . Since to e I was
2 d
arbitrary, i i G C ( J , E ) .
Thus
/ F (t,u,
pp Pl + s(p ~ ))ds 2 Pl (p -pi)=0.
2 (1.2.6)
Jo
By our assumption on F , (1.2.6) is invertible, hence p = P i , hence
pp 2
uniqueness.
Using this global uniqueness together w i t h the existence result of
the implicit function theorem, we now see that for any (t,u,q) in a
sufficiently small neighbourhood of ( ,^(b<Zo) (^o I , Ho = u(to), 0
qo = F (t ,^(bPo),
p 0 Po = ^o(^o)), there is a unique solution (p(t,u,q)
of
F (t,u,p)
p -q =0
1
and <p is of class C . Thus, as in the proof of Theorem 1.2.2,
to
1
w then is of class C . Since u is absolutely continuous, by a theorem of
Lebesgue
1
Since v = u almost everywhere, we conclude u = w, hence u G C near
2
to, which was arbitrary in / . Theorem 1.2.2 then gives u G C .
q.e.d.
TF (t,u(t),ii(t))
p - F (t,u(t),u(t))
u = 0 (1.2.7)
or equivalently of
q.e.d.
d d k
T h e o r e m 1.2.4. Let F : I x R x R -+ R be of class C , and let F p
k 1
also be of class C , k G { 2 , 3 , . . . , o o } . Suppose u is of class C and a
d
solution of 6I(u,rj) = 0 for all rj G C o ( / , R ) , and suppose
k+l d 1
Then u G C (I,R ). (The same result holds if we assume that u G C
is a solution of the Euler-Lagrange equations (1.2.8).)
2
Proof. By Theorem 1.2.2, u is of class C , and by Corollary 1.2.1, i t
1.2 Direct methods, regularity results 17
{-F (t,u(t),u(t))
pu - F (t,u(t),u(t))
pt + F (t,u(t),ii(t))}
u
(1.2.10)
3
Let now j < k, and suppose inductively u E C . The right hand side of
3 x 7 1
(1.2.10) then is of class C ~ . Therefore, u is of class C- " , hence u is
j+1
of class C .
q.e.d.
k
are of class C , then a solution u of 6I(u,rj) = 0 for all rj ACo is of
k + 1
class C .
q.e.d.
7(u)= J F(t,u(t),u(t))
is well defined, assuming F(t,u,p) to be continuous in u and p and
measurable in t. However, if one then finds a minimizer u, it might not
be a solution of the Euler-Lagrange equations, because it is not regular
enough. I f the invertibility condition d e t F ^ 0 is satisfied, however, p p
I(u) = / F(t,u(t),u(t))dt,
Ja
i.e.
d
6I(u,T]) = 0 for all 77 G >o(/,M ). (1.3.1)
We recall that
:= ^ / ( u + s77) u = 0 ,
f(s)=I(u + sri).
2 2
m-f(0) = \s f"(0)+o(s ) fors^O.
with
for some e > 0. (Note: I t is not clear that e may be chosen independently
of v.) We define the second variation of / at u in the direction rj e DQ
as
2
d
2
6 I(u,rj) := _ / ( w - f s 7 ? ) u = 0 .
1.3 The second variation. Jacobi fields 19
In order that this variation exists, we require for the rest of the section
2
that F is of class C . We then compute
2
6 I(u, 77) = ^ J" F(t, u(t) + sr](t), ii(t) + 7(<))d*|.
rb
I
J+a 2F i (t,u(t),u(t))r) {t)r {t)
p UJ i )j
We abbreviate (1.3.5) as
fb
2
6 I{u,r])= {F r)r)
pp + 2F r)rj pu + F rjrj} uu dt. (1.3.6)
Ja
<p(t, 77, TT) : = F pipj (t, u(t), u(t))'Ki'K + 2F i j p UJ (t, u(t), u^))-*^
(,u(),^))^%,
and we define the accessory variational problem for J(M) min as
d
Q(rj) : = / cf)(t,r)(t),r)(t))dt -> min among all 77 G Z ^ ( J , R ) .
(1.3.8)
If u satisfies the assumptions of Theorem 1.3.1, then
= ^(*,r?W,i)W), (1.3.10)
20 The classical theory
i.e.
= F (t,
pu u(t), u(t))fj(t) + F (t, uu u(t), u(t))rj(t). (1.3.11)
2 d d
for all t e I , u e C ( J , R ) . Then any solution of rj e AC {I,R ), 0
d 2
6Q(rj,(p) = 0 for all <p ACo(I, R ) is of class C and hence a Jacobi
field.
and so the assumption det F (t, u(t), u(t)) ^ 0, that is seemingly weaker
pp
than the one of Theorem 1.2.3, indeed suffices to apply that Theorem.
q.e.d.
F (t,u(t),u(t))
pp is positive semidefinite for all t G / ,
i.e.
d d
F pipj (t, u(t), u{t))?? > 0 for all = (\ ..., ) e R.
(At points where ii(t) is discontinuous, this holds for the left and right
derivatives.)
{ 0
e
for a < t < 10 e and to 4- e < t < b
for t - to
linear for to e < t < 10 and for to < t < to + e
1.3 The second variation. Jacobi fields 21
d
for given R . Then
{ 0
-
for a < t < t or t + e < t < 6
for t - e < t < t
0
0
0
0
2 + C J 2
0 < 6 I(u, rj) = r F p V (t, ix(t), u(t))CZ dt + 0(e ) for c 0,
Jto-e
since all other terms contain a factor e, and we integrate over an interval
of length 2e. Hence
j j
F i (t ,u{t ),u(t ))Cl;
p pJ 0 0 o - lim - / F {t,u{t),u(t))C^ dt
pipj > 0.
0 Jt -e 0
q.e.d.
The Jacobi equations and the notion of Jacobi fields are meaningful
for arbitrary solutions of the Euler-Lagrange equations, not only for
minimizing ones. I n fact, Jacobi fields are solutions of the linearized
Euler-Lagrange equations. Namely:
3 d d
T h e o r e m 1.3.3. Let F e C {I x R x R , R ) , and let u (t) s be a family
2
of C -solutions of the Euler-Lagrange equations
rj(t) := - ^ ( % s = 0
ds
is a Jacobi field along u = uo.
-F (t,u(t),u(t))r)(t)
pu - F {t,u(t),u(t))r](t)
uu = 0.
l d
in [ a i , a ] , and suppose r\ G C ([a\,a ],R )
2 is a Jacobi field on [ a i , a ] 2 2
/ <p(t,r)(t),r)(t))dt = 0. (1.3.13)
22 The classical theory
2<f>(t, 77, 7T) = (f) (t, 77, 7r)77 -h (f>n(t, 77, 7T)7T.
v
Therefore
3 2
As before, let F be of class C , and let u(t) be a solution of class C
on [a, 6] of the Euler-Lagrange equations
j F {t,
t p u(t), ii(t)) - F (t, u(t), ii(t)) = 0.
u
Definition 1.3.2. Let a < a\ < a < b. We call the parameter value 2
77(01) = 0 = 77(02).
3 d d 2 d
T h e o r e m 1.3.4. LetF e < 7 ( J x R x R , R) and suppose u e C (I,R ).
Suppose that F (t,u(t),u(t))
pp is positive definite on I. If there exists a*
with a < a* < b that is conjugate to a, then u cannot be a local mini
l d
mum of I. More precisely, for any e > 0, there exists v D (I, R ) with
v(a) = u(a), v(b) = u(b),
and
Q(V*)= f* <P(t,v*,v*)dt = o.
Ja
7)*(a*)=0.
Since also 77*(a*) = 0, and since 77* solves the Jacobi equation, a (linear)
second order ordinary differential equation, the uniqueness theorem for
solutions of such equations implies
F (t,u(t),ii(t))
p = F (t,u(t),ii(t))
u
to minimize
Qfa) 2
: = 6 I(U,T ) ] = - ^ I ( U+ ST )
1 1 for r? Do-
24 The classical theory
If, for fixed u, we consider the variational problem Q(rj) * 0, we are led
to the Jacobi equations
= F (t,
up u(t),u(t))r)(t) + F (t,
uu u(t),u(t))ri(t)
for 77.
Solutions rj with 77(a) = 77(6) = 0 are called Jacobi fields, a* G (a, 6) for
which there exists a nontrivial Jacobi field on [a, a*] is called conjugate
to a, and if there exists such a*, u cannot be locally minimizing on [a, 6].
In other words, a solution of the Euler-Lagrange equations cannot be
minimizing beyond the first conjugate point.
M nv = f(U).
n
A n example is the sphere S described in detail in Section 2.1 (Exam
ple 2.1.1). The tangent space T M of M at p then is the vector space
P
n d
D / ( z ) ( E ) . I t can be considered as a subspace of the vector space T E , p
d
the tangent space of E at p.
As in 1.1, we now consider the variational problem
d
(differentiable, embedded) of E , we require that
u(a) G M i , i i ( 6 ) G M . 2
d
from / into M. depending differentiably on s G (e, c), w i t h u(t) = Uo(t)
and
u (a)
s GMi , u (b) G M
s 2 for all s.
Let
Then again
0= ^ / K ) | . _ 0 = F(t,u(t),u(t))dt^ 0
= f {F {t,u(t),u(t))-f,{t)
p + F {t,u(t),u{t))-T}(t)}dt
u
Ja
= f a \ - j F P + ^ } - v + F P ( * . ( * ) , ( * ) ) m l z i
i
= F (t,u(t)M*))-V(t)\ Za>
p
(~, e). Since / is injective, there then has to exist a curve 7(5) C U w i t h
1 ,
u (a) = fo>y(s) for all s. Hence 77(a) = u (a)
s = D/(/- 7i(a)) (0) s u=0 7
F (a,u(a),u(a))
p V = 0 for all V G T M
u{a) u
and likewise
d
of R ), i.e. ^ ^ ( ^ s ) | = 0 for all variations u (t) differentiable in
s = 0 s
d
and T ( 5 ) M 2 , respectively. In particular, if for example Mi = R , then
U
F (a,u(a),u(a))
p = 0.
I(u) = / F(t,u(t),u(t))dt,
Ja
2 d d
with F G C ([a,6] x l x E , E ) . We suppose that there exists a smooth
one-parameter family of differentiable maps
d d
h : R
s -> R
h(s,z) := h (z)s
2 d
is of class C ( ( - e , e ) x E , E ) for some e > 0),
0 0 0
with
d
h (z) 0 = z for all zeR
1.5 Symmetries and the theorem of E. Noether 27
and satisfying
j\(t,h (u(t)),
s f h (u(t)^J
t s dt = j\(t,u(t), j u(t^J
t dt (1.5.1)
2 d
for all s G (~e,e) and all u G C ( [ a , 6], R ).
Then, for any solution u(t) of the Euler-Lagrange equations (1.1.4)
fori,
F (t, u(t),ii(t))
p ~h (u(t))\ s s=0 (1.5.2)
Proof of Theorem 1.5.1: Equation (1.5.1) yields for any t G [a, 6], using 0
h (z) = z,
0
= F t h s k s { u { t ) ) d t s = 0
^ s J a { ' ^ ^ Jt ) ^
h
+F P (t, u(t),u(t)) J f sW))}dt\s=o-
t s
h u
+FP (t,u{t),u(t)) J -ff s( (t))}dt\ =o
t s s (1-5.5)
= f j t (F (t,u(t),u(t))^- h (u(t))\ )
p s 3 s=0 dt.
Therefore
(1.5.6)
for any to G [a, 6]. This means that (1.5.2) is constant on [a, 6].
q.e.d.
28 The classical theory
Examples
3 n
Example 1.5.1. We consider for u : E > E , u = (ui,...,u )
n with
2
F(t,u(t),u(t)) = pmj-^f-- ^II^H = E ^ i j,
3
i.e. a mechanical system in E w i t h point masses m*, and a potential
V(u) that is independent of the third coordinates of the Ui. Then
h (z)
a = z + se , 3
3
where e is the t h i r d unit vector i n M , leaves F invariant i n the sense
3
we conclude that
1=1
i.e. the third component of the momentum vector of the system is con
served.
axis,
n
m u e
^ F ez A Ui = ] P ( i i)
v ' ( 3 A Ui) = ] P (ui A ra^) e . 3
i=l i i
(s G ( - e o , e ) as before) with
0
d
h (t, z) = (t, z)
0 for all (t, z) G [a, b] x E
and satisfying
b
rh (b)
3 ( d \ r
/ F(t ,h (u(t )),h (u(t )))dt =
s s s s s s / F(t,u(t),u(t))dt
(1.5.7)
2 d
fort = h (t), all s G ( - e , e ) and a// it G C ( [ a , 6 ] , E ) . Then, for any
s s 0 0
F {t,u(t),u(t))
p h {u{t))\
s s=0
'ds'
f c
+ ( F ( t , t i ( t ) , t i ( t ) ) - F (*,ti(*),A(*))wW) ^ 2 W I - = o p (1-5.8)
F(t(T),u(t(T)),^,^u(t(r))
:=F t M t ) , * ^ ) Z (1.5-9)
dr
\ dr
Then
I(t,u) := j H F ( i ( r ) , u ( i ( r ) ) , | : ( * ( r ) ) ) dr
= J F(t,u(t),u{t))dt, i f i ( r ) = a, f ( n ) = 6
0 (1.5.10)
= /()
F {t,u(t)Mt))~h (u(t))\ ^
p s s=Q
30 The classical theory
Fp
Fpi
Fo
p = F - Fu
p
h (t,z)
s = (t + s,z)
Exercises
d
1.1 For mappings u : [a, 6] > E , consider
2
E(u) : = i f\u{t)\ dt
d l d
(| | is the Euclidean norm of E , i.e. for z (z ,..., z ),
1 2
| | 2 _ J2i i(z ) ).
z = Compute the Euler-Lagrange equations and
the second variation. Also, let
L(u) := I \ii{t)\dt.
Ja
Show that
L(u) < yj2(b-a)E(u),
Exercises 31
w i t h u(~l) = 0 = u{l).
1.3 Develop a theory of Jacobi fields for variational problems with
free boundary conditions. I n particular, you should obtain an
analogue of Jacobi's theorem.
d
1.4 For mappings u : [a, 6] E , consider
c(t) := Jt{t).
a y
L(c):=\c(t)\dt = ^ ( n j dt, (2.1.1)
1 d
where ( c , . . . , c ) are the coordinates of c = c(t). We also define the
energy of c as
2 2
E(c) := ^ \c(t)\ d t = \ Y . (<H (2.1-2)
We let now
7 ( t ) C C/
32
2.1 The length and energy of curves 33
with
c(t) = f( (t)). 1 (2.1.3)
Since the derivative Df(z) has maximal rank everywhere (by definition
of a chart, cf. 1.4), 7 is absolutely continuous, since c is, and we have
the chain rule
c(t) = (Df) ( ( ) ) o 7 ( t ) , 7
or
< , 2
L(c) = (^(7W)7 ()^(7(*))7 '(*)) dt
and
T a a
1 f df df
j h
L{c)= I * (9iMt))fm (t)) dt (2.1.5)
Jo
Definition 2.1.1.
QfOt QfOt
z f o r a 1 1
9ij(*) =9ji( ) hi
and positive definite, i.e.
1 n
9ij{z)rfrf > 0 whenever 77 = ( 7 7 , . . . , rf) ^ 0 G E .
34 Geodesic curves
Remark 2.1.1. The use of local charts for M seerns to have the obvious
disadvantage that the expressions for length and energy of curves be
come more complicated. The advantage of this approach, namely not to
d
consider curves on M as curves in R satisfying a constraint, is that this
constraint now is automatically fulfilled. A l l curves represented in local
charts lie on M . This more than compensates for the complication in
the formulae for L and E.
d
for any curve c : [0, T] -> R .
Secondly, L is parameterization invariant in the sense that whenever
T:[0,S]->[0,r]
l
is a diffeornorphisrn (i.e. r is bijective, and both r and its inverse r~
are everywhere differentiable), then
L(c) = L ( c o r ) ,
Namely
L(cor)
2.1 The length and energy of curves 35
2
L(c) = J l<\c(t)\dt<U dt\ [J \c(t)\ dt) =VWy/E(cj,
(2.1.10)
w i t h equality iff
We have shown:
d
L e m m a 2.1.1. For every c e A C ( [ 0 , T ] , E )
L(c) < V^>/E(cj,
If
\c(t)\ = constant almost everywhere ,
r:[0,L(c)]^[0,T]
with
T:[0,L(C)}^[0,L(C)}
36 Geodesic curves
(i.e. we keep the interval of definition fixed, namely [0, L(c)]), the par
ameterization by arc-length leads to the smallest energy. Namely, if
d
c : [0, L(c)] E is parameterized by arc-length
f:U-+V
C ([0,T])c/(/).
f == r 1
of-.r1
(/([/) n / ( # ) ) ^ r 1
(/([/) n /"(#))
/ o ( t ) = c(t) = / o 7 ( t ) ,
7
and from
fob)) = f(z)
2.1 The length and energy of curves 37
Figure 2.1.
we get
j
9iJ (7(0) (t)i (t) = hi m)) t W (*), (2-1.17)
and this shows again the equivalence of (2.1.5) and (2.1.1), and likewise
for the corresponding expressions of the energy. The important transfor
mation formula (2.1.16) shows how the metric tensor transforms under
coordinate transformations. This invariance property of L and E makes
it possible to express the length and energy of an arbitrary curve c on
M that is not necessarily contained in the image of a single chart as
follows:
One finds a subdivision
c([^_i,^])
U :U - v V v
l
m f x
= E / (^(7,W)7iW7iW)'*
tiy 1
T h
L(c) = f ( ^(t))f(t)jHt))
9ij dt, (2.1.18)
Jo
We now assume that the charts for M are twice differentiable and return
to the question of finding shortest curves on M , for example between two
given points. By Corollary. 2.1.1, it is preferable to minimize E instead
of L , because a minimizer for E contains more information than one
for L ; namely, minimizers for E are precisely those minimizers for L
that are parameterized proportionally to arc-length. Thus, minimizing
E not only selects shortest curves but also convenient parameterizations
of such curves.
We now compute the Euler-Lagrange equations for E as given by
(2.1.19):
d
0= E^i for i = 1 , . . . , m
J
^ 0 = j t (2 (7(W (t))
9ij - ( ^ ) (7(*))7*(t)7>(*)
(the factor 2 in the first term results from the symmetry = gji)
f c
& 0 = 2 rf9i +2^y7 7> - ^iflyyV- (2.1.20)
ij
(g ) ,
W / t,j = l,...,n
2.1 The length and energy of curves 39
= f + ^ +Abu - ffiM) 7 * y
by using symmetries. Thus:
Examples
Example 2.1.1. The sphere
in+l
and define
n n
gi : fix - E , g : fi -
2 2 E
as
and
+,
*" > - ( I ^ j t nSsr)
(#1 and g are the stereographic projections from the south and north
2
1 n
/ 1 n , _ f X X \
2 n + 1 n + 1
1 = <* <* = * i(l
X X z z - " + ! ) 4-
x x X ,
hence
4-1
and then
2 z J
/0 = l1, . . . , n ^) .
1 4- ^
Thus
l n l {
i / 2z 2z zz - 1
,...,2
dfj_ 26 jk 4*V
2
dzk
~ (1 + z V ) '
2.1 The length and energy of curves 41
Hence
()
9ij z = = ^-6ij. (2.1.22)
Actually, the metric tensor w.r.t. the chart f is given by the same 2
9ij = (2.1-23)
n +
where (\>: R E is positive and differentiable. Then
ij 2
g =<l> 8 . ij (2.1.24)
We also put
<p : = log</>.
Then
_ % i _ _, _2__^0 _ _ r 2 dip
Next
kl
It,- = \g (9iu + 9ji,i ~ 9ij,i) (2.1.25)
= (9ikj + ?M - 9ij,k)
dip dip dip
= 6 i k 6 j k + 6 i j
- dzl~ ^ dz^'
Thus, vanishes if all three indices i, j , k are distinct, and for all i , j
T
)i T
= tj = - ^ j , a n d l ^ g fari^j. (2.1.26)
2
<p = log(l 4- | z | ) - log 2
hence
J
dip _ 2z '
j 2
dz~ " 1 + N "
42 Geodesic curves
+
= V - 2
r r r V y V TTTP - - S W ( 2 L 2 7 )
1
1 + |7l j^x + 171
We now claim that the geodesic ( t ) through the origin, i.e. 7 ( 0 ) = 0,
7
n
with 7(0) = a R is given by
2 2 2
fr[l + a \a\ fril + a*\a\
2
if.. 2\a\ a . \ 2 . 1
1 L L
= a a - cr i = 1 , . . . , n.
2
V l + \a\ a* y
Since we may assume a ^ 0 (otherwise the solution with 7 ( 0 ) = a is
a point curve, hence uninteresting), this equation holds, if a(t) satisfies
the ordinary differential equation (ODE)
0 = a !-4a. (2.1.29)
2
1 4- |a| a
The theorem of Picard-Lindelof implies that (2.1.29) has a unique solu
tion in a neighbourhood of t = 0. We then have found a solution j(t) of
(2.1.27) of the desired form (2.1.28). The image of 7(f) is a straight line
through 0. By Lemma 2.1.3, we have thus found all solutions through 0.
The images of the straight lines under the chart / 1 are the great circles
n
on S through the south pole. We can now use a symmetry argument
n
to conclude that all the geodesic lines on S are given by the great cir
n
cles on S . Namely, the south pole does not play any distinguished role,
and we could have constructed a local chart by stereographic projection
n
from any other point on S as well, and the metric tensor would have
assumed the same form (2.1.22). More generally, one may also argue as
n
follows: We want to find the geodesic arc j(t) on S with 7 ( 0 ) = po,
n n
7 ( 0 ) = V for some p e S ,V
0 0 e T S. Let c (t) be the great circle on
0 Po 0
2.2 Fields of geodesic curves 43
n
S parameterized such that Co(0) = po> co(0) = V . Co is contained in a
0
n + 1
unique two-dimensional plane through the origin in E . Let i denote
n + 1 n
the reflection across this plane. This is an isometry of R mapping S
n
onto itself. I t therefore maps geodesies on S onto geodesies, because
we have observed that the length and energy functionals are invariant
under isornetries, and so isometries have to map critical points to crit
ical points. Now i maps po and Vo to themselves. I f 7 were not invariant
under i , i o 7 would be another geodesic w i t h initial values po, Vo, con
tradicting the uniqueness result of Lemma 2.1.3. Therefore, 2 0 7 = 7 ,
and therefore 7 = c . 0
3
M q has dimension n 1, and that i t is also of class C .
n
(i) U contains the origin o / E , / ( 0 ) = #o-
n
(ii) M nV
0 = f{UD{x = 0})
l
(Hi) The curves x = C\, C \ constant, i = l , . . . , n 1, are geodesies
n
parameterized by arc-length. The arcs 1 < x < 2 on any such
f We do not introduce the concept of an abstract Riemannian manifold here, but
some readers may know that concept already, and in fact it provides the natural
setting for the theory of geodesies. O n the other hand, the embedding theorem
of J.Nash says that any Riemannian manifold can be isometrically embedded into
d d
some Euclidean space E , hence considered as a submanifold of M . Therefore, from
that point of view, no generality is gained by considering Riemannian manifolds
d
instead of submanifolds of R .
44 Geodesic curves
n n
curve between the hypersurfaces x = 1 and x = 2 are M of
the same length 1 2
(iv) The metric tensor on U satisfies
n (p)eT M,
p
||n(p)|| = l
( n ( p ) , v) = 0 for all v G r M p 0 C T M.
P
71 1
for M q (Uo C M "" ), possibly choosing V smaller, if necessary. For every
p G Mo f l V , we then consider the geodesic arc 7 ( ) with P
7P(0) = P,
7P(0) = n ( p ) . (2.2.2)
This geodesic exists for |f| < e = e(p) by Lemma 2.1.3. By choosing
V smaller i f necessary, we may assume that e > 0 is independent of p.
Instead of 7 (), we write ~/(p,t). Since the solution of (2.2.2) depends
P
/ :0b* (-e,)-M
x = y?(0),
0
-(dx
) + r * ^ dx
n 2 Sdx: t3 n n (* n
= *) ***=!,...,n.
Hence in particular
r* n = 0 for fc = l , . . . , n .
Now
1
r
1 M
n n = ^9 (l9nl,n ~ 9nn,l) = ^'ffnl.n,
since # n n = 1. Therefore
9nk = 0.
> r n
i7 (o *>7 n
(<2)-7 n
(<i)=7 n
(e)-7 n
(-e)
= 1(7).
Thus 7 A is defined on
v j], if 7 is defined on [c, e]. Since *y depends v
n
differentiably on v, and since v G E , \v\ = 1, is compact, there exists
0 > 0 with the property that for all v w i t h \v\ = 1, 7 is defined V
n
on [eo,eo]. From (2.2.5), we then conclude that for any w R with
M < eo, 7tu is defined on [1,1]. For later purposes, we also note that
by Lemma 2.1.3, CQ may be chosen to depend continuously on ZQ.
2.2 Fields of geodesic curves 47
w H- 7tu(l).
Then e(0) = z . We compute the derivative of e at 0 as
0
De(0)(v) = | 7 t (l)|
= ^7(%.o by (2.2.5)
= 7(0)
n
Hence, the derivative of e at 0 G E is the identity, and the inverse
mapping theorem implies:
n
T h e o r e m 2.2.3. e maps a neighbourhood of 0 G E diffeomorphically
l
(i.e. e is bijective, and both e and e~ are differentiable) onto a neigh
bourhood of ZQ G U. q.e.d.
rj (0) - 0 -
fc ftj fc , (0) for all i, j , fc. (2.2.9)
Proof. B y (2.1.16), <ftj = 6ij holds, since the metric tensor w.r.t. the
chart / satisfies this property and De(0) is the identity by the proof
of Theorem 2.2.3. I n order to verify (2.2.9), we observe that in our new
n
chart, the straight lines tv (v G E , t \v\ < e) are geodesies. Namely, tv is
mapped to 7 ^ ( 1 ) = y (t) (see (2.2.5)), where *y (t) is the geodesic with
v v
48 Geodesic curves
17,(0) = 0 for a l i i a n d / .
l
We next insert v = \(ei 4- e ) , / ^ m . The symmetry TJ. = T - (which m fc h
l
directly follows from the definition of T - and the symmetry = g j) k k
then yields
rj (0)
m = 0 for a l i i , / , m .
n
of gu, I = 2, . . . , n , and g instead of (9ki) ,i=2,...,d' * Particular, by
w k
ffrr(0) = l , f f r ( 0 ) = 0 . V (2.2.10)
The lines through the origin are geodesies by the construction of Rie
mannian normal coordinates, and in polar coordinates, they now become
1 n l
the curves (p ( y ? , . . . , <p ~ ) = constant; thus they can be written as
r* = 0 r for all i
l
(where of course T rr stands for T ^ ) , i.e.
il
7}9 (29rt,r ~ g ,i) rr = 0 for a l i i ,
2.2 Fields of geodesic curves 49
hence
Putting r I gives
grr = 0,
(I 0 ... o x
Vo /
Note that this generalizes the situation for Euclidean polar coord
2
inates. The Euclidean metric on M , written in polar coordinates, e.g.
takes the form
{(r,^):0<r<e }, 0
Proof. The first claim follows from Theorem 2.2.3, since Riemannian
polar coordinates are based on the diffeomorphism e (see the construc
tions before Theorems 2.2.4 and 2.2.5). As already noted before Theo
rem 2.2.3, Lemma 2.1.3 implies that .we may choose e as a continuous
0
function of p. I n order to verify the second claim, let c(t) be a curve from
p to g, w i t h c(0) = p. Let
(2.2.14)
Since the curve (,y>o)> 0 < f < e, has length e as easily follows from
Theorem 2.2.5, this will imply the claim. I n order to verify (2.2.14), we
proceed as follows:
Jo
Here, equality only holds i f g ifiip = 0, i.e. (p(t) = constant, r > 0, i.e. i f
w
C|[0 j is a straight line through the origin. The second claim now easily
follows. q.e.d.
d(p,q) : = i n f { L ( c ) | c : [a, 6] M
rectifiable curve with c(a) = p, c(6) = q}.
d(p, q) < to
can be connected by a unique shortest geodesic arc (i.e. of length d(p,q)).
This geodesic arc depends continuously on p and q.
arc-length. Thus
Cn(0) = p, c ( l ) = q, n
L(c )
n > d(p, q) for n oo.
with
geodesic. Since the length of the c are bounded because of the mini
n
the unique shortest geodesic arc between Pj-\ and pj (for this point, one
verifies that limits of geodesic arcs are again geodesic arcs, that limits
of shortest arcs are again shortest arcs, that d(pj-\,pj) < Q, and one
uses Theorem 2.3.1). We thus obtain a piecewise geodesic limit curve c,
with c(0) = p, c ( l ) = g, and
L(c) = l i m L ( c ) , n
noo
since we have for the geodesic pieces
L c = L c
( iii-..,.) B ^ ( i^-.-^-i)
noo '
L(c) = d(p,g),
2.3 The existence of geodesies 53
but w i t h C |
( S Inot being geodesic. Replacing c j by the shortest{ai 8 2 )
geodesic arc between c(s\) and c(s ) would yield a shorter curve (cf.
2
Let us point out that these normalizations are not at all essential, but
only convenient for our constructions. We look at the family of curves
Then
and
7(0, s) = c(0), 7(1, s) = c ( l ) for all s.
Redefining the parameter t, we may also assume that all curves 7(-,s)
are parameterized proportionally to arc-length. By Theorem 2.3.1, there
exists o > 0 such that the shortest geodesic between any p, q G M, w i t h
d(p, q) < o is unique. Let
be a partition of [0,1] w i t h
r
To = t < l < h < T2 < < T
0 M < tm = T m + i
and
T j T j 1 < :
~ ~ ? forj = l,...,m + l. (2.3.4)
r(7) : = r o r i ( 7 ) 2
sais/ies
i(ri (7))<i(7)
7*1(7) = 7-
L(r ( '))<L(V)
2 7
; ;
r (7 ) = 7 -
2
Therefore
L(r( ))<L( ) 7 7
w i t h equality only i f
r(7) = 7.
m times
n
Since M is compact, a subsequence of A converges to some limit
( p i , . . . , P m ) 6 M x ... x M .
N
r ( 7 ) then converges uniformly towards the piecewise geodesic 7 0 w i t h
endpoints 70(0) = 7(0)>7o(l) = 7(1) and nodes 70(r<) = Pi (i =
1,... , m ) w i t h segments 7 | 0 { T i t } being the shortest geodesic arcs be
tween their endpoints. This follows from the continuous dependence of
the occurring geodesic arcs on their endpoints (Theorem 2.3.1). We de
N
note the convergent subsequence of ( r ( 7 ) ) N N by ( 7 J , ) . For all v N
g N
then
n M
7 l / + 1 = r 7 with n[y) 6 N .
L r
OHT,-!,^)
= d
( l v ( i ~ i ) 7i/ fo)) >
hence
ra+1
L
(7^) = <*(7/ ,7 (rj)).
n
L ( ) = lim L (
7 o 7 l / + 1 ) = l i m L(r <"> ) 7
i/oo 1/oo
< l i m L(7i/) by Lemma 2.3.1
v>oo
= i(7o),
and
L(r( o))=
7 Urn L ( r ( ) )7
Voo
N
> lim L f r ^ 7 ^ ) by Lemma 2.3.1 again
= (7o).
Lemma 2.3.1 then implies that 7 0 is geodesic.
q.e.d.
K 0 := L(c),
and
N
K\ : = sup lim L(r 7(-,s)).
N
Since 7 ( - , 0) = c(-) is geodesic, r 7 ( - , 0) = 7 ( - , 0) for all n, hence K\ > K . 0
(1) K\ > K 0
N
Since 7 ( - , s) is continuous in s, so is r 7 ( - , s) for every n G N . We
now claim:
Whenever
N
s u p L ( r 7 ( - , s ) ) < KI + e (2.3.6)
n n + 1
L ( r ( - , *)) - i ( r
7 7 ( - , )) < 2c (2.3.7)
and
n
L(r (-, ) ) > ! - .
7 (2.3.8)
Indeed, otherwise
n + 1
supL(r 7 (-,)) < i -c,
N + 1
contradicting the definition of K\ (note that sup L ( r 5 7 ( - , $))
58 Geodesic curves
n
s u p L ( r 7 ( - , 5 ) ) < fti + e . n
3
n
A subsequence of ( r 7 ( - , s ))neN has to converge to some limit
n
L(r(c)) = L(c),
We are going to show that in this case, there even exist infinitely
many geodesies from p to q. For that purpose, we consider the
curve
7(^,5) = /io(sin27T5sin7rt,cos27T5sin7rt,cos7rt),
n
r 7 ( , s ) has to map [ 0 , 1 ] x [ 0 , 1 ] surjectively onto M . Therefore, for
every n N and every 5 G [ 0 , 1 ] , there exists cr (s) w i t h n
n
7(5) G r 7 ( . , a ( ) ) =:7n,s(0
n 5
n
(in other words, r 7 ( - , o - ( s ) ) is a curve passing through 7 ( 5 ) ) . 7 ,s(*)
n n
7n,a(0) = C ( 0 ) = p , 7 n , ( l ) = c ( l ) = q,
Figure 2.2.
c (0)
s =p,c (l) = q
a
and
By (2.3.5),
and since K,Q is the infimum of the energies of all curves from p to q
(o = L(c), and c is minimizing), c (-) is a minimizing curve itself,
s
hence geodesic.
Therefore, we have shown that for every 5, there exists a geodesic
from p to q that passes through 7(5). Hence there exist infinitely many
geodesies from p to g, as claimed.
q.e.d.
Remarks:
(2) We may construct the curves 7 ,s(*) at the end of the proof also
n
Exercises
2 2 2 2
2.1 For curves j(t) = (j\j ) : R -+ {(x\x ) e E |x > 0 } , con
sider
3.1 A f i n i t e d i m e n s i o n a l e x a m p l e
d 1
Let F : E E be a function of class C which is bounded from below
and which is 'proper' in the following sense:
s > inf
0 F(x).
d
xR
Then
d
{x G E : F(x) < so}
3<5 Vt/
0 with 0 < \y-x\ < 6 : F(y) > F(x).
0 (3.1.3)
F(x ) 3 > m a x ( F ( x i ) , F ( x ) ) = : o
2
d
Proof. We consider curves 7 : [0,1] R with
7(0) = x i , 7 ( 1 ) = * 2 . (3.1.4)
62
3.1 A finite dimensional example 63
We first observe that there exists a > 0 with the property that for any
such curve, there exists t (0,1) with
0
| ( 1 ) - x \ = 0, | ( 0 ) - x \ = \xi -
7 2 7 2 x \,
2
\7(t ) - x \ = 6
0 2 (recall (3.1.6)) .
By (3.1.7) then
F ( ( ^ o ) ) > o 4- a,
7
i > K . 0 (3.1.8)
F(x ) = i. 3
Since
F(xi),F(x )< , 2 0
64 Saddle point constructions
with
sup F(7(0) <n\+6 (3.1.9)
t[0,l]
31 0 [0,1] with:
F(-y(t ))0 > i - 6 (3.1.10)
|(VF)( (*o))|<c.
7 (3.1.11)
|(VF)(7(o))|> . 0 (3.1.14)
d
assumption (3.1.2) and (3.1.12), 7 ( ) stays in a bounded subset of E ,
n
and VF will then be bounded on that bounded set, and hence for any
So > 0 and all 0 < 5 < so, the curves 7 ,s(0 stay in some bounded n
set, too. This set is independent of n (as long as 0 < 5 < 5 , for fixed 0
2
F(7n,.W) < ^(7(*)) - I |VF(7n(<))| (3.1.15)
3.1 A finite dimensional example 65
Thus, in particular,
F( ,s (t))
ln 0 < F( (t)) ln - ^4 (3.1.17)
for all such t and all n. We now simply choose n so large that
i < f 4 (3.1.18)
2
F(ln,s (to))
0 < F( (*o)) -
7 |e
< i by (3.1.18).
Having proved (3.1.19), there are now various ways to construct a path
7 from X\ to X2 w i t h
v(t):[0, l]-[0,* ] 0
with
and
?
F(7(0) = i ( 7 n ( 0 ) < i - e , 0
(cf. (3.1.15), (3.1.16), (3.1.14)), and finally for all t w i t h F(y (t)) n >
K l
~ 2
F( (t)) = F(7n,. (*))<!
7 0 (cf- (3.1.19)).
F(~f (tn))>Ki-e
n n (3.1.22)
|(VF)( (* ))|<e .
7 n n n (3.1.23)
/
After selection of a subsequence, ( y (tn))neN then converges to some
n
of F and V F
F ( x ) = i
3 (3.1.24)
V F ( x ) = 0. 3 (3.1.25)
F(x )
3 > max(F(xi),F(x )) = , 2 0
Proof. For the argument of the proof of Theorem 3.1.1, we only need
F(x )
x < F(x ). 2
The argument at the beginning of the proof of Theorem 3.1.1 then shows
that (3.1.26) holds if x is a strict relative minimum. I f x is a relative
2 2
minimum, which is not strict, for all sufficiently small 6 > 0, say <5 < <5Q,
we have
and there always exists some x$ with 0 < \xs x \ < 6 and 2
F(x )
6 = F{x ). 2 (3.1.28)
0 < \x - x \ < \x _
6n 2 - x | , we put
6n 1 2
3.2 T h e construction of L y u s t e r n i k - S c h n i r e l m a n
In this section, we want to prove the following theorem, in order to ex
hibit some important global construction in the calculus of variations, in
troduced by Lyusternik-Schnirelman. The result presented is much more
elementary than the theorem of Lyusternik-Schnirelman, which says
that on any surface w i t h a Riemannian metric, e.g. a surface embedded
in some Euclidean space, diffeomorphic to the two-dimensional sphere,
there exist at least three closed geodesies without self-intersections. The
more elementary character of our setting allows us to bypass essential
geometric difficulties encountered in a detailed proof of the Lyusternik-
Schnirelman Theorem.
68 Saddle point constructions
Figure 3.1.
1
T h e o r e m 3 . 2 . 1 . Let 7 be a closed convex Jordan curved of class C in
2
the plane E . (7 then divides the plane into a bounded region A, and an
unbounded one, by the Jordan curve Theorem. That 7 is convex means
that the straight line between any two points of 7 is contained in the
closure A of A.) Then there exist at least two such straight lines between
points on 7 meeting 7 orthogonally at both end points (see Figure 3.1).
Proof. We start by finding one such line. Let C be the set of all straight
lines / in A w i t h dl C 7. We say that a sequence (l )neN C converges
n
v:I-*C
two regions into which v(t) divides A. Having chosen A\(0) and
A (0), A\(t) and A (t) then are determined by the continuity
2 2
d
t A closed Jordan curve is a curve 7 : [0, T] R with 7 ( 0 ) = 7(T") that is injective
on [0, T ) . Cf. the definition of a Jordan curve on p. 35.
3.2 The construction of Lyusternik-Schnirelman 69
Figure 3.2.
A (1)
1 = A (0).
2
K\ : = inf
v y
supL(v(t)).
^i tei
Figure 3.3.
70 Saddle point constructions
i > 0.
For this purpose, let p > 0 be the inner radius of 7, i.e. the largest p for
which there exists a disc
B(x p) 0l C A
2
for some XQ G A (B(xo,p) := {x G E : \x - x | < p})- Then
0
some to I w i t h
Area ( t ) ) = Area
0 (A' (t )).
2 0
Thus v(to) divides B(xo,p) into two subregions of equal area. v(to) then
has to be a diameter of B(xo,p), i.e.
L(v(to)nB(x ,p))=2p.0
Therefore
i > i = 2p > 0
Otherwise
3e > 0 : V n G N
0 3v n G V\ w i t h
s u p L ( v ( t ) ) < i 4-
n
t
_
V t w i t h L (v (*o)) > K I
0 n o
e
|cosai (v (*o))| > o
n
or |cosa2 (vn(*o))| > o-
3
L e m m a 3 . 2 . 1 . For every planar closed Jordan curve 7 of class C ,
2
there exists (3 > 0 with the following property: Whenever x G E satisfies
7 n v = f (u n {x = 0 } ) .
2
1
Furthermore, the curves x = constant in U correspond to geodesies, i.e.
straight lines in V perpendicular to 7, and they form shortest connec
tions to 7 fl V. By shrinking U, i f necessary, we may assume that i t is of
the form ( - , ) x (-77, rj), w i t h > 0, rj > 0. Since 7 is compact, i t can
be covered by finitely many such charts
: y
fi ( - 6 , 6 ) x (-WiVi) ~* i , i = l,...,m.
x 1 1
If we then restrict fi to ( - 6 , 6 ) ("f ' ^J, lines x = constant,
2
~ k < x < ^ , then correspond to shortest geodesies to 7, since the part
of 7 not contained in V{ is not contained in the image of fi, and hence has
distance at least ^ from the image of the smaller set ( & ) x , ).
This is indicated in Figure 3.4 where the broken lines correspond to
2
x = ^ and this is depicted for two different indices i.
Therefore, (3 : = min ( ^ ) satisfies the claim.
i=l,...,n
q.e.d.
Saddle point constructions
Figure 3.5.
v (to)
n with
|ai(*o)-Pi(to)|=0,
where p\(to) is the endpoint of v (to) where i t forms the angle a\(to)
n
f
by the shortest line segment v (to) from s\(to) to 7. By the theorem of
n
<L(vi(t ))yfl^.
0
We then let
Vn(to)
be the straight line from the second endpoint p (to) 2 of v (to)n to the
f
point where v (to) meets 7. Then, letting v(to)
n denote the segment of
v (to)
n between s\(to) and p (to), 2 by the triangle inequality
2
L(v* (t ))<L(v' (t ))
n 0 n 0 + L{v (t ))
n 0
2
= f 3 ^ 4 + L(v (t )).
n 0
2
L(v (t ))n 0 < KI-/? +
Py/l ~4 + Ki-0+^<Ki-ri
with
and
and
\Pi(t) - Si(t)\ < 0 foralU.
74 Saddle point constructions
We then choose again the shortest lines from Si (t) to 7 and replace v (t) n
by the straight line v (t) between those points, where these two shortest
n
whenever
L(v n (t)) > KI - e. 0
L(v n (t ))
n > K\ - -
n
v : J x J -+ C
with
v({0} x / ) and v({l} x I) C C 0 (3.2.1)
T(8) = (h(8)MB))
3.2 The construction of Lyusternik-Schnirelman 75
f -o
2 t2 = 1/4 t 2 = 1/2 f = 3/4
2 r -l
2
Figure 3.6.
with
t i ( l ) = 1 - t i ( 0 ) , t ( 0 ) = 0 , t ( l ) = 1,
2 2
(3.2.2)
we have
the v\ G V\ of Figure 3.5 where i>i(0) and V\(\) were point curves on
7, and we rotate v\ via the parameter t so that at 2 = 1 we have 2
t =0.
2
We define
K2 := inf sup L (v (t)).
vV [22te
Then
K2 > Ki
(2) K = tti.
2
supL(i>o (t)) = ^ 2 -
ten
76 Saddle point constructions
indeed exists.)
2
We then have for any r : J -+ I w i t h (3.2.2)
that VOOT is a critical family for i , and i t then has to contain a solution
l of our problem.
r
lim s u p ( | 7 ( r ) - 7 ( r ) | +
n = 0.
noo r \
^
ti
Figure 3.7.
Exercises
3.1 Let F C ^ M j R ) ( M an embedded, connected, differentiable
d
submanifold of R ) be bounded from below and proper (i.e. for
all 5 R, {x M : F(x) < s} is compact), and suppose F has
two relative minima X i , x . Let2
KO : = m a x ( F ( # i ) , Ffa))-
variational integral
2 1 n l n
I = f L fax ^),... ,x (t),x (t),... ,x (t)) dt (4.1.1)
Jti
1 n
for the unknown functions x(t) = (x ^),..., x (t)) w i t h fixed endpoints
x(t\) and x(t ). Here,
2
.i _ <W_
X :
~ dt'
2
We assume that L is of class C . The Euler-Lagrange equations for /
are
^-L -L =0
i xi (i = l , . . . , n ) . (4.1.2)
2
As shown in 1.2, this implies that solutions of (4.1.2) are of class C .
(4.1.3) also implies that we may perform a Legendre transformation.
Namely, by the implicit function theorem, we may then locally solve
Pi = L i (4.1.4)
l
w.r.t. x , i.e.
i
x =x*(t,x,p) ( p = (pi,...,Pn)). (4.1.5)
79
80 The theory of Hamilton and Jacobi
We obtain
H x i p j L i J L x i
- M ~ d x ^ ~ >
and with (4.1.4) then
H% x = ~L i. x
Hi x = -p.. (4.1.7)
Also
n
Pi Pj r X L/ jX ,
dpi dpi
and thus again with (4.1.4)
H =x\ Pi (4.1.8)
2 j
I = j [x Pj -H(t,x,p)) dt, (4.1.9)
Jti
where the unknown functions are x(t) and p(t), has Euler-Lagrange
equations (4.1.7) and (4.1.8), and so does
2 j
J = - [ (x pj+H(t,x,p))dt. (4.1.10)
l
~H (x(t),p(t)) = H ix x + H pi Pi = 0 (4.1.11)
1 2
H= -\p\ + V(x),
x =p
P=-V . x
4.2 T h e H a m i l t o n - J a c o b i equation
n + 1 1 n
Assumption. There is given a set fi C M = { ( ^ x , . . . , ) } with
1 71
the property that for any points A, B fi, A = (a, ft , . . . , ft ), B =
1 n n
(s, q ,..., q ), there is a unique solution x(t) = ..., x (t)) of
(4.1.2) contained i n fi w i t h (a, x(o~)) = A, (s,x(s)) = B.
Thus, fi is covered by solutions of (4.1.2), and those can be considered
as functions of their endpoints. Thus
n n
= (t;s,q\...,q ;a^\...^ ) (4.2.1)
and also
n l n
Pi = gi(t;s,q\...,q ;o-,K, ,...,K, ) = L i. (4.2.2)
In particular,
1 1 71
= (a; s , ? , . . . , ^ ; a, ft ,..., ft ) (4.2.3)
i 1 n 1 n
Q* = f (s\s,q ,...,q ;a,K, ,...,K, ).
We also define
l n 1 n
ipi := gi(a; 5 , q ,..., q ; a, ft ,..., ft ) = L i (a, k ft, ft) (4.2.4)
x n 1 n
Vi := gi(s; s,q ,... ,q ;a, ft ,..., ft ) = L^(s,q,q).
In the sequel, / * etc. will mean a derivative w.r.t. the first independent
l
variable, f etc. a derivative w.r.t. the second one. Inserting (4.2.1),
(4.2.2) into i " , we obtain
J = I(s,q,a,K) (4.2.5)
82 The theory of Hamilton and Jacobi
and call this expression the geodesic distance betweeen A and B. I n this
connection, I is called eiconal. Recalling (4.1.9), we may write
I = j ( < -H(t,x,p))dt.
p< (4.2.6)
{
I s = q
Vi - H(s, q, v) + j f {g'J + f' 9i - H f' Xi - H gty
Pi dt
p + f = 0 for t = s
f = 0 for t = a,
l
h = Vi<j - H(s,q,v) - Vitf
= -H{s,q, v)
i
= L(s,q,q)-q L^. (4.2.7)
Next
q3 f + ]
~l [W *W~ *W
t=s
9i^\ again by (4.1.7), (4.1.8)
1 n
^(s; 5, g , . . . , q ; 1
G , K , . . . , K) N
by (4.2.3)
Thus
I q j = Vj =L#(s,q,q). (4.2.8)
4-2 The Hamilton-Jacobi equation
Analogously,
4* = ->j = -L (o-,K,,k).
kJ (4.2.10)
h + H( (4.2.11)
1= f L(t,x(t),x(t))dt= f L(t,fJ)dt
J a J o
and
and so
= L(s,q,q)+ L if
p =-f fort = 5
/*' = 0 for t = a,
hence
I s = L(s,q,q) - L#q\
84 The theory of Hamilton and Jacobi
3 x 3
dq dq
I t= s
dp
rT"
L +i
with
Po = >t
Pi = <Px*-
D e f i n i t i o n 4 . 2 . 1 . If
1 n 2
<p = G(t, x , . . . , x , A i , . . . , A ) n with G e C (4.2.14)
and
det(G x % ). . = 1 n ^ 0 (4.2.15)
we call
n
<p = G(t, x\ . . . , x , A i , . . . , A ) + A n (4.2.16)
canonical equations
{
H =x Pi (4.2.17)
H. x = -Pi (4.2.18)
1 n
depending on 2n parameters A i , . . . , A , / i , . . . , / i n by solving
G Xi = V? (4.2.19)
Gi =x P i . (4.2.20)
l
purpose, we differentiate (4.2.13) w.r.t. x and obtain:
Gtxi + H G k Pk x xi + H i = 0. x (4.2.21)
G +H G =0,
tXi Pk xkXi (4.2.22)
^ = G xH + G x i x ^ . (4.2.24)
Comparing (4.2.24) and (4.2.21) and using the relation (4.2.17) just
derived, we then obtain (4.2.18).
q.e.d.
2
/ = / </?(, x) y/l + x dt (</?(, x) > 0 ) ,
Jti
already explained in Example (3) of Section 1.1 in a slightly different
notation. The physical meaning is that x(t) is considered as the graph
of a light ray travelling in a medium with light velocity ^ ^ y , where c
is the velocity of light in vacuum. I n this example, putting
2
L(t, x, x) = <p(t, x) \ / l + x , (4.2.25)
we have
p= L = ipx
x
2
VTTx
2
H = px-L = - vV-P - (4.2.26)
7(s, g, cr, K) here is the time that a light ray needs to travel from A =
(a, K) to B = (5, q). The Hamilton-Jacobi equation I + H (5, g, I ) = 0 s q
2 2 2
I +I =<p .
s q (4.2.27)
2 2
L(t, x, x) = i ( x + a x ) (a = constant). (4.2.28)
Then
2 2
p= L x = x,H = p x - L = \(p - ax ), (4.2.29)
2 2
/ + i(/
( x - a x ) = 0. (4.2.30)
2
If we substitute I = p(t)x , we are led to the Riccati equation
2
p + 2p -^=0. (4.2.31)
4.3 Geodesies 87
- A + ^ (' lJ<\2
( x ) - a x )2\ = 0 ,
n
2 2
i.e.
2
i/>'(x) = \ A * x + 2A
and a solution
x
f
2
I = -\t+ v /a + 2Ad. (4.2.32)
Jo
The equation
I\(t, x, A) = \i
means
2 1
7o Vx /<a f + 2A
This can be solved for x; let us assume for example a < 0; then the
solution is
x = ax.
p= I x , I + H(x,I )
t x = 0,
A= H(x,p),
4.3 Geodesies
We consider the case where L is homogeneous of degree 1, i.e.
L = L&x\ (4.3.1)
Then
det L i x xj =0, (4.3.2)
88 The theory of Hamilton and Jacobi
H = ~ L + x % i =0, (4.3.3)
i
I, = L(8,q,q)-q L i=0 4 (4.3.4)
L = s/Q
with
n i j
Q = g (x\...,x )x x .
ij (4.3.5)
Q Q = 0
s(^j ")-75 " ( 4 3
- ' 6 )
Since t does not occur explicitely in (4.3.5) and since I is invariant under
transformations of t, we may choose t such that
Q = 1, (4.3.7)
j Qi*
t ~ = 0. (4.3.8)
Q iX x
l
= 2Q. (4.3.9)
Q i + Q i i = 2 Q = 2 Q x i i + 2 Q x i
{lt ) ^ Jt ^
~Q = 0 along a solution.
dt
4-4 Fields of extremals 89
E = x t
\ja Q( ( )^(t))dt=^^ gijixityxWxi^dt. (4.3.10)
Moreover,
j
Pi = Q =2 x . xi gij (4.3.12)
Thus
ij 1
H = \g Pi Pj (with g** = (gij)- ). (4.3.13)
ij
Et + \g E <E x XJ = 0 cf. (4.3.13), (4.2.11), (4.3.10) (4.3.14)
ij
x* = ^g Pj cf. (4.1.8), (4.3.13) (4.3.15)
ki
1 da
cf 4 1 7 4 3 N 4 3 5
pi = - j - ^ r P f e P i -1 - - )' ( - - )> ( - - )-
As observed at the end of Section 4.2, E depends linearly on t.
4.4 F i e l d s of extremals
n + 1
Let Q C M satisfy the assumptions of 4.2, T C ^ f y R ) . The equa
tion
T(a,K,\...,K, ) N
= 0 (4.4.1)
90 The theory of Hamilton and Jacobi
1 n
as a function of (a, K , . . . , tt ) satisfying (4.4.1). A t such a minimizing
A, we have with some Lagrange multiplier A
I a + \T a = 0 (4.4.2)
4;+AT^=0 (j = l , . . . , n ) .
1 n
These are equations for the tangent vector ( K , . . . , k ) of the solution
from A to B. A solution satisfying (4.4.3) is called orthogonal to E. We
want to use the following:
1 n
For each B (s, g , . . . , q ), we thus find a unique
A = (a (5, q), K (5, q)) G E minimizing 7(s, g, a, K). We call
J9 = -H(s,q,L ) 4 (4.4.4)
and
JJ Q =Ly, (4.4.5)
J +H(s,q,J )=0.
s q (4.4.6)
44 Fields of extremals 91
Proof.
J =Is
s + I<rO-s + I iK
K s (4.4.7)
7 - T T T D
^
aT s a + fc*T^ = 0
and likewise
Js Is
=
Jqi Iqi
Conversely
2
T h e o r e m 4.4.2. If J(s,q) is a solution of (4-4-6) of class C , there
exists a field of solutions orthogonal to the hypersurfaces J(s, q) = con
stant, and J is the geodesic distance from the hypersurface J = 0.
P i = ~H . qi (4.4.10)
92 The theory of Hamilton and Jacobi
Equations (4.4.9) and (4.4.10) state that the curves q(s) constitute a
field of solutions. (4.4.6) and (4.4.8) yield
-H = J S
Pj = Jqi
J(B)-J(A) = Jj J(T,x(T))dT
t
dx
r * r \ j
-h Ji
x +J T J dr
does not depend on this particular path, but only on the end points A
and B. We rewrite this integral as
i
' dx \
Pi -^-H(r,x(T),p(T)))dT (4.5.1)
Since this integral does not depend on the path connecting A and B, we
must have
J i =
x P i (4.5.3)
J =
t -H(t,x,p).
J then solves the Hamilton-Jacobi equation. By Theorem 4.4.2, any so
lution of the Hamilton-Jacobi equation is the geodesic distance function
for a field of solutions of the canonical equations. Thus, any invariant
integral of the form (4.5.1) yields a field of solutions.
Let us now reconsider Jacobi's Theorem 4.2.1. Let
1 n
? = G(t,x ,...,x ,A ,...,A ) + A
v 1 n (4.5.4)
be a complete integral of
n
p + H(t, x \ ..., x , P l , . . . ,p ) = 0 n (4.5.5)
Gi
x = Pi,
n
where the parameters are A i , . . . , A , fi\ . . . , /Li . For fixed values of A i , . . . ,
n
l
where x (r) now denotes the derivative in the direction of the solution
l
and not in the direction of the arbitrary curve x (r) connecting A and B.
We now vary A i , . . . , A , but keep the curve x*(r) fixed. Then the field
n
94 The theory of Hamilton and Jacobi
a < 4 5 8 )
* - -
In the same way as G(B), this expression only depends on B (A is kept
3
fixed for the moment) but not on the particular x (r). For each J3, we
find Bo on the surface
1 n
G(t,x ,...,x ,A ,...,A ) = 0 1 n
and the integrand in (4.5.8) thus vanishes along this curve. Instead of
integrating from A to B, i t therefore suffices to integrate from A to JBo,
and we obtain
GXi = ii\ (4.5.9)
% %
w i t h \i being the value of the integral from A to Bo- Thus, \i can be
considered as a constant for the solution passing through BQ.
l J
If, conversely, (4.5.9) defined a family of curves x (t, \j,fjt ) (the family
is locally unique because of (4.5.6)), then, since G\ is constant, the 3
integrand in (4.5.8) has to vanish along any curve of the family Thus
3
/dx \
l -x>)L , =0 x Xi (t = l , . . . , n ) . (4.5.10)
this means that the curves defined by (4.5.9) are solutions of the canoni
1 n
cal equations contained in the field defined by G(t, x , . . . , x , A i , . . . , A ) . n
We also observe that the parameter A is only used for specifying the sur
face G = 0 and has no geometric meaning beside that.
4-6 Canonical transformations 95
4.6 C a n o n i c a l transformations
2 n 2 n
ip : R -> R
H-> (,7r),
x = H p
(4.6.1)
(4.6.2)
b y ( 4 6 1 }
- -
and
i
dx dpi
H r + HPi
d7T4 On,
i
dx dpi
dpi dp
dnj ~
i
dx dp
drci dpi
dpi dwj
dp dx*
i
dx diTj
(4.6.3)
dp dpi'
or in matrix notation
07T
dp (r -() (4.6.4)
T
l-(ff)
T
where A denotes the transpose of a matrix A. Obviously, this is a con
dition that does not depend anymore on the particular Hamiltonian H.
mation.
Then obviously
J2
= ~hn. (4.6.5)
4-6 Canonical transformations 97
1 T
(DI/J)~ = -J(Dil>) J, (4.6.6)
or equivalently
T
(Di)) JD<4> = J. (4.6.7)
z = -JVH(t,z) (4.6.8)
For a reader who knows the calculus of exterior differential forms, the
following explanation should be useful. We consider the two-form
1 2 n
u = dx A dpi on E
1
(here, as always, we use a summation convention: dx A dpi means
l
Y17-1 dx Adpi). According to the transformation rules for exterior differ
3 1
ential forms (i.e. d^ = J^dx etc.), we have, for = ( # , p ) , n = 7r(x,p),
J x
d A dnj = y ^ i g ^ ~ ~g~[ j d> A dpk-
n n z l n
uj : = ( J A - A ( J = n\(l) ^ ^'dx A' - A d x A d p i A - -Adp . (4.6.10)
n
n times
Since
x n l n
d A A d AdTTi A Ad7T = (det Dip)dx n A Adx Adp A A d p ,
x n
we conclude Liouville's:
98 The theory of Hamilton and Jacobi
2 n 2 n
T h e o r e m 4 . 6 . 1 . Every canonical transformation ip : R > R satis
fies
det Dip-zl. (4.6.11)
q.e.d.
One also expresses this result by saying that a canonical transforma
l n
tion is volume preserving in phase space as dx A A dx A dpi A A dp n
2n
can be interpreted as the volume form of R . By what was observed in
the beginning of this section, this applies in particular to the Hamilto-
nian flow which constitutes Liouville's original statement.
After this excursion and interruption, we return to our canonical equa
tions (4.6.1) and t r y to simplify them by suitable canonical transfor
mations. Canonical transformations may be easily obtained from the
variational integral
I I L(t,x,x)dt
with
dW\
* = J* (^L(t x,x)dt +J ~)dt
)
Then, with
(4.6.12) becomes
x-(p-W )--(n
x + Wz)-H + H*-W =0 t (4.6.13)
P^W X
7T = -\\\ (4.6.14)
H* H , i.e. W =0.
t
I n the same manner, we may also take a function W(t,p, ), W(t, x, 7r)
or W ( t , p , 7r). I n the first case, we obtain for example the equations
x = W p
H* H , i.e. W = 0. t
H(t,x,W (x,Q)
x = H*(Q, (4.6.15)
i =o
7r = - i f | . (4.6.17)
1 n
c = ( c , . . . , c ) are n-dimensional tori, if compact and connected. Thus,
2n
the so-called phase space { ( x , p ) G R } is foliated by tori that are
invariant under the motion, and on each such torus, the motion is given
by straight lines.
It should be pointed out, however, that completely integrable dynam
ical systems are quite rare, in the sense that the complete integrabil-
ity usually depends on particular symmetries, and their dynamical be
haviour is quite exceptional in the class of all Hamiltonian systems.
The invariant tori may disappear under arbitrarily small perturbations.
By way of contrast, the Kolmogorov-Arnold-Moser theory asserts that
these invariant tori persist under sufficiently small and smooth pertur
bations if the coordinates of H are rationally independent and satisfy
certain Diophantine inequalities, and if the matrix H^ of second deriva
tives is invertible.
In the older literature, the notion of 'canonical transformation' is usu
2n 2n
ally applied to any transformation ip : R > R that preserves the
form of the canonical equations, i.e. (4.6.1) is transformed into (4.6.2),
but without requiring that
H*(t,,ir)=H(t,x,p).
= 2x , TV = p
with H * = IE.
p W x , 7T = W$ (4.6.19)
4-6 Canonical transformations 101
still hold. This may be used to explain Jacobi's theorem once more, as
we now shall see.
1 n
Let I(t,x ,..., x , A i , . . . , A ) be a solution of the Hamilton-Jacobi
n
equation
J + #(*,x,J )=0,
t x (4.6.20)
We now choose
n 1 n
W{t,x\...,x ,t ,...,S )
1 n = I(t,x ,...,x ,tu...,Z ).
n
P= h
7T = -It: (4.6.22)
H*(t,t,n) = H(t,x,p) + It.
Because of (4.6.20),
H* = 0 .
i =o
7T = 0.
= A = constant
7r = I\ = fi = constant.
h =
Ix = P
1 n
w i t h parameters A = ( A i , . . . , A ) , fi = ( / i , . . . , /L* ).
n
102 The theory of Hamilton and Jacobi
That text will give readers a good perspective on the present research
directions in the field.
Exercises
4.1 Discuss the relation between the canonical equations for the
energy functional E and the equations for geodesies derived in
Chapter 2.
4.2 (Kepler problem) Consider the Lagrangian
2 3
L(x,x) = \ \x\ + r i r for x G E .
2 |x|
Compute the corresponding Hamiltonian and write down the
canonical equations. Show that the three components of the
angular momentum x A x are integrals of the Hamiltonian flow.
2 n
4.3 For smooth functions F,G : E E, define their Poisson
bracket as
X ]
' ' dxidpj dpjdxi'
1 n
where z = (x,p) = ( x , . . . , x , p \ , . . . ,p ) are Euclidean coordi
n
2 n
nates of E . Let z(t) = (x(t),p(t)) be a solution of a canonical
system
x = H p
P = ~H X
Exercises 103
j F{z{t))
t = {F,H}.
{F,G} = -{G,F}
K e Ai, (5.1.1)
104
5.1 Discrete control problems 105
c
where A* is a given control restriction (A* C M ) to determine
w i t h cost
ki(Xii A$).
K (x ,\ ,...
u u u ,\ )
n := ^ f c t ( x j , A i ) , with x\ i+ = (fi(xi, A<). (5.1.3)
We wish to minimize the total cost of the process and define the Bellman
function
I (x )
v v = inf (k (x , A) + I +i
v v v ((ft, (x, A))) forv=l,...,n
AA
(5.1.5)
(here, we put I +i = 0). Furthermore, ( A , . . . , A ) G A^ x x A ,
n n n
Proof. Since
K. (x \ v v \ , . . . , A ) = k (x ,
v n v v \ ) -h i ^ - i - i ((p (x ,
v v v \v)\ A | / 4 - i , . . . , A ) ) ,
n
we get
J,/(x) = inf jFf(x;A,...,A ) n
i = t/,...,n
= inf \k (x ,\ )+
v v v inf K +1 ( < / v ( ^ A); A + i , . . . , A ) ]
v n
A e A
AA \ i i /
j = i / , . . . , n,
x +\ = Kp (x , X (x )).
v v For an arbitrary initial value x\, an
v v v
Ai : = A i ( x i ) , x 2 := ^ i ( x i , A i ) , A = A (x ),... 2 2 2 .
d
K(h,x(ti)) for a path x : [ t , * i ] 0 R
x(t )
0 = x 0
x(h) G B t
d
w i t h a given set B\ G R - We have the control equation
X(t) G A
5.2 Continuous control problems 107
c
for some given A C R . Pairs (\(t),x(t)) satisfying all these restrictions
are called admissible, and the set of admissible pairs is called P(to,xo).
We put
I{t ,x ):=
0 0 inf K(ti,x(ti))
(A(t),x(t))P(to,*o)
(Bellman function).
L e m m a 5.2.1.
Proof, (i) is obvious. For (ii), if t < T\ < r < t\, the set of all admiss
0 2
J{t ,x )
0 0 = J(h,x{ti)) = K(ti,x(ti)),
Lemma 5.2.1 implies that for those t for which I(t,x(t)) is differen
tiable ((\(t),x(t)) e P(t ,x )) 0 0
inf: ( / t ( r , O + / ( r , O / ( r , f , A ) ) = 0 .
Proof. This follows from the proof of Lemma 5.2.1. Namely, the assump
tion implies that we may select A such that the path is optimal at the
point (r, ) under consideration.
q.e.d.
Jt0
v(h).
Given p : [to,t\] E with
p{h) = 0
we put
2
J(t,u,v) = p(t)u (t) +v(t).
Then
J(h,u{t ),v(t ))=v(t )
1 1 1
2 2 2 2 2
^-J(t,u(t),v(t)) = (3 p u + 2p(5u\ + A = (/Jpu + A ) > 0,
which is essentially the same as the one considered at the end of 4.2 with
integrand given by (4.2.28). We recall that the latter one had also been
reduced to a Riccati equation.
Equation (5.2.5) expresses the control parameter as a function of the
state of the system. We just have a feedback control: knowing the state
at a given time determines the control needed to reach an optimal state
at the next time.
(x
(5.3.1)
with the control conditions
x(t )
0
(5.3.2)
x(t)=f(t,x(t),X(t))
110 Dynamic optimization
w i t h controls
X(t) A C R C
1
C . Also, to is fixed, whereas t\ > to is variable subject to the restriction
(5.3.3). We define the Pontryagin function
H(x, A , p , t , / / ) :=p-
0 f(t,x,\) - ii F(t,x,\).
0
ond
p = -H x , x = Hp (5.3.5)
and of f/ie end poinf f i , we /love the trans vers ality condition
j
da
P(h) = --:(tux(t ))-a . 1 j (5.3.6)
ri(t)=H(x{t),\(t),p{t),t,iM>) (5.3.7)
and
r)(t) = H t (5.3.8)
V(h) = ^ ( t u x ^ a j . (5.3.9)
Remarks:
x = f(t,x(t),\(t)).
c
(2) I f A = M , then (5.3.4) becomes
H (x(t),\(t),p(t),t,t )=0-
x M)
d
with x(to) = #o, #(^i) = x i , x : [to, t i ] R and where x(t) is required
to have piecewise continuous first derivatives. We introduce the control
variable through the control equation
X(t) = x(t)
d
with A = M , i.e. no constraint imposed. We have g(ti,x(ti)) = x\
x(t\). The Pontryagin function of this problem is
P = -n x
p(ti) = a
H(t,x(t),\(t),p(t),ii ) 0 = max(t,x(t),A,PW,W>)
and 77 C ( [ t , t i ] , R ) w i t h
0
r?(t)=W(t,rc(t),A(t),p(t), ) W)
q(ti)=0.
and
p = Hx=0 , hence p = a since p(t\) = a.
Thus
W = a A,
a = 0,
and
W (t,x(t),A(t),p(t),l)=0
A (5.3.11)
L (t,x(t),x(t))
xx is positive semidefinite. (5.3.12)
~M = L .
x (5.3.13)
at
A basic reference for the variational aspects of optimization and control
theory where also a detailed proof of the Pontryagin maximum principle
together with many applications is given is
E. Zeidler, Nonlinear Functional Analysis and its Applications, I I I ,
Springer, New York, 1984, pp. 93-6, 422-40.
Part two
Multiple integrals in the calculus of
variations
1
Lebesgue measure and integration theory
d
Definition 1.1.1. A collection E of subsets ofR is called a a-algebra
d
(on R ) if
d
(i) R G E
d
(ii) / / A G E, then also R \ A G E
A
(iii) / / A G E, n = 1,2,3..., then also JJ^Li n e E.
n
The Borel a-algebra is the smallest a-algebra containing all open sub
d
sets of R . The elements of the Borel a-algebra are called Borel sets.
(iv) 0 G E
(v) I f A n G E, n = 1,2,3 . . . , then also (XLi 6 E.
(vi) ISA, Be E, then also A - B := A \ (A f l B) G E.
fi: E R+ Pi {oo}.
117
118 Lebesgue measure and integration theory
1
'Countably additive here means that
oo \ oo
(n=l
U A n
)
/
=
TO=1
E
(vii) jx(0) = 0
(viii) I f A , B G E, A c B , then fi(A) < fi(B)
(ix) I f A G E, n = 1,2,3,... and A C A + 1 for all n , then
n n n
00 \
1J A = lim
n fi(A ).
n
/ n>oo
n=l /
d
T h e o r e m 1.1.1. There exist a (unique) a-algebra E on R and a
(unique) measure [i i n E satisfying
d
(x) i4nt/ open subset ofR is contained in E (i.e. E contains the Borel
a-algebra)
(xi) For
d d j
Q := [ x = (\x . ..,x )eR \ aj < x < bj , j = 1 , . . . , d] ,
f : A E U { o c }
is called measurable if
{xeA\ f(x)<\}
D e f i n i t i o n 1.1.4.
d
(1) Let AcR be measurable with fi(A) < oo,
k
I f(x)dx := l i m / s (x)dx
n
N
JA -* JA
{ m
n
f(x)
if f(x) < m
iff(x)>n
ifm<f(x)<n.
We say that f is integrable if
x d:i
lim / fmA )
integrable on A and n
{xeA\ f(x)^g(x)}=0,
X
then g e C (A), and
I f(x)dx = I g(x)dx.
JA JA
In particular,
f f(x)dx = Oiffi(A) = 0.
JA
l l
(2) Iffe C (A), then \f\ e C (A), and
\[ f{x)dx\< [ \f(x)\dx.
\JA I JA
1
(3) If f C ^), h: A-> R U { 0 0 } measurable with \h\ < / , then
l
h C {A) and
and if
00 .
/ \f(x)\dx<oo,
j A
n=l n
l
then f e C (A), and
[ f(x)d X = f; / f(x)dx.
jA
JA n=i n
X
Conversely, if f e C (A), then this equation holds for any such
sequence (A ) ^.
n ne
122 Lebesgue measure and integration theory
l
(6) Iff G (A), then for every e > 0, there exists
d d
<p G C$(R ) := {g :R -+R continuous] {x \ g(x) ^ 0} bounded}
with
r
\f{x) - (p{x)\dx < e.
c d
T h e o r e m 1.1.5 ( F u b i n i ) . Let A C R , B C R be measurable, and
write x = (,77) G A x B. If f : A x B R U { o c } is integrable, then
j f(x)dx= j ( [ f(a,r))dri)dt
JAxB JA \JB /
ffor,)di) dn.
IB [JA
(Here, for example j B /(, n)dn exists for almost all A)
l
For / C (A), we put
lim / f (x)dx
n < 00,
n-^ooJ A
00 ~
< OC,
Y] / fn(x)dx
s
then YlnLi fn i> integrable, and
J A ^ ^ J A
d
T h e o r e m 1.2.2 ( F a t o u ) . Let A C R be measurable, f : A RU n
L
1A
f (x)dx
n < K < 00 for some K independent of n.
/ l i m i n f f (x)dx
n < liminf / f (x)dx.
n
n
JA N
^ ^ JA
d
T h e o r e m 1.2.3 (Lebesgue). Let A C R fee measurable, f : A * n
U(x)
t ( \
:=<
fn for 1/n < x <
1 - -
2/n 1 , ^ ^
(n>2)
10 otherwise.
124 Lebesgue measure and integration theory
Then
lim fn = 0,
and
lim / f (x)dx
n = 1^0= / f(x)dx.
n Jo Jo
The f do not form a monotonically increasing sequence so that B.Levi's
n
theorem does not apply, and they are not bounded by some integrable
function that is independent of n so that Lebesgue's theorem does not
apply either. Considering f instead of / , we finally obtain a sequence
n n
57/OM) <<Kx).
Then
q.e.d.
2
Banach spaces
In this chapter, we present some results from functional analysis that will
be needed in the sequel, in particular in the next chapter. A l l proofs are
supplied. As a reference, one may use any good book on functional anal
ysis, e.g. K . Yosida, Functional Analysis, Springer, Berlin, 5th edition,
1978, pp. 52-5, 81-3, 90-92, 102-28, 139-45 or F. Hirzebruch, W . Schar-
lau, Einfuhrung in die Funktionalanalysis, Bibliograph. Inst., Mannheim,
1971, pp. 60-88, 107-12. (These were also our main sources when com
piling this chapter.)
satisfying
(iii) \\v + w\\ < \\v\ \ + ||w|| for all v,w G V (triangle inequality)
A sequence (f )nN C
n V is said to converge to v V if
125
126 Banach spaces
\\v -Vm\\
n < .
Examples
(1) Every finite dimensional normed vector space is a Banach space,
d
for example R w i t h its Euclidean norm |-|.
d
(2) Let K C R be compact. C(K) := {/ : K R continuous},
: = x r
ll/lloo P Z G A : l / ( ) l f f C(K)> defines a Banach space.
S U
m
If we equip C (K) := {/ : K R m-times continuously dif
ferentiable}, m G N , w i t h the norm I H I ^ , i t is not a Banach
space, because it is not complete. Namely the convergence w.r.t.
s u n r m
IN loo * i f convergence, and while the uniform limit of con
tinuous functions is continuous, in general the uniform limit of
differentiable functions is not necessarily differentiable.
(3) Let (V, 11-||) be a Banach space, W cV & linear subspace that is
closed w.r.t. ||-|| i.e. if (u> )nN C W converges to v G n
In addition, we require
(iv) H is complete w.r.t. the norm \ \v\\ : = (v, i.e. a Banach space.
Inserting A = and expanding with the help of (i), (ii) yields the
Schwarz inequality
2 2 2
Since + w | | = (v + w, v + w) = \\v\\ + | | w | | + 2 ( v , i u ) , the Schwarz
inequality in turn implies the triangle inequality.
q.e.d.
Example 2.1.1. Let (V, ||-||) be a normed space. Then for every \i < 0,
:= {# G V | ||x|| < fi} is convex. Namely if x,y G i.e. |x| <
< A*, then for 0 < t < 1
hence -h (1 t)y G
2 ^ + 2/) (2.1.3)
Nl<i, llvll<i.
Proof. I n the situation of Definition 2.1.3, for eo > 0, we may find So > 0
such that for all z,w with \\z\\ = \\w\\ = 1, we have
1 - 6 <
then
x
2(lWI +
IMl) > 2 ^ + ^)
R llyll
> 1 - 3<5.
x y e
R llyll <2-
Now
x x
1^ < y
Ml + IMI +
by the triangle inequality
<4* + f.
2.1 Basic properties of Banach and Hilbert spaces 129
>l-6=>\\x-y\\<e
and
lim 2 (n
x
H" %m) l. (2.1.6)
n,moo
Then (x ) n converges to some x G V with \\x\\ = 1.
|\%n %m j j ^ ^
\\x\\ = lim | | x | | = 1. n
q.e.d.
+
p : V -+ R+ (R : = {t G R | t > 0 } )
130 Banach spaces
is called convex if
f\v0 = /o-
and satisfies
Remark 2.1.2. We shall need the Hahn-Banach theorem only in the case
where V possesses a countable basis, i.e. is separable (see p. 130).
r
Proof. We may assume VQ ^ V. Let v G V \Vb, V\ be the linear subspace
of V spanned by Vo and v, i.e.
Vi : = {x + tv \ x G Vb, , t G l } .
f (x)
0 + ta< p(x 4- tv). (2.1.10)
hence
~fo(x )
2 +p(x 2 + v) > -fo(xi) - p ( - x i - v). (2.1.13)
Thus
a2 : = inf ( - f {x )
0 2 + p ( x - f f))
2
Therefore, any a w i t h
ot\ < cx < a 2
satisfies (2.1.11) and (2.1.12), hence (2.1.10). Thus, the desired extension
/ i exists. I f V possesses a countable basis, we may use the preceding
construction to extend /o inductively to all of V.
If V does not possess a countable basis, we need to use Zorn's lemma
to complete the proof. For that purpose, let
(2.1.14)
and
| | X | | < / * + .
Bp :={xeV\
+e \\x\\ < + }.
( A i , . . . , A ) with
m
^2 ^ iOLi
S U
P $>/<(*)
2=1 =1
TO
0* + ) 5 > / <
f:V-+R
2.2 Dual spaces and weak convergence 133
then also form a vector space. I f (V, ||-||) is a normed vector space, we
define the norm of a linear functional / : V E as
+ : = s u p I M G | + u W (2.2.1)
x*o IFII
The easy proof is left to the reader. (See also Lemma 2.3.1 below.)
q.e.d.
\\fn-fm\l <.
We now consider
(V*)* = : V**,
a linear functional
Proof. We have
l ( / , * ) l < l l / I U M I ,
and therefore
f (tx)
0 := t\\x\\ for t G E.
= 1
and
l ( / , * ) l = N I -
Therefore
i :V -> V**
Remark 2.2.2.
D e f i n i t i o n 2.2.3.
f G V*, in symbols:
v
XJI x.
+ 1
v < m, we may find a subsequence ( / ^ ) of (f) for which also
+1
( / r ( 2 / m + i ) ) n N converges. The diagonal sequence (/)nN then con
verges at every y , v G N , and since (y )v^
v is dense in V ,
v (fn{x)) en n
q.e.d.
Remark 2.2.3.
sequence.
f Separable means that V contains a countable subset {y )veN that is dense w.r.t.
u
q.e.d.
I M = I I ^ ) I I = sup (2.2.4)
fev* 11/11*
is bounded (see Lemma 2.2.3 for the first equality). Since i(x ) is linear, n
with
Bj+x C JBJ and l i m Qj = 0
j*oo
/o e n -Si-
Because of (2.2.5), we have
Example 2.2.1.
w i t h ||e$|| < 1 for all i and ||e* ej\\ > 1 for i ^ j . Such a
sequence cannot converge w.r.t. ||-||, because i t is not a Cauchy
sequence, but i t always contains a weakly convergent subsequence
according to Corollary 2.2.1 (we have shown Corollary 2.2.1 only
under the assumption of separability, but i t holds true in general).
reflexive, then V* satisfies the first axiom of countability w.r.t. the weak
topology.
U , ,..., (f)-={geV*\
t vl Vk \g(vi) - f(v )\ t < fori = l,...,fc}.
u ... (f)
tWilt tWik
i.e. g G v . ., (f)
Vlim as required.
iVk
/o(x ) = 1
0
/ok=0.
/(*) o
/ k = o .
Corollary 2.2.2. yl Banach space (V, ||-||)is reflexive if and only if its
dual (V*, \ is reflexive.
\\x\\ < l i m i n f | | x | | .
n
2.2 Dual spaces and weak convergence 139
||x|| > l i m | | x | | . n
n>oo
As in the proof of Lemma 2.2.3, we may find f eV* with
11/11. = i
l/(x)| = I N I -
But then
| / ( x ) | > lim | | x | | > l i m s u p | / ( x ) | ,
n n
n*oo >oo
n
n+oo
This contradiction establishes the claim.
q.e.d.
Proof (Kakutani). Let (V, ||-||)be a uniformly convex Banach space, and
let XQ* G V**. We need to show that there exists some x V w i t h 0
t ( x ) = x*,*
0 (2.2.6)
11*5*11 = 1- (2-2.7)
For every n N , we may then find / V* w i t h | | / | | = 1 and n
and
M<||xS*|| + 1+ -. (2.2.10)
n
For any A i , . . . , A G R, we have
n
and so the claim follows from Helly's Theorem 2.1.2. Since in addition
to (2.2.10) also
we must have
lim | | x | | = 1. n
n*oo
For ra > n, we have
2 2
2 - - < fn(Xn) + fn(Xm) < ||x + X \\ < \\x \\ + | | x | | < 2 + - . m m m
n n
By Lemma 2.1.3, ( # ) n N is a Cauchy sequence and converges to some
n
xo eV, satisfying
I |xo|| = 1 (2-2.11)
and
/<(*o)=x5*(/<) f o r i = 1,2,3,... (2.2.12)
The solution XQ of (2.2.11), (2.2.12) is unique. Namely, if there were
f
another solution x , on one hand, we would have
0
hence
hence
2
I N + 4 1 1 > -
and
/<K)=*S*(/*) for i = 0 , l , 2 , 3 , . . . (2.2.15)
2.2 Dual spaces and weak convergence 141
f
ever, we must have x = x$. Equation (2.2.15) for i = 0 then is (2.2.14).
0
q.e.d.
f {y)
x '= {x,y) for y G H.
We have
ll/xll = I N | .
f (y)
x = (x,y) for all y G H.
space itself, and f*f is an element of the dual M*. By Corollary 2.2.3,
it corresponds to some X\ M , i.e.
/ f ( y ) = ( * i , y ) for all y G M .
w n 1
x X\ + #2 = # i 4- #2 ^ # i Af, #2, #2 ^ Af" ,
Or, j/) = =
q.e.d.
Of course, the reader knows the preceding result in the case where H
is finite dimensional, i.e. a Euclidean space. x\ is interpreted as the
orthogonal projection of x onto the subspace M , and therefore Corollary
2.2.4 is called the projection theorem.
The next result will be needed for Sections 4.2 and 4.3 when we estab
lish the existence of minimizers for lower semicontinuous, convex func
tionals.
some Banach space V. For every e > 0, we may then find a convex
combination
N N
]P A n X n ( A > 0, n ]T A n = 1)
71=1 71=1
with
N
< e. (2.2.16)
( N N \
X n X n w i t h
Co : = I > 0, ^ A = 1> .
n
<n=l n=l
2.2 Dual spaces and weak convergence 143
is convex and contains the ball with radius | and center 0. We consider
the Minkowski functional p of C\ defined by
p(z) : = i n f { A > 0 ; A - ^ G C i } .
we have
p(x) > 1.
V = {Mo,tiR}C
0 V
/o() = fionV .
0
Then
fo <p on Vb,
2.3 L i n e a r o p e r a t o r s b e t w e e n B a n a c h spaces
The results of this section will be used in Chapter 8. I n Section 2.2, we
considered linear functionals
T:V ->W,
and we put
\\Tx\
|r||:=supiL-i E+U{oo}. (2.3.1)
IF!I
L e m m a 2 . 3 . 1 . The linear operator T : V W is continuous if and
only if\\T\\ < oo.
IW<||r|||N| (2.3.2)
implies that T is continuous. (Of course, this uses the linearity of T.)
Conversely, if T is continuous, we recall the usual e 6 criterion for
continuity, and so for e = 1, we find some 6 > 0 with the property that
\\Tx\ [
Ty
Thus
l|T||<<oo.
q.e.d.
2.3 Linear operators between Banach spaces 145
The proof is the same as the one of Lemma 2.2.2, simply replacing (R, | |)
b y W I N D .
Remark 2.3.1. Again, (V, ||-||) need not be a Banach space here.
k e r T : = {x G V : Tx = 0}
T :V ->W
L e m m a 2.3.4. Let
T:V ->W
IIT-SIKpijjp (2.3.3)
l
then S is bijective, and S~ is continuous, too.
Proof. We have
1
S = T(Id-T~ (T-S)).
146 Banach spaces
1
^(r~ (r-s)H T~\ (2.3.4)
^=0
l
l
Y^{T- {T-s)y <J2\\(T- (T-s)y\\
vm
< ^(WT-'WWT-SWY,
and since |J^ 11J \\T - S\\ < 1 by assumption, the series satisfies the
Cauchy property and hence converges to a linear operator w i t h finite
norm.
q.e.d.
v = Vi e v 2
x = X\ + x.
2
als with
e s
fo( i) = ij (hj = l , . - , n ) .
3
By Corollary 2.1.1, we may find extensions p : V R with f ^ = f^.
2.3 Linear operators between Banach spaces 147
We define 7 r : V > V as
n
Vi : = ker 7r
x = n(x) + (x n(x))
V = VbSVi. (2.3.5)
W = W 0 W\ (2.3.6)
with
the index ofT. The set of all Fredholm operators T : V > W is denoted
by F(V,W).
V - Vo 0 Vi w i t h Vo = k e r T
W = Wo 0 Wi w i t h W = coker T 0
S' :V xWo^W
x
(x, z) v-* Sx - f 2,
L(V, W ) > L ( V i x i y , W ) 0
;
Since T\ : V\ > VFi is bijective w i t h a continuous inverse, T is also
bijective w i t h a continuous inverse, and by Lemma 2.3.4 this then also
holds for all S in some neighbourhood of T. For such 5, S'(Vi) is closed
as Vi is closed and S' is continuous, and we have the decomposition
f
w= s'(v )es {w ),
1 0
;
and since 5 ( V i ) = 5 ( V i ) also
/
W = 5(V"i)0 5 ( i y ) , o (2.3.7)
v = v 'ev "eVi,
0 0
and thus
get a decomposition
w= s(v )s(v)w^
1
Consequently
/;
= dim Vo - dim W 0 since S is injective on V 0
= indT.
Tx = y. (2.3.11)
150 Banach spaces
Either
or
2.4 C a l c u l u s in B a n a c h spaces
In this section, we collect some material that will only be used in Chap
ters 8 and 9.
Definition 2.4.1. Let (V, \\-\\v), (W, ||-||w) be Banach spaces, F :V ->
W a map. F is called differentiable (in the sense of Frechet) at u V
if there exists a bounded linear map
DF(u) : V -+ W
with
lim \\nu + v)-F(u)-DF um\\ { w =
0
<:;o> IHIv
/ is called differentiable in U CV if it is differentiable at every u EU.
1
f is said to be of class C if DF(u) depends continuously on u. f is
2
said to be of class C if DF(u) is differentiable in u and the derivative
2
D F(u) := D(DF)(u) depends continuously on u.
Gy = y. (2.4.3)
If we have a continuous family G(x) where all the G(x) satisfy (2.4-2)
(with q not depending on x), then the solution y = y(x) of (2.4-3) de
pends continuously on X.
Vn := Gy -\. n
We have
n n
G _ 1 Gi l
Vn = (W - W - 0 +2/0 = 2 ( ' yi - ~ Vo) + Vo- (2-4.4)
i=l i=l
We obtain from (2.4.2)
1 1
E l l G ' - V - G^j/oll < X y - | | y i - ftll < j - Wvi ~ Vo\\
9
t= l i=l
V = l i m Gy n = G (lim y) n = Gy,
noo \noo /
1
U, F C (J7, W ) , i.e. F is continuously differentiable. For purposes of
normalization solely, we assume
(p:U ^U 1 2
with
F(x,(p(x))=0 (2.4.6)
and
1
D<p(x) = - ( D F ( x , ifix)))-
2 o D F(x 1 1 tp(x)) for all xeUx
(2.4.7)
(D F(-,y)
x : Vi -+ W is the derivative of F(-,y) : Vi -+ I V ) . I n /ac*, /or
et;er?/ x C/i, <p(x) is the only solution of (2.4-6) in U . 2
F(x,y)=0
Proof. The idea is to transform the problem into a fixed point problem
for which the Banach fixed point theorem is applicable. We put
l:=D F( yo)-
2 X(h
&(x,y).
l
Using l~ o / == id (note that / is invertible by assumption), we get
x
*(ar, yi) - y ) = l~ (D
2 2 F(x 0j y )(yi
0 - y ) - (F(x, yi) - F(x,
2 y )).
2
and
we have
II*-soil
we have
||*(ar,2/o) - * ( o , 2 / o ) | | < | -
Since $(sco, t/o) = 2/o by assumption, we then have for ||y j/o|| <
whenever x | | < <5 : = min(<5',6"). This means that i f \\x #o|| < <5,
0
A : = {y Vi : | | y - l / o | | <
onto itself. By Lemma 2.4.1, for every x with \\x x \\ < <5, there exists 0
a n
a unique y =: tp(x) w i t h ||y y | | < d 2/ = $ ( # , t/), i.e. F(x,y)
0 = 0.
Moreover, t/ depends continuously on x. We consider the open balls
(<&(sc, ) also maps the open ball t/2 onto itself.) By choosing <$, > 0
smaller, i f necessary, we may assume
U1XU2C U.
(x <p(xi))
u eUxX U 2y
h := DiF(xi,yi),l 2 := D F(x ).
2 uyi
F(x,y)=h(x-xi)+ l (x - x ) + r(ar,y)
2 2
V-+V1
154 Banach spaces
Thus
By (2.4.10), (2.4.11),
hence
with
r
lim , , ^ = 0 from (2.4.9). (2.4.13)
x-+xi \\x Xi\\
(2.4.12) and (2.4.13) yields the differentiability of (p and (2.4.7).
q.e.d.
if : Ui -+ V
2.4 Calculus in Banach spaces 155
implies
= L e 2 4 1 4
Df(<p(x ))
0 Zty(a? )) 0 " ( - - )-
q.e.d.
x . I ^ V
/ x(t)dt.
Ja
$ : R x V -+ V,
Proof. We shall solve (2.4.16) w i t h the help of Lemma 2.4.1. For a con
tinuous y : I V , we define Gy G C ( J , V ) ,
\\y\\ :=
c0 sup||y(t)||.
(To verify this, one just needs to observe that any sequence (y )neN n C
C(I,V) with
lim \\y - y \\ o
n m C (= l i m sup \y (t) - y (t)\)n m = 0
n,moo \ n,m*oo f^j J
(x(0), the value at 'time' 0, is called initial value). We denote the solu
tion by X(XQ, t). Then for s, t > 0,
Exercises
2.1 Let (V,\\'\\y) (W, \ \ - \ \ ) be normed linear spaces. For a linear
w
functional
/ : V - . W,
put
x
xev\{o} \\ \\v
Show that / is continuous iff | | / | | < oo. Let L(V,W) := {/ :
V -+ W linear w i t h | | / | | < oo}. Show that i f ( W ^ I H I ^ ) is a
Banach space then so is (L(V, W), ||-||).
2.2 Show that a normed space (V, ||-||) is uniformly convex i f the
following condition holds:
Whenever ( x ) , (y )nN C V satisfy
n n N n
and
lim ||ar + y \ n n
nco
then
lim (x - y ) = 0. n n
71OO
2.3 A normed space (V, ||-||) is called strictly normed i f the following
condition holds: Whenever x , t / G ^ , x , | / / 0 satisfy
l k + y|| = I W I+ llll
x = ay.
Banach spaces
p
3.1 L spaces
In the sequel, instead of functions / : A R U { 0 0 } (A measurable),
we shall consider equivalence classes of functions, where / and g are
equivalent if f(x) = g(x) for almost all x G A. We shall be lax w i t h
the notation, however, not distinguishing between a function and its
equivalence class. The equivalence class of the zero function is called the
null class, and a function in that class is called a null function.
d
D e f i n i t i o n 3 . 1 . 1 . Let A C R be measurable, p G R \ { 0 } .
P
L (A) = {(equivalence classes of) measurable
functions f : A R U {=boc} with
l
\f(x)f C (A)}.
p
For f e L (A), we put
I I / I I P ^ I I / I I L P ( A ) : = ( / J / W I P
^ ) P
- (3-1.1)
Thus, ||-|| is positive definite (on the set of equivalence classes). Next,
for c G R,
l|c/|| = |c|||/|| .
p p (3.1.3)
159
p
160 L and Sobolev spaces
H/iH = l . P I l / 2 | | , = 1. (3-1.6)
/ 1 W / 2 W ^ zp + ~g
have p-1 v q p
^ = l/i+/ | , 2
P
we get
||/ +/ || <||/lV'll
1 2
P
1 + ll/2V'll 1
< l l / i l l p l M I , + l l / 2 | | p I I V ' l l ,
by Holder's inequality
P
then converges in L (A). Since all elements of the series are nonnega-
+
tive, ( # m ) m N converges to some g : A R U {oo} pointwise in A,
P
and Corollary 1.2.1 implies that (g ) also converges to g in L ( A ) . I n
m
particular, g(x) < oo for almost all x G A. Thus, our original sequence
(3.1.10) is absolutely convergent for almost all x A, towards some /
p
w i t h \f\<g+ |/m|; in particular / G L ( 0 ) . We interrupt the proof to
record:
P
L e m m a 3.1.3. Let (f )neN converge to f in L (A).
n Then some subse
quence converges pointwise almost everywhere to f.
P
assumed to be a Cauchy sequence in L (A), the whole sequence has to
P
converge in L (A). I t is in general not true, however, that the whole
sequence also converges pointwise almost everywhere to / . )
This is easy:
CO
/n>(*) + E (/.+(*) " " /(*)
p
162 L and Sobolev spaces
2 2
/m(s) + E O W * ) ~ / . ) - /(*) ^ + \fm(x)\,
i/=i
2
C o r o l l a r y 3.1.1. L (A) is a Hilbert space with scalar product
JA
|(/l,/2)|<|l/l|l ll/2ll2- 2
2 2
Thus (, ) is finite on L (A) x L (A). A l l the other properties are obvious
or follow from Theorem 3.1.1.
q.e.d.
d
D e f i n i t i o n 3.1.2. Let A C R be measurable, f : A -> R U { 0 0 }
measurable.
Thus
m , n > n
xeA xeA\N
\ \fi{x)f (x))\dx
2 < esssup|/ (x)| / 2 |/i(x)|dx
JA xeA JA
Hl/alUI/ill!-
q.e.d.
d
T h e o r e m 3.1.3. Let A C R be measurable. Let 1 < p < oo, q =
9 P
i.e. ~ + ^ = 1. T/ien L ( A ) is t/ie dual space of L (A). In particular,
P
L (A) is reflexive.
l
Remark 3.1.1. The dual space of L (A) is given by L(A) while the
l l
dual space of L(A) is larger than L (A). Therefore, neither L (A) nor
L(A) is reflexive.
Remark 3.1.2. Clarkson's theorem holds more generally for 1 < q < oo.
The proof for 1 < q < 2 is a little more complicated than the one for
2 < q < oo.
q q 1
11/ + g\\ +11/ - \\
q 9 q < 2 * - (H/H; + y i;). (3.1.12)
(In order to verify the left inequality in (3.1.13), we may assume w.l.o.g.
2 2 q 2 q 2
x + y = 1. Then x < x , y < y since g < 2, and the desired
inequality easily follows. The right inequality follows for example from
Holder's inequality (Lemma 3.1.1) applied to the following functions
/i,/ :(-U)-R
2
fx = 1,
2
, . / a for - 1 < t < 0
h ( t } 2
~ \ 6 for 0 < t < 1. )
q 9 2 2
(|a + b\ + \a- 6 | ) ' < (|a + 6| + |a - 6 | ) *
2 2
<v^(a +6 )5 (3.1.14)
By (3.1.12),
q q
\\f+9\\ q + \\f-9\\ <y- q
\\f-9\\ <e g
with
i(f)(9) := J f(x)g(x)dx.
q
Thus i(f) is indeed an element of L (A)*. We claim that we have equality
q
in (3.1.17). This means that there exists some g G L w i t h
/ f(x)g(x)dx l l / I L I M L . (3.1.18)
JA
p q q
We put g(x) : = s i g n / ( x ) \f(x)\ 'K Then \g\ = hence g G L (A),
and
/ f(x)g(x)dx\ = / |/(x)^(a
(x)\ dx
p
= / j / ( * ) ii dx ?
p p
= ( j \f{x)\ d y
y A X (j \f(x)\ d y
A X
= H/llplMI,-
This verifies (3.1.18), hence equality in (3.1.17). Equality in (3.1.17)
implies that i is an isometry, in particular injective. I n order to complete
the proof we need to show that i is surjective. Suppose on the contrary
that
L"(A)* \ i{W{A)) / 0.
p
166 L and Sobolev spaces
P p
Since L (A) is complete and i is continuous, i(L (A)) is complete, hence
closed. By the Hahn-Banach theorem (Corollary 2.1.1), there then exists
veL<*(A)**,v^0, with
v\i(LP(A)) = 0.
We now suppose for a moment that 1 < p < 2. Then 2 < q < oo, and
q
L (A) is reflexive by Theorems 3.1.4 and 2.2.3. We may therefore find a
q
g in L {A) with
q
F(g) = v(F) for all F G L (A)*.
P
We then have for any <p G L ( A )
p
3.2 Approximation of L functions by smooth functions
(mollification)
p
In this section, we shall smooth out L functions by integrating them
against smooth kernels. As these kernels approach the Dirac distribution,
these regularizations will tend towards the original function. For that
d
purpose, we need some g G C o ( R ) f with
d
g(x) > 0 for all x G R (3.2.1)
Q{X) = 0 for |x| > 1 (3.2.2)
d
supp Q C B(y, h) := {z R \ \z-y\< h}> (3.2.5)
f
L e m m a 3 . 2 . 1 . Let Q C C fif> h < d i s t ( f y , d f i ) . Then
fh C(fi').
Proof. We have
and
f
f 'ft' CC H ' means that the closure of Q is compact and contained in fi. We say
that Q' is relatively compact in Q.
168 LP and Sobolev spaces
P
T h e o r e m 3 . 2 . 1 . Let f G L (Q), 1 < p < oo. Then fh converges to f
p
in L (Q) as h -+ 0.
P
Proof. We have for g G L (Q)
p
/ \g (x)\ dx
h
= I I g(w)g(x hw)dwdx
JQ J\W\<1
p
< / I / g(w)dw J I / g(w) \g(x - hw)\ dw J .
JQ \J\W\<1 J \J\U)\<1 J
by Holder's inequality
= / Q(W) / \g(x - p
hw)\ dxdw,
J\w\<i Jn
Thus
3 2 9
H^llLp(n)<IMlLp(n)- ( - - )
3.2 Approximation of LP functions by smooth functions 169
D
Let e > 0. By Theorem 1.1.4, (6), we may find <p G C$(R ) with
3 2 1 0
I I / - V H L P ( R - ) < | - ( - - )
Since <p has compact support, we may apply Lemma 3.2.2 to conclude
that for sufficiently small h > 0,
Ik -WIIILP(R*) ^ |- (3.2.11)
(3.2.10)-(3.2.12) yield
g.e.d.
p
Corollary 3.2.1. For 1 < p < oo, Cg(f2) is dense i n L ( f 2 ) .
p
Proo/. Let / G L ( 0 ) , e > 0. We may then find Q' C C 0 w i t h
<
H/llLp(n\n') 2*
/ 3 2 1 4
H/-/ llLp n)<5- ( ( - - )
f f
\\f -f h\\L (n)<~- P (3.2.15)
By (3.2.13), (3.2.14)
< 3 2 1 6
llv-alli-(n) ( - - )
170 LP and Sobolev spaces
d
Let B the set of all functions a on R of the following form: There exist
some fc, N G N and rational numbers c * i , . . . , a* and cubes Qi,..., Qk G
d
M w i t h corners having all their coordinates in -^Z and of edge length
fa such that
for x Qi
otherwise.
Clearly, B is countable. Since a continuous function (p w i t h compact
support is uniformly continuous, we may easily find some a G B w i t h
a A E
Il ~ ^IILP(Q) ^ H - <P\\LP(**) < - (3.2.17)
/(an) { 00 for k G N.
for < x < 2FTT i f a k = 0
= L
||/(a ) -/(M|| ~((0,1))
w L
j f(x)(p(x)dx = 0.
Then f = 0.
3.3 Sobolev spaces 1 1 1
2
Proof. Since Co(fi) is dense in L ( f i ) , and since
9*-> I f(x)g(x)dx
JQ
2
is a continuous linear functional on L ( f i ) , we obtain that
/IQ
JQ
f(x)g(x)dx = 0 for all g G L ( Q ) . 2
|| : = | > ,A:=(^r) ( ^ )
1
Definition 3.3.1. Let u,v G L (Q). Then v is said to be the oc-th weak
derivative ofu, v := D u,a if
k P
W *(Q) := {u G L (Q) | D u exists and lies in a
p
L (Q) for all\a\ < k } ,
jQ
\\<*\<k
k k k,p
Finally, let H *(n) andH *(Q) be the closures of C(fi) f l W (l)
k k
and C$ n W *(Q), respectively in W *(Q).
p
172 L and Sobolev spaces
k
L e m m a 3 . 3 . 1 . Let u G C (Q), and suppose all derivatives ofu of order
p k,p
< k are in L ( f i ) . Then u G W (Q), and the weak derivatives are given
by the ordinary derivatives.
q.e.d.
k p
Thus, the W > spaces constitute a generalization of the spaces of
k,p
k times differentiable functions. The W norm is considerably weaker
fc kyP k
than the C -norm, and so the W spaces are larger than the C spaces.
Before investigating the properties of these spaces, it should be useful
to consider an example: Let fi = ( - 1 , 1 ) C E, u(x) : = We claim that
l p
u G W ' (Q) for 1 < p < oo. I n order to see this it suffices that the first
weak derivative of u is given by
2,p
We claim, however, that u is not contained in W (ft). Namely i f w(x)
were the second weak derivative of u, i t would have to be the first weak
derivative of and consequently, we would have w(x) = 0 for x ^ 0.
The rule for integration by parts (3.3.1) would then require that for all
V>C3((-1,1))
0 =
= 2^(0)
which is not the case. Thus, v does not have a first weak derivative.
3.3 Sobolev spaces 173
Remark 3.3.1. Some readers may have encountered the notion of a dis
tributional derivative. I t is important to distinguish between weak and
1
distributional derivatives. Any L ( f i ) function possesses distributional
derivatives of any order, but as the preceding example shows, not nec
essarily weak derivatives. I n the example, of course, the second distri
butional derivative of u is 2<5o, where <5o is the Dirac delta distribution
at 0. u does not possess a second weak derivative because the delta
1
distribution cannot be represented by an L function.
kyP
T h e o r e m 3.3.1. The Sobolev spaces W (ft) are separable Banach
spaces w.r.t. ||-|liyfc,P(n)-
18 a
Proof. That |Hlw*.p(n) norm follows from the fact that I H I X ^ Q ) is
a norm (see section 3.1). Similarly, we shall now derive completeness of
k p
W *(ft) from the completeness of the L (ft) spaces (Theorem 3.1.1).
k,p
Thus, let ( v ) n N C W (ft)
n be a Cauchy sequence w.r.t. ||-||w*.p(n)-
This implies that (D u ) ^ is a Cauchy sequence w.r.t. I H I ^ Q ) for
a n ne
p
all |Q| < k. By Theorem 3.1.1, (D u ) therefore converges in L (ft) a n
a]
towards some v . For a<peCl (n)
(3.3.2)
p
Therefore, v is the a - t h weak derivative of t>o, the L - l i m i t of ( u ) N ,
a n n
k,p
and consequently vo G W (ft). The separability again follows from the
p
corresponding property for L (ft) (Corollary 3.2.2).
q.e.d.
k k
T h e o r e m 3.3.2. W *(fl) = H *(ft).
kyP
This result says that elements of W (ft) can be approximated by
n
C(Q) functions w.r.t. IHIivfc.p(Q)' I general, however, for k > 1 one
p k p k
has # o ' ( Q ) ^ W ' (ft) so that W *(ft) functions cannot be approxi
p
mated by C g ( f i ) functions, in contrast to L (ft) functions where this
is possible (Corollary 3.2.1). This is seen from the following simple ex
ample:
ft = (1,1) C K, u(x) = 1. I f (<p )neN C C^ (ft) converges to u in n
3
1
L ( f i ) , then after selection of a subsequence, i t converges pointwise al
most everywhere (Lemma 3.1.3), and therefore, for sufficiently large n ,
there exists x n G ( - 1 , 1 ) w i t h ip (x ) n n > \. Since <p (l) = 0 = y> (l)>
n n
1,p
(p cannot converge to u in W ( ( 1 , 1 ) ) .
n
k p
Proof (Theorem 3.3.2). We have to show that any u E W < (fl) can
be approximated by C(fi) functions. As in 3.2, we extend u to be 0
outside f i and consider the mollifications UH C(fi). We compute
u d u s i n
D (u (x))
a h = Da Q (^J^jiX ' (y) V ( g Corollary 1.2.2)
by definition of D u a
= (D u) (x). a h (3.3.3)
D u in LP(Q)> for all | a | < fc, and this means that Uh converges to u in
a
k
w *(n).
q.e.d.
kyP
Proof. I t follows from Theorem 3.1.3 that the dual space of W (ft) is
k,q
given by W (fl), w i t h ~ + ~ = 1. This implies reflexivity.
g.e.d.
,p
Proof. This follows from Lemma 2.2.5, since HQ (Q) by its definition is
k,p
a closed subspace (w.r.t. strong convergence) of W (fl).
q.e.d.
k,p
T h e o r e m 3.3.5. For 1 < p < oo, k N , any sequence in W (Q) that
is bounded w.r.t. IHIjy*.p(n) contains a weakly convergent subsequence.
3.4 Rellich's theorem, Poincare and Sobolev inequalities 175
k p
Proof. By Theorems 3.3.1 and 3.3.3, W ' (ft) is separable and reflexive.
Therefore, the result follows from Corollary 2.2.1.
q.e.d.
d
T h e o r e m 3 . 4 . 1 . Let ft C R be open and bounded. Let (w )nN C n
p
HQ' (Q) be bounded, i.e. \\u \\ i, ^ < c (independent of n). Then a
n W P
p
subsequence of (u )neN converges in L (ft).
n
Remark 3.4-1-
Rellich originally proved the theorem for p = 2. Kon-
drachev proved the stronger result that some subsequence converges in
L(fi) for 1 < q < ^ if p < d and for 1 < q < oo if p > d. Of
course, these exponents come from the Sobolev Embedding Theorem
(see (3.4.12)). See Corollary 3.4.1 below.
p
Proof. Since u n G HQ' (Q), for every n G N and e > 0, there exists some
vn G CQ (ft) w i t h
t;
r n ~ n||v^i.p(n) < o (3.4.1)
Therefore
v x
n,h( ) = Jfi J^Q (~/~) v (y)dy
n
of v n and estimate
\v (x)
n - V (x)\
nyh
L \w\<\
g(w)(v (x) n
h w
v (x
n hw))dw by (3.2.7), (3.2.8)
r r\\
w . (3.4.3)
< / Q{w) / v (xn - n?) drdw with
J\w\<l JO
\dr M
176 LP and Sobolev spaces
This implies
p
/ \v (x) n - v , (x)\ dx
n h
JQ
( f
f f h M
\ d Y
< / I / g(w) / v (x-rd) n drdw] dx
JQ \J\W\<I JO \ur J
w
f ( f f i\ i f^\ \ I Q
= 1 6 w Vn X
drdw dx
Jn\J\ |<i V ^ ^ " " / ^^ J \dr ^
p p p
< ( j g{w)dw] ( [ g{w)h \w\ f \Dv (x)\ dxdw]n ,
VM<i / vM<i J j
Dv n
Vn
dxi "-"dx<* J' Vn
I K - nM\ P(Q)
v
L ^ H
11 D v
n 11 LP (Q)
< he' by (3.4.2)
< - if h is sufficiently small. (3.4.4)
Next,
X c
KM )\ ^ ^ o l K I I L i ( n )
by Holder's inequality,
and similarly
c m e a s n lp
X
^ J^i i( y \K\\ (3.4.6)
dx' Vn,h( ) LP{Q)
K , h | | i ( ) < constant
C Q
(3.4.7)
space (e.g. a Banach space) is totally bounded, there exist finitely many
p
wi,... ,U)N L ( f i ) such that for every n N there exists 1 < j < N
with
v 3 4 8
\\ n,h - W j | | LP( n ) < I' (--)
p
Thus, ( w ) n N is totally bounded in L ( f i ) . Therefore, the closure of
n
p
(u ) w
n n in L ( f i ) is compact (again, a general result for metric spaces),
p
and i t thus contains a convergent subsequence in L ( f i ) .
q.e.d.
d p
T h e o r e m 3.4.2. Let fi C R be open and bounded. For any u e H^ (ft)
i
E d
IHIz, n)<( ^) |l^llLp(n)
P( (3-4-9)
d
where u?d is the Lebesgue measure of the unit ball in R .
p
Proof. Since CQ(Q) is dense in iJQ' (fi), we may assume u e C(j(fi). We
d d
put u(x) = 0 for all x e E \ S l . For $ e R with = 1, we have
f d
ulx) = u(x 4- rd)dr.
Jo dr
1 f t d
/ / u(x + rfodddr
duj Jo J d w = 1 dr
I f 1 , .
p
178 L and Sobolev spaces
Therefore
(J\u{x)\>dxY
p-1
I D u i v ) r d y d y dx
k ( i (L J T ^ F ) [L ^ )
by Holder's inequality
i /
p 1
|Du(y)| d) ( / - ~dx\ 1
(3.4.10)
du, \ J
d 1
W \x - V "
N
L \ x -y\
y\ - d l d X
'
we choose R with
d
meas ft = meas B(y, R) = UdR
d
(B(y,R):={zeR \ \z\ < R}).
Since
J ^ ^ - J F * **\*-v\>R
^ T ^ - ^ ior\x-y\<R,
we have
j d-i^ x
- / d~i^ x
y\ JB(y,R) \x - y\
JQ \x - y\ JB(y R)
y \x - y\
= dw R d (3.4.11)
1 1
= dw d (meas ft) .
P
T h e o r e m 3.4.3. Let u G H^ (Q).
d
(i) If p < d, then u G L ~^p(Q), and
\\u\\^<c\\Du\\ . p (3.4.12)
with exponent 1 ^ . )
We only prove (i) as (ii) will not be used in the present book:
d
a ( f
H y ) \ ^ < Ml / IAti(W
Using Holder's inequalityf, we compute
oo d
1
\u(x)\*=* dx
/ -oo
\J oo / J oo \ J oo J
1
1 d
1
< (y iDMy^dy ^ (jlf |A()|dy'dx^ .
1
jlPILi(n)- (3-4-14)
M
H I I <gjf Nx)rMiM*)i<fc
lA
<^||H-
by Holder's inequality.
p
For p < d, we may take = ^7}^ and obtain
P
HQ (ft) be bounded for some 1 < p < d. 77ien a subsequence converges
q
in L (ft) for anyl<q<-^.
if fi satisfies - = - f (1 /z) ( -
q \p d
M
< c\\u n - tz ||i
m ( n ) \ \D(u n - Wm)|lip n)
( (3.4.15)
by Theorem 3.4.3 (i).
Exercises 181
P
Since Du is bounded in L (Q) by assumption, and (u ) is a Cauchy
n n
p
sequence in L ( f i ) , hence also in (3.4.15) then implies the Cauchy
q
property i n L (ft).
q.e.d.
Exercises
3.1 Let
d
Ai :== {x G R | > 1} , A : = {x E 2
<i}.
and consider
x
f{x) = \\x\\ for A E R .
P P
For which values of d,p, A is / G L ( A ) , or / G ( A ) ? X 2
d =
3.2 Let A C R be measurable. Let pi,... ,p/t > 1, ]Ci=i jr 1>
X
/< G LP* (A) for * = 1 , . . . , k. Show / i ... f k G L ( A ) , with
^nii/'iu-
i <=i
d
3.3 Let A C R be measurable, meas A < oo, 1 < p < g < oc. Then
q P
L (A) C L ( A ) , and for / G L(i4)
l-Q
L(A) ^ LP(A) L"(A) '
d
3.5 Let A C R be measurable, meas A < oo, / : A - R U { 0 0 }
measurable. Then
||/-/|| ^0 p asn^oo?
2
3.8 Consider the sequence ( s i n ( n x ) ) N in L ( ( 0 , 1 ) ) . Does i t con
ne
2
verge in the L -norm? Does i t converge weakly? I f so, what is
the limit?
4
The direct methods in the calculus of
variations
d
where fi is some open subset of R (in most cases, fi is bounded), among
functions
u : fi -+ R
183
184 Direct methods
4.2 L o w e r s e m i c o n t i n u i t y
We say that a topological space X satisfies the first axiom of countability,
if the neighbourhood system of each point x X has a countable base,
i.e. there exists a sequence (t/ ) eN f open subsets of X w i t h x G U
I/ 1/ v
with the property that for every open set U C X with x G U there exists
n G N with
V cV.n
the property that for every open subset V of X , there exists n G N with
U CV.n
{ 7 ( x , r ) := {x G X : d{x,x )
M v < r^}
4-2 Lower semicontinuity 185
n00
L e m m a 4.2.1.
Examples.
Definition 4.2.2.
F ( x ) < l i m F ( x ) = inf F ( x ) ,
0 n
noo xX
Remark 4-2.1. The argument of the preceding proof also shows that in a
separable reflexive Banach space, a weakly proper functional is coercive
w.r.t. the weak topology.
F(x )
n converges to some K G R. By Theorem 2.2.4, for every m G N and
every e > 0, we may find a convex combination
N N
Vm '= ^ ^ X x n n (X > 0, ^ ^ A = 1)
n n
nm n=m
with
Since F is convex,
N
X F
F(y ) m < n (*n)- (4.2.1)
n=m
F(x )n < K+ e.
l i m s u p F ( ? / ) < K. m
raoo
moo m-^oo
d d
L e m m a 4.3.1. Let Q C R be open, f : fl x R R, with f(-,v)
d
measurable for all v G M. , /(#, ) continuous for all x eft, and
p
f(x,v) > -a(x) + b\v\
188 Direct methods
d 1
for almost all x G ft, and all v G R , with a G L (ft), b G R, p > 1.
$(v) := / f(x,v(x))dx
JQ
p p
25 o /ower semicontinuous functional on L (ft), $ : L ( f i ) > R U {oo}.
p
converges to v in L (ft). Then a subsequence converges pointwise almost
everywhere to v by Lemma 3.1.3. We shall denote this subsequence again
by (t; ), noting that the subsequent arguments may also be applied to
n
p p
f(x,v(x)) - b\v(x)\ < liminf ( / ( x , v ( x ) ) n -b\v (x)\ ).
n
noo
p
f(x,v (x))
n -b\v (x)\
n > -a(x)
x
with a G L (ft), we may apply the Theorem 1.2.2 of Fatou to conclude
p
(f(x,v(x)) b\v(x)\ )dx < l i m i n f / {f(x,v (x)) n - p
b\v (x)\ )dx.
n
p
Since v converges to v in L ( f i ) ,
n
p p
f b\v(x)\ dx lim / b\v (x)\ dx,
n
JQ JQ
q.e.d.
*(*v + ( l = / /(x,*v(x) + ( l - * ) ^ ( x ) ) d x
< / {*/(x,t;(x)) + ( l - t ) / ( ^ ^ ) ) } d x
by the convexity of /
= **(v) + ( l - * ) * ( w ) .
g.e.d.
d
(i) / ( , v) fcs measurable for all v G E .
(ii) / ( # , ) 25 convex for all x Efl.
p d
(iii) f(x,v) > a(x) 4- 6|t;| / o r almost all x fl, all v R , with
x
aL (fl), 6 > 0 , p > 1.
1>p P
Le* # G i f (f2), and /e* A : = ^ + #o' (fi)-
F(u) := / f(x,Du(x))dx
Jn
assumes its infimum on A, i.e. there exists uo G A with
F(u )
0 = inf F(u).
liP
Proof. By Lemma 4.3.1, F is lower semicontinuous w.r.t. H (fl) con-
vergencef, and by Lemma 4.2.2, F then is also lower semicontinuous
1,p 1,p
w.r.t. weak H (fl) convergence, since H (fl) is separable and reflex
ive for p > 1 (see Theorems 3.3.1 and 3.3.3). Let ( u ) n e N be a minimizing n
sequence in A, i.e.
lim F(u ) n = inf F(u).
noo uA
Since
p
/ \Du \ n < \F(u )+ n \ [ a(x)dx,
Jn o b J Q
p p
(Du ) n
n ne is bounded in L ( f i ) , hence ( w ) n N C g+H^ (fl) is bounded n
1 p
in H *^) by the Poincare inequality (see Theorem 3.4.2). Since H^ (fl)
d
f Note that convex functions on R are continuous.
190 Direct methods
q.e.d.
p ,p
Remark 4.3.1. The condition u g^H^ (ft), i.e. u-g HQ (Q), is a
(generalized) Dirichlet boundary condition. I t means that u = g on dQ
in the sense of Sobolev spaces.
Definition 4.4.1. Let X be a metric space with metric d(-,-)> and let
F : X > R U {oo} be a functional. For X > 0, we define the Moreau-
x
Yosida approximation F of F as
x 2
F (x) := mi(\F(y) + d (x, y)) (4.4.1)
yex
for x X.
Remark 4-4-1- This is different from the definition in Section 5.1 where
2 a
we shall take d(x,y) instead of d (x,y). Here, one might take d (x,y)
for any exponent a > 1. For our present purposes, it is most convenient
to work with a = 2.
F ( ( * ) ) < t F ( ( 0 ) ) + (1 - i ) F ( ( l ) ) .
7 7 7 (4.4.2)
y =:
x
J\x)
with
x x 2 x
F (x) = \F(y ) + d (x,y ) (4.4.3)
Vo = \(Vi +02)
x x
F(yZ)<\(F(y )+F(y )), (4.4.4)
x
and by Euclidean geometry, i f y ^ y , we have
2
A 2
lk-J/o H <^(|k-^ir + ||x-^|| ), 2
(4.4.5)
hence
x A 2
XF(y ) + | | x - j/o || < mvi) + Ik - 2/2 If
A A 2
= AF(j ) + | | z - /
/2 2 2 || ,
x
contradicting the minimizing property of y and 1/2 Thus, we must have
2/i = 2/2 > proving uniqueness.
Existence: (4.4.5) may be refined as follows: For 1/1,1/2 i f and
2/o : = ^(2/1+1/2)
2 2 2 2
l b ~ 2/o|| = 5 ( | | * ~ yi\\ + \\x - 1/2II ) - \ lift - 2/2II (4.4.6)
192 Direct methods
2 2
XF(y )
n + \\x - y \\ n - inf (xF(y) + ||x - y | | ) = : . A (4.4.7)
yH \ /
Vi,k '= +
2 2
< \ (\F(y )k + \\x - y \\ ) k + \ (\F(y ) t + \\x - yi\\ ) - \\\yk yi\\ >2
(4.4.8)
By definition of K\ (see (4.4.7)), the left hand side of (4.4.8) cannot be
smaller than K\, and so we conclude that
2
llifc-wll -o
as fc, I > oo, establishing the Cauchy property. Since the norm is con
x
tinuous and F is assumed to be lower semicontinuous, the limit y of
(Vn)neN then solves (4.4.3). q.e.d.
x x
L e m m a 4.4.2. Let F and y = J (x) be as in Lemma 4-4-1- Let x be
in the closure of D(F). Then
x
x = l i m J (x). (4.4.9)
A+0
Proof. Since x is in the closure of D(F), for every 6 > 0, we may find
x B(x,6)
6 := {yeH :\\x~y\\<6}
with
F(xs) < oo.
Then
lim (\F(x ) 6 + \\x - 2
,511 ) < 62
and therefore
Xn 2
\\x-y \\ >a>0 for all n. (4.4.11)
4-4 Convex Junctionals 193
x Xn 2
]im8up(\ F(y )
n + \\x-y \\ ) < 0, (4.4.12)
hence
Xn
F (y ) -> - o o asn^oo. (4.4.13)
1 x 2 Xn Xn
Fiy ) + | | x - y \\ < F(y ) + \\x - y \f -+ - o o as n ^ oo
x
then (y )\>o converges to a minimizer of F as X oo.
Xn
Proof. Since (y ) eN n is bounded and since y Xn
minimizes
2
F(j/) + i - | | a ; - j , | | ,
we obtain
x
F(y )^ inf F(y)
yen
Xn
so that (y )nen is a minimizing sequence for F. We now claim that
A
n*- ir
is monotonically increasing in A. Indeed, let 0 < fi\ < fa- Then by
1
definition of y^
2 2 l 2
W ) + \\x - y^\\ > W ) + ||x - y^\\ ,
Mi Mi
hence
2 2
W 2
) + - | | x - y^\\ > F(y^) + \\x - y ||
M2 M2
+ l 2 2 2
( r - r ) ( l ^ - ^ l l - l l x - ^ i i ) .
\Mi M2/ V /
2
This is compatible w i t h the minimizing property of y^ only i f
2 2 2
| | * - J H I > I I * - 2 / ' I I
194 Direct methods
a
\\*-ir\\
is bounded independently of A since it is assumed to be bounded for the
sequence A > oo. We next claim that
n
x
F(y )
inf F(y).
yH
A
Indeed, from the definition of t / ,
x
F(y ) = inf F(t/),
{y.\\x-y\\<\\x-y>\\}
x x
and therefore y has to decrease since | \x y 1 1 increases. The limit has
Xn
to be i n f # F(y) since this is so for the subsequence {y ) eN-
y We now n
a
claim that (?/ )A>O satisfies the Cauchy property, i.e. for every e > 0,
there exists Ao > 0 such that for all A,// > A 0
x 2
\\y -y\\ <e.
For that purpose, we choose AQ SO large that for A, \i > AQ
< \ (4-4.14)
We let
F(y^) + {\\x-y^\f
x x 2 x 2
< * V ) + \{\\\ -y \\ + \ I I * - w " l l a
-\\\y - y"\\ )
>M .. 1
(ll~ ..AM , 2 6 1
|L,A
<F (y*) + ^\\ -y*\\+- -- \\y*-y\
x 4 4
by (4.4.14).
4-5 Euler-Lagrange equations 195
q.e.d.
d
on a bounded, open subset fl of R , and we make the following assump
( i
tions o n / : f i x E x E ^ E = E U { o o } :
d
(i) / ( , u, v) is measurable for all u E , v R .
(ii) f(x, , ) is differentiable for almost all x ft.
p p
(iii) \f(x,u,v)\ < c - f ci \u\ - f c \v\ , c , c i , c constants, for almost
0 2 0 2
1,p
Condition (iii) implies that # ( u ) is finite for u H (fl), since fl is
bounded. (If fl is unbounded, this still holds provided c = 0.) I n the 0
196 Direct methods
(iv)
df p p
(x, u, v) r(x,W, v) < c 3 + c \u\ + c \v\ ,
4 5
du dv
2=1
d
C3, C4, C5 constants, for almost all x G fl, and all u G R , V G M. .
p
Let u be a minimizer for # m he c/ass g + HQ* (l) (g G i f ^ f i ) given).
We then have for all <p G C^(ft)
f fd f d
df d *\
= 0. (4.5.1)
p
Proof. Since u is a minimizer for $ in # - f i f Q ' ( 0 ) ,
n
< *(w 4- tip) for t G E, <p G <7g( )- (4.5.2)
We have
By (ii), (iii), (iv), we may apply Corollary 1.2.2 to conclude that $(u+t(p)
is differentiable w.r.t. </?, and
$ ( u 4- tip)
= J | ^ ( x , u ( x ) +t<p(x),Du(x) +tDip(x))ip(x)
d</?(x)'
x d x 4 5 3
+J2^~i( ^ ^+^fr)+^( ))
x u
^?} - (--)
t=l
4-5 Euler-Lagrange equations 197
j $(u
t + t<p)\t= = 0. 0 (4.5.4)
Remark 4-5.1. From the preceding proof, it is clear that we do not need
to assume that u is a minimizer for I f suffices that u is a critical point
for # in the sense that
du
ij=l i=l
d
a : w x I ) u a :
+E^( ' ( )' ( )) - ^(x,u(x),Du(x))=0.
(4.5.6)
1=1
{ x u { x ) D u { x ) ) ] i p { x ) d x ( 4 5 7 )
- Y . S L ' ' - -
1=1
From Lemma 3.2.3 (applied to supp</? C C fi so that the term in { }
2
is in L ) , we then obtain (4.5.6).
q.e.d.
198 Direct methods
show that without such an assumption, in general one does not get
smoothness of minimizers. On the positive side, however, we do have de
Giorgi's and Nash's:
d d
T h e o r e m 4.5.2. Let ft be open and bounded inR , f : ft x R * E be
of class C, with
(i)
2 2
A M < / ( x , t ; ) < A ( l + |t;| )
and
d
for all x G fi, u E E , G E , with constants X > 0, A < oo,
(")
2
Let u G g + HQ (Q) be a bounded minimizer of F(u) :=
1,p
f(x,Du(x))dx (g G H (ft) given). Then u is smooth in ft (u G
/ n
C(ft)).
The proof of the theorem of de Giorgi and Nash is too long to be pre
sented here. We refer to M . Giaquinta, Introduction to Regularity Theory
for Nonlinear Elliptic Systems, Birkhauser, Basel, 1993, pp. 76-99 and
4-5 Euler-Lagrange equations 199
2
D(u) := I \Du(x)\ dx (4.5.8)
JQ
yP
in the class g + HQ (ft). By Theorem 4.3.1, a minimizer u exists, and
by Theorem 4.5.1, i t satisfies
ThenuC(Q).
Remark 4-5.2.
Uh{x) = u{y)dy
hjjici^)
as in Section 3.2. Given (p Co(Q), we restrict h to be smaller than
dist(supp</?, dft). We obtain
u(y)dyA(p(x)dx
= / u(x)A<ph{x)dx, (4.5.11)
JQ
Remark 4-5.3. We have also used the fact that A commutes with mol
lification, i.e.
(A<p) = A(<p ).
h h (4.5.12)
For this, one needs that g is a function of |x| only, i.e. rotationally
symmetric. Also, this point needs the rotational invariance of the Laplace
operator A . Therefore, the present proof does not generalize to other
variational problems.
/ u (x)Aip(x)dx
h =0 (4.5.13)
JQ
Au h = 0
4-5 Euler-Lagrange equations 201
/ K ( y ) l \dy
<
< / M * ) l \dx
< (4.5.14)
Jn
X
by Fubini's theorem, using ~ J ^ fe ^ dy = 1 by (3.2.3)
1
< oo since u L ( f i ) .
2
L e m m a 4 . 5 . 1 . Let f C (Q) be harmonic, i.e.
Af(x) = 0 into.
r
Then f satisfies the mean value property, i.e. for every ball B(xo, ) C 0,,
d
where uJd is the volume of the unit ball in R.
0 = / Af(x)dx
JB{X ,Q)
0
I j(x)da(x),
JdB(x ,Q)0
in polar coordinates UJ
Q
Q JdB(0,\)
1
Q -* f f(x)da(x))
dg
1
(dWd^" J DL
202 Direct methods
Thus,
d l
-i-y / f(x)dx = 4 M T - V T / f(x)da(x)) g ~d Q)
Uh(x ) 0 = ~ j u (x)dx
h
\u (xi)
h - u (x )\
h 2 < ~ (-) [ \u {x)\dx
h
dWd XT 1 / B(x ,r)\B(8 r)
1 2 )
X /
UB(i r)\B(*i,r)
2 l
Av = 0 in Q.
p 1 p
in the class A : = # + # d ' ( f i ) , with given # G i f (f2). By the Picard-
Lindelof theorem, the ordinary differential equation
P = (4-5.18)
1 1
admits a solution u(v) of class C ' . We then have
. . du du ,, .
( . ) - - = !. (4.5.19)
Since ^ > A~* > 0, the inverse function v(u) exists and is of class C '
as well, and we have by (4.5.19) and a chain rule for Sobolev functions
that easily follows from the chain rule for differentiable functions by an
approximation argument that
d d
a(u)DiuDiU ^2 F>ivDiV.
i=l i=l
Therefore, (4.5.17) is transformed into Dirichlet's integral
F(u) = D(v).
Since the latter admits a smooth minimizer, the original problem (4.5.17)
1 1
then admits a minimizer that is of class C ' in fi.
Exercises
4.1 Weaken the growth assumption required for | in (iv) of The
orem 4.5.1. Hint: Use the Sobolev Embedding Theorem.
4.2 Compute the Euler-Lagrange equations for the variational in
tegral
2
A(u) := / y/l + \Du{x) \dx.
Jn
(A(u) represents the volume of the graph of u over fi. Critical
points are minimal hypersurfaces that can be represented as
graphs over fl.)
204 Direct methods
lj
E(u) := / g (x)Diu(x)Dju(x) (detgij(x))^ dx,
Jn
u
where (<7 ())i,j=i,.,,,d is the inverse matrix of {gij{x))ij=i,...,d.
Assume that (0ij(x))i,j:=i,...,d is positive definite for all x G ft.
x 2
Show that for given g G H ' (ft), there exists a unique minimizer
1 , 2 1 , 2
of J? among all u G i ? ( f i ) with u g G i f ( f i ) . (Minimizers
for J? are harmonic functions w.r.t. the metric gij(x).)
5
Nonconvex functionals. Relaxation
d d
T h e o r e m 5.1.1. Let ft C R be open, l < p < o o , / : f 2 x E - > E
measurable and suppose:
d
(i) For almost all x eft, / ( # , ) is convex on R
1
(ii) There exist a G L (ft), b G E with
p
f{x,v) > -a(x) + b\v\
d
for almost all x G ft and all v G E .
Then
1,p
is Isc and convex on H (ft) equipped with its weak topology and assumes
lyP yP
its infimum in the class of all f G H (ft) with f g G H (ft) for 0
lyP
some given g G H (ft).
205
206 Nonconvex functionals. Relaxation
2 2 2
F(u) = (u (x) + (u'(x) - l ) ) dx.
We claim that
4
i n f { F ( u ) : u G H ^ ( ( 0 , 1 ) ) } = 0. (5.1.1)
Mn (0)=o = ti (l),
n (5.1.3)
F(u) = 0,
;
then u(x) = 0 for almost all x G (0,1) and | w ( x ) | = 1 for almost all
x G (0,1), and these two conditions are not compatible. (In fact, since
4
d = 1 here, any u G # Q ' ( ( 0 , 1)) is absolutely continuous, and so u = 0 i f
u(x) = 0 a.e., hence u is differentiable and u' = 0. (More generally, any
Sobolev function that is constant on some set A has a representative u
whose derivative Du vanishes on A.)
We have thus shown that the problem
M
F(u) -+ min in # 0 (fi)
noo
1
Therefore, F is not lsc w.r.t. weak H ^-convergence although the inte
grand is continuous in u'. As we shall see this results from the lack of
convexity of the integrand. We also observe that any sequence of saw
tooth functions u , i.e. satisfying
n
\u' \ = 1 a.e.
n
2
that converges to 0 in L is a minimizing sequence for F .
u'(t)=g(t,u(t),a(t)) (5.1.7)
a(t)=<p(t,u(t))
for an optimal control a at time t assuming a given state u(t) of the sys
tem. I f one knows the optimal control, one can reconstruct the evolution
u(t) of the state of the system from (5.1.6) and (5.1.7) under appropriate
assumptions. The simplest control equation (5.1.7) is
u'(t)=a(t),
208 Nonconvex functionals. Relaxation
/ f(t,u(t),u'(t))dt.
Jo
f 2 2
Expressions of the type (u (t) l ) can occur in many technical exam
ples, like boats sailing against the wind.
Proof. sc~F is Isc as a supremum of Isc functions, see Lemma 4.2.1 (iv).
Obviously, sc~F < F , and for all Isc $ w i t h # < F , we have
$ < sc~F
by definition of sc~F.
q.e.d.
(sc~F)(x )
0 < l i m i n f ( s c ~ F ) ( x ) by lower semicon-
n
noo , . _ .
tinuity of sc F (see
Lemma 5.1.1)
< lim inf F(x )n since sc~ F < F
n>oo
= inf F(y) since (x ) is a min-
n (5.1.8)
lmizing sequence tor b .
* ( * ) = inf
yX
(i) whenever x n x
(sc~F)(x) < l i m i n f F ( x ) n
noo
(5.1.12)
whenever y > x. Indeed, otherwise, for some 5 > 0, we would find some
v
diagonal sequence y ,n ^ as v oo w i t h
u u
noo n+co
and hence
< F"(x).
Thus, F"* is the largest lsc functional < F , and (5.1.11) follows from
Lemma 5.1.1. I t is then easy to see (and left as an exercise) that F ~ ( x )
satisfies and is characterized by the properties (i) and (ii).
q.e.d.
i i x A
t A ( x ) . = l *
{oo if X iA.
We then have
SC~%A i A,
G A
XA{X)1
={1 tf* iA.
Then
sc \A = XA
where A is the complement of X \ A.
d
Example 5.1.2. Let fl C R be open, 1 < p < oo,
p
/ : L (fl) -> R
defined by
/( ) = I L
U :
Du
\ \" d x
+ fa M " d x i f
C ^) 1
i oo otherwise.
1
(Note that 7(u) may also be infinite for some u G C (f2).) We claim
u d x d i f u 1 , P n
(*r/)() = ( /n l ^ l " + /n M " * # ( )
1 oo otherwise.
p
(i) (sc~I) is lower semicontinuous on L which yields condition (i).
The lower semicontinuity is seen as follows:
p
Suppose u u in L (fl).
n For the purpose of lower semicontinu
ity, we may select a subsequence (w)eN C (w )nN w i t h n
v>oo noo
and we may also assume that this limit is finite. ( W ) N then
c 1 p
is bounded in J f ' (f2). A subsequence of (w ) then converges v
p
weakly in H^ (fl) (Theorem 3.3.5), and by the Rellich-Kondra-
chev compactness Theorem 3.4.1, i t also converges strongly in
p
L (fl). The limit has to be u, because the original sequence (u ) n
liP
was assumed to converge to this limit. Since the H -norm is Isc
l,p
w.r.t. weak H convergence (Lemma 2.2.7), we have
1 , p
may find a sequence ( w ) N C C ^ f i ) f l ff ( f i ) w i t h
n n
p p p P
lim f/ |>u | + / K |
n ) = / |Du| + H ,
N
^ \JQ JQ ) JQ
i.e.
lim I(u ) n (sc~I)(u).
1
HugH *^), then
o K 1
'' \oo if w L P ( n ) \ c ( n ) , 0
p
instead of L ( f i ) . The relaxed functionals will be given by the same
formulae.
c 1 1
anymore by the J f ' -norm, but by the BV-norm which is defined in
Chapter 7.
F (x)
x : = inf (F(y) + \d(x,y)). (5.1.13)
\F ( )
x Xl - F (x )\ x 2 < Ad(xi,x ) 2 (5.1.14)
hence
F {xi)
x < F ( x ) + Ad(x ,x ).
A 2 1 2
\F (xi)-Fx(x2)\<\d(x x ).
x u 2
FA < 5C"F,
s u p F ( x ) < (sc'F)(x).
A (5.1.16)
A>0
F ( x ) + A d ( x , x ) <F (x)
A A x +j .
Therefore
lim x A = x
A+00
and
hp
F(u) := / f(Du(x))dx , (u G H (Q)0 given).
JQ
lyP
Then the relaxed function of F w.r.t. the weak H topology is given by
Proof. First
p p p p
/ \f (x)\ dx
n = I \f(nx)\ dx = ^ f \f(y)\ dy= f \f(x)\ dx
n
JW JW JnW JW
by the periodicity of / . Thus
\\fn\\ LP{w) = ll/H L P { W ) - (5-2-2)
In the same manner,
/ f {x)dx=
n [ f(x)dx= [ fdx. (5.2.3)
Jw Jw Jw
Let now WQ be a subrectangle of W, written in the form
d
W0 = a + bW (a = ( a , . . . , a ) , 6 = (&i, ...,&<*))
x d
5.2 Representation of relaxed functionals via convex envelopes 215
Then
= - d l (f(y)~f)dy
n
Jna+nbW
= A / ( / ( ) - f) V d
7 1
Jna+[nb]W
d
+ - d I (/() - f) y
n
Jna+(nb-\nb\)W
+ A / {f(y)~f)dy
n i+(nb-[nb})W
Jna+(nb-[nb})W
by periodicity of / .
The first term in the right-hand side vanishes by (5.2.3), and thus, again
using the periodicity of / ,
1/ {fn{x)-f)dx\<- d [ \f(y)~f\dy.
\JWo I 7 1
JW
lim / f (x)g(x)dx
n = / fg(x)dx. (5.2.5)
n
-*Jw Jw
Given e > 0, we then find subrectangles W\,..., Wk {k k(e)) and
\ e R (i = l , . . . , f c ) w i t h
{
< e (5.2.6)
L*(W)
q d
(The possibility of approximating L (fl) functions g (Q open in R ) in
such a manner by step functions can easily be seen as follows:
9
Since Cg(ft) is dense in L ( f i ) , there exist y> C(fi) w i t h
e
sup 2 meas supp (p
supp <p
216 Nonconvex functionals. Relaxation
Then indeed
Xi w
9 ~ Yl * * LP(Q)
Then
/ (fn(x) ~ f) 9(x)dx
\Jw
I {fn(x) ~ f)Y^XiXWi(x)
Jw {
x
+ {fn(x) - f) ^g(x) - ]T\XwA )
k
< I > l | / {fn(x)-f)dx +e||/ -/||
n
The first term tends to zero as n oo by (5.2.4), whereas the second one
is bounded by 2e | | / | | , P ( W ) by (5.2.2) and can hence be made arbitrarily
small. Therefore, (5.2.5) holds.
q.e.d.
(1) We put
and we claim:
L e m m a 5.2.2.
(sc-F)(u) < [ (q~f)(Du(x))dx.
JQ
P
H*' (W) with
n meas W J w
(5.2.8)
d
We extend (p periodically from W to E
n and put
and
u (x)
n := u(x) + - ( / ? ( n x ) . n
n
lyP
By Lemma 5.2.1, u n converges to u weakly in H . Then u n = u
on dW by periodicity of (p and ^ n n ( a v v = 0. We have
/ f(Du (x))dx
n = j f(v 0 + -D(p (nx))dx n
n
Jw Jw
= v D(
I f( o + Pn(y))dy
n d
JnW
/ W
f(v 0 + D<p (y))dy n (5.2.9)
since (p is periodic. n
n-KX) J w
Jw
The claim then follows from the characterization of (sc~F), see
e.g. Lemma 5.1.2(i).
q.e.d.
218 Nonconvex functionals. Relaxation
(2) We observe
With
(q~F)(u) := / (q-f)(Du(x))dx,
Jn
s c - F = 5C-(g~F), (5.2.11)
n
(Q- f)(v)
(Qf)(v).
n n
l i m (q~ F)(u) = lim f (q~ f)(Du(x))dx
noo n>oo J
Since by (5.2.12)
n
(sc~F)(u) < (q~ F)(u) for all n ,
P d
<p e H^ (U), UcR open, bounded}. (5.2.14)
l
g{v) < r I g{v + D<p{x))dx. T (5.2.15)
meas U J v
Proof. '=>':
Jensen's inequality says that i f / is convex, for every ip G
1 d d
L {R ,R )
meas U = 1
and put
ip(x) = v 4- D<p(x).
p
Since <p G H^ , f ip(x) = v meas U = v, and (5.2.16) therefore
implies that / is quasiconvex.
f{Vo) = f{vo)dx
^bjl
P
for all ip e HQ' {U).
d
We have to show that for all v i , v R , 0 < t < 1 2
d d
shall take a cube W : = (a, b) C R as our set U and construct a
family of functions
(v>) N C 6 H^{W)
with
on a set W C W w i t h meas
{l-t)( -V2) Vl d 1
W? = t(b-a)(b-a-) -
V(p (x)
n =
on a set Wg C W w i t h meas
-t(v x - v) 2
d _ 1
WT = (i-*)(fc-o)(fc-o-s)
and
n 1
It remains to construct (p . We divide the interval (a, 6) into 2 +
n
subintervals as follows:
h = (a,a+^(b-a))
l2 = (a+^(b-a),a+^(b-a))
h = (a+~(b~a),a+~(b-a) + ^(b-a))
i.e. the intervals hv-i have length ~(b a), and they alternate
with the intervals I 2v of length ^ ^ ( 6 - a). We then put
d 1
Wr'=((jl2,-i)x(a+- b--) - )
X
i/=i
n n
*$>=0*i = 2,...,d.
(p = 0 on dW,
n
2
linear i n x , Since
, /., t(l t),, ., , Ci
sup \ipn(x)\ < (6 - a)|vi - v | r *
2
n w
\- 2
n
we get
xdip (x)\n Ci
sup < n.
xW xEWfUW? ux
5.2.3.
q.e.d.
(4) We may now complete the proof of Theorem 5.2.1 From (2), we
know
(sc-F)(u) = J (Qf)(Du(x))dz
222 Nonconvex functionals. Relaxation
G(u) := j g(Du(x))dx
ltP
is a weakly H Isc functional < F . Therefore, from the defini
tion of sc~F, the convex function Qf must in fact be the largest
convex function < / . This completes the proof.
q.e.d.
Proof. Lemma 4.3.1 says that convex functionals are weakly lower semi-
continuous. I f / is not convex, then by Theorem 5.2.1 sc~~F ^ F , hence
F is not weakly Isc by Lemma 5.1.1.
q.e.d.
Remark 5.2.1. One may also consider variational problems for vector
d n
valued functions u : fl C R -+ E ,
F(u) := [ f(Du(x))dx.
Jn
d
Again, / is called quasiconvex i f for all open and bounded U cR and
p n nd
sl\ ip e H^ (U;R ), v eR
/ ( / (
^ n ^ / ^ ^
In this case, however, while convex functions are still quasiconvex, the
converse is no longer true. Theorem 5.2.1 continues to hold but w i t h con
vexity replaced by quasiconvexity. Also, one may consider more general
problems of the form
F(u) = J f(x,u(x),Du(x))dx
tinuous ones. For detailed references to the work of Morrey and other
researchers, see the book of Dacorogna quoted at the end of this chap
ter.
2 2 2
F(u) = j f * | u ( x ) + (u'(x) - l ) } dx
M
for u # ( ( 0 , l ) ) . F(u) is the sum of a functional which is continuous
o
2 1 4
w.r.t. strong L -convergence, hence also w.r.t. weak H * convergence,
and another one to which Theorem 5.2.1 applies. We conclude that
1 2
(sc~F)(u)= f {u (x) + Q(u'(x))}dx,
Jo
with
( v 2 1 ) 2 i f H
Q(v) = l - ^
\ 0 otherwise,
2 2
the largest convex function < (v l ) .
References
For the definition of relaxation and its general properties:
G. dal Maso, An Introduction to F-Convergence, Birkhauser, Boston 1993,
pp. 28-37.
G. Buttazzo, Semicontinuity, Relaxation and Integral Representation in the
Calculus of Variations, Pitman Research Notes in Math. 207, Longman
Scientific, Harlow, Essex, 1989, pp. 7-28.
For Theorem 5.2.1 and generalizations thereof:
B. Dacorogna, Direct Methods in the Calculus of Variations, Springer,
Berlin, 1989, pp. 197-249.
Nonconvex functionals. Relaxation
Exercises
2 2 1 4
F(u) = J (l-u'(x)) u(x) dx foruGtf '
with = 0 , u ( l ) = 1,
, 2 2 1A
F(u) = J {2x-u {x)) u(x) dx for ueH
with = 0 , u ( l ) = 1,
and
1 2 ; 2 1 4
F(u) = J ((u(x) - a) + ( u ( x ) - 1)) dx for u G / J '
with a G i
p d
Determine sc"I for / : L ( f i ) R ( f i G R open and bounded),
1
J(ii) : = { / n W <** + Jn cfa i f ti C ^ )
I oo otherwise.
Why does the proof of Lemma 5.3.3 not work for vector-valued
d n d n d n
mappings R -+ M with n > 1, i.e. # : M -+ R, v G R , y> G
p n
# o ' ( R , R ) as in Remark 5.2.1?
6
r~convergence
T-converges to F,
F = T- l i m Fn r
if
and
F(x) = lim F (x ).
n n
Example 6.1.1. F : R -+ R
n
1 for x >
n
F (x)
n nx for < x <
n n
-1 for x < .
n
Then
for x > 0
for x < 0
225
226 r'-convergence
while the pointwise limit is 0 for x = 0, 1(1) for x > 0 ( < 0).
Example 6.1.2. F n : R R
(r-UmF )(x) = 0 n
Example 6.1.3. F n : R R
nx
for 0 < x <
n
F (x)
n := I 1 2
nx 2 for < x <
n n
0 otherwise.
Then
-1 for x = 0
(r-limF )(x) = {
n
0 otherwise.
whereas the pointwise limit is again identically 0. Note that the F of n
(r-limF )^r-lim(-F ).
n n
Example 6.1.4- F : n
Example 6.1.5. F n : R R
F ( x ) = sinnx.
n
Then
(r-limF )(x) = - l ,
n
From Examples 6.1.4 and 6.1.5, we see that among the two notions of
pointwise convergence and T-convergence, neither one implies the other.
F (i/) G V
n
F n EE F : X -> E
r-limF = n (sc~F)
l i m s u p F ( x ) < F(x).
n n
noo
l i m s u p F ( x ) < F{x).
m
moo
Suppose that (ii') is satisfied for every x (i.e. for every m, there exists
m
a sequence {x ,n)neN
m converging to x with m
noo
228 T-convergence
tions so that
xm G U m for all m G N,
and that every sequence (y^^n with y G U^ ) for all v and some v u
Then
lim m = oo. n
n+oo
Namely, otherwise, we would find fco G N with
or
%k,n Uk u for all k > ko and some
sequence n oo for
z/ > oo.
To see that this is impossible we simply observe that since x ko G Uk0
r a
^fc ,n 4
0 0 f ^ sufficiently large n,
71 OO
we have
We then have
%m n ,n G f/m n
1
F (x
n m n , ) < ^OmJ +
n
F ( x ) < inf +
for Fj and
F(x) = l i m F (x ).
n n (6.1.1)
n-+oo
,
\imF (x )=F(x').
n n
e <6
n (6.1.3)
F (x' )<F(x')
n n +6 (6.1.4)
Since x n is an e -minimizer of F ,
n n
F )
n > F (x )n n - e n (6.1.6)
>F (x )-S
n n by (6.1.3)
>F(x)-2 by (6.1.5).
F(x) = l i m F ( x ' ) , n n
q.e.d.
with
lim Xm = x
m*oo
tions. We let
6 := ] (F(x) - l i m F(x j)
m >0 by (6.1.7).
4 \ ra+00 /
For every ra G N , we may find n m G N with
Fn (x n )-F(x )<6
m mt m m (6.1.9)
F(x)< lim i n f F n m (x ,
m n m ). (6.1.10)
ra>oo
and
F (x , J
nm m n > F(x) - 6. (6.1.12)
Equations (6.1.9), (6.1.11) and (6.1.12) are not compatible, and the re
sulting contradiction proves the lower semicontinuity.
q.e.d.
6.2 Homogenization
In this section and the next one, we describe two important examples
of T-convergence. They are taken from H . Attouch, Variational Conver
gence for Functions and Operators, Pitman, Boston, 1984.
In the discussion of these two examples, we shall be more sketchy
about some technical details than in the rest of the book, because
the main point of these examples is to show how the concept of T-
convergence can be usefully applied to concrete problems that arise in
various applications of the calculus of variations.
232 T-convergence
d d
Let M be a smooth subset of the open unit cube (0, l) of R . M is
considered as a hole. Let
M := c ( J e ( M + ra)
d
mGZ
(e(M + 777.) : = { x = y + em with ^ G M}) be a periodic lattice of 'holes'
of scale .
d
Let ft C R , fi : = ft \ ( M f l ft), i.e. a domain with many small
c c
holes. Such domains occur in many physical problems like crushed ice,
porous media etc. Often, the physical value of e is so small that it is
useful to perform the mathematical analysis for e 0. This is called
homogenization. Let
d
, x / x f0 forxGM \Mi
^ oo for x G M i
d
be the indicator function of R \ M\. a ( ~ ) then is the indicator function
d
of R \M . eWe consider the functional
2 2 2
F (u):=\e
e [ | D u ( x ) | d x + / a (-) u (x)dx (6.2.1)
2 V e
Jn Jn '
2
for u G # o ' ( n ) . A minimizer of the functional
F (u)e / f(x)u(x)dx
Jn
2
(for given / G ( f i ) ) satisfies
2
the requirement that u G jfiFo' (fi), while the boundary condition on dM e
r - l i m F = F, c (6.2.3)
with
where
2
fi(M):= j \Drj{x)\ dx= [ r/(x)dx,
d d
J(o,i) \M J(o,i)
6.2 Homogenization 233
2 2
L ( f i ) to fJ>(M) as e 0. Let now u G L (ft). By approximation, we
/1 2
may assume that u is smooth, e.g. contained in W ' ( f i ) f l C ( Q ) . We
put
2
Then u converges weakly in L (Q)
c to u, and
u 0
e on M . c
Moreover
^ e K ) = ~ / l ^ c l 2
(6.2.5)
2
^2
l^jf 2
2
(u \Drif + 2ur Du-Dri ]e e + 2 2
r \Du\ )
1
2 /x(M)
2
I f C/ C fl is open, because of the periodicity, J V |-DT7 | asymptotically
c
behaves like
meas U f .^ ,2 meas U
d d
J(0,e) \M e J(0,l) \M
2 2 2 2
lime / u \Drj \ e = fi(M) [ u (6.2.7)
2 2 2
lime / 77 |>u| = 0, (6.2.8)
tx, then
as before. We have
2
F (v )
e e + F (u )
e >e [ Dv Du e e
f (m
2
e !
V(M) w*Dve'Du + uDv -Dri ).
(6.2.12)
2
UminfF (t; ) + I r j r
e e f u > lim inf - 1 / uDv Dr] e e (6.2.13)
since the other term on the right hand side of (6.2.12) goes to 0 by a
similar reasoning as above. Equation (6.2.4) implies
2
e Ar) e = -1 in fi . c (6.2.14)
Moreover
6 2
Duv D e Vt <e 2
(^j \Du\ v )j ' (^J 2 2
\D \ ^Ve
2
' - 0, (6.2.15)
2
since v as a weakly converging sequence is bounded in L , \Du\ is
e
u 2
=
M M ) i '
6.3 Thin insulating layers 235
2
since v converges weakly in L to u. This implies (6.2.11) and concludes
e
the proof.
q.e.d.
S c : = fl f l S e
fl c :=fl\S . c
Conductivity coefficient
_ f 1 on fl c
a c A :
' ~\A onE c (A>0).
Variational problem:
C A 1 , 2
J ' :# 0 ( f l ) -> R
2 2
r*(u):=\f \Du\ dx+~f \Du\ dx (6.3.1)
2 2
JQ / c
c A 2
J ' (u) - / fu -> min (/ L (fl) given).
Au eA + / = 0 on flc (6.3.2)
AAu C ) A + / = 0 on S c (6.3.3)
W ,A|Q
C C = W ,A|E
C c on d f l f l d c c (6.3.4)
A on d f l f l d
c c (6.3.5)
dux*
(where nu denotes the exterior normal of a set U)
u iX = 0 on d f l . (6.3.6)
236 r -convergence
w/iere w solves
Aw + / = 0 on fl \ E
w|an = 0
^p
and
= ^a n | = H E
2
w/iere is the jump ofu across E, and and ore the exterior
normal derivatives for the two components of ft \ E. (In case a = oo, u
is continuous across E, and! A u = / in fi.) Furthermore
2 2
\ f l^e,A| + ^ / |^e,A|
2
/ | D u | + ^ / [u]|dE.
e , A 2
J r-converges w.r.t. the weak L -topology to I(u):
e 2
0 < a < oo : I(u) = ( J / n \ l ^ f + f # " o ' ( " \ E)
I oo otherwise
2
a = oo:I(u)=iUn\Du\ ifuZH^{Sl)
I oo otherwise
(in this case,
the result holds
2 2
a = Q : 1 { u ) = / \ J n V E \Du\ if u e H^ (Q \ E) for the strong
L
loo otherwise -topology
in place of
the weak one)
Thus, in case a = 0, we obtain a perfect insulation in the limit,
whereas for a = oo, the limiting layer does not insulate at all.
3
We assume for simplicity S = {x = 0 } .
Proof.
2 2
[ \Du ,\\
e + \ f \Bu , \ e x
n
Jn e JdQ n Jz
+ A / W C , A ^ ~ -
2
^ l/L (Q)' M
l .Alx,2( ) N
2
/ < A <C2 [ \Du , \ e x
2
[ < A <C3 / | ^ C , A | .
By a change of scale
3 3 1 2 2
y = ex (y = x\y = x ),
2
( < A <c e 3 [ \Du \ yX
2
(we only get e instead of e , because the area of the portion of # E on C
2 2
< A < C 4 ( l + ^) (jf \Du , \ e X + \J^ |> W e ,A| )
Proof (Theorem 6.3.1). We only consider the case 0 < a < oo (the
other cases follow from a limiting argument). We first observe
c A 2 2
T- lim J ' ( u ) = oo i f u e L ( f i ) \ H^ (Q \ E).
1 2 1 2
3
if rc <.
x
1 2 1 2
+{u(x ,x ,e)-u(x ,x ,-e)}
2 2
Then u # o ' ( f i \ E), w w weakly in L ( f i ) for c -> 0,
e
J(ti ) e
2 2 2
= 5 / \Du\ + ^ [ \\D(u(z\x ,e)+u(x\x ,-e))
2
* JQ J\x*\<e \*
/ 3 \ ^
+D ( y (u(x\x ,e) 2
- u(x\x ,-e))J 2
2 2 2 2
- I I \Du\ + ?L [ \D(x*(u(x\x ,e)-u(x\x ,-e)))\
D u 2 2 2 2
~\l \ \ + rf Hx\x ,e)-u(z\x ,-e)\
z 3
-f terms that contain x and go to zero as e > 0 ( | # | < e).
If u is smooth (which we may assume by an approximation argument),
therefore for e > 0
2
i2 / |D| + f /[]
JQ\E 4
/E
c
liminf/ (t; )>/(u). c
e>0
For u as above,
e
2 2
~ f \Dv \ e + ^ [ \Du \ >aef e Du e Dv (
vE e 'Eg </ E e
> f / e (!>{u(.,6)+u(.,-6)}
+ ) { ( - , e) - ( - , - e ) }
+^(u(., )-u(.,- )) e e
3
where 3 of course is the unit vector in the x direction.
6.3 Thin insulating layers 239
3
> lim inf / (D e) - -c))) D f z e
c
Without loss of generality l i m i n f _ ^ J (^e) < oo. Then e 0
2
supe / \Dv \e < oo. (6.3.7)
Consequently,
c a e Dv
limae J Ql)w(-,c) + ^ > w ( - , - ) ^ >v < (^J \ *
cae
-+ 0 for e -+ 0.
Similarly,
3
lim sup a / (D (w(-, e) - it(-, - e ) ) ) Dt; x c -* 0
3
since | # | < e. Thus
3
Since u(-, c) and u(-, e) do not depend on x , we obtain by integration
^ / e -/?v (w(-,c)-w(-,-c))
3
= 77 / v
e ) - W(., - ) ) - ^ / V ( u ( . , C) -
C -c)) ,
3
where here of course dUf = 0 0 {x = e}. Since we may assume
c ly2
l i m i n f _ o ^ (^c) < oo, v is bounded in H (Q ).
c e Therefore, we may e
240 r-convergence
-
assume that the traces of v on d converge*) . Since u is assumed smooth
e c
2
and v converges to u weakly in L , we may assume
e
^,(0E ) e - w(',0 ) weakly in L ( d ) . 2
We then get
h
e ^2 / ^Dv (u(-,e)-u(-,-e))
{ = - / []..
J Eg J E
Altogether
Therefore
2
liminfrK)>i / \Du\ + ^ f [u)l.
q.e.d.
Exercises
6.1 Determine the T-limits of the following sequences of functions
F (x)
n := n(smnx + 1)
for a: = 0
n z 2
for 0 < x < -
F () : - | ^
n Q f^ I
^ ^
^ n2
2
2n - n x for < x < ~
n n
F (x) : = sinnrr + cosnrr.
n
Does one get equality instead of ' < ' here? (Hint: Consider
F (x) sinnrr, G (x) = - s i n n r r . )
n n
f For this technical point, see e.g. W . Ziemer, Weakly Differentiable Functions,
Springer, G T M 120, New York, 1989, pp. 189ff.
7
BV-functionals and T-convergence:
the example of Modica and Mortola
d
L : C$(R ) -* R
L(f) = f fudpi.
d
JR
Conversely, we have the Riesz representation theorem, given here with
out proof (see e.g. N . Dunford, J. Schwartz, Linear Operators, Vol. I ,
Interscience, New York, 1958, p. 265).
d
T h e o r e m 7.1.1. Let L : Co(M ) R be a linear functional with
d
\\L\\K : = s u p { L ( / ) : / C ( E ) , | / | < l , s u p p / C K] < oo
0 (7.1.1)
d
for each compact K C R . Then there exist a Radon measure ii on
d d
R and a ji-measurable function v : R R with \u\ = 1 ji-almost
everywhere with
d
L(f)= f fisdfi forallfC(R ). (7.1.2)
241
242 Modica-Mortola example
d
Thus, the Radon measures on R are precisely the nonnegative linear
d
functionals on C o ( R ) . (Note that (7.1.1) automatically holds i f L is
nonnegative; namely
\\L\\ K = L( ) XK
d
for any open ft C E .
The expression z/d// in (7.1.4) = 1 //-almost everywhere) is called
a vector-valued signed measure. (// is supposed to be a Radon measure
and v a //-measurable function with values in H.)
d
D e f i n i t i o n 7 . 1 . 1 . Let ft R be open. The space BV(ft) consists of
1
all functions u L (ft) for which there exists a vector-valued signed
measure z/// with //(fi) < oo and
\\Du\\(n) :=// =
d G d
sup { / udivgdx
n :g = (g\...,g )C% ( f i , R ), \g(x)\ < 1
for all x fi}
< oo
and
n
W \\ v(n) :=IHlLt(n) +
u
B IPll()-
7.1 The space BV{SI) 243
||Z>||(no)=: / ll^ll,
/Oo
Jftn
and also
\\Du\\(f)
|Sld-i = / dS.
Js
d
L e m m a 7.1.2. Let E be a bounded open set in R with a boundary dE
of class C. Then
1 ^ 1 ^ = 1 1 ^ 1 1 ^ ) , (7.1.7)
[ dlvg= [ g(x)n(x)d(dE)
JE JdE
/ divg = f ipd(dE).
JE IdE
JdE
Consequently
sup j ^ d i v : c ? e C5 0
o o d
(lR lR ),|5| <
)
d
l j
in that case.
d
D e f i n i t i o n 7.1.2. A Borel set c R has finite perimeter in an open
set fi if X E \ BV(Q).
N The perimeter of E in fi m that case is
P(E,Q) := \\DxE\m
d
( = sup|^div f f : C (n,R ),| | < l | ) .
5 0 5 (7.1.8)
d
E is a set of finite perimeter if XE S V ( R ) .
7.1 The space BV(Q) 245
then
u BV(Q).
d
Proof. Let g C${U,R ) w i t h \g\ < 1. Then
i/ nn
Diu((p) : = - / uDup,
in
symmetry condition
d
p {x)
h := h- p{j^
l l d
and for u L (fl), we extend u to L (R ) by defining u(x) = 0 for
d
x R \ fl and put
u (x)
h : = p^ * u(x) : = / p^(x - y)u(y)dy C(Q).
In particular,
Proof. Uh u in L (ft) l
by Theorem 3.2.1. I t suffices to consider the
case / > 0. Prom (7.1.3) i t follows as in the proof of Theorem (7.1.2)
that for every / C (fi) with / > 0
/ f\\Du \\ h =
Ja
s u p j y g(x)Du (x)dx
h : g Cf(Sl,R),\g(x)\ < f(x) V x fij .
(7.1.16)
Here, Dun = ( g f r w ^ , . . . , ^ J U A ) is the gradient of u/,, since UH is smooth.
7.1 The space BV(ft) 247
= - J J p (x - y)u(y)dy
h divg(x)dx
\\Un ~ ^ n | | i ( n ) <L n
\\Dv \\(Q)<K
n + l.
lyl
Therefore ( f ) n N is bounded in W (ft).
n By the Rellich-Kondrachev
compactness theorem 3.4.1, after selection of a subsequence, ( f ) n G N n
1 1
converges in L (ft) to some u L (ft). (u ) has to converge to u as well
n
x
(in L ( f i ) ) . By Theorem 7.1.1, u BV{Q), and
\H\ <K.
BV
q.e.d.
248 Modica-Mortola example
7.2 T h e e x a m p l e o f M o d i c a - M o r t o l a
We now come to the theorem of Modica-Mortola:
T h e o r e m 7.2.1. Let
v oo otherwise,
] D U l = l | Z ? U | 1 U B V ( R d )
F{u) := { t L *
\ oo otherwise.
1 d
Then w.r.t. to L (M ) convergence
F = T- l i m F . n (7.2.1)
Proof.
\h (s)
n - h {t)\
n <\s-t\ for all n G N , 5, t G .
Therefore
Also
We now obtain
, 2 , 2
< ||/i ou -/i ou|| -f h ou u
nn ou n u n n n L 1
n
1
L 1
L
7T 7T
0 as n oo (7.2.5)
l i m i n f F (u )
n n > 2 lim inf / \Du \ |sin(7rnw )|
n n
= 21iminf / \D(h ou )\ n n
noo J
= F(u).
This shows (7.2.2).
x d
(ii) We want to show that for every u G L ( E ) , there exists a se
1 d x d
quence (tXn)nGN C L (R ) converging to u in L ( R ) w i t h
v v+l
n n
and satisfying
/ \Du(x)\dx^ I [u^^l^dt
d
Ju Ju
oo rl
* E / " l ^ w L v
J
l / = - 0 0 n
OO j
u _ 1
* E ^l (^)L-i
v oo
oo 1
=
2 D
^ l l X { t i > t , , } | | by Lemma 7.1.2.
n
t>= oo
N(n) 1 ^
t/=-AT(n)
n>oo
n
s u p p u C suppu, n
2=1
where the fti are disjoint bounded open sets with bound
ary dfti of class C. Since the general case is completely
analogous, for simplicity, we only consider the case N = 1,
i.e.
u = an X (7.2.8)
Q G R , na n G Z ,
2
4>n{x) : = fjy ^ + nsin (7mx())} *
Lipschitz functions \n R R
XnW = 0 /orKO
with
lim a n = a,
noo
fin := jxefi:d(x)<--^=J
and
w (a:) : = Xn(d(x))
n w i t h %n as in Lemma 7.2.1.
(7.2.11)
Then
d
u (x)
n = 0 for xeR \fi
(x) = a
n n for X G fi \ fi n
We also note
lim I f i J . = 0. (7.2.12)
lim / |u(x) - u ( x ) | dx = 0,
n (7.2.13)
1
and u n converges to u in L . We also let (as in Appendix
B)
d
E : = {x R : d(x) = t}.
t
We note
d
Du (x)
n = 0 , sin(n7n/ (z)) = 0 n for x e R \ fi ,
n
(7.2.14)
and
Then
lim sup F ( u )
n n
l D U n { x ) l 2
= limsup / ( + nsm (n7ru (x)) n \ \Dd{x)\dx
l i m s u p ^ ^Xn(*)l 2
+ n s i n 2( n 7 r X w ( )) j
t
>
rft
n->oo y)<t<^= y
4
< -a by Lemma 7.2.1 and Lemma B . l
= F(u) (cf. (7.2.9)).
This is (7.2.7).
(4) I t only remains to prove Lemma 7.2.1:
The idea is of course to minimize 4> {x) under the given n
are
=
^x" 7rnsin(7rnx) cos(7rnx),
2 2
^ X ' = s i n ( 7 m x ) + ci. (7.2.16)
*n(0 : = / ~ I 1 ^ r I
2
ds
7o n ^ +sm (n7T5) y
Vn : = </>n(c*n).
Then
We let
Xn : [0, Vn] -+ [0, a ]
254 Modica-Mortola example
1
be the inverse of i/j . Then x-n is of class C
n and
Then
<MXn)
+ nsin^(7rnxn(0) I ^
2
^ v ' (f> / 1
X b 1 2
' - + n ( - +sin (7rn nW) X
n \n
i
2
n + sin (7rn n(0))' X n b y
X (7.2.17)
2
= 2 / f h sin (7rns) 1 ds
io V /
and Lemma 7.2.1 follows.
q.e.d.
References
L. Modica and St. Mortola, Un esempio di r~-convergenza, Boll. U.M.I. (5),
14-B (1977), 285-99.
L. Modica, The gradient theory of phase transitions and the minimal
interface criterion, Arch. Rat. Mech. Anal. 98 (1987), 123-42
a
F {u)
n := { In ( IP(*)H + nW(u(x))) dx for u e H{Q)
v oo otherwise
and
u
Fo(u) = { h 2 c
H^ ll for u e
BV(Q) and for almost all x G E
I oo otherwise
7.2 The example of Modica-Mortola 255
with
co= W2(s)ds.
Ja
1
Then F is the T-limit of F w.r.t L -convergence.
0 n
The proof is similar to the one of Theorem 7.2.1, except that we cannot
apply Sard's lemma anymore, because even for a smooth function u, a
and (3 need not be regular values. Thus, one has to consider nonsmooth
level sets as well and appeal to some general results about BV-functions
and sets of finite perimeter.
The interpretation of Modica's theorem is the following:
Consider first the problem
W(u(x)) dx min
L
under the constraint
1_ /
meas S 2
u(x) = 7,
with a < 7 < (3 (w.l.o.g. assume a < (3). A minimizer then is of the
form
f a for A\ C ft / p ? n l o ,
( ? 2 l 8 )
= for^Cfi - -
such that Ai U A<i = ft,
u thus jumps from the value a to the value (3 along dAiilft = dA2C\fl =:
y
Exercises
d
7.1 Try to construct bounded sets in R that do not have a finite
perimeter.
7.2 Prove the preceding theorem of L . Modica for d = 1.
Appendix A
The coarea formula
has one-dimensional Lebesgue measure zero, and thus, for almost all
1
t G R, w~" (0 is a smooth hypersurface by the implicit function theorem.
d
We then have for every open ft C R
Proof.
I = A O 7T o R.
d
For every measurable subset E of R , we have by Fubini's theorem
\E\d = ^ {Enir-Wl^dt,
J OO
where
\E\ d = [ XE
257
258 Appendix A
{Enr'is^ds. (A.2)
Since \A\ = |dZ|, and / is linear, this is the coarea formula for
linear maps.
(2) Let
d
S u = {x G R : Du(x) = 0}
d
U := {x G R
t : u(x) > t} for t G R.
We put
f Xt/ t at > 0
U t
" 1 -Xn*\c/ t i f * < 0.
Then
u(x) = / u (x)dt.
t
JR
d
Let G Q ( R \ S ), u \(p\ < 1. Then
- I RS JR d
d
R \ S , we may apply the divergence theorem to obtain
u
and
jR*\U t J (dUt)r\R<*\S u
The coarea formula 259
/ Du(x)ip(x)dx
d
JR
= [ [ <p(x)n(x)d(dU )(x)dt t
d
Jm Jdu nm \s t u
1 d
< / \ - (t)nR \S \ _ dt
u u d 1 since we assume \(p < 1|
JR
JR
d
(3) We now prove the reverse inequality. We let l n : R > E be
piecewise linear maps w i t h
lim / \l - u\ = 0
n (A.5)
n d
-^ jR
Let
d
U? := {x e R : l {x)
n > t}.
T := Ti U T . 2
Appendix A
e
/ \Xt-Xt\ (A.9)
<M [ | X t - ?|ds<5.
x (A.10)
div g{x)dx + -
2
(*)>*}
<
J{1
// div g(x)dx + e
./0{M*)>t}
n(x) denoting the exterior normal of {l (x) n > t}
1
<K (t)\ _ d 1 +^
Thus, for t < T,
/ R d I t i - V * ) ! ^ ! * < l i m t a f jf I ^ W I ^ dt
n->oo JRd
= / |Du(x)|dx. (A.ll)
g+(t) : = m a x ( 0 , (?(*))
g~(t) : = m a x ( 0 , -g(t))
with
lim p n = 0
n*oo
oo
X) Pn = OO,
n=l
and put inductively
d
Then for all x R
oo
0(x) = ] T p X A ( z ) .n n (A.13)
n=l
Remark A.l. The coarea formula is due to Federer. I t holds more gen
d
erally for Lipschitz functions u : R > R. See H . Federer, Geometric
Measure Theory, Springer, New York, 1969, pp. 241-760, 268-71.
Appendix B
The distance function from smooth
hypersurfaces
We also need some elementary results about the (signed) distance func
d
tion from a smooth hypersurface. Let ft C E be open with nonempty
boundary dft. We put
M \ _ / dist(x,dft) if x G ft
d : d
W~ \_dist(x,0ft) ifxGE \a
< \x - 7r | < |x - y\ 4- |y - ?r | = |x - y| 4- % ) ,
y y
\d(x)-d(y)\ <\x-y\.
2
We now assume that dft is of class C . Let #o dft. Let n ( x ) be the 0
outer normal vector of ft at #o, and let be the tangent plane of dft
d
at xo. We rotate the coordinates of W* so that the x coordinate axis
1
is pointing in the direction of - n ( # o ) . I * some neighbourhood U(xo) of
#o, # 0 can then be represented as
(B.l)
1 1 d 1 2
with x = ( x , . . . , x - ) , where / G C ( T i D C/(x )), Df(x' ) = 0. The 0 0 Q
2
Hessian D f(xo) is symmetric, and therefore, after a further rotation of
coordinates, it becomes diagonalized,
( Kl 0 \
2
D f(x ) 0 = (B.2)
262
The distance function from smooth hypersurfaces 263
2
K I , . . . , K d - i are the eigenvalues of D f(xo), and they do not depend on
the special position of our coordinates. They are invariants of dfi, and
are called the principal curvatures of dQ at x$. The mean curvature of
dft at #o is
d _ 1
1 1
H ( x o ) = = A
E ^ rfn /(^o). (B.3)
2 =1
The outer normal vector n(x) at x G dfl f l ^ ( X Q ) has components
TZ(X')
n'(x) = ^ ' ^ ' , , i= l,...,d-l (B.4)
d
n (a;) = (B.5)
( l + |Z?/(x')|)'
1 d _ 1
(#' = ( x , . > x ) ) . I n particular
d
L e m m a B . l . Suppose is open in R and that dfl is bounded and of
k
class C with k > 2. For rj e R, pu
d
:= { x R : d(x) = r/}.
TTiere exists eo > 0 (depending on dQ) with the property that for
M < c, 0
k
is a hypersurface of class C . vMso,
2
Proof Since d f i is compact and of class C , there exists e > 0 with the
following property: Whenever \rj\ < e for each x$ d f i , there exist two
d
unique open balls B B with B C fi, JB C R \ f i ,
u 2 x 2
Bi n an = x = B n an 0 2
2
of radius |r/|. The eigenvalues of the Hessian D f(xo) of a normalized
representation / of d Q at #o as above then have to lie between - and
i,i.e.
\i*\<\ (B.8)
#o dft. Thus, i f \rj\ < e, each x is the centre of such a ball, and
( 1 - Kid(x) 0 \
DF by (6) . (B.ll)
K -\d(x)
d
V 0 1/
By (B.8) and since \q\ < e,
det D F ^ 0.
k x
By the inverse function theorem, x' and d therefore locally are C -
functions of x (cf. (B.9)). Since
we have
Dd(x) - U(XQ) = 1.
\Dd(x)\ = 1
and
1
Dd(x) =-n(x ) 0 C*- .
k k
Thus d e C locally, and the level hypersurfaces Y> are of class C . For v
(B.7), we may w.l.o.g. take n > 0 as the case n < 0 succumbs to the
same reasoning. We consider the vector field
V(x) = Dd(x).
J{0<d(x)<rj} JHo
The distance function from smooth hypersurfaces 265
References
D. Giibarg, N. Trudinger, Elliptic Partial Differential Equations, Springer,
Berlin, 2nd edition, 1983, pp. 354-6.
8
Bifurcation theory
1
A is supposed to vary in some open set A C E . Often, one has / = 1.
We assume that
F : [a, 6] x R x R x A R
d d
u(a) = ui(X)
u(b) = u (\),
2
a = a(A)
6 = 6(A).
266
8.1 Bifurcation problems in the calculus of variations 267
Let
T(-,A):[a(Ao),&(Ao)]-[a(A),6(A)]
dr(t, A)
dt
rb(X )
0
F (t,
pp u(t), u(t), X)u(t) + F (t, pu u(t), ii(t), X)u(t) (8.1.1)
+ F ( * , u(t), ii(t), A) - F (t, u(t), ii(t), A) = 0.
pt u
We abbreviate (8.1.1) as
L\u = 0. (8.1.2)
In the light of Theorems 1.2.2 and 1.2.4 and Lemma 1.3.1 of Part I , we
shall assume
det F (t,u(t),u(t),X)
pp ^0 (8.1.3)
for all functions u occurring in the sequel. Equation (8.1.3) implies that
(8.1.1) can be solved for u in terms of u and i.e.
1
ii = -Fpp(t, ti(t), ii(t), A ) " {F (t, pu u(t), u(t), X)u(t)
+F (t,
pt u(t), u(t), A) - F (t, u{t), u(t), A)}
u (8.1.4)
=:f(t,u(t),u(t),X),
268 Bifurcation theory
u(t) ~ / ( t , u { t ) , A) = 0. (8.1.5)
2 2
I(u 4 sr), A) = J(u, A) 4 ~s 6 I(u, ly, A) + o(s) for s - 0, (8.1.6)
with
2 b
d f
2
6 I(u, r/, A) = Q (n)
x := ^ J F(t, u(t) 4 sn(t), u{t) + A)cft, a==0
= / {F i j{t,u,u,X)rji7jj
p p + 2F t (t,u,u,\)rjirjj
p UJ (8.1.7)
4- F i (t,u,u,\)r)ir)j}dt,
u uj
abbreviated as
b
{ F , r ) r ) 4 2F p 'nV + F uVV} dt.
f A pp Xy U XyU
d_
J\(u)n = L (u 4 srj) . x ls=0 (8.1.9)
ds'
Of course, this is not surprising since L \ represents the first variation of
/ ( , A) and J\ the second one. From the expansion (8.1.6) we see that
2
I(u 4 si/, A) < A) if 6 I{u y n A) < 0.
y (8.1.10)
Jx(u)rj = 0, (8.1.12)
2
H^Hc (7) 1-
By the Arzela-Ascoli theorem, after selection of a subsequence, (f] )neN n
l
then converges in C (I) to some limit denoted by 770. (8.1.13) then i m
plies that (77) N converges in C(I) (as it follows from our assumptions
n
(8.1.13). From this compactness result, one easily deduces that the space
of solutions of (8.1.13) has finite dimension.
270 Bifurcation theory
Lu
x = 0, (8.2.1)
with
V x A-> W
(u, A) H-+ L\u.
J {u)v
x : = (D L {u))v
u x : = ^ L ( u + tv)^
A 0 (8.2.2)
Lu
Xo 0 =0 (8.2.4)
We shall see that in this case, no bifurcation can occur at Ao- Namely,
we have:
Lu
x = 0.
8.2 The functional analytic approach to bifurcation theory 271
JA 0 : V - W
VxA->W
(u, A) i L\u
L\u = 0
U(X ) 0 - V(u ) 0
A i-+ u(X)
such that
Lu x = 0
precisely i f
u = u(X).
q.e.d.
Lu Xo 0 = 0 (8.2.6)
K := ker J \ ( U Q ) is one-dimensional.
0 (8.2.7)
u = 0
Q
272 Bifurcation theory
V = KV U (8.2.8)
W = KW U (8.2.9)
with
We denote by
:V-+K
u = -f w
with K, w V i , then
n{u) = .
I n particular,
TT(0) = 0.
AXo :V^W
u LA W +
0 n(u).
L e m m a 8 . 2 . 1 . A\
0 is a local diffeomorphism, i.e. the derivative
DA\ 0 = DA\ (0) : V W is an isomorphism.
0
DA v
Xo = J A o f + 7r(v) for v G V. (8.2.11)
isomorphism.
q.e.d.
8.2 The functional analytic approach to bifurcation theory 273
A: V x A - > W
(tx, A) >-+ A\(u) := L\{u) + 7r(tx).
i4(0,A ) = 0,
0 (8.2.12)
with
A(u,\) = , (8.2.13)
i.e.
L n + T T ( U ) = .
A (8.2.14)
We write
tx = tx(f,A)
u ( 0 , A ) = 0,
0 (8.2.15)
since L\ 0
o = 0, 7r(0) = 0 (remember txo = 0). In this notation, (8.2.13)
is
A{u{t,\)A)) = Z-
T(tf,A))=, (8.2.16)
L tx(,A) = 0,
A (8.2.17)
d u and d\u,
a respectively.
Since G K, also
0
</A o + 7r(o) = o.
0 (8.2.19)
du = .
a 0 (8.2.20)
We are now ready for the essential point, namely the asymptotic expan
sion of the equation (8.2.16), i.e.
7r(u(,A)) = (8.2.21)
near 0, A = Ao-
2
We let d u, d\u be the second derivatives of u(ao> A) w.r.t. a and A,
respectively, at a = 0, A = Ao, and likewise d\ u be the mixed second x
2 2
0 = ir{d\u)fji - f ^n(d u)a 4- higher order terms in a only (8.2.23)
-f 7r(<9^ u)a/i
x - f higher order terms that also involve / i .
Thus, the first term in the expansion of Q in (8.2.24) can be expressed via
DJ\. I n a variational context, J\ represents the second variation, and so
DJ\ represents the third variation of the variational integral. Likewise,
2
if d u vanishes, i.e. if the third variation vanishes on the Jacobi field o>
then 7r(d^u) can be expressed by the fourth variation, and so on.
2 2
We put a : = tr, ji = : /i, ao : = 7v(d\u)p, a\ : = ~7r(d u), y and (8.2.23)
becomes
2 2 2 2
0 = at 0 + ait r + t E ( t , r, p) (8.2.29)
with
2
0 = a + a i r - f (, r, p) = : #(, r ) .
0
(8.2.33)
276 Bifurcation theory
# ( 0 , T i ) = 0 f o r i = 1,2, (8.2.34)
whereas
n{t) : ->R
for i = 1,2, 0 < |t| < , for some > 0 that satisfy
$(t,r<(t)) = 0. (8.2.36)
hence (8.2.16), hence (8.2.17), i.e. (8.2.1) for t ^ 0, for the parameters
2
A = A + t Jl.
t 0 (8.2.37)
In the other case, ^ > 0, (8.2.30) implies that for sufficiently small
\t\ ^ 0, there is no solution of (8.2.33), i.e. of (8.2.1). Thus, as promised,
the bifurcation behaviour in case 7r(9^n) ^ 0 ( (8.2.28)) is completely
described by the simple quadratic equation (8.2.32). Of course, replacing
ft by ft changes the sign of ao and thus interchanges the cases ^ > 0
and < 0.
ai
We summarize our result in:
T h e o r e m 8.2.2. We consider a parameter dependent family of equa
tions
Lu
x = 0 (8.2.38)
as above,
V x A->W
(u, A)i L u,
x
L0
Xo = 0,
J\{u)v := (D L\(u))v u
L {u
= -j;L x + *v)| t = 0 ,
dt
we assume that there is a canonical isomorphism
and we let
a 0 : = 7T(d u)ft
x (= j v(u(Q, t Ao + t/2))| =o) ^ 0t (8-2.41)
2 a i :=7T(dlu)^0 (8.2.42)
(nonvanishing of the third variation, see Remark 8.2.1). Then there exist
2
e > 0 and a variation X = Ao - f t ft of Ao with the property that for
t
number of solutions u U of t
Lu Xt = 0 (8.2.43)
2
dimensional, we have simply considered ir(d u), 7r(d u) as scalar quan x
tities.
but
n(dlu) ^ 0. (8.2.46)
278 Bifurcation theory
3 w a i
0 = ir(d\u)ii 4- ^7r(dlu)a + 7 r ( ^ a , A ) / + higher order terms. (8.2.47)
7r(9 w)/ii + 0
A (see Lemma 8.2.2 below) (8.2.48)
and
K(dl u)ti2
)X + 0, but ir(d u)ii x 2 = 0- (8.2.49)
3 2
We put a : = t r , /x = 6 i / i i -f 6 / i , with parameters b\, 6 , and rewrite 2 2 2
(8.2.47) as
3
0 = t (7r(a w)/ii6i + TT(dl u)fi b T A )X 2 2 (8.2.50)
3
+ h{dlu)T + Z{t,T^ fl )) U 2
3 3
=: c t (a -f a i r -f r
0 0 - f E(, r, / i i , / i ) ) , 2
3
with c = ^ T T ( ^ ^ )
0 0 (see (8.2.46)). For t 7^ 0, this is equivalent to
3
0 = a + air + r 0 + E(, r, / i i , / i ) . 2 (8.2.51)
Again
7T(9AU)/XI + 0, (8.2.54)
Lu
Xt = 0 (8.2.57)
(3 = TX(D L U(0,
X XQ AO) fa) (:= 7 r ( i L A o + t M u(0, A ))|
0 T = 0 ). (8-2.59)
Tx((d u)v)
x - /?. (8.2.60)
Proof. We abbreviate
By (8.2.14)
LA.(, A T ) + A ))
T = .
n((d u)fi)
x = -^(L A t w(0, A ))| t t = 0
L 0 A
= ~ ~ ( ; ^ * t M ' o)|t=o
-(D L u A o )^u(0, A )| t t = 0 .
Since D L\ U = J , and 7r o J
0 A o = 0 by definition of 7r, applying 7r to
A o
and by assumption (8.2.59), we may find fi for which the right-hand side
becomes /3o- (We take -/? in place of (3 in (8.2.59).)
a.e.d.
L\u = 0 (8.2.61)
with
V xA-*W
(u, A) i- L\u.
L uo
Xo = 0. (8.2.62)
8.2 The functional analytic approach to bifurcation theory 281
V : = ker
0 J\ {u )
0 0
V = V 0 Vi0 (8.2.63)
W = W Wi, with W i = R{J\ {u )),
0 0 0 W0 finite dimensional.
(8.2.64)
TV : W -+ Wo
TXL U
X = 0 and {Id - TT)L U X = 0. (8.2.65)
We first consider
{Id-n) : Vi x V x A W i , 0
v" = <p(v',\)
with
j F (t,u{t),u{t))
t p -F (t,u{t),u{t))
u
= 0
/
y/l -f 6 2
dt V v l T ^ 2
2
2
/ 2 3
+ - 7 ^ = ^ " \ A + u - 0,
/ 2
(8.3.3)
(v TT^ ) v IT^
or equivalently
2
nix - i i - 1 = 0. (8.3.4)
2
in our case F = u\/l - f u ,
= constant = : A.
u{t) = A cosh
A
8.3 Example: bifurcation of catenoids 283
u(t) = A c o s h ^ . (8.3.5)
The curve u(t) is called a catenary, and the minimal surface of revolution
obtained by revolving u(t) about the t-axis is called a catenoid. For the
sake of normalization, we consider the interval I = [1,1]. In order to
use the general theory of Section 8.2, we need to choose appropriate
Banach spaces V, W and A = E and consider the operator
L x : VxA-^W (8.3.6)
( n , A ) f _ >
(it (vTT6*) " ^ 1 + i 2
^ ~ Aeoshl,?x(-f)
A cosh j
k+2 P k p 2
V = W ' (I), W = W > (I) x E for some k,p
but the reader should also convince herself or himself that the other
2
choices work as well, although the space L will always play some aux
iliary role.
2 2
In the sequel, we shall denote the scalar product in L (I) by (, - ) L ,
i.e.
(WI,W )L*
2 = J wi(t)w (t) 2 dt
284 Bifurcation theory
2 2 2
for wi,w 2 G L (I). The scalar product on W = L (I) x E for w x =
2 2
w = (w ,s )
2 2 2 (wi,w 2 G L (I),s ,s x 2 K ),
2 2
is obtained from the scalar products on L (I) and on E .
The Jacobi operator is given by
J\(u)v = D L\(u)vu
( 8 3 8 )
- ( M r * ) ' -
J (u)v
x = 0. (8.3.9)
This is equivalent to
Vi(t) = sinh j
v (t) 2 = cosh j - j sinh j .
cf. Theorem 1.3.3 of Part I.) The boundary condition (8.3.11) cannot be
satisfied by V\, and so we have to find out for which values of A
A = tanh A. (8.3.13)
cf. (8.3.5).
8.3 Example: bifurcation of catenoids 285
The only solutions of (8.3.10), (8.3.11) are av(t) with a G R and v(t)
given in (8.3.12), and so we have
dimker J ( u ) = 1. A o 0 (8.3.14)
In the sequel, we shall need a little regularity result, namely that any
2
solution v of (8.3.15) of class L (I) is automatically smooth, in fact of
class C(I). As we are dealing with a one-dimensional problem here,
this result is not too hard to demonstrate, but since that would lead us
too far astray, we omit the proof. I t can be found in most good books on
differential equations or functional analysis, e.g. K . Yosida, Functional
Analysis, Springer-Verlag, Berlin, 5th edition, 1978, pp. 177-82.
2
Of course, i f v is of class C , (8.3.15) is equivalent to
for all 17 G C n i ) ,
22 2 2
J X O : W (I) - L (I) x R .
2 2
If w G Ro(J\ )
0
; =
R(J\ \H > (I))I
0
2 2 L
-- E
there exists v G H ' (I) with
J\ v = w, then for all cp G ker J
0 A o
= 0
(in the same manner as the equivalence of (8.3.15) and (8.3.16) and
noting that cp is smooth and v and <p both vanish on dl.) Thus i f
1 1
w G Ro(J\ ),
0 then also w G (ker J ^ ) - , where - denotes the orthog
2
onal complement in the Hilbert space L (I), as in Corollary 2.2.4. Con
2 2
sequently, i f we denote the closure of a linear subspace M of L (I) x R
286 Bifurcation theory
by M , then also
Ro(Jxo) C (ker J A J - . 1
1
Conversely, if w G Ro(J\ ) (= 0 ( ^ o ^ A o ) ) " ) , then
2 2
(w, ^A ^)vy = 0 0 for all v e H ' (I).
1
ker J A o = RoiJxo)- .
2 L2 1
L (I) = Ro(Jx ) Q (BRoiJxo)- ,
L
we may also consider Ro(J\ )~ 0 as coker J , and so we get the required
A o
identification
Remark 8.3.1. The situation here is slightly different from the one in
L
Section 8.2 inasmuch as we identify coker J here with Ro(J\ )' and A() 0
^(^Ao^) = 0
2 2 2 2
only for v G H ' (I), but not for all v G H ' (I). This is for example
relevant for the argument of the proof of Lemma 8.2.2.
J\o^n fni
2 2 2 2
and f n converges to fo in L {I) x E , then ||f ||vi/ < (/) is bounded. n
8.3 Example: bifurcation of catenoids 287
1 .. d ( 1 \ . 1 1
V V Vn
2 nn + "J! , 2 t2 nn +
1 To 7TT2 = fn,
cosh ^ ' dt ^ c o s h i ^ A2 c o s h ^
2
we then see that v n also converges in L (I). Thus, v n converges in
2 2
W ' (I), and the limit VQ then satisfies
v
J\o o = /o-
d
J T / w x ( 2
3Atanh^ . 2 \ 1 . 2
By (8.2.27), we have to project this onto the kernel of J\ (uo) and check 0
that the result is nonzero, for our Jacobi field v given in (8.3.12), i.e.
v = cosh tjA t / A s i n h t / A . Since here the projection TT is given by the
2 2
orthogonal projection in the Hilbert space L (I) x E onto ker J ( u o ) , Ao
1
cosh A
3Atanh| , 3 0 I
3
/ | cosh ^ cosh' j
by an integration by parts
2
3v(t)
, (Atanh { ?)() - v(t)) dt.
cosh A-
and so
(dJ (u )(v,v),v)
Xo 0 L2 = < 0.
Thus, indeed
3
ir(d u) a > 0. (8.3.19)
r a
it{d\u) ^ 0, w i t h d\u = ^t)\t=o f suitable family
A of parameters.
t (8.3.20)
L n ( , A t ) + 7r(n(,At)) = .
At (8.3.21)
In the present case L \ is given by (8.3.6), and IT is the orthogonal
2
projection in L (I) onto ker J , the one-dimensional space generated
A o
However, since
^(Acoshi) | A = A o =0
i / := A cosh-^
0 0
J (u )d u
Xo 0 x -f T J - ^ (d\u,v) 2v L = 0 (8.3.23)
v 2
\\ \\ L ( / )
dxU
(J\ (u )d u,v) 2
0 0 x L =j ^ c Q s ^ 2 _L ( W^J
+ dxU{t)Ht)dt
^ ^^0
Exercises 289
1 l
2-rd u(t)v{t)\ _
x v
cosh A q
= TT^TQ V)V ^ 0,
v 2
ll ll L ( / )
i.e. (8.3.20).
We thus have verified all the assumptions of Theorem 8.2.2 (for the
family X defined by (8.3.22) in place of the family At = Ao + tfi).
t
Exercises
2
I(u) = J u(t)^/l + u(t) dt
U(K) = U(K) = 1
9.1 T h e P a l a i s - S m a l e condition
d
in a bounded set. The local compactness of R then allowed the extrac
tion of a convergent subsequence whose limit XQ satisfied DF(xo) = 0,
because of the continuity of DF. I n Sections 2.3 and 3.2 of Part I , we
also presented examples where variational problems could be reduced to
such finite dimensional problems. The domain was a little more compli
d
cated than E , but being finite dimensional, i t was still locally compact
so that we had no difficulties finding limits of subsequences for critical
sequences. I n the remainder of this book, however, we have had am
ple opportunity to realize that variational problems are typically and
naturally posed on some infinite-dimensional Hilbert or Banach space.
Such a space is not locally compact anymore w.r.t. its Hilbert or Banach
space topology, so that the previous strategy encounters a serious prob
lem. Also weak topologies do not help much as the functionals under
consideration typically are not continuous w.r.t. the weak topology. I f
one searches for minimizers, this problem can be overcome by introduc
ing convexity assumptions as we have seen in Chapters 4 and 8, but any
convexity assumption excludes the existence of critical points other than
minima.
291
292 The Palais-Smale condition
(i) | F ( x ) | < c
n for some constant c
(ii) \\DF(x )\\->0 n /orrwoo
K a := {x e V : F(x) = a, DF(x) = 0}
q.e.d.
We also have:
erty of the i V 5 , let us assume on the contrary that there exist a neigh
a <
q.e.d.
(i) | F ( x ) | < c
n for some constant c
0 for n + oo
2 (
\\DG(x )\\
n
G(x )
Q = f3 (9.1.1)
and
2
\\DG(x )\\ 0
F : H R
2
of class C on some Hilbert space (H, (, )) that satisfies (PS). For each
294 The Palais-Smale condition
2 1
Since F is assumed to be of class C , DF and hence V F are of class C
in their dependence on u. I n particular, V F is locally Lipschitz.
We now consider the (negative) gradient flow induced by F :
V>(u,0) = u. (9.1.6)
V>(M + s) = ^ ( M ) , * ) (9.1.7)
= F(u) + jT DF(i/>(u,t))-^il>(u,T)dT
= F(u)- f \\DF(ip(u,T))\\ 2
dr by (9.1.5), (9.1.3)
Jo
< F(u) for t > 0, i f >F(u) = D F ( ^ ( u , 0 ) ) ^ 0, (9.1.9)
i.e. i f u is not a critical point of F .
may become unbounded in finite ' t i m e ' t . This can be easily remedied
by using the Lipschitz function
+ +
77 : M -> M
1 for 0 < s < 1
T](S) = { 1
for 5 > 1
s
and putting
VF(u) :=n(\\VF(u)\\)VF{u)
2
F(iKu,t)) = F(u) - T r / ( | | V F ( ^ ( u , r ) ) | | ) ||DF(^(u, r ) ) | | d r
Jo
F(-0(u, t)) < F(V>(u, 5)) whenever 0 < 5 < t, for all u.
Next, we wish to localize the construction near a level a. Thus, for given
eo > 0 and a neighbourhood U of K we want to have a flow ip(u, t)a
and the following more explicit local decrease of the value of F : For
a E R, we put
F : = {veH
a I F(v) <a}.
^(F a + e \ U, 1) C F _ Q C (9.1.12)
^(f/,l)cF _ U[/, a e (9.1.13)
296 The Palais-Smale condition
Again, a solution ip(u,t) exists for all > 0 and satisfies (9.1.7) for all
5,t > 0, as well as (9.1.8) and (9.1.14) (for the latter i t was necessary to
require (p > 0). (9.1.11) also is clear from the choice of (p. We now verify
(9.1.12), (9.1.13). I f 0 < c < f and u G F and i f F{i>(u, 1)) > a - e,
a + C
from (9.1.14)
and therefore
F(i>(u,l))
1
f d
= F(u)+ / F(V(w,r))dr
d T
Jo
; , 2
= F(u)- f v (i W,T))i7(||VF(V'(u,T))||)||Z?i!'W,r))|| dT
JO
2
<a + e- [ min(l,||DF(^(u,r)|| )(ir (9.1.17)
Jo
since we assume u G F a + C , using (9.1.16) and the properties of 77.
2 2
\\DF(ip(u T)\\
} > 6 whenever ip(u,r) ^ N f Without loss of gen a
dist(v,N s)
a I := inf w|| ) > p.
1
' wN ai6
Since
8
M
< 1, (9.1.20)
d t ^
therefore, i f u fi /, then also t/>(u, r ) ^ 7V 5 for 0 < r < p, and similarly,
a<
2
Thus, for 0 < e < min(, | p 6 ) , we get (9.1.12), (9.1.13).
In conclusion, we have shown the following deformation result:
2
T h e o r e m 9.1.1. Let F : H > R be a C functional on a Hilbert space
H, satisfying (PS). Let a G R, and put
FA : = {v G H : F ( v ) < a } ,
Ka := {v e H : = a, DF(v) = 0} .
with the semigroup property ip(ijj(u, s),t) = ip(u,s + t) for all s,t > 0,
u H and with
(vi) IfF(u) is even (i.e. F(u) = F(u) for all u), then also F(IJJ(U, t))
is even in u for all t (i.e. F(ip(u,t)) = F(ip(u,t))).
298 The Palais-Smale condition
|GWM))
G G
= -<p(F(i,(u, t)))r,(\\V F(u)\\) (V FW(u, t)), VG(i>(u, t)))
from the chain rule and the analogue of (9.1.15)
= 0,
G
since (V F(v),VG(v)) = 0 for all v G H. Therefore, the flow tp(u,t)
now leaves G = /3 invariant. We obtain:
2
T h e o r e m 9.1.2. Let F,G : H R beC functionals on a Hilbert space
( i f , (, )) with F satisfying (PS) relative to G = (3. Let a G R,
:= {veH\F(v)<a,G(v) = (3},
G
K>P := {v G H | F(v) = a, G(v) = (3, V F(v) = 0} .
9.1 The Palais-Smale condition 299
satisfying
(i) H ) | | < m i n ( l , | | D F ( u ) | | )
U
2
(ii) DF(u)(v) > Imin(|| DF( )||,||F( )|| )
J M M
for all u G U.
1
L e m m a 9.1.3. Let F : V R be a functional of class C on the
Banach space V. Then F admits a pseudo-gradient vector field on
V':={ueV \ DF(u)^0}.
p (v)
a :=dist(t;,V"\M ). a
f T h i s holds for any open covering of a paracompact set, see e.g. J . Dieudonne,
Grundziige der Modemen Analysis, 2, Vieweg, Braunschweig, second edition, 1987,
pp. 26-9; V is paracompact for example because it is metrizable.
300 The Palais-Smale condition
/ x Pa(v)
V U U W U S O m e U e M
() '' = YlalV<*( ) ( <*) <*
is a convex combination of vectors satisfying (9.1.24), (9.1.25) and hence
satisfies these relations, too.
v(u) thus is a pseudo-gradient vector field for F.
q.e.d.
1 2
Note that we only need to require F G C , and not F G C , in order
to construct a locally Lipschitz pseudo-gradient field. We then have the
1
following deformation for C -functionals on Banach spaces.
1
T h e o r e m 9.1.3. Let F : V > E be a C -functional on a Banach space
V satisfying (PS). Let a G E , eo > 0, U a neighbourhood of K as a
in Theorem 9.1.1. Then there exist 0 < e < 1 and a continuous family
I/J : V x [0, oo) V satisfying the semigroup property w.r.t. t > 0, and
(v) ^(F
a+e \ U, 1) C F _ , 1>(U, 1) C F _ U U
a c a c
Proof. The proof is the same as the one of Theorem 9.1.1, replacing
VF(u) by a pseudo-gradient vector field v(u) except for the following
technical point: Lemma 9.1.3 asserts the existence of a pseudo-gradient
field only on {x G V \ DF(x) ^ 0}. We therefore have to choose another
Lipschitz continuous cut-off function 7 : V E with 0 < 7 < 1, 7(1*) =
0 i f u G J V j , 7(14) = 1 for u G V \ N ^. We may then consider
a j a
W i t h such a </?, the right-hand side of (9.1.26) vanishes near any critical
point of F , and i t is therefore defined on all of V. I f we then also impose
the additional restriction
4
everything works out as before.
q.e.d.
9.2 T h e m o u n t a i n pass t h e o r e m
W i t h the help of the deformation theorems of the previous section, one
may easily derive existence results for critical points of a functional sat
isfying (PS). To illustrate this point, we start with the trivial
1
L e m m a 9 . 2 . 1 . Let F : V R be a C functional on a Banach space
satisfying (PS). If
a := inf F(u) > - 0 0 ,
of a,
F a + C ^ 0 , F _ = 0.
a c
302 The Palais-Smale condition
(i) 3p > 0,/3 > 0: F(u) > (3 for all u with \\u\\ = p
(ii) 3t*i with \\u\\\ > p and F(u) < (3.
We let
r := { C([0,1], V) | 7(0) = 0,7(1) = Ui} .
7
Then
a := inf sup F(I(T)) (> 0)
7r r [ 0 ) 1 ]
is a critical value ofF (i.e. there exists UQ with F(u ) 0 = a, DF(UQ) = 0).
sup F ( 7 O ( T ) ) < a + e,
T[0,1]
point.
q.e.d.
9.2 The mountain pass theorem 303
(1) One chooses a family of sets, here T, that exploits some properties
of F and is invariant under the deformation ?/>(, 1).
(2) This family yields a minimax value a.
(3) a can be estimated from above w i t h the help of any member of
our family r (a < s u p [ j F ( 7 ( t ) ) ) for any 7 G T), and from
r 0 x
d
T h e o r e m 9.2.2. Let Q C R be a bounded domain, 2 < p < ^
(respectively < 00 for d = 1,2). Then the Dirichlet problem
p 2
Au+ \u\ ~ u = 0 in Q (9.2.1)
u = 0 on dn (9.2.2)
2
F(u) = \ f \Du\ -- f \uf. (9.2.3)
^ JQ P JQ
2 2
This functional is a continuous functional on HQ' (Q), because J \Du\ Q
p
clearly is continuous there, and J \u\ too, because of the Sobolev Em
bedding Theorem 3.4.3 as we assume p < -j^. F is also differentiable,
with
p 2
DF(u)(<p)= j Du Dip - j \u\ ' u(p. (9.2.4)
JQ JQ
Again
(p 1 / Du Dip
JQ
304 The Palais-Smale condition
,2
clearly is continuous on HQ (Q), whereas
Jn
is continuous, because we have by Holder's inequality
p-i i
2 p P P (9.2.5)
I\ur u<p <([\u\A (f M )
<co\\u\\ i\ \\<p\\ i,2 p
H {Q) H {Q)
(9.2.6)
by the Sobolev Embedding Theorem 3.4.3
2 1
for some constant c . Thus F : H^ (Vt)
0 R is of class C .
We shall verify the Palais-Smale condition for F: Suppose ( u ) n n N C
H Q ' ( 0 ) satisfies
2
Thus
< ci (9.2.9)
and
(9.2.10)
a n
In (9.2.10), we use (p = n^)"^ 2 d obtain
2 p
- J \Du \ n +j \u \ n < c \\u \\ , .
2 n H1 2 (9.2.11)
2
J \Du \ n < c ||w||i,2 + c .
3 A (9.2.12)
=/ +j \Du \ n
2
<c J 5 \Du \
n
2
, (9.2.13)
C
HnllHi.(fl) ^ 6 - (9-2-14)
Thus, any 'critical sequence' ( u ) n N is bounded. We now claim that n
9.2 The mountain pass theorem 305
2
j \Du n Du \ m 0 for n, ra oo (9.2.15)
J Du D(u
n n - Um) - j |u | ~
n
p 2
u (u
n n - u) m 0 for 71, ra > oo
(9.2.16)
by (9.2.10), (9.2.14).
By the Rellich-Kondrachev theorem (Corollary 3.4.1), we may also as
sume (by selecting a subsequence) that (u ) ^ is a Cauchy sequence in n ne
Jy \u \ ~ n
P 2
U (un n - U)
m < (^J \u \ ^j (^J n
P
|tZ -U | ^
n m
P
^ 0
forn,ra->oo. (9.2.17)
which implies (9.2.15). We have thus verified (PS) for F. We shall now
check the remaining assumptions of Theorem 9.2.1. First of all,
F(0) = 0.
M C |D |!
(/n f '(/ " )"' n
we have
D U 2 P
^(l) = T / \ 2 \ ~ - f \U2\ <0.
2
J P Jn
306 The Palais-Smale condition
We have now verified all the assumptions of the mountain pass Theorem
9.2.1, and we consequently get a critical point u of F w i t h
Remark 9.2.1. By the same method, we can also treat the equation
p 2
Au - Xu + \u\ ' u = 0 for any A > 0. (9.2.18)
t/> : X x [0,1] ^ X
satisfying
F(y) < a + e,
we have
F(^(,l))<a-c.
Suppose
xo G spec F. a
We may find AQ G M w i t h
q.e.d.
We now also require ip(j(x),t) j(ip(x, t)) for all the deformations of
Definition 9.3.1.
i : M -+ { 0 , 1 , 2 , . . . , 0 0 }
(i) i(A) = 0 ^ A = 0
(ii) A finite (A ^ 0) i(A) = 1
(iii) 2(^1 U A ) < i(Ai) + i(A )
2 2
M :=
n {AeM\ i(A) >n}.
(i) Then
spec Q n F^0 (9.3.6)
spec F 25 infinite.
Qn
spec F ,
Qn
i(A )
0 < k. (9.3.8)
tion ip w i t h
f Since the infimum over an empty set is 00, this contains the assumption Mn / 0-
310 The Palais-Smale condition
hence
sup F(z)<a -e. n (9.3.10)
zeif(A\u,i)
We have
i(A\U)>i(A)-i(U) by (iii)
> n + k - k, using (9.3.8), (9.3.9), A e M+
n k
= n.
Thus
A\UeM , n
hence
sup F(y) > a , n
yV(^\t/,l)
contradicting (9.3.10).
q.e.d.
while g e n ( 0 ) : = 0.
As an example, we state:
n 1 n
L e m m a 9.3.2. The genus of the unit sphere S' "" = {||x|| = 1} in R
(with involution j(x) = x) is equal to n.
9.3 Topological indices and critical points 311
n 1 n
Proof. The inclusion map S' "" R satisfies the properties of Def
n _ 1 n _ 1
inition 9.3.3, and so g e n ( 5 ) < n . I f n > 2, 5 is connected,
and therefore, by the mean value theorem, there is no continuous map
1 1
/ : S^- -+ R ^ j o } w i t h f(-x) = -f(x) for all x. Hence g e n ^ " ) > 2.
In fact, by the Borsuk-Ulam theoremf, there is no such continuous map
m n 1
to R \ {0} w i t h m < n. Therefore, g e n ( 5 ~ ) > n .
q.e.d.
(i) is obvious.
(ii) I f A A I is finite, then A is of the form {x ,j(x ) \ v 1 , . . . , k} 1/ 1/
1
for some k. We define / : A -+ R \ {0} by f(x) = 1, f{j{x )) = u
- 1 for all v (of course, we may assume x ^ j(x ) for all fi, v). p u
(hi) Let gen(A) = n < oo, v = 1,2, and let the continuous f : v
n
A -> R " \ { 0 } satisfy U(j(x))
v = ~U(x) for all x. By the
Tietze extension theorem^, f can be continuously extended to
v
n
f v :X-+R ".
(iv) is obvious.
(v) follows, since / o j shares the necessary properties w i t h / .
f See e.g. E . Zeidler, Nonlinear Functional Analysis and its Applications, I , Springer,
New York, 1984, p. 708, for a proof.
$ See E . Zeidler, loc. cit., p. 49.
312 The Palais-Smale condition
</?j,(x) > 0 for x /|,, <p (x) = 0 for sc E X \ t/. We then define
u
n n
h = (h\...,h ): A^R \{0}by
^( ).= J
x forxt/^
\ <Pv{x) for x A \ U , in particular for x
y j{U ).
u
n = gen(A)
< gen(U) by (iv)
n
< n since J(U) is contained in V C R \ { 0 } .
Proof. Since (PS) holds, by Theorem 9.1.2, all special points (in the
9.3 Topological indices and critical points 313
00 < ot\ < cx<i < < a n < - - - < 0 0 whenever n < 7 0 .
xn spec F Qn
produced by Theorem 9.3.2 (i) are all different, because their values
F(x )
n are all different. I f however any two such numbers a _ i and a n n
are equal, then by Theorem 9.3.2 (ii) we even obtain infinitely many
special points. Thus, in any case, we have at least 70 special, hence
critical points.
q.e.d.
F{u) = \j^(\Du\ 2
+ \u ) 2
G(u) = lf
P Jn
\u\ . p
(DGMDFM)
0 forn->oo (9.3.14)
\\DG(u )\f n
2
where all norms and scalar products are from H Q ' ( Q ) . From (9.3.13)
(and the Poincare inequality in case A = 0), we obtain
(9.3.15)
\DG(u )(u
n n - u )\ m = j \u \ n
p 2
u (u -Um)
n n
2^1
| U n | P P U P P 9 3 1 6
" ( / ) ( / l ^ - - | ) ' ( - - )
p
sequence in L ( f i ) . Thus, from (9.3.16)
DG(u )(u
n n Um) > 0 for n, m oo. (9.3.17)
Also
\DG(u )(w)\
n
\\DG(u ) n sup
\DG(u )(u )\ n n
> ||U||1,2
P
!K\
(9.3.18)
||n||fl-.a
- / \u \ n
p
= G(u ) n = 1. (9.3.19)
PJ
9.3 Topological indices and critical points 315
2
Prom (9.3.17), (9.3.18) we conclude that there exist h n m e H^ (ft) with
DG(u )(u
n n - Um + h ) nm = 0 for all n , m (9.3.20)
l l ^ n m ! ! / ^ - 2
0 for n, ra oo. (9.3.21)
DF(u )(u n n - u m 4 h ) nm -+ 0,
i.e.
for n, ra ^ oo
2 2
J (\{D(u n - u )\ m 4 A | ( u - u )\ ^ n m -+ 0 forn,ra->oo,
,2
and consequently, ( u ) N is a Cauchy sequence in H o ( n ) . This verifies
n n e
(PS) relative to G = 1.
p
is the intersection of a sphere centered at the origin in L (ft) with the
,2
subspace HQ (Q). Therefore, the argument of Lemma 9.3.2 easily im
plies 7 0 = 0 0 . Theorem 9.3.2 thus produces infinitely many solutions
n ! 2 y n >
\\DG(u )\\ n
i.e. w i t h
= (DG(u ),DF(u )) n n
a
^' ||r>G()|| '
weak solutions of
P 2
Au n - Xu 4 fi \u \ ~ n n n u =0 n in ft
u n = 0 on dft.
316 The Palais-Smale condition
2
If we choose v with z / P ~ / i = 1, then v := v u
n solves (9.3.11),
n n n n
d 2
and the reader may wonder whether this is necessary. To pursue this
question, we shall now discuss the theorem of Pohozaev:
d
T h e o r e m 9.3.4. Let fi C R be a smooth domain which is strictly star
d
shaped w.r.t. 0 R (this means that the outer normal v of Vt satisfies
(x, v(x)) > 0 for all x dft). Then for X > 0, any solution of
Au - Xu - f M ^ u = 0 in fi (9.3.22)
u = 0 on dft (9.3.23)
vanishes identically.
We shall present a complete proof only for A > 0 and for smooth
solutions u (elliptic regularity implies that any weak solution of (9.3.21),
(9.3.22) is automatically smoothf on fi, but the present book does not
treat this topic):
a n d o b t a i n
We multiply (9.3.22) by YlLi
= div ( W g ~^f~r M 2
+ w*
2 2
+ ^ | D u | + ^ H - - ^ H ^ . (9.3.25)
(9.3.27)
Jn Jn Jn
Equations (9.3.26) and (9.3.27) imply
(9.3.28)
Jn Jan du
I f A > 0, this implies u = 0, hence the result. (If A = 0, one still concludes
that | ^ = 0 on dft. Since also u = 0 on dft by (9.3.23) one may invoke a
unique continuation theorem for solutions of elliptic equations to obtain
u = 0 in ft. We omit the details.)
q.e.d.
and
318 The Palais-Smale condition
Exercises
9.1 W h y is Theorem 9.2.1 called 'mountain pass theorem'? Hint:
Try to find an analogy between the statement of that result and
the geometry of mountain passes.
9.2 Try to find conditions for a function
/ : ft x R->R
x y = J2i=i = x v
l
k
C (Q), xv C {A), 120
d
fc d <p C g ( R ) , 122
C ( n , R ) , xvi
c(n), xvi INI. 125
c (n), xvi
0
(v), 126
qjf(n), xvi R + := {t e R | t > 0 } , 130
b
/(u) = / F(t,u(t),u(t))ift,3 0 H/IL :=sup^ i^|l,133 0
D/, 4
d
V * := { / V R linear with :
r7Ci([a,6],IR ),5
| | / I U < o o } , 133
F , 5 u
(V*)* = : V * * , 133
i^P, 5 3Jn * X , 135
6 J ( U , I J ) := / ( u 4 - s r / ) , _ 1 0 a = 0 M := x
2
6 / ( U , T J ) := ^ / ( u + t y ) , ^ 1 9 = 0 L ( V , W), 145
k e r T : = {a; 6 V : To; = 0 } , 145
Fpipj'Hi'nj = 1 2 ^ = 1 FpipiViVj, 19
V = V i e V , 146 2
c(t) : = g ( t ) , 32
coker, 147
^(c) :=/ |c(t)|dt = 0
T
H ( T ) , 147
T 2
i n d T , 147
/ o ( E i i ( 0 ) * * , 32 F ( V , W ) , 147
E(c) :=yT\c(t)\ dt 2
= DF(u), 150
1
C , 150
2
C , 150
2
> F ( u ) , 150
:= 3 9
ODE, 155
fl'y.fc ~$t9v> I M | o : = s u p | | y ( t ) | | , 156
c t /
r : =
Jfc ^(gjitk+gku-gjid), 39 H/llp = H / H L P ) : = :
U \f(x)\ dx)p, M A
p
159
n
esssup f(x) := inf { A G R | f(x) <
x j 4
5 :=
X for almost all xeA}, 162
39 C (n), 1660