Professional Documents
Culture Documents
p,
p,
A SCIENCE PUBLISHERS BOOK
A SCIENCE PUBLISHERS BOOK
CRC Press
Taylor & Francis Group
6000 Broken Sound Parkway NW, Suite 300
Boca Raton, FL 33487-2742
Except as permitted under U.S. Copyright Law, no part of this book may be reprinted, reproduced, transmitted,
or utilized in any form by any electronic, mechanical, or other means, now known or hereafter invented, includ-
ing photocopying, microfilming, and recording, or in any information storage or retrieval system, without written
permission from the publishers.
For permission to photocopy or use material electronically from this work, please access www.copyright.com
(http://www.copyright.com/) or contact the Copyright Clearance Center, Inc. (CCC), 222 Rosewood Drive, Danvers,
MA 01923, 978-750-8400. CCC is a not-for-profit organization that provides licenses and registration for a variety
of users. For organizations that have been granted a photocopy license by the CCC, a separate system of payment
has been arranged.
Trademark Notice: Product or corporate names may be trademarks or registered trademarks, and are used only for
identification and explanation without intent to infringe.
The first part of this book develops in details the basic tools of functional analysis, measure and integration
and Sobolev spaces theory, which will be used in the development of the subsequent chapters. The results are
established in a rigorous and precise fashion, however we believe the proofs presented are relatively easy to
follow, since all steps are developed in a very transparent and clear way.
Of course, in general the results in this first part are not new and may be found in similar format in many
other excellent books on functional analysis, such as those by Bachman and Narici [6], Brezis [24], Rudin
[67] and others.
In many cases we give our own version of some proofs we believe the original ones are not clear enough,
always aiming to improve their clarity and preciseness.
In the second book part, we develop the basic concepts on the calculus of variations, convex analysis,
duality theory and constrained optimization in a Banach spaces context.
We start with a more basic approach on the calculus of variations which gradually evolves to the
development of more advanced results on convex analysis and optimization. The text includes a formal proof
of the Lagrange multiplier theorem, in a first step, for the equality-restriction case and in the subsequent
sections we address the general case of constrained optimization also in a Banach spaces context.
Finally, in the third part we address the development of several important examples which apply the
previous theoretical approaches exposed.
Applications include duality principles for non-linear plate and shell models, duality principles for
problems in non-linear elasticity, duality for shape optimization models, existence and duality for the
Ginzburg-Landau system in superconductivity, duality for a semi-linear model in micro-magnetism and
others.
About such applications we highlight the results on the Generalized Method of Lines, which is a
numerical method for which the domain of the partial differential equation in question is discretized in lines,
and the concerning solution is written on these lines as function of the boundary conditions and boundary
shape. Some important models are addressed and among them we highlight the applications of such a method
to the Navier-Stokes system.
Also as an application of the Generalized Method of Lines, we finish the book with a chapter which
presents a numerical procedure for the solution of an inverse optimization problem.
At this point, it is worth emphasizing that the Chapter 21 is co-authored by myself and my colleague
Alexandre Molter and Chapter 23 is co-authored by myself and my graduate student Eduardo Pandini Barros,
to whom I would like to express my gratitude for their contributions.
As a final note we highlight that the content of the present book overlaps something with the ones of my
previous books ‘‘Topics on Functional Analysis, Calculus of Variations and Duality” published by Academic
Publications, Sofia, 2011 and, ‘‘Functional Analysis and Applied Optimization in Banach Spaces” published
by Springer in 2014, but only relating a part on basic standard mathematics, so that, among some other new
chapters, the applications presented from Chapters 17 and 30 are almost all new developments.
Acknowledgements
I would like to thank some people of Virginia Tech—USA, where I got my Ph.D. degree in Mathematics in
2009. I am especially grateful to Professor Robert C Rogers for his excellent work as advisor. I would like to
thank the Department of Mathematics for its constant support and this opportunity of studying mathematics
in advanced level. Among many other Professors, I particularly thank Martin Day (Calculus of Variations),
William Floyd and Peter Haskell (Elementary Real Analysis), James Thomson (Real Analysis), Peter
Linnell (Abstract Algebra) and George Hagedorn (Functional Analysis) for the excellent lectured courses.
Finally, special thanks to all my Professors at I.T.A. (Instituto Tecnológico de Aeronáutica, SP-Brasil) my
undergraduate and masters school. Specifically about I.T.A., among many others I would like to express
my gratitude to Professors Leo H Amaral, Tânia Rabelo and my master thesis advisor Antônio Marmo de
Oliveira, also for their valuable work.
January 2020 Fabio Silva Botelho
Florianópolis, SC, Brazil
Contents
Preface iii
Acknowledgements iv
1. Metric Spaces 1
1.1 Introduction 1
1.2 The main definitions 1
1.2.1 The space l ∞ 2
1.2.2 Discrete metric 2
1.2.3 The metric space s 3
1.2.4 The space B(A) 3
1.2.5 The space l p 3
1.2.6 Some fundamental definitions 6
1.2.7 Properties of open and closed sets in a metric space 7
1.2.8 Compact sets 10
1.2.9 Separable metric spaces 13
1.2.10 Complete metric spaces 14
1.3 Completion of a metric space 15
1.4 Advanced topics on compactness in metric spaces 19
1.5 The Arzela-Ascoli theorem 24
3. Hilbert Spaces 52
3.1 Introduction 52
3.2 The main definitions and results 52
3.3 Orthonormal basis 57
3.3.1 The Gram-Schmidt orthonormalization 59
3.4 Projection on a convex set 60
3.5 The theorems of Stampacchia and Lax-Milgram 63
17. Global Existence Results and Duality for Non-Linear Models of Plates and Shells 367
17.1 Introduction 367
17.2 On the existence of a global minimizer 369
17.3 Existence of a minimizer for the plate model for a more general case 372
17.4 The main duality principle 375
17.5 Existence and duality for a non-linear shell model 381
17.6 Existence of a minimizer 384
17.7 The duality principle for the shell model 386
17.8 Conclusion 390
18. A Primal Dual Formulation and a Multi-Duality Principle for a Non-Linear Model of Plates 391
18.1 Introduction 391
18.2 The first duality principle 393
18.3 The primal dual formulation and related duality principle 399
18.4 A multi-duality principle for non-convex optimization 402
18.5 Conclusion 408
19. On Duality Principles for One and Three-Dimensional Non-Linear Models in Elasticity 409
19.1 Introduction 409
19.2 The main duality principle for the one-dimensional model 410
19.3 The primal variational formulation for the three-dimensional model 414
19.4 The main duality principle for the three-dimensional model 415
19.5 Conclusion 419
20. A Primal Dual Variational Formulation Suitable for a Large Class of Non-Convex 420
Problems in Optimization
20.1 Introduction 420
20.2 The main duality principle 424
20.3 Conclusion 426
21. A Duality Principle and Concerning Computational Method for a Class of Optimal 427
Design Problems in Elasticity
Fabio Silva Botelho and Alexandre Molter
21.1 Introduction 427
21.2 Mathematical formulation of the topology optimization problem 428
21.3 About the computational method 431
21.4 Computational simulations and results 432
21.5 Final remarks and conclusions 435
22. Existence and Duality Principles for the Ginzburg-Landau System in Superconductivity 437
22.1 Introduction 437
22.2 The main result 440
22.3 A second multi-duality principle 446
22.4 Another duality principle for global optimization 452
x < Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
22.5 The existence of a global solution for the Ginzburg-Landau system in the presence 456
of a magnetic field
22.6 Duality for the complex Ginzburg-Landau system 461
22.7 Conclusion 464
23. Existence of Solution for an Optimal Control Problem Associated to the 465
Ginzburg-Landau System in Superconductivity
Fabio Silva Botelho and Eduardo Pandini Barros
23.1 Introduction 465
23.2 An existence result for a related optimal control problem 466
23.3 A method to obtain approximate numerical solutions for a class of partial 471
differential equations
23.4 Conclusion 475
25. About Numerical Methods for Ordinary and Partial Differential Equations 482
25.1 Introduction 482
25.2 On the numerical procedures for Ginzburg-Landau type ODEs 482
25.3 Numerical results for related P.D.E.s 485
25.3.1 A related P.D.E on a special class of domains 485
25.3.2 About the matrix version of G.M.O.L. 486
25.3.3 Numerical results for the concerning partial differential equation 488
25.4 A proximal algorithm for optimization in Rn 488
25.4.1 The main result 489
25.5 Conclusion 497
26. On the Numerical Solution of First Order Ordinary Differential Equation Systems 498
26.1 Introduction 498
26.2 The main results 498
26.3 Numerical results 501
26.4 The Newton’s method for another first order system 504
26.4.1 An example in nuclear physics 507
26.5 Conclusion 511
27. On the Generalized Method of Lines and its Proximal Explicit and Hyper-Finite 512
Difference Approaches
27.1 Introduction 512
27.1.1 Some preliminaries results and the main algorithm 514
27.1.2 A numerical example, the proximal explicit approach 516
27.2 The hyper-finite differences approach 520
27.2.1 The main algorithm 520
27.2.2 A numerical example 522
27.3 Conclusion 525
Contents < xi
28. On the Generalized Method of Lines Applied to the Time-Independent Incompressible 526
Navier-Stokes System
28.1 Introduction 526
28.2 On the solution of the time-independent incompressible Navier-Stokes system through 527
an associated linear one
28.3 The generalized method of lines for the Navier-Stokes system 529
28.3.1 Numerical examples through the generalized method of lines 539
28.4 Conclusion 545
29. A Numerical Method for an Inverse Optimization Problem through the Generalized 547
Method of Lines
29.1 Introduction 547
29.2 The mathematical description of the main problem 547
29.3 About the generalized method of lines and the main result 548
29.3.1 The numerical results 552
29.4 Conclusion 553
30. A Variational Formulation for Relativistic Mechanics based on Riemannian Geometry 554
and its Application to the Quantum Mechanics Context
30.1 Introduction 554
30.2 Some introductory topics on vector analysis and Riemannian geometry 556
30.3 A relativistic quantum mechanics action 560
30.3.1 The kinetics energy 561
30.3.2 The energy part relating the curvature and wave function 562
30.4 Obtaining the relativistic Klein-Gordon equation as an approximation of the 565
previous action
30.5 A note on the Einstein field equations in the vacuum 567
30.6 Conclusion 568
References 569
Index 573
SECTION I
FUNCTIONAL ANALYSIS
Chapter 1
Metric Spaces
1.1 Introduction
In this chapter we present the main basic definitions and results relating to the concept of metric spaces.
We may recall that any Banach space is a metric one, so that the framework here introduced is
suitable for a very large class of spaces.
The main references for this chapter are [14, 82].
Example 1.2.5 V = C([a, b]), where C([a, b]) is the metric space of continuous functions u : [a, b] → R
with the metric d : V ×V → R+ , where
Example 1.2.6 V = C([a, b]), is a metric space with the metric d : V ×V → R+ where
Z b
d(u, v) = |u(x) − v(x)| dx, ∀u, v ∈ V.
a
l ∞ = {u = {un }n∈N : un ∈ C and there exists M > 0 such that |un | < M, ∀n ∈ N} .
Definition 1.2.8 Let V be a non-empty set. We define the discrete metric for V by
0, if u = v,
d(u, v) = (1.1)
6 v.
1, if u =
V = {u = {un }, : un ∈ C, ∀n ∈ N},
B(A) = {u : A → R, such that there exists M > 0 such that |u(x)| < M, ∀x ∈ A}.
Exercise 1.2.13 Show that B(A) is a metric space with the metric
h0 (x) = x p−1 − y = 0,
Thus,
1 p 1 q
h(x) = x + y − xy ≥ h(x0 ) = 0, ∀x > 0.
p q
Therefore, since y > 0 is arbitrary, we obtain
1 p 1 q
xy ≤ x + y , ∀x, y > 0.
p q
Metric Spaces 5
so that
1 p 1 q
xy ≤ x + y , ∀x, y ≥ 0. (1.3)
p q
Let u = {un } ∈ l p and v = {vn } ∈ l q .
Denote !1/ p
∞
kuk p = ∑ |un | p
n=1
and !1/q
∞
kvkq = ∑ |vn |q .
n=1
Define also ( )
u un
û = = ,
kuk p (∑∞ p 1/p
n=1 |un | )
and ( )
v vn
v̂ = = .
kvkq (∑∞ q 1/q
n=1 |vn | )
From this and (1.3) we obtain,
∞
1 ∞ 1 ∞
∑ |ûn v̂n | ≤ ∑ |ûn | p + ∑ |v̂n |q
n=1 p n=1 q n=1
1 1
= +
p q
= 1. (1.4)
Thus,
∞
∑ |un vn | ≤ kuk p kvkq , ∀u ∈ l p , v ∈ l q .
n=1
¨
This last inequality is well known as the Holder one.
Exercise 1.2.15 Prove the Minkowski inequality, namely
ku + vk p ≤ kuk p + kvk p , ∀u, v ∈ l p .
Hint
∞
ku + vk pp = ∑ |un + vn | p
n=1
∞
≤ ∑ |un + vn | p−1 (|un | + |vn |). (1.5)
n=1
Definition 1.2.17 (limit point) Let (U, d) be a metric space and E ⊂ U. A point u ∈ U is said to be a
limit point of E if for each r > 0 there exists v ∈ Vr (u) ∩ E such that v 6= u.
Remark 1.2.19 In the next definitions U shall denote a metric space with a metric d.
Definition 1.2.20 (Isolated point) Let u ∈ E ⊂ U. We say that u is an isolated point of E if it is not a
limit point of E.
Example 1.2.21
U = R2 , E = B1 ((0, 0)) ∪ {(3, 3)}. Thus (3, 3) is an isolated point of E.
Definition 1.2.22 (Closed set) Let E ⊂ U and let E 0 be the set of limit points of E. We say that E is
closed if E ⊃ E 0 .
Example 1.2.23
Let U = R2 and r > 0, thus E = Br ((0, 0)) is closed.
Definition 1.2.24 A point u ∈ E ⊂ U is said to be an interior point of E if there exists r > 0 such that
Vr (u) ⊂ E, where
Vr (u) = {v ∈ U | d(u, v) < r}.
Example 1.2.25
For U = R2 , let E = B1 ((0, 0)) ∪ {(3, 3)}, for example u = (0.25, 0.25) is an interior point
p of E, in fact,
for r = 0.5, if v ∈ Vr (u) then d(u, v) < 0.5 so that d(v, (0, 0)) ≤ d((0, 0), u) + d(u, v) ≤ 1/8 + 0.5 < 1
that is, v ∈ B1 ((0, 0)) and thus Vr (u) ⊂ B1 ((0, 0)). We may conclude that u is an interior point of
B1 ((0, 0)). In fact all points of B1 ((0, 0)) are interior.
Definition 1.2.26 (Open set) E ⊂ U is said to be open if all its points are interior.
Example 1.2.27
For U = R2 , the ball B1 (0, 0) is open.
E c = {v ∈ U | v 6∈ E}.
Definition 1.2.29 A set E ⊂ U is said to bounded if there exists M > 0 such that
sup{d(u, v) | u, v ∈ E} ≤ M.
Example 1.2.31
The set Q is dense in R. Let u ∈ R and let r > 0. Thus, from a well known result in elementary analysis
there exists v ∈ Q such that u < v < u + r, that is, v ∈ Q ∩Vr (u) and v 6= u, where Vr (u) = (u − r, u + r).
Therefore u is a limit point of Q. Since u ∈ R is arbitrary, we may conclude that R ⊂ Q0 , that is, Q is
dense in R.
Theorem 1.2.32 Let (U, d) be a metric space. Let u ∈ U and r > 0. Then Vr (u) is open.
Therefore w ∈ Vr (u), ∀w ∈ Vr1 (v), that is Vr1 (v) ⊂ Vr (u), so that we may conclude that v is an interior point
of Vr (u), ∀v ∈ Vr (u), thus, Vr (u) is open. The proof is complete.
Theorem 1.2.33 Let u be a limit point of E ⊂ U, where (U, d) is a metric space. Then each neighborhood
of u has an infinite number of points of E, distinct from u.
Proof 1.2 Suppose to obtain contradiction, that there exists r > 0 such that Vr (u) has a finite number of
points of E distinct from u. Let {v1 , ..., vn } be such points of Vr (u) ∩ E distinct from u. Choose 0 < r1 <
min{d(u, v1 ), d(u, v2 ), ...., d(u, vn )}. Hence Vr1 (u) ⊂ Vr (u) and vi 6∈ Vr1 (u), ∀i ∈ {1, 2, ..., n}. Therefore either
Vr1 (u) ∩ E = {u} or Vr1 ∩ E = 0, / which contradicts the fact that u is a limit point of E.
The proof is complete.
u ∈ (∪α∈L Eα )c ⇔ u 6∈ ∪α∈L Eα
⇔ u 6∈ Eα , ∀α ∈ L
⇔ u ∈ Eαc , ∀α ∈ L
⇔ u ∈ ∩α∈L Eαc . (1.6)
Exercise 1.2.36
Prove that
(∩α ∈L Eα )c = ∪α∈L Eαc .
Theorem 1.2.37 Let (U, d) be a metric space and E ⊂ U. Thus, E is open if and only if E c is closed.
8 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Proof 1.4 Suppose E c is closed. Choose u ∈ E, thus u 6∈ E c and therefore u is not a limit point E c . Hence
there exists r > 0 such that Vr (u) ∩ E c = 0/. Hence, Vr (u) ⊂ E, that is, u is an interior point of E, ∀u ∈ E, so
that E is open.
Reciprocally, suppose E is open. Let u ∈ (E c )0 . Thus for each r > 0 there exists v ∈ Vr (u) ∩ E c such that
v=6 u, so that
Vr (u) * E, ∀r > 0.
Therefore u is not an interior point of E. Since E is open we have that u 6∈ E, that is, u ∈ E c . Hence (E c )0 ⊂ E c ,
that is, E c is closed.
The proof is complete.
Corollary 1.2.38 Let (U, d) be a metric space, F ⊂ U is closed if and only if F c is open.
∩ni=1 Gi
is open.
∪ni=1 Fi
is closed.
Proof 1.5
1. Let Gα ⊂ U, where Gα is open ∀α ∈ L. Let u ∈ ∪α ∈L Gα . Thus u ∈ Gα0 for some α0 ∈ L. Since Gα0 is
open, there exists r > 0 such that Vr (u) ⊂ Gα0 ⊂ ∪α ∈L Gα . Hence, u is an interior point, ∀u ∈ ∪α ∈L Gα .
Thus ∪α∈L Gα is open.
2. Let Fα ⊂ U, where Fα is closed ∀α ∈ L. Thus, Fαc is open ∀α ∈ L. From the last item, we have ∪α∈L Fαc
is open so that
∩α∈L Fα = (∪α∈L Fαc )c
is closed.
3. Let G1 , ..., Gn ⊂ U be open sets. Let
u ∈ ∩ni=1 Gi .
Thus,
u ∈ Gi , ∀i ∈ {1, ..., n}.
Since Gi is open, there exists ri > 0 such that Vri (u) ⊂ Gi .
Metric Spaces 9
Define r = min{r1 , ..., rn }. Hence, Vr (u) ⊂ Vri (u) ⊂ Gi , ∀i ∈ {1, ..., n} and therefore
Vr (u) ⊂ ∩ni=1 Gi .
This means that u is an interior point of ∩ni=1 Gi , and being u ∈ ∩ni=1 Gi arbitrary we obtain that ∩ni=1 Gi
is open.
4. Let F1 , ..., Fn ⊂ U be closed sets. Thus, F1c , ..., Fnc are open. Thus, from the last item, we obtain:
∩ni=1 Fic
is open, so that
∪ni=1 Fi = (∩ni=1 Fic )c
is closed.
The proof is complete.
Exercise 1.2.40
Let (U, d) be a metric space and let u0 ∈ U. Show that A = {u0 } is closed. Let B = {u1 , ..., un } ⊂ U. Show
that B is closed.
Definition 1.2.41 (Closure) Let (U, d) be a metric space and let E ⊂ U. Denote the set of limit points of E
by E 0 . We define the closure of E, denoted by E, by:
E = E ∪ E 0.
Examples 1.2.42
1. Let U = R2 , E = B1 (0, 0), we have that E 0 = B1 (0, 0), so that in this example E = E ∪ E 0 = E 0 .
2. Let U = R, A = {1/n : n ∈ N}, we have that A0 = {0}, and thus A = A ∪ A0 = A ∪ {0}.
Proof 1.6
c
1. Observe that E = E ∪ E 0 . Let u ∈ E . Thus u 6∈ E and u 6∈ E 0 (u is not a limit point of E). Therefore,
/ that is, Vr (u) ⊂ E c , thus, u is an interior point of E c .
there exists r > 0 such that Vr (u) ∩ E = 0,
We shall prove that Vr (u) ∩ E = 0/. Let v ∈ Vr (u) and define r1 = r − d(u, v) > 0. We shall show that
Definition 1.2.45 (Compact set) Let (U, d) be a metric space and K ⊂ U. We say that K is compact if each
open covering {Gα , α ∈ L} of K admits a finite sub-covering. That is, if K ⊂ ∪α ∈L Gα , and Gα is open
∀α ∈ L, then there exist α1 , α2 , ..., αn ∈ L such that K ⊂ ∪ni=1 Gαi .
Theorem 1.2.46 Let (U, d) be a metric space. Let K ⊂ U where K is compact. Then K is closed.
Proof 1.7 Let us show that K c is open. Let u ∈ K c . For convenience, let us generically denote in this proof
Vr (u) = V (u, r).
For each v ∈ K we have d(u, v) > 0. Define rv = d(u, v)/2. Thus,
Observe that
∪v∈K V (v, rv ) ⊃ K.
since K is compact, there exist v1 , ..., vn ∈ K such that
Hence,
n
V (u, r0 ) ∩ (∪i=1 V (vi , rvi )) = 0/.
/ that is V (u, r0 ) ⊂ K c . Therefore u is an interior point of K c
From this and (1.8) we obtain, V (u, r0 ) ∩ K = 0,
c c
and being u ∈ K arbitrary, K is open so that K is closed.
The proof is complete.
Theorem 1.2.47 Let (U, d) be a metric space. If F ⊂ K ⊂ U, K is compact and F is closed, then F is
compact.
Metric Spaces 11
F ⊂ ∪α∈L Gα .
F c ∪ (∪α∈L Gα ) ⊃ K.
Therefore, since F c is open {F c , Gα , α ∈ L} is an open covering of K, and since K is compact, there exist
α1 , ..., αn ∈ L such that
F c ∪ Gα1 ∪ ... ∪ Gαn ⊃ K ⊃ F.
Therefore
Gα1 ∪ ... ∪ Gαn ⊃ F,
so that F is compact.
Theorem 1.2.48 If {Kα , α ∈ L} is a collection of compact sets in a metric space (U, d) such that the inter-
section of each finite sub-collection is non-empty, then
6 0/.
∩α∈L Kα =
Hence
Kα0 ⊂ (∩α∈L1 Kα )c ,
that is,
Kα0 ⊂ ∪α∈L1 Kαc .
Since, Kα0 is compact and Kαc is open, ∀α ∈ L, there exist α1 , α2 , ..., αn ∈ L1 such that
c
Kα0 ⊂ ∪nj=1 Kαc j = ∩nj=1 Kα j ,
therefore,
Kα0 ∩ ∩nj=1 Kα j = Kα0 ∩ Kα1 ∩ ... ∩ Kαn = 0/,
Corollary 1.2.49 Let (U, d) be a metric space. If {Kn , n ∈ N} ⊂ U is a sequence of compact non-empty sets
such that Kn ⊃ Kn+1 , ∀n ∈ N then ∩∞
n=1 Kn 6= 0
/.
Theorem 1.2.50 Let (U, d) be a metric space. If E ⊂ K ⊂ U, K is compact and E is infinite, then E has at
least one limit point in K.
Proof 1.10 Suppose, to obtain contradiction, that no point of K is a limit point of E. Then, for each
u ∈ K there exists ru > 0 such that V (u, ru ) has at most one point of E, namely, u if u ∈ E. Observe that
{V (u, ru ), u ∈ K} is an open covering of K and therefore of E. Since each V (u, ru ) has at most one point of
E which is infinite, no finite sub-covering (relating the open cover in question), covers E, and hence no finite
sub-covering covers K ⊃ E, which contradicts the fact that K is compact. This completes the proof.
12 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Theorem 1.2.51 Let {In } be a sequence of bounded closed non-empty real intervals, such that In ⊃
In+1 , ∀n ∈ N. Thus, ∩∞ 6 0/.
n=1 In =
Theorem 1.1
Let I = [a, b] ⊂ R be a bounded closed non-empty real interval. Under such hypotheses, I is compact.
Proof 1.12 Observe that if x, y ∈ [a, b] then |x − y| ≤ (b − a).. Suppose there exists an open covering of I,
denoted by {Gα , α ∈ L} for which there is no finite sub-covering.
Let c = (a + b)/2. Thus, either [a, c] or [c, b] has no finite sub-covering related to {Gα , α ∈ L}. Denote
such an interval by I1 . Dividing I1 into two connected closed sub-intervals of same size, we get an interval I2
for which there is no finite sub-covering related to {Gα , α ∈ L}.
Proceeding in this fashion, we may obtain a sequence of closed intervals {In } such that
1. In ⊃ In+1 , ∀n ∈ N.
2. No finite sub-collection of {Gα , α ∈ L} covers In , ∀n ∈ N.
3. If x, y ∈ In then |x − y| ≤ 2−n (b − a).
From the last theorem, there exists x∗ ∈ R such that x∗ ∈ ∩∞ n=1 In ⊂ I ⊂ ∪α ∈L Gα . Hence, there exists α0 ∈ L
such that x∗ ∈ Gα0 . Since Gα0 is open, there exists r > 0 such that
Vr (x∗ ) = (x∗ − r, x∗ + r) ⊂ Gα0 .
Choose n0 ∈ N such that
2−n0 (b − a) < r/2.
Hence, since x∗ ∈ In0 , if y ∈ In0 then from item 3 above, |y − x∗ | ≤ 2−n0 (b − a) < r/2, that is y ∈ Vr (x∗ ) ⊂
Gα0 .
Therefore
y ∈ In0 ⇒ y ∈ Gα0 ,
so that In0 ⊂ Gα0 , which contradicts the item 2 above indicated.
The proof is complete.
Proof 1.13
1 implies 2: Let E ⊂ R be a closed and bounded. Thus, since E is bounded there exists [a, b], a
bounded closed interval such that E ⊂ [a, b]. From the last theorem [a, b] is compact and since E is
closed, from Theorem 1.2.47 we may infer that E is compact.
2 implies 3: This follows from Theorem1.2.50.
3 implies 1: We prove the contrapositive, that is, the negation of 1 implies the negation of 3.
The negation of 1 is: E is not bounded or E is not closed. If E ⊂ R is not bounded, choosing x1 ∈ E,
for each n ∈ N there exists xn+1 ∈ E such that |xn+1 | > n + |xn | ≥ n. Hence {xn } has no limit points
so that we have got the negation of 3.
On the other hand, suppose E is not closed. Thus there exists x0 ∈ R such that x0 ∈ E 0 and x0 6∈ E.
Since x0 ∈ E 0 , for each n ∈ N there exists xn ∈ E such that |xn − x0 | < 1/n (xn ∈ V1/n (x0 )).
Let y ∈ E, we are going to show that y is not limit point {xn } ⊂ E. Observe that
Theorem 1.2.52 (Weierstrass) Any real set which is bounded and infinite has a limit point in R.
Proof 1.14 Let E ⊂ R be a bounded infinite set. Thus, there exists r > 0 such that E ⊂ [−r, r] = Ir . Since
E is infinite and Ir is compact, from Theorem 1.2.50, E has a limit point in Ir ⊂ R. The proof is complete.
Example 1.2.54 V = R is separable. Indeed, Q, the set of rational number, is dense in R and countable.
B = {un }n∈N ⊂ l ∞
is dense in l ∞ .
Thus, for each v ∈ A, we may select a nv ∈ N such that
d(v, w) = 1,
so that
d(v, w) ≤ d(v, unv ) + d(unv , w),
and thus
6 w, then
So to summarize, if v =
6 unw .
unv =
Let T : A → B, where
T (v) = unv .
Thus T is a bijection on Im (T ) ⊂ B.
Therefore, generically denoting C ∼ D if, and only if, there exists a bijection between the sets C and D,
we have
A ∼ Im(T ) ∼ B ∼ N.
This contradicts A to be non-countable.
So, we may infer that l ∞ is non-separable.
Exercise 1.2.57 Let (V, d) be a metric space and let {un } ⊂ V be a convergent sequence.
Show that {un } is bounded.
Definition 1.2.58 (Cauchy sequence) Let (V, d) be a metric space and let {un } ⊂ V be a sequence. We say
that such a sequence is a Cauchy one as for each ε > 0 there exists n0 ∈ N such that if m, n > n0 , then
d(un , um ) < ε.
Exercise 1.2.59 Let (V, d) be a metric space and let {un } ⊂ V be a Cauchy sequence.
Show that {un } is bounded.
Definition 1.2.60 (Complete metric space) Let (V, d) be a metric space. We say that V is complete as each
Cauchy sequence in V converges to an element of V.
1. Show that u ∈ M if, and only if, there exists a sequence {un } ⊂ M such that
un → u, as n → ∞.
2. Show that M is closed if, and only if, the following property is valid:
If {un } ⊂ M and un → u, then u ∈ M.
Exercise 1.2.62 Let (V, d) be a metric space and let M ⊂ V. Show that M is complete if, and only if, M is
closed in V.
Exercise 1.2.64 Prove that c is complete, where c is the space of complex convergent sequences.
Exercise 1.2.65 Let (V, d) be a metric space where V = C([a, b]) whit the metric
Z b
d(u, v) = |u(x) − v(x)| dx, ∀u, v ∈ V.
a
If there exists an isometry betwween V and V1 , we say that V and V1 are isometric.
Theorem 1.3.2 (Completion) Let (V, d) be a metric space which is not complete.
Under such hypotheses, there exists a metric space (V̂ , dˆ) such that V is isometric to a sub-space W of V̂
which is dense in V̂ . Moreover, V̂ is complete.
16 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Proof 1.15
1. First part: Construction of V̂ .
Let {un } and {u0n } be Cauchy sequences in V .
We define a relation of equivalence in the set of Cauchy sequences in V by declaring
{un } ∼ {u0n }
as
lim d(un , u0n ) = 0.
n→∞
Let
V̂ = {{d
un } : {un } is a Cauchy sequence in V },
and where
un } = {{u0n } ⊂ V, such that {u0n }is a Cauchy sequence and {u0n } ∼ {un }}.
{d
Thus
lim d(un , vn ) = lim d(u0n , v0n ), ∀{u0n } ∈ û, {v0n } ∈ v̂.
n→∞ n→∞
Define
W = {b̂ = {b,\
b, b, . . .} : b ∈ V }.
Define also T : V → W by
T (b) = b̂ = {b,\
b, b, . . .}.
Thus,
dˆ(b̂, ĉ) = lim d(b, c) = d(b, c).
n→∞
Therefore, T is an isometry.
We are going to show that W is dense in V̂ .
Let û ∈ V̂ and {un } ∈ û. Let ε > 0.
Since {un } is a Cauchy sequence, there exists n0 ∈ N such that if m, n > n0 , then
Choose N > n0 .
Thus, d(un , uN ) < ε/2, ∀n > n0 .
Observe that
ûN = {uN , u\
N , uN , . . .} ∈ W,
and
dˆ(û, ûN ) = lim d(un , uN ) ≤ ε/2 < ε .
n→∞
18 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
û ∈ W 0 ∪W, ∀û ∈ V̂ ,
Observe that
Let ε > 0 (a new one). Hence, there exists n0 ∈ N such that if m, n > n0 , then
ε
dˆ(ûn , ûm ) < .
3
Thus, if
3
m, n > max , n0 ,
ε
then
Indeed,
Therefore,
lim dˆ(ûn , û) = 0.
n→∞
Thus, V̂ is complete.
The proof is complete.
Metric Spaces 19
Definition 1.4.2 Let (U, d) be a metric space. We say that {Fk } ⊂ U is a nested sequence of sets if
F1 ⊃ F2 ⊃ F3 ⊃ ....
Theorem 1.4.3 If (U, d) is a complete metric space then every nested sequence of non-empty closed sets
{Fk } is such that
lim diam(Fk ) = 0
k→+∞
Proof 1.16 Suppose {Fk } is a nested sequence and lim diam(Fk ) = 0. For each n ∈ N select un ∈ Fn .
k→∞
Suppose given ε > 0. Since
lim diam(Fn ) = 0,
n→∞
d(un , um ) < ε.
Hence, {un } is a Cauchy sequence. Being U complete, there exists u ∈ U such that
un → u as n → ∞.
u ∈ F̄m = Fm .
Theorem 1.4.4 Let (U, d) be a metric space. If A ⊂ U is compact then it is closed and bounded.
Proof 1.17 We have already proved that A is closed. Suppose, to obtain contradiction that A is not bounded.
Thus for each K ∈ N there exists u, v ∈ A such that
d(u, v) > K.
Observe that
A ⊂ ∪u∈A B1 (u).
Since A is compact there exists u1 , u2 , ..., un ∈ A such that
A =⊂ ∪nk=1 B1 (uk ).
20 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Define
R = max{d(ui , u j ) | i, j ∈ {1, ..., n}}.
Choose u, v ∈ A such that
u ∈ B1 (ui ), v ∈ B1 (u j ).
Thus
Definition 1.4.5 (Relative compactness) In a metric space (U, d) a set A ⊂ U is said to be relatively com-
pact if A is compact.
Definition 1.4.6 (ε - nets) Let (U, d) be a metric space. A set N ⊂ U is said to be a ε-net with respect to a
set A ⊂ U if for each u ∈ A there exists v ∈ N such that
d(u, v) < ε.
Definition 1.4.7 Let (U, d) be a metric space. A set A ⊂ U is said to be totally bounded if for each ε > 0
there exists a finite ε-net with respect to A.
Proposition 1.4.8 Let (U, d) be a metric space. If A ⊂ U is totally bounded then it is bounded.
Proof 1.18 Choose u, v ∈ A. Let {u1 , ..., un } be the 1 − net with respect to A. Define
Thus
Theorem 1.4.9 Let (U, d) be a metric space. If from each sequence {un } ⊂ A we can select a convergent
subsequence {unk }, then A is totally bounded.
Proof 1.19 Suppose, to obtain contradiction, that A is not totally bounded. Thus there exists ε0 > 0 such
that there exists no ε0 -net with respect to A. Choose u1 ∈ A, hence {u1 } is not a ε0 -net, that is, there exists
u2 ∈ A such that
d(u1 , u2 ) > ε0 .
Metric Spaces 21
Again {u1 , u2 } is not a ε0 -net for A, so that there exists u3 ∈ A such that
Clearly, we cannot extract a convergent subsequence of {un }, otherwise such a subsequence would be Cauchy
contradicting (1.19). The proof is complete.
Definition 1.4.10 (Sequentially compact sets) Let (U, d) be a metric space. A set A ⊂ U is said to be se-
quentially compact if for each sequence {un } ⊂ A there exists a subsequence {unk } and u ∈ A such that
unk → u, as k → ∞.
Theorem 1.4.11 A subset A of a metric space (U, d) is compact if and only if it is sequentially compact.
Proof 1.20 Suppose A is compact. By Proposition 2.4.8, A is countably compact. Let {un } ⊂ A be a
sequence. We have two situations to consider.
1. {un } has infinitely many equal terms, that is in this case we have
unk → u, as k → ∞.
δ (un ) → δ0 as n → ∞.
Since A is sequentially compact, we may obtain a subsequence {unk } and u0 ∈ A such that
δ (u0 )
d (unk , u0 ) < . (1.20)
4
We claim that
δ (u0 )
δ (unk ) ≥ , if k > K0 .
4
To prove the claim, suppose
z ∈ B δ (u0 ) (unk ), ∀k > K0 ,
4
Since
δ (u0 )
< δ (u0 ),
2
there exists some α1 ∈ L such that
B δ (u0 ) (u0 ) ⊂ Gα1 .
2
However, since
δ (u0 )
d(unk , u0 ) < , if k > K0 ,
4
we obtain
B δ (u0 ) (u0 ) ⊃ B δ (u0 ) (unk ), if k > K0 ,
2 4
so that
δ (u0 )
δ (unk ) ≥ , ∀k > K0 .
4
Therefore,
δ (u0 )
lim δ (unk ) = δ0 ≥ .
k→∞ 4
Choose ε > 0 such that
δ0 > ε > 0.
From the last theorem, since A it is sequentially compact, it is totally bounded. For the ε > 0 chosen above,
consider an ε-net contained in A (the fact that the ε-net may be chosen contained in A is also a consequence
of last theorem) and denote it by N, that is,
N = {v1 , ..., vn } ∈ A.
Proof 1.21 Suppose A is relatively compact. Thus A is compact so that from the last Theorem, A is
sequentially compact.
Thus from each sequence in A we may select a subsequence which converges to some element of A. In
particular, for each sequence in A ⊂ A, we may select a subsequence that converges to some element of A.
Conversely, suppose that for each sequence in A we may select a convergent subsequence. It suffices
to prove that A is sequentially compact. Let {vn } be a sequence in A. Since A is dense in A, there exists a
sequence {un } ⊂ A such that
1
d(un , vn ) < .
n
From the hypothesis we may obtain a subsequence {unk } and u0 ∈ A such that
unk → u0 , as k → ∞.
Thus,
vnk → u0 ∈ A, as k → ∞.
Therefore, A is sequentially compact so that it is compact.
2. If (U, d) is a complete metric space and A ⊂ U is totaly bounded then A is relatively compact.
Proof 1.22
1. Suppose A ⊂ U is relatively compact. From the last theorem, from each sequence in A, we can extract
a convergent subsequence. From Theorem 1.4.9, A is totally bounded.
2. Let (U, d) be a metric space and let A be a totally bounded subset of U.
Let {un } be a sequence in A. Since A is totally bounded for each k ∈ N, we find a εk -net where εk = 1/k,
denoted by Nk , where
(k) (k) (k)
Nk = {v1 , v2 , ..., vnk }.
In particular for k = 1 {un } is contained in the 1-net N1 . Thus at least one ball of radius 1 of N1
(1)
contains infinitely many points of {un }. Let us select a subsequence {unk }k∈N of this infinite set
(which is contained in a ball of radius 1). Similarly, we may select a subsequence here just partially
(2) (1)
relabeled {unl }l∈N of {unk } which is contained in one of the balls of the 21 -net. Proceeding in this
24 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
(k)
fashion for each k ∈ N, we may find a subsequence denoted by {unm }m∈N of the original sequence
contained in a ball of radius 1/k.
(k)
Now consider the diagonal sequence denoted by {unk }k∈N = {zk }. Thus
2
d(zn , zm ) < , if m, n > k,
k
that is, {zk } is a Cauchy sequence, and since (U, d) is complete, there exists u ∈ U such that
zk → u as k → ∞.
Proof 1.23 Let {un } be a countable dense set in (U, d). By hypothesis, { fn (u1 )} is a bounded sequence,
therefore it has a convergent subsequence, which is denoted by { fnk (u1 )}. Let us denote
f˜2,k (u1 ) → g1 , as k → ∞.
f˜2,k (u2 ) → g2 , as k → ∞.
Proceeding in this fashion for each m ∈ N we may obtain { f˜m,k } such that
where the set {g1 , g2 , ..., gm } is obtained as above. Consider the diagonal sequence
{ f˜k,k },
Metric Spaces 25
{ fnk } = { f˜k,k }
which is convergent in
E = {un }n∈N .
Now suppose K ⊂ U, being K compact. Suppose given ε > 0. From the equi-continuity hypothesis there
exists δ > 0 such that if u, v ∈ U and d(u, v) < δ we have
ε
| fnk (u) − fnk (v)| < , ∀k ∈ N.
3
Observe that
K ⊂ ∪u∈K B δ (u),
2
and being K compact we may find {ũ1 , ..., ũM } such that
K ⊂ ∪M
j=1 B δ (ũ j ).
2
lim fnk (v j )
k→∞
| fnk (u) − fnl (u)| ≤ | fnk (u) − fnk (v jˆ)| + | fnk (v jˆ) − fnl (v jˆ)|
+| fnl (v jˆ) − fnl (u)|
ε ε ε
≤ + + = ε. (1.21)
3 3 3
Since u ∈ K is arbitrary, we conclude that { fnk } is uniformly Cauchy on K.
The proof is complete.
Chapter 2
2.1 Introduction
The main objective of this chapter is to present an outline of the basic tools of analysis necessary to develop
the subsequent chapters. We assume the reader has a background in linear algebra and elementary real anal-
ysis at an undergraduate level. The main references for this chapter are the excellent books on functional
analysis, Rudin [67], Bachman and Narici [6] and Reed and Simon [62]. All proofs are developed in detail.
Remark 2.2.2 From now on we may drop the dot (·) in scalar multiplications and denote α · u simply as αu.
Topological Vector Spaces 27
Definition 2.2.3 (Vector subspace) Let U be a vector space. A set V ⊂ U is said to be a vector subspace
of U if V is also a vector space with the same operations as those of U. If V 6= U we say that V is a proper
subspace of U.
Definition 2.2.4 (Finite dimensional space) A vector space is said to be of finite dimension if there exists
fixed u1 , u2 , ..., un ∈ U such that for each u ∈ U there are corresponding α1 , ...., αn ∈ F for which
n
u = ∑ αi u i . (2.1)
i=1
Definition 2.2.5 (Topological spaces) A set U is said to be a topological space if it is possible to define a
collection σ of subsets of U called a topology in U, for which are valid the following properties:
1. U ∈ σ ,
2. 0/ ∈ σ ,
3. if A ∈ σ and B ∈ σ then A ∩ B ∈ σ , and
4. arbitrary unions of elements in σ also belong to σ .
Any A ∈ σ is said to be an open set.
Remark 2.2.6 When necessary, to clarify the notation, we shall denote the vector space U endowed with the
topology σ by (U, σ ).
Definition 2.2.7 (Closed sets) Let U be a topological space. A set A ⊂ U is said to be closed if U \ A is open.
We also denote U \ A = Ac = {u ∈ U | u 6∈ A}.
A \ B = {u ∈ A | u 6∈ B}.
Proof 2.1
1. Since 0/ is open and U = 0/c , by Definition 2.2.7, U is closed. Similarly, since U is open and 0/ =
U \U = U c , 0/ is closed.
2. A, B closed implies that Ac and Bc are open, and by Definition 2.2.5, Ac ∩ Bc is open, so that A ∪ B =
(Ac ∩ Bc )c is closed.
3. Consider A = ∩λ ∈L Aλ , where L is a collection of indices and Aλ is closed, ∀λ ∈ L. We may write
A = (∪λ ∈L Acλ )c and since Acλ is open ∀λ ∈ L we have, by Definition 2.2.5, that A is closed.
¯ as the intersection of
Definition 2.2.10 (Closure) Given A ⊂ U, we define the closure of A, denoted by A,
all closed sets that contain A.
Remark 2.2.11 From Proposition 2.2.9 Item 3 we have that Ā is the smallest closed set that contains A, in
the sense that, if C is closed and A ⊂ C then Ā ⊂ C.
28 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Definition 2.2.12 (Interior) Given A ⊂ U we define its interior, denoted by A◦ , as the union of all open sets
is contained in A.
Definition 2.2.14 (Neighborhood) Given u0 ∈ U we say that V is a neighborhood of u0 if such a set is open
and contains u0 . We denote such neighborhoods by Vu0 .
Proposition 2.2.15 If A ⊂ U is a set such that for each u ∈ A there exists a neighborhood Vu 3 u such that
Vu ⊂ A, then A is open.
Proof 2.2 This follows from the fact that A = ∪u∈A Vu and any arbitrary union of open sets is open.
Definition 2.2.16 (Function) Let U and V be two topological spaces. We say that f : U → V is a function
if f is a collection of pairs (u, v) ∈ U × V such that for each u ∈ U there exists only one v ∈ V such that
(u, v) ∈ f .
In such a case we denote
v = f (u).
Proposition 2.2.19 A function f : U → V is continuous if and only if f −1 (V ) is open for each open V ⊂ V ,
where
f −1 (V ) = {u ∈ U | f (u) ∈ V }. (2.2)
Proof 2.3 Suppose f −1 (V ) is open whenever V ⊂ V is open. Pick u ∈ U and any open V such that
f (u) ∈ V . Since u ∈ f −1 (V ) and f ( f −1 (V )) ⊂ V , we have that f is continuous at u ∈ U. Since u ∈ U
is arbitrary we have that f is continuous. Conversely, suppose f is continuous and pick V ⊂ V open. If
f −1 (V ) = 0/ we are done, since 0/ is open. Thus, suppose u ∈ f −1 (V ), since f is continuous, there exists Vu
a neighborhood of u such that f (Vu ) ⊂ V . This means Vu ⊂ f −1 (V ) and therefore, from Proposition 2.2.15,
f −1 (V ) is open.
6 u2 , there
Definition 2.2.20 We say that (U, σ ) is a Hausdorff topological space if, given u1 , u2 ∈ U, u1 =
exists V1 , V2 ∈ σ such that
Definition 2.2.21 (Base) A collection σ 0 ⊂ σ is said to be a base for σ if every element of σ may be repre-
sented as a union of elements of σ 0 .
Definition 2.2.22 (Local base) A collection σ̂ of neighborhoods of a point u ∈ U is said to be a local base
at u if each neighborhood of u contains a member of σ̂ .
Definition 2.2.23 (Topological vector spaces) A vector space endowed with a topology, denoted by (U, σ ),
is said to be a topological vector space if and only if
1. Every single point of U is a closed set,
2. The vector space operations (addition and scalar multiplication) are continuous with respect to σ .
Topological Vector Spaces 29
More specifically, addition is continuous if, given u, v ∈ U and V ∈ σ such that u + v ∈ V then there exists
Vu 3 u and Vv 3 v such that Vu + Vv ⊂ V . On the other hand, scalar multiplication is continuous if given
α ∈ F, u ∈ U and V 3 α · u, there exists δ > 0 and Vu 3 u such that, ∀β ∈ F satisfying |β − α| < δ we have
β Vu ⊂ V .
and
The continuity of such functions is a straightforward consequence of the continuity of vector space operations
(addition and scalar multiplication). It is clear that the respective inverse maps, namely T−u0 and M1/α0 are
also continuous. So if V is open, then u0 + V , that is, (T−u0 )−1 (V ) = Tu0 (V ) = u0 + V is open. By analogy
α0 V is open. Thus σ is completely determined by a ‘local base’, so that the term local base will be understood
henceforth as a local base at θ . So to summarize, a local base of a topological vector space is a collection Ω
of neighborhoods of θ , such that each neighborhood of θ contains a member of Ω.
Now we present some simple results, namely:
Proposition 2.2.24 If A ⊂ U, then ∀u ∈ A, there exists a neighborhood V of θ such that u + V ⊂ A
Proof 2.5 Let A ⊂ U open and u ∈ A. V = A − u is a neighborhood of θ and by definition of local base,
there exists a set VΩu ⊂ V such that VΩu ∈ Ω. Thus, we may write
V1 + V2 ⊂ V (2.7)
u0 + U + U ⊂ u0 + V ⊂ U \ {u1 } (2.8)
so that
6 u1 , ∀v1 , v2 ∈ U ,
u0 + v1 + v2 = (2.9)
30 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
or
u0 + v1 6= u1 − v2 , ∀v1 , v2 ∈ U , (2.10)
and since U = −U
Definition 2.3.2 (Bounded sets) A set A ⊂ U is said to be bounded if to each neighborhood of zero V there
corresponds a number s > 0 such that A ⊂ tV for each t > s.
is said to be convex.
Definition 2.3.4 (Locally convex spaces) A topological vector space U is said to be locally convex if there
is a local base Ω whose elements are convex.
Definition 2.3.5 (Balanced sets) A set A ⊂ U is said to be balanced if αA ⊂ A, ∀α ∈ F is such that |α| ≤ 1.
Proof 2.7
1. Suppose U is a neighborhood of zero. From the continuity of scalar multiplication, there exist V
(neighborhood of zero) and δ > 0, such that αV ⊂ U whenever |α| < δ . Define W = ∪|α|<δ αV ,
thus W ⊂ U is a balanced neighborhood of zero.
2. Suppose U is a convex neighborhood of zero in U. Define
As 0 · θ = θ (where θ ∈ U denotes the zero vector) from the continuity of scalar multiplication there
exists δ > 0 and there is a neighborhood of zero V such that if |β | < δ then β V ⊂ U . Define W as
the union of all such β V . Thus, W is balanced and α −1 W = W as |α| = 1, so that W = αW ⊂ αU ,
and hence W ⊂ A, which implies that the interior A◦ is a neighborhood of zero. Also, A◦ ⊂ U . Since
A is an intersection of convex sets, it is convex and so is A◦ . Now will show that A◦ is balanced and
complete the proof. For this, it suffices to prove that A is balanced. Choose r and β such that 0 ≤ r ≤ 1
and |β | = 1. Then
Since αU is a convex set that contains zero, we obtain rαU ⊂ αU , so that rβ A ⊂ A, which completes
the proof.
Proposition 2.3.7 Let U be a topological vector space and V a neighborhood of zero in U. Given u ∈ U,
there exists r ∈ R+ such that β u ∈ V , ∀β such that |β | < r.
Topological Vector Spaces 31
Proof 2.8 Observe that u + V is a neighborhood of 1 · u, then by the continuity of scalar multiplication,
there exists W neighborhood of u and r > 0 such that
so that
βu ∈ u+V , (2.16)
or
and thus
Proof 2.9 Let u ∈ U, then αu ∈ V for any α sufficiently small, from the last proposition u ∈ αV
1
. As
rn → ∞ we have that rn > α1 for n sufficiently big, so that u ∈ rn V , which completes the proof.
Proposition 2.3.9 Suppose {δn } is sequence such that δn → 0, δn < δn−1 , ∀n ∈ N and V a bounded neigh-
borhood of zero in U, then {δn V } is a local base for U.
Proof 2.10 Let U be a neighborhood of zero; as V is bounded, there exists t0 ∈ R+ such that V ⊂ tU
for any t > t0 . As lim δn = 0, there exists n0 ∈ N such that if n ≥ n0 then δn < t10 , so that δn V ⊂ U , ∀n such
n→∞
that n ≥ n0 .
Definition 2.3.10 (Convergence in topological vector spaces) Let U be a topological vector space. We say
{un } converges to u0 ∈ U, if for each neighborhood V of u0 then, there exists N ∈ N such that
un ∈ V , ∀n ≥ N.
Definition 2.3.11 (Dense set) Let (V, σ ) be a topological vector space (T.V.E.). Let A, B ⊂ V . We say that A
is dense in B as
B ⊂ A.
Definition 2.3.12 We say that topological vector space V is separable as it has a dense and countable set.
α ≺ γ and β ≺ γ.
Definition 2.3.14 (Net) Let (V, σ ) be a topological space. A net in (V, σ ) is a function defined on a directed
system I with range in V , where we denote such a net by
{uα }α∈I ,
and where
uα ∈ V, ∀α ∈ I.
Definition 2.3.15 (Convergent net) Let (V, σ ) be a topological space and let {uα }α∈I be a net in V .
We say that such a net converges to u ∈ V as for each neighborhood W ∈ σ of u there exists β ∈ I such
that if α β , then
uα ∈ W.
Definition 2.3.16 Let (V, σ ) be a topological space and let {uα }α ∈I be a net in V . We say that u ∈ V is a
cluster point of the net in question as for each neighborhood W ∈ σ of u and each β ∈ I, there exists α β
such that
uα ∈ W.
Definition 2.3.17 (Limit point) Let (V, σ ) be a topological space and let A ⊂ V. We say that u ∈ V is a limit
point of A as for each neighborhood W ∈ σ of u, there exists v ∈ W ∩ A such that v 6= u.
W1 ≺ W2 ⇔ W2 ⊂ W1 .
From the exposed above we may obtain a net {uw }W ∈I such that
uw → u.
uw2 ∈ A \ {u}.
Summarizing,
u ∈ A, ∀u ∈ A ∪ A0 .
Therefore,
A ∪ A0 ⊂ A. (2.19)
Reciprocally, suppose u ∈ A.
Topological Vector Spaces 33
If u ∈ A, then u ∈ A ∪ A0 .
Suppose, to obtain contradiction, that u 6∈ A and u 6∈ A0 .
Thus there exists a neighborhood W ∈ σ of u such that
W ∩ A = 0/.
uα → u.
f (Vu ) ⊂ W f (u) .
uα ∈ Vu .
Therefore,
f (uα ) ∈ W f (u) , if α β .
Since W f (u) is arbitrary, it follows that
f (uα ) → f (u).
Reciprocally, suppose
f (uα ) → f (u)
whenever
uα → u.
Suppose, to obtain contradiction, that f is not continuous at u.
Thus there exists W f (u) ∈ σ2 such that f (u) ∈ W f (u) and so that for each neighborhood W ∈ σ1 of u there
exists uW ∈ W such that
f (uW ) 6∈ W f (u) .
34 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
W1 ≺ W2 ⇔ W2 ⊂ W1 .
Definition 2.4.2 (Compact sets) A set B ⊂ U is said to be compact if each open covering of B has a finite
sub-covering. More explicitly, if B ⊂ ∪α ∈A Oα , where Oα is open ∀α ∈ A, then, there exist α1 , ..., αn ∈ A
such that B ⊂ Oα1 ∪ ... ∪ Oαn , for some n, a finite positive integer.
Proof 2.13 Suppose K is compact. Let {uα }α∈I ⊂ K be a net with infinite distinct terms (otherwise the
result is immediate).
Denote E = {uα }α ∈I . Suppose, to obtain contradiction, that no point of K is a limit point of E.
Hence, for each u ∈ K, there exists a neighborhood Wu of u such that
Wu ∩ E = 0/,
or
Wu ∩ E = {u} if u ∈ E.
In any case each Wu has no more than one point of E.
Observe that ∪u∈K Wu ⊃ K. Since K is compact, there exist u1 , . . . , un ∈ K such that
E ⊂ K ⊂ ∪nj=1Wu j .
From this we may conclude that E has no more than n distinct elements, which contradicts E to have
infinity distinct terms.
Reciprocally, suppose that each net {uα }α ∈I ⊂ K has at least one limit point in K.
Suppose, to obtain contradiction K is not compact.
Thus there exists an open covering {Gα , α ∈ L} of K which admits no finite sub-covering.
Denote by F the finite sub-collections of {Gα , α ∈ L}.
Hence, for a W ∈ F we may select a uW 6∈ W where uW ∈ K.
Let us partially order F through the relation
W1 ≺ W2 ⇔ W1 ⊂ W2 .
Observe that
u ∈ K ⊂ ∪α∈L Gα .
Thus, there exists α0 ∈ L such that
u ∈ Gα0 .
Since u is a limit point of {uW }W ∈F ⊂ K, there exists W1 Gα0 such that
uW1 ∈ Gα0 ⊂ W1 .
Proof 2.14 Let U be a Hausdorff space and consider A ⊂ U, A compact. Given x ∈ A and y ∈ Ac , there
exist open sets Ox and Oyx such that x ∈ Ox , y ∈ Oyx and Ox ∩ Oyx = 0. / It is clear that A ⊂ ∪x∈A Ox and since A
is compact, we may find {x1 , x2 , ..., xn } such that A ⊂ ∪ni=1 Oxi . For the selected y ∈ Ac we have y ∈ ∩ni=1 Oyxi
n O xi ) ∩ (∪n O ) = 0.
and (∩i= / Since ∩ni=1 Oyxi is open, and y is an arbitrary point of Ac we have that Ac is
1 y i=1 xi
open, so that A is closed, which completes the proof. The next result is very useful.
Theorem 2.4.5 Let {Kα , α ∈ L} be a collection of compact subsets of a Hausdorff topological vector space
U, such that the intersection of every finite sub-collection (of {Kα , α ∈ L}) is non-empty.
Under such hypotheses
∩α ∈L Kα 6= 0/.
∩α∈L Kα = 0/.
That is,
6
α=α
Kα0 ∩ [∩α∈L 0 Kα ] = 0/.
Thus,
6
∩α ∈L 0 Kα ⊂ Kαc 0 ,
α=α
so that
α 6=α
Kα0 ⊂ [∩α∈L 0 Kα ]c ,
6
Kα0 ⊂ [∪α∈L 0 Kαc ].
α=α
Proposition 2.4.6 Let U be a topological Hausdorff space and let A ⊂ B where A is closed and B is compact.
Under such hypotheses, A is compact.
36 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Proof 2.16 Consider {Oα , α ∈ L} an open cover of A. Thus {Ac , Oα , α ∈ L} is a cover of U, so that
it is a cover of B. As B is compact, there exist α1 , α2 , ..., αn such that Ac ∪ (∪ni=1 Oαi ) ⊃ B ⊃ A, so that
{Oαi , i ∈ {1, ..., n}} covers A. From this we may infer that A is compact. The proof is complete.
Definition 2.4.7 (Countably compact sets) A set A is said to be countably compact if every infinite subset
of A has a limit point in A.
Proposition 2.4.8 Every compact subset of a Hausdorff topological space U is countably compact.
Proof 2.17 Let B an infinite subset of A compact and suppose B has no limit point in A, so that there is no
any limit point. Choose a countable infinite set {x1 , x2 , x3 , ....} ⊂ B and define F = {x1 , x2 , x3 , ...}. It is clear
that F has no limit point. Thus for each n ∈ N, there exist On open such that On ∩ F = {xn }. Also, for each
x ∈ A \ F, there exist Ox such that x ∈ Ox and Ox ∩ F = 0. / Thus {Ox , x ∈ A \ F; O1 , O2 , ...} is an open cover
of A without a finite subcover, which contradicts the fact that A is compact.
Theorem 2.4.10 let (V, σ ) be a topological vector space. Let A ⊂ V be such that A 6= 0/.
Under such hypotheses, Conv(A) is convex.
Proof 2.18 Let u, v ∈ Conv(A) and let λ ∈ [0, 1]. Thus, there exist n1 , n2 ∈ N such that
n1 n2
u= ∑ λk uk and v = ∑ λ̃k vk ,
k=1 k=1
where
λ λk ≥ 0, uk ∈ A, ∀k ∈ {1, . . . , n1 }
and
(1 − λ )λ̃k ≥ 0, vk ∈ A, ∀k ∈ {1, . . . , n2 }
so that
n1 n2
∑ λ λk + ∑ (1 − λ )λ̃k = λ + (1 − λ ) = 1.
k=1 k=1
Therefore,
λ u + (1 − λ v) ∈ Conv(A), ∀u, v ∈ Conv(A), λ ∈ [0, 1].
Hence, Conv(A) is convex.
The proof is complete.
Topological Vector Spaces 37
Theorem 2.4.11 Let (V, σ ) be a topological vector space. Let A ⊂ V be such that A =
6 0/.
Under such hypotheses, A is convex if, and only if, Conv(A) = A.
We shall do it by induction on n.
Observe that for n = 1 and n = 2, from the convexity of A we obtain A = B1 and A = B2 .
Let n ∈ N. Suppose A = Bn . We are going to prove that A = Bn+1 which will complete the induction.
Clearly Bn ⊂ Bn+1 , so that A ⊂ Bn+1 .
Reciprocally, let u ∈ Bn+1 . Thus, there exist u1 , . . . , un+1 ∈ A and λ1 , . . . λn+1 such that λk ≥ 0, ∀k ∈
+1
{1, . . . , n + 1}, ∑nk=1 λk = 1, and
n+1
u= ∑ λk u k .
k=1
With no loss in generality, assume 0 < λn+1 < 1 (otherwise the conclusion is immediate).
Thus,
λ1 + · · · + λn = (1 − λn+1 ) > 0.
Hence,
λ1 λn
+···+ = 1.
1 − λn+1 1 − λn+1
Therefore, defining
λk
λ̃k = ≥ 0, ∀k ∈ {1, . . . , n}
1 − λn+1
we have that
n
∑ λ̃k = 1,
k=1
so that
n
w= ∑ λ̃k uk ∈ Bn = A.
k=1
Since A convex, we obtain
w1 = (1 − λn+1 )w + λn+1 un+1 ∈ A,
that is,
n+1
w1 = ∑ λk uk = u ∈ A, ∀u ∈ Bn+1 .
k=1
Thus,
Bn+1 ⊂ A,
and hence
Bn+1 = A.
This completes the induction, that is,
A = Bn , ∀n ∈ N.
Hence,
A = ∪∞
n=1 Bn = Conv(A).
Reciprocally, assume A = Conv(A). Since Conv(A) is convex, A is convex.
The proof is complete.
38 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Conv(A) ⊂ B = Conv(B).
Proposition 2.4.13 Let (V, σ ) be a topological vector space. Suppose that a non-empty A ⊂ V is open. Under
such hypotheses, Conv(A) is open.
Proof 2.20 Let u ∈ Conv(A). Thus, there exist n ∈ N, uk ∈ A, λk ≥ 0, ∀k ∈ {1, . . . , n} such that ∑nk=1 λk = 1,
and u = ∑nk=1 λk uk .
With no loss in generality, assume λ1 6= 0 (redefine the indices, if necessary).
Since u1 ∈ A and A is open, there exists a neighborhood Vu1 of u1 such that Vu1 ⊂ A.
Thus, W = λ1Vu1 + λ2 u2 + · · · + λn un ⊂ Conv(A).
Observe that W is open and u ∈ W ⊂ Conv(A).
Therefore u is an interior point of Conv(A), ∀u ∈ Conv(A). Thus, Conv(A) is open.
This completes the proof.
Proposition 2.4.14 Let (V, σ ) be a topological vector space. Suppose A ⊂ V is convex and A◦ 6= 0/. Under
such hypotheses, A◦ is convex.
Proof 2.21 Let u, v ∈ A◦ and λ ∈ [0, 1]. Thus, there exist neighborhoods Vu of u and Vv of v such that Vu ⊂ A
and Vv ⊂ A. Hence,
B ≡ Vu ∪Vv ⊂ A.
Therefore, since A is convex, we obtain
Conv(B) ⊂ Conv(A) = A.
From the last proposition Conv(B) is open and moreover Conv(B) ⊂ A◦ . Thus,
Remark 2.4.15 Let (V, σ ) be a topological vector space a let A ⊂ V be a non-empty open set..
Thus, tA is open, ∀t ∈ F such that t 6= 0.
Let B ⊂ V be a balanced set such that 0 ∈ B◦ .
Let α ∈ F be such that 0 < |α| ≤ 1. Thus,
αB◦ ⊂ αB ⊂ B.
The following are some basic definitions concerning metric and normed spaces:
Definition 2.5.3 (Convergent sequences) Given a metric space U, we say that {un } ⊂ U converges to u0 ∈
U as n → ∞, if for each ε > 0, there exists n0 ∈ N, such that if n ≥ n0 then d(un , u0 ) < ε. In this case we write
un → u0 as n → +∞.
Definition 2.5.4 (Cauchy sequence) {un } ⊂ U is said to be a Cauchy sequence if for each ε > 0 there exists
n0 ∈ N such that d(un , um ) < ε, ∀m, n ≥ n0
Definition 2.5.5 (Completeness) A metric space U is said to be complete if each Cauchy sequence related
to d : U ×U → R+ converges to an element of U.
Definition 2.5.6 (Limit point) Let (U, d) be a metric space and let E ⊂ U. We say that v ∈ U is a limit point
6 v.
of E if for each r > 0 there exists w ∈ Br (v) ∩ E such that w =
Definition 2.5.7 (Interior point, topology for (U, d)) Let (U, d) be a metric space and let E ⊂ U. We say
that u ∈ E is interior point if there exists r > 0 such that Br (u) ⊂ E. If a point of E is not a limit point is said
to be an isolated one. We may define a topology for a metric space (U, d), by declaring as open all set E ⊂ U
such that all its points are interior. Such a topology is said to be induced by the metric d.
Proposition 2.5.8 Let (U, d) be a metric space. The set σ of all open sets, defined through the last definition,
is indeed a topology for (U, d).
Proof 2.22
1. Obviously 0/ and U are open sets.
2. Assume A and B are open sets and define C = A ∩ B. Let u ∈ C = A ∩ B, thus from u ∈ A, there exists
r1 > 0 such that Br1 (u) ⊂ A. Similarly from u ∈ B there exists r2 > 0 such that Br2 (u) ⊂ B.
Define r = min{r1 , r2 }. Thus Br (u) ⊂ A ∩ B = C, so that u is an interior point of C. Since u ∈ C is
arbitrary, we may conclude that C is open.
40 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
3. Suppose {Aα , α ∈ L} is a collection of open sets. Define E = ∪α ∈L Aα and we shall show that E is
open.
Choose u ∈ E = ∪α ∈L Aα . Thus there exists α0 ∈ L such that u ∈ Aα0 . Since Aα0 is open there exists
r > 0 such that Br (u) ⊂ Aα0 ⊂ ∪α ∈L Aα = E. Hence u is an interior point of E, since u ∈ E is arbitrary,
E = ∪α ∈L Aα is open.
The proof is complete.
Definition 2.5.9 Let (U, d) be a metric space and let E ⊂ U. We define E 0 as the set of all the limit points
of E.
Theorem 2.5.10 Let (U, d) be a metric space and let E ⊂ U. Then E is closed if and only if E 0 ⊂ E.
Proof 2.23 Suppose E 0 ⊂ E. Let u ∈ E c , thus u 6∈ E and u 6∈ E 0 . Therefore there exists r > 0 such that
Br (u) ∩ E = 0, / so that Br (u) ⊂ E c . Therefore u is an interior point of E c . Since u ∈ E c is arbitrary we may
infer that E is open, so that E = (E c )c is closed.
c
Remark 2.5.11 From this last result, we may conclude that in a metric space E ⊂ U is closed if and only if
E 0 ⊂ E.
Remark 2.5.13 Let (U, σ ) be a topological space. We say that the topology σ is compatible with a metric
d : U × U → R+ if σ coincides with the topology generated by such a metric. In this case we say that
d : U ×U → R+ induces the topology σ .
Definition 2.5.14 (Metrizable spaces) A topological vector space (U, σ ) is said to be metrizable if σ is
compatible with some metric d.
Definition 2.5.15 (Normable spaces) A topological vector space (U, σ ) is said to be normable if the in-
duced metric (by this norm) is compatible with σ .
Definition 2.6.2 (Null space and range) Given f : U → V , we define the null space and the range of f,
denoted by N( f ) and R( f ) respectively, as
N( f ) = {u ∈ U | f (u) = θ } (2.26)
and
Proposition 2.6.3 Let U,V be topological vector spaces. If f : U → V is linear and continuous at θ , then it
is continuous everywhere.
f (U ) ⊂ V . (2.28)
Thus
or
Proposition 2.7.2 A set E is bounded if and only if the following condition is satisfied: whenever {un } ⊂ E
and {αn } ⊂ F are such that αn → 0 as n → ∞ we have αn un → θ as n → ∞.
Proof 2.25 Suppose E is bounded. Let U be a balanced neighborhood of θ in U, then E ⊂ tU for some t.
For {un } ⊂ E, as αn → 0, there exists N such that if n > N then t < |α1n | . Since t −1 E ⊂ U and U is balanced,
we have that αn un ∈ U , ∀n > N, and thus αn un → θ . Conversely, if E is not bounded, there is a neighborhood
V of θ and {rn } such that rn → ∞ and E is not contained in rn V , that is, we can choose un such that rn−1 un
is not in V , ∀n ∈ N, so that {rn−1 un } does not converge to θ .
Then,
1 implies 2,
2 implies 3,
if U is metrizable with invariant metric,then 3 implies 4, which implies 1.
Proof 2.26
f (V ) ⊂ W . (2.31)
Thus since 2 implies 3 we have that { f (γn un )} is bounded so that, by Proposition 2.7.2 f (un ) =
γn−1 f (γn un ) → θ as n → ∞.
4. 4 implies 1: suppose 1 fails. Thus there exists a neighborhood of zero W ⊂ V such that f −1 (W )
contains no neighborhood of zero in U. Particularly, we can select {un } such that un ∈ B1/n (θ ) and
f (un ) not in W so that { f (un )} does not converge to zero. Thus 4 fails.
We claim that for each ε > 0 there exists δ > 0 such that if kukU < δ then kA(u)kV < ε.
Suppose, to obtain contradiction that the claim is false.
Topological Vector Spaces 43
1
Thus there exists ε0 > 0 such that for each n ∈ N there exists un ∈ U such that kun kU ≤ n and kA(un )kV ≥
ε0 .
Therefore un → θ and A(un ) does not converge to θ , which contradicts (2.34).
Thus the claim holds.
In particular, for ε = 1 there exists δ > 0 such that if kukU < δ then kA(u)kV < 1. Thus given an arbitrary
not relabeled u ∈ U, u 6= θ , for
δu
w=
2kukU
we have
δ kA(u)kV
kA(w)kV = < 1,
2kukU
that is,
2kukU
kA(u)kV < , ∀u ∈ U.
δ
Defining
2
K=
δ
the first part of the proof is complete. Reciprocally, suppose there exists K > 0 such that
Proof 2.28 Suppose there exists u0 ∈ U in the interior of C. Thus, there exists r > 0 such that
Br (u0 ) = {u ∈ U | ku − u0 kU < r} ⊂ C.
and hence
∀u ∈ U such that kukU < r. Fix an arbitrary not relabeled u ∈ U such that u 6= θ . From (2.35)
u r
w=
kukU 2
is such that
kA(u)kV r
kA(w)kV = ≤ 1 + kA(u0 )kV ,
kukU 2
so that
2
kA(u)kV ≤ (1 + kA(u0 )kV )kukU .
r
Since u ∈ U is arbitrary, A is bounded.
Reciprocally, suppose A is bounded. Thus
kA(u)kV ≤ KkukU , ∀u ∈ U,
Definition 2.9.2 A set S in a metric space U is said to be nowhere dense if S has an empty interior.
Theorem 2.9.3 (Baire Category theorem) A complete metric space is never the union of a countable num-
ber of nowhere dense sets.
Proof 2.29 Suppose, to obtain contradiction, that U is a complete metric space and
U = ∪∞
n=1 An
where each An is nowhere dense. Since A1 is nowhere dense, there exist u1 ∈ U which is not in Ā1 , otherwise
we would have U = Ā1 , which is not possible since U is open. Furthermore, Āc1 is open, so that we may
obtain u1 ∈ Ac1 and 0 < r1 < 1 such that
B1 = Br1 (u1 )
satisfies
B1 ∩ A1 = 0/.
Since A2 is nowhere dense we have B1 is not contained in Ā2 . Therefore we may select u2 ∈ B1 \ Ā2 and since
B1 \ Ā2 is open, there exists 0 < r2 < 1/2 such that
that is
B2 ∩ A2 = 0/.
Proceeding inductively in this fashion, for each n ∈ N we may obtain un ∈ Bn−1 \ Ān such that we may choose
an open ball Bn = Brn (un ) such that
B̄n ⊂ Bn−1 ,
Bn ∩ An = 0/
Topological Vector Spaces 45
and
0 < rn < 21−n .
Observe that {un } is a Cauchy sequence, considering that if m, n > N then un , um ∈ BN , so that
Define
u = lim un .
n→∞
Since
un ∈ BN , ∀n > N,
we get
u ∈ B̄N ⊂ BN−1 .
Therefore, u is not in AN−1 , ∀N > 1, which means u is not in ∪n=1
∞ A = U, a contradiction.
n
The proof is complete.
U = ∪∞
n=1 Bn .
Moreover, each Bn is closed. By the Baire category theorem there exists n0 ∈ N such that Bn0 has non-empty
interior. That is, there exists u0 ∈ U and r > 0 such that
Br (u0 ) ⊂ Bn0 .
kT (u)kV ≤ n0 , ∀u ∈ Br (u0 ).
kT (u + u0 )kV ≤ n0 ,
that is,
kT (u)kV − kT (u0 )kV ≤ n0 .
Thus
Proof 2.31 First we will prove that given r > 0, there exists r0 > 0 such that
V = ∪∞
n=1 A(nB1 (θ )).
By the Baire Category theorem, there exists n0 ∈ N such that the closure of A(n0 B1 (θ )) has non-empty
interior, so that A(B1 (θ )) has a non-empty interior. We will show that there exists r0 > 0 such that
A(Br2 (θ )) ⊃ BVr1 (θ ),
y ∈ A(B1 (θ ))
thus there exists u ∈ U such that kukU < 1 and A(u) = y. Therefore
ku − u0 kU ≤ kukU + ku0 kU
< 1 + ku0 kU
= r2 , (2.39)
Topological Vector Spaces 47
so that
A(u − u0 ) ∈ A(Br2 (θ )).
This means
y = A(u) ∈ A(u0 ) + A(Br2 (θ )),
and hence
A(B1 (θ )) ⊂ A(u0 ) + A(Br2 (θ )).
That is, from this and (2.38), we obtain
and therefore
A(Br2 (θ )) ⊃ BVr1 (θ ).
Since
A(Br2 (θ )) = r2 A(B1 (θ )),
we have, for some not relabeled r1 > 0 that
A(B1 (θ )) ⊃ BVr1 (θ ).
Define
∞
u= ∑ un ,
n=1
Br (u0 ) ⊂ O.
From (2.37),
A(Br (θ )) ⊃ BVr0 (θ ),
48 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Theorem 2.9.6 (The Inverse Mapping theorem) A continuous linear bijection of one Banach space onto
another has a continuous inverse.
Proof 2.32 Let A : U → V satisfying the theorem hypotheses. Since A is open, A−1 is continuous.
Theorem 2.9.8 (The Closed Graph theorem) Let U and V be Banach spaces and let A : U → V be a linear
operator. Then A is bounded if and only if its graph is closed.
Proof 2.33 Suppose Γ(A) is closed. Since A is linear Γ(A) is a subspace of U ⊕ V . Also, being Γ(A)
closed, it is a Banach space with the norm
A = Π2 ◦ Π−1
1 ,
s = |α1 | + · · · + |αn |.
vm = β1m u1 + . . . + βnm un
such that
n
∑ |β jm | = 1, ∀m ∈ N
j=1
and
kvm kV → 0, as m → ∞.
In particular
|β jm | ≤ 1, ∀m ∈ N, j ∈ {1, . . . , n}.
It is a well-known result in elementary analysis that a bounded sequence Cn has a convergent subse-
quence.
Hence, there exists a subsequence {mk } of N and
β j0 ∈ C, ∀ j ∈ {1, . . . , n}
such that
m
β j k → β j0 , ∀ j ∈ {1, . . . , n},
and
n
∑ |β j0 | = 1.
j=1
Thus
n
vmk → ∑ β j0 u j .
j=1
Since
n
∑ |β j0 | = 1,
j=1
this contradicts
vmk → 0.
Therefore, there exists c > 0 such that (2.40) holds.
The proof is complete.
Now we present the following result about the completeness of finite dimensional subspaces in a normed
vector space.
Theorem 2.10.2 Let V be a complex normed vector space and let M be finite dimensional subspace of V .
Under such hypotheses, M is complete (closed). In particular, each normed finite dimensional vector
space is complete.
50 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Proof 2.35 Let {vm } ⊂ M be a Cauchy sequence. Let n ∈ N be the dimension of M. Let {u1 , . . . , un } be a
basis for M.
Hence, there exists a sequence {α m
j } ⊂ C such that
vm = α1m u1 + · · · + αnm un .
Let ε > 0. Since {vm } is a Cauchy sequence, there exists n0 ∈ N such that, if m, l > n0 , then
kvm − vl kV < ε.
Hence, from this and the last theorem, there exists c > 0 such that
n
ε > ∑ (α mj − α lj )u j
j=1 V
n
≥ c ∑ |α m l
j − α j|
i=1
≥ c|α m l
j − α j |, ∀m, l > n0 , ∀ j ∈ {1, · · · , n}. (2.42)
Thus {α m
j } ⊂ C is a Cauchy sequence.
Therefore, there exists a α 0j ∈ C such that
0
αm
j → α j , ∀ j ∈ {1, . . . , n}.
Definition 2.10.3 (Equivalence between two norms) Let V be a vector space. Two norms
k · k0 , k · k1 : V → R+
Theorem 2.10.4 Let V be a finite dimensional vector space. Under such hypotheses, any two norms defined
on V are equivalent.
so that there exists c > 0 which does not depend on u, such that
Hilbert Spaces
3.1 Introduction
At this point we introduce an important class of spaces namely, the Hilbert spaces, which are a special class
of metric spaces.
The main references for this chapter are [62, 24].
that is
|(u, v)H | ≤ kukH kvkH .
The proof of the remaining conclusions is left as an exercise.
k · kH : H → R
Proof 3.2 The only non-trivial property to be verified, concerning the definition of norm, is the triangle
inequality.
Observe that, given u, v ∈ H, from the Cauchy-Schwartz inequality we have,
ku + vk2H = (u + v, u + v)H
= (u, u)H + (v, v)H + 2(u, v)H
≤ (u, u)H + (v, v)H + 2|(u, v)H |
≤ kuk2H + kvk2H + 2kukH kvkH
= (kukH + kvkH )2 . (3.2)
Therefore
ku + vkH ≤ kukH + kvkH , ∀u, v ∈ H.
The proof is complete.
Definition 3.2.5 A pre-Hilbert space H is said to be a Hilbert space if it is complete, that is, if any cauchy
sequence in H converges to an element of H.
Definition 3.2.6 (Orthogonal Complement) Let H be a Hilbert space. Considering M ⊂ H we define its
orthogonal complement, denoted by M ⊥ , by
Theorem 3.2.7 Let H be a Hilbert space, M a closed subspace of H and suppose u ∈ H. Under such hy-
potheses there exists a unique m0 ∈ M such that
Moreover, n0 = u − m0 ∈ M ⊥ so that
u = m0 + n0 ,
where m0 ∈ M and n0 ∈ M⊥. Finally, such a representation through M ⊕ M ⊥ is unique.
54 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
ku − mi kH → d, as i → ∞.
Thus {mi } ⊂ M is a Cauchy sequence. Since M is closed, there exists m0 ∈ M such that
mi → m0 , as i → +∞,
so that
ku − mi kH → ku − m0 kH = d.
Define
n0 = u − m0 .
We will prove that n0 ∈ M ⊥ .
Pick m ∈ M and t ∈ R, thus we have
d2 ≤ ku − (m0 − tm)k2H
= kn0 + tmk2H
= kn0 k2H + 2(n0 , m)H t + kmk2H t 2 . (3.4)
Since
kn0 k2H = ku − m0 k2H = d 2 ,
we obtain
2(n0 , m)H t + kmk2H t 2 ≥ 0, ∀t ∈ R
so that
(n0 , m)H = 0.
Being m ∈ M arbitrary, we obtain
n0 ∈ M ⊥ .
It remains to prove the uniqueness. Let m ∈ M, thus
ku − mk2H = ku − m0 + m0 − mkH 2
since
(u − m0 , m − m0 )H = (n0 , m − m0 )H = 0.
Hilbert Spaces 55
ku − mk2H = ku − m1 + m1 − mk2H
= ku − m1 k2H + km − m1 k2H ,
≥ ku − m1 kH (3.6)
Theorem 3.2.8 (The Riesz Lemma) Let H be a Hilbert space and let f : H → R be a continuous linear
functional. Then there exists a unique u0 ∈ H such that
f (u) = (u, u0 )H , ∀u ∈ H.
Moreover
k f kH ∗ = ku0 kH .
f (u) = α f (v)
f (v)(αv, v)H
=
kvk2H
f (v)v
= αv,
kvk2H H
= (αv, u0 )H . (3.7)
56 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Therefore, f (u) equals (u, u0 )H in the space spanned by N and v. Now we show that this last space (then span
of N and v) is in fact H. Just observe that given u ∈ H we may write
f (u)v f (u)v
u = u− + . (3.8)
f (v) f (v)
Since
f (u)v
u− ∈N
f (v)
we have finished the first part of the proof, that is, we have proven that
f (u) = (u, u0 )H , ∀u ∈ H.
f (u) = (u, u1 )H , ∀u ∈ H.
Thus,
Hence, u1 = u0 .
Let us now prove that
k f kH ∗ = ku0 kH .
First observe that
Remark 3.2.9 Similarly as above, we may define a Hilbert space H over C, that is, a complex one. In this
case the complex inner product (·, ·)H : H × H → C is defined through the following properties:
1. (u, v)H = (v, u)H , ∀u, v ∈ H,
Hilbert Spaces 57
where for α = a + bi ∈ C, we have α = a − bi. Finally, similar results as those proven above are valid for
complex Hilbert spaces.
kukH = 1,
and
(u, v)H = 0, ∀u, v ∈ S, such that u 6= v.
If S is not properly contained in any other orthonormal set, it is said to be an orthonormal basis for H.
Theorem 3.3.2 Let H be a Hilbert space and let {un }Nn=1 be an orthonormal set. Then for all u ∈ H, we have
2
N N
kuk2H = ∑ |(u, un )H | 2
+ u − ∑ (u, un )H un .
n=1 n=1 H
Corollary 3.3.3 (Bessel inequality) Let H be a Hilbert space and let {un }Nn=1 be an orthonormal set. Then
for all u ∈ H, we have
N
kuk2H ≥ ∑ |(u, un )H |2 .
n=1
58 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Proof 3.6 Define by C the collection of all orthonormal sets in H. Define an order in C by stating S1 ≺ S2
if S1 ⊂ S2 . Then C is partially ordered and obviously non-empty, since
6 θ.
v/kvkH ∈ C, ∀v ∈ H, v =
Now let {Sα }α ∈L be a linearly ordered subset of C. Clearly ∪α ∈L Sα is an orthonormal set which is an
upper bound for {Sα }α∈L .
Therefore, every linearly ordered subset has an upper bound, so that by Zorn’s lemma C has a maximal
element, that is, an orthonormal set not properly contained in any other orthonormal set.
This completes the proof.
Theorem 3.3.5 Let H be a Hilbert space and let S = {uα }α∈L be an orthonormal basis. Then for each v ∈ H
we have
v = ∑ (uα , v)H uα ,
α∈L
and
kvk2H = ∑ |(uα , v)H |2 .
α∈L
Proof 3.7 Let v ∈ H. Let L0 ⊂ L a finite subset of L. From Bessel’s inequality we have,
From this, we may infer that the set An = {α ∈ L | |(uα , v)H | > 1/n} is finite, so that
∞
A = {α ∈ L | |(uα , v)H | > 0} = ∪n=1 An
is at most countable.
6 0 for at most countably many α 0 s ∈ L, which we order by {αn }n∈N . Since the sequence
Thus (uα , v)H =
N
sN = ∑ |(uαi , v)H |2 ,
i=1
Observe that
N
(v − v0 , uαl )H = lim (v − ∑ (uαi , v)H uαi , uαl )H
N →∞
i=1
= (v, uαl )H − (v, uαl )H
= 0. (3.14)
Also, if α 6= αl , ∀l ∈ N then
∞
(v − v0 , uα )H = lim (v − ∑ (uαi , v)H uαi , uα )H = 0.
N→∞
i=1
Hence,
v − v0 ⊥uα , ∀α ∈ L.
If
v − v0 =
6 θ,
then we could obtain an orthonormal set
v − v0
uα , α ∈ L,
kv − v0 kH
which would properly contain the complete orthonormal set
{uα , α ∈ L},
a contradiction.
Therefore, v − v0 = θ , that is
N
v = lim∑ (uαi , v)H uαi .
N→∞
i=1
Observe that clearly {vn } is an orthonormal set and for each m ∈ N, {vk }m m
k=1 and {uk }k=1 span the same
vector subspace of H.
Such a process of obtaining the orthonormal set {vn } is known as the Gram-Schmidt orthonormalization.
We finish this section with the following theorem.
Theorem 3.3.6 A Hilbert space H is separable if and only if has a countable orthonormal basis. If dim(H) =
N < ∞, the H is isomorphic to CN . If dim(H) = +∞ then H is isomorphic to l 2 , where
( )
∞
l2 = {yn } | yn ∈ C, ∀n ∈ N and ∑ |yn |2 < +∞ .
n=1
60 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Proof 3.8 Suppose H is separable and let {un } be a countable dense set in H. To obtain an orthonormal
basis it suffices to apply the Gram-Schmidt orthonormalization procedure to the greatest linearly independent
subset of {un }.
Conversely, if B = {vn } is an orthonormal basis for H, the set of all finite linear combinations of elements
of B with rational coefficients are dense in H, so that H is separable.
Moreover, if dim(H) = +∞ consider the isomorphism F : H → l 2 given by
d ≤ k f − vn kH < d + 1/n.
that is,
k2 f − (vn + vm )k2H + kvn − vm k2H = 2(k f − vn k2H + k f − vm k2H ),
so that
!
2
vn + vm
kvn − vm k2H = −4 f− + 2(k f − vn k2H + k f − vm k2H )
2 H
→ −4d 2 + 2d 2 + 2d 2
= 0, as m, n → ∞. (3.15)
kvn − ukH → 0, as n → ∞.
Hilbert Spaces 61
Therefore,
k f − vn kH → k f − ukH = d, as n → ∞.
Now let v ∈ K and t ∈ [0, 1]. Define
w = (1 − t)u + tv.
Observe that, since K is convex, w ∈ K, ∀t ∈ [0, 1].
Hence,
k f − uk2H ≤ k f − wk2H
= k f − (1 − t )u − tvk2H
= k( f − u) + t(u − v)k2H
= k f − uk2H + 2( f − u, u − v)H t + t 2 ku − vk2H . (3.16)
( f − u, v − u) ≤ 0, ∀v ∈ K.
Summarizing,
k f − vkH ≥ k f − ukH , ∀v ∈ K.
Suppose now that u1 , u2 ∈ K be such that
( f − u1 , v − u1 )H ≤ 0, ∀v ∈ K,
( f − u2 , v − u2 )H ≤ 0, ∀v ∈ K.
With v = u2 in the last first inequality and v = u1 in the last second one, we get
( f − u1 , u2 − u1 )H ≤ 0
62 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
and
( f − u2 , u1 − u2 )H ≤ 0.
Adding these two last inequalities, we obtain
( f , u2 − u1 )H + ( f , u1 − u2 )H − (u1 , u2 − u1 )H + (u2 , u2 − u1 )H ≤ 0,
that is,
ku2 − u1 k2H ≤ 0,
so that
ku1 − u2 kH = 0,
and therefore,
u1 = u2 .
Hence, the u ∈ K in question is unique.
The proof is complete.
Proposition 3.4.3 Let H be a Hilbert space and K ⊂ H a non-empty, closed and convex set.
Let f ∈ H. Define PK ( f ) = u where u ∈ K is such that
kPK f1 − PK f2 kH ≤ k f1 − f2 kH , ∀ f1 , f2 ∈ H.
( f1 − u1 , v − u1 )H ≤ 0, ∀v ∈ K
and
( f2 − u2 , v − u2 )H ≤ 0, ∀v ∈ K.
With v = u2 in the first last inequality and v = u1 in the last second one, we obtain
( f1 − u1 , u2 − u1 )H ≤ 0,
( f2 − u2 , u1 − u2 )H ≤ 0.
Adding these two last inequalities, we get
( f1 , u2 − u1 )H − ( f2 , u2 − u1 )H + (u2 − u1 , u2 − u1 )H ≤ 0,
that is,
ku2 − u1 k2H ≤ ( f2 − f1 , u2 − u1 )H
≤ k f2 − f1 kH ku2 − u1 kH , (3.18)
so that
ku2 − u1 kH ≤ k f2 − f1 kH .
This completes the proof.
Hilbert Spaces 63
Corollary 3.4.4 Let H be a Hilbert space and let M ⊂ H be a closed vector subspace of H.
Let f ∈ H. Thus, u = PM ( f ) is such that u ∈ M and
( f − u, v)H = 0, ∀v ∈ M.
( f − u, v − u)H ≤ 0, ∀v ∈ M.
Let v ∈ M be such that v 6= 0.
Thus,
( f − u,tv − u)H ≤ 0, ∀t ∈ R.
Hence,
t( f − u, v)H ≤ ( f − u, u)H , ∀t ∈ R.
From this we obtain
( f − u, v)H = 0, ∀v ∈ M.
Theorem 3.5.2 (Stampacchia) Let H be a Hilbert space and let a : H × H → R be a bounded and coercive
bilinear form.
Let K ⊂ H be a non-empty, closed and convex set. Under such hypotheses, for each f ∈ H there exists a
unique u ∈ K such that
a(u, v − u) ≥ ( f , v − u)H , ∀v ∈ K. (3.19)
Moreover, if a is symmetric, that is, a(u, v) = a(v, u), ∀u, v ∈ H, such u ∈ K in question is also such that
1 a(v, v)
a(u, u) − ( f , u)H = min − ( f , v)H . (3.20)
2 v∈K 2
64 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
α2
0 < 1− < 1.
c
Hence,
kT (v1 ) − T (v2 )kH ≤ λ kv1 − v2 kH ,
where r
α2
λ= 1−
< 1.
c
From this and Banach fixed point theorem, there exists u ∈ K such that
T (u) = u,
that is,
PK (ρ f − ρAu + u) = u.
Hilbert Spaces 65
Therefore,
a(g, g) − 2a(g, u) + a(u, u) ≤ a(g, g) − 2a(g, v) + a(v, v),
that is,
a(u, u) a(v, v)
− a(g, u) ≤ − a(g, v),
2 2
so that
a(u, u) a(v, v)
− ( f , u)H ≤ − ( f , v)H , ∀v ∈ K .
2 2
The proof is complete.
Corollary 3.5.3 (Lax-Milgram theorem) Assume a(u, v) is a bounded, coercive and symmetric bilinear
form on H. Under such hypotheses, there exists a unique u ∈ H such that
a(u, v) = ( f , v), ∀v ∈ H.
Moreover, such a u ∈ H is such that
1 1
a(u, u) − ( f , u)H = min a(v, v) − ( f , v)H .
2 v∈H 2
Proof 3.14 The proof follows from the Stampacchia theorem with K = H and Remark 3.4.5.
Chapter 4
4.1 Introduction
In this chapter we present the Hahn-Banach theorems and some important applications. Also, a study
on weak topologies is developed in details.
Finally, we highlight the main reference for this chapter is Brezis [24].
Proof 4.1 Choose z ∈ U \V . Denote by Ṽ the space spanned by V and z, that is,
Ṽ = {v + αz | v ∈ V and α ∈ R}. (4.6)
The Hahn-Banach Theorems and the Weak Topologies 67
and therefore,
1 1
[−p(v1 − αz) + g(v1 )] ≤ [p(v2 + β z) − g(v2 )],
α β
∀v1 , v2 ∈ V, α, β > 0. Thus, there exists a ∈ R such that
1 1
sup (−p(v − αz) + g(v)) ≤ a ≤ inf ( p(v + αz) − g(v)) . (4.9)
v∈V,α >0 α v∈V,α>0 α
g̃(αz + v) = aα + g(v)
1
≤ (p(v + αz) − g(v)) α + g(v)
α
= p(v + αz). (4.10)
g̃(αz + v) = aα + g(v)
1
≤ (−p(v + αz) + g(v)) α + g(v)
−α
= p(v + αz) (4.11)
and hence,
g̃(u) ≤ p(u), ∀u ∈ Ṽ .
Define now by E the set of all extensions e of g, which satisfy e(u) ≤ p(u) on the domain of e, where such a
domain is always a subspace of U. We shall also define a partial order for E denoting e1 ≺ e2 , as the domain
of e2 contains the domain of e1 and e1 = e2 on the domain of e1 . Let {eα }α ∈A be an ordered subset of E .
Let Vα be the domain of eα , ∀α ∈ A. Define e on ∪α ∈AVα by setting e = eα on Vα . Clearly, eα ≺ e, ∀α ∈ A
so that each ordered subset of E has an upper bound. From this and Zorn Lemma, E has a maximal element
f defined on some subspace Ũ ⊂ U such that f (u) ≤ p(u), ∀u ∈ Ũ. Suppose, to obtain contradiction, that
6 U and let z1 ∈ U \Ũ. As indicated above we may obtain an extension f1 from Ũ to the subspace spanned
Ũ =
by z1 and Ũ, which contradicts the maximality of f .
The proof is complete.
68 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Definition 4.2.2 (Topological dual spaces) Let U be a Banach space. We will define its dual topological
space, as the set of all linear continuous functionals defined on U. We suppose such a dual space of U
may be represented by another vector space U ∗ , through a bilinear form h·, ·iU : U ×U ∗ → R (here we are
referring to standard representations of dual spaces of Sobolev and Lebesgue spaces, to be addressed in the
subsequent chapters). Thus, given f : U → R linear and continuous, we assume the existence of a unique
u∗ ∈ U ∗ such that
Corollary 4.2.3 Let V ⊂ U be a proper subspace of U and let g : V → R be a linear and continuous func-
tional with norm
and
Proof 4.2 It suffices to apply Theorem 4.2.1 with p(u) = kgkV ∗ kukV . Indeed, from such a theorem, there
exists a linear functional f : U → R such that
f (u) = g(u), ∀u ∈ V
and
f (u) ≤ p(u) = kgkV ∗ kukU ,
that is,
| f (u)| ≤ p(u) = kgkV ∗ kukU , ∀u ∈ U.
Therefore,
k f kU ∗ = sup{| f (u)| : kukU ≤ 1} ≤ kgkV ∗ .
u∈U
On the other hand,
k f kU ∗ ≥ sup{| f (u)| : kukU ≤ 1} = kgkV ∗ .
u∈V
Thus,
k f kU ∗ = kgkV ∗ .
Finally, since f is linear and continuous, there exists u∗ ∈ U ∗ such that
f (u) = hu, u∗ iU , ∀u ∈ U,
and hence
hu, u∗ iU = f (u) = g(u), ∀u ∈ V.
Moreover,
ku∗ kU ∗ = k f kU ∗ = kgkV ∗ .
The proof is complete.
The Hahn-Banach Theorems and the Weak Topologies 69
Corollary 4.2.4 Let u0 ∈ U. Under such hypotheses, there exists u∗0 ∈ U ∗ such that
Proof 4.3 It suffices to apply the Corollary 4.2.3 with V = {αu0 | α ∈ R} and g(tu0 ) = tku0 kU2 so that
kgkV ∗ = ku0 kU .
Indeed, from the last corollary, there exists u∗0 ∈ U ∗ such that
and
ku∗0 kU ∗ = kgkV ∗ ,
where,
kgkV ∗ = sup{tku0 kU2 : ktu0 kU ≤ 1} = ku0 kU .
t∈R
Finally,
htu0 , u∗0 iU = g(tu0 ) = tku0 kU2 , ∀t ∈ R,
so that
hu0 , u∗0 iU = ku0 kU2 .
This completes the proof.
Corollary 4.2.5 Let u ∈ U. Under such hypotheses
we have
However, from the last corollary, there exists u∗0 ∈ U ∗ such that ku∗0 kU ∗ = kukU and hu, u∗0 iU = kukU2 . Define
u∗1 = kukU−1 u∗0 . Thus, ku∗1 kU = 1 and hu, u∗1 iU = kukU .
The proof is complete.
Definition 4.2.6 (Affine hyperplane) Let U be a Banach space. An affine hyperplane H is a set defined by
H = {u ∈ U | hu, u∗ iU = α} (4.20)
Proof 4.5 The result follows directly from the continuity of hu, u∗ iU as functional on U.
70 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Definition 4.2.8 (Separation) Let A, B ⊂ U. We say that a hyperplane H, as above indicated separates A
and B, as there exist α ∈ R and u∗ ∈ U ∗ such that
We say that H separates A and B strictly if there exists ε > 0 such that
Theorem 4.2.9 (The Hahn-Banach theorem, the geometric form) Let A, B ⊂ U be two non-empty, convex
sets such that A ∩ B = 0/ and A is open. Under such hypotheses, there exists a closed hyperplane which
separates A and B, that is, there exist α ∈ R and u∗ ∈ U ∗ such that
hu, u∗ iU ≤ α ≤ hv, u∗ iU , ∀u ∈ A, v ∈ B.
and
Moreover,
p(u + v) ≤ p(u) + p(v), ∀u, v ∈ U.
Proof 4.6 Let r > 0 be such that B(0, r) ⊂ C. Let u ∈ U such that u 6= 0. Thus,
u
r ∈ B(0, r) ⊂ C,
kukU
and therefore
kukU
p(u) ≤ , ∀u ∈ U (4.26)
r
which proves (4.24). Suppose now u ∈ C. Since C is open there exists ε > 0 sufficiently small such that
(1 + ε )u ∈ C. Thus, p(u) ≤ 1+1 ε < 1. Reciprocally, if p(u) < 1, there exists 0 < α < 1 such that α −1 u ∈ C
and hence, since C is convex, we get u = α (α −1 u) + (1 − α)0 ∈ C.
u v tu (1−t )v
Finally, let u, v ∈ C and ε > 0. Thus, p(u)+ε ∈ C and p(v)+ε ∈ C so that p(u)+ε + p(v)+ε ∈ C, ∀t ∈ [0, 1].
p(u)+ε u+v
Particularly, for t = p(u)+p(v)+2ε we obtain p(u)+p(v)+2ε ∈ C, and thus,
Lemma 4.2.11 Let C ⊂ U be an non-empty, open and convex set and let u0 ∈ U such that u0 6∈ C. Under
such hypotheses, there exists u∗ ∈ U ∗ such that hu, u∗ iU < hu0 , u∗ iU , ∀u ∈ C
The Hahn-Banach Theorems and the Weak Topologies 71
Proof 4.7 By translation, if necessary, there is no loos in generality in assuming 0 ∈ C. Consider the
functional p defined in the last lema. Define V = {αu0 | α ∈ R}. Define also g on V , by
g(tu0 ) = t, ∀t ∈ R. (4.27)
Here, we have applied the Lemma 4.2.10. In particular, f (u0 ) = g(u0 ) = g(1u0 ) = 1, also from the last
lemma, f (u) < 1, ∀u ∈ C. The existence of u∗ satisfying this lemma conclusion follows from the continuity
of f , indicated in (4.28).
Proof of Theorem 4.2.9. Define C = A + (−B) so that Cis convex and 0 6∈ C. From Lemma 4.2.11, there
exists u∗ ∈ U ∗ such that hw, u∗ iU < 0, ∀w ∈ C, and thus,
Proposition 4.2.12 Let U be a Banach space and let A, B ⊂ U be such that A is compact, B is closed and
A ∩ B = 0/.
Under such hypotheses, there exists ε1 > 0 such that
kvn j − v0 kU → 0, as j → ∞.
kun j − v0 kU → 0, as j → ∞,
72 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
and
kwn j − v0 kU → 0, as j → ∞.
Since A and B are closed we may infer that
v0 ∈ A ∩ B,
Theorem 4.2.13 (The Hahn-Banach theorem, the second geometric form) Let A, B ⊂ U be two non-
empty, convex sets such that A ∩ B = 0/. Suppose A is compact and B is closed. Under such hypotheses,
there exists an hyperplane which separates A and B strictly.
Proof 4.9 Observe that, from the last proposition, there exists ε > 0 sufficiently small such that Aε =
A + B(0, ε) and Bε = B + B(0, ε) are disjoint and convex sets. From Theorem 4.2.9, there exists u∗ ∈ U ∗ such
that u∗ 6= 0 and
Corollary 4.2.14 Suppose V ⊂ U is a vector subspace such that V 6= U. Under such hypotheses, there exists
u∗ ∈ U ∗ such that u∗ 6= 0 and
hu, u∗ iU = 0, ∀u ∈ V. (4.35)
Proof 4.10 Let u0 ∈ U be such that u0 6∈ V . Applying Theorem 4.2.9 to A = V and B = {u0 } we obtain
u∗ ∈ U ∗ and α ∈ R such that u∗ 6= 0 e
Proposition 4.3.2 A Banach space U is Hausddorff as endowed with the weak topology σ (U,U ∗ ).
The Hahn-Banach Theorems and the Weak Topologies 73
Proof 4.11 Choose u1 , u2 ∈ U such that u1 6= u2 . From the Hahn-Banach theorem, second geometric form,
there exists an hyperplane separating {u1 } and {u2 } strictly, thats is, there exist u∗ ∈ U ∗ and α ∈ R such that
Define
and
We claim that
Vw1 (u1 ) ∩Vw2 (u2 ) = 0/.
Suppose, to obtain contradiction, there exists u ∈ Vw1 (u1 ) ∩Vw2 (u2 ).
Thus,
hu − u1 , u∗ iU < α − hu1 , u∗ iU ,
and therefore
hu, u∗ iU < α.
Also
−hu − u2 , u∗ iU < hu2 , u∗ iU − α,
and hence
hu, u∗ iU > α.
We have got
hu, u∗ iU < α < hu, u∗ iU ,
a contradiction.
Summarizing, we have obtained u1 ∈ Vw1 (u1 ), u2 ∈ Vw2 (u2 ) and Vw1 (u1 ) ∩ Vw2 (u2 ) = 0.
/
The proof is complete.
Remark 4.3.3 If {un } ∈ U is such that un converges to u in σ (U,U ∗ ), then we write un * u, weakly.
4. If un * u weakly and u∗n → u∗ strongly (in norm) in U ∗ , then hun , u∗n iU → hu, u∗ iU .
Proof 4.12
1. The result follows from the definition of σ (U,U ∗ ).
Indeed, suppose that {un } ⊂ U and un * u, weakly.
Let u∗ ∈ U ∗ and let ε > 0.
Define
Vw (u) = {v ∈ U : |hv − u, u∗ iU | < ε}.
74 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
un ∈ Vw (u).
That is,
|hun − u, u∗ iU | < ε,
if n > n0 .
Therefore,
hun , u∗ iU → hu, u∗ iU , as n → ∞
∀u∗ ∈ U ∗ .
Reciprocally, suppose that
hun , u∗ iU → hu, u∗ iU , as n → ∞
∀u∗ ∈ U ∗ .
Let V (u) ∈ σ (U,U ∗ ) be a set which contains {u}.
Thus, there exists a weak neighborhood a Vw (u) such that u ∈ Vw (u) ⊂ V (u), where there exist m ∈ N,
εi > 0 and u∗i ∈ U ∗ such that
From the hypotheses, for each i ∈ {1, · · · m}, there exists ni ∈ N such that if n > ni , then
Define n0 = max{n1 , · · · , nm }.
Thus,
un ∈ Vw (u) ⊂ V (u), if n > n0 .
3. For each u∗ ∈ U ∗ the sequence {hun , u∗ iU } is convergent for some bounded sequence. From this and
the Uniform Boundedness Principle, there exists M > 0 such that kun kU ≤ M, ∀n ∈ N. Moreover, for
u∗ ∈ U ∗ , we have
Thus,
Theorem 4.3.5 Let U be a Banach space and let A ⊂ U be a non-empty convex set. Under such hypotheses,
A is closed for the topology σ (U,U ∗ ) if, and only if A is closed for the topology induced by k · kU .
Proof 4.13 If A = U, the result is immediate. Thus, assume A =6 U. Suppose that A is strongly closed. Let
u0 6∈ A. From the Hahn-Banach theorem there exists a closed hyperplane which separates u0 and A strictly,
that is, there exist α ∈ R and u∗ ∈ U ∗ such that
Define
so that u0 ∈ V , V ⊂ U \ A.
Let
Vw (u0 ) = {v ∈ U : |hv − u0 , u∗ iU | < α − hu0 , u∗ iU .
Let v ∈ Vw (u0 ).
Thus,
hv, u∗ iU = hv − u0 + u0 , u∗ iU
= hv − u0 , u∗ iU + hu0 , u∗ iU
≤ |hv − u0 , u∗ iU | + hu0 , u∗ iU
< α − hu0 , u∗ iU + hu0 , u∗ iU
= α. (4.48)
From this we may infer that Vw (u0 ) ⊂ V ⊂ U \ A, that is, u0 is an interior point for σ (U,U ∗ ) of U \ A,
∀u0 ∈ U \ A
Therefore, V is weakly open.
Summarizing, U \ A is open in σ (U,U ∗ ) and thus A is closed for σ (U,U ∗ ) (weakly closed).
Finally, the reciprocal is immediate.
Theorem 4.3.6 Let (Z, σ ) be a topological space and let U be a Banach space. Let φ : Z → U be a function,
considering U with the weak topology σ (U,U ∗ ).
Under such hypotheses, φ is continuous if, and only if, fu∗ : Z → R, where
is continuous, ∀u∗ ∈ U ∗ .
76 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Proof 4.14 Assume φ is continuous. Let z0 ∈ Z and let {zα }α∈I be a net such that
zα → z0 .
is continuous, ∀u∗ ∈ U ∗ .
Suppose, to obtain contradiction, that φ is not continuous.
Thus, there exists z0 ∈ Z such that φ is not continuous at z0 .
In particular, there exists a net {zα }α∈I such that zα → z0 and we do not have
hφ (zα ), u∗ iU → hφ (z0 ), u∗ iU ,
Under such hypotheses, BU ∗ is compact for U ∗ with the weak-star topology σ (U ∗ ,U).
Proof 4.15 For each u ∈ U, we shall associate a real number ωu and denote
ω= ∏ ωu ∈ RU ,
u∈U
φ (u∗ ) = ∏ hu, u∗ iU .
u∈U
We shall show that φ is continuous. Suppose, to obtain contradiction that φ is not continuous. Thus, there
exists u∗ ∈ U ∗ such that φ is not continuous at u∗ .
Hence there exist a net {u∗α }α∈I such that
u∗α → u∗ in σ (U ∗ ,U),
Thus,
|Pu j (φ (u∗αβ )) − Pu j (φ (u∗ ))| = |hu j , u∗αβ − u∗ iU |
≥ ε j , ∀αβ ∈ I. (4.49)
Therefore, we do not have,
hu j , u∗αβ iU → hu j , u∗ iU ,
that is, we do not have,
u∗α → u∗ , em σ (U ∗ ,U),
a contradiction.
Hence, φ is continuous with RU with the topology σ above specified.
We shall prove now that
φ −1 : φ (U ∗ ) → U ∗
is also continuous.
This follows from a little adaptation of the last proposition, considering that
Theorem 4.5.2 (Kakutani) Let U be a Banach space. Then U is reflexive if and only if
BU = {u ∈ U | kukU ≤ 1} (4.56)
Proof 4.16 Suppose U is reflexive, then J(BU ) = BU ∗∗ . From the last theorem BU ∗∗ is compact for the
topology σ (U ∗∗ ,U ∗ ). Therefore it suffices to verify that J −1 : U ∗∗ → U is continuous from U ∗∗ with the
topology σ (U ∗∗ ,U ∗ ) to U, with the topology σ (U,U ∗ ).
From Theorem 4.3.6 it is sufficient to show that the function u 7→ hJ −1 u, f iU is continuous for the topol-
ogy σ (U ∗∗ ,U ∗ ), for each f ∈ U ∗ . Since
hJ −1 u, f iU = h f , uiU ∗ ,
2.
n n
∑ βi αi ≤ ∑ βi fi , ∀β1 , ..., βn ∈ R. (4.57)
i=1 i=1 U∗
n
Proof. 1 ⇒ 2: Fix β1 , ..., βn ∈ R, ε > 0 and define S = ∑i=1 |βi |. From 1, we have
n n
∑ βi huε , fi iU − ∑ βi αi < εS (4.58)
i=1 i=1
and therefore
n n
∑ βi αi − ∑ βi huε , fi iU < εS (4.59)
i=1 i=1
or
n n n
∑ βi αi < ∑ βi fi kuε kU + εS ≤ ∑ βi fi + εS (4.60)
i=1 i=1 U∗ i=1 U∗
so that
n n
∑ βi αi ≤ ∑ βi f i (4.61)
i=1 i=1 U∗
since ε is arbitrary.
80 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Now let us show that 2 ⇒ 1. Define ~α = (α1 , ..., αn ) ∈ Rn and consider the function ϕ(u) =
(h f1 , uiU , ..., h fn , uiU ). Item 1 is equivalent to ~α belongs to the closure of ϕ(BU ). Let us suppose that ~α
does not belong to the closure of ϕ(BU ) and obtain a contradiction. Thus we can separate ~α and the closure
of ϕ(BU ) strictly, that is there exists ~β = (β1 , ..., βn ) ∈ Rn and γ ∈ R such that
Proof 4.17 Let u∗∗ ∈ BU ∗∗ and consider Vu∗∗ a neighborhood of u∗∗ for the topology σ (U ∗∗ ,U ∗ ). It suffices
to show that J(BU ) ∩ Vu∗∗ = / As Vu∗∗ is a weak neighborhood, there exists f1 , ..., fn ∈ U ∗ and ε > 0 such
6 0.
that
n n n
∑ βi αi = h ∑ βi fi , u∗∗ iU ∗ ≤ ∑ βi fi , (4.64)
i=1 i=1 i=1 U∗
so that from Helly lemma, there exists uε ∈ U such that kuε kU ≤ 1 and
or,
and hence
Now we will complete the proof of Kakutani theorem. Suppose BU is weakly compact (that is, compact
for the topology σ (U,U ∗ )). Observe that J : U → U ∗∗ is weakly continuous, that is, it is continuous with U
endowed with the topology σ (U,U ∗ ) and U ∗∗ endowed with the topology σ (U ∗∗ ,U ∗ ). Thus, as BU is weakly
compact, we have that J(BU ) is compact for the topology σ (U ∗∗ ,U ∗ ). From the last lemma, J(BU ) is dense
BU ∗∗ for the topology σ (U ∗∗ ,U ∗ ). Hence J(BU ) = BU ∗∗ , or J(U) = U ∗∗ , which completes the proof.
Proposition 4.5.5 Let U be a reflexive Banach space. Let K ⊂ U be a convex closed bounded set. Then K is
weakly compact.
Proof 4.18 From Theorem 4.3.5, K is weakly closed (closed for the topology σ (U,U ∗ )). Since K is
bounded, there exists α ∈ R+ such that K ⊂ αBU . Since K is weakly closed and K = K ∩ αBU , we have that
it is weakly compact.
Proposition 4.5.6 Let U be a reflexive Banach space and M ⊂ U a closed subspace. Then M with the norm
induced by U is reflexive.
The Hahn-Banach Theorems and the Weak Topologies 81
It can be easily verified that these two topologies coincide (through restrictions and extensions of linear
forms). From the Kakutani Theorem it suffices to show that BM is compact for the topology σ (M, M ∗ ). But
BU is compact for σ (U,U ∗ ) and M ⊂ U is closed (strongly) and convex so that it is weakly closed, thus from
last proposition, BM is compact for the topology σ (U,U ∗ ), and therefore it is compact for σ (M, M ∗ ).
Theorem 4.6.3 Let U be a Banach space such that U ∗ is separable. Then U is separable.
so that for each n ∈ N, there exists un ∈ U such that kun kU = 1 and hu∗n , un iU ≥ 12 ku∗n kU ∗ .
Define U0 as the vector space on Q spanned by {un }, and U1 as the vector space on R spanned by {un }.
It is clear that U0 is dense in U1 and we will show that U1 is dense in U, so that U0 is a dense set in U. For,
suppose u∗ is such that hu, u∗ iU = 0, ∀u ∈ U1 . Since {u∗n } is dense in U ∗ , given ε > 0, there exists n ∈ N such
that ku∗n − u∗ kU ∗ < ε, so that
1 ∗
ku kU ∗ ≤ hun , u∗n iU = hun , u∗n − u∗ iU + hun , u∗ iU
2 n
≤ ku∗n − u∗ kU ∗ kun kU + 0
< ε (4.70)
or
Therefore, since ε is arbitrary, ku∗ kU ∗ = 0, that is u∗ = θ . By Corollary 4.2.14 this completes the proof.
Proof 4.21 Suppose U is reflexive, as BU ∗ is compact for σ (U ∗ ,U) and σ (U ∗ ,U) = σ (U ∗ ,U ∗∗ ) we have
that BU ∗ is compact for σ (U ∗ , U ∗∗ ), which means that U ∗ is reflexive.
Suppose U ∗ is reflexive, from above U ∗∗ is reflexive. Since J(U) is a closed subspace of U ∗∗ , from
Proposition 4.5.6, J(U) is reflexive. From the Kakutani Theorem J(BU ) is weakly compact. At this point
we shall prove that J −1 : J(U) → V is continuous from J(U) with the topology σ (U ∗∗ ,U ∗ ) to U with the
topology σ (U,U ∗ ).
Let {u∗∗α }α∈I ⊂ J(BU ) be a net such that
u∗∗ ∗∗
α * u0
82 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
weakly in σ (U ∗∗ ,U ∗ ).
Let u∗ ∈ U ∗ . Thus,
hu∗ , u∗∗ ∗ ∗∗
α iU ∗ → hu , u0 iU ∗ .
From this
hu∗ , J(J −1 (u∗∗ ∗ −1 ∗∗
α ))iU ∗ → hu , J(J (u0 ))iU ∗ ,
so that
h(J −1 (u∗∗ ∗ −1 ∗∗ ∗
α )), u iU → h(J (u0 )), u iU .
Since the net is question and u∗ ∈ U ∗ have been arbitrary, we may infer that J −1 is weakly continuous for
the concerning topology.
Hence J −1 (J(BU )) is also weakly compact so that from this, from the fact that BU is weakly closed and
BU ⊂ J −1 J(BU ),
Proposition 4.6.5 Let U be a Banach space. Then U is reflexive and separable if and only if U ∗ is reflexive
and separable.
Proof 4.22 Let {un } be a dense countable set in BU . For each u∗ ∈ U ∗ define
∞
1
ku∗ kw = ∑ 2n | hun , u∗ iU |.
n=1
ku∗ kw ≤ ku∗ kU .
d(u∗ , v∗ ) = ku∗ − v∗ kw .
Now we shall prove that the topology induced by d coincides with σ (U ∗ ,U) in U ∗ .
For, let u∗0 ∈ BU ∗ and let V be neighborhood of u∗0 in σ (U ∗ ,U).
We need to prove that there exists r > 0 such that
Since {un } is dense in U, for each i ∈ {1, ..., k} there exists ni ∈ N such that
ε
kuni − vi kU < .
4
Choose r > 0 small enough such that
ε
2ni r < , ∀i ∈ {1, ..., k}.
2
We are going to show that Vw ⊂ V , where
For u∗ ∈ Vw we have
k
1
d(u∗ , u∗0 ) = ∑ 2n |hun , u∗ − u∗0 iU |
n=1
∞
1
+ ∑ n
|hun , u∗ − u∗0 iU |
n=k+1 2
∞
1
< ε +2 ∑ n
n=k+1 2
1
= ε+ . (4.73)
2k−1
Hence, it suffices to take ε = r/2, and k sufficiently big such that
1
< r/2.
2k−1
The first part of the proof is finished.
Conversely, assume BU is metrizable in σ (U ∗ ,U). We are going to show that U is separable.
84 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Define,
∗ ∗ 1
Ṽn = u ∈ BU ∗ | d(u , θ ) < .
n
From the first part, we may find Vn a neighborhood of zero in σ (U ∗ ,U) such that
Vn ⊂ Ṽn .
Thus D is countable and we are going to prove that such a set is dense in U.
For, suppose u∗ ∈ U ∗ is such that
hu, u∗ iU = 0, ∀u ∈ D.
Hence,
u∗ ∈ Vn ⊂ Ṽn , ∀n ∈ N,
so that u∗ = θ .
The proof is complete.
Theorem 4.7.2 (Milman Pettis) Every uniformly convex Banach space is reflexive.
Proof 4.23 Let η ∈ U ∗∗ be such that kηkU ∗∗ = 1. It suffices to show that η ∈ J(BU ). Since J(BU ) is closed
in U ∗∗ , we have only to show that for each ε > 0 there exists u ∈ U such that kη − J(u)kU ∗∗ < ε.
Thus, suppose given ε > 0. Let δ > 0 be the corresponding constant relating the uniformly convex prop-
erty.
Choose f ∈ U ∗ such that k f kU ∗ = 1 and
δ
h f , ηiU ∗ > 1 − . (4.74)
2
Define
δ
V = ζ ∈ U ∗∗ | |h f , ζ − ηiU ∗ | < .
2
Observe that V is a neighborhood of η in σ (U ∗∗ ,U ∗ ). Since J(BU ) is dense in BU ∗∗ concerning the topology
σ (U ∗∗ ,U ∗ ), we have that V ∩ J(BU ) 6= 0/ and thus there exists u ∈ BU such that J(u) ∈ V. Suppose, to obtain
contradiction, that
kη − J(u)kU ∗∗ > ε.
Therefore, defining
W = (J(u) + εBU ∗∗ )c ,
The Hahn-Banach Theorems and the Weak Topologies 85
2h f , ηiU ∗ < hu + v, f iU + δ
≤ ku + vkU + δ . (4.75)
ku − vkU ≤ ε. (4.76)
The main references for this chapter are Reed and Simon [62] and Bachman and Narici [6].
The topology related to the metric induced by this norm is called the uniform operator topology.
Let us introduce now the strong operator topology, which is defined as the weakest topology for which
the functions
Eu : L (U,Y ) → Y
are continuous where
Eu (A) = Au, ∀A ∈ L (U,Y ).
For such a topology a base at origin is given by sets of the form
kAn u − AukY → 0, as n → ∞, ∀u ∈ U.
In the next lines we describe the weak operator topology in L (U,Y ). Such a topology is weakest one
such that the functions
Eu,v : L (U,Y ) → C
are continuous, where
Eu,v (A) = hAu, viY , ∀A ∈ L (U,Y ), u ∈ U, v ∈ Y ∗ .
For such a topology, a base at origin is given by sets of the form
as n → ∞, ∀u ∈ U, v ∈ Y ∗ .
Hence, T (u) is a continuous linear functional on U and considering our fundamental representation hypoth-
esis, there exists u∗ ∈ U ∗ such that
T (u) = hu, u∗ iU , ∀u ∈ U.
We define A∗ by setting u∗ = A∗ v∗ , so that
that is,
hu, A∗ v∗ iU = hAu, v∗ iY , ∀u ∈ U, v∗ ∈ Y ∗ .
We call A∗ : Y ∗ → U ∗ the adjoint operator relating A : U → Y.
Theorem 5.2.2 Let U,Y be Banach spaces and let A : U → Y be a bounded linear operator. Then
kAk = kA∗ k.
4. kAA∗ k = kAk2 .
Proof 5.2
1. Observe that
(ABu, v)H = (Bu, A∗ v)H = (u, B∗ A∗ v)H , ∀u, v ∈ H.
2. Observe that
(Au, v)H = (u, A∗ v)H = (A∗ v, u)H = (v, A∗∗ u)H = (A∗∗ u, v)H , ∀u, v ∈ H.
3. We have that
I = AA−1 = A−1 A,
so that
I = I ∗ = (AA−1 )∗ = (A−1 )∗ A∗ = (A−1 A)∗ = A∗ (A−1 )∗ .
4. Observe that
kA∗ Ak ≤ kAkkA∗ k = kAk2 ,
and
and hence
kA∗ Ak = kAk2 .
A = A∗ .
Theorem 5.2.5 Let U and Y be Banach spaces and let A : U → Y be a bounded linear operator. Then
[R(A)]⊥ = N(A∗ ),
where
[R(A)]⊥ = {v∗ ∈ Y ∗ | hAu, v∗ iY = 0, ∀u ∈ U}.
Proof 5.3 Let v∗ ∈ N(A∗ ). Choose v ∈ R(A). Thus there exists u in U such that Au = v so that
N(A∗ ) ⊂ [R(A)]⊥ .
Topics on Linear Operators 89
hAu, v∗ iY = 0,
so that
hu, A∗ v∗ iU , ∀u ∈ U.
Therefore, A∗ v∗ = θ , that is, v∗ ∈ N(A∗ ). Since v∗ ∈ [R(A)]⊥ is arbitrary, we get
[R(A)]⊥ ⊂ N(A∗ ).
Lemma 5.1
Let U,Y be Banach spaces and let A : U → Y be a bounded linear operator. Suppose also that R(A) =
{A(u) : u ∈ U} is closed. Under such hypotheses, there exists K > 0 such that for each v ∈ R(A) there exists
u0 ∈ U such that
A(u0 ) = v
and
ku0 kU ≤ KkvkY .
Proof 5.4 Define L = N(A) = {u ∈ U : A(u) = θ } (the null space of A). Consider the space U/L, where
U/L = {u : u ∈ U},
where
u = {u + w : w ∈ L}.
Define A : U/L → R(A), by
A(u) = A(u).
Observe that A is one-to-one, linear, onto and bounded. Moreover R(A) is closed so that it is a Banach space.
Hence by the inverse mapping theorem we have that A has a continuous inverse. Thus, for any v ∈ R(A) there
exists u ∈ U/L such that
A(u) = v
so that
−1
u = A (v),
and therefore
−1
kuk ≤ kA kkvkY .
Recalling that
kuk = inf {ku + wkU },
w∈L
Theorem 5.1
Let U,Y be Banach spaces and let A : U → Y be a bound linear operator. Assume R(A) is closed. Under such
hypotheses
R(A∗ ) = [N(A)]⊥ .
u∗ = A∗ (v∗ ).
R(A∗ ) ⊂ [N(A)]⊥ .
hu, u∗ iU = 0, ∀u ∈ N(A).
Fix v ∈ R(A). From the Lemma 5.1, there exists K > 0 (which does not depend on v) and uv ∈ U such
that
A(uv ) = v
and
kuv kU ≤ KkvkY .
Define f : R(A) → R by
f (v) = huv , u∗ iU .
Observe that
| f (v)| ≤ kuv kU ku∗ kU ∗ ≤ KkvkY ku∗ kU ∗ ,
so that f is a bounded linear functional. Hence by a Hahn-Banach theorem corollary, there exists v∗ ∈ Y ∗
such that
f (v) = hv, v∗ iY ≡ F(v), ∀v ∈ R(A),
that is, F is an extension of f from R(A) to Y .
In particular
f (v) = huv , u∗ iU = hv, v∗ iY = hA(uv ), v∗ iY ∀v ∈ R(A),
where A(uv ) = v, so that
huv , u∗ iU = hA(uv ), v∗ iY ∀v ∈ R(A).
Now let u ∈ U and define A(u) = v0 . Observe that
u = (u − uv0 ) + uv0 ,
and
A(u − uv0 ) = A(u) − A(uv0 ) = v0 − v0 = θ .
Since u∗ ∈ [N(A)]⊥ , we get
hu − uv0 , u∗ iU = 0
Topics on Linear Operators 91
so that
hu, u∗ iU = h(u − uv0 ) + uv0 , u∗ iU
= huv0 , u∗ iU
= hA(uv0 ), v∗ iY
= hA(u − uv0 ) + A(uv0 ), v∗ iY
= hA(u), v∗ iY . (5.3)
Hence,
hu, u∗ iU = hA(u), v∗ iY , ∀u ∈ U.
We may conclude that u∗ = A∗ (v∗ ) ∈ R(A∗ ). Since u∗ ∈ [N(A)]⊥ is arbitrary we obtain
[N(A)]⊥ ⊂ R(A∗ ).
The proof is complete.
Theorem 5.3.2 A compact operator maps weakly convergent sequences into norm convergent sequences.
un * u weakly in U.
hAun , v∗ iY = hun , A∗ v∗ iU
→ hu, A∗ v∗ iU
= hAu, v∗ iY . (5.4)
Suppose Aun does not converge in norm to Au. Thus there exists ε > 0 and a subsequence {Aunk } such that
kAunk − AukY ≥ ε, ∀k ∈ N.
As {unk } is bounded and A is compact, {Aunk } has a subsequence converging para ṽ = 6 Au. But then such a
sequence converges weakly to ṽ 6= Au, which contradicts (5.5). The proof is complete.
Theorem 5.3.3 Let H be a separable Hilbert space. Thus each compact operator in L (H) is the limit in
norm of a sequence of finite rank operators.
Proof 5.8 Let A be a compact operator in H. Let {φ j } be an orthonormal basis in H. For each n ∈ N define
It is clear that {λn } is a non-increasing sequence that converges to a limit λ ≥ 0. We will show that λ = 0.
Choose a sequence {ψn } such that
ψn ∈ [φ1 , ..., φn ]⊥ ,
kψn kH = 1 and kAψn kH ≥ λ /2. Now we will show that
ψn * θ , weakly in H.
and
∞
∑ |b j |2 = 1.
j=n+1
Therefore,
∞
|(ψn , ψ ∗ )H | = ∑ (φ j , φ j )H a j · b j
j=n+1
∞
= ∑ aj ·bj
j=n+1
s s
∞ ∞
≤ ∑ |a j |2 ∑ |b j |2
j=n+1 j=n+1
√
≤ ε, (5.6)
Observe that
1. E is linear,
2. E is idempotent, that is E 2 = E,
94 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
3. R(E) = M,
4. N(E) = M ⊥ .
Also observe that from
Ez = x
we have
kEzk2H = kxk2H ≤ kxk2H + kyk2H = kzk2H ,
so that
kEk ≤ 1.
Definition 5.4.2 Let A, B ∈ L (H). We write
A≥θ
if
(Au, u)H ≥ 0, ∀u ∈ H,
and in this case we say that A is positive. Finally, we denote
A≥B
if
A−B ≥ θ.
Theorem 5.4.3 Let A and B be bounded self-adjoint operators such that A ≥ θ and B ≥ θ . If AB = BA then
AB ≥ θ .
Proof 5.9 If A = θ , the result is obvious. Assume A 6= θ and define the sequence
A
A1 = , An+1 = An − A2n , ∀n ∈ N.
kAk
We claim that
θ ≤ An ≤ I, ∀n ∈ N.
We prove the claim by induction.
For n = 1, it is clear that A1 ≥ θ . And since kA1 k = 1, we get
so that
A1 ≤ I.
Thus,
θ ≤ A1 ≤ I.
Now suppose θ ≤ An ≤ I. Since An is self adjoint we have,
where v = An u. Therefore
A2n (I − An ) ≥ θ .
Topics on Linear Operators 95
A1 = A21 + A2
= A21 + A22 + A3
... ...................
= A21 + ... + A2n + An+1 . (5.8)
Hence,
(ABu, u)H ≥ 0, ∀u ∈ H.
The proof is complete.
Theorem 5.4.4 Let {An } be a sequence of self-adjoint commuting operators in L (H). Let B ∈ L (H) be a
self adjoint operator such that
Ai B = BAi , ∀i ∈ N.
Suppose also that
A1 ≤ A2 ≤ A3 ≤ ... ≤ An ≤ ... ≤ B.
Under such hypotheses there exists a self-adjoint, bounded, linear operator A such that
An → A in norm ,
and
A ≤ B.
Cn = B − An ≥ 0, ∀n ∈ N.
and also
Cn (Cm −Cn ) ≥ θ .
Thus,
(Cm2 u, u)H ≥ (CnCm u, u)H ≥ (Cn2 u, u)H ,
Topics on Linear Operators 97
Definition 5.4.5 Let A ∈ L (A) be a positive operator. The self-adjoint operator B ∈ L (H) such that
B2 = A
Theorem 5.4.6 Suppose A ∈ L (H) is positive. Then there exists B ≥ θ such that
B2 = A.
kAk ≤ 1,
so that if
A
C2 =
kAk
then
B = kAk1/2C.
Let
B0 = θ ,
and consider the sequence of operators given by
1
Bn+1 = Bn + (A − B2n ), ∀n ∈ N ∪ {0}.
2
Since each Bn is polynomial in A, we have that Bn is self-adjoint and commute with any operator with
commutes with A. In particular
Bi B j = B j Bi , ∀i, j ∈ N.
First we show that
Bn ≤ I, ∀n ∈ N ∪ {0}.
1
Since B0 = θ , and B1 = 2 A, the statement holds for n = 1. Suppose Bn ≤ I. Thus,
1 1
I − Bn+1 = I − Bn − A + B2n
2 2
1 2 1
= (I − Bn ) + (I − A) ≥ θ (5.13)
2 2
so that
Bn+1 ≤ I.
The induction is complete, that is,
Bn ≤ I, ∀n ∈ N.
Topics on Linear Operators 99
B0 ≤ B1 ,
and supposing
Bn−1 ≤ Bn ,
we have
1 1
Bn+1 − Bn = Bn + (A − B2n ) − Bn−1 − (A − B2n−1 )
2 2
1 2 2
= Bn − Bn−1 − (Bn − Bn−1 )
2
1
= Bn − Bn−1 − (Bn + Bn−1 )(Bn − Bn−1 )
2
1
= (I − (Bn + Bn−1 ))(Bn − Bn−1 )
2
1
= ((I − Bn−1 ) + (I − Bn ))(Bn − Bn−1 ) ≥ θ .
2
The induction is complete, that is
θ = B0 ≤ B1 ≤ B2 ≤ ... ≤ Bn ≤ ... ≤ I.
Bn → B in norm.
Fixing u ∈ H we have
1
Bn+1 u = Bn u + (A − B2n )u,
2
so that taking the limit in norm as n → ∞, we get
θ = (A − B2 )u.
6.1 Introduction
We start by presenting some results about the spectrum and resolvent sets for a bounded operator defined on
a normed space. The main reference for this chapter is Bachman and Narici [6].
Definition 6.1.1 Let V be a complex normed vector space and let A : D ⊂ V → V be a linear operator, where
D is dense on V . We say that A−1 : R(A) → D is the inverse operator related to A, as A is a bijection from D
to R(A) and
A−1 y = u if, and only if, Au = y, ∀u ∈ D, y ∈ R(A),
where R(A) = {Au : u ∈ D}, is the range of A.
In such case we have,
A−1 Au = u, ∀u ∈ D
and
AA−1 y = y, ∀y ∈ R(A).
Let λ ∈ C.
1. If R(λ I − A) is dense in V and λ I − A has a bounded inverse, we write λ ∈ ρ(A), where ρ(A) denotes
the resolvent set of A.
2. If R(λ I − A) is dense in V and (λ I − A)−1 exists but it is not bounded, we write λ ∈ Cσ (A), where
Cσ (A) denotes the continuous spectrum of A.
3. If R(λ I − A) is not dense in V and λ I − A has an inverse either bounded or unbounded, we write
λ ∈ Rσ (A), where Rσ (A) denotes the residual spectrum of A.
4. If (λ I − A)−1 does not exist, we write λ ∈ Pσ (A) where Pσ (A) denotes the point spectrum of A.
In such a case there exists u ∈ V such that
Au − λ u = 0
Observe that, since M is closed, we have d > 0, otherwise if we had d = 0 we had v ∈ M = M, which
contradicts v 6∈ M.
Also,
d/α > d.
Hence, there exists u0 ∈ M such that
0 < d ≤ ku0 − vkV < d/α.
Define
v − u0
uα = .
kv − u0 kV
Thus, kuα kV = 1 and also for u ∈ M we have
v u0
ku − uα kV = u− +
kv − u0 kV kv − u0 kV V
1
= k u(kv − u0 kV ) + u0 − vkV . (6.1)
kv − u0 kV
From this, since ukv − u0 kV + u0 ∈ M, we have
d
ku − uα kV ≥ > α, ∀u ∈ M.
kv − u0 kV
The proof is complete.
102 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Theorem 6.1.4 Let V be a complex normed vector space and let A : D ⊂ V → V be a linear compact operator.
Under such hypotheses, Pσ (A) is countable and 0 is its unique possible limit point.
Proof 6.2 Let ε > 0. We shall prove that there exists at most a finite number of points in Pε where
Pε = {λ ∈ Pσ (A) : |λ | ≥ ε}.
Observe that in such a case
Pσ (A) \ {0} = ∪∞
n=1 P1/n
and such a set is countable and has 0 as the unique possible limit point.
Suppose, to obtain contradiction, there exists ε > 0 such that Pε has infinite points. Hence, there exists a
sequence {λn }n∈N ⊂ Pε and a sequence of linearly independent eigenvectors {un } such that
Aun = λn un , ∀n ∈ N.
Define
Mn = Span{u1 , . . . , un },
so that
Mn−1 ⊂ Mn
and Mn is finite dimensional, ∀n ∈ N. Observe that Mn−1 ⊂ Mn properly.
From the Riesz theorem, there exists yn ∈ Mn such that kyn kV = 1 and
kyn − uk ≥ 1/2, ∀u ∈ Mn−1 , ∀n > 1.
Let
n
u = ∑ αi ui ∈ Mn .
i=1
Thus,
n n
Au = ∑ αi Aui = ∑ αi λi ui .
i=1 i=1
Therefore,
(λn − A)u = λn u − Au
n−1
= ∑ αi (λn − λi )ui ∈ Mn−1 . (6.2)
i=1
Therefore,
(λn − A)(Mn ) ⊂ Mn−1
and from this
A(Mn ) ⊂ Mn , ∀n ∈ N.
Let 1 < m < n. Thus,
w = (λn − A)yn + Aym ∈ Mn−1 ,
so that
Ayn − Aym = λn yn − w = λn (yn − λn−1 w).
Since, λn−1 w ∈ Mn−1 we get
|λn | ε
||Ayn − Aym k = |λn |kyn − λn−1 wk ≥ ≥ ,
2 2
∀1 ≤ m < n ∈ N.
Therefore, {yn } is a bounded sequence and such that {Ayn } has no Cauchy subsequence, that is, {Ayn }
has no convergent subsequence, which contradicts A be compact.
The proof is complete.
Spectral Analysis, a General Approach in Normed Spaces 103
Definition 6.1.5 Let V be a normed vector space and let A : D ⊂ V → V be a linear operator, where D is
dense in V.
We say that λ ∈ C is a proper approximate value of A if for each ε > 0 there exists u ∈ D such that
Theorem 6.1.6 Considering the statements of the last definition, we have that λ ∈ π(A) if, and only if, λ I −A
has no a bounded inverse.
Proof 6.3 Suppose λ ∈ π(A). Thus, for each n ∈ N, there exists un ∈ D such that kun kV = 1 and
In particular we have
k(λ I − A)un kV ≥ 1, ∀n ∈ N,
which contradicts (6.3).
Reciprocally, suppose λ I − A has no bounded inverse.
Thus, there is no K > 0 such that
Therefore λ ∈ π(A).
Remark 6.2.2 Let H be a complex Hilbert space and let A : H → H be a linear operator. Hence f : H × H →
C defined by
f (u, v) = (Au, v)H , ∀u, v ∈ H
is a sequilinear functional.
104 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Defining
B = {K > 0 such that (6.6) is satisfied }
we also define the norm of f , denoted by k f k as
k f k = inf{K : K ∈ B}.
Moreover, defining
C = {K > 0 such that | fˆ(u)| ≤ KkukV2 , ∀u ∈ V },
we define also the norm of fˆ, denoted by k fˆk, as
and
k fˆk = sup{| fˆ(u)| : kukV = 1}.
u∈V
Observe that
| f (u, v)| ≤ k f kkukV kvkV , ∀u, v ∈ V,
so that
α ≤ k f k. (6.7)
On the other hand,
kukV kvkV
| f (u, v)| = f u ,v
kukV kvkV
u v
= f , kukV kvkV
kukV kvkV
≤ α kukV kvkV , ∀u =
6 0, v 6= 0. (6.8)
Hence α ∈ B so that
α ≥ inf B = k f k. (6.9)
From (6.7) and (6.9) we may infer that
α = k f k.
Similarly, the second result may be proven.
The proof is complete.
106 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Theorem 6.2.9 Let V be a complex normed vector space and let F : V ×V → C be a sesquilinear, bounded
and symmetric functional. Under such hypotheses,
k f k = k fˆk.
1 1
|Re[ f (u, v)]| ≤ fˆ (u + v) + fˆ (u − v)
2 2
1 ˆ 1
≤ k f kku + vkV2 + k fˆkku − vkV2
4 4
1 ˆ 2
k f k 2kukV + 2kvkV2 , ∀u, v ∈ V.
= (6.11)
4
Thus, if kukV = kvkV = 1, we get
|Re[ f (u, v)]| ≤ k fˆk.
Observe that in its polar form, we have
f (u, v) = reiθ .
Thus,
k f k = sup{| f (u, v)| : kukV = kvkV = 1} ≤ k fˆk.
However, from the definitions, k f k ≥ k fˆk.
From these last two lines, we may infer that
k f k = k fˆk.
Definition 6.2.10 (Normal operator) Let H be a complex Hilbert space. We say that a bounded linear op-
erator A : H → H is normal as
A∗ A = AA∗ .
Theorem 6.2.11 Let H be a complex Hilbert space. and let A : H → H be a bounded linear operator. Under
such hypotheses, A is normal if, and only if,
Theorem 6.2.12 Let H be a complex Hilbert space and let A ∈ L(H). Under such hypotheses the following
proprieties are equivalent.
1. There exists λ ∈ π(A) such that |λ | = kAk.
2.
kAk = sup{|(Au, u)H | : kukH = 1}.
u∈H
Proof 6.8
1 implies 2: Suppose λ ∈ π(A) is such that
|λ | = kAk.
Thus,
|(Aun , un )H − λ | = |(Aun , un )H − λ (un , un )H |
= |(Aun − λ un , un )H |
≤ kAun − λ un kH kun kH
→ 0, as n → ∞. (6.13)
Thus,
λ ∈ {(Au, u)H : u ∈ H, kukH = 1}.
Thus {(Aun , un )H } ⊂ C is a bounded sequence. From this there exists a subsequence {(Aunk , unk )H }
of {(Aun , un )H } and λ ∈ C such that
(Aunk , unk )H → λ , as k → ∞.
Therefore,
kAunk − λ unk k2H = kAunk k2H − λ (Aunk , unk )H − λ (unk , Aunk )H + |λ |2
≤ kAk2 kunk k2H − λ (Aunk , unk )H − λ (Aunk , unk )H + |λ |2
→ |λ |2 − λ λ − λ λ + |λ |2 = 0. (6.14)
Summarizing,
kAunk − λ unk kH → 0, as k → ∞,
so that λ ∈ π(A) and |λ | = kAk.
The proof is complete.
Theorem 6.2.13 Let H be a Hilbert space and let A ∈ L(H) be a self-adjoint operator.
Define
M = sup{(Au, u)H : kukH = 1}
u∈H
and
m = inf {(Au, u)H : kukH = 1}.
u∈H
Under such hypotheses, m ∈ σ (A) and M ∈ σ (A).
Spectral Analysis, a General Approach in Normed Spaces 109
kÂk = M̂.
Thus,
ˆ n − M̂un , un )H → 0, as n → ∞.
(Au
Hence,
ˆ n − M̂un k2H
kAu ˆ n − M̂un , Au
= (Au ˆ n − M̂ un )H
ˆ n k2H − 2M̂(Au
= kAu ˆ n , un )H + M̂ 2
2 2 ˆ n , un )H + M̂ 2
≤ kÂkH kun kH − 2M̂(Au
→ M̂ 2 − 2M̂ 2 + M̂ 2
= 0. (6.15)
Summarizing,
ˆ n − M̂un kH → 0, as n → ∞,
kAu
so that
kAun − Mun kH → 0, as n → ∞.
From this we may infer that
M ∈ π(A) ⊂ σ (A).
Similarly, select β ∈ R such that
−m + β ≥ −M + β > 0.
Define  = −A + β I. The remaining parts of the proof are similar to those of the previous case.
This completes the proof.
Theorem 6.2.14 Let H be a complex Hilbert space. Let U : H → H be a linear bounded operator.
Under such hypotheses, U is a isometry if, and only if,
U ∗U = I,
Theorem 6.2.15 Let H be a complex Hilbert space. Under such hypothesis, U : H → H is a bijective isometry
if, and only if,
U ∗U = UU ∗ = I, in H.
Theorem 6.2.16 Let H be a complex Hilbert space. Suppose U : H → H is a linear operator such that
kUukH = kukH , ∀u ∈ H
Thus,
(U ∗Uu, u)H = (u, u)H , ∀u ∈ H,
so that
((U ∗U − I)u, u)H = 0, ∀u ∈ H.
Since U ∗U − I is self adjoint, it follows that
so that U ∗U = I.
From this and Theorem 6.2.14, we have that U is a isometry.
|λ | = 1.
Theorem 6.2.18 Let H be a complex Hilbert space. Let A : DA ⊂ H → H be a linear self-adjoint operator
but not necessarily bounded, where DA is dense in H, that is, DA = H. Let U be the Cayley transform of A,
that is U = (A − i)(A + i)−1 .
Under such hypotheses, U is unitary.
Spectral Analysis, a General Approach in Normed Spaces 111
Proof 6.14 First, we shall prove that A ± i is injective, so that its inverse is well defined on R(A ± i).
Since A is self-adjoint, we have that
From this
(Au, z)H = ((u, iz), ∀u ∈ H,
so that
A∗ z = Az = iz,
that is,
(A − i)z = 0.
Thus, z = 0.
Summarizing these last results, if z ⊥ R(A + i) then z = 0, so that
R(A + i) = H.
Now we are going to show that R(A + i) = R(A + i) = H. Let v ∈ H. Thus there exists a sequence {vn } ⊂
R(A + i) such that
vn → v, in norm, as n → ∞.
Therefore there exists a sequence {un } ⊂ DA such that
Aun + iun = vn → v, as n → ∞.
kvn − vm k2H = kAun + iun − Aum − ium k2H = kA(un − um )k2H + kun − um kH
2
, ∀m, n ∈ N.
From this, since {vn } is a Cauchy sequence, we may infer that {un } and {Aun } are Cauchy sequences, so
that there exists u ∈ H, and w ∈ H such that
Aun → w
and
un → u, as n → ∞.
Since A = A∗ is closed (see Theorem 6.6.7 for details), we may infer that w = Au and
since A is closed we get (u, Au) ∈ Gr(A) where Gr(A) denotes the graph of A, so that Au + iu = v ∈ R(A + i).
Summarizing, if v ∈ H then v ∈ R(A + i), so that R(A + i) = H = R(A + i).
112 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
R(A − i) = H.
Observe that
U = (A − i)(A + i)−1
and
R((A + i)−1 ) = DA = D(A+i) ,
and R(A − i) = H.
Thus R(U) = H, that is U : H → H is linear and onto (recalling that D(A+i)−1 = H).
At this point, we shall prove that U is unitary.
Let v ∈ H. Since R(A + i) = H, there exists u ∈ DA such that
(A + i)u = v.
Hence
Uv = (A − i)(A + i)−1 v = (A − i)u,
so that
Summarizing,
kUvkH = kvkH , ∀v ∈ H,
so that U is unitary.
The proof is complete.
Remark 6.2.19 Let v ∈ H. Since R(A + i) = H, we may obtain u ∈ H such that v = (A + i)u.
From this we have
Uv = (A − i)(A + i)−1 v = (A − i)u,
so that
(I +U)v = 2Au
and
(I −U)v = 2iu.
Thus, if (I −U)v = 0, then u = 0 so that v = (A + i)u = 0.
Therefore, I −U is injective and its inverse exists on R(I −U).
Moreover, for u ∈ R(I −U) as above, we have,
so that
Au = i(I +U)(I −U )−1 u, ∀u ∈ R(I −U).
In the next lines we shall show that R(I −U) = DA .
Indeed, let v ∈ H. Thus, from the last lines above, there exists u ∈ DA such that
(I −U)v = 2iu
Spectral Analysis, a General Approach in Normed Spaces 113
in DA .
Rλ (A) = (λ I − A)−1
the resolvent of A in λ .
If λ 6∈ ρ(A), we write
λ ∈ σ (A) = C − ρ(A),
where σ (A) is said to be the spectrum of A.
Theorem 6.3.3 Let U be a Banach space and suppose that A ∈ L (U). Then ρ(A) is an open subset of C
and
F(λ ) = Rλ (A)
is an analytic function with values in L (U) on each connected component of ρ(A). For λ , µ ∈ σ (A), Rλ (A)
and Rµ (A) commute and
Rλ (A) − Rµ (A) = (µ − λ )Rµ (A)Rλ (A).
114 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Proof 6.15 Let λ0 ∈ ρ(A). We will show that λ0 is an interior point of ρ(A).
Observe that symbolically we may write
1 1
=
λ −A λ − λ0 + (λ0 − A)
1 1
=
λ0 − A 1 − λ0 −λ
λ0 −A
∞ n !
1 λ0 − λ
= 1+ ∑ . (6.18)
λ0 − A n=1 λ0 − A
Define,
∞
R̂λ (A) = Rλ0 (A){I + ∑ (λ − λ0 )n (Rλ0 )n }. (6.19)
n=1
Observe that
k(Rλ0 )n k ≤ kRλ0 kn .
Thus, the series indicated in (6.19) will converge in norm if
Hence, for λ satisfying (6.20), R̂(A) is well defined and we can easily check that
Therefore
R̂λ (A) = Rλ (A), if |λ − λ0 | < kRλ0 k−1 ,
so that λ0 is an interior point. Since λ0 ∈ ρ(A) is arbitrary, we have that ρ(A) is open. Finally, observe that
Corollary 6.3.4 Let U be a Banach space and A ∈ L (U). Then the spectrum of A is non-empty.
Rλ (A) = θ , ∀λ ∈ C,
which is a contradiction.
Proposition 6.3.5 Let H be a Hilbert space and A ∈ L (H).
1. If λ ∈ Res[σ (A)] then λ ∈ Pσ (A∗ ).
2. If λ ∈ Pσ (A) then λ ∈ Pσ (A∗ ) ∪ Res[σ (A∗ )].
Proof 6.17
1. If λ ∈ Res[σ (A)] then
R(A − λ I) =
6 H.
Therefore there exists v ∈ (R(A − λ I))⊥ , 6 θ such that
v=
(v, (A − λ I)u)H = 0, ∀u ∈ H
that is
((A∗ − λ I)v, u)H = 0, ∀u ∈ H
so that
(A∗ − λ I)v = θ ,
which means that λ ∈ Pσ (A∗ ).
2. Suppose there exists v 6= θ such that
(A − λ I)v = θ ,
and
λ 6∈ Pσ (A∗ ).
Thus,
(u, (A − λ I)v))H = 0, ∀u ∈ H,
so that
((A∗ − λ I)u, v)H = 0, ∀u ∈ H.
Since
(A∗ − λ I)u =
6 θ , ∀u ∈ H, u =
6 θ,
we get v ∈ (R(A∗ − λ I))⊥ , so that R(A∗ − λ I) =
6 H.
Hence, λ ∈ Res[σ (A∗ )].
116 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
so that
k(A − (λ + µi))uk2 ≥ µ 2 kuk2 .
6 0, A − (λ + µi) has a bounded inverse on its range, which is closed. If R(A − (λ + µi)) =
Therefore, if µ = 6 H
then by the last result (λ − µi) would be in the point spectrum of A, which contradicts the last inequality.
6 0 then λ + µi ∈ ρ(A). To complete the proof, suppose
Hence, if µ =
Au1 = λ1 u1 ,
and
Au2 = λ2 u2 ,
where
λ1 , λ2 ∈ R, λ1 =
6 λ2 and u1 , u2 =
6 θ.
Thus,
Since λ1 − λ2 =
6 0 we get
(u1 , u2 )H = 0.
kAun − λ un kH → 0, as n → ∞.
Solution: Suppose λ ∈ C is such that there exists {un } ⊂ H such that kun k = 1, ∀n ∈ N and
kAun − λ un kH → 0, as n → ∞. (6.25)
Suppose, to obtain contradiction, that λ ∈ ρ(A). Thus, (A − λ I)−1 exists and it is bounded, so that there
exists K > 0 such that
kAu − λ IukH ≥ KkukH , ∀u ∈ H.
From this we obtain
kAun − λ un kH ≥ K, ∀n ∈ N
which contradicts (6.25).
Spectral Analysis, a General Approach in Normed Spaces 117
Thus, (A − λ I)−1 exists and it is bounded. Since λ ∈ σ (A), we must have that R(A − λ I) is not dense, so
that λ ∈ Res[σ (A)].
From Proposition 6.3.5, we have λ ∈ Pσ (A∗ ).
Since A = A∗ , from this we obtain λ = λ ∈ P(σ (A)) which contradicts (A − Iλ )−1 to exist.
Thus, we may infer that it does not exist K > 0 such that (6.26) holds.
From this, for each n ∈ N there exists un ∈ H such that
kun kH = 1
and
kAun − λ un kH < 1/n
so that
kAun − λ un kH → 0.
The solution is complete.
and
M = sup{(Au, u)H | kukH = 1}.
u∈H
Remark 6.4.1 It is possible to prove that for a linear self-adjoint operator A : H → H we have
This propriety, which prove in the next lines, is crucial for the subsequent results, since for example for A, B
linear and self-adjoint and ε > 0 we have
−εI ≤ A − B ≤ εI,
and
M = sup{(Au, u)H | kukH = 1}.
u∈H
Then,
kAk = α.
and
(A(u − v), u − v)H = (Au, u)H + (Av, v)H − 2(Au, v)H .
Thus,
4(Au, v) = (A(u + v), u + v)H − (A(u − v), u − v)H ≤ Mku + vkU2 − mku − vkU2 ,
so that
4(Au, v)H ≤ α(ku + vkU2 + ku − vkU2 ).
Hence, replacing v by −v we obtain
and therefore
4|(Au, v)H | ≤ α(ku + vkU2 + ku − vkU2 ).
Replacing v by β v, we get
β = kukU /kvkU ,
we obtain
|(Au, v)H | ≤ αkukU kvkU , ∀u, v ∈ U.
Thus
kAk ≤ α.
On the other hand,
|(Au, u)H | ≤ kAkkukU2 ,
so that
|M| ≤ kAk
and
|m| ≤ kAk,
so that
α ≤ kAk.
The proof is complete.
At this point we start to develop the spectral theory. Define by P the set of all real polynomials defined in
R. Define
Φ1 : P → L (H),
by
Φ1 (p(λ )) = p(A), ∀p ∈ P.
Thus we have
1. Φ1 (p1 + p2 ) = p1 (A) + p2 (A),
2. Φ1 (p1 · p2 ) = p1 (A)p2 (A),
3. Φ1 (α p) = α p(A), ∀α ∈ R, p ∈ P
4. if p(λ ) ≥ 0, on [m, M], then p(A) ≥ θ ,
We will prove (4):
Consider p ∈ P. Denote the real roots of p(λ ) less or equal to m by α1 , α2 , ..., αn and denote those that
are greater or equal to M by β1 , β2 , ..., βl . Finally denote all the remaining roots, real or complex by
Observe that if µi = 0 then vi ∈ (m, M). The assumption that p(λ ) ≥ 0 on [m, M] implies that any real root in
(m, M) must be of even multiplicity.
Since complex roots must occur in conjugate pairs, we have the following representation for p(λ ) :
n l k
p(λ ) = a ∏(λ − αi ) ∏(βi − λ ) ∏((λ − vi )2 + µi2 ),
i=1 i=1 i=1
gn ↓ f , pointwise ,
that is,
gn (λ ) ↓ f (λ ), ∀λ ∈ R.
Considering the Weierstrass theorem, since gn ∈ C([m, M]) we may obtain a sequence of polynomials {pn }
such that
1 1
gn + n − pn < n ,
2 ∞ 2
120 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
pn (λ ) ↓ f (λ ), on [m, M].
Therefore,
p1 (A) ≥ p2 (A) ≥ p3 (A) ≥ ... ≥ pn (A) ≥ ...
Since pn (A) is self-adjoint for all n ∈ N, we have
pn ↓ f , and qn ↓ f ,
First, observe that being {pn } and {qn } sequences of continuous functions we have that
hk (λ ) = max{pk (λ ), qn (λ )}.
observe that
hk (λ ) ↓ qn (λ ), ∀λ ∈ R,
so that by Dini’s theorem
hk → qn , uniformly on [m, M].
It follows that for each n ∈ N there exists kn ∈ N such that if k > kn then
1
hk (λ ) − qn (λ ) ≤ , ∀λ ∈ [m, M].
n
Since
pk (λ ) ≤ hk (λ ), ∀λ ∈ [m, M],
we obtain
1
pk (λ ) − qn (λ ) ≤ , ∀λ ∈ [m, M].
n
By analogy, we may show that for each n ∈ N there exists k̂n ∈ N such that if k > k̂n then
1
qk (λ ) − pn (λ ) ≤ .
n
Spectral Analysis, a General Approach in Normed Spaces 121
lim pk (A)
k→∞
we obtain
1
lim pk (A) ≤ lim qn (A) +
k→∞ n→∞ n
≤ lim qn (A). (6.27)
n→∞
so that
lim qn (A) = lim pn (A) = f (A).
n→∞ n→∞
Hence, we may extend Φ1 : P → L (H) to Φ2 : Cup → L (H) where Cup as earlier indicated, denotes the set
of upper semi-continuous functions, where
Φ2 ( f ) = f (A).
h = f1 − g1 and h = f2 − g2 .
Thus,
f1 − g1 = f2 − g2 ,
that is,
f1 + g2 = f2 + g1 ,
so that from the definition of Φ2 we obtain
that is,
f1 (A) − g1 (A) = f2 (A) − g2 (A).
Therefore, Φ3 is well defined. Finally, observe that for α < 0
α( f − g) = −αg − (−α) f ,
Φ3 (α f ) = α f (A) = αΦ3 ( f ), ∀α ∈ R.
hν (λ )hµ (λ ) = hµ (λ )hν (λ ) = hµ (λ ),
so that
E(ν)E(µ) = E(µ)E(ν) = E(µ), ∀ν ≥ µ.
If µ < m, then hµ (λ ) = 0, on [m, M], so that
E(µ) = 0, if µ < m.
E(µ) = I, if µ ≥ M.
Next we show that the family {E(µ)} is strongly continuous from the right. First we will establish a sequence
of polynomials {pn } such that
pn ↓ hµ ,
and
pn (λ ) ≥ hµ + 1 (λ ), on [m, M].
n
Observe that for any fixed n there exists a sequence of polynomials {pnj } such that
Thus,
gn (λ ) ≥ hµ + 1 (λ ), ∀λ ∈ R,
n
and we obtain
lim gn (λ ) ≥ lim hµ + 1 (λ ) = hµ (λ ).
n→∞ n→∞ n
Therefore,
Thus,
lim gn (λ ) = hµ (λ ).
n→∞
Observe that gn are not necessarily polynomials. To set a sequence of polynomials, observe that we may
obtain a sequence {pn } of polynomials such that
1
|gn (λ ) + 1/n − pn (λ )| < , ∀λ ∈ [m, M], n ∈ N.
2n
so that
pn (λ ) ≥ gn (λ ) + 1/n − 1/2n ≥ gn (λ ) ≥ hµ+1/n (λ ).
Thus,
pn (A) → E(µ),
and
pn (A) ≥ hµ + 1 (A) = E(µ + 1/n) ≥ E (µ).
n
Thus,
lim E(µ + 1/n) = E(µ).
n→∞
From this we may easily obtain the strong continuity from the right.
For µ ≤ ν we have
µ(hν (λ ) − hµ (λ )) ≤ λ (hν (λ ) − hµ (λ ))
≤ ν(hν (λ ) − hµ (λ )). (6.30)
To verify this observe that if λ < µ or λ > ν then all terms involved in the above inequalities are zero. On
the other hand if
µ ≤λ ≤ν
then
hν (λ ) − hµ (λ ) = 1,
124 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
so that in any case (6.30) holds. From the monotonicity property we have
µ(E(ν) − E(µ)) ≤ A(E(ν) − E(µ))
≤ ν(E(ν) − E(µ)). (6.31)
Now choose a, b ∈ R such that
a < m and b ≥ M.
Suppose given ε > 0. Choose a partition P0 of [a, b], that is
P0 = {a = λ0 , λ1 , ..., λn = b},
such that
max {|λk − λk−1 |} < ε.
k∈{1,...,n}
Hence,
λk−1 (E(λk ) − E(λk−1 )) ≤ A(E(λk ) − E(λk−1 ))
≤ λk (E(λk ) − E(λk−1 )). (6.32)
Summing up on k and recalling that
n
∑ E(λk ) − E(λk−1 ) = I,
k=1
we obtain
n
∑ λk−1 (E(λk ) − E(λk−1 )) ≤ A
k=1
n
≤ ∑ λk (E(λk ) − E(λk−1 )). (6.33)
k=1
Let λk0 ∈ [λk−1 , λk ]. Since (λk − λk0 ) ≤ (λk − λk−1 ) from (6.32) we obtain
n n
A − ∑ λk0 (E(λk ) − E(λk−1 )) ≤ ε ∑ (E(λk ) − E (λk−1 ))
k=1 k=1
= εI. (6.34)
By analogy
n
−εI ≤ A − ∑ λk0 (E(λk ) − E(λk−1 )). (6.35)
k=1
Since
n
A − ∑ λk0 (E(λk ) − E(λk−1 ))
k=1
is self-adjoint we obtain
n
kA − ∑ λk0 (E(λk ) − E (λk−1 ))k < ε.
k=1
Being ε > 0 arbitrary, we may write
Z b
A= λ dE(λ ),
a
that is, Z M
A= λ dE(λ ).
m−
Spectral Analysis, a General Approach in Normed Spaces 125
Let H be a complex Hilbert space and let A ∈ L(H), where A is a self-adjoint operator.
Suppose f ∈ C([m, M]) where
and
M = sup{(Au, u)H : kukH = 1}.
u∈H
Let ε > 0. Since f is uniformly continuous on the compact set [m, M], there exists δ > 0 such that if
x, y ∈ [m, M] and |x − y| < δ , then
| f (x) − f (y)| < ε. (6.37)
Let P = {λ0 = m, λ1 , . . . , λn = M} be a partition of [m, M], such that kPk = max{λk − λk−1 : k ∈
{1, . . . , n}} < δ .
Choose
λk0 ∈ (λk−1 , λk ), ∀k ∈ {1, . . . , n}
and observe that
1, if λk−1 < λ ≤ λk
hλk (λ ) − hλk−1 (λ ) = (6.38)
0, otherwise.
From this and (6.37), we may obtain
n
f (λ ) − ∑ f (λk0 )[hλk (λ ) − hλk−1 (λ )] < ε, ∀λ ∈ [m, M].
k=1
Since ε > 0, the partition P and {λk0 } have been arbitrary, we may denote
Z M
f (A) = f (λ )dE(λ ).
m−
where {E(φ )} is a spectral family on [0, 2π]. Furthermore E(φ ) is continuous at 0 and it is the limit of
polynomials in U and U −1 .
Also if
p(eiφ ) ≥ 0
there exists a polynomial q such that
so that
p(U) = [q(U)]∗ q(U).
Therefore
(p(U)v, v)H = (q(U)∗ q(U)v, v)H = (q(U)v, q(U)v)H ≥ 0, ∀v ∈ H,
which means
p(U) ≥ 0.
Define the function hµ (φ ) by
1, if 2kπ < φ ≤ 2kπ + µ,
hµ (φ ) = (6.39)
0, if 2kπ + µ < φ ≤ 2(k + 1)π,
for each k ∈ {0, ±1, ±2, ±3, ...}. Define E(µ) = hµ (U). Observe that the family {E(µ)} are projections and
in particular
E(0) = 0,
E(2π) = I
and if µ ≤ ν, since
hµ (φ ) ≤ hν (φ ),
Spectral Analysis, a General Approach in Normed Spaces 127
we have
E(µ) ≤ E(ν).
Suppose given ε > 0. Let P0 be a partition of [0, 2π] that is,
P0 = {0 = φ0 , φ1 , ..., φn = 2π}
such that
max {|φ j − φ j−1 |} < ε.
j∈{1,...,n}
φ ∈ [φ j−1 , φ j ].
n
|eiφ − ∑ eiφk (hφk (φ ) − hφk−1 (φ ))| = |eiφ − eiφ j |
k=1
≤ |φ − φ j | < ε. (6.40)
Thus,
n
0 ≤ |eiφ − ∑ eiφk (hφk (φ ) − hφk−1 (φ ))|2 ≤ ε 2
k=1
so that, for the corresponding operators
n n
0 ≤ [U − ∑ eiφk (E (φk ) − E (φk−1 )]∗ [U − ∑ eiφk (E(φk ) − E(φk−1 )]
k=1 k=1
2
≤ ε I (6.41)
and hence
n
kU − ∑ eiφk (E(φk ) − E(φk−1 )k < ε.
k=1
Being ε > 0 arbitrary, we may infer that
Z 2π
U= eiφ dE(φ ).
0
A2 u = A1 u, ∀u ∈ D1 .
Definition 6.6.4 A linear operator A : D → H is said to be closable if it has a linear closed extension. The
smallest closed extension of A is denote by A and is called the closure of A.
Γ(A) = Γ(A).
and C(ψ) = φ , where φ is the unique point such that (ψ, φ ) ∈ Γ(A). Hence
so that
A ⊂ C.
However, C ⊂ B and since B is an arbitrary closed extension of A we have
C=A
so that
Γ(C) = Γ(A) = Γ(A).
Observe that by the Riesz lemma, φ ∈ D(A∗ ) if and only if there exists K > 0 such that
|(Aψ, φ )H | ≤ KkψkH , ∀ψ ∈ D.
A = A∗∗ .
Proof 6.21
1. We define the operator V : H × H → H × H by
V (φ , ψ) = (−ψ, φ ).
V (φ , ψ) = (−ψ, φ ) = (φ1 , ψ1 ).
Hence
ψ = −φ1 and φ = ψ1 ,
so that for (ψ1 , −φ1 ) ∈ E⊥ and (w1 , w2 ) ∈ E we have
Thus,
(φ1 , −w2 )H + (ψ1 , w1 )H = 0,
and therefore
((φ1 , ψ1 ), (−w2 , w1 ))H×H = 0,
that is
((φ1 , ψ1 ),V (w1 , w2 ))H ×H = 0, ∀(w1 , w2 ) ∈ E.
This means that
(φ1 , ψ1 ) ∈ (V (E))⊥ ,
so that
V (E ⊥ ) ⊂ (V (E))⊥ .
It is easily verified that the implications from which the last inclusion results are in fact equivalences,
so that
V (E ⊥ ) = (V (E))⊥ .
Γ(A) = [Γ(A)⊥ ]⊥
= V 2 [Γ(A)⊥ ]⊥
= [V [V (Γ(A))⊥ ]]⊥
= [V (Γ(A∗ )]⊥ (6.42)
Γ(A) = Γ[(A∗ )∗ ].
Conversely, suppose D(A∗ ) is not dense. Thus there exists ψ ∈ [D(A∗ )]⊥ such that ψ 6= θ . Let
(φ , A∗ φ ) ∈ Γ(A∗ ). Hence
((ψ, θ ), (φ , A∗ φ ))H ×H = (ψ, φ )H = 0,
so that
(ψ, θ ) ∈ [Γ(A∗ )]⊥ .
Therefore, V [Γ(A∗ )]⊥ is not the graph of a linear operator. Since Γ(A) = V [Γ(A∗ )]⊥ A is not closable.
3. Observe that if A is closable then
Definition 6.7.3 Let A : D → H be a symmetric operator. We say that A is essentially self-adjoint if its closure
A is self-adjoint. If A is closed, a subset E ⊂ D is said to be a core for A if A|E = A.
Theorem 6.7.4 Let A : D → H be a symmetric operator. Then the following statements are equivalent
1. A is self-adjoint.
2. A is closed and N(A∗ ± iI) = {θ }.
3. R(A ± iI) = H.
Proof 6.22
1 implies 2:
Suppose A is self-adjoint let φ ∈ D = D(A∗ ) be such that
Aφ = iφ
Spectral Analysis, a General Approach in Normed Spaces 131
so that
A∗ φ = iφ .
Observe that
i(φ , φ )H = (iφ , φ )H
= (Aφ , φ )H
= (φ , Aφ )H
= (φ , iφ )H
= −i(φ , φ )H , (6.43)
N(A − iI) = {θ }.
Similarly, we prove that N(A + iI) = {θ }. Finally, since A∗ = A∗ = A, we get that A = A∗ is closed.
2 implies 3:
Suppose 2 holds. Thus the equation
A∗ φ = −iφ
has no non trivial solution. We will prove that R(A − iI) is dense in H. If ψ ∈ R(A − iI)⊥ then
R(A − iI) = H.
Given φ ∈ D we have
k(A − iI)φ k2H = kAφ kH
2 2
+ kφ kH . (6.44)
Let ψ0 ∈ H be a limit point of R(A − iI). Thus we may find {φn } ⊂ D such that
(A − iI)φn → ψ0 .
From (6.44)
kφn − φm kH ≤ k(A − iI)(φn − φm )kH , ∀m, n ∈ N
so that {φn } is a Cauchy sequence, therefore converging to some φ0 ∈ H. Also from (6.44)
so that {Aφn } is a Cauchy sequence, hence also a converging one. Since A is closed, we get φ0 ∈ D
and
(A − iI)φ0 = ψ0 .
Therefore, R(A − iI) is closed, so that
R(A − iI) = H.
Similarly,
R(A + iI) = H.
132 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
(A∗ − iI)(φ − η) = θ .
Since R(A + iI) = H we have N(A∗ − iI) = {θ }. Therefore, φ = η, so that D(A∗ ) = D. The proof is
complete.
The operator U is called the Cayley transform of A. We have already proven that
Z 2π
U= eiφ dF(φ ),
0
where {F(φ )} is a monotone family of orthogonal projections, strongly continuous from the right and we
may consider it such that
0, if φ ≤ 0,
F(φ ) = (6.45)
I, if φ ≥ 2π.
Therefore,
F(φ )v = F(φ )[(F(2π) − F(2π − ))u],
so that
0, if φ < 2π,
F(φ )v = (6.46)
v, if φ ≥ 2π.
Spectral Analysis, a General Approach in Normed Spaces 133
Observe that Z 2π
U −I = (eiφ − 1)dF(φ ),
0
and Z 2π
∗
U −I = (e−iφ − 1)dF(φ ).
0
Let {φn } be a partition of [0, 2π]. From the monotonicity of [0, 2π] and pairwise orthogonality of
{F(φn ) − F(φn−1 )}
The last two equalities results from e2πi − 1 = 0 and dF(φ )v = θ on [0, 2π). Since v 6= θ , the last equation
implies that 1 ∈ Pσ (U), which contradicts the existence of
(I −U)−1 .
this implies that the range of Tn is invariant under U and A. Observe that
∑ Tn = ∑(F(φn ) − F(φn−1 ))
n n
= lim F(φ ) − lim F(φ )
φ →2π φ →0
= I − θ = I. (6.48)
Hence,
∑ R(Tn ) = H.
n
134 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
so that
Z 2π
(I −U)Tn u = (1 − eiφ )dF(φ )Tn u
0
Z φn
= (1 − eiφ )dF(φ )u. (6.50)
φn−1
Therefore,
Z φn
(1 − eiφ )−1 dF(φ )(I −U)Tn u
φn−1
Z φn Z φn
= (1 − eiφ )−1 dF(φ ) (1 − eiφ )dF(φ )u
φn−1 φn−1
Z φn
= (1 − eiφ )−1 (1 − eiφ )dF(φ )u
φn−1
Z φn
= dF(φ )u
φn−1
Z 2π
= dF(φ )Tn u = Tn u. (6.51)
0
Hence,
−1 Z φn
(1 − eiφ )−1 dF(φ ).
(I −U)|R(Tn ) =
φn−1
Therefore, defining
φ
λ = −cot ,
2
and
E(λ ) = F(−2cot −1 λ ),
we get
iφ iφ −1 φ
i(1 + e )(1 − e ) = −cot = λ.
2
Hence, Z n
ATn u = λ dE(λ )u.
n−1
Finally, from
∞
u= ∑ Tn u,
n=−∞
Spectral Analysis, a General Approach in Normed Spaces 135
we can obtain
!
∞
Au = A ∑ Tn u
n=−∞
∞
= ∑ ATn u
n=−∞
∞ Z n
= ∑ λ dE(λ )u. (6.52)
n=−∞ n−1
The main references for this chapter are Rudin [68], Royden [69] and Stein and Shakarchi [72], where more
details may be found. All these three books are excellent and we strongly recommend their reading.
Definition 7.1.2 (Measurable spaces) If M is a σ -algebra in U we say that U is a measurable space. The
elements of M are called the measurable sets of U.
Definition 7.1.3 (Measurable function) If U is a measurable space and V is a topological space, we say
that f : U → V is a measurable function if f −1 (V ) is measurable whenever V ⊂ V is an open set.
Remark 7.1.4 1. Observe that 0/ = U \U so that from 1 and 2 in Definition 7.1.1, we have that 0/ ∈ M .
2. From 1 and 3 from Definition 7.1.1, it is clear that ∪ni=1 Ai ∈ M whenever Ai ∈ M , ∀i ∈ {1, ..., n}.
∞ A = (∪∞ Ac )c also from Definition 7.1.1, it is clear that M is closed under countable
3. Since ∩i= 1 i i=1 i
intersections.
4. Since A \ B = Bc ∩ A we obtain: if A, B ∈ M then A \ B ∈ M .
Theorem 7.1.5 Let F be any collection of subsets of U. Then there exists a smallest σ -algebra M0 in U
such that F ⊂ M0 .
Basic Results on Measure and Integration 137
Proof 7.1 Let Ω be the family of all σ -Algebras that contain F . Since the set of all subsets in U is a
σ -algebra, Ω is non-empty.
Let M0 = ∩Mλ ∈Ω Mλ , it is clear that M0 ⊃ F , it remains to prove that in fact M0 is a σ -algebra. Observe
that:
1. U ∈ Mλ , ∀Mλ ∈ Ω, so that, U ∈ M0 ,
2. A ∈ M0 implies A ∈ Mλ , ∀Mλ ∈ Ω, so that Ac ∈ Mλ , ∀Mλ ∈ Ω, which means Ac ∈ M0 ,
3. {An } ⊂ M0 implies {An } ⊂ Mλ , ∀Mλ ∈ Ω, so that ∪∞
n=1 An ∈ Mλ , ∀Mλ ∈ Ω, which means ∪n=1 An ∈
∞
M0 .
From Definition 7.1.1 the proof is complete.
Definition 7.1.6 (Borel sets) Let U be a topological space, considering the last theorem there exists a small-
est σ -algebra in U, denoted by B, which contains the open sets of U. The elements of B are called the Borel
sets.
Theorem 7.1.7 Suppose M is a σ -algebra in U and V is a topological space. For f : U → V , we have:
1. If Ω = {E ⊂ V | f −1 (E) ∈ M }, then Ω is a σ -algebra.
2. If V = [−∞, ∞], and f −1 ((α, ∞]) ∈ M , for each α ∈ R, then f is measurable.
Proof 7.2
1. (a) V ∈ Ω since f −1 (V ) = U and U ∈ M .
(b) E ∈ Ω ⇒ f −1 (E) ∈ M ⇒ U \ f −1 (E) ∈ M ⇒ f −1 (V \ E) ∈ M ⇒ V \ E ∈ Ω.
(c) {Ei } ⊂ Ω ⇒ f −1 (Ei ) ∈ M , ∀i ∈ N ⇒ ∪∞
i=1 f
−1 (E ) ∈ M ⇒ f −1 (∪∞ E ) ∈ M ⇒ ∪∞ E ∈ Ω.
i i=1 i i=1 i
Thus Ω is a σ -algebra.
2. Define Ω = {E ⊂ [−∞, ∞] | f −1 (E) ∈ M } from above Ω is a σ - algebra. Given α ∈ R, let {αn } be a
real sequence such that αn → α as n → ∞, αn < α, ∀n ∈ N. Since (αn , ∞] ∈ Ω for each n and
C
[−∞, α ) = ∪∞ ∞
n=1 [−∞, αn ] = ∪n=1 (αn , ∞] , (7.1)
we obtain, [−∞, α) ∈ Ω. Furthermore, we have (α, β ) = [−∞, β ) ∩ (α, ∞] ∈ Ω. Since every open set
in [−∞, ∞] may be expressed as a countable union of intervals (α, β ) we have that Ω contains all the
open sets. Thus, f −1 (E) ∈ M whenever E is open, so that f is measurable.
Proposition 7.1.8 If { fn : U → [−∞, ∞]} is a sequence of measurable functions and g = supn≥1 fn and h =
lim sup fn then g and h are measurable.
n→∞
Theorem 7.2.2 Let f : U → [0, ∞] be a measurable function. Thus there exists a sequence of simple functions
{sn : U → [0, ∞]} such that
1. 0 ≤ s1 ≤ s2 ≤ ... ≤ f ,
2. sn (u) → f (u) as n → ∞, ∀u ∈ U.
Proof 7.4 Define δn = 2−n . To each n ∈ N and each t ∈ R+ , there corresponds a unique integer K = Kn (t)
such that
Defining
Kn (t)δn , i f 0 ≤ t < n,
ϕn (t ) = (7.4)
n, i f t ≥ n,
7.3 Measures
Definition 7.3.1 (Measure) Let M be a σ -algebra on a topological space U. A function µ : M → [0, ∞] is
said to be a measure if µ(0/) = 0 and µ is countably additive, that is, given {Ai } ⊂ U, a sequence of pairwise
disjoint sets then
∞
µ(∪∞
i=1 Ai ) = ∑ µ(Ai ). (7.5)
i=1
Proposition 7.3.2 Let µ : M → [0, ∞], where M is a σ -algebra of U. Then we have the following.
1. µ(A1 ∪ ... ∪ An ) = µ(A1 ) + ... + µ(An ) for any given {Ai } of pairwise disjoint measurable sets of M .
2. If A, B ∈ M and A ⊂ B then µ(A) ≤ µ(B).
3. If {An } ⊂ M , A = ∪∞
n=1 An and
A1 ⊂ A2 ⊂ A3 ⊂ ... (7.6)
4. If {An } ⊂ M , A = ∩∞
n=1 An , A1 ⊃ A2 ⊃ A3 ⊃ .... and µ(A1 ) is finite then,
Proof 7.5
1. Take An+1 = An+2 = .... = 0/ in Definition 7.1.1 item 1,
2. Observe that B = A∪(B−A) and A∩(B−A) = 0/ so that by above µ(A∪(B−A)) = µ(A)+ µ(B−A) ≥
µ(A),
where
1, if u ∈ Ai ,
χAi (u) = (7.11)
0, otherwise,
Definition 7.4.2 (Integral for non-negative measurable functions) If f : U → [0, ∞] is measurable, for
E ∈ M , we define the integral of f on E, denoted by E f dµ, as
R
Z Z
f dµ = sup sdµ , (7.13)
E s∈A E
where
Definition 7.4.3 (Integrals for measurable functions) For a measurable f : U →R [−∞, ∞] and E ∈ M , we
define f + = max{ f , 0}, f − = max{− f , 0} and the integral of f on E, denoted by E f dµ, as
Z Z Z
f dµ = f + dµ − f − dµ .
E E E
140 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Theorem 7.4.4 (Lebesgue’s monotone convergence theorem) Let { fn } be a sequence of real measurable
functions on U and suppose that
1. 0 ≤ f1 (u) ≤ f2 (u) ≤ ... ≤ ∞, ∀u ∈ U,
2. fn (u) → f (u) as n → ∞, ∀u ∈ U.
Then,
(a) f is measurable,
R R
(b) U f n dµ → U f dµ as n → ∞.
R R
Proof 7.6 Since U f n dµ ≤ U f n+1 dµ, ∀n ∈ N, there exists α ∈ [0, ∞] such that
Z
fn dµ → α, as n → ∞, (7.15)
U
Let s be any simple function such that 0 ≤ s ≤ f , and let c ∈ R such that 0 < c < 1. For each n ∈ N we define
En = {u ∈ U | fn (u) ≥ cs(u)}. (7.18)
Clearly En is measurable and E1 ⊂ E2 ⊂ ... and U = ∪n∈N En . Observe that
Z Z Z
fn dµ ≥ fn dµ ≥ c sdµ. (7.19)
U En En
so that
Z
α≥ sdµ, ∀s simple and measurable such that 0 ≤ s ≤ f . (7.21)
U
This implies
Z
α≥ f dµ. (7.22)
U
The next result we do not prove it (it is a direct consequence of last theorem). For a proof see [68].
Corollary 7.4.5 Let { fn } be a sequence of non-negative measurable functions defined on U ( fn : U →
[0, ∞], ∀n ∈ N). Defining f (u) = ∑∞
n=1 f n (u), ∀u ∈ U, we have
Z ∞ Z
f dµ = ∑ fn dµ .
U n=1 U
Basic Results on Measure and Integration 141
Theorem 7.4.6 (Fatou’s lemma) If { fn : U → [0, ∞]} is a sequence of measurable functions, then
Z Z
lim inf fn dµ ≤ lim inf fn dµ. (7.23)
U n→∞ n→∞ U
Then
gk ≤ fk (7.25)
so that
Z Z
gk dµ ≤ fk dµ, ∀k ∈ N. (7.26)
U U
Theorem 7.4.7 (Lebesgue’s dominated convergence theorem) Suppose { fn } is sequence of complex mea-
surable functions on U such that
Proof 7.8
1. This inequality holds since f is measurable and | f | ≤ g.
142 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
so that
Z
lim sup | fn − f | d µ ≤ 0. (7.33)
n→∞ U
Hence,
Z
lim | fn − f | dµ = 0. (7.34)
n→∞ U
Definition 7.5.2 (Signed measure) Let (U, M ) be a measurable space. We say that ν : M → [−∞, +∞] is
a signed measure if
ν may assume at most one the values −∞, +∞.
ν(0/) = 0.
ν (∑∞ ∞
n=1 En ) = ∑n=1 ν(En ) for all sequence of measurable disjoint sets {En }
We say that A ∈ M is a positive set with respect to ν if A is measurable and ν(E) ≥ 0 for all E measurable
such that E ⊂ A.
Similarly, We say that B ∈ M is a negative set with respect to ν if B is measurable and ν(E) ≤ 0 for all
E measurable such that E ⊂ B.
Finally, if A ∈ M is both positive and negative with respect to ν, it is said to be a null set.
Lemma 7.5.3 Considering the last definitions, we have that a countable union of positive measurable sets
is positive.
En = (E ∩ An ) \ (∪ni=−11 Ai ).
Basic Results on Measure and Integration 143
Lemma 7.5.4 Considering the last definitions, let E be a measurable set such that
0 < ν(E) < ∞.
Then there exists a positive set A ⊂ E such that ν(A) > 0.
Proof 7.10 Observe that if E is not positive then it contains a set of negative measure. In such a case, let
n1 be the smallest positive integer such that there exists a measurable set E1 ⊂ E such that
ν(E1 ) < −1/n1 .
Reasoning inductively, if E \ ∪k−1
j=1 E j is not positive, let nk be the smallest positive integer such that
there exists a measurable set
Ek ⊂ E \ ∪k−1
j=1 E j
such that
ν(Ek ) < −1/nk .
Define
A = E \ (∪∞
k=1 Ek ) .
Then
E = A ∪ (∪∞
k=1 Ek ) .
Since such a union is disjoint, we have,
∞
ν(E) = ν(A) + ∑ ν(Ek ),
k=1
Proposition 7.5.5 (The Hahn decomposition) Let ν be a signed measure on a measurable space (U, M ).
Then there exist a positive set A and a negative set B such that U = A ∪ B and A ∩ B = 0/.
Proof 7.11 Without loosing generality, suppose ν does not assume the value +∞ (the other case may be
dealt similarly). Define
λ = sup{ν(A) | A is positive}.
Since the empty set 0/ is positive, we obtain λ ≥ 0.
Let {An } be a sequence of positive sets such that
lim ν(An ) = λ .
n→∞
Define
A = ∪∞
i=1 Ai .
λ ≥ ν(A).
λ ≥ ν(E ∪ A)
= ν(E) + ν(A)
= ν(E) + λ , (7.35)
so that ν(E) = 0.
Thus, B contains no positive set of positive measure, so that, by Lemma 7.5.4, B contains no subsets of
positive measure, that is, B is negative.
The proof is complete.
Remark 7.5.6 Denoting the Hahn decomposition of U relating ν by {A, B}, we may define the measures ν +
and ν − by
ν + (E) = ν(E ∩ A),
and-
ν − (E) = −ν(E ∩ B),
so that
ν = ν + − ν −.
We recall that two measures ν1 and ν2 are mutually singular if there are disjoint measurable sets such
that
U = A∪B
Basic Results on Measure and Integration 145
and
ν1 (A) = ν2 (B) = 0.
Observe that the measures ν + and ν − above defined are mutually singular. The decomposition
ν = ν+ − ν−
is called the Jordan one of ν. The measures ν + and ν − are called the positive and negative parts of ν
respectively.
Observe that either ν + or ν − is finite since only one of the values +∞, −∞ may be assumed by ν. We
may also define
|ν|(E) = ν + (E) + ν − (E),
which is called the absolute value or total variation of ν.
Definition 7.6.1 (Absolutely continuous measures) We say that a measure ν is absolutely continuous with
respect to a measure µ and write ν µ, if ν(A) = 0 for all set such that µ(A) = 0. In case of a signed
measure we write ν µ if |ν| |µ|.
Theorem 7.6.2 (The Radon-Nikodym theorem) Let (U, M , µ) be a σ -finite measure space. Let ν be a
measure defined on M which is absolutely continuous with respect to µ, that is, ν µ.
Then there exists a non-negative measurable function f such that
Z
ν(E) = f dµ, ∀E ∈ M .
E
since Z Z
k f k2L2 (µ ) = | f |2 dµ ≤ | f |2 dλ = k f k2L2 (λ ) .
U U
Thus, Z Z
f dµ = f g dλ ,
U U
and in particular, Z Z
f dµ = f g (dµ + dν).
U U
Hence, Z Z
f (1 − g) dµ = f g dν. (7.37)
U U
Assume, to obtain contradiction, that g < 0 in a set A such that µ(A) > 0.
Thus, Z
(1 − g) dµ > 0,
A
so that from this and (7.37) with f = χA we get
Z
g dν > 0.
A
and hence
ν(B) = 0.
R
Thus, Bg dν = 0 so that Z
(1 − g) dµ = 0,
B
which implies that µ(B) = 0, a contradiction.
Basic Results on Measure and Integration 147
so that Z Z Z
(1 − g) dµ = g dν − dν + ν(E),
E E E
and therefore Z
ν(E) = (1 − g) (dµ + dν),
E
that is, Z
ν(E) = (1 − g) dλ , ∀E ∈ M
E
Define
B = {u ∈ U : g(u) = 0}.
R
Hence, µ(B) = B g dλ = 0.
From this, since λ µ, we obtain
g−1 g = 1, a.e. [λ ].
Therefore, for a not relabeled E ∈ M we have
Z Z
λ (E) = g−1 g dλ = g−1 dµ.
E E
Thus,
Z
ν(E) = g−1 dµ − µ(E)
E
Z
= (g−1 − 1) dµ
E
Z
= (1 − g)g−1 dµ, ∀E ∈ M . (7.39)
E
The proof for the finite case in complete. The proof for σ -finite is developed in the next lines.
Since U is σ -finite, there exists a sequence {Un } such that U = ∪∞ n=1Un , and µ(Un ) < ∞ and ν(Un ) <
∞, ∀n ∈ N.
Define
Fn = Un \ ∪n−1
j=1 U j ,
thus U = ∪∞n=1 Fn and {Fn } is a sequence of disjoint sets, such that µ(Fn ) < ∞ and ν(Fn ) < ∞, ∀n ∈ N.
Let E ∈ M . For each n ∈ N from above we may obtain fn such that
Z
ν(E ∩ Fn ) = fn dµ, ∀E ∈ M .
E∩Fn
148 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
From this and the monotone convergence theorem corollary we may write
∞
ν(E) = ∑ ν(E ∩ Fn )
n=1
∞ Z
= ∑ fn dµ
n=1 E ∩Fn
∞ Z
= ∑ fn χFn dµ
n=1 E
Z ∞
= ∑ fn χFn dµ
E n=1
Z
= f dµ, (7.40)
E
where
∞
f= ∑ fn χFn .
n=1
The proof is complete.
Theorem 7.6.3 (The Lebesgue decomposition) Let (U, M , µ) be a σ -finite measure space and let ν be a
σ -finite measure defined on M .
Then we may find a measure ν0 , singular with respect to µ, and a measure ν1 , absolutely continuous with
respect to µ, such that
ν = ν0 + ν1 .
Furthermore, the measures ν0 and ν1 are unique.
λ = ν + µ.
Observe that ν and µ are absolutely continuous with respect to λ . Hence, by the Radon-Nikodym theorem,
there exist non-negative measurable functions f and g such that
Z
µ(E) = f dλ , ∀E ∈ M
E
and Z
ν(E) = g dλ ∀E ∈ M .
E
Define
A = {u ∈ U | f (u) > 0},
and
B = {u ∈ U | f (u) = 0}.
Thus,
U = A ∪ B,
and
A ∩ B = 0/.
Also define
ν0 (E) = ν(E ∩ B), ∀E ∈ M .
Basic Results on Measure and Integration 149
ν1 (E) = ν(E ∩ A)
Z
= g dλ . (7.41)
E∩A
Therefore,
ν = ν0 + ν1 .
To finish the proof, we have only to show that
ν1 µ.
and in particular Z
f dλ = 0.
E∩A
Since f > 0 on A ∩ E we conclude that
λ (A ∩ E) = 0.
Therefore, since ν λ , we obtain
ν(E ∩ A) = 0,
so that
ν1 (E) = ν(E ∩ A) = 0.
From this we may infer that
ν1 µ.
The proof of uniqueness is left to the reader.
µ ∗ (A) = µ ∗ (A ∩ E) + µ ∗ (A ∩ E c ), ∀A ⊂ U.
150 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
µ ∗ (A) = µ ∗ (A ∩ E2 ) + µ ∗ (A ∩ E2c ),
Since,
A ∩ (E1 ∪ E2 ) = (A ∩ E2 ) ∪ (A ∩ E1 ∩ E2c ),
we obtain
µ ∗ (A ∩ (E1 ∪ E2 )) ≤ µ ∗ (A ∩ E2 ) + µ ∗ (A ∩ E2c ∩ E1 ). (7.43)
From this and (7.42) we obtain
Hence E1 ∪ E2 is µ ∗ -measurable.
By induction, the union of a finite number of µ ∗ - measurable sets is µ ∗ -measurable.
Assume E = ∪∞ ∗
i=1 Ei where {Ei } is a sequence of disjoint µ -measurable sets.
n ∗
Define Gn = ∪i=1 Ei . Then Gn is µ -measurable and for a given A ⊂ U we have
µ ∗ (A) = µ ∗ (A ∩ Gn ) + µ ∗ (A ∩ Gcn )
≥ µ ∗ (A ∩ Gn ) + µ ∗ (A ∩ E c ), (7.45)
since E c ⊂ Gcn , ∀n ∈ N.
Observe that
Gn ∩ En = En
and
Gn ∩ Enc = Gn−1 .
Thus, from the measurability of En we may get
µ ∗ (A ∩ Gn ) = µ ∗ (A ∩ Gn ∩ En ) + µ ∗ (A ∩ Gn ∩ Enc )
= µ ∗ (A ∩ En ) + µ ∗ (A ∩ Gn−1 ). (7.46)
By induction we obtain
n
µ ∗ (A ∩ Gn ) = ∑ µ ∗ (A ∩ Ei ),
i=1
so that
n
µ ∗ (A) ≥ µ ∗ (A ∩ E c ) + ∑ µ ∗ (A ∩ Ei ), ∀n ∈ N,
i=1
Basic Results on Measure and Integration 151
Definition 7.7.3 A measure on an Algebra A ⊂ U is a set function µ : A → [0, +∞) such that
1. µ(0/) = 0,
2. if {Ei } is a sequence of disjoint sets in A so that E = ∪∞
i=1 Ei ∈ A , then
∞
µ(E) = ∑ µ(Ei ).
i=1
where Ai ∈ A , ∀i ∈ N.
152 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
A ⊂ ∪∞
i=1 Ai .
Proof 7.16 The only not immediate property to be proven is the countably subadditivity.
Suppose E ⊂ ∪∞ ∗
i=1 Ei . If µ (Ei ) = +∞ for some i ∈ N the result holds.
∗
Thus, assume µ (Ei ) < +∞, ∀i ∈ N.
Let ε > 0. Thus for each i ∈ N there exists {Ai j } ⊂ A such that Ei ⊂ ∪∞j=1 Ai j , and
∞
ε
∑ µ(Ai j ) ≤ µ ∗ (Ei ) + 2i .
j=1
Therefore,
∞ ∞ ∞
µ ∗ (E) ≤ ∑ ∑ µ(Ai j ) ≤ ∑ µ ∗ (Ei ) + ε.
i=1 j=1 i=1
Proof 7.17 Let E ∈ U such that µ ∗ (E) < +∞. Let ε > 0.
Thus, there exists {Ai } ⊂ A such that E ⊂ ∪∞
i=1 Ai and
∞
∑ µ(Ai ) < µ ∗ (E) + ε.
i=1
Observe that
µ(Ai ) = µ(Ai ∩ A) + µ(Ai ∩ Ac ),
Basic Results on Measure and Integration 153
µ ∗ (E) ≥ µ ∗ (E ∩ A) + µ ∗ (E ∩ Ac ).
The proof is complete.
Proposition 7.7.8 Suppose µ is a measure on an Algebra A ⊂ U, µ ∗ is the outer measure induced by µ and
E ⊂ U is a set. Then, for each ε > 0, there is a set A ∈ Aσ with E ⊂ A and
µ ∗ (A) ≤ µ ∗ (E) + ε.
Also, there is a set B ∈ Aσ δ such that E ⊂ B and
µ ∗ (E) = µ ∗ (B).
Proof 7.18 Let ε > 0. Thus, there is a sequence {Ai } ⊂ A such that
E ⊂ ∪∞
i=1 Ai
and
∞
∑ µ(Ai ) ≤ µ ∗ (E) + ε.
i=1
Define A = ∪∞
i=1 Ai , then
∞
µ ∗ (A) ≤ ∑ µ ∗ (Ai )
i=1
∞
= ∑ µ(Ai )
i=1
∗
≤ µ (E) + ε. (7.50)
Now, observe that we write A ∈ Aσ if A = ∪∞ where Ai ∈ A , ∀i ∈ N.
i=1 Ai
Also, we write B ∈ Aσ δ if B = ∩∞
n=1 A n , where ∈ Aσ , ∀n ∈ N.
An
From above, for each n ∈ N there exists An ∈ Aσ such that
E ⊂ An
and
µ ∗ (An ) ≤ µ(E) + 1/n.
n=1 An . Thus B ∈ Aσ δ , E ⊂ B and
Define B = ∩∞
µ ∗ (B) ≤ µ ∗ (An ) ≤ µ ∗ (E) + 1/n, ∀n ∈ N.
Hence,
µ ∗ (B) = µ ∗ (E).
The proof is complete.
154 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Proposition 7.7.9 Suppose µ is a σ -finite measure on a σ -algebra A , and let µ ∗ be the outer measure
induced by µ.
Under such hypotheses, a set E is µ ∗ measurable if and only if E = A \ B where A ∈ Aσ δ , B ⊂ A,
µ ∗ (B) = 0.
Finally, for each set B such that µ ∗ (B) = 0, there exists C ∈ Aσ δ such that B ⊂ C and µ ∗ (C) = 0.
Observe that
E = ∪∞
i=1 Ei
where
Ei = E ∩Ui ,
is µ ∗ -measurablefor each i ∈ N.
Let ε > 0. From the last proposition for each i, n ∈ N there exists Ani ∈ Aσ such that
1
µ(Ani ) < µ ∗ (Ei ) + .
n2i
Define
An = ∪∞
i=1 Ani .
Thus,
E ⊂ An
and
An \ E ⊂ ∪∞
i=1 (Ani \ Ei ),
and therefore,
∞ ∞
1 1
µ(An \ E) ≤ ∑ µ(Ani \ Ei ) ≤ ∑ i
= .
i=1 i=1 n 2 n
Since An ∈ Aσ , defining
A = ∩∞
n=1 An ,
Theorem 7.7.10 (Caratheodory) ´ Let µ be a measure on a algebra A and µ ∗ the respective induced outer
measure.
Then the restriction µ of µ ∗ to the µ ∗ -measurable sets is an extension of µ to an σ -algebra containing
A . If µ is finite or σ -finite, so is µ. In particular, if µ is σ -finite, then µ is the only measure on the smallest
σ -algebra containing A which is an extension of µ.
Basic Results on Measure and Integration 155
Proof 7.20 From the Theorem 7.7.2, µ is an extension of µ to a σ -algebra containing A , that is, µ is a
measure on such a set.
Observe that, from the last results, if µ is σ -finite so is µ.
Now assume µ is σ -finite. We will prove the uniqueness of µ.
Let B be the smallest σ -algebra containing A and let µ̃ be another measure on B which extends µ
on A .
Since each set Aσ may be expressed as a disjoint countable union of sets in A , the measure µ̃ equals µ
on Aσ . Let B be a µ ∗ -measurable set such that µ ∗ (B) < ∞.
Let ε > 0. By the Proposition 7.7.9 there exists an A ∈ Aσ such that B ⊂ A and
Since B ⊂ A, we obtain
µ̃(B) ≤ µ̃(A) = µ ∗ (A) ≤ µ ∗ (B) + ε.
Considering that ε > 0 is arbitrary, we get
µ̃(B) ≤ µ ∗ (B).
µ ∗ (A) ≤ µ ∗ (B) + ε
so that
µ ∗ (A) = µ ∗ (B) + µ ∗ (A \ B),
from this and above
µ̃(A \ B) ≤ µ ∗ (A \ B) ≤ ε,
if µ ∗ (B) < ∞.
Therefore,
Remark 7.7.11 We may start the process of construction of a measure by the action of a set function on a
semi-algebra. Here, a semi-algebra C is a collection of subsets of U such that the intersection of any two
sets in C is in C and, the complement of any set in C is a finite disjoint union of sets in C .
If C is any semi-algebra of sets, then the collection consisting of the empty set and all finite disjoint
unions of sets in C is an algebra, which is said to be generated by C . We denote such algebra by A .
If we have a set function acting on C , we may extend it to A by defining
n
µ(A) = ∑ µ(Ei ),
i=1
where, A = ∪ni=1 Ei and Ei ∈ C , ∀i ∈ {1, ..., n}, so that this last union is disjoint. We recall that any A ∈ A
admits such a representation.
U ×V = {(u, v) | u ∈ U and v ∈ V }.
(A × B) ∩ (C × D) = (A ∩C) × (B ∩ D),
and
(A × B)c = (Ac × B) ∪ (A × Bc ) ∪ (Ac × Bc ).
We define λ : M1 × M2 → R+ by
λ (A × B) = µ1 (A)µ2 (B).
Lemma 7.8.3 Let {Ai × Bi }i∈N be a countable disjoint collection of measurable rectangles whose the union
is the rectangle A × B. Then
∞
λ (A × B) = ∑ µ1 (Ai )µ2 (Bi ).
i=1
Basic Results on Measure and Integration 157
Proof 7.21 Let u ∈ A. Thus each v ∈ B is such that (u, v) is exactly in one Ai × Bi . Therefore
∞
χA×B (u, v) = ∑ χAi (u)χBi (v).
i=1
Hence for the fixed u in question, from the corollary of Lebesgue monotone convergence theorem we may
write
Z Z ∞
χA×B (u, v)dµ2 (v) = ∑ χAi (u)χBi (v)dµ2 (v)
V V i=1
∞
= ∑ χAi (u)µ2 (Bi ) (7.53)
i=1
Observe that
Z Z Z Z
dµ1 (u) χA×B (u, v)dµ2 (v) = dµ1 (u) χA (u)χB (v)dµ2 (v)
U V U V
= µ1 (A)µ2 (B).
Eu = {v | (u, v) ∈ E},
and
Ev = {u | (u, v) ∈ E}.
Observe that
χEu (v) = χE (u, v),
(E c )u = (Eu )c ,
and
(∪Eα )u = ∪(Eα )u ,
for any collection {Eα }.
We denote by Rσ as the collection of sets which are countable unions of measurable rectangles. Also,
Rσ δ will denote the collection of sets which are countable intersections of elements of Rσ .
Lemma 7.8.5 Let u ∈ U and E ∈ Rσ δ . Then Eu is a measurable subset of V .
Proof 7.22 If E ∈ R the result is trivial. Let E ∈ Rσ . Then E may be expressed as a disjoint union
E = ∪∞
i=1 Ei ,
158 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
where Ei ∈ R, ∀i ∈ N. Thus,
E = ∩∞
i=1 Ei ,
Thus as from above χ(Ei )u (v) is measurable for each i ∈ N, we have that χEu is also measurable so that Eu is
measurable.
Lemma 7.8.6 Let E be a set in Rσ δ with (µ1 × µ2 )(E) < ∞. Then the function g defined by
g(u) = µ2 (Eu )
Proof 7.23 The lemma is true if E is a measurable rectangle. Let {Ei } be a disjoint sequence of measurable
rectangles and E = ∪∞
i=1 Ei . Set
gi (u) = µ2 ((Ei )u ).
Then each gi is a non-negative measurable function and
∞
g = ∑ gi .
i=1
Thus g is measurable, and by the corollary of the Lebesgue monotone convergence theorem, we have
Z ∞ Z
g(u)dµ1 (u) = ∑ gi (u)dµ1 (u)
U i=1 U
∞
= ∑ (µ1 × µ2 )(Ei )
i=1
= (µ1 × µ2 )(E). (7.56)
Basic Results on Measure and Integration 159
Ei+1 ⊂ Ei
and
E = ∩∞
i=1 Ei .
that is,
gi → g, a.e. in E.
We may conclude that g is also measurable. Since
0 ≤ gi ≤ g, ∀i ∈ N
the Lebesgue dominated convergence theorem implies that
Z Z
g(u) dµ1 (u) = lim gi dµ1 (u)
U i→∞ U
= lim(µ1 × µ2 )(Ei )
i→∞
= (µ1 × µ2 )(E).
Lemma 7.8.7 Let E be a set such that (µ1 × µ2 )(E) = 0. then for almost all u ∈ U we have
µ2 (Eu ) = 0.
Proof 7.24 Observe that there is a set in Rσ δ such that E ⊂ F and
(µ1 × µ2 )(F) = 0.
Proposition 7.8.8 Let E be a measurable subset of U ×V such that (µ1 × µ2 )(E) is finite. The for almost all
u the set Eu is a measurable subset of V . The function g defined by
g(u) = µ2 (Eu )
is measurable and Z
g dµ1 (u) = (µ1 × µ2 )(E).
160 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Proof 7.25 First observe that there is a set F ∈ Rσ δ such that E ⊂ F and
(µ1 × µ2 )(F) = (µ1 × µ2 )(E).
Let G = F \ E. Since F and E are measurable, G is measurable, and
(µ1 × µ2 )(G) = 0.
By the last lemma we obtain
µ2 (Gu ) = 0,
for almost all u so that
g(u) = µ2 (Eu ) = µ2 (Fu ), a.e. in U.
By Lemma 7.8.6 we may conclude that g is measurable and
Z
g dµ1 (u) = (µ1 × µ2 )(E).
U
Theorem 7.8.9 (Fubini) Let (U, M1 , µ1 ) and (V, M2 , µ2 ) be two complete measure spaces and f an inte-
grable function on U ×V . Then
1. fu (v) = f (u, v) is measurable and integrable for almost all u.
2. fv (u) = f (u, v) is measurable and integrable for almost all v.
R
3. h1 (u) = V f (u, v) dµ2 (v) is integrable on U.
R
4. h2 (v) = U f (u, v) dµ1 (u) is integrable on V .
5.
Z Z Z Z
f dµ2 (v) dµ1 (u) = f dµ1 (u) dµ2 (v)
U V V U
Z
= f d(µ1 × µ2 ). (7.58)
U×V
Proof 7.26 It suffices to consider the case where f is non-negative (we can then apply the result to f + =
max( f , 0) and f − = max(− f , 0)). The last proposition asserts that the theorem is true if f is a simple function
which vanishes outside a set of finite measure. Similarly as in in Theorem 7.2.2, we may obtain a sequence
of non-negative simple functions {φn } such that
φn ↑ f .
Observe that, given u ∈ U, fu is such that
(φn )u ↑ fu , a.e. .
By the Lebesgue monotone convergence theorem we get
Z Z
f (u, v) dµ2 (v) = lim φn (u, v) dµ2 (v),
V n→∞ V
so that this last resulting function is integrable in U. Again by the Lebesgue monotone convergence theorem,
we obtain
Z Z Z Z
f dµ2 (v) dµ1 (u) = lim φn dµ2 (v) dµ1 (u)
U V n→∞ U V
Z
= lim φn d(µ1 × µ2 )
n→∞ U×V
Z
= f d(µ1 × µ2 ). (7.59)
U×V
Chapter 8
8.1 Introduction
In this chapter we will define the Lebesgue measure and the concept of Lebesgue measurable set. We show
that the set of Lebesgue measurable sets is a σ − algebra so that the earlier results, proven for more gen-
eral measure spaces, remain valid in the present context (such as the Lebesgue monotone and dominated
convergence theorems).
The main references for this chapter are [72, 69].
We start with the following theorems without proofs.
Theorem 8.1.1 Every open set A ⊂ R may be expressed as a countable union of disjoint open intervals.
Remark 8.1.2 In this text Q j denotes a closed cube in Rn and |Q j | its volume, that is, |Q j | = ∏ni=1 (bi − ai ),
where Q j = ∏ni=1 [ai , bi ]. Also we assume that if two Q1 and Q2 closed or not, have the same interior, then
|Q1 | = |Q2 | = |Q̄1 |. We recall that two cubes Q1 , Q2 ⊂ Rn are said to be quasi-disjoint if their interiors are
disjoint.
Theorem 8.1.3 Every open set A ⊂ Rn , where n ≥ 1 may be expressed as a countable union of quasi-disjoint
closed cubes.
Definition 8.1.4 (Outer measure) Let E ⊂ Rn . The outer measure of E, denoted by m∗ (E), is defined by
( )
∞
m∗ (E) = inf ∑ |Q j | : E ⊂ ∪∞j=1 Q j ,
j=1
1. Monotonicity: If E1 ⊂ E2 then m∗ (E1 ) ≤ m∗ (E2 ). This follows from the fact that if E2 ⊂ ∪∞j=1 Q j then
E1 ⊂ ∪∞j=1 Q j .
2. Countable sub-additivity : If E ⊂ ∪∞j=1 E j , then m∗ (E) ≤ ∑∞j=1 m∗ (E j ).
Proof 8.1 First assume that m∗ (E j ) < ∞, ∀ j ∈ N, otherwise the result is obvious. Thus, given ε > 0
for each j ∈ N there exists a sequence {Qk, j }k∈N such that
E j ⊂ ∪∞
k=1 Qk, j
and
∞
ε
∑ |Qk, j | < m∗ (E j ) + 2 j .
k=1
Hence
E ⊂ ∪∞j,k=1 Qk, j
and therefore
!
∞ ∞ ∞
m∗ (E) ≤ ∑ |Qk, j | = ∑ ∑ |Qk, j |
j,k=1 j=1 k=1
∞
ε
≤ ∑ m∗ (E j ) + j
j=1 2
∞
= ∑ m∗ (E j ) + ε. (8.1)
j=1
3. If
E ⊂ Rn ,
and
α = inf{m∗ (A) | A is open and E ⊂ A},
then
m∗ (E) = α.
Thus,
m∗ (E) ≤ α.
Suppose given ε > 0. Choose a sequence {Q j } of closed cubes such that
E ⊂ ∪∞j=1 Q j
The Lebesgue Measure in Rn 163
and
∞
∑ |Q j | ≤ m∗ (E) + ε.
j=1
therefore
α ≤ m∗ (E) + 2ε.
Being ε > 0 arbitrary, we have
α ≤ m∗ (E).
The proof is complete.
E ⊂ ∪∞j=1 Q j ,
and
∞
∑ |Q j | ≤ m∗ (E) + ε.
j=1
hence,
m∗ (E1 ) + m∗ (E2 ) ≤ ∑ |Q j | + ∑ |Q j |
j∈J1 j∈J2
∞
≤ ∑ |Q j | ≤ m∗ (E) + ε. (8.3)
j=1
E = ∪∞j=1 Q j
then
∞
m∗ (E) = ∑ |Q j |.
j=1
Being
∪Nj=1 Q˜ j ⊂ E
we obtain
N N
m∗ (E) ≥ ∑ |Q̃ j | ≥ ∑ |Q j | − ε.
j=1 j=1
Therefore,
∞
∑ |Q j | ≤ m∗ (E) + ε.
j=1
m∗ (A) = m∗ (A ∩ E) + m∗ (A ∩ E c ).
m(E) = m∗ (E).
Proposition 8.3.2 Let E ⊂ Rn be such that m∗ (E) = 0. Under such hypotheses, E is measurable.
m∗ (A ∩ E) = 0.
m∗ (A) ≤ m∗ (A ∩ E) + m∗ (A ∩ E c )
= m∗ (A ∩ E c )
≤ m∗ (A). (8.4)
Thus,
m∗ (A) = m∗ (A ∩ E) + m∗ (A ∩ E c ).
Since A is arbitrary, we may infer that E is measurable.
m∗ (A) = m∗ (A ∩ E1 ) + m∗ (A ∩ E1c )
= m∗ (A ∩ E1 ) + m∗ ((A ∩ E1c ) ∩ E2 ) + m∗ ((A ∩ E1c ) ∩ E2c ). (8.5)
(A ∩ E1 ) ∪ [A ∩ E1c ∩ E2 ] = (A ∩ E1 ) ∪ (A ∩ E2 ) ∩ E1c )
= A ∩ (E1 ∪ E2 ) ∩ [A ∩ (E1 ∪ E1c )]
= A ∩ [E1 ∪ E2 ] ∩ A
= A ∩ (E1 ∪ E2 ). (8.6)
Theorem 8.3.4 Let A ⊂ Rn and let {Ek }nk=1 be finite collection of measurable disjoint sets.
Under such hypotheses,
n
m∗ (A ∩ [∪nk=1 Ek ]) = ∑ m∗ (A ∩ Ek ).
k=1
In particular,
n
m∗ (∪nk=1 Ek ) = ∑ m∗ (Ek ).
k=1
Proof 8.7 We prove the result by induction on n. The result is immediate for n = 1.
Assume the result holds for n − 1.
Since the collection {Ek }nk=1 is disjoint, we have
A ∩ [∪nk=1 Ek ] ∩ En = A ∩ En ,
and
A ∩ [∪nk=1 Ek ] ∩ Enc = A ∩ [∪n−1
k=1 Ek ].
Thus, from the measurability of En and the induction assumption, we get
Remark 8.4.1 Let A ⊂ B ⊂ Rn be sets such that A is a measurable set which has a finite outer measure.
In such a case
m∗ (B) = m∗ (B ∩ A) + m∗ (B ∩ Ac )
= m∗ (A) + m∗ (B \ A), (8.9)
so that
m∗ (B \ A) = m∗ (B) − m∗ (A).
The Lebesgue Measure in Rn 167
Theorem 8.4.2 Let E ⊂ Rn . Under such hypothesis, the following properties are equivalent to the measur-
ability of E:
E ⊂O
and
m∗ (O \ E) < ε.
F ⊂E
and
m∗ (E \ F) < ε.
Proof 8.8 Assume E is measurable. Let ε > 0. Assume first m∗ (E) < ∞. Hence, there exists a sequence of
open blocks {Q j } ⊂ Rn such that
E ⊂ ∪∞j=1 Q j
and
∞
∑ m∗ (Q j ) < m∗ (E) + ε.
j=1
E ⊂O
and
∞
m∗ (O) ≤ ∑ m∗ (Q j ) < m∗ (E) + ε.
j=1
Thus
m∗ (O) − m∗ (E) < ε.
From the measurability of E we get,
m∗ (O) = m∗ (O ∩ E) + m∗ (O ∩ E c )
= m∗ (O ∩ E) + m∗ (O \ E), (8.10)
so that
m(O \ E) = m∗ (O) − m∗ (E) < ε.
Assume now m∗ (E) = ∞. Let Fk = E ∩ Bk (0), ∀k ∈ Rn , where Bk (0) denotes the open ball of center 0
and radius k.
168 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
O \ E = (∪∞
k=1 Ok ) \ E = ∪k=1 (Ok \ E) ⊂ ∪k=1 (Ok \ Ek ).
∞ ∞
Thus,
∞
m∗ (O \ E) ≤ ∑ m∗ (Ok \ Ek )
k=1
∞
ε
≤ ∑ 2k
k=1
= ε. (8.11)
m∗ (G \ E) = 0.
Hence, G \ E is measurable.
We now are going to show that
E = G ∩ (G \ E )c .
Indeed, since E ⊂ G, we have,
x ∈ G ∩ (G \ E )c ⇔ x ∈ G ∩ (G ∩ E c )c
⇔ x ∈ G ∩ (Gc ∪ E)
⇔ x ∈ (G ∩ Gc ) ∪ (G ∩ E)
⇔ x ∈ E. (8.12)
Summarizing
E = G ∩ (G \ E)c ,
so that E is measurable.
From this we may infer that the measurability of E is equivalent of 1 and equivalent to 2.
Reasoning with the complements, we may obtain that 1 is equivalent to 3 and 2 is equivalent to 4.
This completes the proof.
The Lebesgue Measure in Rn 169
∩∞j=1 E j = (∪∞j=1 E cj )c .
Theorem 8.5.1 If {Ei } is sequence of measurable pairwise disjoint sets and E = ∪∞j=1 Ei then
∞
m(E) = ∑ m(E j ).
j=1
Proof 8.10 First assume that E j is bounded. Being E cj measurable, given ε > 0 there exists an open H j ⊃ E cj
such that
ε
m∗ (H j − E cj ) < , ∀ j ∈ N.
2j
Denoting Fj = H cj we have that Fj ⊂ E j is closed and
ε
m∗ (E j − Fj ) < , ∀ j ∈ N.
2j
For each N ∈ N the sets F1 , ..., FN are compact and disjoint, so that
N
m(∪Nj=1 Fj ) = ∑ m(Fj ).
j=1
As
∪Nj=1 Fj ⊂ E
we have
N N
m(E) ≥ ∑ m(Fj ) ≥ ∑ m(E j ) − ε.
j=1 j=1
Hence,
∞
m(E) ≥ ∑ m(E j ) − ε.
j=1
For the general case, select a sequence of cubes {Qk } such that
Rn = ∪∞
k=1 Qk
E j,k = E j ∩ Sk , ∀ j, k ∈ N.
Thus,
E = ∪∞j=1 ∪∞ ∞
k=1 E j,k = ∪ j,k=1 E j,k ,
where such an union is disjoint and each E j,k is bounded. Through the last result, we get
∞
m(E) = ∑ m(E j,k )
j,k=1
∞ ∞
= ∑ ∑ m(E j,k )
j=1 k=1
∞
= ∑ m(E j ). (8.13)
j=1
m(E \ K) < ε.
F = ∪Nj=1 Q j
such that
m(E 4 F) ≤ ε,
where
E 4 F = (E \ F) ∪ (F \ E).
Proof 8.11
1. Choose a closed set such that F ⊂ E and
ε
m(E \ F) < .
2
Let Bn be a closed ball with center at origin and radius n. Define Kn = F ∩ Bn and observe that Kn is
compact, ∀n ∈ N. Thus
E \ Kn & E \ F.
Being m(E) < ∞ we have
m(E \ Kn ) < ε,
for all n sufficiently big.
The Lebesgue Measure in Rn 171
E ⊂ ∪∞j=1 Q j ,
and
∞
ε
∑ |Q j | ≤ m(E) + 2 .
j=1
Being m(E) < ∞ the series converges and there exists N0 ∈ N such that
∞
ε
∑ |Q j | < .
N0 +1 2
N
0
Defining F = ∪ j=1 Q j , we have
is measurable.
Observe that:
1. If
f −1 ([−∞, a))
is measurable for each a ∈ R then
f −1 ([−∞, a)) = ∪∞
k=1 f
−1
([−∞, a − 1/k])
Proof 8.12 The first implication is obvious. For the second, being φ continuous
φ −1 ([−∞, a))
is open, and therefore
(φ ◦ f )−1 (([−∞, a)) = f −1 (φ −1 ([−∞, a)))
is measurable, ∀a ∈ R.
and
lim sup fk (x), lim inf fk (x)
k→∞ k→∞
are measurable.
Proof 8.13 We will prove only that supn∈N fn (x) is measurable. The remaining proofs are analogous. Let
f (x) = sup fn (x).
n∈N
Thus
f −1 ((a, +∞]) = ∪∞ −1
n=1 f n ((a, +∞]).
Being each fn measurable, such a set is measurable, ∀a ∈ R. By analogy
inf fk (x)
k∈N
is measurable,
lim sup fk (x) = inf sup f j (x),
k→∞ k≥1 j≥k
and
lim inf fk (x) = sup inf f j (x)
k→∞ k≥1 j≥k
are measurable.
The Lebesgue Measure in Rn 173
Then f is measurable.
The next result we do not prove it. For a proof see [72].
Proposition 8.6.5 If f and g are measurable functions, then
1. f 2 is measurable.
2. f + g and f · g are measurable if both assume finite values.
Observe that
Thus g−1 ((a, +∞]) ∩ B is measurable. As g−1 ((a, +∞]) ∩ A ⊂ A we have m∗ (g−1 ((a, +∞]) ∩ A) = 0, that is,
such a set is measurable. Hence being g−1 ((a, +∞]) the union of two measurable sets is also measurable.
Being a ∈ R arbitrary, g is measurable.
Theorem 8.6.7 Suppose f is a non-negative measurable function on Rn . Then there exists a increasing
sequence of non-negative simple functions {ϕk } such that
Proof 8.16 Let N ∈ N. Let QN be the cube with center at origin and side of measure N. Define
f (x), if x ∈ QN and f (x) ≤ N,
FN (x) = N, if x ∈ QN and f (x) > N,
0, otherwise.
174 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Theorem 8.6.8 Suppose that f is a measurable function defined on Rn . Then there exists a sequence of
simple functions {ϕk } such that
|ϕk (x)| ≤ |ϕk+1 (x)|, ∀x ∈ Rn , k ∈ N,
and
lim ϕk (x) = f (x), ∀x ∈ Rn .
k→∞
Theorem 8.6.9 Suppose f is a measurable function in Rn . Then there exists a sequence of step functions
{ϕk } which converges to f a.e. in Rn .
Proof 8.18 From the last theorem, it suffices to prove that if E is measurable and m(E) < ∞ then χE may be
approximated almost everywhere in E by step functions. Suppose given ε > 0. Observe that from Proposition
8.5.2, there exist cubes Q1 , ..., QN such that
Thus,
M
f (x) = ∑ χR j ,
j=1
possibly except in a set of measure < 2ε. Hence, for each k > 0, there exists a step function ϕk such that
m(Ek ) < 2−k where
Ek = {x ∈ Rn | f (x) =
6 ϕk (x)}.
Defining
Fk = ∪∞j=k+1 E j
we have
∞
m(Fk ) ≤ ∑ m(E j )
j=k+1
∞
≤ ∑ 2− j
j=k+1
2−(k+1)
=
1 − 1/2
= 2−k . (8.17)
Theorem 8.6.10 (Egorov) Suppose that { fk } is a sequence of measurable functions defined in a measurable
set E such that m(E) < ∞. Assume that fk → f , a.e in E. Thus given ε > 0 we may find a closed set Aε ⊂ E
such that fk → f uniformly in Aε and m(E − Aε ) < ε.
fk → f , ∀x ∈ E.
176 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
1
m(E − EkNN ) < .
2N
Observe that
1
| f j (x) − f (x)| < , ∀ j ≥ kN , x ∈ EkNN .
N
Choose M ∈ N such that
∞
ε
∑ 2−k ≤ 2 .
k=M
Define
N
Ãε = ∩∞
N ≥M EkN .
Thus,
∞
ε
m(E − Ãε ) ≤ ∑ m(E − EkNN ) < .
N=M 2
Suppose given δ > 0. Let N ∈ N be such that N > M and 1/N < δ . Thus, if x ∈ Ãε then x ∈ EkNN so that
Hence, fk → f uniformly in Ãε . Observe that Ãε is measurable and thus there exists a closed set Aε ⊂ Ãε
such that
ε
m(Ãε − Aε ) < .
2
That is:
ε ε
m(E − Aε ) ≤ m(E − Ãε ) + m(Ãε − Aε ) < + = ε,
2 2
and
fk → f
uniformly in Aε . The proof is complete.
Definition 8.6.12 We say that a set A ⊂ L1 (Rn ) is dense in L1 (Rn ), if for each f ∈ L1 (Rn ) and each ε > 0
there exists g ∈ A such that Z
k f − gkL1 (Rn ) = | f − g| dx < ε.
Rn
Proof 8.20
1. From the last theorems given f ∈ L1 (Rn ) there exists a sequence of simple functions such that
as k → ∞.
2. From the last item , it suffices to show that simple functions may be approximated by step functions.
As a simple function is a linear combination of characteristic functions of sets of finite measure, it
suffices to prove that given ε > 0 and a set of finite measure, there exists ϕ a step function such that
Thus,
k f − gkL1 (Rn ) < 2ε.
for the general case of a rectangle in Rn , we just recall that in this case f is the product of the charac-
teristic functions of n intervals. Therefore we may approximate f by the product of n functions similar
to g defined above.
Chapter 9
In this chapter we present some important results which may be found in similar form at Chapters 2, 6 and 7
in the excellent book Real and Complex Analysis [68] by W. Rudin, where more details may be found.
then
f = 0, a.e. in E.
Therefore, µ(An ) = 0, ∀n ∈ N.
Define
A = {u ∈ E | f (u) > 0}.
Hence,
A = ∪∞
n=1 An ,
so that µ(A) = 0.
Thus,
f = 0, a.e. in E.
Other Topics in Measure and Integration 179
At this point we present some preliminary results to the development of the well known Urysohn’s lemma.
180 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Theorem 9.1.4 Let U be a Hausdorff space and K ⊂ U compact. Let v ∈ K c . Then there exists open sets V
and W ⊂ U such that v ∈ V , K ⊂ W and V ∩W = 0/.
Proof 9.4 For each u ∈ K there exists open sets Wu ,Vvu ⊂ U such that u ∈ Wu , v ∈ Wvu and Wu ∩Vvu = 0/.
Observe that K ⊂ ∪u∈K Wu so that, since K is compact there exist u1 , u2 , ..., un ∈ K such that
K ⊂ ∪ni=1Wui .
Hence,
Kα0 ⊂ [∩α∈L\{α0 } Kα ]c ,
that is,
Kα0 ⊂ ∪α ∈L\{α0 } Kαc = ∪α ∈L\{α0 }Vα .
Since Kα0 is compact, there exists α1 , ..., αn ∈ L such that
so that,
Kα0 ∩ Kα1 ∩ ... ∩ Kαn = 0/.
The proof is complete.
Definition 9.1.6 We say that a space U is locally compact if each u ∈ U has a neighborhood whose closure
is compact.
Theorem 9.1.7 Let U be a locally compact Hausdorff space. Suppose W ⊂ U is open and K ⊂ W , where K
is compact. Then there exists an open set V ⊂ U whose closure is compact and such that
K ⊂ V ⊂ V ⊂ W.
Proof 9.6 Let u ∈ K. Since U is locally compact there exists an open Vu ⊂ U such that u ∈ Vu and V u is
compact.
Observe that
K ⊂ ∪u∈K Vu
Other Topics in Measure and Integration 181
K ⊂ ∪nj=1Vu j .
C ∩ G ∩W v1 ∩ ... ∩W vn = 0/.
Defining
V = G ∩Wv1 ∩ ... ∩Wvn
we obtain
V ⊂ G ∩W v1 ∩ ... ∩W vn .
Also,
K ⊂ V ⊂ V ⊂ W.
This completes the proof.
Observe that from this last definition f is continuous if and only if, it is both lower and upper semi-
continuous.
Here we state and prove a very important result namely, the Uryshon’s lemma.
Lemma 9.1.9 (Urysohn’s lemma) Assume U is a locally compact Hausdorff space, V ⊂ U is an open set
which contains a compact set K. Under such assumptions, there exists a function f ∈ Cc (V ) such that
0 ≤ f (u) ≤ 1, ∀u ∈ V,
f (u) = 1, ∀u ∈ K.
Proof 9.7 Set r1 = 0 and r2 = 1, and let r3 , r4 , r5 , ... be an enumeration of the rational numbers in (0, 1).
Observe that we may find open sets V0 and V1 such that V 0 is compact and
K ⊂ V1 ⊂ V 1 ⊂ V0 ⊂ V 0 ⊂ V.
Reasoning by induction, suppose n ≥ 2 and that Vr1 , ...,Vrn have been chosen so that if ri < r j then V r j ⊂ Vri .
Denote
ri = max{rk | k ∈ {1, ..., n} and rk < rn+1 },
and
r j = min{rk , | k ∈ {1, ..., n} and rk > rn+1 }.
We may find again an open set Vrn+1 such that
Thus, we have obtained a sequence Vr of open sets such that for every r rational in (0, 1) V r is compact and
if s > r then V s ⊂ Vr . Define,
r, if u ∈ Vr ,
fr (u) =
0, otherwise,
and
1, if u ∈ V s ,
gs (u) =
s, otherwise.
Also define
f (u) = sup fr (u), ∀u ∈ V,
r∈Q∩(0,1)
and
g(u) = inf gs (u), ∀u ∈ V.
s∈Q∩(0,1)
0 ≤ f ≤ 1,
and
f = 1, if u ∈ K.
Observe also that the support of f is contained in V 0
To complete the proof, it suffices to show that
f = g.
The inequality
fr (u) > gs (u)
is possible only if r > s, u ∈ Vr and u 6∈ V s .
But if r > s then Vr ⊂ Vs , and hence fr ≤ gs , ∀r, s ∈ Q ∩ (0, 1), so that f ≤ g. Suppose there exists u ∈ V
such that
f (u) < g(u).
Thus there exists rational numbers r, s such that
Theorem 9.1.10 [Partition of unity] Let U be a locally compact Hausdorff space. Assume K ⊂ U is compact
so that
K ⊂ ∪ni=1Vi ,
where Vi is open ∀i ∈ {1, ...n}. Under such hypotheses, there exists functions h1 , ..., hn such that
n
∑ hi = 1, on K,
i=1
Proof 9.8 Let u ∈ K ⊂ ∪ni=1Vi . Thus there exists j ∈ {1, ..., n} such that u ∈ V j . We may select an open set
Wu such that u ∈ Wu , W u is compact and W u ⊂ V j .
Observe that
K ⊂ ∪u∈K Wu .
From this, since K is compact, there exist u1 , ..., uN such that
K ⊂ ∪Nj=1Wu j .
For each i ∈ {1, ..., n} define by W̃i the union of those Wu j , contained in Vi .
By the Uryshon’s lemma we may find continuous functions gi such that
gi = 1, on W̃i ,
gi ∈ Cc (Vi ),
0 ≤ gi ≤ 1, ∀i ∈ {1, ..., n}.
Define,
h1 = g1
h2 = (1 − g1 )g2
h3 = (1 − g1 )(1 − g2 )g3
... .................
hn = (1 − g1 )(1 − g2 )...(1 − gn−1 )gn . (9.1)
Thus,
0 ≤ hi ≤ 1 and hi ∈ Cc (Vi ), ∀i ∈ {1, .., n}.
Furthermore, by induction, we may obtain
Finally, if u ∈ K then u ∈ W̃i for some i ∈ {1, .., n}, so that gi (u) = 1, and hence
The set {h1 , ..., hn } is said to be a partition of unity on K subordinate to the open cover {V1 , ...,Vn }.
The proof is complete.
4. µ(E) = sup{µ(K) | K ⊂ E, K compact} holds for all open E and all E ∈ M such that µ(E) < ∞.
5. If E ∈ M , A ⊂ E and µ(E) = 0 then A ∈ M .
Proof 9.9 We start by proving the uniqueness of µ. If µ satisfies (3) and (4), then µ is determined by
its values on compact sets. Then, if µ1 and µ2 are two measures for which the theorem holds, to prove
uniqueness, it suffices to show that
µ1 (K) = µ2 (K)
for every compact K ⊂ U. Let ε > 0. Fix a compact K ⊂ U. By (2) and (3), there exists an open V ⊃ K such
that
µ2 (V ) < µ2 (K) + ε.
By the Urysohn’s lema, there exists a f ∈ Cc (V ) such that
0 ≤ f (u) ≤ 1, ∀u ∈ V,
and
f (u) = 1, ∀u ∈ K.
Thus,
Z
µ1 (K) = χK dµ1
ZU
≤ f dµ1
U
= F( f )
Z
= f dµ2
ZU
≤ χV dµ2
U
= µ2 (V )
< µ2 (K) + ε. (9.2)
µ2 (K) ≤ µ1 (K),
so that
µ1 (K) = µ2 (K).
The proof of uniqueness is complete.
Now for every open V ⊂ U, define
Finally, define by M the collection of all sets such that E ⊂ U and E ∩ K ∈ MF for all compact K ⊂ U. Since
µ(A) ≤ µ(B),
if A ⊂ B we have that µ(E) = 0 implies E ∩ K ∈ MF for all K compact, so that E ∈ M . Thus, (5) holds and
so does (3) by definition.
Observe that if f ≥ 0 then F( f ) ≥ 0, that is if f ≤ g then F( f ) ≤ F(g).
Now we prove that if {En } ⊂ U is a sequence then
∞
µ (∪∞
n=1 En ) ≤ ∑ µ(En ). (9.3)
n=1
0 ≤ hi ≤ 1
and so that h1 + h2 = 1 on the support of g. Hence hi ∈ Cc (Vi ), 0 ≤ hi g ≤ 1, and g = (h1 + h2 )g and thus
Furthermore, if µ(En ) = ∞, for some n ∈ N, then (9.3) is obviously valid. Assume then µ(En ) < ∞, ∀n ∈ N.
Let a not relabeled ε > 0. Therefore for each n ∈ N there exists an open Vn ⊃ En such that
ε
µ(Vn ) < µ(En ) + .
2n
Define
V = ∪∞
n=1Vn ,
and choose f ∈ Cc (V ) such that 0 ≤ f ≤ 1. Since the support of f is compact, there exists N ∈ N such that
spt( f ) ⊂ ∪Nn=1Vn .
Therefore,
µ ∪Nn=1Vn
F( f ) ≤
N
≤ ∑ µ(Vn )
n=1
∞
≤ ∑ µ(En ) + ε. (9.4)
n=1
186 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
αg ≤ f .
Hence,
µ(K) ≤ µ(Vα )
= sup{F(g) | g ∈ Cc (Vα ), 0 ≤ g ≤ 1}
≤ α −1 F( f ). (9.6)
Letting α → 1 we obtain
µ(K) ≤ F( f ).
Thus, µ(K) < ∞, and obviously K ∈ MF .
Also there exists an open V ⊃ K such that
and hence MF contains every open set such that µ(V ) < ∞.
Let V ⊂ U be an open set such that µ(V ) < ∞.
Let α ∈ R be such that α < µ(V ). Therefore there exists f ∈ Cc (V ) such that 0 ≤ f ≤ 1 and such that
α < F( f ).
If W ⊂ U is an open set such that K = spt( f ) ⊂ W, we have that f ∈ Cc (W ) and 0 ≤ f ≤ 1 so that
F( f ) ≤ µ(W ).
From the Uryshon’s lemma there exists f ∈ Cc (U) such that f = 1 on K1 , f = 0 on K2 and,
0 ≤ f ≤ 1.
µ(E) ≥ µ(KN )
N
= ∑ µ(Hn )
n=1
N
≥ ∑ µ(En ) − ε, ∀N ∈ N. (9.11)
n=1
Let ε0 > 2ε. If µ(E) < ∞, there exists N0 ∈ N such that µ(KN0 ) > ∑∞
n=1 µ(En ) − ε0 .
From this and (9.12) we obtain
µ(E) ≤ µ(KN0 ) + ε0 .
Therefore, since ε > 0 and ε0 > 2ε are arbitrary we may conclude that E satisfies (4) so that E ∈ MF .
Now we prove the following.
If E ∈ MF there is a compact K ⊂ U and an open V ⊂ U such that K ⊂ E ⊂ V and
µ(V \ K) < ε.
For, from above, there exists a compact K and an open V such that
K ⊂E ⊂V
and
ε ε
µ(V ) − < µ(E) < µ(K) + .
2 2
Since V \ K is open and of finite measure, it is in MF . From the last chain of inequalities we obtain
so that
µ(V \ K) < ε.
In the next lines we prove that if A, B ∈ MF then
A \ B, A ∪ B and A ∩ B ∈ MF .
By above there exist compact sets K1 , K2 and open sets V1 ,V2 such that
K1 ⊂ A ⊂ V1 , K2 ⊂ B ⊂ V2
and
µ(Vi \ Ki ) < ε, ∀i ∈ {1, 2}.
Since
(A \ B) ⊂ (V1 \ K2 ) ⊂ (V1 \ K1 ) ∪ (K1 \V2 ) ∪ (V2 \ K2 ),
we get
µ(A \ B) < ε + µ(K1 \V2 ) + ε,
Since K1 \V2 ⊂ A \ B is compact and ε > 0 is arbitrary, we get,
A \ B ∈ MF .
Since
A ∪ B = (A \ B) ∪ B,
we obtain
A ∪ B ∈ MF .
Since
A ∩ B = A \ (A \ B)
Other Topics in Measure and Integration 189
we get
A ∩ B ∈ MF .
At this point we prove that M is a σ -algebra in U which contains all the Borel sets.
Let K ⊂ U be a compact subset. If A ∈ M then
Ac ∩ K = K \ (A ∩ K),
where An ∈ M , ∀n ∈ N.
Define B1 = A1 ∩ K and
Bn = (An ∩ K) \ (B1 ∪ B2 ∪ ... ∪ Bn−1 ),
∀n ≥ 2, n ∈ N.
Then {Bn } is disjoint sequence of sets in MF .
Thus,
n=1 Bn ∈ MF .
A ∩ K = ∪∞
Hence A ∈ M . Finally, if C ⊂ U is a closed subset, then C ∩ K is compact, so that C ∩ K ∈ MF . Hence
C ∈ M.
Therefore, M is a σ -algebra which contains the closed sets, so that it contains the Borel sets.
Finally, we will prove that
MF = {E ∈ M | µ(E) < ∞}.
For, if E ∈ MF then E ∩ K ∈ MF for all compact K ⊂ U, hence E ∈ M .
Conversely, assume E ∈ M and µ(E) < ∞. There is an open V ⊃ E such that µ(V ) < ∞. Pick a compact
K ⊂ V such that
µ(V \ K) < ε.
Since E ∩ K ∈ MF there is a compact K1 ⊂ (E ∩ K) such that
Since
E ⊂ (E ∩ K) ∪ (V \ K),
it follows that
µ(E) ≤ µ(E ∩ K) + µ(V \ K) < µ(K1 ) + 2ε.
This implies E ∈ MF .
To finish the proof, we show that
Z
F( f ) = f dµ, ∀ f ∈ Cc (U)
U
.
From linearity it suffices to prove the result for the case where f is real.
Let f ∈ Cc (U). Let K be the support of f and let [a, b] ⊂ R be such that
R( f ) ⊂ (a, b),
∪ni=1 Ei = K.
Observe that
ε
µ(Ei ) + > µ(Vi )
n
= sup{F( f ) | f ∈ Cc (Vi ), 0 ≤ f ≤ 1}
> F(hi ), ∀i ∈ {1, .., n}. (9.13)
Thus,
n
F( f ) = ∑ F(hi f )
i=1
n
≤ ∑ F(hi (yi−1 + 2ε))
i=1
n
= ∑ (yi−1 + 2ε)F(hi )
i=1
n ε
< ∑ (yi−1 + 2ε) µ (Ei ) +
i=1 n
n n n
ε
< ∑ y i −1 µ(Ei ) + ∑ (yi−1 )
n
+ 2ε ∑ µ(Ei ) + 2ε 2
i=1 i=1 i=1
Z
< f dµ + bε + 2ε µ(K) + 2ε 2 . (9.14)
U
Other Topics in Measure and Integration 191
From this Z
F(− f ) ≤ (− f ) dµ, ∀ f ∈ Cc (U),
U
that is, Z
F( f ) ≥ f dµ, ∀ f ∈ Cc (U).
U
Hence, Z
F( f ) = f dµ, ∀ f ∈ Cc (U).
U
The proof is complete.
Consider a Borel measure µ on Rn . We may associate to µ, the function Frµ (u), denoted by
µ(Br (u))
Frµ (u) = ,
m(Br (u))
where m denotes the Lebesgue measure.
We define the symmetric derivative of µ at u, by (Dµ)(u), by
Proof 9.10 Let us first order the balls Bri (ui ) so that
r1 ≥ r2 ≥ ... ≥ rN .
Let Bi2 be the first of the remaining balls, if any. Discard all B j such that j > i2 and Bi2 ∩ B j 6= 0/.
Let Bi3 the first of the remaining balls as long as possible. Such a process stops after a finite number of
steps. Define S = {i1 , i2 , ...}. It is clear that (1) holds. Now we prove that each discarded B j is contained in
{B3ri , i ∈ S}.
For, just observe that if r0 < r and Br0 (u0 ) intersects Br (u) then Br0 (u0 ) ⊂ B3r (u).
The proof is complete.
m(Aλ ) ≤ 3n λ −1 kµk,
where
Aλ = {u ∈ U | Gµ (u) > λ },
and
kµk = |µ|(Rn ).
Since K is compact, there exists a finite number of such balls which covers K. By Lemma 9.3.1, there
exists a disjoint sub-collection here denoted by {Br1 , ..., BrN } such that K ⊂ ∪Nk=1 B3rk , so that
N
m(K) ≤ 3n ∑ m(Brk )
k=1
N
≤ 3n λ −1 ∑ |µ|(Brk )
k=1
n −1
≤ 3 λ kµk. (9.15)
Remark 9.3.3 Observe that, if f ∈ L1 (Rn ) and λ > 0, for Aλ = {u ∈ Rn | | f | > λ }, we have
m(Aλ ) ≤ λ −1 k f k1 .
and
Aλ = {u ∈ U | G f (u) > λ },
we have
m(Aλ ) ≤ 3n λ −1 k f k1 .
Other Topics in Measure and Integration 193
1
Z
lim | f (v) − f (u)| dm(v) = 0,
r→0 m(Br (u)) Br (u)
also define
H f (u) = lim sup Hr f (u).
r→0
1
Z
Hrh (u) = |h − h(u)| dm
m(Br (u)) Br (u)
1
Z
≤ |h| dm + |h(u)|, (9.16)
m(Br (u)) Br (u)
so that
Hh < Gh + |h|.
Since
Hr f ≤ Hrg + Hrh ,
we obtain
H f ≤ Gh + |h|.
Define
Ay = {u ∈ Rn | H f (u) > 2y},
By,k = {u ∈ Rn | Gh (u) > y}
and
Cy,k = {u ∈ Rn | |h| > y}.
Observe that khk1 < 1/k, so that from remark 9.3.3 we obtain
3n
m(By,k ) ≤ ,
yk
and
1
m(Cy,k ) ≤ ,
yk
194 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
and hence
3n + 1
m(By,k ∪Cy,k ) ≤ ,
yk
Therefore
3n + 1
m(Ay ) ≤ m(By,k ∪Cy,k ) ≤ .
yk
Since k is arbitrary, we get m(Ay ) = 0, ∀y > 0 so that m{u ∈ Rn | H f (u) > 0} = 0.
The proof is complete.
Theorem 9.3.6 Suppose µ is a complex Borel measure on Rn such that µ m. Suppose f is the Radon-
Nikodym derivative of µ with respect to m. Under such assumptions,
Dµ = f , a.e. [m],
and Z
µ(E) = Dµ dm,
E
for all Borel set E ⊂ Rn .
Distributions
Thus φ ∈ D(Ω) if and only if φ ∈ C∞ (Ω) and the support of φ is a compact subset of Ω.
and
2. σ̂ denotes the collection of all convex balanced sets W ∈ D(Ω) such that W ∩ DK ∈ σK for every
compact K ⊂ Ω.
3. We define σ in D(Ω) as the collection of all unions of sets of the form φ + W , for φ ∈ D(Ω) and
W ∈ σ̂ .
Proof 10.1
1. From item 3 of Definition 10.1.2, it is clear that arbitrary unions of elements of σ are elements of σ .
Let us now show that finite intersections of elements of σ also belongs to σ . Suppose V1 ∈ σ and
V2 ∈ σ , if V1 ∩ V2 = 0/ we are done. Thus, suppose φ ∈ V1 ∩ V2 . By the definition of σ there exist two
sets of indices L1 and L2 , such that
where, Wiλ ∈ σ̂ , ∀λ ∈ Li . Moreover, since φ ∈ V1 ∩ V2 there exist φi ∈ D(Ω) and Wi ∈ σ̂ such that
φ ∈ φi + Wi ⊂ Vi , for i = 1, 2. (10.5)
Thus there exists K ⊂ Ω such that φi ∈ DK for i ∈ {1, 2}. Since DK ∩ Wi ∈ σK , DK ∩ Wi is open in DK
so that from (10.5) there exists 0 < δi < 1 such that
φ − φi + δi Wi ⊂ (1 − δi )Wi + δi Wi = Wi (10.7)
so that
φ + Wφ ⊂ Vi , (10.9)
1 1
φ1 + V + φ2 + V = φ1 + φ2 + V . (10.12)
2 2
(c) To prove the continuity of scalar multiplication, first consider φ0 ∈ D(Ω) and α0 ∈ R. Then:
1
φ − φ0 ∈ cV = V, (10.14)
2(|α0 | + δ )
so that
1
(|α0 | + δ )(φ − φ0 ) ∈ V . (10.15)
2
This means
1 1
α(φ − φ0 ) + (α − α0 )φ0 ∈ V + V = V . (10.16)
2 2
Therefore, αφ − α0 φ0 ∈ V whenever |α − α0 | < δ and φ − φ0 ∈ cV .
For the next result the proof may be found in Rudin [67].
Proposition 10.1.4 A convex balanced set V ⊂ D(Ω) is open if and only if V ∈ σ .
Proposition 10.1.5 The topology σK of DK ⊂ D(Ω) coincides with the topology that DK inherits from
D(Ω).
V ∈ σ implies DK ∩ V ∈ σK . (10.17)
Now suppose V ∈ σK , we must show that there exists A ∈ σ such that V = A ∩ DK . The definition of σK
implies that for every φ ∈ V , there exist N ∈ N and δφ > 0 such that
{ϕ ∈ DK | kϕ − φ kN < δφ } ⊂ V . (10.18)
Define
Then Uφ ∈ σ̂ and
DK ∩ (φ + Uφ ) = φ + (DK ∩ Uφ ) ⊂ V . (10.20)
The proof for the next two results may also be found in Rudin [67].
Proposition 10.1.6 If A is a bounded set of D(Ω) then A ⊂ DK for some K ⊂ Ω, and there are MN < ∞ such
that kφ kN ≤ MN , ∀φ ∈ A, N ∈ N.
Proposition 10.1.7 If {φn } is a Cauchy sequence in D(Ω), then {φn } ⊂ DK for some K ⊂ Ω compact, and
Proposition 10.1.8 If φn → 0 in D(Ω), then there exists a compact K ⊂ Ω which contains the support of
φn , ∀n ∈ N and Dα φn → 0 uniformly, for each multi-index α.
198 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
2. T is bounded.
3. If φn → θ in D(Ω) then T (φn ) → θ as n → ∞.
4. The restrictions of T to each DK are continuous.
Proof 10.3
1 ⇒ 2. This follows from Proposition 2.7.3 .
2 ⇒ 3. Suppose T is bounded and φn → 0 in D(Ω), by last proposition φn → 0 in some DK so that
{φn } is bounded and {T (φn )} is also bounded. Hence by Proposition 2.7.3, T (φn ) → 0 in V .
3 ⇒ 4. Assume 3 holds and consider {φn } ⊂ DK . If φn → θ then, by Proposition 10.1.5, φn → θ in
D(Ω), so that, by above T (φn ) → θ in V . Since DK is metrizable, also by proposition 2.7.3 we have
that 4 follows.
4 ⇒ 1. Assume 4 holds and let V be a convex balanced neighborhood of zero in V . Define U =
T −1 (V ). Thus U is balanced and convex. By Proposition 10.1.5, U is open in D(Ω) if and only
if DK ∩ U is open in DK for each compact K ⊂ Ω, thus if the restrictions of T to each DK are
continuous at θ , then T is continuous at θ , hence 4 implies 1.
Definition 10.1.10 (Distribution) A linear functional in D(Ω) which is continuous with respect to σ is said
to be a Distribution.
Proposition 10.1.11 Every differential operator is a continuous mapping from D(Ω) into D(Ω).
Proof 10.4 Since kDα φ kN ≤ kφ k|α |+N , ∀N ∈ N, Dα is continuous on each DK , so that by Theorem 10.1.9,
Dα is continuous on D(Ω).
Theorem 10.1.12 Denoting by D 0 (Ω) the dual space of D(Ω) we have that T : D(Ω) → R ∈ D 0 (Ω) if and
only if for each compact set K ⊂ Ω there exists an N ∈ N and c ∈ R+ such that
|T (φ )| ≤ ckφ kN , ∀φ ∈ DK . (10.22)
Proof 10.5 The proof follows from the equivalence of 1 and 4 in Theorem 10.1.9.
Dα Tn → Dα T in D 0 (Ω). (10.26)
Proof 10.6 Let K be an arbitrary compact subset of Ω. Since (10.25) holds for every φ ∈ DK , the principle
of uniform boundedness implies that the restriction of T to DK is continuous. It follows from Theorem 10.1.9
that T is continuous in D(Ω), that is, T ∈ D 0 (Ω). On the other hand
(Dα T )(φ ) = (−1)|α| T (Dα φ ) = (−1)|α | lim Tn (Dα φ ) = lim (Dα Tn (φ )), ∀φ ∈ D(Ω). (10.27)
n→∞ n→∞
T : D(Ω) → R
given by Z
T (φ ) = f φ dx,
Ω
T (φ ) = φ (1/2) + φ 0 (1/3).
Thus,
|T (φ )| = |φ (1/2) + φ 0 (1/3)| ≤ kφ k∞ + kφ 0 k∞ ≤ 2kφ k1 ,
so that T is also a distribution (bounded and linear).
200 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Here we emphasize that the two main references for this chapter are Adams [2] and Evans [34]. We start with
the definition of Lebesgue spaces, denoted by L p (Ω), where 1 ≤ p ≤ ∞ and Ω ⊂ Rn is an open set. In this
chapter, integrals always refer to the Lebesgue measure.
Proof 11.1 The result is clear if p = 1 or p = ∞. You may assume kuk p , kvkq > 0, otherwise the result is
also obvious. Thus suppose 1 < p < ∞. From the concavity of log function on (0, ∞) we obtain
1 p 1 q 1 1
log a + b ≥ log a p + log bq = log(ab). (11.4)
p q p q
Thus,
1 p 1
ab ≤ (a ) + (bq ), ∀a ≥ 0, b ≥ 0. (11.5)
p q
Therefore,
1 1
|u(x)||v(x)| ≤ |u(x)| p + |v(x)|q , a.e. in Ω. (11.6)
p q
1 1
Z
|uv|dx ≤ kuk pp + kvkqq . (11.7)
Ω p q
λ p−1 1
Z
|uv|dx ≤ kuk pp + kvkqq . (11.8)
Ω p λq
q/ p
For λ = kuk−1
p kvkq ¨
we obtain the Holder inequality.
Proof 11.2 The only non-trivial property to be proved concerning the norm definition, is the triangle
inequality. If p = 1 or p = ∞ the result is clear. Thus, suppose 1 < p < ∞. For u, v ∈ L p (Ω) we have
¨
and hence, from Holder’s inequality
p−1
ku + vk pp ≤ ku + vk p−1
p kuk p + ku + vk p kvk p , (11.11)
that is,
Proof 11.3 Suppose p = ∞. Suppose {un } is Cauchy sequence in L∞ (Ω). Thus, given k ∈ N there exists
Nk ∈ N such that, if m, n ≥ Nk then
1
kum − un k∞ < . (11.13)
k
Therefore, for each k, there exist a set Ek such that m(Ek ) = 0, and
1
|um (x) − un (x)| < , ∀x ∈ Ω \ Ek , ∀m, n ≥ Nk . (11.14)
k
Observe that E = ∪∞k=1 Ek is such that m(E) = 0. Thus {un (x)} is a real Cauchy sequence at each x ∈ Ω \ E.
Define u(x) = lim un (x) on Ω \ E. Letting m → ∞ in (11.14) we obtain
n→∞
1
|u(x) − un (x)| < , ∀x ∈ Ω \ E, ∀n ≥ Nk . (11.15)
k
Thus, u ∈ L∞ (Ω) and kun − uk∞ → 0 as n → ∞.
Now suppose 1 ≤ p < ∞. Let {un } a Cauchy sequence in L p (Ω). We can extract a subsequence {unk }
such that
1
kunk+1 − unk k p ≤ , ∀k ∈ N. (11.16)
2k
To simplify the notation we write uk in place of unk , so that
1
kuk+1 − uk k p ≤ , ∀k ∈ N. (11.17)
2k
Defining
n
gn (x) = ∑ |uk+1 (x) − uk (x)|, (11.18)
k=1
we obtain
kgn k p ≤ 1, ∀n ∈ N. (11.19)
From the monotone convergence theorem and (11.19), gn (x) converges to a limit g(x) with g ∈ L p (Ω). On
the other hand, for m ≥ n ≥ 2 we have
|um (x) − un (x)| ≤ |um (x) − um−1 (x)| + ... + |un+1 (x) − un (x)| ≤ g(x) − gn−1 (x), a.e. in Ω. (11.20)
Hence, {un (x)} is Cauchy a.e. in Ω and converges to a limit u(x) so that
which means u ∈ L p (Ω). Finally, from |un (x) − u(x)| → 0, a.e. in Ω, |un (x) − u(x)| p ≤ |g(x)| p and the
Lebesgue dominated convergence theorem we get
kun − uk p → 0 as n → ∞. (11.22)
Theorem 11.1.6 Let {un } ⊂ L p (Ω) and u ∈ L p (Ω) such that kun − uk p → 0. Then there exists a subsequence
{unk } such that
1. unk (x) → u(x), a.e. in Ω,
204 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Proof 11.4 The result is clear for p = ∞. Suppose 1 ≤ p < ∞. From the last theorem we can easily obtain
that |unk (x) − u(x)| → 0 as k → ∞, a.e. in Ω. To complete the proof, just take h = |u| + g, where g is defined
in the proof of the last theorem.
Theorem 11.1.7 L p (Ω) is reflexive for all p such that 1 < p < ∞.
u+v p u−v p
+
2 L p (Ω) 2 L p (Ω)
1
≤ kukLp p (Ω) + kvkLp p (Ω) , ∀u, v ∈ L p (Ω). (11.23)
2
and therefore
u+v
< 1−δ, (11.28)
2 p
for δ = 1 − (1 − (ε /2) p )1/p > 0. Thus, L p (Ω) is uniformly convex and from Theorem 4.7.2 it is
reflexive.
The Lebesgue and Sobolev Spaces 205
3. L p (Ω) is reflexive for 1 < p ≤ 2. Let 1 < p ≤ 2, from 2 we can conclude that Lq is reflexive. We will
define T : L p (Ω) → (Lq )∗ by
Z
hTu, f iLq (Ω) = u f dx, ∀u ∈ L p (Ω), f ∈ Lq (Ω). (11.29)
Ω
¨
From the Holder inequality, we obtain
so that
Pick u ∈ L p (Ω) and define f0 (x) = |u(x)| p−2 u(x) ( f0 (x) = 0 if u(x) = 0). Thus, we have that f0 ∈
Lq (Ω), k f0 kq = kuk p−1
p and hTu, f0 iLq (Ω) = kuk pp . Therefore,
Thus, T is an isometry from L p (Ω) to a closed subspace of (Lq (Ω))∗ . Since from the first part Lq (Ω)
is reflexive, we have that (Lq (Ω))∗ is reflexive. Hence T (L p (Ω)) and L p (Ω) are reflexive.
Theorem 11.1.8 (Riesz representation theorem) Let 1 < p < ∞ and let f be a continuous linear functional
on L p (Ω). Then there exists a unique u0 ∈ Lq such that
Z
f (v) = vu0 dx, ∀v ∈ L p (Ω). (11.34)
Ω
Furthermore
We have to show that T is onto. Define E = T (Lq (Ω)). As E is a closed subspace, it suffices to show that E
is dense in (L p (Ω))∗ . Suppose h ∈ (L p )∗∗ = L p is such that
Choosing u = |h| p−2 h we may conclude that h = 0 which, by Corollary 4.2.14 completes the first part of the
proof. The proof of uniqueness is left to the reader.
p
Definition 11.1.9 Let 1 ≤ p ≤ ∞. We say that u ∈ Lloc (Ω) if uχK ∈ L p (Ω) for all compact K ⊂ Ω.
206 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
6 0}.
spt(φ ) = {x ∈ Ω | φ (x) =
Cc∞ (Ω) denotes the set of functions in C∞ (Ω) with compact support contained in Ω.
The sets C0 (Ω) and C0∞ (Ω) consist of the closure of Cc (Ω) (which is the set of functions in C(Ω) with
compact support in Ω) and Cc∞ (Ω) respectively, relating the uniform convergence norm. On the other hand,
CBm (Ω) denotes the set of functions u ∈ Cm (Ω) for which Dα u is bounded on Ω for 0 ≤ |α| ≤ m. Observe
that CBm (Ω) is a Banach space with the norm denoted by k · kB,m given by
Also, we define Cm (Ω) as the set of functions u ∈ Cm (Ω) for which Dα u is bounded and uniformly continuous
on Ω for 0 ≤ |α| ≤ m. Observe that Cm (Ω) is a closed subspace of CBm (Ω) and is also a Banach space with
the norm inherited from CBm (Ω). An important space is the one of Holder
¨ continuous functions.
¨
Definition 11.1.10 (Spaces of Holder continuous functions) If 0 < λ < 1, for a nonnegative integer m we
define the space of Holder
¨ continuous functions denoted by Cm,λ (Ω), as the subspace of Cm (Ω) consisting
of those functions u for which, for all 0 ≤ |α| ≤ m, there exists a constant K such that
Cm,λ (Ω) is a Banach space with the norm denoted by k · km,λ given by
α
|D u(x) − Dα u(y)|
kukm,λ = kukB,m + max sup , x 6 = y .
0≤|α |≤m x,y∈Ω |x − y|λ
From now on we say that f : Ω → R is locally integrable, if it is Lebesgue integrable on any compact K ⊂ Ω.
p
Furthermore, we say that f ∈ Lloc (Ω) if f ∈ L p (K) for any compact K ⊂ Ω. Finally, given an open Ω ⊂ Rn ,
we denote W ⊂⊂ Ω whenever W is compact and W ⊂ Ω.
Theorem 11.1.11 The space C0 (Ω) is dense in L p (Ω), for 1 ≤ p < ∞.
Then f = 0 a.e. in Ω.
First suppose f ∈ L1 (Ω) and Ω bounded, so that m(Ω) < ∞. Given ε > 0, since C0 (Ω) is dense in L1 (Ω),
there exists f1 ∈ C0 (Ω) such that k f − f1 k1 < ε and thus, from (11.39) we obtain
Z
f1 u dx ≤ εkuk∞ , ∀u ∈ C0 (Ω). (11.40)
Ω
The Lebesgue and Sobolev Spaces 207
Defining
and
As K1 and K2 are disjoint compact sets, by the Urysohn Theorem there exists u0 ∈ C0 (Ω) such that
+1, if x ∈ K1 ,
u0 (x) = (11.43)
−1, if x ∈ K2
and
so that
Z Z Z
| f1 | dx = | f1 | dx + | f1 | dx ≤ ε + εm(Ω). (11.47)
Ω K Ω−K
Hence,
k f k1 ≤ k f − f1 k1 + k f1 k1 ≤ 2ε + εm(Ω). (11.48)
Proof 11.8 The result follows from last theorem and from the fact that C0 (K) is separable for each K ⊂ Ω
compact (from the Weierstrass theorem, polynomials with rational coefficients are dense C0 (K)). Observe
n=1 Ωn , Ωn defined as in (11.49), where Ω̄n is compact, ∀n ∈ N.
that Ω = ∪∞
208 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Definition 11.2.2 We define the norm k · km,p for W m,p (Ω), where m ∈ N and 1 ≤ p ≤ ∞, as
( )1/p
kukm,p = ∑ kD α
uk pp , if 1 ≤ p < ∞, (11.51)
0≤|α|≤m
and
Proof 11.9 Consider {un } a Cauchy sequence in W m,p (Ω). Then {Dα un } is a Cauchy sequence for each
0 ≤ |α| ≤ m. Since L p (Ω) is complete there exist functions u and uα , for 0 ≤ |α| ≤ m, in L p (Ω) such that
1 (Ω) and so u determines a distribution
un → u and Dα un → uα in L p (Ω) as n → ∞. From above L p (Ω) ⊂ Lloc n
Tun ∈ D (Ω). For any φ ∈ D(Ω) we have, by Holder’
0 ¨ s inequality
Z
|Tun (φ ) − Tu (φ )| ≤ |un (x) − u(x)||φ (x)|dx ≤ kφ kq kun − uk p . (11.53)
Ω
Hence Tun (φ ) → Tu (φ ) for every φ ∈ D(Ω) as n → ∞. Similarly TDα un (φ ) → Tuα (φ ) for every φ ∈ D(Ω).
We have that
for every φ ∈ D(Ω). Thus uα = Dα u in the sense of distributions, for 0 ≤ |α| ≤ m, and u ∈ W m,p (Ω). As
lim ku − un km,p = 0, W m,p (Ω) is complete.
n→∞
Remark 11.2.4 Observe that distributional and classical derivatives coincide when the latter exist and are
continuous. We define S ⊂ W m,p (Ω) by
Thus, the completion of S concerning the norm k · km,p is denoted by H m,p (Ω).
Proof 11.10 Since W m,p (Ω) is complete we have that H m,p (Ω) ⊂ W m,p (Ω).
Theorem 11.2.6 W m,p (Ω) is separable if 1 ≤ p < ∞, and is reflexive and uniformly convex if 1 < p < ∞.
Particularly, W m,2 (Ω) is a separable Hilbert space with the inner product
Proof 11.11 We can see W m,p (Ω) as a subspace of L p (Ω, RN ), where N = ∑0≤|α|≤m 1. From the relevant
properties for L p (Ω), we have that L p (Ω; RN ) is a reflexive and uniformly convex for 1 < p < ∞ and separable
for 1 ≤ p < ∞. Given u ∈ W m,p (Ω), we may associate the vector Pu ∈ L p (Ω; RN ) defined by
Pu = {Dα u}0≤|α |≤m . (11.57)
Since kPuk pN = kukm,p , we have that W m,p is closed subspace of L p (Ω; RN ). Thus from Theorem 1.21 in
Adams [1], we have that W m,p (Ω) is separable if 1 ≤ p < ∞ and, reflexive and uniformly convex, if 1 < p < ∞.
Lemma 11.2.7 Let 1 ≤ p < ∞ and define U = L p (Ω; RN ). For every continuous linear functional f on U,
there exists a unique v ∈ Lq (Ω; RN ) = U ∗ such that
N
f (u) = ∑ hui , vi i, ∀u ∈ U. (11.58)
i=1
Moreover,
k f kU ∗ = kvkqN , (11.59)
where k · kqN = k · kLq (Ω,RN ) .
¨
From Holder’s inequality we obtain
N
| f (u)| ≤ ∑ ku j k p kv j kq ≤ kuk pN kvkqN , (11.62)
j=1
and hence k f kU ∗ ≤ kvkqN . The equality in (11.62) is achieved for u ∈ L p (Ω, RN ), 1 < p < ∞ such that
|v j |q−2 v̄ j , if v j 6= 0
u j (x) = (11.63)
0, if v j = 0.
If p = 1 choose k such that kvk k∞ = max1≤ j≤N kv j k∞ . Given ε > 0, there is a measurable set A such that
m(A) > 0 and |vk (x)| ≥ kvk k∞ − ε, ∀x ∈ A. Defining u(x) as
v̄k /vk , if i = k, x ∈ A and vk (x) =
6 0
ui (x) = (11.64)
0, otherwise,
we have
Z
f (uk ) = hu, vk iL2 (Ω) = |vk |dx
A
≥ (k(vk k∞ − ε)kuk k1
= (kvk∞N − ε)kuk1N . (11.65)
Since ε is arbitrary, the proof is complete.
210 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Theorem 11.2.8 Let 1 ≤ p < ∞. Given a continuous linear functional f on W m,p (Ω), there exists v ∈
Lq (Ω, RN ) such that
f (u) = ∑ hDα u, vα iL2 (Ω) . (11.66)
0≤|α |≤m
Proof 11.13 Consider f a continuous linear operator on U = W m,p (Ω). By the Hahn Banach Theorem,
we can extend f to f˜, on L p (Ω; RN ), so that k f˜kqN = k f kU ∗ and by the last theorem, there exists {vα } ∈
Lq (Ω; RN ) such that
f˜(û) = ∑ hûα , vα iL2 (Ω) , ∀v ∈ L p (Ω; RN ). (11.67)
0≤|α|≤m
Finally, observe that, also from the Hahn-Banach theorem k f kU ∗ = k f˜kqN = kvkqN .
Definition 11.2.9 Let Ω ⊂ Rn be a domain. For m a positive integer and 1 ≤ p < ∞ we define W0m,p (Ω) as
the closure in k · km,p of Cc∞ (Ω), where we recall that Cc∞ (Ω) denotes the the set of C∞ (Ω) functions with
compact support contained in Ω. Finally, we also recall that the support of φ : Ω → R, denoted by spt(φ ), is
given by
spt(φ ) = {x ∈ Ω |φ (x) =
6 0}.
Theorem 11.3.2 (The Sobolev imbedding theorem) Let Ω be an open bounded set in Rn such that ∂ Ω is
Ĉ1 . Let j ≥ 0 and m ≥ 1 be integers and let 1 ≤ p < ∞.
1. Part I
(a) Case A If either mp > n or m = n and p = 1 then
Moreover,
and, in particular
and, in particular
2. Part II
If mp > n > (m − 1)p, then
Definition 11.4.1 Let Ω ⊂ Rn be an open bounded set. For each ε > 0 define
where | · |2 refers to the Euclidean norm in Rn , that is for x = (x1 , ..., xn ) ∈ Rn , we have
q
|x|2 = x12 + ... + xn2 .
fε = ηε ∗ f ,
that is,
Z
fε (x) = ηε (x − y) f (y) dy
Ω
Z
n
= (−1) ηε (y) f (x − y) dy. (11.78)
B(0,ε)
Theorem 11.4.4 (Properties of mollifiers) The mollifiers have the following properties:
1. fε ∈ C∞ (Ωε ),
2. fε → f a.e. as ε → 0,
3. If f ∈ C(Ω) then fε → f uniformly on compact subsets of Ω.
Proof 11.14
x + hei ∈ Ωε ,
The Lebesgue and Sobolev Spaces 213
V ⊂⊂ W ⊂⊂ Ω,
and note that f is uniformly continuous on W Thus the limit indicated in (11.81) holds uniformly on
V , and therefore fε → f uniformly on V .
Theorem 11.4.5 Let Ω ⊂ RN be an open bounded set. Let u : Ω → R be such that u ∈ L p (Ω), where 1 ≤
p < ∞. Then
ηε ∗ ũ ∈ L p (Ω),
kηε ∗ ũk p,Ω ≤ kuk p,Ω , ∀ε > 0
and
lim kηε ∗ ũ − uk p,Ω = 0.
ε→0+
Suppose given ρ > 0. As Cc (Ω) is dense in L p (Ω), there exists φ ∈ Cc (Ω) such that
ku − φ k p < ρ/3.
kηε ∗ ũ − uk p = kηε ∗ ũ − ηε ∗ φ + ηε ∗ φ − φ + φ − uk p
≤ kηε ∗ ũ − ηε ∗ φ k p + kηε ∗ φ − φ k p + kφ − uk p
≤ ρ/3 + ρ/3 + ρ/3 = ρ. (11.85)
2.
Z
|u(x)| p dx < ε p . (11.87)
Ω−G
Proof 11.16 Suppose K is relatively compact in L p (Ω). Suppose given ε > 0. As K is compact we may
find a finite ε/6-net for K. Denote such a ε/6-net by N where
Since Cc (Ω) is dense in L p (Ω), for each k ∈ {1, ..., m} there exists φk ∈ Cc (Ω) such that
ε
kφk − vk k p < .
6
Thus defining
S = {φ1 , ..., φm },
given u ∈ K, we may select vk ∈ N such that
ε
ku − vk k p < ,
6
so that
where
spt(φk ) = {x ∈ Rn | φk (x) =
6 0}.
We have that
G ⊂⊂ Ω,
216 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Since u ∈ K is arbitrary, (11.87) is proven. Since φk is continuous and spt(φk ) is compact we have that φk is
uniformly continuous, that is, for the ε given above, there exists δ̃ > 0 such that if |h| < min{δ˜ , 1} then
ε
|φk (x + h) − φk (x)| < , ∀x ∈ G,
3(|G| + 1)
Thus, Z ε p
|φk (x + h) − φk (x)| p dx < .
Ω 3
Also observe that since
ε
ku − φk k p < ,
3
we have that
ε
kTh u − Th φk k p < ,
3
where Th u = u(x + h). Thus if |h| < δ = min{δ̃ , 1}, we obtain
φk → u, in L p (Rn ).
The Lebesgue and Sobolev Spaces 217
Observe that
ηρ ∗ φk → ηρ ∗ u, in L p (Rn ),
as k → ∞. Also
Th φk → Th u, in L p (Rn ),
as k → ∞. Thus,
kTh φk − φk k p → kTh u − uk p ,
in particular ( )
lim sup sup {kTh φk − φk k ≤ sup kTh u − uk p .
k→∞ h∈Bρ (θ ) h∈Bρ (θ )
Therefore, as
kηρ ∗ φk − φk k p → kηρ ∗ u − uk p ,
as k → ∞, from (11.92) we get
kηρ ∗ u − uk p → 0, uniformly in K as ρ → 0.
where K1 = K2 K3 ,
!1/p
K2 = sup ηρ0 (y) ,
y∈Rn
uε = ηε ∗ u in Ωε .
Then,
1. uε ∈ C∞ (Ωε ), ∀ε > 0,
m,p
2. uε → u in Wloc (Ω), as ε → 0,
Proof 11.17 Assertion 1 has been already proved. Let us prove 2. We will show that if |α| ≤ m, then
Dα uε = ηε ∗ Dα u, in Ωε .
Therefore, Z Z
Dαy (ηε (x − y)) u(y) dy = (−1)|α| ηε (x − y)Dαy u(y) dy,
Ω Ω
and hence,
Z
Dα uε (x) = (−1)|α|+|α | ηε (x − y)Dα u(y) dy
Ω
= (ηε ∗ Dα u)(x). (11.96)
Now choose any open bounded set such that V ⊂⊂ Ω. We have that
Dα uε → Dα u, in L p (V ) as ε → 0,
as ε → 0.
Theorem 11.4.8 Let Ω ⊂ Rn be a bounded open set and suppose u ∈ W m,p (Ω) for some 1 ≤ p < ∞. Then
there exists a sequence {uk } ⊂ C∞ (Ω) such that
uk → u in W m,p (Ω).
where
Ωi = {x ∈ Ω | dist(x, ∂ Ω) > 1/i}.
Define
Vi = Ωi+3 − Ω̄i+1 ,
and choose any open set V0 such that V0 ⊂⊂ Ω, so that
Ω = ∪∞
i=0Vi .
Let {ζi }∞
i=0 be a smooth partition of unit subordinate to the open sets {Vi }i=0 . That is,
∞
0 ≤ ζi ≤ 1, ζi ∈ Cc∞ (Vi )
∑∞i=0 ζi = 1, on Ω,
Now suppose u ∈ W m,p (Ω). Thus ζi u ∈ W m,p (Ω) and spt(ζi u) ⊂ Vi ⊂ Ω. Choose δ > 0. For each i ∈ N
choose εi > 0 small enough so that
ui = ηεi ∗ (ζi u)
satisfies
δ
kui − ζi ukm,p,Ω ≤ ,
2i+1
220 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Thus such a function belongs to C∞ (Ω), since for each open V ⊂⊂ Ω there are at most finitely many non-zero
terms in the sum. Since
∞
u = ∑ ζi u,
i=0
we have that for a fixed V ⊂⊂ Ω,
∞
kv − ukm,p,V ≤ ∑ kui − ζi ukm,p,V
i=0
∞
1
≤ δ∑ = δ. (11.97)
i=0 2i+1
kv − ukm,p,Ω < δ .
The next result is also relevant. For a proof see Evans, [34] page 232.
Theorem 11.4.9 Let Ω ⊂ Rn be a bounded set such that ∂ Ω is C1 . Suppose u ∈ W m,p (Ω) where 1 ≤ p < ∞.
Thus there exists a sequence {un } ⊂ C∞ (Ω) such that
un → u in W m,p (Ω), as n → ∞.
kuk − ukm,p,Ω → 0, as k → ∞.
Proof 11.19 Fix x0 ∈ ∂ Ω. Since ∂ Ω is Ĉ1 , denoting x̂ = (x1 , ..., xn−1 ) for a local coordinate system, there
exists r > 0 and a function f (x1 , ..., xn−1 ) = f (x̂) such that
and
vn ∈ C∞ (W ).
Hence,
kvn − ukm,p,W ≤ kvn − uδ km,p,W + kuδ − ukm,p,W < ε/2 + ε/2 = ε.
Clarifying the dependence of r on x0 ∈ ∂ Ω we denote r = rx0 . Observe that
∂ Ω ⊂ ∪M
i=1 Bri (xi ).
We denote Bri (xi ) = Bi and Wi = Ω∩Bi , ∀i ∈ {1, ..., M}. We also choose an appropriate open set B0 ⊂⊂ Ω
such that
Ω ⊂ ∪M i=0 Bi .
Let {ζi }M M
i=0 be a concerned partition of unity relating {Bi }i=0 .
Thus ζi ∈ Cc∞ (Bi ) and 0 ≤ ζi ≤ 1, ∀i ∈ {0, ..., M} and also
M
∑ ζi = 1 on Ω.
i=0
From above, we may find vi ∈ C∞ (W i ) such that kvi − ukm,p,Wi < ε, ∀i ∈ {1, ..., M}. Define u0 = v0 =
u on B0 ≡ W0 ,
ui = ζi u, ∀i ∈ {0, ...., M}
and
M
v = ∑ ζi vi .
i=0
We emphasize
v ∈ C∞ (Ω).
Therefore
M
kv − ukm,p,Ω = ∑ (ζi u − ζi vi )
i=0 m,p,Ω
M
≤ C2 ∑ ku − vi km, p,(Ω∩Bi )
i=0
M
= C2 ∑ ku − vi km,p,Wi
i=0
< C2 Mε. (11.98)
11.4.3 Extensions
In this section we study extensions of Sobolev spaces from a domain Ω ⊂ Rn to Rn . First we enunciate a
result found in Evans, [34].
Theorem 11.4.11 Assume Ω ⊂ Rn is an open bounded set, and that ∂ Ω is C1 . Let 1 ≤ p < ∞ and let V be a
bounded open set such that Ω ⊂⊂ V . Then there exists a bounded linear operator
Proof 11.20 Let u ∈ W 1,p (Ω). Fix N ∈ N and select φN ∈ C∞ (Ω) such that
for some r > 0 and so that denoting x̂ = (x1 , ..., xn−1 ), f (x1 , ..., xn−1 ) = f (x̂) is a Lipschitz continuous function
such that
∂ f (x̂) n−1
∂ xk k=1
is classically defined almost everywhere on its domain and
ψ −1 (B+
r1 (y0 )) ⊂ Ω ∩ Br (x0 ),
+ −
where y0 = (x01 , ..., x0n−1 , 0). We define W + = ψ −1 (Br1 (y0 )) and W − = ψ −1 (Br1 (y0 )) where we denote
B+ = B+
r1 (y0 ) = {y ∈ Br1 (y0 ) | yn ≥ 0},
and
B− = B−
r1 (y0 ) = {y ∈ Br1 (y0 ) | yn < 0}.
We emphasize that locally about x0 we have that ∂ Ω and ψ(∂ Ω) correspond to the the equations xn −
f (x1 , ..., xn−1 ) = 0 and yn = 0 respectively.
At this point we are going to show that {φ N } is a Cauchy sequence in W 1,p (B+ ).
Observe that, denoting W = W + ∪W − , we have
Also, since ψ 0 (x) is classically defined and |ψ 0 (x)| < C2 , a.e. in Ω, for an appropriate C2 > 0 we have
that ψ 0 ∈ L∞ (W , Rn×n ). Similarly,
det(ψ 0 ) ∈ L∞ (W ),
det[(ψ −1 )0 ] ∈ L∞ (B),
∂ xk (y)
∈ L∞ (B), ∀ j, k ∈ {1, · · · , n},
∂yj
and
∂ y j (x)
∈ L∞ (W ), ∀ j, k ∈ {1, · · · , n}.
∂ xk
We denote by Ŵ ⊂ W the set on which det(ψ 0 ) and
∂ y j (x)
, ∀ j, k ∈ {1, · · · , n},
∂ xk
are classically defined.
Similarly, we denote by B̂ ⊂ B the set on which det((ψ −1 )0 ) and
∂ xk (y))
, ∀ j, k ∈ {1, · · · , n},
∂yj
φ N → ũ a.e. in B+ .
However, up to to a not relabeled subsequence of this last subsequence, from (11.99) we obtain,
φN → u, a.e. in Ω, (11.102)
so that
ũ(y) = u(x(y)), a.e. in B+ .
Moreover, φ N is Lipschitz continuous on B+ so that φ ∈ W 1,p (B+ ), and therefore there exists φ̃N ∈
+
C∞ (B ) such that
kφ̃N − φ N k1,p,B+ < 1/N.
Define φ̂N : B → R by
+
φ̃N (y) if y ∈ B
φ̂N (y) =
−3φ̃N (y1 , ..., yn−1 , −yn ) + 4φ̃N (y1 , ..., yn−1 , −yn /2) if y ∈ B− .
It may be easily verified that φ̂N ∈ C1 (B). Also, there exists C2 > 0 such that
Hence {φ̂N } is a Cauchy sequence and thus there exists û ∈ W 1,p (B) such that
In particular, up to a not relabeled subsequence of indices {N} of the sequence indicated in (11.102), we
have,
φ̂N → û, a.e. in B,
Observe that,
as N → ∞. From this, since φ N → u(x(y)), in W 1, p (B+ ), we obtain for an appropriate not relabeled subse-
quence
φ̂N |B+ → u(x(y)), in W 1,p (B+ ),
so that
û(y) = u(x(y)), a.e. in B+ ,
and thus,
û(y(x)) = u(x), a.e. in W + .
Denoting, u(x) = u(y(x)) in W = W + ∪W − , we obtain
u = u, a.e. in W + .
ku − φ̂N (y(x))k1,p,W → 0, as N → ∞.
Observe that,
Z
|uN (x)) − φ̂N (y(x))| p dx
W
Z
≤ |u(x(y)) − φ̂N (y)| p | det((ψ −1 )0 (y))| dy
B̂
Z
≤ K1 |û(y) − φ̂N (y)| p dy
B
p
≤ K1 kû − φ̂N k1,p,B → 0, as N → ∞. (11.106)
Therefore,
ku(x) − φ̂ (y(x))k1,p,W → 0, as N → ∞.
226 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Now choose ε > 0. Thus there exists N0 ∈ N such that if N > N0 we have
kuk1,p,W ≤ C5 kuk1,p,W + .
Now denoting W = Wx0 we have that ∂ Ω ⊂ ∪x0 ∈∂ ΩWx0 and since ∂ Ω is compact, there exist x1 , ..., xM ∈
∂ Ω, such that
∂ Ω ⊂ ∪M i=1Wxi .
Ω ⊂ ∪M
i=0Wi .
M
∑ ζi = 1, in Ω,
i=0
where C = (M + 1)C5 .
We recall that the partition of unity may be chosen so that its support is on V .
Finally, we denote Eu = u.
The proof is complete.
The Lebesgue and Sobolev Spaces 227
Theorem 11.4.14 (Gagliardo-Nirenberg-Sobolev inequality) Let 1 ≤ p < n. Thus there exists a constant
K > 0 depending only p and n such that
Proof 11.21 Suppose p = 1. Let u ∈ Cc1 (Rn ). From the fundamental theorem of calculus we have,
Z xi
∂ u(x1 , ..., xi−1 , yi , xi+1 , ..., xn )
u(x) = dyi ,
−∞ ∂ xi
so that Z ∞
|u(x)| ≤ |Du(x1 , ..., xi−1 , yi , xi+1 , ..., xn )| dyi .
−∞
Therefore,
n Z ∞
1/(n−1)
n/(n−1)
|u(x)| ≤∏ |Du(x1 , ..., xi−1 , yi , xi+1 , ..., xn )| dyi .
i=1 −∞
¨
From this and the generalized Holder inequality, we obtain,
Z ∞
|u(x)|n/(n−1) dx1
−∞
Z ∞
1/(n−1)
≤ |Du| dy1
−∞
n Z ∞ Z ∞ 1/(n−1)
×∏ |Du| dx1 dyi . (11.111)
i=2 −∞ −∞
Integrating in x2 we obtain,
Z ∞Z ∞
|u(x)|n/(n−1) dx1 dx2
−∞ −∞
Z ∞ Z ∞
1/(n−1)
≤ |Du| dy1
−∞ −∞
n Z ∞ Z ∞ 1/(n−1) !
×∏ |Du| dx1 d yi dx2 ,
i=2 −∞ −∞
228 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
so that
Z ∞Z ∞
|u(x)|n/(n−1) dx1 dx2
−∞ −∞
Z ∞ Z ∞ 1/(n−1)
≤ |Du| dy2 dx1
−∞ −∞
Z ∞ Z ∞
1/(n−1)
× |Du| dy1
−∞ −∞
n Z ∞ Z ∞ 1/(n−1) !
×∏ |Du| dx1 dyi dx2 . (11.112)
i=3 −∞ −∞
¨
By applying the generalized Holder inequality we get
Z ∞Z ∞
|u(x)|n/(n−1) dx1 dx2
−∞ −∞
Z ∞ Z ∞ 1/(n−1)
≤ |Du| dy2 dx1
−∞ −∞
Z ∞ Z ∞ 1/(n−1)
× |Du| dy1 dx2
−∞ −∞
n Z ∞ Z ∞Z ∞ 1/(n−1)
×∏ |Du| dx1 dx2 dyi . (11.113)
i=3 −∞ −∞ −∞
p(n−1)
that is γ = n−p , so that
γn (γ − 1) p np
= = ,
n−1 p−1 n− p
we get
Z ((n−1)/n−(p−1)/p) Z 1/p
r p
|u| dx ≤C |Du| dx .
Rn Rn
Theorem 11.4.15 Let Ω ⊂ Rn be a bounded open set. Suppose ∂ Ω is Ĉ1 , 1 ≤ p < n and u ∈ W 1,p (Ω).
Then u ∈ Lr (Ω) and
kukr,Ω ≤ Kkuk1,p,Ω ,
where the constant depends only on p, n and Ω.
Proof 11.22 Since ∂ Ω is Ĉ1 , from Theorem 11.4.12, there exists an extension Eu = ū ∈ W 1,p (Rn ) such
that ū = u in Ω the support of ū is compact and
kūk1,p,Rn ≤ Ckuk1,p,Ω ,
where C does not depend on u. As ū has compact support, from Theorem 11.4.10, there exists a sequence
{uk } ∈ Cc∞ (Rn ) such that
uk → ū in W 1,p (Rn ).
from the last theorem
kuk − ul kr,Rn ≤ KkDuk − Dul k p,Rn .
Hence,
uk → ū in Lr (Rn ).
also from the last theorem
kuk kr,Rn ≤ KkDuk k p,Rn , ∀k ∈ N,
so that
kūkr,Rn ≤ KkDūk p,Rn .
Therefore, we may get
kukr,Ω ≤ kūkr,Rn
≤ KkDūk p,Rn
≤ K1 kūk1,p,Rn
≤ K2 kuk1,p,Ω . (11.116)
Theorem 11.4.16 Let Ω ⊂ Rn be a bounded open set such that ∂ Ω ∈ Ĉ1 . If mp < n, then W m,p (Ω) ,→ Lq (Ω)
for p ≤ q ≤ (np)/(n − mp).
230 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
where
r1 = np/(n − (m − 1)p) = np/(n − mp + p),
whenever n > (m − 1)p. If u ∈ W m,p (Ω) where n > mp, then u and D j u are in W m−1,p (Ω), so that from
(11.117) we have u ∈ W 1,r1 (Ω) and
Since n > mp we have that r1 = np/((n − mp) + p) < n, from q0 = nr1 /(n − r1 ) = np/(n − mp) by the last
theorem we have
kukq0 ,Ω ≤ K2 kuk1,r1 ,Ω ,
where the constant K2 does not depend on u, and therefore from this and (11.118) we obtain
¨
Through Holder’s inequality, we get
Z
kukqq,Ω = |u(x)|s |u(x)|q−s dx
Ω
Z 1/t Z 1/t 0
0
≤ |u(x)|st dx |u(x)|(q−s)t dx
Ω Ω
p/t q /t 0
= kuk p,Ω kukq00 ,Ω
p/t 0 q /t 0
≤ kuk p,Ω (K3 )q0 /t kukm,p,Ω
0
0 p/t q /t 0
≤ (K3 )q0 /t kukm,p,Ω kukm,p,Ω
0
0
= (K3 )q0 /t kukqm,p,Ω , (11.120)
since
p/t + q0 /t 0 = q.
This completes the proof.
Proof 11.24 If q ≥ p0 = p/(p − 1) then q = ns/(n − ms) where s = pq/(p + q) is such that 1 ≤ s ≤ p.
Observe that
W m,p (Ω) ,→ W m,s (Ω)
with the imbedding constant depending only on |Ω|. Since ms < n, by the last theorem we obtain
0
Now if p ≤ q ≤ p0 , from above we have W m,p (Ω) ,→ L p (Ω) and the obvious imbedding W m, p (Ω) ,→ L p (Ω).
Define s = (p0 − q)p/(p0 − p) and the result follows from a reasoning analogous to the final chain of inequal-
ities of last theorem, indicated in (11.120).
About the next theorem, note that its hypotheses are satisfied if ∂ Ω is Ĉ1 (here we do not give the details).
Theorem 11.4.18 Let Ω ⊂ Rn be an open bounded set, such that for each x ∈ Ω there exists a convex set
Cx ⊂ Ω whose shape depends on x, but such that |Cx | > α, for some α > 0 that does not depend on x. Thus
if mp > n, then
W m,p (Ω) ,→ CB0 (Ω).
Proof 11.25 Suppose first m = 1 so that p > n. Fix x ∈ Ω and pick y ∈ Cx . For φ ∈ C∞ (Ω), from the
fundamental theorem of calculus, we have
Z 1
d (φ (x + t(y − x))
φ (y) − φ (x) = dt.
0 dt
Thus, Z 1
d (φ (x + t(y − x))
|φ (x)| ≤ |φ (y)| + dt,
0 dt
and hence Z 1
d(φ (x + t(y − x))
Z Z Z
|φ (x)| dy ≤ |φ (y)| d y + dt dy,
Cx Cx Cx 0 dt
¨
so that, from Holder’s inequality and Fubini theorem we get,
|φ (x)|α ≤ |φ (x)| · |Cx |
0
≤ kφ k p,Ω |Cx |1/ p
Z 1Z
d(φ (x + t(y − x)) (11.121)
+ dy dt.
0 Cx dt
Therefore Z 1Z
1/p0
|φ (x)|α ≤ kφ k p,Ω |Ω| + |∇φ (z)|δt −n dz dt,
0 V
where |V | = t n |Cx | and δ denotes the diameter of Ω. From Holder’
¨ s inequality again, we obtain
Z 1 Z 1/p
1/p0 0
|φ (x)|α ≤ kφ k p,Ω |Ω| +δ |∇φ (z )| dz p
t −n (t n |Cx |)1/p dt,
0 V
and thus Z 1
0 0 0
|φ (x)|α ≤ kφ k p,Ω |Ω|1/p + δ |Cx |1/p k∇φ k p,Ω t −n(1−1/p ) dt.
0
Since p > n we obtain
Z 1 Z 1
0 1
t −n(1−1/p ) dt = t −n/p dt = .
0 0 1 − n/p
From this, the last inequality and from the fact that |Cx | ≤ |Ω|, we have that there exists K > 0 such that
Here the constant K depends only on p, n and Ω. Consider now u ∈ W 1,p (Ω).
232 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
φk → u, in W 1,p (Ω).
φk → u, a.e. in Ω. (11.123)
Fix x ∈ Ω such that the limit indicate in (11.123) holds. Suppose given ε > 0. Therefore, there exists k0 ∈ N
such that
|φk0 (x) − u(x)| ≤ ε/2
and
kφk0 − uk1,p,Ω < ε/(2K).
Thus,
Since ε > 0 is arbitrary, the proof for m = 1 is complete, because for {φk } ∈ C∞ (Ω) such that φk → u
in W 1,p (Ω), from (11.122) we have that {φk } is a uniformly Cauchy sequence, so that it converges to a
continuous u∗ , where u∗ = u, a.e. in Ω.
For m > 1 but p > n we still have
r̂ = np/(n − j p),
Now define
r̂1 = p1 = np/(n − ( j − 1)p)
and 1 ≤ p2 ≤ n such that
r̂1 = np2 /(n − p2 ),
so that
np/(n − ( j − 1)p) = np2 /(n − p2 ).
Hence p2 = np/(n − ( j − 2)p) so that by the last theorem
Theorem 11.4.19 Let Ω ⊂ Rn be an open and bounded set with a boundary Ĉ1 . If mp > n, then W m, p (Ω) ,→
Lq (Ω) for p ≤ q ≤ ∞.
Proof 11.26 From the proof of the last theorem, we may obtain
kuk∞,Ω ≤ Kkukm,p,Ω , ∀u ∈ W m,p (Ω).
If p ≤ q < ∞, we have
Z
kukqq,Ω = |u(x)| p |u(x)|q−p dx
Ω
Z q− p
≤ |u(x)| p Kkukm,p,Ω dx
Ω
≤ K q−p
kuk pp,Ω kukq−p
m,p,Ω
p
≤ K q−p kukm,p,Ω kukq−p
m,p,Ω
= K q−p kukqm,p,Ω . (11.125)
The proof is complete.
234 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Theorem 11.4.20 Let S ⊂ Rn be a n-dimensional ball of radius bigger than 3. If n < p, then there exists a
constant C, depending only on p and n, such that
kukC0,λ (S) ≤ Ckuk1,p,S , ∀u ∈ C1 (S),
Proof 11.27 First consider λ = 1 − n/p and u ∈ C1 (S). Let x, y ∈ S such that |x − y| < 1 and define
σ = |x − y|. Consider a fixed cube denoted by Rσ ⊂ S such that |Rσ | = σ n and x, y ∈ R̄σ . For z ∈ Rσ , we may
write: Z 1
du(x + t(z − x))
u(x) − u(z) = − dt,
0 dt
that is, Z Z Z 1
u(x)σ n = u(z) dz − ∇u(x + t(z − x)) · (z − x) dt dz.
Rσ Rσ 0
Thus, denoting in the next lines V by an appropriate set such that |V | = t n |Rσ |, we obtain
Z
√ Z Z 1
|u(x) − u(z) dz/σ n | ≤ nσ 1−n |∇u(x + t(z − x))| dt dz
Rσ Rσ 0
√ Z 1 Z
≤ nσ 1−n t −n |∇u(z)| d z dt
0 V
√ Z 1
0
≤ nσ 1−n t −n k∇uk p,S |V |1/p dt
0
√ 1−n n/p0 Z 1
0
≤ nσ σ k∇uk p,S t −nt n/ p dt
0
√ Z 1
≤ nσ 1−n/p k∇uk p,S t −n/p dt
0
1−n/p
≤σ kuk1,p,S K, (11.126)
where
√ Z 1 −n/p √
K= n t dt = n/(1 − n/p).
0
A similar inequality holds with y in place of x, so that
|u(x) − u(y)| ≤ 2K|x − y|1−n/p kuk1,p,S , ∀x, y ∈ Rσ .
Now consider 0 < λ < 1 − n/p. Observe that, as |x − y|λ ≥ |x − y|1−n/p if |x − y| < 1, we have,
|u(x) − u(y)|
sup |x=
6 y, |x − y| < 1
x,y∈S |x − y|λ
|u(x) − u(y)|
≤ sup | x 6
= y, |x − y| < 1 ≤ Kkuk1,p,S . (11.127)
x,y∈S |x − y|1−n/ p
Also,
|u(x) − u(y)|
sup | |x − y| ≥ 1 ≤ 2kuk∞,S ≤ 2K1 kuk1,p,S
x,y∈S |x − y|λ
so that
|u(x) − u(y)|
sup 6 y ≤ (K + 2K1 )kuk1,p,S , ∀u ∈ C1 (S).
|x=
x,y∈S |x − y|λ
The proof is complete.
The Lebesgue and Sobolev Spaces 235
Theorem 11.4.21 Let Ω ⊂ Rn be an open bounded set such that ∂ Ω is Ĉ1 Assume n < p ≤ ∞.
Then
W 1,p (Ω) ,→ C0,λ (Ω),
for all 0 < λ ≤ 1 − n/p.
Proof 11.28 Fix 0 < λ ≤ 1 − n/p and let u ∈ W 1,p (Ω). Since ∂ Ω is Ĉ1 , from Theorem 11.4.12, there exists
an extension Eu = ū such that ū = u, a.e. in Ω, and
kūk1,p,Rn ≤ Kkuk1,p,Ω ,
where the constant K does not depend on u. From the proof of this same theorem, we may assume that spt(ū)
is on a n-dimensional sphere S ⊃ Ω with sufficiently big radius and such sphere does not depend on u. Thus,
in fact, we have
kūk1,p,S ≤ Kkuk1,p,Ω .
Since C∞ (S) is dense in W 1,p (S), there exists a sequence {φk } ⊂ C∞ (S) such that
uk → ū, a.e. in Ω.
so that {uk } is a cauchy sequence in C0,λ (S), and thus uk → u∗ for some u∗ ∈ C0,λ (S). Hence, from this and
(11.128), we have
u∗ = ū, a.e. in S.
Finally, from above and last theorem we may write:
kTuk p,∂ Ω ≤ Ckuk1,p,Ω , ∀u ∈ W 1,p (Ω),
where the constant C depends only on p and Ω.
236 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
where denoting x̂ = (x1 , ..., xn−1 ), f (x̂) is continuous and such that its partial derivatives are classically de-
fined a.e. and bounded on its domain. Furthermore
Ψ−1 (B+
r1 (y0 )) ⊂ Ω ∩ Br (x0 ).
Observe that this is possible since Ψ and Ψ−1 are continuous, where y = Ψ(x). Here
B+
r1 (y0 ) = {y ∈ Br1 (y0 ) | yn > 0}.
+
For each N ∈ N, choose, by mollification for example, φN ∈ C∞ (Br1 (y0 )) such that
1
kφN − ûk∞,B+ (y ) < .
r1 0 N
Also denote
Γ̃ = {y ∈ B | yn = 0},
and
Γ̃1 = {y ∈ Br1 (y0 ) | yn = 0}.
Observe that
Z Z
|φN | p dΓ ≤ η |φN | p dΓ
Γ̃ Γ̃1
Z
= − (η|φN | p )yn dy
B+
r1
Z
≤ − (ηyn |φN | p ) dy
B+
r1
Z
+ (p|φN | p−1 |(φN )yn |η) d y. (11.129)
B+
r1
a p bq 1 1
ab ≤ + , ∀a, b ≥ 0, where + = 1.
p q p q
The Lebesgue and Sobolev Spaces 237
Thus,
|(φN )yn | p η p |φN |(p−1)q
(|φN | p−1 |)(|(φN )yn |η) ≤ + ,
p q
so that replacing such an inequality in (11.129), since (p − 1)q = p we get
Z Z Z
!
p p p
|φN | dΓ ≤ C1 |φN | dy + |DφN | dy . (11.130)
Γ̃ B+
r1 B+
r1
Letting N → +∞ we obtain
Z Z
|u(x)| p dΓ ≤ C2 |û(y)| p dΓ
Γ Γ̃
Z Z
!
≤ C3 |û| p dy + |Dû| p dy
B+
r1 Br+1
Z Z
≤ C4 |u| p dx + |Du| p d x . (11.131)
W+ W+
kuk − uk1,p,Ω → 0, as k → ∞.
From above
kTuk − Tul k p,∂ Ω ≤ Ckuk − ul k1,p,Ω ,
so that
{Tuk }
is a Cauchy sequence. Hence we may define
Remark 11.5.2 Similar results are valid for W0m,p , however in this case the traces relative to derivatives of
order up to m − 1 are involved.
238 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Proof 11.30 The Imbeddings (11.133) and (11.134) follows from the inequalites
and also,
α
D φ (x) − Dα φ (y)|
sup | | |x − y| ≥ 1 ≤ 2 sup{|Dα φ |}. (11.137)
x,y∈Ω |x − y|ν x∈Ω
Now suppose Ω is bounded. If A is a bounded set in C0,λ (Ω) then there exists M > 0 such that
kφ kC0,λ (Ω) ≤ M, ∀φ ∈ A.
But then
|φ (x) − φ (y)| ≤ M|x − y|λ , ∀x, y ∈ Ω, φ ∈ A,
so that by the Ascoli-Arzela theorem, A is pre-compact in C(Ω). This proves the compactness of (11.134)
for m = 0.
If m ≥ 1 and A is bounded in Cm,λ (Ω), then A is bounded in C0,λ (Ω). Thus, by above there is a sequence
{φk } ⊂ A and φ ∈ C0,λ (Ω) such that
φk → φ in C(Ω).
However, {Di φk } is also bounded in C0,λ (Ω), so that there exist a not relabeled subsequence, also denoted
by {φk } and ψi such that
Di φk → ψi , in C(Ω).
The Lebesgue and Sobolev Spaces 239
The convergence in C(Ω̄) being the uniform one, we have ψi = Di φ . We can proceed extracting (not rela-
beled) subsequences until obtaining
Dα φk → Dα φ , in C(Ω), ∀ 0 ≤ |α| ≤ m.
This completes the proof of compactness of (11.134). For (11.135), let S be a bounded set in Cm,λ (Ω).
Observe that
|D φ (x) − Dα φ (y)| ν /λ
α
|Dα φ (x) − Dα φ (y)|
=
|x − y|ν |x − y|λ
·|Dα φ (x) − Dα φ (y)|1−ν /λ
≤ K|Dα φ (x) − Dα φ (y)|1−ν /λ , (11.138)
for all φ ∈ S. From (11.134), S has a converging subsequence in Cm (Ω). From (11.138) such a subsequence
is also converging in Cm,ν (Ω). The proof is complete.
Theorem 11.6.2 (Rellich-Kondrachov) Let Ω ⊂ Rn be an open bounded set such that ∂ Ω is Ĉ1 . Let j, m
be integers, j ≥ 0, m ≥ 1, and let 1 ≤ p < ∞.
1. Part I- If mp < n, then the following imbeddings are compact:
4. Part IV- All the above imbeddings are compact if we replace W j+m,p (Ω) by W0j+m,p (Ω).
Remark 11.6.3 Given X,Y, Z spaces, for which we have the imbeddings X ,→ Y and Y ,→ Z and if one of
these imbeddings is compact then the composite imbedding X ,→ Z is compact. Since the extension oper-
ator u → ũ where ũ(x) = u(x) if x ∈ Ω and ũ(x) = 0 if x ∈ Rn − Ω, defines an imbedding W0j+m,p (Ω) ,→
W j+m,p (Rn ) we have that Part-IV of above theorem follows from the application of Parts I-III to Rn (despite
the fact we are assuming Ω bounded, the general results may be found in Adams [2]).
240 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Remark 11.6.4 To prove the compactness of any of above imbeddings it is sufficient to consider the case
j = 0. Suppose, for example, that the first imbedding has been proved for j = 0. For j ≥ 1 and {ui } bounded
sequence in W j+m,p (Ω) we have that {Dα ui } is bounded in W m,p (Ω) for each α such that |α| ≤ j. From the
case j = 0 it is possible to extract a subsequence (similarly to a diagonal process) {uik } for which {Dα uik }
converges in Lq (Ω) for each α such that |α| ≤ j, so that {uik } converges in W j,q (Ω).
Proof of Parts II and III. If mp > n > (m − 1)p and 0 < λ < (m − n)/p, then there exists µ such that
λ < µ < m − (n/p). Since Ω is bounded, the imbedding C0,µ (Ω) ,→ C0,λ (Ω) is compact by Theorem 11.6.1.
Since by the Sobolev Imbedding Theorem we have W m,p (Ω) ,→ C0,µ (Ω), we have that imbedding (11.144)
is compact.
If mp > n, let j∗ be the non-negative integer satisfying (m − j∗ )p > n ≥ (m − j∗ − 1)p. Thus we have the
chain of imbeddings
∗ ,p
W m,p (Ω) ,→ W m− j (Ω) ,→ C0,µ (Ω) ,→ C(Ω), (11.146)
where 0 < µ < m − j∗ − (n/p). The last imbedding in (11.146) is compact by Theorem 11.6.1, so that
(11.143) is compact for j = 0. By analogy (11.141) is compact for j = 0. Therefore from the above remarks,
(11.142) is also compact. For the proof of Part I, we need the following lemma:
Lemma 11.6.6 Let Ω be an bounded domain in Rn . Let 1 ≤ q1 ≤ q0 and suppose
is compact.
Proof 11.31 Define λ = q1 (q0 − q)/(q(q0 − q1 )) and µ = q0 (q − q1 )/(q(q0 − q1 )). We have that λ > 0
and µ ≥ 0. From Holder’
¨ s inequality and (11.147) there exists K ∈ R+ such that,
λ µ λ µ
kuk0,q,Ω ≤ kuk0,q1 ,Ω
kuk0,q0 ,Ω ≤ Kkuk0,q 1 ,Ω
kukm,p,Ω ,
∀u ∈ W m,p (Ω). (11.150)
Thus considering a sequence {ui } bounded in W m,p (Ω), since (11.148) is compact there exists a subsequence
{unk } that converges, and is therefore a Cauchy sequence in Lq1 (Ω). From (11.150), {unk } is also a Cauchy
sequence in Lq (Ω), so that (11.149) is compact.
Suppose A is a bounded set of functions in W m,p (Ω), that is suppose there exists K1 > 0 such that
Also, suppose given ε > 0, and define, for u ∈ W m,p (Ω), ũ(x) = u(x) if x ∈ Ω, ũ(x) = 0, if x ∈ Rn \ Ω. Fix
u ∈ A. From Holder’
¨ s inequality and considering that W m,p (Ω) → Lq0 (Ω), we have
Z Z 1/q0 Z 1−1/q0
q0
|u(x)|dx ≤ |u(x)| dx 1dx
Ω−Ωk Ω−Ωk Ω−Ωk
Thus, since A is bounded in W m,p (Ω), there exists K0 ∈ N such that if k ≥ K0 then
Z
|u(x)|dx < ε, ∀u ∈ A (11.154)
Ω−Ωk
Observe that if |h| < 1/k, then x + th ∈ Ω2k provided x ∈ Ωk and 0 ≤ t ≤ 1. If u ∈ C∞ (Ω) we have that
Z 1
d u(x + th)
Z Z
|u(x + h) − u(x)| ≤ dx dt
Ωk Ωk 0 dt
Z 1 Z
≤ |h| dt |∇u(y)|dy ≤ |h|kuk1,1,Ω
0 Ω2k
≤ K2 |h|kukm,p,Ω . (11.156)
Since C∞ (Ω) is dense in W m,p (Ω), from above for |h| sufficiently small
Z
|ũ(x + h) − ũ(x)|dx < 3ε, ∀u ∈ A. (11.157)
Ω
From Theorem 11.4.6, A is relatively compact in L1 (Ω) and therefore the imbedding indicated (11.151) is
compact for q = 1. This completes the proof.
Before introducing the exercises, we work out some examples.
We start start with the following J.L. Lions Lemma.
Lemma 11.1
Let U,V,W be Banach spaces. Assume U ,→ V , where such an imbbeding is compact, and V ,→ W .
Under such assumptions, for each ε > 0 there exists C(ε) > 0 such that
Proof 11.32 Suppose, to obtain contradiction, there exists ε0 > 0 such that for each n ∈ N we may find
un ∈ U such that
kun kV > ε0 kun kU + nkun kW . (11.158)
242 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
kukV ≤ K1 kukU , ∀u ∈ U,
6 0, ∀n ∈ N, we also obtain,
and since from (11.158), kun kV > 0 so that un =
un
≤ K1 , ∀n ∈ N. (11.159)
kun kU V
un un
K1 ≥ > ε0 + n , ∀n ∈ N. (11.160)
kun kU V kun kU W
unk
− u0 → 0, as k → ∞.
kunk kU V
unk
− u0 < ε = ku0 kW /2,
kunk kU W
so that
unk
≥ ku0 kW /2, if k > k0 .
kunk kU W
From this and (11.160), we get,
u nk
K1 > ε0 + n k
kunk kU W
ku0 kW
≥ ε0 + n k , if k > k0 . (11.162)
2
6 0.
This contradicts u0 =
The proof is complete.
The Lebesgue and Sobolev Spaces 243
Example 11.6.7 With such a result in mind, we work out the following example.
Let Ω ⊂ Rn be an open bounded set with a Ĉ1 class boundary.
Let 1 < p < n where n ≥ 2.
Let
np
1≤q< .
n− p
In such a case the imbbeding
W 1,p (Ω) ,→ Lq (Ω)
is compact, and we have also the obvious imbbeding,
Lq (Ω) ,→ L1 (Ω).
Let ε > 0. From the last lemma with U = W 1,p (Ω), V = Lq (Ω) and W = L1 (Ω), there exists C(ε) > 0
such that
Exercise 11.6.8 Let Ω ⊂ Rn be an open bounded set with a Ĉ1 class boundary.
Let p > n.
Prove that for each ε > 0 there exists C(ε) > 0 such that
Hint: From the Rellich-Kondrachov theorem, the imbbeding W 2,p (Ω) ,→ C1 (Ω) is compact.
and in particular,
Z
Fu (ϕ) = F̃u (ϕ) = f j ϕ dx, ∀ϕ ∈ Cc1 (Ω),
Ω
so that
Z Z
∂ϕ
u dx = Fu (ϕ) = f ϕ dx, ∀ϕ ∈ Cc1 (Ω).
Ω ∂xj Ω
so that
u ∈ W 1,p (Ω).
Example 11.6.10 In this example, let Ω ⊂ Rn be an open bounded set. Let 1 < p < ∞.
Assume u ∈ W 1,p (Ω).
Let f : R → R be such that
| f (x) − f (y)| ≤ K|x − y|, ∀x, y ∈ R.
We are going to show that
( f ◦ u) ∈ W 1,p (Ω).
First observe that, from the hypotheses,
so that
| f (u(x))| ≤ K|u(x)| + | f (0)|, a.e. in Ω.
Since ( f ◦ u) is measurable, we may infer that
( f ◦ u) ∈ L p (Ω).
Observe that
ϕ(x + he j ) − ϕ(x) ∂ ϕ(x)
f (u(x)) → f (u(x)) , as h → 0, a.e. in Ω,
h ∂xj
ϕ(x + he j ) − ϕ(x)
f (u(x)) ≤ | f (u(x))|C,
h
where
∂ ϕ(x)
C = max .
x∈Ω ∂xj
Recalling that ( f ◦ u) ∈ LP (Ω) ⊂ L1 (Ω), from the Lebesgue dominated convergence theorem, we obtain,
ϕ(x + he j ) − ϕ(x)
Z
lim ( f (u(x)) dx
h→0 Ω h
Z
∂ϕ
= ( f (u(x)) dx. (11.164)
Ω ∂xj
The Lebesgue and Sobolev Spaces 245
ϕ(x + he j ) − ϕ (x)
Z
f (u(x)) dx
Ω h
f (u(x − he j )) − f (u(x))
Z
= ϕ(x) dx
Ω h
| f (u(x − he j )) − f (u(x))|
Z
≤ |ϕ (x)| dx
Ω |h|
K|u(x − he j ) − u(x)|
Z
≤ |ϕ(x)| dx
Ω |h|
u(x − he j ) − u(x)
≤ K kϕ k p0
h p
≤ KC1 kϕk p0 . (11.165)
( f ◦ u) ∈ W 1,p (Ω).
ai j ξi ξ j ≥ c|ξ |2 , ∀ξ ∈ Rn .
Let f ∈ L2 (Ω) and consider the problem of minimizing J : U = W01,2 (Ω) → R, where,
1 ∂u ∂u
Z Z
J(u) = ai j dx − f u dx,
2 Ω ∂ xi ∂ x j Ω
∀u ∈ U.
Observe that J is strictly convex, coercive and continuous, so that it is lower semi-continuous.
Hence, from the direct method of calculus of variations, there exists a unique u0 ∈ W01,2 (Ω) such that
we obtain,
∂ u0 ∂ ϕ
Z Z
ai j dx − f ϕ dx = 0, (11.166)
Ω ∂ xi ∂ x j Ω
∀ϕ ∈ U.
246 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
η ≡ 1, on V ,
spt(η) ⊂ W,
and
0 ≤ η(x) ≤ 1, ∀x ∈ Rn .
Select k ∈ {1, · · · , n} and define
ϕ(x) = −D−h 2 h
k [η Dk u0 (x)],
where generically,
u(x + hek ) − u(x)
Dhk u(x) = , ∀h =
6 0.
|h|
Observe that, for |h| > 0 sufficiently small, ϕ ∈ U = W01,2 (Ω), so that from (11.166), we obtain,
∂ u0 ∂ D−h 2 h
k [η Dk u0 (x)]
Z
− ai j dx
Ω ∂ xi ∂xj
Z
+ f Dk−h [η 2 Dkh u0 (x)] dx = 0, (11.167)
Ω
Define,
∂ u0 ∂ D−h 2 h
k [η Dk u0 (x)]
Z
A=− ai j dx,
Ω ∂ xi ∂xj
and Z
B= f D−h 2 h
k [η Dk u0 (x)] dx.
Ω
Observe that
∂ u0 −h ∂ η 2 h
Z
A = − ai j
Dk Dk u0 (x) dx
Ω ∂ xi ∂xj
∂ u0 −h 2 h ∂ u0 (x)
Z
− ai j Dk η Dk dx
Ω ∂ xi ∂xj
∂ u0 ∂ η 2 h
Z
h
= Dk [ai j ] D u0 (x) dx
Ω ∂ xi ∂ x j k
∂ η2 h
h ∂ u0
Z
+ ai j Dk D u0 (x) dx
Ω ∂ xi ∂xj k
∂ u0 2 h ∂ u0 (x)
Z
h
+ Dk [ai j ] η Dk dx
Ω ∂ xi ∂xj
∂ u0 ∂ u0 (x)
Z
+ ai j Dhk η 2 Dhk dx (11.168)
Ω ∂ xi ∂xj
The Lebesgue and Sobolev Spaces 247
where
∂ η2
K1 = ,
∂ xk ∞
and
K2 = kηk∞ .
From (11.167) and (11.168) we get
∂ u0 (x)
∂ u0
Z
ai j Dhk η 2 Dhk dx
Ω ∂ xi ∂xj
∂ u0 ∂ η 2 h
Z
= − Dhk [ai j ] Dk u0 (x) dx
Ω ∂ xi ∂ x j
2
h ∂ u0
Z
∂η h
− ai j Dk D u0 (x) dx
Ω ∂ xi ∂xj k
∂ u0 2 h ∂ u0 (x)
Z
h
− Dk [ai j ] η Dk dx
Ω ∂ xi ∂xj
−B, (11.170)
where
∂η
K3 = max kDhk ai j k∞ max η
i, j∈{1,··· ,n} j∈{1,··· ,n} ∂xj ∞
∂ ai j ∂η
≤ max max η , (11.172)
i, j∈{1,··· ,n} ∂ xk ∞ j∈{1,··· ,n} ∂xj ∞
∂η
K4 = max kai j k∞ max η ,
i, j∈{1,··· ,n} j∈n ∂xj ∞
K6 = k f k2 K1 ku0 k1,2 ,
and
K7 = K2 k f k2 .
We claim
n
∂ u0 (x)
lim sup ∑ η Dhk ≤ K8 ,
h→0 j=1 ∂xj 2
for some K8 > 0.
Suppose, to obtain contradiction, there is a sequence {hm } such that hm → 0+ and
n
hm ∂ u0 (x)
∑ ηDk → +∞, as m → ∞.
j=1 ∂xj 2
∂ u0
DhK ≤ K8 , ∀ j, k ∈ {1, · · · , n},
∂xj V
for some appropriate K8 > 0, so that
∂ 2 u0
∈ L2 (V ), ∀ j, k ∈ {1, ..., n},
∂ x j ∂ xk
that is,
u0 ∈ W 2,2 (V ), ∀V ⊂⊂ Ω,
so that
u0 ∈ Wl2,2
oc (Ω).
The Lebesgue and Sobolev Spaces 249
and
∂ u0 ∂ ϕ
Z Z
ai j dx = f ϕ d x, (11.175)
Ω ∂ xi ∂ x j Ω
∀ϕ ∈ U = W01,2 (Ω).
Define V = Br/2 (x0 ) ∩ Rn+ .
Choose η ∈ C1 (Rn ) such that
η ≡ 1, on V
and
spt(η) ⊂⊂ Br (x0 ).
Select k ∈ {1, · · · , n − 1}, and define for 0 < |h| < r/4
ϕ(x) = −D−h 2 h
k [η Dk u0 (x)].
Observe that ϕ ∈ W01,2 (Ω) so that with the same development presented in the last section we may obtain
∂ u0 (x)
ηDhk ≤ K8 ,
∂xj 2,Ω
∂ 2 u0
≤ K8 , ∀ j ∈ {1, · · · , n}, k ∈ {1, · · · , n − 1}.
∂ xk ∂ x j 2,V
Hence
∂ 2 u0
≤ K9 ,
∂ xn2 2,V
so that
u0 ∈ W 2,2 (V ).
Our more general result is summarized by the next theorem.
250 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
so that
x = ψ −1 (y) = (y1 , · · · , yn−1 , f (y1 , · · · , yn−1 ) − yn ),
we have,
ãi j ηi η j ≥ cx0 |η|2 , ∀η ∈ Rn ,
where
∂ yk (x(y)) ∂ yl (x(y))
ãi j (y) = akl (x(y)) | det(ψ −1 )0 (y)|.
∂ xi ∂xj
Let u0 ∈ U = W01,2 (Ω) be such that
J(u0 ) = min J(u),
u∈U
where, once more, J : U → R is given by
1 ∂ u0 ∂ u0
Z Z
J(u) = ai j dx − f u dx,
2 Ω ∂ xi ∂ x j Ω
W = ψ −1 (B+
r1 ) ⊂ V ⊂ Ω,
However,
∂ u0 (x) ∂ ϕ(x)
Z
ai j dx
W ∂ xi ∂xj
∂ u0 (x(y)) ∂ yk (x(y)) ∂ ϕ(x(y)) ∂ yl (x(y))
Z
= ai j (x(y)) | det[(ψ −1 )0 (y)]| dy
+
Br1 ∂ yk ∂ xi ∂ yl ∂xj
∂ ũ0 (y) ∂ ϕ̃(y)
Z
= ãi j (y) dy. (11.177)
B+
r1 ∂ yi ∂yj
The Lebesgue and Sobolev Spaces 251
Hence,
∂ ũ0 (y) ∂ ϕ̃(y)
Z Z
ãi j (y) dy = f˜(y)ϕ̃(y) dy,
B+
r1 ∂ yi ∂yj B+
r1
where,
f˜(y) = f (x(y))| det[(ψ −1 )0 (y)| ∈ L2 (B+
r1 ).
∂ 2 u˜ 0 ∂ u˜0
+K2 ∑
i, j,k ∂ yi ∂ y j 2,B+ ∂ yk 2,B+
r1 /2 r /2
1
n 2
∂ ũ0
+K3 ∑ , (11.178)
j=1 ∂yj 2,B+
r /2
1
so that
∂ 2 u0
∈ L2 (W̃ ).
∂ xi ∂ x j
At this point we remark that x0 ∈ W, and clarifying the dependence of W relating x0 , we also denote
W = Wx0 .
Observe that from the compactness of ∂ Ω, we may find x1 , · · · , xM ∈ ∂ Ω such that
∪M
j=1Wx j ⊃ ∂ Ω,
so that we may also find W0 ⊂⊂ Ω such that
[∪M
j=1Wx j ] ∪W0 ⊃ Ω,
and in such a case
W = [∪M ˜
j=1Wx j ] ∪W0 .
From the previous results
ku0 k2,2,W̃x ≤ K j , ∀ j ∈ {1, · · · , M},
j
and
ku0 k2,2,W0 ≤ K0 ,
for appropriate real constants K0 > 0, K j > 0, ∀ j ∈ {1, · · · M}.
Joining the pieces, we obtain
u0 ∈ W 2,2 (Ω).
The proof is complete.
SECTION II
CALCULUS OF VARIATIONS,
CONVEX ANALYSIS AND
RESTRICTED OPTIMIZATION
Chapter 12
Definition 12.1.1 Let V be a vectorial space. A norm in V is a function denoted by k·kV : V → R+ = [0, +∞),
for which the following properties hold:
and
kukV = 0, if, and only if u = 0.
2. Triangular inequality, that is
Definition 12.1.2 (Convergent sequence) Let V be a normed space and let {un } ⊂ V be a sequence. We say
that {un } converges to u0 ∈ V , if for each ε > 0, there exists n0 ∈ N such that if n > n0 , then
kun − u0 kV < ε.
Definition 12.1.3 (Cauchy sequence in norm) Let V be a normed space and let {un } ⊂ V be a sequence.
We say that {un } is a Cauchy one, as for each ε > 0, there exists n0 ∈ N such that if m, n > n0 , then
kun − um kV < ε.
kun − u0 kV → 0, as n → ∞.
Solution:
1. Clearly
k f kV ≥ 0, ∀ f ∈ V
and
k f kV = 0 if, and only if f (x) = 0, ∀x ∈ [a, b],
that is if, and only if f = 0.
2. Let f , g ∈ V .
Thus,
Proof 12.1 The proof that C([a, b]) is a vector space is left as an exercise.
From the last exercise, k · kV is a norm for V .
Let { fn } ⊂ V be a Cauchy sequence.
We shall prove that there exists f ∈ V such that
k fn − f kV → 0, as n → ∞.
Let ε > 0.
Thus, there exists n0 ∈ N such that if m, n > n0 , then
k fn − fm kV < ε.
Hence
max{| fn (x) − fm (x)| : x ∈ [a, b]} < ε,
that is,
| fn (x) − fm (x)| < ε, ∀x ∈ [a, b], m, n > n0 . (12.3)
Let x ∈ [a, b].
From (12.3), { fn (x)} is a real Cauchy sequence, therefore it is convergent.
So, define
f (x) = lim fn (x), ∀x ∈ [a, b].
n→∞
| f (y) − f (x)| = | f (y) − fn2 (y) + fn2 (y) − fn2 (x) + fn2 (x) − f (x)|
≤ | f (y) − fn2 (y)| + | fn2 (y) − fn2 (x)| + | fn2 (x) − f (x)|
ε ε ε
< + +
3 3 3
= ε. (12.4)
Exercise 12.1.8 Let V = C1 ([a, b]) be the space of functions f : [a, b] → R which the the first derivative is
continuous on [a, b].
Define the function (in fact a functional) k · kV : V → R+ by
k fn − fm kV < ε/2.
Therefore,
| fn (x) − fm (x)| + | fn0 (x) − fm0 (x)| < ε/2, ∀x ∈ [a, b], m, n > n0 . (12.5)
Let x ∈ [a, b]. hence, { fn (x)} and { fn0 (x)} are real Cauchy sequences, and therefore, they are convergent.
Denote
f (x) = lim fn (x)
n→∞
and
g(x) = lim fn0 (x).
n→∞
| fn (x) − f (x)| + | fn0 (x) − g(x)| = lim | fn (x) − fm (x)| + | fn0 (x) − fm0 (x)|
m→∞
≤ ε/2, ∀x ∈ [a, b], n > n0 . (12.6)
Similarly to the last example, we may obtain that f and g are continuous, therefore uniformly continuous
on the compact set [a, b].
Thus, there exists δ > 0 such that if x, y ∈ [a, b] and |y − x| < δ , then
Hence, if 0 < |h| < δ , then from (12.6) and (12.7) we have
where from mean value theorem, t ∈ (0, 1) (it depends on h). Therefore, letting n1 → ∞, we get
f (x + h) − f (x)
f 0 (x) = lim = g(x), ∀x ∈ (a, b).
h→0 h
The cases in which x = a or x = b may be dealt similarly with one-sided limits.
From this and (12.6), we have
k fn − f kV → 0, as n → ∞
and
f ∈ C1 ([a, b]).
The solution is complete.
Definition 12.1.9 (Functional) Let V be a Banach space. A functional F defined on V is a function whose
the co-domain is R (F : V → R).
Observe that if y00 (x) = c in [a, b], we have (12.10) satisfied, since in such a case,
Z b
J(y0 + v) − J(y0 ) ≥ 2 y00 (x)v0 (x) dx
a
Z b
= 2c v0 (x) dx
a
= 2c[v(x)]ab
= 2c(v(b) − v(a))
= 0. (12.12)
J(y0 + v) ≥ J(y0 ), ∀v ∈ Va .
y0 (x) = cx + d,
for some d ∈ R.
However, from
y(a) = 0, we get ca + d = 0.
From y0 (b) = 1, we have cb + d = 1.
Solving this last system in c and d we obtain,
1
c= ,
b−a
and
−a
d= .
b−a
From this, we have,
x−a
y0 (x) = .
b−a
Observe that the graph of y0 corresponds to the straight line connecting the points (a, 0) and (b, 1).
12.2 ˆ
The Gateaux variation
Definition 12.2.1 Let V be a Banach space and let J : D ⊂ V → R be a functional. Let y ∈ D and v ∈ Va
We define the Gateaux
ˆ variation of J at y in the direction v, denoted by δ J(y; v), by
∂ J(y + εv)
δ J(y; v) = |ε=0
∂ε
ρ (x)y0 (x)v0 (x)
Z b
= p dx. (12.16)
a 1 + y0 (x)2
Example 12.2.4 Let V = C1 ([a, b]) and f ∈ C1 ([a, b] × R × R). Thus f is a function of three variables,
namely, f (x, y, z).
Consider the functional F : V → R, defined by
Z b
F(y) = f (x, y(x), y0 (x)) dx.
a
Let y, v ∈ V . Thus,
∂
δ F(y; v) = F(y + εv)|ε=0 .
∂ε
Observe that Z b
F(y + εv) = f (x, y(x) + εv(x), y0 (x) + εv0 (x)) dx,
a
and therefore
b
Z
∂ ∂ 0 0
F(y + εv) = f (x, y(x) + εv(x), y (x) + εv (x)) dx
∂ε ∂ε a
Z b
∂
f (x, y(x) + εv(x), y0 (x) + εv0 (x)) dx
=
a ∂ε
∂ f (x, y(x) + εv(x), y0 (x) + εv0 (x))
Z b
= v(x)
a ∂y
∂ f (x, y(x) + εv(x), y0 (x) + εv0 (x)) 0
+ v (x) d x. (12.17)
∂z
Thus
∂ F(y + εv)
δ F(y; v) = |ε=0
∂ε
∂ f (x, y(x), y0 (x)) ∂ f (x, y(x), y0 (x)) 0
Z b
= v(x) + v (x) dx. (12.18)
a ∂y ∂z
hx, yiRn = x1 y1 + · · · + xn yn ,
∀x = (x1 , · · · , xn ), y = (y1 , · · · , yn ) ∈ Rn .
Thus,
f (x + λ (y − x)) − f (x)
≤ f (y) − f (x), ∀λ ∈ (0, 1).
λ
Therefore,
f (x + λ (y − x)) − f (x)
h f 0 (x), y − xiRn = lim
λ →0+ λ
≤ f (y) − f (x), ∀x, y ∈ Rn . (12.19)
that is,
Summarizing,
f (x) = sup {hx, x∗ iRn − f ∗ (x∗ )}, ∀x ∈ Rn .
x∗ ∈Rn
Definition 12.3.4 (Convex functional) Let V be a Banach space and let J : D ⊂ V → R be a functional. We
say that J is convex if
J(y + v) − J(y) ≥ δ J(y; v), ∀v ∈ Va (y),
where
Va (y) = {v ∈ V : y + v ∈ D}.
Theorem 12.3.5 Let V be a Banach space and let J : D ⊂ U be a convex functional. Thus, if y0 ∈ D is such
that
δ J(y0 ; v) = 0, ∀v ∈ Va (y0 ),
then
J(y0 ) ≤ J(y), ∀y ∈ D,
that is, y0 minimizes J on D.
v ∈ Va (y0 ).
that is,
J(y0 ) ≤ J(y), ∀y ∈ D.
The proof is complete.
Example 12.3.6 Let us see this example of convex functional. Let V = C1 ([a, b]) and let J : D ⊂ V → R be
defined by
Z b
J (y) = (y0 (x))2 dx,
a
Basic Topics on the Calculus of Variations 263
where
D = {y ∈ V : y(a) = 1 and y(b) = 5}.
We shall show that J is convex.
Indeed, let y ∈ D and v ∈ Va where
Thus,
Z b Z b
J (y + v) − J (y) = (y0 (x) + v0 (x))2 dx − y0 (x)2 dx
a a
Z b Z b
0 0
= 2y (x)v (x) dx + v0 (x)2 dx
a a
Z b
≥ 2y0 (x)v0 (x) dx
a
= δ J(y; v). (12.24)
Therefore, J is convex.
Suppose f is convex in (y, z) for all x ∈ [a, b], which we denote by f (x, y, z) to be convex.
From the last section, we have that
where we denote
∇ f (x, y, y0 ) = ( fy (x, y, y0 ), fz (x, y, y0 )).
Therefore,
Z b
F(y + v) − F(y) = [ f (x, y + v, y0 + v0 ) − f (x, y, y0 )] dx
a
Z b
≥ [ fy (x, y, y0 )v + fz (x, y, y0 )v0 ] dx
a
= δ J(y; v). (12.26)
Thus, F is convex.
264 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Theorem 12.4.2 Let V = C1 ([a, b]). Let f ∈ C2 ([a, b] × R × R) where f (x, y, z) is convex. Define
where a1 , b1 ∈ R.
Define also F : D → R by
Z b
F(y) = f (x, y(x), y0 (x)) dx.
a
Under such hypotheses, F is convex and if y0 ∈ D is such that
d
[ fz (x, y0 (x), y00 (x))] = fy (x, y0 (x), y00 (x)), ∀x ∈ [a, b],
dx
then y0 minimizes F on D, that is,
F(y0 ) ≤ F(y), ∀y ∈ D.
Proof 12.5 From the last remark, F is convex. Suppose now that y0 ∈ D is such that
d
[ fz (x, y0 (x), y00 (x))] = fy (x, y0 (x), y00 (x)), ∀x ∈ [a, b].
dx
Let v ∈ Va = {v ∈ V : v(a) = v(b) = 0}. Thus,
Z b
δ F(y0 ; v) = ( fy (x, y0 (x), y00 (x))v(x) + fz (x, y0 (x), y00 (x))v0 (x)) dx
a
Z b
d
= ( fz (x, y0 (x), y00 (x))v(x)) + fz (x, y0 (x), y00 (x))v0 (x) dx
a dx
Z b
d
= [ fz (x, y0 (x), y00 (x))v(x)] dx
a dx
= [ fz (x, y0 (x), y00 (x))v(x)]ba
= fz (b, y0 (b), y00 (b))v(b) − fz (a, y0 (a), y00 (a))v(a)
= 0, ∀v ∈ Va . (12.27)
Since F is convex, from this and Theorem 12.3.5, we may infer that y0 minimizes J on D.
Define F : D → R by
Z 1
F(y) = [y0 (x)2 + 5y(x)] dx, ∀y ∈ D.
0
Observe that Z 1
F(y) = f (x, y, y0 ) dx
0
where
f (x, y, z) = z2 + 5y,
that is, f (x, y, z) is convex.
Thus, from the last theorem F is convex and if y0 ∈ D is such that
d
fz (x, y0 (x), y00 (x)) = fy (x, y0 (x), y00 (x)), ∀x ∈ [a, b],
dx
Basic Topics on the Calculus of Variations 265
then y0 minimizes F on D.
Considering that fz (x, y, z) = 2z and fy (x, y, z) = 5, from this last equation we get,
d
(2y0 (x)) = 5,
dx 0
that is,
5
y000 (x) = , ∀x ∈ [0, 1].
2
Thus,
5
y00 (x) = x + c,
2
and
5
y0 (x) = x2 + cx + d.
4
From this and y0 (0) = 0, we obtain d = 0.
From this and y0 (1) = 1, we have
5
+ c = 1,
4
so that
c = −1/4
Therefore
5x2 x
y0 (x) = −
4 4
minimizes F on D.
The example is complete.
Theorem 12.5.1 Let V = C1 ([a, b]). Let f ∈ C2 ([a, b] × R × R) be such that f (x, y, z) is convex. Define
D = {y ∈ V : y(a) = a1 },
where a1 ∈ R.
Define also F : D → R by
Z b
F(y) = f (x, y(x), y0 (x)) dx.
a
Under such hypotheses,F is convex and if y0 ∈ D is such that
d
[ fz (x, y0 (x), y00 (x))] = fy (x, y0 (x), y00 (x)), ∀x ∈ [a, b]
dx
and
fz (b, y0 (b), y00 (b)) = 0
then y0 minimizes F on D, that is,
F(y0 ) ≤ F(y), ∀y ∈ D.
266 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Proof 12.6 Since f (x, y, z) is convex from the last remark F is convex. Suppose now that y0 ∈ D is such
that
d
[ fz (x, y0 (x), y00 (x))] = fy (x, y0 (x), y00 (x)), ∀x ∈ [a, b]
dx
and
fz (b, y0 (b), y00 (b)) = 0.
Let v ∈ Va = {v ∈ V : v(a) = 0}. Thus,
Z b
δ F(y0 ; v) = ( fy (x, y0 (x), y00 (x))v(x) + fz (x, y0 (x), y00 (x))v0 (x)) dx
a
Z b
d
= ( fz (x, y0 (x), y00 (x))v(x)) + fz (x, y0 (x), y00 (x))v0 (x) dx
a dx
Z b
d 0
= [ fz (x, y0 (x), y0 (x))v(x)] dx
a dx
= [ fz (x, y0 (x), y00 (x))v(x)]ba
= fz (b, y0 (b), y00 (b))v(b) − fz (a, y0 (a), y00 (a))v(a)
= 0v(b) − fz (a, y0 (a), y00 (b))0
= 0, ∀v ∈ Va . (12.28)
Since F is convex, from this and Theorem 12.3.5, we may infer that y0 minimizes J on D.
Remark 12.5.2 About this last theorem y(a) = a1 is said to be an essential boundary condition, whereas
fz (b, y0 (b), y00 (b)) = 0 is said to be a natural boundary condition.
Theorem 12.5.3 Let V = C1 ([a, b]). Let f ∈ C2 ([a, b] × R × R) where f (x, y, z) is convex. Define
D=V
and F : D → R by
Z b
F(y) = f (x, y(x), y0 (x)) dx.
a
Under such hypotheses, F is convex and if y0 ∈ D is such that
d
[ fz (x, y0 (x), y00 (x))] = fy (x, y0 (x), y00 (x)), ∀x ∈ [a, b],
dx
fz (a, y0 (a), y00 (a)) = 0
and
fz (b, y0 (b), y00 (b)) = 0,
then y0 minimizes F on D, that is,
F(y0 ) ≤ F(y), ∀y ∈ D.
Proof 12.7 From the last remark F is convex. Suppose that y0 ∈ D is such that
d
[ fz (x, y0 (x), y00 (x))] = fy (x, y0 (x), y00 (x)), ∀x ∈ [a, b]
dx
and
fz (a, y0 (a), y00 (a)) = fz (b, y0 (b), y00 (b)) = 0.
Basic Topics on the Calculus of Variations 267
Let v ∈ D = V . Thus,
Z b
δ F(y0 ; v) = ( fy (x, y0 (x), y00 (x))v(x) + fz (x, y0 (x), y00 (x))v0 (x)) dx
a
Z b
d 0 0 0
= ( fz (x, y0 (x), y0 (x))v(x)) + fz (x, y0 (x), y0 (x))v (x) dx
a dx
Z b
d 0
= [ fz (x, y0 (x), y0 (x))v(x)] dx
a dx
= [ fz (x, y0 (x), y00 (x))v(x)]ba
= fz (b, y0 (b), y00 (b))v(b) − fz (a, y0 (a), y00 (a))v(a)
= 0v(b) − 0v(a)
= 0, ∀v ∈ D. (12.29)
Since F is convex, from this and from Theorem 12.3.5, we may conclude that y0 minimizes J on D = V .
The proof is complete.
Remark 12.5.4 About this last theorem, the conditions fz (a, y0 (a), y00 (a)) = fz (b, y0 (b), y00 (b)) = 0 are said
to be natural boundary conditions and the problem in question a free extremal one.
Exercise 12.5.5 Show that F is convex and obtain its point of global minimum on D, D1 and D2 , where
Z 2 0 2
y (x)
F(y) = dx,
1 x
and where
1.
D = {y ∈ C1 ([1, 2]) : y(1) = 0, y(1) = 3},
2.
D1 = {y ∈ C1 ([1, 2]) : y(2) = 3}.
3.
D2 = C1 ([1, 2]).
For D, from Theorem 12.4.1, sufficient conditions of optimality are given by,
dx [ fz (x, y0 (x), y00 (x))] = fy (x, y0 (x), y00 (x)) in [1, 2],
d
y (1) = 0, (12.30)
0
y0 (2) = 3.
For D1 , from Theorem 12.5.1, sufficient conditions of global optimality are given by
dx [ fz (x, y0 (x), y00 (x))] = fy (x, y0 (x), y00 (x)) in [1, 2],
d
y (2) = 3, (12.31)
0
fz (1, y0 (1), y00 (1)) = 0.
For D2 , from Theorem 12.5.3, sufficient conditions of global optimality are given by
dx [ fz (x, y0 (x), y00 (x))] = fy (x, y0 (x), y00 (x)) in [1, 2],
d
Thus, we have
cx2
y0 (x) = + d.
4
Basic Topics on the Calculus of Variations 269
represents the energy of a straight beam with rectangular cross section with inertial moment I. Here y(x)
denotes the vertical displacement of the point x ∈ [0, 1] resulting from the action of distributed vertical load
P(x) = αx, ∀x ∈ [0, 1], where E > 0 is the Young modulus and α > 0 is a real constant.
And also
D = {y ∈ V : y(0) = y(1) = 0}.
Under such hypotheses,
then y0 minimizes F on D.
3. Find the optimal solution y0 ∈ D.
Solution:
Let y ∈ D and v ∈ Va = {v ∈ V : v(0) = v(1) = 0}.
We recall that
F(y + εv) − F(y)
δ J(y; v) = lim
ε→0 ε
(EI/2) 0 [(y + εv00 )2 − (y00 )2 ] dx − 01 (P(y + εv) − Py) dx
R 1 00 R
= lim
ε→0 ε
Z 1 Z 1
εEI
= lim (EIy00 v00 − Pv) dx + (v00 )2 dx
ε →0 0 2 0
Z 1
= (EIy00 v00 − Pv) dx. (12.34)
0
270 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Thus,
Z 1
δ J(y; v) = (EIy00 v00 − Pv) dx
0
Z 1
= (EIy00 v00 − EIy(4) v) dx
0
Z 1
000
= (EIy00 v00 + EIy v0 ) dx − [EIy000 (x)v(x)]ba
0
Z 1
000
= (EIy00 v00 + EIy v0 ) dx
0
Z 1
00
= (EIy00 v00 − EIy v00 ) dx + [EIy00 (x)v0 (x)]ba
0
= 0 (12.37)
Summarizing
δ J(y0 ; v) = 0, ∀v ∈ Va .
Therefore, since J is convex, we may conclude that y0 minimizes J on D.
To obtain the solution of the ODE is question, we shall denote
d4
EI [y p (x)] = P(x), ∀x ∈ [0, 1].
dx4
The homogeneous associated equation
d4
EI [yh (x)] = 0,
dx4
has the following general solution
yh (x) = ax3 + bx2 + cx + d,
Basic Topics on the Calculus of Variations 271
and thus,
αx5
y0 (x) = y p (x) + yh (x) = + ax3 + bx2 + cx + d.
120EI
From y0 (0) = 0, we obtain d = 0.
Observe that y00 (x) = 120EI
5α
x4 + 3ax2 + 2bx + c e y000 (x) = 6EI
α 3
x + 6ax + 2b.
00
From this and y0 (0) = 0, we get b = 0.
From y000 (1) = 0, we obtain,
α 3
1 + 6a 1 = 0,
6EI
and thus
α
a=− .
36EI
From such results and from y0 (1) = 0, we obtain
α α α
+ a 13 + c 1 = − + c = 0,
120EI 120EI 36EI
that is,
α 1 1 7α
c= − = .
EI 36 120 360EI
Finally, we have that
αx5 αx3 7αx
y0 (x) = − +
120EI 36EI 360EI
minimizes J on D.
The solution is complete.
where
Va = {v ∈ C1 ([a, b]) : v(a) = v(b) = 0}.
Under such hypotheses, there exists c ∈ R such that
Moreover, Z a
v(a) = (h(t) − c) dt = 0,
a
and Z b Z b
v(b) = (h(t) − c) dt = h(t) dt − c(b − a) = c(b − a) − c(b − a) = 0,
a a
so that v ∈ Va .
Observe that, from this and the hypotheses,
Z b
0 ≤ (h(t) − c)2 dt
a
Z b
= (h(t) − c)(h(t) − c) dt
a
Z b
= (h(t) − c)v0 (t) dt
a
Z b Z b
= h(t)v0 (t) dt − c v0 (t) dt
a a
= 0 − c(v(b) − v(a))
= 0. (12.38)
Thus, Z b
(h(t) − c)2 dt = 0.
a
Since h is continuous, we may infer that
that is,
h(x) = c, ∀x ∈ [a, b].
The proof is complete.
where
Va = {v ∈ C1 ([a, b]) : v(a) = v(b) = 0}.
Under such hypotheses, h ∈ C1 ([a, b]) and
From this and from the du Bois - Reymond lemma, we may conclude that
for some c ∈ R.
Thus
g(x) = G0 (x) = h0 (x), ∀x ∈ [a, b],
so that
g ∈ C1 ([a, b]).
The proof is complete.
Lemma 12.6.3 (Fundamental lemma of calculus of variation for one dimension) Let g ∈ C([a, b]) = V.
Assume Z b
g(x)v(x) dx = 0, ∀v ∈ Va ,
a
where again,
Va = {v ∈ C1 ([a, b]) : v(a) = v(b) = 0}.
Under such hypotheses,
g(x) = 0, ∀x ∈ [a, b].
Let v ∈ Va .
Let w ∈ C([a, b]) be such that
w(x) = v0 (x), ∀x ∈ [a, b].
274 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Observe that Z b Z b
w(x) dx = v0 (x) dx = [v(x)]ab = v(b) − v(a) = 0.
a a
Rb
From this a h(x)w(x) dx = 0, and thus
Z b
h(x)v0 (x) dx = 0.
a
Since v ∈ Va is arbitrary, from this and the du Bois-Reymond lemma, there exists c ∈ R such that
h(x) = c, ∀x ∈ [a, b].
The solution is complete.
where we denote
dx = dx1 · · · dxn .
Assume f : Ω × R × Rn → R is of C2
class. Suppose also f (x, y, z) is convex in (y, z), ∀x ∈ Ω, which we
denote by f (x, y, z) to be convex.
Observe that for y ∈ D and v ∈ Va , where
D = {y ∈ V : y = y1 on ∂ Ω},
and
Va = {v ∈ V : v = 0 on ∂ Ω},
where
y1 ∈ C1 (Ω),
we have that
∂
δ F(y; v) = F(y + εv)|ε=0 ,
∂ε
where Z
F(y + εv) = f (x, y + εv, ∇y + ε∇v) dx.
Ω
Therefore,
Z
∂ ∂
F(y + εv) = ( f (x, y + εv, ∇y + ε∇v) dx
∂ε Ω ∂ε
Z n
= [ fy (x, y + εv, ∇y + ε∇v)v + ∑ fzi (x, y + εv, ∇y + ε∇v)vxi ] dx. (12.40)
Ω i=1
Thus,
∂
δ F(y; v) = F(y + εv)|ε=0
∂ε
Z n
= [ fy (x, y, ∇y)v + ∑ fzi (x, y, ∇y)vxi ] dx. (12.41)
Ω i=1
Basic Topics on the Calculus of Variations 275
∇ f (x, y, ∇y) = ( fy (x, y, ∇y), fz1 (x, y, ∇y), · · · , fzn (x, y, ∇y)).
Theorem 12.7.1 Let Ω ⊂ Rn be a set of Ĉ1 class and let V = C1 (Ω). Let f ∈ C2 (Ω × R × R) where f (x, y, z)
is convex. Define
D = {y ∈ V : y = y1 em ∂ Ω},
where y1 ∈ C1 (Ω)
Define also F : D → R by Z
F(y) = f (x, y(x), ∇y(x)) dx.
Ω
From such hypotheses, F is convex and if y0 ∈ D is such that
n
d
∑ dxi [ fzi (x, y0 (x), ∇y0 (x))] = fy (x, y0 (x), ∇y0 (x)), ∀x ∈ Ω,
i=1
Proof 12.11 From the last remark, F is convex. Suppose now that y0 ∈ D is such that
n
d
∑ dxi [ fzi (x, y0 (x), ∇y0 (x))] = fy (x, y0 (x), ∇y0 (x)), ∀x ∈ Ω,
i=1
where n = (n1 , · · · , nn ) denotes the outward normal field to ∂ Ω = S. Since F is convex, from this and Theorem
12.3.5, we have that y0 minimizes F on D.
276 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
12.8 ˆ
The second Gateaux variation
Definition 12.8.1 Let V be a Banach space. Let F : D ⊂ V → R be a functional such that δ F(y; v) exists on
Br (y0 ) for y0 ∈ D, r > 0 and for all v ∈ Va .
Let y ∈ Br (y0 ) and v, w ∈ Va . We define the second Gateaux
ˆ variation of F at the point y in the directions
v and w, denoted by δ 2 F(y; v, w), as
Remark 12.8.2 Observe that from this last definition, if the limits in question exist, we have
∂
δ F(y; v) = F(y + εv)|ε=0 ,
∂ε
and
∂2
δ 2 F(y; v, v) = F(y + εv)|ε=0 , ∀v ∈ Va .
∂ ε2
Thus, for example, for V = C1 (Ω) where Ω ⊂ Rn is of Ĉ1 class and F : V → R is given by
Z
F(y) = f (x, y, ∇y) dx
Ω
and where
f ∈ C2 (Ω × R × Rn ),
for y, v ∈ V , we have
∂2
δ 2 F(y; v, v) = F(y + εv)|ε=0 ,
∂ ε2
where
∂2 ∂2
Z
F(y + εv) = f (x, y + εv, ∇y + ε∇v) dx
∂ ε2 ∂ ε2 Ω
∂2
Z
= [ f (x, y + εv, ∇y + ε∇v)] dx
∂ ε2
Ω
"
Z n
= fyy (x, y + εv, ∇y + ε∇v)v2 + ∑ 2 fyzi (x, y + εv, ∇y + ε∇v)vvxi
Ω i=1
#
n n
+ ∑ ∑ fzi z j (x, y + εv, ∇y + ε∇v)vxi vx j dx (12.44)
i=1 j=1
so that
∂2
δ 2 F(y; v, v) = 2
F(y + εv)|ε=0
∂ ε"
Z n
= fyy (x, y, ∇y)v2 + ∑ 2 fyzi (x, y, ∇y)vvxi
Ω i=1
#
n n
+ ∑ ∑ fzi z j (x, y, ∇y)vxi vx j dx. (12.45)
i=1 j=1
Basic Topics on the Calculus of Variations 277
Theorem 12.9.2 [First order necessary condition] Let V be a Banach space. Let F : D ⊂ V → R be a
functional. Suppose that y0 ∈ D is a point of local minimum for F on D. Let v ∈ Va and assume δ F(y0 ; v) to
exist.
Under such hypotheses,
δ F(y0 ; v) = 0.
Proof 12.12 Define φ (ε) = F(y0 + εv), which from the existence of δ F(y0 ; v) is well defined for all ε
sufficiently small.
Also from the hypotheses, ε = 0 is a point of local minimum for the differentiable at 0 function φ .
Thus, from the standard condition for one variable calculus, we have
φ 0 (0) = 0,
thst is,
φ 0 (0) = δ F(y0 ; v) = 0.
The proof is complete.
Theorem 12.9.3 (Second order sufficient condition) Let V be a Banach space. Let F : D ⊂ V → R be a
functional. Suppose y0 ∈ D is such that δ F(y0 ; v) = 0 for all v ∈ Va and there exists δ > 0 such that
φ 0 (0) 1
φ (1) = φ (0) + (1 − 0) + φ 00 (t0 )(1 − 0)2 ,
1! 2!
That is,
F(y) = F(y0 + v)
1
= F(y0 ) + δ F(y0 ; v) + δ 2 F(y0 + t0 v; v, v)
2
1
= F(y0 ) + δ 2 F(y0 + t0 v; v, v)
2
≥ F(y0 ), ∀y ∈ Bδ (y0 ) ∩ D. (12.46)
12.11 ˆ
The Gateaux variation, the formal proof of its formula
ˆ
In the previous sections we had obtained the Gateaux variations formulas with some informality for a rela-
tively large class of functionals.
In this section we intend to provide a formal proof for such formulas.
Our main result is summarized by the following theorem.
Theorem 12.11.1 Let Ω ⊂ Rn be sets of Ĉ1 class and let V = C1 (Ω).
Let f : Ω × R × Rn → R be a function of C1 class.
Define F : V → R by Z
F(y) = f (x, y(x), ∇y(x)) dx.
Ω
Let y, v ∈ V . Under such hypotheses
!
Z n
δ F(y; v) = fy (x, y(x), ∇y(x))v(x) + ∑ fzi (x, y(x), ∇y(x))vxi (x) d x.
Ω i=1
Hence,
where Z
m(Ω) = dx.
Ω
Therefore, if j > j0 , then
Z Z
[Gn j (x) − G(x)] dx ≤ |Gn j (x) − G(x)| dx
Ω Ω
Z
≤ cn j dx
Ω
= cn j m(Ω)
< ε. (12.51)
Thus, Z Z
lim Gn j (x) dx = G(x) dx.
j→∞ Ω Ω
where
f (x, y(x) + ε̃n v(x), ∇y(x) + ε̃n ∇v(x)) − f (x, y(x), ∇y(x))
G̃n (x) = ,
ε̃n
∀n ∈ N, x ∈ Ω.
However, as above indicated, we may obtain a subsequence {ε̃n j } of {ε̃n } such that
Z Z
lim G̃n j (x) dx = G(x) dx,
j→∞ Ω Ω
that is,
1.
∂ f (x, s, ξ )
∂s
and
2.
∂ f (x, s, ξ )
∂ξ
are continuous ∀(x, s, ξ ) ∈ Ω × RN × RN×n .
13.2 ˆ
The Gateaux variation, a more general case
Theorem 13.2.1 Consider the functional F : U → R, where
U = {u ∈ W 1,2 (Ω, RN ) | u = u0 in ∂ Ω}.
Suppose Z
F(u) = f (x, u, ∇u) dx,
Ω
where f : Ω × RN × RN×n is such that, for each K > 0 there exists K1 > 0 such that
Define
f (x, u + εϕ, ∇u + ε∇ϕ) − f (x, u, ∇u)
G(x, u, ϕ, ε) = ,
ε
and
∂ f (x, u, ∇u) ∂ f (x, u, ∇u)
G̃(x, u, ϕ) = ·ϕ + · ∇ϕ.
∂s ∂ξ
Thus we have
lim G(x, u, ϕ, ε) = G̃(x, u, ϕ), a.e in Ω.
ε→0
Now will show that Z Z
lim G(x, u, ϕ, ε) dx = G̃(x, u, ϕ) dx.
ε→0 Ω Ω
It suffices to show that (we do not provide details here), for an arbitrary sequence {εn } ⊂ R such that
εn → 0, as n → ∞
we have Z Z
lim G(x, u, ϕ, εn ) dx = G̃(x, u, ϕ) dx.
n→∞ Ω Ω
Observe that, for an appropriate K > 0, we have
|G(x, u, ϕ, εn )| ≤ K(|ϕ| + |∇ϕ|), a.e. in Ω. (13.1)
By the Lebesgue dominated convergence theorem, we obtain
Z Z
lim G(x, u, ϕ, εn ) dx = G̃(x, u, ϕ) dx,
n→∞ Ω Ω
that is, Z
∂ f (x, u, ∇u) ∂ f (x, u, ∇u)
δ F(u, ϕ) = ·ϕ + · ∇ϕ dx.
Ω ∂s ∂ξ
13.3 ´
Frechet differentiability
´
In this section we introduce a very important definition namely, Frechet differentiability.
Definition 13.3.1 Let U,Y be Banach spaces and consider a transformation T : U → Y. We say that T is
´ het differentiable at u ∈ U if there exists a bounded linear transformation T 0 (u) : U → Y such that
Frec
kT (u + v) − T (u) − T 0 (u)(v)kY
lim 6 θ.
= 0, v =
v→θ kvkU
In such a case T 0 (u) is called the Frec
´ het derivative of T at u ∈ U.
where
a(x) = fξ ξ (x, u(x), Du(x)),
b(x) = 2 fsξ (x, u(x), Du(x)),
and
c(x) = fss (x, u(x), Du(x)).
Thus given x0 ∈ Ω for λ sufficiently small we have that ϕ(x) = λ v x−x
v ∈ Cc∞ (B1 (0), RN ). 0
Now consider λ
is an admissible direction. Now we introduce the new coordinates y = (y1 , ..., yn ) by setting y = λ −1 (x − x0 )
and multiply (13.2) by λ −n to obtain
Z
{a(x0 + λ y)Dv(y) · Dv(y) + 2λ b(x0 + λ y)v(y) · Dv(y) + λ 2 c(x0 + λ y)v(y) · v(y)} dy > 0,
B1 (0)
αβ β
where a = {ai j }, b = {b jk } and c = {c jk }. Since a, b and c are continuous, we have
where
f˜jk = a jk (x0 ) = fξαi ξ k (x0 , u(x0 ), ∇u(x0 )).
αβ αβ
β
By analogy for
v j = ρ j sin((η · y)t)ζ (y)
286 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
we obtain
Z
f˜jk ρ j ρ k
αβ
0 ≤ (ηα t(cos((η · y)t)ζ + sin((η · y)t)Dα ζ )
B1 (0)
· ηβ t(cos((η · y)t)ζ + sin((η · y)t)Dβ ζ dy (13.5)
Summing up these last two equations, dividing the result by t 2 and letting t → +∞ we obtain
Z
0 ≤ f˜jk ρ j ρ k ηα ηβ ζ 2 dy,
αβ
B1 (0)
u0 (t j +) =
6 u0 (t j −), ∀ j ∈ {1, · · · , n}.
Theorem 13.5.2 (Weierstrass) Let Ω = (a, b) and f : Ω̄ × RN × RN → R be such that fs (x, s, ξ ) and
¯ × RN × RN .
fξ (x, s, ξ ) are continuous on Ω
Define F : U → R by
Z b
F(u) = f (x, u(x), u0 (x)) dx,
a
where
U = {u ∈ Ĉ1 ([a, b]; RN ) | u(a) = α, u(b) = β }.
Suppose u ∈ U minimizes locally F on U, that is, suppose that there exists ε0 > 0 such that
and
E(x, u(x), u0 (x−), w) ≥ 0, ∀x ∈ [a, b], w ∈ RN ,
where
u(x + h) − u(x)
u0 (x+) = lim ,
h→0+ h
and
u(x + h) − u(x)
u0 (x−) = lim , ∀x ∈ (a, b)
h→0− h
and,
E(x, s, ξ , w) = f (x, s, w) − f (x, s, ξ ) − fξ (x, s, ξ )(w − ξ ).
Proof 13.4 Fix x0 ∈ (a, b) and w ∈ RN . Choose 0 < ε < 1 and h > 0 such that u + v ∈ U and
kvk∞ < ε0
where
ε
ε̃ = .
1−ε
From
F(u + v) − F(u) ≥ 0
we obtain
Z x0 +h Z x0 +h
f (x, u(x) + v(x), u0 (x) + v0 (x)) dx − f (x, u(x), u0 (x)) dx ≥ 0. (13.6)
x0 x0
Define
x − x0
x̃ = ,
h
so that
dx
dx̃ = .
h
From (13.6) we obtain
Z 1
h f (x0 + x̃h, u(x0 + x̃h) + v(x0 + x̃h), u0 (x0 + x̃h) + v0 (x0 + x̃h) dx̃
0
Z 1
−h f (x0 + x̃h, u(x0 + x̃h), u0 (x0 + x̃h)) dx̃ ≥ 0. (13.7)
0
Therefore
Z ε
f (x0 + x̃h, u(x0 + x̃h) + v(x0 + x̃h), u0 (x0 + x̃h) + w) dx̃
0
Z ε
− f (x0 + x̃h, u(x0 + x̃h), u0 (x0 + x̃h)) dx̃
0
Z 1
+ f (x0 + x̃h, u(x0 + x̃h) + v(x0 + x̃h), u0 (x0 + x̃h) − ε̃w) dx̃
ε
Z 1
− f (x0 + x̃h, u(x0 + x̃h), u0 (x0 + x̃h)) dx̃
ε
≥ 0. (13.8)
Letting h → 0 we obtain
f (x0 , u(x0 ), u0 (x0 +) + w) − f (x0 , u(x0 ), u0 (x0 +)) − fξ (x0 , u( x0 ), u0 (x0 +)) · w ≥ 0.
f (x0 , u(x0 ), u0 (x0 −) + w) − f (x0 , u(x0 ), u0 (x0 −)) − fξ (x0 , u( x0 ), u0 (x0 −)) · w ≥ 0.
Theorem 13.1
Assume u ∈ C1 (Ω; RN ) is a point of strong minimum for a Frec
´ het differentiable functional F : U → R that
is, in particular, there exists ε > 0 such that
F(u + ϕ) ≥ F(u),
Under such hypotheses, for all x ∈ Ω and each rank-one matrix η = {ρi β α } = {ρ ⊗ β }, we have that
where
that is Z
(ϕ · fs (x, u(x), Du(x)) + Dϕ · fξ (x, u(x), Du(x)) dx = 0,
Ω
and hence,
Z
( f (x, u(x), Du(x) + Dϕ(x)) − f (x, u(x), Du(x)) dx
Ω
Z
− (ϕ(x) · fs (x, u(x), Du(x)) − Dϕ(x) · fξ (x, u(x), Du(x)) dx
Ω
≥ 0, (13.11)
∀ϕ ∈ V , where
V = {ϕ ∈ Cc∞ (Ω; RN ) : kϕk∞ < ε}.
Choose a unity vector e ∈ Rn and write
x = (x · e)e + x,
where
x · e = 0.
Denote De v = Dv · e, and let ρ = (ρ1 , ...., ρN ) ∈ RN .
Also, let x0 be any point of Ω. Without loss of generality assume x0 = 0.
Choose λ0 ∈ (0, 1) such that Cλ0 ⊂ Ω, where,
and such that φk0 converges uniformly to φλ0 on each compact subset of
Aλ = {t : −λ 2 < t < λ , t =
6 0}.
290 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
We emphasize the choice of {φk } may be such that for some K > 0 we have kφ k∞ < K, kφk k∞ < K and
kφk0 k∞ < K, ∀k ∈ N.
Observe that for any sufficiently small λ > 0 we have that ϕk defined by
Cλ = {x ∈ Rn : |x · e| ≤ λ and kxk ≤ λ }.
Define Cλ+ and Cλ− by
Cλ− = {x ∈ Cλ : x · e ≤ 0},
and
Cλ+ = {x ∈ Cλ : x · e > 0}.
Hence, denoting
and
Now define
y = ye e + y,
where
x·e
ye = ,
λ2
and
x
y= .
λ
More Topics on the Calculus of Variations 291
The sets Cλ− and Cλ+ correspond, concerning the new variables, to the sets Bλ− and Bλ+ , where
B−
λ = {y : kyk ≤ 1, and − λ
−1
≤ ye ≤ 0},
−1
B+ e
λ = {y : kyk ≤ 1, and 0 < y ≤ λ }.
Therefore, since dx = λ n+1 dy, multiplying (13.15) by λ −n−1 , we obtain
Z Z Z
g(x(y)) dy + g(x(y)) dy + g(x(y)) dy ≥ 0, (13.16)
B−
1 B−
λ
\B−
1 B+
λ
where
x = (x · e)e + x = λ 2 ye + λ y ≡ x(y).
Observe that
ρφ (kyk2 ) if − 1 ≤ ye ≤ 0,
De ϕ(x) = ρφ (kyk2 )(−λ ) if 0 ≤ ye ≤ λ −1 , (13.17)
0, otherwise.
that
|g(x(y))| ≤ o(λ ), as λ → 0.
Since the Lebesgue measures of B− +
λ and Bλ are bounded by
2n−1 /λ
lim o(1)/λ = 0,
λ →0+
we get,
Z
[ f (0, u(0), Du(0) + ρφ (kyk2 )e) − f (0, u(0), Du(0))
B
−ρφ (kyk2 )e fξ (0, u(0), Du(0))] dy2 ...dyn
≥ 0, (13.19)
292 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
where B is an appropriate limit set (we do not provide more details here) such that
Finally, inequality (13.19) is valid for a sequence {φn } (in place of φ ) such that
if
Z b
( fs (x, u(x), u0 (x)) · ϕ + fξ (x, u(x), u0 (x)) · ϕ 0 (x)) dx = 0,
a
Proof 13.6 Fix ϕ ∈ Cc∞ ([a, b]; RN ). Integration by parts of the extremal condition
δ F(u, ϕ) = 0,
implies that
Z b Z bZ x
fξ (x, u(x), u0 (x)) · ϕ 0 (x) dx − fs (t, u(t), u0 (t)) dt · ϕ 0 (x) dx = 0.
a a a
More Topics on the Calculus of Variations 293
Since ϕ is arbitrary, from the du Bois-Reymond lemma, there exists c ∈ RN such that
Z x
fξ (x, u(x), u0 (x)) − fs (t, u(t), u0 (t)) dt = c, ∀x ∈ [a, b].
a
Theorem 13.7.3 (Weierstrass-Erdmann corner conditions) Let I = [a, b]. Suppose u ∈ Ĉ1 ([a, b]; RN ) is
such that
F(u) ≤ F(v), ∀v ∈ Cr ,
for some r > 0. where
Cr = {v ∈ Ĉ1 ([a, b]; RN ) | v(a) = u(a), v(b) = u(b), and ku − vk∞ < r}.
Let x0 ∈ (a, b) be a corner point of u. Denoting u0 = u(x0 ), ξ0+ = u0 (x0+ ) and ξ0− = u0 (x0− ), then the
following relations are valid:
1. fξ (x0 , u0 , ξ0− ) = fξ (x0 , u0 , ξ0+ ),
2.
Remark 13.7.4 The conditions above are known as the Weierstrass-Erdmann corner conditions.
Proof 13.7 Condition (1) is just a consequence of equation (13.21). For (2), define
τε (x) = x + ελ (x),
where λ ∈ Cc∞ (I). Observe that τε (a) = a and τε (b) = b, ∀ε > 0. Also τ0 (x) = x. Choose ε0 > 0 sufficiently
small such that for each ε satisfying |ε| < ε0 , we have τε0 (x) > 0 and
Define
φ (ε) = F(x, ũε , ũ0ε (x)).
Thus φ has a local minimum at 0, so that φ 0 (0) = 0, that is
Defining
x̄ = τε−1 (x),
we obtain
1
dx̄ = dx,
1 + ελ 0 (x̄)
that is
dx = (1 + ελ 0 (x̄)) dx̄.
Dropping the bar for the new variable, we may write
u0 (x)
Z b
1 + ελ 0 (x) dx.
F(ũε ) = f x + ελ (x), u(x),
a 1 + ελ 0 (x)
From
dF(ũε )
|ε=0 ,
dε
we obtain
Z b
(λ fx (x, u(x), u0 (x)) + λ 0 (x)( f (x, u(x), u0 (x)) − u0 (x) fξ (x, u(x), u0 (x)))) dx = 0. (13.22)
a
Proof 13.8 Observe that δ F(u, ϕ) = 0, ∀ϕ ∈ Cc∞ (Ω; RN ), thus u is a extremal of F and through integration
by parts and the fundamental lemma of calculus of variations, we obtain
L f (u) = 0, in Ω,
where
L f (u) = fs (x, u(x), ∇u(x)) − div( fξ (x, u(x), ∇u(x)).
Defining
V = {ϕ ∈ C1 (Ω; RN ) | ϕ = 0 on Γ0 },
for an arbitrary ϕ ∈ V , we obtain
Z
δ F(u, ϕ) = L f (u) · ϕ dx
Ω
Z
+ nα fξαi (x, u(x), ∇u(x))ϕ i (x) dΓ
Γ1
Z
= nα fξαi (x, u(x), ∇u(x))ϕ i (x) dΓ
Γ1
= 0, ∀ϕ ∈ V . (13.23)
and in particular
G(x) > β /2, in Br (x0 ) ∩ Γ1 .
Choose 0 < r1 < r such that Br1 (x0 ) ∩ Γ0 = 0/. This is possible since Γ0 is closed and x0 ∈ Γ1 .
Choose ϕ i ∈ Cc∞ (Br1 (x0 )) such that ϕ i ≥ 0 in Br1 (x0 ) and ϕ i > 0 in Br1 /2 (x0 ). Therefore
Z Z
β
G(x)ϕ i (x) dx > ϕ i dx > 0,
Γ1 2 Γ1
G(x) ≤ 0, ∀x ∈ Γ1 ,
and by analogy
G(x) ≥ 0, ∀x ∈ Γ1 ,
so that
G(x) = 0, ∀x ∈ Γ1 .
The proof is complete.
Chapter 14
Definition 14.1.1 (Convex functional) Let U be a vector space and let S ⊂ U be a convex set. A functional
F : S → R̄ = R ∪ {+∞, −∞} is said to be convex, if
Definition 14.2.2 Let U be a Banach space. Consider the weak topology σ (U,U ∗ ) for U and let F : U →
R ∪ {+∞} be a functional. Let u ∈ U. We say that F is weakly lower semi-continuous at u ∈ U if for each
λ < F(u), there exists a weak neighborhood Vλ (u) ∈ σ (U,U ∗ ) such that
Proof 14.1 Assume F is w.l.s.c.. We are going to show that E pi(F )c is open for σ (U,U ∗ ) × R. Choose
(u, r) ∈ E pi(F)c . Thus (u, r) 6∈ E pi(F), so that r < F(u). Select λ such that r < λ < F(u). Since F is w.l.s.c.
at u, there exists a a weak neighborhood Vλ (u) such that
Thus,
Vλ (u) × (−∞, λ ) ⊂ E pi(F )c
so that (u, r) is an interior point of E pi(F )c and hence, since such a point is arbitrary in E pi(F )c , we may
infer that E pi(F)c is open so that E pi(F) is closed for the topology in question. Assume now (2). Observe
that
HγF × {γ} = E pi(F) ∩ (U × {γ}).
From the hypotheses E pi(F) is closed, that is, HγF × {γ} is closed and thus HγF is closed.
Assume (3). To obtain (4), it suffices to consider the complement of HγF . Suppose (4) is valid. Let u ∈ U
and let γ ∈ R be such that
γ < F(u).
Since GFγ is open for σ (U,U ∗ ) there exists a weak neighborhood V (u) such that
V (u) ⊂ GFγ ,
so that
F(v) > γ, ∀v ∈ V (u),
and hence
inf F(v) ≥ γ .
v∈V (u)
In particular, we have
lim inf F(v) ≥ γ.
v*u
Thus, there exists a weak neighborhood V (u) such that F(v) ≥ F(u) − ε > λ , ∀v ∈ V (u).
The proof is complete.
298 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Remark 14.2.4 A similar result is valid for the strong topology (in norm) of a Banach space U so that a
F : U → R ∪ {+∞} is strongly lower semi-continuous (l.s.c.) at u ∈ U, if
lim inf F(v) ≥ F(u). (14.2)
v→u
Proof 14.2 The result follows from the fact for F l.s.c., its epigraph being convex and strongly closed, it is
also weakly closed.
where
A∗ = {(u∗ , α) ∈ U ∗ × R | hv, u∗ iU + α ≤ F(v), ∀v ∈ U} (14.5)
Definition 14.3.2 (Bipolar functional) Let U be a Banach space and let F : U → R̄ be a functional. We
define the bi-polar functional related to F, denoted by F ∗∗ : U → R̄, as
F ∗∗ (u) = sup {hu, u∗ iU − F ∗ (u∗ )}, ∀u ∈ U. (14.7)
u∗ ∈U ∗
Proposition 14.3.3 Let U be a Banach space and let F : U → R̄ be a functional. Under such hypotheses
F ∗∗ (u) = CF(u), ∀u ∈ U and in particular, if F ∈ Γ(U), then F ∗∗ (u) = F(u), ∀u ∈ U.
Proof 14.3 By the definition, the convex envelop of F is the supremum of affine-continuous functionals
bounded by F at the point in question. In fact we need only to consider those which are maximal, that is,
only those in the form
u 7→ hu, u∗ iU − F ∗ (u∗ ). (14.8)
Thus,
CF(u) = sup {hu, u∗ iU − F ∗ (u∗ )} = F ∗∗ (u). (14.9)
u∗ ∈U ∗
Convex Analysis and Duality Theory 299
Corollary 14.3.4 Let U be a Banach space and let F : U → R̄ be a functional. Under such hypotheses,
F ∗ = F ∗∗∗ .
F ∗ ≤ F ∗∗∗ . (14.10)
so that
ˆ
At this point, we recall the definition of Gateaux differentiability.
ˆ
Definition 14.3.5 (Gateaux differentiability) Let U be a Banach space. A functional F : U → R̄ is said to
be Gateaux
ˆ differentiable at u ∈ U, if there exists u∗ ∈ U ∗ such that
F(u + λ h) − F (u)
lim = hh, u∗ iU , ∀h ∈ U. (14.13)
λ →0 λ
The vector u∗ is said to be the Gateaux
ˆ derivative of F : U → R at u and may denoted by
∂ F(u)
u∗ = or u∗ = δ F(u) (14.14)
∂u
Definition 14.3.6 (Sub-gradients) Let U be a Banach space and let F : U → R̄ be a functional. We define
the set of sub-gradients of F at u, denoted by ∂ F(u), by
Lemma 14.3.7 (Continuity of convex functions) Let U be a Banach space and let F : U → R be a convex
functional. Let u ∈ U and suppose there exists a > 0 and a neighborhood V of u such that
Proof 14.5 Redefining the problem with G(v) = F(v + u) − F(u) we need only consider the case in which
u = 0 and F(u) = 0. Let V be a neighborhood of 0 such that F(v) ≤ a < +∞, ∀v ∈ V . Define W = V ∩(−V ).
Choose ε ∈ (0, 1). Let v ∈ εW , thus
v
∈V (14.16)
ε
and since F is convex, we have that
v
F(v) = F (1 − ε)0 + ε ≤ (1 − ε)F(0) + εF(v/ε) ≤ εa. (14.17)
ε
300 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Also
−v
∈V. (14.18)
ε
Hence,
v (−v/ε) F(v) ε
F(0) = F +ε ≤ + F(−v/ε),
1+ε 1+ε 1+ε 1+ε
so that
F(v) ≥ (1 + ε)F(0) − εF(−v/ε) ≥ −εa. (14.19)
Therefore
|F(v)| ≤ εa, ∀v ∈ εW , (14.20)
that is, F is continuous at u = 0.
Proposition 14.3.8 Let U be a Banach space and let F : U → R̄ be a convex functional, which is finite and
continuous at u ∈ U. Under such hypotheses, ∂ F(u) =
6 0/.
Proof 14.6 Since F is convex, E pi(F) is convex. Since F is continuous at u, we have that E pi(F)0 is
non-empty. Observe that (u, F(u)) is on the boundary of E pi(F). Therefore, denoting A = E pi(F), from the
Hahn-Banach theorem there exists a closed hyperplane H which separates (u, F(u)) and A0 , where H
´
Definition 14.3.9 (Caratheodory function) Let S ⊂ Rn be an open set. We say that g : S × Rl → R is a
Caratheodory
´ function if
∀ξ ∈ Rl , x 7→ g(x, ξ ) is a measurable function,
and
a.e. in S, ξ 7→ g(x, ξ ) is a continuous function.
The proof of next results may be found in Ekeland and Temam [33].
Proposition 14.3.10 Let E and F be two Banach spaces, let S be a Borel subset of Rn , and g : S × E → F
be a Caratheodory
´ function. For each measurable function u : S → E, let G1 (u) be the measurable function
x 7→ g(x, u(x)) ∈ F.
Under such hypotheses, if G1 maps L p (S, E) on Lr (S, F) for 1 ≤ p, r < ∞, then G1 is strongly continuous.
Proposition 14.3.11 Considering the statement in the last proposition we may express G∗ : U ∗ → R̄ by
Z
G∗ (u∗ ) = g∗ (x, u∗ (x))dx, (14.29)
S
Definition 14.4.1 (Legendre transform and associated functional) Consider the function of C2 class, g :
Rn → R. Its Legendre transform, denoted by g∗L : RnL → R, is expressed by
n
g∗L (y∗ ) = ∑ x0i · y∗i − g(x0 ), (14.30)
i=1
∂ g(x0 )
y∗i = , (14.31)
∂ xi
Proposition 14.4.2 Considering the last definitions, suppose for x̂0 ∈ Rn we have
2
∂ g(x̂0 )
det 6= 0.
∂ xi ∂ x j
Let y∗0 be such that
∂ g(x̂0 )
(y0∗ )i = , ∀i ∈ {1, . . . , n}.
∂ xi
Under such hypotheses, y∗0 ∈ RnL and
∂ g −1 ∗
x0 (y∗ ) = [ ] (y )
∂x
is of C1 class in a neighborhood of y∗0 .
Moreover, for all y∗ in such a neighborhood, we have that
∂ g(x0 ) ∂ g∗ (y∗ )
y∗i = , ∀i ∈ {1, ..., n} ⇔ x0i = L ∗ , ∀i ∈ {1, ..., n}.
∂ xi ∂ yi
Proof 14.7 The proof that x0 (y∗ ) is of C1 class in a neighborhood of y∗0 results from the inverse function
theorem.
Suppose now that:
∂ g(x0 )
y∗i = , ∀ i ∈ {1, ..., n}, (14.33)
∂ xi
thus:
∂ g∗L (y∗ )
∗ ∂ g(x̄0 ) ∂ x̄0 j
= yj − + x̄0i (14.42)
∂ y∗i ∂xj ∂ y∗i
∀ i ∈ {1, ..., n}, which from (14.38) and (14.39), implies that:
∂ g∗L (y∗ )
x̄0i = = x0i , ∀ i ∈ {1, ..., n}, (14.43)
∂ y∗i
Theorem 14.4.3 Consider the functional J : U → R̄ defined as J(u) = (G ◦ Λ)(u) − hu, f iU where Λ(=
{Λi }) : U → Y (i ∈ {1, ..., n}) is a continuous linear operator and, G : Y → R is a functional that can be
expressed as G(v) = S g(v)dS, ∀v ∈ Y (here g : Rn → R is a differentiable function that admits Legendre
R
Transform denoted by g∗L : RnL → R. That is, the hypothesis mentioned at Proposition 14.4.2 are satisfied).
Under these assumptions we have:
∂ G(Λu0 )
Λ∗ − f =θ (14.47)
∂v
∂ G(Λu0 )
which, as v∗0 = ∂v implies that:
Λ∗ v∗0 − f = θ , (14.48)
and
∂ g(Λu0 )
v∗0i = . (14.49)
∂ xi
Thus from the last proposition we can write:
∂ g∗L (v∗0 )
Λi (u0 ) = , for i ∈ {1, .., n} (14.50)
∂ y∗i
which means:
∂ G∗L (v∗0 )
Λu0 = . (14.51)
∂ v∗
304 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
J(u0 ) = G(Λu0 ) − hu0 , Λ∗ v∗0 iU = G(Λu0 ) − hΛu0 , v∗0 iY = −G∗L (v∗0 ). (14.60)
then h is convex.
Proof 14.9 We have to show that given p, q ∈ Y and λ ∈ (0, 1), we have
h(λ p + (1 − λ )q) ≤ λ h(p) + (1 − λ )h(q). (14.67)
If h(p) = +∞ or h(q) = +∞ we are done. Thus let us assume h(p) < +∞ and h(q) < +∞. For each a > h(p)
there exists u ∈ U such that
h(p) ≤ φ (u, p) ≤ a, (14.68)
and, if b > h(q), there exists v ∈ U such that
h(q) ≤ φ (v, q) ≤ b. (14.69)
Thus
so that
h∗ (p∗ ) = sup {hp, p∗ iY − φ (u, p)} = φ ∗ (0, p∗ ). (14.74)
(u,p)∈U ×Y
Proposition 14.5.3 The set of solutions of the problem P ∗ (the dual problem) is identical to ∂ h∗∗ (0).
−h(p∗0 ) = sup {h0, p∗ iY − h∗ (p∗ )} ⇔ −h∗ (p∗0 ) = h∗∗ (0) ⇔ p∗0 ∈ ∂ h∗∗ (0). (14.77)
p∗ ∈Y ∗
306 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Theorem 14.5.4 Consider φ : U ×Y → R̄ convex. Assume infu∈U {φ (u, 0)} ∈ R and there exists u0 ∈ U such
that p 7→ φ (u0 , p) is finite and continuous at 0 ∈ Y , then
inf {φ (u, 0)} = sup {−φ ∗ (0, p∗ )}, (14.78)
u∈U p∗ ∈Y ∗
Proof 14.11 By hypothesis h(0) ∈ R and as was shown above, h is convex. As the function p 7→ φ (u0 , p)
is convex and continuous at 0 ∈ Y , there exists a neighborhood V of zero in Y such that
φ (u0 , p) ≤ M < +∞, ∀p ∈ V , (14.79)
for some M ∈ R. Thus, we may write
h(p) = inf {φ (u, p)} ≤ φ (u0 , p) ≤ M, ∀p ∈ V . (14.80)
u∈U
Now we apply the last results to φ (u, p) = G(Λu+ p)+F(u), where Λ : U → Y is a continuous linear operator
whose adjoint operator is denoted by Λ∗ : Y ∗ → U ∗ . We may enunciate the following theorem.
Theorem 14.5.5 Suppose U is a reflexive Banach space and define J : U → R by
J(u) = G(Λu) + F(u) = φ (u, 0), (14.82)
where lim J(u) = +∞ as kukU → ∞ and F ∈ Γ0 (U), G ∈ Γ0 (Y ). Also suppose there exists û ∈ U such that
J(û) < +∞ with the function p 7→ G(p) continuous at Λu.
ˆ Under such hypothesis, there exist u0 ∈ U and
p∗0 ∈ Y ∗ such that
Proof 14.12 The existence of solutions for the primal problem follows from the direct method of cal-
culus of variations. That is, considering a minimizing sequence, from above (coercivity hypothesis), such
a sequence is bounded and has a weakly convergent subsequence to some u0 ∈ U. Finally, from the lower
semi-continuity of primal formulation, we may conclude that u0 is a minimizer. The other conclusions follow
from Theorem 14.5.4 just observing that
φ ∗ (0, p∗ ) = sup {hp, p∗ iY −G(Λu+ p)−F(u)} = sup {hq, p∗ i−G(q)−hΛu, p∗ i−F(u)}, (14.84)
u∈U,p∈Y u∈U,q∈Y
so that
φ ∗ (0, p∗ ) = G∗ (p∗ ) + sup{−hu, Λ∗ p∗ iU − F(u)} = G∗ (p∗ ) + F ∗ (−Λ∗ p∗ ). (14.85)
u∈U
Thus,
inf {φ (u, 0)} = sup {−φ ∗ (0, p∗ )} (14.86)
u∈U p∗ ∈Y ∗
and solutions u0 and p∗0 for the primal and dual problems, respectively, imply that
Definition 14.1 Let U,Y be Banach spaces, A ⊂ U and B ⊂ Y and let L : A × B → R be a functional. We
say that (u0 , v0 ) ∈ A × B is a saddle point for L if
Proposition 14.1
Let U,Y be Banach spaces, A ⊂ U and B ⊂ Y . A functional L : U ×Y → R has a saddle point if and only if
Hence
so that
sup inf L(u, v) ≤ sup L(u, v), ∀u ∈ A,
v∈B u∈A v∈B
and hence
sup inf L(u, v) ≤ inf sup L(u, v). (14.93)
v∈B u∈A u∈A v∈B
Conversely suppose
max inf L(u, v) = min sup L(u, v).
v∈B u∈A u∈A v∈B
As above defined,
and
G(v) = inf L(u, v).
u∈A
so that
F(u0 ) = sup L(u0 , v) = inf L(u, v0 ) = G(v0 ).
v∈B u∈U
In particular
L(u0 , v0 ) ≤ sup L(u0 , v) = inf L(u, v0 ) ≤ L(u0 , v0 ).
v∈B u∈U
Therefore
Proposition 14.2
Let U,Y be Banach spaces, A ⊂ U, B ⊂ Y and let L : A × B → R be a functional. Assume there exist u0 ∈ A,
v0 ∈ B and α ∈ R such that
L(u0 , v) ≤ α, ∀v ∈ B,
Convex Analysis and Duality Theory 309
and
L(u, v0 ) ≥ α, ∀u ∈ A.
Under such hypotheses (u0 , v0 ) is a saddle point of L, that is,
L(u0 , v0 ) ≤ α,
and
L(u0 , v0 ) ≥ α,
so that
L(u0 , v) ≤ α = L(u0 , v0 ) ≤ L(u, v0 ), ∀u ∈ A, v ∈ B.
This completes the proof.
In the next lines we state and prove the min − max theorem.
Theorem 14.1
Let U,Y be reflexive Banach spaces, A ⊂ U, B ⊂ Y and let L : A × B → R be a functional.
Suppose that
1. A ⊂ U is convex, closed and non-empty.
2. B ⊂ Y is convex, closed and non-empty.
3. For each u ∈ A, Fu (v) = L(u, v) is concave and upper semi-continuous.
4. For each v ∈ B, Gv (u) = L(u, v) is convex and lower semi-continuous.
5. The set A and B are bounded.
Under such hypotheses L has at least one saddle point (u0 , v0 ) ∈ A × B such that
Proof 14.15 Fix v ∈ B. Observe that Gv (u) = L(u, v) is convex and lower semi-continuous, therefore it is
weakly lower semi-continuous on the weakly compact set A. At first we assume the additional hypothesis
that Gv (u) is strictly convex, ∀v ∈ B. Hence Gv (u) attains a unique minimum on A. We denote the optimal
u ∈ A by u(v)
Define
G(v) = min Gv (u) = min L(u, v).
u∈A u∈U
Thus,
G(v) = L(u(v), v).
The function G(v) is expressed as the minimum of a family of concave weakly upper semi-continuous
functions, and hence it is also concave and upper semi-continuous.
310 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Moreover, G(v) is bounded above on the weakly compact set B, so that there exists v0 ∈ B such that
G(v0 ) = max G(v) = max min L(u, v).
v∈B v∈B u∈A
Observe that
G(v0 ) = min L(u, v0 ) ≤ L(u, v0 ), ∀u ∈ U.
u∈A
Observe also that, from the concerned concavity, for u ∈ A, v ∈ B and λ ∈ (0, 1) we have
L(u, (1 − λ )v0 + λ v) ≥ (1 − λ )L(u, v0 ) + λ L(u, v).
In particular denote u((1 − λ )v0 + λ v) = uλ , where uλ is such that
G((1 − λ )v0 + λ v) = min L(u, (1 − λ )v0 + λ v)
u∈A
= L(uλ , (1 − λ )v0 + λ v). (14.96)
Therefore,
G(v0 ) = max G(v)
v∈B
≥ G((1 − λ )v0 + λ v)
= L(uλ , (1 − λ )v0 + λ v)
≥ (1 − λ )L(uλ , v0 ) + λ L(uλ , v)
≥ (1 − λ ) min L(u, v0 ) + λ L(uλ , v)
u∈A
= (1 − λ )G(v0 ) + λ L(uλ , v). (14.97)
From this, we obtain
G(v0 ) ≥ L(uλ , v). (14.98)
Let {λn } ⊂ (0, 1) be such that λn → 0.
Let {un } ⊂ A be such that
G((1 − λn )v0 + λn v) = min L(u, (1 − λn )v0 + λn v)
u∈A
= L(un , (1 − λn )v0 + λn v). (14.99)
Since A is weakly compact, there exists a subsequence {unk } ⊂ {un } ⊂ A and u0 ∈ A such that
unk * u0 , weakly in U, as k → ∞.
Observe that
(1 − λnk )L(unk , v0 ) + λnk L(unk , v) ≤ L(unk , (1 − λnk )v0 + λnk v)
= min L(u, (1 − λnk )v0 + λnk v)
u∈A
≤ L(u, (1 − λnk )v0 + λnk v), (14.100)
∀u ∈ A, k ∈ N.
Recalling that λnk → 0, from this and (14.100) we obtain
L(u0 , v0 ) ≤ lim inf L(unk , v0 )
k→∞
= lim inf((1 − λnk )L(unk , v0 ) + λnk L(u, v))
k→∞
≤ lim sup L(u, (1 − λnk )v0 + λnk v)
k→∞
≤ L(u, v0 ), ∀u ∈ U. (14.101)
Convex Analysis and Duality Theory 311
From these last two results and Proposition 14.2 we have that (u0 , v0 ) is a saddle point for L. Now assume
that
Gv (u) = L(u, v)
is convex but not strictly convex ∀v ∈ B.
For each n ∈ N Define Ln by
Ln (u, v) = L(u, v) + kukU /n.
In such a case
(Fv )n (u) = Ln (u, v)
is strictly convex for all n ∈ N.
From above we main obtain (un , vn ) ∈ A × B such that
Since A × B is weakly compact and {(un , vn )} ⊂ A × B, up to subsequence not relabeled, there exists
(u0 , v0 ) ∈ A × B such that
un * u0 , weakly in U,
vn * v0 , weakly in Y,
so that,
Hence,
L(u0 , v) ≤ L(u, v0 ), ∀u ∈ A, v ∈ B,
so that
L(u0 , v) ≤ L(u0 , v0 ) ≤ L(u, v0 ), ∀u ∈ A, v ∈ B.
This completes the proof.
Theorem 14.2
Let U,Y be reflexive Banach spaces, A ⊂ U, B ⊂ Y and let L : A × B → R be a functional.
Suppose that
1. A ⊂ U is convex, closed and non-empty.
2. B ⊂ Y is convex, closed and non-empty.
3. For each u ∈ A, Fu (v) = L(u, v) is concave and upper semi-continuous.
4. For each v ∈ B, Gv (u) = L(u, v) is convex and lower semi-continuous.
5. Either the set A is bounded or there exists ṽ ∈ B such that
Proof 14.16 We prove the result just for the special case such that there exists ṽ ∈ B such that
L : An × B → R.
Hence,
L(un , v) ≤ L(un , vn ) ≤ L(u, vn ), ∀u ∈ An , v ∈ B.
For a fixed ũ ∈ A1 we have
is convex, lower semi-continuous and coercive, so that it is bounded below. Thus there exists a ∈ R such that
Hence
a ≤ L(un , ṽ) ≤ L(un , vn ) ≤ b, ∀n ∈ N.
Convex Analysis and Duality Theory 313
a ≤ L(un , ṽ) ≤ b, ∀n ∈ N,
unk * u0 , weakly in U,
vnk * v0 , weakly in Y,
L(unk , vnk ) → α ∈ R,
as k → ∞. Fix (u, v) ∈ A × B. Observe that if nk > n0 = kukU , then
that is,
L(u0 , v) ≤ α ≤ L(u, v0 ), ∀u ∈ A, v ∈ B.
From this and Proposition 14.2 we may conclude that (u0 , v0 ) is a saddle point for L : A × B → R.
The proof is complete.
Theorem 14.7.2 Let r ∈ N and let uk , 1 ≤ k ≤ r be piecewise affine functions from Ω into R and {αk } such
that αk > 0, ∀k ∈ {1, ..., r} and ∑rk=1 αk = 1. Given ε > 0, there exists a locally Lipschitz function u : Ω → R
and r disjoint open sets Ωk , 1 ≤ k ≤ r, such that
r
u(x) − ∑ αk uk < ε, ∀x ∈ Ω, (14.109)
k=1
r
u(x) = ∑ αk uk (x), ∀x ∈ ∂ Ω. (14.110)
k=1
r
|u(x) − ∑ αk uk (x)| < ε, ∀x ∈ Ω (14.114)
k=1
r
u(x) = ∑ αk uk (x), ∀x ∈ ∂ Ω. (14.115)
k=1
Proof 14.17 It is sufficient to establish the result for functions uk affine over Ω, since Ω can be divided
into pieces on which uk are affine, and such pieces can be put together through (14.115). Let ε > 0 be given.
We know that simple functions are dense in L1 (Ω), concerning the L1 norm. Thus there exists a partition of
Ω into a finite number of open sets Oi , 1 ≤ i ≤ N1 and a negligible set, and there exists f¯k constant functions
over each Oi such that
Z
| f (x, ∇uk (x)) − f¯k (x)|dx < ε, 1 ≤ k ≤ r. (14.116)
Ω
Now we apply Theorem 14.7.2, to each of the open sets Oi , therefore there exists a locally Lipschitz function
u : Oi → R and there exist r open disjoints spaces Ωik , 1 ≤ k ≤ r, such that
r
u(x) − ∑ αk uk (x) ≤ δ , ∀x ∈ Oi (14.122)
k=1
r
u(x) = ∑ αk uk (x), ∀x ∈ ∂ Oi . (14.123)
k=1
From | f (x, ∇u(x))| ≤ c(x), m(Oi − ∪rk=1 Ωik ) ≤ δ and (14.118) we obtain
Z r Z Z
f (x, ∇u(x))dx − ∑ f (x, ∇uk (x)dx = f (x, ∇u(x))dx ≤ ε/N1 . (14.125)
Oi i
k=1 Ωk Oi −∪rk=1 Ωki
Considering that f¯k is constant in Oi , from (14.117), (14.118) and (14.119) we obtain
r Z Z
∑| f¯k (x)dx − αk f¯k (x)dx| < ε/N1 . (14.126)
k=1 Ωik Oi
The next result we do not prove it. It is a well known result from the finite element theory.
Proposition 14.7.4 If u ∈ W01,p (Ω) there exists a sequence {un } of piecewise affine functions over Ω, null
on ∂ Ω, such that
un → u, in L p (Ω) (14.129)
and
Proposition 14.7.5 For p such that 1 < p < ∞, suppose that f : Ω × RN → R is a Caratheodory
´ function ,
for which there exist a1 , a2 ∈ L1 (Ω) and constants c1 ≥ c2 > 0 such that
Then, given u ∈ W 1,p (Ω) piecewise affine, ε > 0 and a neighborhood V of zero in the topology
σ (L p (Ω, RN ), Lq (Ω, RN )) there exists a function v ∈ W 1,p (Ω) such that
∇v − ∇u ∈ V , (14.132)
u = v on ∂ Ω,
and
Z Z
f (x, ∇v(x))dx − f ∗∗ (x, ∇u(x))dx < ε. (14.134)
Ω Ω
Proof 14.18 Suppose given ε > 0, u ∈ W 1, p (Ω) piecewise affine continuous, and a neighborhood V of
zero, which may be expressed as
Z
V = {w ∈ L p (Ω, RN ) | hm · wdx < η, ∀m ∈ {1, ..., M}}, (14.135)
Ω
N+1
∑ αk ξk = ∇u, ∀x ∈ ∆i , (14.136)
k=1
and
N+1
∑ αk f (x, ξk ) = f ∗∗ (x, ∇u(x)). (14.137)
k=1
Convex Analysis and Duality Theory 317
Define βi = maxk∈{1,...,N+1} {|ξk | on ∆i }, and ρ1 = maxi∈{1,...,r} {βi }, and ρ = max{ρ1 , k∇uk∞ }. Now, ob-
serve that we can obtain functions ĥm ∈ C0∞ (Ω; RN ) such that
η
max kĥm − hm kLq (Ω,RN ) < . (14.138)
m∈{1,...,M } 4ρm(Ω)
We recall that ρ does not depend on ε. Furthermore, for each i ∈ {1, ..., r} there exists a compact subset
Ki ⊂ ∆i such that
Z
ε1
[a1 (x) + c1 (x) max {|ξ | p }]dx < . (14.140)
∆i −Ki |ξ |≤ρ r
Also, observe that the sets Ki may be obtained such that the restrictions of f and f ∗∗ to Ki ×ρB are continuous,
so that from this and from the compactness of ρB, for all x ∈ Ki , we can find an open ball ωx with center in x
and contained in Ω, such that
ε1
| f ∗∗ (y, ∇u(x)) − f ∗∗ (x, ∇u(x))| < , ∀y ∈ ωx ∩ Ki , (14.141)
m(Ω)
and
ε1
| f (y, ξ ) − f (x, ξ )| < , ∀y ∈ ωx ∩ Ki , ∀ξ ∈ ρB. (14.142)
m(Ω)
Therefore, from this and (14.137) we may write
N+1
2ε1
f ∗∗ (y, ∇u(x)) − ∑ αk f (y, ξk ) < , ∀y ∈ ωx ∩ Ki . (14.143)
k=1 m(Ω)
We can cover the compact set Ki with a finite number of those open balls ωx , denoted by ω j , 1 ≤ j ≤ l.
j−1
Consider the open sets ω 0j = ω j − ∪i=1 ω̄i , we have that ∪lj=1 ω̄ 0j = ∪lj=1 ω̄ j . Defining functions uk , for 1 ≤
k ≤ N + 1 such that ∇uk = ξk and u = ∑N+1 k=1 αk uk we may apply Proposition 14.7.3 to each of the open sets
ω 0j , so that we obtain functions vi ∈ W 1,p (Ω) such that
Z N+1 Z
ε1
f (x, ∇vi (x)dx − ∑ αk f (x, ξk )dx < , (14.144)
ω 0j k=1 ω 0j rl
and
Finally we set
vi = u on ∆i − ∪lj=1 ω j . (14.148)
318 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
We have that v(x) = u(x), ∀x ∈ ∂ Ω and k∇vk∞ < ρ. Also, from (14.140)
Z
ε1
| f ∗∗ (x, ∇u(x)|dx < (14.151)
∆i −Ki r
and
Z
ε1
| f (x, ∇v(x)|d x < . (14.152)
∆i −Ki r
0
ε1 ε1 m(ω j ∩ Ki )
Z Z
f (x, ∇v(x))dx − f ∗∗ (x, ∇u(x))dx ≤ + (14.153)
Ki ∩ω 0j Ki ∩ω 0j rl m(Ω)
so that
ε1 ε1 m(Ki )
Z Z
| f (x, ∇v(x))dx − f ∗∗ (x, ∇u(x))dx| ≤ + . (14.154)
Ki Ki r m(Ω)
Now summing up in i and considering (14.151) and (14.152) we obtain (14.134), that is
Z Z
| f (x, ∇v(x))dx − f ∗∗ (x, ∇u(x))dx| < 4ε1 ≤ ε. (14.155)
Ω Ω
and thus
Z Z
ĥm · (∇v(x) − ∇u(x))dx = − div(ĥm )(v(x) − u(x))dx
Ω Ω
≤ kdiv(ĥm )kLq (Ω) kv − ukL p (S)
≤ Cε1 m(Ω)1/p
η
< . (14.157)
2
Also we have that
Z
η
(ĥm − hm ) · (∇v − ∇u)dx ≤ kĥm − hm kLq (Ω,RN ) k∇v − ∇ukL p (Ω,RN ) ≤ . (14.158)
Ω 2
Thus
Z
hm · (∇v − ∇u)d x < η, ∀m ∈ {1, ..., M}. (14.159)
Ω
Convex Analysis and Duality Theory 319
Theorem 14.7.6 Assuming the hypothesis of last theorem, given a function u ∈ W01,p (Ω), given ε > 0 and a
neighborhood of zero V in σ (L p (Ω, RN ), Lq (Ω, RN )), we have that there exists a function v ∈ W01,p (Ω) such
that
∇v − ∇u ∈ V , (14.160)
and
Z Z
f (x, ∇v(x))dx − f ∗∗ (x, ∇u(x))dx < ε. (14.161)
Ω Ω
Proof 14.19 We can approximate u by a function w which is piecewise affine and null on the boundary.
Thus, there exists δ > 0 such that we can obtain w ∈ W01,p (Ω) piecewise affine such that
ku − wk1,p < δ (14.162)
so that
1
∇w − ∇u ∈ V , (14.163)
2
and
Z Z
∗∗ ε
f (x, ∇w(x))dx − f ∗∗ (x, ∇u(x))dx < . (14.164)
Ω Ω 2
Finally, from (14.163), (14.165) and from the fact the weak neighborhoods are convex, we have
∇v − ∇u ∈ V . (14.168)
To finish this chapter, we present two theorems which summarize the last results.
´ ory function from Ω × RN into R which satisfies
Theorem 14.7.7 Let f be a Caratheod
a2 (x) + c2 |ξ | p ≤ f (x, ξ ) ≤ a1 (x) + c1 |ξ | p (14.169)
where a1 , a2 ∈ L1 (Ω), 1 < p < +∞, b ≥ 0 and c1 ≥ c2 > 0. Under such assumptions, defining Û = W01,p (Ω),
we have
Z Z
∗∗
inf f (x, ∇u)dx = min f (x, ∇u)dx (14.170)
u∈Û Ω u∈Û Ω
The solutions of relaxed problem are weak cluster points in W01,p (Ω) of the minimizing sequences of primal
problem.
320 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Proof 14.20 The existence of solutions for the convex relaxed formulation is a consequence of the reflexiv-
ity of U and coercivity hypothesis, which allows an application of the direct method of calculus of variations.
That is, considering a minimizing sequence, from above (coercivity hypothesis), such a sequence is bounded
and has a weakly convergent subsequence to some û ∈ W 1,p (Ω). Finally, from the lower semi-continuity of
relaxed formulation, we may conclude that û is a minimizer. The relation (14.170) follows from last theorem.
where a1 , a2 ∈ L1 (Ω), 1 < p < +∞, b ≥ 0 and c1 ≥ c2 > 0. Let u0 ∈ W 1, p (Ω). Under such assumptions,
defining Û = {u | u − u0 ∈ W01,p (Ω)}, we have
Z Z
inf f (x, ∇u)dx = min f ∗∗ (x, ∇u)dx (14.172)
u∈Û Ω u∈Û Ω
The solutions of relaxed problem are weak cluster points in W 1,p (Ω) of the minimizing sequences of
primal problem.
Proof 14.21 Just apply the last theorem to the integrand g(x, ξ ) = f (x, ξ + ∇u0 ). For details see [33].
u1 = u,
and having u1 , ..., un , select un+1 as specified in the next lines. First, define
On the other hand, from the definition of infimum, we may select un+1 ∈ Sn such that
1
F(un+1 ) ≤ F(un ) + inf {F(w)} . (14.173)
2 w∈Sn
Remark 14.8.2 We may introduce in U a new metric given by d1 = ε 1/2 d. We highlight that the topology
remains the same and also F remains lower semi-continuous. Under the hypotheses of the last theorem, if
there exists u ∈ U such that F(u) < infu∈U F(u) + ε, then there exists v ∈ U such that
1. d (u, v) ≤ ε 1/2 ,
2. F(v) ≤ F(u),
3. F (v) ≤ F (w) + ε 1/2 d(u, w), ∀w ∈ U.
Remark 14.8.3 Observe that, if U is a Banach space,
F(v) − F(v + tw) ≤ ε 1/2tkwkU , ∀t ∈ [0, 1], w ∈ U, (14.178)
so that, if F is Gateaux
ˆ differentiable, we obtain
−hδ F(v), wiU ≤ ε 1/2 kwkU . (14.179)
Similarly
F(v) − F(v + t (−w)) ≤ ε 1/2tkwkU ≤, ∀t ∈ [0, 1], w ∈ U, (14.180)
so that, if F is Gateaux
ˆ differentiable, we obtain
hδ F (v), wiU ≤ ε 1/2 kwkU . (14.181)
Thus
kδ F(v)kU ∗ ≤ ε 1/2 . (14.182)
We have thus obtained, from the last theorem and remarks, the following result.
Theorem 14.8.4 Let U be a Banach space. Let F : U → R be a lower semi-continuous Gateaux
ˆ differentiable
functional. Given ε > 0 suppose that u ∈ U is such that
F(u) ≤ inf {F(u)} + ε. (14.183)
u∈U
and
√
kδ J(uε )kU ∗ < ε. (14.190)
Convex Analysis and Duality Theory 323
Theorem 14.9.1 (Toland, 1979) Let U be a Banach space and let F, G : U → R be functionals such that
u∗0
Assume also ∈ ∂ F(u0 ).
Under such hypotheses,
F ∗ (u∗0 ) − G∗ (u∗0 ) = α,
so that
Thus
G(u) − F(u) ≥ α, ∀u ∈ U.
Therefore, for u∗ ∈ U ∗, we have
so that,
hu0 , u∗0 iU − G(u0 ) ≥ hu, u∗0 iU − G(u), ∀u ∈ U,
that is
Exercise 14.9.2 Let Ω ⊂ R2 be a set of Ĉ1 class. Let V = C1 (Ω) and let J : D ⊂ V → R where
Z Z
γ
J(u) = ∇u · ∇u dx − f u dx, ∀u ∈ U
2 Ω Ω
and where
D = {u ∈ V : u = 0 on ∂ Ω}.
3. Prove that
inf J(u) ≥ sup {−G∗ (v∗ ) − F ∗ (−Λ∗ v∗ )},
u∈U v∗ ∈Y ∗
where
1
Z
G(∇u) = ∇u · ∇u dx,
2 Ω
and
G∗ (v∗ ) = sup{hv, v∗ iY − G(v)},
v∈Y
where Y = Y∗ = L2 (Ω).
4. Defining Λ : U → Y by
Λu = ∇u,
and
F :D→R
as Z
F(u) = f u dx,
Ω
so that
F ∗ (−Λ∗ v∗ ) = sup{−h∇u, v∗ iY − F(u)}
u∈D
Z
= sup hu, div v∗ iY + f u dx
u∈D Ω
Z
∗
= sup (div v + f ) u dx
u∈D Ω
if div(v∗ ) + f = 0, in Ω
0,
= (14.197)
+∞, otherwise,
Hence,
Z Z
J(u + v) − J(u) = (γ/2) (∇u + ∇v) · (∇u + ∇v) dx − (γ/2) ∇u · ∇u dx
Ω Ω
Z
− (u + v − u) f dx
ZΩ Z Z
= γ ∇u · ∇v dx − f v dx + (γ/2) ∇v · ∇v dx
ZΩ ZΩ Ω
≥ γ ∇u · ∇v dx − f v dx
Ω Ω
= δ J(u; v) (14.200)
∀u ∈ D, v ∈ Va .
From this we may infer that J is convex.
From the hypotheses u0 ∈ D is such that
γ∇2 u0 + f = 0, in Ω.
Let v ∈ Va .
Therefore, we have
Z Z
δ J(u0 ; v) = γ ∇u0 · ∇v dx − f v dx
ZΩ Ω
Z
= γ ∇u0 · ∇v dx + γ ∇2 u0 v dx
Ω Ω
Z Z Z
= γ ∇u0 · ∇v dx − γ ∇u0 · ∇v dx + ∇u0 · n v ds
Ω Ω ∂Ω
= 0 (14.201)
where n denotes the unit outward field to ∂ Ω.
Summarizing, we got δ J(u0 ; v) = 0, ∀v ∈ Va .
Since J is convex,from this we may conclude that u0 minimizes J on D.
Observe now that,
and from
div v∗0 + f = 0
we get
F ∗ (−Λ∗ v∗0 ) = 0.
Hence
G(∇u0 ) − hu0 , f iL2 = −G∗ (v∗0 ) − F ∗ (−Λ∗ v∗0 ),
so that from this and (14.203) we have
Constrained Variational
Optimization
Proposition 15.1.3 Let U be a Banach space and P be a convex closed cone in U. If u ∈ U satisfies hu, u∗ iU ≥
0, ∀u∗ ≥ θ ∗ , then u ≥ θ .
Proof 15.1 We prove the contrapositive. Assume u 6∈ P. Then by the separating hyperplane theorem there
is an u∗ ∈ U ∗ such that hu, u∗ iU < hp, u∗ iU , ∀p ∈ P. Since P is a cone, given any p ∈ P we must have
h p, u∗ iU ≥ 0, otherwise we would have hu, u∗ i > hα p, u∗ iU for some α > 0. Thus u∗ ∈ P+ . Finally, since
inf p∈P {hp, u∗ iU } = 0, we obtain hu, u∗ iU < 0 which completes the proof.
Definition 15.1.4 (Convex mapping) Let U, Z be vector spaces. Let P ⊂ Z be a convex cone. A mapping
G : U → Z is said to be convex if
G(αu1 + (1 − α)u2 ) ≤ αG(u1 ) + (1 − α)G(u2 ), ∀u1 , u2 ∈ U, α ∈ [0, 1]. (15.2)
Define
Now we establish the Lagrange multiplier theorem for convex global optimization.
Theorem 15.1.6 Let U be a vector space, Z a Banach space, Ω a convex subset of U, P a positive convex
closed cone of Z. Assume that P contains an interior point. Let F be a real convex functional on Ω and G a
convex mapping from Ω into Z. Assume the existence of u1 ∈ Ω such that G(u1 ) < θ . Defining
Furthermore, if the infimum in (15.8) is attained by u0 ∈ U such that G(u0 ) ≤ θ , it is also attained in (15.9)
by the same u0 and also hG(u0 ), z∗0 iZ = 0. We refer to z∗0 as the Lagrangian Multiplier.
and
B = {(r, z) ∈ R × Z | r ≤ µ0 , z ≤ θ }, (15.11)
where µ0 = infu∈Ω {F(u) | G(u) ≤ θ }. Since F and G are convex, A and B are convex sets. It is clear that A
contains no interior point of B, and since N = −P contains an interior point , the set B contains an interior
point. Thus, from the separating hyperplane theorem, there is a non-zero element w∗0 = (r0 , z∗0 ) ∈ W ∗ such
that
From the nature of B it is clear that w∗0 ≥ θ . That is, r0 ≥ 0 and z∗0 ≥ θ . We will show that r0 > 0. The point
(µ0 , θ ) ∈ B, hence
r0 r + hz, z∗0 iZ ≥ r0 µ0 , ∀(r, z) ∈ A. (15.13)
If r0 = 0 then hG(u1 ), z∗0 iZ ≥ 0 and z∗0 =
6 θ . Since G(u1 ) < θ and z∗0 ≥ θ we have a contradiction. Therefore
r0 > 0 and, without loss of generality we may assume r0 = 1. Since the point (µ0 , θ ) is arbitrarily close to A
and B, we have
µ0 = inf {r + hz, z∗0 iZ } ≤ inf {F(u) + hG(u), z∗0 iZ } ≤ inf{F(u) | u ∈ Ω, G(u) ≤ θ } = µ0 . (15.14)
(r,z)∈A u∈Ω
Corollary 15.1.7 Let the hypothesis of the last theorem hold. Suppose
F(u0 ) = inf {F(u) | G(u) ≤ θ }. (15.17)
u∈Ω
Proof 15.5 Suppose there is a u1 ∈ Ω such that F(u1 ) < F(u0 ) and G(u1 ) ≤ G(u0 ). Thus
hG(u1 ), z∗0 iZ ≤ hG(u0 ), z∗0 iZ (15.24)
so that
F(u1 ) + hG(u1 ), z∗0 iZ < F(u0 ) + hG(u0 ), z∗0 iZ , (15.25)
which contradicts the hypothesis of the theorem.
Constrained Variational Optimization 331
Theorem 15.1.9 Let F be a convex real functional and G : Ω → Z convex and let u0 and u1 be solutions to
the problems P0 and P1 respectively, where
and
Suppose z∗0 and z∗1 are the Lagrange multipliers related to these problems. Then
15.2 Duality
Consider the basic convex programming problem:
where
and
Γ = {z ∈ Z | G(u) ≤ z f or some u ∈ Ω}.
so that
Theorem 15.2.2 (Lagrange duality) Consider F : Ω ⊂ U → R a convex functional, Ω a convex set, and
G : U → Z a convex mapping. Suppose there exists a u1 such that G(u1 ) < θ and that infu∈Ω {F(u) | G(u) ≤
θ } < ∞, with such order related to a convex closed cone in Z. Under such assumptions, we have
If the infimum on the left side in (15.38) is achieved at some u0 ∈ U and the max on the right side at z∗0 ∈ Z ∗ ,
then
or
ϕ(z∗ ) ≤ µ0 . (15.41)
In this section we develop a new and simpler proof of the Lagrange multiplier theorem in a Banach space
context. In particular, we address the problem of minimizing a functional F : U → R subject to G(u) = θ ,
where θ denotes the zero vector and G : U → Z is a Frechet ´ differentiable transformation. Here U, Z are
Banach spaces. General results in Banach spaces may be found in [2, 34], for example. For the theorem in
question, among others we would cite [57, 54, 19]. Specially the proof given in [57] is made through the
generalized inverse function theorem. We emphasize such a proof is extensive and requires the continuous
´
Frechet differentiability of F and G. Our approach here is different and the results are obtained through other
hypotheses.
The main result is summarized by the following theorem.
Theorem 15.3.2 Let U and Z be Banach spaces. Assume u0 is a local minimum of F(u) subject to G(u) = θ ,
where F : U → R is a Gateaux
ˆ differentiable functional and G : U → Z is a Frec
´ het differentiable transfor-
mation such that G0 (u0 ) maps U onto Z. Finally, assume there exist α > 0 and K > 0 such that if kϕkU < α
then,
kG0 (u0 + ϕ) − G0 (u0 )k ≤ KkϕkU .
Constrained Variational Optimization 333
that is,
hϕ, F 0 (u0 )iU + hG0 (u0 )ϕ, z∗0 iZ = 0, ∀ϕ ∈ U.
Proof 15.9 First observe that there is no loss of generality in assuming 0 < α < 1. Also from the general-
ized mean value inequality and our hypothesis, if kϕkU < α, then
that is,
A(ū) = G0 (u0 ) · u,
where ū = {u + v | v ∈ L0 }.
Since G0 (u0 ) is onto, so is A, so that by the inverse mapping theorem A has a continuous inverse A−1 .
Let ϕ ∈ U be such that G0 (u0 ) · ϕ = θ . For a given t such that 0 < |t| < 1+kαϕ kU , let ψ0 ∈ U be such that
H(tϕ)
G0 (u0 ) · ψ0 + = θ,
t2
Observe that, from (15.42),
kH(tϕ)k ≤ Kt 2 kϕkU2 ,
and thus from the boundedness of A−1 , kψ0 k as a function of t may be chosen uniformly bounded relating t
(that is, despite the fact that ψ0 may vary with t, there exists K1 > 0 such that kψ0 kU < K1 , ∀t such that 0 <
α
|t| < 1+kϕ kU ).
Now choose 0 < r < 1/4 and define g0 = θ .
Also define
r
ε= −1
.
4(kA k + 1)(K + 1)(K1 + 1)(kϕkU + 1)
Since from the hypotheses G0 (u) is continuous at u0 , we may choose 0 < δ < α such that if kvkU < δ then
H (tϕ + t 2 ψ)
G0 (u0 ) · ψ + = θ.
t2
Define
H(tϕ + t 2 (ψ0 − g0 ))
L1 = A−1 G0 (u0 ) · (ψ0 − g0 ) + ,
t2
so that
H (tϕ + t 2 (ψ0 − g0 ))
L1 = A−1 [A(ψ0 − g0 )] + A−1
t2
= ψ0 − g0 + w1
= ψ0 + w1
= {ψ0 + w1 + v | v ∈ L0 }.
So we have that
H(tϕ) H(tϕ + t 2 (ψ0 − g0 ))
−1
L1 = A − 2 + . (15.43)
t t2
However
Observe that
H (tϕ + t 2 (ψ0 − gn−1 ))
Ln = A−1 G0 (u0 ) · (ψ0 − gn−1 ) + + Ln−1
t2
2
−1 −1 H(t ϕ + t (ψ0 − gn−1 ))
= A A(ψ0 − gn−1 ) + A + gn−1
t2
H (t ϕ + t 2 (ψ0 − gn−1 ))
= ψ0 − gn−1 + A−1 + gn−1
t2
H (tϕ + t 2 (ψ0 − gn−1 ))
= ψ0 + A−1
t2
= {ψ0 + wn + v | v ∈ L0 }.
Thus
H (tϕ + t 2 (ψ0 − gn−1 ))
Ln = A−1 G0 (u0 ) · (ψ0 − gn−1 ) + + G0
(u 0 ) · gn−1 .
t2
336 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
By analogy
Observe that
kA−1 k
kLn − Ln−1 k ≤ kH(tϕ + t 2 (ψ0 − gn−1 )) − H (t ϕ + t 2 (ψ0 − gn−2 ))k
t2
< εkA−1 kkgn−1 − gn−2 kU
< (r/4)kgn−1 − gn−2 kU
1
< kgn−1 − gn−2 kU . (15.48)
4
Thus,
1
kgn − gn−1 kU ≤ 2kLn − Ln−1 k < kgn−1 − gn−2 kU .
2
Finally
gn → g, in norm, as n → ∞.
Hence
Ln → L = ḡ, in norm, as n → ∞,
Constrained Variational Optimization 337
so that,
H (t ϕ + t 2 (ψ0 − g))
−1 0
Ln − Ln−1 → L − L = θ = A G (u0 ) · (ψ0 − g) + .
t2
Since A−1 is a bijection, denoting ψ̃0 = (ψ0 − g), we get
H(tϕ + t 2 (ψ˜0 ))
G0 (u0 ) · ψ˜0 + =θ
t2
Clarifying the dependence on t we denote ψ˜0 = ψ̃0 (t) where as above mentioned, t ∈ R is such that
δ
0 < |t| < .
2(1 + kϕkU + K1 )
Therefore
G(u0 + tϕ + t 2 ψ̃0 (t)) = θ .
Observe also that kt 2 ψ̃0 (t )kU = kt 2 (ψ0 (t) − g)kU ≤ t 2 (K1 + r) ≤ t 2 (K1 + 1) so that t 2 ψ̃0 (t) → θ as t → 0.
Thus, by defining t 2 ψ̃0 (t)|t=0 = θ (observe that in principle such a function would not be defined at t = 0),
we obtain
d (t 2 ψ̃0 (t))
2
t ψ̃0 (t) − θ
|t=0 = lim = θ,
dt t→0 t
considering that
ktψ̃0 (t)kU ≤ |t|(K1 + 1) → 0, as t → 0.
Finally, defining
φ (t ) = F(u0 + tϕ + t 2 ψ̃0 (t)),
from the hypotheses we have that there exists a suitable t˜2 > 0 such that
Since R[G0 (u0 )] is closed, we get F 0 (u0 ) ∈ R([G0 (u0 )]∗ ), that is, there exists z∗0 ∈ Z ∗ such that
it is clear that (locally) minimizing F̃(u, v) subject to G(u, v) = (θ , θ ) is equivalent to the original problem.
We clarify the domain of F˜ is denoted by U ×Y , where
Therefore, if u0 is a local minimum for the original constrained problem, then for an appropriate and
easily defined v0 we have that (u0 , v0 ) is a point of local minimum for the extended constrained one, so that by
the last theorem there exists a Lagrange multiplier z∗0 = (z∗1 , z2∗ ) ∈ [Lq (Ω)]m1 ×[Lq (Ω)]m2 where 1/p+1/q = 1
and
F̃ 0 (u0 , v0 ) + [G0 (u0 , v0 )]∗ (z∗0 ) = (θ , θ ),
that is,
F 0 (u0 ) + [G01 (u0 )]∗ (z∗1 ) + [G02 (u0 )]∗ (z∗2 ) = θ , (15.50)
and
(z∗2 )i v0i = θ , ∀i ∈ {1, ..., m2 }.
In particular for almost all x ∈ Ω, if x is such that v0i (x)2 > 0 then z∗2i (x) = 0, and if v0i (x) = 0 then
(G2 )i (u0 (x)) = 0, so that (z∗2 )i (G2 )i (u0 ) = 0, a.e. in Ω, ∀i ∈ {1, ..., m2 }.
Furthermore, consider the problem of minimizing F1 (v) = F̃(u0 , v) = F(u0 ) subject {G2i (u0 ) + v2i } = θ .
From above such a local minimum is attained at v0 . Thus, from the stationarity of F1 (v) + hz∗2 , {(G2 )i (u0 ) +
vi2 }i[L p (Ω)]m2 at v0 and the standard necessary conditions for the case of convex (in fact quadratic) constraints
we get (z∗2 )i ≥ 0 a.e. in Ω, ∀i ∈ {1, ..., m2 }, that is, z∗2 ≥ θ .
Summarizing, for the order in question the first order necessary optimality conditions are given by
(15.50), z∗2 ≥ θ and (z∗2 )i (G2 )i (u0 ) = θ , ∀i ∈ {1, ..., m2 } (so that hz∗2 , G2 (u0 )i[L p (Ω)]m2 = 0), G1 (u0 ) = θ , and
G2 (u0 ) ≤ θ .
Remark 15.4.1 For the case U = Rn and Rmk replacing [L p (Ω)]mk , for k ∈ {1, 2} the conditions (z∗2 )i vi = θ
means that for the constraints not active (for example vi 6= 0 ) the corresponding coordinate (z∗2 )i of the
Lagrange multiplier is 0. If vi = 0 then (G2 )i (u0 ) = 0, so that in any case (z∗2 )i (G2 )i (u0 ) = 0.
Summarizing, for this last mentioned case we have obtained the standard necessary optimality condi-
tions: (z∗2 )i ≥ 0, and (z∗2 )i (G2 )i (u0 ) = 0, ∀i ∈ {1, ..., m2 }.
15.5 The Lagrange multiplier theorem for equality and inequality con-
straints
In this section we develop more rigorous results concerning the Lagrange multiplier theorem for the case
involving equalities and inequalities.
Theorem 15.1
Let U, Z1 , Z2 be Banach spaces. Consider a cone C in Z2 as above specified and such that if z1 ≤ θ and z2 < θ
then z1 + z2 < θ , where z ≤ θ means that z ∈ −C and z < θ means that z ∈ (−C)◦ . The concerned order is
Constrained Variational Optimization 339
supposed to be also that if z < θ , z∗ ≥ θ ∗ and z∗ 6= θ then hz, z∗ iZ2 < 0. Furthermore, assume u0 ∈ U is a
point of local minimum for F : U → R subject to G1 (u) = θ and G2 (u) ≤ θ , where G1 : U → Z1 , G2 : U → Z2
and F are Frec ´ het differentiable at u0 ∈ U. Suppose also G01 (u0 ) is onto and that there exist α > 0, K > 0
such that if kϕkU < α then
kG01 (u0 + ϕ) − G01 (u0 )k ≤ KkϕkU .
Finally, suppose there exists ϕ0 ∈ U such that
G01 (u0 ) · ϕ0 = θ
and
G02 (u0 ) · ϕ0 < θ .
Under such hypotheses, there exists a Lagrange multiplier z∗0 = (z∗1 , z∗2 ) ∈ Z1∗ × Z2∗ such that
z∗2 ≥ θ ∗ ,
and
hG2 (u0 ), z∗2 iZ2 = 0.
and
kψ0 (t)kU < K1 , ∀|t| < ε.
Observe that
G02 (u0 ) · ϕα
= αG02 (u0 ) · ϕ0 + (1 − α)G02 (u0 ) · ϕ
= αG02 (u0 ) · ϕ0 + (1 − α)(v − λ G2 (u0 ))
= αG02 (u0 ) · ϕ0 + (1 − α)v − (1 − α)λ G2 (u0 ))
= v0 − λ0 G2 (u0 ), (15.51)
where,
λ0 = (1 − α)λ ,
and
v0 = αG02 (u0 ) · ϕ0 + (1 − α)v < θ .
Hence, for t > 0
where
kr(t)k
lim = 0.
t→0+ t
Therefore from (15.51) we obtain
where
kr1 (t)k
lim = 0.
t→0+ t
Observe that there exists ε1 > 0 such that if 0 < t < ε1 < ε, then
r1 (t)
v0 + < θ,
t
and
G2 (u0 ) − tλ0 G2 (u0 ) = (1 − tλ0 )G2 (u0 ) ≤ θ .
Hence
G2 (u0 + tϕα + t 2 ψ0 (t)) < θ , if 0 < t < ε1 .
From this there exists 0 < ε2 < ε1 such that
where
|r2 (t)|
lim = 0.
t→0+ t
Dividing the last inequality by t > 0 we get
Letting t → 0+ we obtain
hϕα , F 0 (u0 )iU ≥ 0.
Letting α → 0+ , we get
hϕ, F 0 (u0 )iU ≥ 0,
if
G01 (u0 ) · ϕ = θ ,
and
G02 (u0 ) · ϕ = v − λ G2 (u0 ),
for some v ≤ θ and λ ≥ 0. Define
∀ ϕ ∈ U, r ≥ 0, v ≤ θ , λ ≥ 0. Suppose β = 0. Fixing all variable except v we get z∗2 ≥ θ . Thus, for ϕ = cϕ0
with arbitrary c ∈ R, v = θ , λ = 0, if z∗2 =
6 θ , then hG02 (u0 ) · ϕ0 , z∗2 iZ2 < 0, so that we get z∗2 = θ . Since G01 (u0 )
is onto, a similar reasoning lead us to z∗1 = θ , which contradicts (β , z∗1 , z∗2 ) = 6 (0, θ , θ ).
Hence, β =6 0, and fixing all variables except r we obtain β > 0. There is no loss of generality in assuming
β = 1.
Again fixing all variables except v, we obtain z∗2 ≥ θ . Fixing all variables except λ , since G2 (u0 ) ≤ θ we
get
hG2 (u0 ), z∗2 iZ2 = 0.
Finally, for r = 0, v = θ , λ = 0, we get
hϕ, F 0 (u0 )iU + hG01 (u0 )ϕ, z∗1 iZ1 + hG02 (u0 ) · ϕ, z∗2 iZ2 ≥ 0, ∀ϕ ∈ U,
that is, since obviously such an inequality is valid also for −ϕ, ∀ϕ ∈ U, we obtain
hϕ, F 0 (u0 )iU + hϕ, [G01 (u0 )]∗ (z∗1 )iU + hϕ, [G02 (u0 )]∗ (z∗2 )iU = 0, ∀ϕ ∈ U,
so that
F 0 (u0 ) + [G01 (u0 )]∗ (z∗1 ) + [G02 (u0 )]∗ (z∗2 ) = θ .
The proof is complete.
342 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Theorem 15.2
Let U, Z1 , Z2 be Banach spaces. Consider a cone C in Z2 as above specified and such that if z1 ≤ θ and z2 < θ
then z1 + z2 < θ , where z ≤ θ means that z ∈ −C and z < θ means that z ∈ (−C)◦ . The concerned order is
supposed to be also that if z < θ , z∗ ≥ θ ∗ and z∗ 6= θ then hz, z∗ iZ2 < 0. Furthermore, assume u0 ∈ U is a point
of local minimum for F : U → R subject to G1 (u) = θ and G2 (u0 ) ≤ θ , where G1 : U → Z1 , G2 : U → (Z2 )k
and F are twice Frec ´ het differentiable at u0 ∈ U. Assume G2 (u) = {(G2 )i (u)} where (G2 )i : U → Z2 , ∀i ∈
{1, ..., k} and define
A = {i ∈ {1, ..., k} : (G2 )i (u0 ) = θ },
and also suppose that (G2 )i (u0 ) < θ , if i 6∈ A. Moreover, suppose {G01 (u0 ), {(G2 )0i (u0 )}i∈A } is onto and that
there exist α > 0, K > 0 such that if kϕkU < α then
where
G̃(u) = {G1 (u), {(G2 )i (u)}i∈A }.
Finally, suppose there exists ϕ0 ∈ U such that
G01 (u0 ) · ϕ0 = θ
and
G02 (u0 ) · ϕ0 < θ .
Under such hypotheses, there exists a Lagrange multiplier z∗0 = (z∗1 , z∗2 ) ∈ Z1∗ × (Z2∗ )k such that
Hence the point u0 ∈ U is a local minimum for F(u) under the constraints
From the last Theorem 15.1 for such an optimization problem there exists a Lagrange multiplier
(z∗1 , {(z∗2 )i∈A }) such that (z∗2 )i ≥ θ ∗ , ∀i ∈ A, and
F 0 (u0 ) + [G01 (u0 )]∗ (z∗1 ) + ∑ [(G2 )0i (u0 )]∗ ((z∗2 )i ) = θ . (15.55)
i∈A
The choice (z∗2 )i = θ , if i 6∈ A leads to the existence of a Lagrange multiplier (z∗1 , z∗2 ) =
(z∗1 , {(z∗2 )i∈A , (z∗2 )i6∈A }) such that
z∗2 ≥ θk∗
and
h(G2 )i (u0 ), (z∗2 )i iZ = 0, ∀i ∈ {1, ..., k}.
Let ϕ ∈ V0 , that is, ϕ ∈ U,
G01 (u0 )ϕ = θ
and
(G2 )0i (u0 ) · ϕ = θ , ∀i ∈ A.
Recall that G̃(u) = {G1 (u), (G2 )i∈A (u)} and therefore, similarly as in the proof of the Lagrange multiplier
theorem 15.3.2, we may obtain ψ0 (t), K > 0 and ε > 0 such that
(G2 )i (u0 + tϕ + t 2 ψ0 (t)) = (G2 )i (u0 ) + G0i (u0 ) · (tϕ + t 2 ψ0 (t)) + r(t),
where
kr(t)k
lim = 0,
t →0 t
that is,
(G2 )i (u0 + tϕ + t 2 ψ0 (t)) = (G2 )i (u0 ) + t(G2 )0i (u0 ) · ϕ + r1 (t),
where,
kr1 (t )k
lim = 0,
t→0 t
and hence there exists, 0 < ε1 < ε, such that
Therefore, since u0 is a point of local minimum under the constraint G(u) ≤ θ , there exists 0 < ε2 < ε1 ,
such that
F(u0 + tϕ + t 2 ψ0 (t)) − F(u0 ) ≥ 0, ∀|t| < ε2 ,
344 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
so that,
Remark 15.7.2 Observe that if kT 0 (u)kU ≤ α < 1, on a convex set C then T is a contraction mapping, since
by the mean value inequality,
Theorem 15.7.3 (Contraction mapping theorem) Let C be a closed subset of a Banach space U. Assume
T is contraction mapping on C, then there exists a unique u0 ∈ C such that u0 = T (u0 ). Moreover, for an
arbitrary u1 ∈ C defining the sequence
we have
un → u0 , in norm, as n → +∞.
un+1 = T (un ), ∀n ∈ N.
kun+p − un kU
= kun+p − un+p−1 + un+p−1 − un+p−2 + ... − un+1 + un+1 − un kU
≤ kun+p − un+p−1 kU + kun+p−1 − un+p−2 kU + ... + kun+1 − un kU
≤ (α n+ p−2 + α n+p−3 + ... + α n−1 )ku2 − u1 kU
≤ α n−1 (α p−1 + α p−2 + ... + α 0 )ku2 − u1 kU
!
∞
≤ α n−1 ∑ αk ku2 − u1 kU
k=0
α n−1
≤ ku2 − u1 kU (15.57)
1−α
Denoting n + p = m, we obtain
α n−1
kum − un kU ≤ ku2 − u1 kU , ∀m > n ∈ N.
1−α
Let ε > 0. Since 0 ≤ α < 1 there exists n0 ∈ N such that if n > n0 then
α n−1
0≤ ku2 − u1 kU < ε,
1−α
so that
kum − un kU < ε, if m > n > n0 .
From this we may infer that {un } is a Cauchy sequence, and since U is a Banach space, there exists
u0 ∈ U such that
un → u0 , in norm, as n → ∞.
346 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Observe that
Hence,
F(x0 , u0 ) = 0,
Under such hypotheses, there exist 0 < ε1 < r/2 and 0 < ε2 < 1 such that for each u ∈ Bε1 (u0 ), there
exists x ∈ Bε2 (x0 ) such that
F(x, u) = 0,
where we denote x = x(u) so that
F(x(u), u) = 0.
Constrained Variational Optimization 347
Moreover, there exists δ > 0 such that 0 < δ ρ < 1, such that for each u, v ∈ Bε1 (u0 ) we have
ρ 2δ
kx(u) − x(v)k ≤ kF(x(v), u) − F(x(v), v)k.
1 − ρδ
Finally, if there exists K > 0 such that
so that
kF(x, u) − F(x, v)k ≤ Kku − vk, ∀(x, u) ∈ Bε2 (x0 ) × Bε1 (u0 ),
then
kx(u) − x(v)k ≤ K1 ku − vk,
where
ρ 2δ
K1 = K .
1−δρ
Proof 15.13 Let 0 < ε < r/2. Choose δ > 0 such that
0 < ρδ < 1.
Define
T (x) = F(x0 , u0 ) + Fx (x0 , u0 )(x − x0 ) = Fx (x0 , u0 )(x − x0 ),
and
h(x, u) = F(x0 , u0 ) + Fx (x0 , u0 )(x − x0 ) − F(x, u).
Choose 0 < ε3 < r/2 and 0 < ε2 < 1 such that
δ
kFx (x, u) − Fx (x0 , u0 )k < .
2
Select 0 < ε1 < ε3 such that if u ∈ Bε1 (u0 ), then
Observe that since 0 < ρδ < 1 we have that φu is a candidate to be a contraction mapping.
Observe also that
x0 = T −1 (0),
so that
T (x0 ) = 0 = h(x0 , u0 )
and
x0 = T −1 (h(x0 , u0 )).
Thus,
x(u) = φu (x(u))
= T −1 (h(x(u), u))
= T −1 (Fx (x0 , u0 )(x(u) − x0 ) − F(x(u, u)))
= [Fx (x0 , u0 )]−1 [Fx (x0 , u0 )(x(u) − x0 ) − F(x(u), u)] + x0
= x(u) − x0 + x0 − [Fx (x0 , u0 )]−1 (F(x(u), u))
= x(u) − [Fx (x0 , u0 )]−1 (F(x(u), u)). (15.63)
Letting n → ∞, we obtain
ρδ
kx(u) − xk ≤ kφu (x) − xk.
1 − ρδ
In particular for v ∈ Bε1 (u0 ) and x = x(v), we get
ρδ
kx(u) − x(v)k ≤ kφu (x(v)) − x(v)k
1 − ρδ
ρδ
= kφu (x(v)) − φv (x(v))k
1 − ρδ
ρ 2δ
≤ kh(x(v), u) − h(x(v), v)k
1 − ρδ
ρ 2δ
≤ kF(x(v), u) − F(x(v), v)k
1 − ρδ
ρ 2δ
≤ K ku − vk
1 − ρδ
= K1 ku − vk, ∀u, v ∈ Bε1 (u0 ). (15.65)
Corollary 15.9.2 Consider the hypotheses and statements of the last theorem. Moreover, assume Fx : V ×
U → W is such that [Fx (x, u)]−1 exists and it is bounded in Br (x0 , u0 ).
´ het differentiable in Br (x0 , u0 ).
Suppose also, F is Frec
Let ϕ ∈ U.
Under such hypotheses,
where
x(u + tϕ) − x(u)
x0 (u, ϕ) = lim .
t →0 t
since
kx(u + tϕ) − x(u)k
kr(u, ϕ,t)k ≤ kW (u, ϕ,t)k + kϕk
t
≤ kW (u, ϕ,t)k(K1 kϕ k + kϕ k)
→ 0, as t → 0. (15.68)
Summarizing,
15.9.1 ˆ
The main results about Gateaux differentiability
Again let V,U be Banach spaces and let F : V ×U → R be a functional. Fix u ∈ U and consider the problem
of minimizing F(x, u) subject to G(x, u) ≤ θ and H(x, u) = θ . Here the order and remaining details on the
primal formulation are the same as those indicated in section 15.4.
Hence, for the specific case in which
G : V ×U → [L p (Ω)]m1
and
H : V ×U → [L p (Ω)]m2 ,
Constrained Variational Optimization 351
(the cases in which the co-domains of G and H are Rm1 and Rm2 respectively are dealt similarly) we redefine
the concerned optimization problem, again for a fixed u ∈ U, by minimizing F(x, u) subject to
and
H(x, u) = θ .
At this point we assume F (x, u), G̃(x, u, v) = {Gi (x, u) + v2i } ≡ G(u) + v2 (from now on we use this
general notation) and H(x, u) satisfy the hypotheses of the Lagrange multiplier theorem 15.3.2.
Hence, for the fixed u ∈ U we assume there exists an optimal x ∈ V which locally minimizes F(x, u)
under the mentioned constraints.
From Theorem 15.3.2 there exist Lagrange multipliers λ1 , λ2 such that denoting [L p (Ω)]m1 and [L p (Ω)]m2
simply by L p , and defining
G(x, u) + v2 = θ , (15.71)
λ1 · v = θ , (15.72)
λ1 ≥ θ , (15.73)
H(x, u) = θ . (15.74)
Clarifying the dependence on u, we denote the solution x, λ1 , λ2 , v by x(u), λ1 (u), λ2 (u), v(u), respec-
tively. In particular, we assume that for a u0 ∈ U, x(u0 ), λ1 (u0 ), λ2 (u0 ), v(u0 ) satisfy the hypotheses of
the implicit function theorem. Thus, for any u in an appropriate neighborhood of u0 , the corresponding
x(u), λ1 (u), λ2 (u), v(u) are uniquely defined.
We emphasize that from now on the main focus of our analysis is to evaluate variations of the optimal
x(u), λ1 (u), λ2 (u), v(u) with variations of u in a neighborhood of u0 .
For such an analysis, the main tool is the implicit function theorem and its main hypothesis is satisfied
´
through the invertibility of the matrix of Frechet second derivatives.
Hence, denoting, x0 = x(u0 ), (λ1 )0 = λ1 (u0 ), (λ2 )0 = λ2 (u0 ), v0 = v(u0 ), and
A2 = G(x0 , u0 ) + v20
A3 = H(x0 , u0 ),
A4 = (λ1 )0 · v0 ,
we reiterate to assume that
A1 = θ , A2 = θ , A3 = θ , A4 = θ ,
and M −1 to represent a bounded linear operator, where
(A1 )x (A1 )λ1 (A1 )λ2 (A1 )v
(A2 )x (A2 )λ (A2 )λ2 (A2 )v
M= 1 (15.75)
(A3 )x (A3 )λ1 (A3 )λ2 (A3 )v
(A4 )x (A4 )λ1 (A4 )λ2 (A4 )v
352 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
where
A = Fxx (x0 , u0 ) + (λ1 )0 · Gxx (x0 , u0 ) + (λ2 )0 · Hxx (x0 , u0 ).
Moreover, also from the implicit function theorem,
k(x(u), λ1 (u), λ2 (u), v(u)) − (x(u0 ), λ1 (u0 ), λ2 (u0 ), v(u0 ))k ≤ Kku − u0 k, (15.77)
for some appropriate K > 0, ∀u ∈ Br (u0 ), for some r > 0.
We highlight to have denoted λ (u) = (λ1 (u), λ2 (u)).
Let ϕ ∈ [C∞ (Ω)]k ∩U, where k depends on the vectorial expression of U.
ˆ
At this point we will be concerned with the following Gateaux variation evaluation
Observe that
and
F̃x (x(u0 + εϕ), u0 + εϕ, λ (u0 + εϕ), v(u0 + εϕ)) = θ .
On the other hand,
F̃(x(u0 ), u0 + εϕ, λ (u0 + εϕ), v(u0 + εϕ))
ε
F̃(x(u0 ), u0 , λ (u0 ), v(u0 ))
−
ε
F̃(x(u0 ), u0 + εϕ, λ (u0 + εϕ), v(u0 + εϕ))
=
ε
F̃(x(u0 ), u0 + εϕ, λ (u0 ), v(u0 ))
−
ε
Also,
λ1 (u0 )v(u0 ) = θ ,
λ1 (u0 )v(u0 + εϕ) → θ , as ε → 0,
and
λ1 (u0 )(v(u0 + εϕ)2 − v(u0 )2 )
ε
λ1 (u0 )(v(u0 + εϕ) + v(u0 ))(v(u0 + εϕ) − v(u0 ))
=
ε
kλ1 (u0 )(v(u0 + εϕ) + v(u0 ))kKkϕkε
≤
ε
→ 0, as ε → 0. (15.79)
Finally,
and
H(x(u0 ), u0 + εϕ) → θ , as ε → 0.
From these last results, we get
In the last lines we have proven the following corollary of the implicit function theorem,
Corollary 15.1
Suppose (x0 , u0 , (λ1 )0 , (λ2 )0 , v0 ) is a solution of the system (15.70), (15.71),(15.72), (15.74), and assume the
corresponding hypotheses of the implicit function theorem are satisfied. Also assume F̃(x, u, λ1 , λ2 , v) is such
´ het second derivative F̃xx (x, u, λ1 , λ2 ) is continuous in a neighborhood of
that the Frec
we have
16.1 Introduction
In this short communication we develop sufficient conditions of local optimality for a relatively large class
of problems in the calculus of variations. The concerning approach is developed through a generalization of
some theoretical results about central fields presented in [79]. We address both the scalar and vectorial cases
for a domain in Rn . Finally the Weierstrass Excess function has a fundamental role in the formal proofs of
the main results.
16.2 Central fields for the scalar case in the calculus of variations
Let Ω ⊂ Rn be an open, bounded, simply connected set with a regular (Lipschitzian) boundary denoted by
∂ Ω.
Let f ∈ C1 (Ω × R × Rn ) ( f (x, y, z)) and V = C1 (Ω). We suppose f is convex in z, which we denote by
f (x, y, z) be convex.
Choose x̃1 , . . . , x̃n , ỹ1 , . . . , ỹn ∈ R such that if x = (x1 , . . . , xn ) ∈ Ω, then
and where
D = {y ∈ V : y = y0 in ∂ Ω}.
On Central Fields in the Calculus of Variations 357
Assume there exists a family of functions F , such that for each (x, y) ∈ D1 ⊂ Rn+1 , there exists a unique
stationary function y3 ∈ V1 = C1 (B0 ) for f in F , such that
F1 = {φ (t, Λ(x, y)) stationary for f such that φ (x, Λ(x, y)) = y for a Λ ∈ Br (0) ⊂ R}
φ ((t1 , . . . ,tk = x̃k , . . . ,tn ), Λ(x, y)) = y0 (t1 , . . . ,tk = x̃k , . . . ,tn ), on ∂ B0
and
∇φ ((t1 , . . . ,tk = x̃k , . . . ,tn ), Λ(x, y)) · n = Λ ∈ R, on ∂ B0 , ∀k ∈ {1, . . . , n},
where n denotes the outward normal field to ∂ B0 so that, clarifying the dependence of Λ on (x, y), we have
denoted Λ = Λ(x, y).
Define the field θ : D1 → Rn by
θ (x, y) = ∇y3 (x),
where as above indicated y3 is such that
(x, y) = (x, y3 (x)),
so that
θ (x, y3 (x)) = ∇y3 (x), ∀x ∈ B0 .
At this point we assume the hypotheses of the implicit function theorem so that φ (x, Λ(x, y)) is of C1 class
and θ (x, y) is continuous (in fact, since φ is stationary, the partial derivatives of θ (x, y) are well defined).
Define also
n
h(x, y) = f (x, y, θ (x, y)) − ∑ fz j (x, y, θ (x, y))θ j (x, y)
j=1
and
Pj (x, y) = fz j (x, y, θ (x, y)), ∀ j ∈ {1, . . . , n}.
Observe that
n
hy (x, y3 (x)) = fy (x, y3 (x), θ (x, y3 (x))) + ∑ fz j (x, y3 (x), θ (x, y3 (x)))(θ j )y (x, y3 (x))
j=1
n
− ∑ {(Pj )y (x, y3 (x))θ j (x, y3 (x)) + Pj (x, y3 (x))(θ j )y (x, y3 (x))}
j=1
n
= fy (x, y3 (x), θ (x, y3 (x))) + ∑ Pj (x, y3 (x))(θ j )y (x, y3 (x))
j=1
n
− ∑ {(Pj )y (x, y3 (x))θ j (x, y3 (x)) + Pj (x, y3 (x))(θ j )y (x, y3 (x))}
j=1
n
= fy (x, y3 (x), θ (x, y3 (x))) − ∑ (Pj )y (x, y3 (x))θ j (x, y3 (x)). (16.1)
j=1
Thus,
n
hy (x, y) = fy (x, y, θ (x, y)) − ∑ (Pj )y (x, y)θ j (x, y), ∀(x, y) ∈ D1 .
j=1
358 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Therefore,
n
∂ Pj (x, y)
hy (x, y) = ∑ , ∀(x, y) ∈ D1
j=1 ∂xj
so that
n
∂ H j (x, y)
h(x, y) = ∑ +W (x),
j=1 ∂ xk
for some W : B0 → R.
Hence, assuming D1 contains an open which contains C0 , where C0 = {(x, y0 (x)) : x ∈ Ω}, for y ∈ D
sufficiently close to y0 in L∞ norm, the generalized Hilbert integral, denoted by I(y), will be defined by
n
∂ y(x)
Z
I(y) = h(x, y(x)) dx + ∑ Pj (x, y(x)) dx
Ω j=1 ∂xj
n
∂ H j (x, y(x)) ∂ y(x)
Z Z
= ∑ + (H j )y (x, y(x)) dx + W (x) dx
Ω j=1 ∂xj ∂xj Ω
Z n
dH j (x, y(x))
Z
= ∑ dx j dx + W (x) dx
Ω j=1 Ω
Z n Z
=
∂ Ω j=1
∑ (−1) j+1 H j (x, y(x))dx1 ∧ · · · ∧ dcx j ∧ · · · ∧ dxn + Ω
W (x) dx
= W1 (y|∂ Ω )
= W1 ((y0 )|∂ Ω ), (16.3)
On Central Fields in the Calculus of Variations 359
Therefore,
Z
F(y) − F(y0 ) = [ f (x, y(x), ∇y(x)) − f (x, y(x), θ (x, y))] dx
Ω
n Z
−∑ fz j (x, y(x), θ (x, y(x))(yx j (x) − θ j (x, y(x))) dx
j=1 Ω
Z
= E (x, y(x), θ (x, y(x)), ∇y(x)) dx, (16.5)
Ω
where
Theorem 16.2.1 Let Ω ⊂ Rn be an open, bounded and simply connected set, with a regular (Lipschitzian)
boundary denoted by ∂ Ω. Let V = C1 (Ω) and let f ∈ C1 (Ω × R × Rn ) be such that f (x, y, z) is convex. Let
F : D → R be defined by Z
F(y) = f (x, y(x), ∇y(x)) dx,
Ω
where
D = {y ∈ V : y = y1 em ∂ Ω}.
Let y0 ∈ D be a stationary function for f which may be extended to B0 , being kept stationary in B0 , where
B0 has been specified above in this section. Suppose we may define a field θ : D1 → Rn , also as it has been
specified above in this section. Assume D1 ⊂ Rn+1 contains an open set which contains C0 , where
where
Bδ (y0 ) = {y ∈ V : ky − y0 k∞ < δ }.
Proof 16.1 From the hypotheses and from the exposed above in this section, there exists δ > 0 such that
θ (x, y(x)) is well defined for all y ∈ D such that
ky − y0 k∞ < δ .
so that, Z
F(y) − F(y0 ) = E (x, y(x), θ (x, y(x)), ∇y(x)) dx ≥ 0, ∀y ∈ Bδ (y0 ) ∩ D.
Ω
16.3 Central fields and the vectorial case in the calculus of variations
Let Ω ⊂ Rn be an open, bounded, simply connected set with a regular (Lipschitzian) boundary denoted by
∂ Ω.
Let f ∈ C1 (Ω×RN ×RNn ) ( f (x, y, z)) and V = C1 (Ω; RN ). We suppose f is convex in z, which we denote
by f (x, y, z) be convex.
Choose x̃1 , . . . , x̃n , ỹ1 , . . . , ỹn ∈ R such that if x = (x1 , . . . , xn ) ∈ Ω, then
Suppose that y0 ∈ V1 = C1 (B0 ; RN ) is stationary for f in B0 = ∏nk=1 [x̃k , ỹk ], that is, suppose that
n
d
∑ dxk fz jk (x, y0 (x), ∇y0 (x)) = fy j (x, y0 (x), ∇y0 (x)), ∀x ∈ B0 , ∀ j ∈ {1, . . . , N }.
k=1
and where
D = {y ∈ V : y = y0 in ∂ Ω}.
On Central Fields in the Calculus of Variations 361
Assume there exists a family of stationary functions F , such that for each (x, y) ∈ D1 ⊂ Rn+N , there
exists a unique stationary function y3 ∈ V1 for f in F , such that
F1 = {Φ(t, Λ(x, y)) stationary for f such that Φ(x, Λ(x, y)) = y for a Λ ∈ Br (0) ⊂ RN }
such that Br (0) is an open ball of center 0 and radius r, for some r > 0.
Here, Φ is stationary and
Φ((t1 , . . . ,tk = x̃k , . . . ,tn ), Λ(x, y)) = y0 (t1 , . . . ,tk = x̃k , . . . ,tn ), on ∂ B0
and
∇(Φ) j ((t1 , . . . ,tk = x̃k , . . . ,tn ), Λ(x, y)) · n = Λ j , on ∂ B0 , ∀k ∈ {1, . . . , n}, j ∈ {1, . . . , N}.
Here, n denotes the outward normal field to ∂ B0 .
Define the field θ : D1 ⊂ Rn+N → RNn by
and thus
θ j (x, (y3 )(x)) = ∇(y3 ) j (x), ∀x ∈ B0 .
At this point we assume the hypotheses of the implicit function theorem so that Φ j (x, Λ(x, y)) is of C1
class and θ j (x, y) is continuous (in fact, since Φ j is stationary, the partial derivatives of θ j (x, y) are well
defined, ∀ j ∈ {1, . . . , N}).
Define also
n n
h(x, y) = f (x, y, θ (x, y)) − ∑ ∑ fz jk (x, y, θ (x, y))θ jk (x, y)
j=1 k=1
and
Pjk (x, y) = fz jk (x, y, θ (x, y)), ∀ j ∈ {1, . . . , N}, k ∈ {1, . . . , n}.
Observe that
N n
hy j (x, y3 (x)) = fy j (x, y3 (x), θ (x, y3 (x))) + ∑ ∑ fzlk (x, y3 (x), θ (x, y3 (x)))(θlk )y j (x, y3 (x))
l=1 k=1
N n
−∑ ∑ {(Plk )y j (x, y3 (x))θlk (x, y3 (x)) + Plk (x, y3 (x))(θlk )y j (x, y3 (x))}
l=1 k=1
N n
= fy j (x, y3 (x), θ (x, y3 (x))) + ∑ ∑ Plk (x, y3 (x))(θlk )y j (x, y3 (x))
l=1 k=1
n
− ∑ {(Pj )y (x, y3 (x))θ j (x, y3 (x)) + Pj (x, y3 (x))(θ j )y (x, y3 (x))}
j=1
N n
= fy j (x, y3 (x), θ (x, y3 (x))) − ∑ ∑ (Plk )y j (x, y3 (x))θlk (x, y3 (x)). (16.8)
l=1 k=1
362 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Thus,
N n
hy j (x, y) = fy j (x, y, θ (x, y)) − ∑ ∑ (Plk )y j (x, y)θlk (x, y), ∀(x, y) ∈ D1 .
l =1 k=1
On the other hand, considering that y3 (x) is stationary, we obtain Hk (x, y) such that
∂ Hk (x, y)
Pjk (x, y) = , ∀ j ∈ {1, . . . , N}, k ∈ {1, . . . , n}.
∂yj
Indeed, we define
Z xk
Hk (x, y) = staφ ∈Dˆ k f (x1 , . . . ,tk , . . . , xn , φ1 (tk ), . . . , φN (tk ), ∇φ̃ (x,tk )) dtk ,
x̃k
and where
D̂k = {φ ∈ C1 ([x̃k , xk ]; RN ) : φ (x̃k ) = y3 (x̂˜k ) and φ (xk ) = y3 (x̂k )},
∀k ∈ {1, . . . , n}.
Observe that, since φ (tk ) is stationary, we have
d
fy j [φ (tk )] − fz [φ (tk )] = 0, in [x̃k , xk ], ∀ j ∈ {1, . . . , N } (16.10)
dtk jk
where generically, we denote
f [φ (tk )] = f (x, φ (tk ), ∇φ̃ (tk )).
Observe that from these Euler-Lagrange equations we may obviously obtain
Therefore,
Z xk
Hk (x, y) = f [φ (tk , Λ̃(x, y)] dtk ,
x̃k
On Central Fields in the Calculus of Variations 363
and thus,
Z xk
[Hk (x, y)]y j = ˜ y)](φl ) Λ̃y (x, y) dtk
fyl [φ (tk , Λ(x, Λ̃ j
x̃k
Z xk
+ ˜ y (x, y) dtk
fzlk [φ (tk , Λ̃(x, y)](φl )tk Λ̃ Λ j
x˜k
Z xk
= fyl [φ (tk , Λ̃(x, y)](φl )Λ̃ Λ̃y j (x, y) dtk
x̃k
Z xk
+ ˜ y (x, y) dtk .
fzlk [φ (tk , Λ̃(x, y)][(φl )Λ̃ ]tk Λ (16.11)
j
x̃k
Therefore,
n
d
0 = fy j (x, y3 (x), ∇y3 (x)) − ∑ fz jk (x, y3 (x), ∇y3 (x))
k=1 dx k
N n
= hy j (x, y3 (x)) + ∑ ∑ (Plk )y j (x, y3 (x))θlk (x, y3 (x))
l=1 k=1
!
n ∂ Pjk (x, y3 (x)) N ∂ (y3 )l (x)
−∑ + ∑ (Pjk )yl (x, y3 (x))
k=1 ∂ xk l=1 ∂ xk
N n
= hy j (x, y3 (x)) + ∑ ∑ (Plk )y j (x, y3 (x))θlk (x, y3 (x))
l=1 k=1
!
n
∂ Pjk (x, y3 (x)) N
−∑ + ∑ (Pjk )yl (x, y3 (x))θlk (x, y3 (x))
k=1 ∂ xk l=1
n ∂ (H ) (x, y (x))
k yj 3
= hy j (x, y3 (x)) − ∑ . (16.14)
k=1 ∂ xk
Thus,
n
∂ Hk (x, y)
hy j (x, y) = ∑ , ∀(x, y) ∈ D1
k=1 ∂ xk yj
so that
n
∂ Hk (x, y)
h(x, y) = ∑ +W (x),
k=1 ∂ xk
for some W : B0 → R.
Hence, assuming D1 contains an open set which contains C0 , where
for y ∈ D sufficiently close to y0 in L∞ norm, the generalized Hilbert integral, denoted by I(y), will be defined
as
N n
∂ y j (x)
Z
I (y) = h(x, y(x)) dx + ∑ ∑ Pjk (x, y(x)) dx
Ω j=1 k=1 ∂ xk
!
n n
∂ Hk (x, y(x)) ∂ y j (x)
Z Z
= ∑ + ∑ (Hk )y j (x, y(x)) dx + W (x) dx
Ω k=1 ∂ xk j=1 ∂ xk Ω
n
dHk (x, y(x))
Z Z
= ∑ dx + W (x) dx
Ω k=1 dxk Ω
Z n Z
= ∑ (−1)k+1 Hk (x, y(x))dx1 ∧ · · · ∧ dcxk ∧ · · · ∧ dxn +
∂ Ω k=1 Ω
W (x) dx
= W1 (y|∂ Ω )
= W1 ((y1 )|∂ Ω ), (16.15)
Thus,
Z
F(y) − F(y0 ) = [ f (x, y(x), ∇y(x)) − f (x, y(x), θ (x, y(x)))] dx
Ω
N n Z
−∑ ∑ fz jk (x, y(x), θ (x, y(x))((y j )xk (x) − θ jk (x, y(x))) dx
j=1 k=1 Ω
Z
= E (x, y(x), θ (x, y(x)), ∇y(x)) dx, (16.17)
Ω
where
Theorem 16.3.1 Let Ω ⊂ Rn be an open, bounded, simply connected set with a regular (Lipschitzian) bound-
ary denoted by ∂ Ω. Let V = C1 (Ω; RN ) and let f ∈ C1 (Ω × RN × RNn ) be such that f (x, y, z) is convex. Let
F : D → R be defined by Z
F(y) = f (x, y(x), ∇y(x)) dx,
Ω
where
D = {y ∈ V : y = y1 on ∂ Ω}.
Let y0 ∈ D be an stationary function for f which may be extended to B0 , keeping it stationary in B0 , where
B0 has been specified above in this section. Suppose we may define a field θ : D1 → RNn , also as specified
above in this section. Assume D1 ⊂ Rn+N contains an open set which contains C0 , where
where
Bδ (y0 ) = {y ∈ V : ky − y0 k∞ < δ }.
366 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Proof 16.2 From the hypotheses and from the exposed above in this section, there exists δ > 0 such that
θ (x, y(x)) is well defined for each y ∈ D such that
ky − y0 k∞ < δ .
so that, Z
F(y) − F(y0 ) = E (x, y(x), θ (x, y(x)), ∇y(x)) dx ≥ 0, ∀y ∈ Bδ (y0 ) ∩ D.
Ω
The proof is complete.
SECTION III
APPLICATIONS TO MODELS IN
PHYSICS AND ENGINEERING
Chapter 17
17.1 Introduction
In this chapter, in a first step, we develop a new existence proof and a dual variational formulation for the
Kirchhoff-Love thin plate model. Previous results on existence in mathematical elasticity and related models
may be found in [29, 30, 31].
At this point we refer to the exceptionally important article “A contribution to contact problems for a
class of solids and structures” by W.R. Bielski and J.J. Telega [11], published in 1985, as the first one to
successfully apply and generalize the convex analysis approach to a model in non-convex and non-linear
mechanics.
The present work is, in some sense, a kind of extension of this previous work [11] and others such as
[10], which greatly influenced and inspired my work and recent book [14].
Here we highlight that such earlier results establish the complementary energy under the hypothesis of
positive definiteness of the membrane force tensor at a critical point (please see [11, 10, 39] for details).
We have obtained a dual variational formulation which allows the global optimal point in question not
to be positive definite (for related results see F. Botelho [14]), but also not necessarily negative definite.
The approach developed also includes sufficient conditions of optimality for the primal problem. It is worth
mentioning that the standard tools of convex analysis used in this text may be found in [33, 14, 64], for
example.
368 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
It is worth emphasizing that the boundary conditions here specified refer to a clamped plate.
We define the operator Λ : U → Y ×Y , where Y = Y ∗ = L2 (Ω; R2×2 ), by
1 1
Z Z
(G ◦ Λ)(u) = Nαβ (u)γαβ (u) dx + Mαβ (u)καβ (u) dx (17.4)
2 Ω 2 Ω
where P, P1 , P2 ∈ L2 (Ω) are external loads in the directions a3 , a1 and a2 respectively. The potential energy,
denoted by J : U → R is expressed by:
Finally, we also emphasize from now on, as their meaning are clear, we may denote L2 (Ω) and
L2 (Ω; R2×2 ) simply by L2 , and the respective norms by k · k2 . Moreover derivatives are always understood
in the distributional sense, 0 may denote the zero vector in appropriate Banach spaces and, the following and
relating notations are used:
∂ 2w
w,αβ = ,
∂ xα ∂ xβ
∂ uα
uα,β = ,
∂ xβ
∂ Nαβ
Nαβ ,1 = ,
∂ x1
and
∂ Nαβ
Nαβ ,2 = .
∂ x2
Remark 17.1 Let {Pα } ∈ L∞ (Ω; R2 ). We may easily obtain by appropriate Lebesgue integration {T̃αβ }
symmetric and such that
T̃αβ ,β = −Pα , in Ω.
Indeed, extending {Pα } to zero outside Ω if necessary, we may set
Z x
T̃11 (x, y) = − P1 (ξ , y) dξ ,
0
Z y
T̃22 (x, y) = − P2 (x, ξ ) dξ ,
0
and
T̃12 (x, y) = T̃21 (x, y) = 0, in Ω.
Thus, we may choose a C > 0 sufficiently big, such that
About the other existence result for plates, its proof through the tensor well specified {(T0 )αβ } is also
new, even though the final part of such a proof is also performed through a standard procedure.
A similar remark is valid for the existence result for the model of shells, which is also established through
a tensor T0 properly specified.
Finally, the duality principles and concerning optimality conditions are established through new func-
tionals. Similar results may be found in [14].
Remark 17.2 Specifically about the existence of the tensor T0 relating Theorem 17.3.1, we recall the
following well known duality principle of the calculus of variations
1 2
inf kT k2
T ={Tαβ }∈B∗ 2
1
Z
= sup − ∇uα · ∇uα dx + huα , Pα iL2 (Ω) + huα , Pαt iL2 (Γt ) . (17.6)
{uα }∈Ũ 2 Ω
Here
B∗ = {T ∈ L2 (Ω; R4 ) : Tαβ ,β + Pα = 0, in Ω, Tαβ nβ − Pαt = 0, on Γt },
and
Ũ = {{uα } ∈ W 1,2 (Ω; R2 ) : uα = 0 on Γ0 }.
We also recall that the existence of a unique solution for both these primal and dual convex formulations
is a well known result of the duality theory in the calculus of variations. Please, see related results in [33].
A similar duality principle may be established for the case of Theorem 17.6.1.
Theorem 17.2.1 Let Ω ⊂ R2 be an open, bounded, connected set with a Lipschitzian boundary denoted by
∂ Ω = Γ. Suppose (G ◦ Λ) : U → R is defined by
where
1
Z
G1 (γu) = Hαβ λ µ γαβ (u)γλ µ (u) dx,
2 Ω
and
1
Z
G2 (κu) = hαβ λ µ καβ (u)κλ µ (u) dx,
2 Ω
where
Λ(u) = (γ(u), κ(u)) = ({γαβ (u)}, {καβ (u)}),
uα,β + uβ ,α w,α w,β
γαβ (u) = + ,
2 2
καβ (u) = −w,αβ ,
and where
U = {u = (u1 , u2 , w) ∈ W 1,2 (Ω; R2 ) ×W 2,2 (Ω) : u1 = u2 = w = 0 on ∂ Ω}.
We also define,
where
f = (Pα , P) ∈ L∞ (Ω; R3 ).
Global Existence Results and Duality for Non-Linear Models of Plates and Shells 371
Let J : U → R be defined by
J(u) = G(Λu) − F1 (u), ∀u ∈ U.
Assume there exists {cαβ } ∈ R2×2 such that cαβ > 0, ∀α, β ∈ {1, 2} and
an also such that {Tαβ } is positive definite and symmetric (please, see Remark 17.1).
Thus defining
uα,β + uβ ,α 1
vαβ (u) = + w,α w,β , (17.8)
2 2
we obtain
Therefore, there exists w0 ∈ W 2,2 (Ω) such that, up to a subsequence not relabeled,
and thus, from this, (17.8) and (17.10), we may infer that there exists K3 > 0 such that
From this and Korn’s inequality, there exists K4 > 0 such that
So, up to a subsequence not relabeled, there exists {(u0 )α } ∈ W 1,2 (Ω, R2 ), such that
inf J(u) = α1
u∈U
= lim inf J(un )
n→∞
≥ J(u0 ). (17.11)
Thus,
J(u0 ) = min J(u).
u∈U
17.3 Existence of a minimizer for the plate model for a more general
case
At this point we present an existence result for a more general case.
Theorem 17.3.1 Consider the statements and assumptions concerning the plate model described in the last
section.
More specifically, consider the functional J : U → R given, as above described, by
where,
mΓ (Γ0 ∩ Γt ) = 0,
and
mΓ (Γ0 ) > 0.
Let T0 be such that,
kT0 k22 = min {kT k22 },
T ∈L2 (Ω;R2×2 )
subject to
Tαβ ,β + Pα = 0 in Ω,
(T0 )αβ nβ − Pαt = 0, on Γt .
Assume kPα k∞ and kPαt k∞ are small enough so that
where
1
J1 (u) = G2 (κ(u)) + h(T0 )αβ , w,α w,β iL2
2
−hP, wiL2 − hPt , wiL2 (Γt ) . (17.15)
J(u0 ) = min{J(u)}.
u∈U
Proof 17.2
Observe that defining
uα ,β + uβ ,α 1
vαβ (u) = + w,α w,β , (17.16)
2 2
374 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
we have,
From this and the hypothesis (17.14), J is bounded below. So, there exists α1 ∈ R such that
α1 = inf J(u).
u∈U
and thus, from this, (17.16) and (17.18), we may infer that there exists K3 > 0 such that
From this and Korn’s inequality, there exists K4 > 0 such that
∀α ∈ {1, 2}, as n → ∞.
Moreover, the boundary conditions satisfied by the subsequences are also satisfied for w0 and u0 in a
trace sense, so that
u0 = ((u0 )α , w0 ) ∈ U.
From this, up to a subsequence not relabeled, we get
inf J(u) = α1
u∈U
= lim inf J(un )
n→∞
≥ J(u0 ). (17.19)
Thus,
J(u0 ) = min J(u).
u∈U
The proof is complete.
Remark 17.3 In the proofs relating our duality principles we apply a very well known result found in
Toland [78].
Indeed, for
{Nαβ } ∈ L2 (Ω; R2×2 ),
assume
F5 (w) − G5 ({wα }) > 0, ∀w ∈ W02,2 (Ω) such that w 6= 0,
where here
1 K
Z
F5 (w) = hαβ λ µ w,αβ w,λ µ dx + hw,α , w,α iL2 ,
2 Ω 2
and
1 K
Z
G5 ({wα }) = − Nαβ w,α w,β dx + hw,α , w,α iL2 ,
2 Ω 2
where K > 0 is supposed to be sufficiently big so that G5 is convex in w.
Thus,
1
Z
F5 (w) − G5 ({wα }) = hαβ λ µ w,αβ w,λ µ dx
2 Ω
1
Z
+ Nαβ w,α w,β dx > 0, (17.20)
2 Ω
Therefore,
1
Z
= −F5∗ (z∗ ) + (−Nαβ )K z∗α z∗β dx ≥ 0, (17.22)
2 Ω
∀z∗ ∈ L2 .
Indeed, from the general result in Toland [78], we have
1
Z
inf −F5∗ (z∗ ) + (−Nαβ )K z∗α z∗β dx
z∗ ∈L2 2 Ω
= inf {F5 (w) − G5 ({w,α })
2,2
w∈W0
≤ F5 (w) − G5 ({w,α })
1
Z
= h w w dx
2 Ω αβ λ µ ,αβ ,λ µ
1
Z
+ N w,α w,β dx, ∀w ∈ W02,2 (Ω). (17.23)
2 Ω αβ
Theorem 17.4.1 Let Ω ⊂ R2 be an open, bounded, connected set with a Lipschitzian boundary denoted by
∂ Ω = Γ. Suppose (G ◦ Λ) : U → R is defined by
where
1
Z
G1 (γ(u)) = Hαβ λ µ γαβ (u)γλ µ (u) dx,
2 Ω
and
1
Z
G2 (κ(u)) = hαβ λ µ καβ (u)κλ µ (u) dx,
2 Ω
where
Λ(u) = (γ(u), κ(u)) = ({γαβ (u)}, {καβ (u)}),
uα ,β + uβ ,α w,α w,β
γαβ (u) = + ,
2 2
καβ (u) = −wαβ .
Here,
u = (u1 , u2 , w) = (uα , w) ∈ U = W01,2 (Ω; R2 ) ×W02,2 (Ω).
Global Existence Results and Duality for Non-Linear Models of Plates and Shells 377
We also define,
where
f = (Pα , P) ∈ L2 (Ω; R3 ).
Let J : U → R be defined by
J(u) = G(Λu) − F1 (u), ∀u ∈ U.
Under such hypotheses,
inf J(u)
u∈U
≥ sup { inf {−F ∗ (z∗ , Q) + G∗ (z∗ , N)}},
∗ ∗
(17.25)
v∗ ∈A∗ z ∈Y2
where,
K
F(u) = G2 (κ(u)) + hw,α , w,α iL2 , ∀u ∈ U.
2
Moreover, F ∗ : [Y2∗ ]2 → R is defined by,
Also,
(v2 )α (v2 )β
1 (v2 )λ (v2 )µ
Z
G(v) = − H (v1 )αβ + (v1 )λ µ + dx
2 Ω αβ λ µ 2 2
K
+ h(v2 )α , (v2 )α iL2 , (17.26)
2
G∗ (z∗ , N)
= sup { inf { hNαβ , (v1 )αβ iL2 + hQα + z∗α , (v2 )α iL2 − G(v)}}
v2 ∈Y2 v1 ∈Y1
1
Z
= (−Nαβ )K z∗α z∗β dx
2 Ω
1
Z
− H αβ λ µ Nαβ Nλ µ dx, (17.27)
2 Ω
if v∗ = (Q, N) ∈ A3 , where
−N11 + K −N12
{(−Nαβ )K } = , (17.28)
−N21 −N22 + K
and
(−Nαβ )K = {(−Nαβ )K }−1 .
Moreover,
A∗ = A1 ∩ A2 ∩ A3 ∩ A4 ∩ A5 ,
where
A1 = {v∗ = (N, Q) ∈ Y ∗ : Nαβ ,β + Pα = 0, in Ω, ∀α ∈ {1, 2}}
378 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
and
A2 = {v∗ ∈ Y ∗ : Qα,α + P = 0, in Ω}.
Also,
and
∗ ∗ K
A5 = v = (Q, N) ∈ Y : {Nαβ + Kδαβ } ≥ {δαβ } ,
2
where K > 0 is supposed to be such that, in an appropriate matrix sense,
2
{H αβ λ µ } > {δ }.
K αβ
Furthermore,
and
J˜∗ (v∗ ) = ∗inf ∗ J ∗ (v∗ , z∗ ), ∀v∗ ∈ A∗ ,
z ∈Y2
suppose there exist v∗0 = (N0 , Q0 ) ∈ A∗ , z∗0 ∈ Y2∗ and u0 ∈ U such that
δ {J ∗ (v∗0 , z∗0 ) − h(u0 )α , (N0 )αβ ,β + Pα iL2 − hw0 , (Q0 )α,α + PiL2 } = 0.
Proof 17.3 Observe that, from the general result in Toland [78], we have
Thus,
= inf J ∗ (v∗ , z∗ )
z∗ ∈Y2∗
∀v∗ ∈ A∗ , u ∈ U.
Summarizing,
J(u) ≥ J˜∗ (v∗ ), ∀u ∈ U, v∗ ∈ A∗ ,
so that,
Finally, assume v∗0 = (N0 , Q0 ) ∈ A∗ , z∗0 ∈ Y2∗ and u0 ∈ U are such that
δ {J ∗ (v∗0 , z∗0 ) − h(u0 )α , (N0 )αβ ,β + Pα iL2 − hw0 , (Q0 )α,α + PiL2 } = 0.
380 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Hence,
F ∗ (z∗0 , Q0 ) = h(z∗0 )α + (Q0 )α , (w0 ),α iL2 − F(u0 ).
Also, from (17.35) we have,
Therefore, from such a last equality and (17.34), we may infer that
∂r
aα = , ∀α ∈ {1, 2},
∂ ξα
aαβ = aα · aβ .
Let
aα × aβ
n= ,
|aα × aβ |
be the unit normal to S, so that we define the covariant components bαβ of the curvature tensor
b = {bαβ },
by
bαβ = n · aα,β = n · r,αβ .
Observe that
n · aα = 0,
382 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
so that
nβ · aα + n · aα,β = 0,
and thus we obtain,
bαβ = n · aα,β = −nβ · aα .
The Christofell symbols relating S, would be,
1
Γαβ γ = (aαβ ,γ + aαγ,β − aβ γ,α ), ∀α, β , γ ∈ {1, 2}
2
and
Γλαβ = aλ γ Γγαβ , ∀α, β , λ ∈ {1, 2},
where
{aαβ } = {aαβ }−1 .
Let us denote with a bar the quantities relating the deformed middle surface.
So, the middle surface strain tensor γ = {γαβ } is given by
(āαβ − aαβ )
γαβ = ,
2
while the tensor relating change in curvature is given by
We also denote,
where, in order to simplify the analysis, the boundary conditions in question refer to a clamped shell and
where ∂ S denotes the boundary of S.
Also from reference [10], for moderately large rotations around tangents, the strain displacements rela-
tions are given by,
1.
1
γαβ (u) = θαβ (u) + ϕα (u)ϕβ (u),
2
2.
where,
uα |β = uα,β − Γλαβ uλ ,
β
ϕα (u) = w,α + bα uβ , (17.42)
and
β
bα = bαλ aλ β .
The primal shell inner energy is defined by
1
Z
W (γ(u), κ(u)) = H αβ λ µ γαβ (u)γλ µ (u) dS
2 S
1
Z
+ hαβ λ µ καβ (u)κλ µ (u) dS, (17.43)
2 S
where
αβ λ µ Eh αλ β µ αµ βλ 2ν αβ λ µ
H = a a +a a + a a ,
2(1 + ν) 1−ν
h2 αβ λ µ
hαβ λ µ =
H ,
12
and E denotes the Young’s modulus and ν the Poisson ratio.
The constitutive relations are,
N αβ = H αβ λ µ γλ µ (u),
and
M αβ = hαβ λ µ κλ µ (u),
where {N αβ } is the membrane force tensor and {M αβ } is the moment one.
We assume
H = {H αβ λ µ }
to be positive definite in the sense that there exists c0 > 0 such that
H αβ λ µ tαβ tλ µ ≥ c0tαβ tαβ ≥ 0, ∀t = {tαβ } ∈ R4 .
Finally, the primal variational formulation for this model will be given by
J :U →R
where,
J(u) = W (γ(u), κ(u)) − hu, fiL2 ,
W (γ(u), κ(u)) = G1 (γ(u)) + G2 (κ(u)),
1
Z
G1 (γ(u)) = H αβ λ µ γαβ (u)γλ µ (u) dS,
2 S
1
Z
G2 (κ (u)) = hα β λ µ καβ (u)κλ µ (u) dS,
2 S
and Z
hu, fiL2 = (Pα uα + Pw) dS.
S
Here
f = (Pα , P) ∈ L2 (S; R3 ),
are the external loads distributed on S, Pα relating the directions aα and P relating the direction n, respec-
tively.
Moreover, generically for f1 , f2 ∈ L2 (S), we denote,
Z
h f1 , f2 iL2 = f1 f2 dS,
S
√
where dS = a dξ1 dξ2 , and a = det{aαβ }.
384 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
subject to √
( gTαβ ),β
√ + Γαλ β Tλ β + Pα = 0 in D.
g
Assume
J1 (u) → +∞, as hw,αβ , w,αβ iL2 + huα,β , uα,β iL2 → +∞, (17.45)
where
1
J1 (u) = G2 (κ(u)) + h(T0 )αβ , ϕα (u)ϕβ (u)i − h(T0 )αβ bαβ + P, wiL2 .
2
Under such hypotheses, there exists u0 ∈ U such that,
J(u0 ) = min{J(u)}.
u∈U
Proof 17.4
Observe that
Z √
( g(T0 )αβ ),β
α √
hPα , uα iL2 = − √ + Γλ β (T0 )λ β uα g dξ
D g
uα,β + uβ ,α
λ
= (T0 )αβ , − Γαβ uλ
2 L2
= h(T0 )αβ , θαβ (u) + bαβ wiL2 , (17.46)
so that defining
1
vαβ (u) = θαβ (u) + ϕα (u)ϕβ (u), (17.47)
2
we obtain
1
hPα , uα iL2 = (T0 )αβ , vαβ (u) − ϕα (u)ϕβ (u) + bαβ w .
2 L2
Thus,
From this and hypothesis (17.45), J is bounded below. So, there exists α1 ∈ R such that
α1 = inf J(u).
u∈U
(un )α,β * (u0 )α,β weakly in L2 (S), ∀α, β ∈ {1, 2}, (17.50)
(un )α → (u0 )α strongly in L2 (S), ∀α ∈ {1, 2}. (17.51)
Also from (17.48), there exists K2 > 0 such that,
and thus, from this, (17.42), (17.47), (17.49) and (17.51), we may infer that there exists K3 > 0 such that
inf J(u) = α1
u∈U
= lim inf J(un )
n→∞
≥ J(u0 ). (17.52)
Thus,
J(u0 ) = min J(u).
u∈U
Theorem 17.7.1 Consider the statements and assumptions concerning the shell model described in the last
two sections.
More specifically, consider the functional J : U → R given, as above described by,
inf J(u)
u∈U
≥ sup { ∗inf ∗ {−F ∗ (z∗ , Q) + G∗ (z∗ , N)}}, (17.54)
v∗ ∈A∗ z ∈Y2
where,
K
F(u) = G2 (κ(u)) + hϕα (u), ϕα (u)iL2 , ∀u ∈ U.
2
Moreover, F ∗ : [Y2∗ ]2 → R is defined by,
Also,
(v2 )α (v2 )β
1 (v2 )λ (v2 )µ
Z
G(v) = − H (v1 )αβ + (v1 )λ µ + dS
2 S αβ λ µ 2 2
K
+ h(v2 )α , (v2 )α iL2 , (17.55)
2
G∗ (z∗ , N ) = sup { inf {hN αβ , (v1 )αβ iL2 + hz∗α , (v2 )α iL2 − G(v)}}
v2 ∈Y2 v1 ∈Y1
1
Z
= (−N α β )K zα∗ z∗β dS
2 S
1
Z
− H αβ λ µ N αβ N λ µ dS, (17.56)
2 S
if v∗ = (Q, N) ∈ A3 , where
−N 11 + K −N 12
αβ
{(−N )K } = , (17.57)
−N 21 22
−N + K
Global Existence Results and Duality for Non-Linear Models of Plates and Shells 387
and
(−N αβ )K = {(−N αβ )K }−1 .
Moreover, defining
Y ∗ = Y = L2 (S; R2 ) × L2 (S; R2×2 ),
Y1∗ = Y1 = L2 (S; R2×2 ),
and
Y2∗ = Y2 = L2 (S; R2 ),
also,
A∗ = A1 ∩ A2 ∩ A3 ∩ A4 ∩ A5 ,
where,
Moreover,
{H αβ λ µ } = {H αβ λ µ }−1 ,
and,
{hαβ λ µ } = {hαβ λ µ }−1 .
Also,
and
∗ ∗ K
A5 = v = (Q, N) ∈ Y : {Nαβ + Kδαβ } ≥ {δαβ } ,
2
where K > 0 is supposed to be such that, in an appropriate matrix sense,
2
{H αβ λ µ } > {δ }.
K αβ
Furthermore,
suppose there exist v∗0 = (N0 , Q0 ) ∈ A∗ , z∗0 ∈ Y2∗ and u0 ∈ U such that
Proof 17.5
Observe that, from the general result in Toland [78], we have
Thus,
= inf J ∗ (v∗ , z∗ )
z∗ ∈Y2∗
∀v∗ ∈ A∗ , u ∈ U.
Global Existence Results and Duality for Non-Linear Models of Plates and Shells 389
Summarizing,
J(u) ≥ J˜∗ (v∗ ), ∀u ∈ U, v∗ ∈ A∗ ,
so that,
inf J(u) ≥ sup J˜∗ (v∗ ). (17.66)
u∈U v∗ ∈A∗
Finally, assume v0∗ = (N0 , Q0 ) ∈ A∗ , z∗0 ∈ Y2∗ and u0 ∈ U are such that
Hence,
Therefore, from such a last equality and (17.66), we may infer that
17.8 Conclusion
In this article, in a first step, we have developed new proofs of global existence of minimizers for the
Kirchhoff-Love plate and a shell model presented in [10].
In a second step, we have developed duality principles for these same models. In [10], the authors devel-
oped a duality principle valid for the special case in which the membrane force tensor at a critical point is
positive definite. We have generalized such a result, considering that in our approach, such a previous case
is included but here we do not request the optimal membrane force to be either positive or negative definite.
Thus, in some sense, we have complemented the important work developed in [10]. We would emphasize,
sufficient optimality conditions are presented and the results here developed are applicable to a great variety
of problems, including other shell models.
Chapter 18
18.1 Introduction
In this chapter we develop a new primal dual variational formulation for the Kirchhoff-Love non-linear plate
model. We emphasize the results here presented may be applied to a large class of non-convex variational
problems.
At this point we start to describe the primal formulation.
Let Ω ⊂ R2 be an open, bounded, connected set which represents the middle surface of a plate of thickness
h. The boundary of Ω, which is assumed to be regular (Lipschitzian), is denoted by ∂ Ω. The vectorial basis
related to the cartesian system {x1 , x2 , x3 } is denoted by (aα , a3 ), where α = 1, 2 (in general Greek indices
stand for 1 or 2), and where a3 is the vector normal to Ω, whereas a1 and a2 are orthogonal vectors parallel
to Ω. Also, n is the outward normal to the plate surface.
The displacements will be denoted by
It is worth emphasizing that the boundary conditions here specified refer to a clamped plate.
392 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
where P, P1 , P2 ∈ L2 (Ω) are external loads in the directions a3 , a1 and a2 respectively. The potential energy,
denoted by J : U → R is expressed by:
J(u) = (G ◦ Λ)(u) − F1 (u)
Finally, we also emphasize from now on, as their meaning are clear, we may denote L2 (Ω) and
L2 (Ω; R2×2 ) simply by L2 , and the respective norms by k · k2 . Moreover derivatives are always understood
in the distributional sense, 0 may denote the zero vector in appropriate Banach spaces and, the following and
relating notations are used:
∂ 2w
w,αβ = ,
∂ xα ∂ xβ
∂ uα
uα,β = ,
∂ xβ
∂ Nαβ
Nαβ ,1 = ,
∂ x1
and
∂ Nαβ
Nαβ ,2 =
.
∂ x2
Here we emphasize the general Einstein convention of sum of repeated indices holds throughout the text,
unless otherwise indicated.
Remark 18.1 About the references, details on the Sobolev involved may be found in [1]. Mandatory
references are the original results of Telega and his co-workers in [10, 11, 76, 77]. About convex analysis,
the results here developed follow in some extent [14], for which the main references are [33, 78].
We emphasize, our results complement, in some sense, the original ones presented in [10, 11, 76, 77].
Finally, existence results for models in elasticity including the plate model here addressed are developed
in [29, 30, 31]. Similar problems are addressed in [39, 43].
A Primal Dual Formulation and a Multi-Duality Principle for a Non-Linear Model of Plates 393
1 1
γαβ (u) = (uα,β + uβ ,α ) + w,α w,β ,
2 2
if
div Q + div z∗ − P = −hαβ λ µ w,αβ λ µ
and
w ∈ U2 .
Define also G∗2 : Y2∗ ×Y2∗ → R, by
G∗2 (z∗ , Q)
= sup{hw,α , z∗α + Qα iL2 − G2 (κ(u)) + hP, wiL2 }
w∈U
1
Z
= {(hαβ λ µ D∗λ µ Dαβ )−1 ( div Q + div z∗ − P)}{( div Q + div z∗ − P)} dx,
2 Ω
G̃∗1 (−Q, N) =
sup −h(v2 )α , Qα iL2 + h(v1 )αβ , Nαβ iL2
(v1 ,v2 )∈Y1 ×Y2
1 1
Z
−G1 (v1 )αβ + (v2 )α (v2 )β − Kαβ (v2 )α (v2 )β dx
2 2 Ω
1
Z
= Nαβ + Kαβ Qα Qβ dx
2 Ω
1
Z
+ H αβ λ µ Nαβ Nλ µ dx (18.6)
2 Ω
394 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
if
{Nαβ + Kαβ }
is positive definite.
Here we have denoted,
{Nαβ + Kαβ } = {Nαβ + Kαβ }−1 .
We denote also,
1
Z
F ∗ (z∗ ) = sup h(v2 )α , (z∗2 )α iL2 − K (v2 )α (v2 )β dx
v2 ∈Y2 2 Ω αβ
1
Z
= K (z∗ )α (z∗2 )β dx
2 Ω αβ 2
1
Z
= −Nαβ + εδαβ (z∗2 )α (z∗2 )β dx , (18.7)
2 Ω
if {Kαβ } is positive definite, where
{K αβ } = {Kαβ }−1 ,
{Kαβ } = {−Nαβ + εδαβ },
{H αβ λ µ } = {Hαβ λ µ }−1 ,
and
1
Z
F({wα }) = Kαβ w,α w,β dx,
2 Ω
for some
ε > 0.
At this point we also define,
B∗ = {N ∈ Y1∗ : {Kαβ } is positive definite and Jˆ∗ (N, z∗ ) > 0, ∀z∗ ∈ Y2∗ such that z∗ =
6 0}
where,
Jˆ∗ (N, z∗ ) = F ∗ (z∗ ) − G∗2 (z∗ , 0)
1
Z
= (−Nαβ + εδαβ )z∗α z∗β dx
2 Ω
1
Z
− [(h D∗ D )−1 ( div z∗ )][ div z∗ ] dx. (18.8)
2 Ω αβ λ µ λ µ αβ
Moreover, we denote,
C∗ = {N ∈ Y1∗ : Nαβ ,β + Pα = 0, in Ω}
and define
A∗ = B∗ ∩C∗ .
Assume u0 ∈ U is such that δ J(u0 ) = 0 and N0 ∈ B∗ , where
{(N0 )αβ } = {Hαβ λ µ γλ µ (u0 )}.
Under such hypotheses,
J(u0 ) = inf J(u)
u∈U
= sup J˜∗ (Q, N)
(Q,N)∈Y2∗ ×A∗
= J˜∗ (Q0 , N0 )
= J ∗ (z∗0 , Q0 , N0 ), (18.9)
A Primal Dual Formulation and a Multi-Duality Principle for a Non-Linear Model of Plates 395
where
J ∗ (z∗ , Q, N) = F ∗ (z∗ ) − G∗2 (z∗ , Q) − G̃∗1 (−Q, N),
J˜∗ (Q, N) = ∗inf ∗ J ∗ (z∗ , Q, N),
z ∈Y2
and where
(z∗0 )α = (−(N0 )αβ + εδαβ )(w0 ),β ,
and
Q0 = −ε∇w0 .
inf J ∗ (z∗ , Q, N)
z∗ ∈Y2∗
Therefore,
inf J ∗ (z∗ , Q, N)
z∗ ∈Y2∗
1 1
Z Z
≤ hαβ λ µ w,αβ w,λ µ dx + Nαβ w,α w,β dx
Ω2 2
Ω Z
uα,β + uβ ,α
1
+ , Nαβ − H N N dx
2 L 2 2 Ω αβ λ µ αβ λ µ
−hw, PiL2 − huα , Pα iL2
uα,β + uβ ,α 1
1
Z
≤ sup + w,α w,β , Nαβ − H αβ λ µ Nαβ Nλ µ dx
N∈Y1∗ 2 2 L2 2 Ω
1
Z
+ h w w dx
2 Ω αβ λ µ ,αβ ,λ µ
−hw, PiL2 − huα , Pα iL2
1 1
Z Z
= γαβ (u)γλ µ (u) dx + h κ (u)κλ µ (u) dx
2 Ω 2 Ω αβ λ µ αβ
−hw, PiL2 − huα , Pα iL2
= J(u), (18.11)
∀u ∈ U, Q ∈ Y2∗ ,
N ∈ A∗ .
Summarizing,
J(u) ≥ inf J ∗ (z∗ , Q, N)
z∗ ∈Y2∗
so that
[(hαβ λ µ D∗λ µ Dαβ )−1 ( div(z∗0 + Q0 ) − P)],ρ + ((−N0 )ρβ + εδρβ )(z∗0 )β = 0, in Ω,
that is,
∂ Jˆ1∗ (z∗0 , Q0 , w0 , u0 )
= 0,
∂ z∗
where
Jˆ1∗ (z∗ , Q, N, u) = J ∗ (z∗ , Q, N) + huα , Nαβ ,β + Pα iL2 .
Also, from (18.15) and (18.16) we obtain
Furthermore,
(Q0 )ρ
= −(w0 ),ρ
ε
= [(hαβ λ µ D∗λ µ Dαβ )−1 ( div(z∗0 + Q0 ) − P)],ρ . (18.19)
H αβ λ µ (N0 )λ µ
(u0 )α,β + (u0 )β ,α
=
2
1
+ (−(N0 )αρ + εδαρ )(z∗0 )ρ (−(N0 )β η + εδβ η )(z∗0 )η , (18.20)
2
which means
∂ Jˆ1∗ (z∗0 , Q0 , N0 , u0 )
= 0.
∂ Nαβ
Finally, from
Nαβ ,β + Pα = 0, in Ω,
we get
∂ Jˆ1∗ (z∗0 , Q0 , N0 , u0 )
= 0.
∂ uα
398 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
= J˜∗ (Q0 , N0 )
= J ∗ (z∗0 , Q0 , N0 ), (18.24)
The proof is complete.
A Primal Dual Formulation and a Multi-Duality Principle for a Non-Linear Model of Plates 399
= J˜∗ (Q0 , N0 )
= J ∗ (z∗0 , Q0 , N0 )
= J3 (w0 , N0 )
= sup J3 (w, N) (18.26)
(w,N)∈U×A∗
where,
1
Z
J3 (w, N) = − h w w dx
2 Ω αβ λ µ ,αβ ,λ µ
1
Z
− (N − εδαβ )w α w,β dx
2 Ω αβ
1
Z
(−∇2 )−1 hαβ λ µ w,αβ λ µ − [(Nαβ − ε δα β )w,β ],α − P
−
2ε Ω
× hαβ λ µ w,αβ λ µ − [(Nαβ − εδαβ )w,β ],α − P dx
1
Z
− H N N dx, (18.27)
2 Ω αβ λ µ αβ λ µ
400 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
∀Q ∈ Ŷ2∗ , N ∈ A∗ .
Also, such an infimum is attained through the equation
that is,
− div ∇(hαβ λ µ D∗λ µ Dαβ )−1 ( div (z∗ + Q) − P)
= [(−Nαβ + εδαβ )(z∗2 )β ],α
= ∇2 w, (18.29)
where
w = −(hαβ λ µ D∗λ µ Dαβ )−1 ( div (z∗ + Q) − P).
Hence,
1
F ∗ (z∗ ) = hw,α , z∗α iL2 − h(−Nαβ + εδαβ )w,α w,β iL2 . (18.30)
2
From
− div (z∗ + Q) = hαβ λ µ w,αβ λ µ − P,
and
(z∗α ),α = [(−Nαβ + εδαβ )(w,β )],α ,
we obtain
−∇2 v = − div (∇v) = − div Q = hαβ λ µ w,αβ λ µ − [(Nαβ − εδαβ )(w,β )],α − P,
so that
v = −(∇2 )−1 (hαβ λ µ w,αβ λ µ − [(Nαβ − εδαβ )(w,β )],α − P)
A Primal Dual Formulation and a Multi-Duality Principle for a Non-Linear Model of Plates 401
Thus,
Q = {[−(∇2 )−1 (hαβ λ µ w,αβ λ µ − [(Nαβ − εδαβ )(w,β )],α − P],ρ }
so that
1
Z
δ Qα Qβ dx
2ε Ω αβ
1
Z
(−∇2 )−1 hαβ λ µ w,αβ λ µ − [(Nα β − εδαβ )w,β ],α − P
=
2ε Ω
× hαβ λ µ w,αβ λ µ − [(Nαβ − εδαβ )w,β ],α − P dx. (18.32)
Thus,
1
Z
J˜∗ (Q, N) = − h w w dx
2 Ω αβ λ µ ,αβ ,λ µ
1
Z
− (N − εδαβ )w,α w,β dx
2 Ω αβ
1
Z
(−∇2 )−1 hαβ λ µ w,αβ λ µ − [(Nαβ − εδαβ )w,β ],α − P
−
2ε Ω
× hαβ λ µ w,αβ λ µ − [(Nαβ − εδαβ )w,β ],α − P dx
1
Z
− H N N dx
2 Ω αβ λ µ αβ λ µ
= J3 (w, N). (18.33)
hαβ λ µ (w0 ),αβ λ µ − [((N0 )αβ − εδαβ )(w0 ),β ],α − εδαβ (w0 )αβ − P = 0, in Ω,
so that
ŵ0 ≡ w0 = (∇2 )−1 (hαβ λ µ (w0 ),αβ λ µ − [((N0 )αβ − εδαβ )(w0 ),β ],α − P)/ε
and therefore
Also,
(u0 )α,β + (u0 )β ,α
1 1
H αβ λ µ (N0 )λ µ = + (w0 ),α (w0 ),β ,
2 2 2
so that
1 (u0 )α,β + (u0 )β ,α
δ J3 (w0 , N0 ) + , (N0 )αβ − h(u0 ),α , Pα iL2 = 0.
2 2 L2
From these last results and from the last theorem, we may obtain
J3 (w0 , N0 ) = J(w0 )
= J ∗ (z0 , Q0 , N0 )
= J˜∗ (Q0 , N0 ). (18.35)
402 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
From this, also from the last theorem and from (18.33), we finally get
= J˜∗ (Q0 , N0 )
= J ∗ (z∗0 , Q0 , N0 )
= J3 (w0 , N0 )
= sup J3 (w, N). (18.36)
(w,N)∈U×A∗
Theorem 18.4.1 Considering the notation and statements of the plate model addressed in the last sections,
assuming a not relabeled finite dimensional approximate model, in a finite elements or finite differences
context, let J1 : U ×Y1∗ ×Y2∗ → R be a functional where
1
Z
J1 (u, Q, N) = hαβ λ µ w,αβ w,λ µ dx − hP, wiL2
2 Ω
1
Z
+ (−NαβK )Q Q dx − hw , Q i
α β ,α α L2
2 Ω
K 1
Z Z
+ w,α w,α dx − H N N dx
2 Ω 2 Ω αβ λ µ αβ λ µ
−hNαβ ,β + Pα , uα iL2 , (18.37)
and where
K } = {−N −1
{−Nαβ αβ + Kδαβ } .
Define also,
K
C∗ = N ∈ Y1∗ : {−Nαβ + Kδαβ } > δαβ ,
2
where
1
Z
HK∗ (Q, N) = sup hw,α , Q,α iL2 − (−Nαβ + Kδαβ )w,α w,β dx .
u∈U 2 Ω
Moreover, we also define,
A∗+ = B∗ ∩C∗ ∩ D+
A∗− = B∗ ∩C∗ ∩ D− ,
and
E ∗ = B∗ ∩C∗ .
Let u0 ∈ U be such that δ J(u0 ) = 0 and define
and
(Q0 )α = ((N0 )αβ + Kδαβ )(w0 ),β .
Under such hypotheses,
1. if δ 2 J(u0 ) > 0 and N0 ∈ E ∗ , defining
and
J˜∗ (Q) = inf J2 (u, Q),
u∈Br1 (u0 )
2. If N0 ∈ A∗+ , defining
J3 (u, Q) = sup J1 (u, Q, N),
N∈A∗+
and
J˜3∗ (Q) = inf J3 (u, Q),
u∈U
then
δ J˜3∗ (Q0 ) = 0,
δ 2 J˜3∗ (Q0 ) ≥ 0,
404 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
and
1
Z
Jˆ∗ (Q, N) = −F̂K∗ (Q) + HK∗ (Q, N) − H αβ λ µ Nαβ Nλ µ dx,
2 Ω
where
1 K
Z Z
F̂K∗ (Q) = sup hw,α , Qα iL2 − h w w dx − w,α w,α dx + hw, PiL2 .
u∈U 2 Ω αβ λ µ ,αβ ,λ µ 2 Ω
we have that
Jˆ∗ (Q0 , N0 ) = J(u0 ),
δ {Jˆ∗ (Q0 , N0 ) − h(N0 )αβ ,β + Pα , (u0 )α iL2 } = 0,
and there exist r, r1 , r2 > 0 such that
∂ J1 (u0 , Q0 , N0 )
= 0.
∂N
Moreover, there exists r > 0, r2 > 0 such that
and (we justify that the first infimum Q in this equation (18.41) is well defined in the next lines)
∂ J1 (u0 , Q0 , N0 )
= 0.
∂Q
Hence, from
∂ J(u0 )
= 0,
∂u
from the implicit function theorem and chain rule, we get
∂ J(u0 )
0 =
∂u
∂ J1 (u0 , Q0 , N0 )
=
∂u
∂ J1 (u0 , Q0 , N0 ) ∂ Q0
+
∂Q ∂u
∂ J1 (u0 , Q0 , N0 ) ∂ N0
+
∂N ∂u
∂ J1 (u0 , Q0 , N0 )
= . (18.42)
∂u
Therefore,
∂ J˜∗ (Q0 )
∂Q
∂ J1 (u0 , Q0 , N0 )
=
∂Q
∂ J1 (u0 , Q0 , N0 ) ∂ u0
+
∂u ∂Q
∂ J1 (u0 , Q0 , N0 ) ∂ N0
+
∂N ∂Q
= 0. (18.43)
∂ J1 (u0 , Q0 , N0 )
= 0,
∂N
that is,
that is,
so that
∂ (N0 )αβ
∂ Qρ
= H αβ λ µ − (w0 )λ (−N0 )Kαβ (w0 )µ ((−N0 )Kλ µ (w0 )ρ ). (18.47)
δ 2 J˜∗ (Q0 ) ≥ 0.
J(u0 ) = J3 (u0 , Q0 )
= inf ∗ J3 (u, Q)
(u,Q)∈U×Y2
≤ inf J3 (u, Q)
Q∈Y2∗
= J(u), ∀u ∈ U. (18.49)
Moreover,
Hence,
J(u0 ) = J˜3∗ (Q0 ) ≤ inf J3 (u, Q) = J˜3∗ (Q), ∀Q ∈ Y2∗ .
u∈U
δ 2 J˜3∗ (Q0 ) ≥ 0.
Denoting,
1
Z
Jˆ(u, N) = hαβ λ µ w,αβ w,λ µ dx
2 Ω
1 1
Z Z
+ N w,α w,β dx − H αβ λ µ Nαβ Nλ µ dx
2 Ω αβ 2 Ω
−hNαβ ,β − Pα , uα iL2 , (18.52)
and
18.5 Conclusion
In this chapter we have developed a new primal dual variational formulation and a multi-duality principle
applied to a non-linear model of plates.
About the primal dual formulation, we emphasize such a formulation is concave so that it is very inter-
esting from a numerical analysis point of view.
Finally, the results here presented may be also developed in a similar fashion for a large class of problems,
including non-linear models in elasticity and other non-linear models of plates and shells.
Chapter 19
19.1 Introduction
In this chapter, we develop duality principles applicable to primal variational formulations found in the
non-linear elasticity theory. As a first application, we establish the concerning results in details for one and
three-dimensional models. We emphasize such duality principles are applicable to a larger class of variational
optimization problems, such as non-linear models of plates and shells and other models in elasticity. Finally,
we formally prove there is no duality gap between the primal and dual formulations, in a local extremal
context.
About the references, this article in some sense extends and complements the original works of Telega,
Bielski and their co-workers [11, 10, 76, 77]. In particular in [11], published in 1985 and in [77], for three-
dimensional elasticity and related models, the authors established duality principles and concerning global
optimality conditions, for the special case in which the stress tensor is positive definite at a critical point. In
this specific sense, the present work complements such previous ones, considering we establish a sufficient
condition for local minimality which does not require the stress tensor to be either positive or negative defined
along the concerning domain. Such an optimality condition is summarized by the condition kux k∞ < 1/4 at
a critical point.
The tools of convex analysis and duality theory here used may be found in [14, 33, 64]. Existence of
results in non-linear elasticity and related models may be found in [29, 30, 31].
Finally, details on the function spaces addressed may be found in [1].
At this point, we start to describe the primal variational formulation for the one-dimensional model.
Let Ω = [0, L] ⊂ R be an interval which represents the axis of a straight bar of length L and constant cross
section area A.
We denote by u : [0, L] → R the field of axial displacements for such a bar, resulting from the application
of an axial load field P ∈ C([0, L]).
We also denote
U = {u ∈ C1 ([0, L]) : u(0) = u(L) = 0},
410 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
and
Û = {u ∈ U : kux k∞ < 1/4}.
The energy for such a system, denoted by J : U → R, is expressed as
EA L 1 2 2
Z Z L
J(u) = ux + ux dx − Pu dx, ∀u ∈ U,
2 0 2 0
where
kvkU = max {|v(x)| + |vx (x)|}, ∀v ∈ U.
x∈[0,L]
Moreover, defining V = C([0, L]), for z∗ ∈ V and r1 > 0, we shall generically also denote
where
kvkV = kvk∞ = max |v(x)|, ∀v ∈ V.
x∈[0,L]
Similar corresponding standard notations are valid for V ×V and the 3-dimensional model.
and
(v∗1 )2
Z L Z L
1 1
= dx + (v∗2 + z∗ )2 dx, (19.2)
2 0 z + v∗2 + K
∗ 2E A 0
if v∗2 + z∗ + K > 0, in Ω.
Define also,
A∗ = {v∗ = (v∗1 , v∗2 ) ∈ V ×V : (v∗1 )x + (v∗2 )x + P = 0, in Ω},
K = EA/2,
ẑ∗ = K(u0 )x ,
1
v̂2 = E A (u0 )x + (u0 )x − ẑ∗ ,
∗ 2
2
v̂∗1 = (ẑ∗ + v̂∗2 + K)(u0 )x .
Under hypotheses and definitions, we have
Also,
EA 1 EA
v̂∗2 + zˆ∗ + = 2
EA (u0 )x + (u0 )x +
2 2 2
EA 1 1 1
> − EA +
2 4 2 16
1 1
= EA −
4 32
7
= EA . (19.6)
32
412 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
∂ 2 J ∗ (v̂∗ , ẑ∗ ) 1 1 32
> −
∂ (z∗ )2 EA 16 (7 EA)
1 2
= 1−
EA 7
5
=
7EA
> 0, in Ω. (19.7)
Thus, we may infer that there exists r1 , r2 > 0 such that J ∗ (v∗ , z∗ ) is convex in z∗ and concave in v∗ , on
Now, denoting
Jˆ(v∗ , z∗ , u) = J ∗ (v∗ , z∗ ) − hu, (v∗1 )x + (v∗2 )x + PiL2
we obtain
∂ Jˆ∗ (v̂∗ , ẑ∗ , u0 ) z∗ 1 (v̂∗1 )2 v̂∗2 + ẑ∗
= + −
∂ z∗ K 2 (v̂∗2 + ẑ∗ + K)2 EA
EA (u0 )x + 12 (u0 )2x
1
= (u0 )x + (u0 )2x −
2 EA
= 0, in Ω. (19.8)
Also,
and
∂ Jˆ∗ (v̂∗ , ẑ∗ , u0 ) 1 (v∗1 )2 v̂∗2 + ẑ∗
= − + (u0 )x
∂ v∗2 2 (v̂∗2 + ẑ∗ + K )2 EA
EA (u0 )x + 21 (u0 )2x
1
= (u0 )x + (u0 )2x −
2 EA
= 0, in Ω. (19.10)
Finally,
from this, the last result and from the min-max theorem, we have
J ∗ (v̂∗ , ẑ∗ ) = Jˆ∗ (v̂∗ , ẑ∗ , u0 )
( )
∗ ∗ ∗
= sup inf Jˆ (v , z , u0 )
v∗ ∈Br2 (v̂∗ ) z∗ ∈Br1 (ẑ∗ )
( )
= sup inf Jˆ∗ (v∗ , z∗ ) . (19.12)
v∗ ∈Br2 (v̂∗ )∩A∗ z∗ ∈Br1 (ẑ∗ )
From the first equation in such a result we may also obtain the standard second order necessary condition
indicated in (19.3).
The proof is complete.
and Γ = Γ0 ∪ Γ1 , Γ0 ∩ Γ1 = 0.
/ We assume |Γ0 | > 0 where |Γ0 | denotes the Lebesgue measure of Γ0 .
The stress tensor is denoted by {σi j }, where
1
σi j = Hi jkl (uk,l + ul,k + um,k um,l ) , (19.18)
2
where Z
hu, f iL2 (Ω;R3 ) = fi ui dx.
Ω
Remark 19.3.1 By a regular Lipschitzian boundary Γ of Ω we mean regularity enough so that the standard
Gauss-Green formulas of integrations by parts to hold. Also, we denote by 0 the zero vector in appropriate
function spaces.
About the references, similarly as for the one-dimensional case, we refer to [76, 77, 10, 11] as the first
articles to deal with the convex analysis approach applied to non-convex and non-linear mechanics models.
Indeed, the present work complements such important original publications, since in these previous results
On Duality Principles for One and Three-Dimensional Non-Linear Models in Elasticity 415
the complementary energy is established as a perfect duality principle for the case of positive definiteness of
the stress tensor (or the membrane force tensor, for plates and shells models) at a critical point.
We have relaxed such constraints, allowing to some extent, the stress tensor to not be necessarily either
positive or negative definite in Ω. Similar problems and models are addressed in [14].
Moreover, we highlight again that existence results for models in elasticity are addressed in [29, 30, 31].
Finally, the standard tools of convex analysis here used may be found in [33, 78, 64, 14].
and
and
A2 = {v∗ = (v∗1 , v∗2 ) ∈ V ×V : (v∗1 )i j n j + (v∗2 )i j n j − fˆi = 0, on Γt },
and let K > 0 be such that
Di jkl 3
M= − δi j − H i jkl
K 32K
is a positive definite tensor, where
1, if i = k and j = l,
Di jkl = (19.24)
0, otherwise
Hence,
∂ Jˆ∗ (v̂∗ , ẑ∗ , u0 )
= δ J(u0 ) = 0.
∂u
These last four results may be summarized by the equation
from this, the last result and from the min-max theorem, we have
∀u ∈ U, z∗ ∈ Br1 (ẑ∗ ). n o
u +u
In particular, there exists r > 0 such that if u ∈ Br (u0 ) then {z∗i j } = K i, j 2 j,i ∈ Br1 (ẑ∗ ), so that from
this and (19.34), we obtain
Z
∗ ∗ ∗ K ui, j + u j,i ui, j + u j,i
J (v̂ , zˆ ) ≤ − dx + G(u)
2 Ω 2 2
K ui, j + u j,i ui, j + u j,i
Z
+ dx − hui , fi iL2 (Ω) − hui , fˆi iL2 (Γ1 )
2 Ω 2 2
= G(u) − hui , fi iL2 (Ω) − hui , fˆi iL2 (Γ ) 1
= J(u), ∀u ∈ Br (u0 ). (19.35)
19.5 Conclusion
In this chapter we have developed some theoretical results on duality for a class of non-convex optimization
problems in elasticity. In this first approach we have developed in details duality principles and sufficient
optimality conditions for local minimality for one and three-dimensional models in elasticity. It is worth
mentioning the results may be extended to other models in elasticity and to other models of plates and shells.
Chapter 20
20.1 Introduction
In this article we develop a new primal dual variational formulation suitable for a large class of non-convex
problems in the calculus of variations.
The results are obtained through basic tools of convex analysis, duality theory, the Legendre transform
concept and the respective relations between the primal and dual variables. The novelty here is that the dual
formulation is established also for the primal variables, however with a large domain region of concavity
about a critical point.
We formally prove there is no duality gap between the primal and dual formulations in a local extremal
context.
We emphasize, our work, in some sense, generalizes, extends and complements the original Telega, Biel-
ski and their co-workers results in the articles [10, 11, 77, 76].
The convex analysis results here used may be found in [33, 78, 64, 14], for example. Similar results for
other problems may be found in [14].
Finally, details on the function spaces addressed may be found in [1].
At this point we start to describe the primal formulation.
Let Ω ⊂ R3 be an open, bounded and connected set with a regular (Lipschitzian) boundary denoted by
∂ Ω.
Consider the functional J : V → R where
where Z
γ
G0 (∇u) = ∇u · ∇u dx,
2 Ω
A Primal Dual Variational Formulation Suitable for a Large Class of Non-Convex Problems in Optimization 421
Z
α
G1 (u) = u4 dx,
4 Ω
(−β + K − ε)
Z
G2 (u) = u2 dx,
2 Ω
K
Z
F(u) = u2 dx
2 Ω
and Z
ε
G3 (u) = u2 dx
2 Ω
so that
Z Z
γ α
J (u) = ∇u · ∇u dx + u4 dx
2 Ω 4 Ω
Z
β
− u2 dx − hu, f iL2 , ∀u ∈ V. (20.2)
2 Ω
Here dx = dx1 dx2 dx3 , α > 0, β > 0, γ > 0, ε > 0, K > β + ε, f ∈ C(Ω) and
V = {u ∈ C2 (Ω) : u = 0, on ∂ Ω}.
kukV = max{|u(x)| + |ux (x)| + |uy (x)| + |uz (x)| + |uxy (x)| + |uxz (x)| + |uyz (x)|
x∈Ω
+|uxx (x)| + |uyy (x)| + |uzz (x)|}, ∀u ∈ V, (20.3)
where
1
Z
F ∗ (− div z∗0 + z∗1 + z∗2 ) = (− div z∗0 + z∗1 + z∗2 )2 dx,
2K Ω
Also,
and
G∗3 ( div v∗0 − v∗1 − v∗2 + f ) = sup{−h∇u, v∗0 iL2 − hu, v∗1 iL2
u∈V
−hu, v∗2 iL2 + hu, f iL2 − G3 (u)}
= sup{−h∇u, v∗0 iL2 − hu, v∗1 iL2
u∈V
Z
ε
−hu, v∗2 iL2 + hu, f iL2 − u2 dx}
2 Ω
1
Z
= ( div v∗0 − v∗1 − v∗2 + f )2 dx. (20.8)
2ε Ω
Specifically from
∂ J ∗ (v∗ , z∗ )
= 0,
∂ z∗0
we get
∂ G∗0 (v∗0 + z∗0 )
∗
∂ F (− div z∗0 + z∗1 + z∗2 )
− +∇ = 0,
∂ z∗0 ∂ w∗0
where
w∗0 = − div z∗0
so that
3α β +ε
Z Z Z
γ
Jˆε (û) = − |∇û|2 dx − û4 dx + û2 dx
2 Ω 4 Ω 2 Ω
1
Z
− (−γ∇2 û + αû3 − (β + ε)û − f )2 dx. (20.15)
2ε Ω
Assume u0 ∈ V is such that
δ J(u0 ) = 0,
and
δ 2 J(u0 ) > 0.
Under such hypotheses,
δ Jˆε (u0 ) = 0,
J(u0 ) = Jˆε (u0 )
and for ε > 0 sufficiently small,
(δ 2 J(u0 ) − ε )2
δ 2 Jˆε (u0 ) = −(δ 2 J (u0 ) − ε) −
ε
1
≈ −O
ε
< 0, (20.16)
A Primal Dual Variational Formulation Suitable for a Large Class of Non-Convex Problems in Optimization 425
−γ∇2 u0 + αu30 − β u0 − f = 0, in Ω.
Hence
−γ∇2 u0 + αu30 − (β + ε)u0 − f
uˆ = − .
ε
From such an expression for û we obtain
20.3 Conclusion
In this chapter we have developed a primal dual variational formulation for a large class of problems in the
calculus of variations.
We emphasize the dual functional obtained has a large domain region of concavity about a critical point,
which makes such a formulation very interesting from a numerical analysis point of view.
Finally, it has been formally proven there is no duality gap between the primal and dual formulations in
a local extremal context.
Chapter 21
21.1 Introduction
Consider an elastic solid which the volume corresponds to an open, bounded, connected set, denoted by
Ω ⊂ R3 with a regular (Lipschitzian) boundary denoted by ∂ Ω = Γ0 ∪ Γt where Γ0 ∩ Γt = 0/. Consider also
the problem of minimizing the functional Jˆ : U × B → R where
1 1
Jˆ(u,t) = hui , fi iL2 (Ω) + hui , fˆi iL2 (Γt ) ,
2 2
subject to
(Hi jkl (t)ekl (u)), j + fi = 0 in Ω,
(21.1)
Hi jkl (t)ekl (u)n j − fˆi = 0, on Γt , ∀i ∈ {1, 2, 3}.
Z
B = t : Ω → [0, 1] measurable : t(x) dx = t1 |Ω| ,
Ω
where
0 < t1 < 1
and |Ω| denotes the Lebesgue measure of Ω.
Moreover, u = (u1 , u2 , u3 ) ∈ W 1,2 (Ω; R3 ) is the field of displacements relating the cartesian system
(0, x1 , x2 , x3 ), resulting from the action of the external loads f ∈ L2 (Ω; R3 ) and fˆ ∈ L2 (Γt ; R3 ).
428 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
1
ei j (u) = (ui, j + u j,i ), ∀i, j ∈ {1, 2, 3}.
2
Finally,
{Hi jkl (t)} = {tHi0jkl + (1 − t)Hi1jkl },
where H 0 corresponds to a strong material and H 1 to a very soft material, intending to simulate voids along
the solid structure.
The variable t is the design one, which the optimal distribution values along the structure are intended to
minimize its inner work with a volume restriction indicated through the set B.
The duality principle obtained is developed inspired by the works in [11, 10]. Similar theoretical results
have been developed in [14], however we believe the proof here presented, which is based on the min-max
theorem is easier to follow (indeed we thank an anonymous referee for his suggestion about applying the min-
max theorem to complete the proof). We highlight throughout this text we have used the standard Einstein
sum convention of repeated indices.
Moreover, details on the Sobolev spaces addressed may be found in [1]. In addition, the primal variational
development of the topology optimization problem has been described also in [14].
The main contributions of this work are to present the detailed development, through duality theory,
for such a kind of optimization problems. We emphasize that to avoid the check-board standard and obtain
appropriate robust optimized structures without the use of filters, it is necessary to discretize more in the load
direction, in which the displacements are much larger.
Finally, it is worth mentioning the numerical examples presented have been developed in a Finite Element
(FE) context, based on the work of [70].
Theorem 21.2.1 Consider the statements and assumptions indicated in the last section, in particular those
refereing to Ω and the functional Jˆ : U × B → R.
Define J1 : U × B → R by
where
1
Z
G(e(u),t) = Hi jkl (t)ei j (u)ekl (u) dx,
2 Ω
and where
dx = dx1 dx2 dx3 .
Define also J ∗ : U → R by
and
Hi1jkl zi j zkl > c1 zi j zi j , ∀z = {zi j } ∈ R3×3 , such that z 6= 0.
Finally, define J : U × B → R ∪ {+∞} by
where
if (u,t) ∈ A∗ ,
0,
Ind(u,t) = (21.3)
+∞, otherwise ,
where A∗ = A1 ∩ A2 ,
and
A2 = {(u,t) ∈ U × B : σi j (u)n j − fˆi = 0, on Γt , ∀i ∈ {1, 2, 3}}.
Under such hypotheses, there exists (u0 ,t0 ) ∈ U × B such that
where
C1 = {σ ∈ Y ∗ : σi j, j + fi = 0, in Ω, ∀i ∈ {1, 2, 3}}
and
C2 = {σ ∈ Y ∗ : σi j n j − fˆi = 0, on Γt , ∀i ∈ {1, 2, 3}}.
430 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Also, from this and the min-max theorem, there exist (u0 ,t0 ) ∈ U × B such that
inf J(u,t) = inf sup J1 (u,t)
(u,t)∈U×B t∈B û∈U
= sup inf J1 (u,t)
u∈U t∈B
= J1 (u0 ,t0 )
= inf J1 (u0 ,t)
t∈B
= J ∗ (u0 ). (21.7)
∂ J1 (u0 ,t0 )
=0
∂u
we obtain
(Hi jkl (t0 )ekl (u0 )), j + fi = 0 in Ω,
and
Hi jkl (t0 )ekl (u0 )n j − fˆi = 0 on Γt , ∀i ∈ {1, 2, 3},
so that
1 1
G(e(u0 )) = h(u0 )i , fi iL2 (Ω) + h(u0 )i , fˆi iL2 (Γt ) .
2 2
Hence, (u0 ,t0 ) ∈ A∗ so that Ind(u0 ,t0 ) = 0 and σ (u0 ) ∈ C∗ .
A Duality Principle for a Class of Optimal Design Problems 431
Moreover
J ∗ (u0 ) = −G(e(u0 )) + h(u0 )i , fi iL2 (Ω) + h(u0 )i , fˆi iL2 (Γt )
= G(e(u0 ))
= G(e(u0 )) + Ind(u0 ,t0 )
= J(u0 ,t0 )
= G∗ (σ (u0 ),t0 ). (21.8)
This completes the proof.
4. Obtain tn+1 by
tn+1 = arg min J1 (un ,t).
t∈B
H(t)U = f, (21.10)
where H(t) is the global stiffness matrix, U is the global deflection and f is the global forces vector.
Thus, for such a FE models (N elements where e ∈ {1, ..., N}), the primal optimization problem can be
written in matrix form as
1
min Jˆ(u,t) = UT H(t)U
2
1 N
= ∑ (te ) p uTe He ue
2 e=1
s.t. (te ) p He ue = fe (21.11)
N
∑ teVe = t1 |Ω|
e=1
0≤t ≤1
e = 1, 2, 3, ..., N,
432 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
where u = (u, v) ∈ W 1,2 (Ω; R2 ), ex (u) = ux , ey (u) = vy and exy (u) = 21 (uy + vx ).
Moreover, the stress tensor σ (e(u)) is given by
σ (e(u)) = He(u),
where
1 ν 0
E(t)
H= ν 1 0 (21.13)
1 − ν2 1
0 0 (1 −
2 ν)
and
E(t) = tE0 + (1 − t)E1 ,
where E0 = 210 ∗ 109
(the modulus of Young) and E1 E0 . Moreover, ν = 0.33.
As previously mentioned, we present four numerical simulations.
Case 1. For the first case see Figure 21.1, on the left, for the concerning case, Figure 21.1, in the middle,
for the optimal topology for this case with no filter, Figure 21.1, on the right, for the optimal topology for
this first case with filter. The objective function as function of iteration numbers also for such a case with no
filter, Figure 21.2, on the left, and the objective function as function of iteration numbers also for this first
case with filter, Figure 21.2, on the right.
A Duality Principle for a Class of Optimal Design Problems 433
0.5 0.5
0 0
1 1
Figure 21.1: On the left, a clamped beam at x = 0 (cantilever beam). In the middle, the optimal topology for t1 = 0.5, for
the case with no filter. On the right, the optimal topology for t1 = 0.5, for the case with filter. The FE mesh was 60 x 50.
35.2 200
35.1
150
35
34.9 100
34.8
50
34.7
34.6 0
0 2 4 6 8 10 12 0 20 40 60 80 100
Figure 21.2: On the left, the objective function by iteration numbers for t1 = 0.5, for the case with no filter. On the right,
the objective function by iteration numbers for t1 = 0.5, for the case with filter.
0.5 0.5
0 0
1 1
Figure 21.3: On the left, a simply supported beam at x = 0 and x = 1. In the middle the optimal topology for t1 = 0.5,
for the case with no filter. On the right the optimal topology for t1 = 0.5, for the case with filter. The FE mesh was 40 x
50.
Case 2. For the second case see Figure 21.3, on the left, for the concerning case, Figure 21.3, in the middle,
for the optimal topology for this case with no filter, Figure 21.3, on the right, for the optimal topology for this
second case with filter. The objective function as function of iteration numbers also for such a case with no
filter, Figure 21.4, on the left, and the objective function as function of iteration numbers also for this second
case with filter, Figure 21.4, on the right.
Case 3. For the third case see Figure 21.5, on the left, for the concerning case, Figure 21.5, in the middle,
for the optimal topology for this case with no filter, Figure 21.5, on the right, for the optimal topology for
this third case with filter. The objective function as function of iteration numbers also for such a case with no
434 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
×10 4
8040 4.5
8020 4
3.5
8000
3
7980
2.5
7960
2
7940
1.5
7920 1
7900 0.5
0 2 4 6 8 10 12 0 20 40 60 80 100
Figure 21.4: On the left the objective function by iteration numbers for t1 = 0.5, for the case with no filter. On the right
the objective function by iteration numbers for t1 = 0.5, for the case with filter.
0.6 0.6
0 0
1 1
Figure 21.5: On the left a beam with a hole clamped at x = 0. In the middle the optimal topology for t1 = 0.5, for the
case with no filter. On the right the optimal topology for t1 = 0.5, for the case with filter. The FE mesh was 50 x 40.
76.5 600
76.4 500
76.3 400
76.2 300
76.1 200
76 100
75.9 0
0 2 4 6 8 10 0 20 40 60 80 100
Figure 21.6: On the left the objective function by iteration numbers for t1 = 0.5, for the case with no filter. On the right
the objective function by iteration numbers for t1 = 0.5, for the case with filter.
filter, Figure 21.6, on the left, and the objective function as function of iteration numbers also for this third
case with filter, Figure 21.6, on the right.
Case 4. For the fourth case see Figure 21.7, on the left, for the concerning case, Figure 21.7, in the middle,
for the optimal topology for this case with no filter, Figure 21.7, on the right, for the optimal topology for this
fourth case with filter. The objective function as function of iteration numbers also for such a case with no
A Duality Principle for a Class of Optimal Design Problems 435
1 1
0.5 0.5
0 0
0 0.5 1 0.5 1
Figure 21.7: On the left a L shape beam clamped at y = 1. In the middle the optimal topology for t1 = 0.5, for the case
with no filter. On the right the optimal topology for t1 = 0.5, for the case with filter. The FE mesh was 40x60.
×10 5 ×10 6
5.7 14
5.65
12
10
5.6
8
5.55
6
5.5
4
5.45
2
5.4 0
0 2 4 6 8 10 0 20 40 60 80 100
Figure 21.8: On the left the objective function by iteration numbers for t1 = 0.5, for the case with no filter. On the right
the objective function by iteration numbers for t1 = 0.5, for the case with filter.
filter, Figure 21.8, on the left, and the objective function as function of iteration numbers also for this fourth
case with filter, Figure 21.8, on the right.
We emphasize to have obtained in both optimized structures, without filter and with filter, robust topology
from a structural point of view. One can note also in the figures that in all cases the objective functions,
without filter and with filter, have similar final values, which indicates the results obtained are consistent.
We emphasize, to obtain good and consistent results, it is necessary to discretize more in the direction
y, that is, the load direction, in which the displacements are much larger.
If we do not discretize enough in the load direction, for the software with no filter, a check-board
standard in the material distribution is obtained in some parts of the concerning structure.
Summarizing, with no filter, the check-board problem is solved by increasing the discretization in
the load direction.
Moreover, with the OC optimizer with filter, the volume fraction of material is kept constant in 0.5
at each iteration during the optimization process, whereas for the case with no filter we start with a
volume fraction of 0.95 which is gradually decreased to the value 0.5, using as the initial solution for
a iteration with a specific volume fraction, the solution of the previous one.
We also highlight the result obtained with no filter is indeed a critical point for the original optimiza-
tion problem, whereas there is some heuristic in the procedure with filter.
Once more we emphasize to have obtained more robust and consistent shapes by properly discretizing
the approximate model in a FE context.
Finally, it is also worth mentioning, we have obtained similar final objective function values without
and with filter in all examples, even though without filter such values have been something smaller,
as expected. The qualitative differences between the graphs without and with filter, for the objective
function as function of the number of iterations, refer to the differences between the optimization
processes, where in the case with filter the volume fraction is kept 0.5 and without filter it is gradually
decreased from 0.95 to 0.5, as above described.
We highlight the results obtained may be applied to other problems, such as other models of plates, shells
and elasticity.
Chapter 22
22.1 Introduction
In this work we present three theorems which represent duality principles suitable for a large class of non-
convex variational problems.
At this point we refer to the exceptionally important article, “A contribution to contact problems for a
class of solids and structures” by Bielski and Telega [11], published in 1985, as the first one to successfully
apply and generalize the convex analysis approach to a model in non-convex and non-linear mechanics.
The present work is, in some sense, a kind of extension of this previous work [11] and others such as
[10], which greatly influenced and inspired my work and recent book [14].
We extend and generalize the approaches in [11] and [78] and develop two multi-duality principles
through which we classify qualitatively the critical points.
Thus, we emphasize the first multi-duality principle generalizes some Toland results found in [78] and in
some appropriate sense, such a work complements the results presented in [11], now applied to a Ginzburg-
Landau type model context.
On the other hand, the conclusions of the second multi-duality principle may be qualitatively found in
similar form in the triality approach found in [83] and other references therein, even though the construction
of the present result and the respective proofs be substantially different.
About the model in physics involved, we recall that about the year 1950, Ginzburg and Landau introduced
a theory to model the super-conducting behavior of some types of materials below a critical temperature Tc ,
which depends on the material in question. They postulated the free density energy may be written close to
Tc as
h̄ α(T ) β (T )
Z Z Z
Fs (T ) = Fn (T ) + |∇ψ|22 dx + |ψ|4 dx − |ψ|2 dx,
4m Ω 4 Ω 2 Ω
where ψ is a complex parameter, Fn (T ) and Fs (T ) are the normal and super-conducting free energy densi-
ties, respectively (see [4, 55] for details). Here Ω ⊂ R3 denotes the super-conducting sample with a boundary
denoted by ∂ Ω = Γ. The complex function ψ ∈ W 1,2 (Ω; C) is intended to minimize Fs (T ) for a fixed tem-
perature T .
438 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Denoting α(T ) and β (T ) simply by α and β , the corresponding Euler-Lagrange equations are given by:
h̄
− 2m ∇2 ψ + α|ψ|2 ψ − β ψ = 0, in Ω
(22.1)
∂ψ
∂n = 0, on ∂ Ω.
This last system of equations is well known as the Ginzburg-Landau (G-L) one in the absence of a magnetic
field and respective potential.
Remark 22.1 About the notation, for an open bounded subset Ω ⊂ R3 , we denote the L2 (Ω) norm by
k · kL2 (Ω) or simply by k · k2 . Similar remark is valid for the L2 (Ω; R3 ) norm, which is denoted by k · kL2 (Ω;R3 )
or simply by k · k2 , when its meaning is clear. On the other hand, by | · |2 we denote the standard Euclidean
norm in R3 or C3 .
Moreover derivatives are always understood in the distributional sense. Also, by a regular Lipschitzian
boundary ∂ Ω = Γ of Ω, we mean regularity enough so that the standard Sobolev Imbedding theorems, the
trace theorem and Gauss-Green formulas of integration by parts to hold. Details about such results may be
found in [1].
´
Finally, in general δ F(u, v) will denote the Frechet derivative of the functional F(u, v) at (u, v),
∂ F(u, v)
δu F(u, v) or
∂u
´
denotes the first Frechet derivative of F relating the variable u and
∂ 2 F(u, v)
δ 2 Fu,v (u, v) or
∂ u∂ v
denotes the second one relating the variables u and v, always at (u, v).
At some point of our analysis, we shall assume a finite-dimensional approximate model version. In such
a context, we remark that generically in a matrix sense the notation
1
K + γ∇2
will indicate the inverse
(KId + γ∇2 )−1 ,
where Id denotes the identity matrix and ∇2 is the matrix originated by a discretized version of the Laplace
operator. We also emphasize that, as the meaning is clear, other similar notations may be used to indicate the
inverse of matrices or operators.
Remark 22.2 For an appropriate set Ω ⊂ R3 and a space U, our primal functional J : U → R is specified
by Z Z
γ α
J(u) = ∇u · ∇u dx + (u2 − β )2 dx − hu, f iL2
2 Ω 2 Ω
and
K
Z Z
α
G(u, v) = (u2 − β + v)2 dx + u2 dx − hu, f iL2 ,
2 Ω 2 Ω
so that
J(u) = G(u, 0) − F(u), ∀u ∈ U.
Let F ∗ : Y ∗ → R denote the polar functional related to F, that is,
2v∗0 + K > 0, in Ω,
we also define
G∗ (v∗1 , v∗0 ) = sup {hu, v∗1 iL2 + hv, v∗0 iL2 − G(u, v)},
(u,v)∈U×Y
Defining w = u2 − β + v, so that
v = w − u2 + β ,
we may write
1 (v∗1 + f )2 1
Z Z
G∗ (v∗1 , v∗0 ) = ∗ dx + (v∗0 )2 dx
2 Ω 2v0 + K 2α Ω
Z
+β v∗0 dx. (22.5)
Ω
Theorem 22.2.1 Let Ω ⊂ R3 be an open, bounded, connected set with a regular (Lipschitzian) boundary
denoted by ∂ Ω. Suppose J : U → R is a functional defined by
Z Z
γ α
J(u) = ∇u · ∇u dx + (u2 − β )2 dx − hu, f iL2
2 Ω 2 Ω
δ J(u0 ) = 0
Here {u0 (i)2 } denotes the diagonal matrix which the diagonal is given by the vector [u0 (i)2 ].
Also, from now on, as the meaning is clear, we shall denote such a second Frec´ het derivative simply by
Define
K
Z Z
γ
F(u) = − ∇u · ∇u dx + u2 dx,
2 Ω 2 Ω
Existence and Duality Principles for the Ginzburg-Landau System in Superconductivity 441
and G∗ : Y ∗ ×Y ∗ → R = R ∪ {+∞} by
G∗ (v∗1 , v∗0 ) = sup {hv, v∗0 iY + hv1 , v∗1 iY − G(v1 , v)}
(v1 ,v)∈Y ×Y
1 (v∗1 + f )2 1
Z Z
= ∗ dx + (v0∗ )2 dx
2 Ω 2v0 + K 2α Ω
Z
+β v∗0 dx, (22.7)
Ω
Define
v̂∗0 = α(u20 − β ),
and
v̂∗1 = (2v̂∗0 + K)u0 − f .
Suppose v̂∗0 ∈ A∗ .
Under such hypotheses,
δ J˜∗ (v̂∗1 ) = 0,
and
J˜∗ (v̂∗1 ) = J(u0 ).
Moreover,
1. if δ 2 J(u0 ) > 0, then
δ 2 J˜∗ (v̂∗1 ) > 0,
so that there exist r > 0 and r1 > 0 such that
J(u0 ) = min J(u)
u∈Br (u0 )
= J˜∗ (v̂∗1 )
= J ∗ (v̂∗1 , v̂∗0 ). (22.8)
442 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
we have
δ J2∗ (v̂∗1 ) = 0,
δ 2 J2∗ (v̂∗1 ) > 0
and
= J2∗ (v̂∗1 )
= J ∗ (v̂∗1 , v̂∗0 ), (22.9)
where
J2∗ (v∗1 ) = sup J ∗ (v∗1 , v∗0 ).
v∗0 ∈A∗ ∩B∗
= J˜∗ (v̂∗1 )
= J ∗ (v̂∗1 , v̂∗0 ). (22.10)
and thus
∂ J ∗ (v̂∗1 , v̂∗0 )
= 0.
∂ v∗0
From this, (22.11), from the definition of v̂∗0 and the concavity of J ∗ (v̂∗1 , v∗0 ) in v∗0 , we obtain
= J ∗ (v̂∗1 , v̂∗0 )
= −G∗ (v̂∗1 , v̂∗0 ) + F ∗ (v̂∗1 )
= −hu0 , v̂∗1 iL2 − h0, v̂∗0 iL2 + G(u0 , 0)
+hu0 , v̂∗1 iL2 − F(u0 )
= G(u0 , 0) − F(u0 )
= J(u0 ). (22.12)
(v̂∗1 + f )2 v̂∗
∗ 2
− 0 − β = 0,
(2v̂0 + K) α
so that taking the variation in v∗1 of this equation in both sides, we obtain
and thus,
2(v̂∗1 + f )
∂ v̂∗0 (2v̂∗0 +K)2
= , (22.17)
∂ v∗1 1 4(v̂∗ + f )2
+ (2vˆ∗1+K)3
α 0
Therefore, denoting
4(v̂∗1 + f )2 4αu20
H = 1+α ∗ = 1 + ,
(2v̂0 + K)3 2v̂∗0 + K
we have
∂ 2 J˜∗ (v̂∗1 )
∂ (v∗1 )2
4(v̂∗1 + f )2
1 1
= 1+α − /H
(2v̂∗0 + K)3 K + γ∇2 2v̂∗0 + K
4αu2
1 1
= 1+ ∗ 0 − /H
2v̂0 + K K + γ∇2 2v̂∗0 + K
= (2v̂∗0 + K + 4αu02 − K − γ∇2 )/[(K + γ∇2 )(v̂∗0 + K)H]
= [−γ∇2 + 6αu20 − 2αβ ]/[(K + γ∇2 )(v̂∗0 + K)H]
δ 2 J(u0 )
= . (22.19)
[(K + γ∇2 )(v̂∗0 + K)H]
Existence and Duality Principles for the Ginzburg-Landau System in Superconductivity 445
= J˜∗ (v̂∗1 )
= J ∗ (v̂∗1 , v̂∗0 ). (22.20)
Assume now −γ∇2 + 2v̂∗0 > 0, so that δ 2 J(u0 ) > 0.
Observe that if v∗0 ∈ A∗ and −γ∇2 + 2v∗0 > 0, then
∂ 2 J ∗ (v1 , v∗0 ) 1 1
∗ 2
= 2
− ∗ > 0,
∂ (v1 ) K + γ∇ 2v0 + K
so that J ∗ (v∗1 , v∗0 ) is convex in v∗1 , ∀v∗0 ∈ A∗ ∩ B∗ .
Hence
J2∗ (v∗1 ) = sup J ∗ (v∗1 , v∗0 ),
v∗0 ∈A∗ ∩B∗
Moreover,
J2∗ (v̂∗1 ) = min J2∗ (v∗1 )
v∗1 ∈Y ∗
≤ J2∗ (v∗1 )
= sup J ∗ (v∗1 , v∗0 )
v∗0 ∈A∗ ∩B∗
(v∗1 )2
Z
1
= sup dx
∗ ∗
v ∈A ∩B∗ 2 Ω (K + γ∇2 )
0
(v∗1 + f )2 (v∗0 )2
1 1
Z Z Z
− ∗ dx − dx − β v∗0 dx
2 Ω 2v0 + K 2 Ω α Ω
(v∗1 )2
Z
1
≤ sup 2
dx
v∗ ∈A∗ ∩B∗ 2 Ω (K + γ∇ )
0
u2 (v∗0 )2
1
Z Z Z
−hu, v∗1 + f iL2 + (2v∗0 + K) dx − dx − β v∗0 dx
Ω 2 2 Ω α Ω
(v∗1 )2
Z
1
≤ sup 2
dx
v∗ ∈Y ∗ 2 Ω (K + γ∇ )
0
u2 (v∗0 )2
1
Z Z Z
−hu, v∗1 + f iL2 + (2v∗0 + K) dx − dx − β v∗0 dx
Ω 2 2 Ω α Ω
1 (v∗1 )2
Z
= dx
2 Ω (K + γ∇2 )
K
Z Z
α
−hu, v∗1 iL2 + (u2 − β )2 dx + u2 dx
2 Ω 2 Ω
−hu, f iL2 , ∀u ∈ U, v∗1 ∈ Y ∗ . (22.21)
446 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Hence,
(v∗1 )2
Z
1
J2∗ (v̂∗1 ) ≤ inf ∗ dx
∗
v1 ∈Y 2 Ω (K + γ∇2 )
K
Z Z
α
−hu, v∗1 iL2 (u − β ) dx +
+ 2 2 2
u dx − hu, f iL2
Ω 2 2 Ω
K K
Z Z Z Z
γ α
= ∇u · ∇u dx − u2 dx + (u2 − β )2 dx + u2 dx − hu, f iL2
2 Ω 2 Ω 2 Ω 2 Ω
= J(u), ∀u ∈ U. (22.22)
= J2∗ (v̂∗1 )
= J ∗ (v̂∗1 , v̂∗0 ). (22.23)
Theorem 22.3.1 Let Ω ⊂ R3 be an open, bounded, connected set with a regular (Lipschitzian) boundary
denoted by ∂ Ω. Suppose J : U → R is a functional defined by
Z Z
γ α
J(u) = ∇u · ∇u dx + (u2 − β )2 dx − hu, f iL2
2 Ω 2 Ω
δ J(u0 ) = 0
Define,
K
Z Z
γ
F(u) = − ∇u · ∇u dx + u2 dx,
2 Ω 2 Ω
where K > 0 is such that
F(u) > 0, ∀u ∈ U such that u =
6 0,
and
K
Z Z
α 2 2
G(u, v) = (u − β + v) dx + u2 dx − hu, f iL2 ,
2 Ω 2 Ω
so that
J(u) = G(u, 0) − F(u), ∀u ∈ U.
Define also Y = Y∗ = L2 (Ω), F∗ : Y ∗ → R by
F ∗ (v∗1 ) = sup{hu, v∗1 iL2 − F(u)}
u∈U
1
Z
= v∗1 [(K + γ∇2 )−1 v∗1 ] dx, (22.24)
2 Ω
and G∗ : Y ∗ ×Y ∗ → R = R ∪ {+∞} by
G∗ (v∗1 , v∗0 ) = sup {hv, v∗0 iY + hv1 , v∗1 iY − G(v1 , v)}
(v1 ,v)∈Y ×Y
1 (v∗1 + f )2 1
Z Z
= ∗ dx + (v∗0 )2 dx
2 Ω 2v0 + K 2α Ω
Z
+β v∗0 dx, (22.25)
Ω
= J1∗ (v̂∗0 )
= J ∗ (v̂∗1 , v̂∗0 ), (22.26)
448 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
where
J1∗ (v∗0 ) = ∗inf ∗ J ∗ (v∗1 , v∗0 ).
v1 ∈Y
δ J2∗ (v̂∗0 ) = 0
and
δ 2 J(u0 )
δ 2 J2∗ (v̂∗0 ) = − > 0,
α(−γ∇2 + 2v̂∗0 )
so that there exist r, r1 > 0 such that
= J2∗ (v̂∗0 )
= J ∗ (v̂∗1 , v̂∗0 ), (22.27)
where
J2∗ (v∗0 ) = sup J ∗ (v∗1 , v∗0 ).
v∗1 ∈Y ∗
δ J2∗ (v̂∗1 ) = 0
and
δ 2 J(u0 )
δ 2 J2∗ (v̂∗0 ) = − < 0,
α(−γ∇2 + 2v̂∗0 )
so that there exist r, r1 > 0 such that
= J2∗ (v̂∗0 )
= J ∗ (v̂∗1 , v̂∗0 ), (22.28)
where
J2∗ (v∗0 ) = sup J ∗ (v∗1 , v∗0 ).
v∗1 ∈Y ∗
so that
whereas if
−γ∇2 + 2v∗0 < 0,
then
J˜(v∗0 ) = J ∗ (ṽ∗1 , v∗0 ) = sup J ∗ (v∗1 , v∗0 ).
v∗1 ∈Y ∗
v̂∗1 (v̂∗1 + f )
− = 0,
K + γ∇2 (2v̂∗0 + K)
so that taking the variation in v∗0 of this equation in both sides, we obtain
∂ v̂∗1 ∂ v̂∗1
∂ v∗0 ∂ v∗0 2(v̂∗1 + f )
− ∗ + = 0, (22.34)
(K + γ ∇2 ) (2v̂0 + K) (2v̂∗0 + K)2
Therefore,
∂ 2 J˜∗ (v̂∗0 )
∂ (v∗0 )2
1 4u2
= − − ∗ 0
α 2v̂0 + K
K + γ∇2 4u20
−
2v∗0 + K (2v̂∗0 − γ∇2 )
4u20 K + γ ∇2
1
= − − 1+ ∗
α (2v̂0 + K) 2v̂0 − γ∇2
1 4u20
= − − ∗
α (2v̂0 − γ∇2 )
γ∇2 − 2v̂∗0 − 4αu20
=
α(−γ∇2 + 2v̂∗0 )
γ∇2 − 6αu20 + 2αβ
=
α(−γ∇2 + 2v̂∗0 )
δ 2 J(u0 ) (22.37)
= − .
α (−γ∇2 + 2v̂∗0 )
= J1∗ (v̂∗0 )
= J ∗ (v̂∗1 , v̂∗0 ). (22.38)
452 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
We also denote, for a finite dimensional discretized version of this problem, in a finite elements or finite
differences context,
Also,
G∗1 (z∗ , v∗1 ) = sup {hz∗ , uiL2 + hv∗1 , viL2
(u,v)∈U ×L2
K
Z Z
α
+hu, f iL2 − (u2 − 1 + v)2 dx − u2 dx}
2 Ω 2 Ω
1 (z∗ + f )2 1
Z Z
= dx + (v∗ )2 d x
2 Ω 2v∗1 + K 2α Ω 1
Z
+ v∗1 dx
Ω
≡ G∗1L (z∗ , v∗1 ), (22.41)
if v∗1 ∈ B1 , where
B1 = {v∗1 ∈ Y ∗ : 2v∗1 + K > 0, in Ω}
and G∗1L stands for the Legendre transform of G1 .
We also denote,
B2 = {v∗1 ∈ Y ∗ :
Z Z
γ
∇u · ∇u dx + v∗1 u2 dx > 0,
2 Ω Ω
∀u ∈ U such that u =
6 0}, (22.42)
C∗ = B1 ∩ B2 ,
where
Y = Y ∗ = L2 (Ω).
Under such hypotheses,
inf J(u) ≥ sup { inf {J ∗ (z∗ , v∗1 )}}
∗ ∗
u∈U v∗1 ∈C∗ z ∈Y
where,
J ∗ (z∗ , v∗1 ) = F ∗ (z∗ ) − G∗1 (z∗ , v∗1 )
and
J˜(v∗1 ) = ∗inf ∗ J ∗ (z∗ , v∗1 ).
z ∈Y
Moreover, if there exists a critical point (z∗0 , (v∗0 )1 ) ∈ C∗ ×Y ∗ , so that
∗
δJ (z∗0 , (v∗0 )1 ) = 0,
then, denoting
z∗0 + f
u0 =
2(v∗0 )1 + K
we have that
J(u0 ) = min J(u)
u∈U
= max J˜∗ (v∗1 )
v∗1 ∈C∗
= J˜∗ ((v∗0 )1 )
= J ∗ (z∗0 , (v∗0 )1 ). (22.44)
454 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
∀v∗1 ∈ C∗ , u ∈ U, v ∈ Y, z∗ ∈ Y ∗ .
Thus,
∀u ∈ U, v∗1 ∈ C∗ , z∗ ∈ Y ∗ , so that
∀u ∈ U, v∗1 ∈ C∗ , z∗ ∈ Y ∗ .
Hence,
∀u ∈ U, v∗1 ∈ C∗ .
Thus,
inf J(u) ≥ sup J˜∗ (v∗1 ). (22.49)
u∈U v∗1 ∈C∗
δ J ∗ (z∗0 , (v∗0 )1 ) = 0.
so that
z∗0 = (KId + γ∇2 )u0 ,
and
z∗0 + f = (2(v∗0 )1 + K)u0 . (22.50)
Thus,
F ∗ (z∗0 ) = hz∗0 , u0 iL2 − F(u0 ). (22.51)
On the other hand, from the variation in v∗1 , we have,
[z∗0 + f ]2 (v∗ )1
∗ 2
− 0 − 1 = 0,
[2(v0 )1 + K ] α
Existence and Duality Principles for the Ginzburg-Landau System in Superconductivity 455
so that
(v∗0 )1 = α(u20 − 1),
and hence, from this and (22.50) we have,
and
G∗1 (z∗0 , (v∗0 )1 ) = hz∗0 , u0 iL2 − G1 (u0 , 0). (22.52)
From (22.51) and (22.52), we obtain
Now, let
v∗1 ∈ C∗ .
Observe that, in such a case,
Z Z
γ
∇u · ∇u dx + v∗1 u2 dx > 0,
2 Ω Ω
∀u ∈ U, such that u 6= 0.
Denoting
Z Z
γ ∗ 2
α1 = inf ∇u · ∇u dx + v1 u dx − hu, f iL2 , (22.54)
u∈U 2 Ω Ω
we have
K
Z Z
v∗1 u2 dx + u2 dx − hu, f iL2 − hz∗ , uiL2
Ω 2 Ω
K
Z Z
γ
≥ − ∇u · ∇u dx + u2 dx − hz∗ , uiL2 + α1 , (22.55)
2 Ω 2 Ω
∀u ∈ U, so that,
Z
K
Z
inf v∗1 u2 dx +
u2 dx − hu, f iL2 − hz∗ , uiL2
u∈U Ω Ω 2
K
Z Z
γ
≥ inf − ∇u · ∇u dx + u2 dx − hz∗ , uiL2 + α1 , (22.56)
u∈U 2 Ω 2 Ω
and hence,
1 (z∗ + f )2
Z
− ∗ ≥ −F ∗ (z∗ ) + α1 ,
2 Ω 2v1 + K
1 (z∗ + f )2
Z
F ∗ (z∗ ) − ∗ dx ≥ α1 ,
2 Ω 2v1 + K
∀z∗ ∈ Y ∗ .
456 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
(z∗ + f )2
1
Z
∗ ∗
inf F (z ) − d x = α1 ∈ R.
z∗ ∈Y ∗ 2 Ω 2v∗1 + K
From this, since (v∗0 )1 ∈ C∗ and the optimization in z∗ in question is quadratic, we may infer that,
= J˜∗ ((v∗0 )1 )
= J ∗ (z∗0 , (v∗0 )1 ). (22.57)
Ω ⊂ Ω1 .
We assume the boundaries ∂ Ω and ∂ Ω1 to be regular (Lipschitzian). Here, again im denotes the imaginary
unit and γ, α, β and ρ are positive constants. Also,
kφn k∞ ≤ K, ∀n ∈ N
Existence and Duality Principles for the Ginzburg-Landau System in Superconductivity 457
Proof 22.4
Define
α1 = inf {J(φ , A)} ∈ R.
(φ ,A)∈U
kcurl(An )k22 ≤ K1 , ∀n ∈ N.
Given (φ , A) ∈ U, define (φ 0 , A0 ) ∈ U by
φ 0 = φ eim ρϕ ,
and
A0 = A + ∇ϕ,
where ϕ will be specified in the next lines.
Observe that,
Moreover,
curl(A0 ) = curl(A) + curl(∇ϕ) = curl(A).
Also,
|φ 0 | = |φ eim ρϕ | = |φ |.
From these last calculations, we may infer the system gauge invariance, that is,
J(φ , A) = J(φ 0 , A0 ).
div(A0 ) = div(A) + ∇2 ϕ = 0,
A0 · n = A · n + ∇ϕ · n = 0, on ∂ Ω1
that is,
∇2 ϕ = −div(A), in Ω1 ,
∇ϕ · n = −A · n, on ∂ Ω1 .
458 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
However,
J(φn0 , A0n ) = J(φn , An ) → α1 , as n → ∞,
so that
inf J(φ , A) = inf J(φ 0 , A0 ).
(φ ,A)∈U (φ 0 ,A0 )∈U
k∇φn0 k k2 → +∞, as k → ∞.
Existence and Duality Principles for the Ginzburg-Landau System in Superconductivity 459
which contradicts
J(φn0 , A0n ) → α1 , as n → +∞.
Therefore, there exists K4 > 0 such that
k∇φn0 k2 ≤ K4 ∈ R+ , ∀n ∈ N.
Hence, from the Rellich- Krondrachov theorem, there exists φ0 ∈ W 1,2 (Ω; C) such that, up to a not
relabeled subsequence,
∇φn0 * ∇φ0 , weakly in L2 ,
and
φn0 → φ0 , strongly in L2 .
Also, since
kcurl(A0n )k2 ≤ K1 , ∀n ∈ N,
there exists v0 ∈ L2 (Ω1 ; R3 ) such that
Also, since
kA0n k2 ≤ K4 , ∀n ∈ N,
there exists
A0 ∈ L2 (Ω1 ; R3 ),
such that, up to a not relabeled subsequence,
Now fix
φ̂ ∈ Cc∞ (Ω1 ; R3 ).
Thus, we have
v0 = curl(A0 ),
in distributional sense.
At this point we shall prove that, up to a not relabeled subsequence, we have,
Observe that
kφn0 k∞ < K, ∀n ∈ N,
so that
|φn0 v − φ0 v|22 ≤ 2K 2 |v|22 ∈ L1 (Ω; R).
Thus, from the Lebesgue dominated convergence theorem, we obtain
kφn0 v − φ0 vk22 → 0, as n → ∞.
Hence, since
A0n * A0 , weakly in L2 (Ω1 ; R3 ),
we have,
Z
(A0n · φn0 v − A0 · φ0 v) dx
Ω
Z
= (A0n · φn0 v − A0n · φ0 v + A0n · φ0 v − A0 · φ0 v) dx
Ω
Z
≤ kA0n k2 kφn0 v − φ0 vk2 + A0n · φ0 v − A0 · φ0 v) dx
Ω
→ 0, as n → ∞. (22.63)
Since v ∈ L2 (Ω; C3 ) is arbitrary, we may infer that
A0n φn0 * A0 φ0 , weakly in L2 (Ω; C3 ).
From this we obtain
∇φn0 − im ρA0n φn0 * ∇φ0 − im ρA0 φ0 , weakly in L2 (Ω; C3 ),
so that
Z Z
lim inf |∇φn0 − im ρA0n φn0 |22 dx ≥ |∇φ0 − iρA0 φ0 |22 dx, (22.64)
n→∞ Ω Ω
Also, from
φn0 * φ0 , weakly in W 1,2 (Ω; C)
and
curl(A0n ) * curl(A0 ), weakly in L2 (Ω1 , R3 ),
from the convexity of the functional involved, we obtain,
Z
1
Z
0 2 α 0 4
lim inf |curl(An ) − B0 |2 dx + |φ | dx
n→∞ 8π Ω1 4 Ω n
1
Z Z
α
≥ |curl(A0 ) − B0 |22 dx + |φ0 |4 dx, (22.65)
8π Ω1 4 Ω
so that, from these last results and from
φn0 → φ0 , strongly in L2 (Ω; C),
we get,
inf J(φ , A) = α1
(φ ,A)∈U
Theorem 22.6.1 Let Ω, Ω1 ⊂ R3 be open, bounded, connected sets with regular (Lipischtzian) boundaries
denoted by ∂ Ω and ∂ Ω1 respectively, where Ω ⊂ Ω1 and Ω corresponds to a super conducting sample.
Consider the Ginzburg-Landau energy given by J : V1 ×V2 → R where,
Z
γ
J(φ , A) = |∇φ − im ρAφ |22 dx
2 Ω
Z
α
+ (|φ |2 − β )2 dx − hφ , f iL2
2 Ω
1
Z
+ |curl(A) − B0 |22 dx, (22.67)
8π Ω1
∀h, g ∈ L2 (Ω; C), where Re[a], Im[a] denote the real and imaginary parts of a, ∀a ∈ C, respectively.
We also denote,
J(φ , A) = G0 (φ , ∇φ , A) + G1 (φ , 0) + G2 (A),
Z
γ
G0 (φ , ∇φ , A) = |∇φ − im ρAφ |22 dx,
2 Ω
Z
α
G1 (φ , v3 ) = (|φ |2 − β + v3 )2 dx − hφ , f iL2 ,
2 Ω
and,
1
Z
G2 (A) = | curl(A) − B0 |22 dx.
8π Ω1
Moreover, we define
1 |div(v∗1 ) + im ρA · v∗1 − f |2 1
Z Z
= dx + (v∗3 )2 dx
2 Ω 2v3∗ 2α Ω
Z
+ β (v∗3 ) dx, (22.69)
Ω
if v∗ ∈ B1 , where
B1 = {v∗ ∈ Y ∗ ×Y1∗ : v∗3 > 0 in Ω}.
We also denote
B2 = {v∗ ∈ Y ∗ :
1 1 |ρv∗1 · A|2
Z Z
|curlA|22 dx − dx > 0,
8π Ω1 2 Ω 2 v∗3
∀A ∈ D∗ , such that A 6= 0}, (22.70)
D∗ = {A ∈ V2 : div(A) = 0, in Ω1 , and A · n = 0 on ∂ Ω1 },
C∗ = B1 ∩ B2 ,
and
Y = Y ∗ = L2 (Ω; C3 ) and Y1 = Y1∗ = L2 (Ω).
Under such assumptions, we have,
where
J ∗ (v∗ , A) = −G∗0 (v∗1 ) − G∗1 (v∗1 , v∗3 , A).
and
J˜∗ (v∗ ) = inf ∗ {J ∗ (v∗ , A) + G2 (A)}.
A∈D
Moreover, assume there exists a critical point (v∗0 , A0 ) ∈ C∗ × D∗ such that
δ {J ∗ (v∗0 , A0 ) + G2 (A0 )} = 0.
∀φ ∈ V1 , A ∈ D∗ , that is,
G0 (φ , ∇φ , A) + G1 (φ , 0) + G2 (A)
≥ −G∗0 (v∗1 ) − G∗1 (v∗1 , v∗3 , A) + G2 (A), (22.74)
so that
Thus,
inf J(φ , A) ≥ sup J˜∗ (v∗ ). (22.76)
(φ ,A)∈V1 ×D∗ v∗ ∈C∗
so that,
G∗0 ((v∗0 )1 ) = h(v∗0 )1 , ∇φ0 − im ρA0 φ0 iL2 − G0 (φ0 , ∇φ0 , A0 ). (22.79)
From the variation in v∗3 we obtain
that is,
(v∗0 )3 = α(|φ0 |2 − β ),
so that
G∗1 ((v∗0 )1 , (v∗0 )3 , A0 ) = −h(v∗0 )1 , ∇φ0 − im ρA0 φ0 iL2 − G1 (φ0 , 0). (22.80)
From (22.79) and (22.80), we obtain
J ∗ (v∗0 , A0 ) + G2 (A0 )
= −G∗0 ((v∗0 )1 ) − G∗1 ((v∗0 )1 , (v∗0 )3 , A0 ) + G2 (A0 )
= G0 (φ0 , ∇φ0 , A0 ) + G1 (φ0 , 0) + G2 (A0 )
= J(φ0 , A0 ) (22.81)
464 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
22.7 Conclusion
In the present chapter, we have developed duality principles applicable to a large class of variational non-
convex models.
In a second step we have applied such results to a Ginzburg-Landau type equation. We emphasize the
main theorems here developed are a kind of generalization of the main results found in Toland [78], published
in 1979 and [11, 10].
Following the approach presented in [11], we also highlight the duality principles obtained may be ap-
plied to non-linear and non-convex models of plates, shells and elasticity.
Finally, as above mentioned, in the last sections we present a global existence result, a duality principle
and respective optimality conditions for the complex Ginzburg-Landau system in superconductivity in the
presence of a magnetic field and concerned magnetic potential.
Chapter 23
23.1 Introduction
This work develops an existence result for an optimal control problem closely related to the Ginzburg-Landau
system in superconductivity. First, we recall that about the year 1950 Ginzburg and Landau introduced a
theory to model the super-conducting behavior of some types of materials below a critical temperature Tc ,
which depends on the material in question. They postulated the free density energy may be written close to
Tc as
h̄ α(T ) β (T )
Z Z Z
Fs (T ) = Fn (T ) + |∇ψ |22 d x + |ψ |4 dx − |ψ |2 dx,
4m Ω 4 Ω 2 Ω
where ψ is a complex parameter, Fn (T ) and Fs (T ) are the normal and super-conducting free energy densities,
respectively (see [4] for details). Here Ω ⊂ R3 denotes the super-conducting sample with a boundary denoted
by ∂ Ω = Γ. The complex function ψ ∈ W 1,2 (Ω; C) is intended to minimize Fs (T ) for a fixed temperature T .
Denoting α(T ) and β (T ) simply by α and β , the corresponding Euler-Lagrange equations are given by:
h̄
− 2m ∇2 ψ + α|ψ|2 ψ − β ψ = 0, in Ω
(23.1)
∂ψ
∂ n = 0, on ∂ Ω.
This last system of equations is well known as the Ginzburg-Landau (G-L) one. In the physics literature is
also well known the G-L energy in which a magnetic potential here denoted by A is included. The functional
in question is given by:
466 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
2
1 h̄2 2ie
Z Z
J (ψ, A) = | curl A − B0 |22 dx + ∇ψ − Aψ dx
8π R3 4m Ω hc
¯ 2
Z Z
α β
+ |ψ |4 dx − |ψ|2 dx (23.2)
4 Ω 2 Ω
and
curl (curl A) = curl B0 + 4π
˜ in Ω
c J,
(23.4)
curl (curl A) = curl B0 , in R3 \ Ω,
where
ieh̄ ∗ 2e 2
J˜ = − (ψ ∇ψ − ψ∇ψ ∗ ) − |ψ|2 A.
2m mc
and
B0 ∈ L2 (R3 ; R3 )
is a known applied magnetic field.
Existence of a global solution for a similar problem has been proved in [22].
with (φ , A, u) subject to the satisfaction of the Ginzburg-Landau equations, indicated in (23.5) and (23.6) in
the next lines.
For such a problem, the control variable is u ∈ L2 (∂ Ω; C) and the state variables are the Ginzburg-Landau
order parameter φ ∈ W 1,2 (Ω, C) and the magnetic potential A ∈ W 1,2 (Ω1 , R3 ).
Our main existence result is summarized by the following theorem.
C = {(φ , A, u) ∈ W 1,2 (Ω, C) ×W 1,2 (Ω1 , R3 ) × L2 (∂ Ω; C) : such that (23.5) and (23.6) hold },
Existence for an Optimal Control Problem in Superconductivity 467
where 2
1
2m −ih̄∇ − 2e 2
c A φ + α|φ | φ − β φ = 0, in Ω,
(23.5)
2e
· n = u,
ih̄∇φ + c Aφ on ∂ Ω,
and
curl curl A = curl B0 + 4π ˜
c J, in Ω,
curl curl A = curl B0 , in Ω1 \ Ω,
(23.6)
div A = 0, in Ω1 ,
A · n = 0, on ∂ Ω1
where,
ieh̄ 2e 2
J˜ = − (φ ∗ ∇φ − φ ∇φ ∗ ) − |φ |2 A,
2m mc
and where ε > 0 is a small parameter, K1 > 0 and K2 > 0.
Under such hypotheses, there exists (φ0 , A0 , u0 ) ∈ C such that
J(φ0 , A0 , u0 ) = min J(φ , A, u).
(φ ,A,u)∈C
Proof 23.1 Let {(φn , An , un )} be a minimizing sequence (such a sequence exists from the existence result
for u = 0 in [22], and from the fact that J is lower bounded by 0).
Thus, such a sequence is such that
J(φn , An , un ) → η = inf J(φ , A, u).
(φ ,A,u)∈C
From the well known Friedrichs Inequality and the Sobolev Spaces Imbedding theorem for appropriate
constants indicated, we obtain
2
kAn k20,4,Ω1 ≤ K3 kAn k21,2,Ω1 ≤ K4 k div An k0,2,Ω1 + k curl An k0,2,Ω
= K4 k curl An k20,2,Ω1 , (23.8)
div An = 0, in Ω1 , ∀n ∈ N.
2 1
0 = ρ1,n ≥ K5 kAn k0,4,Ω 1
+ k curl An k20,2,Ω1
2
−k curl An k0,2,Ω1 kB0 k0,2,Ω1 − γkAn k0,4,Ω1 K 2
+γ1 |φ |2 , An · An L2 (Ω;R3 )
≡ ρ2,n . (23.9)
Now, suppose to obtain contradiction there exists a subsequence {nk } ⊂ N such that
kAnk k0,4,Ω1 → ∞, as k → ∞.
so that from this, the Friedrichs inequality and the London Gauge hypothesis, we obtain K8 > 0 such that
So from such a result and the Rellich-Kondrashov Theorem there exists a not relabeled subsequence and
A0 ∈ W 1,2 (Ω1 , R3 ) such that
An * A0 weakly ∈ W 1,2 (Ω1 , R3 )
An → A0 in norm in L2 (Ω1 , R3 ) and L4 (Ω1 , R3 ).
Moreover, from the Sobolev Imbedding Theorem, there exist real constants K̂ > 0, K̂1 > 0 such that
Thus, from this and the first equation in (23.5), there exist real constants K̂2 > 0, . . . , K̂6 > 0, such that
Let
ϕ ∈ Cc∞ (Ω, C), ϕ1 ∈ Cc∞ (Ω, R3 ) and ϕ2 ∈ Cc∞ (Ω1 \ Ω, R3 ).
From the last results, we may easily obtain the following limits
1.
h∇2 φn , ϕiL2 → h∇2 φ0 , ϕiL2 ,
2.
h∇φn , ∇ϕiL2 → h∇φ0 , ∇ϕiL2 ,
3.
hAn · ∇φn , ϕiL2 → hA0 · ∇φ0 , ϕiL2 ,
4.
h|An |2 φn , ϕiL2 → h|A0 |2 φ0 , ϕiL2 ,
5.
h|φn |2 φn , ϕiL2 → h|φ0 |2 φ0 , ϕiL2 ,
6.
h curl An , curl ϕ1 iL2 → h curl A0 , curl ϕ1 iL2 ,
7.
hφn∗ ∇φn , ϕ1 iL2 → hφ0∗ ∇φ0 , ϕ1 iL2 ,
8.
hφn ∇φn∗ , ϕ1 iL2 → hφ0 ∇φ0∗ , ϕ1 iL2 ,
9.
h|φn |2 An , ϕ1 iL2 → h|φ0 |2 A0 , ϕ1 iL2 .
470 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
The other boundary condition may be dealt similarly. Thus, from these last results we may infer that in
the distributional sense,
1
2
2m −ih̄∇ − 2e 2
c A0 φ0 + α|φ0 | φ0 − β φ0 = 0, in Ω,
(23.14)
ih̄∇φ0 + 2e
·
A
c 0 0φ n = u0 , on ∂ Ω,
Existence for an Optimal Control Problem in Superconductivity 471
and 4π ˜
curl curl A0 = curl B0 + c J0 ,
in Ω,
curl curl A0 = curl B0 , in Ω1 \ Ω,
(23.15)
div A0 = 0, in Ω1 ,
A0 · n = 0, on ∂ Ω1
where
ieh̄ 2e 2
J˜0 = − (φ0∗ ∇φ0 − φ0 ∇φ0∗ ) − |φ0 |2 A0 .
2m mc
Hence, (φ0 , A0 , u0 ) ∈ C .
Finally, from φn → φ0 in L2 and L4 , φn * φ0 weakly in W 1,2 , un * u0 weakly in L2 (∂ Ω), by continuity
in φ and the convexity of J in ∇φ and u, we have,
−ε∇2 u + αu3 − β u = f , in Ω,
(23.16)
u = 0, on ∂ Ω,
d2 d2
un+1 − 2un + un−1 + T (un ) + fn = 0, (23.17)
ε ε
where
∂ 2 un
T (un ) = − αu3n + β un , ∀n ∈ {1, . . . , N − 1}.
∂ y2
472 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
d2 d2
u2 − 2u1 + T (u1 ) + f1 = 0,
ε ε
so that
u2 1 d2 1 d2
u1 = + T (u1 ) + f1 ,
2 2 ε 2 ε
that is,
d2 d2
u1 = a1 u2 + b1 T (u1 ) + c1 + E1 ,
ε ε
where
1 1 1
a1 = , b1 = , c1 = f1 and E1 = 0.
2 2 2
Reasoning inductively, having
d2 d2
un−1 = an−1 un + bn−1 T (un−1 ) + cn−1 + En−1 ,
ε ε
from this and (23.17), we have
d2 d2
un+1 − 2un + an−1 un + bn−1 T (un−1 ) + cn−1 + En−1
ε ε
d2 d2
+T (un ) + fn = 0, (23.18)
ε ε
so that
d2 d2
un = an un+1 + bn T (un ) + cn + En
ε ε
where
1
an = ,
2 − an−1
bn = an (bn−1 + 1),
cn = an (cn−1 + fn ),
and
d2
En = an En−1 + an bn−1 (T (un−1 ) − T (un )) ,
ε
∀n ∈ {2, . . . , N − 1}.
In particular, for n = N − 1, recalling that uN = 0, we may obtain uN−1 as the solution of the approximate
ordinary differential equation,
d2 d2
uN −1 ≈ aN −1 uN + bN −1 T (uN−1 ) + cN−1 (23.19)
ε ε
that is,
∂ 2 uN −1 d2 d2
uN−1 ≈ bN −1 − α uN3 −1 + β uN−1 + cN−1 ,
∂ y2 ε ε
with the boundary conditions uN −1 (0) = uN −1 (1) = 0.
Existence for an Optimal Control Problem in Superconductivity 473
d2 d2
uN−2 ≈ aN−2 uN−1 + bN −2 T (uN−2 ) + cN−2 ,
ε ε
that is,
∂ 2 uN −2 d2 d2
uN −2 ≈ aN −2 uN −1 + bN −2 − αu3N −2 + β uN−2 + cN−2 ,
∂ y2 ε ε
and so on, up to finding u1 .
The problem is thus approximately solved. We present numerical results for α = β = 1 and N = 500
lines (mesh 500 × 500).
For ε = 1, 0.1, 0.01 and 0.001, please see the Figures 23.1, 23.2, 23.3, 23.4 and 23.5, for the respective
graphs.
We observe that for small values of ε and in particular for ε = 0.0001, the solution u(x, y) is close to the
constant value 1.32 on the almost whole domain, which is the approximate solution of equation u3 − u − 1 =
0.
23.4 Conclusion
In this chapter we have developed a global existence result for a control problem related to the Ginzburg-
Landau system in superconductivity. We emphasize the control variable u acts on the super-conducting sam-
ple boundary, whereas the state variables, namely, the order parameter φ and the magnetic potential A are
defined on Ω and Ω1 , respectively. The problem has non-linear constraints and the cost functional is non-
convex. Finally, we highlight the London Gauge assumption and the Friedrichs Inequality have a fundamental
role in the establishment of the main results.
Chapter 24
24.1 Introduction
This chapter develops a dual variational formulation for a semi-linear model in micro-magnetism. For the
primal formulation we refer to references [14, 15] for details. In particular we refer to the original results
presented in [15], emphasizing the present work is their natural continuation and extension. We also highlight
the present work develops real relevant improvements relating the previous similar results in [14].
At this point we start to describe the primal formulation.
Let Ω ⊂ R3 be an open bounded set with a a regular (lipschitzian) boundary denoted by ∂ Ω. By a
regular lipschitzian boundary ∂ Ω we mean regularity enough so that the Sobolev imbedding theorem and
relating results, the trace theorem and the standard Gauss-Green formulas of integration by parts to hold. The
corresponding outward normal to ∂ Ω is denoted by n = (n1 , n2 , n3 ). Also, we denote by 0 either the zero
vector in R3 or the zero in an appropriate function space.
Under such assumptions and notations, consider the problem of finding the magnetization m : Ω → R3 ,
which minimizes the functional
Z Z Z
α
J(m, f ) = |∇m|22 dx + ϕ(m(x)) dx − H(x) · m dx
2 Ω Ω Ω
1
Z
+ | f (x)|22 dx, (24.1)
2 R3
where
m = (m1 , m2 , m3 ) ∈ W 1,2 (Ω; R3 ) ≡ Y1 , |m(x)|2 = 1, in Ω (24.2)
and f ∈ L2 (R3 ; R3 ) ≡ Y2 is the unique field determined by the simplified Maxwell’s equations
The term Z
α
|∇m|22 dx
2 Ω
is called the exchange energy, where v
u 3
u
|m|2 = t ∑ m2k
k=1
and
3
|∇m|22 = ∑ |∇mk |22 .
k=1
Finally, ϕ(m) represents the anisotropic contribution and is given by a multi-well functional whose minima
establish the preferred directions of magnetization.
Remark 24.1.1 Here some brief comments on the references. Relating and similar problems are addressed
in [14]. The basic results on convex and variational analysis used in this text may be found in [33, 14, 64, 78].
About the duality principles, we have been greatly inspired and influenced by the work of J.J. Telega and W.R.
Bielski. In particular, we would refer to [11], published in 1985, as the first article to successfully apply the
convex analysis approach to non-convex and non-linear mechanics.
Finally, an extensive study on Sobolev spaces may be found in [1].
Remark 24.1.2 At some points of our analysis we refer to the problems in question after discretization. In
such a case we are referring to their approximations in a finite element or finite differences context.
Theorem 24.2.1 Considering the previous statements and notations, define J : Y1 ×Y2 × B → R = R ∪ {+∞}
by
K
J(m, f ,t) = G0 (m) − hmi , mi iL2 + G1 (m,t) + G2 ( f )
2
+Ind0 (m) + Ind1 (m, f ) + Ind2 ( f ), (24.5)
where
α
G0 (m) = h∇mi , ∇mi iL2 ,
2
K
Z
G1 (m,t) = (tg1 (m) + (1 − t)g2 (m)) dx − hHi , mi iL2 + hmi , mi iL2 ,
Ω 2
1
Z
G2 ( f ) = | f (x)|2 dx,
2 R3
g1 (m) = β (1 + m · e),
g2 (m) = β (1 − m · e),
478 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
0, if |m(x)|2 = 1, in Ω
Ind0 (m) = (24.6)
+∞, otherwise,
if div(− f + mχΩ ) = 0, in R3
0,
Ind1 (m, f ) = (24.7)
+∞, otherwise,
if curl f = 0, in R3
0,
Ind2 ( f ) = (24.8)
+∞, otherwise.
We recall the present case refers to a uniaxial material with exchange of energy, that is α > 0.
Here, e = (e1 , e2 , e3 ) ∈ R3 is a unit vector.
Under such hypotheses, we have,
inf {J(m, f ,t)} ≥ sup {J˜∗ (λ )},
(m, f ,t)∈Y1 ×Y2 ×B λ ∈A∗
where
J˜∗ (λ ) = inf J ∗ (λ , z∗ ,t),
(z∗ ,t)∈Y4∗ (λ )×B
1
G∗2 (λ ) = k∇λ2 − curl ∗ λ1 k22 .
2
Also,
A1 = {λ ∈ Y3 : λ3 + K > 0, in Ω},
and from the standard second order sufficient optimality condition for a local minimum in m, we define
A2 = {λ ∈ Y3 :
3
G0 (m) + hλ3 , ∑ mi2 iL2 > 0,
i=1
∀m ∈ Y1 , such that m =
6 0} (24.12)
Duality for a Semi-Linear Model in Micro-Magnetism 479
where
A∗ = A1 ∩ A2 ,
Y = Y ∗ = L2 (Ω; R3 ) = L2 ,
λ = (λ1 , λ2 , λ3 ) ∈ Y3 = W 1,2 (R3 ; R3 ) ×W 1,2 (R3 ) × L2 (Ω),
Y1 = W 1,2 (Ω; R3 ),
Y2 = W 1,2 (R3 ; R3 ),
Y4∗ (λ ) = {z∗ ∈ [Y ∗ ]3 : z∗i · n + λ2 ni = 0, on ∂ Ω, ∀i ∈ {1, 2, 3}},
B = {t measurable : 0 ≤ t ≤ 1, in Ω}.
Finally, suppose there exists (λ0 , z∗0 , t0 ) ∈ A∗ × Y4∗ (λ0 ) × B such that for an appropriate λ4 ∈ L2 (Ω) we
have
δ J ∗ (λ0 , z∗0 ,t0 ) + hλ4 ,t02 − t0 iL2 = 0,
= ˜∗
sup J (λ )
λ ∈A∗
˜∗
= J (λ0 )
= J ∗ (λ0 , z∗0 ,t0 ). (24.13)
Therefore,
( )
∗ ∗
inf J(m, f ,t) ≥ sup inf J (λ , z ,t)
(m, f ,t)∈Y1 ×Y2 ×B λ ∈A∗ (z∗ ,t)∈Y4∗ (λ )×B
div(m0 χΩ − f0 ) = 0, in R3 , (24.22)
where
f0 = curl(λ1 )0 − ∇(λ0 )2 . (24.23)
From the variation in λ1 , we obtain,
curl f0 = 0, in R3 . (24.24)
From (24.20), we also have
K
F̃ ∗ (z∗0 ) = h(z∗0 )i , ∇(m0 )i iL2 + G0 (m0 ) − h(m0 )i , (m0 )i iL2 . (24.25)
2
Duality for a Semi-Linear Model in Micro-Magnetism 481
= ˜∗
sup J (λ )
λ ∈A∗
˜∗
= J (λ0 )
= J ∗ (λ0 , z∗0 ,t0 ). (24.28)
24.3 Conclusion
In this chapter we have developed a duality principle for a non-convex semi-linear model in micro-
magnetism.
In a second step we present an optimality criterion for the dual formulation. The results are obtained
through standard tools of convex analysis and duality theory.
Chapter 25
25.1 Introduction
This chapter develops numerical methods for a large class of non-linear ordinary and partial differential
equations.
More specifically, such a chapter is concerned with a kind of matrix version of the Generalized Method
of Lines. Applications are developed for models in physics and engineering.
where
An = f 0 (ũn ),
and
Bn = f (ũn ) − f 0 (ũn )ũn + gn .
In particular, for n = 1 we get
u2 − 2u1 + u0 + A1 u1 d 2 + B1 d 2 = 0.
Solving such an equation for u1 , we get
u1 = a1 u2 + b1 u0 + c1 ,
where
a1 = (2 − A1 d 2 )−1 , b1 = a1 , c1 = a1 B1 d 2 .
Reasoning inductively, having
and
un+1 − 2un + un−1 + An un d 2 + Bn d 2 = 0,
we get
and in particular,
uN−1 = HN −1 (u f ),
so that we may calculate,
uN−2 = HN−2 (uN−1 ),
uN−3 = HN −3 (uN−2 ),
and so on, up to finding,
u1 = H1 (u2 ).
484 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
0.14
0.12
0.1
0.08
0.06
0.04
0.02
0
0 0.2 0.4 0.6 0.8 1
1.4
1.2
0.8
0.6
0.4
0.2
0
0 0.2 0.4 0.6 0.8 1
The next step is to replace {ũn } by the {un } calculated, and repeat the process up to the satisfaction of an
appropriate convergence criterion. We present numerical results for the equation,
3
u00 − uε + εu + g = 0, in [0, 1]
(25.2)
u(0) = 0, u(1) = 0,
where,
1
g(x) = ,
ε
The results are obtained for ε = 1.0, ε = 0.1, ε = 0.01 and ε = 0.001. Please see Figures 25.1, 25.2, 25.3
and 25.4 respectively.
About Numerical Methods for Ordinary and Partial Differential Equations 485
1.4
1.2
0.8
0.6
0.4
0.2
0
0 0.2 0.4 0.6 0.8 1
1.4
1.2
0.8
0.6
0.4
0.2
0
0 0.2 0.4 0.6 0.8 1
where α , β , ε > 0, u ∈ U = W01,2 (Ω), and f ∈ L2 (Ω). The corresponding primal variational formulation is
represented by J : U → R, where
Z Z Z Z
ε α β
J(u) = ∇u · ∇u dx + u4 dx − u2 dx + f u dx.
2 Ω 4 Ω 2 Ω Ω
486 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
3αd 2
un+1 − 2un + un−1 + M̃2 un − (u0 )2n un
ε
2α β d2 d2
+ (u0 )3n d 2 + un − fn = 0, (25.5)
ε ε ε
2
where M̃2 = M2 dd 2 and
1
−2 1 0 0 ··· 0
1 −2 1 0 ···
0
0 1 −2 1 ··· 0
M2 = , (25.6)
... ..
.
..
.
..
.
..
.
.
..
0 0 ··· 1 −2 1
0 0 ··· ··· 1 −2
with N1 lines corresponding to the discretization in the y axis. Furthermore d1 = 1/N1 .
In particular, for n = 1 we get
3αd 2
u2 − 2u1 + M̃2 u1 − (u0 )21 u1
ε
2α β d2 d2
+ (u0 )31 d 2 + u1 − f1 = 0. (25.7)
ε ε ε
Denoting
αd 2 β d2
M12 [1] = 2Id − M̃2 + 3 (u0 )12 Id − Id ,
ε ε
where Id denotes the (N1 − 1) × (N1 − 1) identity matrix,
2αd 2 d2
Y0 [1] = (u0 )31 − f1 ,
ε ε
and M50 [1] = M12 [1]−1 , we obtain
u1 = M50 [1]u2 + z[1].
where
z[1] = M50 [1] ·Y0 [1].
Now for n = 2 we get
3αd 2
u3 − 2u2 + u1 + M̃2 u2 − (u0 )22 u2
ε
2α β d2 d2
+ (u0 )23 d 2 + u2 − f2 = 0, (25.8)
ε ε ε
About Numerical Methods for Ordinary and Partial Differential Equations 487
that is,
3αd 2
u3 − 2u2 + M50 [1]u2 + z[1] + M̃2 u2 − (u0 )22 u2
ε
2α β d2 d2
+ (u0 )32 d 2 + u2 − f2 = 0, (25.9)
ε ε ε
so that denoting
αd 2 β d2
M12 [2] = 2Id − M̃2 − M50 [1] + 3 (u0 )22 Id − Id ,
ε ε
2αd 2 d2
Y0 [2] = (u0 )32 − f2 ,
ε ε
and M50 [2] = M12 [2]−1 , we obtain
u2 = M50 [2]u3 + z[2],
where
z[2] = M50 [2] · (Y0 [2] + z[1]).
Proceeding in this fashion, for the line n we obtain
3αd 2
un+1 − 2un + M50 [n − 1]un + z[n − 1] + M̃2 un − (u0 )n2 un
ε
2α β d2 d2
+ (u0 )3n d 2 + un − fn = 0, (25.10)
ε ε ε
so that denoting
αd 2 β d2
M12 [n] = 2Id − M̃2 − M50 [n − 1] + 3 (u0 )n2 Id − Id ,
ε ε
and also denoting
2αd 2 d2
Y0 [n] = (u0 )3n − fn ,
ε ε
and M50 [n] = M12 [n]−1 , we obtain
un = M50 [n]un+1 + z[n],
where
z[n] = M50 [n] · (Y0 [n] + z[n − 1]).
Observe that we have
uN = θ ,
where θ denotes the zero matrix (N1 − 1) × 1, so that we may calculate
and
uN −2 = M50 [N − 2] · uN −1 + z[N − 2],
and so on, up to obtaining
u1 = M50 [1] · u2 + z[1].
The next step is to replace {(u0 )n } by {un } and thus to repeat the process until convergence is achieved.
This is the Newton’s Method, what seems to be relevant is the way we inverted the big matrix ((N1 −
1) · (N − 1)) × ((N1 − 1) · (N − 1)), in fact instead of inverting it directly we have inverted N − 1 matrices
(N1 − 1) × (N1 − 1) through an application of the generalized method of lines.
488 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
through the algorithm specified in the last section. We consider α = β = 1. For ε = 1.0 see Figure 25.5, for
ε = 0.0001 see Figure 25.6.
0.08
0.06
0.04
0.02
0
1
1
0.8
0.5 0.6
0.4
0.2
0 0
1.6
1.4
1.2
0.8
1
1
0.8
0.5 0.6
0.4
0.2
0 0
1
L1 (λ , xk ) = − [( f 00 (xk ) + KId )−1 ( f 0 (xk ) + λ 2 g0 (xk ))] · ( f 0 (xk ) + λ 2 g0 (xk ))
2
+ f (xk ) + λ 2 g(xk ) (25.12)
From
∂ L1 (λ , xk )
= 0,
∂λ
we get
[( f 00 (xk ) + KId )−1 ( f 0 (xk ) + λ 2 g0 (xk ))] · g0 (xk )λ − λ g(xk ) = 0, (25.13)
so that we have two solutions,
λ1 = 0
and
[( f 00 (xk ) + KId )−1 f 0 (xk )] · g0 (xk ) − g(xk )
(λ21 )2 (xk ) = − . (25.14)
[( f 00 (xk ) + KId )−1 g0 (xk )] · g0 (xk )
Observe that if (λ21 )2 (xk ) < 0 then λ21 (xk ) is complex so that, from the condition λ 2 ≥ 0, we obtain
Also, from the generalized inverse function theorem λ 2 (x) is locally Lipschtzian (see [54, 57, 78, 60] for
details). Hence, we may infer that for a given x0 ∈ Rn there exists r > 0 and K̂3 > 0 such that
∀x, y ∈ Br (x0 ). With such results in mind, for such an x0 ∈ Rn , define {xk } by
so that
1
( f 00 (x) + KId )−1 ≤ Id , (25.19)
K − K̂1
and
K3 α1
|( f 00 (x) + KId )−1 |K3 ≤ = , ∀x ∈ Br (x0 ). (25.20)
|K − K̂1 | 2
Assume K > 0 is such that
x1 ∈ Br(1−α0 ) (x0 )
and suppose the induction hypotheses
x2 , . . . , xk+1 ∈ Br (x0 ).
where 0 < α0 < 1 is specified in the next lines.
Note that,
xk+2 − xk+1 = −( f 00 (xk+1 ) + KId )−1 ( f 0 (xk+1 ) + λ 2 (xk+1 )g0 (xk+1 )),
and
xk+1 − xk = −( f 00 (xk ) + KId )−1 ( f 0 (xk ) + λ 2 (xk )g0 (xk )),
so that,
( f 00 (xk+1 ) + KId )(xk+2 − xk+1 ) = −( f 0 (xk+1 ) + λ 2 (xk+1 )g0 (xk+1 )),
and
( f 00 (xk ) + KId )(xk+1 − xk ) = −( f 0 (xk ) + λ 2 (xk )g0 (xk )).
About Numerical Methods for Ordinary and Partial Differential Equations 491
Therefore,
so that
so that
(( f 00 (xk+1 ) + K Id )−1 )( f 00 (x̃k ) + λ 2 (xk+1 )g00 (x̃k )) ≥ α1 Id .
Hence, from this, (25.17), (25.19) and (25.18), we obtain
α1
Id 1 + − α1 Id
4
≥ H (xk+1 , xk ) − (( f 00 (xk+1 ) + K Id )−1 )( f 00 (x̃k ) + λ 2 (xk+1 )g00 (x̃k ))
α1
≥ Id 1 − − (K Id − K̂1 Id )−1 )( f 00 (x̃k ) + λ 2 (xk+1 )g00 (x̃k ))
4
α1 α1
≥ Id 1 − − Id 1 −
4 2
α1
= Id
4
≥ 0, (25.23)
and therefore,
3α1
|H(xk+1 , xk ) − ( f 00 (xk ) + K Id )−1 ( f 00 (x̃k ) + λ 2 (xk+1 )g00 (x̃k ))| ≤ 1 − .
4
On the other hand, from (25.20) we have,
α1
|( f 00 (xk ) + K Id )−1 |K3 ≤ .
2
492 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
λ̃ 2 g(x̃) = 0.
Remark 25.1 For the more general case with m1 equality scalar constraints
h j (x) = 0, ∀ j ∈ {1, . . . , m1 }
gl (x) ≤ 0, ∀l ∈ {1, . . . , m2 },
Linearizing L p , we propose the following procedure for looking for a critical point of such a function:
Consider
1
L̃ p (x, λ , xk ) = f (xk ) + f 0 (xk ) · (x − xk ) + [ f 00 (xk )(x − xk )] · (x − xk )
2
m1
+ ∑ (λh ) j (h j (xk ) + h0j (xk ) · (x − xk ))
j=1
m2
K
+ ∑ (λg )2l (gl (xk ) + g0l (xk ) · (x − xk )) + |x − xk |2 .
l =1 2
Hence, from
∂ L̃ p (x, λ , xk )
= 0,
∂x
we obtain,
m1 m2
f 00 (xk )(x − xk ) + K(x − xk ) + f 0 (xk ) + ∑ (λh ) j h0j (xk ) + ∑ (λg )2l g0l (xk ) = 0,
j=1 l=1
494 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
that is, !
m1 m2
00 −1 0
x − xk = −( f (xk ) + KId ) f (xk ) + ∑ (λh ) j h0j (xk ) + ∑ (λg )2l g0l (xk ) ,
j=1 l=1
and therefore,
!
m1 m2
x(λ , xk ) = xk − ( f 00 (xk ) + KId )−1 f 0 (xk ) + ∑ (λh ) j h0j (xk ) + ∑ (λg )2l g0l (xk ) , (25.28)
j=1 l=1
" !
m1 m2
1
L1 (λ , xk ) = − ( f 00 (xk ) + KId )−1 0
f (xk ) + ∑ (λh ) j h0j (xk ) + (λg )2l g0l (xk )
∑
2 j=1 l=1
!#
m1 m2
0
· f (xk ) + ∑ (λh ) j h0j (xk ) + ∑ (λg )2l g0l (xk )
j=1 l=1
m1 m2
+ f (xk ) + ∑ (λh ) j h j (xk ) + ∑ (λg )2l gl (xk ). (25.29)
j=1 l=1
From
∂ L1 (λ , xk )
= 0,
∂ (λg )l
we get
"
m1
( f 00 (xk ) + KId )−1 f 0 (xk ) + ∑ (λh ) j h0j (xk )
j=1
!#
m2
+ ∑ (λg2 )l g0l (xk ) · g0l (xk )(λg )l − (λg )l gl (xk ) = 0, (25.30)
l=1
From
∂ L1 (λ , xk )
= 0,
∂ (λh ) j
we have
"
m1
( f 00 (xk ) + KId )−1 f 0 (xk ) + ∑ (λh ) j h0j (xk )
j=1
!#
m2
+ ∑ (λg )2l g0l (xk ) · h0j (xk ) − h j (xk ) = 0, (25.31)
l =1
∀ j ∈ {1, . . . , m1 }. Solving the linear system which comprises these last m1 equations and the m2 equations
"
m1
( f 00 (xk ) + KId )−1 f 0 (xk ) + ∑ (λh ) j h0j (xk )
j=1
!#
m2
+ ∑ (λg )l2 g0l (xk ) · g0l (xk ) − gl (xk ) = 0, (25.32)
l=1
About Numerical Methods for Ordinary and Partial Differential Equations 495
1. Choose x0 ∈ Rn , Kmax ∈ N (Kmax is the maximum number of iterations), set k = 0 and e1 ≈ 10−5 .
2. Obtain a solution
(λh ) j (xk ), (λg1 )2l (xk )
by solving the linear system (in (λh ) j and (λg )2l ) indicated in (25.31) and (25.32).
Observe that if (λg1 )l2 < 0 then (λg1 )l is complex.
To up-date λh and λg proceed as follows:
3. For each l ∈ {1, . . . , m2 } if (λg )2l (xk ) ≤ 0, then set (λg )l (xk ) = 0.
4. Define J = {l ∈ {1, . . . , m2 } such that (λg )2l (xk ) > 0}.
5. Recalculate (λh ) j (xk ) and the non-zero (λg )l2 (xk ) for l ∈ J through the solution of the linear system
(in (λh ) j and (λg )2l )
"
m1
( f 00 (xk ) + KId )−1 f 0 (xk ) + ∑ (λh ) j h0j (xk )
j=1
!#
+ ∑ (λg )2l g0l (xk ) · h0j (xk ) − h j (xk ) = 0, (25.33)
l∈J
∀ j ∈ {1, . . . , m1 } and
"
m1
( f 00 (xk ) + KId )−1 f 0 (xk ) + ∑ (λh ) j h0j (xk )
j=1
!#
+ ∑ (λg )2l g0l (xk ) · g0l (xk ) − gl (xk ) = 0, (25.34)
l∈J
∀l ∈ J.
6. If (λg )2l (xk ) ≥ 0, ∀l ∈ {1, . . . , m2 }, then go to 7, otherwise go to item 3.
7. Up-date xk through the equation
m1
xk+1 = xk − ( f 00 (xk ) + KId )−1 f 0 (xk ) + ∑ (λh ) j (xk )h0j (xk )
j=1
!
m2
+ ∑ (λg )2l (xk )g0l (xk ) . (25.35)
l=1
Remark 25.4.1 In this section we have developed an algorithm for constrained optimization in Rn . We prove
the main result only for the special case of a single scalar inequality constraint. However, we highlight the
proof of a more general result involving equality and inequality constraints may be developed in a similar
fashion, as indicated in remark 25.1. We postpone the presentation of the formal details for such a more
general case for a future work.
Also in the context of this last result, let us consider the following example of application.
Let Ω ⊂ R2 be an open, bounded, connected set with a regular boundary denoted by ∂ Ω. Consider the
real Ginzburg-Landau type equation, given by
ε∇2 u − αu3 + β u = f , in Ω
(25.36)
u = 0, on ∂ Ω,
4. Compute z∗n+1 by
Z Z
ε α
z∗n+1 = argminz∗ ∈L2 (Ω) ∇un · ∇un dx + u4 dx
2 Ω 4 Ω n
1
Z Z
−hun , z∗ iL2 + (z∗ )2 dx + f un dx , (25.39)
2β Ω Ω
that is,
z∗n+1 = β un .
Remark 25.4.2 Observe that for each n, the procedure of evaluating un stands for the solution of a convex
optimization problem with unique solution, given by the one of equation
which may be easily obtained, due to convexity, through the generalized method of lines (matrix version)
associated with Newton’s method as above described.
25.5 Conclusion
In this chapter we have develop numerical methods for large class of ODEs.
We have also introduced the matrix version of the generalized method of lines for PDEs. Further, we
have developed a convergent algorithm suitable for equations that present strong variational formulation, and
in particular, suitable for Ginzburg-Landau type equations. The results are rigorously proven and numerical
examples are provided. We emphasize that even as the parameter ε is very small, namely ε = 0.0001, the
results are consistent and the convergence is very fast.
Chapter 26
26.1 Introduction
In this chapter we develop an algorithm to solve a class of first order non-linear ordinary differential equa-
tions.
We start by presenting a general procedure for solving the linearized equations, and in a second step, we
apply it to solve a problem in flight mechanics in a Newton’s method context. In fact, a sequence of linear
problems is solved intending to obtain a solution for the original non-linear problem. We emphasize the
method here proposed has a performance considerably better than those so far known, particularly concerning
the computation time.
At this point we present a remark on the references.
Remark 26.1 We highlight that a similar problem is addressed in [14] for a nuclear physics model. The
main difference is that now our results are more general and applicable to a much larger class of problems.
Specifically in the present work, we apply them to a flight mechanics model found in [80].
For the numerical results we have used finite differences. Details about finite differences schemes may
be found in [73].
Finally, details on the Sobolev spaces in which the original problem is established may be found in [1].
where
(uk )n , (ak j )n , (gk )n ∈ R, ∀ j, k ∈ {1, 2, 3, 4}, n ∈ {0, ..., N − 1}.
Assume the following boundary conditions are intended to be satisfied:
(u1 )0 = h0 ,
(u3 )0 = V0
(26.2)
(u4 )0 = x0
(u1 )N = h f .
that is,
(u1 )1 − (g1 )0 − (a11 )0 (u1 )0 − (a13 )0 (u3 )0 − (a14 )0 (u4 )0
(u2 )0 = ,
(a12 )0
so that we write
(u2 )0 = m2 [0](u1 )1 + z̃2 [0], (26.3)
where
1
m2 [0] = ,
(a12 )0
and
−(g1 )0 − (a11 )0 (u1 )0 − (a13 )0 (u3 )0 − (a14 )0 (u4 )0
z̃2 [0] = .
(a12 )0
Replacing (26.3) into
4
(u2 )1 = ∑ (a2 j )0 (u j )0 + (g2 )0 ,
j=1
we get
(u2 )1 = m2 [1](u1 )1 + z2 [1],
where
m2 [1] = (a22 )0 m2 [0],
and,
z2 [1] = (a21 )0 (u1 )0 + (a22 )0 z̃2 [0] + (a23 )0 (u3 )0 + (a24 )0 (u4 )0 + (g2 )0 .
Also, replacing (26.3) into
4
(u3 )1 = ∑ (a3 j )0 (u j )0 + (g3 )0 ,
j=1
we obtain
(u3 )1 = m3 [1](u1 )1 + z3 [1],
where
m3 [1] = (a32 )0 m2 [0],
and
z3 [1] = (a31 )0 (u1 )0 + (a32 )0 z̃2 [0] + (a33 )0 (u3 )0 + (a34 )0 (u4 )0 + (g3 )0 .
500 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
we may obtain
(u4 )1 = m4 [1](u1 )1 + z4 [1],
where
m4 [1] = (a42 )0 m2 [0],
and
z4 [1] = (a41 )0 (u1 )0 + (a42 )0 z̃2 [0] + (a43 )0 (u3 )0 + (a44 )0 (u4 )0 + (g4 )0 .
Reasoning inductively, having for k ∈ {2, 3, 4},
for n ≥ 2, replacing these last equations into (26.1) for k = 1, we may obtain
where
m̃1 [n − 1]
= {(a11 )n−1 + (a12 )n−1 m2 [n − 1] + (a13 )n−1 m3 [n − 1]
+(a14 )n−1 m4 [n − 1]}−1 , (26.6)
and
z̃1 [n − 1]
= −m̃1 [n − 1]{(a12 )n−1 z2 [n − 1] + (a13 )n−1 z3 [n − 1]
+(a14 )n−1 z4 [n − 1] + (g1 )n−1 }. (26.7)
∀k ∈ {2, 3, 4},
where
m̃k [n − 1] = mk [n − 1]m̃1 [n − 1],
z̃k [n − 1] = mk [n − 1]z̃1 [n − 1] + zk [n − 1].
Replacing (26.5) and (26.8) into the system (26.1), we get
where
mk [n] = (ak1 )n m̃1 [n − 1] + (ak2 )n m̃2 [n − 1] + (ak3 )n m̃3 [n − 1] + (ak4 )n m̃4 [n − 1],
and
4
zk [n] = ∑ (ak j )n z̃ j [n − 1] + (gk )n .
j=1
and
(u1 )N −1 = m̃N −1 (u1 )N + z̃1 [N − 1].
Having (u1 )N−1 , we may obtain
and
(u1 )N−2 = m̃1 [N − 2](u1 )N −1 + z̃1 [N − 2],
and so on, up to finding (uk )1 , ∀k ∈ {1, 2, 3, 4}, and finally,
h˙ = V sin γ,
γ̇ = 1 (T sin(e3 ) + L) − g cos γ,
mfV V
1 (26.9)
V̇ = mf (T cos(e 3 ) − D) − g sin γ
ẋ = V cos γ,
where t f = 90s, h is the airplane altitude, V is its speed, γ is the angle between its velocity and the
horizontal axis, and finally x denotes the horizontal coordinate position.
For numerical purposes, we assume (Air bus 320)
m f = 120, 000Kg, S f = 260m2 , a = 0.17/10 rad, g = 9.8m/s2 , ρ(h) = 1.225(1−0.0065h/288.15)4.225 Kg/m3 ,
CL = 4.95a,
CD = 0.0175 + 0.06CL2 ,
1
L = ρ(h)V 2CL S f ,
2
1
D = ρ(h)V 2CD S f ,
2
T0 = 10000
502 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
T = D + m f g sin γ + T0 ,
Thus, denoting u = (u1 , u2 , u3 , u4 ) ∈ U = W 1,2 ([0,t f ]; R4 ), the system above indicated may be expressed
by
u̇1 = f1 (u)
u̇2 = f2 (u)
(26.12)
u̇3 = f3 (u)
u̇4 = f4 (u),
where
f1 (u) = u3 sin(u2 ),
f2 (u) = 1 (T (u) sin(e3 ) + L(u)) − g cos(u2 ),
m f u3 u3
(26.13)
f3 (u) = m1f (T (u) cos(e3 ) − D(u)) − g sin(u2 )
f4 (u) = u3 cos(u2 ).
Finally,
1
L(u) = ρ0 e−u1 /B9 u23CL S f ,
2
1
D(u) = ρ0 e−u1 /B9 u23CD S f ,
2
here d = 90/N, we N refers to the number of nodes concerning the discretization in t (in our numerical
example N = 10000).
Intending to apply the Newton’s method we linearize the system indicated in (26.14) about a initial guess
where
∂ fk (ũn )
(ak j )n = d, for j 6= k
∂uj
∂ f j (ũn )
(a j j )n = 1 + d, for j = k,
∂uj
and
4
∂ fk (ũn )
(gk )n = fk (ũn ) d − ∑ (ũ j )n d .
j=1 ∂ u j
We have obtained {un }. In a Newton’s method context, the next step is to replace ũn by {un } and thus to
repeat the process up to the satisfaction of an appropriate convergence criterion.
We have obtained the following solutions for h, γ,V and x. Please see Figures 26.1, 26.2, 26.3 and 26.4,
respectively.
12000
10000
8000
6000
4000
2000
0
0 10 20 30 40 50 60 70 80 90
1.4
1.2
0.8
0.6
0.4
0.2
−0.2
0 10 20 30 40 50 60 70 80 90
156
155
154
153
152
151
150
0 10 20 30 40 50 60 70 80 90
Linearizing the equations about the first solutions ũ, and ṽ, we obtain,
∂ f1 (ũ, ṽ)
u0 + f1 (ũ, ṽ) + (u − ũ)
∂u
∂ f1 (ũ, ṽ)
+ (v − ṽ) + g1 = 0, (26.18)
∂v
On the Numerical Solution of First Order Ordinary Differential Equation Systems 505
7000
6000
5000
4000
3000
2000
1000
0
0 10 20 30 40 50 60 70 80 90
∂ f2 (ũ, ṽ)
v0 + f2 (ũ, ṽ) + (u − ũ)
∂u
∂ f2 (ũ, ṽ)
+ (v − ṽ) + g2 = 0. (26.19)
∂v
In finite differences, we could write,
∂ f1 (ũn−1 , ṽn−1 )
un − un−1 + f1 (ũn−1 , ṽn−1 )d + (un−1 − ũn−1 )d
∂u
∂ f1 (ũn−1 , ṽn−1 )
+ (vn−1 − ṽn−1 )d + (g1 )n−1 d = 0, (26.20)
∂v
∂ f2 (ũn−1 , ṽn−1 )
vn − vn−1 + f2 (u˜n−1 , ṽn−1 )d + (un−1 − ũn−1 )d
∂u
∂ f2 (ũn−1 , ṽn−1 )
+ (vn−1 − ṽn−1 )d + (g2 )n−1 d = 0. (26.21)
∂v
Hence, we may write,
un = an un−1 + bn vn−1 + cn ,
vn = dn un−1 + en vn−1 + fn ,
Where
∂ f1 (ũn−1 , ṽn−1 )
an = − d + 1,
∂u
∂ f1 (ũn−1 , ṽn−1 )
bn = − d,
∂v
∂ f1 (u˜n−1 , ṽn−1 )
cn = − f1 (ũn−1 , ṽn−1 )d + ũn−1 d
∂u
∂ f1 (ũn−1 , ṽn−1 )
+ ṽn−1 d − (g1 )n−1 d, (26.22)
∂v
506 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
and
∂ f2 (ũn−1 , ṽn−1 )
dn = − d,
∂u
∂ f2 (ũn−1 , ṽn−1 )
en = − d + 1,
∂v
∂ f2 (ũn−1 , ṽn−1 )
fn = − f2 (ũn−1 , ṽn−1 )d + ũn−1 d
∂u
∂ f2 (ũn−1 , ṽn−1 )
+ ṽn−1 d − (g2 )n−1 d. (26.23)
∂v
In particular, for n = 1, we get
u1 = a1 u0 + b1 v0 + c1 , (26.24)
and
v1 = d1 u0 + e1 v0 + f1 . (26.25)
From this last equation,
v0 = (v1 − d1 u0 − f1 )/e1 ,
so that from this and equation (26.24), we get,
u1 = a1 u0 + b1 (v1 − d1 u0 − f1 )/e1 + c1 = F1 v1 + G1 ,
where
F1 = b1 /e1 , G1 = a1 u0 − b1 (d1 u0 + f1 )/e1 + c1 .
Reasoning inductively, having,
and
Gn = an (Fn−1 Ln + Gn−1 ) + bn Ln + cn−1 .
Thus,
un = Fn vn + Gn ,
so that, in particular,
uN = FN v f + GN ,
vN−1 = HN v f + LN ,
and hence,
uN−1 = FN−1 vN−1 + GN−1 ,
vN−2 = HN−1 vN −1 + LN−1 ,
and so on, up to finding,
u1 = F1 v1 + G1 ,
and
v0 = H0 v1 + L0 ,
where H0 = 1/e1 and L0 = −(d1 u0 + f1 )/e1 .
The next step is to replace {ũn } and {ṽn } by {un } and {vn } respectively and then to repeat the process up
the satisfaction of an appropriate convergence criterion.
where n(t) is the neutron population, C(t) is the concentration of delayed neutrons, T (t) is the core tempera-
ture, ρ(T ) is the reactivity (which depends on the temperature T ), β is the delayed neutron fraction, L is the
prompt reactors generation time, λ is the average decay constant of the precursors and H is the inverse of the
reactor thermal capacity.
For our numerical examples we consider T (0s) = 300K and T (100s) = T f = 350K. Moreover we assume
the relation,
1 (β − ρ (0))
C(0) = n(0),
λ L
where n(0) is unknown (to be numerically calculated by our method such that we have T (100s) = T f ).
Also we consider
ρ(T ) = ρ(0) − α(T − T (0)).
The remaining values are: β = 0.0065, L = 0.0001s, λ = 0.00741s−1 , H = 0.05K/(MW s), α = 5 ·
10−5 K −1 , and ρ(0) = 0.2β .
First we linearize the system in question about (ñ, T̃ ) obtaining (in fact its a first approximation)
ρ(T̃ ) − β ρ(T ) − β
n0 (t) = n(t) + ñ(t)
L L
ρ(T̃ ) − β
− ñ(t) + λC(t), (26.27)
L
508 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
β
C0 (t) = n(t) − λC(t),
L
T 0 (t) = Hn(t),
where ρ(T ) = ρ(0) − α(T − T (0)).
Discretizing such a system in finite differences, we get
ρ(T̃i ) − β ρ(Ti ) − β
(ni+1 − ni )/d = ni + ñi
L L
ρ(T̃i ) − β
− ñi + λCi , (26.28)
L
β
(Ci+1 −Ci )/d = ni − λCi ,
L
(Ti+1 − Ti )/d = Hni ,
where d = 100s/N, where N is the number of nodes.
Hence, we may write
ni+1 = ai ni + bi Ti + diCi + ei , (26.29)
Ci+1 = f ni + gCi , (26.30)
Ti+1 = hTi + mni , (26.31)
where
ρ(T̃i ) − β
ai = 1 + d,
L
−α
bi = ñi d,
L
di = λ d,
(ρ(0) + αT (0) − β ) (ρ(T̃i ) − β )
ei = ñi d − ñi d,
L L
β
f = d,
L
g = 1 − λ d,
h = 1,
m = Hd.
Observe that
C0 = α̃n0 ,
where
β − ρ(0)
α̃ = .
Lλ
For i = 1 from (26.31) we obtain
T1 − hT0
n0 = = α1 T1 + β1 , (26.32)
m
where α1 = 1/m and β1 = −(h/m)T0 .
Therefore,
C0 = α̃n0 = α̃(α1 T1 + β1 ).
On the Numerical Solution of First Order Ordinary Differential Equation Systems 509
Still for i = 1, replacing this last relation and (26.32) into (26.29), we get
n1 = a1 (α1 T1 + β1 ) + b1 T0 + d1 α̃(α1 T1 + β1 ) + e1 ,
so that
n1 = α̃1 T1 + β̃1 , (26.33)
where
α̃1 = a1 α1 + d1 α̃α1 ,
and
β̃1 = a1 β1 + b1 T0 + d1 α̃β1 + e1 .
Finally, from (26.30),
C1 = f (α1 T1 + β1 ) + gα̃(α1 T1 + β1 )
= α̂1 T1 + β̂1 , (26.34)
where
α̂1 = f α1 + gα̃α1 ,
and
β̂1 = f β1 + gα̃β1 .
Reasoning inductively, having
ni = α̃i Ti + β̃i , (26.35)
ni−1 = αi Ti + βi , (26.36)
Ci = α̂i Ti + β̂i , (26.37)
we are going to obtain the corresponding relations for i + 1, i ≥ 1. From (26.31) and (26.35) we obtain
so that
Ti = ηi Ti+1 + ξi , (26.38)
where
ηi = (h + mα̃i )−1 ,
and
ξi = −(mβ̃i )ηi .
On the other hand, from (26.29), (26.35) and (26.37) we have
12
11
10
6
0 20 40 60 80 100
350
345
340
335
330
325
320
315
310
305
300
0 20 40 60 80 100
26.5 Conclusion
In this chapter, we have developed a method for solving a class of first order ordinary differential equations.
The results are applied to a flight mechanics problem which models the in plane climbing of an airplane. It
is worth mentioning the algorithm obtained is of easy implementation and very efficient from a computational
point of view.
Finally, we would highlight the numerical results obtained are perfectly consistent with the physical
problem context. In future works we intend to apply the method to solve relating optimal control problems.
Chapter 27
27.1 Introduction
This chapter develops two improvements relating to the generalized method of lines. In our previous publi-
cations [19, 18], we highlight the method there addressed may present a relevant error as a parameter ε > 0
is too small, that is, as ε is about 0.01, 0.001 or even smaller.
In the present section we develop a solution for such a problem through a proximal formulation suitable
for a large class of non-linear elliptic PDEs.
At this point we reintroduce the generalized method of lines, originally presented in F. Botelho [19].
In the present context we add new theoretical and applied results to the original presentation. Specially the
computations are all completely new. Consider first the equation
Remark 27.1.2 Observe that if kT 0 (x)kU ≤ α < 1, on a convex set C then T is a contraction mapping, since
by the mean value inequality,
The next result is the base of our generalized method of lines. For a proof see Theorem 15.7.3.
Theorem 27.1.3 (Contraction mapping theorem) Let C be a closed subset of a Banach space U. Assume
T is contraction mapping on C, then there exists a unique x̃ ∈ C such that x̃ = T (x̃). Moreover, for an arbitrary
x0 ∈ C defining the sequence
x1 = T (x0 ) and xk+1 = T (xk ), ∀k ∈ N
we have
xk → x̃, in norm, as k → +∞.
To obtain a fixed point for each Tn indicated in (27.5) is perfectly possible if ε ≈ O(1). However, if ε > 0
is small, the error in this process may be relevant.
To solve this problem, firstly we propose the following algorithm,
(un − un−1 ) 1
un+1 − 2un + un−1 + f2 (θ )d 2
d tn
∂ (un − un−1 ) 1 ∂ 2 un f4 (θ ) 2
+ f3 (θ )d 2 + d
∂θ tn d ∂ θ 2 tn2
d2 d2
+ f5 (θ ) g(un ) + fn
ε ε
d2
−K(un − (u0 )n )
ε
= 0. (27.6)
Such an equation is solved through the Banach fixed point theorem, that is, defining
(un − un−1 ) 1
Tn (un , un+1 , un−1 ) = un+1 + un + un−1 + f2 (θ )d 2
d tn
∂ (un − un−1 ) 1 ∂ 2 un f4 (θ ) 2
+ f3 (θ )d 2 + d
∂θ tn d ∂ θ 2 tn2
d2 d2
+ f5 (θ ) g(un ) + fn
ε ε
2 d2
d
+K(u0 )n / 3+K (27.7)
ε ε
On the Generalized Method of Lines and its Proximal Explicit and Hyper-Finite Difference Approaches 515
u2 = T2 (u1 , u2 , u3 ),
uN−2 = gN −2 (uN−1 ),
Remark 27.1.4 Here we consider some points concerning the convergence of the method.
In the next lines the norm indicated refers to the infinity one for C([0, 2π]; RN −1 ). In particular for n = 1
from above we have:
u1 = T1 (0, u1 , u2 ),
that is,
d2
2
d
u2 − 2u1 − K u1 + O K = 0.
ε ε
Hence, denoting
d2
a[1] = 1/ 2 + K
ε
and
d2
a[n] = 1/ 2 + K − a[n − 1] , ∀n ∈ {2, . . . , N − 1},
ε
for N sufficiently big we may obtain
2
d
ku1 − a[1]u2 k = O K ,
ε
and by induction 2
d
kun − a[n]un+1 k = nO K ,
ε
so that we would have
d
kun − a[n]un+1 k ≤ O K ), ∀n ∈ {1, . . . , N − 1}
ε
This last calculation is just to clarify that the procedure of obtaining the relation between consecutive lines
through the contraction mapping theorem is well defined.
where, for a Ginzburg-Landau type equation (see [4, 55] for the corresponding models in physics),
g(u) = −u3 + u,
u1 = 0, on Γ0 = {(1, θ ) : 0 ≤ θ ≤ 2π},
u1 = u f (θ ), on Γ1 = {(2, θ ) : 0 ≤ θ ≤ 2π}.
Through the generalized method of lines, for N = 10 (10 lines), d = 1/N in polar coordinates and finite
differences (please see [73] for general schemes in finite differences), equation (27.8), stands for
1 1 ∂ 2 un 2 3 d2 d2
(un+1 − 2un + un−1 ) + (un − un−1 )d + 2 d + (−un + un ) + = 0,
rn rn ∂ θ 2 ε ε
On the Generalized Method of Lines and its Proximal Explicit and Hyper-Finite Difference Approaches 517
∀n ∈ {1, . . . , N − 1}.
At this point we present, through the generalized method of lines, the concerning algorithm which may
be for the softwares maple or mathematica.
In this software, x stands for θ .
m8 = 10; (number of lines)
d = 1.0/m8 ; (thickness of the grid)
e1 = 0.01; (ε = e1 )
K = 70.0;
Clear[d1 , u, a, b, h];
For[i = 1, i < m8 , i + +,
z1 [i] = 0.0]; ( vector which stores Ku0 (i))
For[k1 = 1, k1 < 180, k1 + +,
Print[k1 ];
a = 0.0;
For[i = 1, i < m8 , i + +,
Print[i];
t = 1.0 + i ∗ d;
b[x− ] = u[i + 1][x];
b12 = 2.0;
A18 = 5.0;
k = 1;
W hile[ b12 > 10−4 ,
k = k + 1;
z = (u[i + 1][x] + b[x] + a + 1/t ∗ (b[x] − a) ∗ d ∗ d12 + 1/t 2 ∗ D[b[x], {x, 2}] ∗ d 2 ∗ d12
−b[x]3 ∗ d 2 ∗ d12 /e1 + b[x] ∗ d 2 ∗ d12 /e1 + 1.0 ∗ d 2 /e1 +
z1 [i] ∗ d 2 /e1 )/(3.0 + K ∗ d 2 /e1 );
z = Series[z, {d1 , 0, 2}, {u f [x], 0, 3}, {u0f [x], 0, 1}, {u00f [x], 0, 1}, {u000f [x], 0, 0}, {u0000
f [x], 0, 0}];
z = Normal[z];
z = Expand[z];
b[x− ] = z;
u[i + 1][x− ] = 0.0;
u f [x− ] = 0.0;
d1 = 1.0;
A19 = z;
b12 = Abs[A19 − A18 ];
A18 = A19 ;
Clear[u, u f , d1 ]];
a1 = b[x];
Clear[b];
u[i + 1][x− ] = b[x];
h[i] = a1 ;
a = a1 ];
b[x− ] = u f [x];
d1 = 1.0;
(27.9)
518 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
For[i = 1, i < m8 , i + +,
W1 [m8 − i] =
Series[h[m8 − i], {u f [x], 0, 3}, {u0f [x], 0, 1}, {u00f [x], 0, 1}, {u000f [x], 0, 0}, {u0000
f [x], 0, 0}];
W = Normal[W1 [m8 − i]];
b[x− ] = Expand[W ];
v[m8 − i] = Expand[W ]];
For[i = 1, i < m8 , i + +,
z1 [i] = K ∗ v[i]];
d1 = 1.0;
Print[Expand[v[m8 /2]]];
Clear[d1 , u, b]]
(27.10)
At this point we present the expressions for 10 lines, firstly for ε = 1 and K = 0. In the next lines x stands
for θ .
For each line u[n] we have obtained,
u[1] = 0.0588308 + 0.167434u f [x] − 0.00488338u f [x]2 − 0.00968371u f [x]3 + 0.0126122u00f [x]
−0.000641358u f [x] u00f [x] − 0.00190065u f [x]2 u00f [x] + 0.0000660286u f [x]3 u00f [x]
u[2] = 0.101495 + 0.316995u f [x] − 0.00919963u f [x]2 − 0.0182924u f [x]3 + 0.0225921u00f [x]
−0.00113308u f [x]u00f [x] − 0.00336691u f [x]2 u00f [x] + 0.000122652u f [x]3 u00f [x]
u[3] = 0.1295 + 0.450424u f [x] − 0.0127925u f [x]2 − 0.0257175u f [x]3 + 0.0294791u00f [x]
−0.00142448u f [x] u00f [x] − 0.00428071u f [x]2 u00f [x] + 0.000160933u f [x]3 u00f [x]
u[4] = 0.143991 + 0.568538u f [x] − 0.0153256u f [x]2 − 0.0315703u f [x]3 + 0.0331249u00f [x]
−0.0014821u f [x] u00f [x] − 0.00456619u f [x]2 u00f [x] + 0.000168613u f [x]3 u00f [x]
u[5] = 0.146024 + 0.672307u f [x] − 0.0164357u f [x]2 − 0.0352883u f [x]3 + 0.0336371u00f [x]
−0.00131323u f [x] u00f [x] − 0.00421976u f [x]2 u00f [x] + 0.000141442u f [x]3 u00f [x]
u[6] = 0.136541 + 0.762571u f [x] − 0.0158578u f [x]2 − 0.0361974u f [x]3 + 0.0312624u00f [x]
−0.000974635u f [x] u00f [x] − 0.00333176u f [x]2 u00f [x] + 0.0000901069u f [x]3 u00f [x]
u[7] = 0.116389 + 0.840008u f [x] − 0.0135378u f [x]2 − 0.0336098u f [x]3 + 0.0263271u00f [x]
−0.000565842u f [x] u00f [x] − 0.00210507u f [x]2 u00f [x] + 0.0000375075u f [x]3 u00f [x]
u[8] = 0.0864095 + 0.905032u f [x] − 0.00970893u f [x]2 − 0.0269167u f [x]3 + 0.0192033u00f [x]
−0.000206117u f [x] u00f [x] − 0.000856701u f [x]2 u00f [x] + 5.78758 ∗ 10−6 u f [x]3 u00f [x]
u[9] = 0.0473499 + 0.958203u f [x] − 0.00491745u f [x]2 − 0.0157466u f [x]3 + 0.0102907u00f [x].
On the Generalized Method of Lines and its Proximal Explicit and Hyper-Finite Difference Approaches 519
In the next lines we present the results relating to the software indicated, with ε = 0.01 and K = 70.
For each line u[n] we have obtained,
u[1] = 1.08673 + 4.6508 ∗ 10−7 u f [x] − 1.11484 ∗ 10−7 u f [x]2 + 3.25552 ∗ 10−8 u f [x]3
+5.13195 ∗ 10−9 u00f [x] − 2.24741 ∗ 10−9 u f [x] u00f [x]
+8.97477 ∗ 10−10 u f [x]2 u00f [x] − 8.86957 ∗ 10−11 u f [x]3 u00f [x]
u[2] = 1.27736 + 1.51118 ∗ 10−6 u f [x] − 4.39811 ∗ 10−7 u f [x]2 + 1.55883 ∗ 10−7 u f [x]3
+1.33683 ∗ 10−8 u00f [x] − 7.50593 ∗ 10−9 u f [x] u00f [x]
+3.77548 ∗ 10−9 u f [x]2 u00f [x] − 4.9729 ∗ 10−10 u f [x]3 u00f [x]
u[3] = 1.30559 + 6.91602 ∗ 10−6 u f [x] − 2.21813 ∗ 10−6 u f [x]2 + 8.89891 ∗ 10−7 u f [x]3
+4.85851 ∗ 10−8 u00f [x] − 3.22542 ∗ 10−8 u f [x] u00f [x]
+1.90602 ∗ 10−8 u f [x]2 u00f [x] − 3.20439 ∗ 10−9 u f [x]3 u00f [x]
Remark 27.1 Observe that since ε = 0.01 the solution is close to the constant value 1.3247 along the
domain, which is an approximate solution of equation −u3 + u + 1.0 = 0.
520 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
∂ 2u 1 ∂u
+ f2 (θ )
∂t 2 t ∂t
1 ∂ 2u f4 (θ ) ∂ 2 u
+ f3 (θ ) + 2
t ∂ θ ∂t t ∂θ2
+ f5 (θ )(g(u) + f ) = 0, (27.11)
un = Tn (un−1 , un , un+1 ),
where
(un − un−1 ) 1
Tn (un−1 , un , un+1 ) = un+1 + un + un−1 + f2 (θ )d 2
d tn
∂ (un − un−1 ) 1 ∂ 2 un f4 (θ ) 2
+ f3 (θ )d 2 + d
∂θ tn d ∂ θ 2 tn2
d2 d2
+ f5 (θ ) g(un ) + fn /3.0. (27.13)
ε ε
1. Choose N = 30 − 100 and N1 = 10 − 12 (specifically for the example in the next lines). Divide the
interval domain in the variable t into N1 equal pieces (for example, for the interval [1, 2] through
a concerning partition {t0 = 1,t1 , ...,tN1 = 2}, where tk = 1 + k/N1 and d = 1/(N N1 ) is the grid
thickness in t).
2. Through the generalized method of lines, solve the equation in question on the interval [tk ,tk+1 ] as
function of u(tk ) and u(tk+1 ) and the domain shape.
To calculate uk = {ukn } on [tk ,tk+1 ], proceed as follows. First observe that the equation in question
stands for
(ukn − ukn−1 ) 1
ukn+1 − 2ukn + ukn−1 + f2 (θ )d 2
d tnk
∂ (ukn − unk −1 ) 1 ∂ 2 ukn f4 (θ ) 2
+ f3 (θ )d 2 + d
∂θ k
tn d ∂ θ 2 (tnk )2
d2 d2
+ f5 (θ ) g(ukn ) + fnk
ε ε
= 0, (27.14)
where
tnk = 1 + (k − 1)/N1 + nd, ∀k ∈ {1, . . . , N1 }, n ∈ {1, . . . N − 1}.
Such an equation is solved through the Banach fixed point theorem, that is, defining
(ukn − ukn−1 ) 1
Tnk (ukn , ukn+1 , ukn−1 ) = ukn+1 + ukn + ukn−1 + f2 (θ )d 2
d tnk
∂ (ukn − ukn−1 ) 1 ∂ 2 ukn f4 (θ ) 2
+ f3 (θ )d 2 + d
∂θ tn d ∂ θ 2 tn2
d2 d2
+ f5 (θ ) g(ukn ) + fnk /3 (27.15)
ε ε
equation (27.14) stands for
ukn = Tnk (ukn−1 , ukn , un+1
k
),
so that for n = 1 we have
uk1 = T1 (u(tk ), uk1 , u2k ).
We may use the Contraction Mapping Theorem to calculate uk1 as a function of uk2 and u(tk ). The
procedure would be,
(a) set x1 = uk2 ,
(b) obtain recursively
x j+1 = T1k (u(tk ), x j , uk2 ),
(c) and finally get
uk1 = lim x j = g1 (u(tk ), u2 ).
j→∞
which correspond to the partial differential equation in question on the line k N, where
(un − un−1 ) 1
T̃k (un , un+1 , un−1 ) = un+1 + un + un−1 + f2 (θ )d 2
d tk
∂ (un − un−1 ) 1 ∂ 2 un f4 (θ ) 2
+ f3 (θ )d 2 + d
∂θ tk d ∂ θ 2 tk2
d2 d2
+ f5 (θ ) g(un ) + fn /3. (27.16)
ε ε
Here may use the Banach fixed point theorem for the final calculation as well.
The problem is then solved.
g(u) = −u3 + u,
On the Generalized Method of Lines and its Proximal Explicit and Hyper-Finite Difference Approaches 523
u1 = 0, on Γ0 = {(1, θ ) : 0 ≤ θ ≤ 2π},
u1 = u f (θ ), on Γ1 = {(2, θ ) : 0 ≤ θ ≤ 2π}.
Through the generalized method of lines, for N = 30 (30 lines in which sub-domain), N1 = 10 (10 sub-
domains) and d = 1/(N N1 ), in polar coordinates and finite differences (please see [73] for general schemes
in finite differences), equation (27.17) stands for
1 1 ∂ 2 un 2 d2 d2
(un+1 − 2un + un−1 ) + (un − un−1 )d + 2 2
d + (−u3n + un ) + = 0,
rn rn ∂ θ ε ε
∀n ∈ {1, . . . , N − 1}.
At this point we present, through the generalized method of lines, the concerning algorithm which may
be for the softwares mathematica or maple.
In this software, x stands for θ .
ClearAll;
m8 = 30; (number of lines for each sub-domain)
N1 = 10; (number of sub-domains)
d = 1.0/m8 /N1 ; (grid thickness)
e1 = 0.01; (ε = 0.01)
Clear[d1 , u, a, b, h,U];
For[k1 = 1, k1 < N1 + 1, k1 + +,
Print[k1 ];
a = U[k1 − 1][x];
For[i = 1, i < m8 , i + +,
t = 1.0 + (k1 − 1)/N1 + i ∗ d;
Print[i];
b[x− ] = u[i + 1][x];
For[k = 1, k < 35, k + +, (here we have fixed the number of iterations for this example)
z = (u[i + 1][x] + b[x] + a + 1/t ∗ (b[x] − a) ∗ d ∗ d12 + 1/t 2 ∗ D[b[x], {x, 2}] ∗ d 2 ∗ d12
+(−b[x]3 ∗ d 2 ∗ d12 /e1 + b[x] ∗ d 2 /e1 ∗ d12 ) + 1.0 ∗ d 2 /e1 ∗ d12 )/(3.0);
z = Series[z, {d1 , 0, 2}];
z = Normal[z];
z = Expand[z];
b[x− ] = z];
a1 = b[x];
Clear[b];
u[i + 1][x− ] = b[x];
h[k1 , i] = Expand[a1 ];
Clear[d1 ];
a = a1 ]; b[x− ] = U[k1 ][x];
For[i = 1, i < m8, i + +,
W1 [k1 , m8 − i] = Series[h[k1 , m8 − i], {d1 , 0, 2}];
W [k1 , m8 − i] = Normal[W1 [k1 , m8 − i]];
524 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
At this point we present the expressions for N1 = 10 and ε = 0.01. In the next lines x stands for θ (0 ≤ θ ≤
2π).
For each line u[tk ] = U[k] we have obtained:
U [1] = 1.11698 + 3.02073 ∗ 10−9U f [x] − 8.9816 ∗ 10−10U f [x]2 + 1.85216 ∗ 10−10U f [x]3
+8.284 ∗ 10−11U f00 [x] − 4.75816 ∗ 10−11U f [x] U f00 [x]
+1.35882 ∗ 10−11U f [x]2 U f00 [x] − 1.13972 ∗ 10−12U f [x]3 U f00 [x]
U[2] = 1.3107 + 7.41837 ∗ 10−9U f [x] − 3.8971 ∗ 10−9U f [x]2 + 1.58347 ∗ 10−9U f [x]3
+1.71087 ∗ 10−10 U f00 [x] − 1.71274 ∗ 10−10U f [x] U f00 [x]
+1.01077 ∗ 10−10U f [x]2 U f00 [x] − 8.08804 ∗ 10−12U f [x]3 U f00 [x]
U[3] = 1.32397 + 6.40836 ∗ 10−8U f [x] − 4.12903 ∗ 10−8U f [x]2 + 1.91826 ∗ 10−8U f [x]3
+1.13579 ∗ 10−9U f00 [x] − 1.42494 ∗ 10−9U f [x] U f00 [x]
+9.88305 ∗ 10−10U f [x]2 U f00 [x] − 9.82218 ∗ 10−11U f [x]3 U f00 [x]
On the Generalized Method of Lines and its Proximal Explicit and Hyper-Finite Difference Approaches 525
U[4] = 1.32468 + 7.82478 ∗ 10−7U f [x] − 5.26904 ∗ 10−7U f [x]2 + 2.48884 ∗ 10−7U f [x]3
+1.05836 ∗ 10−8U f00 [x] − 1.39954 ∗ 10−8U f [x] U f00 [x]
+1.00186 ∗ 10−8 U f [x]2 U f00 [x] − 1.26942 ∗ 10−9U f [x]3 U f00 [x]
U[5] = 1.32471 + 0.0000107587U f [x] − 7.21035 ∗ 10−6U f [x]2 + 3.33153 ∗ 10−6U f [x]3
+1.08485 ∗ 10−7U f00 [x] − 1.42477 ∗ 10−7U f [x] U f00 [x]
+1.00683 ∗ 10−7U f [x]2 U f00 [x] − 1.65819 ∗ 10−8U f [x]3 U f00 [x]
Remark 27.2 Observe that since ε = 0.01 the solution is close to the constant value 1.3247 along the
domain, which is an approximate solution of equation −u3 + u + 1.0 = 0. Finally, the first output of the
method is the solution on the N1 − 1 = 9 nodes U[1], . . . ,U[9] which, in some sense, justify the terminology
hyper-finite differences, even though the solution in all the N · N1 = 300 lines have been obtained.
27.3 Conclusion
In this chapter we have developed two improvements concerning the generalized method of lines. For a large
class of models, we have solved the problem of minimizing the error as the parameter ε > 0 is small. In a first
step we present a proximal formulation through the introduction of a parameter K > 0 and related equation
part properly specified. In a second step, we develop the hyper-differences approach which corresponds to
a domain division in smaller sub-domains so that the solution on each sub-domain is obtained through the
generalized method of lines.
We highlight the methods here developed may be applied to a large class of problems, including the
Ginzburg-Landau system in superconductivity in the presence of a magnetic field and respective magnetic
potential.
We intend to address this kind of model and others such as the Navier-Stokes system in a future research.
Chapter 28
28.1 Introduction
In the first part of this article, we obtain a linear system whose the solution solves the time-independent
incompressible Navier-Stokes system for the special case in which the external forces vector is a gradient. In
a second step we develop approximate solutions, also for the time independent incompressible Navier-Stokes
system, through the generalized method of lines. We recall that for such a method, the domain of the partial
differential equation in question is discretized in lines and the concerning solution is written on these lines as
functions of the boundary conditions and boundary shape. Finally, we emphasize these last main results are
established through applications of the Banach fixed point theorem.
At this point we describe the system in question.
Consider Ω ⊂ R2 an open, bounded and connected set, whose regular (Lipschitzian) internal boundary
is denoted by Γ0 and the regular external one is denoted by Γ1 . For a two-dimensional motion of a fluid on
Ω, we denote by u : Ω → R the velocity field in the direction x of the Cartesian system (x, y), by v : Ω → R,
the velocity field in the direction y and by p : Ω → R, the pressure one. We define P = p/ρ, where ρ is the
constant fluid density. Finally, ν denotes the viscosity coefficient and g denotes the gravity field. Under such
notation and statements, the time-independent incompressible Navier-Stokes system of partial differential
equations is expressed by,
ν∇2 u − u∂x u − v∂y u − ∂x P + gx = 0, in Ω,
ν∇2 v − u∂x v − v∂y v − ∂y P + gy = 0, in Ω, (28.1)
∂x u + ∂y v = 0, in Ω,
On the Generalized Method of Lines Applied to the Time-Independent Incompressible Navier-Stokes System 527
(
u = v = 0, on Γ0 ,
(28.2)
u = u∞ , v = 0, P = P∞ , on Γ1
In principle we look for solutions (u, v, P) ∈ W 2,2 (Ω) × W 2,2 (Ω) × W 1,2 (Ω) despite the fact that less
regular solutions are also possible specially concerning the weak formulation. Details about such Sobolev
spaces may be found in [1]. General results on finite differences and existence theory for similar systems
may be found in [73] and [75], respectively.
Thus, in such a sense, the next result complements this previous one, by introducing a new function w2 in
the solution expressions, which makes the concerning boundary conditions perfectly possible to be satisfied.
Theorem 28.2.1 For h = (∂x f , ∂y f ) ∈ C1 (Ω; R2 ), consider the Navier-Stokes system similar as above indi-
cated, that is,
ν∇2 u − u∂x u − v∂y u − ∂x P + ∂x f = 0, in Ω,
ν∇2 v − u∂x v − v∂y v − ∂y P + ∂y f = 0, in Ω, (28.5)
∂x u + ∂y v = 0, in Ω,
and P is a solution of the system indicated in the first two lines of (28.5) with boundary conditions indicated
in the third line of (28.6).
Proof 28.1 For w0 , w1 , w2 such that ∇2 w2 + 2∂xy w1 = 0 in Ω, u and v as indicated above and defining
h1 = u∂x u + v∂y u,
h2 = u∂x v + v∂y v
and ϕ ≡ ∇2 w2 + 2∂xy w1 = 0, we have (you may check it using the softwares MATHEMATICA or MAPLE)
∂ h1 ∂ h2
−
∂y ∂x
= (−∂yy w1 + ∂yy w0 − ∂xy w2 )ϕ
+(−∂y w1 + ∂y w0 − ∂x w2 )∂y ϕ
+(∂xy w2 + ∂xx w1 + ∂xx w0 )ϕ
+(∂y w2 + ∂x w1 + ∂x w0 )∂x ϕ
= 0, in Ω. (28.9)
∂ Ω = ∂ Ω0 ∪ ∂ Ω1 ,
∂ Ω0 = {(r(θ ), θ ) ∈ R2 : 0 ≤ θ ≤ 2π}
and
∂ Ω1 = {(2r(θ ), θ ) ∈ R2 : 0 ≤ θ ≤ 2π}.
For ν = 1, neglecting the gravity effects, the corresponding Navier-Stokes homogeneous system, in func-
tion of the variables (t, θ ) where t = r/r(θ ), is given by
r(θ )2 ∂ 2u
1 ∂u
L(u) = 2 + f2 (θ )
f0 (θ ) ∂t t ∂t
1 ∂ 2u f4 (θ ) ∂ 2 u
+ f3 (θ ) + 2 , (28.16)
t ∂ θ ∂t t ∂θ2
in Ω. Here f0 (θ ), f2 (θ ), f3 (θ ) and f4 (θ ) are known functions.
More specifically, denoting
−r0 (θ )
f1 (θ ) = ,
r(θ )
we have
f0 (θ ) = 1 + f1 (θ )2 ,
f10 (θ )
f2 (θ ) = 1 + ,
1 + f1 (θ )2
2 f1 (θ )
f3 (θ ) = ,
1 + f1 (θ )2
530 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
and
1
f4 (θ ) = .
1 + f1 (θ )2
Also d1 and d2 are expressed by
∂u ∂u
d1 u = fˆ5 (θ ) + ( fˆ6 (θ )/t) ,
∂t ∂θ
∂u ∂u
d2 u = fˆ7 (θ ) + ( fˆ8 (θ )/t)
∂t ∂θ
Where
fˆ5 (θ ) = cos(θ )/r(θ ) + sin(θ )r0 (θ )/r2 (θ ),
fˆ6 (θ ) = − sin(θ )/r(θ ),
fˆ7 (θ ) = sin(θ )/r(θ ) − cos(θ )r0 (θ )/r2 (θ ),
fˆ8 (θ ) = cos(θ )/r(θ ).
We also define
f0 (θ )
h3 (θ ) = ,
r(θ )2
r(θ )2 ˆ
f5 (θ ) = f5 (θ ),
f0 (θ )
r(θ )2 ˆ
f6 (θ ) = f6 (θ ),
f0 (θ )
r(θ )2 ˆ
f7 (θ ) = f7 (θ ),
f0 (θ )
and
r(θ )2
f8 (θ ) = fˆ8 (θ ),
f0 (θ )
Observe that t ∈ [1, 2] in Ω.
From equations (28.13) and (28.14) we may write
Remark 28.1 Who taught me how to obtain this last approximate system was Professor Alvaro de Bortoli
of Federal University of Rio Grande do Sul, UFRGS, Porto Alegre, RS-Brazil.
At this point, discretizing only in t (in N lines), defining d = 1/N and tn = 1 + nd, ∀n ∈ {1, . . . , N − 1},
we represent such a concerning system in partial finite differences.
∂ un ∂ 2 un
and
∂θ ∂θ2
also in a finite differences context, so that in such a case we may also consider them as bounded operators.
Denoting
(un − un−1 ) f6 (x) ∂ un
dˆ1 (un , un−1 ) = f5 (x) + ,
d tn ∂ x
and
(un − un−1 ) f8 (x) ∂ un
dˆ2 (un , un−1 ) = f7 (x) + ,
d tn ∂ x
where x stands for θ , in partial finite differences, equation (28.18) stands for
where
Hence,
vn = (T2 )n (un+1 , un , un−1 ),
where
Hence,
Pn = (T3 )n (un+1 , un , un−1 ),
where
∀n ∈ {1, . . . , N − 1}.
Therefore, for n = 1 we obtain
u1 = T̂1 (u2 , u1 , u0 ).
We solve such an equation through the Banach fixed point theorem.
1. First set
(u1 )1 = u2 .
3. Finally obtain
u1 = lim uk1 ≡ F1 (u2 , u0 ).
k→∞
3. Finally obtain
un = lim ukn ≡ Fn (un+1 , u0 ).
k→∞
Line 2
u2 (x) = −0.081 f5 (x)Pf (x) + 0.081 f5 (x)P0 (x)
+0.799u0 (x) − 0.034 f2 (x)u0 (x) + 0.059 f5 (x)u0 (x)2
+0.048 f7 (x)u0 (x)v0 (x) − 0.022 f6 (x)Pf0
−0.036 f6 (x)P00 (x) − 0.059 f3 (x)u00 (x)
−0.025 f6 (x)u0 (x)u00 (x) − 0.025 f8 (x)v0 (x)u00 (x)
+0.028 f4 (x)u000 (x)
534 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Line 3
u3 (x) = −0.106 f5 (x)Pf (x) + 0.106 f5 (x)P0 (x)
+0.698u0 (x) − 0.075 f2 (x)u0 (x) + 0.060 f5 (x)u0 (x)2
+0.060 f7 (x)u0 (x)v0 (x) − 0.031 f6 (x)Pf0
−0.044 f6 (x)P00 (x) − 0.075 f3 (x)u00 (x)
−0.029 f6 (x)u0 (x)u00 (x) − 0.029 f8 (x)v0 (x)u00 (x)
+0.033 f4 (x)u000 (x)
Line 4
u4 (x) = −0.121 f5 (x)Pf (x) + 0.121 f5 (x)P0 (x)
+0.597u0 (x) − 0.084 f2 (x)u0 (x) + 0.064 f5 (x)u0 (x)2
+0.064 f7 (x)u0 (x)v0 (x) − 0.037 f6 (x)Pf0
−0.046 f6 (x)P00 (x) − 0.084 f3 (x)u00 (x)
−0.029 f6 (x)u0 (x)u00 (x) − 0.029 f8 (x)v0 (x)u00 (x)
+0.034 f4 (x)u000 (x)
Line 5
u5 (x) = −0.126 f5 (x)Pf (x) + 0.126 f5 (x)P0 (x)
+0.497u0 (x) − 0.086 f2 (x)u0 (x) + 0.062 f5 (x)u0 (x)2
+0.062 f7 (x)u0 (x)v0 (x) − 0.041 f6 (x)Pf0
−0.044 f6 (x)P00 (x) − 0.086 f3 (x)u00 (x)
−0.026 f6 (x)u0 (x)u00 (x) − 0.026 f8 (x)v0 (x)u00 (x)
+0.032 f4 (x)u000 (x)
Line 6
u6 (x) = −0.121 f5 (x)Pf (x) + 0.121 f5 (x)P0 (x)
+0.397u0 (x) − 0.080 f2 (x)u0 (x) + 0.056 f5 (x)u0 (x)2
+0.056 f7 (x)u0 (x)v0 (x) − 0.041 f6 (x)Pf0
−0.031 f6 (x)P00 (x) − 0.069 f3 (x)u00 (x)
−0.017 f6 (x)u0 (x)u00 (x) − 0.017 f8 (x)v0 (x)u00 (x)
+0.022 f4 (x)u000 (x)
Line 7
u7 (x) = −0.105 f5 (x)Pf (x) + 0.105 f5 (x)P0 (x)
+0.297u0 (x) − 0.069 f2 (x)u0 (x) + 0.045 f5 (x)u0 (x)2
+0.045 f7 (x)u0 (x)v0 (x) − 0.037 f6 (x)Pf0
−0.031 f6 (x)P00 (x) − 0.069 f3 (x)u00 (x)
−0.017 f6 (x)u0 (x)u00 (x) − 0.017 f8 (x)v0 (x)u00 (x)
+0.022 f4 (x)u000 (x)
On the Generalized Method of Lines Applied to the Time-Independent Incompressible Navier-Stokes System 535
Line 8
u8 (x) = −0.080 f5 (x)Pf (x) + 0.080 f5 (x)P0 (x)
+0.198u0 (x) − 0.051 f2 (x)u0 (x) + 0.032 f5 (x)u0 (x)2
+0.032 f7 (x)u0 (x)v0 (x) − 0.029 f6 (x)Pf0
−0.021 f6 (x)P00 (x) − 0.051 f3 (x)u00 (x)
−0.011 f6 (x)u0 (x)u00 (x) − 0.011 f8 (x)v0 (x)u00 (x)
+0.015 f4 (x)u000 (x)
Line 9
u9 (x) = −0.045 f5 (x)Pf (x) + 0.045 f5 (x)P0 (x)
+0.099u0 (x) − 0.028 f2 (x)u0 (x) + 0.016 f5 (x)u0 (x)2
+0.016 f7 (x)u0 (x)v0 (x) − 0.017 f6 (x)Pf0
−0.022 f6 (x)P00 (x) − 0.012 f3 (x)u00 (x)
−0.006 f6 (x)u0 (x)u00 (x) − 0.006 f8 (x)v0 (x)u00 (x)
+0.008 f4 (x)u000 (x)
For the field of velocity v, we have obtained the following expressions for the lines:
Line 1
v1 (x) = −0.045 f7 (x)Pf (x) + 0.045 f7 (x)P0 (x)
+0.899v0 (x) − 0.034 f2 (x)v0 (x) + 0.029 f5 (x)u0 (x)v0 (x)
+0.029 f7 (x)v0 (x)2 − 0.011 f8 (x)Pf0 (x)
−0.022 f8 (x)P00 (x) − 0.034 f3 (x)v00 (x)
−0.016 f6 (x)u0 (x)v00 (x) − 0.016 f8 (x)v0 (x)v00 (x)
+0.018 f4 (x)v000 (x)
Line 2
v2 (x) = −0.081 f7 (x)Pf (x) + 0.081 f7 (x)P0 (x)
+0.799v0 (x) − 0.059 f2 (x)v0 (x) + 0.048 f5 (x)u0 (x)v0 (x)
+0.048 f7 (x)v0 (x)2 − 0.022 f8 (x)Pf0 (x)
−0.036 f8 (x)P00 (x) − 0.059 f3 (x)v00 (x)
−0.025 f6 (x)u0 (x)v00 (x) − 0.025 f8 (x)v0 (x)v00 (x)
+0.028 f4 (x)v000 (x)
Line 3
v3 (x) = −0.106 f7 (x)Pf (x) + 0.106 f7 (x)P0 (x)
+0.698v0 (x) − 0.075 f2 (x)v0 (x) + 0.060 f5 (x)u0 (x)v0 (x)
+0.060 f7 (x)v0 (x)2 − 0.031 f8 (x)Pf0 (x)
−0.044 f8 (x)P00 (x) − 0.075 f3 (x)v00 (x)
−0.029 f6 (x)u0 (x)v00 (x) − 0.029 f8 (x)v0 (x)v00 (x)
+0.033 f4 (x)v000 (x)
536 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Line 4
v4 (x) = −0.121 f7 (x)Pf (x) + 0.121 f7 (x)P0 (x)
+0.597v0 (x) − 0.084 f2 (x)v0 (x) + 0.064 f5 (x)u0 (x)v0 (x)
+0.064 f7 (x)v0 (x)2 − 0.037 f8 (x)Pf0 (x)
−0.046 f8 (x)P00 (x) − 0.084 f3 (x)v00 (x)
−0.029 f6 (x)u0 (x)v00 (x) − 0.029 f8 (x)v0 (x)v00 (x)
+0.034 f4 (x)v000 (x)
Line 5
v5 (x) = −0.126 f7 (x)Pf (x) + 0.126 f7 (x)P0 (x)
+0.497v0 (x) − 0.086 f2 (x)v0 (x) + 0.062 f5 (x)u0 (x)v0 (x)
+0.062 f7 (x)v0 (x)2 − 0.041 f8 (x)Pf0 (x)
−0.044 f8 (x)P00 (x) − 0.086 f3 (x)v00 (x)
−0.026 f6 (x)u0 (x)v00 (x) − 0.026 f8 (x)v0 (x)v00 (x)
+0.032 f4 (x)v000 (x)
Line 6
v6 (x) = −0.121 f7 (x)Pf (x) + 0.121 f7 (x)P0 (x)
+0.397v0 (x) − 0.080 f2 (x)v0 (x) + 0.056 f5 (x)u0 (x)v0 (x)
+0.056 f7 (x)v0 (x)2 − 0.022 f8 (x)Pf0 (x)
−0.041 f8 (x)P00 (x) − 0.080 f3 (x)v00 (x)
−0.022 f6 (x)u0 (x)v00 (x) − 0.022 f8 (x)v0 (x)v00 (x)
+0.028 f4 (x)v000 (x)
Line 7
v7 (x) = −0.105 f7 (x)Pf (x) + 0.105 f7 (x)P0 (x)
+0.297v0 (x) − 0.069 f2 (x)v0 (x) + 0.045 f5 (x)u0 (x)v0 (x)
+0.045 f7 (x)v0 (x)2 − 0.037 f8 (x)Pf0 (x)
−0.031 f8 (x)P00 (x) − 0.069 f3 (x)v00 (x)
−0.017 f6 (x)u0 (x)v00 (x) − 0.017 f8 (x)v0 (x)v00 (x)
+0.022 f4 (x)v000 (x)
Line 8
v8 (x) = −0.080 f7 (x)Pf (x) + 0.080 f7 (x)P0 (x)
+0.198v0 (x) − 0.051 f2 (x)v0 (x) + 0.032 f5 (x)u0 (x)v0 (x)
+0.032 f7 (x)v0 (x)2 − 0.029 f8 (x)Pf0 (x)
−0.021 f8 (x)P00 (x) − 0.051 f3 (x)v00 (x)
−0.011 f6 (x)u0 (x)v00 (x) − 0.011 f8 (x)v0 (x)v00 (x)
+0.015 f4 (x)v000 (x)
On the Generalized Method of Lines Applied to the Time-Independent Incompressible Navier-Stokes System 537
Line 9
v9 (x) = −0.045 f7 (x)Pf (x) + 0.045 f7 (x)P0 (x)
+0.099v0 (x) − 0.028 f2 (x)v0 (x) + 0.016 f5 (x)u0 (x)v0 (x)
+0.016 f7 (x)v0 (x)2 − 0.017 f8 (x)Pf0 (x)
−0.011 f8 (x)P00 (x) − 0.029 f3 (x)v00 (x)
−0.057 f6 (x)u0 (x)v00 (x) − 0.057 f8 (x)v0 (x)v00 (x)
+0.008 f4 (x)v000 (x)
Finally, for the field of pressure P, we have obtained the following lines:
Line 1
P1 (x) = 0.101Pf (x) + 0.034 f2 (x)Pf (x)
+0.899P0 (x) − 0.034 f2 (x)P0 (x) + 0.046h3 (x) f5 (x)2 u0 (x)2
+0.092h3 (x) f5 (x) f7 (x)u0 (x)v0 (x) + 0.046h3 (x) f7 (x)2 v0 (x)2
+0.034 f3 (x)Pf0 (x) − 0.034 f3 (x)P00 (x)
−0.045h3 (x) f5 (x) f6 (x)u0 (x)u00 (x) − 0.045h3 (x) f5 (x) f8 (x)v0 (x)u00 (x)
+0.014h3 (x) f6 (x)2 u00 (x)2 − 0.045h3 (x) f6 (x) f7 (x)u0 (x)v00 (x)
−0.045h3 (x) f7 (x) f8 (x)v0 (x)v00 (x) + 0.027h3 (x) f6 (x) f8 (x)u00 (x)v00 (x)
+0.014h3 (x) f8 (x)2 v00 (x)2 + 0.008 f4 (x)Pf00 (x)
+0.018 f4 (x)P000 (x)
Line 2
P2 (x) = 0.201Pf (x) + 0.059 f2 (x)Pf (x)
+0.799P0 (x) − 0.059 f2 (x)P0 (x) + 0.082h3 (x) f5 (x)2 u0 (x)2
+0.163h3 (x) f5 (x) f7 (x)u0 (x)v0 (x) + 0.082h3 (x) f7 (x)2 v0 (x)2
+0.059 f3 (x)Pf0 (x) − 0.059 f3 (x)P00 (x)
−0.074h3 (x) f5 (x) f6 (x)u0 (x)u00 (x) − 0.074h3 (x) f5 (x) f8 (x)v0 (x)u00 (x)
+0.020h3 (x) f6 (x)2 u00 (x)2 − 0.074h3 (x) f6 (x) f7 (x)u0 (x)v00 (x)
−0.074h3 (x) f7 (x) f8 (x)v0 (x)v00 (x) + 0.041h3 (x) f6 (x) f8 (x)u00 (x)v00 (x)
+0.020h3 (x) f8 (x)2 v00 (x)2 + 0.015 f4 (x)Pf00 (x)
+0.028 f4 (x)P000 (x)
Line 3
P3 (x) = 0.302Pf (x) + 0.075 f2 (x)Pf (x)
+0.698P0 (x) − 0.075 f2 (x)P0 (x) + 0.107h3 (x) f5 (x)2 u0 (x)2
+0.214h3 (x) f5 (x) f7 (x)u0 (x)v0 (x) + 0.107h3 (x) f7 (x)2 v0 (x)2
+0.075 f3 (x)Pf0 (x) − 0.075 f3 (x)P00 (x)
−0.089h3 (x) f5 (x) f6 (x)u0 (x)u00 (x) − 0.089h3 (x) f5 (x) f8 (x)v0 (x)u00 (x)
+0.023h3 (x) f6 (x)2 u00 (x)2 − 0.089h3 (x) f6 (x) f7 (x)u0 (x)v00 (x)
−0.089h3 (x) f7 (x) f8 (x)v0 (x)v00 (x) + 0.045h3 (x) f6 (x) f8 (x)u00 (x)v00 (x)
+0.023h3 (x) f8 (x)2 v00 (x)2 + 0.021 f4 (x)Pf00 (x)
+0.033 f4 (x)P000 (x)
538 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Line 4
P4 (x) = 0.403Pf (x) + 0.084 f2 (x)Pf (x)
+0.597P0 (x) − 0.084 f2 (x)P0 (x) + 0.122h3 (x) f5 (x)2 u0 (x)2
+0.245h3 (x) f5 (x) f7 (x)u0 (x)v0 (x) + 0.122h3 (x) f7 (x)2 v0 (x)2
+0.084 f3 (x)Pf0 (x) − 0.084 f3 (x)P00 (x)
−0.094h3 (x) f5 (x) f6 (x)u0 (x)u00 (x) − 0.094h3 (x) f5 (x) f8 (x)v0 (x)u00 (x)
+0.022h3 (x) f6 (x)2 u00 (x)2 − 0.094h3 (x) f6 (x) f7 (x)u0 (x)v00 (x)
−0.094h3 (x) f7 (x) f8 (x)v0 (x)v00 (x) + 0.045h3 (x) f6 (x) f8 (x)u00 (x)v00 (x)
+0.022h3 (x) f8 (x)2 v00 (x)2 + 0.025 f4 (x)Pf00 (x)
+0.034 f4 (x)P000 (x)
Line 5
P5 (x) = 0.503Pf (x) + 0.086 f2 (x)Pf (x)
+0.497P0 (x) − 0.086 f2 (x)P0 (x) + 0.127h3 (x) f5 (x)2 u0 (x)2
+0.255h3 (x) f5 (x) f7 (x)u0 (x)v0 (x) + 0.127h3 (x) f7 (x)2 v0 (x)2
+0.086 f3 (x)Pf0 (x) − 0.086 f3 (x)P00 (x)
−0.089h3 (x) f5 (x) f6 (x)u0 (x)u00 (x) − 0.089h3 (x) f5 (x) f8 (x)v0 (x)u00 (x)
+0.020h3 (x) f6 (x)2 u00 (x)2 − 0.089h3 (x) f6 (x) f7 (x)u0 (x)v00 (x)
−0.089h3 (x) f7 (x) f8 (x)v0 (x)v00 (x) + 0.040h3 (x) f6 (x) f8 (x)u00 (x)v00 (x)
+0.020h3 (x) f8 (x)2 v00 (x)2 + 0.026 f4 (x)Pf00 (x)
+0.032 f4 (x)P000 (x)
Line 6
P6 (x) = 0.603Pf (x) + 0.080 f2 (x)Pf (x)
+0.397P0 (x) − 0.080 f2 (x)P0 (x) + 0.122h3 (x) f5 (x)2 u0 (x)2
+0.244h3 (x) f5 (x) f7 (x)u0 (x)v0 (x) + 0.122h3 (x) f7 (x)2 v0 (x)2
+0.080 f3 (x)Pf0 (x) − 0.080 f3 (x)P00 (x)
−0.079h3 (x) f5 (x) f6 (x)u0 (x)u00 (x) − 0.079h3 (x) f5 (x) f8 (x)v0 (x)u00 (x)
+0.017h3 (x) f6 (x)2 u00 (x)2 − 0.079h3 (x) f6 (x) f7 (x)u0 (x)v00 (x)
−0.079h3 (x) f7 (x) f8 (x)v0 (x)v00 (x) + 0.033h3 (x) f6 (x) f8 (x)u00 (x)v00 (x)
+0.017h3 (x) f8 (x)2 v00 (x)2 + 0.026 f4 (x)Pf00 (x)
+0.028 f4 (x)P000 (x)
On the Generalized Method of Lines Applied to the Time-Independent Incompressible Navier-Stokes System 539
Line 7
P7 (x) = 0.703Pf (x) + 0.069 f2 (x)Pf (x)
+0.297P0 (x) − 0.069 f2 (x)P0 (x) + 0.107h3 (x) f5 (x)2 u0 (x)2
+0.213h3 (x) f5 (x) f7 (x)u0 (x)v0 (x) + 0.107h3 (x) f7 (x)2 v0 (x)2
+0.069 f3 (x)Pf0 (x) − 0.069 f3 (x)P00 (x)
−0.063 f5 (x) f6 (x)u0 (x)u00 (x) − 0.063 f5 (x) f8 (x)v0 (x)u00 (x)
+0.013h3 (x) f6 (x)2 u00 (x)2 − 0.063h3 (x) f6 (x) f7 (x)u0 (x)v00 (x)
−0.063h3 (x) f7 (x) f8 (x)v0 (x)v00 (x) + 0.025h3 (x) f6 (x) f8 (x)u00 (x)v00 (x)
+0.013h3 (x) f8 (x)2 v00 (x)2 + 0.023 f4 (x)Pf00 (x)
+0.022 f4 (x)P000 (x)
Line 8
P8 (x) = 0.802Pf (x) + 0.051 f2 (x)Pf (x)
+0.198P0 (x) − 0.051 f2 (x)P0 (x) + 0.081h3 (x) f5 (x)2 u0 (x)2
+0.162h3 (x) f5 (x) f7 (x)u0 (x)v0 (x) + 0.081h3 (x) f7 (x)2 v0 (x)2
+0.051 f3 (x)Pf0 (x) − 0.051 f3 (x)P00 (x)
−0.043h3 (x) f5 (x) f6 (x)u0 (x)u00 (x) − 0.043h3 (x) f5 (x) f8 (x)v0 (x)u00 (x)
+0.009h3 (x) f6 (x)2 u00 (x)2 − 0.043h3 (x) f6 (x) f7 (x)u0 (x)v00 (x)
−0.043h3 (x) f7 (x) f8 (x)v0 (x)v00 (x) + 0.017h3 (x) f6 (x) f8 (x)u00 (x)v00 (x)
+0.009h3 (x) f8 (x)2 v00 (x)2 + 0.018 f4 (x)Pf00 (x)
+0.015 f4 (x)P000 (x)
Line 9
P9 (x) = 0.901Pf (x) + 0.028 f2 (x)Pf (x)
+0.099P0 (x) − 0.028 f2 (x)P0 (x) + 0.045h3 (x) f5 (x)2 u0 (x)2
+0.191h3 (x) f5 (x) f7 (x)u0 (x)v0 (x) + 0.045h3 (x) f7 (x)2 v0 (x)2
+0.028 f3 (x)Pf0 (x) − 0.028 f3 (x)P00 (x)
−0.022h3 (x) f5 (x) f6 (x)u0 (x)u00 (x) − 0.022h3 (x) f5 (x) f8 (x)v0 (x)u00 (x)
+0.004h3 (x) f6 (x)2 u00 (x)2 − 0.022h3 (x) f6 (x) f7 (x)u0 (x)v00 (x)
−0.022h3 (x) f7 (x) f8 (x)v0 (x)v00 (x) + 0.009h3 (x) f6 (x) f8 (x)u00 (x)v00 (x)
+0.004h3 (x) f8 (x)2 v00 (x)2 + 0.010 f4 (x)Pf00 (x)
+0.008 f4 (x)P000 (x)
Ω = {(r, θ ) | 1 ≤ r ≤ 2, 0 ≤ θ ≤ 2π},
∂ Ω0 = {(1, θ ) | 0 ≤ θ ≤ 2π},
and
∂ Ω1 = {(2, θ ) | 0 ≤ θ ≤ 2π}.
540 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
u = u0 (θ ), v = v0 (θ ), on ∂ Ω0 ,
u = v = 0, on ∂ Ω1 ,
so that in these examples we do not have boundary conditions for the pressure.
Through the generalized method of lines, neglecting gravity effects and truncating the series up to the
terms in d 2 , where d = 1/N is the mesh thickness concerning the discretization in r, we present numerical
results for the following approximation of the Navier-Stokes system,
ν∇2 u − u∂x u − v∂y u − ∂x P = 0, in Ω,
ν∇2 v − u∂x v − v∂y v − ∂y P = 0, in Ω, (28.25)
ε∇2 P + ∂x u + ∂y v = 0, in Ω,
where ε > 0 is a very small parameter. We highlight, since ε > 0 must be very small, the results obtained
through the generalized of lines, as indicated in the last section, may have a relevant error. Anyway, we may
use such a procedure to obtain the general line expressions, which we expect to be analytically suitable to
obtain a numerical result by calculating numerically the concerning coefficients for such lines, through a
minimization of the L2 norm of equation errors, in a finite differences context.
Thus, from such a method, the general expression for the velocity and pressure fields on the line n, are
given by (here x stands for θ and P0 must be calculated numerically in the optimization process):
un (x) = a1 [n] cos(x) + a2 [n]u0 (x) + a3 [n] cos[x]u0 (x)2
a4 [n] sin(x)u0 (x)v0 (x) + a5 [n] sin(x)u0 (x)u00 (x) + a6 [n] cos(x)v0 (x)u00 (x)
a7 [n]u000 (x) + a8 [n] sin(x) + a9 [n] sin(x)P0 (x)
a10 [n]P0 (x) cos(x) + a11 [n], (28.26)
Pn (x) = c1 [n] + c2 [n] cos(x)u0 (x) + c3 [n] sin(x)v0 (x) + c4 [n] sin(x)u00 (x)
+c5 [n] cos(x)v00 (x) + c6 [n]u000 (x) + c7 [n]v000 (x)
+c9 [n]P0 (x) + c10 [n]P000 (x). (28.28)
First example: For the first example,
u0 (x) = −1.5 sin(x)
and
v0 (x) = 1.5 cos(x).
Denoting
Z 2
J(u, v, P) = ν∇2 u − u∂x u − v∂y u − ∂x P dΩ
Ω
Z 2
+ ν∇2 v − u∂x v − v∂y v − ∂y P dΩ
ZΩ
+ (∂x u + ∂y v)2 dΩ, (28.29)
Ω
On the Generalized Method of Lines Applied to the Time-Independent Incompressible Navier-Stokes System 541
as, above mentioned, the coefficients {ai [n]}, {bi [n]}, {ci [n]} have been obtained through the numer-
ical minimization of J({un }, {vn }, {Pn }), so that for the mesh in question, we have obtained
For this first example: J({un }, {vn }, {Pn }) ≈ 9.23 10−12 for ν = 0.1,
We have plotted the fields u, v and P, for the lines n = 1, n = 5, n = 10, n = 15 and n = 19, for
a mesh 20 × 150 corresponding to 20 lines. Please, see the Figures from 28.1 to 28.6 for the case
ν = 0.1. For all graphs, please consider units in x to be multiplied by 2π/150.
1.5 1
0.8
1
0.6
0.4
0.5
0.2
0 0
−0.2
−0.5
−0.4
−0.6
−1
−0.8
−1.5 −1
0 50 100 150 0 50 100 150
0.6
0.4
0.2
−0.2
−0.4
−0.6
−0.8
0 50 100 150
Figure 28.1: First example, from the left to the right, fields of velocity u1 (x), u5 (x), u10 (x) for the lines n = 1, n = 5
and n = 10.
0.3 0.06
0.2
0.04
0.1
0.02
0
0
−0.1
−0.02
−0.2
−0.04
−0.3
−0.4 −0.06
0 50 100 150 0 50 100 150
Figure 28.2: First example, from the left to the right, fields of velocity u15 (x), u19 (x) for the lines n = 15, and n = 19.
542 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
1.5 1
0.8
1
0.6
0.4
0.5
0.2
0 0
−0.2
−0.5
−0.4
−0.6
−1
−0.8
−1.5 −1
0 50 100 150 0 50 100 150
0.6
0.4
0.2
−0.2
−0.4
−0.6
−0.8
0 50 100 150
Figure 28.3: First example, from the left to the right, fields of velocity v1 (x), v5 (x), v10 (x) for the lines n = 1, n = 5 and
n = 10.
0.3 0.06
0.2
0.04
0.1
0.02
0
0
−0.1
−0.02
−0.2
−0.04
−0.3
−0.4 −0.06
0 50 100 150 0 50 100 150
Figure 28.4: First example, from the left to the right, fields of velocity v15 (x), v19 (x) for the lines n = 15, and n = 19.
0.063 0.285
0.0625
0.284
0.062
0.283
0.0615
0.282
0.061
0.0605 0.281
0.06
0.28
0.0595
0.279
0.059
0.278
0.0585
0.058 0.277
0 50 100 150 0 50 100 150
0.388
0.387
0.386
0.385
0.384
0.383
0.382
0.381
0.38
0 50 100 150
Figure 28.5: First example, from the left to the right, fields of pressure P1 (x), P5 (x), P10 (x) for the lines n = 1, n = 5
and n = 10.
0.414 0.403
0.413 0.402
0.412
0.401
0.411
0.4
0.41
0.399
0.409
0.398
0.408
0.397
0.407
0.406 0.396
0.405 0.395
0 50 100 150 0 50 100 150
Figure 28.6: First example, from the left to the right, fields of pressure P15 (x), P19 (x) for the lines n = 15, and n = 19.
Again the coefficients {ai [n]}, {bi [n]}, {ci [n]} have been obtained through the numerical minimiza-
tion of J({un }, {vn }, {Pn }), so that for the mesh in question, we have obtained
For this second example: J({un }, {vn }, {Pn }) ≈ 6.0 10−7 for ν = 1.0.
In any case, considering the values obtained for J, it seems we have got good first approximations
for the concerning solutions.
544 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
For the second example, for the field of velocities u and v, and pressure field P, for the lines n = 1,
n = 5, n = 10, n = 15 and n = 19, please see Figures 28.7 to 28.12. Once more, for all graphs, please
consider units in x to be multiplied by 2π/150.
1.5 1
0.8
1
0.6
0.4
0.5
0.2
0 0
−0.2
−0.5
−0.4
−0.6
−1
−0.8
−1.5 −1
0 50 100 150 0 50 100 150
0.6
0.4
0.2
−0.2
−0.4
−0.6
−0.8
0 50 100 150
Figure 28.7: Second example, from the left to the right, fields of velocity u1 (x), u5 (x), u10 (x) for the lines n = 1, n = 5
and n = 10.
0.3 0.15
0.2
0.1
0.1
0.05
0
−0.1
0
−0.2
−0.05
−0.3
−0.4 −0.1
0 50 100 150 0 50 100 150
Figure 28.8: Second example, from the left to the right, fields of velocity u15 (x), u19 (x) for the lines n = 15, and n = 19.
On the Generalized Method of Lines Applied to the Time-Independent Incompressible Navier-Stokes System 545
2 1.4
1.2
1.5
1
0.8
1
0.6
0.5 0.4
0.2
0
0
−0.2
−0.5
−0.4
−1 −0.6
0 50 100 150 0 50 100 150
0.8
0.6
0.4
0.2
−0.2
−0.4
0 50 100 150
Figure 28.9: Second example, from the left to the right, fields of velocity v1 (x), v5 (x), v10 (x) for the lines n = 1, n = 5
and n = 10.
0.4 0.15
0.3 0.1
0.2 0.05
0.1 0
0 −0.05
−0.1 −0.1
−0.2 −0.15
−0.3 −0.2
0 50 100 150 0 50 100 150
Figure 28.10: Second example, from the left to the right, fields of velocity v15 (x), v19 (x) for the lines n = 15, and n = 19.
28.4 Conclusion
In the first part of this chapter, we obtain a linear system whose solution solves the time-independent in-
compressible Navier-Stokes system. In the second part, we develop solutions for two-dimensional examples
also for the time-independent incompressible Navier-Stokes system, through the generalized method of lines.
546 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
0.5 0.8
0.4 0.7
0.3
0.6
0.2
0.5
0.1
0.4
0
0.3
−0.1
−0.2 0.2
−0.3 0.1
0 50 100 150 0 50 100 150
0.9
0.8
0.7
0.6
0.5
0.4
0 50 100 150
Figure 28.11: Second example, from the left to the right, fields of pressure P1 (x), P5 (x), P10 (x) for the lines n = 1, n = 5
and n = 10.
1.1 4
1 3
0.9
2
0.8
1
0.7
0
0.6
−1
0.5
0.4 −2
−3
0 50 100 150 0 50 100 150
Figure 28.12: Second example, from the left to the right, fields of pressure P15 (x), P19 (x) for the lines n = 15, and n = 19.
Considering the values for J obtained, we have got very good approximate solutions for the model in ques-
tion, in a finite differences context. The extension of such results to R3 , compressible and time dependent
cases is planned for a future work.
Chapter 29
29.1 Introduction
In this chapter we develop a numerical method to compute the solution of an inverse problem through the
generalized method of lines.
More specifically, we consider a Laplace equation on a domain Ω ⊂ R2 with an internal boundary denoted
by ∂ Ω0 and an external one denoted by ∂ Ω1 . We prescribe boundary conditions for both ∂ Ω0 and ∂ Ω1 and a
third boundary condition for ∂ Ω1 (the external one) and consider the problem of finding the optimal shape for
the internal boundary ∂ Ω0 for which such a third boundary condition for the external boundary is satisfied.
The idea is to discretize the domain in lines (in fact curves) and write the solution of Laplace equation on
these lines as functions of the unknown internal boundary shape, through the generalized method of lines.
The second step is to minimize a functional which corresponds to the L2 norms of Laplace equation and
concerning third boundary condition.
Remark 29.1 About the references, this and many other similar problems are addressed in [49]. The gen-
eralized method of lines has been originally introduced in [19], with additional results in [18, 14]. Moreover,
about finite differences schemes we would cite [73]. Finally, details on the function spaces here addressed
may be found in [1].
where uo , u f and w ∈ C2 ([0, 2π]) are known periodic functions with period 2π, n denotes the outward normal
field to ∂ Ω,
∂ Ω0 = {(r(θ ), θ ) : 0 ≤ θ ≤ 2π},
∂ Ω1 = {(R, θ ) : 0 ≤ 0 ≤ 2π},
and R > 0.
The main idea is to discretize the domain in lines and through the generalized method of lines to obtain
the solution un (r(θ )) on each line n as a function of r(θ ). The final step is to compute the optimal r(θ ) in
order to minimize the cost functional J(r(θ )) defined by
29.3 About the generalized method of lines and the main result
At this point we start to describe the main result.
Consider firstly a Laplace equation in polar coordinates, that is,
∂ 2u 1 ∂ u 1 ∂ 2u
+ + = 0.
∂ r2 r ∂ r r2 ∂ θ 2
In order to apply the generalized method of lines, we a define a variable t, through the equation
r − r(θ )
t=
R − r(θ )
∂u ∂ u ∂t ∂u
= +
∂θ ∂t ∂θ ∂θ
∂u t −1 ∂u
= r0 (θ ) +
∂t R − r(θ ) ∂θ
∂u ∂u
= ( f1 (θ ) + t f2 (θ )) + , (29.2)
∂t ∂θ
where
−r0 (θ )
f1 (θ ) = ,
R − r(θ )
and
r0 (θ )
f2 (θ ) = .
R − r(θ )
A Numerical Method for an Inverse Optimization Problem through the Generalized Method of Lines 549
Also,
∂u ∂ u ∂t
=
∂r ∂t ∂ r
∂u 1
= , (29.3)
∂t R − r(θ )
so that
2
∂ 2u ∂ 2u
∂t
=
∂ r2 ∂t 2 ∂r
∂ 2u 1
=
∂t 2 (R − r(θ ))2
∂ 2u
= f3 (θ ) , (29.4)
∂t 2
where
1
f3 (θ ) = .
(R − r(θ ))2
Moreover, we may also obtain
∂ 2u ∂ 2u ∂u
= f4 (t, θ ) + f5 (t, θ )
∂θ2 ∂t 2 ∂t
∂ 2u ∂ 2u
+ f6 (t, θ ) + , (29.5)
∂t∂ θ ∂ θ 2
where,
f4 (t, θ ) = ( f1 (θ ) + t f2 (θ ))2 ,
f5 (t, θ ) = f10 (θ ) + t f20 (θ ) + f2 (θ )( f1 (θ ) + t f2 (θ )),
f6 (t, θ ) = 2( f1 (θ ) + t f2 (θ )).
Thus, for the new variables (t, θ ), dropping the bar in u, the Laplace equation is equivalent to
∂ 2u ∂u ∂ 2u ∂ 2u
+ f 7 (t, θ ) + f 8 (t, θ ) + f 9 (t, θ ) = 0,
∂t 2 ∂t ∂t∂ θ ∂θ2
in Ω̂ where
Ω̂ = {(t, θ ) ∈ R2 : 0 ≤ t ≤ 1, 0 ≤ θ ≤ 2π},
r = t(R − r(θ )) + r(θ ),
1 f4 (t, θ )
f0 (t, θ ) = + ,
(R − r(θ ))2 r2
f˜7 (t, θ )
f7 (t, θ ) = ,
f0 (t, θ )
f˜8 (t, θ )
f8 (t, θ ) = ,
f0 (t, θ )
f˜9 (t, θ )
f9 (t, θ ) =
f0 (t, θ )
550 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
and where
1 f5 (t, θ )
f˜7 (t, θ ) = +
r(R − r(θ )) r2
f6 (t, θ )
f˜8 (t, θ ) =
r2
and
1
f˜9 (t, θ ) = 2 .
r
So, discretizing the interval [0, 1] into N ∈ N pieces, that is defining,
n
tn = , ∀n ∈ {0, . . . , N }
N
and d = 1/N, in partial finite differences, the concerning equation stands for
un+1 − 2un + un−1 un − un−1
+ f7 (tn , θ )
d2 d
∂ 2 un
∂ un − un−1
+ f8 (tn , θ ) + f9 (tn , θ ) = 0, (29.6)
∂θ d ∂θ2
To solve this equation we apply the Banach fixed point theorem as it follows:
1. Set u11 = u2 .
2. Define
uk+1
1 = T1 (u2 , uk1 , u0 ), ∀k ∈ N.
3. Obtain
u1 = lim uk1 ≡ F1 (u2 , u0 ).
k→∞
A Numerical Method for an Inverse Optimization Problem through the Generalized Method of Lines 551
un = Tn (un+1 , un , u0 ),
where
Again, to solve this equation, we apply the Banach fixed point theorem, as it follows:
1. Set u1n = un+1 .
2. Define
uk+1
n = Tn (un+1 , ukn , u0 ), ∀k ∈ N.
3. Obtain
un = lim ukn ≡ Fn (un+1 , u0 ), ∀n{2, . . . , N − 1}.
k→∞
uN −1 = FN −1 (uN , u0 ) = FN −1 (u f , u0 ).
Anyway, we remark to properly run the software we have to make the approximation tn = t, ∀n ∈
{1, . . . , N − 1}. Thus, we have used the method above described just to find a general expression for un ,
which is approximately given by (here x stands for θ )
un (x) ≈ an [1]u f (x) + an [2]u0 (x) + an [3] f7 [tn , x]u f (x) + an [4] f7 [tn , x]u0 (x)
+an [5] f8 [tn , x]u0f (x) + an [6] f8 [tn , x]u00 (x)
+an [9] f9 [tn , x]u00f (x) + an [10] f9 [tn , x]u000 (x). (29.10)
Indeed this a first approximation for the series representing the solution on each line obtained by considering
terms up to order d 2 (in d).
Moreover the coefficients an [k] and r(θ ) expressed in finite differences are calculated through the mini-
mization of J(r(θ ), {an [k]}) given by
where
∂ Ω0 = {(θ , r(θ )) ∈ R2 : r(θ ) = 5.5(1.4 + cos(θ ))/1.5, and 0 ≤ θ ≤ 2π},
and as above indicated
∂ Ω1 = {(θ , R) ∈ R2 : 0 ≤ θ ≤ 2π}.
Having obtained u = {un }Nn=1 for N = 26 lines we define W = (u f − un−1 )/d, where d = 1/N. We have
considered N1 = 42 nodes for the discretization in θ .
In a second step, we calculate the optimal solution r(θ ), for the boundary conditions
We present numerical results for R = 20, K = 100. Please see Figure 29.1 for the exact solution and
Figure 29.2 for the optimal shape which minimizes J, calculated through the generalized method of lines (we
have used a very different initial solution for the iterative minimization procedure).
Finally, we have obtained,
k(∇u) · n − wk∞ ≈ 3.1218 10−5 .
and
k∇2 uk∞ ≈ 1.0626 10−6
These last results indicate the method proposed has been successful to compute such a problem.
−2
−4
−6
−8
−2 0 2 4 6 8 10
−2
−4
−6
−8
−2 0 2 4 6 8 10
Figure 29.2: Optimal shape calculated through the generalized method of lines for the internal boundary ∂ Ω0 of Ω.
29.4 Conclusion
In this chapter we have used the generalized method of lines to obtain an approximate solution for an inverse
optimization problem.
We emphasize the numerical results obtained indicate the numerical procedure proposed is an interesting
possibility to compute such a type of problem.
We also highlight the method here developed may be applied to a large class of similar models.
Chapter 30
ˆ
This chapter has been published in a similar form by the Journal Ciencia e Natura, from the Federal University
of Santa Maria, Santa Maria, RS - Brazil (reference [23]).
30.1 Introduction
In this chapter we develop a variational formulation suitable for the relativistic quantum mechanics approach
in a free particle context. The results are based on fundamental concepts of Riemannian geometry and suitable
extensions for the relativistic case. Definitions such as vector fields, connection, Lie Bracket and Riemann
tensor are addressed in the subsequent sections for the main energy construction.
Indeed, the action developed in this article, in some sense, generalizes and extends the one presented
in the Weinberg book [81], in Chapter 12 at page 358. In such a book, the concerned action is denoted by
I = IM + IG , where IM , the matter action, for N particles with mass mn and charge en , ∀n ∈ {1, . . . , N}, is
given by
s
N µ
dxn (p) dxnν (p)
Z ∞
IM = − ∑ mn −gµν (x( p)) dp
n=1 −∞ dp dp
1 √
Z
− gFµν F µν d 4 x
4 Ω
N µ
dxn (p)
Z ∞
+ ∑ en Aµ (x(p)) d p, (30.1)
n=1 −∞ dp
A Variational Formulation for Relativistic Mechanics 555
where {xn (p)} is the position field with concerning metrics {gµν (x(p))} and
∂ Aν ∂ Aµ
Fµν = −
∂ xµ ∂ xν
where
R = gµν Rµν .
Here,
Rµν = Rσµσ ν ,
where
Rηµσ ν
are the components of the well known Riemann curvature tensor.
According to [81], the Euler-Lagrange equations for I correspond to the Einstein field equations,
1
Rµν − gµν R + 8πGT µ ν = 0,
2
where the energy-momentum tensor T µν is expressed by
ρ(û(x,t))
|φ (û(x,t))|2 = ,
m
where m denotes the total system mass at rest. We emphasize it seems to be clear that in the previous book
the parametrization of the position field, through the parameter p, is one-dimensional.
In this work we do not consider the presence of electromagnetic fields.
556 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Anyway, the final expression of the related new action here developed is given by
J(φ , r, û, E )
r
T ∂ ui ∂ u j √
Z Z
= mc −gi j |φ (û(x,t))|2 −g | det(û0 (x,t))| dx dt
0 Ω ∂t ∂t
γ T ∂φ ∂φ∗ √
Z Z
+ g jk −g | det(û0 (x,t))| dxdt
2 0 Ω ∂ u j ∂ uk
γ T ∂φ∗ √
Z Z
∂φ
+ g jk φ ∗ +φ Γljk −g | det(û0 (x,t))| dxdt
4 0 Ω ∂ ul ∂ ul
l
!
Z TZ l
∂ Γl k ∂ Γ jk √
γ p l p
+ 2 jk
|φ | g − l
+ Γlk Γ jp − Γ jk Γl p −g | det(û0 (x, t ))| dxdt
2 0 Ω ∂uj ∂ ul
2√
Z T Z
0
− E(t) |φ (û(x,t))| −g | det(û (x,t))| dx − 1 dt. (30.3)
0 Ω
where
R̂ = g jk R̂ jk ,
R̂ jk = R̂ljlk
and
∂ Γljk ∂ Γlik
R̂li jk = − + Γ pjk Γlip − Γikp Γljp
∂ ui ∂uj
represent the Riemann curvature tensor, corresponds tho the Hilbert-Einstein one, as specified in the subse-
quent sections.
In the last section, we show how such a formulation may result, as an approximation, the well known rela-
tivistic Klein-Gordon one and the respective Euler-Lagrange equations. We believe the main results obtained
may be extended to more complex mechanical systems, including in some extent, the quantum mechanics
approach.
Finally, about the references, details on the Sobolev Spaces involved may be found in [1, 14]. For standard
references in quantum mechanics, we refer to [13, 48, 55] and the non-standard [12].
Definition 30.2.1 (Surface in Rn , the respective tangent space and the dual one) Let D ⊂ Rm be an
open, bounded, connected set with a regular (Lipschitzian) boundary denoted by ∂ D. We define a m-
dimensional C1 class surface M ⊂ Rn , where 1 ≤ m < n, as the range of a function r : D ⊂ Rm → Rn ,
where
M = {r(u) : u = (u1 , . . . , um ) ∈ D}
A Variational Formulation for Relativistic Mechanics 557
and
r(u) = X̂1 (u)e1 + X̂2 (u)e2 + · · · + X̂n (u)en ,
where X̂k : D → R is a C1 class function, ∀k ∈ {1, . . . , n}, and {e1 , . . . , en } is the canonical basis of Rn .
Let u ∈ D and p = r(u) ∈ M. We also define the tangent space of M at p, denoted by Tp (M), as
∂ r(u) ∂ r(u)
Tp (M) = α1 + · · · + αm : α1 , . . . , αm ∈ R .
∂ u1 ∂ um
We assume
∂ r(u) ∂ r(u)
,··· ,
∂ u1 ∂ um
to be a linearly independent set ∀u ∈ D.
Finally, we define the dual space to Tp (M), denoted by Tp (M)∗ , as the set of all continuous and linear
functionals (in fact real functions) defined on Tp (M), that is,
∂ r(u)
Tp (M)∗ = f : Tp (M) → R : f (v) = α · v, for some α ∈ Rn , ∀v = vi ∈ Tp (M) .
∂ ui
M = {r(u) ∈ Rn : u ∈ D ⊂ Rm }.
∀v = vi ∂∂r(u)
ui ∈ Tp (M).
Under such hypotheses,
d f ∈ Tp (M)∗ .
Reciprocally, let F ∈ Tp (M)∗ .
Under such assumption, there exists f ∈ C1 (M) such that
where
α = ∇ f (r(u)),
so that d f ∈ Tp (M)∗ .
Reciprocally, assume F ∈ Tp (M)∗ , that is, suppose there exists α ∈ Rn such that
F(v) = α · v,
558 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
∀v = vi ∂∂r(u)
ui ∈ Tp (M).
Define f : M → R by
f (w) = α · w, ∀w ∈ M.
In particular,
f (r(u)) = α · r(u) = α j X̂ j (u), ∀u ∈ D.
Therefore,
F(v) = d f (v), ∀v ∈ Tp (M).
The proof is complete.
At this point, we present the tangential vector field definition, to be addressed in the subsequent results
and sections.
Definition 30.2.3 (Vector field) Let M ⊂ Rn be a m-dimensional C1 class surface, where 1 ≤ m < n. We
define the set of C1 class tangential vector fields in M, denoted by X (M), as
∂ r(u)
X (M) = X = Xi (u) ∈ T (M) = {Tp (M) : p = r(u) ∈ M} ,
∂ ui
(X · f )(p) = d f (X(u))
( f ◦ r)({ui } + ε{Xi (u)}) − ( f ◦ r)({ui })
= lim
ε→0 ε
∂ ( f ◦ r)(u)
= Xi (u). (30.7)
∂ ui
The next definition is also very important for this work, namely, the connection one.
Definition 30.2.4 (Connection) Let M ⊂ Rn be a m-dimensional C1 class surface, where
M = {r(u) ∈ Rn : u ∈ D ⊂ Rm }
and
r(u) = X̂1 (u)e1 + · · · + X̂n (u)en .
We define an affine connection on M, as a map ∇ : X (M) × X (M) → X (M) such that
1.
∇ f X+gY Z = f ∇X Z + g∇Y Z,
A Variational Formulation for Relativistic Mechanics 559
2.
∇X (Y + Z) = ∇X Y + ∇X Z
and
3.
∇X ( fY ) = (X · f )Y + f ∇X Y,
∀X,Y, Z ∈ X (M ), f , g ∈ C∞ (M).
About the connection representation, we have the following result.
Theorem 30.2.5 Let M ⊂ Rn be a m-dimensional C1 class surface, where
M = {r(u) ∈ Rn : u ∈ D ⊂ Rm }
and
r(u) = X̂1 (u)e1 + · · · + X̂n (u)en .
Let ∇ : X (M) × X (M) → X (M) be an affine connection on M. Let u ∈ D, p = r(u) ∈ M and X,Y ∈ X (M)
be such that
∂ r(u)
X = Xi (u) ,
∂ ui
and
∂ r(u)
Y = Yi (u) .
∂ ui
Under such hypotheses, we have
!
m m
∂ r(u)
∇X Y = ∑ X ·Yi + ∑ Γijk X jYk ∈ Tp (M ), (30.8)
i=1 j,k=1 ∂ ui
∂ r(u) ∂ r(u)
∇ ∂ r(u) = Γijk (u) .
∂uj ∂ uk ∂ ui
where
u0 (x,t) = ct,
and
u(x,t) = (u1 (x,t), u2 (x,t), u3 (x,t)),
∀(x,t) = ((x1 , x2 , x3 ),t) ∈ Ω × [0, T ].
At this point, we recall to have defined φ (û(x,t)) as a complex function such that
ρ(û(x,t))
|φ (û(x,t))|2 = ,
m
where m denotes the total system mass at rest. Also, we assume φ to be a C2 class function and define
ρ (û(x,t)) √
dm = q −g | det(û0 (x,t))| dx
v2
1 − c2
m|φ (û(x,t))|2 √
= q −g | det(û0 (x,t))| dx, (30.10)
v2
1 − c2
where dx = dx1 dx2 dx3 and û0 (x,t) denotes the Jacobian matrix of the vectorial function û(x,t).
Also,
∂ r(û)
gi = , ∀i ∈ {0, 1, 2, 3},
∂ ui
gi j = gi · g j , ∀i, j ∈ {0, 1, 2, 3},
and
g = det{gi j }.
Moreover,
{gi j } = {gi j }−1 .
and s 2 2 2
dX1 (u(x,t)) dX2 (u(x,t)) dX3 (u(x,t))
v= + + .
dt dt dt
The semi-classical kinetics energy differential is given by
dr(û) dr(û)
dEc = · dm
dt dt
2
dτ
= − dm
dt
= −(c2 − v2 ) dm, (30.12)
562 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
so that
(c2 − v2 ) √
dEc = −m q |φ (û)|2 −g | det(û0 (x,t))| dx
2
1 − vc2
r
v2 √
= −mc 1 − 2 |φ (uˆ )|2 −g | det(û0 (x,t))| dx
2
c
p √
= −mc c2 − v2 |φ (û)|2 −g | det(û0 (x,t))| dx
r
∂ ui ∂ u j √
= −mc −gi j |φ (û)|2 −g | det(û0 (x,t))| dx, (30.13)
∂t ∂t
and thus, the semi-classical kinetics energy Ec is given by
Z TZ
Ec = dEc dt,
0 Ω
that is, r
√
Z TZ
∂ ui ∂ u j
Ec = − mc −gi j |φ (û)|2 −g | det(û0 (x,t))| dx dt.
0 Ω ∂t ∂t
30.3.2 The energy part relating the curvature and wave function
At this point we define an energy part, related to the Riemann curvature tensor, denoted by Eq , where
√
Z TZ
γ
Eq = g jk R jk −g | det(û0 (x,t))| dx dt .
2 0 Ω
and
R jk = Re[Rijik (φ )].
Also, generically Re[z] denotes the real part of z ∈ C and Rli jk (φ ) is such that
∗ ∂ r(û) ˆ)
∗ ∂ r(u
∇ ∂ r(û) ∇ ∂ r(û)φ − ∇ ∂ r(û) ∇ ∂ r(û) φ
φ ∂u
i ∂uj ∂ uk φ ∂u ∂ ui ∂ uk
j
∂ r(û) ∂ r(û)
−∇ ∂ r(û) ∂ r(û) φ ∗ = Ril jk (φ ) . (30.14)
φ ∂u , ∂u ∂ uk ∂ ul
i j
A Variational Formulation for Relativistic Mechanics 563
∂ 2φ ∗ ∂ r(û) ∂ φ ∗ l ∂ r(û)
= φ δkl +φ Γ
∂ ui ∂ u j ∂ ul ∂ ui jk ∂ ul
∂ (φ ∗ Γlik ) ∂ r(û)
+φ
∂uj ∂ ul
∂ r(û)
+|φ |2 Γikp Γlj p . (30.16)
∂ ul
Moreover,
∗ ∂ r(û)
∇ ∂ r(û) ∂ r(û) φ
φ ∂u , ∂u ∂ uk
i j
∂ r(û)
= ∇ ∂ r(û) ∂ r(û) φ ∗
− ∂ u ·φ ∂ u ∂ uk
j i
∂ r(uˆ )
= −∇ ∂ φ ∂ r(û) φ ∗
∂ u j ∂ ui
∂ uk
∂φ ∂ r(û)
= − ∇ ∂ r(û) φ ∗
∂ u j ∂ ui ∂ uk
∗
∂ φ ∂ φ ∂ r(û) ∂ φ ∗ ∂ r(û)
= − − φ ∇ ∂ r(û)
∂ u j ∂ ui ∂ uk ∂uj ∂ ui ∂ uk
∗
∂ φ ∂ φ ∂ r(û) ∂ φ ∗ l ∂ r(û)
= − − φ Γik
∂ u j ∂ ui ∂ uk ∂uj ∂ ul
∂φ ∂φ ∗ ∂ r(û) ∂ φ ∗ l ∂ r(û)
= − δkl − φ Γik . (30.17)
∂ u j ∂ ui ∂ ul ∂uj ∂ ul
564 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
Thus,
∂ 2φ ∗ ∂φ∗ l ∂ (φ ∗ Γljk )
Rli jk (φ ) = φ δkl + φ Γik + φ + |φ |2 Γ pjk Γlip
∂ ui ∂ u j ∂uj ∂ ui
∂ 2φ ∗ ∂φ∗ l ∂ (φ ∗ Γik
l )
−φ δkl − φ Γ jk − φ − |φ |2 Γikp Γljp
∂ ui ∂ u j ∂ ui ∂uj
∂φ ∂φ∗ ∂φ ∗ l
+ δkl + φ Γik . (30.18)
∂ u j ∂ ui ∂uj
where
∂ Γljk ∂ Γlik
R̂li jk = − + Γ pjk Γlip − Γikp Γljp
∂ ui ∂uj
represents the Riemann curvature tensor.
At this point, we recall to have defined this energy part by
√
Z TZ
γ
Eq = R −g | det(û0 (x,t))| dxdt,
2 0 Ω
where
R = g jk R jk ,
and as above indicated, R jk = Re[Rijik (φ )].
Hence the final expression for the energy (action) is given by
More explicitly, the final action (the generalized Einstein-Hilbert one), would be given by
J(φ , r, û, E )
r
√
Z TZ
∂ ui ∂ u j
= mc −gi j |φ (û(x,t))|2 −g | det(û0 (x,t))| dx dt
0 Ω ∂t ∂t
∗
γ T jk ∂ φ ∂ φ √
Z Z
+ g −g | det(û0 (x,t))| dxdt
2 0 Ω ∂ u j ∂ uk
γ T ∂φ∗ √
Z Z
∂φ
+ g jk φ ∗ +φ Γljk −g | det(û0 (x,t))| dxdt
4 0 Ω ∂ ul ∂ ul
l
!
γ T ∂ Γllk ∂ Γ jk √
Z Z
p l p
+ 2 jk
|φ | g − l
+ Γl k Γ jp − Γ jk Γl p −g | det(û0 (x,t))| dxdt
2 0 Ω ∂uj ∂ ul
2√
Z T Z
0
− E(t) |φ (û(x,t))| −g | det(û (x,t))| dx − 1 dt. (30.21)
0 Ω
so that
dr(û(x,t))
≈ (c, 0, 0, 0),
dt
we would obtain
γ T 1 ∂ φ (x,t) ∂ φ ∗ (x, t )
Z Z
Eq /c ≈ − 2
2 0 Ω c ∂t ∂t
!
3
∂ φ (x,t) ∂ φ ∗ (x,t)
+∑ dxdt, (30.22)
k=1 ∂ xk ∂ xk
and
√
Z TZ q Z TZ
Ec /c = m c2 |φ |2 1 − v2 /c2 −g| det(û0 (x,t))| dx dt/c ≈ mc2 |φ (x,t)|2 dxdt.
0 Ω 0 Ω
566 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
!
3
γ 1 ∂ 2 φ (x,t ) ∂ 2 φ (x,t)
− ∑
2 c 2 ∂t 2
k=1 ∂ xk2
+mc2 φ (x,t) − E(t)φ (x,t) = 0, in Ω, (30.24)
where we assume the space of admissible functions is given by C1 (Ω × [0, T ]; C) with the following time and
spatial boundary conditions,
φ (x, 0) = φ0 (x), in Ω,
φ (x, T ) = φ1 (x), in Ω,
φ (x,t) = 0, on ∂ Ω × [0, T ].
Equation (30.24) is the relativistic Klein-Gordon one.
For E(t) = E ∈ R (not time dependent), at this point we suggest a solution (and implicitly related time
iEt
boundary conditions) φ (x,t) = e− h̄ φ2 (x), where
φ2 (x) = 0, on ∂ Ω.
in Ω.
Denoting
γE 2
E1 = − + mc2 − E ,
2c2 h¯ 2
the final eigenvalue problem would stand for
γ 3 ∂ 2 φ2 (x)
− ∑ ∂ x2 + E1 φ2 (x) = 0, in Ω
2 k=1 k
iE t
Moreover, from (30.24), such a solution φ (x,t) = e− φ2 (x) is also such that
h̄
!
3
γ 1 ∂ 2 φ (x,t ) ∂ 2 φ (x,t)
−∑
2 c2 ∂t 2 k=1 ∂ xk2
∂ φ (x,t)
+mc2 φ (x,t) = ih̄ , in Ω. (30.25)
∂t
At this point, we recall that in quantum mechanics,
γ = h̄2 /m.
¨
Finally, we remark this last equation (30.25) is a kind of relativistic Schrodinger-Klein-Gordon equation.
Hence, already including the Lagrange multipliers, considering r and {g jk } as independent variables,
such a functional again denoted by J(r, {g jk }, λ ) is expressed as
γ T √
Z TZ
∂r ∂r
Z Z
J(r, {g jk }, λ ) = R̂ −g dxdt + λ jk · − g jk dxdt.
2 0 Ω 0 Ω ∂ u j ∂ uk
568 Functional Analysis, Calculus of Variations and Numerical Methods for Models in Physics and Engineering
√
1
γ R jk − g jk R̂ −g − λ jk = 0, in Ω.
2
∂ 2 Xl (u) ∂ Xl (u) ∂ λ jk
λ jk + = 0, in Ω, (30.27)
∂ u j ∂ uk ∂ u j ∂ uk
so that √
∂ 2 Xl (u) √ ∂ Xl (u) ∂ [(R jk − 12 g jk R̂) −g]
1
[R jk − g jk R̂] −g + = 0, in Ω, (30.28)
∂ u j ∂ uk 2 ∂uj ∂ uk
∀l ∈ {1, 2, 3}.
Observe that the condition R jk = 0 in Ω × [0, T ], ∀ j, k ∈ {0, 1, 2, 3}, it is sufficient to solve the system
indicated in (30.28) but it is not necessary.
The system indicated in (30.28) is the Einstein field one. It is my understanding the actual variable for
this system is r not {g jk }.
However in some situations, it is possible to solve (30.28) through a specific metric {(g0 ) jk }, but one
question remains, how to obtain a corresponding r.
With such an issue in mind, given a specific metric {(g0 ) jk }, we suggest the following control problem,
3 2
∂ r(u) ∂ r(u)
Find r ∈ U which minimizes J1 (r) = ∑ · − (g0 ) jk ,
j,k=0 ∂uj ∂ uk 2
subject to
√
∂ 2 Xl (u) √ ∂ Xl (u) ∂ [(R jk − 12 g jk R̂) −g]
1
[R jk − g jk R̂] −g + = 0, in Ω, (30.29)
∂ u j ∂ uk 2 ∂uj ∂ uk
∀l ∈ {1, 2, 3}.
30.6 Conclusion
This work proposes an action (energy) suitable for the relativistic quantum mechanics context. The Riemann
tensor represents an important part of the action in question, but now including the density distribution of
mass in its expression. In one of the last sections, we obtain the relativistic Klein-Gordon equation as an
approximation of the main action, under specific properly described conditions.
We believe the results obtained may be applied to more general models, such as those involving atoms
and molecules subject to the presence of electromagnetic fields.
Anyway, we postpone the development of such studies for a future research.
References
[1] R.A. Adams, Sobolev Spaces, Academic Press, New York, 1975.
[2] R.A. Adams and J.F. Fournier, Sobolev Spaces, second edition, Elsevier, 2003.
[3] G. Allaire, Shape Optimization by the Homogenization Method, Springer-Verlag, New York, 2002.
[4] J.F. Annet, Superconductivity, Superfluids and Condensates, Oxford Master Series in Condensed Matter
Physics, Oxford University Press, Reprint, 2010.
[5] H. Attouch, G. Buttazzo and G. Michaille, Variational Analysis in Sobolev and BV Spaces, MPS-SIAM
Series in Optimization, Philadelphia, 2006.
[6] G. Bachman and L. Narici, Functional Analysis, Dover Publications, Reprint, 2000.
[7] J.M. Ball and R.D. James, Fine mixtures as minimizers of energy, Archive for Rational Mechanics and Anal-
ysis, 100: 15–52, 1987.
[8] M.P. Bendsoe and O. Sigmund, Topology Optimization, Theory Methods and Applications, Springer, Berlin,
2003.
¨
[9] F. Bethuel, H. Brezis and F. Helein, Ginzburg-Landau vortices, Birkhauser, Basel, 1994.
[10] W.R. Bielski, A. Galka and J.J. Telega, The Complementary Energy Principle and Duality for Geometrically
Nonlinear Elastic Shells. I. Simple case of moderate rotations around a tangent to the middle surface. Bulletin
of the Polish Academy of Sciences, Technical Sciences, 38: 7–9, 1988.
[11] W.R. Bielski and J.J. Telega, A contribution to contact problems for a class of solids and structures, Arch.
Mech., 37(4-5): 303-320, Warszawa, 1985.
[12] D. Bohm, A suggested interpretation of the quantum theory in terms of hidden variables I, Phys. Rev. 85: 2,
1952.
[13] D. Bohm, Quantum Theory (Dover Publications INC., New York, 1989).
[14] F. Botelho, Functional Analysis and Applied Optimization in Banach Spaces, Springer Switzerland, 2014.
[15] F. Botelho, Variational Convex Analysis, Ph.D. thesis, Virginia Tech, Blacksburg, VA -USA, 2009.
[16] F. Botelho, Variational Convex Analysis, Applications to non-Convex Models, Lambert Academic Publishing,
Berlin, June, 2010.
[17] F. Botelho, Dual variational formulations for a non-linear model of plates, Journal of Convex Analysis,
17(1): 131-158, 2010.
570 References
[18] F. Botelho, Existence of solution for the Ginzburg-Landau system, a related optimal control problem and its
computation by the generalized method of lines, Applied Mathematics and Computation, 218: 11976–11989,
2012.
[19] F. Botelho, Topics on Functional Analysis, Calculus of Variations and Duality, Academic Publications, Sofia,
2011.
[20] F. Botelho, On duality principles for scalar and vectorial multi-well variational problems, Nonlinear Anal-
ysis, 75: 1904–1918, 2012.
[21] F. Botelho On the Lagrange multiplier theorem in Banach spaces, Computational and Applied Mathematics,
32: 135–144, 2013.
[22] F. Botelho, A Classical Description of Variational Quantum Mechanics and Related Models, Nova Science
Publishers, New York, 2017.
[23] F.Botelho, A variational formulation for relativisticmechanics based on Riemannian geom-etry and its appli-
cation to the quantum mechanics context, arXiv:1812.04097v2[math.AP], 2018.
[25] L.D. Carr, C.W. Clark and W.P. Reinhardt, Stationary solutions for the one-dimensional nonlinear
Schroding
¨ er equation. I—Case of Repulsive Nonlinearity, Physical Review A, Volume 62, 063610, 2000.
[26] I.V. Chenchiah and K. Bhattacharya, The relaxation of two-well energies with possibly unequal moduli, Arch.
Rational Mech. Anal., 187: 409–479, 2008.
[27] M. Chipot, Approximation and oscillations, Microstructure and Phase Transition, The IMA Volumes in
Mathematics and Applications, 54: 27–38, 1993.
[28] R. Choksi, M.A. Peletier and J.F. Williams, On the Phase Diagram for Microphase Separation of Diblock
Copolymers: an Approach via a Nonlocal Cahn-Hilliard Functional, to appear in SIAM J. Appl. Math.,
2009.
[29] P. Ciarlet, Mathematical Elasticity, Vol. I—Three Dimensional Elasticity, North Holland Elsivier, 1988.
[30] P. Ciarlet, Mathematical Elasticity, Vol. II—Theory of Plates, North Holland Elsevier, 1997.
[31] P. Ciarlet, Mathematical Elasticity, Vol. III—Theory of Shells, North Holland Elsevier, 2000.
[33] I. Ekeland and R. Temam, Convex Analysis and Variational Problems, North Holland, 1976.
[34] L.C. Evans, Partial Differential Equations, Graduate Studies in Mathematics, 19, AMS, 1998.
[35] U. Fidalgo and P. Pedregal, A general lower bound for the relaxation of an optimal design problem in con-
ductivity with a quadratic cost functional and a general linear state equation, Journal of Convex Analysis
19(1): 281–294, 2012.
[36] N.B. Firoozye and R.V. Khon, Geometric parameters and the relaxation for multiwell energies, Microstruc-
ture and Phase Transition, the IMA volumes in mathematics and applications, 54: 85–110, 1993)
[37] I. Fonseca and G. Leoni, Modern Methods in the Calculus of Variations, L p Spaces, Springer, New York,
2007.
[38] D.Y. Gao and G. Strang, Geometric nonlinearity: Potential energy, complementary energy and the gap func-
tion, Quartely Journal of Applied Mathematics, 47: 487–504, 1989a.
References 571
[39] D.Y. Gao, On the extreme variational principles for non-linear elastic plates. Quarterly of Applied Mathe-
matics, XLVIII(2): 361–370, June 1990.
[40] D.Y. Gao, Finite deformation beam models and triality theory in dynamical post-Buckling analysis, Interna-
tional Journal of Non-linear Mechanics 35: 103–131, 2000.
[41] D.Y. Gao, Pure complementary energy principle and triality theory in finite elasticity, Mech. Res. Comm.,
26(1): 31–37, 1999.
[42] D.Y. Gao, General analytic solutions and complementary variational principles for large deformation non-
smooth mechanics, Meccanica 34: 169–198, 1999.
[43] D.Y. Gao, Duality Principles in Nonconvex Systems, Theory, Methods and Applications, Kluwer, Dordrecht,
2000.
[44] M. Giaquinta and S. Hildebrandt, Calculus of variations I, a series of comprehensive studies in mathematics,
vol. 310, Springer, 1996.
[45] M. Giaquinta and S. Hildebrandt, Calculus of variations II, a series of comprehensive studies in mathematics,
vol. 311, Springer, 1996.
[46] T. Giorgi and R.T. Smits, Remarks on the existence of global minimizers for the Ginzburg-Landau energy
functional Nonlinear Analysis, Theory Methods and Applications, 53, 147: 155, 2003.
[47] E. Giusti, Direct Methods in the Calculus of Variations, World Scientific, Singapore, Reprint, 2005.
[48] B. Hall, Quantum Theory for Mathematicians (Springer, New York, 2013).
[49] V. Isakov, Inverse Problems for Partial Differential Equations, Series Applied Mathematical Sciences 127,
second edition, Springer, New York, 2006.
[52] R.D. James and D. Kinderlehrer Frustration in ferromagnetic materials, Continuum Mech. Thermodyn., 2:
215–239, 1990.
[53] H. Kronmuller and M. Fahnle, Micromagnetism and the Microstructure of Ferromagnetic Solids, Cambridge
University Press, 2003.
[54] K. Ito and K. Kunisch, Lagrange Multiplier Approach to Variational Problems and Applications, Advances
in Design and Control, SIAM, Philadelphia, 2008.
[55] L.D. Landau and E.M. Lifschits, Course of Theoretical Physics, Vol. 5—Statistical Physics, part 1,
Butterworth-Heinemann, Elsevier, Reprint, 2008.
[56] E.M. Lifschits and L.P. Pitaevskii, Course of Theoretical Physics, Vol. 9—Statistical Physics, part 2,
Butterworth-Heinemann, Elsevier, Reprint, 2002.
[57] D.G. Luenberger, Optimization by Vector Space Methods, John Wiley and Sons, Inc., 1969.
[58] G.W. Milton, Theory of composites, Cambridge Monographs on Applied and Computational Mathematics.
Cambridge University Press, Cambridge, 2002.
[59] A. Molter, O.A.A. Silveira, J. Fonseca and V. Bottega, Simultaneous piezoelectric actuator and sensor place-
ment optimization and control design of manipulators with flexible links using SDRE method, Mathematical
Problems in Engineering, 2010: 1-23.
572 References
[60] P. Pedregal, Parametrized measures and variational principles, Progress in Nonlinear Differential Equations
and their Applications, 30, Birkhauser, 1997.
[61] P. Pedregal and B. Yan, On two dimensional ferromagnetism, Proceedings of the Royal Society of Edinburgh
139A: 575–594, 2009.
[62] M. Reed and B. Simon, Methods of Modern Mathematical Physics, Volume I, Functional Analysis, Reprint
Elsevier (Singapore, 2003).
[63] S M. Robinson, Strongly regular generalized equations, Math. of Oper. Res., 5: 43–62, 1980.
[64] R.T. Rockafellar, Convex Analysis, Princeton Univ. Press, 1970.
[65] R.C. Rogers, A nonlocal model of the exchange energy in ferromagnet materials, Journal of Integral Equa-
tions and Applications, 3(1), Winter 1991.
[66] R.C. Rogers, Nonlocal variational problems in nonlinear electromagneto-elastostatics, SIAM J. Math, Anal.,
19(6), November 1988.
[67] W. Rudin, Functional Analysis, second edition, McGraw-Hill, 1991.
[68] W. Rudin, Real and Complex Analysis, third edition, McGraw-Hill, 1987.
[69] H. Royden, Real Analysis, third edition, Prentice Hall, India, 2006.
[70] O. Sigmund, A 99 line topology optimization code written in Matlab, Struc. Muldisc. Optim., 21: 120–127
Springer-Verlag, 2001.
[71] J.J.A. Silva, A. Alvin, M. Vilhena, C.Z. Petersen and B. Bodmann, On a closed form solution of the point
cinetics equations with reactivity feedback of temprerature International Nuclear Atlantic Conference-INAC-
ABEN, Belo Horizonte, MG, Brazil, October 24–28, 2011, ISBN: 978-85-99141-04-05.
[72] E.M. Stein and R. Shakarchi, Real Analysis, Princeton Lectures in Analysis III, Princeton University Press,
2005.
[73] J.C. Strikwerda, Finite Difference Schemes and Partial Differential Equations, SIAM, second edition (2004).
[74] D.R.S. Talbot and J.R. Willis, Bounds for the effective contitutive relation of a nonlinear composite, Proc. R.
Soc. Lond., 460: 2705–2723, 2004.
[75] R. Temam, Navier-Stokes Equations, AMS Chelsea, Reprint, 2001.
[76] J.J. Telega, On the complementary energy principle in non-linear elasticity. Part I: Von Karman plates and
three dimensional solids, C.R. Acad. Sci. Paris, Serie II, 308: 1193–1198; Part II: Linear elastic solid and
non-convex boundary condition. Minimax Approach, ibid, pp. 1313–1317, 1989.
[77] A. Galka and J.J. Telega Duality and the complementary energy principle for a class of geometrically non-
linear structures. Part I. Five parameter shell model; Part II. Anomalous dual variational priciples for com-
pressed elastic beams, Arch. Mech., 47(677-698): 699–724, 1995.
[78] J.F. Toland, A duality principle for non-convex optimisation and the calculus of variations, Arch. Rath. Mech.
Anal., 71(1): 41–61, 1979.
[79] J.L. Troutman, Variational Calculus and Optimal Control, second edition, Springer, 1996.
[80] N.X. Vinh, Flight Mechanics of High Performance Aircraft, Cambridge University Press, New York, 1993.
[81] S. Weinberg, Gravitation and Cosmology, Principles and Applications of the General Theory of Relativity,
Wiley and Sons, Cambridge, Massachusetts, 1972.
[82] E. Kreyszig, Introductory Functional Analysis with Applications, John Wiley and Sons, New York, 1989.
[83] D.Y. Gao and H.F. Yu, Multi-scale modeling and canonical dual finite element method in phase transition
in solids, Int. J. Solids Struct. 45: 3660–3673, 2008.
Index
A compact set 10, 11, 21, 34, 36, 92, 120, 125, 170, 181,
184, 186–188, 198, 207, 215, 255, 278, 309, 310,
absolute continuous measures 145 317
adjoint operator 87, 94, 96, 98, 99, 108, 110, 116, 117, compactness 19, 20, 34, 77, 238–240, 251, 317
128, 130, 306 completeness 39, 49, 446, 456, 556
affine continuous 298, 316 complex Ginzburg-landau system 461, 464
Arzela-Ascoli theorem 24, 218 complex Hilbert space 57, 103, 106, 107, 109, 110,
116–118, 125, 132
B
composites 239
Baire category theorem 43–46 cone 91, 328–330, 332, 338, 342
balanced sets 30, 38, 195, 197 connected set 282, 294, 356, 359, 360, 365, 368, 370,
Banach space 1, 38, 40, 42, 43, 45, 46, 48, 66, 68, 69, 376, 381, 391, 393, 414, 420, 427, 440, 446, 452,
71–73, 75–77, 79–82, 84, 86–92, 113, 114, 202, 206, 456, 461, 466, 485, 496, 526, 547, 556, 560, 567
208, 241, 252–258, 262, 276–278, 282, 284, 296, constrained optimization 488, 496
298–301, 304, 306–309, 312, 322, 323, 328, 329, continuity 24, 25, 28–31, 41, 42, 69, 71, 78, 123, 196,
332, 333, 336, 338, 342, 344–346, 350, 369, 392, 223, 250, 279, 283, 295, 296, 299, 306, 320, 471
421, 514 continuous function 2, 28, 119–122, 176, 183, 206, 210,
bi-polar functional 298 220, 222, 253, 294, 301, 309, 313, 482
Borel set 137, 183, 189, 194 continuous operator 42, 43, 48, 64, 86, 87, 89, 100, 119,
bounded function 3, 41 125, 126, 132, 198, 205, 210, 222, 303, 306, 351
bounded operator 45, 86, 100, 109, 117, 531 control 465, 466, 475, 511, 568
bounded self-adjoint operator 94, 117 convergence 31, 120, 135, 140–142, 148, 157–161, 203,
bounded set 30, 41, 80, 92, 197, 199, 210–212, 214, 215, 206, 239, 244, 284, 431, 460, 484, 487, 496, 497,
219, 220, 222, 231, 235, 238, 239, 241, 243–245, 503, 507, 510, 515, 516, 520
250, 476 convex analysis 296, 298, 316, 323, 367, 381, 392, 409,
414, 415, 420, 437, 461, 477, 481
C convex envelop 298
convex function 260, 299
calculus of variations 245, 252, 273, 274, 282, 283, 295,
convex set 30, 60, 62, 63, 70, 72, 231, 296, 329, 331,
306, 320, 356, 360, 369, 370, 420, 426
332, 340, 344, 514
Cauchy sequence 15–19, 24, 39, 40, 45, 50, 53, 54, 58,
countably compact set 36
60, 97, 111, 131, 197, 203, 208, 223–225, 232, 235,
237, 240, 253–255, 321, 336, 345, 492 D
Cauchy-Schwartz inequality 52, 53
closed graph theorem 48 dense set 24, 31, 44, 60, 81, 176
closed set 6, 7, 9, 10, 19, 27, 28, 78, 167, 169, 170, 175, distribution 195, 198, 199, 208, 428, 436, 568
176, 189, 196 dual space 68, 76, 198, 557
closure 9, 27, 46, 80, 128, 130, 164, 180, 206, 210 dual variational formulation 367, 375, 391, 408, 420,
compact imbedding 238 426, 476
compact operator 92, 102 duality 296, 304, 320, 323, 331, 332, 367, 370, 375, 376,
381, 386, 390, 391, 393, 399, 402, 408–410, 415,
574 < Index
419, 420, 424, 426–428, 435, 437, 440, 446, 452, Gram-Schmidt orthonormalization 59, 60
456, 461, 464, 476, 477, 481 graph 48, 111, 127, 130, 258, 313
duality principle 370, 375, 376, 381, 386, 390, 391, 393,
399, 402, 408, 410, 415, 424, 427, 428, 435, 437, H
440, 446, 452, 456, 461, 464, 477, 481
Hölder continuous 206
du-Bois Reymond lemma 271, 274, 293
Hahn Banach theorem 66, 70–73, 75, 90, 210, 243, 300,
341
E
Hahn Banach theorem, geometric form 341
Egorov theorem 175 Hahn decomposition 144
eigenvalue 110, 113, 116, 566 Hausdorff space 29, 35, 180–183
Ekeland variational principle 320 Helly lemma 80
elasticity 367, 392, 408, 409, 414, 415, 419, 427, 435, Hilbert space 52, 53, 55–60, 62, 63, 87, 92, 93, 103,
436, 464 106–110, 115–118, 125, 127, 132, 146, 208
ε-net 20, 22 hyper-plane 69, 70, 72, 73, 75, 300, 328, 329
epigraph 296, 298
Euler system 438, 465, 527, 555, 556 I
inequality 4, 5, 52, 53, 57, 58, 61, 62, 74, 116, 141, 145,
F
169, 182, 201, 202, 204, 205, 208, 209, 214, 216,
Fatou’s lemma 141, 142 227, 228, 230, 231, 234, 236, 237, 240, 241, 252,
finite dimensional space 27 292, 331, 333, 334, 336, 338, 340, 341, 344, 372,
Fréchet derivative 284, 438, 440 374, 458, 467–469, 475, 493, 496, 514
Fréchet differentiability 284, 332, 350 inner product 52, 56, 65, 104, 145, 146, 208, 260
Fubini theorem 214, 216, 231 integral 139, 142, 201, 358, 359, 364
function 1, 3, 15, 28, 32, 33, 38–42, 52, 53, 64, 75, integration 136, 139, 178, 283, 292, 295, 369, 414, 438,
79, 80, 113, 115, 122, 125, 126, 136–141, 145, 476
149, 151, 152, 156, 158–160, 171, 173–175, 177, interior 6–10, 28, 30, 38–40, 43–46, 48, 72, 75–77, 114,
178, 181, 182, 191, 202, 206, 210, 212, 216, 219, 161, 164, 297, 329, 340
220, 222, 250, 252, 255, 256, 259–261, 263, 277, inverse mapping theorem 48, 89, 333
279, 282, 287, 289, 292, 294, 301–304, 306, 309, inverse operator 100
313, 314–316, 319, 320, 332, 333, 337, 338, 346,
351, 352, 355–357, 359, 361, 365, 381, 405, 409, J
414, 420, 432–437, 443, 449, 465, 466, 476, 482,
Jordan 145
488, 489, 493, 512, 515, 521, 527, 529, 547, 548,
555–558, 560–562 K
functional 26, 55, 66, 68, 69, 71, 87, 90, 103–106, 145,
146, 183, 198, 199, 205, 209, 210, 243, 255–258, Kakutani theorem 80–82
260, 262, 276–278, 282, 283, 288, 294, 296,
298–301, 303, 307–309, 312, 320, 322, 329, 331, L
332, 350, 372, 384, 386, 402, 420, 426–428, 435, Lagrange multiplier 329, 331, 332, 338, 339, 342, 343,
438–440, 446, 456, 460, 465, 466, 475, 476, 477, 351, 564, 567
547, 548, 567, 568 Lebesgue decomposition 148
functional analysis 26 Lebesgue dominated convergence theorem 159, 203,
244, 284, 460
G Lebesgue measurable function 171
Gâteaux derivative 299 Lebesgue measurable set 161, 166
Gâteaux differentiability 299, 350 Lebesgue monotone converge theorem 141, 157, 158,
Gâteaux variation 258, 276, 279, 283, 352 160
Generalized Method of Lines 344, 471, 482, 486, 487, Lebesgue points 191, 193, 194
497, 512, 514, 516, 517, 520, 521, 523, 525, 526, Lebesgue space 68, 201
529, 539, 540, 545, 547, 548, 550, 552, 553 Legendre functional 301
Ginzburg-Landau system 437, 456, 461, 464, 465, 475, Legendre Hadamard condition 284
525 Legendre transform 301, 420, 453
global existence 367, 390, 464, 475
Index < 575
limit 6–9, 11, 13, 14, 21, 32, 34–36, 39, 40, 58, 92, 97, normed space 38, 39, 48, 100, 210, 253
99, 102, 126, 131, 191, 203, 214, 232, 256, 258, 276, null space 41, 89, 333, 337
285, 291, 292, 469 numerical 408, 426, 428, 432, 435, 471, 473, 482, 484,
limit point 6–9, 11, 13, 21, 32, 34–36, 39, 40, 102, 131 485, 488, 497, 498, 501, 502, 507, 510, 511, 516,
linear function 41 522, 539–541, 543, 547, 552, 553
linear functional 55, 66, 68, 71, 87, 90, 146, 183, 198,
199, 205, 209, 210 O
linear mapping 40
open covering 10–12, 34
linear operator 42, 43, 46, 48, 86–91, 96, 100, 103, 106,
open mapping theorem 46
110, 113, 128, 130, 210, 222, 235, 303, 306, 351
open set 6, 8, 27, 28, 32, 35, 38–40, 136, 137, 161,
local minimum 277, 283, 289, 293, 332, 338, 339, 342,
167–169, 180–183, 185, 186, 188, 190, 192, 195,
343, 406, 445, 451, 478
201, 206, 218, 219, 221, 222, 229, 243, 246, 301,
locally convex space 30, 198
313–315, 317, 359, 364, 365
lower semi-continuous 181, 182, 191, 245, 296, 298,
operator 42, 43, 45, 46, 48, 64, 86–94, 96, 98–100, 102–
309, 312, 321, 322
104, 106, 108–110, 113, 116, 117, 119, 125–130,
132, 198, 205, 210, 222, 235, 239, 303, 306, 344,
M
351, 368, 392, 438, 440, 446, 514, 531
matrix version of generalized method of lines 482, 497 operator topology 86
maximal 58, 67, 298 optimal control 465, 466, 511
maximize 304 optimal design 427
maximum 451, 495 optimal shape 547, 552, 553
measurability 149, 150, 166–168 optimality conditions 338, 370, 390, 409, 419, 464
measurable function 136–141, 145, 148, 158, 171–175, optimization 304, 328, 329, 342, 343, 351, 402, 409, 419,
178, 301 420, 428, 431, 435, 436, 439, 452, 456, 481, 488,
measurable set 136, 138, 142–144, 149–151, 154, 155, 496, 497, 540, 547, 553
159, 161, 166, 169–171, 173, 175, 194, 209, 314 orthogonal complement 53
measure 136, 138, 142–145, 147–156, 159–161, 165, orthonormal basis 57–60, 92
166, 173, 175, 177, 178, 183, 184, 188, 191, 192, orthonormal set 57–59
194, 201, 291, 313, 373, 414, 427 outer measure 149, 151–154, 161, 166
measure space 138, 145, 148, 156, 160, 161
metric 1–11, 13–17, 19–21, 23, 24, 38–40, 42, 44, 52, 81, P
82, 86, 320, 322, 568
plate model 367, 369, 372, 375, 391, 392, 402
metric space 1–3, 6–11, 13–15, 19–21, 23, 24, 38–40, 44,
positive function 314
52, 81, 320
positive operator 93, 98, 119
micromagnetism 481
positive set 142–144
Milman-Pettis theorem 84
projection 60, 77, 78, 93, 122, 126, 132
minimizer 306, 320, 369, 372, 384, 390
minimum 4, 257, 267, 277, 283, 288, 289, 293, 309, 332, Q
338, 339, 342, 343, 406, 445, 451, 478
multi-well problems 477 quantum mechanics 456, 554, 556, 560, 567, 568
N R
natural set 294, 476 Radon-Nikodym theorem 145, 148, 194
Navier-Stokes system 525–527, 529, 540, 545 range 32, 41, 100, 110, 116, 133, 137, 189, 337, 556
necessary conditions 277, 283, 338, 342, 351 real analysis 26
neighborhood 6, 7, 28–32, 34, 38, 41, 42, 72, 74, 76, 77, real set 13, 282
80, 82–85, 180, 198, 296, 297, 299, 302, 306, 313, reflexive spaces 76
316, 319, 351, 355, 443 relative 20, 237
nested sequence 19 relative compactness 20
Newton’s method 482, 484–487, 497, 498, 502–504 Relativistic 554, 556, 560, 565–568
norm 38–40, 48, 53, 65, 68, 73, 80, 82, 86, 92, 93, 96, 97, resolvent 100, 101, 113
99, 105, 111, 114, 120, 135, 201, 202, 206, 208, 210, Riesz 55, 64, 65, 101, 102, 128, 146, 183, 205, 243
212, 224, 225, 243, 252–255, 298, 314, 336, 345, Riesz lemma 55, 128
358, 364, 369, 421, 438, 467, 469, 514, 516, 540 Riesz representation theorem 64, 65, 146, 183, 205, 243
normable space 40
576 < Index