You are on page 1of 283

~ TEXTS AND READINGS 52

~ IN MATHEMATICS

Functional Analysis

s. Kesavan

I1,Jr,i1@HINDUSTAN
UillJ UBOOK AGENCY
TEXTS AND READINGS
IN MATHEMATICS 52
Functional Analysis
Texts and Readings in Mathematics

Advisory Editor
C. S. Seshadri, Chennai Mathematical Institute, Chennai.

Managing Editor
Rajendra Bhatia, Indian Statistical Institute, New Delhi.

Editors
R. B. Bapat, Indian Statistical Institute, New Delhi.
V. S. Borkar, Tata Inst. of Fundamental Research, Mumbai.
Probal Chaudhuri, Indian Statistical Institute, Kolkata.
V. S. Sunder, Inst. of Mathematical Sciences, Chennai.
M. Vanninathan, TIFR Centre, Bangalore.
Functional Analysis

s. Kesavan
Institute of Mathematical Sciences
Chennai

fl:[gUgl HINDUSTAN
U U2J UBOOKAGENCY
Published in India by

Hindustan Book Agency (India)


P 19 Green Park Extension
New Delhi 110 016
India

email: info@hindbook.com
http://www.hindbook.com

Copyright © 2009, Hindustan Book Agency (India)

No part of the material protected by this copyright notice may be


reproduced or utilized in any form or by any means, electronic or
mechanical, including photocopying, recording or by any information
storage and retrieval system, without written permission from the
copyright owner, who has also the sole right to grant licences for
translation into other languages and publication thereof.

All export rights for this edition vest exclusively with Hindustan Book
Agency (India). Unauthorized export is a violation of Copyright Law
and is subject to legal action.

ISBN 978-81-85931-87-6 ISBN 978-93-86279-42-2 (eBook)


DOI 10.1007/978-93-86279-42-2
Preface

Functional analysis studies linear spaces provided with suitable topo-


logical structures and (continuous) linear transformations between such
spaces. It has far reaching applications in several disciplines. For in-
stance, the modern theory of (partial) differential equations and the
numerical approximation of their solutions rely a lot on functional ana-
lytic techniques.

Functional analysis is now an integral part of the curriculum in any


post graduate course in Mathematics. Ideally it should be taught after
having covered courses in linear algebra, real and complex analysis, and
topology. An introduction to the theory of measure and integration is
also helpful since the richest examples in functional analysis come from
function spaces whose study demands such a knowledge.

The present book grew out of notes prepared by myself while lectur-
ing to graduate students at the Tata Institute of Fundamental Research
(Bangalore Centre) and the Institute of Mathematical Sciences, Chen-
nai. The material presented in this book is standard and is ideally suited
for a course which can be followed by masters students who have cov-
ered the necessary prerequisites mentioned earlier. While covering all
the standard material, I have also tried to illustrate the use of various
theorems via examples taken from differential equations and the calcu-
lus of variations, either through brief sections or through the exercises.
In fact, this book is well suited for students who would like to pursue a
research career in the applications of mathematics. In particular, famil-
iarity with the material presented in this book will facilitate studying
my earlier book, published nearly two decades ago, 'Topics in Func-
tional Analysis and Applications' (Wiley Eastern, now called New Age
International), which serves as a functional analytic introduction to the
theory of partial differential equations.

The first chapter of the present book gives a rapid revision of linear
algebra, topology and measure theory. Important definitions, examples
and results are recalled and no proofs are given. At the end of each
section, the reader is referred to a standard text on that topic. This
chapter has been included only for reference purposes and it is not in-
tended that it be covered in a course based on this book.
vi

Chapter 2 introduces the notion of a normed linear space and that


of continuous linear transformations between such spaces.

Chapter 3 studies the analytic and geometric versions of the Hahn-


Banach theorem and presents some applications of these.

Chapter 4 is devoted to the famous 'trinity' in functional analysis


- the Banach-Steinhaus, the open mapping and the closed graph theo-
rems, which are all consequences of Baire's theorem for complete metric
spaces. Several applications are discussed. The notion of an 'unbounded
linear mapping' is introduced.

In my opinion, most texts do not emphasize the importance of weak


topologies in a course on functional analysis. That a bounded sequence
in a reflexive space admits a weakly convergent sequence is the corner
stone of many an existence proof in the theory of (partial) differential
equations. These topologies also provide nice counter-examples to show
the inadequacy of sequences in a general topological space. For instance,
we will see that two topologies on a set could be different while having
the same convergent sequences. We will also see an example of a com-
pact topological space in which a sequence does not have any convergent
subsequence. Chapter 5 deals with weak and weak* topologies and their
applications to the notions of reflexivity, separability and uniform con-
vexity.

Chapter 6 introduces the Lebesgue spaces and also presents the the-
ory of one of the simplest classes of Sobolev spaces.

Chapter 7 is devoted to the study of Hilbert spaces.

Chapter 8 studies compact operators and their spectra.

Much of the fun in learning mathematics comes from actually doing


it! Every chapter from Chapters 2 through 8 has a fairly large collection
of exercises at the end. These illustrate the results of the text, show
the optimality of the hypotheses of the various theorems via various ex-
amples or counterexamples, or develop simple versions of theories not
elaborated upon in the text. They are of varying degrees of difficulty.
vii

Occasionally, some hints for the solution are provided. It is hoped that
the students will benefit by solving them.

Since this is meant to be a first course on functional analysis, I have


kept the bibliographic references to a minimum, merely citing important
texts where the reader may find further details of the topics covered in
this book.

No claim of originality is made with respect to the material presented


in this book. My treatment of this subject has been influenced by the
writings of authors like Simmons (whose book was my first introduction
to Functional Analysis) and Rudin. These works figure in the bibliogra-
phy of this book. I would also like to mention a charming book, hardly
known outside the francophonic mathematical community, viz. 'Analyse
Fonctionnelle' by H. Brezis.

The preparation of this manuscript would not have been possible


but for the excellent facilities provided by the Institute of Mathematical
Sciences, Chennai, and I wish to place on record my gratitude. I also
thank the Hindustan Book Agency and the editor of the TRIM Series,
Prof. R. Bhatia, for their kind cooperation in bringing out this volume.
I must thank the anonymous referees who painstakingly went through
the first draft of the manuscript. I have tried to incorporate many of
their constructive suggestions in the current version.

Finally, I thank my family for its constant support and encourage-


ment.

Chennai, s. Kesavan
May 2008.
Notations
Certain general conventions followed throughout the text regarding
notations are described below. All other specific notations are explained
as and when they appear in the text.

• The set of natural numbers is denoted by the symbol N, the in-


tegers by Z, the rationals by Ql, the reals by lR and the complex
numbers by C.

• Sets (including vector spaces and their subspaces) and also linear
transformations between vector spaces are denoted by upper case
latin letters.

• Elements of sets (and, therefore, vectors as well) are denoted by


lower case latin letters.

• Scalars are denoted by lower case greek letters.

• To distinguish between the scalar zero and the null (or zero) vector,
the latter is denoted by the zero in boldface, i. e. o. This is also
used to denote the zero linear transformation and the the zero
linear functional.

• Column vectors in Euclidean space are denoted by lower case latin


letters in boldface and matrices are denoted by upper case latin
letters in boldface.

• Elements of £P spaces (cf. Chapter 6) are equivalence classes of


functions under the equivalence relation of 'equality almost every-
where'. To emphasize this fact, all elements of £P spaces (and
hence of Sobolev spaces as well) are denoted by lower case latin
letters in the sanserif font. A generic representative of that equiva-
lence class is denoted by the same lower case latin letter (in italics).
Thus, if f E Ll(O, 1), a generic representative of this class will be
denoted by f and will feature in all computations involving this
element. For instance, we have

Ilflll = fal If( t) I dt.

• The norm in a normed linear space V will be denoted by 11.11, or


by 11.llv, if we wish to distinguish it from other norms that may
x

be entering the argument. Similarly the innerproduct in a Hilbert


space H will, in general, be denoted by (.,.) (or by (., .)H in case
we wish to stress the role played by H).
Contents

1 Preliminaries 1
1.1 Linear Spaces . . . . . . 1
1.2 Topological Spaces . . . 11
1.3 Measure and Integration 18

2 Normed Linear Spaces 26


2.1 The Norm Topology 26
2.2 Examples . . . . . . 28
2.3 Continuous Linear Transformations. 36
2.4 Applications to Differential Equations 52
2.5 Exercises 57

3 Hahn-Banach Theorems 69
3.1 Analytic Versions . . . 69
3.2 Geometric Versions .. 77
3.3 Vector Valued Integration 81
3.4 An Application to Optimization Theory 85
3.5 Exercises 92

4 Baire's Theorem and Applications 97


4.1 Baire's Theorem . . . . . . . . . . 97
4.2 Principle of Uniform Boundedness 98
4.3 Application to Fourier Series . . . 101
4.4 The Open Mapping and Closed Graph Theorems 104
4.5 Annihilators . . . . . . . . . . . . 108
4.6 Complemented Subspaces . . . . 113
4.7 Unbounded Operators, Adjoints . 116
4.8 Exercises 125
xii

5 Weak and Weak* Topologies 132


5.1 The Weak Topology · 132
5.2 The Weak* Topology . · 139
5.3 Reflexive Spaces · .. · 145
5.4 Separable Spaces · .. · 147
5.5 Uniformly Convex Spaces · 152
5.6 Application: Calculus of Variations · 156
5.7 Exercises ............. · 158

6 LP Spaces 162
6.1 Basic properties . · 162
6.2 Duals of LP Spaces · 168
6.3 The Spaces LP(n) .170
6.4 The Spaces W1,P(a, b) .178
6.5 Exercises ...... · 188

7 Hilbert Spaces 195


7.1 Basic Properties · .......... · 195
7.2 The Dual of a Hilbert Space . . . . . .202
7.3 Application: Variational Inequalities .207
7.4 Orthonormal Sets . .210
7.5 Exercises .223

8 Compact Operators 231


8.1 Basic Properties · ......... .231
8.2 Riesz-Fredholm Theory . . . . . . .238
8.3 Spectrum of a Compact Operator . .244
8.4 Compact Self-Adjoint Operators · 251
8.5 Exercises . . . . . . . . . . . . . .258

Bibliography 265

Index 266
Chapter 1

Preliminaries

1.1 Linear Spaces

Functional Analysis is the study of vector spaces endowed with topo-


logical structures (that are compatible with the linear structure of the
space) and of (linear) mappings between such spaces. Throughout this
book we will be working with vector spaces whose underlying field is the
field of real numbers IR or the field of complex numbers C.
For completeness, we will recall some basic definitions and some im-
portant results.
Definition 1.1.1 A vector space or a linear space over a field IF
(whose elements are called scalars) is a set V, whose elements are called
vectors, on which two operations - addition and scalar multiplica-
tion - are defined such that the following properties hold:
Addition: (x, y) E V x V I---t X + Y E V such that
(i) (commutativity) for all x and y E V, we have

x+y = y+x;
(ii) (associativity) for all x, y and Z E V, we have

x + (y + z) = (x + y) + z;
(iii) there exists a unique vector 0 E V, called the zero or the null
vector, such that, for every x E V,

x+o = x;

(iv) for every x E V , there exists a unique vector -x E V such that

x + (-x) = o.
2 1 Preliminaries

Scalar Multiplication: (a, x) E IF x V I-t ax E V such that


(v) for every x E V, Ix = x where 1 is the multiplicative identity in IF;
(vi) for all a and f3 E IF and for every x E V,
a(f3x) = (af3)x;
(vii) for all a and f3 E IF and for every x E V,
(a + (3)x = ax + f3x;
(viii) for every a E IF and for all x and Y E V,
a(x+y) = ax+ay . •
Remark 1.1.1 The conditions (i) - (iv) above imply that V is an abelian
group with respect to vector addition. The conditions (vii) and (viii)
above are known as the distributive laws. •

Example 1.1.1 Let N 2: 1 be a positive integer. Define


~N = {X=(Xl, ... ,XN)lxiE~ foralll:Si:SN}.
We define addition and scalar multiplication componentwise, i. e. if x =
(Xl, ... , XN) and Y = (Yl, ... , YN) are elements of ~N and if a E ~, we
define
X+y= (Xl+Yl, ... ,XN+YN)
and
ax = (axl, ... ,axN).
It is now easy to see that ~N is a vector space over ~ with the zero
vector being that element in ~N with all its components zero. In the
same way, we can define cP as a vector space over C.
Setting N = 1, we see that ~ (respectively q is a vector space over
itself, the scalar multiplication being the usual multiplication operation .

Definition 1.1.2 Let V be a vector space and let W c V. Then W is
said to be a subspace of V if W is a vector space in its own right for
the same operations of addition and scalar multiplication. •

Definition 1.1.3 Let V be a vector space and let Xl, ... , Xn be vectors
in V. A linear combination of these vectors is any vector of the form
alxl + ... +anxn , where the ai, 1 :S i :S n are scalars. A linear relation
between these vectors is an equation of the form
alXl + ... + anXn = 0.•
1.1 Linear Spaces 3

Definition 1.1.4 A finite set of vectors in a vector space is said to be


linearly independent if there does not exist any linear relation between
them other than the trivial one, i. e, when all the scalar coefficients are
zero. If there exists a non-trivial linear relation, then the set of vectors
is said to be linearly dependent. An infinite set of vectors is said to
be linearly independent if there does not exist any finite linear relation
amongst vectors in that set. •

Given a vector space V and a set S of vectors, the collection of all


finite linear combinations of vectors from S will form a subspace of V.
In fact, it is clear that this subspace is the smallest subspace containing
S and is called the linear span of the set S or the subspace generated by
S. This subspace is denoted by span{S}.

Definition 1.1.5 A maximal linearly independent subset of a vector


space is called a basis . •

In other words, a basis is a linearly independent subset such that, if


any other vector is adjoined to the set, the enlarged set becomes linearly
dependent. This implies that every vector in the space can be expressed
as a (finite) linear combination of the members of the basis. Thus, the
vector space is generated by its basis.

Proposition 1.1.1 (i) Every vector space has a basis.


(ii) Any two bases of a vector space have the same cardinality . •

The above proposition leads us to the following definition.

Definition 1.1.6 A vector space V is said to be finite dimensional if


it admits a basis with a finite number of elements. Otherwise, it is said
to be infinite dimensional. The dimension of a vector space is the
number of elements in a basis, if it is finite dimensional, and infinity if
it is infinite dimensional and is denoted dim(V) . •

Example 1.1.2 The space ]RN (respectively, eN) has a basis which
is defined as follows. Let 1 ::; i ::; N. Let ei be the vector whose i-th
component is unity and all other components are zero. Then {el' ... , eN}
is a basis for]RN (respectively, eN) and is called the standard basis . •

Example 1.1.3 Let P denote the collection of all polynomials in one


variable with real coefficients. Let p(x) = "L~=o aixi and q(x) = "L~o bix i
4 1 Preliminaries

where the ai and bi are real numbers and x is the variable. Let a E R
Assume, without loss of generality, that m ::; n. Define
n
(p + q)(x) = I)ai + bdxi
i=O

where we set bi = 0 for m < i ::; n if m < n. Define


n
(ap)(x) = L aaixi.
i=O

With these operations, P becomes a vector space over R It is easy


to check that the collection of monomials {p ~o where Po(x) == 1 and
Pi(X) = xi for i > 1, forms a basis for P. Thus, P is an infinite dimen-
sional vector space. •

We will come across numerous examples of infinite dimensional vec-


tor spaces in the sequel (cf. Section 2.2, for instance).
Let V be a vector space and let Wi, 1 ::; i ::; n be subspaces. The
span of the Wi is the subspace of all vectors of the form

v= WI + ... + Wn
where Wi E Wi for each 1 :::; i :::; n. The spaces are said to be independent
if an element in the span is zero if, and only if each, Wi = O. In particular,
it follows that, if the Wi are independent, then for all 1 ::; i, j ::; n such
that i f. j, we have Wi n Wj = {O}. Further, every element in the span
will have a unique decomposition into vectors from the spaces Wi'
Definition 1.1.7 Let V be a vector space and let Wi, 1 ::; i ::; n be
subspaces. Then, V is said to be the direct sum of the Wi if the spaces
Wi are independent and their span is the space V. In this case we write

We now study mappings between vector spaces which preserve the


linear structure.

Definition 1.1.8 Let V and W be vector spaces (over the same base
field). A linear transformation, or linear operator, is a mapping
T : V -> W such that for all x and y E V and for all scalars a and (3,
we have
T(ax + (3y) = aT(x) + (3T(y).
1.1 Linear Spaces 5

If W is the base field (which is a vector space over itself), then a linear
transformation from V into W is called a linear functional on V. •

Definition 1.1.9 Let V and W be vector spaces and let T : V W


-t

be a linear transformation. The image of T is a subspace of Wand is


called the range of T. The dimension of the range is called the rank of
T. The set
{x E V IT(x) = O}
is a subspace of V and is called the null space or kernel of T . •

Definition 1.1.10 Let V and W be vector spaces and T a linear trans-


formation between them. The transformation is said to be invertible if
T is a bijection. •

It is easy to see that a linear transformation which is an injection


maps a linearly independent set onto a linearly independent set. In
particular, if T : V - t W is an injection, then, necesarily, dim(V) ~
dim(W). On the other hand, if T : V - t W is a surjection, clearly,
dim(V) ::::: dim(W). Thus, if T is invertible, then the two spaces must
have the same dimension.
We now focus our attention on finite dimensional spaces. Let V
be a space of dimension n with basis {VI, ... , vn } and W a space of
dimension m with basis {WI, ... , w m }. A linear map T : V - t W is
completely defined, once it is defined on a basis. So let us write
m

T(vj) = L tijWi, 1 ~ j ~ n. (1.1.1)


i=1

The coefficients (tij) in the above relation form a matrix with m rows
and n columns. Such a matrix is referred to as an m x n matrix. The
j-th column of the matrix represents the coefficients in the expansion
of T(vj) in terms of the basis { ~1 of W. Of course, if we change
the bases for V and W, the same linear transformation will be given by
another matrix. In particular, let dim(V) = n and let T : V - t V be a
linear operator. Let T be represented by the n x n matrix (also known
as a square matrix of order n) T = (tij) with respect to a given basis. If
we change the basis, then T will be represented by another n x n matrix
T = (tij) and the two will be connected by a relation of the form:
6 1 Preliminaries

where P is called the change of basis matrix and represents the linear
transformation which maps one basis to another. The matrix p-l rep-
resents the inverse of this change of basis m p~n and is the inverse
matrix of P. In this case, the matrices T and T are said to be simi-
lar. The identity matrix I represents the identity mapping x 1-+ x for
all x E V for any fixed basis of V. For a given basis, if T : V ~ V is
invertible, then the matrix representing T- 1 will be the inverse of the
matrix representing T.
A square matrix is said to be diagonal if all its off-diagonal entries
are zero. A n x n square matrix A = (aij) is said to be upper trian-
gular (repectively, lower triangular) if aij = 0 for all 1 :S j < i :S n
(repectively, aij = 0 for all 1 :S i < j :S n). It can be shown that every
matrix is similar to an upper triangular matrix. A matrix is said to be
diagonalizable if it is similar to a diagonal matrix.
Given an m x n matrix and two vector spaces of dimensions nand m
respectively along with a basis for each of them, the matrix can be used,
as in relation (1.1.1), to define a linear transformation beteween these
two spaces. Thus, there is a one-to-one correspondence between ma-
trices and linear transformations between vector spaces of appropriate
dimension, once the bases are fixed.
Definition 1.1.11 If T = (tij) is an m x n matrix, then the n x m
matrix T' = (tji), formed by interchanging the rows and the columns of
the matrix T, is called the transpose of the matrix T. If T = (tij) is
an m x n matrix with complex entries, then the n x m matrix T* = (tTj )
where tTj = tji (the bar denoting complex conjugation), is called the
adjoint of the matrix T . •
If x and y E ]Rn (respectively, en), then y'x (respectively, y*x) repre-
sents the 'usual' scalar product of vectors in ]Rn (respectively, en) given
by L~~l XiYi (respectively, L~=l x/fh). If the scalar product is zero, we
say that the vectors are orthogonal to each other and write x ~ y. If
W is a subspace and x is a vector orthogonal to all vectors in W, we
write x ~ W.
Definition 1.1.12 Let T be an m x n matrix. Then its row rank is
defined as the number of linearly independent row vectors of the matrix
and the column rank is the number of independent column vectors of
the matrix. •
The column rank is none other than the rank of the linear transfor-
mation defined by T and the row rank is the rank of the transformation
1.1 Linear Spaces 7

defined by the transpose. We have the following important result.

Proposition 1.1.2 For any matrix, the row and column ranks are equal
and the common value is called the rank of the matrix. •

Definition 1.1.13 The nullity of a matrix is the dimension of the null


space of the linear transformation defined by the matrix. •

Proposition 1.1.3 Let T be an m x n matrix. The the sum of the rank


of T and the nullity of T is equal to n.

Corollary 1.1.1 Ann x n matrix is invertible if, and only if, its nullity
is zero or, equivalently, its rank is n. Equivalently, a linear operator on
a finite dimensional space is one-to-one if, and only if, it is onto . •

n
Definition 1.1.14 Let T be an n x matrix with complex entries and
T* its adjoint. The matrix is said to be normal if

TT* = T*T.

The matrix is said to be unitary if

TT* = T*T = I.

The matrix is said to be self-adjoint or hermitian if T = T* . •

Definition 1.1.15 Let T be an n x n matrix with complex entries. It is


said to be positive semi-definite if for every n x 1 matrix with complex
entries, i.e, a column vector, x, we have

x*Tx ~ O.

The matrix T is said to be positive definite if, in addition, the above


inequality is strict if xi- O. •

Remark 1.1.2 A hermitian matrix is equal to its adjoint and the inverse
of a unitary matrix is its adjoint. A matrix T, with real entries, which
is equal to its transpose is called symmetric and one whose inverse is
its own transpose is called orthogonal. In case the matrix T has real
entries, we can still define positive semi-definiteness (or positive definite-
ness) by considering real column vectors x in the above definition. •
8 1 Preliminaries

We now introduce an important notion, viz. that of the determinant.


Before we do this, we need some notation. Let Sn denote the set of all
permutations of n symbols. A transposition is a permutation wherein
two symbols exchange places with each other and all other symbols are
left invariant. It is known that every permutation is the product (in the
sense of composition of mappings) of transpositions. A permutation is
even if it is the product of an even number of transpositions and odd if
it is the product of an odd number of transpositions. The signature of a
permutation CT, denoted sgn CT, is +1 if the permutation is even and -1
if it is odd.
Definition 1.1.16 Let T = (tij) be an n x n matrix. The determinant
of T, denoted det(T), is given by the formula

det(T) = L (sgn CT)t1,a(1) ... tn,a(n)' •


aESn

We list below the important properties of the determinant.


Proposition 1.1.4 (i) If I is the identity matrix of order n, then det(l) =
l.
(ii) If T and S are two n x n matrices, then

det(ST) = det(S).det(T).
In particular, T is invertible if, and only if, det(T) =I- 0 and
det(T- 1 ) = (det(T))-l.
(iii) If T is an n x n matrix, then

det(T') = det(T) . •

Definition 1.1.17 An invertible matrix is also said to be a non-singular


matrix. Otherwise, it is said to be singular. •

If T is a non-singular matrix of order n, then, given any n x 1 column


vector b, there exists a unique n x 1 column vector x such that

Tx = b.
This is because the corresponding linear transformation is invertible and
hence onto (which gives the existence of the solution) and one-to-one
(which gives the uniqueness of the solution). If T is singular, this is no
longer the case and we have the following result.
1.1 Linear Spaces 9

Proposition 1.1.5 (FredhOlm Alternative) Let T be a singular ma-


trix of order n and let b be an n x 1 column vector. Then, either the
system of n linear equations in n unknowns (written in matrix notation)

Tx = b
has no solution or has an infinite number of solutions. The latter pos-
sibility occurs if, and only if

b'u = 0

for all (column) vectors u such that T/u = o.•


We now come to a very important notion in linear algebra and func-
tional analysis.

Definition 1.1.18 Let T be a square matrix of order n with complex


entries. A complex number A is said to be an eigenvalue of T if there
exists an n x 1 vector u =1= 0 such that

Tu = AU.

Such a vector u is called an eigenvector of T associated to the eigen-


value A. The set of all eigenvectors associated to an eigenvalue A is a
subspace of en and is called the eigenspace associated to A.•

From the above definition, we see that A E e is an eigenvalue of a


square matrix T if, and only if the matrix T - ),1 is not invertible. Thus,
A will be an eigenvalue of T if, and only if,

det(T - ),1) = O. (1.1.2)

The expression on the left-hand side of (1.1.2) is a polynomial of degree


equal to the order of T and is called the characteristic polynomial
of T. Thus, every eigenvalue is a root of the characteristic polynomial
and so every matrix of order n has a non-empty set of at most n distinct
eigenvalues. Counting multiplicity, there are exactly n eigenvalues. The
equation (1.1.2) is called the characteristic equation of the matrix T.

Definition 1.1.19 Let A be an eigenvalue of a matrix T. Its algebraic


multiplicity is its multiplicity as a root of the characteristic polyno-
mial. Its geometric multiplicity is the dimension of the eigenspace
associated to it. •
10 1 Preliminaries

Proposition 1.1.6 The geometric multiplicity of an eigenvalue does


not exceed its algebraic multiplicity . •

Definition 1.1.20 The set of all eigenvalues of a matrix is called its


spectrum. The maximum of the absolute values of the eigenvalues is
called its spectral radius . •

The sum of the diagonal entries of a square matrix T is called its


trace and is denoted by tr(T). It is easy to see from the characteristic
equation that the trace is the sum of all the eigenvalues taking into
account their multiplicities. Similarly the determinant of a matrix is the
product of all its eigenvalues (again counting mUltiplicity).

Proposition 1.1.7 (i) The eigenvalues of the adjoint of a matrix are


the complex conjugates of the eigenvalues of the original matrix.
(ii) The eigenvalues of a hermitian matrix are all real.
(iii) The eigenvalues of a hermitian and positive definite matrix are pos-
itive.
(iv) The eigenvalues of a unitary matrix lie on the unit circle of the
complex plane. •
The eigenvalues of a hermitian matrix admit a 'variational charac-
terization'. Let T be a hermitian matrix of order n. Its eigenvalues are
all real and let them be numbered in increasing order as follows:

Let Vi, 1 :s: i :s: n be a collection of eigenvectors, where Vi is associated


to Ai. Set Vo = {O} and

for 1 :s: i :s: n.


We define the Rayleigh quotient associated to the matrix T as follows:
x*Tx
RT(X) = --,
x*x
x 1= 0,
which is real valued, since T is hermitian.

Proposition 1.1.8 (i) The eigenvectors Vi can be chosen such that

if i = j,
if i 1= j.
1.2 Topological Spaces 11

(ii) For each 1 :::; i :::; n, we have

Ai RT( Vi)
RT(X)
maxxE\!;
mind Vi-I RT(X)
minwcc n , dim(W)=i maxxEW RT(X).

In particular,

By analogy, if T is a real symmetric matrix, a similar result holds


with the adjoint being replaced by the transpose in the definition of the
Rayleigh quotient and en
being replaced by]Rn in the formulae above.

Remark 1.1.3 A matrix is diagonalizable if, and only if, it admits a


basis of eigenvectors. In particular, if all the eigenvalues of a matrix are
distinct, it is diagonalizable. All normal matrices are diagonalizable. In
particular, hermitian matrices are diagonalizable. •

For more details on linear spaces, the reader is referred to, for in-
stance, Artin [1].

1.2 Topological Spaces

In this section, we recall the important definitions and results of topology


which will be used in the sequel.

Definition 1.2.1 A topology on a set X is a collection T of subsets


of X such that
(i) X and (/) are in T;
(ii) the union of any collection of members in T is a member of T;
(iii) the intersection of any finite collection of members ofT is a member
ofT.
The pair {X, T} is said to be a topological space and the members of
the topology T are called open sets. The complements of open sets are
called closed sets. If x E X, then a neighbourhood of x is any open
set containing x. •

It is clear from the above definition that X and 0 are both open and
closed. Further, a finite union and an arbitrary intersection of closed
12 1 Preliminaries

sets is closed. It then follows that given any set A eX, there is a
smallest closed set containing it. This is called the closure of the set
A and is usually denoted by A. If A = X, we say that A is dense in
X. Similarly, given any set A eX, there is a largest open set contained
in A. This set is called the interior of A and is denoted by A o . A set
A c X is said to be nowhere dense if (A)O =.0.
If {X, T} is a topological space and if Y eX, then Y inherits a
natural topology from that of X. The open sets are those of the form
Un Y, where U is open in X.

Definition 1.2.2 A topological space {X, T} is said to be Hausdorff


if for every pair of distinct elements x and y in X, there exist disjoint
open sets U and V such that x E U and y E V. •

Definition 1.2.3 A metric on a set X is a function d : X x X ----t [0,00)


such that
(i) d(x, y) = 0 if, and only if, x = y;
(ii) for all x and y EX, we have

d(x, y) = d(y, x);

(iii) for all x, y and Z E X, we have

d(x, z) ::; d(x, y) + d(y, z). (1.2.1)

The pair {X, d} is called a metric space . •

Remark 1.2.1 The notion of a metric generalizes that of a distance, as


we know it, in the Euclidean spaces JR2 and JR3. The inequality (1.2.1)
is just an abstract version of a familiar theorem from Euclidean plane
geometry which states that the sum of the lengths of two sides of a tri-
angle is greater than the length of the third side. For this reason, it is
called the triangle inequality. Of all the conditions on a metric, this one
will need the most non-trivial verification . •

If {X, d} is a metric space, then it is easy to see that we have a


topology (called the metric topology) induced on X by the metric d
which is defined as follows. A non-empty set U C X is open if, and only
if, for every x E U, there exists r > 0 such that

B(x;r) ~f {y E X I d(x,y) < r} C U.


1.2 Topological Spaces 13

The set B(x; r) described above is called the (open) ball centred at x and
of radius r. It is a simple exercise to check that open balls themselves
are open sets. It is also immediate to see that this topology is Hausdorff.
On]R or e, we have the 'usual' metric defined by

d(x, y) = Ix - YI·
The topology induced by this metric will be called the 'usual' topology
on ]R or e, as the case may be. Similarly, on ]RN (or eN), we have
the 'usual' Euclidean distance which defines a metric on that space: if
x = (x, ... , XN) and Y = (YI, ... , YN) are vectors in]RN (respectively, eN),
then

The topology induced by this metric will be referred to as the 'usual'


topology on ]RN (respectively, eN).
In the metric topology defined above, we see that every open set is
the union of open balls. In a general topological space a collection B
of open sets is called a base for the topology if every open set can be
expressed as the union of members of B.

Definition 1.2.4 Let {X, T} be a topological space and let S be a col-


lection of open sets in X. We say that S is a subbase for the topology
T if every open set can be expressed as unions of finite intersections of
members of s .•

Clearly any topology containing S will have to contain T. Thus T


is the smallest topology containing S. The set of finite intersections of
members of S form a base for the topology.

Definition 1.2.5 Let {X, T} be a topological space and let A be an


arbitrary subset of X. A point x E X is said to be a limit point of A
if every neighbourhood of x contains a point of A (different from x, in
case x E A) . •

Definition 1.2.6 Let {X, T} be a topological space and let {x n } be a


sequence of elements in X. We say that the sequence converges to a
point x E X if for every neighbourhood U of x, we can find a positive
integer N (depending on U) such that Xk E U for all k ~ N. In this
case, we write Xn --t x in X . •
14 1 Preliminaries

Definition 1.2.7 Let {Xi, 'Ii}, i = 1,2, be two topological spaces and
let f : Xl -+ X 2 be a given function. We say that f is continuous
if f-l(U) is an open set in Xl for every open set U in X 2 • If f is
a bijection such that both f and f- l are continuous, then f is said to
be a homeomorphism and the two topological spaces are said to be
homeomorphic to each other. •

The following propositions are easy to prove.

Proposition 1.2.1 Let {X,d} be a metric space and let {x n } be a se-


quence in X. Then Xn -+ x in X if, and only if, for every E > 0, there
exists a positive integer N such that

d(Xk' x) < E for every k 2: N . •

In particular, every convergent sequence is bounded, i. e., it can be


contained in a (sufficiently large) ball.
Proposition 1.2.2 Let {Xi, dd, i = 1,2, be metric spaces and let f :
Xl -+ X 2 be a given function. The following are equivalent:
(i) f is continuous.
°
(ii) For every x E X and for every E > 0, there exists <5 > such that
whenever d l (x, y) < <5, we have d2 (f(x),j(y)) < E.
(iii) If Xn -+ x in Xl, then f(xn) -+ f(x) in X 2 . •
Remark 1.2.2 When property (ii) (or, equivalently, (iii)) holds for a
particular point x E X, we say that f is continuous at x. Thus, f is
continuous if, and only if, it is continuous at each point of X. If f is
°
continuous and, for a given E > 0, the <5 > described in statement (ii)
above does not depend on the point x, then the function is said to be
uniformly continuous on X . •

Let {X, d} be a metric space and let E be an arbitrary subset of X.


Let x EX. Define
d(x, E) = inf d(x, y).
yEE

This is called the distance of the point x from the set E. The following
proposition is easy to prove.

Proposition 1.2.3 Let {X, d} be a metric space and let E eX. Then
(i) for all x and y E X, we have

Id(x, E) - d(y, E)I < d(x, y).


1.2 Topological Spaces 15

Thus, the function x f-> d(x, E) is a uniformly continuous function on


X;
(ii) if E is a closed set, then, d(x, E) = 0 if, and only if, x E E; more
generally, if E is any subset of X, we have

E = {x E X I d(x, E) = o}.
Definition 1.2.8 Let {X,d} be a metric space. A sequence {x n } in X
is said to be Cauchy if, for every E > 0, there exists a positive integer
N such that

for every k 2: N, l 2: N . •

It is simple to verify that every Cauchy sequence is bounded. It is also


easy to see that every convergent sequence in a metric space is Cauchy.
The converse is not true and this leads to the following important defi-
nition.

Definition 1.2.9 A metric space is said to be complete if every Cauchy


sequence is convergent . •

With their usual metric, the spaces lR and C are complete.


We now introduce an important notion which we will study in detail
in later chapters.

Definition 1.2.10 Let I be an arbitrary indexing set. Let X be a set


and {Xi, 'Iilia be topological spaces. Let li : X -* Xi be given functions.
The weak topology generated by the functions fi, i E I, is the smallest
topology on X such that all the li are continuous. •

From the above definition it follows that a subbase for the weak
topology generated by the li is the collection of all sets of the form
fi-l(U) where U is an arbitrary open set in Xi and the index i ranges
over the indexing set I. A typical neighbourhood of a point x E X will
therefore be a finite intersection of sets of the form f i- 1 (Ui) where Ui is
a neighbourhood of Ji(x) in Xi.

Definition 1.2.11 Let I be an indexing set and let {Xi, 'Ii}, i E I be


topological spaces. Set X = IIiEIXi . Let x = (Xi)iEI. Let Pi : x E X f->
xi E Xi be the i-th coordinate projection. The product topology on
X is the weak topology generated by the coordinate projections, i. e. it is
the smallest topology such that the projections are all continuous. •
16 1 Preliminaries

Thus, sets of the form IIi EI Ui , where Ui = Xi for all i -I- io (an
arbitrary element of I) and Uio is open in X io ' form a subbase for the
product topology. A base for the topology is the collection of all sets
of the form IIiEIUi where Ui = Xi for all but a finite number of indices
and, for those indices, Ui is an open set in Xi.

Definition 1.2.12 Let {X, T} be a topological space and let 0 -I- K c


X. A collection of open sets F is said to be an open cover of K if
the union of the members of F contains K. A sub cover of F is a
subcollection of members of F which is also an open cover of K . •

Definition 1.2.13 Let {X, T} be a topological space and let 0 -I- K c


X. The set K is said to be a compact set if every open cover of K
admits a finite subcover. •

If X is itself a compact set, we say that it is a compact space. We


can also describe compactness via closed sets.

Definition 1.2.14 A collection A of subsets of a set X is said to have


finite intersection property if every finite subcollection has non-
empty intersection. •

The following proposition is easily proved.

Proposition 1.2.4 A non-empty subset K of a topological space is com-


pact if, and only if, every collection of closed sets in K having finite
intersection property has non-empty intersection . •

Definition 1.2.15 Let {X, T} be a topological space and let 0 -I- K c


X. The set K is said to be sequentially compact if every sequence in
K has a convergent subsequence. •

We list important facts about compact sets in the following propo-


sition.

Proposition 1.2.5 (i) Every continuous image of a compact set is com-


pact.
(ii) The product of compact sets is compact.
(iii) Compact subsets of Hausdorff spaces are closed.
(iv) A compact subset of a metric space is closed and bounded.
(v) For the usual topology on ]RN, a subset is compact if, and only if, it
is closed and bounded.
1.2 Topological Spaces 17

(vi) Every continuous real valued function on a compact space is bounded


and attains its maximum and minimum values in that set.
(vii) Every continuous real valued function on a compact metric space is
uniformly continuous . •
Compact metric spaces are very special. In order to characterize
them, we need the following notion.
Definition 1.2.16 A metric space {X, d} is said to be totally bounded
if, for every E > 0, there exists finite set of points {xi 7~ { such that

X C 7~ { B(Xi; E) .•
Proposition 1.2.6 Let {X, d} be a metric space. The following state-
ments are equivalent:
(i) X is compact.
(ii) X is sequentially compact.
(iii) Every infinite subset of X has a limit point.
(iv) X is complete and totally bounded. •

We conclude this section with one final important topological notion.


Definition 1.2.17 Let {X, T} be a topological space. We say that X
is connected if there do not exist non-empty open sets U and V such
that X = U U V and U n V = 0. A subset A c X is said to be
connected if there do not exist disjoint open sets U and V such that
A = An (U U V), An U of 0, A n V of 0.•

Definition 1.2.18 A non-empty subset A of a topological space is said


to be path connected if given any pair of points x and y in A, there
exists a continuous function, : [0,1] --+ A such that ,(0) = x and
,(1) = y.•

Proposition 1.2.7 (i) Every continuous image of a connected set is


connected.
(ii) The product of connected sets is connected.
(iii) Every path connected set is connected. •
In particular, every ball in a metric space is path connected, and
hence, connected.
The only connected sets in lR. are intervals.
For a detailed study of topological spaces, the reader is referred to,
for instance, Simmons [9].
18 1 Preliminaries

1.3 Measure and Integration

In this section, we will recall basic facts and results of Lebesgue's theory
of measure and integration.

Definition 1.3.1 Let X be a set. A a-algebra is a collection S of


subsets of X such that
(i) XES;
(ii) if A E S, then AC E S, where AC denotes the complement of A in X;
(iii) if Ai E S for i E N, then

The pair (X, S) is then called a measurable space. The members of


S are called measurable sets . •

In other words, a a-algebra on a set X is collection of subsets of


X containing X and which is closed under the set theoretic operations
of complementation and countable unions.

Definition 1.3.2 Let (X, S) be a measurable space. Let f : X - t JR be


a given function. It is said to be a measurable function if, for every
a E lR, we have

f-1((a,00)) = {x E X I f(x) > a} E S.

A complex valued function is said to be measurable if its real and imag-


inary parts are measurable. •

Proposition 1.3.1 Let (X, S) be a measurable space and let f : X -t JR


be a given function. Then, the following are equivalent:
(i) f-1((a,00)) E S, for all a E R
(ii) f-1([a,00)) E S, for all a E JR.
(iii) f-1((-00,a)) E S, for all a E R
(iv) f-1(( -00, a]) E S, for all a E JR . •

Given any collection of subsets of a set X, there is a smallest a-


algebra containing the collection and it is called the a-algebra generated
by the given collection of sets. If (X, T) is a topological space, the the
a-algebra generated by the open sets is called the Borel a-algebra and
its members are called Borel sets.
1. 3 Measure and Integration 19

Definition 1.3.3 Let (X, S) be a measurable space. A measure on X


is a function J-l : S -----+ [0,00] such that, if Ai, i E N are members of S
which are pairwise disjoint, i. e. Ai n Aj = 0 if i -I j, then
00

p(U~l i) = :LJ-l(Ai ).
i=l
The triple (X, S, J-l) is called a measure space. •

Definition 1.3.4 Let (X,S,J-l) be a measure space. If p(X) < 00, then
J-l is said to be a finite measure. If X can be covered by a countable
union of measurable sets, each with finite measure, then J-l is said to be
a a-finite measure . •

Example 1.3.1 Let X be any non-empty set and let S be the collection
of all subsets of X, which is obviously a a-algebra. If A c X, define
J-l(A) to be the number of elements in A, if A is a finite set and to be 00,
otherwise. This defines a measure and is called the counting measure
onX . •

Example 1.3.2 Let (X, S) be as in the preceding example and let Xo E


X be a given point. let A c X. Define

if Xo E A
J-l(A) = { 1,
0, otherwise.
This defines a measure on X which is called the Dirac measure con-
centrated at the point Xo. •
The most important example of a measure is the Lebesgue mea-
sure defined on lRN.
Consider the real line lR endowed with its usual topology. Then, it is
possible to construct a a-algebra M on lR which contains all the Borel
sets and to define a measure p such that for any interval of the form
[a, b) (or [a, b], (a, b], (a, b)), where a, bE lR, we have

J-l([a, b)) = b-a.

This measure also has the following additional properties:


(i) (Completeness) If E E M and J-l(E) = 0, then, for any FeE, we
have F E M and, a fortiori, J-l(F) = 0.
(ii) (Translation Invariance) If E E M and if a E lR, then

a + E = {a + x Ix E E} E M
20 1 Preliminaries

and, further, J-l(a + E) = J-l(E).


(iii) (Regularity) If E E M and if E > 0, then there exists an open set
V and a compact set K such that K c E c V and J-l(V\K) < E. In
fact,
J-l(E) = inf{J-l(W) lEe W, W open}
= sup{J-l(F) IF c E,F compact}.
Properties (i)-(iii) above determine the measure J-l upto a multiplica-
tive constant. The fact that the measure of an interval is its length
determines the measure uniquely and it is called the Lebesgue mea-
sure on lR and the members of M are called Lebesgue measurable
sets.
In the same way, it is possible to construct the Lebesgue measure on
lR N , N ~ 1, with the properties (i)-(iii) and such that for any cell of the
form E = II~ i bi ), we have

Definition 1.3.5 Let (X, S, J-l) be a measure space. Let E E S. The


characteristic function of E, denoted XE is defined by

XE
( ) _{I,
X - 0,
if x E E,
if x (j. E.

A simple function is a function S : X ---+ lR such that


m

s(x) = I>l:iXAi(X),
i=l

where ai E lR and Ai E S for 1 ::; i ::; m . •

We are now in a position to define the Lebesgue integral. Let


(X, S, J-l) be a measure space. The integral of a non-negative simple
functon s = 2::1 aiXi is defined by

(Since the Ai may have infinite measure, we insist on the non-negativity


of the function (so that ai ~ 0 for all 1 ::; i ::; m) in order to avoid
anomalous situations where infinite quantities may need to be subtracted
from each other.)
1. 3 Measure and Integration 21

Proposition 1.3.2 Let (X, S) be a measurable space and let f : X -+ lR


be a non-negative measurable function. Then there exists a sequence
{sn} of non-negative simple functions suc/, that, for all n,

and
lim sn(x) f(x)
n-->oo

for all x E X . •

In view of the above proposition, we may now define the integral of


any non-negative measurable function as follows:

L f d/1 ~ sup {L s d/1 Is simple, 0:::; s :::; f } .

In the case of a simple function, it is easy to see that both these


definitions coincide.
Now, let f : X -+ lR be any measurable function. Set

f+ = max{f,O}, and f- = -min{f,O}.


Then, f+ and f- are non-negative measurable functions and

Definition 1.3.6 Let (X, S, /1) be a measure space and let f : X -+ lR


be a measurable function. Then f is said to be integrable if

LIf I d/1 < 00 . •

It is easy to see that

so that, if f is integrable, then both the integrals on the right-hand side


of the above relation are finite. Then, we can unambiguously define the
integral of f as follows:
22 1 Preliminaries

Integration is a linear operation: if f and 9 are integrable functions and


if 0: and f3 E JR, we have

Notation: If we wish to specify the variable of integration as well, we


will write

in place of Jx f dJL.

Example 1.3.3 If we define the counting measure on the set of natural


numbers N, then any real valued function f on N is measurable and can
be identified with a sequence {an}, where f (n) = an. It can be seen
that
1N
f dJL = fan.
n=l

An integrable function corresponds to an absolutely convergent series. •

Example 1.3.4 If X is any non-empty set and if <5xo is the Dirac measure
concentrated at Xo EX, then

f(xo)

for any function f :X ---+ R •

We now list some of the main results from the theory of the Lebesgue
integral, which behaves very nicely with respect to limit processes.
Theorem 1.3.1 (Monotone Convergence Theorem) Let (X, S, JL)
be a measure space. let {fn} be an increasing sequence of non-negative
measurable functions on X. Let f n (x) ---+ f (x) for all x EX. Then f is
measurable and
lim { fn dJL ---+ { f dJL . • (1.3.1)
n->oojx jx
Theorem 1.3.2 (Fatou's Lemma) Let (X, S, JL) be a measure space.
Let {fn} be a sequence of non-negative measurable functions on X. Then

{ liminf fn dJL ::::; liminf { f~ dJL . •


}x n->oo n->oo } X
1.3 Measure and Integration 23

Theorem 1.3.3 (Dominated Convergence Theorem) Let (X, S, fJ-)


be a measure space and let Un} be a sequence of measurable functions
such that fn(x) -+ f(x) for all x E X. Assume that Ifni:::; g for all n,
where g is integrable. Then, f is also integrable and

lim
n----+oo ixr Ifn - fl dfJ- = O.

In particular, {1.3.1} holds . •

A property is said to hold almost everywhere (abbreviated as a.e.) in


a measure space if it holds on the complement of a set of measure zero.
The above results are valid even if we replace convergence for all x E X
by convergence a .e. for the given sequences of functions.
Let (X, S, fJ-) and (Y, T,),) be two measure spaces. Consider the
product set X x Y. A measurable rectangle is a subset of the product
set of the form A x B, where A E Sand BET. An elementary set is
a finite disjoint union of measurable rectangles. The product O"-algebra
S x T on X x Y is the smallest O"-algebra generated by elementary sets.

Definition 1.3.1 Let Q c X x Y. Then the x-section of Q is the


subset of Y defined by

Qx = {y E Y I (x, y) E Q}.

Similarly, the y-section of Q is a subset of X and is given by

QY = {x E X I (x, y) E Q}.

If f : X x Y -+ 1R is a function, then its x- and y-sections are functions


fx : Y -+ 1R and fY : X -+ 1R respectively, given by

fx(y) = f(x, y) = fY(x)

for all x E X and y E Y. •

Proposition 1.3.3 (i) IfQ E S x T, then, for every x E X and y E Y,


we have Qx E T and QY E S.
(ii) If f : X x Y -+ Y is a S x T -measurable function, then, for every
x E X and y E Y, we have that f x is T -measurable on Y and fY is
S -measurable on X •.
24 1 Preliminaries

Theorem 1.3.4 Let (X, S, 1-") and (Y, T, A) be a-finite measure spaces.
Let Q E S x T. For x E X and y E Y, define

'P(x) = A(Qx), and 'IjJ(y) = I-"(QY).


Then 'P is S -measurable, 'IjJ is T -measurable and, further,

i 'P d/l' = i '1/) dA. • (1.3.2)

We can now use the above theorem to define the measure on the
product of two a-finite measure spaces.

Definition 1.3.8 Let (X, S, 1-") and (Y, T, A) be two a -finite measure
spaces. The product measure I-" x A on X x Y is defined by

(I-" x A)(Q) = i A(Qx) dl-"(x) = i I-"(QY) dA(Y)

for every Q E S x T. •

The following result helps us to evaluate integrals over the product


space via iterated integrals and is very useful.

Theorem 1.3.5 (Fubini's Theorem) Let (X, S, 1-") and (Y, T, A) be


a-finite measure spaces. Let f : X x Y ~ lR be S x T -measurable.
(i) Let f be non-negative. Set

( 1.3.3)

for x E X and y E Y. Then 'P is S -measurable, 'IjJ is T -measurable and

i 'P dl-" = i 'IjJ dA. (1.3.4)

(ii) If f is an arbitrary measurable function such that

verifies
i'P dl-" < 00,

then f is integrable with respect to I-" x A.


(iii) Let f be integrable with respect to I-" x A. Then fx is integrable
1.3 Measure and Integration 25

with respect to ,X for almost every x EX; similarly fY is integrable with


respect to JL for almost every y E Y; the functions 'P and 'lj; defined by
{1.3.3} are integrable with respect to J-L and ,X respectively and {1.3.4}
holds.

Remark 1.3.1 The 17-finiteness of the measures is essential. •

Remark 1.3.2 The relation (1.3.4) may be written in the form of iter-
ated integrals as follows:

and the common value is equal to IxxY f d(J-L x ,X) whenever Fubini's
theorem is applicable. •

Remark 1.3.3 In (ii), f is integrable if the analogous result for the


y-section of If I holds. Thus, if either of the iterated integrals for If I is
finite, then f is integrable over the product space and (1.3.4) will hold . •

For further details, the reader is referred to Halmos [2], Royden [5]
or Rudin [7J.
Chapter 2

Normed Linear Spaces

2.1 The Norm Topology

In order to do analysis on vector spaces, we need to endow these spaces


with a topological structure which is compatible with the linear struc-
ture. This is made precise in the following definition.

Definition 2.1.1 A topological vector space is a vector space V


that is endowed with a Hausdorff topology such that the maps

(x, Y) E V x V t--+ X +y EV and (a, x) E IF x V t--+ ax E V

are continuous, each product space being endowed with the appropriate
product topology using the given topology of V and the usual topology on
the scalar field IF (= lR or C) . •

We will restrict our attention to a class of topological vector spaces


called normed linear spaces, which we now proceed to define.

Definition 2.1.2 A norm on a vector space V is a function 11.11 : V -t

[0,(0) such that


(i) Ilxll = 0 if, and only if, x = 0;
(ii) Ilaxll = lalllxli for every a E IF and every x E V;
(iii) (Triangle Inequality) for every x and y E V, we have

Ilx + yll ~ Ilxll + Ilyll· • (2.1.1)

Associated to a norm on a vector space V, we have a metric defined


by
d(x, y) = Ilx - YII·
2.1 The Norm Topology 27

It is immediate to verify that this defines a metric. The triangle inequal-


ity (2.1.1) yields the inequality (1.2.1) (which is also called by the same
name) via the relation

x- z = (x - y) + (y - z).
(Thus, the norm of a vector is its distance from the origin and is a
generalization of the notion of the length of a vector, as we know it, in
Euclidean space.)
Thus, V is endowed with a metric topology. In this topology, a
sequence {x n } converges to x in V if, and only if

Now if {x n } and {Yn} are sequences in V and {an} a sequence in IF,


we have, for x, Y E V and a E IF,

II(xn + Yn) - (x + y)11 ::; IIxn - xII + llYn - YII,


lIanxn - axil ::; lanlllxn - xII + Ian - alllxll·
Thus, if Xn ----t x, Yn ----t Y in V and if an ----t a in IF, it immediately
follows that Xn + Yn ----t X + y and that anxn ----t ax in V. Thus addition
and scalar multiplication are continuous and so V becomes a topological
vector space with this metric topology.
Definition 2.1.3 A normed linear space is a vector space V endowed
with a norm. The metric topology induced by the norm is called its norm
topology . •
The norm itself is a continuous function with respect to this topology.
Indeed, if x and y E V, then, since x = (x-y) +y, the triangle inequality
yields
IIxll ::; IIx - YII + lIylI
which we rewrite as
IIxll - lIyll ::; IIx - YII·
Interchanging the roles of x and Y we finally obtain

I IIxll - lIylI I ::; IIx - YII


from which the continuity of the functiop x E V f--t IIxll E lR follows.

Definition 2.1.4 A normed linear space is said to be a Banach space


if it is complete under the norm topology . •
28 2 N armed Linear Spaces

2.2 Examples

We will now look at several examples of normed linear spaces. Essen-


tially, they can be classified into three groups - finite dimensional spaces,
sequence spaces and function spaces. In the examples that follow, we
will set F = R The reader can easily make the necessary changes to
cover the case when lR is replaced by Co

Example 2.2.1 We can consider lR as a vector space over itself. The


map x E lR 1--+ Ixl is easily seen to define a norm which generates the
usual topology on R Since lR is complete, it thus becomes a Banach
space. •

Example 2.2.2 Let 1:::; p < 00. For x = (Xl,X2, ... ,XN) E lR N , define

It is easy to see that conditions (i)-(ii) of Definition 2.1.2 are verified.


We will presently prove the triangle inequality and thus lRN with the
norm 11.ll p will become a normed linear space. It is, again, immediate to
see that a sequence {x(n)} in lRN converges in this norm to x E lRN if,
and only if, for every 1 :S i :S N, we have x~n) -+ Xi. Similarly, {x(n)} is
Cauchy in this norm if, and only if, for every 1 :::; i :::; N, the sequences
{x~n) are Cauchy in R Since lR is complete, it now follows that lRN
is also complete with respect to each of the norms 11.llp defined above.
Thus for each of these norms, lRN is a Banach space . •

We now proceed to prove the triangle inequality for each of the norms
11.llp for1 :::; p < 00.
Definition 2.2.1 Let 1 :::; p :::; 00. If p = 1, set p* = 00 and vice-versa.
Otherwise, let 1 < p* < 00 be such that
1 1
- +- = 1 (2.2.1)
p p*
The number p* defined thus is called the conjugate exponent of p . •
Lemma 2.2.1 Let 1 < p < 00. Let p* be its conjugate exponent. Then,
if a and b are non-negative real numbers, we have

(2.2.2)
2.2 Examples 29

Proof: Let t 2:: 1 and consider the function

f (t) = k (t - 1) - t k +1
for some k E (0,1). Then f'(t) = k(l - t k - 1 ) 2:: 0 since k < 1. Thus, f
is an increasing function on [1, (0) and, since f(l) = 0, we immediately
deduce that
t k :s; k( t - 1) + 1 (2.2.3)
for t 2:: 1 and 0 < k < 1.
Now, if a or b is zero, then (2.2.2) is obviously true. So let us assume
that a 2:: b > O.
The inequality (2.2.2) now follows by setting t = alb and k = lip in
(2.2.3) and using the definition of p* . •

Lemma 2.2.2 (HOlder's inequality) Let 1 < p < 00. Let p* be its
conjugate exponent. Then, for x, Y E ]RN,

N
L IXiYil :s; IlxllpIIYllp·· (2.2.4)
i=l

Proof: Since the result is trivially true for x = 0 or Y = 0, we can as-


sume, without loss of generality, that both x and yare non-zero vectors.
Then, set

for a fixed 1 :s; i :s; N. Then (2.2.2) yields


IXiYi I 1 IXi IP
< - - - +1-IYi
-IP·
-
IlxllpllYllp. - p Ilxll~ p* II II~:

Summing over the range of the index i, we get

which proves (2.2.4) . •

Lemma 2.2.3 (Minkowski's Inequality) Let 1 :s; p < 00. Let x, Y E


]RN. Then
(2.2.5)
30 2 Normed Linear Spaces

Proof: The proof is obvious if p = 1. Let us, therefore, assume that


1 < p < 00. Let p* be the conjugate exponent. Then,

2:!llxi + Yil P < 2:!llxi + YilP-Ilxil + 2:!llxi + YiIP-IIYil1

< (1lxllp + IIYllp) (2:!1 IXi + Yil(P-I)P·) p.


by a simple application of Holder's inequality (2.2.4). But (p - l)p* = P
by definition and so,

Since the result is obviously true when x + Y = 0, we can assume, with-


out loss of generality, that x + y#-O and so, dividing both sides of the
above inequality by Ilx + yIIP/p· and using, once again, the definition of
p*, we get (2.2.5) . •

Since Minkowski's inequality is exactly the triangle inequality for the


norm 11.llp, our proof that ]RN is a Banach space for each of these norms
is complete.

Remark 2.2.1 The inequalities of Holder and Minkowski are clearly


true when x and Y E eN and so eN is also a Banach space for each of
the norms 11.llp, 1::; p < 00 . •

Remark 2.2.2 When p = 2, we have p* = 2 as well. In this case Holder's


inequality is known as the Cauchy-Schwarz inequality. The inequal-
ity (2.2.2), in this case, turns out to be the familiar inequality relating
the arithmetic and geometric means of two positive real numbers. The
norm 11.112 is also called the Euclidean norm since it corrsponds to the
usual Euclidean distance in ]RN . •

Example 2.2.3 For x = (Xl, ... , x N) E ]RN, define

It is easy to verify that this also defines a norm on ]RN. Again con-
vergence and the Cauchy criterion hold if and only if they hold compo-
nentwise and so ]RN is a Banach space for this norm as well. Again all
these assertions hold for eN as well. It is immediate to see that Holder's
inequality is true when p = 1 as well. •
2.2 Examples 31

Remark 2.2.3 The spaces ffi.N (or C N when the base field is C) with the
norm 11.llp, where 1 ~ p ~ 00 are usually denoted by C!; in the literature .

Remark 2.2.4 The notation 11.1100 for the norm defined in Example 2.2.3
can be 'justified' as follows. Let x E ffi.N. Assume that the maximum
for IXil is attained for a single index, say, io. Then,

Thus, since IXil/lxiol < 1 for i =I- io, we get that

when p -+ 00 . •

Example 2.2.4 We now consider sets of real (or complex) sequences

Let 1 ~ p < 00. We define the space

We define vector addition and scalar multiplication (over the correspond-


ing field) componentwise, i.e. if x = (Xi) and y = (yd are sequences in
Cp and if a is a scalar, we set

We also define

We will prove the triangle inequality for 1I.llp (which will also simulta-
neously show that Cp is closed under vector addition and hence that it
is a vector space). Since properties (i) and (ii) of Definition 2.1.2 are
obvious, it will follow that II_lip defines a norm on Cp .
32 2 N ormed Linear Spaces

r
Let x and Y E Pp . Then, for any positive integer N, we have

2:i':1 IXi + Yil P < [(2:i':1 IXiI P ) ~ + (2:i':1 IYiIP) ~


< [llxllp + IIYllpjP
using Minkowski's inequality (2.2.5) for the integer N. Thus, since N
was arbitrary, we deduce that

L IXi + Yil
00

P :S [llxllp + IIYllpjP < 00


i=l

which shows that x + Y E Pp and also proves the triangle inequality for
1I·llp· •
Remark 2.2.5 The triangle inequality

for x and Y E Pp is again referred to as Minkowski's inequality. We can


also prove Holder's inequality: if x E Pp and Y E Pp' where p* is the
conjugate exponent, then
00

L IXiYil :::; IIxllpIlYllp···


i=l

Proposition 2.2.1 Let 1 :S p < 00. Then Pp is a Banach space.


Proof: We just need to prove the completeness of the space. Let {x(n)}
be a Cauchy sequence in Pp , i.e. given c > 0, there exists N such that

L Ix~m) -
00

x~l) IP < c (2.2.6)


i=l

for all m ~ N, I ~ N. Thus, it is clear that for each fixed subscript i,


the sequence {x~n) is Cauchy in lR (or C, as the case may be). Thus,
there exists Xi such that x~n) --t Xi for each i. Set

We will first show that x E Pp. Since {x(n)} is a Cauchy sequence, it


is bounded. Thus, there exists a C > 0 such that

IIx(n) II~ :S C, for all n.


2.2 Examples 33

Let k be any fixed positive integer. Then,


k
L Ix;n)I P < C
i=l

which implies that


k
L IXil P < c.
i=l

Since k is arbitrary, this shows that

L IXi IP
00

< C < 00.


i=l

This shows that x E f p .


Now, for any positive integer k and all m, l > N, it follows from
(2.2.6) that
k
L Ix~m) - x~l) IP < E.
i=l

Passing to the limit as l ----+ 00, we get that for any m ~ N and for any
k,
k
L Ix~m) - xil P :S E.
i=l

Since k is arbitrary, we deduce that for m ~ N,

i.e. x(n) ----+ x in f p • This completes the proof. •

Example 2.2.5 Set

i.e. the space of all bounded real (or complex) sequences. This is clearly
a vector space under componentwise addition and scalar multiplication.
Define
Ilxll oo = sup IXil.
l::;i<oo
34 2 JVorr,ned Linear Spaces

This makes Roo a Banach space (check!) . •

Remark 2.2.6 Once again, Holder's inequality holds for p = 1 as well.



Remark 2.2.7 For those readers who are acquainted with measure the-
ory, the spaces R: and Rp , for 1 ::; p ::; 00 are particular cases of the
Lebesgue spaces LP(p,) where {X,S,p,} is a measure space. In the case
of R:, we have X = {I, 2, ... , N} and in the case of Rp we have X = N,
the set of all natural numbers. In either case, the (j-algebra is the col-
lection of all subsets of X and the measure p, is the counting measure.
We will study LP spaces in detail in Chapter 6. •

Our final example is that of a function space.

Example 2.2.6 Let e[O, 1] denote the set of all continuous real valued
functions on the closed interval [0, 1]. This becomes a vector space under
the operations of addition and scalar multiplications defined pointwise,
i.e.
(f + g)(x) = f(x) + g(x) and (af)(x) = af(x)
for f and 9 E e[O, 1], a E ffi. and for x E [0,1]. Define

Ilfli = sup If(x)1


XE[O,l]
(= max If(x)l) .
XE[O,l]

This is well defined since [0,1] is compact and so every continuous func-
tion is bounded and attains its maximum. The verification that this
defines a norm on e[O, 1] is routine and is left to the reader.
Let Un} be a Cauchy sequence in e[O, 1]. This implies that for every
c > 0, there exists a positive integer N such that, for all x E [0,1] and
for all n ~ Nand m ~ N, we have

(2.2.7)

Thus the pointwise sequences Un(x)} are all Cauchy and hence conver-
gent. Define
f(x) = lim fn(x).
n->oo

'Ne will show that the function f thus defined is in e[o, 1] and that
Ilfn - fll ~ 0. This will show that C[0,1], with the given norm, is a
Banach space.
2.2 Examples 35

Let E and IV be as above. Then, keeping n 2: IV fixed and passing


to the limit as m ----+ 00 in (2.2.7), we get

Ifn(x) - f(x)1 ::::; E (2.2.8)

°
for all n 2: IV and for all x E [0,1]. Fix a point Xo E [0,1]. Since fN is
continuous, there exists 8 > such that for all Ixo - yl < 8, we have

Thus, if Ixo - yl < 8, we get, using the above inequality and also the
inequality (2.2.8),

This proves that f is continuous and from (2.2.8), we see that Ilfn - fll ----+
0. •

Remark 2.2.8 The convergence described in the preceding example is


what is known in the literature as uniform convergence. The norm is
often referred to as the 'sup-norm' . •

We conclude this section by showing a standard method of producing


new normed linear spaces from existing ones.Let V be a normed linear
space and let W be a closed subspace of V, i. e. W is a linear subspace
of V and is closed under the norm topology. We define an equivalence
relation on V by
x""'y {::;>x-yEW.
The equivalence class containing a vector x E V is called a coset and is
denoted as x + W. It consists of all elements of the form x + w where
w E W. The set of all cosets is called the quotient space and is denoted
V/W. Addition and scalar multiplication on V/W are defined by

(x+W)+(y+W) = (x+y)+W and a(x+W) = ax+W.

If x ""' x' and y ""' y', then, clearly, x + y ""' x' + y' and ax ""' ax', since
W is a linear subspace of V. Thus, addition and scalar multiplication
are well defined. Thus the quotient space becomes a vector space. On
this, we define
Ilx + Wllvjw = inf Ilx + wll·
wEW

In other words, the 'norm' defined above is the infimum of the norms of
all the elements in the coset and so, clearly, it is well defined.
36 2 N ormed Linear Spaces

Proposition 2.2.2 Let V be a normed linear space and let W be a


closed subspace. Then, 11.llvjw defined above is a norm on the quotient
space V /W. Further, if V is a Banach space, so is V /W.
Proof: Clearly Ilx+Wllvjw 2 ofor all x E V. Ifx+W = O+Win V/W,
we have x E W; then -x E Wand so 0:::; Ilx+Wllvjw :::; Ilx+(-x)11 = 0
and so Ilx + Wllvjw = O. Conversely, if Ilx + Wllvjw = 0, then, by
definition, there exists a sequence {w n } in W such that Ilx + Wn II -+ O.
This means that Wn -+ -x in V and, since W is closed, it follows that
-x E Wand so x E W as well. This means that x rv 0, i.e. x + W is
the zero element of V /W.
If a =1= 0, then ax + w = a(x + w') where w' = a-1w E W. From
this it is easy to see that Ilax + Wllvjw = lallix + Wllvjw. The case
a = 0 is obvious.
Finally, we prove the triangle inequality.
Ilx + Y + Wllvjw inf{llx + Y + wll I wE W}
= inf {II x + Y + w + w'll I w, w' E W}
< inf{llx + wll + Ily + w'll I w, w' E W}
inf{llx + wll I wE W} + inf{lly + w'll I w' E W}
Ilx + WllVjW + Ily + Wllvjw·
Thus, V /W is a normed linear space. Now assume that V is com-
plete. Let {xn + W} be a Cauchy sequence in V/W. Then, we can find
a subsequence such that
1
II(x nk + W) - (X nk + 1 + W)IIVjW < 2k · (Why?)

Now choose Yk E x nk + W such that IIYk - Yk+lll < 1/2k. Then the
sequence {yd is Cauchy (why?) and so, since V is complete, Yk -+ Y in
V. Thus
II (xnk + W) + W)llvjw
- (y :::; IIYk - yll -+ O.
Thus, the Cauchy sequence {x n + W}
has a convergent subsequence
{xnk + W} and so the Cauchy sequence itself must be convergent and
converge to the same limit (why?). Hence V/W is complete. •

2.3 Continuous Linear Transformations

An important aspect of functional analysis is to study mappings be-


tween normed linear spaces which 'respect' the linear and topological
structures. We make this notion precise in the following definition.
2.3 Continuous Linear Transformations 37

Definition 2.3.1 Let V and W be normed linear spaces. A linear trans-


formation T : V ---t W is said to be a continuous linear transforma-
tion or, a continuous linear operator, if it is continuous as a map
between the topological spaces V and W (endowed with their norm topolo-
gies). If W is the base field, then a continuous linear transformation is
called a continuous linear functional. •

Definition 2.3.2 A subset of a normed linear space is bounded if it


can be contained in a ball. •

The following proposition gives an important characterization of con-


tinuous linear transformations.

Proposition 2.3.1 Let V and W be normed linear spaces and let T :


V ---t W be a linear transformation. The following are equivalent:
(i) T is continuous.
(ii) T is continuous at O.
(iii) There exists a constant K > 0 such that, for all x E V,

IIT(x)llw :S Kllxllv (2.3.1)

where 11.11 V and 11·11 w denote the respective norms in the spaces V and
W.
(iv) If B = {x E V I IIxliv :S I} is the (closed) unit ball in V, then T(B)
is a bounded set in W.

Proof: (i) ¢} (ii) If T is continuous, then, clearly, it is continuous at


o E V. Conversely, let T be continuous at 0 E V. Let x E V be arbi-
trary and let Xn ---t x in V. Then Xn - X ---t 0 in V and so, T (x n - x) ---t 0
in W, i.e. T(x n ) ---t T(x) in W. Thus, T is continuous.

(ii) ¢} (iii) If T is continuous at 0 E V, there exists a <5 > 0 such that


IIxliv < <5 implies that IIT(x)lIw < 1. For any x E X, set y = 211~lIv x so
that IIYllv = <5/2 < <5 and so IIT(y)lIw < 1. By linearity, it follows that

2
IIT(x)lIv :S Jllxllv

which proves (2.3.1) with K = 2/<5. Conversely, if (2.3.1) is true, then


whenever Xn ---t 0 in V, it follows that T(x n ) ---t 0 in W, i.e. T is con-
tinuous at O.
38 2 N ormed Linear Spaces

(iii) {:} (iv) By virtue of (2.3.1), it follows that IIT(x)llw ~ K for all
x E B Thus, T(B) is bounded in W. Conversely, if T(B) is bounded in
W, there exists a K > 0 such that IIT(x)llw ~ K for all x E B. Now, if
0-# x E V is arbitrary, set y = x/llxli. Then IIT(y)llw ~ K from which
(2.3.1) follows, by linearity. •

Remark 2.3.1 Continuous linear transformations are also known as


bounded linear transformations since they map bounded sets into
bounded sets. •

The above proposition inspires the following definition.

Definition 2.3.3 Let V and W be normed linear spaces and let T :


V --+ W be a continuous linear transformation. Let B be the closed unit
ball in V. The norm of T, denoted IITII, is given by

IITII = sup IIT(x)llw . • (2.3.2)


xEB

The following proposition gives alternative characterizations of the


norm of a continuous linear transformation.

Proposition 2.3.2 Let V and W be normed linear spaces and let T :


V --+ W be a continuous linear transformation. Then

IITII sup{IIT(x)llw Illxllv = I}


sup{IIT(x)llwlllxllv 10 -# x E V}
inf{K > 0 I IIT(x)llw ~ Kllxllv for all x E V}.

Proof: Let us set

a sup{IIT(x)llw I Ilxllv = I},


(3 sup{IIT(x)llw Illxllv I 0 -# x E V} and
, inf{K > 0 I IIT(x)llw ~ Kllxllv for all x E V}.

Clearly, a ~ (3. If x is a non-zero vector in V, then x/llxliv has unit


norm and IIT(x)llwlllxllv' = IIT(x/llxllv)llw. This shows that we also
have (3 ~ a. If K > 0 is any number in the set defining" then it follows
immediately that (3 ~ K and so, a fortiori, we have (3 :S f. Now, we
also have that IIT(x)llw ~ (3l1xllv for all x E V and so, by definition,
, :S (3. Thus, we have
a = (3 = ,.
2.3 Continuous Linear Transformations 39

Clearly IITII ~ 0: by definition. If K is in the set defining " then, for


all x E V such that Ilxllv :::; 1, we have IIT(x)llw :::; K andso IITII :::; K.
Thus, we get that IITII :'S , = 0:. Thus we get that

IITII = 0: = f3 = , .•

Corollary 2.3.1 If V and Ware normed linear spaces and if T : V ~


W is a continuous linear transformation, then

IIT(x)llw :::; IITllllxllv (2.3.3)

for all x E V. •

Let V and W be normed linear spaces. Let us denote by £(V, W),


the set of all continuous linear maps from V into W. If Tl and T2 are
such maps, let us define Tl + T2 by

for all x E V. Clearly, Tl + T2 is also a linear transformation. Now,

by virtue of the triangle inequality and the above corollary. Thus, it


follows that Tl + T2 is also a continuous linear transformation and that

Similarly, if T is a continuous linear transformation and if 0: is a s l ~,


we define
(o:T) (x) = o:T(x)
for all x E V. It is then easy to see that o:T is also continuous and that

II00TII = 100111TII·
The zero element of £(V, W) is the trivial map which maps every element
of V into the null vector of W. The element - T is defined by ( - T) (x) =
-T(x). Thus, £(V, W) is a vector space; in fact, it is a normed linear
space for the norm of a continuous linear transformation defined above.

Proposition 2.3.3 Let V and W be normed linear spaces. If W is


complete, then £(V, W) is also complete.
40 2 N ormed Linear Spaces

Proof: Let {Tn} be a Cauchy sequence in .c(V, W). Then, given c > 0,
we can find a positive integer N such that, for all m and n ;::: N , we
have

Let x E V. Then,

and so it follows that the sequence {Tn(x)} is Cauchy in W. Since W is


complete, this sequence is convergent. Let us define

T(x) = lim Tn(x).


n-+oo

Clearly, the map x t-t T(x) is linear. We will show that it is continuous
and that II Tn - Til ~ O. This will complete the proof.
Since the sequence {Tn} is Cauchy, it is bounded, i.e. there exists
M > 0 such that, for all positive integers n, we have IITnl1 ::; M. Now,
since for any x E V, we have IITn(x)lIw ::; IITnllllxliv ::; Mllxllv, it
follows, on passing to the limit as n ~ 00, that, for all x E V,

IIT(x)llw ::; Mllxllv.


Thus, T is continuous and so T E C(V, W).
Let c > 0 and let N be as defined earlier by the Cauchy property of
the given sequence. Let B be the closed unit ball in V. For all x E B,
we have

Keeping n fixed and letting m tend to infinity, we get

IITn(x) - T(x)llw ::; c

for all x E B. This shows that, for n ;::: N, we have IITn - Til::; c, i.e.
Tn ~ T in .c(V, W). This completes the proof. •

In particular, since the scalar field is a Banach space over itself (cf.
Example 2.2.1), the set of all continuous linear functionals .c(V,JR) (or,
.c(V, C), as the case may be) is always a Banach space.
Definition 2.3.4 Let V be a normed linear space. The space of all
continuous linear functionals on V is a Banach space and is called the
dual space of V. It is denoted by V* . •
2.3 Continuous Linear Transformations 41

Another particular case is when W = V. In this case we write


.c(V) for the space of all continuous linear operators instead of .c(V, V).
This space is Banach if V is Banach. On this space, we have a third
operation (after addition and scalar mUltiplication) namely, composition
of operators: if Tl and T2 are continuous linear operators, we define Tl T2
by

Now, for any x E V, we have

Thus, Tl T2 is also a continuous linear operator and, further,

(2.3.4)

Further, multiplication is a continuous operation. Indeed, if Tn --t T


and ~ --t T' in .c(V), we have

II n ~ - TT'" ~ " nllll ~ - T'II + liT' II II Tn - Til.

Since IITnil is bounded independent of n, it follows that n ~ --t TT'.


Finally, if I is the identity mapping i.e. I(x) = x for all x E V, we have

11111 = 1.

Definition 2.3.5 A Banach space V on which we have a multiplication


operation (x, y) E V x V f---? xy E V such that addition and multiplication
make it a ring and such that

IIxYil ~ IIxlillyil and 11111 = 1

where 1 is the multiplicative identity in V, is called a Banach algebra .



Thus, .c(V), where V is a Banach space, is a Banach algebra.

Let us now study various examples of continuous linear transforma-


tions.

Example 2.3.1 Any linear transformation T : ]RN --t ]R.M is given by an


M x N matrix. Assume that ]R.N is the space ff. Then, for any norm
on ]R.M, any linear transformation is continuous. Let {el' ... , eN} be the
42 2 lVorr,ned Linear Spaces

standard basis of]RN (cf. Example 1.1.2). If x = (Xl, ... , XN) E ]RN,
then X = I:[:l Xiei· Then T(x) = I:[:l xiT(ei). Thus,

N
IIT(x)IIIRM < KL IXil Kllxlh
i=l

where

Example 2.3.2 Let al, ... ,aN be scalars. For X = (XI, ... ,XN) E ]RN,
define
N
f(x) = L aixi·
i=l

Then f is a linear functional on ]RN which is continuous if]RN = tf .•


We will see later that these transformations and functionals are con-
tinuous for any norm defined on ]RN.

Example 2.3.3 Let x = (Xi) E C2. Define

( Xl X2 Xi )
T(x) = 1'2'''''i'''' .
Then, since

we have that T is a continuous linear operator on C2 and that IITII :S 1.


The map T is not onto. In fact, the range of T consists of all square
summable sequences (Yi) such that
00

Li 2 1Yil 2 < 00 . •
i=l

Example 2.3.4 Let 1 :S p :S 00 and let p* be its conjugate exponent.


For X E Cp and Y E Cp ., define

L XiYi·
00

fy(x)
i=l
2.3 Continuous Linear Transformations 43

Then, by Holder's inequality, we have


00

!fy(x)! ::; 2: !XiYi! ::; I\xl\pl\yl\p'.


i=1
Thus fy defines a continuous linear functional on £p and

We will see later that, when 1 :::; P < 00, all continuous linear functionals
on £p occur only in this way and that the last inequality is, in fact, an
equality. •

Example 2.3.5 Consider an infinite matrix (aij )0=1. This can be used
to define a linear mapping on £p as follows. Let x = (Xi) E £p. Define a
sequence A(x) by
00

A(X)i = L aijXj.
j=1
Showing that A defines a continuous linear map of £p into itself is usually
a non-trivial problem and some examples are given in the exercises at
the end of this chapter. We now give an example (due to Schur).
Assume that aij ~ 0 for all i and j. Assume further there exists a
sequence {Pi} of positive real numbers and {3 > 0 and, > 0 such that
00

L aijPi :::; {3Pj


i=l
for all j E N and also such that
00

2: aijpj :::; ,Pi


j=1
for all i E N. Then A E £(£2) and I\A\!2 :::; {3,.
To see this, let x = (Xi) E £2. We write
00 00 y?iijx j
LaijXj = Ly7iijyPj ..
j=1 j=1 . JPj
Applying the Cauchy-Schwarz inequality, this yields
44 2 N armed Linear Spaces

It follows from the hypotheses that

IA( ) 12 aiil x il 2
",,00
L..ti=l X i < ",,00
L.Ji=1 '"'(P2 L.Jj=1
_",,00
Pi
",,00 Ix _1 2 ",,00
= r L...Jj==l ~ L.-i=l aijPi
< ",,00 Ix _12
'"'( L.Jj=1 ~f3Pj
= f3'"'( Ilx II§
which establishes the claim.
An interesting particular case is that of the Hilbert matrix. Set
1
a-- -
2J - i +j +1

for 0 ~ i,j ~ 00 . Set Pi = 1/ Vi


+ ~. Since the matrix is symmetric, it
suffices to check one of the two conditions. Now,

roo dx
< Jo (x+J+!)y'X

= 2 roo dt
Jo t 2 +J+!

Thus, by Schur's test, the matrix defines a continuous linear operator A


on £2 whose norm is less than, or equal to 1r. (In fact, it has been shown
by Hardy, Littlewood and Polya that the norm is exactly Jr.) •

Example 2.3.6 (Cesaro Operator) Let x = (Xi) E fp where 1 < P < 00 .


Define
Xl + ... + Xn
(T(x))n =
n
We show that T E £(fp) and that

P
IITII ~ p-l

Indeed,
2.3 Continuous Linear Transformations 45

Set An = IX11+·· ·+Ixnl and an = An/n, for n 2: 1 and set Xo = Ao = o.


Then
c1n - rtrc1n-1Ixnl = c1n - rtrc1n-1(nan - (n - l)a n -d

( 1 _ 2:!E..)
p-1
~
n
+ (n-1)p p-1
p-1 an an-1

~) ~n + (n
( 1 p-1 p-11)p(~)7(
n .n-1 )~.

Recall that p/(p-1) = p*, the conjugate exponent of p. Thus by Lemma


2.2.1, we get

( 1 - 2:!E..)
p-1 ~n + (n-1)p
p-1 (E.=
p ~n +!d?.
p n-1 )

Fix a positive integer N. Then summing both sides over n running


between 1 and N, and noticing that the right-hand side is a telescoping
sum, we get

By an application of Holder's inequality, we now deduce that

N N
~ ~ :s; p~l~ ~ llxnl :s; P~l ~ ~
(N )7(N~lxnlP )~
Dividing both sides by (2:;;=1 c1n)«p-1)/p), which is strictly positive for
non-zero x, we get

Since N was arbitrarily chosen, we deduce that

which establishes our claim. (In fact, Hardy, Littlewood and Polya also
show that IITII = p/(p - 1).) •
46 2 o~ Linear Spaces

Example 2.3.7 (Volterra integral operator) Let K : [0, 1] x [0, 1] --t lR


be a continuous function. For f E C[O, 1] and s E [0, 1], define

T(f)(s) = fos K(s, t)f(t)dt.


Since K is continuous on the compact set [0,1] x [0, 1], it is bounded and
uniformly continuous. Assume that, for all sand t in [0,1]' we have

IK(s, t)1 ::; K.

Further, given E > 0, there exists 8> a such that, whenever lSI -S21 < 8,
we have
IK(sl' t) - K(S2' t)1 < E

for all t E [0,1], by virtue of the uniform continuity. Without loss of


generality, we can assume that 8::; E. Thus,

T(f)(Sl)-T(f)(S2) = tl (K(Sl' t)-K(S2' t))f(t)dt+l s1


K(82' t)f(t)dt
~ ~
If Ilfll denotes the norm of f E e[O, 1] as defined in Example 2.2.6, we
have

whenever lSI - 821 < 8. This shows that T(f) is a continuous function.
The mapping T being clearly linear, it thus defines a linear operator on
e[o, 1]. Further,
IT(f)(S)1 ::; Kllfl18 ::; Kllfll·
Thus T is a continuous linear operator on e[O, 1] and IITII ::; K. •

So far, we have been seeing examples of continuous linear transfor-


mations. We now give an example of a linear transformation which is
not continuous.

Example 2.3.8 Consider the space C1 [0, 1] of continuous functions on


[0,1] which are continuously differentiable on (0,1) and whose derivatives
can be extended continuously to [0,1]. This is a subspace of C[O, 1]. Let
both these spaces be endowed with the 'sup-norm' (cf. Example 2.2.6).
Then, the map T : e1 [0, 1] --t e[o, 1] defined by T(f) = ff, where ff
denotes the derivative of f, is not continuous. To see this, consider the
2.3 Continuous Linear Transformations 47

sequence of functions Un} defined by fn(t) = t n for n 2: 1. Then, it


is easy to see that Ilf~11 = n while Ilfnll = 1. Hence there can be no
constant C > a such that IIT(f)11 :s Gllfll for all f E el[a, 1]. Thus, T
is not continuous. •

Definition 2.3.6 Let V be a normed linear space and let T E .c(V) be


a bijection. If T- l is also continuous, then T is said to invertible or
an isomorphism . •

Note: When dealing with normed linear spaces, the word isomorphism
is understood in the topological sense: not only is it an isomorphism in
the usual algberaic sense i. e. it is linear and is a bijection, but it also
implies that both the mapping and its inverse are continuous.

Definition 2.3.7 Two norms defined on the same vector space are said
to be equivalent if the topologies induced by these two norms coincide .

Proposition 2.3.4 Let V be a vector space and let 11.11(1) and 11.11(2) be
two norms defined on it. The two norms are equivalent if, and only if,
there exist two constants Gl > a and C2 > a such that, for all x E V,
we have

Proof: The topologies induced by the two norms coincide if, and only
if, the identity mapping

is an isomorphism. This is equivalent to saying that there exist two


constants K 1 > a and K 2 > a such that

for all x E V. This proves the proposition on setting C l = Kll and


G2 = K2·.

Example 2.3.9 Let x = (Xl, ... , XN) E]RN (or eN). Then, clearly,

Ilxll oo :s; IlxllI :s; Nllxll oo .


Thus these two norms are equivalent and the topologies induced on ]RN
(respectively, eN) by the norms 11.111 and 11.1100 coincide. It is a simple
48 2 lVorr,ned Linear Spaces

matter to check that the topology induced by 11.1100 is none other than
the product topology on ]RN (respectively CP) when]R (respectively q
is given its usual topology for every component . •

We will, in fact, now prove a much stronger result.


Proposition 2.3.5 Any two norr,ns on a finite dimensional vector space
are equivalent.
Proof: Let V be a finite dimensional normed linear space with dimen-
sion N. We will show that V is isomorphic to the space.el(. Thus, given
two norms on V, it will be isomorphic to .ef for each of those norms and
from this we will deduce the equivalence of the norms.

Step 1. Let {el, ... , eN} be the standard basis of .ef. Fix a basis
{Vl, ... ,VN} for V. Define T:.ef ----t V by setting T(ei) = Vi for all
1 :S i :S N and then extending T linearly to all of .ef. Clearly T is a
bijection and an identical argument as in Example 2.3.1 shows that T
is continuous.

Step 2. Assume, if possible, that T- 1 is not continuous. Then the


continuity must fail at 0 and so we can find a sequence {Yn} in V and
a real number E: > 0 such that IIT-1(Yn)lh ;::: c > 0 while Yn ----t o. Set
Zn = Yn/IIT-1(Yn)lll. Then, Zn ----t 0 and IIT-1(zn)lll = 1. Now, the set
B = {xE.ef Illxlli :S I}
is compact. To see this, observe that B is a closed set and is contained in
the set ~ 1, 1], which, being the product of compact sets, is compact
in the topology induced by the norm 11.1100' and hence, in the topology
induced by the norm 11.lh as well (cf. Example 2.3.9). Thus, B is also
compact. Consequently, it is also sequentially compact and so, there
exists a subsequence {znk} such that {T-1(znk)} is convergent. Let
T-1(Znk) ----t x, where Ilxlli = 1. Since T is continuous, we then deduce
that znk ----t T(x) which then implies that T(x) = o. But T is a one-one
map and Ilxlli = 1 implies that xt=-O which shows that T(x) t=- 0 as
well. This gives us a contradiction. Hence T- 1 must also be continuous.

Step 3. Thus, whatever be the norm on V, the same map T is always an


isomorphism between .ef and V which implies in turn that the identity
map on V, considered as a map of normed linear spaces when V is pro-
vided with two different norms, must be an isomorphism as well. Hence
2.3 Continuous Linear Transformations 49

any two norms on V are equivalent . •

Remark 2.3.2 We mentioned in Proposition 1.2.5 that a set in ]RN (or


eN) is compact if, and only if, it is closed and bounded. However, the
topology given there is the 'usual' topolgy, viz. that of £f1. It is for
this reason that we needed to give a separate argument to show that S
(which is also closed and bounded) is compact in £1(. However, thanks
to the preceding proposition, we now know that the topology is the same
for all the spaces £~ or, for any other norm on ]RN (respectively, eN)
and so a subset thereof will be compact if, and only if, it is bounded and
closed . •
Corollary 2.3.2 Any finite dimensional normed linear space is com-
plete. In particular, any finite dimensional subspace of a normed linear
space is closed. •
Example 2.3.10 Let f E C[O, 1]. Define

IIfl11 = 10 1
If(t)1 dt.

It is simple to check that this defines a norm on C[O, 1]. Consider the
sequence {fn} defined by

1- nx, °
for < - x <
for .1n < x
- .1
<
n
0, - - 1.

Clearly, Ilfnll = 1 for all n (where 11.11 denotes the usual 'sup' norm)
while Ilfnlh = ~ fn(t) dt = 1/2n which tends to zero as n tends to in-
finity. Thus it is clear that these two norms cannot be equivalent. Thus
in infinite dimensional spaces, two norms are not, in general, equivalent .

Since all norms on ~N (respectively, eN) generate the same topology,
it is now clear that any matrix generates a continuous linear transforma-
tion, whatever the norm on that space may be. Thus, if T is an N x N
matrix and if 11.11 is a norm on ~N (respectively, eN), we can define

ITII
I sup IITxll (2.3.5)
= iixii::;1, #0 Ilxll
or, via any of the other equivalent formulations as in Proposition 2.3.2.
Since the unit ball (and hence the unit sphere) is compact, the 'sup'
50 2 lVorTined Linear Spaces

above is, in fact, a 'max'. If T and S are matrices of order lV, then TS
represents the composition of the corresponding linear transformations
and so we also have
IITSII ::; IITII·IISII· (2.3.6)
Let MN denote the set of all matrices of order lV with entries from the
corresponding field. This itself (under the operations of matrix addition
and scalar mUltiplication) is a vector space (of dimension N 2 ). Any
norm on this space which satisfies (2.3.6) is called a matrix norm. If
such a norm were induced by a vector norm on ]RN (respectively, (CN)
via (2.3.5), then we always have

IIIII 1

for the identity matrix I.


In particular, for the vector norms 11.llp defining the spaces .e!J for
1 ::; p ::; 00, we denote the induced matrix norms by 11.llp,N.

Example 2.3.11 Since MN is a vector space of dimension N 2 over the


corresponding field, we can string out its rows to form a vector of that
dimension and define the usual Euclidean norm. Thus, if T = (tij), then
define

This obviously defines a norm on MN. It is also a matrix norm. For


this, we only need to check the validity of (2.3.6). It T = (tij) and
S = (Sij), then

II II~ 2:~=112:~=1 tik S kjl2


< 2:~=1 (2:~=1ItikI2) (2:~=1I kjI2)
by the Cauchy-Schwarz inequality. Thus,we get

N N
II II~ ::; L Itikl2 L ISkjl2 = II II~II II~
i,k=l k,j=l

which shows that II· liE is indeed a matrix norm. This is also known as
the Hilbert-Schmidt nOrTin.
2.3 Continuous Linear Transformations 51

However, notice that this norm is not induced by any vector norm
when N ~ 2. Indeed to see this, observe that lilliE = VN # 1. •

The proof of the Proposition 2.3.5 depended crucially on the fact


that the unit ball in a finite dimensional space is compact. In fact this
property characterizes finite dimensional spaces, which we now proceed
to show. We begin with a very useful technical result.
Let V be a normed linear space. If E c V, we define the distance of
a vector x E V from E as

d(x, E)= inf


yEE
IIx - YII.

This is the same notion of the distance of a point from a subset in a


metric space if we look at Vasa metric space for the metric d( x, y) =
IIx-yll ·
Lemma 2.3.1 (Riesz' Lemma) Let V be a normed linear space and
let W c V be a closed and proper subspace. Then, for every E > 0, we
can find a vector u E V (depending on E) such that

lIuli = 1 and d(u, W) ~ 1 - E.

Proof: Since W is a proper subspace, there exists v E V\ W, so that


8 = d(v, W) > o. Now, choose w E W such that

8
8 :S IIv-wll :S -1-·
-E

Set u = (v - w)/liv - wll so that lIuli = 1. Let z E W be an arbitrary


element. Then

lIu - zll = IIv - (w + IIv - wllz)II/liv - wll ~ 8/(8/(1 - E)) = 1 - E

by the definition of 8 (since w + IIv - wllz E W) and the choice of w.


This completes the proof. •

Proposition 2.3.6 A normed linear space V is finite dimensional if,


and only if, the closed unit ball in V, i. e. the set

B = {x E V IlIxll ~ I},

is compact.
52 2 N armed Linear Spaces

Proof: Assume that V is finite dimensional, with dimension N. Let


T : f'( - t V be the canonical mapping as defined in Proposition 2.3.5.
We have seen that T is an isomorphism. It then follows that T-1(B)
is bounded and closed in £'( and so it is compact (cf. Remark 2.3.2).
Consequently B = T(T-l(B)) is compact as well.
Conversely, let us suppose that B is compact. Then, there exists a
positive integer n and points Xi E B, 1 ::; i ::; n, such that

(2.3.7)

where B(Xi' 1/2) = {x E V Illx-xili < 1/2} is the open ball centered at
Xi and of radius 1/2. Set W = span{xl' ... , x n }. We claim that W = V
and this will prove that the dimension of V is less than, or equal to,
n and so V has to be finite dimensional. Assume the contrary. Then
W will be a proper and closed (since it is finite dimensional) subspace
of V. Now, by the preceding lemma of Riesz, we have the existence of
u E V such that Ilull = 1 (and so u E B) and such that d(u, W) ~ 2/3.
In particular, it follows that u E B is such that Ilu - xiii ~ 2/3 for all
1 ::; i ::; n which contradicts (2.3.7). This completes the proof. •

Example 2.3.12 Consider the space £2 of all square summable se-


quences. Consider the sequence ei E £2 which has its i-th component
equal to unity and all other components equal to zero. Then, since
Ileil12 = 1, it belongs to the closed unit ball in that space. Now, if i =I j,
we have
Ilei - ej 112 = V2
and so the the sequence {ei} can never have a convergent subsequence.
Thus, in the infinite dimensional space £2, we directly see that the unit
ball is not sequentially compact and hence it is not compact. •

2.4 Applications to Differential Equations

One of the famous results in analysis is Banach's contraction mapping


theorem (also known as Banach's fixed point theorem), which is stated
as follows.
Theorem 2.4.1 (Contraction Mapping Theorem) Let (X, d) be a
complete metric space and let F : X - t X be a contraction, i. e. there
exists a constant 0 < c < 1 such that, for all x and y EX, we have

d(F(x), F(y)) ::; c d(x, y).


2.4 Applications to Differential Equations 53

Then F has a unique fixed point, i. e. there exists a unique point x* E X


such that
F(x*) = x*.

Further, given any Xo EX, the sequence {x n } defined by

converges to x* . •

Proof: Obviously, F is continuous. Let Xo E X and let the Xn be as


defined in the statement of the theorem. Then, by hypothesis,

and, proceeding recursively, we deduce that

Thus, if n < m, we have

which can be made arbitrarily small for large nand m since the geometric
series 2:%"=1 ck is convergent for 0 < c < 1. Thus {x n } is a Cauchy
sequence and, since X is complete, it converges to some x* EX. The
continuity of F and the definition of the Xn now imply that x* = F(x*).
If there were two distinct fixed points of F, say, x and y, then, we
have that
o < d(x, y) = d(F(x), F(y)) :s: cd(x, y)
which is a contradiction, since 0 < c < 1. This completes the proof. •

We now give a well known application of this result.

Theorem 2.4.2 (picard's Theorem) Let R be a closed rectangle in


the plane]R2 whose sides are parallel to the coordinate axes. Let f : R ---t
]R be a function which is continuous and which is such that U
exists and
54 2 Normed Linear Spaces

is continuous on R. Let (xo, YO) be a point in the interior of R. Then,


there exists h > 0 such that the initial value problem

~; = f(x, y)
y(xo) = Yo
has a unique solution in the interval (xo - h, xo + h).
Proof: Since R is compact and f and U are continuous on R, there
exist K > 0 and M > 0 such that

If(x,y)1 :=:; K and 1~~(x )1 :=:; M

for all (x, y) E R. Then, by the mean value theorem, it follows that, for
all (x, yd and (x, Y2) in R, we have

It is easy to see that a function y = y(x) is a solution of the initial


value problem above if, and only if, it satisfies the following:

y(x) = Yo + r f(t, y(t)) dt


Jxo
for all x.
Now, choose h > 0 such that M h < 1. Consider the rectangle R'
defined as follows:

R' = {(x,y) I Ix-xol:=:;h, ly-yol:=:;Kh}.

By choosing h small enough, we can ensure that R' c R. Now consider

x = {g E C[xo - h,xo + h]llg(x) - yol :=:; Kh for all x}.

Then, we see that X is a closed subspace of C[xo - h, Xo + h] and so is


a complete metric space (with the distance induced by the 'sup-norm').
Let 9 EX. Define

F(g)(x) = Yo + r
Jxo
f(t, g(t)) dt

for all x E [xo - h, Xo + h]. Then, clearly, F(g) is continuous on that


interval and
IF(g)(x) - yol :=:; Klx - xol :=:; Kh.
2.4 Applications to Differential Equations 55

Further, if g1 and g2 are in X, we have

< Mhllg1 - g211


from which it follows that IIF(g1) - F(g2)11 ~ Mhllg 1 - g211 which shows
that F maps the complete metric space X into itself and is a contrac-
tion. Thus F has a unique fixed point y E X which solves the initial
value problem. This completes the proof. •

We now prove a corollary of the contraction mapping theorem which


will also be useful in proving the existence of solutions to higher order
initial value problems.
Corollary 2.4.1 Let (X, d) be a complete metric space and let F : X -+
X be a mapping such that, for some positive integer n, the map F n :
X -+ X is a contraction. Then F has a unique fixed point.

Proof: Since F n : X -+ X is a contraction, this mapping has a unique


fixed point x* by the preceding theorem. Now,

F(x*) = F(Fn(x*)) = Fn+1(x*) = Fn(F(x*))

and thus, F(x*) is also a fixed point for Fn. By the uniqueness of the
fixed point, it follows that

F(x*) = x*
and so F has a fixed point, viz. x*.
On the other hand, any fixed point y of F is also a fixed point of F n
since

F(y) y.

Thus F also must have a unique fixed point . •

Example 2.4.1 Consider the Volterra integral operator T : C[O, 1] -+


C[O, 1] defined in Example 2.3.7, viz.

T(f)(s) = fos K(s, t)f(t) dt


56 2 Normed Linear Spaces

where K : [0, 1] x [0, 1] -----t lR. is a continuous function. Consider the


following problem: find u E e[O, 1] such that

u(s) = w(s) + A 10 K(s, t)u(t) dt


8
(2.4.1)

where wE e[O, 1] is a given function and A E R The equation (2.4.1) is


called a Volterra integral equation. It is clear that a solution u of (2.4.1)
is a fixed point of the (affine linear) mapping F: e[O, 1] -----t qo, 1] defined
by
F(u) = w + AT(u).
Now, for any s E [0,1] and for any U1 and U2 E e[O, 1],

IF( U1)(S) - F( U2)( s) I IA I; K( s, t)( (U1 (t) - U2(t)) dtl


< I I~ll 1 - u211s

where
~ max IK(s, t)l·
[O,1]x[O,1]
Hence, we get

Again,

IF2(U1)(S) - F2(U2)(s)1 IAI; K(s,t)(F(U1)(t) - F(U2)(t)) dtl


< I I~ I; IF(ud(t) - F(U2)(t)1 dt
< I 12~211 1 - u211 I; t dt
I 12~211 1 - 211~
whence we deduce that

Proceeding in this way, we get, for any positive integer n,

Since I ln~n In!


is the general term of the convergent exponential series
xp(I I~), it tends to zero as n tends to infinity and so, for sufficiently
large n, we have
2.5 Exercises 57

and hence F n is a contraction. Thus, by the preceding corollary, F has a


unique fixed point. In other words, the Volterra integral equation (2.4.1)
has a unique solution. •

We now show that the study of certain differential equations can be


reduced to the study of the Volterra integral equation.
Consider the equation

x"(s) + p(s)x'(s) + q(s)x(s) = f(s) (2.4.2)

for s E (0,1), where p, q and f are given continuous functions. Consider


the initial conditions

x(O) = 0: and x'(O) = (3 (2.4.3)

where 0: and (3 are two given real numbers. Set u(s) = x"(s). Then,

x'(s) = (3 + 18
u(t) dt.

Again,
x(s) 0: + J; x'(t) dt
0: + (3s + J; ~ U(T) dT dt

0: + (3s + J; J: U(T) dt dT
0: + (3s + J; U(T)(S - T) dT.

Thus, (2.4.2) can be written in the form (2.4.1) with

w(s) = f(s) - [(3p(s) + o:q(s) + (3sq(s)] ,


A = 1 and
K(s, t) = - [P(s) + (s - t)q(s)].
Hence, the initial value problem (2.4.2)-(2.4.3) has a unique solution.

2.5 Exercises

Note: In all the function spaces which occur below, it is assumed that
vector addition and scalar multiplication are defined pointwise.

2.1 Let f E C[O, 1]. Define

1 1.
Ilfllp (folf(t)IP dt) P
58 2 N ormed Linear Spaces

where 1 ::; p < 00. Show that this defines a norm on C[O, 1J.

2.2 Show that the space C[O, 1J with the norm 11.111 defined in the previ-
ous exercise is not complete by producing a Cauchy sequence which is
not convergent.

2.3 Let f E C(JR). The support of f is the closure of the set of points
where f does not vanish. Let Cc(JR) denote the space of all continuous
real valued functions on JR whose support is a compact subset of JR. Show
that it is a normed linear space with the 'sup-norm' and that it is not
complete.

2.4 Let Ca(JR) denote the space of all continuous real valued functions
on JR which vanish at infinity, i.e if f E Ca(JR), then, given any c > 0,
there exists a compact subset K c JR such that
If(x)1 < c
for all x E JR\K. Show that Ca(JR) is a Banach space with the 'sup-
norm'. Show also that the space Cc(JR) defined in the previous exercise
is dense in Ca(JR).

2.5 Let Cl[O, 1J denote the space of all continuous real valued functions
on [0, 1J which are continuously differentiable on (0,1) and whose deriva-
tives can be continuously extended to [0, 1J. For fECI [0, 1]'define
Ilfll = m[ax]{lf(t)I,I!'(t)l}
tE a,l

where f' denotes the derivative of f. Show that Cl[O, 1J is a Banach


space for this norm. State and prove an analogous result for Ck [0, 1J, the
space of all continuous real valued functions on [0, 1J which are k times
continuously differentiable on (0,1) and all those derivatives possessing
continuous extensions to [0,1J.

2.6 Let fECI [0, 1J and let f' denote its derivative. Define

(1
1

1
Ilfllt = (If(t)1 2 + 1!'(tW)dt) 2
Show that 11.111 defines a norm on Cl[O, 1J. If we define
2.5 Exercises 59

does 1.11 define a norm on Cl[O, 1]7

2.7 Let
v = {J E Cl[O, 1]1 f(O) = a}.
Show that 1.11 defines a norm on V.

2.8 Let V be a Banach space with norm 1I.lIv. Set

x = C([O, 1]; V)

to be the space of all continuous functions from [0,1] into the space V.
Define, for f EX,

IIflix = sup IIf(t)lIv = (max IIf(t)lIv)


tE[O,l] tE[O,l]

Show that 1I.lIx is well defined and that it defines a norm on X. Show
also that, under this norm, X is a Banach space.

2.9 Let V and W be normed linear spaces and let T : V ~ W be a


linear transformation. Show that T is continuous if, and only if, T maps
Cauchy sequences in V into Cauchy sequences in W.

2.10 Let Cl[O, 1] be endowed with the norm as in Exercise 2.5 above. Let
qO,I] be endowed with the usual 'sup-norm'. Show that T : Cl[O, 1] ~
qO,I] defined by T(f) = l' is a continuous linear transformation and
that IITII = 1.

2.11 Let C[O, 1] be endowed with its usual norm. For f E qo, 1], define

T(f)(t) = lot f(s) ds, t E [0,1].

For every positive integer n, show that

1:
2.12 Consider the space CcOR) defined in Exercise 2.3 above. For E
Cc(lR), define
<p(f) = f(t) dt.
60 2 N ormed Linear Spaces

Show that <p is well defined and that it is a linear functional on this
space. Is it continuous?

2.13 Let {tdi=l be given points in the closed interval [0,1]. Let {wi}i=l
be given real numbers. Let f E C[O, 1]. Define
n
<p(f) = Lwd(td.
i=l

Show that <p defines a continuous linear functional on qo, 1] and that
n

11<p11 = L IWil·
i=l

2.14 Let Mn denote the set of all n x n matrices with complex entries.
Let 11.llp,n denote the matrix norm induced by the vector norm 11.llp on
en, for 1 ::; p::; 00. If A = (aij) E M n , show that

State and prove an analogous result for IIAlloo,n.


2.15 With the notations introduced in the preceding exercise, show that

IIAI12,n = y'p(A*A)

where p(T) denotes the spectral radius of a matrix T. (Hint: Use Propo-
sition 1.1.8). If A is a normal matrix, show that IIAI12,n = p(A).

2.16 With the notations introduced above, show that, for any matrix
A E M n , we have

where II.IIE is the norm introduced in Example 2.3.11.


2.17 If 11.11 defines a matrix norm on M n, show that p(A) ::; IIAII for all
AEMn·
2.5 Exercises 61

2.18 Let A E Mn be invertible and let 11.11 be a matrix norm. The


condition number of A is defined as

cond(A) = IIAII.IIA -111·

Show that
(a) cond(A) 2: 1 for any invertible matrix A E Mn;
(b) cond(o:A) = cond(A) for any invertible matrix A and for any scalar
0: # 0;
(c) for any invertible and normal matrix A,

where {Ai(A)}r=1 are the eigenvalues of A and cond2,n(A) denotes the


condition number of A with respect to the norm 11.112,n.

2.19 For what class of matrices does cond2,n attain its minimum value?

2.20 Let A = (aij) be a 2 x 2 matrix which is invertible. Show that

where
2::;,j=1 laij 12
(Y =
2Idet(A)1
2.21 Let Mn be endowed with the topology generated by any matrix
norm. Let GLn(CC) denote the set of all invertible matrices in Mn. Show
that GLn(tC) is an open and dense set. Is it connected?

2.22 (a) Let Vn be the subset of all n x n matrices with distinct eigen-
values. Show that Vn is dense in Mn (endowed with any matrix norm).
(b) Prove the Cayley-Hamilton theorem for any diagonalizable matrix:'Every
n x n matrix satisfies its characteristic equation'.
(c) Deduce the Cayley-Hamilton theorem for all n x n matrices.

2.23 Let A E Mn be an invertible matrix. Show that

. 1
mf II A - B 112 n = 1 .
B is singular ' IIA - 112,n
62 2 lVorr,ned Linear Spaces

2.24 Show that the set of all orthogonal matrices in the space of all
n x n real matrices (endowed with any norm topology) is compact.

2.25 Let 1 ::; p < q ::; 00. Show that lp C lq and that, for all x E lp,

2.26 Consider an infinite matrix (aij), i,j EN of scalars. Let x = (Xi) E


lp,l ::; p ::; 00. Define a sequence A(x) whose i-th component is given
by
00

LaijXj.
j=l

(a) Assume that


00

a = sup L laijl < 00.


t j=l

Show that A E £(ll) and that IIAII = a.


(b) Assume that
00

a = sup L laijl < 00.


J i=l

Show that A E £(£00) and that IIAII = a.


(c) Assume that
00 00
a = LL
laijl2 < 00.
i=l j=l

Show that A E £(l2) and that IIAI12 ::; a.


(d) Assume that A E £(lp) and that A E £(.eq) where 1 ::; p < q < 00.
Let () E (0,1). Set r = ()p + (1 - ())q. Show that A E £(lr) and that

IIAIIL:(fr) ::; II II~(fp)II II~( )


2.27 Let { k ~o be a sequence ofreal numbers such that 2:::'0 lakl <
00.Consider the infinite lower triangular matrix

ao 0 0
a1 ao 0
a2 a1 ao
a3 a2 a1
2.5 Exercises 63

Let A be the linear map defined on £2 by this matrix (as in the preceding
exercise). Show that A E £(£2) and that
00

IIAII ~ L lakl·
k=O

2.28 Let V be a Banach space. Let {An} be a sequence of continuous


linear operators on V. Let

If {Sn} is a convergent sequence in £ (V), we say that the series

is convergent and the limit of the sequence {Sn} is called the sum of the
series. If 2:~1 IIAkl1 < 00, we say that the series 2:~1 Ak is absolutely
convergent. Show that any absolutely convergent series in convergent.

2.29 Let V be a Banach space. If A E £(V) is such that IIAII < 1, show
that the series
00

1+ LAk
k=1

is convergent and that its sum is (I - A)-I.

2.30 (a) Let V be a Banach space and let A E £(V). Show that the
series
00 Ak

I+Lkf
k=1

is convergent. The sum is denoted exp(A).


(b) If A and B E £(V) are such that AB = BA, show that

exp(A + B) = exp(A) exp(B).

(c) Deduce that exp(A) is invertible for any A E £(V).


(d) Let
A = [a w
-w]
a '
64 2 N armed Linear Spaces

where 0: and ware real numbers. Show that, for any t E JR,

exp (t A) -_ e
ad [coswt
. - sinwt ] .
smwt coswt

2.31 Let V be a Banach space. Show that g, the set of invertible linear
operators in £(V) is an open subset of £(V) (endowed with its usual
norm topology).

2.32 (a) Define T : £2 -t £2 and S : £2 -t £2 by

T(x) (O,X1,X2, •.. )


S(x) (X2' X3, ... )

where x = (X1,X2, ... ) E £2. Show that T and S define continuous linear
operators on £2 and that ST = I while TS =1= I. (Thus, T and S, which
are called the right and left shift operators respectively, are not invert-
ible) .
(b) If A is a continuous linear operator on £2 such that IIA - Til < 1,
show that A is also not invertible. Deduce that, in general, g, defined
in Exercise 2.31 above, is not dense in £(V), if V is infinite dimensional.
(Compare this with the finite dimensional case, cf. Exercise 2.21).

2.33 Let P denote the space of all polynomials in one variable with
real coefficients. Let pEP and let p = 2:~1 aixi, where ai E JR for
1 :S i :S n. Define
n

Show that 11.111 and 11.11 00 define norms on P and that they are not equiv-
alent.

2.34 Let V be a normed linear space and let W be a finite dimensional


(and hence, closed) subspace of V. Let x E V. Show that there exists
w E W such that
IIx+ WII = IIx+wll·
2.35 Let Ei be Banach spaces for 1 :S i :S 3. Let A E £(E1' E 2 ) and BE
£(E1' E3). Assume, further, if K is any bounded set in E 1, then B(K)
is compact in E3. Finally, assume that x E E1 1--+ IIA(x)IIE2 + IIB(x)II E3
defines a norm on E1 which is equivalent to the norm 1I.IIE1.
2.5 Exercises 65

(a) Show that Ker(A), the kernel of A, is finite dimensional.


(b) Let R(A) denote the range of A. Show that the canonical mapping
A: El/(Ker(A)) ---+ R(A) defined by A(x + Ker(A)) = A(x) for x EEl,
is an isomorphism.
(c) Deduce that R(A) is a closed subspace of E 3 .

2.36 Let V and Y be normed linear spaces and let W be a dense subspace
of V. Let A E £(W, Y). Show that there exists a unique continuous ex-
tension A E £(V, Y) (i.e. Alw = A) and that IIAllc(V,y) = IIAllc(w,Y)'

2.37 (Completion of a normed linear space) Let V be a normed linear


space. We say that two Cauchy sequences {xn} and {Yn} in V are equiv-
alent if Ilxn - Ynll ---+ 0 as n ---+ 00.
(a) Show that this defines an equvalence relation. Let the set of all
equivalence classes be denoted by V .
(b) Let x and y denote the equivalence classes of the Cauchy sequences
{xn} and {Yn} respectively. Let 0 denote the equivalence class of the
sequence all of whose terms are zero. Let a be a scalar. Define x + y
to be the equivalence class of the sequence {xn + Yn} and ax to be that
of the sequence {ax n }. Show that these operations are well defined and
make V a vector space.
(c) With the above notations, define Ilxllv = limn-+oo Ilxnllv. Show that
this is well defined and that it defines a norm on V.
(d) Define i : V ---+ V by setting i(x) to be the equivalence class of the
sequence all of whose terms are equal to x, for any x E V. Show that
i E £(V, V) and that it is an injection. Show also that the image i(V) is
dense in V.
(e) Show that V is complete (and this space is called the completion of
V). (Hint: Given a Cauchy sequence {x(n)} in V, choose Xn E V such
that Ilx(n) - i(xn)IIV < lin. Show that {xn} is a Cauchy sequence in V
and if x denotes its equivalence class, show that x(n) ---+ x in V.)

2.38 Let V and W be normed linear spaces and let U c V be an open


subset. Let J : U ---+ W be a mapping. We say that J is (Frechet)
differentiable at u E U if there exists T E £(V, W) such that

. IIJ(u
11m + h) - J(u) - T(h)11
O.
h-+O Ilhll
66 2 lVorTined Linear Spaces

(Equivalently,

J(u + h) - J(u) - T(h) = c(h) lim Ilc(h)11 = 0.)


'h--+O IIhll
(a) If such a T exists, show that it is unique. (We say that T is the
(Frechet) derivative of J at u E U and write T = J'(u).)
(b) If J is differentiable at u E U, show that J is continuous at u E U.

2.39 Let V and W be normed linear spaces and let U c V be an open


subset. Let J : U ---t W be a mapping. We say that J is Gateau
differentiable at u E U along a vector w E V if
. 1
lIm -(J(u + tv) - J(u))
t--+O t

exists. (We call the limit the Gateau derivative of J at u along w.)
If J is Fhkhet differentiable at u E U, show that it is Gateau differ-
entiable at u along any vector w E V and that the corresponding Gateau
derivative is given by J'(u)w.

2.40 Let V and W be normed linear spaces and let A E £(V, W). Let
Wo E W be given. define J : V ---t W by J(u) = A(u) + wo0 Show that
J is differentiable at every u E V and that J' (u) = A.

2.41 Let U = GLn(C) C Mn (cf. Exercise 2.21). Define J(A) = A-I


for A E U. Show that J is differentiable at every A E U and that, if
H E M n , we have

J'(A)(H) = -A -lHA -1.

2.42 (a) Let A E GLn(C). Show that

det(I + A) = 1 + tr(A) + c(A)


where
. Ic(A)1
f~o lAf =0
(for any matrix norm).
(b) Deduce that if we define J(A) = det(A) for A E GLn(C), then J is
differentiable at all such A and that, if HE M n , then

J'(A)(H) = det(A)tr(A -lH).


2.5 Exercises 67

2.43 (Chain Rule) Let V, Wand Z be normed linear spaces and let
j : V ~ Wand 9 : W ~ Z be mappings such that f is differentiable at
a point v E V and 9 is differentiable at j (v) = w E W. Show that the
map 9 0 j : V ~ Z is differentiable at v E V and that

(g 0 f)'(v) = g'(f(v)) 0 j'(v).

2.44 (a) Let V be a real normed linear space and let J : V ~ lR be a


given mapping. A subset K of V is said to be convex if, for every u and
v E K and for every t E [0,1]' we have that

tu + (1 - t)v E K.

Let K c V be a closed convex set. Assume that J attains its minimum


over K at u E K. If J is differentiable at u, then show that

J'(u)(v - u) ~ a
for every v E K.
(b) Let K = V. If J attains its minimum at u E V and if J is differen-
tiable at u, show that J' (u) = o.

2.45 Let V be a real normed linear space. A mapping J : V ~ lR is said


to be convex if, for every u and v E V and for every t E [0,1]' we have

J(tu + (1 - t)v) ::; tJ(u) + (1 - t)J(v).


(a) If J : V ~ lR is convex and differentiable at every point, show that

J (v) - J (u) ~ J' (u) (v - u)

for every u and v E V.


(b) Let J : V ~ lR be convex and differentiable at every point of V. Let
K C V be a closed convex set. Let u E K be such that

J'(u)(v - u) > 0

for every v E K. Show that

J(u) = minJ(v).
vEK

(c) If J : V ~ lR is convex and differentiable at every point of V, and if


u E V is such that J' (u) = 0, show that J attains its minimum (over all
68 2 N ormed Linear Spaces

of V) at u.

Remark 2.5.1 Exercise 2.44 gave necessary conditions for a differen-


tiable function J to attain a minimum at a point u. The preceding
exercise shows that these conditions are also sufficient in the case of
convex functions. •

2.46 Let m > n. Let A be an m x n matrix and let b E ~m. Consider


the linear system of equations

Ax = b.

This system may not have a solution since the number of equations
exceeds the number of unknowns. A least squares approximate solution
is a vector Xo E ~n such that

Show that such a solution must satisfy the linear system

A*Axo = A*b

and that this system has a unique solution if the rank of A is n.


Chapter 3

Hahn-Banach Theorems

3.1 Analytic Versions

The analytic form of the Hahn-Banach theorem concerns the extension


of linear functionalsdefined on a subspace of a normed linear space to
the entire space, preserving the norm of the functional. We will prove a
slightly more general result in this direction.

Theorem 3.1.1 (Hahn-Banach Theorem) Let V be a vector space


over JR. Let p : V ~ JR be a mapping such that

p(ax) ap(x) } (3.1.1)


p(x + y) < p(x) + p(y)
for all x and y E V and for all a > 0 in JR. Let W be a subspace of V
and let 9 : W ~ lR be a linear map such that

g(x) ~ p(x)

for all x E W. Then, there exists a linear extension f : V ~ JR of g (i. e.


f(x) = g(x) for all x E W) which is such that

f(x) ~ p(x)

for all x E V.

Proof: Step 1. Let P denote the collection of all pairs (Y, h), where Y
is a subspace of V containing Wand h : Y ~ lR a linear map which is
an extension of 9 and which is also such that

h(x) ~ p(x)
70 3 Hahn-Banach Theorems

for all x E Y. Clearly P is non-empty, since (W, g) E P. Consider the


partial order defined on P by

(Y, h) -< (Y, h)

if Y c 17 and h is a linear extension of h. Let Q = {(Yi, hi) liE I} be


a chain in P. Define
Y = UiElYi
and let h : Y -----t lR be defined by h(x) = hi(x) if x E Yi. Since Q is a
chain, it is immediate to see that h is well defined and also that it is a
linear extension of each of the hi. Also h(x) ~ p(x) for all x E Y. Thus,
(Y, h) E P and (Yi, hi) ~ (Y, h) for each i E I and thus every chain has an
upper bound. Hence, by Zorn's lemma, P has a maximal element (Z, f).

Step 2. We will show that Z = V, which will complete the proof.


Assuming the contrary, let Xo tt Z. Consider the linear subspace of V
given by
Y = {x + txo I x E Z, t E lR}.
We will define a linear extension h : Y -----t lR of f such that (Y, h) E P,
thus contradicting the maximality of f. Define

h(x + txo) = f(x) + o:t


where 0: will be suitably determined. In order that (Y, h) E P, we need
that
f(x) + o:t ~ p(x + txo)
for all x E Z and for all t E RIft> 0, this reduces to (in view of
(3.1.1))

or, equivalently,
f(x) + 0: ~ p(x + xo) (3.1.2)
for all x E Z. Similarly, by considering t < 0, we deduce that
f(x) - 0: ~ p(x - xo) (3.1.3)

for all x E Z. In other words, it is necessary that 0: be chosen such that

sup[f(x) - p(x - xo)] ~ 0: ~ inf [P(x + xo) - f(x)]. (3.1.4)


xEZ xEZ
3.1 A nalytic Versions 71

But, for all x and y E Z, we have

f(x) + f(y) = f(x + y) :s: p(x + y) < p(x + xo) + p(y - xo)

or, equivalently,

f(y) - p(y - xo) :s: p(x + xo) - f(x).

Hence, it is possible to choose a such that (3.1.4) is true, giving the


desired contradiction, which completes the proof. •

Theorem 3.1.2 (Hahn-Banach Theorem) Let V be a normed linear


space over R Let W be a subspace of V and let 9 : W ~ lR be a
continuous linear functional on W. Then there exists a continuous linear
extension f : V ~ lR of 9 such that

Ilfllv* = Ilgllw*.
Proof: Set p(x) = Ilgllw*llxll. Then p verifies (3.1.1). Also g(x) :s: p(x)
for all x E W. Thus, there exists a linear extension of g, viz. f : V ~ lR
such that, for all x E V, f(x) :s: Ilgllw*lIxll. This implies that f is con-
tinuous and that Ilfllv* :s: Ilgllw*. But, since f = 9 on W, it follows that
we do have equality of the norms of f and g. This completes the proof. •

We will now prove the same result for complex vector spaces.

Proposition 3.1.1 Let V be a normed linear space over Co Let f :


+ ih where 9 and h
V ~ C be a continuous linear functional. Let f = 9
are real valued linear functionals. Then

f(x) = g(x) - ig(ix)

for all x E V and, further, Ilfll = Ilgll.


Proof: Let x E V. Then f(ix) = if(x). Expressing this in terms of the
real and imaginary parts of f, we get

g(ix) + ih(ix) = ig(x) - h(x)

which shows that h(x) = -g(ix). Now, let f(x) = eit9 lf(x)1 where
() E [0, 27T). Then,

If(x)1 e- it9 f(x)


72 3 Hahn-Banach Theorems

since the left extreme of the above relation is real. Thus If (x) I :S: II 9 II II x I
which implies that Ilfll :S: Ilgll. On the other hand,

If(x)12 = Ig(x)12 + Ih(x)12


which yields Ig(x)1 :S: If(x)1 :S: Ilfll Ilxll whence we get Ilgll :S: Ilfll· This
completes the proof. •

Theorem 3.1.3 (Hahn-Banach Theorem) Let V be a normed linear


space over C. Let W be a subspace of V and let 9 : W -+ <C be a
continuous linear functional on W. Then there exists a continuous linear
extension f : V -+ <C of 9 such that

Ilfllv* = Ilgllw*.
Proof: Let 9 = h(x) - ih(ix) where h is the real part of g. We consider
Vasa real normed linear space by st i ti~ ourselves to scalar multi-
plication by reals only. Then, !here exists h : V -+ lR. which is a linear
extension of h and such that Ilhllv* = Ilhllw*. Now set

f(x) = h(x) - ih(ix)

for all x E V. Then, clearly, f (x + y) = f (x) + f (y) and, for real scalars
a, f(ax) = af(x). Now,

f(ix) = h(ix) - ih( -x) = i(h(x) - ih(ix)) = if(x)

and thus f is complex linear as well. Further, by the preceding propo-


sition,
Ilfllv* = Ilhllv* = Ilhllw* = Ilgllw*.
This completes the proof. •

Corollary 3.1.1 Let V be a normed linear space and Xo E V a non-zero


vector. Then, there exists f E V* such that :Ifll = 1 and f(xo) = Ilxoll.
Proof: Let W be the one-dimensional space spanned by Xo. Define
g(axo) = allxoll· Then Ilgllw* = 1. Hence, there exists f E V* such
that Ilfllv* = 1 and which extends g. Hence f(xo) = g(xo) = Ilxoll .•

Remark 3.1.1 If V is a normed linear space and if x and yare distinct


points in V, then, clearly, there exists f E V* such that f(x) i= f(y)
(consider Xo = x - y i= 0). We say that V* separates points of V. •
3.1 Analytic Versions 73

Corollary 3.1.2 Let V be a normed linear space. Let x E V. then

Ilxll = sup If(x)1 = max If(x)l· (3.1.5)


lEV',11/119 lEV', 111119

Proof: Clearly, If(x)1 :S Ilfll Ilxll :S Ilxll when Ilfll :S 1. On the other
hand, by the preceding corollary, there exists f E V* such that Ilfll = 1
and f(x) = IIxll when x is non-zero. Thus the result is established for
non-zero vectors and is trivially true for the null vector. •

Compare the relation

Ilfll sup If(x)l, (3.1.6)


xEV, IixllS:l

which is a definition, with the relation (3.1.5), which is a result of the


theory. In the former, the supremum need not be attained, while in the
latter the supremum is always attained and hence is a maximum.
This is the starting point for the investigation of a very nice property
of Banach spaces called reflexivity.
Let x E V and define

Jx(f) = f(x)

for f E V*. Then, by virtue of (3.1.5), it follows that J x E (V*)* = V**


and that, in fact,
IIJxllv•• = Ilxllv.
Thus J : V --> V** given by x f--+ J x is a norm preserving linear transfor-
mation. Such a map is called an isometry. The map J is clearly injective
and maps V isometrically onto a subspace of V**.

Definition 3.1.1 A Banach space V is said to be reflexive if the canon-


ical imbedding J : V --> V**, given above, is surjective. •

Since V**, being a dual space, is always complete, the notion of re-
flexivity makes sense only for Banach spaces. By applying Corollary
3.1.2 to V*, it is readily seen that the supremum in (3.1.6) is attained
for reflexive Banach spaces. A deep result due to R. C. James is that
the converse is also true: if V is a Banach space such that the supremum
is attained in (3.1.6) for all f E V*, then V is reflexive. We will study
reflexive spaces in greater detail in Chapter 5.
74 3 Hahn-Banach Theorems

Example 3.1.1 Let 1 < p < 00. Let p* be the conjugate exponent of
p (cf. Definition 2.2.1). Let Y E Cp •. We already saw that (cf. Example
2.3.4) the linear functional fy defined on Cp by

2:
00

fy(x) = X iYi
i=l

for x = (Xi) E Cp , is continuous and that, in fact,

Now, let f E C;. Define

where ei is the sequence whose i-th entry is unity and all other entries
are zero. Set f = (h).
Let n be any positive integer. Define

0, if 1::; i ::; nand h = 0,


Xi = { IhI P• / fi, if 1::; i ::; nand h i= 0,
0, if i > n.

Then, since it is a finite sequence, X = (Xi) E Cp and X = 2:~=1 Xiei.


Thus,
n n
f(x) = Lxdi = L IfiI P••
i=l i=l
Consequently
n n
L IhI Ilfllllxll p = Ilfll L
1

P• ::; (lfiI P·) p


i=l i=l

using the definition of the Xi and that of p*. This yields

Since n was arbitrary, we deduce that f E Cp • and that Ilfllp· ::; Ilfll·
For any X = (Xi) E Cp , we have 2:~1 Xiei ----+ X in Cp and so, by the
continuity of f, it follows that

2:
00

f(x) X di
i=l
3.1 A nalytic Versions 75

or, in other words, f = if. Hence,


Ilfll p ' ~ Ilfll < Ilfll p '
as already observed.
Thus every element of the dual space of £p occurs in this fashion and
the map y f--+ fy is an isometry of £p' onto Thus, we can write £;.

using this isometry.


Similarly, we can write

It is easy to see that using these identifications of the dual spaces, the
canonical isometry from £p into £;*
= £p is nothing but the identity map,
which is onto. Thus, the spaces £P' for 1 < p < 00 are all reflexive . •

Remark 3.1.2 Though in the above example, we have not dealt with
the case of real and complex sequence spaces separately, it is customary
to identify the dual of the complex sequence space £p with £p' via the
following relation: if y E £p" then we define fy E by £;
00
fy(x) = L XiYi·
i=l

The mapping y E £p' f--+ fy E £;


satisfies fy+z = fy + fz and fay = "lify.
Such a mapping is called conjugate linear. This identification will be es-
pecially useful in the case of £2 which will be made clear when we study
Hilbert spaces (cf. Chapter 7). •

Example 3.1.2 Proceeding in a manner similar to that in the preced-


ing example, we can show that £i = £00' In other words, given any
continuous linear functional f on £1, we have
00
f(x) = Lxiii
i=l

for all x = (Xi) E £1, where fi = f(ej). Further, setting f = (Ii), we have

Ilfll oo = Ilfll·
76 3 Hahn-Banach Theorems

We now show that there is no similar identification of t~ with £1.


Of course, if Y E £1, the functional fy defined by
00
fy(x) = LXiYi
i=l

for all x = (xd E £00 is continuous and Ilfyll = Ilylh. But there exist
continuous linear functionals on £00 which do not arise this way. So the
identity mapping of £1 which is still the canonical imbedding of £1 into
£i*, is not surjective. Thus the space £1 is not reflexive.
To see this, let G be the subspace of all convergent sequences in £00.
For x = (Xi) E G, define
g(x) = .lim Xi.
2--"00
Then 9 : G -+ lR. is linear and Ig(x)1 ~ Ilxll oo . Thus, 9 is continuous as
well and so, by the Hahn-Banach theorem, can be extended to a con-
tinuous linear functional f on £00' preserving the norm. We claim that
this continuous linear functional cannot be obtained from an element of
£1 by the above outlined procedure.
Assume the contrary and let Y = (Yi) E £1 be such that f = fy.
Consider the sequence {x(n)} in £00 given by
x(n) = {O, 0, ... , 0,1,1,1, ... }
where the 1's start from the n-th entry. Then Ilx(n) 1100 = 1 and x(n) E G.
We have 00

i=n
which is impossible since Y E £1 implies that
00

i=n
as n -+ 00 . •

Remark 3.1.3 Since the canonical map J : V -+ V** is an isometry, it is


injective. Thus, if V is finite dimensional, then dim(V**) = dim(V*) =
dim(V) and so, by dimension considerations, J has to be surjective as
well. Thus, every finite dimensional space is automatically reflexive.
In particular, for any positive integer N, we have that (£~) = £l(,
unlike what happens in the case of sequence spaces, as seen above. The
interested reader can try to prove this directly. •
3.2 Geometric Versions 77

3.2 Geometric Versions

In this section we will study the geometric versions of the Hahn-Banach


theorem which concerns the separation of convex sets by means of hy-
perplanes.

Definition 3.2.1 Let V be a real normed linear space. An affine hy-


perplane is a set of the form

H = {x E V I f(x) = a},
denoted by [f = a], where f is a non-zero linear functional on V . •

Proposition 3.2.1 A hyperplane [f = a] is closed if, and only if, f is


a continuous linear functional.

Proof: Clearly, if f is continuous, then [f = a] is closed. Conversely,


assume that the hyperplane H, given by [f = a], is closed. Then, its
complement He is open and, since f ¢. 0, it is non-empty.(For, if a of- 0,
then 0 E He; if a = 0, and f ¢. 0, there exists x E V such that f(x) of-
and so x E He.)
°
Without loss of generality, assume that Xo E He is such that f(xo) <
°
a. Since He is open, there exists r > such that the open ball centered
at Xo and of radius 2r, denoted B(xo; 2r), is contained in He. Now, for
all x E B(xo; 2r), we have f(x) < a. (If not, there exists Xl E B(xo; 2r)
such that f(xd > a. Let

f(xo) - a
t
f(xo) - f(XI)·

Then 0< t < 1 and, if Xt = tXI +(l-t)xo, then f(xt) = a. But B(xo; 2r)
is a convex set and so Xt E B(xo; 2r), which is a contradiction.)
Thus, for any z E V such that Ilzll ::; 1, we get f(xo + rz) ::; a or,
equivalently,
f(z) ::; a - f(xo) .
r
Thus the image of the unit ball is bounded and so f is continuous. •

Proposition 3.2.2 Let C be an open and convex set in a real normed


linear space V such that 0 E C. For x E V, set

p(x) °
= inf{a > I a-Ix E C}.
78 3 Hahn-Banach Theorems

(The function p is called the Minkowski functional of C.) Then, there


exists M > 0 such that

Os p(x) s lVIllxl1 (3.2.1)

for all x E V. We also have

C = {x E V I p( x) < I}. (3.2.2)

Further, p satisfies (3.1.1).

Proof: Since 0 E C and C is open, there exists an open ball B(O; 2r),
centered at 0 and of radius 2r, contained in C. Now, if x E V, we have
rx/llxli E C and so, by definition, p(x) S ~ Ilxll which proves (3.2.1).
Let x E C. Since C is open, and since 0 E C, there exists c > 0 such
that (1 + c)x E C. Thus, p(x) S (1 + c)-l < 1. Conversely, let x E V
such that p(x) < 1. Then, there exists 0 < t < 1 such that tx
E C.
Then, as C is convex, we also have ttx + (1 - t)O E C, i.e. x E C. This
proves (3.2.2).
If 0: > 0, it is easy to see that p( o:x) = o:p( x). This is the first
relation in (3.1.1). Now, let x and y E V. Let c > O. Then

1 1
--,--,---x E C and () Y E C.
p(x) + c P Y +c

Set
t = p(x) + c
p(x) + p(y) + 2c
so that 0 < t < 1. Then, as C is convex,
1 1 1
t p (x)+c x +(I-t)p(y)+c Y = p(x)+p(y)+2c(x+ Y )EC

which implies that

p(x + y) S p(x) + p(y) + 2c


from which the second relation in (3.1.1) follows since c was chosen
arbitrarily. •

Proposition 3.2.3 Let C be a non-empty open convex set in a real


normed linear space V and assume that Xo ¢ C. Then, there exists
f E V* such that f(x) < f(xo) for all x E c.
3.2 Geometric Versions 79

Proof: Without loss of generality, we can assume that 0 E C. (If 0 ¢ C,


let Xl E C. Then we consider the convex set C - {Xl} which contains
the origin and does not contain Xo - Xl; if f is as in the proposition, we
have f(x - Xl) < f(xo - Xl) for all X E C which yields f(x) < f(xo) for
allxEC.)
Let W be the one-dimensional space spanned by Xo. Define 9 : W ---+
lR by
g(txo) = t.

By definition of the Minkowski functional, since ttxo = Xo ¢ C, we have


that
g(txo) = t 5: p(txo)

for t > O. Since the Minkowski functional is non-negative, this inequality


holds trivially for t 5: 0 as well. Thus, by the Hahn-Banach theorem (cf.
Theorem 3.1.1), there exists a linear extension f of 9 to the whole of V
such that, for all X E V,

f(x) 5: p(x) 5: M\\x\\

(cf. (3.2.1)) which yields \f(x)\ 5: M\\x\\, and so f is continuous as well.


Now, if X E C,

f(x) 5: p(x) < 1 = g(xo) f(xo)

by (3.2.2) and this completes the proof. •

Theorem 3.2.1 (Hahn-Banach Theorem) Let A and B be two non-


empty and disjoint convex subsets of a real normed linear space V. As-
sume that A is open. Then, there exists a closed hyperplane which sep-
arates A and B, i. e. there exists f E V* and 0: E lR such that

f(x) 5: 0: 5: f(y)

for all X E A and y E B.

Proof: Let C = A - B = {x - y \ X E A, y E B}. Since

C = UyEB(A-{y}),

we see immediately that C is both open and convex. Since A and Bare
disjoint, it also follows that 0 ¢ C. Hence, by the preceding proposition,
80 3 Hahn-Banach Theorems

there exists f E V* such that f(z) < 0 for all z E C = A-B. In other
words, f(x) < f(y) for all x E A and y E B. Choose 0: E lR such that

sup f(x) ~ 0: ~ inf f(y).


xEA yEB

This completes the proof. •

Theorem 3.2.2 (Hahn-Banach Theorem) Let A and B be non-


empty and disjoint convex sets in a real normed linear space V. Assume
that A is closed and that B is compact. Then A and B can be separated
strictly by a closed hyperplane, i. e. there exists f E V* , 0: E lR and c > 0
such that
f(x) ~ O:-c and f(y) 2: O:+c
for all x E A and y E B.

Proof: Let TJ > O. Then, if B(O; TJ) is the open ball of radius TJ > 0
centered at 0, then A+B(O; TJ) and B+B(O; TJ) are non-empty, open and
convex. Further, if TJ > 0 is sufficiently small, the two sets are disjoint
as well. If not, there exists a sequence TJn ---t 0 and Xn E A, Yn E B such
that IIx n - Ynll ~ 2TJn . Since B is compact, there exists a subsequence
Ynk which converges to Y E B. This implies then that x nk ---t Y and,
since A is closed, YEA, i. e. yEA n B, which is a contradiction.
Thus, we can choose TJ > 0 such that A + B(O; TJ) and B + B(O; TJ)
are disjoint. Then, by the preceding theorem, there exists f E V* and
0: E lR such that for all x E A, y E Band Zl and Z2 in the closed unit
ball, we have

This implies that

f(x) + ~ Ilfll ~ 0: ~ f(y) - ~llfll.


This proves the result if we set c = 111fll .•
The following corollary is very useful in testing whether a given sub-
space of a normed linear space is dense or not.

Corollary 3.2.1 Let W be a subspace of a real normed linear space V.


Assume that Wi-V. Then, there exists f E V* such that f t=. 0 and
such that f(x) = 0 for all x E W.
3.3 Vector Valued Integration 81

Proof: Let Xo E V\W. Let A = Wand B = {xo}. Then A is closed,


B is compact and they are non-empty and disjoint convex sets. Thus,
there exists f E V* and a E JR such that for all x E W,

f(x) < a < f(xo).


Since W is a linear subspace, it follows that for all >. E JR, we have
>.f(x) < a for all x E W. Now, since 0 E W, we have a > O. On the
other hand, setting>. = n, we get that, for any x E W,
a
f(x) < -
n
whence we see that f(x) ~ 0 for all x E W. Again, if x E W, we also
have -x E Wand so f( -x) ~ 0 as well and so f(x) = 0 for all x E W
and f(xo) > a > O. •

Remark 3.2.1 In case of normed linear spaces over C, the conclusions


of Theorems 3.2.1 and 3.2.2 hold with f being replaced by Re(f), the
real part of f. This follows from Proposition 3.1.1. It is now easy to see
that Corollary 3.2.1 is also valid for complex spaces. For another proof
of this result, see Exercise 3.4. •

Remark 3.2.2 A topological vector space is said to be locally convex


if every point has a local basis made up of convex sets, i. e. every open
neighbourhood of each point contains a convex open neighbourhood of
that point.
The proofs of the geometric versions of the Hahn-Banach theorems
go through mutatis mutandis in the case of locally convex spaces. In
particular, Corollary 3.2.1 is also true for such spaces. For details, see
Rudin [8] . •

3.3 Vector Valued Integration

In this section we will apply the Hahn-Banach theorem to give a meaning


to integration of vector valued functions.
Let us consider the unit interval [0, 1] endowed with the Lebesgue
measure. Let V be a normed linear space over lEt Let 'P : [0,1] ---+ V be
a continuous mapping. We would like to give a meaning to the integral
82 3 Hahn-Banach Theorems

as a vector in V in a manner that the familier properties of integrals are


preserved.
Using our experience with the integral of a continuous real valued
function, one could introduce a partition

a= Xo < Xl < ... < Xn = 1

and form Riemann sums of the form


n
Z)Xi - xi p(~i)
i=l

where ~i E [Xi-I, Xi] for 1 ::; i ::; n, and define the integral (if it exists)
as a suitable limit of such sums. Assume that such a limit exists and
denote it by y E V. Let f E V*. Then, by the continuity and linearity
of f, it will follow that f (y) will be the limit of the Riemann sums of
the form
n
Z)Xi - xi )f( p(~i)).
i=I

But since f 0 <p : [0,1] --? lR. is continuous, the above limit of Riemann
sums is none other than

Thus the integral of <p must satisfy the relation

(3.3.1)

for all f E V*.


Notice that since V* separates points of V (cf. Corollary 3.1.1 and
Remark 3.1.1), such a vector, if it exists, must be unique. We use this
to define the integral of a vector valued function.

Definition 3.3.1 Let V be a real normed linear space and let <p : [0,1] --?

lR. be a continuous mapping. The integral of <p over [0,1], denoted

is that vector in V which satisfies (3.3.1) for all f E V* . •


3.3 Vector Valued Integration 83

Proposition 3.3.1 Let r.p : [0,1] --? V be a continuous mapping into a


real normed linear space V. Then the integral of r.p over [0, 1] exists.

Proof: Since [0,1] is compact, the set H which is the closure (in V) of
the set H which is the convex hull of r.p([0, 1]) (i.e. the smallest convex
set containing r.p([0, 1])), is compact.
Let L be an arbitrary finite collection of continuous linear functionals
on V. Define

EL = {Y E HI f(y) = 10 1
f(r.p(t)) dt for all f E L}.
It is immediate to see that EL is a closed set.

Step 1: For any such finite collection L of continuous linear functionals,


EL # 0. To see this, let L = {II,··· ,Jk}. Define A : V --? ffi.k by

A(x) = (lI(x),···, Jk(x)).


Then A is a continuous linear transformation and so K = A(H) is a
compact and convex set. If (tl,··· ,tk) tf. K, then, by the Hahn-Banach
theorem (cf. Theorem 3.2.2), we can find constants Cl,···, Ck such that
k k
L CiUi < L Citi
i=l i=l

for all (Ul,· .. , Uk) E K. In particular, for all t E [0,1]' we have


k k
LCdi(r.p(t)) < LCiti.
i=l i=l

Integrating this inequality over [0, 1], we get


k k

L qmi < L qti


i=l i=l

where
mi = 10 1
Ji(r.p(t)) dt.

In other words, if (tl,···,tk) tf. K, then (tl,···,tk) # (ml,···,mk).


Thus, (ml,···, mk) E K. Thus, there exists y E H such that, for
1 :s: i :s: k, we have
mi = Ji(y).
84 3 Hahn-Banach Theorems

This means that y EEL, i.e. EL is non-empty.

Step 2. Let I be a finite indexing set and let Li be finite collections


of elements in V* for each i E I. Then L = UiEIL i is still finite and
further, since it is easy to see that

it follows from the previous step that the class of closed sets

{EL I L a finite subset of V*}

has finite intersection property. Since H is compact, it now follows that

nL , finite subset of v. EL =I- 0.


In particular, there exists y such that y E E{f} for every f E V*, i. e. y
satisfies
f(y) = 10 1
f('P(t)) dt

for every f E V*. Thus y = fol 'P(t) dt. This completes the proof. •
Proposition 3.3.2 Let V be a real normed linear space and let 'P
[0,1] --+ V be continuous. Then

(3.3.2)

Proof: By Corollary 3.1.1, there exists f E V* such that Ilfll = 1 and


f(y) = Ilyll where
y = 10 1
'P(t) dt.

Thus,

IIfo 'P(t) dtll


1
= f (fo1'P(t) dt) fol f('P(t)) dt
1
:S fo If('P(t))1 dt < fol 11'P(t)11 dt.
This completes the proof. •

Remark 3.3.1 Let 'P : [a, b] --+ V be a continuous mapping. Define


'lj; : [0,1] --+ V by 'lj;(t) = 'P(a + t(b - a)). Then we can define

lb 'P(t) dt = (b - a) 10 1
'lj;(t) dt.
3.4 An Application to Optimization Theory 85

It is easy to verify that for all f E V*, we have

Again the result of Proposition 3.3.2 remains valid in this case as well.
Assume now that cp : [0, (Xl) ----t V is continuous. Assume further that
the limit

ior <p(t) dt
A
lim
A--+OO

exists, i. e., for any sequence An ----t (Xl (as n ----t (Xl), we have that the
limit
An
lim
n--+oo
lo0 cp(t) dt

exists and is independent of the sequence chosen. Then we define

roo cp(t) dt
io
= lim
A--+OO
r cp(t) dt
io
A

and again it follows that, for all f E V*, we have

The result of Proposition 3.3.2 continues to hold. We can define integrals


over other infinite intervals, if they exist, in a similar manner . •

3.4 An Application to Optimization Theory

We conclude this chapter with an application of the Hahn-Banach the-


orem to optimization theory.

Definition 3.4.1 A cone in a real vector space V is a set C such that:


(i) 0 E C;
(ii) if x E C and A ~ 0, then AX E C. •

Lemma 3.4.1 Let Vi, 1:S; i :s; n be elements in a normed linear space
V. Define

Then C is a closed convex cone.


86 3 Hahn-Banach Theorems

Proof: Step 1. Clearly, C is convex since, for any 0 < t < 1, we have
n n n

i=l i=l J=l

and so, if Ai 2: 0 and f.,li 2: 0 for all i, we also have tAi + (1 - t)f.,li 2: o.
Also if x E C, it is obvious that AX E C for any A 2: O. Thus, C is a
convex cone.

Step 2. Assume that the Vi are linearly independent. Let {w n } be a


sequence in C and assume that Wm ----+ W in V. If

w = span{v1,···, vn },

then W is a finite dimensional subspace of V and is hence closed as well.


Thus W E W. If Wm = L:~1 Aivi with Ai 2: 0 for all 1 ::; i ::; n and for
all m, then Ai ----+ Ai 2: 0 for each 1 ::; i ::; nand W = L:~1 Aivi. Thus
wEe and so C is closed.

Step 3. Assume now that the Vi are linearly dependent. Then there
exists a linear relation between them and so we can find scalars CYi such
that L:~=1 CYiVi = 0 and such that the set

J = {i I 1 ::; i ::; n, CYi < O}


is non-empty.
Let vEe be such that V = L:~=1 AiVi with Ai 2: 0 for all 1 ::; i ::; n.
Then v = L:~=1 (Ai + tCYi)Vi for any t E R Define

t = min {- Ai} 2: O.
tEJ CYi

Then, for all 1 ::; i ::; n, we have Ai + tCYi 2: 0 and at least one of them
must vanish. Then

C= UjEJ {v = L AiVi I Ai 2: 0 for all i E I\{j}}


iEJ\ {j}

where I = {1, 2,· .. ,n}. Each set in the union described on the right-
hand side is a cone but generated by fewer elements from V. Iterating
this procedure, we can ultimately write C as the finite union of cones
3.4 An Application to Optimization Theory 87

each generated by a linearly independent set of vectors and hence, by


the preceding step, each of these cones will be closed as well. Hence C,
being the finite union of closed sets, is closed. This completes the proof.

Theorem 3.4.1 (Farkas-Minkowski Lemma) Let V be a real reflex-
ive Banach space and let {fo, ft" .. ,fn} be elements of V* such that if
for some x E V we have fi(X) 2: 0 for all I :S i :S n, then fo(x) 2: 0 as
well. Then, there exists scalars .\i 2: 0, I:S i :S n such that
n

Proof: Let

which is a closed convex cone in V* by the preceding lemma. Assume


that fo ¢ C. Then, by the Hahn-Banach Theorem (cf. Theorem 3.2.2)
there exist <p E V** and a E lR such that

<pUo) < a < <pU)

for all f E C. Since 0 E C, it follows that a < O. Thus <pUo) < 0 as


well.
Now, since V is reflexive, there exists x E V such that <p = J x and
so fo(x) < O. On the other hand, since C is a cone, for all .\ > 0, and
for all f E C, we have .\f E C and so <p(.\f) > a or, <pU) > al.\ whence
we deduce, on letting .\ tend to infinity, that <pU) 2: 0, i.e. f(x) 2: 0
for all f E C. In particular fi(X) 2: 0 for all I :S i :S n while fo(x) < 0,
which is a contradiction. Thus fo E C and the proof is complete . •
The Farkas-Minkowski lemma is a key step in the proof of the Kuhn-
Tucker conditions which play the same role in characterizing minima in
the presence of constraints in the form of inequalities as that played by
Lagrange multipliers in characterizing minima in the presence of con-
straints in the form of equalities. While the Kuhn-Tucker conditions
are necessary in general 'nonlinear programming', they are necessary
and sufficient in 'convex programming' i.e. when the functional to be
minimized and the constraints are all given by convex functions.
Let V be a real normed linear space and let J : V ---+ lR be a given
functional. Let K c V be a closed and convex subset. Then, if J attains
88 3 Hahn-Banach Theorems

a minimum over K at u E K and if J is differentiable at u, a necessary


condition is that
J'(u)(v - u) ~ 0
for all v E K (cf. Exercise 2.44). We would like to generalize this to
sets K which are not necessarily convex. To this end, we introduce the
following definition.

Definition 3.4.2 Let V be a real normed linear space and let U c V


be a non-empty subset. Let u E U. Then, the tangent cone, denoted
C(u), at u is the union of the origin and the set of all vectors W E V
such that
(i) there exists a sequence {ud in U, Uk i- u for all kEN and
limk---+oo Uk = u;
(ii)
.
11m Uk - U
-;-:--'----:-;-
k---+oo iiuk - uii
Remark 3.4.1 The second condition in the above definition may be
written, in an equivalent fashion, as follows:

where 15k ----+ 0 as k ----+ 00 . •

Remark 3.4.2 It is clear that C(u) is a cone (cf. Definition 3.4.1) since
W E C(u) implies that AW E C(u) as well, for any A > 0 (with the
same associated sequence {Uk})' This is a cone (which is not necessarily
convex) with its vertex at the origin. Its translate

u+C(u) = {u+wiWEC(u)},

is a cone with vertex at u. This translated cone contains the (half)


tangents at u of all curves in U which pass through u, as it is easy to see
from the Taylor expansion (at u) of the function describing the curve .

Proposition 3.4.1 Let U be a non-empty subset of a real normed linear
space V and let u E U. Then, C(u) is a closed cone.

Proof: Let Wn E C(u) and let Wn ----+ W in V. Without loss of gener-


ality, we can assume that wi- 0 (since the origin is always in C(u), by
3.4 An Application to Optimization Theory 89

definition) and hence that Wn I- 0 for all n. There exist uk E U such


that uk I- u for all n and such that uk ---t u as k ---t 00. Further,

where 15k ---t 0 as k ---t 00.


Choose a sequence {En} of positive reals which converges to zero as
n ---t 00. Then, we can find positive integers k(n) such that

Iluk(n) - ull < En, and Ili5k(n) I :S en·


Consider the sequence {uk(n)}. Clearly, uk(n) ---t u and uk(n) I- u for all
n. Further,

Uk(n) = u + Iluk(n) - ull [11:11 + (i5 k(n) + II:: II - 11:11)]


which shows that wE C(u) since

i5 n
+ IIWn II - Ilwll
( Wn W )
''In = ken) ---t 0

given that Wn ---t w. This completes the proof. •


Proposition 3.4.2 Let J : U c V ---t lR be a functional defined on a
set U of a real normed linear space V. Assume that J attains a relative
minimum at u E U and that J is differentiable at u. Then
J'(u)(v - u) ~ 0
for all v E u + C(u).
Proof: Let v E u + C(u). Then W = v - u E C(u). Let {ud be a
sequence in U associated to W as in the definition of C(u). Then, since
J is differentiable at u, we have

Iluk - ull (II!II J'(u)w + J'(u)i5k + ek)


where 15k ---t 0 and ek ---t 0 as k ---t 00. Setting "lk = Ilwll(J'(u)i5 k + ek),
we see that "lk ---t 0 as well. Since J attains a relative minimum at u, it
follows that

o< = Iluk - ull ,


J(Uk) - J(u) Ilwll (J (u)w + "lk)
90 3 Hahn-Banach Theorems

from which it immediately follows that J'(u)w ~ 0, which completes the


proof. •

The above result can be used to derive optimality conditions when a


functional J is being minimized under constraints. We consider a finite
set of functionals 'Pi : V ---+ lR for 1 ~ i ~ m and set

u = {v E V I 'Pi(U) ~ 0, 1 ~ i ~ m}. (3.4.1)

Of particular interest is the case when the functionals 'Pi are affine linear,
z. e. there exist fi E V* and vectors di E V for 1 ~ i ~ m such that

(3.4.2)

for 1 ~ i ~ m. In this case, we have a simple characterization of the


tangent cone.

Proposition 3.4.3 Let U be as given by {3.4.1} and let the constraints


'Pi be affine linear, given by {3.4.2}. Then, for any U E U,

C(U) = {w E V I fi(W) ~ 0, i E J(u)} (3.4.3)

where
J(u) = {i 11 ~ i ~ m, 'Pi(U) = O}.

Proof: Notice that (cf. Exercise 2.40) P~( ) = Ii- If i E J(u), then

°
'Pi attains its maximum over U at u. Then, by Proposition 3.4.2,
P~( )( ) ~ for all W E C(u).
°
Conversely, if w =1= 0 satisfies fi (w) ~ for all i E J (u), set Uk =
u + EkW where {Ed is a sequence of positive reals converging to zero.

° °
then Uk =1= u and Uk ---+ u. If i ¢ J (u), then 'Pi ('U) < 0, and so, by
continuity, 'Pi(Uk) < for large enough k. If i E J(u), then 'P(u) =
and so

Thus, for k large enough, we have that Uk E U. Finally we see immedi-


ately that
Uk -u W

Iluk - ull Ilwll'


Thus, it follows that WEe ( u). This completes the proof. •
3.4 An Application to Optimization Theory 91

Theorem 3.4.2 (Kuhn-Tucker Conditions) Let V be a real, reflex-


ive Banach space. Let 'Pi, 1 :S i :S m be as in {3.4.2} and let U be
as in {3.4.1}. Let J : V ---+ lR be a functional which attains a relative
minimum at u E U. Assume that J is differentiable at u. Then, there
exist constants Ai (u) such that

o (3.4.4)

Proof: By Propositions 3.4.2 and 3.4.3, we have that for all w such
that P~ (u)w :S 0, i E I (u), we have J' (u)w 2: O. Thus, by the Farkas-
Minkowski lemma, there exist Ai (u) 2: 0 for i E I (u) such that

J' (u ) = - L Ai (U ) P~ (u ) .
iEI(u)

Setting Ai(U) = 0 for all i ~ I(u), we get (3.4.4). This completes the
proof. •

The above theorem can be generalized to cases when the 'Pi are not
affine. In this situation, in addition to differentiability at u, we need to
assume another technical condition of 'admissibility' on the constraints
at u. In particular, when the constraints 'Pi, 1 :S i :S m are all convex,
the admissibility condition reads as follows:

• either, all the 'Pi are affine and the set U given by (3.4.1) ·is non-
empty;

• or, there exists an element v* E V such that 'Pi (v*) :S 0 for all
1 :S i :S m and 'Pi (v*) < 0 whenever 'Pi is not affine linear.

If J is differentiable at u and the constraints are differentiable and


admissible (at u), then (3.4.4) is a necessary condition for u to be a
relative minimum of J at u. In addition, if J and the constraints 'Pi are all
convex, then (3.4.4) is both necessary and sufficient. Interested readers
can find further details in the book Introduction a l'analyse numerique
matricielle et a l'optimisation by P.G. Ciarlet (Masson, Paris, France,
1982; English translation, Cambridge University Press, Cambridge, UK,
1989).
92 3 Hahn-Banach Theorems

3.5 Exercises

3.1 Give an example to show that the functional whose existence is


guaranteed by Corollary 3.1.1, is not unique.

3.2 A normed linear space V is said to be strictly convex if for x and


y E V such that x i= y, Ilxll = Ilyll = 1, we have

Show that the spaces £1 and £00 are not strictly convex, while £2 IS
strictly convex.

3.3 If V is a normed linear space such that V* is strictly convex, show


that given a subspace W of V and a continuous linear functional f on
W, its Hahn-Banach extension to all of V is unique. In particular, show
also that the functional whose existence is guaranteed by Corollary 3.1.1,
is unique.

3.4 Use Corollary 3.1.1 to prove Corollary 3.2.1. (Hint: consider the
quotient space V jW.)

3.5 Let V be a normed linear space and let W be a subspace of V. Let


X be a finite dimensional space and let T : W ---* X be a continuous
linear transformation. Show that there exists a continuous linear trans-
formation T: V ---* X which extends T.

3.6 Let V be a normed linear space and let W be a finite dimensional


subspace. Show that there exists a closed subspace Z of V such that

V = WEBZ.
3.7 (a) Let CO denote the space of all real sequences which converge to
zero, equipped with the norm 11.1100. Prove that

(b) Show that


£! = foo.
(c) For any positive integer N, show that

(£~) = ff.
3.5 Exercises 93

3.8 Let V be a normed linear space and let W be a subspace of V.


Define
~ = {g E V* I g(x) = 0 for all x E W} .
(a) Show that W ~ is a closed subspace of V*.
(b) Let f E V*. Show that

d(j, ~) = Ilfl w Il w *
where fl w is the restriction of f to Wand

d(j, ~) = inf
gEW.l
Ilf - gllv*.

(c) Let f E W* and let f E V* be an extension of f preserving its norm.


Define (T : W* -+ V* /W ~ by

Show that (T is well defined and that it is an isometric isomorphism of


W* onto ~.
(d) Let 7r : V -+ V/W be the canonical quotient map x ........ x + W.
For f E (V /W)*, define T(j) = f 0 7r E V*. Show that the range of T
is equal to ~ and that the map T : (V/W)* -+ ~ is an isometric
isomorphism.

3.9 Let V be a normed linear space and let Z be a subspace of V*.


Define
~ = {x E V I g(x) = 0 for all 9 E Z}.
(a) Show that ~ is a closed subspace of V.
(b) Show that, if W is a subspace of V, then

(c) If Z is a subspace of V*, show that

(d) If V is reflexive and if Z is a subspace of V*, show that

(Z ~)~ =
-
Z.
94 3 Hahn-Banach Theorems

3.10 Let r.p and 'ljJ be continuous mappings from [0,1] into a real normed
linear space V. For arbitrary scalars a and {3, show that

11 (ar.p(t) + {3'ljJ(t)) dt = a 11 r.p(t) dt + {3 11 'ljJ(t) dt.

3.11 Let V and W be normed linear spaces and let A E £(V, W). Let
r.p: [0,1] -----> V be a continuous mapping. Show that

3.12 Let r.p : [a, b] -----> V be a continuous mapping and let c E (a, b).

lb l lb
Show that
r.p(t) dt =
c
r.p(t) dt + r.p(t) dt.

3.13 Let r.p : lR -----> V be a continuous mapping. Let a, b, hER Show


that

I a
b
r.p(t) dt = Ib+h 'ljJ(t) dt
a+h

where 'ljJ(t) = r.p(t - h).

3.14 Let r.p : [0,1] -----> V be a continuous mapping and let t E [0,1). Show
that
lim -h
~O
11t+h
t
r.p(s) ds = r.p(t).

3.15 Let V be a normed linear space and let T E £(V) such that IITII ::;
°
1. Let >. > and let x E V. Define

R(x) = 1 00
e-AtT(x) dt.

Show that R E £(V) and that

1
IIRII < ~.

3.16 (a) Let V be a real Banach space. Let k > 0. Define

x = {u E C([O, (0); V) I supe-ktllu(t)llv <


t~O
oo}.
3.5 Exercises 95

Define
Ilullx = supe-ktllu(t)llv.
t20
Show that X is a Banach space with this norm.
(b) Let f : V --) V be a mapping. Assume that there exists L > 0 such
that
Ilf(u) - f(v)llv ::; Lllu - vllv
for all u and v E V. For u E X, define

F(u)(t) = Uo + fat f(u(s)) ds

where Uo E V is a fixed vector. Show that F(u) E X and that, for any
u and v EX, we have
L
IIF(u) - F(v)llx ::; "k 11u - vllx.
(c) Deduce that, for any Uo E V, there exists a unique u E C([O, (0); V)
such that
u(t) = Uo + fat f(u(s)) ds

which is also the solution of the initial value problem

~~(t) = f(u(t)), t > 0


u(O) = Uo.

3.17 Let V be any vector space and let fo, h, ... , f n be linear functionals
on V. Let Ker(fi) denote the kernel of h 0::; i ::; n. Assume that

Show that there exist scalars ai, 1::; i ::; n such that
N
fo = I:adi.
i=l

(Hint: Consider the image of the map A as defined in Section 3.3 and
apply Corollary 3.2.1 to its image.)

3.18 Let V be a real normed linear space and let K c V be a compact


and convex subset. Let C c V* be a convex cone. Assume that for
96 3 Hahn-Banach Theorems

each f EC, there exists a vector x E K (depending on f), such that


f(x) :::: O. Show that there exists x E K such that f(x) :::: 0 for all
f E C. (Hint: For f E C, consider
Kf = {x E K I f(x) :::: O}

which is a non-empty closed set. Show that this collection of closed sets
has finite intersection property.)
Chapter 4

Baire's Theorem and Applications

4.1 Baire's Theorem

Baire's theorem is a result on complete metric spaces which will be used


in this chapter to prove some very important results on Banach spaces.

Theorem 4.1.1 (Baire's Theorem) Let (X, d) be a complete metric


space. Let { n ~=l be a collection of open dense sets. Then

is also dense.
Proof: Let W be any non-empty open set in X. We need to show that
the intersection nn Vn has a point in W.
Since VI is dense, it follows that W n VI t- 0. Thus, we can find a
point Xl in this intersection (which is also an open set). Hence, there
exists rl > 0 such that the open ball B(XI; rd, with centre at Xl and
radius rl, is contained in W n VI. By shrinking r if necessary, we may
also assume that the closure of this ball B(XI; rd is also a subset of
W n Vi and that 0 < rl < l.
If n ?: 2, assume that we have chosen Xn-l and rn-l suitably. The
denseness of Vn shows that Vn n B (Xn-l; r n-l) I- 0 and so we can choose
Xn and rn such that 0 < rn < l/n and

Thus we now have a sequence of points {xn} in X. If i > nand j > n,


it is clear that both Xi and Xj both lie in B(xn; rn) and so d(xi' Xj) <
2rn < 2/n and thus, the sequence {xn} is Cauchy. Since X is complete,
it follows that there exists X E X such that Xn ---t x.
98 4 Baire's Theorem and Applications

Now, for i > n, it follows that Xi E B(xn; rn) and so x E B(xn; rn)
for all n and so x E Vn for all n. Also x E B(X1; r1) and so x E W as
well. This completes the proof. •

Remark 4.1.1 The main significance of this theorem is that, in partic-


ular, the intersection of a countable collection of non-empty open dense
sets in a (non-empty) complete metric space is, again, non-empty. •

Remark 4.1.2 An equivalent way of stating this theorem is that a com-


plete metric space cannot be the countable union of nowhere dense sets.
In the literature, countable unions of nowhere dense sets are said to be
of the first category and all other sets are said to be of the second cate-
gory. Thus, Baire's theorem states that every complete metric space is
of second category and so, often in the literature, it is referred to as the
Baire category theorem . •

A countable intersection of open sets in a topological space is l~


a G8 set. Since the countable union of countable sets is again countable,
the following corallary is an immediate consequence of Baire's theorem.
Corollary 4.1.1 In a complete metric space, the intersection of any
countable collection of dense G8 sets is again a dense G8 set . •

Corollary 4.1.2 In a complete metric space which has no isolated points,


a countable dense set can never be a G8 set.

Proof: Let E = {x ~l be a countable dense set in a complete metric


space X. If it is a G8 set, then, there exist open sets Vn such that

Clearly, since E C Vn for each n, each set Vn is dense as well. Set

Then each Wn is also dense and open, since X has no isolated points.
But n~=l Wn = 0, which contradicts Baire's theorem. Hence the result .

4.2 Principle of Uniform Boundedness

As a first application of Baire's theorem we prove the following result.


4.2 Principle of Uniform Boundedness 99

Theorem 4.2.1 (Banach-Steinhaus Theorem) Let V be a Banach


space and let W be a normed linear space. Let I be an arbitrary indexing
set and, for each i E I, let Ti E C(V, W). Then, either there exists
M > 0 such that
IITi I ::; M, for all i E I
or,
sup IITi(X)11 = 00
iEI

for all x belonging to some dense G8 set in V.

Proof: For each x E V, set

'P(x) sup IITi(X)II.


iEI

Let
Vn = {xEVI'P(x»n}.
Since each Ti is continuous and since the norm is a continuous function,
it is easy to see that Vn is open for each n.
Assume now that there exists N such that VN fails to be dense in V.
Then, there exists Xo E V and r > 0 such that x + Xo rt VN if Ilxll < r.
(In other words, there is an open ball B(xo; r), centered at Xo and of
radi us r, which does not intersect V N .) This implies that 'P (x + xo) ::; N
for all such x and so, for all i E I,

Thus, if Ilxll::; r/2, we have, for all i E I,

It follows from this that, for all i E I,

4N
IITil1 ::; - r
and so the first alternative holds with M = 4N/r.
The other possibility is that each Vn is dense, and so, V being com-
plete, by Baire's theorem, nn Vn is a dense G8 and for each x E nn Vn ,
we have that 'P(x) = 00. This completes the proof. •

An immediate consequence of the above result is the following.


100 4 Baire's Theorem and Applications

Corollary 4.2.1 If V is a Banach space and W is a normed linear


space and if Ti E £(V, W) for an indexing set I such that

sup IITi(X)11 < 00


iEI

for every x E V, then there exists M > 0 such that


IITil1 :::; M for each i E I . •
In other words, if the Ti are all pointwise bounded, then they are
uniformly bounded in norm. For this reason, the Banach-Steinhaus
theorem is also referred to as the principle of uniform boundedness.
Corollary 4.2.2 Let V be a Banach space and let W be a normed linear
space and let {Tn} be a sequence of continuous linear transformations
from V into W such that, for each x E V, the sequence {Tnx} is
convergent in W. Define

T(x) = n-->CXl
lim Tn(x).

Then T E £(V, W) and

IITII :::; liminf


n-->CXl
IITnll· (4.2.1)

Proof: It is clear that T is linear. By the Banach-Steinhaus theorem,


it follows that {IITn II} is a bounded sequence. Let IITn II:::; C for all n.
Then, for each x E V and for all n, we have

Passing to the limit as n --+ 00, we deduce that

IIT(x)11 :::; Cllxll


for each x E V and so T E £(V, W). The relation (4.2.1) follows from
the inequality

for each x E V. •
Corollary 4.2.3 Let V be a Banach space and let B c V be a subset.
Assume that
f(B) = {f(x) I x E B}
is a bounded subset of the scalar field for each f E V*. Then B is a
bounded subset of V.
4.3 Application to Fourier Series 101

Proof: For x E B, consider the functional J x E V** defined by Jx(f) =


f(x) for f E V*. Then, we know that (cf. Corollary 3.1.2)

IIJxllv** = Ilxllv.
Taking B as the indexing set and V* as the Banach space, it follows
from the Banach-Steinhaus theorem that IIJxl1 is uniformly bounded in
V**, which is the same as saying that B is bounded in V. •

Remark 4.2.1 To check the bounded ness of a set V in a Banach space,


it thus suffices to verify that its image under each continuous linear
functional is bounded. In finite dimensional spaces, this is what we
precisely do. We check that the image under each coordinate projection
is bounded and these form a basis for the dual space. In the language
of weak topologies (to be studied later), the conclusion of the preceding
corollary is read as 'weakly bounded implies bounded' . •

4.3 . Application to Fourier Series

Let f : [-7f, 7f] ---t lR be an integrable function. We can write its formal
Fourier series (in exponential form) as follows:

2:
00

f(t) rv j(n) exp(int)


n=-oo

where
j(n) -1 j7f
f(s) exp( -ins) ds (4.3.1)
27f -7f
are the Fourier coefficients of f. The first question that springs to the
mind is "in what sense does the Fourier series of a function represent
the function?" In particular, does the Fourier series of a continuous 27f-
periodic function f converge to f (t) at every point t E [-7f, 7f l? This is
relevant since each term in the Fourier series is a continuous 27f-periodic
function. Unfortunately, the answer is 'No!'. It was Dirichlet who first
established (around 1829, nearly seven decades after a lengthy contro-
versy began in Europe - about the validity of representing a function
in terms of sines and cosines - and raged through the latter half of the
eighteenth century) the sufficient conditions for the Fourier series of a
function to converge to its value at a point. This was later strengthened
by Jordan. In fact the study of the validity of Fourier expansions led to
102 4 Baire's Theorem and Applications

a lot of mathematical development such as making precise the notion of


a function, Cantor's theory of infinite sets, the theories of integration by
Riemann and by Lebesgue and theories of summability of series.
In this section, we will use the Banach-Steinhaus theorem to show
that there exists a very large class of continuous 27r-periodic functions
whose Fourier series fail to converge on a very large set of points.
To study the convergence of Fourier series, we need to study its
partial sums:
m

sm(f)(t) = L j(n) exp(int).


n=-m
Using the formula for the Fourier coefficients (4.3.1), this becomes

sm(f)(t) = - 1 17r f(s)Dm(t - s) ds (4.3.2)


27r -7r
where
m

Dm(t) = L exp(int).
n=-m
The function Dm is called the Dirichlet kernel. If we multiply Dm(t) suc-
cessively by exp(it/2) and exp( -it/2) and subtract, we see immediately
that

sin(m+~)t ·f t r~ 2k 7r, lor


c ..
mteger k
D m (t) -_ { . I ,I
sm 2
some non-negatIve
2m + 1, if t = 2k7r for some non-negative integer k.

Proposition 4.3.1 We have

(4.3.3)

Proof: For t E JR, we have I sin tl :S It I and so

1 71"
-71" IDn(t)1 dt 2: 4 Jo
r Isin(nt+ ~)tl
dt
(n+~)71" Isin tl
= 4l - - dt
o t

> 4Ln lk7r


k=l (k-l)71"
Isin tl
--dt
t
4.3 Application to Fourier Series 103

> 42: jk1r


n

k=l (k-l)7I"
Isin tl
--dt
k1r
8 1
-2:-
n

7r k
k=l

from which (4.3.3) follows immediately. •


Proposition 4.3.2 Let V = Cper [-7r,7rJ, the space of continuous 27r-
periodic functions with the usual sup-norm (denoted II . 1100) and define
cPn : V --t IR by

where sn(f) is the n-th partial sum of the Fourier series of f. Then cPn
is a continuous linear functional on V and

1 171"
IlcPnll = 27r -71" IDn(t)1 dt. (4.3.4)

Proof: On one hand,

cPn(f) = -1 171" f(t)Dn(t) dt


27r -71"
(cf. (4.3.2); Dn is an even function). Thus,

and so
1 171"
IlcPnll ::; 27r -71" IDn(t)1 dt.
Now, let En = {t E [-7r,7rll Dn(t) ~ o} . Define

fm(t) = 1 - md(t, En)


1 + md(t, En)

where d(t, A) = inf{lt-sll sEA} is the distance of t from a set A. Since


d(t, A) is a continuous function (cf. Proposition 1.2.3), fm E Cper [-7r, 7r],
(it is periodic since Dn is even and so En is a symmetric set about the
origin). Also IIfmlloo ::; 1 and fm(t) --t 1 if tEEn while fm(t) --t -1 if
t E E~. By the dominated convergence theorem, it now follows that
104 4 Baire's Theorem and Applications

from which (4.3.4) follows . •

We now apply the Banach-Steinhaus theorem to the space V =


Cper [-1f, 1f] and the collection of continuous linear functionals {<Pn}.
Since, by Propositions 4.3.1.and 4.3.2, we have II<Pnll ---> 00 as n ---> 00, it
follows there exists a dense G 8-set (of continuous 21f-periodic functions)
in V such that the Fourier series of all these functions diverge at t = O.
We could have very well dealt with any other point in the interval [-1f, 1f]
in the same manner.
By another application of Baire's theorem, we can strengthen this
further.
Let Ex be the dense G8-set of continuous 21f-periodic functions in V
such that the Fourier series of these functions diverge at x. Let {Xi} be
a countable set of points in [-1f, 1f] and let

Then, by Baire's theorem, E is also a dense G8-set (cf. Corollary 4.1.1).


Thus for each fEE, the Fourier series of f diverges at Xi for all 1 S
i S 00. Define
s*(f; x) = sup ISn(f)(x)l·
n
Hence {x I s*(f;x) = oo} is a G,,-set in (-1r,1r) for each f. If we choose
the Xi above so that {Xi} is dense (take all rationals, for instance in
( -1f, 1f)) then we have the following result.
Proposition 4.3.3 The set E c V is a dense G8-set such that for all
fEE, the set Qf C (-1f, 1f) where its Fourier series diverges, 'is a dense
G8-set in (-1f, 1f) . •
By virtue of Corollary 4.1.2, it follows that there exist uncountably many
21f-periodic continuous functions on [-1f, 1f] whose Fourier series diverge
on an uncountable subset of ( -1f, 1f).

4.4 The Open Mapping and Closed Graph Theorems

In this section, we will study two more important consequences of Baire's


theorem which will have a lot of applications.
We begin by setting up some notation. If A and B are subsets in a
vector space V, we set

A+B {x + y I x E A, y E B}.
4.4 The Open Mapping and Closed Graph Theorems 105

Similarly, if ,\ is a scalar, we set

,\A = {Ax Ix E A}.

If A is a convex set, then we have A + A = 2A, for if x E A, then


2x = x + x and so 2A C A + A. On the other hand, if x and yEA, by
convexity, we have ~(x + y) E A and so x + Y E 2A. Thus, A + A c 2A.

Proposition 4.4.1 Let V and W be Banach spaces. Let T E £(V, W)


be surjective. Then, there exists a constant c > 0 such that

Bw(O; c) c T(Bv(O; 1)). (4.4.1)

where Bv and Bw denote open balls in the spaces V and W respectively.

Proof: Step 1. We will first show that there exists a constant c > 0
such that
Bw(O; 2c) c T(Bv(O; 1)). (4.4.2)
Set Xn = nT(Bv(O; 1)). Then each Xn is a closed set. Since T is linear
and surjective, it follows immediately that W = U~=l X n . Hence, W
being complete, it follows from Baire's theorem (cf. Remark 4.1.2) that
there exists n such that Xn has non-empty interior. Hence by change of
scale, it follows that
(T(Bv(O; l))t 1= 0.
Thus, there exists Yo E Wand c > 0 such that
Bw(Yo;4c) c T(Bv(O; 1)).

In particular Yo E T(Bv(O; 1)) and, by symmetry, so does -Yo. Since


any element of Bw(yo; 4c) may be written as yo+z where z E Bw(O; 4c),
any such z can be written as z = (Yo + z) + (-yo) and so it follows from
the above that

Bw(O; 4c) c T(Bv(O; 1)) + T(Bv(O; 1)).


But T(Bv(O; 1)) is convex and so the set on the right-hand side in the
above inclusion is, in fact, 2T(Bv(O; 1)) and so we have (4.4.2).

Step 2. We now prove (4.4.1). Let y E Bw(O; c). We need to find


x E Bv(O; 1) such that T(x) = y. Let E > O. There exists z E V such
that Ilzllv < 1/2 and Ily - T(z)llw < E by virtue of (4.4.2) (applied
106 4 Baire's Theorem and Applications

to 2y). Set E = c/2 and let Zl E V be such that IlzIilv < 1/2 and
IIT(Zl) - yllw < c/2.
We can iterate this procedure. By another application of (4.4.2) ( to
4(T(zI) - y)) we can find Z2 E V such that
1 c
IIz211v < 4' IIT(Zl + Z2) - yllw < 4·
Thus, we can find, by repeated use of (4.4.2), a sequence {zn} in V such
that
1 c
Ilznllv < 2n ' IIT(Zl + ... + zn) - Yllw < 2n ·
Then, it follows that the sequence {Zl + ... + zn} is Cauchy in V, and,
since V is complete, it will converge to an element Z E V such that
Ilzllv < 1 and we will also have T(z) = y. This completes the proof. •
Theorem 4.4.1 (Open Mapping Theorem) Let V and W be Ba-
nach spaces and let T E £(V, W) be surjective. Then T is an open map.
Proof: We need to show that T maps open sets in V onto open sets
in W. Let G be an open set in V. Let y E T(G). Then, there exists
x E G such that y = T(x). Since G is open, there exists r > 0 such that
x + Bv(O; r) c G. Hence, y + T(Bv(O; r)) c T(G). But by the previous
proposition, there exists s > 0 such that Bw(O;s) c T(Bv(O;r)) and
so y + Bw(O; s) c T(G) which means that T(G) is open . •
Corollary 4.4.1 Let V and W be Banach spaces and let T E £(V, W)
be a bijection. Then T is an isomorphism.
Proof: Since T is a continuous bijection, in particular it is onto and
so, by the open mapping theorem, it is an open map, which means that
T- 1 is also continuous. Thus T is an isomorphism. •
Corollary 4.4.2 Let V be a Banach space with respect to two norms,
11.111 and 11·112. Assume that there exists c > 0 such that

for all x E V. Then, the two norms are equivalent.


Proof: The identity map I : {V, 11.112} --> {V, 11.lld is a linear bijection
which is also continuous by the given hypothesis. Hence the inverse map
is also continuous and this means that there exists a constant C > 0 such
that
4.4 The Open Mapping and Closed Graph Theorems 107

for all x E V. Hence the norms are equivalent . •

If V and Ware normed linear spaces and T : V ----- W is any mapping


define the graph of T, denoted G(T) as follows:

G(T) = {(x,y) I y = T(x)} c V xW

If V and Ware normed linear spaces and T : V ----- W is continuous,


then G(T) is closed in V x W. This follows from the following general
topological result.

Lemma 4.4.1 Let X and Y be topological spaces, with Y being Haus-


dorff· Let f : X ----- Y be a given mapping. Then, if f is continuous, the
graph
G(f) = {(x,y) E X x Y I y = f(x)}
is closed in the product space X x Y.

Proof: Let (x, y) E (X x Y)\G(f). Thus, y i- f(x) and so; since Y is


Hausdorff, there exist open sets U and V in Y such that f (x) E U, Y E V
and unv = 0. Now, f-l(U) X V is open in X x Y and contains the point
(x,y). Further, this set does not intersect G(f). (For, if (u,v) were in
the intersection, we would have v = f(u),u E f-l(U) and v E V. Thus,
f(u) = v E Un V, which is a contradiction.) Thus, the complement of
G(f) contains an open neighbourhood of each of its points and is hence
open, i.e. G(f) is closed in X x Y. •

If V and Ware Banach spaces and T is linear, then the converse is


also true as the following theorem shows.

Theorem 4.4.2 (Closed Graph Theorem) Let V and W be Banach


spaces and let T : V ----- W be a linear mapping. Assume that G(T), the
graph ofT, is closed in V x W. Then T is continuous.

Proof: For x E V, define

IlxliI = Ilxllv + IIT(x)llw.


Then 11.111 defines a norm on V. If {x n } is a sequence in V which is
Cauchy with respect to this norm, then, evidently it is Cauchy with
respect to the norm 11.llv as well. Then, since V is complete for the
norm 11.11 v, we have that Xn ----- x in V (in the topology of the norm
11.llv). Again, since {x n } is Cauchy for the norm 11.111, it also follows
108 4 Baire's Theorem and Applications

that {T(x n )} is Cauchy in Wand so, since W is complete, T(x n ) ---t y


in W. Since G(T) is closed, it follows that y = T(x). Thus, it follows
that Xn ---t x in the topology of the norm 11.111 as well. Thus {V, 11·11 d
is also complete. Hence, by the preceding corollary, 11.11 v and 11.111 are
equivalent. Hence, there exists C > 0 such that Ilxlli :S Cllxllv for all
x E V. In particular, IIT(x)llw :S Cllxllv for all x E V and so T is
continuous. •

Remark 4.4.1 The closed graph theorem gives a convenient way for
verifying the continuity of linear maps between Banach spaces V and
W. We just have to verify that if Xn ---t x in V and if T(x n ) ---t y in W,
then y = T(x) . •

Remark 4.4.2 In the Banach-Steinhaus theorem, it was sufficient that


the domain space V was complete. The range space W could be any
normed linear space. In the case of the Open Mapping and Closed
Graph theorems, however, it is essential that both the spaces V and W
are complete. •

4.5 Annihilators

In the remaining sections of this chapter, we will apply the open mapping
theorem to obtain results about the range and the kernel of linear maps.
In order to do this, we need to introduce an important notion.
Definition 4.5.1 Let V be a Banach space and let W be a subspace of
V. Let Z be a subspace of V*. The annihilator of W is the subspace
of V* given by

W1- = {f E V* I f(x) = 0 for all x E W}.

The annihilator of Z is the subspace of V given by

Z1- = {x E V I f(x) = 0 for all fEZ} . •

It is easy to see that W 1- is a closed subspace of V* and that Z 1- is


a closed subspace of V. Further,

W (4.5.1)

and
(4.5.2)
4.5 Annihilators 109

(cf. Exercises 3.8 and 3.9). Also, if G and Hare subspaces of V such
that G c H, then H..l c G..l.

Proposition 4.5.1 Let G and H be closed subspaces of a Banach space


V. Then
(4.5.3)

and
(4.5.4)

Proof: Clearly, G n H c (G..l + H..l)..l. Also, G..l c G..l + H..l and


H..l C G..l + H..l. Hence,

using (4.5.1) and the fact that G is closed. Similarly, we have

and (4.5.3) follows immediately. The relation (4.5.4) is obvious . •

Combining the result of the preceding proposition with the relations


(4.5.1)-(4.5.2), we deduce the following corollary.

Corollary 4.5.1 Let G and H be closed subspaces of a Banach space


V. Then
(4.5.5)
and
(4.5.6)

Proposition 4.5.2 Let G and H be two closed subspaces of a Banach


space V such that G + H is also closed. Then, there exists a constant
C > 0 such that, for every z E G + H, there exist x E G and y E H
satisfying
z = x + y, Ilxll ::; Cllzll, Ilyll ::; Cllzll· (4.5.7)

Proof: Consider the product space G x H with the norm

II(x, y)llcxH = Ilxll + Ilyll·


110 4 Baire's Theorem and Applications

This is a Banach space since G and H are closed. Consider the map
G x H --> G + H given by (x, y) t----t X + y. This map is clearly continuous,
linear and onto. Since G + H is a closed subspace of the Banach space
V, it is also a Banach space and so, by the open mapping theorem, there
exists a constant c> 0 such that if z E G + H with Ilzll < c, then there
exists (x , y) E G x H such that z = x + y and such that Ilxll + Ilyll < 1.
Given any z E G + H, by considering, for instance, the vector 211zl1 z we
see immediately that z can be written as z = x + y with x E G, y E H
such that
2
Ilxll ::; -llzll,
c
Ilyll < ~ll
c
ll
which proves the result with C = 2/c . •

Corollary 4.5.2 Let G and H be closed subspaces of a Banach space


V such that G + H is also closed. Then, there exists a constant C > 0
such that
d(x, G n H) ::; C [d(x, G) + d(x, H)] (4.5.8)
for all x E V.

Proof: Let x E V and let E > 0 be arbitrary. Then, there exist a E G


and b E H such that

Ilx - all ::; d(x, G) +E and Ilx - bll ::; d(x, H) + E.


set z = a - bEG + H. Then, there exists c > 0 (which depends only on
G and H), a' E G and b' E H such that

a- b = a' + b', Ila'll ::; clla - bll, 11b'11 < clla - bll·
Now, a - a' = b' + bEG n H. Thus,

d(x, G n H) < Ilx - all + Ila'll


< Ilx - all + clla - bll
< (1 + c)llx - all + cllx - bll
< (1 + c)d(x, G) + cd(x, H) + (1 + 2C)E
< (1 + c)(d(x, G) + d(x, H)) + (1 + 2C)E
which completes the proof (with C = (1 + c)), since E is an arbitrarily
small quantity. •

We are now in a position to prove a deeper result on annihilators.


4.5 Annihilators 111

Theorem 4.5.1 Let G and H be closed subspaces of a Banach space V.


The following are equivalent:
(i) G + H is closed in V.
(ii) G.1 + H.1 is closed in V*.
(iii)

(iv)

Proof: Since an annihilator is always closed, the implication (iv) => (ii)
is trivial. The equivalence (i) {:} (iii) is an immediate consequence of
(4.5.6). To complete the proof, we need to show that (i) => (iv) and that
(ii) => (i), which we now proceed to do.

Step 1. (i) => (iv). Assume that G + H is closed. Now, by (4.5.5),


we already know that (G n H).1 ~ G.1 + H.1. Hence, to prove (iv), it
suffices to prove the reverse inclusion. Let f E (G n H).1. Define a linear
functional <p on G + H as follows: let x = a + bEG + H with a E G
and b E H; set
<p(x) = f(a).
If x = a' + b' is another decomposition of x with a' E G and b' E H,
it follows that a - a' = b' - bEG n H and so f(a) = f(a'). Thus the
definition of <p(x) is independent of the decomposition chosen, and so <p
is a well defined linear functional. Further, since G + H is closed, we
can choose a decomposition x = a + b such that Iiall :S Gllxll (where G
depends only on G and H). Consequently, 11<p(x)11 :S Gllfll Ilxll and it
follows that <p is a continuous linear functional on G + H. Hence, we can
extend it to a continuous linear functional rp on V, by the Hahn-Banach
theorem. Now, f = <p = rp on G and so f - rp E G.1. Also if x E H, then
since <p( x) = 0, it follows that rp E H.1. Hence we have that

which proves the reverse inclusion that we sought to establish.

Step 2. (ii) => (i) . For any f E V*, we have that (cf. Exercise (3.8)(b))

d(f, G.1) = Ilfl Gil = sup If(x)l,


xEG,llxlI9
112 4 Baire's Theorem and Applications

d(f,H.l) = Ilfl HII sup If(x)l,


xEH,lIxll9

and (in view of (4.5.4)) that

d(f,G.lnH.l) = d(f,(G+H).l) = Ilfl G+HII = sup If(x)l ·


xEC+H, IIxll9

By Corollary 4.5.2, there exists a constant e > 0 such that, for every
f E V*,
d(f,G.l n H.l) :::; e [d(f,G.l) + d(f,H.l)]
since we are assuming that G.l + H.l is closed in V*. In other words,
we have, for every f E V*,

sup If(x)l:::; e[ sup If(x)1


sup + If(X)lj.
xEC+H ,IIxll9 xEC, IIxll9 IIxll9 xEH,
(4.5.9)
Step 3. We now claim that the above relation implies that

(4.5.10)

where Bc(O; 1) denotes the (open) unit ball in G and so on. If not, let
Xo E G + H with Ilxoll < lie such that

Xo ¢ Bc(O; 1) + BH(O; 1).

Assume that V is a real Banach space. Then, by the Hahn-Banach


theorem, there exists f E V* and a real number 0: such that, for all
Z E Bc(O; 1) + BH(O; 1), we have

f(z) < 0: < f(xo). (4.5.11)

In particular, f(xo) > 0: > O. Further, if z = x+y E Bc(O; 1)+BH(O; 1),


then so does -z = -x - y. Thus

SUPxEC, 114::;1 If(x)1 + sUPxEH, IIxI19If(x)1 < 0:


< f(xo)
Ilxollf (lI x1ollx O)
< hf (lI}ollxO)
< hSUPxEG+H If(x)1
IIxll9
4.6 Complemented Subspaces 113

which contradicts (4.5.9). This establishes (4.5.10).


In case V is a complex Banach space, then (4.5.11) holds for the real
part of a continuous linear functional f and the preceding sequence of
inequalities hold with If(x)1 being replaced by IRe(f)(x)l. However (cf.
Proposition 3.1.1) since the norm Ilfll is the same as the norm IIRe(f)11
(the latter being considered as a real linear functional), and since the
supremum over the unit ball gives the norm of the functional concerned,
the same conclusion holds with If(x)1 as well and so we again get a con-
tradiction to (4.5.9).

Step 4. Now consider the spaces E = G x H with the norm II(x,y)IIE =


max{llxll, Ilyll} and F = G + H with the norm from V. Both are Banach
spaces. Define T : E ~ F by T((x, y)) = x + y. Then T is continuous
and linear. Further, in view of (4.5.10), we also have that

1
T (BE(O; 1)) =:J CBF(O; 1).

Now, this implies that (cf. Step 2 of the proof of Proposition 4.4.1)

But then, it is now immediate to see that T must be onto. In other


words, G + H = G + H, i. e. G + H is closed. This completes the proof.

4.6 Complemented Subspaces

Let V be a vector space. If W is a subspace of V, then by completing a


basis of W to get a basis of V, we can easily produce a subspace Z of
V such that V = W EEl Z. The question which we wish to address now
is that if W is a closed subspace of a Banach space V, whether there
exists a closed subspace Z as above.

Definition 4.6.1 Let G be a closed subspace of a Banach space V. A


closed subspace H of V is said to be a complement of G if V = G EEl H,
i.e. GnH={O} andV=G+H . •

Remark 4.6.1 If G has a complement H in V, then every x E V has a


unique decomposition z = x + y with x E G and y E H. By Proposition
4.5.2, it follows that the maps z f-----> x and z f-----> yare continuous. Thus,
114 4 Baire's Theorem and Applications

we have continuous projections from V onto G and H. If G and H were


not closed, these projections need not be continuous . •

Example 4.6.1 Let G be a finite dimensional subspace of a Banach


space V. Then G has a complement (cf. Exercises 3.5 and 3.6) . •

Example 4.6.2 Every closed subspace G with finite co dimension has


a complement. This is trivial since any algebraic complement, i. e. any
subspace H such that V = G EB H, is finite dimensional and is hence
closed . •

Remark 4.6.2 Subspaces of finite codimension typically occur in the


following way. Let Z be a subspace of V* of dimension d. Then
its annihilator ~ will be a subspace of V with co dimension d. To
see this, let {h,···, id} be a basis for Z. Define <p : V --t ]Rd by
<p(x) = (h(x),···,id(X)). This map is onto. If not, by the Hahn-Banach
theorem, there exist scalars {a1, ... , ad}, not all zero, such that

Ladi(X) = 0
i=l

for every x E V. But this implies that 'Lt=l adi = 0 in V*, which con-
tradicts the linear independence of the Ud. Thus, we can find vectors
{e1,··· ,ed} in V such that

. {I,
0,
if i = j,
if i 1= j.

It is now easy to verify that {e1,···, ed} are linearly independent and
that their span (which has dimension d) is a complement to ~ . •

Remark 4.6.3 We will see later (cf. Chapter 7) that in a Hilbert space,
every closed subspace is complemented. In fact, a deep result of Lin-
denstrauss and Tzafriri states that any Banach space which is not iso-
morphic to a Hilbert space will always have uncomplemented closed
subspaces. Since Hilbert spaces are always reflexive, it follows that, in
particular, non-reflexive spaces will always have uncomplemented closed
subspaces. For example de Vito has shown that the space Co (cf. Exer-
cise 3.7) is uncomplemented in £00 . •
4.6 Complemented Subspaces 115

Definition 4.6.2 Let V and W be Banach spaces and let T E £(V, W)


be surjective. A linear transformation S E £(W, V) is said to be a right
inverse of T if To S = Iw, the identity operator on W . •

Proposition 4.6.1 Let V and W be Banach spaces and let T E £(V, W)


be surjective. The following are equivalent:
(i) T has a right inverse.
(ii) The subspace N = Ker(T) is complemented .in V.

Proof: Step 1. (i) =} (ii). Let S E £(W, V) be a right inverse for T.


Consider the image S(W) C V of S. Clearly, S(W) n N = {O}. Also,
if x E V, then x - S(T(x)) E N. Thus V = S(W) EEl N. We now show
that S(W) is closed. Indeed, if we have a sequence {Yn} in W such that
S(Yn) -----7 x in V, then Yn = T(S(Yn)) -----7 T(x) in W. Then, it follows
that
x = lim S(Yn) = S(T(x))
n-+oo

i.e. x E S(W), which proves that S(W) is closed. Thus N is comple-


mented in V.

Step 2. (ii) =} (i). Assume that a closed subspace M of V is a comple-


ment to N in V. Then, we have a continuous projection P : V -----7 M.
Let YEW. Let x E V such that T(x) = Y (T is surjective). If x' E V
also satisfies T(x') = y, then x-x' E N and so P(x) = P(x'). Thus, the
map Y ~ P(x) is well defined. Define S(y) = P(x). Then T(S(y)) =
T(P(x)) = T(x) = y (since x - P(x) E N and so T(x) = T(P(x))).
Thus To S = Iw. We now show that S is continuous as well. Since T is
onto, there exists a constant C > 0 such that Bw(O; C) c T(Bv(O; 1)).
Thus, for any YEW, 21131w Y E Bw(O; C) and so there exists x' E V
with Ilx'llv < 1 such that

T(x') = 211~l y.
Hence, if we set x = tllyllwx', we get that T(x) = y and that
2
Ilxllv < C Ilyilw.
Thus,
2
IIS(y)llv = IIP(x)llv ::; 1IPllllxliv ::; CllPllllyllw
which proves the continuity of S. Thus S is a right inverse for T . •
116 4 Baire's Theorem and Applications

4. 7 Unbounded Operators, Adjoints

In this section, we will look at linear transformations between Banach


spaces which are not necessarily defined on all of the space, but only
on a subspace. Further, they may not map bounded sets into bounded
sets. Such transformations are said to be unbounded and bounded linear
transformations form a special subclass.

Definition 4.7.1 Let V and W be Banach spaces. An unbounded


linear operator (or, transformation) from V into W is any linear
map defined on a subspace of V taking values in W. The domain of
definition of the transformation A is called the domain of A and is
denoted D(A). Thus,

A : D(A) cV -+ W.

The image of A is a subspace of Wand is called the range of A and is


denoted R(A). The operator A is said to be bounded if there exists a
constant C > a such that

IIA(x)llw :S Gllxllv (4.7.1)

for all x E D(A). The operator A is said to be densely defined if


D(A) = V. The graph of a linear operator A is denoted G(A) and is
given by
G(A) = {(x, A(x)) E V x W I x E D(A)}.
The operator A is said to be closed if the graph G(A) is closed in V x
W. •

To define a linear operator, we thus need to specify its domain and


then its action on vectors in the domain. Given a linear operator A :
D(A) c V -+ W, we denote its kernel by N(A). Thus,

N(A) = {x E D(A) I A(x) = O}.


Remark 4.7.1 If A : D(A) c V -+ W is closed, then N(A) is a closed
subspace in V. •

Example 4.7.1 Let V = W = c[a, 1]. Let D(A) = Cl[a, 1]. Define
A : D(A) c V -+ V by A(u) = u' where u' stands for the derivative of
the function u. Clearly, A is densely defined and N(A) is the subspace
of all constant functions. It follows from the fundamental theorem of
4.7 Unbounded Operators, Adjoints 117

calculus that A is surjective. If Un -+ U uniformly and if ~ -+ v uni-


formly, we know that u is differentiable and that u' = v. Thus, A is a
closed operator. Finally, A is unbounded. This, in fact, is the content
of Example 2.3.8 . •

Notation: Let V be a Banach space. Let x E V and f E V*. We


introduce the duality bracket < f, x >, which will also be denoted
< f, x >v*,v in case we need to specify the spaces involved, via the
relation
< f,x > = f(x).
Let V and W be Banach spaces and let A: D(A) c V -+ W be a densely
defined linear operator. Let

z = {v E W*
there exists C > 0 such that for all x E D(A)
I I < v, A(x) >w',w I :s; Cllxllv }.
(4.7.2)
Note that the constant C mentioned above depends on v. Clearly, Z is
a subspace of W*. If v E Z, define, for x E D(A),

g(x) = < v,A(x) >w',w .


By the definition of Z, it follows that 9 defines a continuous linear func-
tional on D(A). Hence, by the Hahn-Banach theorem, we can extend it
to a continuous linear functional gv on all of V. Since D(A) is dense in
V, such an extension is unique.
To summarize, we have a map v f--t gv from Z into V*. It is also easy
to see that this map is linear.

Definition 4.7.2 Let V and W be Banach spaces and let A : D(A) c


V -+ W be a densely defined linear operator. Let Z be as defined above
and for each v E Z, let gv E V* be as defined above. We set D(A*) = Z
and define A*(v) = gv for v E D(A*). The linear operator A* : D(A*) c
W* -+ V* is called the adjoint of the operator A . •

Thus, the adjoint is defined for densely defined linear operators. N<r
tice that there is no reason for A * to be densely defined. We have
the following important duality relationship: for all u E D(A) and all
v E D(A*), we have

< A*(v), u >v*,v = < v, A(u) >w',w . (4.7.3)


118 4 Baire's Theorem and Applications

Example 4.7.2 In a finite dimensional space, every subspace is closed.


Thus, any linear transformation is closed; it is densely defined if, and
only if, it is defined over the entire space. Hence, every linear transfor-
mation defined on the entire space has an adjoint and, since every linear
transformation is bounded, we also have that the adjoint is defined on
the entire dual space of the range. The dual of en is identified with en
with the duality product being given by
n
< y,X > = LXiYi
i=l

where x = (x!,···, xn) is in the base space and Y = (YI,···, Yn) IS III
the dual space. If A : en --+ em is a linear transformation which is
represented by the m x n matrix A, then we have, for all y E em and
for all x E en,

<A*y,x>=<y,Ax>= f(taijXj)Yi = tXj(faijYi)


i=l j=1 j=1 i=1

from which it follows that the matrix representing A * is none other than
the adjoint matrix A * (cf. Definition 1.1.11).
In the same way, if the matrix A represents a linear transformation
A from ]Rn into ]Rm, the adjoint of this transformation will be repre-
sented by the transpose matrix A'. •

Example 4.7.3 Consider the space £2 of square summable real se-


quences. Then its dual space can be identified with itself (cf. Example
3.1.1). Consider the linear operators on £2 given by T(x) = (0, Xl, X2,···)
and S(x) = (X2,X3,···) where X = (X1,X2,···) E £2. Then it is easy to
check that T* = S and that S* = T . •
Proposition 4.7.1 Let V and W be Banach spaces and let A: D(A) c
V --+ W be a densely defined linear operator. Then A * is closed.
Proof: We need to show that G(A*), the graph of A*, is closed in
W* x V*. Let Vn --+ v in W* and let A*(vn ) --+ j in V*. Let U E D(A).
Then
< A*(vn),u >v*,v = < vn,A(u) >w*,w·
Passing to the limit as n --+ 00, we get

< j, u >v*,v = < v, A(u) >w*,w (4.7.4)


4.7 Unbounded Operators, Adjoints 119

for all u E D(A). Thus,

I < v, A(u) >w-,w I :s; Ilfllv- Iluliv


which shows that v E D(A*). It also follows from (4.7.4) that f = A*(v),
thus proving that G(A*) is closed . •

The graphs of A and A * are connected by a simple relation. Let V


and W be Banach spaces. Define

.:J : W* x V* -+ V* x W*

by
.:J(v,j) = (-f,v).
Proposition 4.1.2 Let V and W be Banach spaces and let A: D(A) c
V -+ W be a densely defined linear operator. Let.:J be as defined above.
Then
.:J(G(A*)) = ( ( ))~.

Proof: Let u E D(A) be an arbitrary element so that (u,A(u)) is an


arbitrary element of G(A).

(v, j) E G(A*) {:} < f, u >v*,v = < v, A(u) >w*,w


{:} < - f, u >v*,v + < v, A(u) >w*,w = 0
{:} .:J(v,j) E ( ( ))~

which completes the proof. •

The following result characterizes densely defined and closed opera-


tors that are bounded.

Proposition 4.1.3 Let V and W be Banach spaces and let A : D(A) c


V -+ W be a densely defined and closed linear operator. The following
are equivalent:
(i) D(A) = V.
(ii) A is bounded.
(iii) D(A*) = W*.
(iv) A * is bounded.
In this case, we also have

IIAII IIA*II· (4.7.5)


120 4 Baire's Theorem and Applications

Proof: (i) =} (ii). If D(A) = V and A is closed, then it is continuous


by the closed graph theorem, and hence is bounded.

(ii) =} (iii). If A is bounded, then it follows from the definition of A*


that D(A*) = W*.

(iii) =} (iv). Since G(A*) is always closed, the result again follows from
the closed graph theorem.

(iv) =} (i). First of all, we show that D(A*) is closed. Indeed, if {v n } is


a sequence in D(A*) converging to v E VV*, then,

since A* is bounded. Thus, it follows that {A*(v n )} is Cauchy in V*.


Let A*(vn ) -- f in V*. Since G(A*) is always closed, it follows then
that v E D(A*) and that A*(v) = f. Thus D(A*) is closed.
Now set G = G(A) c V x Wand H = {O} x W c V x W. Both
are closed subspaces. Further, G + H = D(A) x W. On the other hand,
since (G(A))l- = J'(G(A*)), we have Gl- + Hl- = V* x D(A*), which is
closed. Thus, by Theorem 4.5.1, it follows that G + I-1 is closed as well,
which implies that D(A) is closed. Thus, D(A) = D(A) = V.
This proves the equivalence of all the four statements. Under these
conditions, we now prove (4.7.5). For all u E V and for all v E W*, we
have
< v,A(u) >w*,w = < A*(v),u >v*,v (4.7.6)
which yields

1< v,A(u) >w*,w I :S IIA*llllvllw*llullv


whence we deduce that (cf. Corollary 3.1.2)

IIA(u)llw :S IIA*II Iluliv


which implies that IIAII :S IIA* II·
Again, by virtue of (4.7.6), it follows that

IIA*(v)llv* = sup 1< A*(v),u >v*,v I :s IIAllllvllw*


uEv,llull9

which implies that IIA*II :s IIAII. This completes the proof. •


4.1 Unbounded Operators, Adjoints 121

Proposition 4.7.4 Let V and W be Banach spaces and let A: D(A) c


V ---+ W be a densely defined linear operator. Let G = G(A) c V x W
and let H = V x {O} c V x W. Then
(i) N(A) x {O} = GnH.
(ii) V x R(A) = G + H.
(iii) {O} x N(A*) = G.L n H.L.
(iv) R(A*) x W* = G.L + H.L.

Proof: The proof is an immediate consequence of the definitions and


the relation G(A).L = .J(G(A*)). The details are left as an exercise . •

Corollary 4.7.1 Let V and W be Banach spaces and let A : D(A) C


V ---+ W be a closed and densely defined linear operator. Then

N(A) = R(A*).L, (4.7.7)


N(A*) = R(A).L, (4.7.8)
N(A).L => R(A*) (4.7.9)
and
N(A*).L = R(A). (4.7.10)

Proof: Let G and H be as in the preceding proposition. We know that


(cf. Proposition 4.5.1) G n H = (G.L + H.L).L. Thus, we get that

N(A) x {O} = R(A*).L X {O} .

This proves (4.7.7). Again, by Proposition 4.5.1, we know that G.L n


H.L = (G + H).L which yields

{O} x N(A*) = {O} x R(A).L

which proves (4.7.8). The remaining two relations are proved from these
two and applying (4.5.1) and (4.5.2) . •

Theorem 4.7.1 Let V and W be Banach spaces and let A : D(A) C


V ---+ W be a closed and densely defined linear operator. Then, the
following are equivalent:
(i) R(A) is closed in W.
(ii) R(A*) is closed in V*.
(iii) R(A) = N(A*).L.
(iv) R(A*) = N(A).L.
122 4 Baire's Theorem and Applications

Proof: Using the same notations as in the preceding proposiition and


its corollary, we have:
(i) {::;> G + H is closed in V x W.
(ii) {::;> G.l + H.l is closed in V* x W*.
(iii) {::;> G + H = (G.l n H.l).l.
(iv) {::;> G.l + H.l = (G n H).l.

The equivalence of these statements follows from Theorem 4.5.1. •

Remark 4.7.2 As already remarked (cf. Example 4.7.2), all linear


transformations on finite dimensional spaces are closed and densely de-
fined means defined on all the space. The statement that R(A) =
N(A*).l is none other than the well known Fredholm alternative (cf.
Proposition 1.1.5) . •

We conclude with a useful characterization of surjective maps.


Theorem 4.7.2 Let V and W be Banach spaces and let A : D(A) c
V -+ W be a closed and densely defined linear operator. The following
are equivalent:
(i) A is onto, i.e. R(A) = W.
(ii) There exists a constant C > 0 such that, for all v E D(A*),

Ilvllw* ::; CIIA*(v)llv*. (4.7.11)

(iii) N(A*) = {O} and R(A*) is closed in V*.


Proof: (i) =? (iii). If A is onto, then R(A) = Wand is hence closed.
Thus by the preceding theorem, R(A*) = N(A).l, which is also closed.
Further (cf. Corollary 4.7.1), N(A*) = R(A).l = {O}.

(iii) =? (i). If N(A*) = {O}, and R(A*) is closed, it follows from the
preceding theorem that R(A) = N(A*).l = W. Thus A is onto.

(ii) =? (iii). By virtue of (4.7.11), it follows that N(A*) = {O}. Let {v n }


be a sequence in D(A*) such that A*vn -+ f in V*. Then, again, by
(4.7.11), it follows that

and so {v n } is a Cauchy sequence. Let Vn -+ v in W*. Since G(A*) is


closed, it follows then that v E D(A*) and that A*(v) = f. Thus R(A*)
4.7 Unbounded Operators, Adjoints 123

is closed.

(iii) (ii). Using the notation established in Proposition 4.7.4, N(A*) =


=}

{O} implies that C.lnH.l = {O} and R(A*) closed implies that C.l+H.l
is closed. Hence, by Proposition 4.5.2, there exists a constant C > 0
such that for every z E C.l + H.l = R(A *) x W*, there exist a E C.l
and b E H.l such that z = a + b, and lIallv*xw* ~ Cllzllv*xw* and
Ilbllv*xw* ~ Cllzllv*xw*. Further, this decomposition is unique, since
C.l n H.l = {O}. Let v E D(A*). Set z = (A*(v),O) E R(A*) x W* =
C.l + H.l . Then a = (A*(v), -v) E C.l and b = (0, v) E H.l and
a + b = z. The inequality(4.7.11) now follows immediately. •

Remark 4.7.3 A similar result can be stated and proved for the sur-
jectivity of A* . •

Remark 4.7.4 If V and Ware finite dimensional, then the ranges of A


and A * are automatically closed. In this case we have:

A onto {:? A* one - one,


A * onto {:? A one - one.

However, in infinite dimensional Banach spaces, if we do not have infor-


mation on the range being closed, we can only say:

A onto =} A * one - one,


A * onto =} A one - one.

For instance, if V = W = £2, and if

then A = A * (check!) and A is clearly one-one, but not onto (cf. Exam-
ple 2.3.3) . •

Remark 4.7.5 To show that A is onto, we usually use the relation


(4.7.11). We assume that A*(v) = f and show that Ilvllw* ~ Cllfllv*.
This is called the method of a priori estimates. We do not worry about
the existence of solutions to the equation A * (v) = f, for a given f; if a
solution exists, we obtain estimates for its norm. •
124 4 Baire's Theorem and Applications

Example 4.7.4 Let V and W be real Banach spaces and assume that
W is reflexive. Let
a(.,.) : V X W --t lR
be a bilinear form such that:
(i) a(.,.) is continuous, i.e. there exists M > 0 such that, for all v E V
and w E W, we have

la(v, w)1 ::; Mllvllvllwllw;

(ii) there exists a > 0 such that, for all w E W,

la(v, w)1 II II .
sup
vEV, v~O
Ilvllv 2: a w W,

(iii) there exists a constant (3 > 0 such that, for all v E V, we have

la(v, w)1
sup
wEW, ~o
II W II W 2: (3llv I v·
Then, given f E V* and 9 E W*, there exist unique elements Vo E V
and Wo E W such that

a(vo,w) < g,w >w*,w for all wE W;


a(v) wo) = < f,v >v*,v for all v E V.

To see this, define A: V --t W* by < A(v),w >w*,w= a(v,w) for all
w E W. By the continuity of the bilinear form, A is a well defined and
bounded linear operator. Since W is reflexive, we have A * : W --t V*,
which is also a bounded linear operator (cf. Proposition 4.7.3) and it is
easy to see that < A*(w),v >v*,v= a(v,w). By (ii), we have

IIA*(w)llv* 2: allwllw.

Thus, by Theorem 4.7.2, we deduce that A is onto. By (iii), we have

IIA(v)llw* 2: (3llvliv
whence A is one-one as well. In the same way, A * is one-one and onto
as well. Thus, there exist unique solutions to the equations

A(vo) = g, and A*(wo) = f

which proves the result . •


4.8 Exercises 125

4.8 Exercises

4.1 Show that a Banach space cannot have a basis whose elements form
a countable set. Deduce that the space P of all polynomials in one vari-
able cannot be complete for any norm.

4.2 (a) Let {an} be a sequence of real numbers such that for any given
real sequence {xn} such that Xn ---> 0 as n ---> 00, the series

converges. Show that the series L:~=l an is absolutely convergent. (Hint:


Use Exercise 3.7(a)).
(b) Let 1 < P < 00. Let {an} be a sequence of real numbers such that
for all x = (xn) E fp, the series L:~=l anxn is convergent. Show that
a = (an) E f p •.

4.3 (Numerical quadrature) Let V = C[0,1]. For each positive integer


n, define
Pn
'Pn(J) = L ~f(x~)
m=O

for all f E V, where {x~ ~=o are given points in [0,1] and { ~ ~=o
are real numbers (called weights). Let

'P(J) = 11 f(t)dt

for f E V.
(a) Show that 'Pn(J) ---> 'P(J) for every f E V, as n ---> 00, if, and only if,
the following conditions are verified:
(i) 'Pn(Jj) ---> 'P(Jj) as n --4 00, for every integer j 20, where fj(t) = t j ;
(ii)

(cf. Exercise 2.13.)


(b) If w;; 2 0 for all n and for all 0 ~ m ~ Pn, show that the condition
(ii) above is redundant.
126 4 Baire's Theorem and Applications

(c) (Trapezoidal rule) Set Pn = n and x~ = min for 0 ::; m ::; n. Let

wn = {~ if m i= 0, n
m 2~ if m = 0 or n.

Show that CPn(f) -+ cP(f) for all f E V, as n -+ 00.

4.4 Let V be a Banach space and let {8 (t) k::o be a family of continuous
linear operators on V. Assume that the following conditions hold:
(i) 8(0) = I, the identity operator on V.
(ii) For all tl ~ 0 and t2 ~ 0, we have

(iii) For all x E V,


lim8(t)(x) = x.
tlO

Then, we say that {8(t)h::::o is a co-semigroup of operators on v.


(a) Let A E .c(V). Define 8(t) = exp(tA) (cf. Exercise 2.30) for t ~ o.
Show that {8 (t) h::::o forms a co-semigroup of operators on V.
(b) Let V denote the space of all bounded and uniformly continuous real
valued functions on lR provided with the usual 'sup-norm'. For t ~ 0,
define 8(t) by
8(t)(f)(r) = f(t + r)
for r E R Show that 8(t) E .c(V) for each t ~ 0 and that {8(t)h>o is
a co-semigroup of operators on V.

4.5 Let V be a Banach space and let {8(t)h>o be a co-semigroup of


operators on V.
(a) Show that there exists M > 0 (which, without loss of generality, can
be chosen to be greater than, or equal to, unity) and TJ > 0 such that,
for all 0 ::; t ::; TJ, we have

118(t)11 ::; M.

(b) Deduce that if w = TJ-1logM ~ 0, then

118(t) II ::; M ewt

for all t ~ o.
(c) The semi group is said to be exponentially stable if we can find ]v! >0
4.8 Exercises 127

and w < 0 such that the preceding inequality is true. Show that a C()-
semigroup {S (t) h2:o is exponentially stable if, and only if, there exists
a to> 0 such that IIS(to)11 < l.
(d) Prove that for every x E V, fixed, the mapping t 1---+ S(t)x is contin-
uous from the interval [0, (0) into V.
(e) Prove that

lim -h
hiO
11 t
t +h
S(T)(X) dT S(t)(x)

for all t 2: 0 and for every x E V.

4.6 Let V be a real Banach space and let a(.,.) : V x V -+ ~ be a


continuous bilinear form (cf. Example 4.7.4). Assume that for every
x E V, x#- 0,
a(x,x) > O.
Let T: V -+ V be a linear map such that, for all x E V and y E V, we
have
a(T(x), y) = a(x, T(y)).
Show that T E £(V).

4.7 Let V and W be Banach spaces and let {fihEI (where I is an in-
dexing set) be a collection of continuous linear functionals on W which
separates points in W. Let T : V -+ W be a linear map. If Ii 0 T is
continuous for each i E I, show that T E £(V, W).

4.8 Let X, Y and Z be Banach spaces. Let T E £(X, Z) and A E


£(Y, Z). Assume that for every x E X, there exists a unique y E Y
such that A(y) = T(x). Define B : X -+ Y by Bx = y. Show that
BE £(X, Y).

4.9 Let V and W be Banach spaces. let T E £(V, W). We say that
S E £(W, V) is a left inverse of T if SoT = Iv, where Iv is the iden-
tity operator on V. Show that T has a left inverse if, and only if, T is
injective and R(A) is closed and complemented in W.

4.10 (a) Let W be a Banach space and let T : D(T) c W -+ W be a


closed and densely defined linear operator. Set V = D(T) and define,
for x E V,
Ilxll = Ilxllw + IIT(x)llw.
128 4 Baire's Theorem and Applications

Show that V is a Banach space for this norm.


(b) If V is also a Banach space for some other norm 11.11 v, and if this
norm is such that both the inclusion map of V into Wand the map T
are in £(V, W), show that there exists a constant C > 0 such that, for
allxEV,
Ilxllv ~ C (1lxllw + IIT(x)llw) .
4.11 Let V and W be Banach spaces and let T E £(V, W) be surjective.
Show that T is injective if, and only if, there exists a constant c > 0
such that, for all v E V, we have

IIT(v)llw 2: cllvllv.
4.12 Let Cp be the set of all real sequences such that all but a finite
number of terms are zero provided with the norrnll.lloo. Define T: cp --t
Cp by
T(x) = (Xl, ~2, ... , ~, ... )

where X = (x n ) E Cp. Show that T is a bijection and that T E £(cp).


Show, however, that T is not an isomorphism. Why does this not con-
tradict Corollary 4.4.1?

4.13 Let V be a Banach space and let {S (t) ~o be a co-semigroup of


operators on V. Define

D(A) = {x E V I ~N l(S(h)(X) - x) eXists}

and, for x E D(A),


. 1
A(x) = hm -h (S(h)(x) - x).
hlO

The operator A : D (A) c V --t V is called the infinitesimal generator of


the semigroup { (t) ~o. If A E £(V), show that it is the infinitesimal
generator of the semigroup {exp( tA) ~o.

4.14 Let V and {S (t) ~o be as in Exercise 4.4(b). Show that the


infinitesimal generator of the semigroup is given by:

D(A) = {f E V I l' exists and f' E V},

and
AU) = f'
4.8 Exercises 129

where I' denotes the derivative of I·


4.15 Let {S(t)h>o be a CQ-semigroup of operators on a Banach space V.
Let A : D(A) c V ----7 V be its infinitesimal generator. For any x E V,
show that
lot S(T)(X) dT E D(A)

and that
A (lot S(T)(X) dT) = S(t)(x) - x.

4.16 Let {S(t)h>o be a co-semigroup of operators on a Banach space

°
V. Let A : D(A) C V ----7 V be its infinitesimal generator.
(a) Let x E D(A). Show that S(t)(x) E D(A) for all t > and that
d
dt (S(t)(x)) = A(S(t)(x)) = S(t)(A(x)).
(b) If x E D(A) and if °:s; t2 < tl, show that

S(tl)(X) - S(t2)(X) = i h
t2
S(T)(A(x)) dT =
iti
t2
A(S(T)(X)) dT.

4.17 Let {S (t) h>o be a co-semigroup of operators on a Banach space


V. Let A : D(A) C V ----7 V be its infinitesimal generator. Show that A
is densely defined and closed.

4.18 Let V be a Banach space and let A : D(A) C V ----7 V be a linear


operator. We say that a map t r--+ u(t) from [0,00) into V is a solution
to the initial value problem:

~~t) = A(u(t)), t > 0,


u(o) = x
°
if u(t) E D(A) for all t > and if it verifies the above equations.
(a) If x E D(A), and if A is the infinitesimal generator of a co-semigroup
of operators {S(t)h::::o on V, show that the only solution to the above
initial value problem is given by u(t) = S(t)(x) for t 2: 0. (Hint: Clearly
u(t) defined thus is a solution by the Exercise 4.16; to show uniqueness,
differentiate the map T r--+ S(t - T)(U(T)), T E (0,00).)
(b) If V is a Banach space and if {SI(t)}t>O - and {S2(t)h>o
- are two
CQ-semigroups of operators on V which have the same infinitesimal gen-
erator, show that Sl (t) = S2 (t) for all t 2: 0,
130 4 Ba'ire's Theorem and Applications

(c) Deduce that the only semigroups on V whose infinitesimal genera-


tors are in £(V) are of the form {exp(tA) ~o with A E £(V).

4.19 Let { (t) ~o be a co-semigroup of operators on a Banach space V.


Let A : D(A) c V ---t V be its infinitesimal generator. Assume further
that, for all t :2 0, IIS(t)11 :S 1. (Such a semi group is called a semigroup
of contractions).
(a) Let A > O. For x E V, define

R(A)(X) = 10 00
e-,xtS(t)(x) dt

(cf. Exercise 3.15). If x E D(A), show that R(A)(X) E D(A) for all
A> 0 and that
R(A)(A(x)) = A(R(A)(X)).
(b) Show that for all A > 0,

(AI - A)(R(A)(X)) x
for all x E V and that

R(A)((AI - A)(x)) = x

for all x E D(A), where I is the identity map on V.

Remark 4.8.1 Exercise 4.19 shows that if A is the infinitesimal genera-


tor of a semi group of contractions, then the (unbounded) linear opeartor
AI - A is invertible and that its inverse is R(A) which is a bounded lin-
ear operator defined on V. Thus, together with Exercise 4.17, we see
that in order that an (unbounded) linear operator A be the infinitesimal
generator of a semigroup of contractions, it has to be densely defined,
closed and for all A > 0, II(AI - A)-III :S l/A. In fact these conditions
are also sufficient for A to be the infinitesimal generator of a semigroup
of contractions. This is the content of the famous Hille- Yosida theorem.
Generalizations to other co-semigroups also exist. The usefulness of this
result stems from the fact that many partial differential equations of the
evolution type (for instance, the heat, wave and Schrodinger equations)
can be cast in the form of an initial value problem as stated in Exer-
cise 4.18 involving an unbounded linear operator and so the existence of
uniqueness of solutions will follow from the the fact that the operator is
the infinitesimal generator of a co-semigroup. For more details see, for
4.8 Exercises 131

instance Kesavan [3] . •

4.20 Let V and W be Banach spaces and let A : D(A) c V -+ W be a


closed operator. Let BE £(V, W). Show that (A+B) : D(A) cV -+ W
is also closed.
Chapter 5

Weak and Weak* Topologies

5.1 The Weak Topology

In this chapter, we will study topologies on Banach spaces which are


weaker (i. e. coarser) than the norm topology.

Definition 5.1.1 Let V be a Banach space. The weak topology on


V is the coarsest (i.e. smallest) topology such that every element of V*
is continuous. Open (respectively, closed) sets in the weak topology will
be called weakly open (respectively, weakly closed) sets . •

We have already encountered the notion of the weak topology on a


given set such that a family of functions is continuous (cf. Definition
1.2.10). The weakly open sets are precisely the class of all arbitrary
unions of finite intersections of sets of the form f- 1 (U) where f E V*
and U is an open set in lR (or te, in the case of complex Banach spaces).
A basic neighbourhood system for the weak topology is, therefore,
the collection of sets of the form

U = {x E V I Ifi (x - xo) I < E: for all i E I}

where Xo E V, E: > 0, I is a finite indexing set and fi E V* for all i E I.


The set U described above forms a weakly open neighbourhood of the
point Xo E V (cf. the discussion following Definition 1.2.10).

Proposition 5.1.1 The weak topology is Hausdorff.

Proof: Let x and y be distinct points in V. Then, since V* sepa-


rates points in V (cf. Remark 3.1.1), there exists f E V* such that
f(x) i= f(y)· Choose disjoint open neighbourhoods U of f(x) and V
5.1 The Weak Topology 133

of f(y). Then f-l(U) and f-l(V) are disjoint weakly open neighbour-
hoods of x and y respectively. This completes the proof. •

Notation: Given a sequence {xn} in V, we write Xn --> x if the


sequence converges to x E V in the norm topology, i.e. if Ilxn - xii --> 0
as n --> 00. If the sequence converges to x in the weak topology, we write
Xn ----'- X.

Proposition 5.1.2 Let V be a Banach space and let {xn} be a sequence


in V.
(i) Xn ----'- x in V if, and only if, f(xn) --> f(x) for all f E V*.
(ii) If Xn --> x in V, then Xn ----'- x.
(iii) If Xn ----'- x in V, then {llxnll} is bounded and

Ilxll :::; liminf


n---+oo
Ilxnll·
(iv) Ifx n ----'- x in V and fn --> fin V*, then fn(xn) --> f(x).

Proof: (i) This is a direct consequence of the definition of the weak


topology.

(ii) Let f E V* be an arbitrary element. Then

If(xn) - f(x)1 :::; Ilfll Ilxn - xii --> O.

The result now follows from (i).

(iii) This follows from (i) and the Banach-Steinhaus theorem (cf. Corol-
lary 4.2.2 applied to the sequence {JXn } in V**).

(iv) We have

Ifn(x n ) - f(x)1 < Ifn(x n ) - f(xn)1 + If(xn) - f(x)1


< Ilfn - fll Ilxnll + If(xn) - f(x)l·
The first term on the right-hand side tends to zero since Ilxnll is bounded
(by (iii)) and Ilfn - fll --> O. The second term also tends to zero (by (i)).
This completes the proof. •

Example 5.1.1 Consider £2 the space of all square summable real se-
quences. We can identify £2 with £2 (cf. Example 3.1.1). Consider the
134 5 Weak and Weak * Topologies

sequence {en} E f2 (cf. Example 3.1.1). If x E f2 = f2' with x = (xd,


we have that
< x, en >£2,(2 = Xn
which tends to zero since L:~llxiI2 < 00. Thus, the sequence {en} con-
verges weakly to 0 in f 2 . Notice that this sequence has no subsequence
which converges in the norm topology since

for all n i= m. Thus, while norm convergence implies weak convergence,


nothing can be said about the reverse implication.
We can similarly prove that for all 1 < p < 00, the sequence {en}
converges weakly to zero in f p . •

Proposition 5.1.3 If V is a finite dimensional space, then the norm


and weak topologies coincide.

Proof: Since the weak topology is coarser than the norm topology,
every weakly open set is also open in the norm topology. We thus have
to prove the converse. Let U be open in the norm topology and let
Xo E U. There exists r > 0 such that the open ball B(xo; r) c U. Let
dim(V) = n and let {VI, ... , vn } be a basis for V such that, without loss
of generality, Ilvill = 1 for all 1 :::; i :::; n. If x E V, then x = L:~=1 XiVi
and define fi to be the i-th coordinate projection, i.e. fi(X) = Xi. Then
n n

i=1 i=1

Define
w = {X E V I Ifi(X - xo)1 < ~, 1~ i ~ n} .
Then W is a weakly open neighbourhood of Xo and it is clear from the
above computations that W C B(xo;r) C U. Thus U is open in the
weak topology as well and this completes the proof. •

Thus, in a finite dimensional space the weak and norm open (respec-
tively, closed) sets are the same. However, in infinite dimensional spaces,
the weak topology is strictly coarser than the norm topology. We will
presently see examples of norm closed (respectively, open) sets which are
not closed (respectively, open) in the weak topology (cf. Examples 5.1.2
and 5.1.3 below). However, for convex sets, the situation is different.
5.1 The Weak Topology 135

Proposition 5.1.4 Let C be a convex and (norm) closed subset of a


Banach space V. Then C is also weakly closed. (The converse is always
true, even without the convexity hypothesis).

Proof: We will assume that V is a real Banach space, for simplicity.


Let C be a closed and convex set in V and let Xo tJ C. Then, by
the Hahn-Banach theorem, there exists f E V* and a E JR such that
f(xo) < a < f(x) for all x E C. Then the set

U = {xEVlf(x)<a}

is a weakly open neighbourhood of Xo which does not meet C. Thus the


complement of C is weakly open and so C is weakly closed . •

Definition 5.1.2 Let X be a topological space and let f : X ----7 JR be a


given function. We say that f is lower semi-continuous if, for every
a E JR, the set

{x E X I f(x)::; a}
is closed in X . •

Clearly, every continuous map is lower semi-continuous. If Xn ----7 x


in X, and if f : X ----7 JR is lower semi-continuous, then

f(x) ::; liminf f(x n ). (5.1.1 )


n--->oo
For, if a = liminfn--->= f(x n ), then, given any c > 0, there exists a

°
subsequence x nk such that f(x nk )::; a+c for all k. Since f- 1 ((-00,a+
cD is closed, it follows that f(x) ::; a + c and since c > was arbitrarily
chosen, (5.1.1) follows.

Corollary 5.1.1 Let V be a Banach space and let 'P : V ----7 JR be convex
and lower semi-continuous (with respect to the norm topology). Then
'P is also lower semi-continuous with respect to the weak topology. In
particular, the map x I--t Ilxll, being continuous, is also lower semi-
continuous with respect to the weak topology and, if Xn - ' x in V, we
have
Ilxll ::; liminfllxnll· (5.1.2)
n--->oo
Proof: For every a E JR, the set 'P- 1 (( -00, aD is closed (in the norm
topology) and is convex. Hence it is weakly closed. This completes the
136 5 Weak and Weak* Topologies

proof. •

Notation: Let V be Banach space. We will use the following notations.


D {x E V Illxll < I} (open unit ball).
B = {x E V Illxll :::; I} (closed unit ball).
S = {x E V Illxll = I} (unit sphere).
Example 5.1.2 Let V be an infinite dimensional Banach space. Let S
be the unit sphere in V. Then S is never weakly closed, though it is
closed in the norm topology. To see this, let Xo E V such that IIxoll < 1.
Consider any weakly open neighbourhood U of Xo of the form

U = {x E V Ilfi(x - xo)1 < E, 1 :::; i :::; n}

where E > 0 and fi E V* for 1 :::; i :::; n. Consider the map A : V ---+ ~n
defined by
A(x) = (h (x), .. · ,fn(x)).
This map cannot be injective (otherwise, we will have dim(V) :::; n, which
is a contradiction). Thus, there exists Yo i- 0, such that fi(yo) = 0 for
all 1 :::; i :::; n. Then Xo + tyo E U for all t E R Set g(t) = IIxo + tyoli.
Then g(O) < 1 while g(t) ---+ +00 as t ---+ +00. Hence, there exists to
such that g(to) = 1. Thus, Xo + toYo E Un S. We have thus proved that
every weakly open neighbourhood of every point in the open unit ball,
D, intersects the unit sphere, S. Hence the closed unit ball, B, must
lie in the weak closure of S. But B being closed (in the norm topol-
ogy) and convex, is itself weakly closed. Thus the weak closure of the
unit sphere, S, is the closed unit ball, B. Thus, S is not weakly closed . •

Example 5.1.3 Let V be an infinite dimensional Banach space. Then


the open unit ball D is not weakly open. As seen in the preceding ex-
ample, every weakly open neighbourhood of a point Xo E D contains
an affine subspace of the form {xo + tyo It E ~ where Yo is chosen as
before. Thus D cannot contain a weakly open neighbourhood of any of
its points and hence D cannot be weakly open. •

Thus, in an infinite dimensional Banach space, the weak topology is


strictly coarser than the norm topology.
Proposition 5.1.5 (Schur's Lemma) In the space fl' a sequence is
convergent in the weak topology if, and only if, it converges in the norm
topology.
5.1 The Weak Topology 137

Proof: By Proposition 5.1.2, every sequence which converges in norm,


also converges weakly. Conversely, let {xn} be a weakly convergent
sequence. Without loss of generality, assume that Xn ~ o. Let

Xn = (x;,x~, ... ,x~, .. .).

Consider the functional Ii which is the projection to the i-th coordi-


nate. Then, since the sequence weakly converges to zero, it follows that
fi(X n ) -+ 0, i.e.
lim xin = 0
n-oo

for very positive integer i. Assume, if possible, that {xn} does not
converge to zero in norm. Then, there exist E > 0 such that, for infinitely
many n,
00

Llx~1 2 E.
k=l

Thus, working with a suitable subsequence if necessary, we may assume


that this is true for all n.

Set no = mo = 1. Define, for k 2 1, nk and mk inductively as follows .

• nk is the smallest integer greater than nk-l such that


mk-l

Lj=l
Ixtl < :

(This is possible since we know that each coordinate sequence tends


to zero.)

• Now choose mk to be the smallest integer greater than mk-l such


that
00

L IX~kl < ;
j=mk+ 1

(This is possible since the sequence (x nk ) E £d

Now define y = (yj) E £00 = £i as follows:


138 5 Weak and Weak* Topologies

if xt = 0,
otherwise.

By varying k over all positive integers, yj will be defined for all positive
integers j. Clearly I/ylloo = 1. Also

00 mk-l 00
4E
L (X~kyj -Ixtl) < 2L IX~kl + 2 L IX~kl < -
j=1 5
j=1 j=mk+1
Thus,
00
4E E
LX~kyj > E-- -
5 5
j=1
which contradicts the weak convergence of {x nk } to zero. Hence the
result . •

Remark 5.1.1 The space £1 being infinite dimensional, the weak and
norm topologies are different. Nevertheless, we see from the preced-
ing proposition that the convergent sequences for these topologies are
the same. While two metric spaces which have the same convergent
sequences are equivalent (i. e. their topologies are the same), two topo-
logical spaces with the same convergent sequences need not be the same.
This illustrates the inadequacy of considering just sequences in a general
topological space. We also conclude that the weak topology on £1 is not
metrizable. •

Definition 5.1.3 Let V and W be Banach spaces and let T : V ~ W


be a linear mapping. We say that T is weakly continuous if T is
continuous as a mapping from V into W, each space being endowed with
its weak topology. •

Lemma 5.1.1 Let T : V ~ W be a linear mapping. Then T is weakly


continuous if, and only if, for every f E W*, the map x f-t f(T(x)) is a
weakly continuous map from V into lR (or C, in case of complex Banach
spaces).

Proof: If T is weakly continuous, then, clearly its composition with any


f E W* will also be weakly continuous.
5.2 The Weak* Topology 139

Now let j E W*. Let U be open in lR (or C). Then j-l(U) is weakly
open in W, by definition. If joT is weakly continuous, we also have
that T- 1 (1-1 (U) is weakly open in V. But by the definition of the weak
topology, every weakly open set in W is the union of finite intersections
of sets of the form j-l(U), where U is open in lR (or C) and j E W*.
Thus, it follows from the above that the inverse image of every weakly
open set in W is weakly open in V, i. e. T is weakly continuous . •

Our final result in this section shows that as far as continuity of


linear maps is concerned, the topology really does not matter.
Theorem 5.1.1 Let V and W be Banach spaces and let T: V -+ W be
a linear map. then T E £(V, W) ij, and only ij, T is weakly continuous.
Proof: Let T E £(V, W). If j E W*, then the map joT E V* and so
is weakly continuous as well. Thus by lemma 5.1.1, it follows that Tis
weakly continuous.

Conversely, if T is weakly continuous, since the weak topology is


Hausdorff, it follows that the graph G(T) is closed when V x W is given
the product topology induced by the weak topologies (cf. Lemma 4.4.1).
But this is clearly the weak topology of V x W (why?) and so G(T) is
weakly closed in V x Wand so is closed for its norm topology as well.
The continuity of T (between the norm topologies of V and W) is now
a consequence of the closed graph theorem. •

5.2 The Weak* Topology

Let V be a Banach space. Then its dual space, V*, has its natural norm
topology. It also is endowed with its weak topology, viz. the coarsest
topology such that all the elements of V** are continuous. We now
define an even coarser topology on V*.
Definition 5.2.1 The weak* topology on V* is the coarsest topology
such that the functionals {Jx I x E V} are all continuous, where J : x I-t
Jx is the canonical imbedding oj V into V** . •
Clearly, the weak* topology is coarser than the weak topology on
V*. Thus if S, Wand W* denote the norm, weak and weak* topologies,
respectively, on V*, we have

W* eWe S.
140 5 Weak and Weak* Topologies

Remark 5.2.1 It is clear that if V is a reflexive Banach space, then the


weak and weak* topologies on V* coincide. •

Proposition 5.2.1 Let V be a Banach space. The weak* topology on


V* is Hausdorff.

Proof: Let JI and 12 be distinct elements of V*. Then, there exists


x E V such that JI(x) 0:1 12(x). Choose disjoint neighbourhoods Ul
of JI(x) and U2 of 12(x) in lR (or C, as the case may be). Then, by
definition, the sets

are both weak* open sets and are clearly disjoint and contain JI and 12
respectively. This completes the proof. •

As in the case of the weak topology, we can describe the weak*


open neighbourhoods of elements of V* as follows. Let I be a finite
indexing set and let Xi E V for i E I. Let c > O. Then, a weak* open
neighbourhood of io E V* can be written as

U E V* 1IU - iO)(Xi)1 < c, i E I}.

Notation: Let {in} be a sequence in V*. If in converges to i in


V* in the norm topology, we write in -> i. If it converges to i in the
weak topology of V*, we will write, as before, in ---'- i. If the sequence
converges in the weak* topology of V*, we will write in ~ i.
The proof of the following proposition is easy and is left to the reader
as an exercise.

Proposition 5.2.2 Let V be a Banach space and let Un} be a sequence


in V*.
(i) in ~ i in V* if, and only if, in(x) -> f(x) for every x E v.
(ii) fn -> f => in ---'- i => in ~ i·
(iii) If fn ~ f in V* and Xn -> x in V, then fn(x n ) -> f(x) . •
The next proposition shows that the functionals {Jx 1 x E V} are
the only ones which are continuous with respect to the weak* topology.

Proposition 5.2.3 Let 'P be a linear functional on V* which is contin-


uous with respect to the weak* topology. Then, there exists x E V such
that 'P = J x .
5.2 The Weak* Topology 141

Proof: Let D be the open unit ball in lR (or C, in case of complex Ba-
nach spaces). Since cp is weak* continuous, there exists a weak* neigh-
bourhood of the origin in V*, say, U, such that cp(U) c D. Assume
that

where c > 0 and Xi E V for 1 ~ i ~ n. Thus, for every 1 E U, we have


that
IcpU)1 < 1.
Assume that for some 1 E V*, we have I(Xi) = 0 for all 1 ~ i ~ n.
Then 1 E U. Further, for any real number k, we have that kl(Xi) = 0
for all 1 ~ i ~ n and so kl E U as well. Thus, for all positive integers k,

1
I'PU)I < k

and so 'PU) = O. It then follows (cf. Exercise 3.17) that there exist
scalars ai for 1 ~ i ~ n such that
n
'P = :2:= aiJXi·
i=l

This proves the result with X = E~=l aixi . •

Corollary 5.2.1 A weak* closed hyperplane must be of the form

H = {f E V* j f(x) = a}

where x E V and a is a scalar.

Proof: For simplicity, we will assume that the base field is R Since H
is a weak* closed hyperplane, it is closed in the norm topology as well
and so (cf. Proposition 3.2.1) there exists 'P E V** such that

H = {f E V* 1 'PU) = a}

for some real number a. Let 10 E H C , the complement of H. Since H is


weak* closed, there exists a weak* open neighbourhood of the form

U = {f E V* IIU - 10)(Xi)1 < c, 1 ~ i ~ n}

(where c > 0 and Xi E V for 1 ~ i ~ n) of 10 contained in H C • Now, U


is a convex set. Thus, it is easy to see that either 'PU) < a for all fEU
142 5 Weak and Weak * Topologies

or rp(J) > a for all 1 E U. Assume the former (the proof in the latter
case will be similar). Let W = U - Uo} = U - 10 lIE U}. Then,

W = {g E V* I 9 + 10 E U}

and so 9 E W if, and only if, -g E W. Thus W -W. Now, if


rp(J) < a for all 1 E U, it follows that

rp(g) < a - rp(Jo)

for all 9 E W. Since W = - W, it then follows that


Irp(g)1 < la - rp(Jo) I

for all 9 E W. Since we can always find 10 E He such that la-rp(Jo) I < 'r},
for any 'r} > 0, and since W is weak* open, it follows that rp is weak*
continuous at the origin, and so, by linearity, weak* continuous every-
where. Then, by the preceding proposition, rp = J x for some x E V.
This completes the proof. •

In a finite dimensional space, we have dim(V) = dim(V*) = dim(V**)


and so the canonical imbedding J : V - t V** is onto. Thus every finite
dimensional space V is reflexive and the weak and weak* topologies on
V* coincide.
However, the above corollary shows that, in infinite dimensional and
nonreflexive spaces, the weak* topology is strictly coarser than the weak
topology. If rp E V**\J(V), then the hyperplane [rp = a] is a convex and
(norm) closed set and hence is weakly closed but it is not weak* closed.
One might wonder the purpose of impoverishing the norm topologies
on Banach spaces and their duals to produce the weak and weak* topolo-
gies. One important off shoot of this is process is that by decreasing the
number of open sets, we increase the chances of a set being compact,
which is a very useful and topological property. We saw, in Chapter
2, that in infinite dimensional spaces, the closed unit ball cannot be
compact. The ball becomes compact in the weak* topology.

Theorem 5.2.1 (Banach-Alaoglu Theorem) Let V be a Banach


space. Then, B*, the closed unit ball in V*, is weak* compact.

Proof: Consider the product space

x = IIxEv[-llxll,llxll]
5.2 The Weak* Topology 143

with the usual product topology inherited from R This space is clearly
compact since each bounded and closed interval in lR is compact. Let
f E B*. Then, for each x E V, we have f(x) E [-llxll, Ilxlll. Thus, the
map f t----+ <p(f) = (f(X))XEV is a bijection from B* onto its image in X.
If B is endowed with the topology induced by the weak* topology of V*,
then the definitions of this topology and the product topology on X tell
us that <p is a homeomorphism. We thus just need to show that <p(B*)
is closed in X, which will prove <p(B*), and hence B*, to be compact.
Let (fx)xEV E <p(B*). Define, for x E V, f(x) = fx. The proof will
be complete if we can show that f is linear; since If(x)1 ~ Ilxll for all
x E V, it will then follow that f E B*, i.e. <p(B*) = <p(B*).
Let E > 0. Then, given x and y E V, we can find 9 E B* such that
E E E
Ig(x) - f(x)1 < 3' Ig(y) - f(y)1 < 3' Ig(x + y) - f(x + y)1 < 3
Thus,
If(x + y) - f(x) - f(y)1 < E

and, since E was arbitrarily chosen, we deduce that

f(x + y) = f(x) + f(y)


for every x and y E V. Similarly, we can show that if a is a scalar and
if x E V,
f(ax) = af(x).
Thus f is linear and the proof is complete. •
Lemma 5.2.1 Let V be a Banach space and let Ii E V*, 1 ~ i ~ n.
Let ai, 1 ~ i ~ n be scalars. Then, the following are equivalent:
(i) For every E > 0, there exists Xc E V with Ilxc II ~ 1 and such that
Ifi(X c ) - ail < E

for all 1 ~ i ~ n.
(ii) For all scalars f3i, 1 ~ i ~ n, we have
n
"f3'a'
~ ~ ~ <
-
i=l

Proof: (i) =? (ii). Let s = 2:~11f3il. By (i),


n
2:(f3il'i(xc ) - f3iai) < ES
i=l
144 5 Weak and Weak * Topologies

which implies that


n
~ ES + Lf3di
i=l

from which (ii) follows, since we can choose E > 0 to be arbitrarily small.

(ii) * (i). Let a = (al,"', an) E ]Rn (the proof when the scalar field
is C is similar). Define A : V ----> ]Rn by A(x) = (h(x),"', In(x)). We
then need to show that a E A(B) where B is the closed unit ball in V. If
not, by the Hahn-Banach theorem, we can find scalars A and f31, ... ,f3n
such that, for every x E B,
n n
L a if3i > A > Lf3ili(x)
i=l i=l

which implies that


n
< A < Laif3i
i=l

which contradicts (ii). This completes the proof. •


Proposition 5.2.4 Let V be a Banach space. Let B be the closed unit
ball in V and B** the closed unit ball in V**. Let J : V ----> V** be the
canonical imbedding. Then, B** is the weak* closure of J(B) in V**.

Proof: Since B** is weak* compact, it is weak* closed. Let <Po E B**.
Consider a weak* open neighbourhood of <Po of the form

u = {<p E V** 11(<p - <po) (fi) 1< E, 1 ~ i ~ n}

where E > 0 and Ii E V*, 1 ~ i ~ n. Let ai = <po(li), 1 ~ i ~ n. Then,


for scalars f3i, 1 ~ i ~ n, we have,
n

Then, by the preceding lemma,there exists x E B such that J x E U.


Thus U intersects J(B) and this shows that J(B) is weak* dense in B**
which completes the proof. •
5.3 Reflexive Spaces 145

Remark 5.2.2 Let V be a Banach space. Since the map J : V -+ V** is


an isometry, it follows that J(B) is closed in the norm topology of V**.
Thus, either J(B) = B**, which is true if, and only if, V is reflexive, or
J(B) is a closed and proper subset of B**. Thus, in the non-reflexive
case, J(B) is not dense in B** for the norm topology. •

5.3 Reflexive Spaces

Let us recall that a Banach space V is said to be reflexive if the canonical


imbedding J : V -+ V** is surjective (cf. Definition 3.1.1). We also saw
that the spaces £p for 1 < p < 00 are examples of reflexive spaces while
£1 is not reflexive. In Chapter 7, we will see that every Hilbert space is
reflexive.
In this section, we will study some important properties of reflexive
spaces.

Notation: Given a Banach space V, we will denote the closed unit balls
in V, V* and V** by B, B* and B**, respectively.

Theorem 5.3.1 A Banach space V is reflexive if, and only if, B is


weakly compact.

Proof: Assume that B is weakly compact. Since J : V -+ V** is an


isometry, it is continuous and hence weakly continuous as well (cf. Theo-
rem 5.1.1) and so J(B) is weakly compact. Hence it is weak* compact as
well. The weak* topology being Hausdorff, it follows that J(B) is weak*
closed. But then (cf. Proposition 5.2.4) it follows that J(B) = B**. This
immediately implies that J is surjective, i. e. V is reflexive.

Conversely, let V be reflexive. Then the weak and weak* topologies


on V* coincide. Hence, by the Banach-Alaoglu theorem, B* is weakly
compact. Then, by the preceding arguments, it follows that V* is reflex-
ive. Then, just as we saw earlier, it follows that B** is weakly compact.
Since V is reflexive, we have B = J-1(B**). Also, since J- 1 : V** -+ V
is continuous, it is weakly continuous as well and so B is weakly compact .

Corollary 5.3.1 Let V and W be Banach spaces and let T : V -+ W
be an isometric isomorphism. Then, if V is reflexive, so is W.
146 5 Weak and Weak * Topologies

Proof: Let Bv and Bw be the closed unit balls in V and W, respec-


tively. Since T is an isometric isomorphism, we have that T(Bv) = Bw.
Now, T being continuous, it is weakly continuous as well. Since V is
reflexive, we have that Bv is weakly compact and so Bw = T(Bv) is
also weakly compact, which implies that W is reflexive . •

Corollary 5.3.2 Let V be a reflexive Banach space and let W be a


closed subspace of V. The W is also reflexive.

Proof: It is easy to see that the weak topology on W is none other than
the topology induced on W by the weak topology of V (cf. Exercise 5.1).
Since V is reflexive, it follows that B is weakly compact. The unit ball
in W is none other than W n B. But W being a closed subspace, it is
weakly closed and since B is weakly compact, it follows that W n B is
weakly compact as well. Thus, it follows that W is reflexive. •

Corollary 5.3.3 Let V be a Banach space. Then, V is reflexive if, and


only if, V* is reflexive.
Proof: We already saw in the proof of Theorem 5.3.1 that if V is
reflexive, then V* is reflexive.
Conversely, let V* be reflexive. Then, as before, V** is reflexive.
Now, J(V) is a closed subspace of V** and so, by the preceding corollary,
it follows that J(V) is reflexive. But then J- 1 : J(V) - t V is an
isometric isomorphism and so V is reflexive by Corollary 5.3.1. •
Corollary 5.3.4 Let V be a reflexive Banach space. Let K c V be a
closed, bounded and convex subset. Then, K is weakly compact.
Proof: Since K is bounded, there exists a positive integer m such that
K c mB. Then, since K is convex and closed, it is weakly closed and
since mB is weakly compact, it follows that K is weakly compact . •

Proposition 5.3.1 Let V and W be Banach spaces, with W being re-


flexive, and let A : D(A) c V - t W be a linear transformation which is
closed and densely defined. Then A* is also densely defined.

Proof: Let c.p E W** which vanishes on D(A*). It suffices to show that
c.p = O. Since W is reflexive, there exists yEW such that

< c.p, v >w**,w* = < v, y >w*,w


for all v E W*. Thus, we need to show that y = 0 given that <
w, y >w*,w= 0 for all W E D(A*). If not, then (0, y) ~ G(A), the graph
5.4 Separable Spaces 147

of A. Since G(A) is closed, by hypothesis, there exists (f, v) E V* x W*


such that
< f,u >v*,v + < v,A(u) >w*,w = ° (5.3.1)

for all u E D(A) and such that < v, y >w*,w# 0, by virtue of the
Hahn-Banach theorem (d. Corollary 3.2.1). It follows from (5.3.1) that
°
v E D(A*) which then implies that < v, Y >w*,w= which is a contra-
diction . •

Thus, in the circumstances of the above proposition, we can define


the second adjoint A ** = (A *) * from V** into W**. If we now assume
that both V and Ware reflexive, then we can identify V with V** and
W with W** via their respective canonical imbeddings. In this case, we
will then have A** : D(A**) c V ---+ W.

Theorem 5.3.2 Let V and W be reflexive Banach spaces and let A :


D(A) C V ---+ W be a closed and densely defined linear transformation.
Then, A** = A.

Proof: It suffices to show that the graphs G(A) and G(A**) are the
same. Recall that if we define J : W* x V* ---+ V* x W* by J(v, f) =
(-f, v), we then have J(G(A*)) = ( )~ (d. Proposition 4.7.2). Then
( ( )~)~ = ( ( ( ))~ c V x W. Thus, ( ( )~)~ consists of all
(v, w) E V x W such that

< -A*('P), v >v*,v + < 'P, w >w*,w= 0

for all <p E D(A*). This is equivalent to saying that v E D(A**) and
that w = A**(v). Thus,

G(A**) = ( ( )~)~ = G(A) = G(A)

since G(A) is closed and this completes the proof. •

5.4 Separable Spaces

In this section, we will study the relationship between separable spaces


and weak topologies.

Definition 5.4.1 A topological space is said to be separable if it con-


tains a countable dense set. •
148 5 Weak and Weak* Topologies

Proposition 5.4.1 Let V be a Banach space. II V* is separable, then


so is V.

Proof: Let {In} be a countable dense set in V*. Choose {xn} in V such
that
1
Ilxnll = 1 and In(xn) > 2111nll.
Assume, for simplicity, that the base field is R Let W be the linear
subspace generated by the sequence {xn} and let Wo be the set of all
finite linear combinations of the {xn} with rational coefficients. Then
Wo is countable and it is dense in W. So it suffices to show that W is
dense in V. Let I E V* which vanishes on W. We need to show then
that I vanishes on all of V (i. e. I is identically zero). Let E > O. Then,
there exists 1m such that 111-Imll < E, by the density of the {In}. Since
I(xn) = 0 for all n, we have
1
2111mll < Im(xm) (fm - J)(xm) < 111m - III·
Thus,
IIIII ~ 111m - III + Illmll < 3E
from which it follows that I = 0 since E was arbitrarily chosen. •

Example 5.4.1 The converse of the above proposition is not true. We


know that fi = f oo . While fl is separable (the set of all sequences with
only a finite number of non-zero components, all of which are rational,
forms a countable dense set of f 1 ), Coo is not separable. To see this, we
prove that no countable set in foo can be dense. Indeed, let {In} be a
countable set in foo where In = (f~). Define I = (fi) by

Ii = {O, ~f II{I ~ 1,
2, If Iii I < 1.
Then I E foo and III - Inlloo ~ 1 for all n. Thus {In} cannot be dense
in f oo .•
Corollary 5.4.1 Let V be a Banach space. Then V is both separable
and reflexive if, and only ii, V* is both separable and reflexive.
Proof: If V* is both separable and reflexive, then so is V. Conversely, if
V is separable and reflexive, so is J(V) = V**, where J is the canonical
imbedding of V into V**. Thus, it now follows that V* is separable and
reflexive . •
5.4 Separable Spaces 149

Theorem 5.4.1 Let V be a Banach space. Then, V is separable if,


and only if, the weak* topology on B*, the closed unit ball in V*, is
metrisable.

Proof: Assume that V is separable. Let {xn} be a countable dense set


in V. We may assume, without loss of generality, that Xn =Ie 0 for all n
(why?). For f and g E B*, define

(5.4.1)

It is easy to check that d(.,.) is well defined and that it defines a metric
on B*. Let U be a weak* open neighbourhood of fo E B* of the form

U = {f E B* 11(f - fO)(Yi) 1< c, 1:S: i :s: k}


where c > 0 and Yi E V for 1 :s: i :s: k. Since {xn} is dense, there exists
x ni such that IIYi - xnJ < c/4 for each 1 :s: i :s: k. Now choose r > 0
such that
r2 ni IlxnJ < ~ for all 1:S: i :s: k.

Consider the ball Bd(fo; r) in B* provided with the metric defined in


(5.4.1). If f belongs to this ball, i.e. d(f, fo) < r, then for each 1 :s: i :s:
k, we have

l(f - fO)(Yi) 1 < l(f -


fO)(Yi - xnJI + l(f - fo)(xnJI
< 2.~ + 2ni Ilxni Ilr
< c.
Thus Bd(fo; r) c U and so every weak* open set is also open in the
metric topology.
On the other hand, consider a ball Bd(fo; r). Consider the weak*
open neighbourhood of fo given by

Choose c < r /2 and k such that

1 1 r
2:
00

2n 2k < "4
n=k+l
150 5 Weak and Weak * Topologies

If f E Uk' then

Thus, Uk c Bd(fO; r) which shows that every open set in the metric
topology is also weak* open. Thus, the weak* and the metric topologies
on B* are the same.

Conversely, assume that the weak* topology on B* is metrisable.


Consider, for each positive integer n, the ball Bd(O; ~), where d is the
metric defined on B*. This ball then contains a weak* open neighbour-
hood of zero, say Un which can be written in the form

Un = U E B* I If(x)1 < En, for all x E <I>n}

where En > 0 and <I>n is a finite set in V. The set

is then countable and the set E of all finite rational linear combinations of
the elements of D is a countable set which will be dense in the subspace
generated by D. If f E V* is such that f(x) = 0 for all x in the subspace
generated by D, then clearly, f E Un for each n. Thus,

Thus the subspace generated by D is itself dense in V and so the count-


able set E is also dense in V and hence V is separable.
This completes the proof. •

Corollary 5.4.2 Let V be a separable Banach space. Then, every bounded


sequence in V* has a weak* convergent subsequence.

Proof: A bounded sequence in V* is contained in some ball, which is


weak* compact. Since V is separable, the weak* topology on this ball
is metrisable and so the ball is weak* sequentially compact as well. •

In a metric space compactness and sequential compactness are equiv-


alent (d. Proposition 1.2.6); this is not true in a general topological
5.4 Separable Spaces 151

space. Thus a sequence in a compact topological space may fail to have


a convergent subsequence, as the following example shows.

Example 5.4.2 Consider the space £00. Define

for x = (xn) E £00. Then clearly in E £~ and, further, Ilinll£~ = 1,


for all n. The unit ball in £~ is weak* compact, by the Banach-Alaoglu
theorem. Assume, if possible, that there exists a weak* convergent sub-
sequence {Ink} for this sequence. This, in view of Proposition 5.2.2
(i), imples that {x nk } is convergent for every x = (xn) E £00' which
is clearly absurd. Thus {in} cannot have any weak* convergent subse-
quence, event hough it lies in a weak* compact set . •

Example 5.4.3 We know that (cf. Exercise 3.7) Co = £1 and that


£i = £00· Consider the sequence {en} in £1, where en is the sequence
whose n-th entry is unity, all other entries being zero. If this sequence
has a weakly convergent subsequence, then the weak limit has to be zero.
To see this, fix a positive integer l. Then el E £00 and so if we have a
subsequence {e nk } weakly converging to x E £1, it follows that

lim
k --+00
< el, enk >£=, £1 = o.

Thus Xl = 0 for all l, i.e. x = o. But if {e nk } converges weakly in


£1, it also converges in norm by Schur's lemma (cf. Proposition 5.1.5)
and so this subsequence should converge in norm to zero which is im-
possible, since IIen l11 = 1 for all n. Thus {en} has no weakly convergent
subsequence in £1. But since £1 is separable and so Co is also separable
(cf. Proposition 5.4.1) and, by the preceding corollary, {en} must have
a weak* convergent subsequence. In fact, if x = (xn) E CO, we have

which converges to zero. Thus {en} weak* converges to zero. Thus the
weak and weak* convergent sequences in £1 are not the same (while its
norm and weakly convergent sequences are the same). •

Theorem 5.4.2 Let V be a reflexive Banach space. Then every bounded


sequence has a weakly convergent subsequence.
152 5 Weak and Weak* Topologies

Proof: Let {xn} be a bounded sequence in V. Let W = span{ {x n }},


i.e. the smallest closed subspace containing the sequence in question.
Then, W is also reflexive and, by construction, it is separable (why?).
So W* is also reflexive and separable. Then, every bounded sequence in
W** has a weak* convergent subsequence and since W** is also reflexive,
the weak and weak* topologies are the same. In particular, {J(xn)} has
a weakly convergent subsequence in tv'**, where J : W --t W** is the
canonical imbedding, and since J- 1 : W** --t W is an isometry and
hence weakly continuous, {xn} has a weakly convergent subsequence in
W. Since the weak topology in W is the topology induced on W by the
weak topology on V (cf. Exercise 5.1), it follows that this subsequence
converges weakly in V as well. •

Remark 5.4.1 The converse of the above theorem is also true and it is
a deep result due to Eberlein and Smulian: if every bounded sequence
admits a weakly convergent subsequence in a Banach space, then the
space is reflexive. •

5.5 Uniformly Convex Spaces

We know that the unit ball in a normed linear space is convex. How-
ever, the nature of the boundary of this ball depends on the norm. For
instance, in ]R2, with the euclidean metric (i.e. ]R2 = ~), the unit ball is
a very symmetric object which 'bulges uniformly' in all directions. On
the other hand, if we consider ]R2 as CT or as ~, then the unit ball will
be, the rhombus bounded by the lines (±xI) + (±X2) = 1 or the unit
square, respectively. In both these cases, the boundary has a lot of 'flat'
portions. Uniform convexity makes precise the notion of the boundary
'bulging uniformly' in all directions. This is a condition describing the
'geometry' of the norm, but has an important 'analytic' consequence,
which will be the main theorem of this section. It also has important
consequences in the calculus of variations, which we will see in the next
section.

Definition 5.5.1 A normed linear space is said to be uniformly con-


vex if for every E > 0, there exists a 8 > 0 such that whenever we have
x and y E V satisfying

Ilxll :S 1, Ilyll < 1 and Ilx - yll > E,


5.5 Uniformly Convex Spaces 153

it follows that

In other words, given two points on the boundary which are at a


distance of E from each other, then, irrespective of the position of these
points, the mid-point of the chord joining them should lie in the interior,
at a minimum distance away from the boundary, the minimum distance
being prescribed uniformly.
Uniform convexity is stronger than the notion of strict convexity (cf.
Exercise 3.2).

Example 5.5.1 The spaces €l{ and ~ are not uniformly convex. In
fact, they are not even strictly convex. •

Example 5.5.2 The space €!j is uniformly convex. Let x and y E €!j.
Then it is easy to verify that

(5.5.1)

If IIxl12 :::; 1, IIyl12 :::; 1 and Ilx - Yl12 > E, with E sufficiently small, we see
that
1
11 2(x + y) 1122 < 1- 4
E2
= (1 - 8) 2

where

8=I-V 1- E
: .•

Remark 5.5.1 When N = 2, the relation (5.5.1) is the familiar paral-


lellogram law or Apollonius' theorem in plane geometry. The relation
(5.5.1) is also valid for the space €2 and so €2 is also uniformly convex.
In fact, we will see, in Chapter 7, that this relation is valid in any Hilbert
space and so every Hilbert space will be uniformly convex. •

Remark 5.5.2 We will see in Chapter 6, that a relation similar to


(5.5.1) is also valid for the spaces €i/ and €p whenever 2 :::; p < 00
(cf. Proposition 6.2.1) and so all these spaces are uniformly convex. A
similar inequality also holds for 1 < p < 2 but the proof is more difficult.
Thus, all these spaces are uniformly convex. •
154 5 Weak and Weak * Topologies

Theorem 5.5.1 Let V be a uniformly convex Banach space. Then V


is reflexive.

Proof: If J : V ----t V** is the canonical mapping, it is enough to show


that the image of the closed unit ball B in V under J is the closed unit
ball B** in V**. Since V is a Banach space, J(B) is a closed set in V**
and so it suffices to show that it is dense in B** (cf. Remark 5.2.2).
Let 'P E B**. Assume that 11'Pllv** = 1. Let c > o. We will show
that there exists x E B such that

II'P - J(x)llv** < c.


The same will then be true for all elements of B** as well (why?).
Let 8 > 0 correspond to c in the definition of uniform convexity.
Choose f E V*, with Ilfllv* = 1 and such that

(5.5.2)

Define
U = {~E V** I I(~ - 'P)(J) I < 8/2}.
Then U is a weak* open neighbourhood of 'P in V**. Since J(B) is
weak* dense in B** (cf. Proposition 5.2.4), it follows that there exists
x E B such that J(x) E U.
"Assume now that IIJ(x) - 'Pllv** > c. In other words, 'P tf. J(x) +
cB**. Since EB** is weak* compact (by the Banach-Alaoglu theorem),
it is weak* closed and so is its translation by J(x). Thus, there exists
a weak* open neighbourhood UI of'P such that for all ~ E UI , we still
have II~ - J(x)llv** > E. Again, as before, there exists Xl E B such that
J(XI) E un UI by the weak* density of J(B) in B**. Thus,

< 1
{)
I'P(J) - f(x)1
I'P(J) - f(XI)1 < 2"
and so

By virtue of (5.5.2), it follows from the above that


5.5 Uniformly Convex Spaces 155

which contradicts the uniform convexity since J(xd E U1 and so

ilx - xiliv = IIJ(x) - J(xl)llv" > E

while Ilxllv = Ilxlllv ~ 1. Thus, it follows that IIJ(x) - 'f'llv" ~ E which


shows that J(B) is dense in B** as already observed . •

Remark 5.5.3 The converse of this theorem is not true. A reflexive


space need not be uniformly convex. For instance, ef
is not uniformly
convex, but since it is finite dimensional, it is reflexive . •

Proposition 5.5.1 Let V be a uniformly convex Banach space. Let


Xn ~x in V. Assume that

lim sup Ilxnll < Ilxll. (5.5.3)


n->oo

Then Xn - t x in V.

Proof: We already know that (d. Proposition 5.1.2 (iii))

liminf
n->CXl
Ilxnll ~ Ilxll. (5.5.4)

Thus, by (5.5.3) and (5.5.4), we deduce that Ilxnll - t Ilxll. If x = 0,


this completes the proof. Assume now that x -# o. Then, from the
convergence of the norms, we deduce that (for large n), Xn -# o. Set
Yn = xn/llxnli and Y = xllixil. Observe then that, by hypothesis and
the convergence of the norms of x n , it follows that Yn ~ Y in V. The
proof will be complete if we show that Yn - t y.
Since Yn ----'- y, we have

Thus we have

Hence, by uniform convexity, if E > 0 is an arbitrary number, we must


have llYn - yll ~ E for n sufficiently large. This proves that Yn - t Y and
hence tha.t Xn - t x in V as already observed. •
156 5 Weak and Weak* Topologies

5.6 Application: Calculus of Variations

In this section, we will apply the results of the preceding sections to


obtain some important results in the calculus of variations, which can
be described as the theory of optimization in infinite dimensional spaces.
Proposition 5.6.1 Let V be a reflexive Banach space and let K c V
be a non-empty, closed and convex subset. Let rp : K ~ lR be a convex
and lower semi-continuous function. Assume further that

lim rp(x) = +00. (5.6.1)


Ilxll--too
Then, rp attains a minimum in K.
Proof: Fix Xo E K and let AO = rp(xo) < 00. Set

K= {x E K I rp(x) :s Ao}.
Since rp is lower semi-continuous and convex, it follows that K is closed
and convex. Further, by (5.6.1), it follows that K is bounded as well.
Thus, K is weakly closed (d. Proposition 5.1.4). Now, let {xn} be
a minimizing sequence in K for rp, i.e., rp(x n ) ~ infxEK rp(x). Then,
since the sequence is bounded and since V is reflexive, it has a weakly
convergent ~ s n , say, {x nk }, onv in~ weakly to some x E V.
But since K is weakly closed, we have x E K. Further, by the lower
semi-continuity and convexity of rp, it follows that rp is also weakly lower
semi-continuous (d. Corollary 5.1.1) and so

in[ rp(y).
yEK

Thus,
rp(x) = migrp(y).
yEK

If Z E K\K, then rp(z) > AO :::: rp(x) so that

rp(x) = minrp(y).
yEK

This completes the proof. •

Remark 5.6.1 The condition (5.6.1) is usually called the condition of


coercivity of the function(al) rp. Thus, a coercive, convex and lower semi-
continuous functional defined on a non-empty, closed and convex subset
5.6 Application: Calculus of Variations 157

of a reflexive Banch space always attains a minimum. The method of


proof used above is usually known as the direct method of the calculus
of variations. A minimizing sequence is shown to have a convergent
subsequence (in a suitable topology) and the limit is shown to be the
desired minimum. •

Remark 5.6.2 In the proof of the preceding proposition, the coercivity


condition was really needed only when K was not bounded.
In a metric space, the lower semi-continuity of a function 'P is equiv-
alent to the condition that if Xn -+ x, then

'P(x) :::; liminf 'P(x n ). (5.6.2)


n-+oo

However, in a general topological space, the lower semi-continuity im-


plies the above relation but the converse is not true. A function which
satisfies the above relation for all convergent sequences is called sequen-
tially lower semi-continuous. In the context of the weak topology, we can
thus say that 'P : V -+ lR is weakly sequentially lower semi-continuous
if whenever Xn -->. X in V, we have that (5.6.2) is true. Thus, the pre-
ceding proposition holds even when 'P is only weakly sequentially lower
semi-continuous, since the coercivity condition would imply that every
minimizing sequence is bounded and the rest of the proof follows as be-
fore . •

The following result is an immediate consequence of the preceding


proposition.

Theorem 5.6.1 Let V be a reflexive Banach space and let K be a closed


convex subset of V. Then, for any x E V, there exists y E K such that

Ilx - yll = min Ilx


zEK
- zll· (5.6.3)

Further, if V is also uniformly convex, then such a y is unique.

Proof: The functional z f--t Ilx - zll is clearly coercive, convex and
weakly lower semi-continuous. Thus the existence of y follows from the
preceding proposition. Assume that V is uniformly convex. Assume
that there exist Yi E K, i = 1,2 such that

a = Ilx - Ylil = Ilx - Y211 = min Ilx


z EK
- zll·
158 5 Weak and Weak * Topologies

Let us assume that IIYl - Y211 > E > 0. Then

II(x - Yl) - (x - Y2)11 > E.

Then, by the uniform convexity, we have that

for some J > 0. Since K is convex, we have ~( l + Y2) E K and so the


above relation contradicts the minimality of Yl and Y2. Thus it follows
that Yl = Y2 and the proof of the uniqueness is complete . •

Example 5.6.1 In general we have non-uniqueness of the minimizer


for the problem (5.6.3) if the space is not uniformly convex. Consider
the space £r (which is ffi.2 with the norm 11.lld. It is reflexive since it
is finite dimensional, but it is not uniformly convex. Consider the set
K = B, the closed unit ball in .er,
which is a closed and convex set. Let
x = (1,1). Let Y = (a, b) E K. Then,

Ilx - ylh 11 - al + 11 - bl
> 1- lal + 1- Ibl
2- (Ial + Ibl)
> 1.
However, if a + b = 1, a 2: 0, b 2: 0, then, for all such points Y = (a, b),
we have
Ilx - ylll = 1- a + 1- b = 1.
Thus we have uncountably many Y which satisfy (5.6.3) . •

If V were not reflexive, then we can guarantee neither the existence,


nor the uniqueness.

5.7 Exercises

5.1 Let V be a Banach space and let W be a closed subspace of V.


Show that the weak topology on W is the topology induced on W by
the weak topology on V.

5.2 Let V be a Banach space and let W be a subspace of V. Show that


the closure of W under the weak topology coincides with W, the closure
5.7 Exercises 159

of W in the norm topology.

5.3 Let V be a Banach space and let W be a subspace of V. Show that


W l.. C V* is weak* closed.

5.4 Let V be a Banach space. Show that V with its weak topology and
V* with its weak* topology are both locally convex topological vector
spaces (cf. Remark 3.2.2).

5.5 Use the preceding exercise and Remark 3.2.2 to show that if V is a
Banach space and if W is a subspace of V*, then the weak* closure of
W is (W l..)l... (Compare this with Exercise 3.9.)

5.6 Let V be a Banach space.


(a) Show that Xn -' x in V if, and only if,
(i) {llxnll} is bounded and (ii) f(xn) - t f(x) for all f E S, where S is a
subset of V* whose span is dense in V*.
(b) Show that f n ~ f in V* if, and only if,
(i) {llfnll} is bounded and (ii) fn(x) - t f(x) for all XES, where S is a
subset of V whose span is dense in V.

5.7 Let 1 < p < 00. Let Xn = (xh) and x = (xj) be elements of €p. Show
that Xn -' x in €p if, and only if, xh - t xj for every positive integer j.

5.8 Let V and W be Banach spaces and let T : V -t W be a linear map.


Show that the following are equivalent:
(i) If Xn - t x in V, then T(xn) - t T(x) in W.
(ii) If Xn -' x in V, then T(xn) -' T(x) in W.
(iii) If Xn - t x in V, then T(xn) -' T(x) in W.

5.9 Let V and W be Banach spaces. Let one of them be provided


with the norm topology and the other with the weak topology. Let
T : V - t W be a linear map. Show that T is continuous if, and only if,
T E C(V, W).

5.10 Define T : CO -t €1 as follows: let x = (xn) E CO and

T(x) = (~~).
Show that T E C(CO,€1)' If B is the closed unit ball in CO, show that
160 5 Weak and Weak * Topologies

T(B) is not closed in £1.

5.11 Let V be a reflexive Banach space. If W is a Banach space and


if T E £(V, W), show that T(B) is closed in W, where B is the closed
unit ball in V.

5.12 Show that £1 does not contain an infinite dimensional subspace


that is reflexive.

5.13 Let 1 ::; p < 00. Show that £p is separable.

5.14 Show that e[O, 1] is separable.

5.15 Let V = C[O, 1] and let K cV be defined by

K ~ {/ E V I la' /(t) dt - {/(t) dt 1} .

(a) Show that K is a closed and convex subset of V.


(b) Show that
inf
JEK
Ilfll = 1.

(c) Show that K does not admit an element with minimal norm.
(d) Deduce that C[O, 1] is not reflexive.

5.16 Let V be a reflexive real Banach space and let a(.,.) : V x V ~ lR


be a continuous bilinear form (cf. Example 4.7.4). Assume that a(.,.) is
°
V-elliptic (or, coercive), i. e. there exists a > such that, for all x E V,

Let x E V. Define A(x) : V ~ lR by

A(x)(y) = a(x, y).

(a) Show that A(x) E V* for every x E V.


(b) Show that A E £(V, V*).
(c) Show that for every x E V,

IIA(x)11 > allxll·


5.7 Exercises 161

(d) Show that A : V ~ V* is surjective.


(e) Deduce that, for every f E V*, there exists a unique x E V such that

a(x, y) = f(y) (5.7.1)

for all y E V.

5.17 In the preceding exercise, assume further that a(.,.) is symmetric


i.e. a(x,y) = a(y,x) for all x and y E V. Let f E V*. Define, for x E V,
1
J(x) = 2"a(x, x) - f(x).

(a) For any closed convex subset K c V, show that there exists x E K
such that
J(x) = min J(y). (5.7.2)
yEK

(b) Show that x E K satisfies (5.7.2) if, and only if,

a(x,y-x) 2: f(y-x) (5.7.3)

for every y E K.
(c) Show that the solution x E K of (5.7.3) (and hence, that of (5.7.2))
is unique.
(d) If K is a closed convex cone (cf. Definition 3.4.1), show that the
solution x E K of (5.7.3) is characterized by

a(x, x) = f(x) and a(x, y) > f(y)

for all y E K.
(e) If K = V, show that the solution x of (57.3) is the solution of
(5.7.1).
Chapter 6

LP Spaces

6.1 Basic properties

The Lebesgue spaces, also known as the V spaces, constitute a rich


source of examples and counter-examples in functional analysis. They
also form an important class of function spaces when studying the ap-
plications of mathematical analysis. In this chapter, we will study the
important properties of these spaces from the functional analytic point
of view.
Let (X,S,J.-L) be a measure space (cf. Section 1.3). Let f : X ~ lR
be a real valued measurable function defined on X. Let 1 ~ p < 00. We
define

(L Ifl
1

Ilfllp = P dJ.-L) P (6.1.1)

and we say that f is p-integrable (integrable, if p = 1 and square


integrable, if p = 2) if Ilfllp < 00. Next, let M > O. We set

{If I > M} = {x E X Ilf(x)1 > M}.


We now define

Ilflloo = inf{M > 0 I J.-L( {If I > M}) = O} (6.1.2)

and we say that f is essentially bounded if Ilflloo < 00.


Proposition 6.1.1 (HOlder's Inequality) Let 1 ~ p < 00 and let p*
be the conjugate exponent. If f is p-integrable and g is p* -integrable
(essentially bounded, if p = 1), then

(6.1.3)
6.1 Basic Properties 163

Proof: If p = 1, then p* = 00. Then

If(x)g(x)1 :S If(x)I·llglloo
for almost every x E X and then (6.1.3) follows on integrating this
inequality over X.
Let us now assume that 1 < p < 00 so that 1 < p* < 00 as well. The
relation (6.1.3) is trivially true if Ilfllp (respectively, Ilgllp') equals zero,
for then f (respectively, g) will be equal to zero almost everywhere. So
we assume further that Ilfllp i- 0 and that Ilgllp' i- O. Then (cf. Lemma
2.2.1)
1 1 •
If(x)g(x)1 :S -If(x)I P + -lg(x)IP
p p*
for all x E X. Assume now that Ilfllp = Ilgllp' = 1. Then, integrating
the above inequality over X, we get

ixr Ifgl dp, :S ! +~


P p*
= 1.

For the general case, apply the preceding result to the functions f Illfllp
and g/llgilp' to get (6.1.3) . •

Remark 6.1.1 When p = p* = 2, once again (6.1.3) is known as the


Cauchy-Schwarz inequality . •
Proposition 6.1.2 (Minkowski's Inequality) Let 1 :S p :S 00. Let f
and 9 be p-integrable. Then f +9 is also p-integrable and
(6.1.4)
Proof: We assume that Ilf + gllp i- 0, since, otherwise, the result is
trivially true. Since the function t f--t Itl P is convex for 1 :S p < 00, we
have that
If(x) + g(x)IP :S 2P- 1 (lf(x)IP + Ig(x)IP)
from which it follows that f + 9 is also p-integrable. Thus, if 1 <p< 00,
we have

i If + glP dp, :S Ix If + glP-1lfl dp, + Ix If + glP-1lgl dp,.

We apply Holder's inequality to each of the terms on the right-hand side.


Notice that If(x) + g(x)l(p-l)p' = If(x) + g(x)IP by the definition of p* .
Thus If + glP-l is p* -integrable and
164 6 £P Spaces

Thus,

..P...-
Dividing both sides by Ilf + gil$' and using, once again, the definition
of p*, we get (6.1.4). The cases where p = 1 and p = 00 follow trivially
from the inequality

If(x) + g(x)1 < If(x)1 + Ig(x)l·


This completes the proof. •

It is now easy to see that the space of all p- integrable functions


(1 ~ p < (0) and that of all essentially bounded functions are vector
spaces and that the map f f---+ Ilfllp for 1 ~ p ~ 00 verifies all the
°
properties of the norm, except that Ilfllp = does not imply that f = 0,
but that f = 0 almost everywhere (a.e. ; cf. Section 1.3).
Given two measurable functions f and g, we say that f rv 9 if f = 9
almost everywhere, i.e. f(x) = g(x) everywhere, except over a subset
of measure zero. This defines an equivalence relation. If f rv g, then for
1 ~ p ~ 00 , we have that Ilfllp = Ilgllp. Further the set of all equiva-
lence classes forms a vector space with respect to pointwise addition and
scalar multiplication defined via arbitrary representatives of equivalence
classes. In other words, if 11 rv 12 and if gl rv g2, then 11 + gl "" 12 + g2
and, for any scalar a, we also have al1 rv ah and so on. Since 11.llp is
also constant on any equivalence class , we can define the 'norm' of an
equivalence class via any representative function of that class. Further,
if Ilfllp = 0, then f will belong to the equivalence class of the function
which is identically zero. Thus the set of all equivalence classes, with
11.llp, becomes a normed linear space.
Definition 6.1.1 Let (X, S, p,) be a measure space. Let 1 ~ p < 00.
The space of all equivalence classes, under the equivalence relation de-
fined by equality of functions almost everywhere, of all p-integrable func-
tions is a normed linear space with the norm of an equivalence class
being the 11.llp-'norm ' of any representative of that class. This space is
denoted £P (p,). The space of all equivalence classes of all essentially
bounded functions with the norm of an equivalence class being defined as
the 11.1100 -'norm' of any representative of that class, is denoted Loo(p,) . •

While we may often talk of ' £P - functions' we must keep in mind that
we are really talking about equivalence classes of functions and that we
6.1 Basic Properties 165

carry out computations via representatives of those equivalence classes.

Notation We will denote elements of V(J.L) by lower case Roman letters


in sanserif font and a generic representative of the equivalence class it
represents by the same lower case Roman letter in italicised font. Thus if
we have f E V(J.L), a generic representative will be f and so, for instance,

for 1 :S p < 00.


Similarly, the equivalence class of a function f will be denoted by f.

Notation If X = 0, an open set of lRn provided with the Lebesgue


measure, then the corresponding spaces V(J.L) will be denoted V(O).
In particular, if lR is provided with the Lebesgue measure and if (a, b) is
an interval, where -00 S a < b :S +00, then the LP spaces on (a, b) will
be denoted V(a, b).

Example 6.1.1 Let X = {1, 2,· .. ,n}. Let S be the collection of all sub-
sets of X and let J.L be the counting measure (cf. Example 1.3.1). Then
a measurable function can be identified with an n-tuple (aI, a2,··· ,an).
In this case LP(J.L) = e;. Notice that in this example, equality almost
everywhere is the same as equality everywhere and so every equivalence
class is a singleton. •

Example 6.1.2 Let X = N, the set of all natural numbers and let S
be the collection of all subsets of X. Let J.L be the counting measure. In
this case, functions are identified with real sequences and LP(J.L) = ep .
Again, in this example, equivalence classes are just singletons . •

Proposition 6.1.3 Let (X,S,J.L) be a finite measure space, i.e. J.L(X) <
00. Then

with the inclusion being continuous, whenever 1 :S q :S p.

Proof: The result is trivial if p = 00. Let 1 :S q <p < 00 and let
166 6 LP Spaces

f E LP(Jl')' Then, by Holder's inequality, we have

Ilfl q dJi- < (Ix(lfl )~ i )~ (Ix i )l ~


(Ix Ifl P dJi-) ~ (Ji-(X))l-!
IlflIZ. (Ji-(X)) 1-!
which yields

where

This completes the proof. •

Example 6.1.3 No such inclusions hold in infinite measure spaces. For


instance, the sequence (~) belongs to £2 but not to £1 . •

Example 6.1.4 Nothing can be said about the reverse inclusions. For
example, if f(x) = 1/ y'X, then f E L1(0, 1) but f tj L2(0, 1). However,
we know that (cf. Exercise 2.25) £p C £q for all 1 :S p < q :S 00 . •

Theorem 6.1.1 Let (X, 5, Ji-) be a measure space. Let 1 :S p :S 00.


Then LP(Ji-) is a Banach space.

Proof: Case 1. Let 1 :S p < 00. Let {fn } be a Cauchy sequence in


LP(Ji-). It is enough to show that there exists a convergent subsequence
(why?). Choose a subsequence such that

1
Ilfnk - fnk+lllp :S 2k ·

Set
n
gn(x) L Ifnk+Jx) - fn,,'(x) I
k=l
and
00

g(x) L Ifnk+l (x) - fnk (x)l·


k=l
Then
Ilgnllp < 1.
6.1 Basic Properties 167

It follows that gn(x) ----+ g(x) and, by the monotone convergence theorem
(cf. Theorem 1.3.1), we see that Ilgllp :S 1. In particular, g(x) < 00
almost everywhere. Further, if k 2 I 2 2, we have

Ifnk(X) - fnl(X)1 < Ifnk(X) - fnk-l (x)1 + ... + Ifnl+l (x) - fnl(X)1
< g(x) - gZ-l(X).
Thus, it follows that, for almost every x EX, {fnk (x)} is a Cauchy
sequence and converges almost everywhere to a finite limit f(x) and
that, for such x,
If(x) - fnk (x)1 :S g(x)
for k 2 2. Set f 0 elsewhere, which is a set of measure zero. It
then follows that f is p- integrable. Further, Ifnk (x) - f (x)IP ----+ 0 almost
everywhere and is bounded by Ig(x)IP which is integrable. Hence, by the
dominated convergence theorem (cf. Theorem 1.3.3), we deduce that
Ilfnk - flip ----+ O. Thus we have that

in V(p,).

Case 2. p = 00. Let {fn} be Cauchy in LOO(p,). Then, for each k, there
exists a positive integer Nk such that

for all m, n 2 N k . Thus, there exists a set Ek of measure zero, such that

for all m, n 2 Nk and for all x E X\Ek. Setting E = U~l E k , we see


that E is of measure zero and for all x E X\E, the sequence {fn(x)} is
a Cauchy sequence in R Thus, for all such x, fn(x) ----+ f(x). Passing to
the limit as m ----+ 00, we see that, for all x E X\E, and for all n :2 Nk,

1
If(x) - fn(x)1 :S k·
Hence, it follows that f is essentially bounded and that fn ----+ f in LOO(p,).

This completes the proof. •


168 6 LP Spaces

Corollary 6.1.1 Let (X,5,J.L) be a measure space and let fn - t f in


LP(J.L) for some 1 S p S 00. Then, there exists a subsequence fnk such
that
(i) fnk (x) - t f(x) almost eveTywhere.
(ii) Ifnk(X)1 S h(x) almost everywhere for some h E LP(J.L).
Proof: The result is obvious in the case p = 00. Let 1 S p < 00. Then,
as in the case of the preceding theorem, we have a subsequence {fnk }
which converges to a function f in LP(J.L)and ~so such that [nk (x) - t f(x)
almost everywhere. It is then clear that f = f, i.e. f = f almost
everywhere and this proves (i). To see (ii), take h = + g, where 9 is 1
as in the proof of the preceding theorem. •

6.2 Duals of £P Spaces

In Chapter 3, we identified the dual of the space Cp with Cp ' where


1 S p < 00 and p* is the conjugate exponent of p. Similar results are
true in more general LP spaces.
Proposition 6.2.1 (Clarkson's Inequality) Let (X, 5, J.L) be a mea-
sure space and let 2 S p < 00. Then if f and g E LP (J.L) ,

11~(f + g)l[ + 11~(f -g)l[ ~ ~ (1Ifll~ + Il ll~). (6.2.1)

Proof: Consider the function


'1'( x) = (x 2 + 1) ~ - x P - 1
for x ~ O. Then it is simple to check that '1'(0) = 0 and that 'P'(x) >0
for x > 0 when p ~ 2. Thus, it follows that for all x ~ 0,
(x 2 + 1) ~ ~ xP + 1,
when p ~ 2. Hence, if Q: and {3 are positive real numbers, we have
(Q:2 + (32) ~ ~ Q:P + {3P.
Combining this with the fact that the function t f--t t~ is convex on the
set {t E lR I t ~ O}, we get, for any x E X and for any f and g E LP(J.L),

1f(x)~ (x) P + 1f(x);g(x)


I IP < (I f(x)~ (x) 12 + 1 f(x);g(x) 12) ~
p.
(If(x)l2~l (X)12) 2

< 1(lf(x)IP + Ig(x)IP)


6.2 Duals of LP Spaces 169

which yields (6.2.1) on integration over X . •

Corollary 6.2.1 Let (X,S,J.L) be a measure space. Then, the spaces


LP(J.L) are reflexive when 2 ::; p < 00.

Proof: Arguing as in Example 5.5.2, it is easy to see that (6.2.1) implies


that V(J.L) is uniformly convex when 2 ::; p < 00. The reflexivity now
follows from Theorem 5.5.1. •

Theorem 6.2.1 (Riesz Representation Theorem) Let (X, S, J.L) be


a measure space and let 1 < p < 00. Let p* be the conjugate expo-
nent. Then the dual of V (J.L) is isometrically isomorphic to LP* (J.L). In
particular, the spaces LP(J.L) are reflexive for all 1 < p < 00.

Proof: Step 1. Let g E V* (J.L). Define Tg : V (J.L) ---+ lR by

Tg(f) = Ix fg dJ.L

for f E V(J.L). Clearly, Tg is a linear functional, and, by Holder's in-


equality, it is continuous as well. In fact, we have

Now, consider the function

f(x) = { Ig(xW*-2 g (x), if g(x) i= 0


0, if g(x) = O.
Then Ifl P = Igl(p*-l)p = IgIP* so that f is p-integrable. Also

Tg(f) = Ix IgI P * dJ.L

from which we deduce that

Thus, the map g 1---+ Tg is an isometry from V* (J.L) into LP(J.L)*. Hence
its image is closed. It is enough now to show that the image is dense.

Step 2. We now show that V(J.L) is reflexive for all 1 < p < 00. This
has already been proved for 2 ::; p < 00. Thus LP(J.L)* is also reflexive
for such p and so is every closed subspace of this dual space. Thus, by
170 6 V Spaces

the preceding step, LP' (J.L) which is isometrically isomorphic to a closed


subspace of the dual of LP(J.L) is also reflexive for 2 S p < 00. But then
1 < p* S 2. This proves that V(J.L) is also reflexive when 1 < P S 2.
This establishes the claim.

Step 3. We are now in a position to show that the isometry g f-..t Tg


from V' (J.L) to the dual space LP(J.L)* is onto. As already observed, the
image is a closed subspace and we now show that it is dense. Indeed,
let <p E V(J.L)** vanish on the image. We need to show that <p is the
zero functional. Since all the V(J.L) are reflexive, this means that there
exists f E V(J.L) such that, for all g E V' (J.L), we have

ixfgdJ.L = o.

Once again, choosing 9 = Ifl p - 2 f (and equal to zero where f vanishes)


we deduce that f = O. This completes the proof. •

Remark 6.2.1 We have seen earlier that ti = too. In the same way, it
is true that for O"-finite measure spaces, we have L 1 (J.L)* = Loo(J.L). How-
ever, the proof of this result relies on very measure theoretic arguments
and we shall omit it. Nevertheless, in the next section, we will prove it
for a very important class of L1 spaces . •

6.3 The Spaces V(O)

In this section, we will study the properties of a very important class of


V spaces defined on open sets in the Euclidean spaces ]RN.
Let n c ]RN be an open set. Consider the Lebesgue measure on this
set. Then, as mentioned in Section 6.1, we will denote the corresponding
V spaces by V(n).

In the sequel, if we say that a certain function space is contained in


(respectively, is dense in), V(n), we will understand that we are talking
about the set of all equivalence classes of functions in that space being
contained in (respectively, being dense in) V(n).

Proposition 6.3.1 Let S be the set of all simple functions which vanish
outside a set of finite measure. Then S is dense in V(n) for 1 S p < 00.
6.3 The Spaces V(n) 171

Proof: Let tp E S. Since tp vanishes outside a set of finite measure,


it is automatically p-integrable for 1 ::; p < 00. Let f 2: 0 be a p-
integrable function. Then, there exists a sequence {tpn} of non-negative
simple functions which increase to f (d. Proposition 1.3.2). Since f is
p-integrable, so is tpn and so tpn will also vanish outside a set of finite
measure. Further
Itpn(x) - f(x)jP ::; 2Plf(x)jP
for x E n and, since Ifl P is integrable, it follows from the dominated
convergence theorem that

In Itpn - flP dx -t 0

as n - t 00. If f is an arbitrary p-integrable function, then we have


sequences {tpn} and {'¢n} of simple functions vanishing outside sets of
finite measure and such that

Thus Xn = tpn - '¢n is a simple function which vanishes outside a set of


finite measure and
In IXn - flP dx -t 0

as n - t 00. This proves the result . •


Theorem 6.3.1 Let 1 ::; p < 00. Let n c
~N be open. Then, Cc(n),
the space of all continuous functions with compact support contained in
n, is dense in V(n).
Proof: By the preceding proposition, we know that S is dense in LP(n).
Thus, given tp E S, it is enough to show that it can be approximated (in
the V-norm) as closely as we wish by a continuous function with com-
pact support. Indeed, let E > O. By Lusin's theorem (d. Royden [5]),
we can find a continuous function g, with compact support, such that
9 = tp except possibly on a set whose measure is less than E and also
such that Ig(x)1 ::; Iltplloo. Then

In Ig - tplP dx ::; 2Plltpll~E.


This shows that Cc(n) is dense (with respect to the norm II· lip) in S
which, in turn, is dense in V(n). This proves the result . •
172 6 V Spaces

Remark 6.3.1 In fact it can be shown that the space of infinitely dif-
ferentiable functions with compact support contained in 0 is dense in
V(O) for 1 :S p < 00. For this we need to develop the theory of convolu-
tion of functions (cf. Theorem 6.3.3 below). For details, see Kesavan [3] .

Corollary 6.3.1 Let 0 C jRN be an open set. Let 1 :S p < 00. Then,
V(O) is separable.

Proof: Recall that, by the Weierstrass approximation theorem, a con-


tinuous functiuon on a compact set can be uniformly approximated by
means of a polynomial and hence, by a polynomial with rational coeffi-
cients and such polynomials form a countable set.
We can write
o = U~=lOn
where On = OnB(O; n); here B(O; n) is the ball centred at the origin and
with radius n in jRN. Notice that On is bounded and is hence relatively
compact, i. e. On is compact.
Let c > 0 and let! be p-integrable over O. Then, by the preceding
theorem, we can find a continuous function g, with compact support
such that II! - gllp < c. Since the support of 9 is compact, its support
will lie in some On. Thus, we can find a polynomial p with rational
coefficients such that, for all x EOn,

Ig(x) - p(x)1 <

where IOnl denotes the (Lebesgue) measure of On. Setting p = 0 outside


On, we then see that Ilg - pllp < c so that II! - pllp < 2c. Thus any
p-integrable function can be approximated in the norm 11.llp by means of
a function which vanishes outside some On and is equal to a polynomial
with rational coefficients inside On. The collection of all such functions
being countable, we deduce that V(O) is separable for 1 :S p < 00 . •

Proposition 6.3.2 Let 0 C jRN be an open set. Then, LOO(O) is not


separable.

Proof: Let x E O. Let r r(x) > 0 be chosen such that the ball
B(x; r) c O. Define

I, if Y E B(x;r)
Xx(y) {
0, otherwise.
6.3 The Spaces LP (0,) 173

Set
Ux = {f E LOO(n) I Ilf - Xxlloo < 1/2}.
Then, for each x E 0" Ux is a non-empty open subset of LOO(n). Further,
if x i- y, then Ux n Uy = 0. For, if f belonged to their intersection,
then on one hand, since f E Uy, we have that If(z)1 < 1/2 in a small
neighbourhood of x. On the other hand, since f belongs to Ux, it follows
that If(z)1 > 1/2 in a small neighbourhood of x, which is a contradiction.
Now let E = {fn} be any countable set in LOO(n). If such a set were
dense, then En Ux i- 0 for each x E n. However, any fn can belong
to at most one such open set Ux since the sets Ux are pairwise disjoint.
This is a contradiction since the number of open sets Ux is uncountable.
Thus, no countable set in LOO(n) can be dense . •

Definition 6.3.1 Let 0, C ]RN be an open set and let f : 0, ---t ]R be a


measurable function. We say that f is locally integrable if JK If I dx <
00 for every compact set Ken . •

We denote the set of all locally integrable functions on 0, by LfoJn).

Proposition 6.3.3 Let f E Lfoc(n) be such that

In fg dx = 0

for all 9 E Cc(n). Then f = 0 almost everywhere in n.

Proof: Step 1. We first assume that f is integrable on 0, and that 10,1,


the measure of 0" is finite. Let c > O. Then, there exists a continuous
function h, with compact support, such that II f - hill < c (cf. Theorem
6.3.1). Thus, if 9 E Cc(n), we have

lin hgdXI = lin (h - f)g dxl < cllglloo. (6.3.1)

Let
Kl = {xEnlh(x)2:c}
K2 = {x En I h(x) :s; -c}.

Then, Kl and K2 are disjoint and compact sets (since h is a continu-


ous function with compact support) and by Urysohn's lemma, we can
construct a continuous function h, also with compact support such that
174 6 LP Spaces

h == 1 on KI and h == -Ion K2. Further, we can also have Ih(x)1 :S 1


for all x E !1. Set K = KI U K 2. Then

in
r hh dx = r
in\K
hh dx + r hh dx,
iK
whence, in view of (6.3.1), we have

r Ihl dx
iK
=
iK
r hh dx :S E + r
in\K
Ihhl dx < E + r
in\K
Ihl dx.

Since Ih(x)1 :S E on !1\K, we deduce that

In Ihl dx IK Ihl dx + In\K Ihl dx


< E + 2 In\K Ihl dx
< E + 2EI!1I.
Thus,
IIfll1 :S IIf - hill + IIhlll :S 2E + 2EI!1I·
Since E is arbitrary, it follows that f(x) = 0 almost everywhere in !1.

Step 2. In the general case, we again write !1 = U~=I 1n where !1 n =


!1 n B(O;'n). Then applying the result of Step 1 to the restriction of
f to !1 n , denoted finn' we get that finn = 0 almost everywhere in !1 n
from which it immediately follows that f = 0 almost everywhere in!1. _

Let us now turn our attention to the space LI(!1).

Theorem 6.3.2 (Riesz Representation Theorem) Let !1 C ]RN be


an open set. The dual of the space LI(!1) is isometrically isomorphic to
Loo(!1).

Proof: Step 1. There exists wE L2(!1) such that w(x) ~ EK > 0 for all
x E K for every compact subset K of!1. Indeed, define w(x) = an > 0
on the set
En = {x E !1 I n :S Ix I < n + I},
where Ixl denotes the Euclidean norm of the vector x E ]RN. Now choose
the constants an such that
00

L ~IEnl < 00,


n=O
6.3 The Spaces V(n) 175

where IEnl denotes the (Lebesgue) measure of the set En. Then w has
the required properties.

Step 2. Let <P E L1(n)*. Consider the mapping f f--+ <p(wf) from L2(n)
into R Clearly, this defines a linear functional which, by Holder's in-
equality, is also continuous. Thus, by the Riesz representation theorem
(cf. Theorem 6.2.1) applied to the case p = 2, there exists v E L2(n)

In
such that
<p(wf) = fv dx

for all f E L2(n). Thus, we have

lin fv dxl s 11<p11·llwfI11. (6.3.2)

Step 3. Set u(x) = v(x)/w(x) for x E n. Since w never vanishes, this


is well defined and u is measurable. We claim that U E DX)(n) and that
s
Ilull oo 11<p11. To see this it is sufficient to show that, for any constant
C> 11<p11, we have that the set

A = {x E 0. Ilu(x)1 > C}

is of measure zero.
Assume the contrary for some such C > 11<pII.Then, there exists a
subset B of A of finite and positive measure. Consider the function

+1, if x E Band u(x) > 0,


f(x) = { -1, if x E Band u(x) < 0,
0, if x E n\B.

Clearly, f is square integrable (since the measure of B is finite) and we


can use it in (6.3.2). We then get

l'u,w dx S "<p'll w dx

and, using the definition of A, which contains B, we get

C l w dx s 1'<p'll w dx

which is a contradiction to the choice of C, since IE w dx > 0.


176 6 LP Spaces

Step 4. Thus we now have u E DXl(O) with Iluli co :S 11<p11 such that, for
all f E L2(0),
<p(wf) = in fuw dx.

Let 9 E Cc(O). Then, by choice of w, f = glw is square integrable and


so, we get, for all 9 continuous with compact support in 0,

<p(g) = in ug dx. (6.3.3)

Since Cc(O) is dense in L1(0), the above relation also holds for all g E
L1(0). Further, it follows that

for all g E L1(0) from which we deduce that 11<p11 :S Iluli co .

Step 5. Thus, for every <p E L1(0)*, we have u E LCO(O) such that
11<p11 = Ilull oo and such that (6.3.3) holds for all g E L1(0). Such a u is
unique as well. Indeed if we have two essentially bounded functions U1

l
and U2 such that
g(U1 - U2) dx = 0

for all g E L1(0), then it is in particular true for all 9 E Cc(O) and, since
essentially bounded functions are locally integrable, it follows that (cf.
Proposition 6.3.3) U1 -U2 = 0 almost everywhere, i.e. U1 = U2 in Loo(O).

Step 6. If u E L 00 (0), then if we define Tu as a linear functional on L1 (0)


via the right-hand side of (6.3.3), then we have just seen that u I-t Tu is
surjective and that it is an isometry between Loo(O) and L1(0)*. This
completes the proof. •

Proposition 6.3.4 Let 0 C ]RN be an open set. Then, L1(0) is not


reflexive.

Proof: Without loss of generality, assume that 0 contains the origin.


Let n be sufficiently large so that the ball centred at the origin and of
radius lin, denoted B n , is contained in O. Let an = IBnl-1, where, as
usual, IBnl denotes the (Lebesgue) measure of Bn. Let fn(x) = an for
all x E Bn and let it vanish on O\Bn . Then fn E L1(0) and IIfnl11 = 1
for all n. If L1(0) were reflexive, then the sequence {fn} would contain
6.3 The Spaces LP(n) 177

a weakly convergent subsequence (cf. Theorem 5.4.2), say {f nk }. Let f


be its weak limit. Then, for every h E vx>(n) we must have

(6.3.4)

Now choose h E Cc(n\ {O}). Then, for sufficiently large k, we have that

l/nkh dx = 0

(since the two functions in the integrand will then have disjoint sup-
ports) and so, it follows that, for all such h, we have In
fh dx = O. By
Proposition 6.3.3, it then follows that f(x) = 0 almost everywhere in
0, \ {O} and so f (x) = 0 almost everywhere on 0, as well. On the other
hand, if we choose h(x) = 1 for all x E 0, in (6.3.4), we get In
f dx = 1,
which is a contradiction. Thus, L1 (0,) is not reflexive . •
Corollary 6.3.2 Let 0, C ]RN be an open set. Then, LOO(n) is not
reflexive.
Proof: Since L1(n)* ~ vx>(n), the result follows immediately from the
preceding proposition (cf. Corollary 5.3.3) . •

To sum up, we have that V(n)* ~ LP' (0,) for 1 'S p < 00. The
spaces V(n) are separable for 1 'S p < 00 and reflexive for 1 < p < 00.
The space LOO(n) is neither separable nor reflexive.
We conclude by proving an important inequality.
Theorem 6.3.3 (young's Inequality) Let 1 < p < 00. Let f E
L1(]RN) and let g E LP(]RN). Then the map

x r-4 r~N f(y)g(x - y) dy

is well defined almost everywhere in ]RN. The function thus defined is


denoted f * g and is called the convolution of f and g. Further, f * g E
LP(]RN) and we also have
(6.3.5)
Proof: Let h E V* (]RN) , where p* is the conjugate exponent of p. The
function (x, y) r-4 f(y)g(x - y)h(x) is measurable on]RN x ]RN; consider
the iterated integral

I = r r
~ j ~f
If(y)g(x - y)h(x)1 dy dx.
178 6 LP Spaces

Since the Lebesgue measure is translation invariant, we get that

I In~ : (In~~ Ig(x - y)h(x)1


If(y)1 dX) dy
< Ilgllpllhllp* JIRN If(y)1 dy
Ilflh Ilgllpllhllp* < 00.

Thus by Fubini's theorem, the integral

ilR
r f(y)g(x - y)h(x) dy
N

exists for almost all x E ]RN. Let us choose h E y* (]RN) such that
h(x) i- 0 for all x E ]RN. For example, we can choose h(x) = exp( -lxI 2 ).
Thus, it follows that the integral

ilR
r f(y)g(x - y) dy
N

exists for almost all x E ]RN and so the convolution f * g is well defined.
Further, by the above computation it follows that the map

h r--+ r h(x)(f * g)(x) dx


ilR N

is a continuous linear functional on y* (]RN) whose norm is bounded


byllfllIilgllp. It follows from the Riesz representation theorem that
f * g E Y(]RN) and that (6.3.5) holds . •

Remark 6.3.2 By a simple change of variable it is easy to see that we


can also write the convolution of f and g as

(f * g)(x) = r f(x - y)g(y) dy.


ilR N

Remark 6.3.3 The result of Theorem 6.3.3 is valid for the case p = 1
as well. The proof of this fact is left as an exercise (cf. Exercise 6.16) .

6.4 The Spaces W1,P(a, b)

In this section, we will study a very special case of a class of function


spaces called Sobolev spaces.
6.4 The Spaces W 1 ,P(a, b) 179

Throughout this section, we assume that (a, b) is a finite interval in


lR and that 1 ::; p < 00. We will denote by V(a, b) the space of infinitely
differentiable functions with compact support contained in the interval
(a, b). Recall that (cf. Remark 6.3.1) V(a, b) is dense in V(a, b) for
1 ::; p < 00.

Lemma 6.4.1 Let f E V(a, b). Assume that there exists g E V(a, b)
such that, for all rp E V( a, b), we have

(6.4.1)

Then such a g is unique.

Proof: If there were two functions 91 and 92 satisfying (6.4.1) for a


given f, then
lb (91 - 92)rp dx = 0

for all rp E V( a, b). Since 91 - 92 is locally integrable, it now follows that


91(X) = 92(X) almost everywhere (cf. Proposition 6.3.3) . •

Definition 6.4.1 Let (a, b) c lR be a finite interval and let 1 ::; p < 00.
The Sobolev space W 1,P( a, b) is 9iven by

W 1 ,P(a, b) = {f E LP(a, b) I there exists g E V(a, b) satisfyin9 (6.4.1)}.

Further, we define
1
Ilfl11,p = (1Ifll~ + Il ll~)p •
It is a routine verification to see that 11.111,p defines a norm on W 1 ,P(a, b)
and this is left to the reader. Thus, W 1 ,P(a, b) is a normed linear space.

Example 6.4.1 Let f E e 1 [a,bJ. Clearly f E LP(a, b). If f' denotes the
derivative of f, then f' E era, bJ and so f' E V(a, b) as well. Further,
if <p E V(a, b), then since <p(a) = <p(b) = 0, we have, by integration by
parts,

lb frp' dx = -l b
!,rp dx.

Thus, f E W 1,P(a, b) and it satisfies (6.4.1) with 9 = f' .•


180 6 V Spaces

By analogy with the preceding example, if f E W1,P(a, b), and if


g is the associated function as in (6.4.1), then we denote g by f'. In
particular, we have
1
Ilflh,p = (1Ifll~ + IIf' II~) p.
In the literature, f' is known as the generalised or distributional derivative
off.

Proposition 6.4.1 Let 1 :S p < 00 and let (a, b) c IRn be a finite


interval. Then, W1,P(a, b) is a Banach space.

Proof: We just need to prove the completeness. Let {fn } be a Cauchy


sequence in W1,P(a,b). Then {fn} and {f~ are both Cauchy sequences
in V(a, b). Let fn - t f and f~ - t 9 in V(a, b). Now, if r.p E V(a, b), we
have

for all n. Passing to the limit as n - t 00, we deduce that the pair (f, g)
satisfies (6.4.1). Thus f E W1,P(a, b) and f' = g. Further, it follows that
fn - t fin W1,P(a, b). This completes the proof. •

Proposition 6.4.2 The space W1,P( a, b) is reflexive if 1 <p< 00 and


separable if 1 :S p < 00.

Proof: Since the space V(a, b) is reflexive if 1 < p < 00, so is the space
(V(a, b))2 (why?). Similarly, (V(a, b))2 is separable if 1 :S p < 00.
Now, the space W1,P(a, b) is isometric to a subspace of (LP(a, b))2 via
the mapping f f---t (f, f'). Since W1,P(a, b) is complete, the image is a
closed subspace of (LP (a, b))2 and so it inherits the reflexivity and sep-
arability properties from that space. This completes the proof. •

We will now study some finer properties of these Sobolev spaces.

Lemma 6.4.2 Let r.p E V(a, b). Then, there exists 7jJ E V(a, b) such
that 7jJ' = r.p if, and only if,

lab r.p(t) dt O.
6.4 The Spaces W1,P(a, b) 181

Proof: Assume that cp = 'Ij;' for some 'Ij; E V(a, b). Then, since 'Ij;(a) =
'Ij;(b) = 0, it follows that

1b cp(t) dt = 1b 'Ij;'(t) dt = 'Ij;(b) - 'Ij;(a) = O.

Conversely, let J:
cp(t) dt = O. Let the support of cp be contained in
[c, dJ c (a, b). Now, define

'Ij;(t) = 1t cp(s) ds.

Clearly 'Ij; is infinitely differentiable since 'Ij;' = cpo Further, 'Ij; vanishes
on the interval (a, c) and, by hypothesis, on the mterval (d, b) as well.
Thus the support of'lj; is also contained in [c, dJ and so 'Ij; E V( a, b). This
completes the proof. •

Corollary 6.4.1 Let f E £P(a, b) where 1 ::; p < 00. Assume that

for all cp E V( a, b). Then f is equal to a constant almost everywhere in


(a, b).

Proof: Choose CPo E V(a, b) such that


be an arbitrary element. Set
J: CPo(t) dt = 1. Let cp E V(a, b)

Then J: ¢>(t) dt = 0 and so ¢> = 'Ij;' for some 'Ij; E V(a, b). Thus,
J: f ¢> dt = 0 which yields

Setting c = J: fcpo dt, we get

1b (J - c) cp dt 0
182 6 LP Spaces

for all 'P E V(a,b), whence, by Proposition 6.3.3, it follows that f(x) = c
almost everywhere in (a, b). This completes the proof. •

Remark 6.4.1 If 'PI were another function in V(a, b) such that J: 'PI (t)dt
= 1, then since J: ('Po - 'PI) dt = 0, it follows that 'Po - 'PI = 1/J' for some
1/J
E V( a, b). Therefore, by hypothesis, J: f ('Po - 'PI) dt = O. Thus, the
constant c defined in the above proof does not depend on the choice of
the function 'Po whose integral is unity. •

Let us denote by Coo [a, bJ the space of all functions which are in-
finitely differentiable in the open interval (a, b) and such that the func-
tions and all their derivatives possess continuous extensions to [a, bJ.
Proposition 6.4.3 Let 1 :s: p < 00. Then COO [a, b] is dense in WI,P(a, b).
Proof: It is clear that if f E Coo [a, b], then f E WI,P( a, b) and its
distributional derivative is just its classical derivative. Now, let f E
WI,P(a, b). Since f' E V(a, b), choose 'Pn E V(a, b) such that 'Pn --t f' in

l
V(a, b). Define
x
1/Jn(x) = 'Pn(t) dt.

Then 1/Jn E COO [a, b]. Further, for x E [a, b],

l1/Jn(x) -7Pm(x)1 :s: lb l'Pn(t) - 'Pm(t)1 dt < (b - a) pl. II'Pn - 'Pmllp

by HOlder's inequality. Thus,

It then follows that {1/Jn} is Cauchy in V( a, b) (since {'Pn}is Cauchy)


and so let 1/Jn --t h in V (a, b). Since 1 ~ = 'Pn, it is now easy to verify
that h E WI,P(a, b) and that h' = f'. By the preceding corollary, it fol-
lows that f - h is equal to a constant, say c. Thus, if we set Xn = 7Pn + c,
then Xn E COO [a, b], Xn --t f in LP(a, b) and X~ --t f' in V(a, b). This
completes the proof. •

We now briefly digress to recall some facts about absolutely contin-


uous functions.

Definition 6.4.2 A function f : [a, b] --t IR is said to be absolutely


continuous on [a, b] if, for every E > 0, there exists 8 > 0 such that
6.4 The Spaces Wl,P( a, b) 183

whenever we have a finite collection of disjoint intervals {(Xi, xD }f=l


contained in (a, b) satisfying
n
I)x~ - Xi) < 8,
i=l

we have
n
L If(xD - f(Xi)1 < E• •
i=l
Clearly, any absolutely continuous function is uniformly continuous.
It can also be shown (cf. Royden [5]) that an absolutely continuous
function is differentiable almost everywhere and that its derivative is an
integrable function. The following two results are very important (cf.
Royden [5]).

Theorem 6.4.1 A function f : [a, b] ~ lR can be expressed as an in-


definite integral of an integrable function if, and only if, it is absolutely
continuous. In this case we have

f(x) = f(a) + l x
f'(t) dt . •

Theorem 6.4.2 (Integration by parts) Let f and 9 be absolutely


continuous functions on [a, b]. Then

lb f(t)g'(t) dt = f(b)g(b) - f(a)g(a) -lb f'(t)g(t) dt . •

If f E el[a, b], then it is absolutely continuous. In particular, if


cp E D( a, b), it is absolutely continuous. Consequently, by virtue of
intergation by parts, it follows that if f is absolutely continuous on [a, b]
and if cp E D(a, b), then

so that the distributional derivative of f is f'.


Proposition 6.4.4 Let 1 :S p < 00. Let f E W1,P(a, b). Then f is ab-
solutely continuous, i. e. f is equal, almost everywhere, to an absolutely
continuous function.
184- 6 LP Spaces

Proof: Let us define

1'(X) = l x
f'(t) dt.

Since f' E LP (a, b) and since p 2: 1, it follows that f' is integrable on (a, b)
and so u is ail absolutely continuous function. Further, since integration
by parts is valid for absolutely continunus functions, it follows that for
all cp E V(a, b), we have

lb Ucp' dx = _1" f'cp dx .

Thus U E W1,P(a, b) and u' = f'. Then, as bebre, it follows that f - u


is equal to a constant almost everywhere. Tl1Us f(x) = u(x) + c almost
everywhere in x and the latter function is absolutely continuous . •

The above proposition states that W1,P( a, b) consists of abiJolutely


continuous functions (upto equality almost everywhere). In particu-
lar, we can say that W1,P(a, b) is contained in C[a, b], i.e. every ele-
ment of W1 ,P(a, b) is represented by means of an (absolutely) continuous
function. Such a representative must be unique, for, if two continuous
functions are equal almost everywhere, then they are equal everywhere
(why?).
Theorem 6.4.3 (Sobolev's Theorem) The inclusion map from W1 ,P(a, b)
into C[a, b] is continuous.
Proof: Let fn, f be in W1,P(a, b) with absolutely continuous representa-
tives fn, f· Assume that fn -----+ fin W1,P(a, b). Then Ilfn - flip -----+ 0 and
Ilf~ - f'llp -----+ O. Now, by absolute continuity, we have

(6.4.2)

and
f(x) = f(a) + l x
f'(t) dt. (6.4.3)

We claim that {fn(a)} is Cauchy. If not, there exists E > 0 such


that, for every N, there exist m, n 2: N satisfying Ifm(a) - fn(a)1 2: E.
Then, it follows from (6.4.2) that
6.4 The Spaces W1,P(a, b) 185

by an application of Holder's inequality. Choose N large enough such


that, for all n, m ~ N, we have
1 E
Ilf:n - f~llp( - a)p' < 2'
Then, for all x E (a, b) we have

> ~
2
whence it would follow that

whch contradicts the fact the {fn } is cauchy in LP(a, b).


Thus {fn(a)} is Cauchy and now, for any x E [a, b],

by another application of (6.4.2) and Holder's inequality. This shows


that {In} is uniformly Cauchy and so it converges to a continuous func-
tion f on [a, b]. But since Ilfn - flip ---t 0, it follows that (cf. Corollary
6.1.1), at least for a subsequence, we have fnk(X) ---t f(x) almost every-
where, from which we deduce that f == 1.
Thus, fn ---t f in C[a, b] which
completes the proof. •

Theorem 6.4.4 (Rellich's Theorem) The unit ball in W1,P(a, b) is


relatively compact in LP (a, b) .
Proof: The inclusion map W1,P(a, b) c V(a, b) is the composition of
the following inclusion maps:

W1,P(a, b) c C[a, b] c LP(a, b).


The first inclusion above is continuous by the preceding theorem. The
space C[a, b] is a subspace of LOO(a, b) and the 'sup-norm' is the same
as 11.1100' Now it follows that the second inclusion is also continuous by
Proposition 6.1.3.
Let B be the unit ball in W1,P(a, b). Thus, if fEB, then

Ilfll~ + 11f 11~ ::; 1.

Then, by the preceding theorem, it follows that B is bounded in C[a, b]


as well, since the inclusion map is continuous (cf. Proposition 2.3.1 (iv)).
186 6 V Spaces

Further, let x, y E [a, bj. Assume, without loss of generality, that x ::; y.
Then

It now follows immediately that B is equicontinuous as well since for


E > 0, if we choose 6 < EP·, then Ix-yl < 6 implies that If(x)- f(y)1 < E
for all fEB. Thus, by the theorem of Ascoli, it follows that B is
relatively compact in C[a, bj.
Thus, any sequence in B will have a subsequence which is convergent
in era, b], which will also, a fortiori, converge in LP(a, b). This proves
that B is relatively compact in V(a, b) . •

Definition 6.4.3 Let (a, b) C R be a finite interval and let 1 ::; p < 00.
The closure of V(a, b) in W1,P(a, b) is denoted ~ P( , b) . •

Theorem 6.4.5 Let f E W1,P(a, b) with f absolutely continuous. Then


f E ~ P( , b) if, and only if, f(a) = f(b) = O.

Proof: Let f E ~ P( , b). Then there exists a sequence {'Pn} in V(a, b)


such that 'Pn -+ f in W1,P(a, b). Then 'Pn -+ f uniformly on [a, bj and so
it follows immediately that f(a) = f(b) = O.
Conversely, let f(a) = f(b) = O. Then (cf. Theorem 6.4.1) we have

f(x) = l x
f'(t) dt

and so it follows that J: f'(t) dt = O. Let 'Pn E V(a, b) such that


II'Pn - f'llp -+ O. Then

lib 'Pn dt - ib l' dtl < II'Pn - f'llp(b - a) pl. -+ 0

and so
ib 'Pn dt -+ O.

Let 'Po E V( a, b) such that J: 'Po dt = 1. Then if

7/Jn = 'Pn - (i b
'Pn dt) 'Po,
6.4 The Spaces W1,P(a, b) 187

we also have that II'lf!n - f'llp - t 0 and J:


'If!n dt = O. Thus 'If!n = X~
where Xn E V(a, b) as well (cf. Lemma 6.4.2). Since

it follows that Xn converges to f uniformly and so IIXn - flip - t 0 as


well. Thus Xn E V(a, b) and Xn - t f in W1,P(a, b). This proves that
f E ~ P( , b) . •

Notice that if f == 1, then f' == 0 so that, in W1,P(a, b), the map


f I--t 1If'IIp does not define a norm, but only a semi-norm, i.e. while
1If'IIp = 0 does not imply that f = 0, all other properties of a norm are
satisfied. However, in the space ~ P( , b), we have the following result.

Theorem 6.4.6 (poincare's Inequality) Let f E ~ P( , ). Then

Ilfllp ::; (b - a)llf'llp· (6.4.4)

Thus the function f I--t 1If'IIp defines a norm on ~ P( , b) equivalent to


the usual norm on this space.
Proof: Let f be absolutely continuous and represent f. Iff E ~ P( , b),
then, since f(a) = 0, we have

f(x) = l x
f'(t) dt.

Then, by Holder's inequality, we have


1
If(x)1 ::; 11f'llp(b - a)p*.
Thus,
IIfllp ::; 11f llp( )~+pl = (b-a)llf'llp
which proves (6.4.4).
In particular, if 1If'IIp = 0, it follows that Ilfllp = 0 and so f = 0 in
~ P( , b). The other properties of a norm are easily verified. Thus we
have two norms on ~ P( , b):
def
Ilfl11,p and Ifl1,p Ilf lip·
I

Clearly,
1
Ifh,p < Ilflh,p < [(b - a)P + 1] p IfI1,p'
188 6 IJ> Spaces

Thus the two norms are equivalent. •

Sobolev spaces can also be defined when p = 00. The definition can
also be extended to cover functions defined on arbitrary open sets 0 C
jRN. It is also possible to define 'higher order distributional derivatives'
and define Sobolev spaces Wm,P(O), mEN, based on these derivatives.
All these spaces have properties similar to those proved in this section,
with or without additional hypotheses. For a detailed study of Sobolev
spaces, see Kesavan [3]. See also the Exercises 6.24-6.26 below.

6.5 Exercises

6.1 Let (X, S, JL) be a measure space. let 1 :S p, q, r < 00 such that

1 1 1
- + -q
p
= -.
r

6.2 Let (X, S, JL) be a measure space and let 1 :S p < 00. Define, for
t > 0,

Show that
Ilfll~ = p fooo tp-1hf(t) dt.
(Hint: Write hf as an integral over a subset of X and apply Fubini's
theorem (cf. Theorem 1.3.5).)

6.3 (a) Let (X, S, JL) be a measure space. Let In, 9n'!, 9 be measurable
functions such that In -. I and 9n -. 9 almost everywhere in X. Assume
further that IIn(x)1 :S 9n(X) for all x E X and that

Ix 9n dJL -. Ix 9 dJL < 00

as n -. 00. Show that


6.5 Exercises 189

as n ----t 00. (Hint: Apply Fatou's lemma (cf. Theorem 1.3.2) to


gn + fn 2: 0 and to gn - fn 2: 0.)
(b) Let 1 ~ p < 00. Let fn and f E LP(I1,) and assume that fn(x) ----t f(x)
almost everywhere in X. Show that fn ----t f in LP(f.L) if, and only if,
Ilfn lip ----t Ilfll p·
6.4 Let (X, S, f.L) be a measure space and let 1 ~ p < 00. Let fn ----t f
in LP(f.L). Let gn be a sequence of measurable functions converging to a
measurable function 9 almost everywhere in X. Assume further that gn
and 9 are all uniformly bounded by a constant M > 0 in X. Show that
fng n ----t fg in LP(f.L).

6.5 Let (X, S, f.L) be a measure space. A sequence of measurable func-


tions fn is said to converge in measure in X to a measurable function f
if, for every c > 0,

lim f.L({lfn - fl2: c}) = O.


n---too

In this case, we write fn £. f. If 1 ~ P < 00 and if fn ----t f in LP(f.L) ,


show that fn £. f·

6.6 Let (X, S, f.L) be a measure space and let 1 < p < 00. Let f :
X x X ----t JR be such that for almost every y EX, the section fY (cf.
Definition 1.3.7) is p-integrable. Define, for x E X,

g(x) = Ix f(x, y) df.L(Y)·

Show that g E LP (f.L) and that

6.7 Let 9 E Cc(JR). Define c.p(g) = g(O). Then c.p can be extended to a
continuous linear functional on L 00 (JR). Show that there does not exist
f E Ll(JR) such that

c.p(g) = l gf dx

for all 9 E Loo(JR). (This gives another proof that Loo(JR) is not reflexive.)
190 6 LP Spaces

6.8 Let h E IR.N. For a (Lebesgue) measurable function f defined on


IR. N , define its translation by h by

fh(X) = f(x + h).

as h -t 0 in IR.N for any 1 ~ p < 00.

6.9 Let
fn = X[n,rz+11'
the characteristic function of the closed interval [n, n + 1] for n E N (cf.
Definition 1.3.5). Then {fn} is a bounded sequence in L1(1R.). Show that
it does not have a weakly convergent subsequence. (In view of Theorem
5.4.2, this p;ives another proof that Ll (IR.) is not reflexive.)

6.10 Let f E C[O, 1] be such that, for all n 2: 0,

11 xnf(x) dx = 0.

Show that f : ;: : 0.

6.11 (Hardy's inequality) Let f E LP(O, 00), where 1 < p < 00. Define

g(x) = - 11
x 0
X
f(t) dt

for x E (0,00 r Show that g E LP(O, 00) and that


p
Ilgllp ::; p _ Illfllp"
(Hint: Prove it first for f E Cc(O,oo),f 2: 0.)

6.12 A function c.p : (0,00) -t IR. is said to be a step function if


n
c.p(x) = I ~jXlj (x)
j=1
6.5 Exercises 191

where I j , 1 ::; j ::; n are intervals contained in (0,00) and, as usual, XE


denotes the characteristic function of a set E. Show that step functions
in (0,00) are dense in L1(0, 00).

6.13 (Riemann-Lebesgue lemma) Let h be a bounded and measurable


function on (0,00) such that

lim -
c---.oo C
1i 0
c
h(t) dt = 0.

(a) Let f = X[c,d) , where [c,d] C (0,00). Show that

lim roo f(t)h(wt) dt = 0. (6.5.5)


w---.oo io
(b) Deduce that (6.5.5) is true for all f E L1(0, 00).
(c) Iff E L1(a,b) where (a, b) C (0,00), show that

lim
n---.oo
lb
a
f(t) cos nt dt = lim
n---.oo
lb
a
f(t) sin nt dt = 0.

6.14 (a) Let (a,b) C (0,00) be any finite interval. Let fn(t) = cosnt
and let gn(t) = sinnt. Show that fn ----' 0 and gn ----' 0 in £P(a, b) for any
1 ::; p < 00.
(b) What is the weak limit of hn in £P(a, b) for 1 ::; p < 00 where
hn(t) = cos 2 nt?

6.15 (a) Consider the trigonometric series


00

~o + L(an cos nt + bn sin nt).


n=1
Show that it can written in the amplitude-phase form

Write down the relations between an, bn and dn , ¢n.


(b) (Cantor-Lebesgue theorem) Show that if a trigonometric series as in

° °
(a) above converges over a set E whose measure is strictly positive, then
an - ? and bn - ? as n - ? 00. (Hint: Use the amplitude-phase form of
the series.)
192 6 LP Spaces

6.16 If f and g E L1(]RN), show that

f*g = r
J{N
f(y)g(x-y)dy

is well defined for almost all x E ]RN. Show also that f * g E L1 (]RN) and
that
Ilf * gill : : : Ilfllt Ilglh·

°
6.17 Let {Pc}c>o be a family of Coo functions in ]RN such that for each
E > 0, we have that Pc(x) 2 for all x E ]RN, the support of Pc is
contained in the closed ball with centre at the origin and radius E, and

r
JlR N
Pc(x) dx = l.

(a) Let 'P E Cc(]RN). Show that Pc * 'P converges uniformly to 'P on ]RN
as E -> 0.
(b) Deduce that, if U E V(]RN), then Pc * u converges to u in V(]RN)
as E -> 0.

6.18 Let {fn} be a bounded sequence in LP(a, b), where (a, b) is an open
interval in ]R and 1 ::::: p ::::: 00. Show that fn ~ f in V(a, b) when
1 ::::: p < 00 and fn ~ f in LOO(a, b) if, and only if, for every 'P E V(a, b),

lb lb
we have
fn'P dx -> f'P dx.

6.19 Let f : [O,IJ -> ]R be a continuous function such that f(O) = f(I).
Define the sequence Un} as follows. Let fn(x) = f(nx) on [0, ~ and
extend this periodically to each subinterval k~l, ~ for 2 ::::: k ::::: n. Let
m = Jo1 f(t) dt. Show that fn ~ fin V(O, 1) for 1 ::::: p < 00 and that
fn ~ fin LOO(O, 1), where f(t) = m for all t E [0,1].

6.20 Let (a, b) c ]R be a finite interval and let f : [a, b] be a


°
-> ]R
Lipschitz continuous function i. e. there exists K > such that for all
x, y E [a, b], we have

If(x) - f(y)1 ::::: Klx - YI·


Show that f E W 1,P(a, b) for alII::::: p < 00.
6.5 Exercises 193

6.21 Let a < c < b in JR. Let f : [a, b] -+ JR be continuous. Assume that
f E W1,P(a, c) and that f E W1,P(b, c). Show that f E W1,P(a, b).

6.22 Let f: [-1,1] -+ JR be given by

f(x) = {O, if x E [-1,0)


1, ifxE[O,l]

Show that f tf. W1,P( -1,1) for 1 :::; p < 00. (Thus, continuity is essential
in the previous exercise.)

6.23 (a) Let f : [a, b] -+ JR be absolutely continuous and assume that

lb f(t) dt = 0.

Let 1 :::; p < 00. Show that


1
If(x)1 :::; 2(b - a)p* Ilf'llp
for all x E [a, b].
(b) (Poincare- Wininger Inequality) Deduce that, for all f E W1,P(a, b)
such that J: f(t) dt = 0, we have

6.24 (a) Define W1,P(JR) exactly as in Definition 6.4.1, with IR replacing


the interval (a, b) in that definition as well as in relation (6.4.1). Let ( be
a Coo function on JR with compact support. Show that, if f E W1,P(JR),
then (f E W1,P(JR) as well, where (f)(x) = «(x)f(x).

°: :;
(b) Let ( be a Coo function on JR with compact support contained in
[-2,2] such that «(x) :::; 1 for all x E JR and such that ( == 1 on
[-1,1]. Define (m(x) = «(x/m) for all x E R Show that if u E W 1,P(IR)
for 1 :::; p < 00, then (mu -+ U in W1,P(JR) as m -+ 00.

6.25 Let (a, b) c JR be a finite open interval. Let m > 1 be a positive


integer. Define, for 1 :::; m < 00,

there exist gi E LP(a, b), 1:::; i :::; m }


{ f E LP(a, b) I rb di(O such that r b •
Ja fdtr dx = (_l)t Ja gi'P dx
for all 'P E D(a, b)
194 6 £P Spaces

The functions gi are called the generalized successive derivatives of f


and we denote f( i) = gi. Define

for f E Wm,P(a, b).


(a)Show that II.llm,p defines a norm on Wm,P(a, b) which makes it into a
Banach space which is separable if 1 :S p < 00 and reflexive if 1 < p < 00.
(b) Show that iff E Wm,P(a,b), then f E Cm-1[a,b].

6.26 Let ~ P( , b) denote the closure of D(a, b) in Wm,P(a, b).


(a) Show that f E Wm,P(a, b) belongs to ~ P( , b) if, and only if, f(a) =
f(b) = 0 and f(i)(a) = f(i)(b) = 0 for all 1 :S i :S m - l.
(b) Show that
f t-> Iflm,p ~f Ilf(m) lip
defines a norm on ~ P( , b) which is equivalent to the usual norm
11·llm,p.
Chapter 7

Hilbert Spaces

7.1 Basic Properties

Hilbert spaces form a special class of Banach spaces with the geometric
notion of orthogonality of vectors, or more generally, the notion of an
angle between vectors, built into them.
Consider the space JR2. If x = (Xl, X2) and Y = (YI, Y2) are vectors
in JR2, then we define the scalar product of these vectors by

X.y = XIYI + X2Y2 = Ixl .IYI cos ()


where Ixi = IIx1I2' lyl = lIyII2 and () is the angle between the two vectors.
The scalar product is linear in each of the two variables. It is symmetric
in these variables and x.x = IIxll~. It turns out that these properties are
crucial and we generalize these to other vector spaces.

Definition 7.1.1 Let V be a real normed linear space. An inner prod-


uct on V is a form (., .) : V x V -+ JR such that
(i) it is symmetric, i.e. for all X and Y E V,
(x,y) = (y,x)j

(ii) it is bilinear: in particular, if x , Y and z E V and if a and {3 E JR,


then
(ax + (3y, z) = a(x, z) + (3(y, z);
(iii) for all x E V,

Remark 7.1.1 The linearity with respect to the second variable is,
clearly, a consequence of conditions (i) and (ii) above . •
196 7 Hilbert Spaces

Remark 7.1.2 In case the base field is e, then the inner product is a
sesquilinear form. If x and Y E V, we have

(y, x) = (x, y).

Thus, we have conjugate linearity with respect to the second variable,


z.e. if x, Y and z E V and if a and /3 E e, then

(x, ay + /3z) = a(x, y) + !3(x, z) . •


In view of condition (iii), we say that the norm comes from the inner
product.

Definition 7.1.2 A Hilbert space is a Banach space whose norm


comes from an inner product. •

Example 7.1.1 Consider the space ]R.n. For x = (Xl,··· ,xn ) and y =
(YI,··· ,Yn), we define
n
(x,y) = LXiYi.
i=l

This defines an inner product and the norm associated to it is the norm
11.112. Thus f~ is a Hilbert space. In the case of en, the inner product is
given by
n
(x, y) L XiYi·
i=l

Again the norm is 11.112 . •

Example 7.1.2 Consider the space £2. For x and Y E £2, define
ex)
(X,y) = LXiYi
i=l

where x = (Xi) and Y = (Yi) are real sequences. Again, if the base field
is e, then we define
00

(x,y) = LXiYi.
i=l

This makes £2 into a Hilbert space. •


7.1 Basic Properties 197

As we have seen in the previous chapter, these are particular cases


of the Lebesgue spaces L2.

Example 7.1.3 Let (X, S, J-l) be a measure space. If f and g E L 2 (J-l),


and if f and g represent these classes respectively, then

(f, g) = L fg dJ-l

defines an inner product which makes L2 (J-l) as a Hilbert space. •

Example 7.1.4 Let (a, b) C JR. be a finite interval. We denote by


Hl(a, b) the space W 1 ,2(a, b) and by H6(a, b) the space ~,2( , b). Then
both these spaces are Hilbert spaces with the inner product given by

(f,g) = l bUg +1'l) dx.

By virtue of the Poincare inequality (cf. Theorem 6.4.6), the space


HJ (a, b) is also a Hilbert space with the inner product
(f, gh = lb l' g' dx . •

Let H be a Hilbert space and let x and y E H. Then

Ilx + Yl12 = (x + y, x + y) = IIxl12 + 2(x, y) + IIyl12 (7.1.1)

in the real case; if the field is <C then the middle term on the right will
be replaced by 2Re(x, y), where Re z denotes the real part of a complex
number z. Writing a similar expression for Ilx - Yl12 and adding the two,
we get

(7.1.2)

This is known as the parallelogram identity. In case of JR.2 = £§, this


is the familiar result from plane geometry which relates the sum of the
squares of the lengths of the diagonals of a parallellogram to that of the
sides. It is also known as Apollonius' theorem.

Remark 7.1.3 A theorem of Frechet, Jordan and von Neumann states


that a Banach space whose norm satisfies the parallelogram identity
198 7 Hilbert Spaces

(7.1.2) is a Hilbert space, i.e. the norm comes from an inner product .

Example 7.1.5 The space C[-l, 1] cannot be made into a Hilbert space.
To see this, consider the functions

u(x) = min{x,O}, and v(x) = x

defined on [-1,1]. Then Ilull = Ilvll = 1 while we have

and the parallelogram identity is not satisfied by this pair of functions .



Proposition 7.1.1 Every Hilbert space is uniformly convex and hence
, is reflexive.

Proof: The proof is of the uniform convexity follows from the par-
allelogram identity (7.1.2) exactly as described in Example 5.5.2; the
reflexivity now follows from Theorem 5.5.1. •

We now prove a fundamental ineqaulity for Hilbert spaces.


Theorem 7.1.1 (Cauchy-Schwarz Inequality) Let H be a Hilbert
space and let x and y E H. Then

I(x, y)1 :S Ilxll Ilyll· (7.1.3)

Equality occurs in this inequality if, and only if, x and yare scalar
multiples of each other.

Proof: Let () be a complex number such that I()I 1 and ()(x, y)


I(x, Y)I. Let t E R We have

°< II()x - tyl12


IIxl1 2 - 2tRe(()x, y) + t 211Yl12
IIxl1 2 - 2tl(x, y)1 + t 211Y112.

Since we have a quadratic polynomial which is always of constant sign,


the roots of this polynomial must be complex. Thus, we deduce that
7.1 Basic Properties 199

which yields (7.1.3).


Equality occurs in (7.1.3) if, and only if, the polynomial has two
coincident roots. Thus, there exists to such that ex = toY, or, in other
words x = ay where a = e-1to .•

Corollary 7.1.1 Let H be a Hilbert space. Let y E H. Define

fy(x) = (x, y)

for all x E H. Then fy E H* and Ilfyll = lIyll·


Proof: Clearly fy is a linear functional. By the Cauchy-Schwarz in-
equality, we have
Ify(x)1 ::; Ilxll·llyll
which shows that fy E H* and that Ilfyll ::; Ilyll· If y =f. 0, then set
x = y/llyll. Then fy(x) = Ilyll, which shows that Ilfyll = Ilyll· •

Remark 7.1.4 We will see in the next section that all continuous linear
functionals on a Hilbert space occur in this manner. •

Corollary 7.1.2 Let H be a Hilbert space and let Xn ----'- x and Yn ----+ Y
in H. Then

Proof: Observe that


I(xn, Yn) - (x, y)1 ::; I(x n , Yn - y)1 + I(xn - x, y)1
::; Ilxnll·IIYn - yll + Ify(xn - x)1
by the Cauchy-Schwarz inequality and the preceding corollary. Now,
since any weakly converging sequence is bounded and since Yn ----+ y, the
first term on the right-hand side tends to zero. The second term also
tends to zero by virtue of the preceding corollary, since Xn ----'- x in H . •

Remark 7.1.5 Since norm convergence implies weak convergence, it


follows a fortiori that if Xn ----+ x and Yn ----+ Y in H, then (xn, Yn) ----+
(x, y) .•
Theorem 7.1.2 Let H be a Hilbert space and let K c H be a closed
and convex subset of H. Then, for every x E H, there exists a unique
element PK(X) E K such that

= min Ilx - YII· (7.1.4)


yEK
200 7 Hilbert Spaces

Further, if H is a real Hilbert space, then PK(x) E K is characterized


by the following relations:
(7.1.5)
for every y E K.
Proof: Since H is uniformly convex, the existence and uniqueness of
PK(X) has been proved in Theorem 5.6.1. Let y E K. For any 0 < t < 1,
set z = (1 - t)PK(X) + ty which belongs to K by convexity. Then, by
virtue of (7.1.4),

Squaring both sides, we get

Ilx-PK(X)11 2 ::; Ilx-PK(X)11 2 -2t(x-PK(X), y-PK(X))+t 2 1Iy-PK(X)11 2 .


Cancelling the common term viz. Ilx - PK (x)11 2 , dividing throughout by
t and letting t - t 0, we get (7.1.5).
Conversely, if PK(X) E K is an element satisfying (7.1.5), then, for
any y E K, we have
Ilx - PK(x)11 2 II(x - y) + (y - PK(x)11 2
Ilx - Yl12 + 2(x - y, y - PK(X)) + Ily - PK(x)11 2
Ilx - Yl12 + 2(x - PK(X),y - PK(X)) -Ily - PK(x)11 2
< Ilx - Y112.
Thus PK(X) also satisfies (7.1.4) . •

Remark 7.1.6 If H is a complex Hilbert space, then (x - PK(X), y -


PK(X)) is replaced by its real part in (7.1.5) . •

Remark 7.1.7 The element PK(X), which is closest to x in K, is called


the projection of x onto K. In general, the mapping x f--t PK(X) is not
linear. In]R2, the condition (7.1.5) means that, for all y E K, the lines
joining x to PK(X) and y to PK(X) will always make an obtuse angle . •

We now study some properties of the mapping PK : H -t K.


Prgposition 7.1.2 Let H be a Hilbert space and let K be a closed and
convex subset of H. Let PK : H - t K be as defined by the preceding
theorem. Then, for all x and y E H, we have
7.1 Basic Properties 201

Proof: Assume, for simplicity, that H is a real Hilbert space. By virtue


of (7.1.5), we have

and

Adding these two inequalities, we get

Thus,

and the result now follows from the Cauchy-Schwarz inequality being
applied to the term on the right-hand side . •

Corollary 7.1.3 Let M be a closed subspace of a Hilbert space H. Then


the projection PM is a continuous linear mapping. Further, for x E H,
the element PM (x) E M is characterized by

(PM(X), y) = (x, y) (7.1.6)

for every y E M.

Proof: If (7.1.6) holds, then (7.1.5) holds trivially. Conversely, if


PM(X) E M is the projection of x onto M, then PM(X) satisfies (7.1.5).
Let y E M. Set z = y + PM(X) E M, since M is a subspace. Then
(7.1.5) yields

for all y EM. Since we also have -y EM, we get the reverse inequality
as well and this proves (7.1.6). It now follows from (7.1.61 that PM is
a linear map and it is continuous by the preceding proposition. This
completes the proof. •

Remark 7.1.8 If M is a closed subspace of a Hilbert space H, the


vector X-PM(X) is orthogonal to every vector in M. Thus, PM is called
the orthogonal projection of H onto M . •

Theorem 7.1.3 Let H be a Hilbert space and let M be a closed sub-


space. then M is complemented in H.
202 7 Hilbert Spaces

Proof: Set

~ = {y E H I (x, y) = 0 for all x EM}.

It is immediate to check that ~ is a subspace. It is also closed. For,


let {Yn} be a sequence in ~ and let Yn ---t yin H. If x E M is arbi-
trary, then since (x,Yn) = 0 for all n, we get that (x,y) = 0 as well and
so Y E ~ which establishes our claim. If x E H, then PM(X) E M
and x - PM (x) E M ~ by the preceding corollary. Thus H = M + M ~ .
Further, if x E M n ~, we then have that IIxl1 2 = (x, x) = 0 and so
M n ~ = {O}. Thus H = M EEl ~ and the proof is complete . •

Remark 7.1.9 The subspace ~ consisting of all vectors orthogonal


to all elements of M is called the orthogonal complement of M. The
notation is not accidental. We will see in the next section that (at least
in the case of real Hilbert spaces) the orthogonal complement can be
identified with the annihilator of M . •

7.2 The Dual of a Hilbert Space

Earlier, we saw that every vector in a Hilbert space gave rise to a con-
tinuous linear functional. The main result of this section is to show that
all continuous linear functionals arise in this way.

Theorem 7.2.1 (Riesz Representation Theorem) Let H be a Hilbert


space. Let 'P E H*. Then, there exists a unique vector Y E H such that

'P(x) = (x, y) (7.2.1)

for all x E H. Further, II'PII = Ilyll·


Proof: We saw that (cf. Corollary 7.1.1), given y E H, the functional
fy defined by
fy(x) = (x, y)
is in H* and that Ilfyll = Ilyli. Thus, the mapping q, : H ---t H* defined
by q,(y) = fy is an isometry of H into H* and so its image is closed in
H*. If we show that the image is dense in H*, then it will follow that
H* = q,(H), or, in other words, that q, is onto and this will complete
the proof.
Consider a linear functional 'P on H* which vanishes on q,(H). Since
every Hilbert space is uniformly convex and hence, reflexive, this means
7.2 Dual of a Hilbert Space 203

that there exists x E H such that fy(x) = 0 for all y E H. This implies
that (x, y) = 0 for all y E H. In particular,

IIxl12 = (x, x) = 0
which shows that x = 0, i.e. 'P is zero. This shows that iI>(H) is dense
in H* and the proof is complete. •

Remark 7.2.1 It is also possible to directly prove this theorem without


using the reflexivity of H. This will be outlined in the exercises at the
end of this chapter. •

Remark 7.2.2 Let H be a Hilbert space. Then every element of the


dual, H*, can be represented as fx, where x E H. We can then define
an inner product on H* by

It is easy to see that this defines an inner product which gives rise to
the usual norm on H*. Thus, H* also becomes a Hilbert space in its
own right. In the same way, H** also becomes a Hilbert space. Now,
we have two natural mappings from H into H**. The first is the usual
canonical imbedding x f-+ J(x). The second is the mapping x f-+ iJx'
i. e. the composition of the Riesz map H ---> H* and that of H* ---> H**.
We will show that these are the same. The latter map, by the Riesz
representation theorem, is onto and so J will be onto, giving another
proof of the reflexivity of a Hilbert space, provided we prove the Riesz
representation theorem independently, as suggested in Remark 7.2.1. To
see that the maps are the same, observe that if f = fy E H*, then

iJx (f) = (f, fx)* (x,y)


J(x)(f) = f(x) (x, y).
This establishes the claim. •

Remark 7.2.3 As a consequence of the Riesz representation theorem,


the map y f-+ fy is an isometry of H onto H*. It is linear if H is real
and conjugate linear if H is complex. Thus, at least in the real case, we
can identify a Hilbert space with its own dual via the Riesz isometry. •

Remark 7.2.4 In the case of real Hilbert spaces, while we can identify
a Hilbert space with its dual, we have to be careful in doing so and
204 7 Hilbert Spaces

we cannot do it to every space under consideration at a time. A typical


example of such a situation is the following. Let V and H be real Hilbert
spaces. Let V cHand let V be dense in H. Let us assume further that
there exists a constant C > 0 such that

for every v E V.
Let us now identify H* with H via the Riesz representation theorem.
Let f E H. Then the map v 1----+ (v, f) H defines a continuous linear
functional on V since

for all v E V. Let us denote this linear functional by T(f). Thus


T E £(H, V*) and

IIT(f)ILC(H,v*) :S CllfllH.
If T(f) = 0, then (v, f) = 0 for all v E V and so, by density, we have
f = 0. Thus T is one-one as well. Finally, we claim that the image of T

° °
is dense in V*. Indeed, if'P E V** vanishes on T(H), then, by reflexivity,
there exists v E V such that T(f)(v) = for all f E H i.e. (v, f) =
for all f E H. Since V c H, it follows that (v, v) = 0, i.e. v = 0, which
means that 'P is idertically zero, which establishes the claim.
Thus we have the following scheme:

V c H ~ H* c V*
where both the inclusions are dense. It would now be clearly absurd
for us to identify V with V* as well. Thus we cannot simultaneously
identify V and H with their respective duals. The space H in this case
is called the pivot space and is identified with its dual, whereas the other
spaces, though they are also Hilbert spaces, will not be identified with
their respective duals. This situation typically arises when we have a
parametrized family of Hilbert spaces as in the case of the Sobolev spaces
(cf. Kesavan [3]). In particular, we can set V = HJ(a, b) (cf. Example
7.1.4) and H = L2(a, b). We identify L2(a, b) with its dual while we
denote the dual of HJ(a,b) by H- 1 (a,b) and we have the inclusions
7.2 Dual of a Hilbert Space 205

Let H be a Hilbert space and let A E £(H). For a fixed y E H,


the map x f--t (A(x), y) clearly defines a continuous linear functional on

H and so, by the Riesz representation theorem, this functional can be


written as the inner product of x with a vector (which depends on y).
This leads us to the following definition.

Definition 7.2.1 Let H be a Hilbert space and let A E £(H). We


define the adjoint of A as the mapping A * : H --+ H given by

(x, A*(y)) = (A(x), y) (7.2.2)

for all x and y E H . •

Remark 7.2.5 In the case of real Hilbert spaces, since Hand H* can
be identified via the Riesz isometry, the map A * is just the adjoint in
the sense of Definition 4.7.2 . •

The following proposition lists the propoerties of the adjoint map.

Proposition 7.2.1 Let H be a Hilbert space. Let Ai, i = 1,2 and A be


continuous linear operators on H. Let 0: be a scalar. Then
(i) IIAII = IIA*II;
(ii) IIA* All = IIAI12;
(iii) A** = A;
(iv) (AI + A2)* = At + A 2;
(v) (AIA2)* = A2Ai;
(vi) (o:A) * = a-A * (to be interpreted as o:A * in the real case).

Proof: By the Cauchy-Schwarz inequality, we have for any x E H,

Ilxll = sup I(x, y)l·


IIyl19

It then follows that if A E £(H), then

IIAII = sup IIA(x)11 = sup sup I(A(x), Y)I·


IIxll9 IIxl1911yl19

Then, using (7.2.2) and the Cauchy-Schwarz inequality, we get

II All = sup sup I(A(x), y)1 sup sup l(x,A*(y))1 :::; IIA*II·
IIxl1911yll9 IIxl1911yl19
206 7 Hilbert Spaces

Similarly,

IIA*II = sup sup I(x, A*(y))1 = sup sup I(A(x), y)1 < IIAII·
iiyii9iixli9 iiyii9iixii9
This proves (i). Again, if x and y E H, then

I(A*A(x),y)1 = I(A(x),A(y))1 :S IIA11 211xll·llyll

from which, we deduce that

On the other hand,

IIA(x)112 = (A(x), A(x)) = (A* A(x), x) :s IIA* AII.llx11 2

by the Cauchy-Schwarz inequality and we deduce that

This proves (ii). The other relations follow trivially from (7.2.2) . •

Remark 7.2.6 A Banach algebra, B, is said to be a *-algebra if there


exists a mapping x f-+ x* from B into itself satisfying the properties
analogous to (iii) - (vi) of the above proposition. Such a mapping is said
to be an involution. If, in addition, properties (i) and (ii) are also true,
it is said to be a B* -algebra. Thus, if H is a Hilbert space, the £(H) is a
B* -algebra with the involution being given by the adjoint mapping . •
Definition 7.2.2 Let H be a Hilbert space and let A E £(H). A is said
to be self-adjoint if A* = A. It is said to be normal if AA* = A* A. It
is said to be unitary if AA* = A* A = I, where I is the identity operator
onH. •
Example 7.2.1 Any orthogonal projection in a Hilbert space is self-
adjoint. If P : H - t M is the orthogonal projection of a Hilbert space
H onto a closed subspace M, then, for any x and y E H, we have

(P*(x), y) = (x, P(y)) = (P(x), P(y)) = (P(x), y)

by repeated application of Corollary 7.1.3. Since x and yare arbitrary


elements of H, it follows that P = P* . •
7.3 Application: Variational Inequalities 207

Example 7.2.2 In £~, the operator defined by a hermetian matrix is


self-adjoint, that defined by a normal matrix is normal and that defined
by a unitary matrix (orthogonal matrix, if the base field is JR.) is unitary
(cf. Definition 1.1.14) . •

Remark 7.2.7 If A : D(A) c H -+ H is a densely defined linear


transformation in a Hilbert space H, it is easy to see how to define the
adjoint A* : D(A*) c H -+ H. Again, we have for u E D(A) and
v E D(A*),
(A(u),v) = (u,A*(v)).

All the results of Section 4.7, in particular, Proposition 4.7.3 and The-
orem 4.7.1, are true . •

7.3 Application: Variational Inequalities

Let H be a real Hilbert space and let a(., .) : H x H -+ JR. be a continuous


bilinear form (cf. Example 4.7.4). Let M > 0 such that

la(x, y)1 ::; Mllxll·IIYII (7.3.1)

for all x and y E H. Assume further that a(.,.) is H-elliptic (or, coercive;
cf. Exercise 5.16). Let 0: > 0 such that

a(x, x) ~ 0:1Ixl12 (7.3.2)

for all x E H.

Example 7.3.1 The inner product of a real Hilbert space is a sym-


metric, continuous and coercive bilinear form. Conversely, if a(.,.) is a
symmetric, continuous and coercive bilinear form, then

(x, Y)a ~f a(x, y)

defines a new inner product on H. The associated norm is

Ilxll a = via(x, x).


Thanks to the continuity and coercivity of the bilinear form, we have

v/allxll ::; Ilxll a ::; VMllxll·


208 7 Hilbert Spaces

Thus the two norms on H are equivalent. •

Example 7.3.2 Let H = £~. Let A be an n x n matrix. If x and


y E lRn = £!j are vectors, define

a(x, y) = y' Ax

where y' is the transpose of the column vector y. Then a(.,.) defines
a continuous bilinear form on £~. If A is symmetric, then the bilinear
form is symmetric as well. If A is positive definite, then the bilinear
form is coercive . •

Theorem 7.3.1 (Stampacchia's Theorem) Let H be a real Hilbert


space and let a(.,.) : H x H -+ lR be a continuous and coercive bilinear
form on H (satisfying (7.3.1) and (7.3.2)). Let K be a closed and convex
subset of H. Let f E H. Then, there exists a unique x E K such that,
for all y E K,
a(x,y-x) :2 (f,y-x). (7.3.3)

Proof: Let u E H be fixed. The map v f--t a( u, v) is a continuous linear


functional on H, by the continuity of the bilinear form. Thus, by the
Riesz representation theorem, there exists A( u) E H such that

(A(u), v) = a(u, v)

for all v E H. Clearly, the map u f--t A(u) is linear. Further, by (7.3.1)
and (7.3.2) we have

IIA(u)11 ::; Mllull and (A(u), u) :2 allul1 2


for all u E H. Thus A E .c(H). Now, (7.3.3) is equivalent to finding
x E K such that
(A(x),y-x) :2 (f,y-x)
for all y E K. If p > 0 is any constant (to be determined suitably), this
is equivalent to finding x E K such that

(-pA(x)+pf+x-x,y-x) < 0
for all y E K. In other words (cf. Theorem 7.1.2),

x = PK(x - pA(x) + pJ) ~ S(x).


7.3 Application: Variational Inequalities 209

Thus, we seek a fixed point of the mapping S : K ----t K. Let Xl and


X2 E K. Then, by Proposition 7.1.2, we have

Squaring both sides, we get

IIS(xd - S(x2)11 2 :'S Ilxl - x211 2 - 2p(XI - X2, A(Xl) - A(X2))


+p21IA(xt) - A(X2)11 2

using (7.3.1) and (7.3.2). Now, choosing p such that


2a
o < p < Af2'

we have 1 - 2pa + p2 Af2 < 1 so that S : K ----t K is a contraction. Since


K is closed, by the contraction mapping theorem (cf. Theorem 2.4.1)
we deduce that there exists a unique fixed point X E K for S which
completes the proof. •

Corollary 7.3.1 (Lax-Milgram Lemma) Let H be a Hilbert space


and let a(., .) : H x H ----t lR be a continuous and coercive bilinear form.
Let f E H. Then, there exists a unique x E H such that

a(x, y) = (f, y)
for every y E H.
Proof: Applying the preceding theorem with K = H, there exists a
unique x E H satisfying (7.3.3). Replacing y by y + x, we get

a(x,y) ~ (f,y)
for every y E H. Since -y E H as well, we also get the reverse inequal-
ity. Hence the result . •

Remark 7.3.1 The Lax-Milgram lemma was already proved in Exercise


5.16. If, in addition a(.,.) is symmetric, then the preceding results have
been proved via Exercise 5.17. In that case, the solution x has a varia-
tional characterization, viz. x E K is the minimizer of the functional
1
J(y) = 2a(y, y) - (f, y)
210 7 Hilbert Spaces

over K. For this reason, (7.3.3) is called a variational inequality. In the


terminology of the calculus of variations, (7.3.3) is the equivalent of the
Euler-Lagrange condition for the minimization of a functional. In the
case of unconstrained minimization i. e. K = H, this becomes an equa-
tion instead of an inequality, as seen in the Lax-Milgram Lemma, and
corresponds to the vanishing of the 'first variation' of J (cf. Kesavan [4]).
Indeed, it is easy to see that J is Frechet differentiable (cf. Exercise
2.38) and that
J'(x)(y) = a(x,y) - (f,y)
for any x and y E H. Thus (7.3.3) and the Lax-Milgram lemma are just
the results of Exercise 2.44 when a(.,.) is symmetric.
The Lax-Milgram lemma forms the basis of a wide class of numeri-
cal methods, known as finite element methods, to solve boundary value
problems for elliptic partial differential equations (cf. Kesavan [3]) . •

Remark 7.3.2 In the symmetric case, as explained in Example 7.3.1,


a(.,.) defines a new inner product whose norm is equivalent to the usual
norm. Thus the dual space remains the same and so the Lax-Milgram
lemma is just a restatement of the Riesz representation theorem. •

7.4 Orthonormal Sets

As mentioned earlier, orthogonality is a very important notion special to


Hilbert spaces. In this section, we will take a closer look at this property.

Definition 7.4.1 Let H be a Hilbert space and let I be an indexing set.


A subset S = {Ui E H liE I} is said to be orthonormal if

Iluill = 1 for all iEI

and

Remark 7.4.1 If we use the Kronecker symbol, viz. 8ij which equals
unity if i = j and equals zero if i ¥: j, then the above relations can be
written as

for all i and j E I. •


1.4 Orthonormal Sets 211

Remark 7.4.2 An orthonormal set of vectors is automatically linearly


independent. For, if we have a linear relation of the form
n
LakUik = 0,
k=l
then, taking the inner product with Uij and using the orthonormality of
the vectors, we get aj = 0 for any 1 :S j :S n . •

Example 7.4.1 The sequence {en} in £2 (cf. Example 2.3.12) forms an


orthonormal set. Similarly, the standard basis in £2' (cf. Example 1.1.2)
forms an orthonormal set . •

Example 7.4.2 Consider the interval X = [0,1] endowed with the


Lebesgue measure. The corresponding space L 2(/1) is denoted L2(0, 1)
(cf. Section 6.3). The sequence {fn } where fn is the equivalence class
represented by the function f n ( t) = V2 sin mrt, forms an orthonormal
set . •
Proposition 7.4.1 (Gram-Schmidt Orthogonalizationj Let H be
a Hilbert space and let {Xl, ... , xn} be a set of linearly independent vec-
tors in H. Then there exists an orthonormal set of vectors {el' ... ,en}
in H such that, for each 1 :S i :S n, the vector ei is a linear combination
of the vectors Xl, ... , Xi.
Proof: Clearly, none of the Xi can be the null vector. Define
1
el = ~XI

Next, consider the vector X2 - (X2' el)el. This vector cannot vanish
since Xl and X2 are linearly independent and el is a scalar multiple of
Xl. Thus, we can define

It is now immediate to check that IleI II = IIe211 = 1 and that (el' e2) = O.
Further, e2 is a linear combination of Xl and X2, since el is just a scalar
multiple of Xl.
We can now proceed inductively. Assume that we have constructed
the vectors el, ... , ek, for 1 :S k :S n - 1. We then define

ek+1 = II
xk+l
_ 2: i=l
k\ .) .11 [Xk+l - I)Xk+l,ei)eij.
xk+l,e l el i=l
212 7 Hilbert Spaces

It is now easy to verify that the set {el,··· , en} verifies the conditions
mentioned in the statement of the proposition. •

In the exercises at the end of this chapter, we will see important ex-
amples of the Gram-Schmidt orthogonalization process leading to vari-
ous well known special functions of mathematical physics.

Remark 7.4.3 Consider the space IR n = £2. For any 1 :'S j :'S n, the
sets {Xl, ... , Xj} and {el' ... , ej} span the same subspace. Thus, we can
write
j

Xj = L rijei·
i=l

Let A be the matrix whose columns are the Xj, and Q the matrix whose
columns are the ej. Let R be the matrix whose entries are the rij. For
any j, we have that rij = 0 if i > j. Thus, R is an upper triangular
matrix. Further, we see that

A =QR.

Since the columns of A are linearly independent, the matrix A is in-


vertible. Since the columns of Q are orthonormal, the matrix Q is
orthogonal. Thus, the Gram-Schmidt orthogonalization process proves
the following result from matrix theory: every invertible matrix can be
decomposed into the product oj an orthogonal matrix and an upper tri-
angular matrix. •

Remark 7.4.4 The process of producing orthonormal vectors from lin-


early independent ones is quite useful in several contexts. For instance,
let us consider a continuous function on the interval [0, 1]. We wish to
approximate it by a polynomial. Amongst several ways of doing this,
one is the least squares apP""Oximation. We look for a polynomial p of
degree at most n such that

l
roIIJ(t) - p(t)12 dt = min r IJ(t) - q(tW dt
in qEPn io
where P n is the space of all polynomials (in one variable) of degree less
than or equal to n. In other words, we are looking for the projection of
J onto the subspace of polynomials of degree less than , or equal to, n
7.4 Orthonormal Sets 213

in the space L2(0, 1). We know that such a P exists uniquely and that
it is characterized by (cf. Corollary 7.1.3)

(p, q) = (f, q)

for all q E Pn . By linearity, it is sufficient to check the above relation


for just the basis elements of Pn . The standard basis of Pn consists of
the functions Po, PI,'" , Pn where

Po(t) == 1 and Pk(t) tk

for t E [0, 1] and for all 1 ::; k ::; n. Writing

p(t) = ao+alt+a2t2+ ... +antn,

we then derive the following linear system:

Ax = f

where, A = (aij) is the (n + 1) x (n + 1) matrix given by

a··
tJ
= (p.J, p.)
t
= 1e+
0
1 . .
J dt =
1
---'
i + j + 1'

x is the (n + 1) x 1 column vector whose components are the unknown


coefficients of P, ao, a1,'" , an; f is the (n + 1) x 1 column vector whose
i-th component is

Solving this linear system yields p. However, especially when n is large,


the matrix A is known to be very difficult to invert numerically; it is an
example of what is known as a highly ill-conditioned matrix, i.e. even
small errors in the data can lead to very large errors in the solution of
any linear system involving this matrix.
On the other hand, if we replace the standard basis by a basis con-
sisting of orthonormal polynomials qo, q1,' .. ,qn, then we can write
214 7 Hilbert Spaces

Now,
n n
(p,qd = L (Jj(qj, qi) L {Jjbji {Ji
j=O j=O
and so
(Ji = fal f(t)qi(t) dt.
Thus, without solving any linear system, we can directly compute t~
least squares approximation. •

Example 7.4.3 Let us compute some of the elements of the orthonormal


set obtained from the standard basis of the space of polynomials of
degree at most n in L2( -1,1). Recall that Pi(t) = t i for 0 ~ i ~ n.
1
([1 dt)
1
IIPol12 = 2 = J2.
Thus qo (t) 1/ v'2 for all t E [-1, 1]. Now consider the function

ql(t) = t - ~ ([11 t dt) qo(t) = t.

Then IIqIil2 = v'2/v'3. Thus,


J3t
-
v'2
for all t E [-1,1]. Next, we consider

q2(t) = t 2 - v'3
v'2
(1 1
-1
t 3 dt) v'3 t __1_
v'2 v'2
(1 1
-1
t 2 dt) ~
v'2
= t2 _ ~
3
Then, a straight forward computation yields that IIq2112 = 2v'2/3J5.
Thus, for all t E [-1,1], we get

Similarly, we can show that

and so on.
An easier way of computing these polynomials will be seen in the
exercises at the end of this chapter. •
7.4 Orthonormal Sets 215

Proposition 7.4.2 Let {el,···, en} be a finite orthonormal set in a


Hilbert space H. Then, for any x E H,
n
L l(x,eiW ~ Ilx112. (7.4.1)
i=l
Further, x - L~=l (x, ei)ei is orthogonal to ej for all 1 ~ j ~ n.

Proof: We know that Ilx - L~=l (x, ei)eil1 2 2: O. Expanding this, and
using the orthonormality of the set, we get (7.4.1) immediately. Further,

This completes the proof. •

Proposition 7.4.3 Let H be a Hilbert space. Let I be an indexing set


and let {ei liE I} be an orthonormal set in H. Let x E H. Define

S = {i E I I (x, ed # O}. (7.4.2)

Then, S is atmost countable.

Proof: Define

By (7.4.1), it follows that Sn has at most n -1 elements for any positive


integer n. Since
S = U~=l n
it follows that S is at most countable. •

The preceding proposition helps us to define (infinite) sums over


arbitrary orthonormal sets. Let {ei liE I} be an orthonormal set in H
for an indexing set I. Let x E H. We wish to define the sum

L l(x,ei)1 2.
iEI

Let S be the set defined by (7.4.2). If S = 0, we define the above sum


to be zero. If it is a finite set, then the above sum is just the finite sum
of the corresponding non-zero terms. If it is countably infinite, then we
216 7 Hilbert Spaces

choose a numbering el, e2,··· ,en,··· for the elements in the orthonormal
set whose inner product with x is non-zero. Then we define the above
sum to be
00

The sum is independent of the numbering chosen since this is a series


of positive terms and so any rearrangement thereof will yield the same
sum.
We are now in a position to generalize (7.4.1).
Theorem 7.4.1 (Bessel's Inequality) Let H be a Hilbert space and
let {ei liE I} be an orthonormal set in H, for some indexing set I.
Let x E H. Then
LI(x, eiW :'S Ilx11 2 . (7.4.3)
iEI

Proof: Let S be defined by (7.4.2). If S is empty, there is nothing to


prove. If it is finite, the result is the same as (7.4.1), which has already
been proved. If S is count ably infinite, then, since (7.4.1) establishes
the result for all partial sums, (7.4.3) follows . •

Let {ei liE I} be an orthonormal set in a Hilbert space H. Given a


vector x E H, we now try to give a meaning to the sum

L(X, ei)ei
iEI

as a vector in H. Once again, let S be the set defined by (7.4.2). If it is


empty, we define the above sum to be the null vector. If it is finite, then
we define it to be the (finite) sum of the corresponding terms. Let us,
therefore, assume now that S is a count ably infinite set. Let us number
the elements E = {ei liE S} as {el' e2, ... ,en,· .. }. Define
n
Yn L(x, edei.
i=l

If m > n, then
111

IIYm - Ynl1 =
2 L I(x, ei)1 2
i=n+l

using the orthonormality of the set. But the sum on the right-hand side
can be made arbitrarily small for large nand m since it is part of the tail
7.4 Orthonormal Sets 217

of a convergent series (cf. (7.4.3)). Thus, the sequence {Yn} is Cauchy


and hence converges to a limit, say, Y in H.
Assume now that the elements of the set E above are rearranged so
that
E = {e'l , e'2 , ... 'n'
e' ... }
where each ~ is equal to a unique ej. Once again, we define
n
y~ = I)x, ~.
i=l

As before {y~ is Cauchy and will converge to an element y' E H.


We claim that y = y' so that, whatever the manner in which we
number the elements of E, we get the same limit vector, which we will
unambiguously define as the required infinite vector sum.
Let E > O. Choose N sufficiently large such that, for all n 2:: N, we
have

L
00

llYn - yll < E, Ily~ - y'll < E and I(x, edl 2 < E2.
i=N+I

Fix n 2:: N . Then we can find m 2:: N such that

{el,"',e n } C { ~, , ~ .
Then, the difference y~ - Yn will consist of a finite number of terms of
the form (x, ei)ei where all the i concerned are greater than n(2:: N).
Hence, it follows that

L
00

Ily~ - Ynl1 2 < I(x, ei)1 2 < E2.


i=N+I

Thus,

Ily - y'll :S Ily - Ynll + llYn - y~11 + Ily~ - y'll < 3E


which proves that y = y' since E > 0 can be chosen arbitrarily small.
To sum up, we choose an arbitrary numbering of E and write E =
e ... } and define
{ e1 , e2 , . . . ,n,

n
lim ~(x, j) j.
n +oo~
j=l

The following result is now an immediate consequence of this defini-


tion and of Proposition 7.4.2.
218 7 Hilbert Spaces

Proposition 7.4.4 Let H be a Hilbert space and let {ei liE I} be an


orthonormal set in H. let x E H. Then

x - I:(x, ei)ei
iEI

is orthogonal to every ej, j E I . •

Definition 7.4.2 An orthonormal set in a Hilbert space is said to be


complete if it is maximal with respect to the partial ordering on or-
thonormal sets induced by set inclusion. A complete orthonormal set is
also called an orthonormal basis . •

Proposition 7.4.5 Every Hilbert space admits an orthonormal basis.

Proof: Given any chain (with respect to the partial ordering induced
by set inclusion on orthonormal sets), the union of its members gives
an upper bound. Hence, by Zorn's lemma, there exists a maximal or-
thonormal set. •

Theorem 7.4.2 Let H be a Hilbert space and let {ei liE I} be an


orthonormal set in H. The following are equivalent:
(i) The orthonormal set is complete.
(ii) If x E H is such that (x, ed = 0 for all i E I, then x = o.
(iii) If x E H, then
x = I:(x, edei. (7.4.4)
iEI

(iv) If x E H, then
IIxl12 = I: I(x, edl 2 (7.4.5)
iEI

(This is known as Parseval's identity.)

Proof: (i) :::::} (ii). Assume that the orthonormal set is complete and
that (x, ed = 0 for all i E I. If x -# 0, then the set

is also an orthonormal set strictly larger than the given set which con-
tradicts the maximality of the given set.
7.4 Orthonormal Sets 219

(ii) =} (iii). We know that x - LiEI(x, edei is orthogonal to every


ej, j E I. This immediately gives (7.4.4).

(iii) =} (iv). Set Yn = L~=l(x, i) i where {el,e2,··· ,en···} is a num-


bering of the elements of the set E explained earlier when defining the
sum LiEI(x, ei)ei. Then a straight forward computation yields
n
IIYnl1 2 .= L I(x, eiW·
i=l

This immediately yields (7.4.5) on passing to the limit as n -+ 00.

(iv) =} (i). If the given set were not complete, then there exists e E H
such that Ilell = 1 and (e, ei) = 0 for every i E I. But then, this will
contradict (7.4.5) (applied to the vector e) . •
Corollary 1.4.1 Let H be a Hilbert space and let {ei liE I} be an
orthonormal set. It is complete if, and only if, the subspace of all (finite)
linear combinations of the ei is dense in H.
Proof: If the orthonormal set is complete, then by the preceding theo-
rem, every element of H is the limit of finite linear combinations of the
ei by (7.4.4) and so the subspace spanned by the ei is dense in H.
Conversely, if the subspace spanned by the ei is dense in H, then if
x E H is such that it is orthogonal to all the elements of this subspace,
then x = o. In particular, if (x, ei) = 0 for all i E I, then, clearly, x
is orthogonal to the subspace spanned by the ei and so it must vanish.
Thus, statement (ii) of the preceding theorem is satisfied and so the
orthonormal set is complete. •
Corollary 1.4.2 Let H be a Hilbert space and let {el' e2, ... , en ... } be
a sequence in H which is also an orthonormal basis for H. Then en ----'" o.
Proof: Let x E H. Then, by (7.4.5), it follows that (x, en) -+ 0 as
n -+ 00. Then, by the Riesz representation theorem, it follows that
en ----'" 0 . •

Remark 1.4.5 Notice that if {en} is an orthonormal sequence which is


also complete in a Hilbert space H, then it weakly converges to the null
vector while it does not have a norm convergent subsequence, since

for all n i= m . •
220 7 Hilbert Spaces

Theorem 7.4.3 A Hilbert space has a countable orthonormal basis if,


and only if, it is separable.
Proof: For simplicity, assume that the space is a real Hilbert space. If
the space has a countable orthonormal basis {en}, then the set of all
finite linear combinations of the {en} is dense in H, by (7.4.4). The
set of all finite linear combinations of the {en} with rational coefficients
then forms a countable dense subset.
Conversely, assume that the space is separable. Let {xn} be a count-
able dense subset. Consider the balls Bn = B (xn; J2 / 4). If {ei liE I}
is an orthonormal set, then, since I ei - ej II = J2 for i =I- j, it follows that
each ball can contain at most one element from the orthonormal basis.
But the set {xn} being dense, each ball B(ei; J2/4) must contain some
X n , or, in other words, each ei must lie in one of the balls Bn. Thus, the
orthonormal set can be at most countable. Thus an orthonormal basis
can also only be at most countable. •

Example 7.4.4 By virtue of (7.4.5), the orthonormal sets in £2 and £2


described in Example 7.4.1 are orthonormal bases of those spaces . •

Example 7.4.5 (Fourier series) Consider the space L2( -7r, 7r). The
set

where

for t E (-7r, 7r), forms an orthonormal set. By Theorem 6.3.1, the space
of continuous functions with compact support contained in (-7r, 7r) is
dense in L2( -7r, 7r). Such functions vanish at the end points of the in-
terval [-7r,7r] and so they are 27r-periodic on the interval [-7r,7r]. Con-
sider the space spanned by the orthonormal set mentioned above. By
an application of the Stone-Weierstrass theorem (cf. Rudin [6]), it fol-
lows that this space is dense in the space of all 27r-periodic continuous
functions with respect to the sup norm, which is nothing but the norm
11.1100. Since the interval (-7r,7r) has finite measure, this implies that
this space is dense with respect to the norm 11.112 as well (cf. Proposition
6.1.3). This shows that the space spanned by this orthonormal set is
dense in L2( -7r, 7r) as well and so, by Corollary 7.4.1, it follows that it
is a complete orthonormal set in L2( -7r, 7r).
7.4 Orthonormal Sets 221

Thus if f E L2( -Jr, Jr), we have that

f(t) 7r f(t)_l dt)


= (J-7r _1 + ,",00_ (J7r-7r f(t)cosnt dtJ ~
"j2; "j2; L.m-1 fo y7r
+ ,",00_
un-1
(J-7r7r f(t) sin nt
fo
dt) sin nt
fo

by virtue of (7.4.4). This can be rewritten as


00

f(t) = ~o + l:)a n cos nt + bn sin nt)


n=l
where
1j7r
ao = -; -7r f(t) dt
and
1j7r 1j7r
an = -; -7r f(t) cos nt dt, bn = -; -7r f(t) sin nt dt.

This is nothing but the classical Fourier series of a function f and the
°
an, n 2 and bn , n 2 1 are the usual Fourier coefficients of f. The
above series expansion means that the partial sums of the Fourier series
converge to f in the 11.112 norm. In other words if
N
fN(t) ~o + l:)a n cos nt + bn sin nt),
n=l
for t E (-Jr, Jr), then

as N --t 00. The analogue of the Parseval identity (7.4.5) reads as


follows:

Example 7.4.6 (Fourier sine series) Consider the space L2(0, Jr).
Consider the set

. This is an orthonormal set in L2(0, Jr) as one can easily verify. Let
f E L2(0, Jr) be orthogonal to every element of this set. Extend f as an
222 7 Hilbert Spaces

odd function to all the interval (-7r, 7r). Thus, we set f(x) = - f( -x) if

I: I:
x E [-7r,0). Since f is an odd function, it follows that

f(t) dt = f(t) cos nt dt = °


for all n EN. Since we also have that

j-7r f(t) sin nt dt


7r = 2 {7r f(t) sin nt =
Jo
°
for all n E N, it follows that f = 0 in L2( -7r, 7r) and so f = 0 in L2(0, 7r)
as well. Thus, by Theorem 7.4.2 (ii), it follows that the given set is
complete in L2(0,7r). In particular, if f E L 2(0,7r), we can write the
series expansion

L b sin nt
00

f(t) n
n=l

where
bn = -217r f(t) sin nt dt.
7r 0
This is called the Fourier sine series of the function f. •

By analogy, if H is a separable Hilbert space with an orthonormal


basis {en I n E N} and if x E H, we call
00

x = L(x, en)en
n=l

as its Fourier expansion and the coefficients (x, en) are called its Fourier
coefficients.

Remark 7.4.6 Let V be a Banach space and let {en} be a sequence of


vectors in V such that every vector x E V can be written as

Then {en} is called a Schauder basis for V. Thus, in every separable


Hilbert space, an orthonormal basis forms a Schauder basis. The set
{en} in fp (cf. Example 3.1.1) is a Schauder basis for fp for 1 ~ p < 00.
7.5 Exercises 223

In the literature, a usual basis of the vector space, i. e. a set of


linearly independent elements such that every vector is a finite linear
combination of vectors from the set, is called a Hamel basis. Notice
that, by Baire's theorem (cf. Exercise 4.1), a Banach space cannot have
a countable Hamel basis, while it may have a (countable) Schauder ba-
sis . •

In the next chapter we will see how orthonormal bases occur very
naturally in Hilbert spaces.

7.5 Exercises

7.1 Let V be a real Banach space and assume that the parallelogram
identity holds in V. Define

Show that this defines an inner product which induces the given norm
and hence that V is a Hilbert space.

7.2 Let V be a complex Banach space and assume that the parallelogram
identity holds in V. Define

Show that this defines an inner product which induces the given norm
and hence that V is a Hilbert space.

7.3 Let H be a Hilbert space and let M be a closed subspace of H. Let


P : H ---+ M be the orthogonal projection of H onto M. Show that
IIPII = 1.
7.4 Let H = £~. Let J be the n x n matrix all of whose entries are equal
to lin. Show that

III 112,n = III - J 112,n 1

where I is the n x n identity matrix.


224 7 Hilbert Spaces

7.5(a) Let H be a Hilbert space and let cp be a non-zero continuous linear


functional on H. Let M = Ker(cp). Show that M has co dimension one.
(b) Let gEM..L be a unit vector such that any y E H can be written as

y = >..g + z

where z E Nl. Define x = cp(g)g. Show that x is such that

cp(y) = (y, x)

for all y E H. (This gives a direct proof of the Riesz representation


theorem.)

7.6 Let H be a Hilbert space and let U : H --t H be a unitary operator.


Show that U is an isometry, i.e. IIUxl1 = Ilxll for all x E H.

7.7 Let H be a real Hilbert space and let a(.,.) : H x H --t lR be a


continuous and H-elliptic bilinear form (cf. Section 7.3) with constants
M > 0 (for continuity) and 0: > 0 (for ellipticity). Let f E H.
(a) Let W c H be a closed subspace. Show that there exists a unique
w E W such that
a(w, v) = (f, v) (7.5.1)
for all v E W.
(b) Show that, if w E W is as above, then

Ilwll ::; ~llfll.


0:

(c) Let u E H be the unique vector such that

a(u,v) = (f,v)

for all v E H. Show that

Ilu - wll ::; M inf Ilu -


0: vEW
vii.
(d) Let H be separable and let { n ~=l be an orthonormal basis for
H. Let Wn = span{ul,··· ,un}. Let Wn be the solution of (7.5.1) when
W = W n . Show that Wn --t U as n --t 00.
7. 5 Exercises 225

7.8 Consider the space L2(0, 1). Define ro(t) == 1 and


2n
rn(t) = "'(-1)i-1 X [i_ 1 .i..j(t)
~ 2n '2 n
i=l

where XE denotes the characteristic function of a set E.


(a) Show tha.t

°
where sgn(t) equals 1 when t 2: and equals -1 when t < 0.
(b) Show that { n(t) ~o is orthonormal in L2(0, 1) but that it is not
complete.

7.9 Let (a, b) C lR be a finite interval and let {cPn }nEN be an orthonormal
basis for L2(a, b). Define

for (t,s) E (a,b) x (a,b). Show that {<Pij}(i,j)ENxN forms an orthonormal


basis for L2((a, b) x (a, b)).

7.10 Show that the sets

is a complete orthonormal set in L2(0, 7r). (Thus, a function in L2(0, 7r)


can be expanded as a series of cosines and this is called its Fourier cosine
series.)

7.11 Consider L2 (-1, 1) and the linearly independent set of functions


Pn where Pn(t) = tn. Applying the Gram-Schmidt orthogonalization
procedure, we obtain an orthonormal sequence {qn} of polynomials (cf.
Example 7.4.3).
(a) Define

These are the Legendre polynomials. Show that Pn(t) consists only of
even powers of t when n is even, and of only odd powers of t, when n is
226 7 Hilbert Spaces

odd.
(b) Show that Po(t) == 1, PI(t) = t and that, for n;::: 1,

(n + l)Pn+l (t) = (2n + l)tPn(t) - nPn-l(t)

for t E [-1,1]' given that Pn (1) = 1 for all n ;::: O.


(This gives a simple recursive formula to compute the Legendre polyno-
mials.) (c) Prove Rodrigues' Formula:

7.12 (a) Consider the space

and let H be the space of all equivalence classes (with respect to equality
almost everywhere) of functions in ii. Define the inner-product

1 /00 2f(x)g(x) dx.


(f, g) = fi -00 e- X

Show that H is a Hilbert space.


(b) Show that if fn(x) = x n , then fn belongs to H for every n E {O}UN.
(c) Apply the Gram-Schmidt process to the linearly independent set
{fn} to obtain an orthonormal set hn . Define

These are the Hermite polynomials. Compute Ho and HI.


(d) Prove Rodrigues' Formula:

2 ~ 2
Hn(x) = (-lte x _ e - x .
dxn

7.13 Let f, g E L2 (-1f, 1f) and let their Fourier series be given by

f(t) ~ + L:~=I (an cos nt + bn sin nt)


g(t) = ~ + L:~=I (en cosnt + dn sin nt).
Show that

1j7r
;: -7r f(t)g(t) dt
7.5 Exercises 227

7.14 Compute the Fourier series of the function:

f(t) = {-I -7r::; t < 0


10<t::;7r.
7.15 Compute the Fourier cosine series of the function f(t) = sin t on
[0,7r].

7.16 (a) Compute the Fourier sine series and the Fourier cosine series
of the function f(t) = ton [0,7r].
(b) Evaluate:

using Parseval's identity.

7.17 (a) Let f E L2( -7r, 7r). Let its Fourier series be given by
00

f(t) = ~o + 2:)an cos nt + bn sin nt).


n=l
Extend the function f to all oflR by periodicity, i.e. such that f(t+27r) =
f(t) for all t E R Define

F (t) = fat (i (s) - ~O) ds.

Show that F : lR --t lR is also 27r-periodic.


(b) Show that its Fourier series is given by

F(t) = ~~ + ~ (~sinnt ~ osnt).


(c) Show that the above series converges to F uniformly on R

7.18 Let f E H6( -7r, 7r). Show that if its Fourier series expansion is
given by
00
ao ~ .
f(t) = 2 + L.,...(an cos nt + bn sm nt),
n=l
then the Fourier series expansion of f' is given by

z:= (nbn cos nt -


00

f' (t) = nan sin nt).


n=l
228 7 Hilbert Spaces

7.19 Let H be a Hilbert space and let A : D(A) c H ~ H be a linear


operator. We say that it is dissipative if (A(u), u) ~ 0 for all u E D(A).
We say that it is maximal dissipative if, in addition R(I - A) = H,
where I denotes the identity operator on H. Let A be the infinitesimal
generator of a co-semigroup of contractions (cf. Exercises 4.4, 4.13 and
4.19) on H. Show that it is maximal dissipative.

7.20 Let H be a Hilbert space and let A: D(A) c H ~ H be a maximal


dissipative operator. Show that if B : D(B) c H ~ H is dissipative
and is an extension of A, i.e. D(A) C D(B) and BID(A) = A, then
D(B) = D(A). (This justifies the adjective 'maximal').

7.21 Let H be a Hilbert space and let A : D(A) C H ~ H be a maximal


dissipative operator. Show that it is closed and densely defined.

7.22 (a) Let H be a Hilbert space and let A : D(A) C H ~ H be a


dissipative operator. Let A> o. IfR(AI -A) = H, show that (AI _A)-1
exists in 'c(H) and that

(b) If A is a dissipative operator and if R(AoI - A) = H for some AO > 0,


show that R(AI - A) = H for all 0 < A < 2Ao.
(c) Deduce that if A is a maximal dissipative operator, then R(AI -A) =
H for all A > O.

Remark 7.5.1 Comparing the results of the above exercises with the
comments made in Remark 4.8.1, we deduce that an operator A: D(A) C
H ~ H will be the infinitesimal generator of a co-semigroup of contrac-
tions if, and only if, it is maximal dissipative. Unlike Banach spaces,
where the Hille-Yosida theorem involves verification of infinitely many
conditions, one for each A > 0, this is much easier to verify in Hilbert
spaces. The dissipativity is usually very easy to check. Further, it
is enough to verify that the equation (AI - A)x = f has a solution
x E D(A) for every f E H just for one fixed A > O.

7.23 Let Hi, i = 1,2,3 be Hilbert spaces with norms II· Iii, i = 1,2,3
respectively. Let T : D(T) C HI ~ H2 and S : D(S) C H2 ~ H3 be
closed and densely defined linear transformations. Assume that R(T) c
N(S). Assume further, that there exists a constant C > 0 such that,
7.5 Exercises 229

for all x E D(S) n D(T*), we have

IIT*xlli + II xll~ ~ llxll~


0) Let H2 = NiS) and let T* denote the adjoint of T : D(T) C HI -+
H 2 . Show that T* has closed range.
(b) If P : H2 -+ N(S) is the orthogonal projection, show that

T*(x) = T*(Px)

for all x E D(T*).


(c) Deduce that T has closed range.
(d) Show that R(T) = N(S).

1.24 Let II be a Hilbert space and let GL(H) be the set of all invertible
continuous linear operators on H. Then GL(H) is a group with respect
to the binary operation defined via composition of operators. Consider
the unit circle Sl C ]R2 with its usual topology inherited from ]R2. Repre-
senting a point g E Sl as (cos 0, sinO) where 0 E [0,211"), we have that Sl
is a group under the operation defined via (0 1 , ( 2 ) I----t (0 1 + (2)mod(211").
A representation of Sl is a group homomorphism 7i' : Sl -+ GL(H)
which is also continuous. For simplicity, we will denote the image of
g = (cosO, sinO) by 7i'(O).

°
(a) Show that every representation is uniformly bounded, i.e. there
exists a constant C > such that

117i'(O) II < C

for all 0 E [0,211").


(b) Define

< u, v> = -1 1211" (7i'(211" - O)(u), 7i'(211" - O)(v)) dO.


211" 0

Show that < .,. > defines a new inner product on H whose induced
norm is equivalent to the original norm on H.
(c) With respect to the inner-product < .,. > on H, show that 7i'(O)
is a unitary operator on H for every 0 E [0, 211"). (We say that every
representation of Sl is equivalent to a unitary representation.)

1.25 Let V = ef
and let {An} be a sequence of N x N matrices such
that An = ~ for each n. Assume further that, for each v E V, we have
230 7 Hilbert Spaces

that the sequence {(Anv, v)} decreases to zero as n - t 00. Show that
there exists a matrix A = A * such that (A v, v) 2: 0 for all v E V and
such that An -) A in £(V).
Chapter 8

Compact Operators

8.1 Basic Properties

In this chapter we will study a special class of linear transformations be-


tween Banach spaces which generalize several properties of linear trans-
formations between finite dimensional spaces.

Definition 8.1.1 Let V and W be Banach spaces and let T E £(V, W).
If dim(R(T)) is finite, then we say that T is of finite rank. If the image
of every bounded set in V is relatively compact in W, we say that T is
compact . •

Remark 8.1.1 For compactness, it is sufficient to verify that T(B) is


relatively compact in W when B is the (closed) unit ball in V. Equiv-
alently, T E £(V, W) is compact if, and only if, given any bounded
sequence {xn} in V, the sequence {T(xn)} admits a convergent subse-
quence in W . •

Example 8.1.1 Since any bounded set in a finite dimensional space is


relatively compact, it follows that every continuous linear transforma-
tion of finite rank is compact . •

Example 8.1.2 Let V be finite dimensional. Then any T E £(V, W),


where W is an arbitrary Banach space, is of finite rank and hence is
compact . •

Example 8.1.3 Let V be an infinite dimensional Banach space and let


I : V --t V be the identity map. Since the unit ball in V is not compact
(cf. Propositon 2.3.6), it follows that the identity map is not compact . •
232 8 Compact Operators

Example 8.1.4 Let X, Y and Z be Banach spaces and let S E C(X, Y)


and let T E C(Y, Z). If one of these is compact, it is easy to see that
their composition is also compact. In particular, since the identity map
in an infinite dimensional Banach space cannot be compact, it follows
that compact maps on infinite dimensional spaces are not invertible. •

Example 8.1.5 Let i : C1 [0, 1J ---+ C[O, 1J be the canonical inclusion map.
Let us denote the usual norm in C1 [0, 1J (cf. Exercise 2.5) by 11.111,00 and
the usual norm in C[O, 1J by 11.1100. Consider the set of all f E C1[0, 1J
such that 111111,00 ~ c. Then, clearly, Ilflloo ~ c as well. Further, if x
and yare points in [0, 1], we have, by the mean value theorem,

If(x) - f(y)1 ~ (sup If'(t)I)lx - yl ~ clx - YI·


tE[O,l]

It follows from this that the set of all functions considered here is uni-
formly bounded and equicontinuous in C[O, 1J. Thus, by Ascoli's theo-
rem, the set is relatively compact and so the inclusion map is compact . •

Example 8.1.6 Let K E C([O, 1J x [0,1]). Let

K, sup IK(x,y)l.
(x,Y)E[O, 1] x [o,lJ

Let f E C[O, 1J. Define

T(f)(x) = 10 1
K(x, y)f(y) dy.

Then, it is easy to see that T(f) is continuous and that IIT(f) 1100 ~
K,llflloo for all f E C[O,lJ and so T is a continuous linear operator on
C[O,lJ. Thus, if we look at the set of all f E C[O, 1J such that Ilflloo ~ c,
it follows that the set of all T(f) is uniformly bounded. Further, if x
and yare points in [0, 1], we have

IT(f)(x) - T(f)(y)1 < ~ IK(x, t) - K(y, t)I·lf(t)1 dt


< CSUPtE[O,l]IK(x, t) - K(y, t)l·

°
Now, since K is uniformly continuous (since [O,lJ x [O,lJ is compact),
given E > 0, there exists 8 > such that, for all t E [0,1]' we have

IK(x, t) - K(y, t)1 < E


8.1 Basic Properties 233

whenever Ix-YI < 5. This shows that the set of all T(f) where Ilflloo :S c
is equicontinuous and so T is a compact operator. •

Example 8.1.7 By virtue of Rellich's theorem (cf. Theorem 6.4.4),


it follows that the canonical inclusion map i: W 1 ,P(a,b) ----t V(a,b) is
compact, where (a, b) c lR is a finite interval and 1 :S p < 00 . •

Example 8.1.8 Let f E £2(a, b) be given. The map v r---t f01 vf dx is ob-
viously a continuous linear functional on HJ(O, 1). Since, by Poincare's

11
inequality,
(v, wh = v'w' dx

defines an inner product on this space (cf. Example 7.1.4), it follows from
the Riesz representation theorem that there exists a unique U E HJ
(a, b)

11 11
such that
u'v' dx = fv dx

for all v E HJ(O, 1). Further, setting v = U in the above relation and
using Poincare's inequality (cf. Theorem 6.4.6), we get

from which we get that

Thus, the map


f r---t U ~f Gf

is continuous. Composing this with the canonical inclusion of HJ(O, 1)


into £2(0, 1), we get that f r---t Gf = u is a compact operator on £2(0, 1) .•

Example 8.1.9 This is a generalization of the preceding example. Let


0: : (0,1) ----t lR be a function such that °
< c :S o:(x) :S C for all
x E (0,1), where c and C are fixed positive constants. Define a(.,.) :
HJ(O, 1) x HJ(O, 1) ----t lR by

a(v,w) = 11 o:(x)v'(x)w'(x) dx

for v and w in HJ(O, 1). Then, it is easy to check that a(.,.) defines a
continuous, symmetric and elliptic bilinear form on HJ
(0, 1). Thus, if
234 8 Compact Operators

f E L2(0, 1) is given, there exists a unique u E H6(O, 1) such that

a(u, v) = 10 1
fv dx

for all v E HJ(O, 1), by virtue of the Lax-Milgram lemma (cf. Corollary
7.3.1). Set u = G(f). Again, it is easy to check that

and so the mapping G : L2(0, 1) - H6(O , I) is continuous. Composing


this with the canonical inclusion of HJ(O, 1) into L2(0, I), we get that G
defines a compact operator on L2(O, 1) . •

Remark 8.1.2 In the preceding example, since u E HJ(O,I), we see


that u(O) = u(l) = 0 (cf. Theorem 6.4.5) . Assuming that u and a are
sufficiently smooth, we get, on integrating by parts, that

10 1
-(a(x)u'(x))'v(x) dx = 10 1
f(x)v(x) dx

for all v E HJ(O,I). In particular, this is true for all v represented by


Coo functions with compact support in (0,1). For this reason, we say
that
-(au')' = f
in the sense of distributions (cf. Kesavan [3]) and u is called a weak (or
generalized) solution of the boundary value problem:

fin (0, 1)
O.
The case of Example 8.1.8 corresponds to the case when a == 1 and so
the differential operator is - £.x
in the above boundary value problem.
The operator G mapping the data f to the solution u is sometimes called
the Green's operator. •

Example 8.1.10 Let a : [0,1] - lR be a continuous function which is


not identically zero. Define A : L2(O, 1) - L2(0, 1) by

A(f)(t) = a(t)f(t).
8.1 Basic Properties 235

It is easy to verify that A E £(L2(0, 1)). Let a(to) =I- 0 for some to E
(0,1). Then, there exists a compact interval J = [to - 0:, to + 0:] such
that
1
la(t)1 ~ 2Ia(to)1 > °
for all t E J. Let {fn } be an orthonormal basis for L2(J). Define

fn(t) = {In(t), if t E J
0, if t E [0, l]\J.

Then IIfnl12 = 1 for all n and, if n =I- m, we have

Thus the sequence {A(fn )} does not have a convergent subsequence and
so A is not compact. •

Proposition 8.1.1 Let V and W be Banach spaces and let K(V, W) be


the set of all compact linear operators from V into W. Then, K(V, W)
is a closed subspace of £(V, W).

Proof: It is immediate to check that a linear combination of compact


operators is compact. Thus K(V, W) is a subspace of £(V, W). We thus
have to check that if Tn E K(V, W) and if Tn -+ T in £(V, W), then
T E K(V, W). Let B be a ball in V. We need to show that its image
T(B) in W is relatively compact. Since W is a complete metric space, it
suffices to show that given c > 0, we can cover T(B) by a finite number
of balls of radius c (cf. Proposition 1.2.6). Since IITn - Til -+ 0, choose
n sufficiently large such that IITn - Til < c/2. Since Tn is compact, we
have

where I is a finite indexing set and fi E W. (Here Bw denotes a ball in


W.) We then have

Thus, T is compact . •

Corollary 8.1.1 If Tn : V -+ Ware all of finite rank and if IITn -Til -+


0, then T is compact. •
236 8 Compact Operators

Thus, a continuous linear transformation which can be approximated


by transformations of finite rank is compact.An interesting question is
the converse. Can every compact linear transformation be approximated
by those of finite rank. In general, this is not true. However, we have
the following resulti (see also Exercise 8.9).
Proposition 8.1.2 Let V be a Banach space and let W be a Hilbert
space. Let T E K(V, W). Then T is the limit of transformations of
finite rank.

°
Proof: Let B be the closed unit ball in V. Let K = T(B), which is
compact, by hypothesis. Let E > and let fi E W for i E I, a finite
indexing set, such that

K C UiEIBW(fi; E).

Let G = span{1i : i E I} which is a finite dimensional subspace of W.


Let P: W -+ G be the orthogonal projection onto G, which exists since
W is closed. Then PoT is of finite rank.
Now, let x E B. Then, there exists lio such that IIT(x) - fioll < E.
Since the norm of a projection is unity (cf. Exercise 7.3), we have

II(PoT)(x)-fioll = II(PoT)(x)-P(lio)11 < E

which implies that, for all x E B,

II(P 0 T)(x) - T(x)11 < 2E


z.e. II(P 0 T) - Til::; 2E. This completes the proof. •

Example 8.1.11 Let k E L2((0, 1) x (0,1)). For f E L2(0, 1), define

K(f)(t) = 10 1
k(t, s)f(s) ds.

Then, by an application of the Cauchy-Schwarz inequality,

I; IK(f)(tW dt I01 II01 k(t, s )f(s) dsl2 dt


< I01 (I01 Ik(t, s)12 dS) (I; If(sW dS) dt
Ilfll~ I01 Ik(t, s Wds dt
which shows that K E £(L2(0, 1)) and that

IIKII ::; Il k I L2«0,1)X(0,1))·


8.1 Basic Properties 237

Let {<Pn} be an orthonormal basis for L2(0, 1). Then {4>ij}(i,j)ENXN


is an orthonormal basis for L2((0, 1) x (0,1)) where

(cf. Exercise 7.9). Thus

L (k, 4>ij )4>ij.


00

k
i,j=1

If we set
n
kn(t,s) = L (k,4>ij)4>ij(t,S),
i,j=1
then Ilk - kn l1 2 ---+ 0. Consequently, if we define

Kn(f)(t) = 10 1 kn(t,s)f(s) ds
for f E L2(0, 1), we have that IIKn - KII :S Ilkn - kl12 ---+ 0. Clearly, the
image of K n is contained in the span of {<PI,' .. , <Pn} and so K n is of
finite rank. Hence K is a compact operator on L2(0, 1). (The operator
K is called the Hilbert-Schmidt operator induced by k.) •

Proposition 8.1.3 Let V and W be Banach spaces and let T E £(V, W).
Then T is compact if, and only if, Tot : W* ---+ V* is compact.
Proof: Assume that T is compact. Set K = T(Bv), where Bv is the
closed unit ball in V. Then, K is compact and Ilwll :S IITII for all wE K.
Now, let {v n } be a sequence in Bw*. Consider the sequence {l.{Jn} defined
by I.{Jn (w) = Vn (w) for w E K. Then, this is a sequence of continuous
functions on the compact metric space K. We havell.{Jn(w)1 :S liT II for all
wE K. Further Il.{Jn(wd-l.{Jn(W2)1 :S IlwI-W211 and so the sequence {l.{Jn}
is uniformly bounded and equicontinuous in C(K). By Ascoli's theorem,
it has a uniformly convergent subsequence {I.{Jnlj. Thus, in particular

sup Il.{Jnk(T(x)) -l.{Jnl(T(x))1


xEBv

can be made arbitrarily small for large k and l. In other words,

sup IT*(vnk)(x) - T*(vnl)(x)1


xEBv
238 8 Compact Operators

can be made arbitrarily small for large k and l, i.e. the sequence
{T*(vnkn is a Cauchy sequence and is hence convergent. This shows
that T* is compact.
Now assume that T* is compact. Then, by the preceding arguments,
it follows that T** : V** --+ W** is compact. Let Jv : V --+ V** and
Jw : W --+ W** be the canonical imbeddings. Let x E Bv. Then,
for any v E W*, we have T**(Jv(x))(v) = Jv(x)(T*(v)) = T*(v)(x) =
v(T(x)) = Jw(T(x))(v). Thus T**(Jv(Bv)) = Jw(T(Bv)). Conse-
quently, Jw (T( Bv)) is relatively compact in W** and since Jw is an
isometric isomorphism of W onto Jw(W), it follows that T(Bv) is rel-
atively compact in W. Thus, T is compact and the proof is complete .

8.2 Riesz-FredhOlm Theory

In this section we will be interested in operators of the form 1 - T on a


Banach space V, where 1 is the identity operator on V and T E £(V)
is a compact operator. Such operators are called compact perturbations
of the identity. Such operators have properties very similar to those of
linear operators on finite dimensional spaces.
Given a continuous linear operator A on a Banach space V, we will
denote its range by R(A) and its kernel (or null space) by N(A).

Theorem 8.2.1 (FredhOlm Alternative) Let V be a Banach space


and let T : V --+ V be compact. Let T* : V* --+ V* denote its adjoint.
Then:
(a) N(I - T) is finite dimensional;
(b) R(1 - T) is closed and

R(I - T) = N(I - T*)-L;

(c) N(I - T) = {O} if, and only if, R(I - T) = V;


(d) dim(N(1 - T)) = dim(N(I - T*)).

Proof: (a) Let Bl be the closed unit ball in N(I - T). If x E Bl, then
Ilxll :::;
1 and, further, x = T(x). Thus, Bl c T(B), where B is the
closed unit ball in V. Since T is compact, it follows that Bl is compact
and so N(I - T) is finite dimensional (cf. Proposition 2.3.6).

(b) Let fn E R(1 - T) and assume that fn --+ f in V. Let fn


Un - T(u n ). Since N(1 - T) is finite dimensional, it follows that there
8.2 Riesz-Fredholm Theory 239

exists Vn E N (1 - T) such that

(cf. Exercise 2.34). We can now write

In = (Un - v n ) - T(u n - vn )·
We now claim that {un - Vn } is a bounded sequence in V. If not, for a
subsequence, we have Ilu nk - v nk II --+ 00. Set
1
wnk = II u nk ._ v nk II (unk - v nk )·

Then Ilwnk II = 1. Further,


1
Ilu nk - vnk II Ink
and so it follows that wnk -T(wnk ) --+ o. Since T is compact, {w nk } has
a subsequence {Wnkl } such that T( Wnkl ) --+ Z E V. Then, it follows from
the preceding arguments that Wnkl --+ Z as well. Then T(W nkl ) --+ T(z)
and so we deduce that z = T(z), i.e. z E N(1 - T).
Now, since Vnkl E N(I - T), it follows that

Thus, on one hand z E N(I - T), while on the other hand, the above
considerations imply that d(z,N(1 - T)) = 1, which is a contradiction.
This establishes the claim.
Since {un - V n } is a bounded sequence, and since T is compact, we
have that, for a subsequence, T(u nk - vnk ) --+ 9 E V. Thus u nk - vnk --+
1+ 9 = h, say. It then follows that T(h) = T(f) + T(g) = g. This
implies that
h-T(h) = I
i.e. R(1 - T) is closed. Then, by virtue of Theorem 4.7.1, we deduce
that
R(I - T) = N(I - T*)-L.
(c) Assume that N (1 - T) = {O}. Assume further that VI = R( 1 - T) =1=
V. Since VI is a closed subspace of V, it is also a Banach space. Further,
if x E V, we have

T(x - T(x)) = (1 - T)(T(x))


240 8 Compact Operators

and so T(VI ) C VI. Thus, T : VI --+ VI is also compact and so 1 - T


restricted to VI also has a closed range. Let V2 = (I - T)(VI ). Again,
it follows that V2 is a proper subspace of VI. (If not, for every x E V, it
will follow that there exists y E V such that
(I - T)(x) = (I - T)2(y)
from which it will follow that x = (1 - T)y, since 1 - T is assumed to be
injective. This will then imply that V = VI, contrary to our assumption.)
Inductively, we thus get a decreasing sequence of closed subspaces {Vn }
such that Vn = (I - T)n(v) and Vn+ I is a proper subspace of Vn for all
n. Then, by Riesz' lemma (cf. Lemma 2.3.1), there exists Un E Vn such
that Ilunll = 1 and d(un , Vn+d ~ 1/2.
Now,

Assume that n > m. Then Un - T(u n ) E Vn+ I , Urn - T(urn) E Vrn+l,


Un E Vn and

Thus,
T( Urn - un) = Urn - w
where w E Vrn + l . Thus, if m i= n, by construction,
1
IIT(urn) - T(un)11 ~ 2".
Hence, {T( un)} cannot have a convergent subsequence while {un} is
bounded, contradicting the fact that T is compact. Thus, it follows that
R(I - T) = V.
Conversely, if R(1 - T) = V, then N(I - T*) = R(I - T).L = {O}.
Since T* is also compact, it now follows that R(I - T*) = V*. Then,
again, N(1 - T) = R(I - T*).L = {O}.

(d) Let d = dim(N(I - T)) and let d* = dim(N(I - T*)). Assume


that d < d*. Since N(1 - T) is finite dimensional, it is complemented
and so there exists a continuous projection P : V --+ N(I - T). Also
R(I -T) = N(1 -T*).L and so the range of (1 -T) has finite codimension
d*. Let W be a d* -dimensional subspace of V which is a complement to
R(1 - T). Since we have assumed that d < d*, there exists a linear map
A : N(I - T) --+ W which is injective but not surjective. Define

S = T+AoP'
8.2 Riesz-Fredholm Theory 241

Since T is compact and A 0 P is of finite rank, it follows that S is also


compact. Let u - S('I1,) = O. Then

o = (u - T(u)) + A(P(u))
where the first term belongs to R(I - T) while the second belongs to
its complement, W. Thus u - T(u) = 0 and so u E N(I - T). Thus,
P(u) = u. Further, 0 = A(P(u)) = A(u) and so u = 0 since A is
injective. Thus N(I - S) = {O} and so R(I - S) = V.
Now choose I E W\R(A). Let u E V be such that u - S(u) = I,
i.e. (u - T(u)) - A(P(u)) = I. Since both I and A(P(u)) belong to
W, which is the complement of R(I - T), it follows that u - T(u) = 0
which implies that I = -A(P(u)) E R(A), a contradiction. Thus,

d* < d.

Similarly,

dim(N(I - T**)) ::; dim(N(I - T*)) = d*.


But, if J : V ---+ V** is the canonical imbedding, then it is easy to see
that
J(N(I - T)) c N(I - T**)
and so d ::; dim(N(I - T**)) ::; d*. Thus, we deduce that d = d* and
this completes the proof. •

Remark 8.2.1 The conclusion (c) holds for all operators on finite di-
mensional spaces and is referred to in short as 'one-one if, and only if,
onto'. This property carries over to compact perturbations of the iden-
tity in infinite dimensional Banach spaces. In general it is not true for
a general operator on a Banach space. If x = (Xi) E £2 and we define
T E £(£2) by
T(x) = (O,Xl,X2,"'),
then, clearly T is one-one but not onto. •

Remark 8.2.2 The Fredholm alternative is stated as follows: let V be


a Banach space and let T E £(V) be a compact operator. Let I E V.
Then
• either - the equation u - T( u) = 0 has only the zero solution
and so the equation w - T( w) = I has a unique solution for all
IE V,
242 8 Compact Operators

• or the equation u - T(u) = 0 has d linearly independent


solutions and the equation w- T( w) = f will have a solution if, and
only if, f satisfies d orthogonality relations i.e. f E N(I - T*)l...
(In the case of finite dimensional spaces, these are the familiar
consistency conditions to be satisfied by the right-hand side vector
in case of singular matrices.) •

Remark 8.2.3 Theorem 8.2.1 is the starting point of the theory of


Fredholm operators. A Fredholm operator A : V ~ W between Banach
spaces V and W is one which is such that N(A) is finite dimensional
and R(A) is closed with finite codimension. The (Fredholm) index of A
is given by
i(A) = dim(N(A)) - codim(R(A)).
Compact perturbations of the identity on a Banach space are thus
Fredholm operators with index zero. It can be shown that the class
of Fredholm operators is open in £(V) and that the index is a continu-
ous function in this space. Since it is integer valued, the index will be
constant on connected components of this class. •
We conclude this section by proving a useful result which could be
considered as a generalization of the Lax-Milgram lemma (cf. Corollary
7.3.1).

Proposition 8.2.1 Let V and H be Hilbert spaces such that V c H,


the inclusion being dense and compact. Let a(.,.) : V X V ~ lR be a
continuous bilinear form such that a(v, v) = 0 if, and only if, v = O.
Assume, further, that there exist constants 0: > 0 and {3 > 0 such that,
for every v E V,
a(v,v) ~ o:llvll~ - {3"v"~. (8.2.1)
Let f E H be given. Then, there exists a unique u E V such that

a(u, v) = (f, V)H (8.2.2)

for every v E V.

Proof: Consider the bilinear form A(.,.) : V x V ~ lR given by

A(v,w) = a(v,w) + {3(V,W)H.


Then, A(.,.) is clearly continuous and, by virtue of (8.2.1), we have

A(v, v) ~ o:llv"~.
8.2 Riesz-Fredholm Theory 243

Further, the map v I--t (f, v) H is clearly a continuous linear functional


on V. Hence, by the Lax-Milgram lemma, there exists a unique u E V
such that
A(u, v) = (f, V)H
for every v E V. Set u = G(f). Thus, G E £(H, V). Since the inclusion
of V in H is compact, we have that G E £(H) is compact.
Now, if u E V is a solution of (8.2.2), then

a(u, v) + (3(u, V)H = (f + (3u, V)H


for every v E V, and vice-versa. Thus, u E V satisfies (8.2.2) if, and
only if, u = G(f + (3u). This can be rewritten as

z - (3G(z) = j, (8.2.3)

where z = j + (3u. Thus, (8.2.2) has a unique solution for given j if,
and only if, the same is true for (8.2.3).
Now, I - (3G is a compact perturbation of the identity and so, it is
surjective if, and only if, it is injective. Assume that (I - (3G)w = O.
Thus, w = G((3w), and thus, w E V and, for every v E V, we have

a(w,v) + (3(W,V)H = ((3w,V)H.

We hence deduce that a(w,v) = 0 for every v E V and so, in particular,


we have a(w, w) = 0, whence, we deduce that w = o. Thus, 1- (3G is
injective and, therefore, surjective as well. This completes the proof. •

Example 8.2.1 Let V = H1(0, 1) and H = L 2 (0, 1). Let k > o. Set

1
a(u, v) = 10 u'(x)v'(x) dx + k[u(l)v(l) + u(O)v(O)],

for u and v E H1(0,1). If a(v, v) = 0, we see that v' = 0 and v(l) =


v(O) = O. This implies that v = O. Further,

a(v, v) ~ 101Iv'(x)12 dx = Ilvlli,2 - Ilvll~


where
1 1
Ilv111,2 (10 (Iv'(x)12 + Iv(x)12) dX) 2
244 8 Compact Operators

is the norm in HI (0, 1). Thus, all the hypotheses of the preceding propo-
sition are satisfied and so, given f E L2 (0, 1), we have a unique solution
u E Hl(O, 1) satisfying

a(u, v) = fol f(x)v(x) dx (8.2.4)

for every v E Hl(O, 1).


It can be shown that (8.2.4) is the weak formulation (cf. Remark
8.1.2) of the following boundary value problem:

~ = fin (0,1) (8.2.5)


ku(O) - u'(O) = ku(l) + u'(l) = o.
The boundary condition in (8.2.5) is called a Robin boundary condition.

8.3 Spectrum of a Compact Operator

In this section, we will assume that all Banach spaces are complex.

Definition 8.3.1 Let V be a Banach space and let T E .c(V). The


spectrum of T is the set O'(T) of scalars defined by

O'(T) = {A Eel T - >"1 is not invertible}.

The resolvent of T is the set p(T) defined by

p(T) = {A Eel T - >"1 is invertible} . •

If T E .c(V) is such that IITII < 1, then (cf. Exercise 2.29) 1- T is


invertible and its inverse is given by the Neumann series

and, it is easy to see that

In particular, if T is any invertible operator on V and if 8 E .c(V) such


that
1
11811 < liT-III'
8.3 Spectrum of a Compact Operator 245

then it follows that T - S is also invertible. We deduce from this that the
set of all invertible operators is open in £(V). In particular, if A E p(T),
then A + <5 E p(T) as well, if <5 is sufficiently small. Thus, it follows that
p(T) is an open subset of C and so the spectrum is a closed subset of C.
Now assume that IAI > IITII. Then

T - AI = -A(I - A-IT)

and, since IIA -ITII = IAI-lIITII < 1, it follows that T - AI is invertible


and so A E p(T). Thus,

a(T) c {A Eel IAI ~ IITII}·


Thus, the spectrum is a compact subset of the complex plane.
Definition 8.3.2 Let B(O; r) denote the closed ball in the complex plane
with centre at the origin and radius r. Let V be a Banach space and let
T E £(V). The spectral radius ofT, denoted r(T), is defined by

r(T) = inf{r > 0 I a(T) c B(O; r)} . •

We now show that the spectrum of a continuous linear operator on


a (complex) Banach space is always a non-empty set. Let A E p(T). Set
T(A) = (T - A1)-l. Since we can write T(A) = A-l((l/A)T - 1)-1, it
follows that, for IAI > IITII, we have

and so IIT(A) II is bounded and tends to zero as IAI ~ 00.


Now let A and f-t belong to p(T). Then

T(A) T(A)(T - f-t1)T(f-t)


T(A)(T - AI + (A - f-t)1)T(f-t)
(I + (A - f-t)T(A))T(f-t)
which implies that

T(A) - T(f-t) = (A - f-t)T(A)T(p,). (8.3.1)


This is usually known as the resolvent equation.
Now, let f E £(V)*. Define

<p(A) = f(T(A))
246 8 Compact Operators

for A E p(T). Then,


I'P(A)I ~ Ilfll·IIT(A)II· (8.3.2)
Further, by (8.3.1), it follows that

lim 'P(A) - 'P(J.L) = f(T(J.L)2). (8.3.3)


A - J.L
A---->J.L

We conclude from (8.3.2) that 'P(A) is bounded and vanishes as A ap-


proaches infinity, and from (8.3.3) that 'P is differentiable at every point
of p(T). Thus, if p(T) = C, then 'P will be a bounded and entire func-
tion and hence, by Liouville's theorem, a constant. Since it vanishes at
infinity, it will follow that 'P == O. Since f can be arbitrarily chosen, it
follows then that T(A) = 0 for all A E C = p(T) which is absurd since
it is the inverse of the map T - AI. Thus p(T) cannot be equal to the
entire complex plane. In other words, a(T) is non-empty.
Thus, the spectrum of a continuous linear operator,T, is a non-empty
compact set of the complex plane lying inside the ball centered at the
origin and having a radius equal to IITII.

Remark 8.3.1 The non-emptiness of the spectrum is crucially depen-


dent on the fact that the scalar field is the field of complex numbers.
Consider the real Banach space]R2 (with any norm). Consider the linear
operator, T, defined via the matrix

Then T - AI is not invertible if, and only if A2 + 1 = 0 which has no real


solution. Thus the spectrum of T is empty, if the space is ]R2, while it
is the set {i, -i} when the space is C 2 . •
Definition 8.3.3 An element A of the spectrum a(T) of a continuous
linear operator T on a Banach space V is called an eigenvalue of T if
N(T-AI) =1= {O}. In this case dim(N(T-AI)) is called the geometric
multiplicity of the eigenvalue A. The non-zero elements of the space
N(T - AI) are called the eigenvectors of T associated to the eigenvalue
A. •
Remark 8.3.2 In finite dimensions, the spectrum coincides with the set
of all eigenvalues. This need not be the case in infinite dimensions. For
example, consider the (complex) space £2 and the map
8.3 Spectrum of a Compact Operator 247

Then, since T is not onto, it is not invertible. Thus 0 E cr(T). However,


since T is injective, A = 0 is not an eigenvalue. •

Proposition 8.3.1 Let H be a Hilbert space and let T E .c(V). Then


A E cr(T) if, and only if, XE cr(T*).
Proof: Observe that the adjoint of T - AI is T* - XI. The result is now
an immediate consequence of Theorem 4.7.1. •

Example 8.3.1 Let V = £2. Consider the map

x ~ S(x) = (X2, X3,···)

where x = (Xl,X2,X3,···) E £2. Then, clearly, IIS(x)112 ~ IIxI12 and so


IISII ~ 1. (In fact IISI12 = 1; why?) The operator S is called the left shift
operator. Thus,
cr(S) C {AEC IIAI ~ I}.
Let A of.
0 and assume that S(x) = AX. Then, for all i 2:: 1, we have
Xi+l = AXi which yields Xi = Ai-Ixl. Since X E £2, we have that
Xi --t 0 as i --t 00, and so it follows that IAI < 1. On the other hand, if
0< IAI < 1, the vector

x = (1,A,A 2,A3, ... ) E £2

and, indeed, S(x) = AX. Further, S(el) = O. Thus, it follows that every
A such that IAI < 1 is an eigenvalue of S and so is in the spectrum of S.
Since the spectrum is closed, it now follows that

cr(S) = {A E C IIAI ~ I}.


If IAI = 1, then though it is in the spectrum, it is not an eigenvalue.
It is now immediate to see that the adjoint of S is the mapping T
on £2 given by
T(x) = (O,XI,X2,X3,···).
This is called the right shift operator. By Proposition 8.3.1, it follows
that the spectrum of T is also the closed unit disc in the complex plane.
Now, if T(x) = AX, then AXI = 0 and, for all i 2:: 2, we have AXi = Xi-I.
If A = 0, then Xi = 0 for all i and so N(T) = {O}. If A of. 0, then Xl = 0
and again, it follows that Xi = 0 for all i. Thus, there are no eigenvalues
for T . •
248 8 Compact Operators

Thus, we see that the spectrum, in infinite dimensional spaces, can


be of various kinds. In the case of the operator S above, there is a
continuum of eigenvalues and a continuum of elements in the spectrum
which are not eigenvalues, while the operator T has a spectrum which is
a continuum of complex numbers but without any eigenvalues. We will
now see that compact operators behave more or less like operators on
finite dimensional spaces.

Theorem 8.3.1 Let V be an infinite dimensional Banach space and let


T E C(V) be a compact operator. Then,
(a) a E O"(T);
(b) every non-zero element of the spectrum is an eigenvalue of T with
finite (geometric) multiplicity;
(c) if {An} is a sequence of distinct non-zero eigenvalues of T, and if
An ---t A, then A = a (in particular, the non-zero elements of the spectrum
are all isolated);
(d) one of the following alternatives holds:

• O"(T) = {a},
• O"(T) \ {a} is finite,

• O"(T)\{a} consists of a sequence of eigenvalues which converges to


zero.

Proof: (a) We have already seen (cf. Example 8.1.4) that a compact
operator is not invertible. Thus a E O"(T).

(b) Let A i= a. Now T-AI = -A(1 _A-IT) and the Riesz-F'redholm the-
ory applies. Thus, if N (I - A-IT) = {O}, then T - A-I I is onto as well.
Then, by the open mapping theorem, it is also invertible. Thus, if A i= a
is in the spectrum of T, it follows that N(T - AI) = N(1 - A-IT) i= {O}
and that it is finite dimensional (cf. Theorem 8.2.1).

(c) Since An is an eigenvalue for each n, let Un i= 0 be such that T(u n ) =


An Un· The set {Un} is a linearly independent set. Indeed, assume that
the set {U 1, ... , Un} is linearly independent (this is true for n = 1 since
Ul i= 0) and that we can write

n
Un+l LajUj.
j=I
8.3 Spectrum of a Compact Operator 249

Then,
n n
L O jA n+1 U j LCXjAjUj.
j=l j=l

Since the set {U1,'" ,Un} is linearly independent and since Aj =1= An+1
for all such j, we deduce that CXj = 0 for 1 ::; j ::; n, whence Un+1 = 0,
which is a contradiction. This completes the induction and proves the
linear independence of the set of eigenvectors {un}.
Set Vn = span {U1, ... , un}. Then we have an increasing sequence of
subspaces such that Vn is a proper subspace of Vn+1 for all n. These
subspaces are all finite dimensional and hence, closed. By Riesz' lemma
we can find Vn E Vn such that Ilvnll = 1 and d(vn, Vn-1) ~ 1/2 for all
n ~ 2.
Let An - t A =1= O. Then tL v n } is a bounded sequence and so, since
T is compact, the sequence

must have a convergent subsequence. Now, if 2::; m < n, we. have

Thus,

Now, T(Vn)-AnVn E Vn- 1 (why?) and, similarly, T(Um)-AmVm E Vm- 1.


Thus

where wE Vn -1 and so, by construction,

which contradicts the fact that the sequence {(l/A n )T(vn )} has a con-
vergent subsequence. This proves that A = O.
250 8 Compact Operators

(d) In view of (c) we deduce that, for all positive integrs n, the set

is either empty, or finite (since u(T) is compact). Thus u(T)\{O} is


empty, finite or countably infinite. In the last case, by the compactness
of u(T), every subsequence of this set has a convergent subsequence
which must converge to zero. Hence the entire sequence in u(T)\{O}
converges to zero. •

Example 8.3.2 If T : V ~ V is an operator of finite rank, then obvi-


ously it can only have a finite number of eigenvalues. Now, consider the
map T : £2 ~ £2 defined by

where x = (Xl,X2,·· .). Then, Tn+l(x) = 0 for all x E £2. Then it is


clear that the only eigenvalue of T is zero. •

Example 8.3.3 Let {an} be a sequence of complex numbers such that


an ~ O. Define T E £(£2) by

where x = (Xl,X2,·· .). Define

Then Tn is of finite rank. Further, for any x = (Xi) E £2, we have

from which we immediately see that liT - Tn" ~ O. Thus T is com-


pact. Clearly each an is an eigenvalue with associated eigenvector en·
If an -I- 0 for any n, then zero is not an eigenvalue of T (since T will
then be injective). If a finite number of the an are zero, then zero will
be an eigenvalue of finite multiplicity. If infinitely many an are zero,
then zero will be an eigenvalue with infinite multiplicity. Thus while
zero is always in the spectrum of a compact operator, it need not be an
eigenvalue or it could be an eigenvalue of finite or infinite multiplicity,
8.4 Compact Self-Adjoint Operators 251

unlike non-zero elements of the spectrum which are always eigenvalues


of finite multiplicity. •

Example 8.3.4 Let T : £2 ---+ £2 be given by

where x = (Xl, X2, ... ), and where {an} is a sequence of complex num-
bers. We saw that if an ---+ 0, then T is compact. Conversely, if T
is compact, it follows that an ---+ 0. Indeed, since an are all in the
spectrum, by Theorem 8.3.1 (c), we deduce that an ---+ 0. •

8.4 Compact Self-Adjoint Operators

In this section, we will study the spectrum of a compact self-adjoint


operator on a Hilbert space. If H is a Hilbert space and T E £( H) is
self-adjoint, recall that

(T(x), y) = (x, T(y))

for all x and y E H.

Example 8.4.1 Consider the operator G : L2(0, 1) ---+ L2(0, 1) defined in


Example 8.1.9. Then G is self-adjoint. To see this, let f and g E L2(0, 1).
Set u = G(f) and v = G(g). Then (cf. Example 8.1.9)

(G(f), g) = 10 1 gu dx = 10 1 a(x)v'(x)u'(x) dx = 10 1 fv dx = (f, G(g))

which establishes our claim. In particular, the operator G considered


in Example 8.1.8 is also self-adjoint. Both these are thus examples of
compact and self-adjoint operators on L2(0, 1) . •

Proposition 8.4.1 Let H be a (complex) Hilbert space and let T E


£(H) be a compact self-adjoint operator. Then, (J(T) C JR.
Proof: Let Ai-
so there exists x
°i-
be an element of (J(T). Then, it is an eigenvalue and
0 such that T(x) = AX. Thus

A(X, x) = (T(x), x) = (x, T(x)) = ~(x, x).

Since x is non-zero, it follows that A = ~. This completes the proof. •


252 8 Compact Operators

Remark 8.4.1 Since the spectrum of a compact self-adjoint operator


is a subset of the reals, the results which follow will also apply to real
Hilbert spaces. •

Proposition 8.4.2 Let T : H --t H be a self-adjoint compact operator.


Set

m = inf (T(u), u) and M = sup (T(u), u).


uEHillull=l uE H ill u ll=l

Then, a(T) c [m, M] and, further, both m and M belong to a(T).

Proof: Let A > M. Then, for any u E H, we have

Thus, if we set a(u, v) = (Au - T(u), v), we have

a(u,v) = (u,Av-T(v)) = a(v,u)

since A is real and T is self-adjoint. Thus, a(.,.) defines a new inner


product whose norm is equivalent to the original one. Hence, by the
Riesz representation theorem, it follows that for any f E H, there exists
a unique u E H such that

(Au - T(u), v) = (j, v)

for all v E H, i. e. AI - T is onto; since it is also one-one (why?)


and continuous, it is invertible, by the open mapping theorem. Thus
A E p(T).
Again, consider a(u,v) = (Mu - T(u), v). This almost defines an
inner product on H; observe that a(Mu - T(u), u) ~ 0 for all u E H
but this quantity being zero does not imply that u = O. Nevertheless,
we can still prove the Cauchy-Schwarz inequality for this 'inner product'
using the same arguments as in the proof of Theorem 7.1.1. Thus, we
deduce that
1 1
I(Mu - T(u), v)1 < (Mu - T(u), u)'i (Mv - T(v), v)'i

for all u and v E H. Thus,


1 1
I(Mu - T(u), v)1 :S: IIMI - T11211vll(Mu - T(u), u)2.
8.4 Compact Self-Adjoint Operators 253

This implies that

IIMu -
1
T(u)11 ~ C(Mu - T(u), U)2

where C = 11M! - T112.


1

Let {un} be a sequence in H such that Ilunll = 1 for all n and such
that (T(u n ), un) - t M as n - t 00. Then, it follows that MU n -T(u n ) - t
O. This proves that M E aCT). (If not, then M! - T would be invertible
and we would have

while Ilunll = 1, which is a contradiction.)


The results for m will now follow from the above by considering -T
instead of T. •
Corollary 8.4.1 Let T E £(H) be a compact and self-adjoint operator
such that aCT) = {O}. Then T = o.
Proof: Let Re(z) and Im(z) stand for the real and imaginary parts
of a complex number z. By the preceding proposition, it follows that
(T(u), u) = 0 for all u E H. Thus, if u and v are arbitrary vectors in H,
we have

2Re((T(u), v)) = (T(u + v), u + v) - (T(u), u) - (T(v), v) = O.


Then, it also follows that 0 = Re((T(iu), v)) = -Im((T(u), v)). Thus
(T(u), v) = 0 for all u and v E H. It is clear from this that T(u) = 0
for all u E H, which completes the proof. •
Theorem 8.4.1 Let H be a separable Hilbert space and let T E £(H) be
a compact self-adjoint operator on H. Then H admits an orthonormal
basis consisting of eigenvectors of T.
Proof: Let a(T) = {O} U {An I n EN}. Set AO = O. The numbers An
for n E N are all distinct eigenvalues of T. Thus, if we set Eo = N(T)
and En = N(T - AnI) for n E N, it follows that

o ~ dim(Eo) ~ 00 and 0 < dim(En) < 00

for n E N.
The spaces En, n 2::': 0 are all mutually orthogonal. To see this, if
u E En and v E Em, where n i=- m, we have
An(U,V) = (T(u), v) = (u,T(v)) = Am(U,V)
254 8 Compact Operators

which implies that (u, v) = 0 since An i- Am. Let F be the linear


subspace generated by the En, n 2: O. We will show that F is dense in
H. In other words, we will show that Fl.. = {O}.
Clearly T(F) c F. If u E Fl.. and v E F, we have

(T(u), v) = (u, T(v)) = o.


Thus T(Fl..) c Fl.. as well. Let To be the restriction of T to Fl... Then
To E £(l{l..) is compact and self-adjoint. Hence, if To has a non-zero
element A in its spectrum, then it must be an eigenvalue. But then, it
will mean that there exists u i- 0 in Fl.. such that >:'u = Tou = Tu which
implies that u E F, which is impossible. Thus O"(To) = {O}, whence, by
the preceding corollary, To = o. Thus

Fl.. c N(T) c F

~. . Fl.. = {O}.
Now, if Eo is a non-zero subspace, it is either finite dimensional or
is an infinite dimensional separable subspace of H and so it has an or-
thonormal basis. Each En, n 2: 1, is finite dimensional and admits an
orthonormal basis. The union of all these bases forms an orthonormal
basis of H, since F is dense in H (cf. Corollary 7.4.1) . •

We now give a variational characterization of the non-zero eigenval-


ues of a compact self-adjoint operator, T, on a separable Hilbert space,
H.
Let us number the non-negative eigenvalues of T in decreasing order
of magnitude and the non-positive eigenvalues in increasing order of
magnitude. Each eigenvalue is repeated as many times as its (geometric)
multiplicity. Thus, we have

At > At > > 0


Al < A2 < < O.

Let u;i:;, be a normalized eigenvalue of A;i:;, and u:;;, a normalized eigenvalue


of A:;;' such that the set of all the u;i:;, and u:;;, form an orthonormal basis
of H. Let Vn+ be the finite dimensional space generated by the vectors
{ ut , ... , u;t} and let Vo = {O}. Similarly we can define Vn- for n 2: O.
With these notations, we can now prove the following result.
8.4 Compact Self-Adjoint Operators 255

Theorem 8.4.2 For each m 2: 1, we have

(T(v),v)
m xv~o; ..v~ l JiViT2""
(T(v),v)
minVCH; dim(V)=m-l m xv~o; vl.V JiViT2"".
Proof: Since ~ is an eigenvector corresponding to the eigenvalue .~,
and since Il ~1I = 1, it follows immediately that

(8.4.1)

On the other hand, if v E H, we have (cf. Theorem 7.4.2)

v = 2)v, ut)ut + I)v, u;;)u;;


k n

and so
T(v) = I:>.t(v,ut)ut + I:>.;;(v,u;;)u;;.
k n

We then deduce that

(T(v), v) = I: >.tl(v, ut)1 + I: >.;;I(v, u;;)1


2 2 < I: >.tl(v, utW·
k n k

If, in addition, v ~ Vm - 1, then,

by repeated use of Parseval identity (cf. Theorem 7.4.2). Since ~ ~


Vm - 1 , the preceding computation and (8.4.1) imply that

(T(v), v)
>.+ = max (8.4.2)
m v~O;vl. m l IIvll 2
Finally, let V be a (m - 1)-dimensional subspace of H. Then, there
exists a non-zero vector u in the m-dimensional space v;t such that
u ~ V (why?). Thus

(T(v), v) > (T(u), u)


sup
v~O;vl. IIvll 2 lIull 2
256 8 Compact Operators

(T(u),u) = La;,Xt 2: ,X;t;lluI1 2 .


i=l

Combining these inequalities, we get

(T(v), v)
sup
V#Ojvl.V IIvl1 2
In fact, since T is compact, the 'sup' above is a 'max' (why?). Further,
in view of (8.4.2), we deduce that

,X+ = min (T(v),v) (8.4.3)


m VCHjdim(V)=m-l v j~ IIvl1 2 .

This completes the proof. •

Remark 8.4.2 We can prove analogous relations for the eigenvalues


,X~ .•
Corollary 8.4.2 With the preceding notations, we have

Example 8.4.2 Let us consider, once again, the operator G defined on


£2(0,1) as in Example 8.1.9 (and also Example 8.4.1). Consider the
following 'eigenvalue problem': find (u,'x) E (HJ(O,l)\{O}) x lR such
that
a(u, v) = 'x(u, v) (8.4.4)
for all v E HJ(O, 1). In view of the definition of G, it follows that

G('xu) = u.

Since a(.,.) is coercive, there is no solution corresponding to ,X = 0. If


,X i= 0, we can write
G(u) = ,X -lu.

adjoint compact operator. Since (G(u), u) 2: °


Thus the problem reduces to finding the spectrum of G which is a self-
for all u E £2(0,1),
it follows that all eigenvalues ,X are strictly positive. Since G has a
sequence of eigenvalues /-Ln - t 0, the problem (8.4.4) admits a sequence
8.4 Compact Self-Adjoint Operators 257

of eigenvalues An = /1;;1 which tend to infinity. In view of the preceding


corollary, we get that

(G(u), u)
All = /11 = max
~O; EL2(O,1) Il ll~
Notice that the equation (8.4.4) is the 'weak formulation' (cf. Re-
mark 8.1.2) of the following two-point boundary value problem:

_ -sL
dx
(o:du)
dx AU in (0,1)
u(O) = u(l) 0.

Thus, eigenvectors of such boundary value problems generate, in a nat-


ural way, orthonormal bases for Hilbert spaces like L2(0, 1). Several
special functions of mathematical physics are, in fact, eigenvectors of
certain boundary value problems for second order differential equations.
For instance, we cite the cases of the Bessel functions, Legendre poly-
nomials (cf. Exercise 7.11) and the Hermite polynomials (cf. Exercise
7.12). For details, the reader is referred to Simmons [10] . •

Remark 8.4.3 Notice that, in the preceding example, all the eigenvec-
tors will, in fact, belong to HJ(O, 1). Using arguments very similar to
those in the proof of Theorem 8.4.2, it can also be shown that if the
eigenvalues {An} are numbered in increasing order of magnitude (tak-
ing into account the geometric multiplicities), then they can be given
the following variational characterization in HJ(O, 1). Let {w n } be the
orthonormal basis of eigenvectors (in L2(0, 1)) with Wn corresponding
to An for each n. Let Vn be the finite dimensional space spanned by
{WI,"" W n }. Set
a(v,v)
R(v)
TvTIr
for v E HJ(O, 1), v i= O. Then

maxv,to; vEVn R(v)

minv~o; ~ n l R(v)

minVCHJ(O,I); dim(V)=n maxv,to; VEV R(v).


258 8 Compact Operators

In particular, we have

Al = min R(v).
v;fo; vEHJ(O,I)

For details, see Kesavan [3]. (A proof is outlined in Exercise 8.20 in


the next section.) This result is an infinite dimensional version of the
characterization of eigenvalues of a hermetian positive definite matrix
(cf. Proposition 1.1.8) .•

Example 8.4.3 A particular case of the preceding example is the case


where a(x) == 1. The corresponding differential equation is

~ AU in (0,1)
u(O) u(l) = O.
A simple computation yields that the only possible solutions to this
equation are given by

where An is an arbitrary constant. Choosing An = J2, we get an or-


thonormal sequence in £2(0, 1). We have already seen that it is complete
(cf. Example 7.4.6). In particular, we have

2 . f01 l'\7vl 2 dx
Al = 7r = mIn "-"-'1--'---
40; VEHJ(O,l) fo Ivl 2 dx
Thus, for every v E HJ(O, 1), we deduce that

This is just Poincare's inequality (cf. Theorem 6.4.6), except that we


have a better constant in the estimate compared to the inequality (6.4.4).
In fact, we have shown in this case that 1/7r is the best possible constant
and equality is attained in the above inequality for the function x t---t
sin 7rX • •

8.5 Exercises

8.1 Let V and W be Banach spaces and let T E £(V, W) be compact.


Show that if Xn ->. X in V, then T(xn) - t T(x) in W.
8.5 Exercises 259

8.2 (a) Let V and W be Banach spaces and let V be reflexive. Let T E
£(V, W). Assume that whenever Xn -->. X in V, we have T(x n ) -+ T(x)
in W. Show that T is compact.
(b) Show, by means of an example, that the conclusion is not true, in
general, if we drop the assumption that V is reflexive.

8.3 (a) Let V be a reflexive Banach space and let T E £(V,RI). Show
that T is compact.
(b) Let W be a reflexive Banach space and let T E £(CO, W). Show that
T is compact.

8.4 Let H be a Hilbert space and let A E £(H) be compact. Assume


that (A(x), x) 2: 0 for all x E H. Show that there does not exist a
constant 0: > 0 such that

(A(x), x) > 0:1Ixl1 2


for all x E H.

8.5 Let V be a Banach space and let T E £(H) be compact. Show that
if 8 E £(V), then 8(1 - T) = 1 if, and only if, (1 - T)8 = 1. In this
case, deduce that 1 - (1 - T)-I is compact.

8.6 Let H be a Hilbert space. Let T E £(H) be compact. Show that

IITII = max
IlxlI=I
XEH;
IIT(x)ll·

8.7 Let Hi, i = 1,2,3 be Hilbert spaces such that

with continuous inclusions. Assume, further, that the inclusion of HI


in H2 is compact. Show that for every E: > 0, there exists a constant
C£ > 0 such that

for all u E HI.


260 8 Compact Operators

8.8 Let { j ~l be a sequence of numbers such that L::~ll jl < 00.


Consider the infinite matrix

Let A be the linear mapping defined on £2 by this matrix (cf. Exercise


2.26). Show that A E £(£2) and that it is compact.

8.9 (a) Let W be a Banach space. Assume that there exists a sequence
of operators of finite rank, {Pn }, in £(W) such that Pn(Y) ~ Y for all
YEW. Show that, if V is any Banach space, and if T E £(V, W) is
compact, then T is the limit (in £(V, W)) of operators of finite rank.
(b) Deduce that, if V is any Banach space, and if T E £(V,£p), where
1 :S: p < 00, is compact, then T is the limit of operators of finite rank.

8.10 Let x = (Xl, X2,···) E £2. Define


S(X) = (0,Xl,X2,···)
M(x) = (Xl,X2/2,X3/3,···).
Set T = MS. Show that T E £(P-2) is compact and compute its spec-
trum.

8.11 Let L 2 (0, 1) be the space of complex valued square integrable func-
tions. Let K be the Hilbert-Schmidt operator (cf. Example 8.1.11)
defined on this space by the function ik where
I if s < t
k(t,s) = { -1 if s;: t.

Show that all the eigenvalues of K are given by

A - 2 kEZ
k - (2k + 1)7r'
with corresponding eigenvector t ~ exp(i(2k + l)7rt).
8.12 Consider the operator K E £(L2(0, 1)) defined by

K(f)(t) = fat f(s) ds.


8.5 Exercises 261

Show that it is compact and compute its spectrum.

8.13 Let V be a Banach space and let T E £(V) be compact. Let L


and M be closed subspaces of V such that M is strictly contained in L.
Assume that (I - T)L c M. Show that there exists x E L such that
IIxll = 1 and IIT(x) - T(y) II ~ 1/2 for all y E M.
8.14 Let V be a Banach space and let T E £(V) be compact.
(a) Show that for all n E N, the operator (I - T)n is a compact pertur-
bation of the identity.
(b) Let Nk = N((I - T)k) for kEN. Show that Nk is an increasing se-
quence of finite dimensional spaces which is stationary, i.e. Nk C Nk+l
for all kEN and there exists lEN such that Nk = Nl for all k > land
l is the least positive integer with that property. (The dimension of Nl
is called the algebraic multiplicity of T.) (Hint: use Exercise 8.13.)
(c) If V is a Hilbert space and if T is self-adjoint, show that l = 1, i.e.
the algebraic and geometric multiplicities coincide.
(d) Let Fk = R((I - T)k). Show that {Fd is a decreasing sequence of
closed subspaces of V and that Fk = Fl for all k > l.
(e) Show that V = Nl EB Fl.
(f) Show that T(NI ) c Nl and that T(Fl) c Fl.
(g) Show that I - T is an isomorphism of Fl onto itself.

8.15 (a) Let V be a Banach space and let T E £(V) be compact. Let
A 1= O. Show that the results of the previous exercise are valid when
I - T is replaced by AI - T. (If A is an eigenvalue of T, then N(AI - T)
is called the eigenspace associated to A while the space Nl obtained in
section (b) of the preceding exercise is called the generalized eigenspace
associated to A and is denoted N (A). The corresponding complement Fl
is denoted F(A).
(b) Show that if A and fJ, are disctinct non-zero eigenvalues of T, then
N(fJ,) C F(A).

8.16 (a) Let H be a Hilbert space and let Sand T E £(H). Let A =1= 0 be
an eigenvalue of ST with eigenvector u. Show that T(u) is an eigenvector
ofTS.
(b) Show that the non-zero eigenvalues of ST and T S are the same and
that
T(N(ST - AI)) = N((TS - AI))
262 8 Compact Operators

for each such eigenvalue A.

8.17 Let H be a Hilbert space and let T E £(H) be self-adjoint. Show


that
IITII = sup I(T(x), x)l·
xEH; Ilxll=1

8.18 Let H be a Hilbert space and let T E £(H) be compact and self-
adjoint. Show that either IITII or -IITII is an eigenvalue of T.

8.19 Let H be a Hilbert space and let T E £(H) be self-adjoint. If T2 is


compact, show that T is compact as well. Generalize this result to the
case when Tn is compact for some positive integer n.

8.20 Let V and H be real Hilbert spaces with inner products (., .)v and
(., ')H respectively; let the corresponding norms be denoted by 11.llv and
II.IIH. Assume that V c H, the inclusion being continuous and compact.
Assume, further, that V is a dense subspace of H. Let a (., .) : V X V ~ lR
be a continuous, symmetric and coercive bilinear form.
(a) Let f E H. Show that there exists a unique vector u E V such that

a(u, v) = (f, V)H

for every v E V.
(b) Define G(f) = u. Show that G E £(H) and that it is self-adjoint
and compact. Show, further, that

(G(u), U)H > 0

for every u E H, u 1= o.
(c) Show that the image of G is dense in V.
(d) Show that there exists a sequence of positive real numbers {An} and
an orthonormal basis {un} of H such that each Un E V and

for every v E V.
(e) Set Vn = (l/A)u n . Show that {v n } forms an orthonormal basis of
V (for the inner product induced on V by the bilinear form a(., .)).
(f) Show that
, . a(v,v)
"'I = mIn 2 .
vEV; #0 IlvllH
8.5 Exercises 263

(g) If U E V is such that Il ll~ = 1 and a(u, u) = A1, then show that
a(u,v) = A1(U,V)H
for every v E V.
(h) Let Vn be the space spanned by {u 1, u2, ... , un}. Set

R( ) = a(v,v)
v Ilvll~
for v E V, v I- O. Let

Vn..l = {v E V I (v, w) H = °for all w E Vn}.


Show that

minwcv; dim(W)=n maxvEW; vlO R(v).


Remark 8.5.1 Exercise 8.20 above is an abstract version of the situa-
tion described in Remark 8.4.3. In that case we have V = HJ(O, 1) and
H = L2(0, 1) . •

8.21 Let V and H be as in the preceding exercise. Let a(., .) : V x V ---> lR


be a continuous and coercive bilinear form. For 9 E H, denote by u g E V,
the unique solution such that
a(u g , v) = (g, V)H

°
for every v E V.
(a) Show that there exists a constant C > such that

(b) Let K c H be a closed convex subset. Let f E H be given. For


o E K, let u(O) E V be given by u(O) = Uf+f) = uf + Uf)o Show that if
On -" 0 weakly in H, then u(On) ---> u(O) in H (i.e. in norm).
(c) Let, further, Uo E H be given. Define
1 2 1 2
J(O) = "2 llu (O) - uollH + "21101IH'
264 8 Compact Operators

Show that there exists a unique 0* E K such that

J(O*) = min J(O).


()EK

(d) Show that J is differentiable; if hE H, show that

J'(O)(h) = (p + 0, h)H
where p E V satisfies

a(v,p) (u(O) - UO,V)H

for every v E V.
(e) Deduce that if 0* minimizes J over K, then

(0* + p*,O - O*)H ~ 0

where p* E V is the unique solution of the problem

a(v,p*) = (u(O*) - UO,V)H


for every v E V.
(f) Conversely, if (O* ,u*,p*) E K x V x V satisfies the system:

a(u*,v) = (f + 0* , v) H for every v E V


a(v,p*) (u* - Uo, V)H for every v E V
(0* + p*,O - O*)H > o for every 0 E K,

show that 0* minimizes J over K.

Remark 8.5.2 The preceding exercise is an example of an abstract


optimal control problem. The set K is the set of admissible controls
and H is the control space. The equation defining u( 0) is called the
state equation and u( 0) is the state corresponding to the control O. The
state Uo is the desirable state and so we seek to find a control such
that the corresponding state is as close to the desired state as possible.
We thus seek to minimize J which also takes into account the 'cost' of
exercising the control. The minimizer 0* is called the optimal control.
The corresponding state u* is the optimal state. The element p defined
in (d) above is called the adjoint state and the equation defining it is
the adjoint state equation. The system in (f) above, characterizing the
optimal solution, is called the optimality system. •
Bibliography

[1] Artin, M. Algebra, Prentice-Hall of India, 2000.

[2] Halmos, P. R. Measure Theory, van Nostrand.

[3] Kesavan, S. Topics in Functional Analysis and Applications,


New Age International (formerly Wiley-Eastern), 1989.

[4] Kesavan, S. Nonlinear Functional Analysis-A First Course,


Texts and Readings in Mathematics (TRIM) No. 28, Hindustan
Book Agency, 2004.

[5] Royden, H. 1. Real Analysis, Macmillan.

[6] Rudin, W. Principles of Mathematical Analysis, Interna-


tional Student Edition, McGraw-Hill.

[7] Rudin, W. Real and Complex Analysis, McGraw-Hill.

[8] Rudin, W. Functional Analysis, McGraw-Hill, 1973.

[9] Simmons, G. F. Introduction to Topology and Modern


Analysis, International Student Edition, McGraw-Hill, 1963.

[10] Simmons, G. F. Differential Equations - with Applications


and Historical Notes, McGraw Hill, 1972 (TMH Edition, 1974).
Index

B* -algebra, 206 continuous function, 14


(I-algebra, 18 convex
Borel, 18 function, 67
set, 67
absolutely continuous, 182 convolution, 177
affine hyperplane, 77
almost everywhere, 23, 164 dense set, 12
annihilator, 108 dimension, 3
direct sum, 4
Banach algebra, 41, 206 Dirichlet kernel, 102
Banach space, 27 dual space, 40
basis, 3 duality bracket, 117
Hamel, 223
Schauder, 222 eigenspace, 261
standard,3 generalized, 261
boundary value problem, 234 eigenvalue, 246
bounded set, 37 eigenvector, 246
equivalent norms, 47
Cesaro operator, 44 essentially bounded, 162
characteristic equation, 9 Euclidean space, 12
characteristic function, 20
closed sets, 11 finite intersection property, 16
closure (of a set), 12 Fourier
coercive functional, 156 coefficients, 101
compact set, 16 cosine series, 225
complement series, 101, 220
of a subspace, 113 sine series, 222
orthogonal, 202 Frechet
complete metric space, 15 derivative, 66
cone, 85 differentiable, 65
tangent, 88 Fredholm
conjugate exponent, 28 alternative, 9, 122, 238, 241
connected space, 17 index, 242

266
267

operator, 242 Lax-Milgram, 209


Riemann-Lebesgue, 191
Gateau Riesz, 51, 240, 249
derivative, 66 Schur, 136
differentiable, 66 limit point, 13
Gram-Schmidt orthogonalization, linear
211 dependence, 3
Hausdorff space, 12 independence, 3
Hermite polynomials, 226 operator, 4
Hilbert matrix, 44 space, 1
Hilbert space, 114, 145, 153, 196 span, 3
Hilbert-Schmidt operator, 237 transformation, 4
homeomorphism, 14 linear functional, 5
continuous, 37, 40
Inequality linear operator
Bessel, 216 adjoint, 117, 205
Cauchy-Schwarz, 30, 163, 198 bounded, 116
Clarkson, 168 closed, 116
Holder, 29, 30, 32, 162 compact, 231
Hardy, 190 continuous, 37, 41
Minkowski, 29, 32, 163 densely defined, 116
Poincare, 187 dissipative, 228
Poincare-Wirtinger, 193 domain, 116
Young, 177 finite rank, 231
initial value problem, 54, 57, 95, graph, 116
129 invertible, 47
inner product, 195 maximal dissipative, 228
integrable function, 21 normal, 206
interior (of a set), 12 range, 116
involution, 206 self-adjoint, 206
isomorphism, 47
unbounded, 116
Kuhn-Thcker conditions, 91 unitary, 206
linear subspace, 2
least squares approximation, 68, linear transformation
212 bounded, 38
Legendre polynomials, 225 continuous, 37
Lemma invertible, 5
Farkas-Minkowski, 87 kernel, 5
Fatou, 22 range, 5
268

rank, 5 finite, 19
locally convex space, 81 Lebesgue, 19
locally integrable, 173 product, 24
lower semi-continuous, 135 regular, 20
space, 19
matrix, 5 translation invariant, 19
adjoint, 6 metric space, 12
characteristic polynomial, 9 Minkowski functional, 78
condition number, 61 multiplicity
definite, 7 algebraic, 9, 261
determinant, 8 geometric, 9, 246
diagonalizable, 6, 11
eigenspace, 9 neighbourhood, 11
eigenvalue, 9 norm, 26,38
eigenvector, 9 norm topology, 27
hermitian, 7 normed linear space, 27
lower triangular, 6 completion, 65
non-singular, 8 now here dense set, 12
norm, 50 null space, 5
normal, 7
nullity, 7 open sets, 11
rank, 7 orthogonal vectors, 6
self-adjoint, 7 orthonormal
singular, 8 basis, 218
spectral radius, 10 set, 210
spectrum, 10
parallellogram identity, 197
trace, 10
Parseval's identity, 218
transpose, 6
path connected, 17
unitary, 7
product measure, 24
upper triangular, 6
product topology, 15
measurable
projection, 200
function, 18
orthogonal, 201
Lebesgue, 20
sets, 18 quotient space, 35
space, 18
measure, 19 Rayleigh quotient, 10
O"-finite, 19 reflexive space, 73, 145
complete, 19 resolvent, 244
counting, 19 right inverse, 115
Dirac, 19 Rodrigues' formula, 226
269

scalar multiplication, 1 Rellich, 185


scalars, 1 Riesz representation, 169, 174,
semi group 202,224
exponentially stable, 126 Sobolev, 184
infinitesimal generator, 128 Stampacchia, 208
of contractions, 130 topological space, 11
of operators, 126 topological vector space, 26
separable space, 147 totally bounded, 17
sequence triangle inequality, 12, 26
Cauchy, 15
convergent, 13 uniform boundedness principle,
sequentially compact set, 16 100
sesquilinear form, 196 uniform convergence, 35
shift operator, 64, 247 uniform convexity, 152
simple function, 20 uniformly continuous, 14
Sobolev spaces, 178
variational inequality, 210
spectral radius, 245
vector, 1
spectrum, 244
addition, 1
matrix, 10
space, 1
step function, 190
Volterra integral operator, 46, 55
strictly convex, 92
subbase, 13 weak topology, 132
weak* topology, 139
Theorem weakly continuous, 138
Baire, 97
Banach's fixed point, 52
Banach-Alaoglu, 142
Banach-Steinhaus, 99
Cantor-Lebesgue, 191
Cayley-Hamilton, 61
closed graph, 107
contraction mapping, 52
dominated convergence, 23
Fubini,24
Hahn-Banach, 69, 71, 72, 79,
80
Hille-Yosida, 228
monotone convergence, 22
open mapping, 106
Picard, 53
Texts and Readings in Mathematics

1. R. B. Bapat: Linear Algebra and Linear Models (Second Edition)


2. Rajendra Bhatia: Fourier Series (Second Edition)
3. C. Musili: Representations of Finite Groups
4. H. Helson: Linear Algebra (Second Edition)
5. D. Sarason: Complex Function Theory (Second Edition)
6. M. G. Nadkarni: Basic Ergodic Theory (Second Edition)
7. H. Helson: Harmonic Analysis (Second Edition)
8. K. Chandrasekharan: A Course on Integration Theory
9. K. Chandrasekharan: A Course on Topological Groups
10. R. Bhatia (ed.): Analysis, Geometry and Probability
11. K. R. Davidson: C* - Algebras by Example
12. M. Bhattacharjee et al.: Notes on Infinite Permutation Groups
13. V. S. Sunder: Functional Analysis - Spectral Theory
14. V. S. Varadarajan: Algebra in Ancient and Modern Times
15. M. G. Nadkarni: Spectral Theory of Dynamical Systems
16. A. Borel: Semisimple Groups and Riemannian Symmetric Spaces
17. M. Marcolli: Seiberg - Witten Gauge Theory
18. A. Bottcher and S. M. Grudsky: Toeplitz Matrices, Asymptotic
Linear Algebra and Functional Analysis
19. A. R. Rao and P. Bhimasankaram: Linear Algebra (Second Edition)
20. C. Musili: Algebraic Geometry for Beginners
21. A. R. Rajwade: Convex Polyhedra with Regularity Conditions
and Hilbert's Third Problem
22. S. Kumaresan: A Course in Differential Geometry and Lie Groups
23. Stef Tijs: Introduction to Game Theory
24. B. Sury: The Congruence Subgroup Problem
25. R. Bhatia (ed.): Connected at Inf;nity
26. K. Mukherjea: Differential Calculus in Normed Linear Spaces
(Second Edition)
27. Satya Deo: Algebraic Topology: A Primer (Corrected Reprint)
28. S. Kesavan: Nonlinear Functional Analysis: A First Course
29. S. Szab6: Topics in Factorization of Abelian Groups
30. S. Kumaresan and G. Santhanam: An Expedition to Geometry
31. D. Mumford: Lectures on Curves on an Algebraic Surface (Reprint)
32. J. W. Milnor and J. D. Stasheff: Characteristic Classes (Reprint)
33. K. R. Parthasarathy: Introduction to Probability and Measure
(Corrected Reprint)
34. A. Mukherjee: Topics in Differential Topology
35. K. R. Parthasarathy: Mathematical Foundations of Quantum
Mechanics
36. K. B. Athreya and S. N. Lahiri: Measure Theory
37. Terence Tao: Analysis I
38. Terence Tao: Analysis II
39. W. Decker and C. Lossen: Computing in Algebraic Geometry
40. A. Goswami and B. V. Rao: A Course in Applied Stochastic
Processes
41. K. B. Athreya and S. N. Lahiri: Probability Theory
42. A. R. Rajwade and A. K. Bhandari: Surprises and Counterexamples
in Real Function Theory
43. G. H. Golub and C. F. Van Loan: Matrix Computations (Reprint of the
Third Edition)
44. Rajendra Bhatia: Positive Definite Matrices
45. K. R. Parthasarathy: Coding Theorems of Classical and Quantum
Information Theory
46. C. S. Seshadri: Introduction to the Theory of Standard Monomials
47. Alain Connes and Matilde Marcolli: Noncommutative Geometry,
Quantum Fields and Motives
48. Vivek S. Borkar: Stochastic Approximation: A Dynamical Systems
Viewpoint
49. B. J. Venkatachala: Inequalities: An Approach Through Problems
50. Rajendra Bhatia: Notes on Functional Analysis
51. A. Clebsch (ed.) Jacobi's Lectures on Dynamics
(Second Revised Edition)

You might also like