Professional Documents
Culture Documents
Functional Analysis
University of Regina
Department of Mathematics & Statistics
Regina, Saskatchewan S4S 0A2
Canada
Preface
1
incomplete in several places
Contents
1 Banach Spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.1 Banach Space Definitions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.2 Finite-Dimensional Spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.3 Subspaces and Quotients of Banach Spaces . . . . . . . . . . . . . . . . . 9
1.4 Two Useful Lemmas . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
1.5 Baire Category Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
1.6 Linear and Schauder Bases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
1.7 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
3 Duality . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
3.1 Operators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
3.2 The Hahn–Banach Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
3.3 Complementation of Finite-Dimensional Spaces . . . . . . . . . . . . . 49
3.4 Weak Topology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
3.5 The Second Dual . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
3.6 Weak∗ Topology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
3.7 Subspaces of C(X) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54
6 Contents
3.8 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
5 Convexity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63
5.1 Convex Combinations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63
5.2 Extreme Points and Faces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
5.3 Geometry of the Closed Unit Ball . . . . . . . . . . . . . . . . . . . . . . . . . 66
5.4 Separation of Convex Sets by Linear Functionals . . . . . . . . . . . . 69
5.5 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
6 Hilbert Spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
6.1 Inner Products and Euclidean Norms . . . . . . . . . . . . . . . . . . . . . . 75
6.2 Distance to Convex Sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
6.3 Orthogonal Complements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79
6.4 Orthonormal Bases for Hilbert Spaces . . . . . . . . . . . . . . . . . . . . . . 81
6.5 Examples of Separable Hilbert Spaces . . . . . . . . . . . . . . . . . . . . . . 85
6.6 Hilbert Space Duality . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
6.7 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89
Part II Operators
13 C∗ -algebras . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 157
13.1 C∗ -algebra Definitions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 157
13.2 Adjoining a Unit to a Nonunital C∗ -algebra . . . . . . . . . . . . . . . . . 158
13.3 Gelfand Theory for Unital Abelian C∗ -algebras . . . . . . . . . . . . . . 160
13.4 C∗ -subalgebras . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 161
13.5 Continuous Functional Calculus . . . . . . . . . . . . . . . . . . . . . . . . . . . 163
13.6 Positive Elements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 165
13.7 Ideals and Quotients . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 169
13.8 C∗ -algebra Homomorphisms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 171
13.9 States . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 172
13.10Representations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 175
13.11Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 175
Contents 1
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 181
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 183
Part I
Banach Spaces
1
Banach Spaces
Proof. Exercise 1.
Definition 1.4. A subset K of a topological space X is compact S if for every
collection {Uα }α of open sets Uα ⊂ X that cover X (ie., KS⊂ α Uα ), there
n
are finitely many Uα1 , . . . , Uαn of these sets for which K ⊂ j=1 Uαj .
Definition 1.5. Suppose that V is a normed vector spaces and that {vk }k∈N
is a sequence of vectors vk ∈ V .
1. {vk }k∈N is a Cauchy sequence if for every ε > 0 there exists Nε ∈ N such
that
kvn − vm k < ε , ∀ m, n ≥ Nε .
2. {vk }k∈N is convergent with limit v ∈ V if for every ε > 0 there exists
Nε ∈ N such that
kv − vn k < ε , ∀ n ≥ Nε .
∞
X
3. The series vk converges to v ∈ V if
k=1
n
X
lim
v − vk
= 0 .
n→∞
k=1
∞
X
In this case we shall write v = vk .
k=1
Because the modulus |α| of α ∈ C satisfies |α|2 = (<α)2 + (=α)2 , where <α
and =α are the real and imaginary parts of α, the Euclidean space Cn may
be identified with the real Euclidean space R2n .
The Euclidean space Rm has several strong features, of which one of the
most important is given by the Heine–Borel Theorem.
for all α1 , . . . , αn ∈ C.
P
Proof. We will first relate the norm of any element v = j αj vj ∈ V to the
length of the vector a in the Euclidean space Cn determined by the coefficients
αj for v. Let
X = {a ∈ Cn | kak2 = 1} .
The set X is closed and bounded in Cn ; hence, X is a compact set. The
continuous function f : X → R, defined by
n
X
f(a) =
αj vj
,
j=1
1/2
X n
X n
2
∀ α ∈ Cn .
c |αj | ≤
αj vj
(1.2)
j=1
j=1
P 1/2
n 2
Now let C = j=1 kvj k . Then
1/2 1/2
n
n n n
X X X X
|αj |2 kvj k2
≤
αj vj
|αj | kvj k ≤ ,
j=1
j=1 j=1 j=1
where the inequality on the right is the Cauchy–Schwarz inequality (that is,
the Minkowski inequality [Theorem 2.13] with p = 2) in Rn . Thus,
1/2
n
n
X X
|αj |2
≤ C
αj vj
. (1.3)
j=1
j=1
Pn
Thus, the countable set of all j=1 βj vj , where each βj ∈ (Q + iQ), is dense
in V .
α 1 w1 + α 2 w2 ∈ L , ∀ α1 , α2 ∈ C, w1 , w2 ∈ L .
k[v]kq = inf{kv − yk | y ∈ L} ,
k[v]kq ≤ kvk , ∀v ∈ V .
j
X
=
[vki − vki−1 ] − [y]
i=2 q
j
X
=
[yi − y]
i=2 q
j
X
≤
yi − y
.
i=2
P∞
Because i=2 yi converges to y ∈ V , the sequence {[vkj ]}j∈N converges to
[y + vk1 ] in V /L. However, as {[vkj ]}j∈N is a convergent subsequence of the
Cauchy sequence {[vk ]}k∈N, the sequence {[vk ]}k∈N must also converge in V /L
to [y + vk1 ].
The use of quotient spaces leads to the following two useful lemmas which
will be invoked from time to time when convenient.
kvk − vj k ≥ δ > 0 , ∀ k 6= j .
This shows that no subsequence of {vn }n∈N can be a Cauchy sequence. Thus,
the sequence {vn }n∈N does not admit a convergent subsequence, in contradic-
tion to the fact that the closed unit ball of V is compact. Hence, it must be
that V has finite dimension.
12 1 Banach Spaces
Corollary 1.17. A Banach space V has finite dimension if and only if every
closed, bounded subset of V is compact.
is also dense in V .
Proof. Let G = ∩k∈N Uk . Choose v0 ∈ V and let ε > 0. We aim to prove that
Bε (v0 ) ∩ G 6= ∅.
By hypothesis, U1 is open and dense in V . Thus, there is a δ1 ∈ (0, 1)
and an element v1 ∈ Bε (v0 ) ∩ U1 such that Bδ1 (v1 ) ⊂ Bε (v0 ) ∩ U1 . Likewise,
U2 is open and dense in V , and so there is a δ2 ∈ (0, 21 ) and an element
v2 ∈ Bδ1 (v1 ) ∩ U2 such that Bδ2 (v2 ) ⊂ Bδ1 (v1 ) ∩ U2 .
It is clear that this process may be continued by induction to obtain se-
quences {vn }n∈N in V and {δn }n∈N in R such that
1
δn ∈ (0, ) and Bδn (vn ) ⊂ Bδn−1 (vn−1 ) ∩ Un .
2n−1
By construction,
Hence, v ∈ Bδn (vn ) ⊂ Un and v ∈ Bδn (vn ) ⊂ Bε (v0 ), which proves that
v ∈ G ∩ Bε (v0 ).
Definition 1.20. A subset F ⊂ X is nowhere dense in a topological space X
if the topological interior of the closure F of F is the empty set.
Corollary 1.21. In a Banach space V ,
1. the intersection of a countable family of dense Gδ -sets is a dense Gδ -set,
and
2. the union of a countable family of nowhere dense Fσ sets is a nowhere
dense Fσ set.
Proof. Exercise 5.
1.7 Exercises
1. Prove that, in a normed vector space V , every open ball Bρ (v0 ) is an open
set whose topological closure is Bρ (v0 ) and whose topological boundary
∂Bρ (v0 ) is Sρ (v0 ).
2. Assuming that C is a complete metric space with respect to the metric
induced by the modulus, give a detailed proof that the vector space Cn ,
in the Euclidean norm
1/2
Xn
kξk = |ξj |2 , ξ ∈ Cn ,
j=1
This chapter introduces and studies a number of the important Banach spaces
that arise in classical analysis.
2. ϑ is a concave function if
Likewise,
y
dϑ dϑ
Z
ϑ(y) − ϑ(ζ) = dt ≥ [ζ] (y − ζ) .
ζ dt dt
Hence,
dϑ
λϑ(ζ) ≤ λϑ(x) + [ζ] λ(1 − λ)(y − x)
dt
dϑ
(1 − λ)ϑ(ζ) ≤ (1 − λ)ϑ(y) − [ζ] λ(1 − λ)(y − x) .
dt
y − λx λ
z = and z − y = (y − x) .
1−λ 1−λ
Because ϑ is a convex function, ϑ(y) ≤ λϑ(x) + (1 − λ)ϑ(z), and so
That is,
1
ϑ(z) − ϑ(x) ≤ (ϑ(z) − ϑ(y)) .
λ
Hence,
ϑ(y) − ϑ(x) λϑ(x) + (1 − λ)ϑ(z) − ϑ(x)
≤
y − x y −x
(1 − λ) (ϑ(z) − ϑ(x))
=
y −x
(1−λ)
λ (ϑ(z) − ϑ(x))
≤
y −x
ϑ(z) − ϑ(y)
= λ
1−λ (y − x)
ϑ(z) − ϑ(y)
= .
z −y
This completes the proof.
If one views each side of inequality (2.1) as a difference quotient, then one
can interpret inequality (2.1) as saying that the derivative of ϑ (if it exists) is
an increasing function. See Exercise 2.
implies that Z Z Z
a = a dµ < f dµ < b dµ = b .
X X X
18 2 Banach Spaces in Classical Analysis
R
Let ζ = X
f dµ and let
ϑ(ζ) − ϑ(z)
β = sup .
z∈(a,ζ) ζ−z
Hence,
ϑ(z) ≤ ϑ(ζ) + β(z − ζ) , ∀ z ∈ (a, ζ) .
Because ϑ is a convex function, Proposition 2.4 implies that
ϑ(y) − ϑ(ζ)
β ≤ , ∀ y ∈ (ζ, b) .
y−ζ
Thus,
ϑ(y) ≥ β(y − ζ) + ϑ(ζ) , ∀ y ∈ (ζ, b) .
Conclusion:
ϑ(t) ≥ ϑ(ζ) + β(t − ζ) , ∀ t ∈ (a, b) .
In particular,
That is, Z Z
ϑ f dµ ≤ ϑ ◦ f dµ ,
X X
which completes the proof.
1 p1 1 pn
α1 · · · αn ≤ α + ··· + α .
p1 pn n
2.2 Classical Inequalities 19
Hence,
n n
1/n 1 X βk 1X
(α1 α2 · · · αn ) = eβ1 /n+···+βn /n ≤ e = αk .
n n
k=1 k=1
Proof. Exercise 5.
Proof. Because f p and gq are integrable, so are |f|p and |g|q . Let F, G : X → R
be the functions defined by
|f(x)| |g(x)|
F (x) = R 1/p and G(x) = R 1/q .
p |g|q dµ
X |f| dµ X
Thus, Z Z
p
F dµ = Gq dµ = 1 .
X X
That is, Z
|fg| dµ
X
R 1/p R 1/q ≤ 1 ,
X |f|p dµ X |g|q dµ
which completes the proof.
Proof. The theorem is true for p = 1 because the sum of integrable functions is
integrable and because inequality (2.4) is simply a consequence of the triangle
inequality in real and complex numbers.
2.2 Classical Inequalities 21
Hence, (|f| + |g|)p−1 is q-integrable. Therefore, one can apply Hölder’s In-
equality to obtain
Z Z 1/p Z 1/q
|f|(|f| + |g|)p−1 dµ ≤ |f|p dµ (|f| + |g|)(p−1)q dµ
X X X
and
Z Z 1/p Z 1/q
p−1 p (p−1)q
|g|(|f| + |g|) dµ ≤ |g| dµ (|f| + |g|) dµ .
X X X
Because |f|p + |g|p = |f|(|f| + |g|)p−1 + |g|(|f| + |g|)p−1 , summing the two
inequalities above yields a new inequality whose left hand side is
Z
(|f| + |g|)p dµ
X
1/q
(|f| + |g|)(p−1)q dµ
R
Divide the new inequality through by X
and use that
p = (p − 1)q to obtain
Z 1−1/q Z 1/p Z 1/p
p p p
(|f| + |g|) dµ ≤ |f| dµ + |g| dµ .
X X X
Because p1 = 1 − 1
q and |f + g|p ≤ (|f| + |g|)p , the inequality above implies
inequality (2.4).
22 2 Banach Spaces in Classical Analysis
Theorem 2.13. Suppose that {αk }k∈N and {βk }k∈N are sequences in C. As-
sume that p ≥ 1.
1. (Hölder) If p, q ∈ (1, ∞) are conjugate, and if {αk }k∈N is p-summable and
{βk }k∈N is q-summable, then {αk βk }k∈N is summable and
!1/p !1/q
X X X
p q
|αk βk | ≤ |αk | |βk | .
k∈N k∈N k∈N
2. (Minkowski) If {αk }k∈N and {βk }k∈N are p-summable, then {αk + βk }k∈N
is p-summable and
!1/p !1/p !1/p
X X X
p p p
|αk + βk | ≤ |αk | + |βk | .
k∈N k∈N k∈N
Proof. Apply Theorems 2.10 and 2.11 to the case where the measure space
(X, Σ, µ) is given by X = N, Σ = P(N), and µ is counting measure.
Note that the Hölder and Minkowski inequalities are nontrivial even in
the cases where the sequences {αk }k∈N and {βk }k∈N have only finitely-many
nonzero elements.
2. ρ(αv) = |α|ρ(v),
3. ρ(v + w) ≤ ρ(v) + ρ(w).
d(v, w) = ρ(v − w) , v, w ∈ V ,
Proof. Exercise 7.
Seminorms ρ and norms k · k differ in the following way: the equation
ρ(v) = 0 can hold for nonzero v whereas kvk = 0 holds only for v = 0. One
obtains a norm from a seminorm as follows.
v ∼ w if ρ(v − w) = 0 ,
[v] = {w ∈ V | w ∼ v} ,
then:
1. the set V / ∼ of equivalence classes is a vector space under the operations
[v] + [w] = [v + w] , v, w ∈ V ,
α [v] = [αv] , α ∈ C, v ∈ V ;
2. the function
k[v]k = ρ(v) , v∈V ,
is a norm on V / ∼.
Proof. Exercise 8.
The next sections develop examples of Banach spaces.
24 2 Banach Spaces in Classical Analysis
Proof. It is clear that αf ∈ Lp(X, Σ, µ), for every α ∈ C and f ∈ Lp(X, Σ, µ).
If f, g ∈ Lp(X, Σ, µ), then f + g ∈ Lp (X, Σ, µ), by Minkowski’s inequality.
Hence, Lp (X, Σ, µ) is a vector space.
To verify that ρ is a seminorm, the only nontrivial item to verify is that the
triangle inequality holds. But this is true because of Minkowski’s inequality:
Z 1/p
p
ρ(f + g) = |f + g| dµ
X
Z 1/p Z 1/p
≤ |f|p dµ + |g|p dµ
X X
= ρ(f) + ρ(g) .
Hence, ρ is a seminorm.
The seminorm ρ of Proposition 2.18 need not be a norm. For example, if
f is the characteristic function on the set Q of rational numbers, and if m
denotes Lebesgue measure on R, then f 6= 0 yet
Z 1/p
ρ(f) = |f|p dm = m(Q)1/p = 0 .
R
On the other hand, Proposition 2.17 demonstrates that a bona fide normed
vector space can be obtained by passing to equivalence classes.
That is, a + b = {αk + βk }k∈N ∈ `p (N). Hence, with pointwise sum and scalar
multiplication, `p (N) is a vector space over C. The seminorm ρ of Proposition
2.18 is in fact a norm on this space.
Proposition 2.20. On the vector space `p (N), the function
!1/p
X
p
kak = |αk | (2.6)
k∈N
is a norm.
Proof. We know already that k · k is a seminorm. If a = {αk }k∈N ∈ `p (N), then
!1/p
X
0 = kak = |αk |p
k∈N
if and only if |αk | = 0 for all k. Hence, kak = 0 if and only if a = 0 ∈ `p (N).
Notational Convention. The notation `p (N) refers to the normed vector
space of p-summable sequences, where p ≥ 1, under the norm (2.6). If n ∈ N,
then `p (n) denotes the normed vector space of sequences
a = {αk }nk=1
Thus, Z
gip dµ ≤ 1 , ∀i ∈ N.
X
Considering gi as a function, the converse to the Monotone Convergence The-
orem implies that limi gi (x)p exists for almost all x ∈ X; that is, limi gi (x)
exists for almost all x ∈ X and the limit function—call it g—is p-integrable.
Let L ⊆ X denote the set of points x in which limi gi (x) exists; thus,
i
X
lim gi (x) = lim |fkj+1 (x) − fkj (x)| . (2.7)
i→∞ i→∞
j=1
that is, Z
p
lim |f − fki | dµ = 0 .
i→∞ X
Hence, kf − fki k → 0 as i → ∞. To show that the entire Cauchy sequence
{fk }k∈N converges in Lp (X, Σ, µ) to f, let ε > 0. Since {fk }k∈N is a Cauchy
sequence, there exists Nε ∈ N such that lim inf kfki − fNε k < ε. Using this
i∈N
and the fact that f = limi fki almost everywhere, Fatou’s Lemma yields the
desired conclusion: namely, for any m ≥ Nε ,
Z
kf − fm kp = |f − fm |p dµ
X
Z
≤ lim inf |fki − fm |p dµ (Fatou’s Lemma)
i∈N X
< εp .
This completes the proof that Lp (X, Σ, µ) is a Banach space.
Corollary 2.22. `p (N) is a Banach space, for every p ≥ 1.
where f(E) denotes the closure in C of f(E) = {f(t) | t ∈ E}, and where
E ∈ Σ. The essential supremum of f is the quantity
ess-sup f = sup {|λ| | λ ∈ ess-ran f} .
28 2 Banach Spaces in Classical Analysis
a = {αk }k∈N
a = {αk }nk=1
Let Cb (X) denote the set of all functions f : X → C that are continuous
and bounded. The continuity of f ∈ Cb (X) means that for each x ∈ X and
ε > 0 there is an open subset U ⊂ X containing x such that |f(x) − f(y)| < ε
for all y ∈ U . The boundedness of f ∈ Cb (X) means that there is a R > 0
such that |f(x)| < R for all x ∈ X.
Proof. It is elementary that Cb (X) is a vector space and that (2.10) defines a
norm on Cb (X). Thus, it remains only to show that every Cauchy sequence
in Cb (X) is convergent in Cb (X).
Let {fk }k∈N ⊂ Cb (X) denote a Cauchy sequence. For each x ∈ X,
Since {fk }k∈N is a Cauchy sequence in Cb (X), {fk (x)}k∈N is a Cauchy se-
quence in C for each x ∈ X. Because C is complete, limk fk (x) exists for every
x ∈ X. Therefore, define f : X → C by f(x) = limk fk (x), for each x ∈ X.
We aim to show (i) that f is continuous and bounded, and (ii) that {fk }k∈N
converges to f in Cb (X).
Let ε > 0. Because {fk }k∈N is a Cauchy sequence, there exists Nε ∈ N
such that kfn − fm k < ε for all n, m ≥ Nε . Assume that n ≥ Nε . Choose any
x ∈ X; thus,
≤ 0 + ε.
This right hand side of the inequality above is independent of the choice of
x ∈ X. Hence, if n ≥ Nε is fixed, then f − fn is a bounded function X → C
and
sup |f(x) − fn (x)| ≤ ε .
x∈X
{x ∈ X | |f(x)| ≥ ε}
f (x) = f(x) , ∀x ∈ X .
1 − gδ (x) ∈ [0, 1 − δ] , ∀x ∈ X .
Theorem 2.39. C([a, b]) is a separable Banach space, for all a < b in R.
Proof. Let C be the set {1, t, t2, t3 , . . .}, considered as functions on [a, b]. Note
that C is a countable set. Let Q + iQ denote the set of all ζ = r + is ∈ C for
which r, s ∈ Q, and note that Q + iQ is countable and dense in C. Now let P
be the set of all polynomials with coefficients coming from the subring Q + iQ
of C. Note that P is in bijective correspondence with the set
[
(Q + iQ)n ,
n∈N
n
!1/2
X
k
ψ(t) = |t| .
k=0
36 2 Banach Spaces in Classical Analysis
n n
!1/2 n
!1/2
X X X
k 2 k
|f(t) − p(t)| ≤ |αk − βk | |t| ≤ |αk − βk | |t| < ε.
k=0 k=0 k=0
Definition 2.40. For any [a, b] ⊂ R, Lp ([a, b]) denotes Lp (X, Σ, µ), where
p ≥ 1, X = [a, b], Σ = {E ∩ [a, b] | E ∈ M(R)}, and µ = m (Lebesgue
measure).
Theorem 2.41. Lp ([a, b]) is separable, for every p ≥ 1 and all a < b in R.
Proof. As in the proof of Theorem 2.39, let P be the countable set of all poly-
nomials with coefficients coming from the subring Q + iQ of C. We shall prove
that P is dense in Lp ([a, b]). The first step is to show that any characteristic
function can be approximated in Lp ([a, b]) by polynomials g ∈ P.
Suppose that E ⊆ [a, b] is a Lebesgue-measurable set and let ϕ be the
characteristic function on E. Assume that ε > 0. If E is one of [a, b], (a, b],
[a, b), or (a, b), then ϕ = 1 ∈ P. Thus, we assume that E is a proper subset of
(a, b). The regularity of Lebesgue measure implies that, there are K, U ⊆ R
such that K is closed, U is open, K ⊆ E ⊆ U , m(E\K) < 2ε , and m(U \E) <
ε
2
. Because E is a proper subset of (a, b), U may be assumed to be a subset
of (a, b) as well. Thus,
Let F = [a, b]\U = [a, b] ∩ U c , which is a closed set in [a, b] disjoint from the
closed set K. By Urysohn’s Lemma, there is a continuous function h : [a, b] →
[0, 1] such that h(K) = {1} and h(F ) = {0}. Because E ⊆ K ⊆ U ,
Thus,
2.8 Separable Banach Spaces 37
Z
kϕ − hkp = |ϕ − h|p dm
[a,b]
Z Z Z
= |ϕ − h|p dm + |ϕ − h|p dm + |ϕ − h|p dm
K U \K F
Z
= |ϕ − h|p dm
U \K
Z
≤ (1 + 1)p dm
U \K
< 2p ε .
p
2 ε
Because h can be uniformly approximated to within b−a by some g ∈ P, we
p p+1
have that kϕ−gk < 2 ε. That is, characteristic functions are approximated
in Lp ([a, b]) by elements g of the countable set P.
Suppose now that ϕ is a simple function of the form
n
X
ϕ = αk χEk .
k=1
√
Let ε > 0. There exist βk ∈ Q + iQ such that |αk − βk | < ε/n for every 1√≤
k ≤ n. By the paragraph above, there are gk ∈ P such that kχEk − gk k < ε
for all 1 ≤ k ≤ n. Let g = β1 g1 + . . . + βn gn ∈ P. Then
n
X
kϕ − gk ≤ |αk − βk | kχEk − gk k < ε .
k=1
This proves that every simple function is approximated in Lp ([a, b]) by ele-
ments g of P.
Next, suppose that f ∈ Lp ([a, b]) is such that f(x) ≥ 0 for all x ∈ [a, b].
Since every nonnegative measurable function can be approximated pointwise
by a monotone-increasing sequence of simple functions, there is a monotone-
increasing sequence {ϕk }k∈N of simple functions ϕk : [a, b] → [0, ∞) such
that
lim ϕk (x) = f(x) , ∀ x ∈ [a, b] .
k→∞
Because, 0 ≤ f(x) − ϕk (x) ≤ f(x), for all x ∈ [a, b], it is also true that |f(x) −
ϕk (x)|p ≤ f(x)p for all x ∈ [a, b]. Therefore, by the Dominated Convergence
Theorem, each (f − ϕk ) is an element of Lp ([a, b]) and
Z Z
p
lim |f − ϕk | dm = lim |f − ϕk |p dm = 0 .
k→∞ [a,b] [a,b] k→∞
38 2 Banach Spaces in Classical Analysis
Corollary 2.42. C([a, b]) is a dense subset of Lp ([a, b]). That is, for every
p-integrable function f : [a, b] → C and ε > 0 there is a continuous function
ϑ : [a, b] → C such that kf − ϑk < ε.
S
If one notes that R = n∈N [−n, n], then the following theorem can be
established from the previous one.
2.9 Lp(T)
This fact implies that the elements of {un }n∈Z are linearly independent (Ex-
ercise 20). Hence, {un }n∈Z is a basis for the vector space T of trigonometric
polynomials. In turn, T is dense in Lp (T), as shown by the following theorem.
Proof. Theorem 2.38 shows that Span {un }n∈Z is uniformly dense in the linear
submanifold of Lp (T) consisting of all 2π-periodic continuous functions; hence,
this is also true with respect to the norm of Lp (T). Corollary 2.42 shows that
2.10 Exercises 39
C([−π, π]) is dense in Lp ([−π, π]). Therefore, it is sufficient to show that every
f ∈ C([−π, π]) can be approximated to within ε in (the norm of) Lp (T) by a
2π-periodic continuous function h.
To this end, choose f ∈ C([−π, π]) and let ε > 0. Let M = max {|f(t)| |, t ∈
εp
[−π, π]} and choose δ > 0 such that δ < 2p+1 M p . Let h ∈ C([−π, π]) be the
function that agrees with f on [−π + δ, π − δ], is a straight line from the point
(−π, 0) to the point (−π + δ, f(−π + δ)), and is a straight line from the point
(π −δ, f(π −δ)) to the point (π, 0). Thus, |f(t)−h(t)| = 0 for t ∈ [−π +δ, π −δ]
and |f(t) − h(t)| ≤ 2M for all t 6∈ [−π + δ, π − δ]. Hence,
Z Z
p p
kf − hk = |f − h| dm + |f − h|p dm ≤ 2p+1 M p δ .
[−π,−π+δ] [π−δ,π]
π
1
Z
p̂(k) = √ p(t)e−i kt dt = αk , for all n ≤ k ≤ m .
2π −π
2.10 Exercises
1/n 1
(α1 · · · αn ) = (α1 + · · · + αn )
n
if and only if α1 = . . . = αn .
5. Let α1 , . . . , αn be positive real numbers and let p1 , . . . pn be positive real
numbers that satisfy p11 + . . . + p1n = 1.
a) Prove Young’s inequality:
n
X αpk k
α1 . . . αn ≤ .
pk
k=1
d(v, w) = ρ(v − w) , v, w ∈ V ,
is a pseudo-metric on V .
b) With respect to the topology on V induced by the seminorm ρ, prove
that the vector space operations (scalar multiplication and vector ad-
dition) are continuous. That is, prove that V is a topological vector
space.
8. If ρ is a seminorm on a topological vector space, and if ∼ is the relation
on V defined by
v ∼ w if ρ(v − w) = 0 ,
2.10 Exercises 41
[v] = {w ∈ V | w ∼ v} ,
14. Let T denote the group R/2πZ, equipped with the quotient topology, and
for√each n ∈ Z let γn : T → C be defined by γn ([t]) = eint , where i refers
to −1. Prove that functions γn are well defined and continuous.
15. Consider the Banach space `p (N), where 1 ≤ p < ∞.
a) Prove that the closed unit ball of `p (N) is not compact.
42 2 Banach Spaces in Classical Analysis
3.1 Operators
Definition 3.1. If V and W are normed vector spaces, then an operator from
V to W is a function T : V → W such that
1. T is linear—that is, T (α1 v1 +α2 v2 ) = α1 T (v1 )+α2 T (v2 ), for all v1 , v2 ∈ V
and α1 , α2 ∈ C—and
2. T is continuous.
The continuity of a linear transformation can be rephrased in terms of the
boundedness of the transformation.
Definition 3.2. If V and W are normed vector spaces, then a linear trans-
formation T : V → W is bounded if there is a constant M > 0 such that
kT (v)k ≤ M kvk , ∀v ∈ V .
kT (v) − T (v0 )k = kT (v − v0 )k ≤ M kv − v0 k .
Thus, if ε > 0 and if δ = ε/M , then kv − v0 k < δ implies kT (v) − T (v0 )k < ε.
Hence, T is continuous at v0 .
Assume that T is continuous. In particular, T is continuous at 0. Thus, for
ε = 1 there is a δ > 0 such that kwk < δ implies kT (w)k < 1. Let M = 2/δ.
δ
If v ∈ V is nonzero, then let w = 2kvk v. Thus, kwk < δ and so kT (w)k < 1.
That is,
44 3 Duality
Proposition 3.5. If V and W are normed vector spaces, then the set B(V, W )
of all operators T : V → W is a vector space under the operations
kT (v)k
kT k = sup (3.1)
06=v∈V kvk
is a norm on B(V, W ).
Proof. It is clear that B(V, W ) carries the structure of a vector space, and so
it is sufficient to prove that (3.1) defines a norm on B(V, W ).
To this end, note that (using the fact that every linear transformation
sends zero to zero) equation (3.1) is equivalent to
and so
kT1 + T2 (v)k
kT1 + T2 k = sup ≤ kT1 k + kT2 k .
06=v∈V kvk
This completes the proof.
Notational Convention. If V and W are normed vector spaces, then
B(V, W ) denotes the normed vector space of Proposition 3.5—that is, it is
assumed implicitly that B(V, W ) is normed by definition (3.1) (as there could
be other norms on the set of all operators V → W ). If W = V , then B(V ) is
to denote B(V, V ).
v 7→ T (v)
≤ 0 + ε kvk .
Definition 3.7. If V and W are normed vector spaces, then a linear trans-
formation T : V → W is called an isometry if kT vk = kvk, for every v ∈ V .
kT v − wk ≤ kT (v − vn )k + kT vn − wk = kv − vn k + kwn − wk , ∀n ∈ N.
Hence, w = T v. That is, the range of T contains all of its limit points, implying
that the range of T is closed.
Definition 3.9. Two normed vector spaces V and W are said to be isomet-
rically isomorphic if there is an isometry T : V → W such that T is a
surjection.
|ϕ(v)| ≤ M kvk , ∀v ∈ V .
dimension, then it is not obvious a priori that V ∗ has any nonzero elements
whatsoever. To show that V ∗ has (many) nonzero elements, we require the
Hahn–Banach Theorem, which is possibly the most important fundamental
theorem in functional analysis.
To prove the Hahn–Banach Theorem, it shall be convenient to consider
normed vector spaces V as real vector spaces. Note that if ϕ ∈ V ∗ , then the
norm of ϕ is given by
Proof. For every ζ ∈ C, ζ = <(ζ) − i<(iζ); thus, the formula ϕ(v) = ψ(v) −
iψ(iv) implies that ψ = <(ϕ). It is straightforward to show, for all v, v1 , v2 ∈ V
and r, s ∈ R, that ϕ(v1 + v2 ) = ϕ(v1 ) + ϕ(v2 ), ϕ(rv) = rϕ(v), and ϕ(isv) =
isϕ(v). Thus, ϕ is C-linear. The verification of kϕk = kψk is the same as in
Lemma 3.11.
We are now ready to prove the Hahn–Banach theorem.
With respect to this partial order, let F be any linearly ordered subset of S.
Hence, there is a linearly ordered set Λ such that
F = {(Mλ , ϑλ) | λ ∈ Λ} .
Define M ⊆ V by [
M = Mλ ,
λ∈Λ
Hence,
sup (ϑ(x) − kv0 − xk) ≤ inf (ϑ(y) + kv0 − yk) .
x∈M y∈M
Let δ0 ∈ R satisfy
therefore,
|δ0 − ϑ(x)| ≤ kv0 − xk , ∀x ∈ M .
Next, consider the subspace M0 = SpanC {v0 }. Let M1 = M0 + M , which is a
linear submanifold of V which properly contains M . Because M0 ∩ M = {0},
3.3 Complementation of Finite-Dimensional Spaces 49
1
≤ |α| kv0 + xk
α
= kαv0 + xk .
Thus, kϑ1 k ≤ 1 as an R-linear map M1 → R. Because ϑ1 is an extension of
ϑ from M to M1 , 1 = kϑk ≤ kϑ1 k ≤ 1 implies that kϑ1 k = 1, which shows
that (M1 , ϑ1) ∈ S. However, because M1 properly contains M , the relation
(M, ϑ) ≤ (M1 , ϑ1) contradicts the maximality of (M, ϑ) in S. Therefore, it
must be that M = V . Hence, a maximal element of S has the form (V, ϑ). To
complete the proof, let Ψ = ϑ.
Proof. Exercise 6.
2. M + N = V .
Thus, V = M + N .
and let TF (V ) denote the collection of all sets formed by arbitrary unions of
U ∈ B. The set TF (V ) satisfies the axioms for a topology on V . One can think
of it as being the coarsest (or smallest, weakest) topology on V that makes
each function f ∈ F continuous.
ϕn (v)
Note that ker Φ = L. The First Isomorphism Theorem in Linear Algebra
asserts that the quotient space V /L is isomorphic to the range of Φ, which is
a subspace of Cn . Hence, L cannot have finite-dimension (because dim V ≤
n + dim L).
52 3 Duality
Definition 3.22. The second dual of a normed vector space V is the dual
space (V ∗ )∗ of V ∗ , and is denoted by V ∗∗ .
(The last equality follows from Exercise 4.) Thus, T is bounded, norm pre-
serving, and injective.
It remains to show that if V is a Banach space, then the range of T is a
subspace of V ∗∗ . This is simply the statement of Proposition 3.8.
Proposition 3.23 shows that V can be viewed as sitting inside the dual of V ∗ .
Namely, every vector v ∈ V is a linear functional on V ∗ via
ϕ 7→ ϕ(v) , ϕ ∈V∗.
and
|ϕ(λ1 v1 + λ2 v2 ) − ζ(λ1 v1 + λ2 v2 )| < ε .
Therefore,
Theorem 3.27. For every Banach space V there is a compact space X such
that V is isometrically isomorphic to a subspace of C(X).
Proof. Let X be the unit ball of V ∗ , which is compact when endowed with the
weak∗ topology (Theorem 3.26). By Proposition 3.23, there is an isometric
embedding T : V → V ∗∗ whereby T v(ϕ) = ϕ(v), for all ϕ ∈ X and v ∈ V .
Thus we need only show that T v ∈ C(X) for every v ∈ V .
Let v ∈ V . To show that T v is (weak∗ ) continuous at ϕ0 ∈ X, let ε > 0
and consider the open ε-ball Ω centered at T v(ϕ0 ), namely
Ω = {λ ∈ C | |λ − ϕ0 (v)|} < ε .
Proof. Let Tw∗ denote the weak∗ topology of K. We will show below that
there is a metric d on K such that Td = Tw∗ , where Td is the topology on K
induced by the metric d.
Because V is a separable, there is a countable dense subset {vn }n∈N ⊂ V .
Define a function d : K × K → R+ by
∞
X 1
d(ϕ1 , ϕ2 ) = n
|ϕ1 (vn ) − ϕ2 (vn )| , ∀ ϕ1 , ϕ2 ∈ K .
n=1
2
|ϕ1 (v) − ϕ2 (v)| = |ϕ1 (v) − ϕ1 (vn ) + ϕ2 (vn ) − ϕ2 (v)| < ε (kϕ1 k + kϕ2 k) .
{ϕ ∈ K | d(ϕ, ϕ0 ) < ε} ,
which is an open set in (K, Td), is an open set in (K, Tw∗ ). Thus, Td ⊆ Tw∗ .
Conversely, if one shows that every weak∗ closed set in K is closed in the
metric topology of K, then it follows that every weak∗ open set in K is open
in the metric topology—that is, Tw∗ ⊆ Td . To this end, if F ⊆ K is weak∗
closed, then F is necessarily weak∗ compact (because the space (K, Tw∗ ) is
compact). Since Td ⊆ Tw∗ , every covering of F by open sets in (K, Td ) admits,
therefore, a finite subcovering. Hence, F is compact in (K, Td). Because the
topology of every metric space is Hausdorff, F is a compact, Hausdorff subset
of the metric space (K, Td ). In any topological space, “compact + Hausdorff”
implies “closed” [9, Theorem 3.5.3]; hence F is a closed set in the metric space
(K, Td).
3.8 Exercises
6. Prove that if V and W are nonzero Banach spaces, then there exists a
nonzero operator T : V → W .
7. Suppose that M, N ⊂ V are nonzero subspaces of a Banach space V . If
N has finite dimension, then prove that
M + N = {u + w | u ∈ M, w ∈ N }
is subspace of V .
8. Prove that the following Banach spaces are reflexive:
a) Every Hilbert space
b) Every finite-dimensional Banach space
9. Let V be a normed vector space and suppose that ϕ1 , . . . , ϕn ∈ V ∗ . Let
n
\
L = ker ϕj .
j=1
ϕn (v)
there is a linear functional ψ : Cn → C such that ϕ(v) = ψ (Φ(v)), for
every v ∈ V .
b) Use the Riesz Representation Theorem (Theorem 6.28) for Cn to show
X n
that there are λ1 , . . . , λn ∈ C such that ϕ = λj ϕj .
j=1
10. Prove that the closed unit ball of an infinite-dimensional Hilbert space is
weakly compact.
11. Let X be the unit ball of a Banach space V such that X is endowed with
the weak∗ topology. Is X a necessarily a Hausdorff space ?
12. Prove that the vector space operations on V ∗ are continuous in the weak∗
topology, for every normed vector space V .
4
Dual Spaces in Classical Analysis
This [incomplete] chapter considers the dual spaces of some of the Banach
spaces in classical analysis.
= |ϕ(f)|
≤ kϕk kfk
Z 1/p
= kϕk |f|p dµ
X
Z 1/p
q
= kϕk |g| dµ
X
= kϕk kgkq/p .
q
Hence, the equation q − p = 1 shows that kgk ≤ kφk.
There is a converse to Proposition 4.1, which is a (remarkable) theorem of
F. Riesz and is stated below but not proved.
p
Corollary 4.3. L (X, Σ, µ) is a reflexive Banach space for every 1 ≤ p < ∞.
Proof. Exercise 1.
Proof. Let λ = µ + ν. For notational simplicity, L2 (λ) and L2 (µ) shall denote,
respectively, the Hilbert spaces L2 (X, Σ, λ) and L2 (X, Σ, µ). Because µ(X)
and ν(X) are finite, so is λ(X) and the constant (measurable) function 1 is a
element of both L2 (λ) and L2 (µ).
Without loss of generality, we may assume that µ(X) = 1. If ψ : X → R
is a nonnegative simple function with canonical form
n
X
ψ = αj χEj ,
j=1
then
Z n
X Z Z
ψ dλ = αj (µ(Ej ) + ν(Ej )) = ψ dµ + ψ dν .
X j=1 X X
Therefore, on the Hilbert space L2 (λ) one can define a (linear) function
ϕ : L2 (λ) → C by
Z
ϕ(h) = h dµ , ∀ h ∈ L2 (λ) .
X
2
Observe that if f ∈ L (µ) and f ≥ 0 almost everywhere, then ϕ(f) ≥ 0. By
the Cauchy-Schwarz inequality and the fact that 1 ∈ L2 (µ), for any h ∈ L2 (λ)
we have Z
|ϕ(h)| = h dµ
X
= hh, 1iL2(µ)
Z 1/2 Z 1/2
≤ |h|2 dµ 12 dµ
X X
Z 1/2
|h|2 dµ
p
= µ(X)
X
Z 1/2
2
≤ |h| dλ
X
= khk .
Thus, ϕ is nonnegative, linear, bounded, and of norm kϕk ≤ 1. Hence, Theo-
rem 6.28 states that there is a unique q ∈ L2 (λ) such that ϕ(h) = hh, qiL2 (λ)
for all h ∈ L2 (λ). That is,
Z Z
h dµ = hq dλ , ∀ h ∈ L2 (λ) .
X X
In particular, using h = χE ,
Z
µ(E) = q dλ , ∀E ∈ Σ . (4.3)
E
It is also the case that λ(E) = µ(E)+ν(E) (by definition). By simple algebra,
we have another expression for λ(E):
Z Z Z Z
λ(E) = dλ = q + (1 − q) dλ = q dλ + (1 − q) dλ .
E E E E
Hence, Z
ν(E) = (1 − q) dλ , ∀E ∈ Σ . (4.4)
E
By Exercise ??, equations (4.3) and (4.4) show that it may be assumed that
0 ≤ q(x) ≤ 1 for all x ∈ X.
4.2 The Radon–Nikodým Theorem 61
If E ∈ Σ and ψ = χE , then
Z Z Z
ψ dµ = µ(E) = q dλ = ψ q dλ . (4.5)
X E X
1
Z Z Z Z
dµ = ψ dµ = ψ q dλ = dλ . (4.7)
Fc q X X Fc
R
Moreover, µ(F ) = X q dλ = 0 implies, because ν is absolutely continuous
with respect to µ, that ν(F ) = 0 and λ(F ) = 0. Therefore, for any E ∈ Σ,
A similar equation holds for µ: namely, µ(E) = µ(E ∩F c). Therefore, equation
(4.7) yields, for every E ∈ Σ,
1 1
Z Z Z Z
dµ = dµ = dλ = dλ . (4.8)
E q E∩F c q E∩F c E
1
Z Z
= dµ − dµ
E q E
1
Z
= − 1 dµ
E∩F c q
Z
= g dµ ,
E
which completes the proof of the theorem.
62 4 Dual Spaces in Classical Analysis
4.3 Exercises
ˆ ,
ϕn (f) = f(n)
To this point in our study, Banach spaces have been considered solely on the
basis of their analytical and topological features. In this chapter, a geometrical
aspect of Banach spaces—namely, convexity—is examined.
If u and v are vectors in a normed vector space V , then the set of all vectors
of the form τ u + (1 − τ )v, where τ ∈ [0, 1] ⊂ R, comprise a line segment in
V connecting v to u. A subset of V that contains the line segments between
any of its points is called a convex set. That is, a subset C of a normed vector
space V is convex if τ u + (1 − τ )v ∈ C, for every τ ∈ [0, 1] and all u, v ∈ C.
n
X
where each τj ∈ [0, 1] and τj = 1.
j=1
2. If, in (5.1), each τj 6= 0, then the convex combination is said to be proper.
3. If S ⊂ V , then the set of all convex combinations of elements of S is called
the convex hull of S and is denoted by Conv S.
Proposition 5.2. A subset C ⊂ V is convex if and only if C contains all
convex combinations of its elements.
Proof. Exercise 2.
Corollary 5.3. The convex hull Conv S of any S ⊆ V is a convex subset of
V.
64 5 Convexity
Proof. Exercise 3.
Proposition 5.7 implies that the extreme points of a convex set C must lie
on the topological boundary of C. However, some convex sets do not include
their boundaries—the open unit disc D in C is one such example—and so
they do not possess extreme points. On the other, in finite-dimensional spaces,
compact convex sets are closed an bounded, and so in these cases there is a
well defined boundary. But do these sets necessarily have extreme points ?
This answer lies in the Kreı̌n–Milman Theorem below.
Therefore, the family L of compact sets has the finite intersection property,
and so F0 6= ∅, where \
F0 = G.
G∈L
γ = <ϕ (τ v1 + (1 − τ )v2 )
= τ <ϕ(v1 ) + (1 − τ )<ϕ(v2 )
≤ τ γ + (1 − τ )γ
= γ,
Proposition 5.10. The closed unit ball of the dual space V ∗ of any normed
vector space has an extreme point.
Proof. Let X ⊂ V ∗ be the closed unit ball of V ∗ . Endow V ∗ with the weak∗
topology. As the closed unit ball X is convex (by Exercise 1) and weak∗
compact (by the Banach–Alaoglu Theorem), X has an extreme point, by
Theorem 5.9.
5.3 Geometry of the Closed Unit Ball 67
Proposition 5.10 provides a useful device for showing that certain Banach
W spaces are not (isometrically isomorphic to) dual spaces; one need only
show the closed unit ball of such W has no extreme points. See Exercise 8.
For a second example of the influence of the norm on the geometry of the
closed unit ball, consider V = R2 as an `1 -space and an `2 -space, and let B1
and B2 denote the closed unit balls in these spaces. The ball B2 is simply a
circular disc; however, the ball B1 is a diamond-shaped disc:
@
@
@
@
@
@
@
@
@
@
which implies that cos(α − β) = −1; that, is α = β + (2k + 1)π for some k ∈ Z.
Thus, λ = µ = ζ.
eτa+(1−τ)b = τ ea + (1 − τ )eb ,
then a = b.
Proof. The function ϑ(t) = et on R is convex and has strictly positive second
derivative. Thus, eτa+(1−τ)b = τ ea + (1 − τ )eb if and only if ea = eb .
Proof of Theorem 5.11. Only the nontrivial implication is shown here. That
is, we show that if v ∈ `p (N) has norm kvk = 1, then v is an extreme point of
B.
Suppose that u, w ∈ B and τ ∈ (0, 1) satisfy v = τ u + (1 − τ )w. Let q > 1
be such that 1/p + 1/q = 1. By Corollary 3.14, there exists a unit vector
ϕ ∈ `p (N)∗ = `q (N) such that kvk = ϕ(v). Hence,
which implies that |ϕ(v)| = kϕk kvk, |ϕ(u)| = kϕk kuk, and |ϕ(w)| = kϕk kwk.
This first of these equalities yields
∞ ∞ ∞
!1/q ∞
!1/p
X X X X
1 = ϕn vn ≤ |ϕn vn | = |ϕn |q |vn |p = 1.
n=1 n=1 n=1 n=1
1 1 1 an 1
e p an + q bn = e + ebn ,
p q
where an = p log |vn | and bn = q log |ϕn | (in nonzero cases). By Lemma 5.13,
|vn |p = |ϕn |q for all n ∈ N. Likewise, |un |p = |wn |p = |ϕn |q for all n ∈ N.
Hence, |un| = |wn | = |vn |, for each n ∈ N. But vn = τ un + (1 − τ )wn , and so
Lemma 5.12 yields un = wn = vn ; that is, u = w = v.
5.4 Separation of Convex Sets by Linear Functionals 69
The proof of Theorem 5.14 requires several lemmas and the introduction
of the notion of a sublinear functional. The Hahn–Banach Extension Theorem
is the basis of the proof of Theorem 5.14.
Proof. Exercise 9.
The sublinear functional p in Proposition 5.16 is sometimes called the
Minkowski functional associated with C.
The Hahn–Banach Extension Theorem, in its most general form, is below.
Its proof is a fairly straightforward adaptation of the proof of Theorem 3.13.
Definition 5.18. Let V be a normed vector space and let VR be the real
normed vector space obtained from V by restricting the base field from C to
R.
1. A real subspace of V is a closed linear submanifold (over R) of VR .
2. A hyperplane of V is a subspace M such that V /M is isometrically iso-
morphic to C.
3. A real hyperplane of V is a real subspace N such that VR /N is isometrically
isomorphic to R.
70 5 Convexity
By Exercise 12, the interval µ(C1 ) is open, since C1 is open. Hence, γ 6∈ µ(C1 )
and so the inequalities (5.5) hold.
However, as C1 is compact, <ϕ(C1 ) is compact and <ϕ (C1 + Bε (0)) has com-
pact closure, disjoint from C2 . Therefore, there are γ1 , γ2 ∈ R such that
Proof. By Theorem 5.9, K has at least one extreme point. Let C = Conv (ext K)
and consider the compact convex subset C of K. If C 6= K, then let w0 ∈ K\C .
72 5 Convexity
Kϕ = {w ∈ K | <ϕ(w) = δ}
5.5 Exercises
satisfies |ϕ(v)| ≤ p(v), for every v ∈ V , if and only if |µ(v)| ≤ p(v), for
every v ∈ V .
12. Let V be a normed vector space and assume that C ⊂ V is an open set.
Let ϕ ∈ V ∗ and µ = <ϕ. Prove that if µ 6= 0, then µ(C) is an open subset
of R.
13. Proof that if K ⊂ Rn is compact and convex, and if w ∈ K, then w is a
convex combination of at most n + 1 extreme points of K.
14. A cone in a finite-dimensional Banach space V is a convex subset C ⊂ V
such that τ v ∈ C, for every τ ∈ R+ and v ∈ C. Let C † = {ϕ ∈ V ∗ | ϕ(v) ≥
0 , ∀ v ∈ C}.
a) Prove that C † is a cone in the dual space V ∗ .
b) Prove that C †† = C.
6
Hilbert Spaces
Among Banach spaces, the so-called Hilbert spaces are of special importance,
for these spaces carry a geometry that is a formal abstraction of the usual
Euclidean geometry of Cn .
If η 6= 0, then |hξ, ηi| = hξ, ξi1/2 hη, ηi1/2 if and only if ξ = αη for some α ∈ C.
Proof. The result is trivially true if hξ, ηi = 0. Assume, therefore, that hξ, ηi =
6
0. For any λ ∈ C,
For
76 6 Hilbert Spaces
hξ, ξi
λ = ,
hη, ξi
the inequality above becomes
kξ + ηk2 = hξ + η, ξ + ηi
Inner products allow one to carry intuitions and concepts from Euclidean
geometry, such as magnitude and angle, to the study of vectors.
= 2kξ0 − ηn k2 + 2kξ0 − ηm k2
2 1 1
< 4 (dist (ξ0 , K)) + n + m ,
where the second equality above follows from the parallelogram law. On the
other hand,
2
4 (dist (ξ0 , K)) ≤ 4kξ0 − 12 (ηn + ηm )k2
= kξ0 − ηk + ηk − ηk
≤ kξ0 − ηk k + kηk − ηk
q
2 1
< (dist (ξ0 , K)) + k + kηk − ηk .
Thus,
kη − η 0 k2 = 4kξ0 − ηk2 − 4kξ0 − 12 (η + η 0 )k2
2
= 4 (dist (ξ0 , K)) − 4kξ0 − 12 (η + η 0 )k2
2 2
≤ 4 (dist (ξ0 , K)) − 4 (dist (ξ0 , K))
= 0.
That is, η 0 = η.
S ⊥ = {η ∈ H | hη, ξi = 0, ∀ ξ ∈ S} .
2. ξ − η ∈ M ⊥ .
Proof. Assume that dist (ξ, M ) = kξ − ηk. Let η 0 ∈ M . We aim to prove
that hξ − η, η 0 i = 0. To this end, consider any α ∈ C. Because η + αη 0 ∈ M ,
kξ − (η + αη 0 )k ≥ dist (ξ, M ) = kξ − ηk. Thus,
kξ − ηk2 ≤ k(ξ − η) − αη 0 k2
whence
2< (αhη 0 , ξ − ηi) ≤ |α|2kη 0 k2 . (6.5)
The complex number hη 0 , ξ − ηi is either zero or it is not. If it is zero, then
ξ − η is orthogonal to η 0 , as desired. We shall now show that the assumption
that hη 0 , ξ − ηi =
6 0 will lead to a contradiction.
Assume that hη 0 , ξ − ηi = 6 0 and let α = t hη 0 , ξ − ηi, for some t > 0.
Inequality (6.5) becomes
Because hη 0 , ξ − ηi =
6 0 and t > 0, the inequality above implies that
2 ≤ t kηk2 ,
= kξ − ηk2 + kη − η 0 k2
≥ kξ − ηk2 .
Thus,
kξ − ηk ≤ inf{kξ − η 0 k | η 0 ∈ M } = dist (ξ, M ) ,
which proves that kξ − ηk = dist (ξ, M ).
To conclude the discussion of Hilbert space geometry, we shall consider
decompositions of a Hilbert space as a direct sum of two subspaces that are
orthogonal to each other.
Recall from linear algebra that if M and N are linear submanifolds of H,
then M + N denotes the set
M + N = {ω + δ | ω ∈ M and δ ∈ N }
Thus, the sequences {ωk }k∈N ⊂ M and {δk }k∈N ⊂ N are Cauchy sequences.
Let ω ∈ M and δ ∈ N be the limits, respectively, of these sequences. (The
limits exist because H is a Hilbert space; the limits lie in M and N , respec-
tively, because M and N are closed.) The verification that ξ = ω + δ is left to
the reader. Hence, M + N is closed.
This situation described in Proposition 6.13 is formalised by the following
definition.
Proof. Suppose that Span B is dense in H and let M be the closure of Span B.
Therefore, if ξ ∈ H is orthogonal to every φ ∈ B, then ξ is orthogonal to
Span B. By continuity of the inner product, this means that ξ is orthogonal
to M : ξ ∈ M ⊥ = H ⊥ = {0}. Thus, there is no orthonormal set B0 in H that
properly contains B.
The proof of the converse is straightforward.
One passes from linearly independent vector to orthonormal vectors via
the Gram–Schmidt Process.
Span{φ1 , . . . , φk } = Span{v1 , . . . , vk } .
Proof. Exercise 7.
The cardinality of the basis is related to the topology of the Hilbert space
by way of the following proposition.
W = SpanQ+iQ {φk | k ∈ N}
W = SpanC {ω | ω ∈ S}
is dense in H. The spanning set S must contain an algebraic basis for the
vector space W . Thus, W has a countable basis and to this basis one can
apply the Gram–Schmidt process to obtain a countable set {φk }k∈N of or-
thonormal vectors whose linear span O is dense in the closure W of W . But
W = H, which indicates that O⊥ = {0}. In other words, there are no nonzero
vectors orthogonal to {φk }k∈N, which proves that {φk }k∈N is a maximal set
of orthonormal vectors. That is, {φk }k∈N is an orthonormal basis of H.
If H has finite dimension n, then n is an algebraic invariant of the space
H: all linear bases of H must have the same cardinality, whence any two
orthonormal bases of H are in bijective correspondence. If H has infinite
dimension and is separable, then all orthonormal bases of H are countably
infinite and are, therefore, in bijective correspondence with one another.
It is also true that if H is a nonseparable Hilbert space, then all orthonor-
mal bases of H are in bijective correspondence. The proof of this requires
some delicate and substantial cardinal arithmetic, however. We will not need
to use such a theorem in what follows.
Hilbert spaces with a countable orthonormal basis are especially easy to
analyse. The following theorem, which is an abstraction of classical Fourier
series, illustrates this fact.
Observe that
n
X
0 ≤ kξ − ξn k2 = hξ − ξn , ξ − ξn i = kξk2 − |hξ, φk i|2 .
k=1
84 6 Hilbert Spaces
Hence,
n
X
lim |hξ, φk i|2 ≤ kξk2 ,
n→∞
k=1
which implies that the sequence {ξn }n∈N is a Cauchy sequence. Because H is
a Hilbert space, this sequence has a limit ξ 0 ∈ H. We shall prove that ξ 0 = ξ.
Choose k ∈ N. Direct computation shows that hξ − ξn , φk i = 0 for every
n ≥ k. Hence, if n ≥ k, then
| h(ξ − ξ 0 ), φk i | = | h(ξ − ξn + ξn − ξ 0 ), φk i |
= | h(ξ − ξn ), φk i + h(ξn − ξ 0 ), φk i |
= | h(ξn − ξ 0 ), φk i |
≤ kξn − ξ 0 k .
Proposition 6.23. Assume that {φk }k∈N is an orthonormal basis for a sep-
arable Hilbert space H.
6.5 Examples of Separable Hilbert Spaces 85
2. For every ξ ∈ H, X
kξk2 = |hξ, φk i|2 .
k∈N
and consider
n
X
ξn = hξ, φk i φk .
k=1
Because limn→∞ kξn − ξk = 0, Lemma 6.5 shows that
hξ, ηi = lim hξn , ηi
n→∞
n
!
X
= lim hξ, φk i hφk , ηi
n→∞
k=1
X
= hξ, φk ihη, φk i .
k∈N
Definition 6.24. The Legendre polynomials in L2 ([a, b]) are the elements of
the countable infinite set of polynomials that result from applying the Gram–
Schmidt process in L2 ([a, b]) to the set {1, t, t2, t3 , . . .}.
As a concrete example, in L2q([−1, 1]) the Legendre polynomials are the
polynomials φk given by φ0 (t) = 12 and
r
2k + 1 1 dk 2
(t − 1)k ,
φk (t) = k k
k ∈ N.
k 2 k! dt
Theorem 6.25. The set of Legendre polynomials in L2 ([a, b]) forms an or-
thonormal basis of L2 ([a, b]).
Proof. Let L denote the set of Legendre polynomials and consider the linear
submanifold Span L of L2 ([a, b]). Because the Gram–Schmidt orthogonalisa-
tion process does not alter the span of the vectors that it is applied to,
Span L = Span {1, t, t2, t3 , . . .} .
Therefore, by the Weierstrass Approximation Theorem, if ϑ : [a, b] → C is con-
tinuous and ε > 0, there is a element f ∈ Span L such that |ϑ(t) − f(t)| < ε
for all t ∈ [a, b]. Furthermore, Corollary 2.42 states that the set of all contin-
uous functions ϑ is dense in L2 ([a, b]). Thus, a simple calculation shows that
Span L is dense in L2 ([a, b]), which proves that L is an orthonormal basis of
L2 ([a, b]).
In some instances the Legendre polynomials are not the best orthonormal-
basis vectors to work with. This is especially true if one is interested in periodic
phenomena. In such cases, classical Fourier series offers a better tool.
For every k ∈ Z, consider the 2π-periodic function φk : [−π, π] → C given
by
ei kt
φk (t) = √ , (6.9)
2π
√
where i = −1 and where ei kt = cos(kt) + i sin(kt). If n, m ∈ Z, then
Z π
hφn , φm i = φn (t)φm (t) dt
−π
Z π
= ei (n−m)t dt
−π
= 0, if n 6= m, or 1, if n = m .
Thus, {φn }n∈Z is a set of orthonormal vectors in L2 ([−π, π]) that span the
ring T of trigonometric polynomials. Indeed, {φn }n∈Z is an orthonormal basis
of L2 ([−π, π]).
Notation. L2 (T) shall denote L2 ([−π, π], m).
6.6 Hilbert Space Duality 87
Proof. Theorem 2.38 shows that Span {φn }n∈Z is uniformly dense in the lin-
ear submanifold of LL2 (T) consisting of all 2π-periodic continuous functions;
hence, this is also true with respect to the norm of L2 (T). Corollary 2.42 shows
that C([−π, π]) is dense in L2 (T). Therefore, it is sufficient to show that every
f ∈ C([−π, π]) can be approximated to within ε in (the norm of) L2 (T) by a
2π-periodic continuous function h.
To this end, choose f ∈ C([−π, π]) and let ε > 0. Let M = max {|f(t)| |, t ∈
ε2
[−π, π]} and choose δ > 0 such that δ < 8M 2 . Let h ∈ C([−π, π]) be the
function that agrees with f on [−π + δ, π − δ], is a straight line from the point
(−π, 0) to the point (−π + δ, f(−π + δ)), and is a straight line from the point
(π −δ, f(π −δ)) to the point (π, 0). Thus, |f(t)−h(t)| = 0 for t ∈ [−π +δ, π −δ]
and |f(t) − h(t)| ≤ 2M for all t 6∈ [−π + δ, π − δ]. Hence,
Z Z
kf − hk2 = |f − h|2 dm + |f − h|2 dm ≤ 8M 2 δ .
[−π,−π+δ] [π−δ,π]
Definition 6.27. The classical Fourier coefficients of f ∈ L2 (T) are the com-
ˆ
plex numbers f(k) defined by
fˆ(k) = hf, φk i .
fˆ(k) ei kt
X
k∈Z
for each ξ ∈ H there exist a unique vector ν ∈ ker ϕ and scalar λ ∈ C for
which
ξ = ν + λw . (6.10)
Thus,
ϕ(ξ) = ϕ(ν) + ϕ(λ w) = λ ϕ(w) .
Now take η ∈ H to be the vector
ϕ(w)
η= w,
||w||2
which depends only on the vector w and the value of ϕ at w. Then, for an
arbitrary ξ ∈ H having form (6.10),
* +
ϕ(w) hw, wi
hξ, ηi = ν + λ w, 2
w = λ ϕ(w) = ϕ(ξ) .
||w|| ||w||2
6.7 Exercises
for all ξ, η ∈ H.
3. Let K be the closed unit ball of a Hilbert space H. Prove that ξ ∈ K is
an extreme point of K if and only if kξk = 1.
4. Prove that in a Hilbert space H, S ⊥ is a subspace for every S ⊆ H.
5. In a Hilbert space H, let M1 and M2 be closed subspaces.
a) Prove that (M1 + M2 )⊥ = M1⊥ ∩ M2⊥ .
b) Prove or find a counterexample to (M1 ∩ M2 )⊥ = M1⊥ + M2⊥ .
6. Assume that {φk }k∈N is an orthonormal basis for a Hilbert space H, and
let {δk }∞
k=1 be a sequence of nonnegative real numbers. Hilbert’s cube is
the set (∞ )
X
∆ = αk φk : |αk | ≤ δk , ∀ k .
k=1
7. Prove that every nonzero Hilbert space has an orthonormal basis. In par-
ticular, if O ⊂ H is a set of orthonormal vectors, then prove that H has
an orthonormal basis B that contains O.
90 6 Hilbert Spaces
φk = (0, . . . , 0, 1, 0, . . . ) ,
where the 1 occurs in the k-th term of the sequence. Prove that {φk }k∈N
is an orthonormal basis of `2 (N).
12. Suppose that in a Hilbert space H, φ1 , . . . , φn are orthonormal vectors.
Prove that if ξ1 , . . . , ξn ∈ H satisfy kξj − φj k < n−1/2 for each j, then
ξ1 , . . . , ξn are linearly independent.
13. Consider the Fourier series expansions of f1 (t) = t and f2 (t) = eαt in
L2 ([−π, π]). Calculate the Fourier series of each fj and use Parseval’s
identity to establish each of the following formulae:
∞
X 1 π2
= ;
n=1
n2 6
∞
X 1 π
= coth(απ) .
n=1
n2 + α 2 α
Part II
Operators
7
Operators on Banach Spaces
kS ◦ T k ≤ kSk kT k ,
sup kT vk = ∞ ∀v ∈ G.
T ∈Λ
fT (v) = kT vk , v∈V.
Hence,
2n
kT k ≤ , ∀T ∈ Λ,
ε
which proves that Λ is uniformly bounded.
Case #2: Un is dense in V for every n ∈ N. In this case, let G ⊆ V be the
Gδ -set defined by \
G = Un .
n∈N
sup kT vk = ∞ ,
T ∈Λ
that is,
2kηk
T u − γ
η
< ε.
γ
2kηk
2kηk 2kηk
Now let ξ = γ
u to obtain kξk < γ
(2n) = δ −1 kηk and kT ξ − ηk < ε.
Lemma 7.5. If V and W are Banach spaces and if T ∈ B(V, W ) is a sur-
jection, then there is a δ > 0 with the following property: if ε is given and if
η ∈ W satisfies kηk < δ, then there exists ξ ∈ V such that
1. kξk < 1 + ε, and
2. T ξ = η.
Proof. Let δ > 0 be the number determined by Lemma 7.4. Suppose that
ε > 0 is given and that η ∈ W satisfies kηk < δ. By Lemma 7.4, there is a
ξ1 ∈ V with kξ1 k < δ −1 kηk (whence, kξ1 k < 1) and kT ξ1 − ηk < δε 2 .
Apply Lemma 7.4 again using T ξ1 − η in place of η: thus, there is ξ2 ∈ V
with kξ2 k < δ −1 kT ξ1 − ηk (thus, kξ2 k < ε/2) and kT ξ2 − (T ξ1 − η)k < 2δε2 .
Repeated use of Lemma 7.4 produces, inductively, a sequence {ξk }k∈N in
V such that, for every k ∈ N,
k
ε X δε
kξk k < and kη − T ξj k < .
2k−1 j=1
2k
k
X
Hence, { ξj }k∈N is a Cauchy sequence in V ; denote its limit ξ ∈ V by
j=1
∞
X
ξ = ξj .
j=1
Thus,
∞ ∞
X X 1
kξk ≤ kξj k ≤ kξ1 k + ε < 1 + ε.
j=1 j=2
2j−1
k
X
By continuity of T , kT ξ − T ( ξj )k → 0. Hence, T ξ = η.
j=1
Proof of the Open Mapping Theorem. Let δ > 0 be the number given by
δ
Lemma 7.5. Suppose that ε > 0. Choose any η ∈ W such that kηk < 1+ε .
Hence, k(1+ε)ηk < δ, and so by Lemma 7.5 there is an ξ ∈ V with kξk < 1+ε
and T ξ = (1 + ε)η. Therefore, ξ˜ = (1 + ε)−1 ξ is in the open unit ball of V and
T ξ̃ = η. Because ε is an arbitrary positive number, this proves that
[ δ
{w ∈ W | kwk < δ} = η ∈ W | kηk <
ε>0
1+ε
⊆ {T ξ | ξ ∈ V, kξk < 1} ,
7.3 Invertible Operators 97
Proof. Exercise 4.
Proposition 7.9. If V and W are Banach spaces, then the following state-
ments are equivalent for T ∈ B(V, W ).
1. T is a bijection;
2. T is bounded below and has dense range.
Lemma 7.16. If T ∈ B(V ) and λ ∈ C are such that |λ| > kT k, then (1 −
1
λ
T )−1 exists,
−1
−1 ∞
1 X
−k k
1
|λ|
1− T = λ T , and
1− T
≤ .
λ
λ
|λ| − kT k
k=0
∞
|λ|
X
Hence, the series λ−k T k converges to some S ∈ B(V ) with kSk ≤ |λ|−kT k
.
k=0
As
1 1 1 1
1− T 1+ T + ... + k Tk = 1− T k+1 ,
λ λ λ λk+1
we conclude that S = (1 − λ1 T )−1 .
kT k
k(1 − T )−1 k ≤ .
1 − kT k
100 7 Operators on Banach Spaces
∞
X
Proof. Lemma 7.16 shows that (1 − T )−1 exists, equals T k , and satisfies
k=0
1
k(1 − T )−1 k ≤ 1−kT k . Further,
∞
X
X∞
k k
k1 − (1 − T )−1 k =
1 − T
=
T
k=0 k=1
∞
!
X
k
=
T − T
= k − T (1 − T )−1 k
k=0
≤ kT k k(1 − T )−1 k .
kT k
Hence, k1 − (1 − T )−1 k ≤ 1−kT k .
Lemma 7.18. If T ∈ B(V ) and λ0 6∈ σ(T ), then (T − λ1)−1 exists for all
λ ∈ C for which |λ − λ0 | < k(T − λ0 1)−1 k−1 .
Proof. Lemma 7.17 asserts that 1 + (λ0 − λ)(T − λ0 1)−1 is invertible, for
all λ ∈ C for which |λ − λ0 | < k(T − λ0 1)−1 k−1 . However,
Thus, (T − λ1)−1 exists for all λ ∈ C for which |λ − λ0 | < k(T − λ0 1)−1 k−1 .
Lemma 7.19. Suppose that T ∈ B(V ), ϕ ∈ B(V )∗ , and λ0 6∈ σ(T ). There is
an ε > 0 such that if Ω = {λ ∈ C | |λ − λ0 | < ε}, then
Ω ∩ σ(T ) = ∅ and the
function f : Ω → C defined by f(λ) = ϕ (T − λ1)−1 is differentiable at λ0 .
Proof. Let ε = k(T − λ0 1)−1 k−1 and Ω = {λ ∈ C | |λ − λ0 | < ε}.By Lemma
7.17, Ω ∩ σ(T ) = ∅. Define f : Ω → C by f(λ) = ϕ (T − λ1)−1 . For every
λ ∈ Ω,
≤ 2k(T − λ0 1)−1 k3 |λ − λ0 |2 .
Hence,
7.5 The Spectrum 101
f(λ) − f(λ0 )
lim = ϕ((T − λ0 1)−2 ) ,
λ→λ0 λ − λ0
which implies that f is differentiable at λ0 ∈ C.
We are now prepared to prove that every operator on a Banach space has
a nonempty compact spectrum.
Proof of Theorem 7.15. If λ ∈ C satisfies |λ| > kT k, then (1 − λ1 T )−1 exists,
by Lemma 7.16. Therefore, −λ(1 − λ1 T ) = (T − λ1) is invertible. Hence,
σ(T ) ⊆ {ζ ∈ C | |ζ| ≤ kT k}. This proves that σ(T ) is bounded. Furthermore,
C \ σ(T ) is an open set, by Lemma 7.18. Let λ0 ∈ C \ σ(T ). Thus, σ(T ) is
bounded and closed, and hence σ(T ) is compact. It remains to show that σ(T )
is nonempty.
Assume, contrary to what we aim to prove, that σ(T ) = ∅. Choose any
ϕ ∈ B(V )∗ and let f : C → C be defined by f(λ) = ϕ((T −λ1)−1 ). By Lemma
7.19, f is differentiable at each λ0 ∈ C. Hence, f is holomorphic on C.
Suppose that λ ∈ C satisfies |λ| > kT k. Then
−1
−1 1
(T − λ1)−1 = 1− T
λ λ
and so
−1
1
1
kϕk
|f(λ)| ≤ kϕk
1− T
≤ ,
|λ|
λ
|λ| − kT k
by Lemma 7.16. Hence, lim f(λ) = 0.
|λ|→∞
On the compact set {ζ ∈ C | |ζ| ≤ kT k}, the continuous map ϕ is bounded.
kϕk
Therefore, the inequality |f(λ)| ≤ |λ|−kT k
for |λ| > kT k implies that ϕ is
bounded on its entire domain C. But the only bounded entire functions are
the constant functions. Thus, there is a α ∈ C such that f(λ) = α, for all
λ ∈ C. Because lim f(λ) = 0, the constant α must in fact be zero. This,
λ→∞
therefore, proves that ϕ((T − λ1)−1 ) = 0 for all ϕ ∈ B(V )∗ and λ ∈ C. By the
Hahn–Banach Theorem, this implies that (T − λ1)−1 = 0, for each λ, which
is impossible since 0 is not invertible. Therefore, it must be that σ(T ) 6= ∅.
Although σ(T ) lies in the closed disc of radius kT k and centre 0 ∈ C, there
could be a smaller disc with the same centre that contains σ(T ). The radius
of the smallest of such discs is called the spectral radius.
Definition 7.20. The spectral radius of T ∈ B(V ) is the quantity spr T de-
fined by
spr T = max |λ| .
λ∈σ(T )
f(λ) = ϕ (1 − λT )−1 .
The final general result of this section concerns the relationship between
the spectra of ST and T S.
f (σ(T )) = σ(f(T ))
Proof. Let f be a complex polynomial and suppose that λ ∈ σ(T ). Let g(t) =
f(t) − f(λ). As g(λ) = 0, there is a polynomial h such that g(t) = (t − λ)h(t).
Hence, f(T ) − f(λ)1 = g(T ) = (T − λ1)h(T ) = h(T )(T − λ1). If f(T ) − f(λ)1
were invertible, then these equations imply that T − λ1 has a left and right
inverse, from which we would conclude that T − λ1 is invertible (Exercise 8).
Therefore, as T − λ1 is not invertible, it must be that f(T ) − f(λ)1 is not
invertible. That is, f(λ) ∈ σ(f(T )). This proves that f (σ(T )) ⊆ σ(f(T )).
Conversely, suppose that ω 6∈ f (σ(T )). Thus, ω 6= f(λ), for all λ ∈ σ(T ).
Let h(t) = f(t)−ω and factor h: h(t) = (t−ω1 )n1 · · · (t−ωm )nm . Since h(λ) 6= 0
for all λ ∈ σ(T ), λ 6= ωj for every j and λ ∈ σ(T ). That is, ωj 6∈ σ(T ) for each
j. The factorisation h leads to the following expression for h(T ):
δkvk ≤ k(T −λ1)vk = k(T −µ1)v +(µ−λ)vk ≤ k(T −µ1)vk + |λ−µ| kvk ,
δkvk ≤ k(T −λ1)vk = k(T −µ1)v +(µ−λ)vk ≤ k(T −µ1)vk + |λ−µ| kvk ,
k(T − λ1) − (T − λn0 1)k = |λn0 − λ| < 1/γ < k(T − λn0 1)−1 k−1 .
This would imply that T −λ1 is invertible, contrary to the fact that λ ∈ σ(T ).
Hence, it must be that {k(T − λn 1)−1 k}n∈N is an unbounded sequence; that
is k(T − λn 1)−1 k−1 → 0.
By definition of norm, for each n ∈ N there is a unit vector wn ∈ V such
that
1
k(T − λn 1)−1 k < k(T − λn 1)−1 wn k + .
n
106 7 Operators on Banach Spaces
Let
1
vn = (T − λn 1)−1 wn , ∀n ∈ N.
k(T − λn 1)−1 wn k
Thus, kvn k = 1 and
1
(T − λ1)vn = wn + (λn − λ)vn .
k(T − λn 1)−1 wn k
Hence,
1
k(T − λ1)vn k ≤ + |λ − λn | ,
k(T − λn 1)−1 k−1 − (1/n)
7.8 Examples
Multiplication Operators
Tψ (f) = ψf , ∀ f ∈ C(X) .
= kψk kfk .
Thus, kTψ (f)k ≤ kψk kfk, for every f ∈ V . This proves that Tψ is bounded
and kTψ k ≤ kψk. The following facts will be shown below:
1. kTψ k = kψk;
2. σ(Tψ ) = σap (Tψ ) = {ψ(t) | t ∈ X};
3. λ ∈ σp (Tψ ) if and only if {t ∈ X | ψ(t) 6= λ} is not dense in X.
7.8 Examples 107
To show that kTψ k = kψk, let f(t) = 1 and note that f ∈ C(X), kfk = 1,
and Tψ f = ψ. Thus, kTψ (f)k = kψk kfk.
If λ 6∈ {ψ(t) | t ∈ X}, then ψ(t) − λ 6= 0 for all t ∈ X and so g ∈ C(X),
1
where g(t) = ψ(t)−λ . Since Tg (Tψ − λ1) = (Tψ − λ1)Tg = 1, λ 6∈ σ(Tψ ). Hence,
σ(T ) ⊆ {ψ(t) | t ∈ X}.
Suppose now that t0 ∈ X and λ = ψ(t0 ). We shall show that λ ∈ σap (Tψ ).
Let ε > 0 be given and let U = {t ∈ X | |ψ(t) − ψ(t0 )| < ε}, which is an
open subset of X (since ψ is continuous). Because X is Hausdorff, there is an
open subset W ⊂ U such that t0 ∈ W ⊂ W ⊂ U [9, Lemma 3.8.2]. Hence, W
and X \ U are disjoint closed sets in X. Because compact Hausdorff spaces
are normal, Urysohn’s Lemma implies that there is a continuous function
f : X → [0, 1] such that f(W ) = {1} and f(X \ U ) = {0}. Hence, kfk = 1
and |(ψ(t)−λ)f(t)| < ε, for all t ∈ X. That is, k(Tψ −λ1)fk < ε, which proves
that λ ∈ σap (Tψ ). The assertion about σp (Tψ ) is left as an exercise (Exercise
9). ♦
Assume that p, q ∈ (1, ∞) satisfy 1/p + 1/q = 1. Thus, the dual of `p (N)
is isometrically isomorphic to `q (N). The linear transformation S : `p (N) →
`p (N) defined by
v1 0
v2 v1
Sv = S v3 = v2 , ∀v ∈ `p (N) ,
.. ..
. .
Because
108 7 Operators on Banach Spaces
∞
X
ϕ(Sv) = vn+1 ϕn ,
n=1
it must be that
ϕ2
ϕ3
S ∗ ϕ = ϕ4 , ∀ ϕ ∈ `p (N)∗ .
..
.
Therefore, S ∗ ϕ = λϕ if and only if ϕn+1 = λϕn for all n ∈ N. Hence, for every
λ ∈ D, S ∗ ϕ = λϕ, where the action of ϕ ∈ `p (N)∗ is given by
∞
X
ϕ(w) = wn λn−1 , ∀ w ∈ `p (N) .
n=1
D ⊂ σ(S) ⊆ D ,
Integral Operators
Now because
Z 1
ϕ(Tκ f) = Tκ f(t) ϕ(t) dm(t)
0
Z 1Z 1
= κ(t, s)f(s) ϕ(t) dm(s)dm(t)
0 0
Z 1Z 1
= κ(t, s)f(s) ϕ(t) dm(t)dm(s) [Fubini’s Theorem]
0 0
Z 1 Z 1
= f(s) κ(t, s) ϕ(t) dm(t) dm(s) ,
0 0
7.9 Exercises
1. Let V be the linear submanifold of the Banach space C([0, 1]) that consists
of all f ∈ C([0, 1]) for which the derivative df/dt exists on (0, 1) and is
continuous on [0, 1]. Let Φ : V → C([0, 1]) be the linear transformation
Φ(f) = df/dt. Show that Φ is unbounded.
2. Let H be a Hilbert space and suppose that ξ, ξk ∈ H, k ∈ N, satisfy
T ∗ ψ(v) = ψ(T v) , ∀ ψ ∈ W ∗, v ∈ V ,
11. Consider the unit square X = [0, 1] × [0, 1] ⊂ R2 , and let κ ∈ C(X). Fix
1 ≤ p < ∞ and let Lp denote Lp ([0, 1], m) (Lebesgue measure). Consider
the integral operator Tκ ∈ B(Lp ).
a) Prove that if κ is a polynomial, which means κ has the form
m X
X n
κ(t, s) = αij ti sj , for some αij ∈ C ,
i=0 j=0
Proof. Let BV denote the closed unit ball of V . Thus, λ1(BV ) = λBV , and
so λ1 is a compact operator if and only if BV is a compact set. But BV is a
compact set if and only if V has finite dimension (Proposition 1.16).
Theorem 8.3. K(V ) is a nonzero norm closed ideal of B(V ). If V has infinite
dimension, then K(V ) 6= B(V ).
114 8 Compact Operators on Banach Spaces
+ k(K − Kp )vm,m k
{w − Kw | w ∈ M }
is a subspace of V .
Proof. Let M ⊆ V be a subspace and assume that M ∩ ker(1 −K) = {0}. (The
case where M ∩ ker(1 − K) 6= {0} will be handled at the end of the proof.)
Let
L = {w − Kw | w ∈ M }
and suppose that y ∈ L. We aim to prove that y ∈ {w − Kw | w ∈ M }. Thus,
there is a sequence of vectors yn ∈ {w − Kw | w ∈ M } with limit y; that is,
there is a sequence of vectors wn ∈ M such that
ky − (wn − Kwn )k → 0 .
Assume that the sequence {wn }n∈N admits a subsequence of vectors wnk in
which kwnk k → ∞. If this is the case, then let vk ∈ M denote the unit vector
kwnk k−1 wnk . The compactness of K implies that the sequence {Kvk }k∈N
admits a convergent subsequence {Kvkj }j∈N with limit, say, z ∈ V . Note that
1
kvkj − Kvkj k = kwnkj − Kwnkj k .
kwnkj k
Proof. Assume, contrary to what we aim to prove, that 1−K is not a surjection.
Set M0 = V and let
Proof. Let BV ∗ and BV denote the closed unit balls of V ∗ and V . Suppose
that {ϕn }n∈N is a sequence in BV ∗ . Because BV ∗ is weak∗ compact, by the
8.2 Properties of Compact Operators 117
< 3ε .
Hence,
As the choice of ε > 0 and j ∈ N are arbitrary, this proves that there is a
subsequence {ϕnj }j of {ϕn }n such that {K ∗ ϕnj }j is convergent (to K ∗ ϕ).
Hence, K ∗ is a compact operator.
Proof. Propositions 8.6 and 8.8 show that 1 − K is injective if and only if
1 − K is surjective. Therefore, if λ ∈ C is nonzero, then replacing K by the
compact operator λ1 K and using the fact that σ(K) = σap (K) ∪ σd (K), we
obtain λ ∈ σp (K) or λ 6∈ σ(K).
The Fredholm Alternative has implications for what properties the spec-
trum of a compact operator may exhibit.
n−1
X
= αj (λj − λn )vj .
j=1
Therefore, if ` < n, then [(K − λn 1)vn − Kv` ] ∈ Mn−1 . Hence, for any u ∈
Mn−1 ,
kKvn − Kv` k = kλn vn + (K − λn 1)vn − Kv` k
≤ kλn vn − uk
1
= |λn | kvn − λn uk
≥ δ2 > 0 .
This means that {Kvn }n does not admit a convergent subsequence, in con-
tradiction to the compactness of K. Therefore, it must be that 0 is a cluster
point of σ(K). The same argument shows that no nonzero λ ∈ σ(K) could
possibly be a cluster point of σ(K).
8.4 Examples
An Integral Operator
Consider the unit square [0, 1] × [0, 1] ⊂ R2 and denote L2 ([0, 1], m) by L2 .
Define a function K : L2 → L2 by
Z t Z 1
Kf (t) = f(s) dm(s) dm(t) , f ∈ L2 .
0 s
120 8 Compact Operators on Banach Spaces
As the left hand side of the equation above is twice differentiable almost
everywhere. The first differentiation leads to
Z 1
df
f(u) dm(u) = λ .
t dt
The second derivative yields
d2 f
−f = λ .
dt2
Evaluation of the two differential equations above at the boundary values for
t gives f(0) = f 0 (1) = 0.
Notice that hKg, gi ≥ 0, for every g ∈ L2 , and so 0 ≤ hKf, fi = λkfk2
implies√
that λ ≥ 0.√Therefore, the general solution of λf 00 + f = 0 is f(t) =
α1 eit/ λ + α2 e−it/
√
λ
. But f(0) = f 0 (1) = 0 implies that α2 = −α1 . Hence,
f(t) = 2i sin(t/ λ). To satisfy both f 6= 0 and f 0 (1) = 0, it is necessary and
sufficient that √1λ = π2 (2k − 1) for some k ∈ N. Hence, the eigenvalues of K
are
4
λk = , k ∈ N,
(2k − 1)2 π 2
√
and the corresponding eigenvectors are fk (t) = 2i sin(t/ λk ).
Now to show that K is a compact operator, first note that K = K2 K1 ,
where K1 , K2 : L2 → L2 are defined by
Z 1 Z t
K1 f(t) = f(s) dm(s) and K2 g(t) = g(s) dm(s) .
t 0
To be completed . . .
8.5 Exercises
The natural notion of adjoint for Hilbert space operators is slightly different
from the adjoint of Banach space operators. In the Hilbert space setting, one
needs to account for the fact that the inner product is not bilinear—rather,
it is conjugate linear in the second variable.
hT ξ, ηi = hξ, T ∗ ηi , ∀ ξ, η ∈ H . (9.1)
Definition 9.2. The operator T ∗ defined by equation (9.1) is called the ad-
joint of the operator T and the map T 7→ T ∗ on B(H) is called the canonical
involution, or more simply the involution, on B(H).
Proposition 9.3. The involution on B(H) has the following properties for
all S, T ∈ B(H) and α ∈ C:
1. T ∗∗ = T ;
2. (α T )∗ = α T ∗ ;
3. (S + T )∗ = S ∗ + T ∗ ;
4. (ST )∗ = T ∗ S ∗ .
hξ, T ∗ ηi = hT ∗∗ ξ, ηi , ∀ ξ, η ∈ H .
But of course we know that it is always the case that hξ, T ∗ηi = hT ξ, ηi; if ξ
is fixed, then for every η ∈ H,
hT ξ − T ∗∗ ξ, ηi = 0 .
Furthermore, kT ∗ k = kT k and kT ∗ T k = kT k2 .
kT ∗ T k ≤ kT ∗ k kT k = kT k kT k = kT k2
in particular. Conversely, if ξ, η ∈ H are unit vectors, then the Cauchy–
Schwarz inequality yields
|hT ξ, ηi|2 ≤ kT ξk2 kηk2 = hT ξ, T ξi = hT ∗ T ξ, ξi ≤ kT ∗ T k .
Thus, kT k2 ≤ kT ∗ T k.
Now we turn to the issue of representing adjoint transformations with
matrices.
Example 9.5. Matrix representations of T and T ∗ . Suppose that T ∈ B(H)
and that B = {φk }k∈N is an orthonormal basis of a separable Hilbert space
H. If T = [τij ] is the (infinite) matrix representation of T with respect to the
orthonormal basis B, then the (i, j)-entry of T is determined via
τij = hT φj , φi i .
In using this for T ∗ in place of T we conclude that the (p, q)-entry of the
matrix representation of T ∗ with respect to B is hT ∗ φq , φp i. Furthermore,
hT ∗ φq , φpi is given by
hT ∗ φq , φp i = hφp , T ∗ φq i = hT φp , φq i = τqp .
Thus, the matrix representation T ∗ of T ∗ is determined by transposing T , the
matrix representation of T , and then conjugating each entry. In other words,
T ∗ is the conjugate transpose of T . ♦
Proposition 9.6. If T ∈ B(H), then
1. ker T = (ran T ∗ )⊥ and
2. ran T = (ker T ∗ )⊥ .
Proof. For the proof of the first assertion, assume that ξ ∈ ker T . Any vector
in ran T ∗ has the form T ∗ η, for some η ∈ H. Since
hξ, T ∗ ηi = hT ξ, ηi = 0 ,
we conclude that ξ ∈ (ran T ∗ )⊥ .
Conversely, suppose that ξ ∈ (ran T ∗ )⊥ . Thus, for every η ∈ H, 0 =
hξ, T ∗ηi. In particular, if η = T ξ, then
0 = hξ, T ∗ηi = hξ, T ∗ T ξi = hT ξ, T ξi = kT ξk2 ,
and so ξ ∈ ker T .
The proof of the second assertion is left to the reader (Exercise 2).
Another aspect of the Hilbert space adjoint to be aware of—especially in
light of what has come before in the study of operators on Banach spaces—is
that the defect spectrum is characterised as follows:
Proposition 9.7. If T ∈ B(H), then λ ∈ σd (T ) if and only if λ ∈ σp (T ∗ ).
Proof. Exercise 3.
124 9 Operators on Hilbert Spaces
9.2 Examples
Multiplication Operators
where ψ(E) denotes the closure in C of ψ(E) = {ψ(t) | t ∈ E}, and where E
is a Borel subset of X. The essential supremum of ψ is the quantity
Mψ (f) = ψf , ∀ f ∈ L2 ,
Z 1/2
≤ ess-sup |ψ| |f|2 dµ
X
= kψk∞ kfk .
Thus, kMψ fk2 ≥ (γ − ε)2 kfk2 , and so kMψ k ≥ kψk∞ − ε. Hence, kMψ k ≥
kψk∞ .
The multiplication operator Mψ satisfies
Z Z
hf, Mψ gi = fψg dµ = fψfg dµ = hMψ f, gi , ∀ f, g ∈ L2 .
X X
Let L2 (T) denote the Hilbert space L2 ([−π, π], m). By Theorem 6.26, an or-
thonormal basis of L2 (T) is given by the continuous functions φk : [−π, π] →
C, k ∈ Z, defined by
1
φk (t) = √ ei kt , t ∈ [−π, π] .
2π
fˆ(k) φk ,
X
f =
k∈Z
Recall that the Hardy space H 2 (T) is the subspace of L2 (T) defined by
Z π
H 2 = f ∈ L2 (T) | f(t)e−ikt dm(t) = 0, ∀ k < 0 .
−π
The bilateral shift operator B on L2 (T) has the property that Bf ∈ H 2 (T)
for every f ∈ H 2 (T). (To verify this, one need only check that the Fourier
coefficients of Bf vanish for negative k; this is clear because the Fourier coef-
ficients of Bf are obtained from f by a forward shift of the Fourier coefficients
of f.) Thus B induces an operator on H 2 (T) which is denoted by S and is
called the unilateral shift operator.
The adjoint of S is not the restriction of B ∗ to H 2 (T) because B ∗ φ0 =
φ−1 6∈ H 2 (T) even though φ0 ∈ H 2 (T). However, it is not difficult to verify
that S ∗ satisfies Sφk = φk−1 , for all k ∈ N and S ∗ φ0 = 0 (Exercise 5). ♦
Integral Operators
Consider the unit square X = [0, 1]×[0, 1] ⊂ R2 , and let κ ∈ L2 (X, m2 ), where
m2 denotes Lebesgue measure on R2 . Let L2 denote L2 ([0, 1], m) (Lebesgue
measure). For each f ∈ L2 , let Kκ f denote a function whose value at t ∈ [0, 1]
is given by given by
Z 1
Kκ f (t) = κ(t, s)f(s) dm(s) .
0
Because
Z 1
hKκ f, gi = Kκ f(t) g(t) dm(t)
0
Z 1 Z 1
= κ(t, s)f(s) g(t) dm(s)dm(t)
0 0
Z 1 Z 1
= κ(t, s)f(s) g(t) dm(t)dm(s) [Fubini’s Theorem]
0 0
Z 1 Z 1
= f(s) κ(t, s)g(t) dm(t) dm(s) ,
0 0
9.3 Hermitian Operators 127
Much of the theory of Hilbert space operators is devoted to the ways in which
T and T ∗ interact. The first case of interest occurs when T and T ∗ are in fact
the same. Such operators are probably the most important in all of operator
theory and its applications.
and therefore hT ξ, ξi ∈ R.
Conversely, suppose that hT ϑ, ϑi ∈ R for all ϑ; then hT ϑ, ϑi = hϑ, T ϑi.
Let λ ∈ C be arbitrary and set ϑ = ξ + λη. Let <z and =z denote the real
and complex parts of a complex number z. Because
In other words,
= λhT ξ, ηi = = λhξ, T ηi .
128 9 Operators on Hilbert Spaces
√
With λ = −1, the equation above is <hT ξ, ηi = <hξ, T ηi, whereas with
λ = 1 the equation becomes =hT ξ, ηi = =hξ, T ηi. This proves, then, that
hT ξ, ηi = hξ, T ηi for all vectors ξ, η ∈ H. Hence, T ∗ = T .
The next set of propositions reveals some striking features of the spectra
of hermitian operators.
λ = λhξ, ξi = hλξ, ξi = hT ξ, ξi ∈ R .
Hence, σd (T ) = σp (T ) ⊂ R and
Hence, k(T − λ1)ξk ≥ 21 |λ − λ| kξk for all ξ ∈ H. This proves that (T − λ1) is
bounded below and, hence, λ 6∈ σap(T ). This concludes the proof of σ(T ) =
σap(T ) ⊂ R.
= ν 4 − ν 2kT ξk2 .
9.4 Normal Operators 129
Proof. Exercise 9.
Hence, λ ∈ σp (N ) ⊆ σap (N ).
In particular,
1
spr N = lim kN 2k k 2k . (9.5)
k
Theorem 9.18. Assume that T ∈ B(H) is hermitian and that σ(T ) ⊆ [a, b].
Then for any continuous function f : [a, b] → C there is a hermitian operator
denoted by f(T ) ∈ B(H) with the property that
kf(T ) − qn (T )k → 0
Proof. The existence of the sequence {qn }n∈N is assured by the Weierstrass
Approximation Theorem. Since this sequence is assume to converge uniformly
on [a, b] to f, the sequence is a Cauchy sequence in C([a, b]). Note that qm (T )−
qn (T ) is hermitian, for all m, n ∈ N, and that the Spectral Mapping Theorem
shows that
Proposition 9.19. If T ∈ B(H) is hermitian and σ(T ) ⊆ [a, b], then for all
continuous functions f, g : [a, b] → C and α ∈ C we have:
1. kf(T )k = maxλ∈σ(T ) |f(λ)|,
2. αf(T ) = α(f(T )),
3. (f + g)(T ) = f(T ) + g(T ), and
4. fg(T ) = f(T )g(T ).
Proof. By Proposition 9.6, ker P = (ranP ∗ )⊥ . Thus, ker P = (ranP )⊥, because
P ∗ = P . Furthermore, ran (1 − P ) = ran P because
Hence, P ∗ = P .
Finally, by the definition of the action of P on H, it is obvious that M =
ran P and M ⊥ = ker P = ran(1 − P ).
Why the term “lattice”? The answer is given by the following definition
and proposition.
From the point of view of invariant subspace theory, what makes projec-
tions of interest is that, for a fixed operator T , the issue of whether a subspace
is invariant or not can be reformulated as an equation involving operators.
T − T P = (1 − P )T (1 − P ) = T − P T − T P + P T P = T − P T ,
and so T P = P T .
Conversely, if P T = T P , then P T P = T P 2 = T P ; thus ran P ∈ Lat T by
1. Moreover,
(1 − P )T (1 − P ) = T − T P − P T + P T P = T − T P = T (1 − P ) ,
T11 : L → L , T12 : M → L ,
T21 : L → M , T22 : M → M .
If P is the projection with range L and kernel M , then the operators Tij are
recovered from T and P via
P T P = T11 : ran P → ran P ,
P T (1 − P ) = T12 : ran (1 − P ) → ran P ,
(1 − P )T P = T21 : ran P → ran (1 − P ) ,
(1 − P )T (1 − P ) = T22 : ran (1 − P ) → ran (1 − P ) .
and that
T11 0
L, M ∈ Lat T if and only if T = .
0 T22
To show this, note T ∗ = T and σ(T ) = {0, 1, 2}; thus, T is indeed positive.
The normalised eigenvectors φ1 , φ2 , φ3 ∈ C 3 corresponding to the eigen-
values λ1 = 0, λ2 = 1, and λ3 = 2 of T are:
√1 √1
1
1 2 1 1 2
φ1 = √ −1 , φ2 = √ 0 , φ3 = √ 1 .
2 √1 2 −1 2 √1
2 2
These vectors not only span C3 , but form an orthonormal basis. Therefore,
1 1 1
√
4 2 2 4
1 1 1
P3 = φ3 φ∗3 = √ √ .
2 2 2 2 2
1 1
√ 1
4 2 2 4
The operators Pj = φj φ∗j are rank-1 projections with range ker(T −λj 1). Thus
P1 + P2 + P3 = 1 and Pi Pj = Pj Pi = 0 for i 6= j. Because T = P2 + 2P3 , we
see that √ √
1 2 1 1 2
√ 2 + 4 2 − 2 + 4
1/2 1 √1 1
T = P2 + 2P3 = ,
2 √ 2 2√
− 12 + 42 21 12 + 42
as claimed in (9.8).
L ⊆ Span{e1 , . . . , ek } .
k
X
Now let v ∈ L be such that l(v) = k; so, v = αj ej and αk 6= 0. Note that
j=1
S m v ∈ L for all m ∈ Z+ n
0 and that S = 0. When m = k − 1, the result is
k−1
S v = αk e1 . Thus,
e1 = αk −1 S k−1 v ∈ L .
Now with m = k − 2 we have S k−2 v = αk e2 + αk−1e1 , and so
e2 = αk −1 S k−2 v − αk−1 e1 ∈ L .
Inductively, the same type of argument reveals that for any m such that
1 ≤ m ≤ k,
m−1
X
em = αk −1 S m v − αj ej ∈ L .
j=0
9.10 Exercises
1. Prove that T, S ∈ B(H) are equal if and only if hT ξ, ξi = hSξ, ξi, for all
ξ ∈ H.
2. Prove that ran T = (ker T ∗ )⊥ , for every T ∈ B(H).
3. For T ∈ B(H), prove that λ ∈ σd (T ) if and only if λ ∈ σp (T ∗ ).
4. Assume that T ∈ B(H) is an operator of rank m ∈ N.
a) Prove that if m = 1, then there are unit vectors γ, η ∈ H such that
T ξ = hξ, γi η, for all ξ ∈ H.
b) Prove that the rank of T ∗ is m.
5. Prove that if {φk }∞ k=0 is the canonical orthonormal basis of the Hardy
space H 2 (T) and if S ∈ B(H 2 (T)) is the unilateral shift operator, then
S ∗ satisfies Sφk = φk−1 , for all k ∈ N and S ∗ φ0 = 0.
6. With respect to the canonical orthonormal basis {φk }∞ k=0 of the Hardy
space H 2 (T), find the matrix representation S of the unilateral shift op-
erator S ∈ B(H 2 (T)). Viewing S as acting on `2 (N ∪ {0}), determine the
action of the matrix S on a vector ξ ∈ `2 (N ∪ {0}).
7. Prove that if T ∈ B(H) is hermitian, then
c) V ∗ V = 1.
12. Prove that if T ∈ B(H) is invertible, then T = U |T | for some unitary
operator U ∈ B(H).
13. Prove that every projection is a positive operator.
14. Let P ∈ B(H) be a projection other than 0 or 1. Compute σ(P ) and kP k.
15. Assume that V ∈ B(H).
a) If V is a partial isometry. Prove that V ∗ V and V V ∗ are projections
and determine their ranges.
b) If V ∈ B(H) is an operator for which V ∗ V and V V ∗ are projections,
then prove that V is a partial isometry and determine the initial and
final spaces of V .
16. Suppose that T1 , T2 ∈ B(H) are hermitian operators such that T1 T2 =
T2 T1 = 0. Prove that hξ, ηi = 0 for all ξ ∈ ran T1 and η ∈ ran T2 .
17. Prove that Lat T is a lattice, for every T ∈ B(H).
18. Assume that T ∈ B(H) and λ ∈ C. Prove that
20. Let B denote the bilateral shift operator on L2 (T). Prove that the Hardy
space H 2 (T) is invariant under B and that, with respect to the decompo-
sition L2 (T) = H 2 (T) ⊕ H 2 (T)⊥ , B is represented by an operator matrix
of the form
S B12
,
0 B22
where S is the unilateral shift operator on H 2 (T) and B12 6= 0.
21. Prove that the following statements are equivalent for an operator T and
projection P on a Hilbert space H:
a) ran P is reducing for T ;
b) (1 − P )T ∗ P = (1 − P )T P = 0;
c) T P = P T .
22. A subspace L ⊂ H is said to be nontrivial if L is neither {0} nor H.
Suppose that T ∈ B(H).
a) Prove that if H has finite dimension, then T has a nontrivial invariant
subspace. (Hint: think about eigenvectors.)
b) Prove that if H is nonseparable, then T has a nontrivial invariant
subspace. (Hint: if ξ ∈ H is nonzero, consider the subspace generated
by T k ξ for k ∈ N.)
23. Let ψ : [0, 1] → [0, 1] be given by ψ(t) = t and consider the (hermitian)
multiplication operator Mψ on L2 ([0, 1], m).
9.10 Exercises 143
To be completed ...
Part III
Algebras
11
Banach Algebras
Some Banach spaces, such as B(V ) and C0 (X), carry the structure of a
normed algebra in addition to that of a normed vector space. The abstract
properties of such algebras are studied in this chapter. Examples of Banach
algebras are examined in the following chapter.
In this study, all algebras with identity that arise will satisfy k1k = 1.
GL(A) = {x ∈ A | x is invertible in A}
Lemma 11.6. Assume that A is a unital Banach algebra and that x ∈ GL(A).
If h ∈ A satisfies khk < kx−1 k−1 , then x + h ∈ GL(A) and
khk2 kx−1 k2
k(x + h)−1 − x−1 + x−1 hx−1 k < .
kx−1 k−1 − khk
11.2 Invertible Elements and Spectra 151
Proof. If khk < kx−1 k−1 , then kx−1 hk < 1 and so Lemma 11.5 implies that
(1 + x−1 h) ∈ GL(A) and
∞
X
(1 + x−1 h)−1 = (−1)k (x−1 h)k .
k=0
and
∞
!
X
(1 + x−1 h)x−1 − 1 + x−1 h x−1 = (−1)k (x−1 h)k x−1 .
k=2
Thus,
∞
!
X
−1 k
k(x + h)−1 − x−1 + x−1 hx−1 k ≤ kx hk kx−1 k
k=0
khk2 kx−1 k2
≤ ,
kx−1 k−1 − khk
which completes the proof.
Corollary 11.7. GL(A) is an open set in the norm topology of A.
The remaining results below are algebraic versions of theorems in §7.5 that
were proved for Banach space operators. The proofs of these results in fact
did not use in any way the fact that the operators were acting on a Banach
space; rather, the proofs depended only upon analytic and algebraic features
of (the unital Banach algebra) B(V ). For this reason, the proofs of theorems
below are exactly the same as the proofs of the correspnding results in §7.5.
Lemma 11.10. Assume that A is a unital Banach algebra and that x ∈ A.
152 11 Banach Algebras
f (σ(x)) = σ(f(x))
The following theorem is inspired by the fact that the only complex finite-
dimensional associative algebra that is also a division algebra is 1-dimensional
algebra (and field) C.
11.3 Subalgebras
11.5 exercises
Some Banach spaces, such as B(V ), carry the structure of a normed algebra
in addition to that of a normed vector space. The abstract properties of such
spaces are studied in this chapter.
12.6 Exercises
13
C∗-algebras
If H is a Hilbert space, then the Banach algebra B(H) differs from algebras
of the form B(V ), where V is a Banach space, by the fact that the adjoint
operator T ∗ is also an operator on H. Therefore, B(H) is an algebra with
involution T 7→ T ∗ . Banach algebras A with involution have a special theory;
an even richer theory arises if the involution satisfies kx∗xk = kxk2 , for all
x ∈ A. Such algebras are called C∗ -algebras.
kxk2 = kx∗ xk ≤ kx∗k kxk and kx∗ k2 = kx∗∗ x∗ k ≤ kx∗∗k kx∗ k = kxk kx∗k ,
kzbk2 = k(zb)∗ (zb)k = kb∗ (z ∗ z)bk = kb∗(z ∗ z)bk0 ≤ kbk2 kz ∗ zk0 . (13.2)
To show that kz ∗ zk0 ≥ (kzk0 )2 , note that for each ε > 0 there is a b ∈ A with
kbk ≤ 1 such that kzbk > (1 −ε)kzk0 . Thus, kz ∗ zk0 > (1 −ε)2 (kzk0 )2 by (13.2).
160 13 C∗ -algebras
As this is true for every θ ∈ R, ω(h) must be a real number. That is, the
continuous function Γ (h) on sp (A) is real-valued for every hermitian h ∈ A.
Now suppose that x ∈ A is arbitrary. Write x = h + ig, where h, g ∈ A are
hermitian. Thus, x∗ = h − ig and so
∗ ∗
Γ (x∗) = Γ (h) − iΓ (g) = (Γ (h) + iΓ (g)) = (Γ (x)) .
13.4 C∗ -subalgebras 161
Proposition 13.9. If X and Y are compact Hausdorff spaces such that X and
Y are homeomorphic, then the C∗ -algebras C(X) and C(Y ) are isometrically
isomorphic.
13.4 C∗-subalgebras
Definition 13.10. If B is a C∗-algebra, then a subset A ⊆ B is a C∗-
subalgebra of B if A is a C∗ -algebra with respect to the sum, product, involu-
tion, and norm of B. If B is unital and if the multiplicative identity 1 ∈ B
belongs to a C∗ -subalgebra A of B, then A is said to be a unital C∗ -subalgebra
of B.
Definition 13.11. If F ⊂ A is a subset of a C∗ -algebra A, then the C∗-
algebra generated by F is the smallest C∗ -subalgebra of A that contains F
and is denoted by C ∗ (F ).
Recall from the spectral theory of Banach algebras that if C is a unital Banach-
subalgebra of a unital Banach algebra D, then σD (x) ⊆ σC (x) and ∂σC (x) ⊆
∂σD (x). In contrast, the following (extremely important) theorem shows that
the spectrum of an element x in a unital C∗ -algebra is independent of the
unital C∗ -algebra that contains x.
σ(x) = {λ ∈ C | x − λ1 6∈ GL(A1 )} .
The fact that sp A and σ(x) are homeomorphic implies that the C∗ -
algebras C(sp A) and C (σ(x)) are isometrically isomorphic, by Proposition
13.9. The isomorphism ρ : C(sp A) → C (σ(x)) given by Proposition 13.9
satisfies
ρ(f)[λ] = f(ω) , where λ = ω(x) ∈ σ(x) .
Furthermore, the Gelfand transform is an isometric isomorphism C ∗(x, 1) →
C(sp A). Hence, composing the inverses of these two isomorphisms produces
an isometric isomorphism of Φ : C (σ(x)) → C ∗ (x, 1); that is, Φ = Γ −1 ◦ ρ−1 .
Let ι ∈ C (σ(x)) denote the function ι(t) = t; thus, ι∗ ∈ C (σ(x)) is
the function ι∗ (t) = t. To show that Φ(ι) = x, it is sufficient to show that
Φ−1 (x) = ι. Note that Φ−1 = ρ ◦ Γ . If λ ∈ σ(x), then there is a unique
ω ∈ sp A such that ω(x) = λ—equivalently, ψ−1 (λ) = ω. Therefore, by the
following calculation, the function Φ−1 (x) sends λ to λ:
Note that the spectrum of the element given in (13.3) is the range of the
continuous function
Xm X n
Γ αkj xk (x∗ )j ∈ C(sp A) ,
k=0 j=0
namely,
m X n m X
n
j
X X
j
αkj ω(x)k ω(x) | ω ∈ sp A = αkj λk λ | λ ∈ σ(x) .
k=0 j=0 k=0 j=0
Remark. The algebra C ∗(h) does not necessarily contain the identity of A;
thus, the conclusion f(h) ∈ C ∗(h) is sharper than the conclusion of Theorem
13.16, namely that f(h) ∈ C ∗ (h, 1).
Proof. By Lemma 13.17, the condition f(0) = 0 implies that there is a sequence
of polynomials fn for which fn (0) = 0 and |f(t) − fn (t)| → 0 uniformly on X
(and, thus, on σ(h) as well). Since fn (h) ∈ C ∗(h),
lim kf(h) − fn (h)k = lim max |f(t) − fn (t)| = 0 ,
n→∞ n→∞ t∈σ(h)
A+ = {h ∈ A | h is a positive element of A} .
Proof. Let β > 0 be large enough so that σ(h) ∪ σ(b) ⊆ [0, β]. Therefore, for
any g ∈ C([0, β]),
σ(aa∗ ) ⊂ R . (13.6)
Proposition 11.16 asserts that σ(aa∗ ) ∪ {0} = σ(a∗ a) ∪ {0}. Therefore, (13.5)
and (13.6) combine to give
Therefore, the spectral radius of a∗ a is 0. Since the spectral radius and norm
coincide for hermitian elements, a∗ a = 0. That is, 0 = ka∗ ak = kak2 , which
proves that a = 0. Since b2− = −a∗ a = 0 and b− is positive, we obtain
b− = [b2−]1/2 = 01/2 = 0.
h ≤ k denotes k − h ∈ A+ .
The relation “≤” on Asa has the following properties (Exercise 14). If
a, b, c ∈ Asa , then:
1. a ≤ a;
2. If a ≤ b and b ≤ a, then b = a; and
3. If a ≤ b and b ≤ c, then a ≤ c.
That is, “≤” is a partial order on the R-vector space Asa .
Proof. If x ∈ A, then
where z = (k − h)1/2 x.
Via the partial order, the real vector space Asa mirrors some of the prop-
erties of R. Through the notion of modulus, A mirrors some of the properties
of C.
[x] = {y ∈ A | y − x ∈ J} ,
and let
A/J = {[x] | x ∈ A} .
By general ring theory, A/J is a complex algebra under the operations
[x]∗ = [x∗] .
Banach space theory shows that A/J is a Banach algebra under the quotient
norm
k[x]k = inf {kx − bk | b ∈ J} .
The new fact that is proved here is that the quotient norm satisfies the C∗ -
norm axiom k[x]k2 = k[x]∗[x]k.
Proof. As mentioned in the preamble to the statement of the theorem, the only
point to verify is that the quotient norm on A/J satisfies k[x]k2 = k[x]∗[x]k.
To this end, fix x ∈ A and define
and so
The infimum of the left hand side of the inequality above over all b ∈ J yields
k[x]k.
Hence, k[x]k = inf {kx − ek | e ∈ E}, for every x ∈ A, and therefore
= k[x]∗[x]k .
Thus,
k[x]k2 ≤ k[x]∗[x]k ≤ k[x]∗k k[x]k . (13.9)
Conversely, k[x]∗k = inf{kx∗ − b∗ k | b ∈ J} = inf{kx − bk | b ∈ J} = k[x]k,
since J is ∗-closed. Therefore, inequality (13.9) is an equality.
Proof. By Exercise 33, spr ρ(x∗ x) ≤ spr (x∗ x), for all x ∈ A. Thus,
Corollary 13.33. If two C∗ -algebras are isomorphic, then they are isometri-
cally isomorphic.
13.9 States
is a state on C(X).
2. States on B(H). The positive cone of B(H) is
For every unit vector ξ ∈ H, the linear map that sends x ∈ B(H) to
hxξ, ξi ∈ C is a state on B(H). However, not all states on B(H) are of
13.9 States 173
this form. For example, if ξ1 , . . . , ξm ∈ H are unit vectors, then the maps
ϕ : B(H) → C defined by
m
1 X
ϕ(x) = hxξj , ξj i
m
j=1
is a state on B(H).
States on A necessarily map Asa onto R. This can be seen via expressing
h ∈ Asa as h = h+ − h− , where h+ , h− ∈ A+ . Therefore, by expressing any
x ∈ A in terms of its real and imaginary parts, we obtain
For
[x, x]
λ = ,
[y, x]
the inequality above becomes
[x, x]2[y, y]
0 ≤ −[x, x] + ,
|[x, y]|2
since ϕ(1) ≤ 1. Hence |ϕ(x)| ≤ 1, for all x ∈ A with kxk ≤ 1, implies that
kϕk ≤ ϕ(1). By hypothesis, kϕk = 1; therefore, ϕ(1) = 1.
Conversely, suppose that ϕ(1) = 1; that is, kϕk = ϕ(1) = 1. It must
happen that ϕ(Asa ) = R, for if not then there is a hermitian element h ∈ Asa
such that ϕ(h) = α + iβ, where α, β ∈ R and β 6= 0. Therefore, with k =
β −1 (h − α1) ∈ Asa , we would have that ϕ(k) = i and, for each γ ∈ R,
≤ kϕk2 kk + γi1k2
= kk 2 + γ 2 1k
= kk 2 k + γ 2 .
13.10 Representations
13.11 Exercises
then show that the following properties hold for all z, z1 , z2 ∈ A1 and
α ∈ C:
a) kαzk = |α| kzk0,
b) kz1 + z2 k0 ≤ kz1 k0 + kz2 k0 ,
c) kz1 z2 k0 ≤ kz1 k0 kz2 k0 , and
d) kz ∗ k0 = kzk0 .
2. Assume that A is a nonunital C∗ -algebra and that A1 = A × C is its
unitisation. Prove that in the norm k · k0 , A1 is a Banach algebra.
3. Assume that A is a unital C∗ -algebra.
176 13 C∗ -algebras
∞
X
a) Prove that the series z n /(n!) converges in A, for each z ∈ A.
n=0
(Suggestion: show that the partial sums of the series form a Cauchy
sequence.)
∞
X
b) If ez denotes the limit of z n /(n!), then prove that ex+y = ex ey if
n=0
x, y ∈ A commute (xy = yx).
4. Let X be a compact Hausdorff space and let f ∈ C(X). Determine nec-
essary and sufficient conditions on the range of f so that:
a) f is invertible;
b) f is hermitian;
c) f is unitary.
5. If X and Y are locally compact Hausdorff spaces and if X and Y are
homeomorphic, then prove that the C∗ -algebras C0 (X) and C0 (Y ) are
isometrically isomorphic.
6. Show that C0 (R) is nonunital.
7. Prove that if A is a nonunital C∗ -algebra, then 0 ∈ σ(x), for all x ∈ A.
8. Suppose that A is a C∗ -algebra with norm k · k. Prove that if k · k0 is a
norm on A that satisfies all of the axioms of a C∗ -norm, then kxk0 = kxk
for all x ∈ A.
9. If A is a unital C∗ -algebra and if u ∈ A is unitary, then prove that σ(u) ⊆
∂D, where D is the open unit disc of the complex plane.
10. If A is a unital C∗ -algebra and if x ∈ A (not necessarily normal), then
prove or find a counterexample to each of the following statements.
a) σ(x∗ ) = {λ | λ ∈ σ(x)}.
b) x∗x is invertible if x is invertible.
c) x is invertible if x∗ x is invertible.
11. Suppose that X ⊂ R is a compact set such that 0 ∈ X. Prove that if
f ∈ C(X) satisfies f(0) = 0 and if ε > 0, then there is a polynomial p
such that p(0) = 0 and |f(t) − p(t)| < ε for all t ∈ X.
12. In a unital Banach algebra A, an element x ∈ A is quasinilpotent if σ(x) =
{0}, and x is properly quasinilpotent if σ(xy) = {0} for all y ∈ A. Prove
that if A is a unital C∗ -algebra, then the only properly quasinilpotent
element x ∈ A is x = 0.
13. Suppose that A is a unital C∗ -algebra, γ ∈ R+ , and h, k ∈ A+ . Prove that
a) γ h ∈ A+ ,
b) h + k ∈ A+ , and
c) −h ∈ A+ only if h = 0.
14. Suppose that a, b, c ∈ Asa . Prove the following assertions.
a) a ≤ a.
b) If a ≤ b and b ≤ a, then b = a.
13.11 Exercises 177
c) If a ≤ b and b ≤ c, then a ≤ c.
15. Suppose that A is a C∗ -algebra with x ∈ A, h ∈ A+ , and xh = hx. Prove
that xh1/2 = h1/2 x. (Suggestion: show that xf(h) = f(h)x for every
polynomial f.)
16. If A is a C∗ -algebra and if a, b ∈ A+ satisfy a ≤ b and ab = ba, then prove
that a2 ≤ b2 . Show by example that a ≤ b does not always imply a2 ≤ b2
if ab 6= ba.
17. If A is a unital C∗ -algebra, prove that 1 +x∗x is invertible for every x ∈ A.
18. Prove that σ(ab) ⊂ R+ for all positive elements a and b in a C∗ -algebra
A.
19. Prove that if A is a unital C∗ -algebra and if x ∈ A is invertible, then there
is a unitary u ∈ A such that x = u|x|.
20. Prove or find a counterexample to the following assertion in a C∗ -algebra
A: kxk = k |x| k, for every x ∈ A.
21. Prove that if h ∈ Asa , where A is a unital C∗ -algebra, and if α ≥ khk,
then
a) h ≤ α1, and
b) kα1 − hk ≤ α.
22. Prove that if X is a locally compact Hausdorff space, then
23. Consider the 2-dimensional Hilbert space C2 with its canonical inner prod-
uct
ξ1 η1
, = ξ1 η1 + ξ2 η2 .
ξ2 η2
Let x : C2 → C2 be the operator given by the 2 × 2 complex matrix
αβ
x = .
γ δ
permissible operations on sets are. Of all the ZF-axioms, the choice axiom is
possibly the most subtle.
The Axiom of Choice. If {Xα }α∈Λ is a family of subsets of a set X, then
there is a subset Y of X such that Y ∩ Xα is a singleton set, for every α ∈ Λ.
If, in the statement of the Axiom of Choice, the singleton set Y ∩ Xα
is denoted by {xα }, then the axiom asserts that a set Y can be formed by
selecting exactly one element xα from each of the sets Xα . This justifies the
use of the word “choice.”
In the ZF-axioms for set theory, Zorn’s Lemma is a theorem. However, if
one replaces the Axiom of Choice in the ZF-axioms and poses Zorn’s Lemma
as an axiom, then the Axiom of Choice is a theorem. Thus, the Axiom of
Choice and Zorn’s Lemma are logically equivalent. For a detailed discussion,
see [9].
References