Professional Documents
Culture Documents
and
Integral
Jaroslav Lukeš
Jan Malý
matfyzpress
PRAGUE 2005
All rights reserved, no part of this publication may be reproduced or transmitted in any
form or by any means, electronic, mechanical, photocopying or otherwise, without the
prior written permission of the publisher.
ISBN 80-86732-68-1
ISBN 80-85863-06-5 (First edition)
Motto: Everybody writes and nobody reads
L. Fejér
Preface
This text is based on lectures in measure and integration theory given by the
authors during the past decade at Charles University, and on preliminary lecture
notes published in Czech.
It is impossible to thank individually all colleagues and students who assisted
in the preparation of this manuscript, but we will just mention Michal Kubeček
who helped with the translation and TEX processing.
The authors wish to express their deep gratitude to Professor Stylianos Negre-
pontis, who was the chief coordinator of TEMPUS project JEP–1980. Without
support from him and the Tempus programme the manuscript would never have
appeared.
The preparation of this manuscript was partially supported by the grant
No. 201/93/2174 of the Czech Grant Agency and by the grant No. 354 of the
Charles University.
We have carried out only minor corrections. We wish to thank all who con-
tributed by suggestions and comments.
In the history, people were engaged in the problem of measuring lenghts, areas
and volumes. In mathematical formulation the task was, for a given set A, to
determine its size (”measure”) λA. It was required that the volume of a cube or
the area of a rectangle or a circle should agree with the well-known formulae. It
was also clear by intuition that this measure should be positive and additive, i.e.
it should satisfy the equality
λ Aj = λAj
The value λ∗ A (which can also be +∞) is called the outer Lebesgue measure of a
set A.
1.2. Properties of the Outer Lebesgue Measure. One can see immediately
that λ∗ A ≤ λ∗ B if A ⊂ B and that the measure of a singleton is 0, and without
much effort it becomes clear that λ∗ I is the length of I in case of I interval of any
type (see Exercise 1.6). Then it is relatively easy to prove that the outer Lebesgue
measure is translation invariant: If A ⊂ R and x ∈ R, then λ∗ A = λ∗ (x + A).
Another important property is the σ-subadditivity:
∞
∞
λ∗ ( Aj ) ≤ λ∗ Aj .
j=1 j=1
In mathematical terminology, the prefix σ usually relates to countable unions and δ to countable
intersections.
The question of whether λ∗ is an additive set function has a negative answer:
There are disjoint sets A, B with
λ∗ (A ∪ B) < λ∗ A + λ∗ B
(cf. 1.8), and we need to find a family of sets (as large as possible) on which the
measure λ∗ is additive. This task will be solved later in Chapter 4 in a much more
general case. Now we just briefly indicate one of its possible solutions in case of
the Lebesgue measure.
A. Measures and Measurable Functions 3
λ∗ A := λI − λ∗ (I \ A)
In the next part of this chapter we introduce some significant sets on the real
line.
1.8. A Nonmeasurable Set. Now we prove the existence of a nonmeasurable subset of R
and consequently prove that the outer Lebesgue measure cannot be additive.
Set x ∼ y if x − y is a rational number. It is easy to see that ∼ is an equivalence relation
on R. Therefore R splits into an uncountable collection of pairwise disjoint classes. A set V
belongs to this collection if and only if V = x + Q for some x ∈ R. By the axiom of choice,
4 1. The Lebesgue Measure
there exists a set E ⊂ (0, 1) that shares exactly one point with each set V ∈ . We show that
E is not in M.
Let {qn } be a sequence containing all rational numbers from the interval (−1, +1). It is not
very difficult to show that the sets En := qn + E are pairwise disjoint and that
[
(0, 1) ⊂ En ⊂ (−1, 2).
n
S P
Assuming that E ∈ M, then also En ∈ M and Theorem 1.4 gives λ En = λEn . Distin-
n n
guishing two cases λE = 0 and λE > 0 we easily obtain the contradiction.
1.9. Remarks. 1. The proof of the existence of a nonmeasurable set is not a constructive
one (it uses the axiom of choice for an uncountable collection of sets). We return to the topic
of nonmeasurable sets in Notes 1.22.
2. By a simple argument, an even stronger proposition can be proved: Any measurable set
M ⊂S R of a positive measure contains a nonmeasurable subset. It is sufficient to realize that
M = q∈Q M ∩ (E + q) where E is the nonmeasurable set from 1.8 and that any measurable
subset of E is of zero (Lebesgue) measure.
3. Van Vleck [1908] “constructed” a set E ⊂ [0, 1] for which λ∗ E = 1 and λ∗ E = 0.
1.10. Exercise. Show that every countable set S is of measure zero.
S
∞
Hint. Consider covers (rj − ε2−j , rj + ε2−j ) where {rj } is a sequence of all elements of the
j=1
set S. The assertion also follows from Theorem 1.4 if you realize that singletons have measure
zero.
1.11. Examples of Sets of Measure Zero. (a) The set Q of all rational numbers is
countable, thus by Exercise 1.10 it has Lebesgue measure zero.
(b) It can be seen from the hint to the exercise that for every k ∈ N there is an open set
T
∞
Gk such that Q ⊂ Gk and λ∗ Gk ≤ 1/k. The set Gk has also Lebesgue measure zero, it is
k=1
dense and uncountable (even residual).
1.12. Cantor Ternary Set. Consider the sequence { n } of finite collections of intervals
defined in the following way: 0 = {[0, 1]}, 1 = {[0, 13 ], [ 23 , 1]}. In each step we construct n
from n−1 as the collection of all closed intervals which are the left or right third of an interval
from the collection n−1 (the middle thirds are omitted). Then n is a collection of 2n disjoint
−n
T 3 . Let Kn denote the union of the collection n . The
closed intervals, each of them of length
Cantor ternary set 1 C is defined as Kn . It is not difficult to verify that C consists precisely
n
P
∞
of points of the form ai 3−i where each ai is 0 or 2. Roughly speaking, in the Cantor set
i=1
there are exactly those points of the interval [0, 1] whose ternary expansions do not contain the
digit 1. The Cantor set has the following properties:
(a) C is a compact set without isolated points;
(b) C is a nowhere dense (and totally disconnected) set;
(c) C is an uncountable set;
(d) the Lebesgue measure of C is zero.
1.13 Discontinua of a Positive Measure. If we construct a set D ⊂ [0, 1] like the Cantor
set except that we always omit intervals of length ε3−n where ε ∈ (0, 1) (note that their centres
are not the same as those in the construction of the Cantor set), we get a closed nowhere dense
set, for which λD = 1 − ε. Sets having this property are called the discontinua of a positive
measure. Another construction: If G is an open subset of the interval (0, 1), containing all
rational points of this interval and λG = ε < 1 then [0, 1] \ G is a discontinuum of measure 1 − ε.
1.14. Exercise. Prove that there exists a non-Borel subset of the Cantor set and realize that
this set is Lebesgue measurable.
Hint. The cardinality argument shows that the set of all Borel subsets of the Cantor set has
cardinality of the continuum while the set of all its subsets has greater cardinality.
Instead of this, the following idea can be used. Define
where f is the Cantor singular function from 23.1. Show that κ is increasing on the interval [0, 1],
and therefore it is a Borel function. Suppose E is a nonmeasurable subset of [0, 1], B := κ(E).
Then B (as a subset of the Cantor set) is a measurable set. But since κ−1 (B) = E (and κ is a
Borel function), B cannot be a Borel set.
In the same way we define vol I for intervals of other types. Given an arbitrary
set A ⊂ Rn , define the outer Lebesgue measure of A as the quantity
∞
∞
λ∗ A = inf{ vol Ik : Ik ⊃ A, Ik is an open interval}.
k=1 k=1
∞ ∞
λ∗ Aj ≤ λ∗ Aj .
j=1 j=1
Since the difference vol I − vol J can be arbitrarily small, the assertion follows.
1.19. Theorem. (a) Any open subset of Rn is measurable.
(b) If λ∗ A = 0, then A is measurable.
Proof. The proof of part (b) is obvious; we will prove (a). First we prove that each
interval H which is a halfspace (e.g. of the form (−∞, c) × Rn−1 ) is measurable.
Choose a “test” set T , λ∗ T < ∞, and ε > 0. There exist open intervals {Qj }
with
Qj ⊃ T and vol Ij < λ∗ T + ε.
j j
The reader should compare the following theorem and Exercise 15.19.
A. Measures and Measurable Functions 7
Proof. The implication (i) =⇒ (ii) is trivial. Assuming (ii), fix ε > 0 and denote
Ik = (−k, k)n . By Theorem 1.20 we can find open sets Gk and Hk such that
Ik ∩M ⊂ Gk , Ik \M ⊂ Hk , λGk ≤ λ∗ (Ik ∩M )+2−k ε and λHk ≤ λ∗ (Ik \M )+2−k ε.
We can assume that Gk and Hk are subsets of Ik . Then we have Gk \M ⊂ Gk ∩Hk .
Using (ii) and the measurability of open sets we obtain
λIk +λ(Gk ∩Hk ) = λGk +λHk ≤ λ∗ (Ik ∩M )+λ∗ (Ik \M )+2−k+1 ε ≤ λIk +2−k+1 ε.
Set G = Gk . Then
k
∞
λ∗ (G \ M ) ≤ λ(Gk ∩ Hk ) ≤ 2ε
k=1
so that (iii) holds. That (iii) implies (iv) is evident. It is not very difficult to
prove the implication (iv) =⇒ (v). If M satisfies (v), then M = Bi ∪ (M \ Bi )
where the sets Bi and M \ Bi are measurable by Theorem 1.19 (each one for a
different reason), so that (v) =⇒ (i).
1.22. Notes. Originally, H. Lebesgue defined the outer measure on the real line using
countable covers formed by intervals, exactly as explained in the text. He defined measurability
as in Exercise 1.7.
At the end of the last century, various attempts to define the length or area of geometrical
figures appear; in the works of G. Peano [1887] and C. Jordan [1892] even the “measures” of
more complicated sets are considered.
The existence of a Lebesgue nonmeasurable set is very closely connected to the axiom of
choice (for uncountable collections of sets) and the assertion that such sets exist was first proved
by G. Vitali [*1905]. Solovay’s result [1970] says that there exist models of the set theory (of
course not satisfying the axiom of choice) in which every subset of real numbers is Lebesgue mea-
surable. The existence of a nonmeasurable set can be proved (assuming various set conditions)
in other ways as well. Constructions of Bernstein’s sets (still assuming the axiom of choice) as
examples of nonmeasurable sets are also interesting. Another construction of a nonmeasurable
set (the axiom of choice again) based on results of the graph theory comes from R. Thomas
[1985]. Using nonstandard methods, it is possible to prove the existence of a nonmeasurable set
assuming the existence of ultrafilters (a weaker form of the axiom of choice; cf. M. Davis [*1977]).
Recently, M. Foreman and F. Wehrung [1991] proved that the existence of a nonmeasurable set
follows from the Hahn-Banach Theorem (which is again a weaker assumption than the axiom
of choice).
Let us note that the Lebesgue measure can be extended to a “translation invariant” mea-
sure defined on a wider σ-algebra than is the collection of all Lebesgue measurable sets. The
construction can be found e.g. in S. Kakutani and J.C. Oxtoby [1950]. However, the Lebesgue
measure cannot be extended in a reasonable way to the collection of all subsets of Rn .
It is interesting that in R or R2 there exist finitely additive extensions of the Lebesgue
measure to the collection of all subsets which can also be invariant with respect to translations
8 2. Abstract Measures
and rotations. This was first proved by S. Banach [1923]. However, this result cannot be
transferred to spaces of higher dimensions as follows from the famous result of S. Banach and
A. Tarski [1924]:
If U and V are arbitrary (!) bounded and open sets in the space Rn , n ≥ 3, then there S exist
S
sets E1 , . . . , Ek and F1 , . . . , Fk such that Ei ∩ Ej = ∅ = Fi ∩ Fj for i
= j, U = Ei , V = Fi
and Ej are isometric copies of Fj .
In this theorem, which is known as the Banach-Tarski paradox, in general all the sets Ei and
Fi cannot be measurable; realize that U , V can be of different measures. More information is
contained in S. Wagon [*1985].
2. Abstract Measures
In this chapter we study an abstract notion of measure which stands as a basis
for modern integration theory. Also in probability theory, the notion of measure
(termed a “probability” there) plays a crucial role. Among many fields of analysis
which employ measures in an essential way, let us mention e.g. functional analysis,
theory of function spaces and theory of distributions, or mathematical modelling
of physical quantities.
So far we have the only nontrivial example of the Lebesgue measure. Further
important examples of measures will be introduced later.
Remember that the Lebesgue measure is not defined on the collection of all
subsets of R but on its subcollection which is closed under countable operations.
We start with the following definition.
2.1. σ-algebras. A collection S of subsets of a given set X is called a σ-algebra
if
(a) X ∈ S ;
(b) if A ∈ S , then X \ A ∈ S ;
∞
(c) if An ∈ S , then An ∈ S .
n=1
The pair (X, S ) is called a measurable space.
Clearly every σ-algebra is also closed under countable intersections, under dif-
ferences and contains the empty set.
Not every collection of sets is a σ-algebra. However, if T is an arbitrary family
of subsets of X, then there exists the smallest σ-algebra σ(T ) which contains T .
Such a σ-algebra is simply the intersection of all σ-algebras (in X) which contain
T . It surely exists, since there is at least one such a σ-algebra (the σ-algebra
P(X) of all subsets of X) and the intersection of any collection of σ-algebras is
again a σ-algebra. The collection σ(T ) is called the σ-algebra generated by T .
2.2. Examples. One of the most important examples is the σ-algebra M of all Lebesgue
measurable sets on the real line. Another important class of examples yields Borel σ-algebras
from 2.3. For illustration, we add a few simple examples. Suppose X is an arbitrary set. Then
(a) {∅, X} is a σ-algebra;
(b) the collection (X) of all subsets of X is a σ-algebra;
(c) {A ⊂ X : A is countable or X \ A is countable} is a σ-algebra.
2.3. Borel Sets. Let P be a topological space. The σ-algebra B(P ) generated
by the family of all open subsets of P is called the Borel σ-algebra of P ; its
A. Measures and Measurable Functions 9
elements are called Borel sets. The Borel σ-algebra B(P ) contains all closed
sets, all countable intersections of open sets (these sets are called Gδ sets), all
countable unions of closed sets (Fσ sets), all countable unions of Gδ sets (Gδσ
sets), all countable intersections of Fσ sets (Fσδ ) and so on. Let us note that for
the complete description of all possible “types” we would have to use (in nontrivial
cases) all countable ordinal numbers.
2.4. Measures. Let S be a collection of subsets of a set X. A nonnegative set
function μ : S → [0, ∞] is called a measure if
(a) S is a σ-algebra;
(b) μ∅ = 0;
∞
such that μMn < +∞ and X = Mn . If μX = 1, then we say that μ is a
n=1
probability measure. A measure μ is said to be complete if whenewer B ∈ S is a
null set and A ⊂ B, then also A ∈ S (and μA = 0).
If μ is a measure on X and E ∈ S , we define μE A = μ(A ∩ E) for A ∈ S .
A related notion is the restriction μ|A of a measure μ to the set A ∈ S : If
we denote by SA the σ-algebra {M ∈ S : M ⊂ A} of subsets of A, we define
μ|A (M ) = μ(M ) for M ∈ SA . Finally, if T ⊂ S is a σ-algebra of subsets of X,
then the symbol μ|T denotes the measure E → μE, E ∈ T .
2.5. Examples. (a) The Lebesgue measure on the collection of all Lebesgue measurable sets
in Rn . This measure is complete, σ-finite, but not finite.
(b) Counting measure. Let X be an arbitrary set, = (X) the collection of all subsets
of X and (
the number of elements of A if A is finite,
μA =
+∞ if A is infinite.
The counting measure is complete; it is σ-finite if and only if X is countable, and finite just
when X is finite.
(c) The Dirac measure. Again, let X be an arbitrary set, x ∈ X, = (X). We define
(
1, if x ∈ A,
μA =
0, if x ∈ X \ A.
The measure μ is called the Dirac measure at x and it is denoted by εx . The Dirac measure is
a complete probability measure.
(d) Trivial measures. If is an arbitrary σ-algebra of subsets of X and μA = 0 for all
A ∈ , then μ is an example of a finite measure which is not complete provided
= (X).
(b) if A1 , A2 , · · · ∈ S , A1 ⊃ A2 ⊃ . . . , and μA1 < ∞, then μ ( An ) =
lim μAn ;
(c) if A1 , A2 , · · · ∈ S , then μ ( An ) ≤ μAn .
B1 = A1 , B2 = A2 \ A1 , B3 = A3 \ (A1 ∪ A2 ), . . . ,
μ(M ∪ N ) = μM, M ∈ S, N ∈ N .
Clearly the value of μE does not depend on the choice of M and N . The set
function μ on S is called the completion of μ.
2.8. Theorem. S is a σ-algebra containing S and μ is a complete measure
on S which coincides with μ on S .
Proof. Obviously, S ⊂ S . Since both S and N are closed under countable
unions, the same is true for S . If E ∈ S , there are M ∈ S , N ∈ N and B ∈ S
so that N ⊂ B, μB = 0 and E = M ∪N . Then X \E = (X \ (M ∪ B))∪(B \N ) ∈
S . It is easy to verify that μ is a complete measure on S and μ = μ on S .
2.9. Remark. Note that there is a variety of extensions of a measure to a complete measure.
The completion described above is uniquely determined and in some sense minimal (cf. Exercise
2.13).
A. Measures and Measurable Functions 11
2.15. Exercise (Darboux property). Let (X, , μ) be a measure space. If μ does not have
an atom, show that {μA : A ∈ } = [0, μX]. (A set A ∈ is called an atom if μA > 0 and for
each set B ∈ , B ⊂ A, either μB = 0 or μ(A \ B) = 0.)
Hint. Fix 0 < α < μX and set = {A ∈ : μA ≥ α}. There exists a set C ∈ such that
μA = μC for all A ∈ , A ⊂ C. In a similar way find D ∈ , D ⊂ C, μD ≤ α with the
following property: μB = μD for each set B ∈ for which D ⊂ B ⊂ C and μB ≤ α. Because
C \ D cannot be an atom, it follows μC = μD = α.
2.16. Notes. In [1895] and [*1898], E. Borel extended the length of intervals to a set function
(measure) defined on the collection of all Borel sets. However, H. Lebesgue was the first who
created the theory of the integral with the help of this measure. J. Radon in [1913] defined a
general notion of (Borel) measures in Euclidean spaces. A.N. Kolmogorov introduced in [*1933]
the axiomatic theory of probability measures.
The Darboux property of real-valued measures is a special version of the general Lyapunov
theorem (A. Lyapunov [1940]) which says that the range of a finite-dimensional nonatomic vector
measure is a convex compact set.
12 3. Measurable Functions
3. Measurable Functions
As mentioned in the first chapter, there are serious reasons why the Lebesgue
measure is not defined on the collection all subsets of Rn . Likewise, one cannot
expect a reasonable theory of integration on the class of all functions; it will be
necessary to confine to some reasonable family of functions. This class of course
should contain the indicator functions of all measurable sets and should be closed
under all common (algebraic as well as limit) operations. As we will see, these
requirements are satisfied by the following natural definition.
In what follows, we assume that S is a σ-algebra of subsets of a given set X.
3.1. Measurable Functions. Suppose D ∈ S . A function f : D → R is said
to be S -measurable on D if {x ∈ D : f (x) > α} ∈ S for each α ∈ R.
A complex function on D is S -measurable if its real and imaginary parts are
S -measurable.
3.2. Examples. (a) If is a σ-algebra of all subsets of X, then every function on X is
-measurable.
(b) Only the constant functions are -measurable if = {∅, X}.
(c) If is the σ-algebra of all Borel sets of a topological space X, then -measurable
functions are called shortly Borel functions on X.
3.3. Remark. Since any σ-algebra is closed for complementation, a function f is -
measurable if and only if {f ≤ α} ∈ for each α ∈ R. Because
∞ j
\ ff
1
{f ≥ α} = f >α− ,
n=1
n
Proof. We will just indicate the ideas of some of the proofs leaving the others as
an exercise for the reader. The assertion (c) is obvious.
(a) Suppose α ∈ R. Then {f + g > α} = {f > α − g} (and the function α − g
is S -measurable). The functions |f |, f 2 and 1/g are S -measurable by (c). Then
we can use the formulae
1
max(f, g) = (f + g + |f − g|),
2
1
fg =((f + g)2 − f 2 − g 2 ).
2
(b) As a hint let us just note that
{sup fn > α} = {fn > α}.
n
3.6. Exercise. (a) Show that a real-valued function f is -measurable if and only if
f −1 (G) ∈ for each open set G ⊂ R or, if and only if f −1 (B) ∈ for each Borel set B ⊂ R.
S
Hint. Since each open set G ⊂ R can be expressed in the form G = (an , bn ) (where (a, b) =
(−∞, b) ∩ (a, +∞)), we can see that a function f is -measurable if and only if f −1 (G) ∈
for each open set G ⊂ R (note that {f > α} = f −1 ((α, +∞))!). Then consider the collection
{B ⊂ R : B Borel, f −1 (B) ∈ },
and show that it is a σ-algebra containing all open sets.
(b) The characterization given in (a) cannot be used for functions having infinite values
unless we introduce the notions of open or Borel subsets of R.
3.7. Exercise. Let {fn } be a sequence of -measurable functions. Show that the sets
{x ∈ X : lim fn (x) exists } and {x ∈ X : lim fn (x) exists and is finite }
are in .
3.8. Simple Functions. By a simple function on X we understand a (finite)
linear combination of indicator functions of sets from S . In other words, a real-
(or complex-)valued function f is simple if f is S -measurable and f (X) is a finite
n
set. Thus every simple function is of the form λi cAi where λi are numbers and
i=1
Ai ∈ S . Note that this expression is not uniquely determined!
3.9. Theorem. Let f ≥ 0 be an S -measurable function on X. Then there
exists a sequence {fn } of nonnegative simple functions on X such that fn f .
Proof. For n ∈ N and k = 1, 2, . . . , n2n set
k−1 k
Fn,k = x ∈ X : ≤ f (x) < n
2n 2
and define k−1
2n , if x ∈ Fn,k ,
fn (x) =
n if x ∈ X \ Fn,k .
k
It is easily seen that fn are simple and fn f .
3.10. Exercise. Suppose f , fn have the same meaning as in the previous theorem. Show
that fn ⇒ f on every set on which f is bounded.
14 3. Measurable Functions
In this chapter, we will construct measures from the so-called outer measures.
This is a very useful method used already before when we introduced the Lebesgue
measure in Rn . We also fulfil our promise to prove propositions from the first
chapter.
4.1. Outer Measure. By an outer measure on a set X we understand a set
function γ which assigns to every set A ⊂ X a nonnegative number γA (real or
+∞) such that the following conditions are satisfied:
(a) γ∅ = 0;
⊂ B, then
(b) if A
γA ≤ γB;
(c) γ ( An ) ≤ γAn .
The property (c) is called the σ-subadditivity of an outer measure.
4.2. Examples of Outer Measures. (a) The outer Lebesgue measure in Rn .
(b) The Hausdorff outer measure from Chapter 36.
(c) The counting measure.
(d) If (X, , μ) is a measure space, then the set function μ∗ : A → inf{μM : M ∈ :M ∈
, M ⊃ A} is an outer measure. See also Exercise 4.8.
Then
Gjn ⊃ A and ν̃An ≥ ν̃A − ε.
n,j n
γT = γ(T ∩ M ) + γ(T \ M )
for each “test” set T ⊂ X (in other words, if M splits additively each set in X).
The collection of all γ-measurable sets will be denoted by M(γ). To prove that a
set M is γ-measurable, it is sufficient to verify the inequality
γT ≥ γ(T ∩ M ) + γ(T \ M )
γT = γ(T ∩ M ) + γ(T \ M )
and
γ(T ∩ M ) = γ(T ∩ M ∩ N ) + γ((T ∩ M ) \ N ).
Now we use the test set T \ (M ∩ N ) and γ-measurability of M to get
Since M(γ) is closed under complements and finite intersections, it is closed also
under finite unions.
A. Measures and Measurable Functions 17
and since the reverse inequality always holds we reach the conclusion.
∞
(c) Let now Mn ∈ M(γ) be pairwise disjoint. Our aim is to show that Mn ∈
n=1
M(γ). Choosing a test set T ⊂ X, we have
∞
k k
k
γT = γ T \ Mn + γ T ∩ Mn ≥ γ T \ Mn + γ(T ∩ Mn )
n=1 n=1 n=1 n=1
This chapter will be devoted to a study of various classes of sets and set func-
tions from the point of view of measure theory. We prove theorems on extensions
of set functions to larger families of sets as important steps in constructing mea-
sures.
18 5. Classes of Sets and Set Functions
∞
(d3) if An ∈ A are pairwise disjoint, then An ∈ A ;
n=1
(b) if {A
j } ⊂ A
is a sequence of pairwise disjoint sets and Aj ∈ A , then
μ ( Aj ) = μAj .
A premeasure is in fact a σ-additive set function defined on a ring of sets. We
say that a premeasure μ on X is σ-finite
if there exists a sequence Xj of sets from
A such that μXj < ∞ and X = Xj .
5.3. Examples. (a) Denote by the collection of all intervals on R including the degenerated
ones (i.e. the empty set and the singletons) and by b the collection of all bounded intervals
from . Further, let l be the collection of all intervals of the form [a, b) together with ∅, R
and the intervals of the form (−∞, b). Finally, let lb = l ∩ b .
(a1) The collections , b, b
l, l form semirings which is not true for the collection of all
open (or closed) intervals.
(a2) Finite unions of sets from b or lb form a ring. Finite unions of sets from or l
form even an algebra or l . The set function I → vol I can be (uniquely) extended to a
premeasure on or l .
(a3) Closing or l under countable unions, we do not get even a semiring: Substracting
a countable union of open intervals from [0,1] we get the Cantor set which is not a countable
union of intervals.
(b) If and are semirings on X, then the collection of all sets of the form A × B where
A∈ and B ∈ forms a semiring on X × X.
(c) Let be a semiring. Then the set of all finite disjoint unions of elements from is a
ring (in fact, the smallest ring containing ).
(d) Consider the family of all subsets of the set {1, 2, . . . , 20} with an even number of
elements and show that is a Dynkin class but not a semiring.
(e) The collection of all countable subsets of a set X is a σ-ring which for is not a σ-algebra
unless X is countable.
A. Measures and Measurable Functions 19
(f) The collection of all Lebesgue measurable sets of finite measure forms a δ-ring.
(g) Suppose that is the collection of all unions of a finite number of intervals (including
the degenerated ones) and = {A ∩ Q : A ∈ }. Then is an algebra of subsets of Q and
ν : A ∩ Q → λA is a finitely additive set function which is not a premeasure.
(h) Let be the algebra of all finite unions of intervals on (0, 1). Define a set function ν on
by the formula
(
1 if A contains some interval of the form (0, ε), ε > 0,
μ(A) =
0 in other cases.
so that νG ≤ ν̃G.
Now suppose G ∈ G . Choose a test set T ⊂ X, ν̃T < +∞ and ε > 0. There exist
Gn ∈ G such that
T ⊂ Gn and νGn ≤ ν̃T + ε.
n n
Then
ν̃T + ε ≥ νGn = (ν(Gn ∩ G) + ν(Gn \ G)) ≥ ν̃(T ∩ G) + ν̃(T \ G),
n n
thus G ∈ M(ν̃).
5.5. Hopf ’s Extension Theorem. Let μ be a premeasure on a ring A of
subsets of a set X. Then there exists a measure μ̃ on σ(A ) which equals μ on A .
This extension of μ is unique provided μ is σ-finite.
Proof. The existence of a measure μ̃ is an immediate consequence of Theorems 4.3
and 5.4 (notice that σ(A ) ⊂ M(μ̃)). To prove uniqueness, under the σ-finiteness
assumptions, let ν be another measure on σ(A ), ν = μ on A . One can easily find
out (from the construction
of the outer measure μ̃) that ν ≤ μ̃ on σ(A ). Then
for Aj ∈ A , A = Aj we have
j
⎛ ⎞ ⎛ ⎞
n
n
μA = lim ν ⎝ Aj ⎠ = lim μ ⎝ Aj ⎠ = μ̃A.
n n
j=1 j=1
20 5. Classes of Sets and Set Functions
So if
E ∈ σ(A ), μ̃E < ∞, then for a given ε > 0 there exist sets Aj ∈ A ,
A = Aj such that E ⊂ A and μ̃A < μ̃E + ε. Hence
j
thus μ̃E = νE. Finally suppose X = Xj , μXj < +∞; one can assume that Xj
j
are pairwise disjoint. If E ∈ σ(A ), then
μ̃E = μ̃(E ∩ Xj ) = ν(E ∩ Xj ) = νE.
j j
5.6. Exercise. Let be a family of subsets of a set X. Show that there exists the smallest
Dynkin class ( ) containing .
5.7. Exercise. Show that every σ-algebra is a Dynkin class. A Dynkin class is a σ-algebra if
and only if it is a π-system.
5.8. Exercise. Prove that if is a π-system, then ( ) = σ( ).
5.9. Exercise. The assumptions of Hopf’s extension theorem can be weakened. Indeed, show
that the assertion of Theorem 5.4 is still true if we only assume that μ is finitely additive and
σ-subadditive on the semiring and that μ∅ = 0.
Hint. First note that μ is monotone. Then proceed as in Theorem 5.4 and show that σ( ) ⊂
j
(μ̃). In the essential step use the existence of sets Cn ∈ with the property Gn \ (G ∩ Gn ) =
S j
Cn .
j
where C is the Cantor set and f the Cantor function from 23.1.
Show that is a semiring and that ν is finitely additive but not σ-additive on .
5.12. Exercise. (a) Let X be an arbitrary set and n ∈ N. Consider the set function ν which
is restriction of the counting measure to the collection
and construct the outer measure ν̃ as in Theorem 4.3. Investigate the relationship between the
collections n and M(ν̃) and compare with Theorem 5.4.
(b) Investigate the same problem in case of X = (0, 1), = (X) and
X
k [
k
νA := inf{ (bi − ai ) : (ai , bi ) ⊃ A}
i=1 i=1
μ∗ A := μ∗ X − μ∗ (X \ A).
Show that A ∈ M(μ∗ ) if and only if μ∗ A = μ∗ A (compare also with Theorem 5.4).
5.14. Capacity on Compact Sets. Let X be a locally compact topological space. A
real-valued nonnegative function defined on the collection (X) of all compact subsets of X
is called a Choquet capacity if it satisfies the following conditions:
(a) if K1 ⊂ K2 , then (K1 ) ≤ (K T2 );
(b) if K1 ⊃ K2 ⊃ K3 . . . , then ( Kn ) = lim (Kn );
(c) (K1 ∩ K2 ) + (K1 ∪ K2 ) ≤ (K1 ) + (K2 ) whenever Kn ∈ (X) (so-called strong
subadditivity).
An important example is the Newtonian capacity in Rn , n ≥ 3. If we define the Newtonian
potential of a Radon measure μ on Rn by
Z
μ dμ(t)
(x) := n−2
,
Rn |x − t|
The proof that this capacity satisfies the conditions (a), (b), (c) can be found for instance in
[KNV].
5.15. Outer Capacity. Let X be a topological space. A mapping c : (X) → [0, +∞] is
called an outer capacity provided it satisfies:
(a) if A ⊂ B, then cA ≤ cB; S
(b) if A1 ⊂ A2 ⊂ A3 ⊂ . . . , then c ( An ) = sup cAn ; T
(c) if K1 ⊃ K2 ⊃ K3 ⊃ . . . and Kn are compact, then c( Kn ) = inf c Kn .
A set A ⊂ X is said to be c-capacitable if
cA = sup{cK : K ⊂ A, K compact}.
An investigation of the theory of capacities in the classical potential theory during the period
1920–1950 is connected with the names of Ch. de la Vallée Poussin, N. Wiener, O. Frostman or
M. Brelot among others. These authors studied mainly the Newtonian capacity and examined
the role of sets of a “small” capacity. One could say that the capacity theory is a “younger
sister” of Lebesgue measure theory. In the 50’s the general capacity theory was developed and
G. Choquet proved his famous capacitability theorem. An interested reader is referred to a nice
article written by Choquet himself [1986] or [1989].
In this chapter, we will investigate “measures” assuming also negative (or even
complex) values.
6.1. Signed Measures. Let S be a σ-algebra of subsets of X. A set function
μ : S → R is said to be a signed measure on S if
(a) μ∅= 0;
∞
∞
(b) μ An = μAn whenever An ∈ S are pairwise disjoint.
n=1 n=1
6.2. Remarks. 1. A signed measure can assume at most one of the values +∞ and
−∞. Indeed, if μE = +∞ and μF = −∞ for E, F ∈ , then μ(E ∩ F ) is finite. Therefore
μ(E \ F ) = +∞, μ(F \ E) = −∞ and the equality (b) does not hold for (disjoint) sets E \ F
and F \ E.
S P
2. If μ( An ) is finite, then the series μAn in the condition (b) converges absolutely (indeed,
this series converges — to the same value — when its terms are rearranged arbitrarily, which is
equivalent to the absolute convergence).
3. Some of basic properties of positive measures hold even for signed measures. Show that the
following assertions are true:
(a) if An ∈ , An A , then μAn → μA,
(b) if An ∈ , An A, |μA1 | < +∞ , then μAn → μA.
4. A signed measure μ is not necessarily monotone: If A ⊂ B, we can no longer assert that
μA ≤ μB.
6.3. Hahn Decomposition. We say that a set P ∈ S is positive for a signed
measure μ if μE ≥ 0 for each set E ∈ S , E ⊂ P . Analogously we define negative
sets for μ.
An ordered pair of sets (P, N ) is called a Hahn decomposition of X for μ if
(a) P ∪ N = X, P ∩ N = ∅;
(b) P is positive and N is negative (for μ).
6.4. Examples. (a) The empty set is both positive and negative for any signed measure. The
pair (X, ∅) is a Hahn decomposition for μ if and only if μ is positive.
(b) The pairs (R, ∅) , (R \ Q, Q) , (R \ {5}, {5}) are Hahn decompositions of R for the
Lebesgue measure.
R
(c) If a measure μf has a density f with respect to μ (i.e. if μf A = A f dμ where f ∈ ∗ (μ)
— cf. 8.19), then the set P := {f ≥ 0} is positive for μf and ({f ≥ 0}, {f < 0}) is a Hahn
decomposition for μf .
Proof. We start with proof of the uniqueness assertion. Suppose that (P1 , N1 )
and (P2 , N2 ) are Hahn decompositions of X for μ. Then evidently
1
kn = min k ∈ N : there exists E ⊂ An with μ(E) ≤ −
k
∞
and find An+1 ⊂ An such that μ(An \ An+1 ) ≤ − k1n . Then we set B = An .
n=1
The sets An \ An+1 are pairwise disjoint and their union is A \ B. Thus
∞
∞
1
+∞ > μ(B) = μ(A) − μ(An \ An+1 ) ≥ μ(A) + .
n=1
k
n=1 n
μ+ E = μ(E ∩ P ) = ν1 (E ∩ P ) − ν2 (E ∩ P ) ≤ ν1 (E ∩ P ) ≤ ν1 E.
An important characterization of the total variation of a signed measure, often used as its
definition, is contained in the following theorem.
∞
μ Ak = μAk whenever {Ak } is a sequence of pairwise disjoint sets from
k=1 k=1
S . Let us note that, as in Remark 6.2.2, the appearing series must converge
absolutely or definitely diverge.
Each complex measure μ can be expressed uniquely in the form μ = μr + iμi
where μr and μi are (finite!) signed measures on (X, S ). In particular, each finite
signed measure can be understood as a complex measure.
If μ is a complex measure, we define its total variation |μ| by
n
n
|μ| (E) = sup |μAk | : Ak ∈ S pairwise disjoint, Ak = E .
k=1 k=1
An appeal to Theorem 6.9 reveals that this definition agrees with the previous
one for signed measures.
6.11. Theorem. Let μ be a complex measure on (X, S ). Then the total
variation |μ| is a finite positive measure on (X, S ).
A. Measures and Measurable Functions 25
Proof. Apparently, |μ| (∅) = 0 and it is simply to verify that |μ| is finitely additive.
If μ = μ1 − μ2 + i(μ3 − μ4 ) where μ1 − μ2 is the Jordan decomposition of μr and
μ3 − μ4 is the Jordan decomposition of μi (i.e. μj are positive finite measures),
then
|μ| (A) ≤ μ1 A + μ2 A + μ3 A + μ4 A,
so that |μ| (A) < +∞. If An ∈ S , An ∅, then lim μj An = 0 for each
j = 1, 2, 3, 4, and therefore |μ| (An ) → 0. A routine argument now shows that μ
is σ-additive.
6.12. Remark. Note that the range of any complex measure is always a bounded subset of
the complex plane C.
6.13. Exercise. Let μ be a signed measure on (X, ) and E ∈ . Show that μ+ E =
sup{μB : B ∈ , B ⊂ E}.
6.14.
j ∞Exercise. Let μ be a complex measure on (X, ff ) and E ∈ . Prove that |μ| (E) =
P S
∞
sup |μAk | : Ak ∈ pairwise disjoint, Ak = E .
k=1 k=1
6.15. Exercise. Let μ be a complex measure on (X, ). Prove that |μ| is the smallest
positive measure ν satisfying νA ≥ |μ| A for all A ∈ .
q
6.16 Exercise. Investigate whether |μ| = |μr | + |μi |, or |μ| = |μr |2 + |μi |2 .
6.17. Exercise. We denote by M( ) the set of all finite signed or complex measures on
(X, ). If μ ∈ M( ), let μ = |μ| (X). Show that (M( ), ·) is a Banach space (i.e. M( )
is a linear space, · becomes a norm and M( ) is complete with respect to it).
Hint. Only the completeness requires a proof. But if μn is a Cauchy sequence in M( ), then
there exists lim μn A for each A ∈ . Defining μA = lim μn A, it is not hard to show that μ is
σ-additive and μn − μ → 0.
S
n P
If A = [ai , bi ), define νA = (f (bi ) − f (ai )). Show that neither the “Jordan decomposition
i=1
theorem”, nor the “Hahn decomposition theorem” do hold for ν. Proceed via the following
steps:
S
(a) the definition of νA does not depend on the particular representation of A as [ai , bi );
(b) ν is a charge on ;
(c) νA ≤ 0 for A ⊂ (0, 1);
(d) ν([0, 1)) = 1;
(e) ν + ([0, 1)) = ν − ([0, 1)) = +∞.
6.23. Exercise. Let ν be a charge on . Show that ν = ν + − ν − if and only if ν is bounded
(i.e. if sup{|νA| : A ∈ } < +∞).
6.24. Notes. Signed measures were investigated by H. Lebesgue already in [1910]; he was
concerned mostly with measures given by a density (see 8.19).
The existence of a Hahn decomposition of the space and the Jordan decomposition of signed
measures were first established in a full generality by Hahn [*1921].
The monograph by K.P.S. Bhaskar Rao and M. Bhaskar Rao [*1983] is devoted to the theory
of charges.
B. The Abstract Lebesgue Integral 27
(A function f is said to be piecewise constant on [a, b] if there exist ξj such that a = ξ0 < ξ1 <
· · · < ξm = b and f is constant on each interval (ξj−1 , ξj ).)
(b) If f is a bounded function on an interval (a, b), then the function
f ∗ := inf{g : g ≥ f, g continuous}
is called the upper Baire function of f . The definition of the lower Baire function f∗ of f is
analogous.
Show that the function f ∗ is upper semi-continuous and that f is continuous at a point x if
and only if f ∗ (x) = f∗ (x).
(c) Show that
Z b
∗
f = inf{R g : g ≥ f, g continuous }.
a
(d) Let f be a bounded function on an interval (a, b). Show that the following conditions are
equivalent:
(i) f is Riemann integrable;
R
(ii) for each ε > 0 there exist continuous functions t, s such that t ≤ f ≤ s and R ab (s−t) <
ε;
(iii) f∗ = f ∗ almost everywhere;
(iv) there exist functions u, v; u upper semi-continuous, v lower semi-continuous such that
v ≤ f ≤ u and u = v almost everywhere;
(v) the set of all points where f is discontinuous is of Lebesgue measure zero.
Hint. Use (b) and (c). For (ii) =⇒ (iii), according to 7.8, λ∗ {f ∗ − f∗ ≥ 1/k} ≤ λ∗ {s − t ≥
1/k} ≤ kε for each k ∈ N and ε > 0.
(e) Show that any Riemann integrable function is measurable.
Hint. Any semi-continuous function is measurable. Now use (d) and Theorem 3.14.b.
(f) The above condition (v) permits to construct bounded Newton integrable functions which
are not Riemann integrable. An example of Volterra type functions constructed with the aid of
closed nowhere dense sets of positive measure (Example 1.13) can be found in A.M. Bruckner
[*1978].
7.10. Notes. The origins of the integral calculus are connected with the names of I. Newton
and G.W. Leibniz. The modern integration theory has been developed from the beginning of
the 19th century by A.L. Cauchy, L. Dirichlet, B. Riemann, C. Jordan, E. Borel, H. Lebesgue,
G. Vitali and others. Some historical treatments on integration are mentioned in 8.25.
B. The Abstract Lebesgue Integral 29
m
Proof. To simplify the proof define α0 = β0 = 0, A0 = X \ Ai and B0 =
i=1
n
X\ Bj (then the collections {Ai }, {Bj } form “partitions” of X). For i ∈
j=1
{0, . . . , m} and j ∈ {0, . . . , n} either Ai ∩ Bj = ∅ or αi ≤ βj . Thus
m
m
n
m
n
n
αi μAi ≤ αi μ(Ai ∩ Bj ) ≤ βj μ(Ai ∩ Bj ) ≤ βj μBj .
i=0 i=0 j=0 i=0 j=0 j=0
if the integral on the right side is defined. It is immediate that this value does
not depend on the choice of D.
The symbol L ∗ or L ∗ (μ) denotes the family of all μ-measurable functions
defined μ-almost everywhere on X for which the abstract Lebesgue integral is
defined.
B. The Abstract Lebesgue Integral 31
Further denote
∗
L = L (μ) =
1 1
f ∈ L (μ) : f dμ ∈ R .
X
8.5. Theorem.
If fn ≥ 0 are μ-measurable functions on X, fn f , then
X
fn dμ → X
f dμ.
Proof. It is clear that the sequence X fn dμ is nondecreasing, and therefore
there exists α := lim X fn dμ. Since fn ≤ f , we have α ≤ X f dμ and the
assertion is obvious for α = +∞. Assume α < +∞, fix a simple function s,
0 ≤ s ≤ f , and prove that X s dμ ≤ α. The proof will be given in several steps:
(a) Let τ ∈ (0, 1). Define En = {x ∈ X : fn (x) ≥ τ s(x)}. Then En ∈ S , En ⊆
∞
En+1 and En = X (if f (x) = 0, then x ∈ E1 ; if f (x) > 0, then τ s(x) <
n=1
lim fn (x) ). Thus μ(En ∩ A) → μA for every set A ∈ S .
k
(b) Let s = βj cAj , where Aj are pairwise disjoint. Then
j=1
k
k
fn dμ ≥ fn dμ ≥ τ s dμ = τ ( βj cAj ) dμ = τ βj μ(Aj ∩ En ).
X En En En j=1 j=1
k
α≥τ βj μAj = τ s dμ .
j=1 X
Since τ ∈ (0, 1) is arbitrary, we get α ≥ X
s dμ, as needed.
32 8. The Abstract Lebesgue Integral
m
n
m
n
X= Ai = Bj , f1 = αi cAi and f2 = βj cBj . Then
i=0 j=0 i=0 j=0
m
n
m
n
(f1 + f2 ) dμ = (αi + βj )μ(Ai ∩ Bj ) = αi μAi + βj μBj
X i=0 j=0 i=0 j=0
= f1 dμ + f2 dμ.
X X
For the general case, find sequences s1n and s2n of simple functions with
sin fi and use the first part and Theorem 8.5.
8.9. Theorem (properties of L 1 (μ)). The following propositions hold:
(a) If f ∈ L 1 , then f is finite almost everywhere.
(b) If f, g ∈ L 1 , α, β ∈ R, then αf + βg ∈ L 1 and (αf + βg) dμ =
α X f dμ + β X g dμ (an appeal to (a) reveals that αf + βg is defined
almost everywhere).
(c) If f ∈ L 1 , then |f | ∈ L 1 and X f dμ ≤ X |f | dμ (i.e. the integral is
“absolutely convergent”).
(d) L 1 is a lattice (if f, g ∈ L 1 , then max(f, g), min(f, g) ∈ L 1 ).
(e) If f is μ-measurable, g ∈ L 1 , |f | ≤ g, then f ∈ L 1 .
Proof. (a) Suppose that f ∈ L 1 , A:= {x ∈ X : f (x) = ∞}. Then
A is measurable
and 0 ≤ ncA ≤ f + for every n ∈ N. Thus 0 ≤ X ncA dμ ≤ X f + dμ and we get
μA ≤ n1 X f + dμ < ∞ for every n ∈ N. Hence μA = 0.
(b) It is plain to see that X αf dμ = α X f dμ for f ∈ L 1 and α ∈ R. Let
f, g ∈ L 1 and write f = f + − f − , g = g + − g − and h = f + g (by (a), h is defined
almost everywhere). Then
h+ − h − = f + − f − + g + − g − ,
B. The Abstract Lebesgue Integral 33
(d) The assertion follows immediately from (b) and (c) using
1
max(f, g) = (f + g + |f + g|).
2
(e) If f is a μ-measurable function, f + is μ-measurable as well and
0≤ f + dμ ≤ g dμ < ∞
X X
(0 ≤ f ≤ |f | ≤ g), thus f
+ +
∈ L . Analogously f − ∈ L 1 , and finally f =
1
f + − f − ∈ L 1.
8.10. Remarks. 1. Proposition (b) of the previous theorem shows that 1 satisfies almost
all axioms for a linear space but it is not a true linear space. To get a linear space, the set of
all finite everywhere defined functions from 1 or the space L1 (which will be introduced in
In the theory of Lebesgue integration, limit theorems play a crucial role. They
concern either monotone convergence or dominated convergence.
34 8. The Abstract Lebesgue Integral
∞
for all n ∈ N and that the series hj converges almost everywhere. Then
j=1
∞ ∞
hj dμ = hj dμ.
X j=1 j=1 X
Proof. Set gk = inf{fn : k ≤ n}. Then gk lim inf fn and infer on Levi’s
theorem again.
B. The Abstract Lebesgue Integral 35
8.16. Theorem. Let f ≥ 0 be a μ-measurable function. If X f dμ = 0, then
f = 0 almost everywhere.
∞
Proof. Denote An = {x ∈ X : g(x) ≥ n1 }. Then An = {x ∈ X : f (x) > 0}
n=1
and the inequality 0 ≤ cAn ≤ nf yields μ(An ) = 0.
8.17. Theorem. Let f ∈ L 1 . If E f dμ = 0 for any measurable set E, then
f = 0 almost everywhere.
Proof. If E := {f ≥ 0}, then E f dμ = E f + dμ = 0 and using the previous the-
orem we get f + = 0 almost everywhere. Analogously f − = 0 almost everywhere.
8.23. Image of a Measure. Let (X, , μ) be a measure space, (Y, ) be a measurable space
and f : X → Y a measurable mapping (i.e. f −1 (E) ∈ whenever E ∈ ). The set function
E → μ(f −1 (E)), E∈ ,
is called the image of the measure μ under the mapping f and it is denoted by f (μ).
(a) Show that f (μ) is a measure on (Y, ).
(b) Let g : Y → R be a -measurable function. Show that g is f (μ)-integrable if and only
if g ◦ f ∈ 1 . In this case Z Z
g df (μ) = g ◦ f dμ.
Y X
8.24. Exercise. Let f be an increasing differentiable function on R, lim f (x) = ±∞. Let
x→±∞
μ be a measure on (R, (R)) defined by
Z
μE = f dx.
E
The Lebesgue dominated convergence theorem has simple but very important
consequences on continuity and differentiation of integrals depending on a param-
eter.
9.1. Theorem. Let (X, S , μ) be a measure space, P a metric space and U a
neighbourhood of a point a ∈ P . Suppose that a function F : U × X → R has the
following properties:
(a) there exists a set N ⊂ X of measure zero such that for each x ∈ X \ N
the function F (·, x) is continuous at a;
(b) for each t ∈ U , the function F (t, ·) is μ-measurable;
(c) there exists a function g ∈ L 1 (X) such that |F (t, ·)| ≤ g almost every-
where for all t ∈ U .
B. The Abstract Lebesgue Integral 37
is continuous at a.
Proof. The proof that
lim F (t, ·) dμ = F (a, ·) dμ
t→a X X
for each sequence tj → a, tj ∈ U . To prove the last assertion it suffices to use the
Lebesgue dominated convergence theorem.
9.2. Theorem. Let (X, S , μ) be a measure space, N ⊂ X a set of measure
zero and I ⊂ R an open interval. Suppose that a function F : I × X → R has the
following properties:
(a) For each x ∈ X \ N , F (·, x) is differentiable on I;
(b) for each t ∈ I, F (t, ·) is μ-measurable;
d
(c) there exists a function g ∈ L (X) such that F (t, x) ≤ g(x) for each
1
dt
x ∈ X \ N and t ∈ I;
(d) there is a t0 ∈ I such that F (t0 , ·) ∈ L 1 (X).
Then F (t, ·) ∈ L 1 (X) for all t ∈ I, the function
f : t → F (t, ·) dμ
X
is differentiable on I and
d
f (t) = F (t, ·) dμ.
X dt
F (b, x) − F (a, x)
x →
b−a
38 10. The Lp Spaces
9.3 Remarks. 1. Notice that in the proofs of the last theorems we made heavy use of the
Lebesgue dominated convergence theorem. Of course we could state analogous results based on
Levi’s theorem.
2. Since the notions of continuity and derivative are local, it suffices to verify the assumption (c)
only locally.
3. A number of exercises and clarifying examples can be found in [L-Př].
The Lebesgue spaces Lp are important means linking measure theory and func-
tional analysis, and they are essential tools in differential equations theory, prob-
ability theory and other branches of modern analysis.
In this chapter, (X, S , μ) denotes a fixed measure space and p a number from
the interval [1, ∞].
10.1. The Set L p . (a) Suppose p < ∞. We denote by L p = L p (X, S , μ) the
set of all μ-measurable functions on X such that
p
|f | dμ < ∞.
X
The value
1/p
p
f p := |f | dμ
is called the Lp -norm of a function f ∈ L p . We will show soon that ·p has all
important properties of a norm.
(b) We denote by L ∞ = L ∞ (X, S , μ) the set of all μ-measurable functions
f on X such that |f | ≤ M μ-almost everywhere for some constant M . The least
constant M having this property is called the L∞ -norm of f and is denoted by
f ∞ . Roughly speaking, the difference between f ∞ and supX |f | is that f ∞
omits values of f on null sets.
Sometimes it is useful to emphasize the measure or the space X. In this case,
abbreviated symbols L p (X) or L p (μ) will be used as well.
B. The Abstract Lebesgue Integral 39
1/p 1/q
f g dμ ≤ |f |
p
dμ |g|
q
dμ .
X X X
Proof. Denote
1/p 1/q
p q
s= |f | dμ , t= |g| dμ .
X X
p q
f (x)g(x) |f (x)| |g(x)| |f (x)| |g(x)|
≤ ≤ +
st st psp qtq
With these trivial cases out of the way, there remains the case 1 < p < ∞.
Hölder’s inequality yields
p−1
|f | |f + g| dμ
X
1/p 1/q
p (p−1)q
≤ |f | dμ |f + g| dμ
X X
1/p 1−1/p
p p
= |f | dμ |f + g| dμ .
X X
Analogously
1/p 1−1/p
p−1 p p
|g| |f + g| dμ ≤ |g| dμ |f + g| dμ .
X X X
This entails
p p−1 p−1
|f + g| dμ ≤ |f | |f + g| dμ + |g| |f + g| dμ
X
X 1/p
X
1/p 1−1/p
p p p
≤ |f | + |g| |f + g| dμ .
X X X
[f ] = {g ∈ L p : g = f μ-almost everywhere on X}
and define
Lp = Lp (X, S , μ) = {[f ] : f ∈ L p }.
Then Lp is a linear space equipped with operations
Proof. The easy case p = ∞ is left to the reader. Suppose p < ∞. First we choose
a subsequence {gj } of {fi } such that
∞
S := gj − gj+1 p < ∞.
j=1
∞
Denote h = |gj − gj+1 |. Then by Levi’s Theorem 8.12 and Minkowski’s in-
j=1
equality
⎛ ⎞p ⎛ ⎞p
∞
k
hp dμ = ⎝ |gj − gj+1 |⎠ dμ = lim ⎝ |gj − gj+1 |⎠
X X k→∞ X
j=1 j=1
⎛ ⎞p
k
≤ lim ⎝ gj − gj+1 p ⎠ ≤ S p < ∞.
k→∞
j=1
By Theorem 8.9.a there exists a set M ⊂ X such that μ(X \ M ) = 0 and h < ∞
on M . We can also assume that |g1 | < ∞ on M . For each x ∈ M , {gj (x)} is a
Cauchy sequence in R, and therefore it is convergent. Thus we can define
f (x) = lim gj (x)
j→∞
2. For a counting measure μ on a set X we write lp (X) := Lp (X, (X), μ). In particular, for
X = N we get well known spaces of sequences:
„ «1/p
P
– the space p , 1 ≤ p < ∞ of all sequences x = {xn } such that xp := |xn |p <
n
∞,
– the space ∞ of all bounded sequences x = {xn } with the norm x∞ := supn |xn |.
10.8. Exercise. Suppose fn , f ∈ ∞ . Show that fn − f ∞ → 0 if and only if there exists
a set E ∈ , μE = 0, such that fn ⇒ f on X \ E.
10.9. Exercise. Let p ∈ [1, +∞) and {fn } be a Cauchy sequence in p . A close inspection of
the proof of 10.6 shows that there is a subsequence of {fn } that converges μ-almost everywhere
in X. Compare with Theorem 12.4.
10.10. Exercise. (a) Suppose that p ∈ [1, +∞). Show that the set
X
:= { βj cBj : Bj ∈ , μBj < ∞}
j
10.11. Exercise. Consider p ∈ (0, 1). Define .p and the spaces p, Lp in the same way as
in the beginning of this chapter. Show that:
(a) Lp is a linear space.
(b) The triangle inequality for .p does not hold.
Hint. Employ the indicator functions of two disjoint sets of a positive measure.
R
(c) The distance function dp (f, g) := |f − g|p dμ is a metric on Lp and (Lp , dp ) is a
complete metric linear space.
10.12. Notes. The Hölder inequality was first proved by A.L. Cauchy [*1821] in the case of
p = 2 for finite sums (so, in fact, for the counting measure on a finite set). V.Y. Bunyakowski
[1859] proved it (still for p = 2) for Riemann integrable functions and the same result was
obtained by H.A. Schwarz [1885]. For general p but still for finite sums, the inequality was
proved by L.J. Rogers [1888] and by O. Hölder [1889]. The definitive form given in 10.3 comes
from F. Riesz [1910], where Minkowski’s inequality is proved as well. The original result for
finite sums is due to H. Minkowski [1907].
The theory of Lp -spaces was originated by F. Riesz [1906] who defined the L2 -metric and
E. Fischer [1907] who proved the completeness of L2 . F. Riesz then defined Lp for other values
of p as well and proved their completeness. The notion of a Banach space has its roots in papers
by S. Banach [1922] and N. Wiener [1922]. This period culminated when Banach’s fundamental
monograph [*1932] appeared.
In this chapter, we will consider the following problem: Given two measure
spaces (X, S , μ) and (Y, T , ν), we wish to define a product measure τ on an
appropriate σ-algebra U of subsets of the Cartesian product X ×Y . If M = S ×T
where S ∈ S , T ∈ T (such sets are called measurable rectangles), we require
τ M = μS · νT .
An important example is that of Lebesgue measure in R2 which should arise as
the product of one-dimensional Lebesgue measures.
B. The Abstract Lebesgue Integral 43
(a) if M1 ⊂ M2 ⊂ M3 ⊂ . . . , Mn ∈ M , then Mn ∈ M ;
(b) if M1 ⊃ M2 ⊃ M3 ⊃ . . . , Mn ∈ M , then Mn ∈ M .
Every monotone class which is simultaneously an algebra forms a σ-algebra.
The following idea, used already in the proof of the previous lemma, will repeat
in the proofs of subsequent theorems:
If A is the family of sets from S ⊗ T for which a certain proposition holds
and if A contains all measurable rectangles, then A = S ⊗ T provided A is a
σ-algebra. It is not always so easy to verify that A a σ-algebra. Often it is easier
to prove that A is only a monotone class and an algebra (mostly using Levi’s
theorem). But even in this case A = S ⊗ T as follows from the next theorem.
11.4. Theorem. If R is an algebra of sets, then the smallest monotone class
containing R is σ (R).
Proof. Let M denote the smallest monotone class containing R (it does exist!).
It is sufficient to show that M is an algebra. To this end, it is enough to prove
that M ⊂ KE := {B : E \ B, B \ E, B ∪ E ∈ M } for each E ∈ M . We fix a set
F and proceed in the following steps:
(a) KF is a monotone class;
(b) R ⊂ KF for F ∈ R;
(c) M ⊂ KF for F ∈ R;
(d) if E ∈ M , then R ⊂ KE . (Let F ∈ R; by (c), E ∈ KF , and so F ∈ KE .)
Therefore R ⊂ M ⊂ KE by the definition of M .
The basis for a definition of the product measure is in the following proposition.
44 11. Product Measures and the Fubini Theorem
Proof. Let A be the collection of all sets E ∈ S ⊗ T such that all assertions
hold for E. If R denotes the family of all finite disjoint unions of measurable
rectangles, it is straightforward to check that R is an algebra. The assertion will
follow from the preceding theorem, provided we can prove that A is a monotone
class containing R. We merely outline the main steps and invite the reader to fill
in the details.
(a) A contains all measurable rectangles.
n
(b) R ⊂ A (it suffices to show that Ej ∈ A whenever Ej ∈ A are pairwise
j=1
disjoint).
τ (A × B) = μA · νB
whenever A ∈ S and B ∈ T .
Proof. Uniqueness is almost obvious. Indeed, if τ1 and τ2 are measures on S ⊗ T
satisfying requirements of the theorem, then the family D = {E ∈ S ⊗T : τ1 E =
τ2 E} contains all measurable rectangles, is closed under finite disjoint unions and
is a monotone class. Therefore D = S ⊗ T and we see that τ1 = τ2 on S ⊗ T .
To prove the existence of a product measure, define
τE = νE x dμ
X
for E ∈ S ⊗ T . We know that τ E = Y μEy dν according to Lemma 11.5.
A routine argument shows that τ is a measure on S ⊗T and τ (A×B) = μA·νB
for all measurable rectangles A × B.
B. The Abstract Lebesgue Integral 45
11.8. Lemma. Let (X, S , μ), (Y, T , ν) be σ-finite measure spaces and f a
S ⊗T -measurable function on X ×Y . Then the function f (x, ·) is T -measurable
on Y for each x ∈ X.
Proof. Select α ∈ R, x ∈ X and observe that
x
{y ∈ Y : f (x, y) > α} = {(t, y) ∈ x × Y : f (t, y) > α} .
11.9. Fubini’s Theorem. Let (X, S , μ), (Y, T , ν) be σ-finite measure spaces,
and h ∈ L ∗ (μ ⊗ ν). Then
h dμ ⊗ ν = h(x, y) dν) dμ = h(x, y) dμ dν .
X×Y X Y Y X
11.11. Theorem. Let (X, S , μ), (Y, T , ν) be complete σ-finite measure spaces,
τ the product of μ and ν and τ the completion of τ . Let h ∈ L ∗ (τ ). Then the
function h(x, ·) is ν-measurable for μ-almost all x ∈ X, h(·, y) is μ-measurable
for ν-almost all y ∈ Y , and
h dτ = h(x, y) dν dμ = h(x, y) dμ dν .
X×Y X Y Y X
Proof. As usual, it is sufficient to prove the theorem for indicator functions. Let
M be a set of the form A ∪ N , where A ∈ S ⊗ T and N ⊂ B for B ∈ S ⊗ T ,
τ B = 0. By Fubini’s theorem
cB (x, ·) dν = 0
Y
46 12. Sequences of Measurable Functions
11.12. Exercise. Show that every Dynkin class is a monotone class and find a counterexample
that the reverse is not true.
11.13. Notes. The product measures and the reduction of multi-dimensional integration to
one-dimensional one were originally studied in case of the Lebesgue measure in R2 and appear
already in H. Lebesgue [*1904]. G. Fubini [1907] claims that the order of integration may be
reversed. A complete proof was done by L. Tonelli [1909]. Approximately in the 30’s a number
of works concerning the abstract product measure theory appeared. The method explained in
this chapter is due to H. Hahn [1933].
Let us point out that there are theories concerning products of measures which are not
necessarily σ-finite. The theory of infinite products of measures (we understand the infinite
number of factors) is also very important.
In this section we will study the relationship among various kinds of conver-
gences of sequences of measurable functions on a measure space (X, S , μ). We
will consider
(a) the convergence almost everywhere;
(b) the convergence in the norm of Lp spaces;
(c) the convergence in measure;
(d) the μ-uniform convergence.
∞ 1
∞
If Aj := {x ∈ X : |f (x) − fnk (x)| ≥ } and B := Aj , then A1 ⊃ A2 ⊃
k=j k j=1
1
A3 ⊃ . . . , μA1 ≤ < +∞, and consequently
2k
∞
1 1
μB = lim μAj ≤ lim = lim j−1 = 0
2k 2
k=j
Hint. For one of the implications use Theorem 12.4. If {fn } satisfies the “subsequences condi-
tion” and does not converge in measure, then there exists an ε > 0 and a subsequence {gn } of
{fn } such that μ{|gn − f | > ε} > ε for all n and gn → f almost everywhere. By Lebesgue’s
theorem of 12.6 we obtain a contradiction.
12.8. Exercise. Let (X, , μ) be a measure space, ϕ an increasing, bounded and continuous
function on [0, ∞], ϕ(0) = 0, and ϕ(s + t) ≤ ϕ(s) + ϕ(t) for all s, t ≥ 0 (for instance the function
t → t/(1 + t)). Define
∞
X
ρ(f, g) = 2−j ϕ(μ{|f − g| ≥ 1/j}).
j=1
Show that
(a) ρ is a pseudometric on the space of all the μ-measurable functions on X;
(b) fn converge to f in measure if and only if ρ(f, fn ) → 0.
12.9. Exercise (Vitali’s theorem). Let 1 ≤ p < +∞. If fn ∈ p, fn → f almost everywhere
and fn p → f p , then fn − f p → 0.
Hint. Let ϕn = 2p−1 (|fn |p + |f |p ) − |fn − f |p . Then ϕn → 2p |f |p almost everywhere. Since
|fn − f |p ≤ (|fn | + |f |)p ≤ 2p−1 (|fn |p + |f |p ), we have ϕn ∈ 1 , ϕn ≥ 0. By Fatou’s lemma
Z Z Z Z
2p |f |p ≤ lim inf ϕ n = 2p |f |p − lim sup |fn − f |p .
X X X X
R
Hence lim sup X |fn − f |p = 0, finishing the proof.
(b) Show that the assumption of σ-finiteness of the measure μ in (a) cannot be dropped.
Hint. Let X be the set of all convergent sequences x = {xk } of real numbers and μ the counting
measure on X. Define a sequence {fn } of functions on X by fn (x) = xn . Choose a sequence
{Ej } of subsets of X on which {fn } converges uniformly and find a convergent sequence {yk }
which does not belong to any of the sets Ej (because it converges more slowly than all sequences
from Ej ).
(c) Show that the μ-uniform convergence implies convergence almost everywhere, or conver-
gence in measure, without the assumption that μX < ∞.
12.11. Exercise. Find a sequence {fn } of functions on [0, 1] which converges to zero in
(Lebesgue) measure whereas the sequence {fn (x)} fails to converge for any x ∈ [0, 1].
Hint. Consider the indicator functions of intervals [ k−1
m
, k
m
] ordered into a suitable sequence.
B. The Abstract Lebesgue Integral 49
p
12.12. Weak Convergence in Lp . Let 1 ≤ p < +∞ and q = p−1 (q = +∞ if p = 1).
Suppose f , fj ∈R p
R that a sequence {fj } converges
. We say weakly to f in Lp (denoted
f = w-lim fj ) if X fj g dμ → X f g dμ for every function g ∈ q .
(a) If f = w-lim fj , g = w-lim fj , then g = f almost everywhere.
(b) If f − fj p → 0, then f = w-lim fj .
(c) Suppose f, fj ∈ p . Let be a set of p -functions, the linear span of which is dense
in Lq . Then the following assertions are equivalent:
(i) f = w-lim fj ; R R
(ii) the sequence {fj p } is bounded and X gfj dμ → X gf dμ for every g ∈ .
Hint. A typical application of the Banach-Steinhaus theorem from functional analysis.
(d) We can take as the set of all indicator functions of sets from in case p = 1, or the
set of all indicator functions of sets from of finite measure if p > 1; in the special case of the
Lebesgue measure, other possibilities are shown in Theorem 31.4.
(e) If p > 1, then any norm-bounded sequence of functions from p has a weakly convergent
subsequence.
Hint. If the space Lp is separable, proceed in a similar way as in the proof of Theorem 17.10.
The nonseparable case is more difficult. The idea is that for 1 < p < ∞ the space Lp is reflexive
and reflexive spaces are characterized by this property, cf. for instance L. Mišı́k [*1989], Theorem
33.4 or R.B. Holmes [*1975], p.149.
(f) Let p > 1. If fj → f almost everywhere and if {fj p } is a bounded sequence, then
f = w-lim fj . The proposition holds also if the assumption fj → f almost everywhere is
replaced by fj → f in measure.
(g) The Radon-Riesz theorem. Let p > 1, f = w-lim fj and f p = lim fj p . Then
f − fj p → 0. (What proposition do we obtain by sticking (f) and (g) together?)
Hint. The proof makes substantial use of uniform convexity of Lp spaces for 1 < p < ∞, see
M.M. Rao [*1987], Proposition 5.5.3.
(h) The sequence fj : x → sin jx converges weakly to zero for any p on each bounded interval.
None of its subsequences is convergent almost everywhere.
(i) Suppose fj = j 1/p on (0, 1/j) and fj (x) = 0 on (1/j, 1). The sequence {fj } is bounded
in the norm of Lp (0, 1) and tends to zero almost everywhere. If p = 1, then it is not weakly
convergent (nor any of its subsequences). If p > 1, then it converges weakly but not in norm.
12.13. Weak* Convergence. Assume μ is a σ-finite measure on X and f, fj ∈ L∞ . We say
R R
that a sequence {fj } converges weakly* to f (denoted f = w* -lim fj ) if X fj g dμ → X f g dμ
for each function g ∈ 1 .
(a) If f = w* -lim fj , g = w* -lim fj , then g = f almost everywhere.
(b) If f − fj ∞ → 0, then f = w* -lim fj .
(c) Suppose f, fj ∈ ∞ . Let be a set of 1 -functions, the linear span of which is dense
in L1 . Then the following are equivalent:
(i) f = w* -lim fj ; R R
(ii) the sequence {fj ∞ } is bounded and X gfj dμ → X gf dμ for each function g ∈ .
(d) If fj → f almost everywhere and if the sequence fj ∞ is bounded, then f = w* -lim f j.
(e) Any norm-bounded sequence {fn } of ∞ -functions contains a weakly* convergent sub-
sequence.
S
Hint. Write X = Xk where X1 ⊂ X2 ⊂ . . . and μXk < ∞. Suppose {fn } is a norm-bounded
sequence of functions from L∞ and
(
fn on Xk ,
f˜n,k =
0 on X \ Xk .
50 13. The Radon-Nikodým Theorem and the Lebesgue Decomposition
Then {f˜n,k }n is bounded even in the L2 -norm and there exists its subsequence {f˜ni ,k }n which
R R
converges weakly in L2 to a function f˜k . But this means that lim X fni u dμ = X fk u dμ for
i
each function u ∈ L2 vanishing on the complement of Xk . The diagonal R method (cf.
R 17.10)
provides a subsequence {hn } of {fn } and a limit function f such that X hn u dμ → X f u dμ
for each function u ∈ L2 vanishing on the complement of some of the sets Xk . Since the set of
all such functions u is dense in L1 , in light of (c) we conclude that f = w* -lim fn .
(f) The sequence fj : x → sin jx converges weakly* to zero (cite the Riemann-Lebesgue
lemma 31.10).
(g) Define fj = 0 on (0, 1/j) and fj (x) = 1 on (1/j, 1). The sequence {fj } is bounded in
L∞ -norm and converges both almost everywhere and weakly* to the constant function 1. We
have lim fj ∞ = lim fj ∞ = 1 but lim f − fj ∞
= 0.
12.14. Remarks. It may be worth while to take for the interested reader a slightly deeper
look from the point of view of functional analysis (see also [Zap]).
(1) Let X is a Banach space with (topological) dual X ∗ , xn , x ∈ X, Fn , F ∈ X ∗ . We say
that
(a) {xn } converges weakly to x if F (xn ) → F (x) for any functional F ∈ X ∗ ;
(b) {Fn } converges weakly* to F if Fn (z) → F (z) for any z ∈ X.
Taking into account 13.17, the previous definitions of the weak and weak* convergence in
the Lp spaces now are easier to understand (at least if 1 < p < ∞ or if μ is σ-finite).
2. We say that a Banach space is uniformly convex if for any ε > 0 there is a δ > 0 such that,
‚ ‚
‚1 ‚
if x, y ∈ X, x = y = 1 and ‚ ‚
‚ 2 (x + y)‚ > 1 − δ, then y − x < ε.
Now, if μ and ν are measures on S , one might ask whether there is a nonnegative
function f ∈ L 1 (μ) such that ν = μf . Immediately we see that such a function in
general does not exist. Indeed, every measure of the form μf satisfies μf (E) = 0,
provided μ(E) = 0. However, we show that if νE = 0 whenever μE = 0, and
μ and ν are σ-finite, then a desired function f , called the density or the Radon-
Nikodým derivative of ν with respect to μ, can be found. Then the density of ν
with respect to μ is unique except for a null set and it is sometimes denoted by
dν
.
dμ
We say that a measure ν on S is absolutely continuous with respect to μ, and
write ν μ, if νE = 0 for every E ∈ S with μE = 0.
There are several methods how to prove the existence of Radon-Nikodým derivatives. Our
proof is based on the following “variational” approach. Assuming that such a function f exists
and is in 2 , consider the functional
Z
J0 : g → |g − f |2 dμ
X
on the space L2 (μ). The function f evidently represents the only element of the space L2 (μ) at
which J0 attains its minimum. For all g ∈ L2 (μ)
Z
J0 (g) = J(g) + f 2 dμ
X
where Z Z Z
J(g) = (g 2 − 2f g) dμ = g 2 dμ − 2 g dν.
X X X
Since the functionals J and J0 differ only by a constant, J attains its minimum at f . This is
also the idea of our proof of the following lemma.
for any A ∈ S .
Proof. For g ∈ L 2 (σ) denote
J(g) = g 2 dσ − 2 g dν .
X X
Let
c := inf{J(g) : g ∈ L2 (σ)}.
Since νA ≤ σA for each A ∈ S , we have
J(g) ≥ (g 2 − 2 |g|) dσ = (|g| − 1)2 dσ − σ(X) ≥ −σ(X),
X X
52 13. The Radon-Nikodým Theorem and the Lebesgue Decomposition
holds, we have
1 2
|fj − fk | dσ ≤ J(fj ) + J(fk ) − 2c → 0
2 X
By the Riesz representation theorem on continuous linear functionals on Hilbert spaces there is
an element f ∈ L2 (σ) such that
Z
F (g) = f g dσ
X
for each g ∈ L2 (σ). Now it is enough to apply the last equality to g = cA , where A runs over
.
We see (for simple functions using linearity, then passing to limits) that
gf dμ = g(1 − f ) dν
X X
for all E ∈ S ;
(iii) for any ε > 0 there is a δ > 0 such that νE < ε whenever E ∈ S and
μE < δ.
54 13. The Radon-Nikodým Theorem and the Lebesgue Decomposition
Proof. The implication (i) =⇒ (ii) was established in Theorem 13.4, (ii) =⇒ (iii)
follows by Exercise 8.22.b and (iii) =⇒ (i) is obvious.
13.6. Remarks. 1. The Radon-Nikodým theorem can be generalized, its variants hold even
for signed or complex measures. Let us state the version for σ-finite measures:
Let μ, ν be σ-finite measures on (X, ), ν μ. Then there is a nonnegative -measurable
function h (not necessarily from 1 (μ)) such that
Z
νA = h dμ
A
dμ+
On the other hand, if μ is a signed measure, f = and P = {f = 1}, N = X \ P , then
d |μ|
(P, N ) is a Hahn decomposition of X with respect to μ.
13.7. Integration with Respect to Signed or Complex Measures. Suppose μ is a
signed or complex measure on (X, ). Then we define
Z Z
f dμ = af d |μ|
X X
Proof. (See also Exercise 13.12.) We first reduce the general case to the case
when ν is a finite positive measure. There are B
j ∈ S such that μBj = 0 and
lim νBj = sup{νB : B ∈ S , μB = 0}. If M = Bj , then μM = 0 and for the
measures
νa A := ν(A \ M ), νs A := ν(A ∩ M )
the equality ν = νa + νs holds. Clearly νs ⊥μ. If now μB = 0, then νa (B) =
ν(B \ M ) = 0. This is so because νB = 0 for each set B ∈ S , B ⊂
X \ M with
μB = 0. (Otherwise μ(M ∪ B ) = 0 and ν(B ∪ M ) > νM .) If X = Xk where
νXk < +∞, find sets Mk ⊂ Xk in the same way as in the first part of the proof
and set
M= Mk , νa A = ν(A \ M ), νs A = ν(A ∩ M ).
Having the proof for positive measures, the general case easily follows.
Uniqueness remains. Suppose that ν = νa + νs is another decomposition with
the required properties. Then there is a set M ∈ S for which μM = 0 and
νs (X \ M ) = 0. Fix A ∈ S and denote C = A ∩ (M ∪ M ), D = A \ (M ∪ M ).
We have μC ≤ μM + μM = 0, so νa C = νa C = 0 and νs C = νs C. Further,
νs D = νs D = 0 and therefore νa D = νa D. Since A = C ∪ D and C ∩ D = ∅, we
get νs A = νs A and νa A = νa A.
13.11. Lebesgue Decomposition. The decomposition ν = νa + νs is called
the Lebesgue decomposition of ν relatively to μ, and the measures νa and νs are
called the absolutely continuous part and the singular part of ν. Notice that there
exists a set M ∈ S such that νs = νM and νa = νX\M .
13.12. Exercise. Give the details of an alternative proof of the Lebesgue decomposition
theorem: Existence (assuming σ-finitness of both measures) using the Radon-Nikodým theorem
and uniqueness from the following exercise (part (c)).
Hint. (a) Existence: Suppose μ, ν are positive and set λ = μ + ν, f = dμ/dλ. Then the
Lebesgue decomposition has the form νs = νM , νa = νX\M , where M = {f = 0}.
(b) Uniqueness: If ν = νa + νs = νa + νs , then νa − νa = νs − νs is at the same time
absolutely continuous and singular.
13.13. Exercise. Let μ and ν be positive, signed or complex measures on a measurable space
(X, )).
(a) Prove that the following are equivalent:
(i) μ⊥ν;
(ii) |μ| ⊥ |ν|;
(iii) (μr )+ ⊥(νr )+ , (μr )− ⊥(νr )− ; (μi )+ ⊥(νi )+ , (μi )− ⊥(νi )− .
(b) If ν1 ⊥μ, ν2 ⊥μ, then ν1 + ν2 ⊥μ.
(c) If ν⊥μ, ν μ, then ν = 0.
P
13.14. Exercise. Suppose S ⊂ R is countable, f ≥ 0 and μ = f (s)εs (εs are the Dirac
s∈S
measures, cf. Exercise 2.10). Describe the Lebesgue decomposition of μ with respect to the
Lebesgue measure.
13.15. Exercise. Suppose μ ∈ M(X) (see Exercise 6.17), V = {ν ∈ M(X) : ν⊥μ}. Show
that V is a closed linear subspace of the Banach space M(X).
13.16. Exercise. Let μ, ν be finite (positive) measures on (X, ). Let sup and inf be defined
as in Exercise 6.19 and let σ = μ + ν.
56 13. The Radon-Nikodým Theorem and the Lebesgue Decomposition
such that f (u) = X uv dμ for all u ∈ Lp . Moreover vq = f (here, f = sup{f (u) : u ∈
Lp , up ≤ 1}).
Hint. ByRthe Radon-Nikodým theorem there exists a Rμ-measurable finite function v such that
f (cE ) = E v dμ for each set E ∈ . Thus, f (u) = X uv dμ for each u ∈ Lp . It remains to
show that v ∈ Lq and vq = f . There is an increasing sequence {Ej } of sets from of
S
finite measures such that v is bounded on each Ej and Ej is X. Set uj = |v|q−2 vcEj . Then
R R R
uj ∈ Lp and uj pp = X uj v dμ = f (uj ) ≤ f uj p and therefore E |v|q dμ = X |uj |p dμ ≤
“ ”q j
uj p−1
p ≤ f q . An appeal to Levi’s theorem reveals that v ∈ Lq and vq ≤ f . The
reverse inequality is as easy consequence of Hölder’s inequality.
13.18. Remark. The previous theorem which describes duals of Lp for 1 < p < ∞ was
proved using the Radon-Nikodým theorem, so that we had to confine ourselves to the case of
σ-finite measures. However, the theorem holds for arbitrary measures.
In a similar way, it can be proved that any element of L∞ represents a continuous linear
functional on L1 (by the same formula as in 13.17), but not every element of the dual space
to L1 is of this form. In order to be able to characterize elements of the dual space to L1 by
elements of L∞ , it is necessary to confine ourselves, e.g. to the case of σ-finite measures. One
possible description of the dual space of L1 (μ) in the case of arbitrary measure μ can be found
in J. Schwartz [1951].
The assumptions imposed on measures in the Radon-Nikodým theorem can be weakened to
the case of so called localizable measures. This notion, containing σ-finite case as well as the
case of Radon measures, was introduced by I.E. Segal in [1954]. For more information we refer
to M.M. Rao [*1987].
Let us remark that the dual spaces of L∞ can be described using (bounded) finitely additive
measures, cf. E. Hewitt and K. Stromberg [*1965].
13.19. Notes. The Radon-Nikodým theorem was first proved by H. Lebesgue [1910] for
measures which are absolutely continuous with respect to the Lebesgue measure. Later, it was
generalized by M.J. Radon [1913] to Radon measures and by O. Nikodým [1930] to measures
on abstract spaces. The Lebesgue decomposition in the case of arbitrary measures is in Saks’
monograph [*1937]. There are different proofs of the Radon-Nikodým theorem. One of them is
based on the Hahn decomposition and a newer one, using the Riesz theorem of representation
of functionals on Hilbert spaces, comes from J. von Neumann. Our proof uses the variational
principle and is near to von Neumann’s.
In the classical case of the Lebesgue measure, the characterization of dual spaces of Lp spaces
is due to F. Riesz [1910] for p > 1 and to H. Steinhaus [1919] for p = 1.
C. Radon Integral and Measure 57
εa : f → f (a) , f ∈ c (P )
for each f ∈ ([a, b]). The functional Aϕ is again a Radon integral and it is called the Riemann-
Stieltjes integral. If ϕ(x) = x, it is the Riemann integral.
(d) The Riemann integral or the Riemann-Stieltjes integral can be defined for functions from
R R
c (R). Indeed, observe that ab f dϕ = cd f dϕ whenever f ∈ c (R) and supt f ⊂ [a, b] ∩ [c, d].
Rb
Thus, if f ∈ c (R), set Aϕ f = a f dϕ where [a, b] is an arbitrary interval containing the
support of f . The functional Aϕ is then a Radon integral on R.
(e) Another example of a Radon integral is the functional
Z
f → (f ◦ ψ) g ,
G
f → Af (x)
is a Radon integral on P .
The Poisson integral is used when solving the Laplace (partial differential) equation Δh = 0.
Indeed, given a continuous function f on P , the function h : x → Af (x) is harmonic in the unit
disc U := {x ∈ R2 : |x| < 1} (i.e. h is a solution to the Laplace equation) and
for each z ∈ P .
and hence b = 0.
14.4. Semicontinuous Functions. Remember that a function f : P → R is
said to be lower semicontinuous if {x ∈ P : f (x) > c} is open for each c ∈ R.
Upper semicontinuous functions are defined in a similar way.
Denote by Cc↑ (P ) the set of all lower semicontinuous functions on P which are
nonnegative outside a compact set and do not attain the value −∞. Analogously
we define Cc↓ (P ).
14.5. Extension of a Radon Integral. Let A be a Radon integral on P . We
extend A for larger classes of functions in the following steps.
1. For f ∈ Cc (P ) let f = Af .
2. If f ∈ Cc↑ (P ), then we define f = sup{ g : g ∈ Cc (P ), g ≤ f }.
C. Radon Integral and Measure 59
To prove the additivity of |A|, note that f1+ +f2+ +(f1 +f2 )− = f1− +f2− +(f1 +f2 )+
and use the previous step. Since |A| (γf ) = γ |A| (f ) for every f ∈ Cc (P ) and
every γ ∈ R, the functional |A| is linear.
14.12. Decomposition of Signed and Complex Radon Integrals. Every
complex Radon integral can be decomposed into its real and imaginary part which
are signed Radon integrals. If A is a signed Radon integral, then A can be
expressed in the form A = A+ − A− where A+ := 12 (|A| + A) (the positive
variation) and A− := 12 (|A| − A) (the negative variation) are positive Radon
integrals. The representation of a signed Radon integral as a difference of positive
Radon integrals is not unique; however, the decomposition to the positive and
negative parts is “minimal” in a similar sense as the Jordan decomposition of a
measure.
14.13. Exercise (product of Radon integrals). Let A1 , A2 be Radon integrals on locally
compact spaces P1 , P2 . Show that there exists exactly one Radon integral A on P1 × P2 such
that
A f = A1 f1 · A2 f2
whenever f1 ∈ c (P1 ), f2 ∈ c (P2 ) and f (x1 , x2 ) = f1 (x1 )f2 (x2 ), x1 ∈ P1 , x2 ∈ P2 . Prove the
similar assertion also for signed and complex Radon integrals.
Hint. According to the Stone-Weierstrass theorem, the set of all linear combinations of functions
of the form f1 (x) · f2 (y), where fi ∈ c (Pi ), is dense in c (P1 × P2 ).
14.14. Notes. When building up the integration theory, some authors do not start from
the original notion of a measure; they consider linear functionals on spaces of functions defined
on an arbitrary set and then they extend these functionals to larger collections of functions.
More precisely, it is possible to start with a Riesz lattice of functions on a set X (which is
a linear space of functions closed under formations of finite maxima and minima) and with a
positive linear functional on which satisfies “Daniell’s condition”. As in 14.5, this functional
is extended, a measure is defined in a natural way and its properties are derived. This method
was worked out by P.J. Daniell [1918] (for the extension he used sequences of functions) and
M.H. Stone in the series of articles [1948] and [1949] (using generalized sequences). Let us note
C. Radon Integral and Measure 61
that many authors use different forms of a similar approach (W.H. Young [1904], H.H. Golstine
[1941], J. Mařı́k [1952] and others).
The idea of extending the Radon integral is also due to many authors. However, it seems
that Bourbaki’s group was the first who worked it out for general locally compact spaces.
Also many authors confine themselves to the integration theory on locally compact spaces.
Kakutani’s representation theorem (S. Kakutani [1941]) says that any “abstract” functional A
on a Riesz lattice can be represented as a Radon integral AK on a locally compact space PK
so that the corresponding spaces of “integrable” functions are isomorphic. However, Kakutani’s
representation also has some disadvantages — when changing the original functional, the space P
changes as well and if the original space is locally compact, Kakutani’s representation can lead to
a different space. Supposing that the Riesz lattice satisfies Stone’s condition (min(1, f ) ∈
if f ∈ ), H. Bauer in [1957] constructed a different representation of (PB , AB ) which removes
mentioned disadvantages; the space PB does not depend on A and if is the space of all
continuous functions with a compact support on a locally compact space P , PB can be identified
with P , c (PB ) with and AB with A.
Of course, no such representation can save the topology on the original space if it is not
locally compact.
Many works are also concerned with integration theories in topological spaces without the
assumption of local compactness (for instance in separable metric spaces) but in this case the
correspondence between measure and the topological structure is not so fruitful.
for every open set U , then is a σ-algebra containing all open sets. Therefore (d) holds as
well.
Throughout, μ is a Radon measure on (P, S ).
15.3. Lemma. If E ∈ S , μE < ∞, then
Proof. Given ε > 0, there exist an open set G ⊃ E and a compact set K ⊂ G
such that μ(G \ E) < ε and μK > μG − ε. Let V be an open set containing G \ E
with μV < ε. If H := K \ V , then H is compact,
E \ H ⊂ (E ∩ K \ H) ∪ (E \ K) ⊂ V ∪ (G \ K)
Proof. (a) Choose ε > 0 and find a sequence {sk } of simple functions, 0 ≤ sk
∞
f . Since f = s1 + (sk − sk−1 ), there exist Aj ∈ S and αj ≥ 0 so that
k=2
∞
f = αj cAj . As f is μ-integrable, we have μAj < ∞ for all j and we can find
j=1
∞
open sets Gj ⊃ Aj so that μ(Gj \ Aj ) < 2−j ε. The function g := αj cGj is
j=1
nonnegative, lower semicontinuous and satisfies
g dμ ≤ f dμ + ε.
P P
(b) To prove the asssertion it suffices to show it for simple functions. Let
n
f = αj cMj be a μ-integrable function (α1 , . . . , αn ≥ 0, M1 , . . . , Mn ∈ S and
j=1
C. Radon Integral and Measure 63
μMj < ∞). Choose again an ε > 0. By virtue of Lemma 15.3 there are compact
sets Kj (j = 1, . . . , n) such that Kj ⊂ Mj and
ε
μKj ≥ μMj − .
nαj
n
Then h := αj cKj is an upper semicontinuous function with a compact support,
j=1
0 ≤ h ≤ f and
h dμ ≥ f dμ − ε.
P P
Proof. First assume that f ∈ L (μ). Let ε > 0 be given. Cite Lemma 15.4 to
1
15.7. Exercise. Show that a finite positive measure μ on (P, ) is Radon if and only if for
every E ∈ and for every ε > 0 there exist a compact set K and an open set U such that
K ⊂ E ⊂ U and μ(U \ K) < ε.
15.8. Signed and Complex Radon Measures. A signed measure on (P, ) is said to be
Radon if its positive and negative variations are Radon measures. A complex measure on is
Radon if its real and imaginary parts are signed Radon measures.
Prove the following proposition: For a locally compact space P and a complex measure μ on
(P, ), the following assertions are equivalent:
(i) μ is Radon;
(ii) |μ| is Radon;
(iii) if E ∈ and ε > 0, then there exist a compact set K and an open set U so that
K ⊂ E ⊂ U and |μA| < ε for each -measurable set A ⊂ U \ K.
Hint. Use the inequality
To prove this inequality, use the decomposition as in the proof of Theorem 6.11.
15.9. Exercise. Let μ be a Radon measure on P . Prove that the union G of an arbitrary
collection of μ-null open sets is again a μ-null open set.
Hint. Suppose K ⊂ G is compact. Then K can be covered by a finite union of sets from ,
whence μK = 0. Now, the definition of a Radon measure (property (c)) shows that μG = 0.
15.10. Support of a Radon Measure. Let μ be a Radon measure on P . Define the support
of μ as [
supt μ = P \ {G : G open, μG = 0}.
In other words, supt μ is the smallest closed set whose complement si of measure zero. According
to the previous exercise, such a set always exists. If μ is a signed or complex measure, its support
is defined as the support of its total variation |μ|.
R
(a) Let μ be a positive measure. Show that z ∈ supt μ if and only if P f dμ > 0 for every
nonnegative function f ∈ c (P ) with f (z) > 0, which happens if and only if μU > 0 for every
open neighbourhood U of z.
(b) If μ1 , μ2 are Radon measures on P , then supt(μ1 + μ2 ) ⊂ supt μ1 ∪ supt μ2 with the
equality if μ1 and μ2 are positive.
15.11. Exercise. Let μ be a Radon measure and E a Borel set. If E is open or of finite
measure, then μE (see 2.4) is a Radon measure.
15.12. Exercise. Let μ be a Radon measure and f a nonnegative μ-measurable function on
P . Show that μf (see 8.19) is a Radon measure in the following cases:
(a) f is a Borel function and μf is finite on compact sets;
(b) f is continuous on P .
Hint. Use the part (a).
15.13. Exercise. Suppose that μ is a Radon measure and f ∈ L1 (μ). Show that μf is a
signed Radon measure.
15.14. Exercise. Let μ0 be a σ-finite Radon measure and μ = μa + μs the Lebesgue
decomposition of a finite (signed or complex) Radon measure μ with respect to μ0 . Show that
μa and μs are Radon measures.
Hint. Use Exercise 15.11.
C. Radon Integral and Measure 65
15.15. Exercise. (a) Show that the assertion of Lemma 15.3 continues to hold, provided E
is a countable union of sets of finite measures.
(b) If a Radon measure μ is σ-finite, then
μE = sup{μK : K ⊂ E, K compact}
for every E ∈ , in particular for every Borel set E ⊂ P . (Recall that on σ-compact spaces
Radon measures are σ-finite. Every locally compact space with a countable base of open sets is
metrizable and σ-compact.)
(c) Let P be the Cartesian product R × Rd where Rd is R equipped with the discrete
topology. Show that P is a locally compact space which is not σ-compact.
P
For every open set G ⊂ P set μG = λGy (see the notation in 11.1) and extend μ to a
y∈Rd
Radon measure on P .
If B := {0} × Rd , then B is a Borel set, μB = ∞ and μK = 0 for every compact set K ⊂ P .
15.16. Exercise. (a) Let μ be a Radon measure on P and K ⊂ P compact. Show that
{x ∈ K : μ{x} > 0} is countable. In particular, if μ is a Radon measure on a σ-compact space
P , then {x ∈ P : μ{x} > 0} is countable.
(b) Let μ be a Radon measure on Rn . Then the set {r > 0 : μ{x ∈ Rn : |x| = r} > 0} is
countable.
15.17. Exercise. Let μ be a Radon measure on (P, ), p ∈ [1, ∞). Show that the set Cc (P )
is dense in p (μ).
Hint. With the help of Exercise 10.10.a, it is sufficient to approximate every set E ∈ of finite
measure by functions from K (P ) in the Lp -norm. If ε > 0, then there exist an open set G and
a compact set K (Lemma 15.7) so that K ⊂ E ⊂ G and μ(G \ K) < ε. Now, use Urysohn’s
lemma in order to find a function ϕ ∈ c (P ) with cK ≤ ϕ ≤ cG .
15.18. Exercise. A Radon measure μ on P is called discrete (often also atomic) if there
exists a set S ⊂ P such that μ(P \ S) = 0 and μ{x}
= 0 for all x ∈ S. A measure μ is called
continuous if μ{x} = 0 for each point x ∈ P .
(a) Show that a complex Radon measure μ on P P is discrete if and P
only if there exists a
sequence of numbers {cj } and points xj ∈ P so that |cj | < ∞ and μ = cj εxj . Cf. Exercise
j j
2.10.
(b) Show that every Radon measure μ can be uniquely expressed in the form μ = μd + μc
where μd is discrete and μc continuous.
Hint. Set S := {x ∈ P : μ{x} > 0}, μd A = μ(A ∩ S) and μc A = μ(A \ S) (use Exercise 15.16.a).
(c) Show that each Radon measure μ on Rn can be written uniquely in the form
μ = μd + μa + μs
(b) Show that there exists an Fσ set S and a Gδ set D such that S ⊂ E ⊂ D and μ(D \S) = 0
(compare also with Theorem 1.21).
15.20. Exercise. Let μ be a Radon measure on (P, ) and consider the outer measure
In this chapter, let us direct our attention to the relation between Radon
measures and Radon integrals on a locally compact space P .
Suppose μ is a Radon measure on P . Then Cc (P ) ⊂ L 1 (μ) and the mapping
f → f dμ , f ∈ Cc (P )
P
is a Radon integral on P .
On the other hand, every Radon integral A on P can be understood as an
integral with respect to a Radon measure. In Chapter 14 we indicated a method
of proving this proposition, and now we propose to show another method and
provide complete proofs.
Symbols μA and μ∗A of this chapter will be used to denote the same objects as
previously in Chapter 14, but the definitions are revised.
In the sequel, A denotes a Radon integral on a locally compact space P .
16.1. Outer Radon Measure. If G is an open subset of P , we set
{Vi1 }i , {Vi2 }i such that Vij ⊂ Gj and Vi1 ∪ Vi2 ⊃ K. Set Kj = K ∩ Vij ,
i i i
j = 1, 2. Then Kj are compact, Kj ⊂ Gj and K = K1 ∪ K2 . Thus μ∗ K ≤
μ∗ K1 + μ∗ K2 ≤ μ∗ G1 + μ∗ G2 and it readily follows that μ∗ is subadditive on
open sets.
∞
Now, let {Gn } be a sequence of open sets. Choose a compact set K ⊂ Gi .
i=1
n
Then K ⊂ Gi for some n ∈ N, whence
i=1
∞
n
n
∗ ∗
μ K≤μ Gi ≤ μ∗ Gi ≤ μ∗ Gi ,
i=1 i=1 i=1
68 16. Riesz Representation Theorem
∞
∗
∞
and (b) yields that μ Gi ≤ μ∗ Gi .
i=1 i=1
∗
Finally, let En ⊂ P be arbitrary. We wish to show that μ∗ ( En ) ≤ μ En .
It would be clearly sufficient to assume that μ∗ En < ∞ for all n. Given ε > 0,
we can find open sets Gn such that En ⊂ Gn and μ∗ Gn < μ∗ En + 2−n ε. Then
μ∗ En ≤ μ∗ Gn ≤ μ∗ En + ε .
μ∗ (T ∩ G) + μ∗ (T \ G) ≤ μ∗ (V ∩ G) + μ∗ (V \ G) < μ∗ W + ε + μ∗ W0
= μ∗ (W ∪ W0 ) + ε ≤ μ∗ V + ε < μ∗ T + 2ε.
(In fact, the idea of the proof is quite simple, T ∩ G and T \ G are approximated
by disjoint open sets.)
16.4. Measure μA . Any Radon integral A on a locally compact space P defines
a corresponding outer Radon measure μ∗A . Carathéodory’s construction yields the
σ-algebra MA on which μ∗A is a complete Radon measure. The restriction of μ∗A
to MA will be denoted by μA .
16.5. Riesz Representation Theorem. Let A be a Radon integral on P .
Then there exists a complete Radon measure μ on P such that Cc (P ) ⊂ L 1 (μ)
and Af = P f dμ for each f ∈ Cc (P ). The measure μ is unique on B(P ).
Proof of the existence. If μ = μA , then μ is a complete Radon measure. To
complete the proof we only have to show that
Af = f dμ, f ∈ Cc (P ).
P
Then using the definition of μ∗ G and the properties of integral it is clear that
1 1 1 1
μGk ≤ A(fk − fk−1 ) ≤ μGk−1 and μGk ≤ (fk − fk−1 ) dμ ≤ μGk−1
n n n P n
n
1 1 1
≤ (μGk−1 − μGk ) = μG0 = μ{x ∈ P : f (x) > 0}.
n n n
k=1
Since μ{x ∈ P : f (x) > 0} < +∞, we get Af = P
f dμ and we are done.
Proof of the uniqueness. If ν is a complete Radon measure on P obeying the
assumptions of the theorem, we have
Af = f dν ≤ cG dν = νG
P P
f = sup |f (x)| .
x∈P
The space
(b) Let ν be a finite signed or complex Radon measure on P . Then the mapping
Z
A : f → f dν
P
16.11. Notes. The famous Riesz Representation Theorem 16.5 was proved for P = [0, 1] by
F. Riesz [1909b] but continuous linear functionals on ([0, 1]) were represented by functions of
bounded variation using Riemann-Stieltjes integrals. Other theorems of this type were proved by
J. Radon [1913] (for compact subsets of Rn ), S. Banach in Appendix to Saks’ monograph [*1937],
S. Saks [1938] (for compact metric spaces) and S. Kakutani [1941]. The Riesz Representation
Theorem for compact spaces and the method of construction of Radon measures directly from
Radon integrals is due to J. von Neumann [1934]. Concerning locally compact spaces, A. Weil
[1940] was aware about the result. However, the final version for locally compact spaces was
established by the Bourbaki group (A. Weil was also its member) in the 40’s and appeared in
[*1952].
for each f ∈ F . Notice that the F -weak limit, if exists, is uniquely determined.
The most important case is the Cc (P )-weak convergence which is called the
v
vague convergence and denoted by μn → μ.
72 17. Sequences of Measures
From the point of view of functional analysis, the vague convergence is the weak* convergence.
In case of spaces of measures, it is common to omit the asterisk. Different spaces of measures
are duals to different spaces of continuous functions. If such a space is equipped with a norm
(or, more generally, with a locally convex topology), then it is possible to introduce a -weak
topology on its dual ∗ so that the -weak convergence of measures is in fact the convergence
in this ( -weak) topology. Notice that the space ∗ is not (except a few not very interesting
cases) metrizable. Hence the -weak topology cannot be described by convergence of sequences
and, in general, nets have to be used instead.
If P is a compact metric space, then the set + (P ) is metrizable in the (P )-weak topology.
In functional analysis, besides weak convergences we meet also strong conver-
gences. The most important example of a strong type convergence of measures is
the convergence μn − μ → 0 on the space Mb (P ) of all finite signed (or com-
plex) Radon measures. The norm μ defined as |μ| (P ) (like in Exercise 6.17)
is the dual norm to the norm of C0 (P ) and Mb (P ) with this norm is a Banach
space.
17.2. Comparison of Weak Convergences. Now we will touch the question
v
whether the vague convergence μn → μ implies the F -weak convergence for a
wider space of “test” functions. Proofs of next propositions require the Banach-
Steinhaus theorem of functional analysis.
v
(a) A sequence μn converges C0 (P )-weakly to μ if and only if μn → μ and the
sequence {μn } is bounded.
(b) As usually, by a weak convergence we understand the Cb (P )-weak conver-
gence where Cb (P ) denotes the set of all bounded continuous functions on P . A
sequence μn of complex measures from Mb (P ) converges weakly to a measure
v
μ ∈ Mb (P ) if and only in μn → μ and for every ε > 0 there exists a compact set
K ⊂ P so that |μn | (P \ K) < ε for all n.
A sequence μn of positive measures from Mb+ (P ) converges weakly to a measure
v
μ ∈ Mb+ (P ) if and only if μn → μ and μn → μ.
(c) Note that the weak convergence implies the C0 (P )-weak convergence and
this one implies the vague convergence. If the space P is compact, then Cb (P ) =
Cc (P ) = C (P ) and there is no difference between these convergences.
v
17.3. Examples. (a) If xn → x, then εxn → εx .
(b) Suppose that {xn } is a sequence having no convergent subsequence (for example, take
xn = n for P = R) and {αn } a sequence of real numbers. Then the sequence {αn εxn } converges
vaguely to the null measure. This sequence converges (to the null measure) 0 (P )-weakly if and
only if αn is a bounded sequence, and weakly if and only if αn → 0.
v
(c) Suppose xn → z, yn → z, xn
= yn . Then εxn − εyn → 0 but εxn − εyn → 2
= 0.
(d) If f is a continuous function on [0, 1], then
Z
1 X i
1 k
f (x) dx = lim f( ) .
0 k→+∞ k i=1 k
As if often the case, this equality can become the basis for a definition of the Riemann integral,
but it cannot be used to describe the set of Riemann integrable functions. In the language of
the weak convergence of measures it can be understood as
1X
k
v
ε i → λ[0,1] .
k i=1 k
C. Radon Integral and Measure 73
The above examples show that the weak or vague convergence (unlike the
strong convergence, cf. Exercise 6.18) of μn to μ does not imply μn (A) → μ(A)
for all (Borel) sets. Nevertheless, we can state the following theorem for compact
spaces. Its modifications hold in locally compact spaces as well.
v
17.4. Theorem. Let P be a compact space and μn , μ ∈ M + (P ), μn → μ.
(a) For any lower semicontinuous lower bounded function u on P we have
u dμ ≤ lim inf u dμn .
P n→∞ P
To prove (c), (d) and (e) we can apply (a) and (b) to indicator functions.
17.5. Remark. Compare Theorem 17.4 with the well-known result that a bounded function
f is Riemann integrable if and only if the set of all points of discontinuity of f is of measure zero
(see 7.9.d). It is possible to prove that a bounded Borel function f is Riemann integrable on
R R
[0, 1] if and only if [0,1] f dμn → 01 f dλ for every sequence μn of positive measures converging
weakly to λ on [0, 1].
v
17.6. Exercise. Suppose μn , μ ∈ + (P ). Show that μ → μ if and only lim sup μ (K) ≤
n n
μ(K) for each compact set K ⊂ P and lim inf μn (G) ≥ μ(G) for each open set G ⊂ P .
17.7. Molecular Measures. A Radon measure ν is called molecular if
k
k
ν = αi εxi where x1 , . . . , xk ∈ P and α1 , . . . , αk are positive and αi = 1.
i=1 i=1
As we have seen in Example 17.3.d, the definition of the Riemann integral is
closely related to an approximation of “continuous” Lebesgue measure by “dis-
crete” molecular measures. In a similar way, more general measures can be ap-
proximated as the next theorem shows.
17.8. Theorem. Let μ be a (positive) Radon measure on a compact metric
space P . Then there exists a sequence {μn } of molecular measures such that
v
μn → μ.
74 18. Luzin’s Theorem
Proof. We sketch the main idea of the proof. Thanks to the compactness of P ,
for every k ∈
N ithere is a finite collection {Mki } of pairwise disjoint nonempty
−k
sets so that Mk = P and diam Mk ≤ 2 . Choose points xik ∈ Mki and set
i
i
μk = μ(Mki )εxik .
i
17.9. Remark. The previous theorem can be proved as a nice application of the Krejn-
Milman theorem or of the bipolar theorem. However, both of them belong to deeper theorems
of functional analysis.
The following theorem has also its interpretation in the language of functional analysis: On
dual spaces bounded sets are relatively sequentially compact in the weak* topology.
μ(f ) = lim νn (f ).
n→∞
Proof. (i) =⇒ (ii): Let {Uj } be a countable base for the topology on R (for
instance, a sequence of all intervals with rational endpoints). Fix an ε > 0 and
a compact set K ⊂ P . There exist open sets Gj ⊂ P and compact sets Fj ⊂ P
such that
Fj ⊂ K ∩ f −1 (Uj ) ⊂ Gj and μ(Gj \ Fj ) < 2−j ε.
ϕn → f μ-almost everywhere on P.
18.6. Notes. Luzin’s theorem for the case of the Lebesgue measure was proved by H. Lebesgue
[1903] and by N.N. Luzin [1912].
Most of this chapter is devoted to a more general problem. Since the object
belongs to elements of harmonic analysis and the reader can consult many text-
books (e.g. G. Bachman [*1964] or K. Ross [*1963]), we merely outline the main
ideas and invite the reader to fill in the details.
19.2. Topological Group. Let us start with basic notions. By a topological
group we understand a group G together with a (Hausdorff) topology such that
the group operations (x, y) → xy and x → x−1 are continuous.
In the sequel, G stands for a topological group whose topology is locally com-
pact. The unit element of G will be denoted by e and the σ-algebra of all Borel
subsets of G by B(G). Finally, by S we will denote σ-algebras which appear as
domains of measures in consideration.
For x ∈ G and A ⊂ G, set
where a ∈ (0, +∞) and b ∈ R. There exists a one-to-one mapping of G onto (0, +∞) × R given
by „ «
a b
F: → (a, b).
0 1
Consider on G the (locally compact) topology determined by F from R2 . For A ∈ (G), set
Z Z
1 1
μA = 2
dx dy, νA = dx dy.
A x A x
(a) Show that μ is a left Haar measure and ν a right Haar measure on G.
(b) Show that νA = μ(A−1 ).
78 19. Measures on Topological Groups
19.7. Theorem (existence and uniqueness of left Haar measure). (a) On any
locally compact group there exists a left Haar measure.
(b) If μ and ν are complete left Haar measures on G, then there exists c > 0
such that μ = cν.
Proof of this theorem is quite complicated and takes a lot of effort. There are var-
ious existence proofs, some of them as applications of deep theorems of functional
analysis. Here we outline a rough idea of an “elementary” existence proof.
Let V be an open neighbourhood of the unit element e of G. If E ⊂ G is a
compact set, denote by HV (E) the smallest n such that there exist x1 , . . . , xn ∈ G
n
so that E ⊂ xj V (existence of a finite cover follows from the compactness of
j=1
E). Let K be a fixed compact set with a nonempty interior. The required Haar
measure is then obtained by some limit procedure for “V → {e}” and by extending
set functions
HV (E)
E → , E compact.
HV (K)
A proper and complete description of the entire construction is not easy.
The proof of uniqueness will be given under an additional assumption that μ
is even a Haar measure. The general case is similar, but the technical details
are more difficult. First, there exists a nonnegative function h ∈ Cc (G) so that
h(e) = 0 and h(x) = h(x−1 ) for x ∈ G (if g ∈ Cc (G), g(e) = 0, g ≥ 0, set
h(x) = g(x) + g(x−1 )). Then G h dμ > 0 (see Exercise 19.14.b) and for an
arbitrary function f ∈ Cc (G) we get
h dν f dμ = h(y) f (xy) dμ(x) dν(y)
G G
G G
To give reasons for particular steps when using Fubini’s theorem notice that f
and h have compact supports and R that μ and ν are Radon measures. We can
h dμ
finish the proof by setting c := RG h dν .
G
Haar measures have many interesting properties. Let us state some of them.
19.8. Theorem. Let μ be a left Haar measure on a locally compact group G.
Then:
(a) μU > 0 for any nonempty open set U ⊂ G;
(b) μG < +∞ if and only if G is compact.
n
of K yields the existence of x1 , . . . , xn ∈ K such that K ⊂ xi U . Then
i=1
n
0 ≤ μK ≤ μ(xi U ) = nμ(U ).
i=1
n
but all (“remaining”) x ∈ G satisfy xK ∩ ( xi K) = ∅. (Indeed – consider finite
i=1
sequences x1 , . . . , xn for which x1 K, . . . , xn K are pairwise disjoint. Their number
n
is bounded for instance by μG/μK. If x ∈ G, then xK ∩ ( xi K) = ∅ and it
n i=1
−1
follows that G = xi K K is compact.
i=1
19.14. Exercise. Let μ be a left Haar measure. Prove the following assertions.
R R
(a) If f ∈ c (G) and y ∈ G, then G f dμ = G f (yx) dμ(x). The last equality holds if f is a
nonnegative μ-measurable function on G or if f ∈ 1 (μ).
R
(b) If f ∈ c (G) is nonnegative and f (e) > 0, then G f dμ > 0.
(c) The measure μ is σ-finite if and only if G is σ-compact.
(d) The topology on G is discrete if and only if μ{x}
= 0 for some (and thus for all) x ∈ G.
19.15. Exercise. Calculate the modular function of the group from Example 19.5.
19.16. Exercise . Let Δ be a modular function on a locally compact group G. Show that:
(a) Δ is continuous;
(b) Δ(xy) = Δ(x)Δ(y) for all x, y ∈ G;
(c) Δ(e) = 1.
19.17. Exercise. Let μ be a left Haar measure
R on G. Show that
R the set function μ̃, where
μ̃E := μE −1 , is a right Haar measure and G f (x)Δ(x−1 ) dμ = G f dμ̃ for every f ∈ c (G).
dμ̃
In other words, the Radon-Nikodým derivative equals Δ(x−1 ).
dμ
R
Hint. Show that the function ν on (G) defined by νA := A Δ(x−1 ) dμ is a right Haar
measure. Thus ν = K μ̃ for some K. Now use the fact that Δ is a continuous function attaining
the value 1 at the unit element e.
19.18. Exercise. Show that the left and right Haar measures on G are mutually absolutely
continuous.
Hint. Use Exercise 19.17.
19.22. Exercise. Show that the Banach algebra (L1 (χ), ∗) has a multiplicative unit if and
only if G is endowed with the discrete topology. As an example consider the group Z of all
integers with the discrete topology and counting measure.
19.23. Involution. If f ∗ (x) := f (x−1 )(Δ(x))−1 , then the mappingR f ˛→ f ∗ of˛the space L1 (χ)
onto itself (which is called the involution) is an isometry. (Show that G ˛f (x−1 )˛ (Δ(x))−1 dχ =
R ∗
G |f (x)| dχ for f ∈ c (G); hence it follows that f = f if f ∈
1 (χ).)
hence Z Z
h dμ∗ν = h(xy) dμ⊗ν.
G G×G
for any bounded Borel function h on G. Therefore, for f, g ∈ 1 (χ) we have
d(χf ∗ χg )
f ∗g = ,
dχ
R
where χf is the complex Radon measure given by χf (E) = E f dχ. We see that the “new”
definition of convolution of functions agrees with the “former” one.
19.29. Exercise. Give a definition of a convolution of a (complex) function f ∈ 1 (χ) and a
complex Radon measure μ ∈ b (G). Prove that f ∗ μ ∈ 1 (χ) and that f ∗ μ1 ≤ f 1 μ.
In fact, L1 (χ) is not only a subalgebra of b (G) but even an ideal.
19.30. Notes. The translation invariant measures (or integrals) on compact Lie groups
were studied by F. Peter and H. Weyl [1927]. Remarkable progress was made by proving the
existence of a left Haar measure for separable locally compact groups by A. Haar [1933] and
by J. von Neumann in [1934] (the existence and uniqueness) for arbitrary compact groups, and
in [1936] (uniqueness) for separable locally compact groups and also by A. Weil especially in
[*1940]. Their proofs used the axiom of choice. H. Cartan [1940] and G.E. Bredon [1963] then
gave proofs without using the axiom of choice. A relatively short proof of the uniqueness result
can be found in S. Kakutani [1948]. More detailed historical notes are given by E. Hewitt and
K.A. Ross in [*1963]. It is said that Example 19.5. is due to J. von Neumann [1936]. Note that
Haar measures are special cases of invariant measures which can be studied in a more general
setting, cf. Banach’s appendix of Saks’ monograph [*1937] or H. Federer [*1969].
82
D. Integration on R
20. Integral and Differentiation
Since the last inequality holds for all ε > 0, we are done.
20.2. Lemma. Let f be a real function on an interval I and E ⊂ I. If |f | ≤ K
on E for some K > 0, then
λ∗ f (E) ≤ Kλ∗ E.
Proof. Choose K > K and denote
Ek = {x ∈ E : |f (x) − f (y)| ≤ K for all y ∈ E ∩ (x − k1 , x + k1 )}.
is measurable.
To prove the required inequality, it is no restriction to assume that E is
bounded. For an ε > 0 denote
λ∗ f (E) ≤ λ∗ f (Ek ) ≤ kε ≤ |f | + ελEk
k k k Ek
= |f | + ελE.
E
20.5. Exercise. Under the assumptions of Theorem 20.4 prove that f (E) is a measurable set.
S
Hint. Use the following Exercise 20.6 and the fact that E = N ∪ Kn where λN = 0 and Kn
n
are compact (Theorem 1.21).
20.7. Luzin’s (N)-property. We have seen in 3.18 that a continuous image of a (Lebesgue)
measurable set does not need to be measurable. On the other hand, if f is differentiable and E
measurable, Exercise 20.5 yields that f (E) is measurable. We now consider briefly the question,
under what conditions images of measurable sets are again measurable.
We say that a real function f defined on an interval I ⊂ R has Luzin’s (N)-property if the
image f (N ) of any (Lebesgue) null set N ⊂ I is again a null set.
(a) (Rademacher) Let f be a continuous function on an interval I. Then the following
conditions are equivalent:
(i) f has Luzin’s (N)-property;
(ii) f (M ) is measurable whenever M ⊂ I is measurable.
Hint. In light of Theorem 1.21 a measurable set is a countable union of compact sets and a null
set. For the converse, use Remark 1.9.2 which says that every set of a positive measure contains
a nonmeasurable subset.
(b) Show that the assertion of (a) remains true if f is measurable only (use Luzin’s Theorem
18.2). Likewise, the definition of Luzin’s (N)-property and other ideas can be generalized to the
case when I is a measurable subset of Rn .
(c) Let f be a function having a finite derivative everywhere on an interval I. Then f has
Luzin’s (N)-property.
Hint. Exercise 20.6.
(d) Every Lipschitz (even locally Lipschitz) function on a measurable set M has Luzin’s
(N)-property.
20.8. Exercise. Let f be an arbitrary function on an interval I ⊂ R. If D is the set of points
at which f has a finite derivative, then D is a Borel set and the function x → f (x) is a Borel
function on D.
20.9. Notes. Luzin’s (N)-property was introduced by Luzin in [*1915]. Corollary 20.3 which
we called the one-dimensional version of Sard’s lemma is also often called Luzin’s theorem.
In this chapter we discuss two classes of functions without using measure the-
ory. Important results concerning these functions will be introduced in following
chapters.
21.1. Functions of Finite Variation. Let f be a real-valued function on an
interval I. For any interval [a, b] ⊂ I and any partition D : a = x0 < x1 < · · · <
xm = b of [a, b] denote
b
m
V(f, D) = |f (xj ) − f (xj−1 )| .
a
j=1
b
is called the variation of f over [a, b]. If V f < ∞ for every interval [a, b] ⊂ I,
a
then f is termed a function of finite variation. In this case there exists a function
v on I such that
b
v(b) − v(a) = V f
a
D. Integration on R 85
we say that f is of bounded variation on I. For a compact interval [a, b], notions
of finite and bounded variation agree and they can be characterized simple by
b
V f < ∞.
a
In case of an arbitrary interval I we could say that f is of “locally bounded
variation” instead of “finite variation”. Indeed, a function f is of finite variation
on I if and only if it has bounded variation on each compact subinterval of I.
In a similar way, we are going to “localize” notions of absolute continuity and
integrability.
21.2 Jordan Decomposition Theorem. A function f is of finite variation
on I if and only if f is a difference of two nondecreasing functions.
Proof. Monotone functions are of finite variation, and thus differences of mono-
tone functions are also of finite variation. For the converse, if v is an indefinite
variation of a function f of finite variation, then v and v − f are nondecreasing
and f is their difference.
21.3. Absolutely Continuous Functions. A real-valued function f is said
to be absolutely continuous on an interval I if given ε > 0, there exists δ > 0 that
m
|f (bj ) − f (aj )| < ε
j=1
m
whenever a1 <b1 ≤ a2 <b2 ≤ · · · ≤ am <bm are points of I with (bj − aj ) < δ.
j=1
The family of all absolutely continuous functions on an interval I is a vector space
which contains all Lipschitz functions.
We say that a function f is locally absolutely continuous on an interval I if f
is absolutely continuous on every compact subinterval of I.
Any locally absolutely continuous function is continuous and of bounded vari-
ation.
21.4. Theorem. Any absolutely continuous function f on I is the difference
of two nondecreasing absolutely continuous functions.
Proof. It is enough to show that the indefinite variation v of f is absolutely
continuous. To this end let an ε > 0 be given. Find δ > 0 such that
m
|f (bj ) − f (aj )| < ε
j=1
86 22. Theorems on Almost Everywhere Differentiation
m
whenever a1 <b1 ≤ a2 <b2 ≤ · · · ≤ am <bm are points of I with (bj − aj ) < δ.
j=1
p
Let A1 <B2 ≤ A2 <B2 ≤ · · · ≤ Ap <Bp be points of I with (Bj − Aj ) < δ. Find
j=1
partitions
mj
Aj = a0j < b0j = a1j < · · · < bj = Bj
of intervals [Aj , Bj ] for which
mj
i
v(Bj ) − v(Aj ) < f (bj ) − f (aij ) + 1 ε.
i=1
p
i
Since (bj − aij ) < δ, we have
j,i
|v(Bj ) − v(Aj )| < f (bij ) − f (aij ) + ε < 2ε ,
j j,i
as needed.
21.5 Exercise. Show that the product of two absolutely continuous functions (or functions of
finite variation) on a bounded interval is again an absolutely continuous function (or a function
of finite variation).
21.6. Exercise. Prove that every absolutely continuous function has Luzin’s (N)-property.
Hint. If N is a null set, find an open set M of a small measure containing N and then use the
definition of absolute continuity.
21.7. Notes. C. Jordan discovered functions of finite variation in [1881] and proved the
decomposition theorem 21.2.
In this chapter we show that every function of finite variation (in particular,
every nondecreasing or Lipschitz function) has a finite derivative almost every-
where. The usual proof of this deep theorem uses Vitali’s covering theorem. Here
we present a rather elementary proof. Let start with a simple lemma which is due
to F. Riesz.
22.1. F. Riesz’s Rising Sun Lemma. Let h be a continuous function on an
interval [a, b]. Denote
Since β ∈
/ E, it follows that h ≤ h(β) in [β, b). By hypothesis M = ∅ and
sup M = β. Thus h(x) ≤ h(β) and letting x → α+, we complete the proof.
22.2. Remarks. 1. If (α, β) is a maximal open interval contained in E and α > a, then even
h(α) = h(β).
2. The “mirror” version of the lemma: Let h be a continuous function on an interval [a, b] and
denote
E = {x ∈ (a, b) : there exists a ξ ∈ (a, x) with h(ξ) > h(x)}.
Then E is the union of a sequence (aj , bj ) of pairwise disjoint open intervals with h(aj ) ≥ h(bj ).
22.3. Extreme Derivatives. Let f be a function defined on a neighborhood
of a point x ∈ R. Denote
1
D+ f (x) = lim sup (f (x + t) − f (x)),
t→0+ t
1
D+ f (x) = lim inf (f (x + t) − f (x))
t→0+ t
and analogously D− f (x) and D− f (x) for t → 0−. These extended real numbers
are called the Dini derivatives of f at x. A function f is differentiable at x if all
Dini derivatives of f agree at x.
Finally set
1
Df (x) = lim sup (f (x + t) − f (x))
t→0 t
and call the function Df the upper derivative of f . The lower derivative Df is
defined in a similar way.
22.4. Lemma. Every nondecreasing Lipschitz function f on an interval [a, b]
has a finite derivative almost everywhere on [a, b].
Proof. It is no restriction to assume that f is 1-Lipschitz. Lipschitz functions
cannot have infinite derivatives. To prove the assertion it suffices to show that
D+ f ≤ D− f almost everywhere. Then analogously D− f ≤ D+ f almost every-
where, and
0 ≤ D+ f ≤ D− f ≤ D− f ≤ D+ f ≤ D+ f ≤ 1
almost everywhere as needed. To this end let 0 < p < q < 1 be given and set
and
f (bj ) − pbj ≤ f (aj ) − paj .
88 22. Theorems on Almost Everywhere Differentiation
Now apply Lemma 22.1 to f (x) − qx on each interval [ak , bk ]. Again there are
pairwise disjoint intervals (ak,j , bk,j ) ⊂ (ak , bk ) such that
{x ∈ (a, b) : D+ f (x) > q} ⊂ {x ∈ (ak , bk ) : there exists ξ ∈ (ak , x)
k
with f (ξ) − qξ < f (x) − qx} = (ak,j , bk,j )
k,j
and
f (bk,j ) − qbk,j ≥ f (ak,j ) − qak,j .
Hence
1 1
(bk,j − ak,j ) ≤ (f (bk,j ) − f (ak,j )) ≤ (f (bk ) − f (ak ))
q q
k,j k,j k
p p
≤ (bk − ak ) ≤ (b − a).
q q
k
Now we show that also monotone functions are differentiable almost every-
where. The situation is more difficult since monotone functions can be discon-
tinuous. The heart of the proof lies in the observation that, for a nondecreasing
function f , the inverse function to x → x + f (x) (whose domain is not necessarily
connected) can be extended to a nondecreasing Lipschitz function on an interval.
22.5. Theorem (Lebesgue). Every monotone function f on an interval I has
a finite derivative at almost all points of I.
Proof. It would be clearly sufficient to assume that I = [a, b]. There exists an
interval [A, B] and a function g on [A, B] so that g is Lipschitz and nondecreasing,
and x + f (x) ∈ [A, B], g(x + f (x)) = x for all x ∈ [a, b]. (If f is continuous, then
g is simply the inverse function to x + f (x)). A moment’s thought will convince
the reader that
{x ∈ (a, b) : f (x) does not exist } ⊂ g(E) ∪ g(N )
where
E = {y ∈ (A, B) : g (x) does not exist } and N = {y ∈ (A, B) : g (x) = 0}.
By the previous lemma λE = 0. Hence λg(E) = 0 by Lemma 20.1. According to
Sard’s lemma (Corollary 20.3) also λg(N ) = 0 and the proof is complete.
D. Integration on R 89
Proof. For x > b set f (x) = f (b) and define a sequence of functions
1
fk (x) = k f (x + ) − f (x)
k
for x ∈ [a, b]. Then fk are nonnegative measurable functions on [a, b] (f is measur-
able!) and lim fk = f almost everywhere. Thus f is measurable and nonnegative
almost everywhere. Fatou’s lemma 8.15 and a quick computation establish that
1
b b b+ k b
f ≤ lim inf fk = lim inf k f− f
1
a a a+ k a
1 1
b+ k a+ k
1 1
= lim inf k f− f ≤ lim inf k f (b) − f (a)
b a k k
= f (b) − f (a).
where the integral is understood as Lebesgue’s one and the derivative in the
“almost everywhere” sense. First, let us show that formula does not always hold
even if f is monotone by considering the following example.
23.1. Cantor Singular Function. We define the Cantor function f : [0, 1] → [0, 1] in the
following way: Let f (0) = 0, f (1) = 1. For x ∈ [ 13 , 23 ] set f (x) = 12 . Further set f (x) = 14 for
x ∈ [ 19 , 29 ] and f (x) = 34 for x ∈ [ 79 , 89 ]. Each succesive step is essentially the same. If (a, b) is a
maximal interval in which f is not yet defined, we subdivide it into thirds and define the value
of f in the closed middle third as the arithmetical mean of f (a) and f (b). Then f is defined
90 23. Indefinite Lebesgue Integral and Absolute Continuity
and uniformly continuous on a dense subset of [0, 1]. The last step of the definition consists in
the continuous extension of f to the whole interval [0, 1]. There is also an arithmetic definition
of the Cantor function. Suppose x ∈ [0, 1] is written in the form
∞
X
x= 3−j xj
j=1
where (
1 if there is i < j with xi = 1,
yj =
xj otherwise.
Whence
b
f (b) − f (a) = λf ([a, b]) ≤ λ∗ f ([a, b] \ G) + λf (G) ≤ f + ε.
a
If c ∈ I and ⎧ x
⎪
⎪ x ≥ c,
⎨ ϕ,
Φ(x) = c
c
⎪
⎪
⎩ − ϕ, x < c,
x
for any open set E, and consequently for any measurable set E as well. Then
Theorem 8.17 gives the desired conclusion.
23.5. Corollary. For a real-valued function f on an interval [a, b], the following
properties are equivalent:
(i) f is absolutely continuous on [a, b]; x
(ii) there exists ϕ ∈ L 1 ([a, b]) such that f (x) = f (a) + a ϕ for all x ∈ [a, b];
f xis differentiable almost everywhere, f ∈ L ([a, b]) and f (x) = f (a) +
1
(iii)
a
f for all x ∈ [a, b].
make this definition reasonable, we should add the assumption that f is continuous in order to
guarantee that all “antiderivatives” of f differ up a constant. An analogous problem appears
when assuming f = ϕ only almost everywhere. Now, the example of the Cantor function shows
that continuity of f does not guarantee that the increment f (b)−f (a) does not depend on choice
of the “generalized antiderivative”. However, it is possible to give an alternative definition of
the Lebesgue integral of a function ϕ as the increment of an absolutely continuous function f
on [a, b] with f = ϕ almost everywhere. (Notice that this definition does not lead to a true
generalization — some antiderivative are not absolutely continuous, see Example 25.1). The
definitions of various integrals based on the idea of (in some way) generalized antiderivatives
are called the descriptive ones. Let us note that these generalizations may consist in omitting
“small sets” or in a “generalized differentiation”. Furthermore, in Chapter 25 we mention
Perron’s method which is also included among descriptive approaches.
23.8. Lebesgue Points. Let I ⊂ R be an interval and x ∈ I. We say that x
is a Lebesgue point for a locally integrable function f if
h
1
lim |f (x + t) − f (x)| dt = 0.
h→0 2h −h
but x does not need to be a Lebesgue point for f . However, it is clear that F = f
at each Lebesgue point for f . The following theorem is thus a strengthening of
Theorem 23.4.
23.9. Lebesgue Differentiation Theorem. Let f be a locally integrable
function on an interval I. Then almost every point of I is a Lebesgue point for
f.
Proof. For a fixed r ∈ R, the function x → |f (x) − r| is locally integrable on I.
By virtue of Theorem 23.4 there exists a set Er ⊂ I of Lebesgue measure zero
such that h
1
lim |f (x + t) − r| dt = |f (x) − r|
h→0 2h −h
|f (x + t) − f (x)| ≤ |f (x + t) − r| + ε
and h
1
lim sup |f (x + t) − f (x)| dt ≤ |f (x) − r| + ε ≤ 2ε.
h→0 2h −h
23.11. Banach-Zarecki Theorem. Let f be a real-valued function on an interval [a, b]. The
following assertions are equivalent:
(i) f is absolutely continuous on [a, b];
(iv) f is continuous on [a, b], f is of bounded variation and has Luzin’s (N)-property.
23.14. Notes. Fundamental theorem (23.5) of “the calculus for Lebesgue integrals” was proved
by H. Lebesgue [*1904]. The implication (iv) =⇒ (i) of Theorem 23.11 is due to S. Banach [1925].
Gν (x) := ν(−∞, x] ,
Proof. (a) As for the uniqueness, any two such measures agree on all open (or
compact) sets in R. The existence can be proved in various ways, depending on
what kind of general theorem we wish to use. For instance, we can use Hopf’s
extension theorem 5.5 or start from the covering collection {(a, b] : a < b, a, b ∈ R}
and the set function νF (a, b] := F (b) − F (a), create the outer measure and to
restrict it to measurable sets. In any case, we have to prove the following property:
∞
∞
If (a, b] ⊂ (an , bn ], then F (b) − F (a) ≤ (F (bn ) − F (an )). To see this, given
n=1 n=1
ε > 0, let δn > 0, δ > 0 be so that F (bn +δn ) < F (bn )+ε2−n , F (a+δ) < F (a)+ε.
Then consider the cover {(an , bn + δn )} of the interval [a + δ, b].
(b) Setting ⎧
⎪
⎨ ν(0, x] for x > 0,
Fν (x) := 0 for x = 0,
⎪
⎩
−ν(x, 0] for x < 0,
Fν is nondecreasing, right continuous, Fν (0) = 0 and ν(a, b] = Fν (b) − Fν (a) for
every interval (a, b] ⊂ R.
24.3. Theorem. The following conditions are equivalent for a measure ν
defined on B(R):
(i) ν is a Radon measure;
(ii) there exists a distribution function F such that ν(a, b] = F (b) − F (a) for
every bounded interval (a, b];
(iii) there exists a nondecreasing function ϕ on R such that ν is the Lebesgue-
Stieltjes measure λϕ of Example 14.8.b.
Proof. Use previous results and the uniqueness part of the Riesz representation
theorem 16.5.
24.4. Remarks. 1. According to Remark 15.2 any finite measure on (R) is a Radon
measure. However,Pthere exist σ-finite measures on (R) which are not Radon measures (for
example, the sum δ1/k of Dirac measures). Hence, these measures are not Lebesgue-Stieltjes
k
measures and have not distribution functions.
2. The function ϕ from the last theorem need not be right continuous. However, as a monotone
function it has a right limit at each point. If ϕ̃(x) := limt→x+ ϕ(t), then ϕ̃ is a distribution
function of ν and determines the same Lebesgue-Stieltjes measure as ϕ. The functions ϕ̃ and F
of Theorem 24.3.ii differ by a constant (cf. Exercise 24.6).
24.5. Exercise. Let F be a distribution function of a Radon measure μ on R and let μF
correspond to F as in Theorem 24.2. Show that μ = μF on (R).
24.6. Exercise. Show that any two distribution functions of the same measure differ by a
constant.
24.7. Exercise. Let F be a distribution function of a Radon measure μ. Show that:
(a) F is locally absolutely continuous on R if and only if μ is absolutely continuous with
respect to the Lebesgue measure λ. In this case, F is the Radon-Nikodým derivative dμ/ dλ.
(b) Measures μ and λ are mutually singular if and only if F = 0 almost everywhere.
(c) μ({x}) = F (x) − lim F (t) for every x ∈ R. In particular, F is continuous at x if and
t→x−
only if μ({x}) = 0.
D. Integration on R 95
In some sense, the Lebesgue integral has the best properties among all integrals
which can be defined on any measure space. However, its universality can also be
a disadvantage. If we are engaged in integration of functions of one real variable
(or several variables, but in what follows we restrict to the real line in order to
simplify the explanation), then we can find wider collections of functions on which
a reasonable notion of an integral can be introduced. Let us present an illustrative
example.
25.1. Example. The function
8“ ”
<
x2 cos2 1
x2
for x
= 0,
f (x) =
:
0 for x = 0
1 1
is not Lebesgue integrable on [−1, 1]. For aj = q and bj = √ we have
1
(j + 2 )π jπ
Z bj Z bj 1
|f (x)| dx = f (x) dx = ,
aj aj πj
R
and therefore 01 |f (x)| dx = +∞. On the other hand, f has an antiderivative on R and the
R
Newton integral 01 f (x) dx exists.
96 25. Henstock–Kurzweil Integral
There are also simple examples of functions which are Lebesgue integrable but
not Newton integrable (for example, the function sign x on [−1, 1]). A generaliza-
tion of both Newton and Lebesgue integrals leads to a nonabsolutely convergent
integral which can be introduced in several ways. Here we use an approach due
to Henstock and Kurzweil; Perron’s one is outlined in 25.10.
25.2. Henstock–Kurzweil integral. For the sake of simplicity we will define
the integral for bounded intervals only.
If [a, b] is an interval, denote the set of all (strictly) positive functions on [a, b] by
Δ and label functions from Δ as gauges. A partition is a pair D = ([aj , bj ], ξj )m j=1 ,
where
for each δ-fine partition D of [a, b]. The number K is then uniquely determined,
b
it is called the Henstock–Kurzweil integral of f and denoted by K a f . Where
no confusion can result, we will drop the prefix “K”. This definition is similar
to original Riemann’s one, the “only” difference being that instead of constants
δ in Riemann’s definition, a gauge function δ appears in Henstock–Kurzweil’s
definition.
The set of all Henstock–Kurzweil integrable functions is a linear space on which
the Henstock–Kurzweil integral is a monotone linear functional.
We are not going to study details of the theory of nonabsolutely convergent
integrals; we will concentrate on relations to the Newton and the Lebesgue integral
only.
25.3 Theorem. Let F be a continuous function on [a, b], F = f on (a, b). Then
the Henstock–Kurzweil integral of f on [a, b] exists and equals to F (b) − F (a).
D. Integration on R 97
Proof. Given an ε > 0 and x ∈ (a, b), there is δ(x) > 0 such that
F (y) − F (x)
− f (x)<ε
y−x
whenever y ∈ [a, b], 0 < |y − x| < δ(x). Next, we can find δ(a) > 0 and δ(b) > 0
such that |f (a)δ(a)| < ε, |f (b)δ(b)| < ε, |F (y) − F (a)| < ε for all y ∈ [a, a + δ(a))
and |F (y) − F (a)| < ε for all y ∈ (b − δ(b), b]. Let D = ([aj , bj ]), ξj )m
j=1 be a δ-fine
partition. Then
(we get strict inequalities by adding small constants). For every x ∈ [a, b] find
δ(x) > 0 with t < f (x) < s on (x − δ(x), x + δ(x)) ∩ [a, b]. Let D = ([aj , bj ], ξj )m
j=1
be a δ-fine partition of [a, b]. Then
bj bj
t ≤ f (ξj )(bj − aj ) ≤ s
aj aj
b b
Since a
t≤L≤ a
s, we have |s(f, D) − L| < 2ε, and we are done.
98 25. Henstock–Kurzweil Integral
1
Ln := {x ∈ [a, b] : DF (x) < f (x) − } and
n
1
Un := {x ∈ [a, b] : DF (x) > f (x) + }
n
have measure zero. For this purpose we will fix n ∈ N and estimate the (outer)
measure of the set L := Ln . We are now in a position to invoke Saks–Henstock’s
lemma 25.6: Given ε > 0, there is a gauge function δ ∈ Δ such that
F (bj ) − F (aj ) − f (ξj )(bj − aj ) < ε
j∈M
100 25. Henstock–Kurzweil Integral
F (b ) − F (a ) 1
< f (x) − .
b − a n
Then V is a Vitali cover of L, and so, by Corollary 27.3, there exists a finite,
pairwise disjoint subfamily {[ai , bi ]}i of V with points ξi ∈ [ai , bi ] such that
λ∗ (L \ [ai , bi ]) < ε.
i
Now, use Cousin’s lemma in 25.2 to extend the collection ([ai , bi ], ξi )i to a fam-
ily ([a∗j , b∗j ], ξj∗ ) subordinated to δ. If M := {j : there exists an i with [a∗j , b∗j ] =
[ai , bi ]}, the property of gauge δ yields that
(f (ξi )(bi − ai ) − (F (bi ) − F (ai ))) < ε ,
i
25.10. Perron Integral. There are also other ways how to define an integral which is
equivalent to the Henstock–Kurzweil integral (i.e. it provides the same class of “integrable”
functions on which it equals to the Henstock–Kurzweil integral). In the following, we outline
Perron’s approach to generalizing both the Newton and the Lebesgue integrals on intervals. Let
us start with two notes:
D. Integration on R 101
(a) A descriptive definition of the Newton integral of a function f requires the existence of
an antiderivative (a function F with F = f ). This is a very strong condition as it reduces
the class of Newton integrable functions. Among others, this condition implies that f shares
the Darboux property and is of the Baire class one. A descriptive definition of the Lebesgue
integral requires the existence of an absolutely continuous function F with F = f only almost
everywhere.
(b) The following particular case of Theorem 15.5 is known as the Vitali-Carathéodory theo-
rem: A function f is Lebesgue integrable on an interval I if for any ε > 0 there exists an upper
semicontinuous
R function g and a lower semicontinuous function h with g ≤ f ≤ h on I and
I (h − g) < ε.
Now we are in the position to introduce Perron’s integral. Let f be a function on an interval
[a, b]. A function M is termed to be a majorant of f if f (x) ≤ DM (x)
= −∞ for every x ∈ [a, b].
If f ≥ Dm
= +∞, we say that m is a minorant of f .
A function f is said to be Perron integrable on an interval [a, b] if for any ε > 0 there exists
a majorant M and a minorant m so that
Since D(M − m) ≥ DM − Dm ≥ 0 (and since every function with a nonnegative lower derivative
is nonnegative) we may define the Perron integral of f as
Z b
P f = inf{M (b) − M (a) : M is a majorant of f }
a
= sup{m(b) − m(a) : m is a minorant of f }.
We may also define upper and lower Perron integrals of f , and f will be Perron integrable when
they are equal and finite.
25.11 Remark. Lebesgue integrable functions may be characterized in the following way:
A function f on an interval [a, b] is Lebesgue integrable if and only if for any ε > 0 there is
a majorant M and a minorant m, both of them absolutely continuous, so that
To prove this proposition, one can use the Vitali-Carathéodory characterization (Theorem
15.5) again (the indefinite integrals of semicontinuous functions serve as majorants and mino-
rants).
It is quite interesting to notice that in the definition of the Perron integral we can restrict to
continuous majorants:
A function f on an interval [a, b] is Perron integrable if and only if for any ε > 0 there is a
majorant M and a minorant m, both of them continuous, so that
25.12. Exercise. Find the gauge function δ from the definition of the Henstock–Kurzweil
integral if f is:
(a) Newton integrable;
(b) Lebesgue integrable;
(c) an indicator function of an open set;
(d) an indicator function of a null set.
102 25. Henstock–Kurzweil Integral
25.13 Exercise. Give an example to show that the indefinite Henstock–Kurzweil integral
need not be an absolutely continuous function.
25.14. Historical Notes. H. Lebesgue already knew that his integral on the real line does not
integrate all derivatives. The task was, to find a wider class of “integrable functions” containing
both the Lebesgue and Newton integrable functions. The problem was solved by A. Denjoy [1912]
who used a constructive method based on a transfinite approach, and by N. N. Luzin [1912] using
a descriptive definition based on further generalization of the notion of absolute continuity. The
method of envelopes was developed by O. Perron [1914]; it was proved that his and Denjoy’s
integrals are equivalent. Nowadays, this integral is usually called the Denjoy-Perron integral or
the restricted Denjoy integral. Another approach was developed independently by R. Henstock
[1961] and J. Kurzweil [1957] who returned to a Riemann type definition. The integral is often
called the complete Riemann or the Henstock-Kurzweil integral. Simple definitions of Henstock’s
and Kurzweil’s approach and simple proofs allowed a deeper study of the Henstock–Kurzweil
integral and surprising applications. Among recent publications let us mention R. Henstock
[*1988], Peng Yee Lee [*1989], Š. Schwabik [*1992] and Š. Schwabik and Guoju Ye [*2005].
E. Integration on Rn 103
E. Integration on Rn
26. Lebesgue Measure and Integral on Rn
and the Lebesgue measure λn is defined as the restriction of λ∗n to the σ-algebra
Mn = Mn (λ∗n ) of all Lebesgue measurable sets. Where no confusion can result,
the subscript “n” will be omitted.
26.1. Theorem. The Lebesgue measure λn is a complete Radon measure.
Proof. Use Theorem 1.21 and Remark 15.2.4 to prove that λn is a Radon measure.
Completeness is a consequence of Caratheodory’s construction, see Theorem 4.5.
For the first counterexample consider the Cartesian product H × H, where H is a measurable
non-Borel subset of R (Exercise 1.14.b) and use Lemma 11.2. The second one can be find in
Remark 11.10.
26.4. Exercise. Show that the Lebesgue measure is the only complete Radon measure on Rn
that assigns to each interval its volume.
26.5. Exercise. The Lebesgue measure is translation-invariant: If A ∈ Mn and x ∈ Rn , then
A + x ∈ Mn and λn (A + x) = λn A.
26.6. Exercise. Let ν be a nontrivial translation-invariant complete Radon measure on Rn .
Show that there is c > 0 such that ν = cλn .
Hint. Let c = ν(0, 1)n . Evaluate the ν-measure of arbitrary intervals and use Exercise 26.4 to
compare λn and 1c ν.
104 26. Lebesgue Measure and Integral on Rn
In other words,
f (x, y) dx dy = f (x, y) dy dx.
M Rn Mx
Proof. To prove the assertion it suffices to use Theorems 11.11. and 26.2.
26.10. Remarks. 1. The assumption f ∈ ∗ (M ) is satisfied if, for instance, f ∈ 1 (M )
(Tonelli’s theorem), or if f is a non-negative measurable function (Fubini’s theorem in a narrow
sense). When applying Fubini’s theorem, we usually start with Fubini’s theorem in a narrow
sense in order to show that |f | is integrable, and then we use Tonelli’s theorem.
2. Let πM denote the projection of M into Rn (i.e. πM is the set of all points x for which M x
is nonempty). Then Z Z „Z «
f (x, y) dx dy = f (x, y) dy dx
M πM Mx
provided πM is measurable. In general, the measurability of M does not imply the measurability
of πM . Observe that πM is measurable whenever M is open (then πM is also open), or compact
(then πM is compact), or a countable union of compact sets (e.g. a closed set). Projections of
Borel sets are also measurable but the proof is difficult. The study of projections of Borel sets
led to the notion of analytic sets (see D.L. Cohn [*1980]).
R
26.11. Example. We evaluate the integral B f (z) dz, where f is an integrable function on
the ball B := {z ∈ R : |z| < 1}. Let usq
3 write z in the form [x, y], where x = [z1 , z2 ] ∈ R2 and
y = z3 ∈ R. Then B x = {y ∈ R : |y| < 1 − |x|2 } and πB = {x ∈ R2 : |x| < 1}. Hence
Z Z Z √1−|x|2 !
f (z) dz = √ f (x, y) dy dx
B πB − 1−|x|2
0 q 0 q 1 1
Z 1 Z 1−z2 Z 1−z2 −z2
1 1 2
= @ q @ q f (z1 , z2 , z3 ) dz3 A dz2 A dz1 .
−1 − 2
1−z1 − 2 −z 2
1−z1 2
E. Integration on Rn 105
j=1,...,k
We say that ϕ is (Fréchet) differentiable at z if there exists a linear mapping
L : Rk → Rn such that
then ϕ(L) = M ∩ {[x, y] : x > 0, y > 0}. By the change of variable formula 26.13 we get
Z Z π
4
λ2 (M ) = 4 r dr dt = 4a2 cos 2t dt = 2a2 .
L 0
106 26. Lebesgue Measure and Integral on Rn
Hint. Consider first the case when ϕ is piecewise constant and then approximate ϕ by piecewise
constant functions.
n/2
(b) Show that κn = Γ(πn +1) .
2
R 2
Hint. Write In = Rn e−|x| dx. Using Fubini’s theorem we obtain In = I1n . By the previous
part (a) Z ∞
2 n n n
In = nκn rn−1 e−r dr = κn Γ( ) = κn Γ( + 1) .
0 2 2 2
It is easy to compute I2 = κ2 Γ(2) = π. Hence In = π n/2 and we are done.
and consequently
r
f ∗ gr =
r f (x − y)g(y) dy dx ≤ f p gq
r−p r−q
(f˜ ∗ g̃)(x) dx
R n Rn Rn
r−p r−q p q r r
≤ f p gq f p gq = f p gq .
Proof. (a) Let 1 < p < +∞. The Hölder inequality gives (for a fixed x)
p1
p−1
p
n f (x − y) dμ(y) ≤ |f (x − y)| d |μ| (y) (|μ| (Rn )) p ,
R n R
Prove that:
(a) ∗E ≤ λ∗ E ≤ λ∗ E ≤ ∗E for any set E ⊂ Rn ;
(b) ∗ G = λG for any open set G;
(c) ∗ K = λK for every compact set K;
(d) for a bounded set E, ∗E = ∗ E if and only if λ(∂E) = 0.
26.25. Exercise. Let G ⊂ Rn be an open set. Show that there exists a disjoint collection of
balls in G whose union covers G except on a null Lebesgue set.
Hint. The assertion is an easy consequence of Vitali’s covering theorem 27.2. Here we indicate
an elementary proof. We can assume that G is of a finite measure. Set G0 = G. By virtue of
Exercise 26.24 there are closed cubes Q1 , . . . , Qp whose interiors are pairwise disjoint and whose
union has a measure greater than 12 λG. Set c = 2−n λB(0, 1/2). Then in the interior of every
cube having a measure α there is a closed ball of measure cα. Hence we get closed pairwise
disjoint balls Bj ⊂ Qj whose union F1 has a measure greater than 2c λG. Set G1 = G0 \ F1 .
Inductively, there are Fi+1 ⊂ Gi := Gi−1 \ Fi which are unions of pairwise disjoint closed balls
with λGi ≤ (1 − 2c )λGi−1 ≤ · · · ≤ (1 − 2c )i λG.
E. Integration on Rn 109
Hint. It may be supposed that λ∗ E < +∞. Choose an ε > 0 and find an open set G ⊃ E with
λG < λ∗ E + ε. Thanks to Exercise 26.25 there exists a countable disjoint collection {Vj } of
open balls so that {Vj } covers G except on a null set N and
X [
λVj = λ Vj ≤ λG + ε .
j j
S P
Find a countable family {Ij } of open intervals such that Ij ⊃ N and λIj ≤ ε. According
S mj P jm
to Exercise 26.24 there exist closed cubes Qj with Qj ⊃ Ij and
m λQj < 2λIj . For each
m m
cube Qm m
j find an open ball Wj with the same centre and with radius equal to the diameter
of this cube (so that Wjm contains Qm
j ). It is not hard to check that there exists a constant c
(depending only on the dimension n) such that λWjm ≤ cλQm j . Whence
X X X
λWjm ≤ c j ≤ 2c
λQm λIj ≤ 2cε.
j,m j,m j
To finish the proof, it is enough to consider the union of all balls from collections {Vj } and
{Wjm }.
!k−1 "
sk = sup r > 0 : B(x, r) ∈ V , B(x, r) ∩ Bj = ∅, B(x, r) ⊂ H .
j=1
The Vitali property and the additional assumptions ensure that sk > 0. Further
+∞ > s1 ≥ s2 ≥ . . . since H is of finite measure. Now choose a ball Bk =
B(xk , rk ) ∈ V with rk > sk /2 and Bk ∩ Bj = ∅ for j < k.
We get a sequence A := {Bk , k ∈ N} of pairwise disjoint sets from V . We
show that A covers
A except on a set of Lebesgue measure zero. To this end let
p ∈ N and x ∈ A \ A be given. We now invoke the Vitali property and find a
ball B = B(y, s) ∈ V with x ∈ B
and B ∩ Bj = ∅ for j = 1, . . . , p − 1. Clearly
s ≤ sp . Since λH < ∞, we have rkn < ∞ and lim sk = lim rk = 0. Therefore
k
we can find q ≥ p with sq+1 < s ≤ sq . By definition of sq+1 there exists i ≤ q
with Bi ∩ B = ∅. If z denotes a point of Bi ∩ B, then
∞
A\ A ⊂ B(xi , 5ri ) ,
i=p
and thus
∞
λ(A \ A)≤ 5n λB(xi , ri ) .
i=p
N
A⊂ {U (x, Δ(x)) : x ∈ Dk } .
k=1
inf{Δ(xi ) : i ∈ N} ≤ inf{|xi − xj | : i, j ∈ N, i = j} = 0.
Pj = {i ∈ {1, . . . , j − 1} : Bi ∩ Bj = ∅} ,
We shall suppose that the last assertion does not hold, and derive a contradic-
tion. Select z := xj ∈ D. If number of balls Bi , i < j intersecting Bj is ≥ N ,
then there exist z1 , z2 , z3 ∈ D and t ∈ S so that
zk = xjk for j1 < j2 < j3 < i
and
z − zk 1
|z − zk | − t < 40 for k = 1, 2, 3 .
For k = 1, 2, 3 denote
r = Δ(z), rk = Δ(zk ), Rk = |z − zk | .
We will show that r, rk and Rk satisfy the following system of 16 inequalities
(I) r > 0,
(IIk ) Rk ≤ rk + r,
8
(IIIk ) r < rk ,
7
8
(IV1 ) r3 < r2 ,
7
8
(IV2 ) r3 < r1 ,
7
8
(IV3 ) r2 < r1 ,
7
(Vk ) rk ≤ R k ,
1
(VI1 ) r2 ≤ |R2 − R3 | + r3 ,
8
1
(VI2 ) r1 ≤ |R1 − R3 | + r3 ,
8
1
(VI3 ) r1 ≤ |R1 − R2 | + r2 ,
8
which does not have any solution. Therewith we get the required contradiction.
Now, we will derive the system of inequalities (I) – (VI3 ). The inequality (IIk )
says that the ball B(zk , rk ) intersects B(z, r). The inequalities (III), (IV) follow
from the following inequalities
8
Δ(xj ) ≤ si < Δ(xi ) for i < j .
7
Since Δ(xi ) < |xj − xi | for i < j, we get (V) and (VI). For instance, to obtain
(VI1 ) we use the definition of N , (II3 ) and (III3 )) in order to show that
|z3 − z| |z3 − z|
r2 ≤ |z2 − z3 | ≤ z2 − z −
(z2 − z) + z3 − z −
|z2 − z| |z2 − z|
z3 − z z2 − z 1
≤ |R2 − R3 | + R3 − ≤ |R2 − R3 | + 2R3
|z3 − z| |z2 − z| 40
1 15 1
≤ |R2 − R3 | + (r3 + r) ≤ |R2 − R3 | + r3 ≤ |R2 − R3 | + r3 .
20 20 · 7 8
E. Integration on Rn 113
Now we show that the system of inequalities (I)–(VI) does not possess a solution.
First, note that rk and Rk are positive by (I), (III) and (V). By (II1 ), (II2 ), (V1 ),
(V2 ), (IV3 ) and (III2 ) we have
7 3
|R1 − R2 | ≤ r+|r1 − r2 | = r+r1 +r2 −2 min(r1 , r2 ) ≤ r+r1 +r2 − r2 < r1 + r2 .
4 7
On the other hand, (VI1 ), (VI2 ) and (IV1 ) yield
1 1 5
|R1 − R3 | + |R2 − R3 | ≥ r1 − r3 + r2 − r3 ≥ r1 + ,
8 8 7
whence
|R1 − R2 | < |R1 − R3 | + |R3 − R2 | .
Similarly we obtain
and
|R2 − R3 | < |R1 − R3 | + |R2 − R1 | .
We see that the system (I)–(VI3 ) has no solution and this contradiction yields
the required conclusion.
For the general case when A is unbounded, let K > 2 sup Δ(x), and denote
x∈A
N
that for D := Dk and k ∈ {1, 2, . . . , N }, the collection of balls
k=1
{B(x, Δ(x)) : x ∈ Dk }
114 27. Covering Theorems
Denote
Ek = B(x, Δ(x)), U= U (x, Δ(x)).
x∈Dk x∈D
We may find and index q ∈ {1, . . . , } with μEq ≥ N1 μU and a finite set F ⊂ Dq
such that 1
μ B(x, Δ(x)) > μU.
2N
x∈F
Proof. We will construct recursively sequences of sets {Gk } and {Ak } according
to the following rule: Set G0 = H, A0 = A. Suppose that in the k th step the open
sets G0 ⊃ · · · ⊃ Gk−1 and sets A0 ⊃ · · · ⊃ Ak−1 have been chosen. By Lemma
27.5 there is an open set Uk , Ak−1 ⊂ Uk ⊂ Gk−1 and a closed set Mk ⊂ Gk−1 so
that Mk is the union of a finite pairwise disjoint collection of balls of V , and
1
μ(Uk \ Mk ) < 1− μUk .
2N
Set
Ak = Ak−1 \ Mk , Gk = Uk \ Mk .
Our construction guarantees that A \ Gk is covered by the selected balls (which
constitute a finite pairwise disjoint subcollection of V whose union is M1 ∪· · ·∪Mk )
and k
1 1
μGk < 1 − μGk−1 ≤ · · · ≤ 1 − μG0 .
2N 2N
Taking sufficiently large k, we get the desired conclusion.
E. Integration on Rn 115
in such a way that the balls B(x, r), (x, r) ∈ Sk are pairwise disjoint and
[
B(x, r) ∩ {B(y, x) : (y, s) ∈ Sk }
= ∅
Since [
A⊂ B(u, r),
(u,r)∈W
there is (y, s) ∈ W with x ∈ B(y, s). Further find k ∈ N satisfying τ −k < s ≤ τ −k+1 . Then
either (y, s) ∈ Sk (and there is nothing to prove), or B(y, s) intersects some of the “previous”
balls. More precisely, there exists z ∈ B(u, r) ∩ B(y, s), where (u, r) ∈ S1 ∪ · · · ∪ Sk . Obviously
s < τ r, and therefore
S
27.10. Exercise. Let be a collection of open balls in Rn , M = , c < λM . Prove that
there exist disjoint balls U1 , . . . , Uk ∈ such that
X
k
c
λUj > .
j=1
3n
Hint. Find a compact set K ⊂ M and a finite subcollection ⊂ so that λK > c and
S ⊃ K. Select the ball U ∈ with the greatest radius. Again, select among the balls of
1
which do not intersect U1 the ball U2 with the greatest radius. Suppose the balls U1 , . . . , Uj−1
have been chosen and find among the balls of which do not intersect U1 , . . . , Uj−1 the ball
Uj with the greatest radius. After a finite number of steps, we get a sequence {U1 , . . . , Uk } such
that we cannot add another ball. If x ∈ K, then x ∈ U for some U ∈ and the construction
implies that U ∩ Ui
= ∅ for some i. Let i be the smallest of such indices. Then the radius R of
the ball U is not greater than the radius of Ui , so that x lies in the ball with the same center as
Ui and radius 3R. Hence
Xk
c < λK ≤ 3n λUj .
j=1
27.11. Exercise. Prove Vitali’s covering theorem 27.2 using Exercise 27.10 in a similar way
as Theorem 27.6 was derived from Lemma 27.5.
27.12. Notes. The famous covering theorem is due to G. Vitali [1908] who proved it for closed
intervals and the Lebesgue measure. Later on, Vitali’s covering theorem was generalized in var-
ious directions by many authors (H. Lebesgue [1910], S. Banach [1924], C. Carathéodory (second
edition of the book [*1918] in 1927)). Another step forward was made by A.S. Besicovitch [1945],
[1946] and by A.P. Morse [1947]. The origin of the simple covering theorem which appeared in
Exercise 27.10 is in Wiener’s article [1939].
μB(x, r) μB(x, r)
Dμ (x) := lim sup and Dμ (x) := lim inf .
r→0+ λB(x, r) r→0+ λB(x, r)
(If μ is absolutely continuous with respect to λ, we can also use Vitali’s covering
theorem 27.2.) Then
μA ≤ μB(xi , ri ) ≤ (a + ε) λB(xi , ri ) ≤ (a + ε)λG ≤ (a + ε)(λA + ε) ,
i i
and
{x ∈ I : Dμ (x) < Dμ (x)} = A(r, s).
r,s∈Q+
28.5. Remark. Consider now the simple case when F is the distribution function of a Radon
measure μ on R. We know that F has a (finite) derivative F almost everywhere (Lebesgue’s
R R
theorem 22.5), that ab F dλ ≤ F (b) − F (a) (Theorem 22.7), and that ab F dλ = F (b) − F (a)
for any interval [a, b] if and only if μ λ (Corollary 23.5 and Exercise 24.7). In the last case,
F agrees almost everywhere with the Radon-Nikodým derivative dμ dλ
. An analogous assertion
holds for differentiation of measures and it is contained in the next Theorem 28.6.
Proof of the following theorem is based on Lemma 28.3. A proof based on the
covering theorem of Exercise 27.10 can be found in W. Rudin [*1974].
118 28. Differentiation of Measures
+∞
≤β μBk (β) ≤ βμB ,
k=−∞
When taking the limit as β → 1+, we get both (a) and (b).
28.7. Theorem. The following statements about a Radon measure μ on Rn
are equivalent:
(i) μ λ;
(ii) μB = B Dμ dλ for every Borel set B ⊂ Rn ;
(iii) Dμ is the Radon-Nikodým derivative of μ with respect to λ;
(iv) Dμ < +∞ μ-almost everywhere on Rn .
Proof. The previous theorem says that (i) =⇒ (ii), thus the equivalence (i) ⇐⇒
(ii) ⇐⇒ (iii) is obvious. To show that (i) implies (iv), use Theorem 28.6. Assume
now (iv) and let A ⊂ Rn , λA = 0. Using Lemma 28.3.a, we get
μ{x ∈ A : Dμ (x) ≤ k} ≤ k λA = 0
28.10. Notes. We could define the (symmetric) derivative of a measure μ at x using cubes
with the center at x whose lenght of edges converges to zero.
Furthermore, we could consider more general sequences of sets “shrinking” in a suitable way
to x. We could consider, for instance, all sequences of intervals containing x whose diameters
converge to zero. Let us note that for these general “derivatives of measures” analogous theorems
to those of this paragraph fail to hold, and even analogies of Theorem 29.2 (which is their direct
consequence) do not hold. An example can be found in M. de Guzmán [*1975], Chapter V., §2.
On the topic of differentiation of measures the reader is invite to consult for instance,
W. Rudin [*1974], M. de Guzmán [*1975] or J. Lukeš, J. Malý and L. Zajı́ček [*1986].
λ(U (z, r) \ Mj )
< 2−j−k
λU (z, r)
∞
for all j = 1, . . . , k and r ∈ (0, rk ). Set Aj = U (z, rj ) \ Mj and M = U \ j=1 Aj .
We have
λ(U (z, r) ∩ Aj )
≤ 2−j
λU (z, r)
for all r > 0. Choose k ∈ N and r ∈ (0, rk ). Then
λ(U (z, r) ∩ Aj ) 2−j−k for j ≤ k ,
≤
λU (z, r) 2−j for j > k .
Whence
λ(U (z, r) \ M )
≤ 2−k+1 ,
λU (z, r)
and z is a density point of M . If x ∈ M ∩ U (z, rk ), then x ∈ Mk and consequently
|f (x) − f (z)| < k1 . Therefore lim f (x) = f (z). The reverse implication is
x→z,x∈M
obvious.
29.9. Denjoy’s Theorem. A function f : Rn → R is Lebesgue measurable if
and only if f is approximately continuous at almost all points of Rn .
Proof. Let f be approximately continuous at almost all points. Denote by N
the set of all points of approximate discontinuity of f . Take any c ∈ R. We
will show that the set M := {x ∈ R : f (x) > c} is measurable. Since M is a
E. Integration on Rn 121
30.1. Lipschitz Mappings. Let (P1 , ρ1 ), (P2 , ρ2 ) be metric spaces and β > 0.
Recall that a mapping f : P1 → P2 is said to be β-Lipschitz if
ρ2 (f (x), f (y)) ≤ βρ1 (x, y)
for every x, y ∈ P1 . We say that f is Lipschitz on P1 if there exists β > 0 for
which f is β-Lipschitz. A mapping f : P1 → P2 is called locally Lipschitz if for
any z ∈ P1 there exists its neighbourhood U such that f is Lipschitz on U (in
this case, the constant β can vary from point to point).
30.2. Lemma. Let f be a continuous function on an open set G ⊂ Rn and
i ∈ {1, . . . , n}. Then the set of those points at which ∂x
∂f
i
fails to exist is a Borel
set.
Proof. To simplify the proof, we can assume that G = Rn . Denote
f (x + tei ) − f (x) 1 1 1
um,k (x) = sup : t ∈ (− , ), |t| ≥ ,
t m m k
f (x + tei ) − f (x) 1 1 1
vm,k (x) = inf : t ∈ (− , ), |t| ≥ ,
t m m k
u(x) = inf sup um,k (x) , v(x) = sup inf vm,k (x).
m k>m m k>m
Then, for k > m, um,k and vm,k are continuous. Since the set of points where
∂xi (x) exists equals {x ∈ R : −∞ < u(x) = v(x) < +∞}, the assertion follows.
∂f n
122 30. Lipschitz Functions
Proof. Assume that f is a β-Lipschitz function. Let E be the set of all points
where f fails to have some of the partial derivatives. Using Fubini’s theorem, the
one-dimensional theorem on differentiability of Lipschitz functions (Lemma 22.4)
easily implies that E is of measure zero (notice that E is measurable by Lemma
30.2).
Now, for p, q ∈ Qn and m ∈ N denote
f (x + tei ) − f (x)
Sp,q,m = x ∈ G \ E : pi < < qi for all i = 1, . . . , n
t
1 1
and for t ∈ − , \ {0} .
m m
Let S̃p,q,m be the set of all density points of Sp,q,m . By the Lebesgue density
theorem 29.2 λ(Sp,q,m \ S̃p,q,m ) = 0. If
N := (Sp,q,m \ S̃p,q,m ) ,
p,q,m
∂f
qi − ε < pi < (x) < qi , i = 1, . . . , n.
∂xi
Then there exists m ∈ N with x ∈ S := Sp,q,m . Since x is outside !of "N , x is even
n
a density point of S. Find δ ∈ (0, m 1
) such that λ(U (x, r) \ S) ≤ 2ε λ(U (x, r))
for all r ∈ (0, 2δ). In particular, notice that U (x, (1 + ε)τ ) \ S does not con-
tain any ball of radius ετ if τ ∈ (0, δ). Choose y ∈ U (x, δ) and denote y i =
(y1 , . . . , yi , xi+1 , . . . , xn ). For any i ∈ {1, . . . , n}, let Ui be the ball with center y i
and radius ε |y − x|. The choice τ = |y − x| yields that S ∩ Ui = ∅ for all i. If
z i ∈ S ∩ Ui and wi := z i−1 + (yi − xi )ei , then
i
w − y i = z i−1 − y i−1 ≤ ε |y − x| ,
f (wi ) − f (z i−1 ) ∂f
pi < < qi and pi < (x) < qi .
yi − xi ∂xi
Whence
f (wi ) − f (z i−1 ) − ∂f (x)(yi − xi ) ≤ (qi − pi )|yi − xi | ≤ ε |y − x| .
∂xi
E. Integration on Rn 123
Summarizing, we get
∂f
f (y) − f (x) − (x)(yi − xi )
∂x i
i
∂f
≤ f (w ) − f (z ) −
i i−1
(x)(yi − xi )
∂x i
i
! "
+ f (wi ) − f (y i ) + f (z i−1 ) − f (y i−1 )
i
≤ ε(n + 2βn) |y − x| .
30.6. Remark. Let f be a β-Lipschitz mapping from E ⊂ Rn into Rk . Then the coordinates
of f can be extended separately by the previous theorem and we get a kβ-Lipschitz extension
f ∗ : Rn → Rk of f . However, this extension fails to be β-Lipschitz. There exists a stronger
extension theorem for mappings due to Kirszbraun which guarantees the existence of a β-
Lipschitz extension. The proof of this assertion is more difficult.
30.7. Exercise. Suppose E ⊂ Rn and f : E → Rn is a β-Lipschitz mapping. Prove that
λ∗ f (E) ≤ β n λ∗ E.
Hint. We can assume that λ∗ E < +∞. Fix an ε > 0. By Lemma 26.26 find a sequence
S P
{U (xj , rj )} of open balls so that E ⊂ U (xj , rj ) and λU (xj , rj ) < λ∗ E + ε. Since
j
f (U (xj , rj )) ⊂ U (f (xj ), βrj ) for every j,
X X
λ∗ (f (E)) ≤ λU (f (xj ), βrj ) = β k λU (xj , rj ) ≤ β k (λ∗ E + ε).
j j
124 31. Approximation Theorems
31.1. Space D(Ω). Let Ω ⊂ Rn be an open set. We denote by D(Ω) the linear
space of all infinitely differentiable functions on Ω with compact support in Ω.
Our aim is to prove that D(Ω) is often dense in other function spaces. The first
task is to construct a nontrivial infinitely smooth function on Rn with compact
support.
Set 1/(|x|2 −1)
αe if |x| < 1,
χ1 (x) =
0 if |x| ≥ 1 ,
where the constant α is chosen in such a way that Rn χ1 (x) dx = 1. Denote
χk (x) = k n χ1 (kx).
Then Rn χk (x ± y) dy = 1 for any k ∈ N and x ∈ Rn . A simple calculation
shows that the functions χk are infinitely differentiable (first reduce the proof to
a problem of differentiability of functions of one variable).
Now, if f is a locally integrable function, then Theorem 9.2 gives immediately
that χk ∗f are infinitely differentiable functions as well. Likewise, the convolutions
χk ∗ μ are infinitely differentiable if μ is a Radon measure.
31.2. Lemma. Suppose p ∈ [1, ∞), f ∈ L p (Rn ) and k ∈ N. Then χk ∗ f ∈
L p (Rn ) and χk ∗ f p ≤ f p .
Proof. According to generalized Young’s inequality 26.20,
χk ∗ f p ≤ f p · χk 1 ≤ f p
fj and this set, according to Denjoy’s theorem 29.9, is of full measure. By the
Lebesgue convergence theorem with dominating function (2j)p cB(0,j+1) we have
lim fj − χk ∗ fj p = 0.
k→0
Hence,
f − χk ∗ f p ≤ f − fj p + fj − χk ∗ fj p + χk ∗ fj − χk ∗ f p < 3ε ,
provided k is large enough.
31.4. Theorem. Let Ω ⊂ Rn be an open set and p ∈ [1, ∞). Then D(Ω) is a
dense subset of L p (Ω).
Proof. Set Ωj = {x ∈ Ω : |x| < j and dist(x, ∂Ω) > 1j }. Take a function
f ∈ L p (Ω) and denote gj = f cΩj . As in the previous proof, it is clear that
f − gj p → 0. For all k > j we get χk ∗ gj ∈ D(Ω). Now the estimate
f − χk ∗ gj p ≤ f − gj p + gj − χk ∗ gj p
yields the assertion.
31.5. Theorem. Let f be a continuous function with compact support contained
in an open set Ω ⊂ Rn . Then χk ∗ f ⇒ f on Ω.
Proof. We can assume that Ω = Rn . Since f is uniformly continuous, given ε > 0
there is k0 ∈ N such that |f (x) − f (y)| < ε whenever x, y ∈ Rn , |x − y| < k10 . If
k > k0 and x ∈ Rn , then
|f (x) − χk ∗ f (x)| = (f (x) − f (y))χk (x − y) dy
R
n
= (f (x) − f (y))χk (x − y) dy
B(x,1/k)
≤ε χk (x − y) dy = ε .
Rn
We can see that (χk ∗ f )(x) → f (x) at all points x, where the derivative f (x)
exists and is approximately continuous. But the Rademacher theorem 29.9 and
Denjoy’s theorem 30.3 tell us that this happens almost everywhere.
31.8. Exercise. Give an alternative proof of the fact that (Ω) is dense in p (Ω) for
1 ≤ p < ∞.
Hint. If f ∈ c (Ω), then by Theorem 31.5 the functions χk ∗ f converge uniformly to f and
their supports are contained in a compact set (not depending on k). This yields that χk ∗ f
converge to f in the p -norm as well. Now it is sufficient to use the density of c (Ω) in p (Ω)
(Exercise 15.17).
31.9. Exercise. Use Theorem 31.4 to prove that the convolution f ∗ g is uniformly continuous
provided f ∈ p and g ∈ q , p1 + 1q = 1 and p, q > 1. (This will prove the proposition stated
in Remark 26.22.)
31.10. Riemann-Lebesgue Lemma. Suppose f ∈ 1 (Rn ). Then
Z
lim eix·t f (t) dt = 0.
|x|→∞ Rn
Z Z Xn
∂2g
− |x|2 eix·t g(t) dt = eix·t (t) dt
Rn Rn j=1
∂t2j
whence ˛Z ˛
˛ ˛
˛ eix·t g(t) dt˛˛ ≤ |x|−2 g1 .
˛
Rn
Now we invoke the fact that (Rn ) is dense in 1 (Rn ): Given f ∈ 1 (Rn ) and ε > 0, by
Theorem 31.4 there is g ∈ (Rn ) (g depends on f and ε) such that f − g1 < ε. Then
˛Z ˛
˛ ˛
˛ eix·t (f (t) − g(t)) dt˛˛ ≤ ε ,
˛
Rn
whence ˛Z ˛
˛ ˛
˛ eix·t f (t) dt˛˛ ≤ |x|−2 g1 + f − g1 .
˛
Rn
E. Integration on Rn 127
32. Distributions
for every open ball U containing the origin. Thus the “average density of the
δ-function” in the ball U (x, r) is
1
λU (x,r) if x ∈ U (0, r),
fr (x) :=
0 otherwise.
(Notice that x ∈ U (0, r) if and only if 0 ∈ U (x, r).) We can see that
∂ |α|
∂xα
1
1
. . . ∂xα
n
n
by the symbol Dα . Finally, recall that D(Ω) denotes the linear space of all
infinitely differentiable functions with compact support contained in Ω (see 31.1).
32.1. Notion of a Distribution. If {fk } is a sequence of functions from D(Ω),
D
we say that fk → 0 if there exists a compact set K ⊂ Ω such that supt fk ⊂ K
for every k and Dα fk ⇒ 0 on K for every multiindex α. Every linear functional
T on D(Ω) satisfying
D
T (fk ) → 0 whenever fk → 0
is called a distribution (on Ω). Thus every distribution is determined by its values
on D(Ω).
Let f be a locally integrable function on Ω (i.e. integrable on each compact
subset of Ω, in particular, f can be a continuous function). Set
Tf (ϕ) = f ϕ dλ
Ω
for ϕ ∈ D(Ω) (notice that the integral exists). It is not very difficult to prove that
D
Tf is a distribution: Suppose ϕk ∈ D(Ω) and ϕk → 0. If K is a compact set on
whose complement all ϕk ’s vanish, then
|Tf (ϕk )| ≤ max |ϕk (t)| |f | dλ
t∈K K
32.3. Examples. 1. In 32.1 we “embedded” locally integrable functions into the space
of distributions. Another class of objects contained in the space of distribution are Radon
measures. Let μ be a (positive) Radon measure on Ω. The functional Tμ defined as
Z
Tμ (ϕ) = μ(ϕ) (= ϕ dμ)
Ω
is again a distribution. In this sense, we can understand every Radon measure as a distribution.
For example, the Dirac measure (the “δ-function”) defined by Tδ (ϕ) = ϕ(0) is a distribution.
It is not regular. Indeed, if there were a locally integrable function f with Tf = Tδ , we could
easily deduce that f = 0 almost everywhere. But Tδ is not the zero distribution.
2. It is not true that every distribution is regular or is defined by some Radon measure. As
the example consider the functional ϕ → ϕ (0). However, there is a simple criterion which
enables to decide whether a distribution T is equal to Tμ . Namely, if μ is a Radon measure,
then the distribution Tμ is a positive functional. Conversely, if T is a distribution with T (ϕ) ≥ 0
whenever ϕ ∈ (Ω) is nonnegative, then there exists a Radon measure μ such that T = Tμ .
The last proposition is an easy consequence of the Riesz representation theorem 16.5.
3. For ε > 0, denote O(ε) = (−∞, −ε) ∪ (ε, +∞) and
Z
ϕ(x)
Rε (ϕ) = dx for ϕ ∈ (R)
x
O(ε)
(notice that the integral exists). It is not difficult to see that Rε is a distribution on R. Further,
for every ϕ ∈ (R) there is a finite limit lim Rε (ϕ). If we denote it by T 1 (ϕ), then T 1 is a
ε→0+ x x
distribution. The reader may find it instructive to prove this assertion directly, or using Remark
32.2.2. Notice also that the function x1 is not locally integrable on R. Thus, the distribution
T 1 which is often identified with the function x1 is not regular and also cannot be determined
x
by a Radon measure.
The set of all distributions is a linear space which is the topological dual to
D(Ω) (cf. Remark 32.2.1).
As mentioned above, every distribution is differentiable. To motivate the exact
definition, consider a function f with continuous derivative f on R (notice that
both f and f are locally integrable). In this case it is natural to require that Tf
should be the derivative of Tf . Since,
Tf (ϕ) = f ϕ = −Tf (ϕ )
R
In particular,
∂T ∂ϕ
(ϕ) = −T .
∂xj ∂xj
In the sense of identification of regular distributions and locally integrable func-
tions, we often speak about distributional derivatives of functions provided the
result of differentiation is a function.
130 32. Distributions
whenever ϕ ∈ (Ω). We see that the derivative of the distribution TY is the Dirac “δ-function”.
In a similar way we find that
Show that U is a distribution on R which is neither regular nor determined by a Radon measure.
E. Integration on Rn 131
P
n ∂2
10. Let := 2
be the Laplace operator and u(x) := |x|2−n . Our aim is to find u in the
i=1 ∂xi
∂u xi
distributional sense. First, we have that = (2 − n) n is a locally integrable function. Now
∂xi |x|
∂2u
the distributional derivative is a complicated distribution containing an “integral average”
∂x2i
similar to the distribution of Example 32.3.3. But we have an easier task, namely to find out
P
n ∂2u
the sum 2
of these distributions. If κn denotes the volume of the unit ball in Rn (cf.
i=1 ∂xi
Exercise 26.17), then we prove that
Z Xn
xi ∂ϕ
n (x) = −nκn ϕ(0)
Rn i=1
|x| ∂xi
u = n(2 − n)κn δ0 .
To this end let γ be a nonincreasing infinitely smooth function on (0, ∞), γ(t) = 1 for t < 1,
γ(t) = 0 for t > 2, |γ (t)| ≤ 2 and ηr (x) = γ(x/r). We use the decomposition
Then Z Xn
xi ∂ϕ
n (x) = Ar + Br + Cr + Dr ,
Rn i=1
|x| ∂xi
where
Z Xn
xi ∂ηr
Ar = ϕ(0) n (x) dx
Rn i=1
|x| ∂xi
Z Z 2r
= ϕ(0) |x|1−n r−1 γ (|x| /r) dx = ϕ(0)nκn r −1 γ (t/r) dt = −nκn ϕ(0)
Rn 0
and
Z Xn
xi ∂ (ϕ(1 − ηr ))
Dr = n (x) dx.
Rn i=1
|x| ∂xi
Set ( xi
|x|n
ϕ(x)(1 − ηr (x)) if x
= 0;
ψi (x) :=
0 if x = 0.
An easy calculation shows that ψi ∈ (Rn ) and
Xn
∂ψi Xn
xi ∂ (ϕ(1 − ηr ))
(x) = n (x).
i=1
∂xi i=1
|x| ∂xi
132 32. Distributions
Thus Z Xn
∂ψi
Dr = dx = 0.
Rn i=1
∂xi
The proof will be finished by estimating the integrals Br and Cr and taking the limit as r → 0.
Since ϕ ∈ (Rn ), there exists a constant K such that |∇ϕ(x)| ≤ K and |ϕ(x) − ϕ(0)| ≤ K |x|
for all x ∈ U (0, 1). Obviously, |∇ηr | ≤ r2 . We obtain
Z n ˛
X
˛ ˛ ˛ Z
˛ xi ˛ ˛ ∂ηr ˛
|Br | ≤ ˛ ˛ ˛ ˛ dx ≤ 4nKr |x|1−n dx → 0
˛ |x|n ˛ |(ϕ(x) − ϕ(0)| ˛ ∂x (x) ˛ r
B(0,2r) i=1 i B(0,2r)
and Z ˛ ˛ ˛ ˛ Z
X
n
˛ xi ˛ ˛ ∂ϕ ˛
|Cr | ≤ ηr (x) ˛˛ n ˛˛ ˛ ˛
˛ ∂x (x)˛ dx ≤ nK |x|1−n dx → 0.
B(0,2r) i=1 |x| i B(0,2r)
and we see that the multiplication in the space of distributions is almost the same as the
“pointwise” multiplication.
0 = 0 · T x1 = (xTδ )T x1 = Tδ (xT x1 ) = Tδ · 1 = Tδ .
32.10. Examples. (a) If Tk := Tsin kx on R, then {Tk } converges to zero (i.e. to the
distribution T0 ).
kx
(b) Let Tk be the regular distribution on R determined by the function sin πx
. Show that
Tk → Tδ (when keeping the convention of 32.1, the sequence of functions { sin
πx
kx
} converges to
the Dirac measure δ in the sense of distributions).
Hint. Choose ϕ ∈ (R). If ϕ = 0 outside the interval [−A, A], then
Z A Z A
(ϕ(x) − ϕ(0)) sin kx
πTk (ϕ) = sin kx dx + ϕ(0) dx.
−A x −A x
By the Riemann-Lebesgue Lemma 31.10, the limit of the first term is zero. Further it is known
that
ZkA
sin t
lim dt = π ,
k→∞ t
−kA
P
∞
We see that we can assign to the divergent series cos kx a “sum” but this one is a distribution
k=1
and not a function.
134 33. Fourier Transform
32.13. Notes. The theory of distributions in the framework of duality between topological
vector spaces was created by L. Schwartz in the late 1930’s and was fully published in [1947]
(because of the scientific isolation during the second world war he published his theory after-
wards). He also introduced the fundamental space (Rn ) suitable for the Fourier transform of
distributions.
Some related considerations were implicitly contained in earlier works of other authors. In
fact, L. S. Sobolev [1936] also introduced basic notions of the theory of distributions. Since he
was led by a special question concerning the solution of the Cauchy problem he did not realize
the power of his own ideas.
Concerning the history of the theory of distributions, we suggest also Lützen’s monograph
[*1982]. Other material can be found in J. Horváth [1970] or J. Horváth [*1966].
Proof. The proof will follow using Fubini’s theorem easily. It only needs to be
observed that f#g ∈ L 1 whenever f, g ∈ L 1 .
The following theorem plays a key role in applications of the Fourier transform
to the theory of partial differential equations.
33.3. Theorem. Suppose f ∈ L 1 (Rn ) and 1 ≤ j ≤ n.
(a) If xj f ∈ L 1 (Rn ), then
∂ f#
= (−ixj f ).
∂xj
E. Integration on Rn 135
lim f (x) = 0.
|x|→∞
Then
∂f /∂xj (x) = ixj f#(x).
leads to
∂ f#
(x) = −(2π)−n/2 e−ix·t itj f (t) dt.
∂xj Rn
, then #
2
33.4. Lemma. If h(x) = e−|x| /2
h = h.
Proof. Assume first that n = 1. By the previous lemma both h and # h are solutions
to the differential equation
u + xu = 0 .
Since h(0) = #
h(0) (see Exercise 26.17.b), we get # h = h. Assuming we had already
proved the theorem for n = 1, we get
2 2 2
e−ix·t e−|t| /2 dt = e−ix1 t1 −t1 /2 . . . e−ixn tn −tn /2 dt1 . . . dtn
Rn Rn
2 2 2
−ix1 t1 −t1 /2
e dt1 · · · e−ixn tn −tn /2 (2π)n/2 e−|x| /2 .
R R
Observe also that the Fourier transform of a function from L 1 needs not to be
an element of L 1 . If f is the indicator function of the interval [−1, 1], then
$
2 sin x
f#(x) = .
π x
We know that in the case of L 1 the answer is positive as we can take g = f#.
On the other hand, Exercise 33.9 shows that the Fourier transform of a constant
function (which is a function of L ∞ (Rn )) is not a function.
Let us note that even in the case when the Fourier transform of a function f
is a function g (in the sense mentioned above), the function g is not determined
pointwise (as in the original definition of the Fourier transform for f ∈ L 1 (Rn ))
but only except a null set.
In what follows, we restrict our attention to the case p = 2 and we state the
main theorem which guarantees a good theory for the Hilbert space L2 (Rn ). First
we define the inverse Fourier transform by the formula
f˜(y) = (2π)−n/2 eix·y f (x) dx , f ∈ L 1 (Rn );
Rn
T̃ (ϕ) = T (ϕ̃) , ϕ ∈ S (Rn ), T is a tempered distribution
˜ ˆ % %
fˆ = f , f˜ = f , %f#% = f
2 2
for f ∈ S (Rn ).
Proof. In Theorem 31.4 we proved even that D(Rn ) is a dense subset of L2 (Rn ).
˜ 2
Now we are going to prove the formula fˆ = f . Denote h(x) = e−|x| /2 . Re-
member, according to Lemma 33.4, that #h = h. Using the multiplication formula
33.2.b we obtain
# t t
f (t)h( ) dt = f ( )#
h(t) dt.
Rn k Rn k
When taking the limit as k → ∞ it follows that
f#(t) dt = f (0)#
h(t) dt = (2π)n/2 f (0).
Rn Rn
ˆ
The formula f = f˜ can be proved in a similar way. Notice that for the Fourier
¯
transform of the complex conjugated function we have fˆ¯ = fˆ and thus the mul-
tiplication formula yields
2
2 ¯
˜ ˆ¯ ¯ ˆ
|f | dx = f f¯ dx = f fˆ dx = f fˆ dx = fˆfˆ dx = f dx.
Rn Rn Rn Rn Rn Rn
33.8. Remark. The Plancherel theorem says that the (original) Fourier transform can be
uniquely extended from (Rn ) to L2 (Rn ). The result is a continuous linear isometry of the
space L2 (Rn ) onto itself. Thus is a unitary mapping and one can easily find out that also
the inner product in L2 (Rn ) is invariant with respect to it.
33.9. Exercise. Let δ be the Dirac δ-function. Prove that δb is the constant (2π)−n/2 and
that the Fourier transform of this constant is δ.
b is a solution to the equation
33.10. Exercise. Find a function u whose Fourier transform u
− b b=δ
u+u
in the sense of distribution (here is the Laplace operator, cf. 32.5.10).
33.11. Fourier Transform of a Function in L1 (Rn ). (a) Suppose f ∈ 1 (Rn ). Then fb
is a uniformly continuous function and lim fb(x) = 0 (in other words, fb ∈ 0 (R
n )).
|x|→∞
Hint. The assertion concerning the limit at infinity is a consequence of the Riemann-Lebesgue
lemma 31.10. As for the proof of uniform continuity it is no restriction to assume that f is from
the space (Rn ) since we know that (Rn ) is dense in L1 (Rn ) (Theorem 31.4). Suppose that
the support of f is contained in U (0, R). We use the inequality
˛ iy·t ˛
˛e − eix·t ˛ ≤ |t| |y − x|
to get
˛ ˛ ˛˛Z ˛
˛
˛b ˛
˛f (y) − fb(x)˛ = ˛˛ (eiy·t − eix·t )f (t) dt˛˛ ≤ R |y − x| f 1
Rn
Show that f is an indefinite Lebesgue integral of the (locally integrable) function fb.
33.13. Remark. Define the “Fourier transform” F on 1 (Rn ) by the formula
Z
1
Ff = e−ibx·t f (t) dt
a Rn
F (f ∗ g) = a F f F g;
F (F f (x)) = c f (−x);
∂(F f )
(x) = −ibxj F f (x);
∂xj
F f 2 = c f 2 .
(the sum of the convergent trigonometrical series with coefficients ck ). The (formal) series
X
b(k)eikx
u
k∈Z
(c) If u ∈ L2 (T ), then {b
u(k)} ∈ l2 (Z) (this follows from the Bessel inequality). Conversely, for
any sequence {ck } ∈ l2 (Z) there exists a unique function u ∈ L2 (T ) with ck = u b(k) (the Riesz-
Fischer theorem). This function can be obtained as the limit of the sequence {sk } ⊂ L2 (T ),
where
Xk
sk (x) = ck eikx , x ∈ T.
j=−k
√
Moreover, u2 = 2π {b
u(k)}2 (the Parseval identity). Compare with the Plancherel theorem
33.7.
(d) The mapping u → {b u(k)} and {ck } → c̃ can be also generalized to wider classes of
functions or sequences. We are not going to mention the details. Just notice that the mapping
{ck } → u, {ck } ∈ l2 (Z) given by the Riesz-Fischer theorem is in fact an extension of the mapping
{ck } → c̃, {ck } ∈ l1 (Z). However, in the case of l2 , the function u is determined only as an
element of L2 (T ), i.e. except of a null set.
(e) If u ∈ 1 (R) is a 2π-periodic function and v = u , then
b(k) = ikb
v u(k) , k ∈ Z.
The theory of Fourier series can be including as a particular case since series could be
considered as integrals with respect to the counting measure.
F. Change of Variable and k-dimensional Measures 141
σ ∗ E = {inf σS : S ⊃ E, S ∈ Σ}.
μ∗ E = inf{μF : F ∈ Σ, F ⊃ E}
as the sets of the form ϕj (Gj ) are measurable (Gj is a union of a sequence of
j
compact sets and each continuous image of a compact set is compact). According
to Exercise 30.7 we obtain that μ is a k-dimensional measure.
34.10. Volume. Let W be a k-dimensional linear space with an inner product
and L : Rk → W a linear mapping. Let Q : R → W be an isometry and
k
a := det(Q−1 L). Then λ(Q−1 L(E)) = a λE for any Borel set E ⊂ Rk . The
constant a is independent of the choice of the isometric mapping Q and has a
144 34. Change of Variable Theorem
σ(L(E)) = vol L λE
Since
det(aβi ,i , bβi ,j )ki,j=1 = 0
whenever any index in β is repeated, we obtain
det AT B = det(aβi ,i , bβi ,j )ki,j=1
β∈{1,...,m}k
= det(aβi ,i , bβi ,j )ki,j=1 = det(ATα Bα ).
α∈I β∈{α1 ,...,αk }k α∈I
Now the promised application to the volume follows: We set both A and B to be
the matrices of the mapping L. From the above computation we obtain
(vol L)2 = det AT A = det ATα Aα = (det Aα )2 .
α∈I α∈I
(e) σ ∗ ϕ(E) ∗
% −1≤% β vol L λ E for any set E ⊂ F ;
k
%
(f) if ε L % < 1, then
β −k α−k vol ϕ (t) dt ≤ σ ∗ ϕ(E) ≤ β k αk vol ϕ (t) dt
E E
|ϕ(s) − ϕ(t)| ≤ |ϕ(s) − ϕ(t) − L(s − t)| + |Ls − Lt| ≤ ε |s − t| + |Ls − Lt|
% %
≤ (ε %L−1 % + 1) |Ls − Lt|
and
∗ −k ∗ −k −k
σ ϕ(E) ≥ α vol L λ E ≥ β α vol ϕ (t) dt.
E
Now we will deal with locally Lipschitz mappings. We will frequently and
tacitly use the fact that (in view of the Rademacher Theorem 30.3) any Lipschitz
mapping ϕ has its derivative ϕ (t) almost everywhere, and that this is measurable
(as a function of t).
F. Change of Variable and k-dimensional Measures 147
{M ∈ F : M − L < ε(L)},
where L runs over F , is an open cover of F . We can thus find its countable
subcover and divide E ∩ {ϕ ∈ F } into a disjoint union of measurable sets E i ,
where for each i there is L ∈ F such
that for all t ∈ E i we have ϕ (t) − L <
i i
ε(L). Let i be fixed. We have E = Ep , where
p
Epi := t ∈ E i : for each s ∈ U (t, p2 ) we have
|ϕ(s) − ϕ(t) − L(s − t)| < ε(L) |s − t| .
Mn,k , it is a countable union of compact sets and we use the preceding part.
34.17. Sard Theorem. Let σ be a complete k-dimensional measure on Rn .
Let G ⊂ Rk be an open set and ϕ : G → Rk a locally Lipschitz mapping. Let
Z := {t : vol ϕ (t) = 0}. Then σ(ϕ(Z)) = 0.
Proof. It is enough to show that σ(ϕ(E)) = 0 for any set E := Z ∩ {ϕ ≤ m}.
If K := {M ∈ Mn,k : vol M = 0 a M ≤ m}, then K is a closed subset
of Mn,k . Choose δ > 0. With each L ∈ K we associate ε(L) > 0 such that
k−1
2k L ε(L) < δ and |Lt| ≥ ε(L) |t| for every t perpendicular to Ker L := {s ∈
k
R : Ls = 0}. According to Lemma 34.16 there is a disjoint partition E into
a countable union of measurable sets Dj such that for each j there is L ∈ K
satisfying
|ϕ(t) − ϕ(s) − L(s − t)| < ε(L) |s − t|
for each s, t ∈ Dj . Let j be fixed. The dimension d of the space Ker L is less
than k. Let P be the orthogonal projection of Rk onto Ker L and W be a (k-d)-
dimensional subspace of Rn orthogonal to L(Rk ). We find an isometric mapping
Q of the space Ker L to W and set M t%= Lt %+ ε(L)QP t. It is easily verified that
for ε = ε(L) we have M − L ≤ ε, %M −1 % ≤ 1/ε and vol M = εk−d vol L ≤
εk−d md ≤ εmk−1 . By Lemma 34.15 we get
% %
σ ∗ ϕ(Dj ) ≤ (2ε %M −1 %)k vol M λD ≤ 2k mk−1 ε λDj ≤ δ λDj .
For each compact set K ⊂ Dj the set ϕ(K) is compact and thus measurable.
There is a
sequence {Ki } of compact subset Dj such that λKi → λDj . Then the
set Fj := ϕ(Ki ) is a σ-measurable subset ϕ(Dj ) and
i
vol ϕ (t) dt ≤ τ σFj .
Dj
τ −1 σ ∗ ϕ(E) ≤ vol ϕ (t) dt ≤ τ σϕ(F ) .
E
Since F ⊂ E, it follows
σϕ(E) = vol ϕ (t) dt ,
E
which is the desired formula in the case of the indicator function of the set E.
Now, if ϕ is not
one-to-one on E, the same procedure as above yields a disjoint
partition E = Ej , where Ej are measurable sets and ϕ is one-to-one on Ej .
j
According to the preceding part of the proof, it follows that
cEj dσ = σϕ(Ej ) = vol ϕ (t) dt.
ϕ(G) Ej
g : Wx → Rn−k of class C such that the matrix g (x) has rank n−k and Wx ∩Ω =
Wx ∩ {g = 0}.
(It is an easy exercise to use the Implicit Function Theorem in order to get a parametrization.
Conversely, if ϕ : G → Ω is a parametrization of class
and x = ϕ(t), then there exists the
projection Π of the space Rn onto Rk such that (Π ◦ ϕ) (t)
= 0. Then by the Inverse Mapping
Theorem there exists (after an eventual restriction of the domain of ϕ) the inverse mapping
` ´
(Π ◦ ϕ)−1 . The equation g(x) = 0, where g(x) = x − ϕ (Π ◦ ϕ)−1 (Π(x)) , is the desired implicit
description of Ω on the neighborhood of x.)
Let Ωbe a k-dimensional surface and σ a k-dimensional measure on Rn . The
integral Ω f dσ is sometimes labelled as the curve or surface integral of f .
34.25. Example (helix). Let X = {[x, y, z] ∈ R3 : x = cos z, y = sin z}. We may parametrize
X by the mapping
ϕ(t) = [cos t, sin t, t] , t ∈ R.
Then
ϕ (t) = [− sin t, cos t, 1]
and q √
vol ϕ (t) = (− sin t)2 + cos2 t + 1 = 2.
and p
vol ϕs (t, a) = cos2 a sin2 a + sin2 t cos4 a + cos2 t cos4 a = cos a.
(b) Let S, X, N be as in (a). For the parametrization of X we may also use the “cylindrical
coordinates” ϕc = [x, y, z], where
x(t, h) = r cos t,
p
y(t, h) = r sin t, (r denotes 1 − h2 ),
z(t, h) = h
and q
vol ϕc (t, h) = h2 + (1 − h2 )(sin2 t + cos2 t) = 1.
152 34. Change of Variable Theorem
(c) Let S+ be the hemisphere {[x, y, z] ∈ S : z > 0}. Then S+ can be parametrized by
its projection into the plane determined by the axes x and y. Consider the parametrization
ϕp = [x, y, z], where
x(s, t) = s,
y(s, t) = t,
p
z(s, t) = 1 − s2 − t2 ,
on {(s, t) ∈ R2 : s2 + t2 < 1}. Then
0 1
1, 0
B C
ϕp (s, t) = @ 0, 1 A
√ −s √ −t
,
1−s2 −t2 1−s2 −t2
and
1
vol ϕp (t) = √ .
1 − s2 − t2
34.27. Example. By means of the parametrization of Example 34.26.b we will compute
two-dimensional surface measure of the unit sphere. Let S, X, G have the same meaning as in
the quoted example. Then
Z Z 1 Z 2π
σ(S) = σ(X) = vol ϕ dt dh = ( dt)dh = 4π.
G −1 0
34.28.
p Example. Let σ be a two-dimensional measure on S− := {(x, y, z) ∈ R3 : z =
− 1 − x2 − y 2 }. The integral Z
z 2 dσ
S−
posseses (up to constants) the physical meaning of the force by which (under the unit gravitation
acceleration) a ball with the unit density half immersed in a liquid is lifted (computed by the
integration of the gravitation forces).
Use spherical coordinates on G = {(t, a) ∈ R2 : t ∈ (0, 2π), a ∈ (−π/2, 0)}. Then
Z Z Z 0
2
z 2 dσ = sin2 a cos a dt da = 2π sin2 a cos a da = π ,
S− G −π/2 3
which is the half of the volume of the unit sphere. Therefore, we have verified Archimedes
principle in our particular case.
34.29. Example (the length of the graph of a function). Let Γ be the graph of a Lipschitz
function f : [0, 1] → R and σ a 1-dimensional measure on R2 . Let G = (0, 1) and ϕ(t) = (t, f (t)),
t ∈ G. Then the assumptions of Theorem 34.18 are satisfied and
Z 1 Z 1q
σ(Γ) = vol ϕ (t) dt = 1 + (f (t))2 dt.
0 0
34.30. Exercise. Derive the formula
Z b q
σ(Ω) = 2π f (t) 1 + (f (t))2 dt
a
in the case of two-dimensional measure of the surface Ω drawn in R3 by the rotation of the graph
of a function f around the z-axis. Suppose that the function f : (a, b) → (0, +∞) is Lipschitz
and set p
Ω = {[x, y, z] ∈ R3 : z ∈ (a, b), x2 + y 2 = f (z)}.
34.31. Rectifiable Sets. A set H ⊂ Rn is said to be k-rectifiable if there is a Lipschitz
mapping of a bounded measurable set E ⊂ Rk to H. It is a consequence of the Change of
Variable Formula 34.19 that on the σ-algebra generated by k-rectifiable sets all k-dimensional
measures coincide. On the other hand, there exist closed sets on which k-dimensional measures
may differ (for example, the k-dimensional measure of 34.9 may be different from the Hausdorff
measure of Chapter 36). The constructions of such sets are rather complicated.
F. Change of Variable and k-dimensional Measures 153
Notice that n-rectificable sets in Rn are just bounded Lebesgue measurable sets.
34.32. Notes. The concept of k-dimensional measures comes from A. Kolmogorov [1932]. Our
exposition follows the monograph by H. Federer [*1969] (see also L. C. Evans and R. E. Gariepy
[*1992]) and ideas due to D. Preiss. The classical Sard theorem (or, Morse-Sard theorem) for
1 –mappings was proved by A.P. Morse [1939] and A. Sard [1942].
where N (x, ϕ, G) is the number of points of the set ϕ−1 (x) (the Banach indica-
trix ). It is natural to ask what is the meaning of the integral
Jϕ (t) dt.
G
This problem leads to the notion of the degree of a mapping which turns out
to have many applications. Let us mention, for instance, its importance for the
topology of Euclidean spaces and solvability problems for nonlinear “algebraic”
equations.
First we need to prepare several auxiliary computational results.
35.1. Lemma. Let G ⊂ Rk be an open set and ψ1 , . . . , ψk twice continuously
differentiable functions on G. Then we have
(a)
k
∂ ! ∂ψi "
(−1)q+1 det i=2,...,k = 0.
q=1
∂tq ∂tj j=1,...,q−1,q+1,...,k
(b)
∂ ! ∂ψi "
k
(−1)q+1 ψ1 det i=2,...,k = det(∇ψ1 , . . . , ∇ψk ).
q=1
∂tq ∂tj j=1,...,q−1,q+1,...,k
k
k
(−1)q+1 det(ai,j,p,q ) i=2,...,k ,
q=1 p=1 j=1,...,q−1,q+1,...,k
where ⎧
⎪
⎪
∂ψi
if j = p = q ,
⎨ ∂tj
2
ai,j,p,q =
⎪
∂ ψi
if j = p = q ,
⎪
⎩
∂tq ∂tj
0 if p = q.
154 35. The Degree of a Mapping
Now it remains to realize that using the interchange of the order of differentiation
we get
(−1)q+1 det(ai,j,p,q ) i=2,...,k = (−1)p det(ai,j,q,p ) i=2,...,k
j=1,...,q−1,q+1,...,k j=1,...,p−1,p+1,...,k
for all p, q.
(b) If we apply the chain rule to the left-hand side of the equality, we obtain
∂ ! ∂ψi "
k
(−1)q+1 ψ1 det i=2,...,k
q=1
∂tq ∂tj j=1,...,q−1,q+1,...,k
k
∂ψ1 ! ∂ψi "
= (−1)q+1 det i=2,...,k
q=1
∂tq ∂tj j=1,...,q−1,q+1,...,k
k
∂ ! ∂ψi "
+ ψ1 (−1)q+1 det i=2,...,k .
q=1
∂tq ∂tj j=1,...,q−1,q+1,...,k
The first term on the right-hand side is just the expansion of the determinant
det(∇ψ1 , . . . , ∇ψk ) by the first row, the second one vanishes by the part (a).
35.2. Lemma. Let ψ1 , . . . , ψk be Lipschitz functions on Rk vanishing outside
a compact subset of Rk . Then
det(∇ψ1 , . . . , ∇ψk ) dt = 0.
Rk
Proof. We proceed in two steps. First, we assume that the functions ψ1 . . . ψk are
of class C 2 . By Lemma 35.1.b there are functions η1 , . . . , ηk vanishing outside
compact subsets of Rk such that
k
∂ηj
det(∇ψ1 , . . . , ∇ψk ) = .
j=1
∂tj
Further,
k
∂g
det(∇(g ◦ ϕ), ∇ϕ2 , . . . , ∇ϕk ) dt = ( ◦ ϕ) det(∇ϕi , ∇ϕ2 , . . . , ∇ϕk ) dt.
G i=1 G ∂xi
Among the terms of the sum on the right, only the first one may differ from
zero and it equals G (f ◦ ϕ) Jϕ dt. If we combine this result with an analogous
computation for ψi , we obtain the desired equality.
35.5. Lemma. Let G ⊂ Rk be a bounded open set and ϕ, ψ : G → Rk Lipschitz
mappings. Let B(y, r) be a closed ball of Rk which does not intersect any of the
segments joining ϕ(t) and ψ(t), t ∈ ∂G. Let f be an integrable function on Rk
with support in U (y, r). Then
f (ϕ(t)) Jϕ (t) dt = f (ψ(t)) Jψ (t) dt.
G G
156 35. The Degree of a Mapping
Proof. Denote
where ψ(t) = ϕ(t) + y − z. For y close enough to z the functions ϕ, ψ satisfy the
hypotheses of Lemma 35.5 and, in view of connectedness of Uρ , it follows that the
function
y → cK(y,ρ) (ϕ(t)) Jϕ (t) dt
G
is constant on Uρ . Hence
cK(y,ρ) (ϕ(t)) Jϕ (t) dt = a(ρ)λK(y, ρ).
G
Proof. The assertions (a) and (b) are obvious. Combining Change of Variable
Formula in 35.8 and 34.19, we get (c).
158 35. The Degree of a Mapping
(d) Assume that the equation ϕ(t) = y has no solution. Since ϕ(G) is a compact
set, there is an open neighborhood U of y such that U ∩ϕ(G) = ∅. Find a Lipschitz
function ψ close enough to ϕ such that U ∩ ψ(G) = ∅ and deg(y, ψ, G) = 0. Then
G ∩ ψ −1 (U ) = ∅ and by 35.8
0= Jψ = deg(y, ψ, G) λU = 0 ,
G∩ψ −1 (U )
which is a contradiction.
(e) We may assume that ϕ is a Lipschitz mapping. The set K := ϕ(G \ (G1 ∪
· · · ∪ Gm )) is compact and does not contain y. We find an open neighborhood U
of y whose closure does not intersect K. Then by 35.8
deg(y, ϕ, G) = Jϕ = Jϕ + · · · + Jϕ
G∩ϕ−1 (U ) G1 ∩ϕ−1 (U ) Gm ∩ϕ−1 (U )
= deg(y, ϕ, G1 ) + · · · + deg(y, ϕ, Gm ).
(f) Let s ∈ ϕ−1 (y). Since Jϕ (s) = 0, there is a neighborhood V of s such that
ϕ is one-to-one on V , ϕ(V ) is an open set (the Inverse Mapping Theorem) and
Jϕ has a constant sign on V . Then
λϕ(V ) deg(y, ϕ, V ) = Jϕ dt = sign Jϕ (s) |Jϕ | dt = sign Jϕ (s) λϕ(V ) .
V V
It is also clear that the set ϕ−1 (y) is isolated, and therefore ϕ−1 (y) is a finite
subset of G. We complete the proof using (e).
(g) We can assume that ϕ is of class C 1 . Let E = {t ∈ G : Jϕ (t) = 0}. By
the Sard Theorem 34.17, λ(ϕ(E)) = 0. Choose y ∈ Rk \ ϕ(∂G). Let U be a
neighborhood of y which does not intersect ϕ(∂G). Then there is x ∈ U \ ϕ(E).
Since deg(y, ϕ, G) = deg(x, ϕ, G), according to (f) deg(y, ϕ, G) is an integer.
35.10. Open Mapping Theorem. Let ϕ : G0 → Rk be a bilipschitz mapping
on an open set G0 ⊂ Rk . Then ϕ(G0 ) is an open set.
Proof. Let t ∈ G0 and y = ϕ(t). Let G be a bounded open set, t ∈ G ⊂ G ⊂ G0 ,
and U be an open neighborhood of y which does not intersect ϕ(∂G). If we prove
that deg(y, ϕ, G) = 0, then by 35.9.d the equation ϕ(s) = x has a solution for
any x ∈ U . Now, there is an open neighborhood V of t such that ϕ(V ) ⊂ U .
Denote V + = {s ∈ V : Jϕ(s)>0 } and V − = {s ∈ V : Jϕ(s)<0 }. Notice that by the
Rademacher Theorem 30.3, ϕ exists almost everywhere. From the definition of
a bilipschitz mapping it is clear that Jϕ (t) = 0 is impossible, and at least one
+ −
sets V , V contains a compact set K of positive measure. We obtain
of the
0 = K Jϕ = deg(y, ϕ, G) λϕ(K). Hence deg(y, ϕ, G) = 0.
35.11. Theorem on Orientation. Let ϕ be a locally bilipschitz mapping of a
connected open set G0 ⊂ Rk into Rk . Then Jϕ > 0 almost everywhere in G0 , or
Jϕ < 0 almost everywhere in G0 .
F. Change of Variable and k-dimensional Measures 159
The set function A → Hp (A) is called the p-dimensional (outer) Hausdorff mea-
sure.
As we show later, if P = Rn and k ≤ n is a nonnegative integer, there is a
constant κk such that Hk /κk is a k-dimensional measure on Rn in the sense of
definition 34.8. The measure Hk /κk is called the normalized Hausdorff measure.
160 36. Hausdorff Measures
The next series of theorems shows that Hp is a metric outer measure. There-
fore, we can apply Carathodory’s method and to derive that each Borel set is
Hp -measurable, and that the restriction of Hp to the Borel σ-algebra is a mea-
sure.
36.4. Theorem. Let γ be an metric outer measure on P . Then each Borel
subset of P is γ-measurable.
Proof. It would be clearly sufficient to prove that closed sets are γ-measurable.
To this end let a closed set F ⊂ P be given. Choose a test set T ⊂ P , γT < +∞
and denote
1 1
Pj = x ∈ T : ≤ dist(x, F ) < , j = 1, 2, . . . ,
j+1 j
P0 = x ∈ T : dist(x, F ) ≥ 1 .
m
!
m
"
γP2j = γ P2j ≤ γT
j=0 j=0
m
∞
for all m ∈ N. Similarly γP2j+1 ≤ γT , and we see that the series γPj
j=0 j=0
m
is convergent. Since for each m ∈ N, the distance between Pj and T ∩ F is
j=0
positive we have
∞ ∞
!
m
" ! "
γ(T \ F ) ≤ γ Pj + γ Pj ≤ γT − γ(T ∩ F ) + γPj .
j=0 j=m+1 j=m+1
F. Change of Variable and k-dimensional Measures 161
Letting m → ∞ we obtain
γ(T \ F ) ≤ γ(T ) − γ(T ∩ F ).
36.5. Remark. If γ is an outer measure on P for which every Borel set is γ-measurable, then
γ is already a metric outer measure.
∞
Proof. For each δ > 0 and each sequence {Aj } of subsets of P with Aj ⊃ E,
j=1
diam Aj < δ we have
∞
∞
Hp (f (E), βδ) ≤ (diam f (Aj )) ≤ β
p p
(diam Aj )p .
j=1 j=1
∞
If Aj is a sequence of subsets of K, Aj = K, then
j=1
∞
∞
(diam Aj )k ≥ 2−k λAj ≥ 2−k λK = 2−k .
j=1 j=1
Whence taking the infimum we obtain the desired lower estimate for Hk (K).
36.13. Normalized Hausdorff Measures on Rn . Let k be a nonnegative
integer, k ≤ n and κk := Hk ([0, 1]k × 0n−k ). Then Hk /κk is a k-dimensional
measure on Rn which will be labelled as the normalized Hausdorff measure.
The constant κk is equal to the number (4/π)k/2 Γ(1 + k2 ). The computation
is not easy, see C.A. Rogers [*1970].
36.14. Remarks. 1. The hints for computation k-dimensional measures of concrete rectifiable
sets are given in Chapter 34. We will not present the (difficult) examples of nonrectifiable sets.
2. Without essential changes of proofs a similar theory to that of this chapter can be built also
for the case of spherical measures (cf. Remark 36.3.4). The computation of the corresponding
constant analogous to κk is much easier, in fact it follows immediately from Exercise 26.26.
3. In terms of the Hausdorff measure the following version of the Sard theorem can be proved:
Let G ⊂ Rk be an open set and f : G → Rn an arbitrary mapping. Let E be the set of all
points t ∈ G at which the derivative f (t) exists and vol f (t) = 0 (which means that the rank
of the matrix f (t) is less than k). Then k (f (E)) = 0.
36.15. Notes. C. Carathéodory developed in [1914] the theory of the one-dimensional (“lin-
ear”) measure in n-dimensional Euclidean spaces. In the same paper he mentions the pos-
sibility of introducing k-dimensional measures in an n-dimensional space (for an integer k).
The k-dimensional measure for arbitrary positive k > 0 on Rn was introduced by F. Hausdorff
[1919]. The interesting Theorem 36.4 is due to C. Carathodory [*1918]. The theory of Haus-
dorff measures was developed very intensively, particularly A.S. Besikovitch published a great
amount of papers devoted to this topic. From monographs on Hausdorff measures we recommend
C.A. Rogers [*1970], K. J. Falconer [*1986] and P. Mattila [*1995].
G. Surface and Curve Integrals 163
In this chapter we continue a study of curve and surface integrals and state
a change of variable formula. Moreover, we derive formulae concerning relations
between the integration over the “interior” (of a set or a surface) and the integra-
tion over the “boundary”. Later on we will see that these formulae are particular
cases of general Stokes’ Theorem of the next chapter. They are, in fact, multi-
dimensional generalizations of the famous Leibniz formula
b
f (x) dx = f (b) − f (a) .
a
The reader perhaps appreciates that we include the explanation of three-dimen-
sional (and therefore most important from the point of view of the classical
physics) situations without a deeper excursion to multilinear algebra.
Recall, that the integral calculus on k-dimensional surfaces in Rn is based on
the notion of a k-dimensional measure (34.8) which exists (Theorem 34.9) and on
k-dimensional surfaces is uniquely determined (Remark 34.31). Underline that for
a basic understanding of the topic it is not essential to know how k-dimensional
measures were constructed.
Since all theorems of this chapter are only special cases of more general results
of Chapter 38, we will not disturb the presentation by their proofs.
For the one-dimensional measure on Rn we will use the notation s, while S
will be reserved for the (n − 1)-dimensional measure on Rn .
37.1. Vector Field, Gradient, Divergence, Curl. By a vector field on a set
X we understand a mapping f of a set X into Rn .
Let g be a function of class C 1 on an open set U ⊂ Rn . Then its gradient
grad g on U is defined as the vector field x → [ ∂x ∂g
1
∂g
(x), . . . , ∂xn
(x)]. (The difference
between the gradient and the derivative for functions of class C 1 consists only in
the convention that the gradient is a vector while the derivative is a linear form.)
Let f = [f1 , . . . , fn ] be a continuously differentiable (i.e. of class C 1 ) vector
field on an open set U ⊂ Rn . Then the divergence of the field f is the function
div f on U defined as
n
∂fi
div f = .
i=1
∂xi
If f is a continuously differentiable vector field on an open set U ⊂ R2 , its curl
is the function curl f defined by the formula
∂f2 ∂f1
curl f = − .
∂x1 ∂x2
Finally, if f is a continuously differentiable vector field on an open set U ⊂ R3 ,
we introduce its curl curl f as the vector field on U defined by
' ∂f3 ∂f2 ∂f1 ∂f3 ∂f2 ∂f1 (
curl f = − , − , − .
∂x2 ∂x3 ∂x3 ∂x1 ∂x1 ∂x2
164 37. Integral Calculus in Vector Analysis
∂x1 ∂x2
2 ∂x3 x1
37.3. Example (in R3 ). Let f (x) = [x1 , x22 , x3 x1 ]. Then div f = ∂x1
+ ∂x2
+ ∂x3
=
∂x2 ∂x2
1 + 2x2 + x1 , curl f = [ ∂x3 x1
∂x2
− 2
∂x3
, ∂x1
∂x3
− ∂x3 x1
∂x1
, 2
∂x1
− ∂x1
∂x2
] = [0, −x3 , 0].
Notice that the vector product is a binary operation if and only if n = 3. The
vector product is always perpendicular to its factors and its norm is
h „ « „ « „ «i
1, 1 1, 1 1, 1
[1, 1, 1] × [1, 2, 3] = det , − det , det = [1, −2, 1].
2, 3 1, 3 1, 2
37.7. Example. In R2 , the vector “product” of the vector [1, 2] is the vector [2, −1].
37.8. Exercise. Compute in R4 the vector product [1, 0, 1, 0] × [0, 0, 0, 1] × [0, −2, 0, 0].
37.12. Example. Let γ = {[x, y, z] ∈ R3 : x = cos z, y = sin z, 0 < z < 2π} be a helix (see
Example 34.25) and f (x, y, z) = [0, x, 3z 2 ]. We orient γ in such a way that the unit tangent
vector would have a positive z-coordinate. Parametrizing ϕ(t) = [x(t), y(t), z(t)] = [cos t, sin t, t]
we compute
1 1
t = √ [− sin z, cos z, 1] = √ [−y, x, 1] ,
2 2
and therefore
Z Z 2π Z 2π
f · t ds = (x y + 3z 2 z ) dt = (cos t(sin t) + 3t2 ) dt = π + 8π 3 .
γ 0 0
w1 × · · · × wn−1
n(x) = ,
|w1 × · · · × wn−1 |
∂ϕ
where wj = ∂t j
(t). Again, the vector n(x) is defined except on a S-null set and
using different parametrizations yields the same result outside a S-null set (for
more details see 38.14).
The field n(x) obtained in this way determines the orientation. A parametriza-
tion ϕ : G → Γ is positive provided for almost all t ∈ G we have
∂ϕ
∂ϕ ∂ϕ ∂ϕ
(t) × · · · × (t) = n(ϕ(t)) (t) × · · · × (t),
∂t1 ∂tn−1 ∂t1 ∂tn−1
and negative if the above equality holds having the opposite sign on the right-hand
side.
The field of unit normal vectors will be briefly called the normal field .
The integral Γ f · n dS is called the surface integral of the vector field f . It
satisfies the following Change of Variable Formula.
37.14. Theorem. Let Γ be an (n − 1)-dimensional oriented surface. Let
G ⊂ Rn−1 be an open set and ϕ : G → Γ a positive parametrization. Then for
any vector field f = [f1 , . . . , fn ], where fj ∈ L 1 (Γ, S), we have
∂ϕ ∂ϕ
f · n dS = f (ϕ(t)) · ( × ··· × ) dt.
ϕ(G) G ∂t1 ∂tn−1
G. Surface and Curve Integrals 167
where
Γ = {[x, y, z] ∈ R3 : z 2 = x2 + y 2 , 0 < z < 1}
and n is supposed to be oriented “outwards from the cone”
on G := {[r, t] : r ∈ (0, 1) and t ∈ (0, 2π)}. Obviously ϕ(G) differs from Γ only in a set of
measure zero, hence there is no difference between integration over Γ and over ϕ(G). We have
0 1
cos t, −r sin t
ϕ (r, t) = @ sin t,
r cos t A ,
1, 0
so that
∂ϕ ∂ϕ
× = [−r cos t, −r sin t, r].
∂r ∂t
In case of a positive parametrization this vector would be a positive multiple of the unit nor-
mal vector and thus it would be directed outwards from Ω. Since it is directed inwards, the
parametrization ϕ is negative (like in 37.21 it is possible to precise “outwards” and “inwards”).
The formula will differ only in the sign. We have
Z Z
[x, y, z 2 ] · n dS = [r cos t, r sin t, r2 ] · [r cos t, r sin t, −r] dr dt
Γ
Z G
π
= (r 2 − r 3 ) dr dt = .
G 6
37.16. Example. Let Γ := {[x, y, z] ∈ R3 : x2 + y 2 = z < 1}. Let the unit normal vector
1
n(x, y, z) = ± √ [2x, 2y, −1])
4z + 1
be oriented by the choice of its sign +. Then [t, r] → [x(t, r), y(t, r), z(t, r)] := [r cos t, r sin t, r2 ],
t ∈ (0, 2π), r ∈ (0, 1), is a positive parametrization and its range differs from Γ only in a set of
measure zero. Let f (x, y, z) = [2x, 0, 0]. Then
Z Z
f · n dS = 2x det(∇y, ∇z) dt dr
Γ (0,2π)×(0,1)
Z
= 2r cos t det(∇(r sin t), ∇(r2 )) dt dr
(0,2π)×(0,1)
Z
= 4r 3 cos2 t dt dr = π.
(0,2π)×(0,1)
Denote by Hk+ , Hk− the halfspaces (0, ∞) × Rk−1 and (−∞, 0) × Rk−1 , respec-
tively. Further denote by i the mapping of Rk−1 onto ∂Hk− defined as
x−a
t(a) = x→a
lim .
x∈γ
|x − a|
37.20. Example. Let h be a Lipschitz function on [−1, 1], h(−1) = h(1) = 0. Let γ =
{[x, y] : y = h(x), |x| < 1}. If we choose an orientation of the unit tangent vector t to γ so that
its x-coordinate is positive, then
1
t(x, y) = p [1, h (x)] .
1 + (h (x))2
G. Surface and Curve Integrals 169
R
We have to evaluate the integral γ f · t ds, where f (x, y) = [3x2 cos y, −x3 sin y]. A direct com-
putation does not seem to be very hopeful, particularly if the function h is rather complicated.
Nevertheless, since f = grad g for g = x3 cos y, by the Curve Integral Theorem we get
Z
f · t ds = g([1, 0]) − g([−1, 0]) = 2.
γ
Then the mapping ϕ(s, t, a) = ψ(s + 1, t, a), [s, t, a] ∈ (−1, 0] × (−π, π) × (− 12 π, 12 π) satisfies
requirements of 37.17 for [x, y, z] ∈ ψ(H). (If [x0 , y0 , z0 ] is not in ψ(H), we can use mappings
[s, t, a] → ψ(s + 1, t − t0 , a − a0 ) for suitable t0 and a0 .) After verifying that det ψ = r 2 cos a > 0
and computing the outer normals we get
Finally
w2 × w 3
n(x, y, z) = = [cos t cos a, sin t cos a, sin a] = [x, y, z].
|w2 × w3 |
37.24. Example. (a) By means of the Gauss Theorem we will evaluate (not for the first
time) the surface measure of the sphere Γ := {(x, y, z) ∈ R3 : x2 + y 2 + z 2 = 1}. We utilize
the fact that Γ is a Lipschitz boundary of the ball Ω = {(x, y, z) ∈ R3 : x2 + y 2 + z 2 < 1}.
We compute the two-dimensional measure of the sphere by integrating the constant 1 which we
express in the form 1 = f · n, where we choose e.g. f (x, y, z) = [x, y, z]. Actually, [x, y, z] · n =
[x, y, z] · [x, y, z] = x2 + y 2 + z 2 = 1 (for a different set it would be necessary to find a different
f ). We obtain
Z Z Z Z
1 dS = [x, y, z] · n dS = div[x, y, z] dx dy dz = 3 dx dy dz = 4π.
Γ Γ Ω Ω)
37.25. Example (the cube). Let Ω be a cube (0, 1)3 and Γ its boundary oriented by the
outer normal (e.g. on {0} × (0, 1)2 the unit normal vector is [−1, 0, 0]). The condition 37.17
is clearly satisfied for z lying on this side, and less obvious (but still satisfied) if z lies on an
edge or even at a vertex. Let, for example, z be the vertex [0, 0, 0]. Then a mapping ϕ with the
desired properties can be found in the form ϕ(t) ` = [−t1 +max(0, −t2 , −t3 ), −t1 +max(t2 , 0, t2 −
t3 ), −t1 + max(t3 , t3 − t2 , 0)], t ∈ (− 21 , 0] × − 12 , 12 )2 ∩ {|t2 − t3 | < 12 }.
37.26. Introduction to Green Theorem. Consider the situation described in
the Gauss theorem for the two-dimensional case. Then Γ is a curve, the orientation
of which can be expressed not only by means of the normal field, but also by means
of the tangent field. The vector n(x) is s-almost everywhere the “vector product of
the vector” t(x), i.e. (n(x), t(x)) is a positive orthonormal basis of R2 . (Roughly
speaking, t(x) circulates around Ω anti-clockwise.)
37.27. Green Theorem. Let g = [g1 , g2 ] be a continuously differentiable vector
field on a neighborhood of Ω. Then
g · t ds = curl g dx.
Γ Ω
37.28. Exercise. Compute the unit tangent vector and the unit normal vector to the unit
circle oriented as the (Lipschitz) boundary to the unit disc.
37.29. Example. Using the Green Theorem we compute the contents of the figure Ω :=
x2 y2
{(x, y) ∈ R2 : 2 + 2 < 1}. The boundary (ellipse) Γ can be (up to a set of one-dimensional
a b
measure zero) parametrized by the mapping ϕ(t) = [x(t), y(t)] := (a cos t, b sin t), t ∈ (0, 2π).
Let f be a vector field on Rn whose curl is 1, e.g. f = [0, x], f = [−y, 0], or f = [− 12 y, 12 x]. The
the Green theorem yields
Z Z Z 2π Z 2π
1 dx dy = [0, x] · t(x, y) ds = xy dt = ab cos2 t dt = πab.
Ω Γ 0 0
37.30. Exercise. Using the Green theorem evaluate the content of the figure surrounded by
the asteroide {(a cos3 t, b sin3 t) : t ∈ [0, 2π)}.
G. Surface and Curve Integrals 171
37.33. Example. Let h be p a positive even function of class 1 on [−1, 1], h(1) = 0. Let
Γ = {[x, y, z] : z = h(r)}, r = x2 + y 2 . Let the unit normal vector have the orientation for
which its z-coordinate is positive, i.e.
1 x y
n(x, y, z) = p [−h (r) , −h (r) , 1].
1 + (h (r))2 r r
R
Let us evaluate Γ g · n dS for g(x, y, z) = [−xez , −yez , 2ez ]. Since g = curl f for f (x, y, z) =
[−yez , xez , 0] and f = [−y, x, 0] on γ := {[x, y, z] ∈ R3 : r = 1, z = 0} is a Lipschitz 2-boundary
of Γ, the Stokes formula gives
Z Z Z Z 2π
g · n dS = f · t ds = [−y, x] · t(x, y) ds = (sin2 t + cos2 t) dt = 2π.
Γ γ γ 0
For the evaluation of the integral over γ we used polar coordinates: x = cos t, y = sin t, t ∈
(0, 2π).
37.44. Notes. Main formulas of the calculus of curve and surface integral were established
in 19th century. The divergence theorem was discovered by K. F. Gauss, G. Green and M. V.
Ostrogradski. The Stokes theorem is due to G. G. Stokes. See also 39.24.
In this chapter we present the theorem which contains as special cases the
Curve integral theorem, the Gauss theorem and Stokes’ theorem of Chapter 37.
For this purpose we need more advanced tools of exterior algebra.
38.1. k-covectors and k-vectors. Let V be a n-dimensional vector space and
V∗ its dual space. The value of a linear form V ∈ V∗ at a vector u ∈ V will be
denoted by V, u.
A mapping W : Vk → R is called a k-linear form (on V) if it is linear in each
variable separately. As a special case we get linear forms for k = 1, or bilinear
forms for k = 2.
A k-linear form W on V is said to be a k-covector if W is antisymmetric in the
following sense: If π is a transposition (a permutation, which transposes exactly
one pair of elements) of the set {1, . . . , k}, then
) *
W, (uπ(1) , . . . , uπ(k) ) = − W, (u1 , . . . , uk ) .
172 38. Integration of Differential Forms
for any matrix A = (ai,j )ki,j=1 and any ordered k-tuple (u1 , . . . , uk ) of vectors of
V.
Dually, a k-vector on V will be defined as a k-covector on V∗ . Thus a k-vector
assigns a real number to a k-tuple of linear forms on V. In particular, a 1-covector
is the same as a linear form; a 1-vector u on V will be identified with a vector
v ∈ V, if u assigns to each linear form V ∈ V∗ the value V, v. A real number
c will be identified with a 0-form, or a 0-vector, which assigns the number c to
each 0-tuple of vectors or forms, respectively. The vector space of all k-covectors
on V will be denoted by Λk (V) and the vector space of all k-vectors on V by
Λk (V). So Λk (V) = Λk (V∗ ), Λk (V) = Λk (V∗ ), Λ1 (V) = V, Λ1 (V) = V∗ , and
Λ0 (V) = Λ0 (V) = R.
38.2. Example. A bilinear form is representable by a matrix. For the sake of simplicity
assume V = Rn , then (ai,j )i,j is a matrix of a bilinear form V if
X
V, (u, v) = ai,j ui vj
i,j
for each u, v ∈ Rn . Among bilinear forms we select 2-covectors as bilinear forms with an
antisymmetric matrix, i.e. ai,j = −aj,i (in particular ai,i = 0). An example of a matrix of a
2-covector in R3 is 0 1
0, 2, −3
@ −2, 0, 5 A.
3, −5, 0
The (canonical) inner product represented by the unit matrix is also a bilinear form but it fails
to be a 2-covector.
38.3. Exterior Product. By an exterior product of an ordered k-tuple
(V1 , . . . Vk ) of linear forms on V we mean the k-covector V1 ∧ · · · ∧ Vk defined
by the formula
Notice that the exterior product is invariant with respect to even permutations of
factors. An odd permutation changes (only) the sign. The exterior product van-
ishes if the factors are linearly dependent (in particular, if two factors coincide).
Now we will investigate coordinates of k-vectors and k-covectors in Rn . Let Xi
denote the i-th coordinate form [u1 , . . . , un ] → ui . Recall that in the context of
these chapters by a multiindex we understand an ordered k-tuple α = [α1 , . . . , αk ]
G. Surface and Curve Integrals 173
of indices from {1, . . . , n}, and that the set of all such multiindices is denoted by
{1, . . . , n}k . A multiindex α is called increasing if α1 < · · · < αk . The set of all
increasing multiindices from {1, . . . , n}k is denoted by I(k, n). Each k-covector
W posseses (uniquely determined) real coordinates Wα , α ∈ {1, . . . , n}k , namely
Wπ(α) = −Wα
so that the basis of Λk (Rn ) is {Xα1 ∧ · · · ∧ Xαk : α ∈ I(k, n)}. Dually, we have
w= wα eα1 ∧ · · · ∧ eαk ,
α∈I(k,n)
Notice that not every k-covector is an exterior product of linear forms and not
every k-vector is an exterior product of vectors, see Example 38.7.
174 38. Integration of Differential Forms
38.5. Example (in R3 , notice how the sign is changed under the transposition).
X2 ∧ (X1 + X3 ) ∧ (X1 − X3 ) = X2 ∧ X1 ∧ X1 − X2 ∧ X1 ∧ X3
+ X2 ∧ X3 ∧ X1 − X2 ∧ X3 ∧ X3
= X1 ∧ X2 ∧ X3 − X3 ∧ X2 ∧ X1 = 2X1 ∧ X2 ∧ X3 .
X1 ∧ X2 + X2 ∧ X3 + X3 ∧ X4 , 2e1 ∧ e2 − e1 ∧ e4 + e2 ∧ e4 = 2.
W = (W(1,2) X1 − W(2,3) X3 ) ∧ X2 .
If W1,3
= 0, then
W(2,3)
W =( X2 + X1 ) ∧ (W(1,3) X3 + W(1,2) X2 ).
W(1,3)
where dxi denotes the constant differential 1-form x → Xi and ωα are functions
on E. We say that a differential form ω is of class C if all its coordinates ωα
are of class C . Similarly we define other properties of differential forms (e.g.
measurability) using coordinates .
38.10. Example. We illustrate the calculation with differential forms by the following
example in R4 :
(dx2 + dx4 ) ∧ (x4 dx1 − x1 dx4 ) ∧ (dx2 + dx3 ) = (x4 dx2 ∧ dx1 + x4 dx4 ∧ dx1 − x1 dx2 ∧ dx4
− x1 dx4 ∧ dx4 ) ∧ (dx2 + dx3 )
= (−x4 dx1 ∧ dx2 − x4 dx1 ∧ dx4 − x1 dx2 ∧ dx4 ) ∧ (dx2 + dx3 )
= −x4 dx1 ∧ dx2 ∧ dx2 − x4 dx1 ∧ dx4 ∧ dx2 − x1 dx2 ∧ dx4 ∧ dx2
− x4 dx1 ∧ dx2 ∧ dx3 − x4 dx1 ∧ dx4 ∧ dx3 − x1 dx2 ∧ dx4 ∧ dx3
= x4 dx1 ∧ dx2 ∧ dx4 − x4 dx1 ∧ dx2 ∧ dx3 − x4 dx1 ∧ dx4 ∧ dx3 + x1 dx2 ∧ dx3 ∧ dx4 .
G. Surface and Curve Integrals 175
d(x2 dx1 − x1 sin x2 dx2 ) = dx2 ∧ dx1 − sin x2 dx1 ∧ dx2 − x1 cos x2 dx2 ∧ dx2
= −dx1 ∧ dx2 − sin x2 dx1 ∧ dx2 = (−1 − sin x2 ) dx1 ∧ dx2 .
(a) d(x1 dx1 + x1 x3 dx2 ) = dx1 ∧ dx1 + x3 dx1 ∧ dx2 + x1 dx3 ∧ dx2
= x3 dx1 ∧ dx2 − x1 dx2 ∧ dx3 .
(b) d(x1 x2 dx1 ∧ dx3 ) = x2 dx1 ∧ dx1 ∧ dx3 + x1 dx2 ∧ dx1 ∧ dx3
= −x1 dx1 ∧ dx2 ∧ dx3 .
We will not use the normal n−k-covector in full generality, nevertheless for the
sake of completeness we will outline the definition: It associates with a n−k-tuple
u1 , . . . , un−k of vectors in Rn the number
and negative if the above stated equality holds with the opposite sign.
38.15. Example. Let Ω := {x ∈ R4 : x1 > 0, x3 = x1 cos x2 , x4 = x1 sin x2 }. We have to
find the unit tangent 2-vector field on Ω knowing that the parametrization ϕ : (0, ∞)×(−π, π) →
Ω,
ϕ(t1 , t2 ) = [t1 , t2 , t1 cos t2 , t1 sin t2 ]
is positive. We have
∂ϕ
w1 := (t) = [1, 0, cos t2 , sin t2 ],
∂t1
∂ϕ
w2 := (t) = [0, 1, −t1 sin t2 , t1 cos t2 ],
∂t2
and thus
vol2 (w1 , w2 ) = 1 + t21 cos2 t2 + t21 sin2 t2 + cos2 t2 + sin2 t2 + t21 (cos2 t2 + sin2 t2 )2 = 2 + 2t21
and
[1, 0, x3 /x1 , x4 /x1 ] ∧ [0, 1, −x4 , x3 ]
ξ(x) = q .
2 + 2 x21
The 2-vector ξ(x) has six coordinates, namely ξ(1,2) (x), ξ(1,3) (x), ξ(1,4) (x), ξ(2,3) (x), ξ(2,4) (x)
and ξ(3,4) (x). For example,
„ «
1, 0
det
x3 /x1 , −x4 −x4
ξ(1,3) (x) = q = q .
2 + 2 x21 2 + 2 x21
G. Surface and Curve Integrals 177
provided the right-hand side makes sense. Here, the symbol Ω represents all the
structure (Ω, orientation): Without the knowledge of ξ it would be impossible to
determine the sign of the integral.
38.17. Particular Cases. Notions of the preceding chapter can be now
revisited from the point of view of the general approach using differential forms
and k-vector fields. Let us start with trivial cases.
If k = 0, then Ω is a finite set, ξ(x) = ±1 on Ω and the integral is only a sum
of values. A differential 0-form ω is a function and
ω= ξ(x)ω(x).
Ω x∈Ω
Finally the case of a normal field is also included: Let k = n − 1. Then the
vector field n(x) and the (n − 1)-vector field ξ(x) are linked by the relation
ξ = n1 e2 ∧ · · · ∧ en − n2 e1 ∧ e3 ∧ · · · ∧ en + (−1)n nn e1 ∧ · · · ∧ en−1 .
[t, a] ∈ G := (0, 2π) × (− 12 π, 12 π) and the orientation is supposed to make ϕ positive. Recall,
that the uncovered part of the sphere has the (n−1)-dimensional measure zero. We have
Thus
Z Z
z dx ∧ dy = sin a(− sin t cos a dt − cos t sin a da) ∧ (cos t cos a dt − sin t sin a da)
Ω
ZG
= sin2 a cos a sin2 t dt ∧ da − sin2 a cos a cos2 tda ∧ dt
G
Z Z π/2
= sin2 a cos a(sin2 t + cos2 t) dt da = 2π sin2 a cos a da
G −π/2
Z 1 4
= 2π u2 du = π.
−1 3
38.20. Example. Let Ω := {x ∈ R4 : x21 + x22 = x23 + x24 = 1, x1 > 0, x3 > 0}. Consider
the parametrization ϕ(t) = [ cos t1 , sin t1 , cos t2 , sin t2 ], t ∈ G := (− π2 , π2 )2 . Suppose that the
orientation makes ϕ positive. Then
0 1
− sin t1 , 0
B cos t1 , 0 C
B
∇ϕ(t) = @ C
0, − sin t2 A
0, cos t2
G. Surface and Curve Integrals 179
so that
∂ϕ ∂ϕ
∧ (t) = sin t1 sin t2 e1 ∧ e3 − sin t1 cos t2 e1 ∧ e4
∂t1 ∂t2
− cos t1 sin t2 e2 ∧ e3 + cos t1 cos t2 e2 ∧ e4
and
` ∂ϕ ∂ϕ ´ p
vol , (t) = sin2 t1 sin2 t2 + sin2 t1 cos2 t2 + cos2 t1 sin2 t2 + cos2 t1 cos2 t2 = 1.
∂t1 ∂t2
∂ϕ ∂ϕ
(The expression under the root is the sum of squares of coordinates of the 2-vector ∂t1
∧ ∂t2
.)
It follows that
ξ(x) = x2 x4 e1 ∧ e3 − x2 x3 e1 ∧ e4 − x1 x4 e2 ∧ e3 + x1 x3 e2 ∧ e4
is a tangent 2-vector field. We can compute (taking into account the sign of ϕ only) for example
Z Z „ « Z
cos t1 , 0
x1 dx2 dx4 = cos t1 det = cos2 t1 cos t2 dt1 dt2 = π.
Ω G 0, cos t2 G
ω = ωα dxα1 ∧ · · · ∧ dxαk−1 ,
The existence of these functions follows from McShane’s theorem 30.5. Applying
Corollary 35.3 to ηψ1 , ψ2 , . . . , ψk we obtain
0= det(∇(η ψ1 ), ∇ψ2 , . . . , ∇ψk ) dt
R
k
= ψ1 det(∇η, ∇ψ2 , . . . , ∇ψk ) dt + η det(∇ψ1 , ∇ψ2 , . . . , ∇ψk ) dt.
Rk Rk
The first integrand can differ from zero only on the strip {0 ≤ t1 ≤ 1}, otherwise
∇η = 0. Fubini’s theorem yields
1 ! ∂ψi "k
− ψ1 det(∇η, ∇ψ2 , . . . , ∇ψk ) dt = (ψ1 dt1 ) det dt2 . . . dtk
Rk ∂tj i,j=2
R 0
k
! ∂ψi "k
= ψ1 det dt2 . . . dtk
G∩∂H−k ∂tj i,j=2
= χq ω.
Γ
The second integrand vanishes on Hk− \G (where ψ1 = 0), on the strip {0 < t1 < 1}
(where the partial derivatives ∂ψi /∂t1 are zero) and for t1 ≥ 1 (where ∇η = 0).
Thus
η det(∇ψ1 , ∇ψ2 , . . . , ∇ψk ) dt = det(∇ψ1 , . . . , ∇ψk ) dt
Rk G∩Hk
−
= d(χq ω).
Ω
∂ϕ ∂ϕ
Hence we easily compute that vol( ∂t (t), ∂t3
(t)) = 1 (similarly as in Example 38.20) and
2
` ∂ϕ ∂ϕ ∂ϕ ´ √
vol (t), (t), )(t) = 2(1 − t1 )2 .
∂t1 ∂t2 ∂t3
Let the orientations of Ω and Γ be chosen in such a way that the parametrization ϕ is positive.
We easily verify that the unit tangent 3-vector field ξ on Ω has the form
1 ` ´
ξ(x) = q x1 e2 ∧ e3 ∧ e4 − x2 e1 ∧ e3 ∧ e4 − x3 e1 ∧ e2 ∧ e4 + x4 e1 ∧ e2 ∧ e3
2 2 2 2
x1 + x2 + x3 + x4
1
n(x) = q [x1 , x2 , −x3 , −x4 ],
x21 + x22 + x23 + x24
where a equals −1/4 for x21 + x22 = 4 and 1 for x21 + x22 = 1.
p
38.23. Example. Let Ω := {[x, y, z] ∈ R3 : z = x2 + y 2 < 35 x + 1}, Γ := {[x, y, z] ∈
p 3
R3 : z = x2 + y 2 = 5 x + 1}. (A part of the conical surface surrounded by an ellipse .) Choose
the orientation RΩ in such a way that the z-coordinate of the normal is positive. The evaluation
of the integral Ω dy ∧ dz is simplified when applying Stokes’ formula. We use the mapping
r
15 25 5 15 25 5
ϕ(r, t) = [ + r cos t, r sin t, ( + r cos t)2 + ( r sin t)2
16 16 4 16 16 4
which is a positive parametrization; ϕ((0, 1) × (0, 2π)) covers Ω up to a set of two-dimensional
measure zero. The mapping ψ : t → ϕ(1, t), t ∈ (0, 2π) is then a positive parametrization of
Γ \ {[ 52 , 0, 52 ]}. Since z = 35 x + 1 on Γ, we have ψ3 = 35 ψ1 + 1 and dψ3 = 35 dψ1 = − 35 16
25
sin t dt.
Thus Z Z Z 2π
5 · 3 · 25 75
dy ∧ dz = y dz = − sin2 t dt = − π.
Ω Γ 0 4 · 5 · 16 64
38.24. Notes. The exterior multiplication was invented by H. Grassmann in the 19th century.
See also 39.24.
√
(d) Consider the mapping g : R2 → R3 defined as g(x) = [x21 , x22 , 2 x1 x2 ] and denote its
restriction to the unit circle Ω2 in R2 by f . Then f maps Ω2 to the unit sphere Ω3 in R3 .
The derivative f (x) maps a vector u ∈ Tx (Ω2 ) ⊂ R2 to the vector f (x)(u) ∈ Tf (x) (Ω3 ), and
√ √
f (x)(u) = g (x)u = u1 ∂x
∂g ∂g
(x) + u2 ∂x (x) = [2u1 x1 , 2u2 x2 , 2u1 x2 + 2u2 x1 ].
1 2
39.6. Example. Let 0 < r < R, and
p
{[x, y, z] ∈ R3 : ( x2 + y 2 − R)2 + z 2 = r 2 }
be an anuloid. We will use the parametrization ϕq (s, t) = [x, y, z], [s, t] ∈ Gq , where
x = (R + r cos s) cos t,
y = (R + r cos s) sin t,
z = r sin s,
G1 = (0, 2π) × (0, 2π),
G2 = (0, 2π) × (−π, π),
G3 = (−π, π) × (0, 2π),
G4 = (−π, π) × (−π, π).
on Ω.
39.7. Example. Let Ω = ϕ(G), where G = (−1/2, 1/2) × (−2 π, 2 π) and
t t t
ϕ(s, t) = [(1 + s cos ) cos t, (1 + s cos ) sin t, s sin ].
2 2 2
Then Ω has the shape of the Möbius strip. Let be the atlas of all 1 charts on Ω. We will
prove that the manifold (Ω, ) is not orientable. We have
0 1
cos t cos 2t , − sin t − s sin t cos 2t − 2s cos t sin 2t
B C
ϕ (s, t) = @ sin t cos 2t , cos t + s cos t cos 2t − 2s sin t sin 2t A .
t s t
sin 2 , 2
cos 2
Assume that there exists an oriented atlas ⊂ . Let H = {[s, t] ∈ G : det ∇(μ ◦ ϕ)(s, t) > 0
whenever μ ∈ and ϕ(s, t) ∈ Uμ }. Then using the connectedness of G it follows that H = G
or H = ∅. Let μ ∈ , [−1, 0, 0] ∈ Uμ . Then det(μ ◦ ϕ) should have the same sign at the point
[0, −π] as at the point [0, π], which leads to a contradiction.
39.8. Differential Forms. A differential k-form on an n-dimensional manifold
Ω is defined as a mapping ω : x ∈ Ω → ω(x) ∈ Λk (Tx (Ω)). The calculation with
differential forms on manifolds is transferred into a calculation with differential
forms in Rk by means of a pullback. Let G ⊂ Rm be an open set and ϕ : G → Ω
a C 1 mapping. Let ϕ be a differential k-form on Ω. Then the differential k-form
ϕ ω which is called the pullback of ϕ on G is defined as
) *
(ϕ ω)(t), (u1 , . . . , uk ) = ω(ϕ(t)), (ϕ (t)u1 , . . . , ϕ (t)uk ) , u1 . . . , uk ∈ Rm .
The pullback of a differential form on Ω
ω(x) = ωα (x)dμα1 ∧ · · · ∧ dμαk (x)
α∈I(k,n)
is a differential form on G
ϕ ω(t) = ωα (ϕ(t))d(μα1 ◦ ϕ) ∧ · · · ∧ d(μαk ◦ ϕ) (t),
α∈I(k,n)
where x̃i are the coordinate functions x → xi on Ω. Notice that ω(x) ∈ Λk (Rn )
while ω̃(x) ∈ Λk (Tx (Ω)). The space Tx (Ω) can be understood as a subspace of Rn .
The difference is immaterial from the point of view of integration. Nevetheless,
a certain carefulness is recommended. Indeed, two different elements of Λk (Rn )
can coincide on Tx (Ω) so that a differential form on Ω can have more distinct
descriptions in coordinates (related to Rn ). This phenomenon is demonstrated
by the next example.
39.9. Example. Let Ω be the unit circle {[x, y] ∈ R2 : x2 + y 2 = 1}. Then x dx + y dy induces
the zero differential form on Ω.
q
39.10. Example. Let Ω be the conical surface {x ∈ R3 : x3 = x21 + x22 }. Let μ be the chart
which is the inverse to the mapping ϕ : G → Ω, G = (0, 1)×(0, 2π), ϕ(t) = [ϕ1 (t), ϕ2 (t), ϕ3 (t)] =
[t1 cos t2 , t1 sin t2 , t1 ], and let ω(x) = x1 dx2 ∧ dx3 be the differential form on Ω. Then
ϕ ω = ϕ1 dϕ2 ∧ dϕ3 = t1 cos t2 d(t1 sin t2 ) ∧ dt1 = (−t21 cos2 t2 )dt1 ∧ dt2 .
hence f is an “isometric mapping”. On the other hand, Ω is not isometric to any open subset of
R2 (it is not possible to “make the hemisphere flat”). From the geometrical point of view, the
shape of the manifold expressed by the Riemannian metric is more important than the original
“underlying space”.
39.15. k-dimensional Measures on Riemannian Manifolds. Suppose that
(X , A , g) is a Riemannian manifold, x ∈ X and (u1 , . . . , uk ) ∈ (Tx (X ))k . We
define a volume of this k-tuple of vectors similarly as in 34.10:
vol(u1 , . . . , uk ) = det(gx (ui , uj ))ki,j=1 .
This immediately introduces also the volume of a linear mapping L : Rk →
Tx (X ). Let σ be a measure on the σ-algebra of all measurable subsets of X .
We say that σ is a k-dimensional measure on X if for each μ ∈ A and each
measurable set E ⊂ Uμ we have
σE = vol ϕ (t) dt , where ϕ = μ−1 .
ϕ−1 (E)
Since the integral does not depend on a particular choice of μ, the partition of
unity leads to the existence of a “k-dimensional measure” on a k-dimensional
Riemannian manifold.
G. Surface and Curve Integrals 187
u1 ∧ · · · ∧ uk
ξ(x) = ,
vol(u1 , . . . , uk )
where (u1 , . . . , uk ) is a positive basis of Tx (Ω) (the definition does not depend
on the choice of the base). Now, we can introduce the integration of differential
forms by means of integration by k-dimensional measure σ on Ω similarly as we
have proceeded in 38.16: Namely, if ω is an integrable differential form on Ω, then
ω= ω, ξ dσ.
Ω Ω
∂ϕ ∂ϕ
w= ∧ (s, t) = [cos t, sin t, 2s, 0] ∧ [−s sin t, s cos t, 0, 1]
∂s ∂t
x1 x 2
=[ , , 2r, 0] ∧ [−x2 , x1 , 0, 1]
r r
x1 x2
= r e1 ∧ e2 + 2rx2 e1 ∧ e3 + e1 ∧ e4 − 2rx1 e2 ∧ e3 + e2 ∧ e4 + 2r e3 ∧ e4 ,
r r
and
x21 x2
|w|2 = r 2 + 4r 2 x22 +
+ 4r 2 x21 + 22 + 4r2 = 4r4 + 5r2 + 1.
r2 r
For the unit tangent k-vector we have
w
ξ= ,
|w|
the integrand is expressed as
rx2 1
√ = dx1 ∧ dx3 , ξ ,
4r 4 + 5r 2 + 1 2
and thus
Z Z
rx2 1
√ dσ = dx1 ∧ dx3
Ω 4r4 + 5r2 + 1 2 Ω
Z Z
1 2
= (cos t ds − s sin t dt) ∧ 2s ds = s2 sin t ds dt = .
2 G G 3
inner product was quite obvious. Nevertheless, for the purpose of integration of
differential forms on manifolds neither Riemannian structure nor a k-dimensional
measure are needed. Indeed, we can realize that the integral expressions given
by the Change of Variable Formula do not depend on these structures. In the
general case we can proceed as follows: Let (Ω, A ) be a k-dimensional oriented
manifold. Let ω be a measurable
differential k-form on A and E a measurable
subset of Ω. The integral E ω is defined in two steps: First, assume E ⊂ Uμ for
some μ ∈ M . Then we define
ω= (μ−1 ) ω.
E μ(E)
From the Change of Variable Formula it follows that this integral (if it makes
sense) does not depend on the choice μ.
The second step is based on the partition of unity. Let {χτ }τ ∈T be a partition
of unity on (Ω, A ).
For any measurable set E ⊂ Ω denote
I(E) = χτ ω
τ ∈T E∩{χτ >0}
(the expression I(E) does not necessarily makes sense). We say, that the integral
E
ω converges, or that the differential form ω is integrable on E if for any mea-
surable set E ⊂ Ω, the expression I(E ) makes sense and it is a finite number,
and for any sequence {Eq } of parwise disjoint measurable sets Eq ⊂ E, the series
∞
I(Eq )
q=1
converges (absolutely). If the integral E
ω converges, we set
ω = I(E).
E
Since we were careful enough, the value of such a defined integral depends neither
on the “partition” of E, nor on the partition of unity. On the other hand, it
depends on the orientation of Ω: The reverse orientation forces the converse of
the sign of the integral.
Let G ⊂ Rk be an open set and ϕ : G → Ω a diffeomorphism. We say, that
ϕ is a positive parametrization if all superpositions μ ◦ ϕ, μ ∈ A , have a positive
Jacobian. For positive parametrizations the following change of variable formula
is valid:
ω= ϕ ω
ϕ(G) G
39.19. Example. Let Ω be the set of all decreasing solution of the differential equation
(x (s))2
x (s) − x(s) = 0 satisfying the condition 0 < x(0) < 1. Let = {μ : Ω → R2 : there are a,
b ∈ R, such that a < b and μ(x) = [x(a), x(b)] for all x ∈ Ω}. Then (Ω, ) is a two-dimensional
oriented manifold and ϕ : t → et2 s+t1 , t ∈ (−∞, 0)2 is a positive parametrization of Ω. The
tangent space Tx (Ω), x = ϕ(t), is representable as a two-dimensional vector space of functions
∂ϕ ∂ϕ
generated by the functions ∂t (t) : s → et2 s+t1 and ∂t (t) : s → s et2 s+t1 . For each τ ∈ R,
1 2
let ετ be the function on Ω defined as ετ (x) = x(τ ). Then ω := dε0 ∧ dε1 is a differential form
which associates with u1 , u2 ∈ Tx (Ω) the number
„ «
u1 (0), u2 (0)
det .
u1 (1), u2 (1)
We have
ϕ
ω = d(et1 ) ∧ d(et1 +t2 ) = et1 dt1 ∧ (et1 +t2 dt1 + et1 +t2 dt2 )
= e2t1 +t2 dt1 ∧ dt2 ,
so that Z Z
1
ω= e2t1 +t2 dt1 dt2 = .
Ω (−∞,0)2 2
In the next step let K ⊂ W be subsets of Ω, K compact, W open. We show that there is
an admissible family of function (even a finite one) whose sum is positive on K and vanishing
outside W . For each point a ∈ K we find μa ∈ and a radius ra > 0 so that a ∈ Uμa and
B(μa (a), 2ra ) ⊂ μa (Uμa ∩ W ). Now set
( 2 2
e1/(|μa (x)−μa (a)| −ra ) if x ∈ μ−1
a (U (μa (x), ra )),
fa (x) =
0 in remaining cases.
Then {{fa > 0} : a ∈ K} is a covering of K and taking into account the compactness of K we
can select a finite set {fa1 , . . . , fam } forming an admissible family of functions, whose sum is
positive on K. We have yet solved the existence of the partition of unity provided Ω is compact.
Recall that, by our definitions, Ω is a metrizable space. If Ω is connected, then the Topological
Lemma 39.23 yields the existence of compact sets Kq and open sets Wq (q ∈ N) such that
S
∞
Kq ⊂ Wq , Ω = Kq , and each point has a neighborhood which intersects only a finite
q=1
number of sets Wq . For each couple (Kq , Wq ) we find an admissible family of functions by the
preceding procedure. The union with respect to q will be an admissible family of functions
whose sum is positive on Ω. This solves the existence problem if Ω is connected. If Ω is not
connected, then its topological components are connected submanifolds Ω. By a simple “union”
of partitions of unity on components we obtain the partition of unity on Ω.
39.23. Topological Lemma. Let (P, ρ) be a connected locally compact metric space. Then
there exists a sequence {Kq } of compact subsets of P and a sequence {Wq } of open subsets of
S
∞
X such that X = Kq and each point P has a neighborhood intersecting only finitely many
q=1
sets Wq .
Proof. We may assume that P is not compact, for otherwise there is nothing to prove. Further,
we consider an equivalent metric in which P is bounded. Given x ∈ P , there is a radius r(x)
such that B(x, 2r(x)) is compact and B(x, 4r(x)) is not compact. We construct recursively a
sequence {Vq } of open relatively compact subsets of P . Choose x0 ∈ P and set V1 = U (x0 , 2r0 ).
Assume that V1 , . . . , Vq were already constructed. Thanks to compactness of Vq it follows that
there is a finite system {U (xj , rj )} of balls selected from {U (x, r(x)) : x ∈ V q } such that it
S
covers V q . Set Vq+1 = U (xj , 2rj ). The resulting sequence satisfies V q ⊂ Vq+1 . We prove
j
S
∞
by a contradiction that V := Vq = P . Suppose V
= P . Since P is connected, there exists
q=1
z ∈ ∂V . Set R = r(z)/3, find x ∈ U (z, R) ∩ V and q such that x ∈ Vq . Further find y ∈ Vq and
r = r(y) such that x ∈ U (y, r) and U (y, 2r) ⊂ Vq+1 . Since z ∈
/ V , it follows that ρ(z, y) ≥ 2r.
We have
2r ≤ ρ(y, z) ≤ ρ(y, x) + ρ(x, z) ≤ r + R,
thus r ≤ R. If t ∈ B(y, 4r), then
so that B(y, 4r(y)) ⊂ B(z, 2r(z)). This is a contradiction, because the ball B(y, 4y) is not
compact and the ball B(z, 2r(z) is compact. We have proved that V = P . To finish the proof it
is enough to set K1 = V 1 , W1 = V2 , W2 = V3 , Kq = Vq \ Vq−1 for q ≥ 2 and Wq = Vq+1 \ V q−2
for q ≥ 3.
39.24. Notes. The modern theory of manifolds is based on ideas of G. F. B. Riemann.
The roots of the topics of Chapter G go back to the 19th century and are connected with
famous names of outstanding mathematicians. From an extensive bibliography we recommend
M. Berger and B. Gostiaux [*1988], L. Boček [Boč], I. Černý and J. Mařı́k [ČM II], H. Federer
[*1969], W. Fleming [*1965], O. Kowalski [Kow], L. Krump, V. Souček and J. A. Těšı́nský [KST],
F. Moran [*1988], R. Sikorski [Sik], L. Simon [*1983].
H. Vector Integration 191
H. Vector Integration
40. Measurable Functions
– l∞ will denote the space of all bounded sequences x = {xn } equipped with
the norm x∞ := supn |xn |,
– l2 ([0, 1]) is the Hilbert space of all real functions f on [0, 1] vanishing off
2
a countable set such that t∈[0,1] |f (t)| < ∞, equipped with the inner
product (f, g) = t f (t)g(t).
40.4. Example. Consider X = l2 ([0, 1]) and the measure space ([0, 1], M, λ). Let {et : t ∈
[0, 1]} be the usual orthonormal base of X (et (x) = 1 for x = t, et (x) = 0 otherwise). Define
the mapping f : [0, 1] → X as f (t) = et . If ϕ ∈ X ∗ , the Riesz-Fréchet representation theorem
on Hilbert spaces implies the existence of a uniquely determined a ∈ X such that ϕ(x) = (x, a)
for every x ∈ X. Hence ϕ(f (t)) = (et , a) = a(t) and the set {t ∈ [0, 1] : (et , a)
= 0} is countable.
We can see that ϕ ◦ f = 0 almost√everywhere and therefore f is a weakly measurable function.
On the other hand, et − es = 2 for t
= s. It follows that f ([0, 1] \ E) = {et : t ∈ [0, 1] \ E}
is separable if and only if the set [0, 1] \ E is countable. Thus, there is no set E ⊂ [0, 1] of
measure zero for which f ([0, 1] \ E) would be separable and Pettis’ theorem implies that f is
not measurable. (Further examples can be found in Exercises 43.7.e and f.)
40.5. Notes. The theory of vector measures and integration was developed in an essential
way during 1930’s. Famous Theorem 40.3 appears in B.J. Pettis [1938].
In this chapter we touch briefly on measures whose values are in a given Banach
space X. Before doing this we take notice of a convergence of series in Banach
spaces.
41.1. Absolute and Unconditional Convergence. Let {xn } be a sequence
∞
of elements of a Banach space X. We say that a (formal) series xi = xi is
i=1
n
∞
n
– convergent if there is a limit lim xi (we write xi = lim xi ),
n i=1 i=1 n i=1
∞
– absolutely convergent if xi < +∞ ,
i=1
– unconditionally convergent to x ∈ X if n xP (n) = x whenever P is a
one-to-one mapping of N onto N.
Each absolutely convergent series converges (X is a complete space !) even
unconditionally. On the other hand, every unconditionally convergent series
converges absolutely provided dim X < +∞ (Riemann’s theorem) while in in-
finite dimensional spaces this assertion is no longer true (consider the example
xn = (0, . . . , 0, n1 , 0, 0, . . . ) ∈ c0 ).
In what follows, (Ω, S , μ) will stand for a fixed measure space and X a given
Banach space.
41.2. Vector Measures. A vector-valued set function F : S → X is called an
additive (σ-additive) vector measure if F (∅) = 0 and
F ( En ) = F (En )
n n
the sense as above. Realize that this convergence is even an unconditional one.
Any σ-additive vector measure will be shortly called a vector measure.
41.3. Examples. In all examples, (Ω, , μ) stands for the measure space ([0, 1], M, λ).
1. If X = Lp ([0, 1]), 1 ≤ p ≤ ∞, and F : E → cE for E ∈ M, then F is a σ-additive vector
measure.
2. Let L be a continuous linear operator from L1 [0, 1] into a Banach space X. If F (E) := L(cE )
for E ∈ M, then F is again a σ-additive vector measure. Indeed, a moment’s reflection shows
that
∞
[ Xn ∞
[ ∞
[
F ( Ej ) − F (Ej ) = F ( Ej ) ≤ λ( Ej ) L .
j=1 j=1 j=n+1 j=n+1
˘X
n [
n
¯
|F (E)| := sup F (Ak ) : Ak ∈ , Ai ∩ Aj = ∅ for i
= j , Ak = E .
k=1 k=1
A few remarks should be now added. Recall (cf. the proof of Pettis’ theorem, or
Exercise 42.5)
that (real) functions f − fn are measurable. Further, the above
limit lim B fn dμ exists for any B ∈ S since X is complete and the estimates
% %
% %
% fn − %
fk % ≤ fn − fk ≤ fn − f + fk − f
%
B B B Ω
hold. Note also that B ϕ dμ := xi μ(Ei ∩ B) when ϕ = xi cEi is a
simple
function and that this definition does not depend on the expression of ϕ as xi cEi
which is not unique.
Moreover, if B f − fn → 0 and B f − gn → 0, B ∈ S (where fn , gn
are simple functions) then lim B fn = lim B gn (consider the sequence f1 , g1 , f2 ,
g2 , . . . ).
Now if f is Bochner
integrable, B ∈ S and {fn } is a sequence of simple func-
tions satisfying B f − fn → 0, the limit lim B hn dμ exists and is independent
of the sequence {hn }. This limit (which is an element of our Banach space X) is
called the Bochner integral of f and it is denoted by B f dμ. The fundamental
characterization of Bochner integrable functions is given in the next theorem.
42.2. Theorem (Bochner). A measurable function f : Ω → X is Bochner
integrable if and only if Ω f dμ < ∞ (i.e. exactly when the (real) function f
is Lebesgue integrable).
Proof. If f is Bochner integrable, then f is measurable (cf. Exercise 42.5). Now
the assertion follows from the estimates
f ≤ f − fn + fn
where {fn } is a sequence of simple functions satisfying f − fn → 0.
For the converse, suppose f is integrable. There are simple functions fn
tending to f μ- almost everywhere. Set
fn (ω) if fn (ω) ≤ 2 f (ω) ,
gn (ω) =
0 otherwise in Ω.
Obviously, gn are again simple functions. Moreover,
∞ ∞
%
k
% % %
%F ( En ) − F (En )% = %F ( En )% .
n=1 n=1 n=k+1
!
∞ "
The assertion now follows since lim μ En = 0 and the measure E →
k
n=k+1
E
f dμ is absolutely continuous with % to μ (Exercise 8.22.b). Indeed,
% respect
given ε > 0 there is δ > 0 such that % E f % ≤ E f < ε whenever μE < δ.
R
42.4. Remark. We have just seen that the indefinite Bochner integral E → E f dμ is a
σ-additive vector measure which is absolutely continuous with respect to μ. Moreover, it is
simply checked that this measure is of bounded variation. As in the real case, a question arises
whether each σ-additive X-valued vector measure of bounded variation which is absolutely
continuous with respect to μ can be expressed as an indefinite Bochner integral of a Bochner
integrable function. Thus there is a question whether or not the Radon-Nikodým theorem holds
for vector measures. The answer is negative. The vector measure of Example 41.3.1 is absolutely
continuous with respect to Lebesgue measure, in case of p = 1 it is of bounded variation and
still it is not the indefinite Bochner integral of any Bochner integrable function.
We say that a Banach space X has the Radon-Nikodým property (shortly, RNP) if the Radon–
Nikodým theorem holds for any X-valued vector measure. More precisely, whenever (Ω, , μ)
is a probability measure space and ν : → X is a vector measure of bounded variation which
is absolutely continuous with respect to μ, then ν is an indefinite Bochner integral of a Bochner
integrable function.
Consequently, the space L1 [0, 1] does not have the RNP. Neither do the spaces c0 , l∞ , (K)
(K infinite compact) have the RNP. On the other hand, any reflexive Banach space has the
RNP.
42.5. Exercise. If f : Ω → X is measurable, then the real function f is measurable. (In
fact, we proved this assertion in the course of the proof of Pettis’ theorem 40.3.) Prove this
assertion directly.
Hint. The assertion is obvious for simple functions. Now, if fn are simple and fn → f μ-almost
everywhere, it follows that fn → f μ-almost everywhere.
H. Vector Integration 197
42.6. Exercise. Let f, g be measurable functions, f ≤ g μ-almost everywhere and let g
be Bochner integrable. Show that f is Bochner integrable.
42.7. Exercise. Show that the indefinite Bochner integral is a vector measure of bounded
variation.
42.8. Notes. The Bochner integral was studied by S. Bochner [1933] and N. Dunford [1935].
Today, it is used as a tool in function spaces theories, when examining evolution PDE’s and
differential equations in Banach spaces, or in some problems of the geometry of Banach spaces.
R
we see that (D) 01 f = {1, 1, 1, , . . . }. Therefore the Dunford integral of f is an element of
P
l = (l ) = (c0 )∗∗ determined as a functional ϕ →
∞ 1 ∗ αn . Consequently, f is not Pettis
n
integrable.
Show that
Z 1/n Z 1/n Z 1/n X 1 2 n
(D) f = ϕ◦f = αi ic[0,1/i] = α1 + α2 + · · · + αn + αn+1 + . . . .
0 0 0 i
n n n
43.4. Remark. Having in mind the last example, notice that the indefinite Dunford integral
Z
F : E → (D) f , E∈M
E
is not
‚ absolutely
R ‚ continuous with respect to the Lebesgue measure (λ[0, 1/n] = 1/n → 0 in spite
‚ ‚
of ‚(D) 01/n f ‚ = 1). Moreover, the vector measure F is not σ-additive.
The next theorem characterizes Pettis integrable functions among those which
are Dunford integrable.
43.5. Pettis’ Theorem. For a measurable Dunford integrable function f : Ω →
X the following assertions are equivalent:
(i) f is Pettis integrable,
(ii) the indefinite Dunford integral of f is a σ-additive vector measure,
(iii) the indefinite Dunford integral of f is absolutely continuous with respect
to μ.
H. Vector Integration 199
The proof of this theorem makes use of deeper theorems of functional analysis and
is beyond the scope of this manuscript.
Many problems dealing with vector integration are subtle and need a deeper knowledge of
Banach spaces theory. This is also the case of the next remarks. The reader is referred, for
example, to J. Diestel and J.J. Uhl [*1977] or L. Mišı́k [*1989].
43.6. Remarks. 1. Any Bochner integrable function is also Pettis integrable and both
integrals coincide on sets of .
2. Moreover, the following theorem holds:
Let f : Ω → X be a measurable Pettis integrable function. Then f is Bochner integrable if
and only if the indefinite Pettis integral of f is a vector measure of bounded variation.
3. The space c0 is exceptional: If X does not contain a copy of c0 (i.e. there is no subspace of
X topologically and algebraically isomorphic with c0 ), then any measurable Dunford integrable
function is Pettis integrable.
4. Let f : Ω → X be a measurable function. There are xn ∈ X and En ∈ such that
P
∞
f = xn cEn almost everywhere. Then
n=1
P
f is Pettis integrable if and only if the series xn μ(En ∩ E) is unconditionally convergent
for any E ∈ ,
P
f is Bochner integrable if and only if the series xn μ(En ∩ E) is absolutely convergent for
any E ∈ .
5. Let C be a compact subset of a Banach space X. Let μ ∈ 1 (C) be a probability Radon
measure on C. (For any A ⊂ X we denote by coA the smallest closed convex subset of X
containing A. It is simply checked that RcoA equals the closure of the convex hull of A.) There
exists a unique z ∈ coC so that ϕ(z) = ϕ dμ for any ϕ ∈ X ∗ . This z is called the barycenter
C
of μ. How is this relatedR to the Pettis integral?
R The answer is simple. When defining f (x) = x
for x ∈ C then z = (P ) f dμ, or z = (P ) x dμ. Illustrate for the case X = R, C = [0, 1] and
C C
μ=λ!
6. There is a generalization of the previous example giving a criterion on the existence of the
Pettis integral.
Let K be a compact metric space, X a Banach space and μ a probability Radon measure on
K. RIf the mapping f : K → X is continuous, then there exists the Pettis integral of f and
(P ) K f dμ ∈ cof (K).
43.7. The Graves Integral. There is a straightforward analogy of Riemann integration
theory for functions having their values in a Banach space. Remember that the Riemann integral
can be defined using Darboux upper and lower sums (and upper and lower integrals), or for its
definition an original Riemann’s approach can be used. It is clear that any definition (like
Darboux’s one) making use of the ordering of the real line (and the notion of the least upper
bound) cannot be immediately carried over for a vector case. Of course, in general Banach
spaces there is no ordering. Nevertheless, in the sequel we will touch briefly analogues of both
Riemann’s and Darboux’s approaches.
Let X again be a Banach space, f a mapping from [0, 1] into X. Given a partition D :=
{0 = x0 < x1 < ... < xn = 1} of [0, 1], I(D) = {ξ = {ξi } : ξi ∈ [xi−1 , xi ]} set
X
n
Ξ(f, D, ξ) := f (ξi )(xi − xi−1 ).
i=1
The real number Ξ(f, D, ξ) is called the Riemann sum of f . Further, define the norm of a
partition D as νD := max{xi − xi−1 : i = 1, ..., n}.
200 43. The Dunford and Pettis Integrals
(a) We will say that f is Riemann integrable provided there exists z ∈ X with the following
property: For any ε > 0 there is δ > 0 such that
Ξ(f, D, ξ) − z < ε
Ξ(f, D, ξ) − w < ε
whenever a partition D is finer than D0 (i.e. D ⊂ D0 ) and ξ ∈ I(D). Of course, w is then the
Graves integral of f .
(b) Realize that the Graves integral is a (Moore-Smith) limit of a “generalized” sequence
{Ξ(f, D, ξ)} when “ordered” either by a norm or an inclusion.
(c) Any Riemann integrable X-valued function f is bounded (there exists K > 0 so that
f (t) < K whenever t ∈ [0, 1]).
In what follows, we make use of the vector integration theory for the case of the special measure
space ([0, 1], M, λ).
(d) If f : [0, 1] → X is Riemann integrable and ϕ ∈ X ∗ , then the (real) function ϕ ◦ f is
Riemann integrable. In particular, f is weakly measurable and Pettis integrable.
R
Hint. Almost all assertions are obvious, even that f is Dunford integrable and (RG) 01 =
R1 R
(D) 0 . Since f is bounded it follows that (D) I f ∈ X for any interval I ⊂ [0, 1] and one can
see that f is Pettis integrable.
(e) Define a (vector) function f : [0, 1] → l∞ [0, 1] (=the space of all bounded functions on
[0, 1] equipped with the sup -norm) as f (t) = c[0,t] . With the aid of the “Bolzano–Cauchy
condition” show that f is Riemann integrable. Now (d) implies that f is weakly measurable.
(Check the last assertion also directly: If ϕ is a continuous linear form on l∞ [0, 1], then it
is easy to see that the function t → ϕ(f (t)) is of bounded variation, and consequently it is
measurable.) On the other hand, f is not measurable (use Pettis’ theorem 40.3 and realize that
f (s) − f (t) = 1 for s
= t).
(f) Another example. Let E ⊂ [0, 1] and define gE : [0, 1] → l∞ [0, 1] as follows: Put gE (t) =
c{t} if t ∈ E and gE (t) equals zero function for t ∈
/ E. Show that gE is Riemann integrable. If
E is a nonmeasurable set, then the function t → gE (t) cannot be measurable. Accordingly,
in this case gE is not measurable (cf. Exercise 42.5 or Pettis’ theorem 40.3). State conditions
on a set E under which gE will be measurable.
(g) Any measurable Riemann integrable X-valued function f is Bochner integrable (and both
integrals equal).
Hint. Recall that a (real) function f is bounded and measurable. Now use Bochner’s charac-
terization 42.2.
(h) Given again a partition D := {0 = x0 < x1 < ... < xn = 1} of [0, 1], set
X
n
(f, D) = sup{f (s) − f (t) : s, t ∈ [xi−1 , xi ]}(xi − xi−1 ).
i=1
We say that f : [0, 1] → X is Darboux integrable if for any ε > 0 there exists δ > 0 such that
(f, D) < ε whenever D is a partition and νD < δ. Show again that instead of “ordering”
given by a norm an equivalent definition can be formulated using an “ordering” determined by
an inclusion. Prove also that any Darboux integrable function is Riemann integrable.
H. Vector Integration 201
(i) A function f : [0, 1] → X is Darboux integrable if and only if f is bounded and continuous
in almost all points of [0, 1] (cf. 7.9.d).
Hint. Define the oscillation of a X-valued function g as
First show that f is continuous at t0 if and only if ωf (t0 ) = 0. Further prove that the set
{t ∈ [0, 1] : ωf (t) ≥ α} is always closed.
Any Darboux integrable function is obviously bounded. To complete the proof check that
1
λ{t ∈ [0, 1] : ωf (t) ≥ }=0
n
for any n.
For the converse, fix ε > 0 and choose an open set G ⊂ [0, 1] such that λG sup f < ε and f is
continuous at all points of the compact set K := [0, 1]\G. For each t ∈ K find the greatest δt > 0
for which f (s) − f (t) ≤ 12 ε for any s ∈ [0, 1] ∩ (t − δt , t + δt ). A routine compactness argument
establishes the existence of δ > 0 with the property: If D := {0 = x0 < x1 < ... < xn = 1}
is a partition of [0, 1] and νD < δ, then each interval [xi−1 , xi ] is either contained in G or, it
satisfies f (s) − f (t) ≤ ε for any couple s, t ∈ [xi−1 , xi ]. Having such a partition D, we get
(f, D) ≤ 3ε.
(j) It follows from the above considerations that any Darboux integrable function is mea-
surable (use Pettis’ theorem 40.3 and the fact that a continuous image of a separable space is
separable), and therefore with the aid of Bochner’s theorem 42.2 it is also Bochner integrable.
(k) Each bounded function which is continuous at almost all points is Riemann integrable.
When X is a general Banach space, the converse may not be the‚ case.
‚
Choose E = Q in (f). Then gQ is Riemann integrable while ‚gQ ‚ is the Dirichlet function of
Example 7.5 which is nowhere continuous. Nor is gQ continuous at any point of [0, 1] (remember
that the norm is a continuous function!).
(l) We can see that the classes of Darboux and Riemann integrable functions can differ. It
seems that there is no known reasonable characterization of Banach spaces where these classes
coincide.
43.8. Exercise. Let X = c0 , (Ω, , μ) = ([0, 1], M, λ) and define
Show that:
(a) g is Pettis integrable ,
(b) f is Dunford but not Pettis integrable,
(c) h is not even Dunford integrable but it is measurable,
(d) f (t) = g(t) = h(t) for any t ∈ [0, 1] .
43.9. Exercise. Define the mapping f from the interval (0, 1) (Lebesgue measure in consid-
eration) into the Hilbert space l2 as
1 1 1
f : x → ( , , , ...) , x ∈ (0, 1).
x+1 x+2 x+3
R1
Show that (P ) 0 f = {log(1 + 1
)} .
n n
202 43. The Dunford and Pettis Integrals
43.10. Exercise. Consider again the measure space ([0, 1], M, λ). Let en := {0, ..., 0, 1, 0, ...}
e0 = {0, 0, ...}.
(a) Put f (rn ) = en and f = e0 otherwise (here {rn } is a sequence of all rational numbers
of [0, 1]). Show that f : [0, 1] → c0 is measurable and Bochner integrable (f is even Riemann
integrable).
(b) Examine the measurability and integrability of the mapping f of (a) replacing the space
c0 by lp , 1 ≤ p < ∞.
(c) Let α ∈ R and define
Appendix on Topology
In this appendix, we mention briefly some topological notions used in the man-
uscript which may not be familiar to the reader.
Let us remind that by a topology we always mean a family τ of subsets of a set
X, designated as open or τ -open sets, which has the following properties:
(a) τ contains ∅ and X;
A ∩ B ∈ τ for any A, B ∈ τ ;
(b)
(c) Aα ∈ τ if Aα ∈ τ .
α
Complements of open sets are called closed sets.
A family B ⊂ τ of open sets is a base for the topology τ if every open set can
be expressed as a union of members of B.
Since a topology is often defined with
help of a base B by the formula τ = { {B : B ∈ Z } : Z ⊂ B}, it is useful to
know when a family of sets determines a topology (in this way). The answer is
as follows:
Proposition.
A collection of sets B is a base for a topology on X if and only
if X = B, and if for any z ∈ B1 ∩ B2 (B1 , B2 ∈ B) there is B ∈ B with
z ∈ B ⊂ B1 ∩ B2 .
An important example is the topology of a metric space formed by the family
of all open sets. A base for this topology is, for instance, the set of all open balls.
The discrete topology on X consists of all subsets of X and the singletons form
a base for this topology.
General topological spaces may enjoy a very complicated structure. In whole
of this manuscript we assume that all spaces are Hausdorff: (X, τ ) is a Haus-
dorff space whenever x, y are distinct points of X, there exist disjoint open sets
Gx , Gy ∈ τ with x ∈ Gx , y ∈ Gy .
A neighborhood of a point z ∈ X is every set whose interior contains z. The
interior of a set M is defined as the largest open set contained in M . It is the
union of all open sets contained in M .
The closure A of a set A is the smallest closed set containing A (it exists!). A
set E is dense in X if E = X and nowhere dense if the interior of its closure is
empty.
A topological space is said to be separable if it contains a countable dense
subset. A metric space X is separable if and only if it has a countable base for
its topology, and this is the case exactly when X has the Lindelöf property: Any
open cover of X contains a countable subcover.
If X, Y are topological spaces, a mapping f : X → Y is continuous if the
pre-images of open sets in Y are open in X. In a usual way, we can define the
continuity at a point. A mapping is continuous if and only if it is continuous at
each point.
A function f is continuous on X if and only if the level sets
K ⊂ L ⊂ U, 0 ≤ f ≤ 1, f = 1 on K, f = 0 on X \ L.
Let P be a metric compact space. Then there exists a countable base {Vn } for
the topology on P . Put fn (x) = dist(x, P \Vn ) and consider the algebra generated
by {fn }. The Stone-Weierstrass theorem yields the following proposition.
Theorem. If P is a metric compact space, then the space C (P ) is separable.
206 References
References
Czech Books and Lecture Notes
[FM] Matematická analýza II. Diferenciálnı́ počet funkcı́ vı́ce proměnných, lecture notes,
SPN Praha 1975.
[Pr] Problémy z matematické analýzy, lecture notes, SPN Praha 1972 (1974, 1977, 1982).
[NV] Přı́klady z matematické analýzy. Mı́ra a integrál, lecture notes, UK Praha 1982.
[Sik] Diferenciálnı́ a integrálnı́ počet. Funkce vı́ce proměnných, Academia Praha 1973.
[SJ] Funkcionálnı́ analýza. Nelineárnı́ úlohy, lecture notes, SPN Praha 1986.
Other books
[*1968] Convergence of probability measures, John Wiley 1968, Russian translation 1977.
[*1979] Probability and measure, Wiley.
[*1918] Vorlesungen über reelle Funktionen, Teubner Leipzig 1918, 2nd edition 1927, 3rd
edition Chelsea 1948.
[*1985] Integration theory, Vol. 1: Measure and integral, John Wiley & Sons.
[*1992] Measure theory and fine properties of functions, CRC Press, Boca Raton.
[*1995] Degree theory in analysis and applications., The Clarendon Press, Oxford University
Press, New York.
[*1973] Spectral analysis of nonlinear operators, Lecture Notes in Math. 346, Springer–Verlag
1973.
[*1994] The integrals of Lebesgue, Denjoy, Perron, and Henstock, American Mathematical
Society, Graduate studies in mathematics, vol. 4.
[*1921] Theorie der reellen Funktionen, ”I. Band”, Julius Springer, Berlin.
[*1970] Lebesgue’s theory of integration (Its origins and development), Wisconsin Press.
[*1963] Abstract harmonic analysis I, Academic Press 1963 (Russian translation 1975).
[*1970] Abstract harmonic analysis II, Academic Press 1970 (Russian translation 1975).
[*1904] Leçons sur l’intégration et la recherche des fonctions primitives, Paris, 2nd edition
1928.
[*1988] An introduction to the theory of real functions, John Wiley & Sons.
[*1986] Fine topology methods in real analysis and potential theory, Lecture Notes in Math.
1189, Springer–Verlag.
[*1987] Measure theory and integration, John Wiley & Sons. (Second revised edition Marcel
Dekker, Inc., New York, 2004).
[*2005] Topics in Banach space integration., World Scientific Publishing Co. Pte. Ltd., Hack-
ensack.
References 211
[*1983] Lectures on geometric measure theory, Proc. of the Centre for mathematical analysis,
Australian National University, vol.3.
[*1905] Sul problema della misura dei gruppi di punta di una retta, Bologna.
[*1940] L’Intégration dans les Groupes Topologiques et ses Applications, Hermann et Cie,
Paris 1940, 2nd edition 1965.
Papers
[1922] Sur les opérations dans les ensembles abstraits et leurs applications aux équations
intégrales, Fund. Math. 3, 133-181.
[1923] Sur le problème de la mesure, Fund. Math. 4, 7-33.
[1924] Sur un theorème de M. Vitali, Fund. Math. 5, 130-136.
[1925] Sur les lignes rectifiables et les surfaces dont l’aire est finie, Fund. Math. 7, 225-237.
212 References
[1957] Sur l’équivalence des théories de l’intégration selon N. Bourbaki et selon M.H. Stone,
Bull. Soc. math. France 85, 51-75.
[1945] A general form of the covering principle and relative differentiation of additive func-
tions, Proc. Cambridge Philos. Soc. 41, 103-110.
[1946] A general form of the covering principle and relative differentiation of additive func-
tions, Proc. Cambridge Philos. Soc. 42, 1-10.
[1933] Integration von Funktionen deren Werte die Elemente eines Vectorraumes sind,
Fund. Math. 20, 262-276.
[1895] Sur quelques points de la théorie des fonctions, Ann. Ecole Normale Sup. 12, 9-55.
[1963] A new treatment of the Haar integral, Michigan Math. J. 10, 365-373.
[1914] Über das lineare Mass von Punktmengeneine Verallgemeinerung des Längenbegriffs,
Nach. Ges. Wiss. Göttingen, 404-426.
[1990] A parametrical example of Dunford, Pettis and Bochner integration, Stud. Cerc.
Mat. 42, 405-418.
[1986] La naissance de la théorie des capacités: réflexion sur une expérience personelle, La
Vie des Sciences, Comptes rendus, sér. générale 3,4, 385-397.
[1989] Vznik teorie kapacit: zamyšlenı́ nad vlastnı́ zkušenostı́, Pokroky matematiky, fyziky
a astronomie 34, 71-83.
[1980] The Hahn decomposition theorem, Proc. Amer. Math. Soc. 80, 377.
[1935] Integration in general analysis, Trans. Amer. Math. Soc. 37, 441-453.
[1911] Sur les suites de fonctions mesurables, C. R. Acad. Sci. Paris 152, 244-246.
[1936] Integration and linear operation, Trans. Amer. Math. Soc. 40, 474-494.
[1907] Sur la convergence en moyenne, Comptes Rendus Acad. Sci. Paris 144, 1022-1024.
[1991] The Hahn-Banach theorem implies the existence of a non Lebesgue-measurable set,
Fund. Math. 138, 13-19.
[1915] Sur l‘intégrale d‘une fonctionnelle étendue à un ensemble abstrait, Bull. Soc. Math.
France 43, 248-265.
[1924] Des familles et fonctions additives d’ensembles abstraits, Fund. Math. 5, 206-251.
[1907] Sugli integrali multipli, Rendiconti Accad. Nazionale dei Lincei (Roma) 16, 608-614.
[1915] Sulla derivazione per serie, Rendiconti Accad. Nazionale dei Lincei (Roma) 24, 204-
206.
[1936] Sur un lemme de la théorie des espaces linéaires, Comm. Ins. Sci. Math. Méc. Univ.
de Kharkov et Soc. Mat. Kharkov 13, 35-40.
[1941] Linear functionals and integrals in abstract spaces, Bull. Amer. Math. Soc. 47, 615-
620.
[1927] Riemann integration and Taylor’s theorem in general analysis, Trans. Amer. Math.
Soc. 29, 163-177.
[1933] Der Maßbegriff in der Theorie der kontinuierlichen Gruppen, Ann. of Math. 34,
147-169.
[1933] Über die multiplikation total-additiver Mengenfunktionen, Annali Scuola Norm. Sup.
Pisa 2, 429-452.
[1957] Note sur la mesurabilité B de la dérivée supérieure, Fund. Math. 44, 238-240.
[1983] The Riesz representation theorem revisited, Amer. Math. Monthly 90, 277-280.
[1961] Definitions of Riemann type of the variational integrals, Proc. London Math. Soc.
13,3, 305-321.
[1988] A short history of integration theory, SEA Bull. Math. 12, 75-95.
[1953] Integration in abstract spaces, Bull. Amer. Math. Soc. 59, 111-139.
[1889] Über einen Mittelwerthssatz, Nachr. Akad. Wiss. Göttingen Math.-Phys., 38-47.
[1881] Sur la série de Fourier, Comptes Rendus Acad. Sci. Paris 92, 228-230.
214 References
[1985] Note on generalized multiple Perron integral, Čas. Pěst. Mat. 110, 371-374.
[1901] Sur une généralisation de l’intégrale définie, Comptes Rendus Acad. Sci. Paris 132,
1025-1028.
[1902] Integrale, longuer, aire, Annali Mat. Pura Appl. 7, 231-359.
[1903] Sur une propriétè des fonctions, Comptes Rendus Acad. Sci. Paris 137, 1228-1230.
[1910] Sur l’intégration des fonctions discontinues, Ann. Sci. Ecole Norm. Sup. 27, 361-450.
[1906] Sopra l’integrazione delle serie, Rend. Instituto Lombardo di Sci. e Lett. 39, 775-780.
[1940] Sur les fonctions–vecteurs complètement additives, Bull. Acad. Sci. URSS 6, 465-478.
[1912] Sur les propriétés des fonctions mesurables, C. R. Acad. Sci. Paris 154, 1688-1690.
[1952] Základy theorie integrálu v Euklidových prostorech, Časopis Pěst. Mat. 77, 1-51,
125-145, 267-301.
[1934] Extension of range of functions, Bull. Amer. Math. Soc. 40, 837-842.
[1975] The work of Henri Lebesgue on the theory of functions (on the occasion of his cen-
tenary) Transl. from Uspechi Mat. Nauk, 30(1975), 227-238, Russian Math.Surveys
30, 179-191.
[1976] Côntrole dans les inéquations variationelles elliptiques, J. Functional Analysis 22,
130-185.
[1939] The behaviour of a function on its critical set, Ann. of Math. 40, 62–70.
[1947] Perfect blankets, Trans. Amer. Math. Soc. 6, 418-442.
References 215
[1988] A simple proof of the Rademacher theorem, Časopis Pěst. Mat. 113, 337-341.
[1930] Sur une généralisation des intégrales de M.J. Radon, Fund. Math. 15, 131-179.
[1914] Über den Integralbegriff, Sitzungsber. Heidelberg Akad. Wiss. A16, 1-16.
[1938] On integration in vector spaces, Trans. Amer. Math. Soc. 44, 277-304.
[1910] Contributions à l’étude de la représentation d’une fonction arbitrire par des intǵrales
définies, Rend. Circ. mat. Palermo 30, 289-335.
[1919] Über partielle und totale Differenzierbarkeit, Math. Ann. 89, 340-359.
[1913] Theorie und Anwendungen der absolut additiv Mengenfunktionen, S.-B. Math. Nat-
ur. Kl. Kais. Akad. Wiss. Wien 122.IIa, 1295-1438.
[1906] Sur les ensembles de fonctions, Comptes Rendus Acad. Sci. Paris 143, 738-741.
[1909a] Sur les suites de fonctions mesurables, Comptes Rendus Acad. Sci. Paris 148, 1303-
1305.
[1909b] Sur les opérations fonctionnelles linéaires, Comptes Rendus Acad. Sci. Paris 149,
974-977.
[1910] Untersuchungen über Systeme integrirbarer Funktionen, Math. Annalen 69, 449-497.
[1912] Sur quelques points de la théorie des fonctions sommables, Comptes Rendus Acad.
Sci. Paris 154, 641-643.
[1920] Sur l’intégrale de Lebesgue, Acta Math. 42, 191-205.
[1930-32] Sur l’existence de la dérivée des fonctions monotones et sur quelques problèmes qui
s’y rattachent, Acta Sci. Math. Szeged 5, 208-221.
[1942] The measure of the critical set values of differentiable mappings, Bull. Amer. Math.
Soc. 48, 883–890.
[1951] A note on the space L∗p , Proc. Amer. Math. Soc. 2, 270-275.
[1948] Thorie des distributions et transformation de Fourier (French), Ann. Univ. Greno-
ble. Sect. Sci. Math. Phys. (N.S.) 23, 7-24.
216 References
[1928] Un théorème général sur les familles d’ensembles, Fund. Math. 12, 206-210.
[1936] Méthode nouvelle à résoudre le problème de Cauchy pour les équations hyperboliques
normales, Mat. Sb. 1 (43), 39-71.
L. S. Sobolev
Méthode nouvelle à résoudre le problème de Cauchy pour les équations hyperboliques nor-
males
Mat. Sb. 1 (43) (1936), 39 – 71
[1970] A model of set theory in which every set of reals is Lebesgue measurable, Annals of
Math. 92, 1-56.
[1948] Notes on integration I - III, Proc. Nat. Acad. Sci. 34, 336-342, 447-455, 483-490.
[1949] Notes on integration IV, Proc. Nat. Acad. Sci. 35, 50-58.
[1909] Sull’integrazione per parti, Rendiconti Accad. Nazionale dei Lincei 18, 246-253.
[1908] On non-measurable sets of points with an example, Trans. Amer. Math. Soc. 9,
237-244.
[1922] Limit in terms of continuous transformation, Bull. Soc. Math. France 50, 119-134.
[1939] The ergodic theorem, Duke Math. J. 1-18.
[1911] On the existence of a differential coefficient, Proc. London Math. Soc. 9, 325-335.
[1904] On upper and lower integration, Proc. London Math. Soc. 2, 52-66.
Subject Index
Published by
MATFYZPRESS
publishing house
of the Faculty of Mathematics and Physics
Charles University in Prague
Sokolovská 83, CZ - 186 75 Praha 8
as the 162. publication
Reviewer: Prof. RNDr. Ivan Netuka, DrSc.
This volume was typeset by the authors using AMS-TE X
the macro system of the American Mathematical Society
Printed by Reproduction center UK MFF
Sokolovská 83, CZ - 186 75 Praha 8
Second edition
Prague 2005
ISBN 80-86732-68-1
ISBN 80-85863-06-5 (First edition)