Probability Notes Intro

CHAPTER 1: MEASURE THEORY
In this chapter we define the notion of measure µ on a space Ω, construct integrals

on this space, and establish their basic properties under limits. The measure µ(E) will
be defined for appropriate subsets E of Ω, called “measurable sets”, and generalizes the
familiar notions of length, area and volume. As such, it is a non-negative number which
may well be infinite. In general, not all subsets of Ω can be taken as measurable. To
require otherwise may sometimes be inconsistent with geometric properties of Ω, as in
the case of the real line R with the translation-invariant Euclidean measure (see Exercise
A.1 in Appendix A) – and sometimes not even desirable, as in the probabilistic notion of
conditional expectation (section 2.8). Thus, a family F of measurable subsets of Ω must
be given, and must have some specific properties consistent with the definition of measure,
namely those of a σ-algebra. The properties of σ-algebras and of measures are self-evident
enough if we consider only finite unions of measurable sets. It is their strengthening to
countable unions which accounts for the power and flexibility of the theory, but also for
the difficulties encountered in the construction of measures.
This strengthening is necessary for the development of a theory of integration with
powerful convergence properties, which allow the interchange of integrals and limits under
fairly general conditions. Such a theory was developed in the beginning of the twentieth
century by H. Lebesgue, who built on the more “classical” theory that culminated with
the Riemann integral in the middle of the nineteenth century.
The subject has its roots in the method of exhaustion, invented in antiquity by Eu-
doxos (4th century BC) and greatly developed by Archimedes (3rd century BC) for the
purpose of calculating areas and volumes of geometric figures. This method is the pre-
cursor of the modern concept of limit: according to it, a convex region is approximated
by inscribed (or circumscribed) polygons, whose areas are relatively easy to calculate, and
then the number of vertices of the polygons is increased until the area of the region has
been “exhausted”. The much later work of Newton and Leibniz in the late 17th century
made this method a systematic and powerful tool for such calculations. By the twentieth
century the main applications of this theory had shifted from geometry and elementary
mechanics to differential equations, convergence of orthogonal expansions, and probability
theory.
1.1. MEASURES AND INTEGRALS
Let Ω be a given non-empty set. A family F of subsets of Ω is said to be an algebra, if it

contains the empty set ∅ and is closed under complements and finite unions; that is, if
1
(i) E ∈ F implies E c ≡ Ω \ E ∈ F ;
(ii) for any sets E1 , · · · , EN in F and N ∈ N, we have ∪N
n=1 En ∈ F .
An algebra F is called σ-algebra, if it is closed under countable unions; that is, if

(iii) for any sequence {En }n∈N ⊆ F we have ∪n∈N En ∈ F .
Given a nonempty set Ω its power set P(Ω) , the set of all its subsets, is the largest
σ-algebra with which it can be endowed; and the trivial σ-algebra {∅, Ω} the smallest.
With Ω = R , the collection F = {A ⊂ R : A or Ac is countable } is easily checked
to be a σ-algebra. On the other hand, with Ω = (0, 1] the collection A that contains the
empty set and all finite disjoint unions of intervals of the form (a, b] with 0 ≤ a < b ≤ 1
is an algebra, but not a σ-algebra.
A pair (Ω, F) with F a σ-algebra of subsets of Ω will be called a measurable space.

Then a set function µ : F → [0, ∞] is called a measure, if µ(∅) = 0 and if µ is countably
additive in the sense that
X
µ(∪n∈N En ) = µ(En ) (1.1)
n∈N
holds for every sequence {En }n∈N of pairwise disjoint subsets in F. The triple (Ω, F, µ)
is said to be a measure space, and the subsets E in F are said to be measurable.
A set function µ : P(Ω) → [0, ∞] with µ(E) ≤ µ(F ) for E ⊆ F ⊆ Ω is called
finite, if µ(Ω) < ∞; it is called σ-finite, if there exists a sequence {En }n∈N ⊆ F such that
Ω = ∪n∈N En and µ(En ) < ∞ , ∀ n ∈ N . A measure µ on (Ω, F) is called a probability,
if µ(Ω) = 1; then (Ω, F, µ) is called probability space.
Occasionally we shall abuse notation and refer to Ω itself as the “measure space”,
unless we consider simultaneously several different σ-algebras F and/or measures µ. It
should be noted that we have the monotonicity property µ(E) ≤ µ(F ) for any sets
E ⊆ F in F, and that the countable subadditivity property
X
µ(∪n∈N En ) ≤ µ(En ) holds for any sequence {En }n∈N ⊆ F . (1.1)0
n∈N
Given a measurable space (Ω, F) , a function f : Ω → R will be said to be mea-

surable if f −1 ((a, ∞)) belongs to the σ-algebra F of measurable sets, for every a ∈ R .
Taking complements, countable unions and intersections we deduce that f −1 (I) ∈ F holds
for any interval I. Useful here are the observations [a, ∞) = ∩n∈N (a − 1/n , ∞) ,
\ [
[a, b] = (a − 1/n , b + 1/n) , (a, b) = [a + 1/n , b − 1/n]
n∈N n∈N
2
(which show that any σ−algebra that contains the open intervals also contains the closed
ones, and vice-versa), as well as
f −1 (∪α∈A Eα ) = ∪α∈A f −1 (Eα ) , f −1 (∩α∈A Eα ) = ∩α∈A f −1 (Eα ) .
These latter are valid for any function f : Ω → X and any collection {Eα }α∈A of
nonempty subsets of the (arbitrary) space X .
When f : Ω → R takes only a finite number of values, we say that f is a simple

function; we shall denote by S the class of all simple functions, and by S+ its subclass
∗
of non-negative simple functions. We shall also consider S+ , the class of functions f :
Ω → [0, ∞] that take only finite number of values, possibly +∞ . The integral of such a
∗
function f ∈ S+ is defined as
Z K
X ¡ ¢
I(f ) ≡ f dµ := yk · µ f −1 {yk } , (1.2)
Ω k=1
where f (Ω) := {y1 , · · · , yK } ⊆ [0, ∞] is the range of f . The right-hand side of (1.2) is
well-defined even when yk or µ(f −1 {yk }) is infinite, provided we set 0 · ∞ = 0 . Since f
is non-negative, no ambiguous expression of the form ∞ − ∞ can appear. We have also
I(cf ) = c I(f ) for any real constant c ≥ 0 and any f ∈ S+ . (1.2)0
• We define the Lebesgue integral of any measurable function f : Ω → [0, ∞) as

Z Z
I(f ) ≡ f dµ := sup g∈S g dµ . (1.3)
0≤g≤f
Ω Ω
Here the supremum on the right-hand side is taken over all non-negative simple functions
g ∈ S+ satisfying g ≤ f pointwise, that is, g(ω) ≤ f (ω) for every ω ∈ Ω . (The Eudoxian
method of “exhaustion” once again!) We have clearly the monotonicity property
I(f1 ) ≤ I(f2 ) , if 0 ≤ f1 ≤ f2 (1.3)0
for this integral, as well as the property
I(cf ) = c I(f ) for any c ∈ [0, ∞) and any measurable f : R → [0, ∞) . (1.2)00
Such a function f is said to be integrable, if the supremum of (1.3) is finite; to wit, if

I(f ) < ∞ .
3
• Now if f : Ω → R is any real-valued, measurable function, it can be seen easily (cf.
Exercise 1.9) that its positive and negative parts f ± , as well as its absolute value |f |,
namely
f + = max(f, 0), f − = max(−f, 0), |f | = f + + f − , (1.4)
are all measurable. The real-valued function f is then said to be integrable, if |f | is

integrable; in this case f ± are also clearly integrable, and we can set
Z Z Z
+
I(f ) ≡ f dµ := f dµ − f − dµ = I(f + ) − I(f − ). (1.5)
Ω Ω Ω
It is seen from (1.2)00 that we have
I(cf ) = c I(f ) for any real constant c , and any integrable f : Ω → R . (1.2)000
Here are some simple examples of measure spaces.

(a): The counting measure on a countable space Ω = {ω1 , · · · , ωN } with 1 ≤ N ≤ ∞,
with µ(E) the number of elements in E (which can be infinite); here all subsets E are
measurable.
(b): The normalized counting measure, defined on finite spaces Ω = {ω1 , · · · , ωN }; this
measure µ is the counting measure divided by N and satisfies µ(Ω) = 1.
(c): The measure space for n coin-tosses; this is just the normalized counting measure on
a space Ω with N = 2n elements, conveniently represented as n−tuples ω = (ω1 , · · · , ωn )
with ωi ∈ {0, 1}. Each ωi can be interpreted as the outcome of the i-th coin toss, with
ωi = 1 when the outcome is “heads” and ωi = 0 when the outcome is “tails”.
(d): The Dirac measure δc at the point c ∈ Rd , on the Euclidean space Ω = Rd with all
subsets of Ω measurable, and with δc (E) equal to either 1 or 0, depending on whether the
set E contains the point c or not.
To illustrate the subsequent discussion, we shall also assume for the moment that
(e): there exists a unique measure λ on the real line R, which is defined on a σ-algebra
that contains all intervals in R and coincides with the notion of length when restricted to
intervals, namely
λ(E) = b − a, when E = (a, b ]. (1.6)
This measure λ is called the Lebesgue measure on the real line. We have the following
generalization.
(f ): For every increasing, right-continuous function F : R → R there exists a unique
measure µF defined on a σ-algebra containing all intervals in R, so that
µF (E) = F (b) − F (a) when E = (a, b ]. (1.7)
4
This measure is called the Lebesgue-Stieltjes measure associated with F . It reduces
to the Lebesgue measure when F (x) = x. We shall construct this measure in Section 1.4,
and study it further in Appendix A.
We postpone to §1.4.C a discussion of measures on finite-dimensional Euclidean spaces,
and to §1.6.A the more delicate issue of constructing measures on infinite-dimensional
spaces.
1.1 Remark: The intersection of σ-algebras is a σ-algebra; thus, for any given family G
of subsets of Ω, there is a smallest σ-algebra, denoted by σ(G), that contains G. We say
then that the σ-algebra σ(G) is generated by the family G.
1.1 Definition: If Ω is a metric space, with O denoting the collection of its open sets,
we denote by B(Ω) := σ(O) the collection of Borel subsets; this is the smallest σ-
algebra that contains the open sets of Ω . A function f : Ω → R is then called Borel
measurable, if f −1 ((a, ∞)) ∈ B(Ω) holds for every a ∈ R .
Since any open set in R is a countable union of open intervals, the class of Borel
measurable functions includes all continuous functions; that is, all functions f : Ω → R
such that f −1 (U ) ∈ O holds for every open set U of R .
1.2 Remark: The pointwise limit of a sequence of continuous functions can easily fail
to be continuous. This is a big nuisance in classical analysis and in the theory of the
Riemann integral: taking the limit under the Riemann-integral sign is always a headache.
By contrast, as we shall see in the next section, the pointwise limit of a sequence of
measurable functions is always measurable, and the Lebesgue integral behaves very well
as far as interchange of limits and integrals is concerned.
1.3 Remark: In the case of the real line Ω ≡ R, the Borel σ-algebra is also generated by
intervals, for instance of the type (a, b] with −∞ < a < b < ∞. There exist subsets of the
real line that are not Borel (see Proposition A.2, Appendix A); such sets are necessarily
uncountable.
In the preceding examples (a)-(d), all functions f : Ω → R are measurable. When
Ω is finite and we consider the normalized counting measure, the integral of f is just the
average of the values of f . For a countable set Ω = {ωn }n∈N it is an easy exercise to
verify that Z X
f dµ = f (ωn ) · µ({ωn }) (1.8)
Ω n∈N
when f is non-negative, or when f is integrable – which in this case implies simply that
the series on the right-hand side is absolutely convergent.
R
We shall use the notation E f dµ ≡ I(f χE ) for any E ∈ F and any integrable
function f : Ω → R. It will be seen in §1.4.B that if f : R → R is a continuous function
5
and E is any bounded interval, then the function f χE is integrable (in the above sense)
with respect to Lebesgue measure λ on the real line. In this case the integral of f coincides
with the usual Riemann integral of f over the set E.
1.1 Exercise: Let (Ω, F, µ) be a finite measure space.

(i) : Show that E ∈ F, F ∈ F, µ(E4F ) = 0 imply µ(E) = µ(F ).
(ii) : We write E ∼ F if µ(E4F ) = 0 for E ∈ F, F ∈ F. Show that “∼” defines an
equivalence relation on F.
(iii) : Define ρ(E, F ) := µ(E4F ) for E ∈ F, F ∈ F. Establish the triangle inequality
ρ(D, F ) ≤ ρ(D, E) + ρ(E, F ) for any sets E, F, D in F, and argue that ρ is a metric
on the space of equivalence classes induced by “ ∼ ”.
1.2 Exercise: Consider a nonempty set Ω with all its subsets measurable, and for a
given function f : Ω → [0, ∞] define
Ã !
X X
µ(E) ≡ f (ω) := sup f (ω) , A := {ω ∈ Ω | f (ω) > 0}.
F ⊆E
ω∈E card(F )<∞ ω∈F
(i) : If A is uncountable, then µ(A) = µ(Ω) = ∞.

P
(ii) : If A is countable and g : N → A is any bijection, then µ(E) = g(n)∈E f (g(n)).
(iii) : The set-function µ is a measure.
(iv) : The measure µ is σ-finite, iff: A is countable, and f (ω) < ∞, ∀ ω ∈ Ω.
1.3 Exercise: Let the set-function µ : F → [0, ∞] be a finitely-additive measure, i.e.,

¡ ¢ Pn
suppose that µ(∅) = 0 and that µ ∪nj=1 Ej = j=1 µ(Ej ) holds for any finite collection
of pairwise disjoint measurable sets E1 , · · · , En . Then
(i) µ is a measure, iff: it is continuous from below (that is, µ (∪∞n=1 An ) = limn→∞ µ(An )
∞
holds for any increasing sequence {An }n=1 ⊆ F );
(ii) µ is a measure, iff: it is continuous from above at ∅ (that is, limn→∞ µ(An ) = 0 holds
for any decreasing sequence {An }n∈N ⊆ F with ∩n∈N An = ∅ ), and µ(A1 ) < ∞ .
1.4 Exercise: For any E ⊆ N , denote by γn (E) the number of integers in E that
do not exceed n. If D is the class of subsets E of N for which the limit
ν(E) = limn→∞ (γn (E)/n) exists, then show that ν is finitely additive, but not countably
additive, on D.
1.5 EXERCISE: Regular Measure. Suppose that Ω is a metric space, and that F is
a σ-algebra that contains the Borel sets: B(Ω) ⊆ F . A measure µ : F → [0, ∞] is called
regular, if for every E ∈ F and ε > 0 there exist an open set G and a closed set F such
that F ⊆ E ⊆ G and µ(G \ F ) < ε.
6
(i) If µ as above is regular, then an arbitrary set D ∈ F is “very near a Borel set”, in the
sense that we have
B1 ⊆ D ⊆ B2 , for some B1 , B2 in B(Ω) with µ(B2 \ B1 ) = 0
as well as
D = E ∪ F, for some E ∈ B(Ω), F ⊆ B ∈ B(Ω) with µ(B) = 0.
(ii) If a measure µ on B(Ω) is finite on bounded Borel sets, then it is regular.

(Hint: Start with the finite case µ(Ω) < ∞. Consider the collection E of subsets E of
Ω, such that for each ε there exist a closed set Fε and an open set Gε , with Fε ⊆ E ⊆
Gε , µ (Gε \ Fε ) < ε. Argue that E is a σ-algebra that contains the closed sets, hence
B(Ω) ⊆ E. Then try to remove the assumption µ(Ω) < ∞.)
1.6 Exercise: An increasing function f : R → R is Borel-measurable.

1.7 Exercise: The pointwise supremum of an uncountable family of measurable func-
tions fα : Ω → R , α ∈ A can fail to be measurable.
1.8 Exercise: Let the function f : R × Rd → R be such that f (x, · ) is
B(Rd )−measurable for every x ∈ R, and f ( · , y) is continuous for every y ∈ Rd .
(i) Show that f is then B(R × Rd )−measurable.
(ii) Argue, by induction, that every function on Rd which is continuous in each of its
variables separately, is B(Rd )−measurable.
(Hint: Consider first the case of g simple; then approximate.)
1.9 EXERCISE: Preservation of measurability under simple operations. Let

f, g be real-valued, measurable functions of (Ω, F). If c is a real number, show that the
functions
cf , f2 , f +g, f g, |f | , f±
are also measurable.

Probability Notes Intro

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Probability Notes Intro

Uploaded by

Copyright:

Available Formats

CHAPTER 1: MEASURE THEORY

In this chapter we define the notion of measure µ on a space Ω, construct integrals

1.1. MEASURES AND INTEGRALS

Let Ω be a given non-empty set. A family F of subsets of Ω is said to be an algebra, if it

An algebra F is called σ-algebra, if it is closed under countable unions; that is, if

A pair (Ω, F) with F a σ-algebra of subsets of Ω will be called a measurable space.

Given a measurable space (Ω, F) , a function f : Ω → R will be said to be mea-

f −1 (∪α∈A Eα ) = ∪α∈A f −1 (Eα ) , f −1 (∩α∈A Eα ) = ∩α∈A f −1 (Eα ) .

When f : Ω → R takes only a finite number of values, we say that f is a simple

I(cf ) = c I(f ) for any real constant c ≥ 0 and any f ∈ S+ . (1.2)0

• We define the Lebesgue integral of any measurable function f : Ω → [0, ∞) as

I(f1 ) ≤ I(f2 ) , if 0 ≤ f1 ≤ f2 (1.3)0

for this integral, as well as the property

Such a function f is said to be integrable, if the supremum of (1.3) is finite; to wit, if

are all measurable. The real-valued function f is then said to be integrable, if |f | is

It is seen from (1.2)00 that we have

Here are some simple examples of measure spaces.

µF (E) = F (b) − F (a) when E = (a, b ]. (1.7)

1.1 Exercise: Let (Ω, F, µ) be a finite measure space.

(i) : If A is uncountable, then µ(A) = µ(Ω) = ∞.

1.3 Exercise: Let the set-function µ : F → [0, ∞] be a finitely-additive measure, i.e.,

B1 ⊆ D ⊆ B2 , for some B1 , B2 in B(Ω) with µ(B2 \ B1 ) = 0

D = E ∪ F, for some E ∈ B(Ω), F ⊆ B ∈ B(Ω) with µ(B) = 0.

(ii) If a measure µ on B(Ω) is finite on bounded Borel sets, then it is regular.

1.6 Exercise: An increasing function f : R → R is Borel-measurable.

1.9 EXERCISE: Preservation of measurability under simple operations. Let

are also measurable.

You might also like