Professional Documents
Culture Documents
1
Preliminaries
Most people know intuitively what the word probability means. Many of us have
encountered probabilities in our daily activities.
Probability is used to model uncertainty (or chance) when dealing with experiments or
natural phenomena whose outcome is not predictable in advance. From now on, we will
simply refer to "random experiments" to include both experiments and phenomena.
Even if the outcome of the random experiment isn't known in advance, let us suppose
that the set of all possible outcomes is known.
2
Example " H finite
+Ñ If the random experiment consists in tossing a coin, the sample space contains two
outcomes, heads or tails, that is
H ={=2 , => }
,Ñ If the random experiment consists in throwing a die, the sample space contains six
outcomes, i.e.
Example # H countable
If the random experiment consists in flipping a coin until the first head is shown, the
sample space is given by
H œ Ö=3 ß 3 œ ",#,á ×
where =3 is the outcome which occurs if the first head occurs on exactly the 3-th flip
(3 œ ",#,á ).
3
Example $ H uncountable
Consider the random experiment which consists in recording the effective stock of
Apple quote tomorrow. The continuous sample space is given by
H œ Ö=B ß B !×
where =B is the outcome which occurs if the quote (US dollars) is exactly B (B !).
Note that the sample space H is the sure event since it will always occur, while the
empty set g, that is a subset of H, is called the impossible event since it will never occur.
Some notes about set theory are essential since the events are sets.
4
Example ".b
Consider the random experiment consisting in throwing a die.
The elementary events are Ö=" ×ß Ö=# ×ß Ö=$ ×ß Ö=% ×ß Ö=& ×ß Ö=' ×Þ
The event ‘even face’ is given by E œ Ö=# ß =% ß =' ×Þ
The event ‘odd face’ is given by F œ {=" , =$ , =& }.
Example 2.b
Consider the random experiment which consists in flipping a coin until the first head is
shown.
The elementary event Ö=3 × occurs if getting the first head on exactly the 3-th flip
(3 œ ",#,á ). The event ‘the first head appears after ten flips’ is given by
E œ Ö=3 ß 3 œ "", "#, á ×.
Example 3.b
Consider the random experiment consisting in recording tomorrow Apple stock quote.
The elementary event Ö=B × occurs if the quote (US dollars) is exactly B (B !).
The event ‘the quote is greater than 10 US dollars’ is given by E œ Ö=B ß B "!×Þ
5
Some elements of Set Theory
Venn diagrams: a rectangle is used to represent the sample space while the inner parts of
closed curves (ovals or rectangles for example) represent subsets.
Venn diagram of I
Venn diagram of E § F
B
A
6
Union event: E ∪ F œ {=: = − E or = − F }
It is the event 'either E or F occurs'ß i.e. E ∪ F occurs if E or F or both occur.
A B
E3
In general, if M is a set of indices, the union event
3−M
In every discrete sample space H, any event E is the union (at most countable) of its
E œ Ö A×
elementary events
=−E
7
Intersection event: E ∩ F œ {=: = − E , = − F }
It is the event 'both E and F occurs'ß i.e. E ∩ F occurs if A and F occur.
Venn diagram of the intersection of E and F
B
A
Events E and F are said to be mutually exclusive if they cannot occur together. This
means that the sets are disjoint.
Venn diagram of E and F mutually exclusive (disjoint)
B
A
E3
In general, the intersection event
3−M
8
Some properties of union and intersection
Let E, F , G subsets of H.
Associative property E ∪ ÐF ∪ GÑ œ ÐE ∪ FÑ ∪ G
E ∩ ÐF ∩ GÑ œ ÐE ∩ FÑ ∩ G
Distributive property E ∩ ÐF ∪ GÑ œ ÐE ∩ FÑ ∪ ÐE ∩ GÑ
E ∪ ÐF ∩ GÑ œ ÐE ∪ FÑ ∩ ÐE ∪ GÑ
9
Complement (or negation) of E: E- œ Ö= − H : =  E×
It is the event 'E doesn’t occur'.
Ac
A
E ∪ E- œ Hß E ∩ E- œ gß
10
Difference event: F Ï E œ Ö= : = − Fß =  E×
It is the event 'F occurs and E does not occur'.
B
A
In generale, note that F is the union of the two disjoint sets E ∩ F and F Ï E.
11
Partition: a collection of 8 events E" ß E# ß á ß E8 such that
3œ"
This means that each single outcome belongs to one and only one event of the partition
and thus one and only one event of the partition always occurs.
Note that the single events constitute a partition of the sample space H.
12
Probability space
For each event E © H, we would like to be able to associate a number between 0 and 1
which will be called the probability of E, denoted by T ÐEÑ.
Unfortunately, only if H is a discrete space, it's possible to consider the set of all
possible subsets of H, that is the power set of H denoted by c ÐHÑÞ
3œ" 3œ"
13
Associated with the sample space H there are many 5 -algebras.
Examples of 5 -algebra
Remark. When H is finite, i.e. H œ Ö=" ß =# ß á ß =R ×, the number of its possible subsets,
that is the cardinality of c ÐHÑ, is given by #R .
• There are R" œ R subsets of H with " outcome: Ö=" ×,Ö=# ×,á ,Ö=8 × .
14
Thus, the number of all the possible subsets of H is given by
15
With any experiment we may associate a pair ÐHß TÑ, where H is the sample space and
T is a 5 -algebra of subsets of HÞ
T ÐE3 Ñ œ T ÐE3 Ñ
∞ ∞
5-additivity property
3œ" 3œ"
The previous properties are usually referred to as the Kolmogorov Axioms. When H is
finite, the axiom 2 can be weakened with
16
Some properties of the probability measure T
In particular T ÐgÑ œ !
17
$ Given two events E, F
T ÐE ∪ FÑ œ T ÐEÑ T ÐFÑ T ÐE ∩ FÑ
Moreover since
E œ ÐE ∩ FÑ ∪ ÐE Ï FÑ
and E ∩ F and E Ï F are mutually esclusive
T ÐE Ï FÑ œ T ÐEÑ T ÐE ∩ FÑ
18
% Let ÐE3 Ñ3− § T be an increasing sequence of events (i.e. E3 § E3" ß 3 œ "ß á Ñß
then
T Ð E3 Ñ œ lim T ÐE3 Ñ
∞
3œ" 3Ä∞
T Ð F3 Ñ œ lim T ÐF3 Ñ
∞
3œ" 3Ä∞
For any sample space many different probability functions, which satisfy the axioms,
can be defined.
19
Example
If the experiment consists in tossing a coin, we can take
Note that the family of probability measures is uncountable even if H is a finite sample
space.
20
Finite sample space
When H œ Ö=" ß =# ß á ß =R × and T œ c ÐHÑ, any probability measure T is determined
by the values T Ð{=" }Ñ ß á ß T Ð{=R }Ñ since
that is, the probability of an event E is the sum of the probabilities of the single events
which are contained in it. Note that
T Ð{=3 }Ñ œ "
R
3œ"
:3 œ "ß
R
3œ"
T ÐEÑ œ :3
3 À =3 −E
21
In particular if :" œ á œ :R œ R" ß T is said the uniform probability on H and the
classical definition of probability is obtained since
#E
T ÐEÑ œ
R
that is the probability of an event E is the proportion of the outcomes that are in E.
Example
Consider the experiment which consists in throwing a die
H œ Ö=" ß =# ß =$ ß =% ß =& ß =' ×ß T œ c ÐHÑ
+Ñ The die is fair: the probability of any single event is
"
T (Ö=3 ×) œ 3 œ "ß á ß 'Þ
'
For any E § T, T ÐEÑ œ #E ' Þ
For example, the event ‘odd face ’ E œ {=" , =$ , =& } has probability
$
T ÐEÑ œ œ !Þ&.
'
22
,Ñ The die is unfair: the probability of getting the face with three dots is twice the
probability of getting any other face.
Let
T (Ö=3 ×) œ :3 3 œ "ß #ß $ß %ß &ß '
Denoting by : the probability of the faces with a number of dots different from three,
the die is unfair such that
:$ œ #: :1 œ :# œ :% œ :& œ :' œ :Þ
Since the sum of the probabilities of the elementary events must be "ß that is
:" :# :$ :% :& :' œ (: œ ",
"
Ê :œ (
23
Countable sample space
When H œ Ö=" ß =# ß á ×, T is still c ÐHÑ and any probability measure T is always
determined by the values T Ð{=" }Ñ ß T Ð{=# }Ñß á since it continues to hold
T Ð{=3 }Ñ œ "
∞
3œ"
Also in this case, given a sequence of non-negative numbers :" ß :# ß á such that
:3 œ "ß
∞
3œ"
T ÐEÑ œ :3
3 À =3 −E
24
Example
Consider the experiment which consists in throwing a die until an even face occurs
H œ Ö=3 ß 3 œ ",#,á ×ß T œ c ÐHÑ
where =3 is the outcome which occurs if an even face occurs for the first time at throw 3.
If the die is fair, the probability that an even face occurs is "# , in such a way that
T (Ö=3 ×) œ
" 3
3 œ ",#,á
#
Note that the sum of the series of probabilities of the elementary events is ", that is
T Ð{=3 }Ñ œ œ #
∞ ∞ "
" 3
"
œ"
3œ" 3œ"
# " #
3 À =3 −E 3 À =3 −E
If E is the event 'an even face occurs in three throws' that is E œ Ö=" ß =# ß =$ ×
" " " (
T ÐEÑ œ # % ) œ )
Probabilities are assigned only to the events in a suitable 5-algebra T § c ÐHÑ and it is
not possible to characterize all probability measures on T.
An important example - Z œ ‘
Let T be the smallest 5-algebra which contains all the events of the form
Ö=B ß B − Ð+ß ,Ñ×
for all real numbers + and ,. It can be straightforwardly proven that T automatically
contains the events
Ö=B ß B − Ò+ß ,Ó×ß Ö=B ß B − Ð+ß ,Ó×ß Ö=B ß B − Ò+ß ,Ñ×
The 5 -algebra formed by all the intervals Ð+ß ,Ñ is called the Borel 5-algebra of ‘ and its
usually denoted by U БÑ.
26
ÖB À 0 ÐBÑ Ÿ >× is a borelian set of U Ð‘Ñ for any real >Ñ, with ∞ 0 ÐBÑ .B œ ",
Given a non-negative continuous function or more in general borelian function 0 (i.e.
∞
Lebesgue ("*!&) proved that there exists a unique probability measure T on the Borel 5-
algebra U Ð‘Ñ such that
T À E È 0 ÐBÑ .B
E
is a probability measure on TÞ
27
Conditional probability
Given a probability space (Hß Tß T Ñß sometimes the probability T has to be updated
when new information is acquired. Supposing that the event F − T had occurred.
The probability of E − T has to be modified taking into account this information.
From the Venn diagram, it is at once apparent that, as only outcomes in F can have
occurred, in order to evaluate the probability of the event E, only the outcomes in
E ∩ F have to be considered.
H
B
A
Example
Consider the experiment which consists in throwing a fair die. Supposing that the event
F œ Ö=# ß =% ß =' × occurred, even if the precise outcome is still unknown. The probability
of the event 'odd face' has been reduced from "# to !.
28
The conditional probability given the event F is a probability measure.
• T ÐH ± FÑ œ T TÐHÐFÑ
∩FÑ
œ TT ÐFÑ
ÐFÑ
œ"
• Given two mutually exclusive events E and G , the events E ∩ F and G ∩ F are
mutually exclusive too in such a way that
T ÐÐE ∪ GÑ ∩ FÑ T ÐÐE ∩ FÑ ∪ ÐG ∩ FÑÑ
T ÐE ∪ G ± FÑ œ œ
T ÐFÑ T ÐFÑ
T ÐE ∩ FÑ T ÐG ∩ FÑ T ÐE ∩ FÑ T ÐG ∩ FÑ
œ œ
T ÐFÑ T ÐFÑ T ÐFÑ
œ T ÐE ± FÑ T ÐG ± FÑ
The previous results can be straightforwardly extended to the case when a sequence E3
of disjoint events are considered so that 5 -additivity property is satisfied.
29
Formally, if T ÐFÑ ! the conditional probability of event E given the event F is
defined by
T ÐE ∩ FÑ
TF ÐEÑ œ T ÐE ± FÑ œ
T ÐFÑ
Thus the probability space (Hß Tß TF Ñ is considered when F has occurred.
Moreover, if T ÐEÑ !, also the conditional probability of event F given the event E
can be defined as
T ÐE ∩ FÑ
TE ÐFÑ œ T ÐF ± EÑ œ
T ÐEÑ
From the previous two definitions, the product rules for probabilities of intersections are
derived
T ÐE ∩ FÑ œ T ÐE ± FÑT ÐFÑ œ T ÐF ± EÑT ÐEÑ
30
The product rules can be generalized to more than two events. In particular, if three
events E, F and G are considered
T ÐE ∩ F ∩ GÑ œ T ÐG ± E ∩ FÑT ÐF ± EÑT ÐEÑ
Generalizations for more than three events are straightforward.
It is worth noting that product rules for probabilities of intersections are frequently used
in random experiments that can be considered as combinations of sub-experiments.
Example
Consider an urn containing ' white balls and % red balls. The random experiment
consists in consecutively selecting three balls without replacement. The goal is to
determine the probability of obtaining three red balls.
Denote by E=< the event 'a red ball is selected at =-th selection', for = œ "ß #ß $Þ
Therefore, using the product rules for probabilities of intersections, it holds
T ÐE"< ∩ E#< ∩ E$< Ñ œ T ÐE$< ± E#< ∩ E"< ÑT ÐE#< ± E"< ÑT ÐE"< Ñ
# $ % "
œ † † œ
) * "! $!
31
Example
Consider the experiment which consists in selecting a ball from an urn containing *!
balls numbered with "ß á , *!:
Let E and F be the event 'a ball with number multiple of 10 is selected' and 'a ball with
even number is selected' respectively, that is
Ê The knowledge that F has occurred does not modify the probability of E.
The converse is also true: conditioning on E, the probability of F does not change.
&
T ÐF ∩ EÑ *! "
T ÐFlEÑ œ œ "
œ œ T ÐFÑ
T ÐEÑ *
#
Then, the knowledge that one of the two events has occurred does not change the
probability of the other event.
33
Example
Consider the random experiment which consists in two consecutive throws of a fair die
H œ ÖÐ=3 ß =4 Ñ À 3 œ "ß á ß 'ß 4 œ "ß á ß '×
Let E be the event "six dots in the first throw" and F be the event "six dots in the second
throw"
E œ ÖÐ=' ß =4 Ñ À 4 œ "ß á ß '× ß F œ ÖÐ=3 ß =' Ñ À 3 œ "ß á ß '×
Since T ÖÐ=3 ß =4 Ñ× œ "Î$', as the number of outcomes is $' and all the outcomes are
equally likely, the probabilities of E and F are given by
' "
T ÐEÑ œ T ÐFÑ œ œ
$' '
Moreover
T ÐE ∩ FÑ " T ÐE ∩ FÑ "
T ÐF ± EÑ œ œ œ T ÐFÑ, T ÐE ± FÑ œ œ œ T ÐEÑ
T ÐEÑ ' T ÐFÑ '
Ê The knowledge that E has occurred does not change the probability that F occurs
and the knowledge that F has occurred does not change the probability that E occurs .
34
Independent events
An event F is said to be independent from an event E if the probability that F occurs is
not influenced by the knowledge that E has occurred
T ÐE ∩ FÑ
T ÐF ± EÑ œ œ T ÐFÑ
T ÐEÑ
Note that if F is independent from E, then T ÐE ∩ FÑ œ T ÐEÑT ÐFÑ and hence
T ÐE ∩ FÑ T ÐEÑT ÐFÑ
T ÐE ± FÑ œ œ œ T ÐEÑß
T ÐFÑ T ÐFÑ
35
Let E and F be independent.
36
Note that the independence is a consequence of how the probability measure has been chosen.
Example
Consider the experiment which consists in throwing a die
H œ Ö=" ß =# ß =$ ß =% ß =& ß =' ×ß T œ c ÐHÑ
and consider the following two events
E œ Ö=# ß =% ß =' ×, F œ Ö=" ß =# ×
+Ñ The die is fair. E and F are independent since
" " "
T ÐE ∩ FÑ œ T ÐÖ=# ×Ñ œ œ † œ T ÐEÑT ÐFÑ
' # $
Remark. E- and Fß F - and E, E- and F - are independent.
,Ñ The die is unfair. The probability of getting the face with three dots is twice the
probability of getting any other face. E and F are not independent since
" $ #
T ÐE ∩ FÑ œ T ÐÖ=# ×Ñ œ ( Á ( † ( œ T ÐEÑT ÐFÑ
37
More generally, a family ÖE3 À 3 − M× is called independent if
Example
Consider the experiment which consists in selecting a ball from an urn containing % balls
numbered with "ß #ß $ß %ß i.e. H œ Ö=3 ß 3 œ ", #ß $ß %×ß T œ T ÐHÑß and consider the
following three events
E œ Ö=" ß =# ×, F œ Ö=" ß =$ ×, G œ Ö=" ß =% ×
We want to compute the probability that the light bulb will not be defective.
Note that
I œ E- ∩ F -
As the two events E and F can be supposed to be independent, also their complements
E- and F - are independent. Therefore
T ÐIÑ œ T ÐE- ∩ F - Ñ œ T ÐE- ÑT ÐF - Ñ œ !Þ*& ‚ !Þ** œ !Þ*%!&
39
Bayes rule
By the product rule of intersection: T ÐE ± FÑT ÐFÑ œ T ÐF ± EÑT ÐEÑ
Given a partition E" ß E# ß á ß E8 of H where T ÐE3 Ñ ! for any 3 œ "ß á ß 8, for any
F œ ÐF ∩ E3 Ñ
8
event F − T, it holds
3œ"
3œ"
40
Example
Consider two dice:
the first is a fair die
the second is such that the probability of getting a face with an odd number of dots is
twice the probability of getting a face with an even number of dots.
The random experiment consists in flipping a coin, if heads appears the fair die is
thrown, otherwise the manipulated die is thrown.
The aim is to compute the probability of getting the face with three dots and, given that
the face with three dots has appeared, the probability that the manipulated die has been
thrown.
As to the fair die, all the faces have the same probability of appearing.
As to the manipulated die, denoted by : the probability of getting a face with an even
number of dots, the probability of getting a face with an odd number is #:. Therefore
:" :# :$ :% :& :' œ #: : #: : #: : œ "
and : œ "Î*ß in such a way that the probability of getting a face with three dots is #Î*Þ
41
Denoted by E the event "the face with three dots has appeared" and by F the event "tails
has appeared", where T ÐFÑ œ "Î#Þ Since
E œ ÐE ∩ FÑ ∪ ÐE ∩ F - Ñ
it follows that
T ÐEÑ œ T ÐE ∩ FÑ T ÐE ∩ F - Ñ
# " " " (
œ T ÐE|FÑT ÐFÑ T ÐE|F - ÑT ÐF - Ñ œ † † œ
* # ' # $'
Moreover, given that the face with three dots has appeared, by using Bayes rule the
probability that the manipulated die has been thrown is given by
# "
T ÐE ∩ FÑ T ÐE|FÑT ÐFÑ * † # %
T ÐFlEÑ œ œ œ (
œ
T ÐEÑ T ÐEÑ $'
(
42
Example
An analyst predicts that there is a %!% chance that the U.S. economy performs well. If
the U.S. economy performs well, then there is an )!% chance that Asian countries will
also perform well. On the other hand, if the U.S. economy performs poorly, the
probability of Asian countries performing well goes down to !Þ$!Þ
43
To determine T ÐEÑ, note that E can be expressed as the union of disjoint events as
E œ ÐE ∩ FÑ ∪ ÐE ∩ F - Ñ
so that
T ÐEÑ œ T ÐE ∩ FÑ T ÐE ∩ F - Ñ œ T ÐElFÑT ÐFÑ T ÐElF - ÑT ÐF - Ñ
œ T ÐElFÑT ÐFÑ T ÐElF - ÑT ÐF - Ñ
œ !Þ)! † !Þ%! !Þ$! † Ð" !Þ%!Ñ œ !Þ$# !Þ") œ !Þ&
As to the probability that the U.S. economy will perform well, given that the Asian
countries perform well, using Bayes rule
44