Professional Documents
Culture Documents
Ω = {ω1 , . . . , ωn }
I toss n coins (n = 4)
Sample Space
Describe the sample spaces for the following experiments
I roll a regular die
{1, 2, 3, 4, 5, 6}
I toss n coins (n = 4)
I toss n coins (n = 4)
I toss n coins (n = 4)
1) E = {2, 4, 6}
2)
n−1 n−1
z }| { z }| {
E = {TTTTTT T , TTTTTT H}
3) E = (3, ∞) (using minutes as unit of time)
Sample Space and Events
A ∪ B = {ω ∈ Ω : ω ∈ A or ω ∈ B}
A ∩ B ≡ AB = {ω ∈ Ω : ω ∈ A and ω ∈ B}
Ā ≡ Ac = {ω ∈ Ω : ω ∈
/ A} = Ω \ A
B \ A = B ∩ Ā = {ω ∈ Ω : ω ∈ B and ω ∈
/ A}
Notation
When A is a countable set, we denote with |A| the number of
points/elements in A (the size or cardinality of A)
A Set-theory Recap
Properties
Commutativity
A ∪ B = B ∪ A, A∩B =B ∩A
Associativity
(A ∪ B) ∪ C = A ∪ (B ∪ C ), (A ∩ B) ∩ C = A ∩ (B ∩ C )
Distributivity
A∩(B ∪C ) = (A∩B)∪(A∩C ), A∪(B ∩C ) = (A∪B)∩(A∪C )
Idempotency
A ∪ A = A, A∩A=A
De Morgan’s laws
A ∪ B = Ā ∩ B̄, A ∩ B = Ā ∪ B̄
n
[ n
\ n
\ n
[
Ai = Āi , Ai = Āi
i=1 i=1 i=1 i=1
A Set-theory Recap. Poincaré Identities
Inclusion-Exclusion Principle (Poincaré Identity)
Let Ai ⊂ Ω, i = 1, . . . , n
n
[
| Ai | = |A1 ∪ A2 ∪ . . . ∪ An | =
i=1
n
X X
|Ai | − |Ai ∩ Aj |
i=1 1≤i<j≤n
X
+ |Ai ∩ Aj ∩ Ak |
1≤i<j<k≤n
− . . . + (−1)n−1 |A1 ∩ A2 ∩ . . . An |
+ . . . + (−1)n−1 |A1 ∩ A2 ∩ . . . An |
An Example
In a group of 21 (Arabic-speaking) Lebanese, 16 speak French, 13
English, 4 Armenian, 9 English and French, 2 French and
Armenian, 3 English and Armenian and 1 English, French and
Armenian. How many speak only Arabic?
An Example
In a group of 21 (Arabic-speaking) Lebanese, 16 speak French, 13
English, 4 Armenian, 9 English and French, 2 French and
Armenian, 3 English and Armenian and 1 English, French and
Armenian. How many speak only Arabic?
Let A be the set of Armenophones, E the set of Anglophones, F
the set of Francophones
The set of those who speak only Arabic is
An Example
In a group of 21 (Arabic-speaking) Lebanese, 16 speak French, 13
English, 4 Armenian, 9 English and French, 2 French and
Armenian, 3 English and Armenian and 1 English, French and
Armenian. How many speak only Arabic?
Let A be the set of Armenophones, E the set of Anglophones, F
the set of Francophones
The set of those who speak only Arabic is
Ā ∩ Ē ∩ F̄ = A ∪ E ∪ F = Ω \ (A ∪ E ∪ F )
|Ā ∩ Ē ∩ F̄ | = |Ω| − |A ∪ E ∪ F |
= |Ω| − (|A| + |E | + |F |) + |A ∩ E | + |A ∩ F | + |E ∩ F |
−|A ∩ E ∩ F |
= 21 − (4 + 13 + 16) + 3 + 2 + 9 − 1 = 1
Some Terminology
P(∅) = 0
Proof. If Ai = ∅, i ∈ N, then ∞
S
i=1 Ai = ∅
∞ ∞ ∞
!
[ X X
P(∅) = P Ai = P(Ai ) = P(∅)
i=1 i=1 i=1
2) Finite additivity
Let Ai , i = 1, . . . , n, be a finite family of disjoint events,
Ai ∩ Aj = ∅, i 6= j
n n
!
[ X
P Ai = P(Ai )
i=1 i=1
using 4)
Inclusion-Exclusion Formulae (Poincaré’s Identities)
Very often if one has to compute the probability that at least one
event occurs or the probability that no event occurs, the Poincaré
formula is most useful
Ω = {ω1 , ω2 , ω3 , ω4 } P(ωi ) = p, ∀i
A1 = {ω1 , ω2 }, A2 = {ω1 , ω3 }, A3 = {ω1 , ω4 }
Ω = {ω1 , ω2 , ω3 , ω4 } P(ωi ) = p, ∀i
A1 = {ω1 , ω2 }, A2 = {ω1 , ω3 }, A3 = {ω1 , ω4 }
|E |
P(E ) =
|Ω|
|E | = n1 · n2 · · ·
In how many ways can a woman that has 8 skirts, 4 pair of shoes
and 10 shirts be dressed?
Problem
In how many ways can a woman that has 8 skirts, 4 pair of shoes
and 10 shirts be dressed?
8 · 4 · 10 = 320
We have a succession of three choices (one per die) and each die
represents a multiple choice of six possibilities ni = 6, thus there
are
6 · 6 · 6 = 216
possible rolls
This is obvious if you think of tossing the dice one after the other
rather than simultaneously (although it does not matter) The set
of choices at each throw of the dice in this case does not change
based on what happens at the previous stage (it is always one of
the numbers {1, 2, 3, 4, 5, 6} (and thus the number does not
change which is only thing that matters for the multiplicative rule)
Problem
6·5
In this case there are 6 available possibilities for the first choice.
Five different possibilities for the second die. Which five depends
on the first choice (e.g. if 3 shows up on the first roll, the set of
available possibilities is {1, 2, 4, 5, 6}, if 2 shows up, such set is
{1, 3, 4, 5, 6}), but there are always 5 possibilities for the second
die, so the first rule of counting as formulated above still applies
Problem
7 · 2 = 14
Each year starts on one of the seven days (Sunday, Monday, ...,
Saturday). Each year is either a leap year (i.e., it includes February
29) or not
Standard Ways of Counting and Basic Probabilistic Models
We are going to see now two standard ways of counting which can
be used in the majority of combinatorial problems
I Sampling Methods
I Allocation Methods
Basic Probabilistic Models. Sampling Model
n1 = n
n2 = n
n1 · · · nk = nk
Ω = {(a1 , a2 , . . . , ak ), ai = 1, ..., n}
Ω = {(a1 , a2 , . . . , ak ), ai = 1, ..., n}
n1 = n
Sampling without replacement and with ordering
II. Sampling without replacement and with ordering
The first individual is chosen from a population with n individuals
n1 = n
n2 = n − 1
Notice that in this case the k choices are not independent, as the
earlier individuals we drew affect the set of possibilities of later
individuals, but the number of possibilities of a given later draw of
an individual is not affected by earlier draws, so the first rule still
applies
Sampling without replacement and with ordering
Ω = {(a1 , a2 , . . . , ak ), ai = 1, . . . , n, ai 6= aj if i 6= j}
Ω = {(a1 , a2 , . . . , ak ), ai = 1, . . . , n, ai 6= aj if i 6= j}
I How many ways are there to form a 3-letter word if the letters
must be different?
Some problems
I How many ways are there for 2 persons to sit in 7 chairs that
are in a row?
7 · 6 = (7)2
Sampling with ordering (it matters in a theater, say, where
you sit) and replacement (two people may not sit on the same
chair)
I How many ways are there to form a 3-letter word if the letters
must be different?
26 · 25 · 24 = (26)3
Sampling with ordering
(n)n = n!
Some problems
6!
6!
7!
2!2!
shuffling the two A’s and the two B’s does not change the
word
Sampling without Ordering
We now consider the case in which the order of the elements in the
sample is irrelevant/disregarded
In this case too we have to distinguish between sampling with or
without replacement
A word on terminology: we will always explicitly specify whether the elements in the samples are ordered or not,
unless it is clear from the context. Sometimes however the word sample is used to refer to what we have called an
ordered sample and the word population is used to refer to an aggregate of elements without regard to their order.
We have indeed used population in this sense, but when we consider sampling from a population, we will mostly
talk about unordered samples of size k instead of sub-populations of size k. In some books, you may find that the
problem of counting the number of possible unordered samples of size k from a population of size n is referred to
Ω = {[1, 2], [1, 3], [1, 4], [2, 3], [2, 4], [3, 4]}
Ω = {[a1 , . . . , ak ] : ai = 1, . . . , n, ai 6= aj , i 6= j}
Ω = {[1, 1], [2, 2], [3, 3], [4, 4], [1, 2], [1, 3], [1, 4], [2, 3], [2, 4], [3, 4]}
In this case again [1, 2] = [2, 1], thus it is counted once, and we
can have the same individual repeated in a sample (as many as k
times in a k-sample) because we replace the individual back in the
population before sampling next time
Sampling with replacement and without ordering. The
sample space
Ω = {[a1 , . . . , ak ] : ai = 1, . . . , n, }
What is |Ω|?
n−1+k n−1+k
with replacement Ω = {[a1 , a2 , . . . , ak ] : ai = 1, . . . , n} k
= n−1
n n!
without replacement Ω = {[a1 , a2 , . . . , ak ] : ai = 1, . . . , n, ai 6= aj if i 6= j} k
= k!(n−k)! ≡ Ck,n
n≥k (set of combinations)
[1, 1], [1, 2], [1, 3] [1, 2], [1, 3], [2, 3]
[2, 2], [2, 3], [3, 3]
Some Problems
Six letters are selected at random one after another from the
English alphabet (26 letters) with replacement
Find the probabilities that: a) the word formed is made up of
vowels (6, if we count Y as vowel); b) it is the word BEIRUT
Some Problems
Six letters are selected at random one after another from the
English alphabet (26 letters) with replacement
Find the probabilities that: a) the word formed is made up of
vowels (6, if we count Y as vowel); b) it is the word BEIRUT
a) |Ω| = 266 , |E | = 66 , p(E ) = (6/26)6
Some Problems
Six letters are selected at random one after another from the
English alphabet (26 letters) with replacement
Find the probabilities that: a) the word formed is made up of
vowels (6, if we count Y as vowel); b) it is the word BEIRUT
a) |Ω| = 266 , |E | = 66 , p(E ) = (6/26)6
Six letters are selected at random one after another from the
English alphabet (26 letters) with replacement
Find the probabilities that: a) the word formed is made up of
vowels (6, if we count Y as vowel); b) it is the word BEIRUT
a) |Ω| = 266 , |E | = 66 , p(E ) = (6/26)6
Six letters are selected at random one after another from the
English alphabet (26 letters) with replacement
Find the probabilities that: a) the word formed is made up of
vowels (6, if we count Y as vowel); b) it is the word BEIRUT
a) |Ω| = 266 , |E | = 66 , p(E ) = (6/26)6
|Ω| = (n)k
|E | = k(n − 1)k−1
(n − 1)k n−k k
P(E ) = 1 − P(Ē ) = 1 − =1− =
(n)k n n
Problem
(n − 1)k 1 k
P(E ) = 1 − P(Ē ) = 1 − =1− 1−
nk n
Standard Ways of Counting and Basic Probabilistic Models
I Sampling Model
I Allocation Model
(k1 , . . . , kn ), ki ≥ 0
k1 + k2 + . . . + kn = k
n+k−1
with replacement nk k
n! n
without replacement (n)k = (n−k)! k
ai 6= aj , i 6= j
n≥k
n+k−1
without exclusion nk k
ki = 0, . . . , k
n! n
with exclusion (n)k = (n−k)! k
ki = 0, 1
(⇒ n ≥ k)
distinguishable objects indistinguishable objects
order within the cell immaterial
Exercise
Let f (x1 , . . . , x12 ) be an analytic function of 12 variables. How
many partial derivatives of order 5 exist?
Exercise
Let f (x1 , . . . , x12 ) be an analytic function of 12 variables. How
many partial derivatives of order 5 exist?
Partial derivatives do not depend on the order of differentiations
Exercise
Let f (x1 , . . . , x12 ) be an analytic function of 12 variables. How
many partial derivatives of order 5 exist?
Partial derivatives do not depend on the order of differentiations
n = 12, k = 5
12 − 1 + 5
= 4368
5
∂5
f (x1 , x2 , . . . , x12 )
∂x2 ∂x3 ∂ 2 x6 ∂x10
or think of 12 boxes and k = 5 identical balls to be placed, with
each ball allocation in a box determining with respect to which
variable one derivative is taken
Exercise-Summer 2017
n = 5 (boxes=slidings), k = 15
(1, 2|3|4) (1, 3|2|4) (1, 4|2|3) (2, 3|1|4) (2, 4|1|3) (3, 4|1|2)
(1, 2|4|3) (1, 3|4|2) (1, 4|3|2) (2, 3|4|1) (2, 4|3|1) (3, 4|2|1)
However, if the groups (of the same size) are indistinguishable then
the second line is the same as the first, so the total number of
ways of splittings is 12/2! = 6
Multinomial Coefficients
the number of groups of the same size is the only thing that
matters since the groups are indistinguishable
two groups of different sizes are distinguishable in view of their
different sizes
Three observations:
1 If we allow ki to be zero (the multinomial coefficient does
make sense since 0! = 1) and we sum
k
X k!
k1 ! · · · kn !
k1 ,...,kn =0
k1 +...+kn =k
Three observations:
1 If we allow ki to be zero (the multinomial coefficient does
make sense since 0! = 1) and we sum
k
X k!
k1 ! · · · kn !
k1 ,...,kn =0
k1 +...+kn =k
Assume that there are 365 days in the year, and that the
probability that a student has the birthday in a given day is the
same for all days of the year
Assume that there are 365 days in the year, and that the
probability that a student has the birthday in a given day is the
same for all days of the year
P(E ) = 1 − P(Ē )
(365)t 365 t!
= 1− =1−
365t t 365t
365 364 365 − t + 1
= 1− · ···
365 365 365
n = 365 boxes (each labelled by a day of the year) and t
(distinguishable) balls (people are distinguishable)
|Ω| = 365t and |Ē | = (365)t since the t distinguishable balls may
not fall in the same box
Equivalently, to
365
compute |Ē |, choose the t boxes that should be
occupied, t , then place the t balls, all possible ways of placing
the t balls in the t boxes being (t)t = t!
Birthday Problem
Notice that already when t = 23, the probability of having at least
two people with the same birthday is already greater than 1/2
Since we are interested in the event that no two people have the
same birthday, but we are not really interested in which people,
why can’t we consider the people indistinguishable?
If we did so, the points in the corresponding sample space would
not have the same probability, so we might not use the formula
based on the counting of the outcomes to compute the probability.
Namely, we are assuming that 1) a person can be born with the
same probability any day of the year, i.e. 1/365 ; 2) this probability
does not depend on another person’s birthday (the events are
independent- see later here for a more precise description)
This implies the denominator must be 365t = nt (distinguishable
balls)
Birthday Problem
p1 p2 p1 p2 p1 p2
p2 p1 p2 p1 p2 p1
p p p p p p
whose n−1+t
t = 6 elements do not all have the same probability:
the states in the second line have twice the probability as those in
the first line (2/9 vs 1/9)
When this is the case, always label the balls even if the problem
says the balls are identical
Sample spaces
|E | =
Problem. Continued
Fill the other ñ = 8 − 3 states with one particle in each state (this
can be done in one way only, since the particles are
indistinguishable)
Notice that
n+k −1
k
counts all possible ways of allocating the k identical particles to
the n states, including those that leave states empty, which is why
we filled the states first with one particle
Notice also that this way of approaching the problem would not
work if the particles were distinguishable
Exercise
Consider 4 balls and three labelled boxes. If each ball has the same
probability of falling in any box and of doing so independently of
the others, what is the probability that one box is left empty?
Exercise
Consider 4 balls and three labelled boxes. If each ball has the same
probability of falling in any box and of doing so independently of
the others, what is the probability that one box is left empty?
Total number of ways of arranging k = 4 objects in n = 3 balls is
|Ω| = 34
that the balls are identical or not does not change anything in this case,
since they all have the same probability 1/3 of falling into any box and
they do so independently of each other (1/3)4 (see previous exercise
however when the assumptions are different)
For the numerator: choose the empty box: 31 ; place all the k = 4
|Ā1 ∩ A¯2 |
Care should be taken when considering the sample spaces and the
probability of its outcomes
It will become clearer with practice, but the following two slides
(taken from two textbooks) should give you further details on how
to assign probabilities to sample spaces
Ordered versus Unordered Samples
Sometimes the same collection of elements in different orders are treated as different samples, and sometimes the
same elements in different orders are treated as the same sample. ”In general, how can one tell which is the correct
way to count in a given problem? Sometimes, the problem description will make it clear which is needed. For
example, if we are asked to find the probability that the items in a sample arrive in a specified order, then we
cannot even specify the event of interest unless we treat different arrangements of the same items as different
outcomes. However, there are cases in which the problem description does not make it clear whether or not one
must count the same elements in different orders as different outcomes. Indeed, there are some problems that can
be solved correctly both ways... In general, this is the principle that should guide the choice of counting method. If
we have the choice between whether or not to count the same elements in different orders as different outcomes,
then we need to make our choice and be consistent throughout the problem. If we count the same elements in
different orders as different outcomes when counting the outcomes in the sample space” Ω, ”we must do the same
when counting the elements of the event E of interest. If we do not count them as different outcomes when
counting Ω, we should not count them as different when counting E” (De Groot)
Concerning ordering and sampling in practice
Sometimes ”one feels intuitively that the order within the sample should be irrelevant, and the beginner is therefore
prone to think of samples as not being ordered. But conclusions from a sample are possible only on the basis of
certain probabilistic assumptions, and for these it is necessary to have an appropriate model for the conceptual
experiment of obtaining a sample. Now such an experiment obviously involves choices that can be distinguished
from each other, meaning choices that are labelled in some way. For theoretical purposes it is simplest to use the
integers as labels, and this amounts to ordering the sample... In other words, even though the order within the
samples may be ultimately disregarded, the conceptual experiment involves ordered samples, and ... this affects the
134 4!48!
p(E ) =
52!
Problem (continued). Solution 2
134
p(E ) = 52
4
Problem (continued). Solution 3
Consider instead as sample space Ω the assignments of the 52
cards (distinguishable) to 4 (distinguishable) groups of equal size
52 52!
|Ω| = =
13, 13, 13, 13 13!4
(multinomial coefficient)
The groups of cards E we are interested in are those that contain
each an ace and 12 non aces
4 48 3 36 2 24 1 12 4!48!
|E | = =
1 12 1 12 1 12 1 12 12!4
(choose an ace and assign it to the first player, complete the set
choosing 12 cards from the 48 cards that are not aces, then do the
same for the other players using the remaining cards)
4!48!134
p(E ) =
52!
Hypergeometric
Consider an urn that contains N balls, K of which red and N − K
non-red. n balls are drawn (one after another without replacement
or at once) at random from the urn. What is the probability pk
that k red balls are drawn?
The
K
of ways of choosing k red balls and n − k non-red is
number
N−K
k · n−k
left-hand side: choose the k red balls, the n − k non-red balls and
count all possible orderings (n!)
Hypergeometric
You can look at the problem differently, since
K
N−K n
N−n
k · n−k k · K −k
N
= N
n K
Consider as sample space the set of all distinct words from a set of
K R’s and N − K R̄’s (as if you were drawing all balls from the
urn and placing them one after another on a row, with the balls
indistinguishable other than for their colors, R, R̄) thus (as the
BAALBEK example)
N! N
|Ω| = =
K !(N − K )! K
We are interested in the subset of such words whose first n letters
are k R’s and n − k R̄’s (our sample) and whose last N − n letters
are the remaining letters, K − k of which are
R’s. The total
number of such words is indeed kn · K N−n
−k
(There is nothing special to our sample being the first n letters,
however)
Exercise
Let us draw a ball from it. The probability that the ball is blue is
P(1st blue) =
Conditional Probability
Let us draw a ball from it. The probability that the ball is blue is
B
P(1st blue) =
B +R
Now, the conditional probability answers questions such as this:
what is the probability of drawing a red ball if (given that) the first
ball was blue?
E.g., in the case of sampling without replacement we have:
Let us draw a ball from it. The probability that the ball is blue is
B
P(1st blue) =
B +R
Now, the conditional probability answers questions such as this:
what is the probability of drawing a red ball if (given that) the first
ball was blue?
E.g., in the case of sampling without replacement we have:
R B −1
P(2nd red|1st blue) = , P(2nd blue|1st blue) =
B +R −1 B +R −1
P(B|B) = 1
P(∅|B) = 0
P(A|B) + P(A|B̄) 6= 1
Furthermore
P(A|B) = 1 A ⊇ B
and thus (first axiom)
Since the coin is fair, all the points in the sample space
Ω=
Exercise
A fair coin is tossed twice. What is the probability that both tosses
land on heads given that
1 the first toss lands head up
2 at least one of the tosses lands head up
Since the coin is fair, all the points in the sample space
A = {(H, H)}
Since the coin is fair, all the points in the sample space
A = {(H, H)}
2
P ((H, H) ∩ {(H, T ), (T , H), (H, H)})
P(A|B2 ) =
P ({(H, T ), (T , H), (H, H)})
P ((H, H))
=
P ((H, T )) + P ((T , H)) + P ((H, H))
1/4 1
= =
1/4 + 1/4 + 1/4 3
Interpreting the Conditional Probability
Since A = {(H, H)} ⊂ B1 = {(H, T ), (H, H)}, the sample space is
now B1 , thus we could have obtained P(A|B1 ) directly by simple
counting (because the space is uniform)
1
P(A|B1 ) =
2
Similarly, A = {(H, H)} ⊂ B2 = {(H, T ), (H, H), (T , H)}
1
P(A|B2 ) =
3
That is, another way of reading
P(A ∩ B)
P(A|B) =
P(B)
The factor
(K )k (N − K )n−k K K −1 K −k +1
= ···
(N)n N N −1 N −k +1
N −K N − K − (n − k) + 1
· ···
N −k N − k − (n − k) + 1
contributions.
Indeed kn is the number of ways of placing k red balls in n places
Hypergeometric Using Conditional Probability
For example, suppose the urn has N = 10 balls, K = 3 of which
are red and N − K = 7 non-red (without loss of generality, we can
color them all blue). We sample 3 balls without replacement. The
probability that one of the drawn balls is red is given by
3 7
1
2
10
3
There are 3 red possible patterns in a sample of size 3 with one red
ball
A1 = RBB A2 = BRB A3 = BBR
The hypergeometric counts
p(A1 ∪ A2 ∪ A3 ) = p(A1 ) + p(A2 ) + p(A3 )
3 7 6 7 3 6 7 6 3
= · · + · · + · ·
10 9 8 10 9 8 10 9 8
3 7
3·7·6
= 3 = 1 102
10 · 9 · 8 3
(the first line is because Ai are mutually exclusive)
Exercise
What is the probability of drawing k balls that are red from an urn
with N total balls, K of which red, if instead we sample with
replacement?
What is the probability of drawing k balls that are red from an urn
with N total balls, K of which red, if instead we sample with
replacement?
(binomial-see chapter 3)
Multiplication Rule
Now, from
P(A ∩ B)
P(A|B) =
P(B)
the following multiplication rule follows
P(A ∩ B) = P(A|B)P(B)
P(A ∩ B) = P(B|A)P(A)
Exercise
P(C 1 = A♦ ∩ C 2 = 2♠) =
Exercise
It does not matter the order with which we condition on the events
(which event we choose as the first to condition on, although of
course once a choice is made, we need to stick with it). In
practice, however, some choices make the computation much
easier than other choices
Law of Total Probability
Consider a complete set of disjoint events. Namely,
n
[
A1 , . . . , An , Ai = Ω, Ai ∩ Aj = ∅, i 6= j
i=1
exhaustive (at least one event will occur) and mutually exclusive,
then the following law of total probability holds
X n
P(B) = P(B|Ai )P(Ai )
i=1
Proof. B can be written as the union of disjoint sets
n n
!
[ [
B =B ∩Ω=B ∩ Ai = B ∩ Ai
i=1 i=1
(B ∩ Ai ) ∩ (B ∩ Aj ) = B ∩ Ai ∩ Aj = ∅
Hence
n
X n
X
P(B) = P(B ∩ Ai ) = P(B|Ai )P(Ai )
i=1 i=1
Law of Total Probability
Assume people can be born any day of the year with equal
probability. What is the probability that a randomly selected
person was born on January, 1?
Exercise
Assume people can be born any day of the year with equal
probability. What is the probability that a randomly selected
person was born on January, 1?
Ω = A1 ∪ A2 , A1 ∩ A2 = ∅ (A2 = Ā)
)
A1
P(
P(
A2
)
Graphical Representation
Suppose a new complete set of events is available, B1 , B2 , B3 , for
which the conditional probabilities P(Bi |Aj ) are known. Then we
can use each A-branch as the root of a new sub-tree, whose
branches correspond to the events Bi . These new branches will be
furnished with the conditional probabilities P(Bi |Aj ).
The probabilities of a given event that is the intersection of some
events, e.g. B2 ∩ A1 , can be read from the graph: one has to select
the unique path that contains the branches of the events of
interest, B2 and A1 , and multiply the probabilities one encounters,
because of the chain rule
P( )
A2
) |A 2
B1
P(
P(B2 |A2 )
P(B2 |A2 )P(A2 ) = P(B2 ∩ A2 )
P(
B
3 |A
2)
Graphical Representation
(of course in the case of sampling with replacement, the urn has
the same composition for any draw)
Thus
P(R2 ) = P(R1 )
General Case
What is the probability that the k-th draw is a red ball?
Ek =”the k-th draw is a red ball” We want to compute
P(Ek )
p(Ek ) = R/N
since before carrying out the k-th draw the urn contains the same
balls as at the start
with r = k − 1, m = R − 1, n = N − R.
with r = k − 1, m = R − 1, n = N − R.
which number to
R R (N − 1)! (N − 1)!
|Ek | = (N − 1)k−1 = =R
1 1 (N − 1 − (k − 1))! (N − k)!
(choose one ball among the red ones, place it in the k-th position,
consider an ordered (k − 1)-size sample from the population of all
balls, except one to be placed in k-th position)
|Ek | R
p(Ek ) = =
|Ω| N
General Case. Solution 3
1 k N
|Ek | R
p(Ek ) = =
|Ω| N
Bayes’ Theorem
Bayes’ Theorem
P(B|Ai )P(Ai )
P(Ai |B) = Pn
j=1 P(B|Aj )P(Aj )
Proof
P(Ai ∩ B) P(B|Ai )P(Ai )
P(Ai |B) = = Pn
P(B) j=1 P(B|Aj )P(Aj )
P(A ∩ B) = P(A)P(B)
the fact that one event occurs does not modify the probability of
the other
Independence. General Case
That is, we have to verify the property for any possible subset of
A1 , . . . , An
Pairwise independence
The events
1 1 1
· = p(A) · p(B) = p(A ∩ B) =
6 6 36
Yet they have something to do with each other, as it were.
Is A independent of A?
Problem
Is A independent of A?
That is
A ∩ B = ∅ ⇒ P(A ∩ B) 6= P(A)P(B)
mutually exclusive ⇒ dependence
Conditional Independence