Professional Documents
Culture Documents
Lecture 7
Introduction to Probability
Basic rules, chance,
probability tree
• Announcements
• Probability
• Introduction
• Addition rule
• Multiplication rule
• Conditional probability
• Tree diagram
• Notations
• More notions about chance
• Independent events
– The occurrence of one event has no effect on
the occurrence of another event
• Results of two coin tosses
• Exhaustive events
– A set that represents all possible outcomes that
can be produced
• Heads, tails in a coin toss
• Numbers “1” “2” “3” “4” “5” and “6” in a throw of a
dice
Theoretical vs. Empirical probability
• Theoretical probability
– aka “analytical vision”
– Based on the number of ways an event can
occur
• Ex: probability of drawing a King out of a pack:
• p(R) = N(R)/[N(R) + N(~R)] = 4/52 = 0,077
• Empirical probability
– “Relative frequency view”
– Conclusions about real past occurrences
• Ex: We drew 2 Kings in 20 attempts
• p(R) = 2/20 = 0,10
p(V) = 4/52
p(R) = 4/52
V, R are mutually exclusive, therefore
P(V or R) = 4/52 + 4/52 = 0,15
Addition rule
• Case 1: A is a subset of B
• Ex: What is the probability that a
card drawn from the deck is a Todas as cartas (52)
Jack (V) or a “figure” card (F)?
F (12)
p(V) = 4/52 V (4)
p(F) = 12/52
However, V and F are not mutually
exclusive:
V is a subset of F: all Jacks are
“figure” cards
We must avoid “double
counting” V & F (4)
p(V or F) = p(V) + p(F) – p(V & F)
= 4/52 + 12/52 – 4/52
12/52 = 3/13 = 0,23
Addition rule
• Caso 2: When A and B intersect
• Ex: What is the probability of a card
drawn from the deck being either a All cards (52)
Jack (V) or a spade (E)?
That is why:
p(A or B) = p(A) + p(B) – p(A & B) A B
P(B | A) != p(B)
P(B | A) = p(B)
So we can reduce
P(A & B) = p(A) * p(B | A) to
P(7) = 4/52
P(N | 7) = 39/51
P(7 & N) = 4/52 * 39/51 = 0.06
Combining probability rules
Subset
• Problems of probability generally require A B
the use of various rules.
A B
Mutually
exclusive
Combining probability rules
– Tree diagrams
• Efficient method for organizing
information
• Helps to correctly apply the addition
and multiplication rules
For marketing reasons, the provider needs to know: What is the probability
that a randomly chosen customer will use chat rooms?
Combining probability rules
An internet provider is interested in knowing how many people use chat rooms. It has the
following information about all the internet users: 29% are 18-29 yo; 47% are 30-49 yo; and
24% are 50 or more. They also interviewed some users of chat rooms: 47% of users between
18-29 yo use chat rooms; 21% between 30-49 yo; and 7% of those 50+ yo.
p(Chat | age1)
Age
0,93 No 0,2232
50+
Putting it into practice
A researcher is interested in how many people have a certain gene. She
has the following information about the tested persons:
• 60% are from Region A
• 40% are from Region B
What is the probability that a person randomly chosen from all those
tested has the gene?
Putting it into practice
A researcher is interested in how many people have a certain gene. She has the
following information about the tested persons: 60% are from Region A; 40% are from
Region B. She also knows that: 30% of people in Region A have the gene; 20% of
people in Region B have the gene. What is the probability that a person randomly
chosen from all those tested has the gene?
p(Gene | A)
p(Gene)
Tested = p(A&G) + p(B&G)
people = 0,18 + 0,08
0,40 = 0,26
0,20 Gene 0,08
• 90%?
Quiz – AIDS Test
• 3. Confusion with inverse conditionals
– Belief that p(B | A) = p(A | B), even
when p(A) != p(B) All people (n = 1000)
• Test of HIV/AIDS is designed.
p(test+ | AIDS) = 0,90. What is AIDS p(+ | AIDS)
p(AIDS | test+)? n = 10 = n(A & +)/n(AIDS)
– Many will say 0.90, confusing p(A) = 10/1000 = 0,01 = 9/10 = 0,90
inverse conditionals
– The correct answer depends on
p(AIDS) and p(test+)
+
– See example...
• Assume: population n = 1000
• AIDS is rare, so assume n(AIDS) = + TEST
10 and p(AIDS) = 0.01 n(A & +) = 9 n(+) = 800
• Assume: test is generally positive for p(A & +) = 9/1000 = 0,009 p(+) = 800/1000 = 0,80
all tested
– n(Test+) = 800
– p(Test+) = 800/1000 = 0.80
p(AIDS | + TEST)
– NB: p(A|B) = p(B|A) SE p(A) = p(B) = n(AIDS & +)/n(+)
• In this case, p(A) = 0.01; p(+) = 0.80
• Test+ does not reveal much = 9 / 800
= 0,01125
Chapters for the next lecture
• Larson & Farber: Chapters 3 & 4