Professional Documents
Culture Documents
Probability
Probability
Probability theory is the branch of math that deals with random
events, processes, and variables
Probability
Probability theory is the branch of math that deals with random
events, processes, and variables
Interpretations of Probability
• Classical or Analytic view
– Definition of probability in terms of an analysis of possible (and equally likely)
outcomes for a set of events. I.e., p=(favorable cases/all possible cases)
– Examples:
• Proportion of ways in which two 6-sided dice can yield 10 total dots
• Proportion of 5-card combinations that can form a royal flush
• Frequentist view
– Definition of probability in terms of past performance
• p=(f/N)
– Example:
• If we roll a die 1000 times and it comes up 6 250 times, then we estimate the probability
of rolling a 6 as 0.25 (Note: such a die is probably loaded)
Basic Terminology
• Trial: one of a number of repetitions of an experiment
– E.g, a coin flip, die roll, or measurement procedure
P 1
All Events
Set C
Basic Terminology
• Marginal probability: the probability of one event, ignoring
the occurrence or nonoccurrence of other (simultaneous)
events
– Denoted: P(event1)
– Examples: P(height>68”), P(suit=clubs)
All Events
P(A)
Basic Terminology
• Joint probability: the probability of (simultaneous)
occurrence of two or more events
– Denoted: P(event1,event2) or P(event1 ∩ event 2)
– Examples: P(height>68”, gender=female), P(suit = clubs, rank = king)
Basic Terminology
• Conditional probability: the probability that one event will
occur given the occurrence of some other event
– Denoted: P(event1 | event2)
– Examples: P(height>68” | gender=female), P(suit=clubs | color=black)
All Events
P(A|B) = P(A,B)/P(B)
P(A,B)
P(B)
Basic Terminology
• Independent events (variables): events (variables) related
such that the occurrence of one has no effect on the
probability of occurrence of the other
– E.g., successive coin flips, successive M&M draws from bag sampled
with replacement
– A special category of independent events is IID or independent and
identically distributed events
• Many of the techniques in this course depend on IID assumptions
A Note on “Replacement”
• Most of the tools that we use in statistics assume very large
populations
Example:
Bayes’ Rule
The most common test for lupus correctly identifies the disease in 98% of
people who have it and correctly rejects the disease in 74% of people who do
not have it.
• A patient tests positive for lupus. What is the probability that the
patient has the disease?
0.02(0.98)
P lupus | test 0.07
0.02(0.98) 0.98(0.26)