Professional Documents
Culture Documents
The content of this lecture roughly corresponds to Sec. 3.4 of Sheldon and Ross’ Introduction to Probability
and Statistics for Engineers and Scientists.
2.1 Probability
1
The intuitive notion of probability
Since experiments are repeatable (ad infinitum), when we have an event A, we can think of the probability
of A, written P (A), as the long-term relative frequency of the event; that is, we perform the experiment a
large number of times, say N , and count how many of these trails resulted in an outcome belonging to the
n
event A, say n, and take this ratio N as N → ∞.
n
You will notice N is always between 0 and 1 for every N . You’ll also notice that, if this limit tends to 0,
then the likelihood the event A occurs is rare; whereas if this limit tends to 1, the event A is very likely to
occur.
Three Axioms of Probability
Axiom (3) is called finite additivity. Axiom (3∗ ) is called countable additivity.
These axioms are equivalent if Ω is a finite set. However, if Ω is infinity, it turns out that axiom (3∗ )
implies axiom (3), but not necessarily the other way around. Therefore, when the 3 axioms are stated, it is
customary to take axioms (1), (2) and (3∗ ) as the axioms. (More on this in a bit, see remark ??)
The point here is no matter how we define a probability, it must satisfy these 3 axioms.
Here are consequences of the axioms (and consequently properties that all probability functions satisfy!)
This is called the complementary rule of probability. Also, since (Ac )c = A, we can write this as P (Ac ) =
1 − P (A).
1 This is discussed in §2.1 of the textbook (R. J. Larsen and M. L. Marx, An Introduction to Mathematical Statistics and
2-1
2-2 Lecture 2: Axioms of Probability
The proof is easy: since A and Ac are mutually exclusive and Ω = A ∪ Ac , we have
axiom(2) axiom(3)
1 = P (Ω) = P (A ∪ Ac ) = P (A) + P (Ac ).
Theorem 2.2
Remark 2.3 By taking A1 = A, A2 = B, and Ai = ∅ for i ≥ 3, the countable additivity axiom says
P (A ∪ B ∪ ∅ ∪ ∅ ∪ · · ·) = P (A ∪ B) = P (A) + P (B) + P (∅) + P (∅) + · · · = P (A) + P (B). This shows that
countable additivity implies finite additivity.
Point: B is the mutually exclusive union of A and B ∩ Ac and since P (B ∩ Ac ) ≥ 0 (axiom (1)), it follows
P (B) = P (A) + P (B ∩ Ac ) ≥ P (A) + 0
Corollary 2.5 Since A ⊆ Ω, it follows P (A) ≤ P (Ω) = 1. So every event A has the property P (A) ≤ 1.
Notice that knowing any 3 of the 4 probabilities in the above equation gives us the 4th . For instance, P (A ∩
B) = P (A) + P (B) − P (A ∪ B).
Here’s another proof by “picture”:
Lecture 2: Axioms of Probability 2-3
• On one hand,
P (A ∪ B) = P (R1 ∪ R2 ∪ R3 ) = P (R1 ) + P (R2 ) + P (R3 )
The inclusion-exclusion principle generalize to any finite number of events. For example, for 3 events:
P (A1 ∪ A2 ∪ A3 ) = P (A1 ) + P (A2 ) + P (A3 )
− P (A1 ∩ A2 ) − P (A1 ∩ A3 ) − P (A2 ∩ A3 )
+ P (A1 ∩ A2 ∩ A3 )
2-4 Lecture 2: Axioms of Probability
These formula can come in handy to give us a strategy for computing the union of a bunch of events.
Example 2. Suppose we have 3 events A, B and C that all have equal probability
P (A ∩ B) = P (A ∩ C) = P (B ∩ C) = 0.25
P (A ∩ B ∩ C) = 0.125.
Here’s another way by filling in probabilities into the mutually exclusive regions of a Venn diagram: