Chapter 4 Probability - MPH

Probability and Probability Distributions
BY: Haymanot N.
1
Learning out comes
• At the end of this chapter the student is
able to:
– Explain the concepts and characteristics of
probabilities
– Compute probabilities of events and conditional
probabilities
– Discuss different rules in probability
– Explain characteristics of probability distribution
– Differentiate between the different distributions
– Understand the concepts and uses of the standard
normal distribution
2
Introduction
 Probability is a measure of how likely it is for an event to
happen.
 Probability is the language of chance.
 Probability theory was developed out of attempting to

solve problems related to games of chance such as tossing
a coin, rolling a die etc.
 probability theory is used to understand:
 About probability distributions
 Sampling and sampling distributions
 Estimation
 Hypothesis testing
 Advanced statistical analysis
3
Definition of terms
Experiment = any process with an uncertain outcome.
Outcomes : results of each trial.

Sample Space(S): The set of all possible outcomes of an
experiment
Example : {H,T}
Probability of the sample space= P(S)=1
Event =Event is any subset of the sample space

e.g. {H }or {T} or {HT}
• Events are represented by uppercase letters such as A, B, C, etc

4
Definition
The definitions for probability
1.Classical probability
Objective probability
2.Relative frequency
3.Subjective probability
5
Definition:
1. Classical Probability: If an event can occur in N mutually

exclusive and equally likely ways, and if m of these posses a
characteristic, E, the probability of the occurrence of E = m/N.

 m = success & N= possible outcomes
Example:
Roll a fair die
If we toss a die, what is the probability of 4 coming up?
m=1(which is 4) and N=6
The probability of 4 coming up is1/6.
6
Definition cont….
2. Relative frequency probability:
• If a process is repeated a large number of times (n),
and if an event with the characteristic E occurs m
times, the relative frequency of E is approximately
the probability of E.
Probability of E = P(E) = m/n
• The probability of an event is the proportion of times

it occurs when exactly the same experiment is
repeated a very large number of times in independent
trials.
7
Example
number of times that the event occurs

Relative Frequency =
number of trials
No freq Relative freq

1 25 25/250 = 0.1
2 34 34/250 = 0.136
3 32 32/250 = 0.128
4 30 30/250 = 0.12
5 34 34/250 = 0.136
6 95 95/250 = 0.38
250 trials 8
Definition cont….
Example
•If you toss a coin 100 times and head comes up 40
times,
 P(H) = 40/100 = 0.4.
•If we toss a coin 10,000 times and the head comes up
5562,
 P(H) = 0.5562.
•Therefore, the longer the series and the longer
sample size, the closer the estimate to the true
value(0.5).
9
3. Subjective probability:
 A subjective probability is an individual’s degree of belief in the
occurrence of an event..
P(X)=the degree of belief that X is true
Example:
 If some one says that he is 95% certain that a cure for AIDS will
be discovered within 5 years, then he means that Pr(discovery of
cure of AIDS within 5 years) = 95%.
10
Mutually exclusive events and the additive law:
• Two events A and B are mutually exclusive if they have no
elements in common.
• If A and B are outcomes of an experiment they cannot both
happen at the same time.
Eg. One die is rolled. Sample space = S = {1,2,3,4,5,6}

• Let A = the event an odd number turns up, A = {1,3,5}
• Let B = the event 1,2 or 3 turns up; B = {1,2,3 }
• Let C = the event 2 turns up, C= {2}
11
A = {1,3,5} B = {1,2,3 } C= {2}
i.Are A and B; A and C; B and C mutually exclusive?
Answers:
A and B are not mutually exclusive. Because they have the
elements 1 and 3 in common
similarly, B and C are not mutually exclusive. They have the
element 2 in common.
 A and C are mutually exclusive. They don’t have any element
in common
12
Thus, if A and B are mutually exclusive events,
Pr(A or B) = Pr (A) + Pr(B).
When two events A and B are not mutually exclusive,
Pr(A or B) = Pr (A) + Pr(B) – Pr(A and B).
From the previous example One die is rolled. Sample space = S = {1,2,3,4,5,6}
•Let A = the event an odd number turns up, A = {1,3,5}
•Let B = the event 1,2 or 3 turns up; B = {1,2,3 }
•Let C = the event 2 turns up, C= {2}
Find Pr (A), Pr (B) and Pr (C)
what is the
 Pr (A or C)?
 Pr (A or B)?
13
Answers
Pr(A) = Pr(1) + Pr(3) + Pr(5) = 1/6+1/6+ 1/6 = 3/6 = 1/2
Pr(B) = Pr(1) + pr(2) + Pr(3) = 1/6+1/6+1/6 = 3/6 = ½
Pr ( C ) = Pr(2) = 1/6
 Pr (A or C)?
 Pr (A or B)?
•Pr(A or C) = Pr (A) + Pr(C)=1/2+1/6=2/3

•Pr(A or B) = Pr (A) + Pr(B) – Pr(A and B).
=1/2+1/2-2/6 =2/3
14
Example
• The weight of new born infants was found
from a large hospital is classified as follows
Weight of new Probability
born
1500- 2500g 0.201
2501-3500g 0.233
3501- 4500g 0.321
4501- 5500g 0.245
15
What is the probability that a randomly selected
newborn from this hospital has a weight less than or
equal to 3500g? Weight of new Probabilit
born y
P(< 3500g)=
1500- 2500g 0.201
P(1500-2500g)+P(2501-3500g)
2501-3500g 0.233
=0.201 +0.233 = 0.434
3501- 4500g 0.321
4501- 5500g 0.245
16
Conditional probabilities and the multiplicative law:
• Conditional probability refers to the chance of a particular event

depends on the outcome of some other event.
• The notation is Pr(B/A), which is read as “the probability event B

occurs given that event A has already occurred .”
• Let A and B be two events of a sample space S.
• The conditional probability of an event A given B denoted by

Pr ( A/B )= P(A n B) / P(B) , P(B) ≠ 0.
• Similarly, P(B/A) = P(A n B) / P(A) , P(A) ≠ 0.
• This can be taken as an alternative form of the multiplicative law.

17
Example
The following table shows the frequency of cocaine use by gender
among adult cocaine users
Life time frequency Male (M) Female(F) total

of cocaine use
1-9 times(A) 32 7 39
20-99 times(B) 18 20 38
100+ (C) 25 9 34
total 75 36 111
Suppose you pick person at random what is the probability of this

person is male?
P(M)=number of male /total number of subjects

=75/111 =0.676 18
Example(cont…
What is the probability that a person picked at random

will be male and used cocaine 100 or more times in his
life time?
P(C n M)=?
P(C n M)=25/111 =0.2252
19
Example(cont…
Suppose you pick person at random and find that he is

male. what is the probability that this male will be a
person who used cocaine 100 or more times during his
life time?
P(C/M) =? P(C/M)=P(C n M)/P(M)
=25/111 = 0.333
75/111 20
Example(cont…
Suppose you pick person at random and find that

she is female. what is the probability that this
female will be a person who used cocaine 20-
99 times during her life time?
21
Example(cont…
What is the probability that the person randomly selected will be

male or will have used cocaine 100 or more times during his life
time or both ?
P(M U C) =?
P(M U C) = P(M) +P( C ) –P(M n C)
=75/111+ 34/111-25/111
=84/111 = 0.757 22
Example 2
• High grade fever and typhoid
High grade Typhoid total
fever yes no
yes 80 40 120
No 10 30 40
total 90 70 160
• What is probability that in this group a child with
Typhoid will have high grade fever also is:
• PROB(High grade Fever | Typhoid) = 80/90 = 0.88
• the proportion of children with typhoid who have high
grade fever also.
23
Example 2(cont…
High grade Typhoid total
fever yes no
yes 80 40 120
No 10 30 40
total 90 70 160
• What is the probability that a child with high grade
fever will have Typhoid.
• PROB(typhoid| High grade fever) = 80/120 = 0.66
• the proportion of children with High grade fever who
have Typhoid also.
24
Example 3
Suppose in country X the chance that an infant lives to age 25 is .95,
whereas the chance that he lives to age 65 is .65
 For the later it is understood that to survive to age 65 means to
survive both from birth to age 25 and from age 25 to 65.
 What is the chance that a person 25 years of age survives to age
65?
P(A)=probability to survive to age 25

 Then, Pr(B/A) = Pr(A n B ) / Pr(A) = .65/.95 = .684 .
 That is, a person aged 25 has a 68.4 percent chance of living to

age 65. 25
Probability in Testing
• Sensitivity: probability of a positive test result given
that the individual tested has the disease
• Specificity :probability that test result is negative
given that the individual tested does not have the
disease
• Predictive value positive test: Among those persons
who test positive for disease, what is the relative
frequency of disease?
• Predictive value negative test– Among those persons
who test negative for disease, what is the relative
frequency of absence of disease?
– All are conditional probabilities
26
BAYES' THEOREM
• Follows from the multiplicative rule
• Often employed in diagnostic testing or
screening
Pr[A/B] = Pr[A n B] = Pr[B/A] Pr[A] provided Pr[B]
not = 0
Pr[B] Pr[B]
Pr[B/A] = Pr[A n B] = Pr[A/B] Pr[B] provided Pr[A] not =
0
Pr[A] Pr[A]
27
DISEASE STATUS
Test (Gold standard)
result PRESENT ABSENT
(D) ( D)
+(T) TP (a) FP (b) TP+FP

_ ( T ) FN (c) TN (d) FN+TN
Total TP +FN FP+TN TP+FP+FN
+TN
The sensitivity of the test

oThis is the probability of a positive result given
that the subject has the disease.
o It is denoted by P(T|D)
28
DISEASE STATUS
Test (Gold standard)
result PRESENT ABSENT
(D) ( D )
+(T) TP (a) FP (b) TP+FP

T
_ ( ) FN (c) TN (d) FN+TN
Total TP +FN FP+TN TP+FP+FN
+TN
The specificity of the test

oThis is the probability of negative result given
that the subject does not have the disease.
o It is denoted by P(T | D)
29
.
The predictive value positive of the test
This is the probability that a subject has the
disease given that the subject has a positive
screening test result
It is calculated using Bayes Theorem through the
following formula
P (T | D) P ( D )
P( D | T ) 
P (T | D ) P ( D )  P(T | D) P ( D)
Where P(D) is the rate of the disease which is

always given P( D)  1  P( D)
p(T | D )  1  P (T | D)
30
The predictive value negative of the test
This is the probability that a subject does not have the disease
given that the subject has a negative screening test result
It is calculated using Bayes Theorem through the following
formula
P (T | D) P ( D)
P( D | T ) 
P(T | D) P( D)  P(T | D) P( D)
where,
p(T | D)  1  P(T | D)
31
Example
A medical research team wished to evaluate a proposed screening test for

Alzheimer’s disease. The test was given to a random sample of 450 patients with
Alzheimer’s disease and an independent random sample of 500 patients without
symptoms of the disease. The two samples were drawn from populations of
subjects who were 65 years or older. The results are as follows.
Test Result Yes (D) ) No

(D Total
positive(T) 436 5 441
) (Negativ
T 14 495 509
Total 450 500 950
32
In the context of this example
a)What is a false positive?
A false positive is when the test indicates a positive result (T) when
the person does not have the disease D
b) What is the false negative?

A false negative is when a test indicates a negative result (T ) when
the person has the disease (D).
c) Compute the sensitivity of the symptom.

436
P(T | D)   0.9689
450
d) Compute the specificity of the symptom.
495
P(T | D)   0.99
500
33
e) Suppose it is known that the rate of the disease in the general population is
11.3%. What is the predictive value positive of the symptom and the predictive
value negative of the symptom
The predictive value positive of the screening is calculated as
P (T | D) P ( D)
P( D | T ) 
P(T | D) P( D)  P (T | D) P( D)
(0.9689)(0.113)
  0.925
(0.9689)(0.113)  (.01)(1 - 0.113)
The predictive value negative of the screening is calculated as

P(T | D) P ( D)
P( D | T ) 
P(T | D) P( D)  P(T | D ) P( D)
(0.99)(0.887)
  0.996
(0.99)(0.887)  (0.0311)(0.113)
34
Independent Events
• The occurrence or nonoccurrence of one event

does not in any way affect the occurrence or
nonoccurrence of the other event.
• Thus, if events A and B are independent,

Pr(B/A) = P(B) and Pr(A/B) = P(A).
35
E.g. n tosses of a coin and the chances that on each toss it lands heads.
These are independent events.

 The chance of heads on any one toss is independent of the number of
previous heads.
 No matter how many heads have already been observed, the chance of
heads on the next toss is ½.
 With independent events, the multiplicative law becomes :
Pr(A and B) = Pr(A) Pr(B)

 Hence, Pr(A) = Pr(A and B) / Pr(B) , where Pr(B) ≠ 0
 Pr(B) = Pr(A and B) / Pr(A) , where Pr(A) ≠ 0
36
Example
In certain high school class ,consisting of 60 girls and
40 boys ,it is observed that 24 girls and 16 boys wear
eye glasses. If a student is picked at random from this
class the probability that the student wears eyeglass
P(E) ,is 40/100 or 0.4.what is the probability that a
student picked at random wears eye glasses, given
that the student is a boy?
Ans
• P(E/B)=p(E n B)/P(B)=16/100/40/100=0.4
• P(E)=P(E/B)
37
Example cont…
• What is the probabillity that a student
picked at random is a boy given that the
student wears eye glasses?
Ans
• P(B/E)=p(E n B)/P(E)=16/100/40/100=0.4
• P(B)=P(B/E)
38
Properties of probability
1. Probabilities are real numbers on the interval from 0 to 1.
2. If an event is certain to occur, its probability is 1, and if the event is certain not
to occur, its probability is 0.
3. The sum of the probabilities of all mutually exclusive outcomes is equal to
P(E1) + P(E2 ) + .... + P(En ) = 1
4. If two events are mutually exclusive ,then Pr(A or B) = Pr(A) + Pr(B).
5. If A and B are two events, not mutually exclusive , then
Pr( A or B)= Pr (A) +Pr (B) – Pr( A and B).
6. The sum of the probabilities that an event will occur and that it will not occur is
equal to 1; hence, P(A’) = 1 – P(A)
7. If A and B are two independent events, then Pr ( A and B) = Pr (A) Pr (B)

39
Probability distribution
40
Probability distribution
41
Random Variable
 Random Variable = is a numerical description of the
outcome of a statistical experiment.
 When the values of a variable (height, weight, or age)

can’t be predicted in advance, the variable is called a
random variable.
 E.g. The weight of a child to be born
42
Random variable(cont….
 Random variables can be either discrete or

continuous
A discrete random variable: is able to assume

only a finite or countable number of outcomes
 A continuous random variable: can take on any

value in a specified interval
43
Probability distribution(cont….
• A probability distribution is a device used to
describe the behaviour that a random variable may
have by applying the theory of probability.
• The term probability distribution refers to the collection of all
possible outcomes along with their probabilities.
• The relationship between the values and their associated probabilities

= Probability Mass Function (Probability distribution)
• Every random variable has a corresponding probability distribution.
• A probability distribution of a random variable can be displayed by a

table or a graph or a mathematical formula. 44
Example
Toss a coin 3 times. Let x be the number of heads
obtained. Find the probability distribution of x .
f (x) = Pr (X = xi) , i = 0, 1, 2, 3.
Pr (x = 0) = 1/8 …………………………. TTT
Pr (x = 1) = 3/8 ……………………. HTT THT
TTH
Pr (x = 2) = 3/8 …………………..HHT THH
HTH
Pr (x = 3) = 1/8 ………………………HHH
X=xi 0 1 2 3
• The probability
Pr(X=xi) 1/8 distribution
3/8 3/8 of X 1/8
45
Probability distribution(cont…
Example:
The following table shows the number of non
prescription drugs used during pregnancy among
women who delivered at some hospital X.
Number of Frequency
drugs
0 220
1 200
2 140
3 55
4 50
5 20
6 10
Total 695
46
• The probability distribution for the number of non
prescription drugs used during pregnancy is as
follows.
Number of Frequency Number of P(x)
drugs drugs(X)
0 220 0 220/695 =0.317
1 200 1 200/695= 0.288
2 140 2 0.201
3 55 3 0.079
4 50 4 0.072
5 20 5 0.029
6 10 6 0.014
Total 695 total 1
47
• The probability distribution can also be displayed in
graphs. The above probability distribution on the
number of non prescribed drugs can be represented as
follows.
48
Examples
What is the probability that a randomly selected woman
will be one who used 2 non-prescription drug?
Solution: P(x=2) =? Number P(x)
of
P(x=2) = 0.201 drugs(X)
0 0.317
What is the probability that a 1 0.288
randomly selected woman is one 2 0.201
who used either one or two drugs? 3 0.079
4 0.072
P(1U2) = P(1) +P(2) 5 0.029
6 0.014
=0.288 + 0.201= 0.489
total 1
49
Exercise
What is the probability
of using 2 or more
Number P(x)
drugs? of
drugs(X)
0 0.317
P(X > 2)=? 1 0.288
2 0.201
P(X > 2)
3 0.079
=P(x=2)+P(X=3)+P(x=4)+P(x=5)
4 0.072
+P(X=6)
5 0.029
= 0.201+ 0.079+ 0.072 +0.029 +0.014 6 0.014
=0.395 total 1
50
The Cumulative Probability Distribution
Number P(x) F(x)= P(X≤ x)

of
It shows the drugs(X)
probability that the 0 0.317 0.317
1 0.288 0.605
variable X is less than 2 0.201 0.806
or equal to a certain 3 0.079 0.885
value, P(X  x). 4 0.072 0.957
5 0.029 0.986
6 0.014 1
total 1
51
• General rules which apply to any probability

distribution:
– They must be numbers in the interval from 0
to 1.
– Since a random variable has to take on one
of its values, the sum of all the values of a
probability distribution must be equal to 1.
52
Permutations and Combinations
Permutations
The number of permutations of n things taken k at a time is

P =n(n−1)×...×(n−k+1)
n k OR P =n!/(n−k)!
n k
It represents the number of ways of selecting k items of n,

where the order of selection is important.
Example:
Suppose 4 schizophrenic women and 7 eligible controls live in the same

community. How many ways are there of selecting 4 controls?
The number of ways = 7P4 = 7(6)(5)(4) = 840.

OR, 7P4 = 7!/3! = 5040/6 = 840.
53
Permutations and Combinations…
Combination
It represents the number of ways of selecting k
objects out of n where the order of selection does
not matter.
The number of combinations of n things taken k at
a time is
æn ö n!
æn ö n(n - 1)´ ...´ (n - k +1) n k ç ÷
C = =
n Ck =ç ÷=
OR èk ø k!(n - k)!
èk ø k!
54
The binomial distribution
• A process that has only two possible outcomes
• Is based on Bernoulli trial, James Bernouli (1654 –
1705).
– When a single trial of an experiment can result in only one of
two mutually exclusive outcomes (success or failure; dead or
alive; sick or well, male/female, etc)
 In statistics the two outcomes are frequently denoted as

success and failure.
 The probabilities of a success or a failure are denoted
by p and q, respectively. Note that p + q = 1.
55
Binomial distribution…
Binomial distribution assumptions:
1) The same experiment is carried out n times ( n trials are made).
2) Each trial has two possible outcomes ( usually these outcomes

are called “ success” and “ failure”.
 If P is the probability of success in one trial, then 1-p is the
probability of failure.
3) The result of each trial is independent of the result of any other

trial.
56
Definition: If the binomial assumptions are satisfied, the
probability of x successes in n trials is:
• Then Pr(X=x) = n! p x (1- p)n-x

x!(n-x)!
57
Example
 Suppose that in a certain population 52% of all recorded births are
males. If we select randomly 5 birth records What is the probability
that exactly 3 will be males?
 Let the five selected persons be in the following order
MFMMF this can be coded as 10110
Then the probability is p(1,0,1,1,0)= p q p p q = q2p3
But we are not only interested only in the above order.
58
• We can also have the following sequences with
probabilities q2p3 for each
• So to find the probability we multiply the above

probability by 10
• So the probability is 10(0.52) 3(0.48)2 =0.32
59
• What can we do when we have a large sample?
Represents the number of ways of

selecting x objects out of n where the
order of selection does not matter
(Combination)
• So the above example represents
Pr(X=x) = n! p x (1- p)n-x
x!(n-x)!
= 5! (0.52) 3(0.48)2 =10(0.52) 3(0.48)2
3!(5-3)!
=0.32
60
 Suppose that in a certain population 52% of all recorded births are
males. If we select randomly 10 birth records What is the probability
that exactly
 5 will be males?
Given n= 10 p=0.52
required Pr(X=5)=?
• Pr(X=x) = n! p x (1- p)n-x
x!(n-x)!
Pr(X=5) = 10! 0.52 5 (1- 0.52)10-5 =0.24
5!(10-5)!
61
 3 or more will be females?
Given n=10 p=0.48
Pr(X=0) = 10! 0.48 0 (1- 0.48)10-0 =0.0014

0!(10-0)!
Pr(X=1) = 10! 0.48 1(1- 0.48)10-1=0.0133

1!(10-1)!
Pr(X=2) = 10! 0.48 2(1- 0.48)10-2=0.111

2!(10-2)!
Pr(X≥3) = 1- Pr(X<3) = 1-[Pr(X=0)+Pr(X=1)+Pr(X=2) ]
=1-[0.001+0.013+0.111]= 1-0.125=0.875
62
Mean and variance
• The mean is the average value of the random

variable in the long run.
– It is also called the expected value or expectation
and
– For random variable X usually denoted by E(X).
• Any random variable which takes discrete
values the mean, is found by summing each
possible value multiplied by its probability.
63
Mean and variance(cont…
• The variance of a random variable is the average
squared difference from the mean.
• So for the previous example

• For the number of heads in tosses of two coins,
– 0 is 1 unit from the mean with probability 1/4
– 1 is 0 units from the mean with probability 1/2
– 2 is 1 unit from the mean and occurs for 1/4 of pairs.
• Variance =(0-1)2x1/4+(1-1)2x1/2+(2-1)2x1/4
=1/2
64
Mean and variance(cont…
• We can now find the mean and variance of the
Binomial distribution with parameters n and p. First
consider n = 1. Then the probability distribution is:
Value probability
0 1-p
1 p
• The mean is therefore 0(1-p)+1(p)=p

• The variance is :(0-p)2x(1-p)+(1-p)2xp
=p2(1-p)+(1-p)2p= p(1-p)
65
• Now, a variable from the Binomial distribution with
parameters n and p is the sum of n independent
variables from the Binomial distribution with
parameters n and p.
• If the true proportion of events of interest is P,

then in a sample of size n
the mean of the binomial distribution is: mean = np
the standard deviation is
66
Exercise
Suppose that in a certain malarious area past experience
indicates that the probability of a person with a high fever
will be positive for malaria is 0.7. Consider 3 randomly
selected patients (with high fever) in that same area.
1)What is the probability that no patient will be positive ?
2)What is the probability that exactly 1 patient will be
positive?
3) What is the probability that exactly 2 of the patients will
be positive?
4) What is the probability that all 3 patients will be positive?
5) Find the mean and the SD of the probability distribution
given above.
67
Answer
Given n= 3 p=0.7
required Pr(X=0)=? Pr(X=1)=? Pr(X=2)=? Pr(X=3)=?

Pr(X=x) = n! p x (1- p)n-x
x!(n-x)!
1) Pr(X=0) = 3! 0.7 0(1- 0.7)3-0=0.027
0!(3-0)!
2) Pr(X=1) = 3! 0.7 1 (1- 0.7)3-1=0.189

1!(3-1)!
3) Pr(X=2) = 3! 0.7 2 (1- 0.7)3-2= 0.441

2!(3-2)! 68
4) Pr(X=3) = 3! 0.7 3 (1- 0.7)3-3=0.343
3!(3-3)!
5) Mean =np = 3(0.7)=2.1

Standard deviation =√np(1-p)
= √3 (0.7)(1-0.7)= √0.63
=0.749
69
The Poisson Distribution
• Is a discrete probability distribution used to

model the number of occurrences of an event
that takes place infrequently in time or space.
• Applicable for counts of events over a given
interval of time, for example:
– number of patients arriving at an emergency
department in a day
– number of new cases of HIV diagnosed at a clinic
in a month
70
The Poisson Distribution(cont..
• In such cases, we take a sample of days and observe

the number of patients arriving at the emergency
department on each day, or a sample of months and
observe the number of new cases of HIV diagnosed at
the clinic.
• We are observing a count or number of events, rather

than a yes/no or success/failure outcome for each
subject or trial, as in the binomial.
71
• Three assumptions must be met for a Poisson
distribution to apply:
1. The probability that a single event occurs within a
given small subinterval is proportional to the length of
the subinterval
2. The rate at which the event occurs is constant over the

entire interval t.
3.Events occurring in an interval of time or space does
not affect the event in the same interval or other
interval.
72
• If x is the number of occurrences of some
random event in an interval of time or space (or
some volume of mater ),the probability that x
will occur is given by:
• λ (lambda) represents the rate at which the event

occurs, or the expected number of events per
unit time
• e =2.71828 73
Example
The daily number of new registrations of cancer
is 2.2 on average. What is the probability of
a) Getting no new cases
b) Getting 1 case
c) Getting 2 cases
d) Getting 3 cases
e) Getting 4 cases
74
Answer
A) Given X=0 ,λ= 2.2 and t=1
P(X=0)= =0.111
B) P(X=1) = 0.244
C) P(X=2) = 0.268
D)(X=3) = 0.197
E) P(X=4) = 0.108
75
Poisson distribution
76
Exercise
The monthly suicide in country x between 1977
and 1987 followed a Poisson distribution with
parameter λ= 2.75.what is the probability that a
randomly selected month will be one in which
three adolescent suicides occurred.(e=2.72)
77
Probability distribution of continuous variables
E.G Suppose, X represents the continuous variable ‘Height’; X can
assume an infinite number of intermediate values between 170 and 171
cm 170.1, 170.2, 170.3 etc.
rarely is an individual exactly equal to 170cm tall
Because a continuous random variable X can take on an infinite
number of values, the probability associated with any particular value is
almost equal to zero.
•Instead of assigning probabilities to specific outcomes of the

random variable X, probabilities are assigned to ranges of values
•The probability is the area under the graph enclosed by two points say x 1 and
x2
78
The Normal Distribution
• It is the probability distribution of continuous variables

and has an especially important role in statistics.
• It is also sometimes known as the Gaussian distribution.
79
The Normal Distribution(cont…
Characteristics of the normal distribution:
1.It extends from minus infinity( -∞) to plus infinity (+∞).
2.It is uni-modal, bell-shaped and symmetrical about x = μ.
3.It is determined by two quantities: its mean ( μ ) and SD (σ ).

 - determines the location of the curve.
 - Determines the shape of the curve(flatness or peaked

ness)
4. The height of the frequency curve cannot be taken as the

probability of a particular value
80
The Normal Distribution(cont…
• The formula that generates the normal
probability distribution is:
2
11xx 2
11
  
22  
ff ((xx)) ee for
for  xx

 22
7183  33..1416
ee 22..7183 1416
 andand arearethe
thepopulation
populationmean
meanand
andstandard
standarddeviation.
deviation.
f(x) represents the height of the curve at point x .
81
The Normal Distribution(cont….)
• For continuous random variable probabilities are
areas under the curve.
• So for normal distribution the area between two
points represents the probabilities.
• To find P(c < x < d), we need to find the area
under the appropriate normal curve.
1  x  μ 2
P(c < X < d) = d 1  
e 2  σ  dx

c 2 πσ
• So, rather than using such complex formula for

every normal distributions which is changing as
mean and standard deviation is changing
82
The standard normal distribution
 Since a normal distribution could be an infinite number of

possible values for its mean and SD, it is impossible to tabulate
the area associated for each and every normal curve.
 Instead only a single curve for which μ = 0 and σ = 1 is

tabulated.
 The curve is called the standard normal distribution (SND).
83
• To simplify the tabulation of these areas, we

standardize each value of x by expressing it
as a z-score, the number of standard deviations
it lies from the mean.
xx 
zz 

84
 Assume a distribution has a mean of 70 and a standard deviation

of 10.
 How many standard deviation units above the mean is a score of
80?
Answer: Z= ( 80-70) / 10 = 1
 How many standard deviation units above the mean is a score of
83?
Answer: Z = (83 - 70) / 10 = 1.3
85
The Standard Normal (z)
Distribution
• Mean = 0; Standard deviation = 1
• When x = mean , z = 0 Symmetric about z = 0

• Values of z to the left of center are negative
• Values of z to the right of center are positive
• Total area under the curve is 1.
86
Using normal table
The four digit probability in a particular row and column of Table 1

gives the area under the z curve to the left that particular value of z.
Area for z  1.36

87
Example
Use z Table to calculate these probabilities:
P(z1.36)
P(z 1.36)==??
P(z1.36)
P(z 1.36)==.9131
.9131
P(z >1.36) =?
P(z
P(z>1.36)
>1.36)=0.0869
=0.0869 or
or
==11--0.913
0.913==.0869
.0869
88
• Find P(Z< -1.20) =? P(Z>1.20) =? P(Z< 1.20) =?
P(z> -1.20) =?
• P(Z< -1.20) =?
P(Z< -1.20)= 0.1151
• P(Z> 1.20) =?
P(Z> 1.20) =0.1151
• P(Z< 1.20) =?
P(Z< 1.20) = 1-0.1151=0.8849
• P(z> -1.20) =?
P(Z> - 1.20) = 1-0.1151=0.8849
89
Example(cont…
P(-1.20 < z < 1.36)=?
90
Example
The weights of packages of ground beef are normally distributed with

mean 1 pound and standard deviation 0.10. What is the probability
that a randomly selected package weighs between 0.80 and 0.85
pounds?
P (.80  x  .85) 
P(2  z  1.5) 
.4772  .4332  .0440

91
Example
What is the weight of a package such that only 1% of all
packages exceed this weight?
PP((xx?) ?)..0101
??11
PP((zz  ))..01
01
..11
From
FromTableTable zzvalue valuefor
for0.01
0.01isis2.33
2.33
??11
22..3333
..11
??22..33
33(.(.11))1111..233
233
92
 From the symmetry properties of the stated normal distribution,
P(Z ≤ -x) = P(Z ≥ x) = 1– P(z ≤ x)
 Hence, P(-1 < Z < +1) = 0.6827
 P(-1.96< Z < + 1.96) = 0.95 and
 P(-2.576 < Z < + 2.576) = 0.99
93
Fig. Percentage of area under a normal distribution with mean μ and
standard deviation σ
For any normal distribution
 about 68% (most) of the observations is contained within one SD of the mean.
about 95% (majority) of the probability is contained within two SDs
and 99% (almost all) within three SDs of the mean. 94
Other Distributions
1. Student t-distribution
2. F- Distribution
3. Chi square Distribution
4. Uniform distribution
5. Exponential distribution
95
Exercise
Assume that among diabetics the fasting blood
level of glucose is approximately normally
distribute with a mean of 105 mg per 100 ml
and SD of 9 mg per 100 ml.
A)What proportions of diabetics have levels
between 90 and 125 mg per 100 ml?
B)What proportions of diabetics have levels
below 87.4 mg per 100 ml?
C) What level cuts of the lower 10% of diabetics?
96
Answer
A)P(90<X<125)
=P(90-105/9<X-µ/ <125-105/9)
= P(-1.67<Z<2.22)
=0.4525 +0.4868=0.9393
B)P(X<87.4)
=p(Z<87.4-105/9)=p(z<-1.96)=0.5-0.4750
=0.025
C)P(X<?)=0.1 P(Z<?-105/9)=0.1
from the table Z=-1.28
-1.28=x-105/9 X=93.48
97
Exercise
• Suppose the height of adult female is normally
distributed with a mean of 162.2 cm and a
standard deviation of 6.8cm.,what is the
probability of a randomly selected female is
taller than 170 cm .
• Ans
A)P(X>170)
=p(Z>170-162.2/6.8)=p(z>1.15)=0.1251
98
Exercise
• Find the probability of the following under the SND
– Above 1.96?
– Below –1.96?
– Between –1.28 and 1.28?
– Between –1.65 and 1.08?
99
100

Chapter 4 Probability - MPH

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Chapter 4 Probability - MPH

Uploaded by

Copyright:

Available Formats

Probability and Probability Distributions

 Probability theory was developed out of attempting to

Outcomes : results of each trial.

Event =Event is any subset of the sample space

• Events are represented by uppercase letters such as A, B, C, etc

1. Classical Probability: If an event can occur in N mutually

characteristic, E, the probability of the occurrence of E = m/N.

If we toss a die, what is the probability of 4 coming up?

m=1(which is 4) and N=6

The probability of 4 coming up is1/6.

• The probability of an event is the proportion of times

number of times that the event occurs

No freq Relative freq

P(X)=the degree of belief that X is true

Eg. One die is rolled. Sample space = S = {1,2,3,4,5,6}

• Let B = the event 1,2 or 3 turns up; B = {1,2,3 }

• Let C = the event 2 turns up, C= {2}

i.Are A and B; A and C; B and C mutually exclusive?

Pr(B) = Pr(1) + pr(2) + Pr(3) = 1/6+1/6+1/6 = 3/6 = ½

•Pr(A or C) = Pr (A) + Pr(C)=1/2+1/6=2/3

4501- 5500g 0.245

• Conditional probability refers to the chance of a particular event

• The notation is Pr(B/A), which is read as “the probability event B

• Let A and B be two events of a sample space S.

• The conditional probability of an event A given B denoted by

• Similarly, P(B/A) = P(A n B) / P(A) , P(A) ≠ 0.

• This can be taken as an alternative form of the multiplicative law.

Life time frequency Male (M) Female(F) total

Suppose you pick person at random what is the probability of this

P(M)=number of male /total number of subjects

What is the probability that a person picked at random

Suppose you pick person at random and find that he is

Suppose you pick person at random and find that

What is the probability that the person randomly selected will be

P(A)=probability to survive to age 25

 That is, a person aged 25 has a 68.4 percent chance of living to

+(T) TP (a) FP (b) TP+FP

The sensitivity of the test

+(T) TP (a) FP (b) TP+FP

The specificity of the test

Where P(D) is the rate of the disease which is

A medical research team wished to evaluate a proposed screening test for

Test Result Yes (D) ) No

positive(T) 436 5 441

Total 450 500 950

b) What is the false negative?

c) Compute the sensitivity of the symptom.

The predictive value negative of the screening is calculated as

• The occurrence or nonoccurrence of one event

• Thus, if events A and B are independent,

These are independent events.

Pr(A and B) = Pr(A) Pr(B)

 Pr(B) = Pr(A and B) / Pr(A) , where Pr(A) ≠ 0

1. Probabilities are real numbers on the interval from 0 to 1.

4. If two events are mutually exclusive ,then Pr(A or B) = Pr(A) + Pr(B).

5. If A and B are two events, not mutually exclusive , then

Pr( A or B)= Pr (A) +Pr (B) – Pr( A and B).

7. If A and B are two independent events, then Pr ( A and B) = Pr (A) Pr (B)

 When the values of a variable (height, weight, or age)

 E.g. The weight of a child to be born

 Random variables can be either discrete or

A discrete random variable: is able to assume