Professional Documents
Culture Documents
BY: Haymanot N.
1
Learning out comes
• At the end of this chapter the student is
able to:
– Explain the concepts and characteristics of
probabilities
– Compute probabilities of events and conditional
probabilities
– Discuss different rules in probability
– Explain characteristics of probability distribution
– Differentiate between the different distributions
– Understand the concepts and uses of the standard
normal distribution
2
Introduction
Probability is a measure of how likely it is for an event to
happen.
Probability is the language of chance.
1.Classical probability
Objective probability
2.Relative frequency
3.Subjective probability
5
Definition:
6
Definition cont….
2. Relative frequency probability:
• If a process is repeated a large number of times (n),
and if an event with the characteristic E occurs m
times, the relative frequency of E is approximately
the probability of E.
Probability of E = P(E) = m/n
9
3. Subjective probability:
A subjective probability is an individual’s degree of belief in the
occurrence of an event..
Example:
If some one says that he is 95% certain that a cure for AIDS will
be discovered within 5 years, then he means that Pr(discovery of
cure of AIDS within 5 years) = 95%.
10
Mutually exclusive events and the additive law:
• Two events A and B are mutually exclusive if they have no
elements in common.
• If A and B are outcomes of an experiment they cannot both
happen at the same time.
11
A = {1,3,5} B = {1,2,3 } C= {2}
Answers:
A and B are not mutually exclusive. Because they have the
elements 1 and 3 in common
similarly, B and C are not mutually exclusive. They have the
element 2 in common.
A and C are mutually exclusive. They don’t have any element
in common
12
Thus, if A and B are mutually exclusive events,
Pr(A or B) = Pr (A) + Pr(B).
When two events A and B are not mutually exclusive,
Pr(A or B) = Pr (A) + Pr(B) – Pr(A and B).
From the previous example One die is rolled. Sample space = S = {1,2,3,4,5,6}
•Let A = the event an odd number turns up, A = {1,3,5}
•Let B = the event 1,2 or 3 turns up; B = {1,2,3 }
•Let C = the event 2 turns up, C= {2}
Find Pr (A), Pr (B) and Pr (C)
what is the
Pr (A or C)?
Pr (A or B)?
13
Answers
Pr(A) = Pr(1) + Pr(3) + Pr(5) = 1/6+1/6+ 1/6 = 3/6 = 1/2
Pr ( C ) = Pr(2) = 1/6
Pr (A or C)?
Pr (A or B)?
14
Example
• The weight of new born infants was found
from a large hospital is classified as follows
Weight of new Probability
born
1500- 2500g 0.201
2501-3500g 0.233
3501- 4500g 0.321
4501- 5500g 0.245
15
What is the probability that a randomly selected
newborn from this hospital has a weight less than or
equal to 3500g? Weight of new Probabilit
born y
P(< 3500g)=
1500- 2500g 0.201
P(1500-2500g)+P(2501-3500g)
2501-3500g 0.233
=0.201 +0.233 = 0.434
3501- 4500g 0.321
16
Conditional probabilities and the multiplicative law:
yes 80 40 120
No 10 30 40
total 90 70 160
• What is probability that in this group a child with
Typhoid will have high grade fever also is:
• PROB(High grade Fever | Typhoid) = 80/90 = 0.88
• the proportion of children with typhoid who have high
grade fever also.
23
Example 2(cont…
High grade Typhoid total
fever yes no
yes 80 40 120
No 10 30 40
total 90 70 160
• What is the probability that a child with high grade
fever will have Typhoid.
• PROB(typhoid| High grade fever) = 80/120 = 0.66
• the proportion of children with High grade fever who
have Typhoid also.
24
Example 3
Suppose in country X the chance that an infant lives to age 25 is .95,
whereas the chance that he lives to age 65 is .65
For the later it is understood that to survive to age 65 means to
survive both from birth to age 25 and from age 25 to 65.
What is the chance that a person 25 years of age survives to age
65?
Pr[B] Pr[B]
Pr[B/A] = Pr[A n B] = Pr[A/B] Pr[B] provided Pr[A] not =
0
Pr[A] Pr[A]
27
DISEASE STATUS
Test (Gold standard)
result PRESENT ABSENT
(D) ( D)
28
DISEASE STATUS
Test (Gold standard)
result PRESENT ABSENT
(D) ( D )
29
.
The predictive value positive of the test
This is the probability that a subject has the
disease given that the subject has a positive
screening test result
It is calculated using Bayes Theorem through the
following formula
P (T | D) P ( D )
P( D | T )
P (T | D ) P ( D ) P(T | D) P ( D)
30
The predictive value negative of the test
This is the probability that a subject does not have the disease
given that the subject has a negative screening test result
It is calculated using Bayes Theorem through the following
formula
P (T | D) P ( D)
P( D | T )
P(T | D) P( D) P(T | D) P( D)
where,
p(T | D) 1 P(T | D)
31
Example
) (Negativ
T 14 495 509
32
In the context of this example
a)What is a false positive?
A false positive is when the test indicates a positive result (T) when
the person does not have the disease D
34
Independent Events
36
Example
In certain high school class ,consisting of 60 girls and
40 boys ,it is observed that 24 girls and 16 boys wear
eye glasses. If a student is picked at random from this
class the probability that the student wears eyeglass
P(E) ,is 40/100 or 0.4.what is the probability that a
student picked at random wears eye glasses, given
that the student is a boy?
Ans
• P(E/B)=p(E n B)/P(B)=16/100/40/100=0.4
• P(E)=P(E/B)
37
Example cont…
• What is the probabillity that a student
picked at random is a boy given that the
student wears eye glasses?
Ans
• P(B/E)=p(E n B)/P(E)=16/100/40/100=0.4
• P(B)=P(B/E)
38
Properties of probability
2. If an event is certain to occur, its probability is 1, and if the event is certain not
to occur, its probability is 0.
3. The sum of the probabilities of all mutually exclusive outcomes is equal to
P(E1) + P(E2 ) + .... + P(En ) = 1
6. The sum of the probabilities that an event will occur and that it will not occur is
equal to 1; hence, P(A’) = 1 – P(A)
40
Probability distribution
41
Random Variable
Random Variable = is a numerical description of the
outcome of a statistical experiment.
42
Random variable(cont….
43
Probability distribution(cont….
• A probability distribution is a device used to
describe the behaviour that a random variable may
have by applying the theory of probability.
• The term probability distribution refers to the collection of all
possible outcomes along with their probabilities.
45
Probability distribution(cont…
Example:
The following table shows the number of non
prescription drugs used during pregnancy among
women who delivered at some hospital X.
Number of Frequency
drugs
0 220
1 200
2 140
3 55
4 50
5 20
6 10
Total 695
46
Probability distribution(cont…
• The probability distribution for the number of non
prescription drugs used during pregnancy is as
follows.
Number of Frequency Number of P(x)
drugs drugs(X)
0 220 0 220/695 =0.317
1 200 1 200/695= 0.288
2 140 2 0.201
3 55 3 0.079
4 50 4 0.072
5 20 5 0.029
6 10 6 0.014
Total 695 total 1
47
Probability distribution(cont…
• The probability distribution can also be displayed in
graphs. The above probability distribution on the
number of non prescribed drugs can be represented as
follows.
48
Examples
What is the probability that a randomly selected woman
will be one who used 2 non-prescription drug?
Solution: P(x=2) =? Number P(x)
of
P(x=2) = 0.201 drugs(X)
0 0.317
What is the probability that a 1 0.288
randomly selected woman is one 2 0.201
who used either one or two drugs? 3 0.079
4 0.072
P(1U2) = P(1) +P(2) 5 0.029
6 0.014
=0.288 + 0.201= 0.489
total 1
49
Exercise
What is the probability
of using 2 or more
Number P(x)
drugs? of
drugs(X)
0 0.317
P(X > 2)=? 1 0.288
2 0.201
P(X > 2)
3 0.079
=P(x=2)+P(X=3)+P(x=4)+P(x=5)
4 0.072
+P(X=6)
5 0.029
= 0.201+ 0.079+ 0.072 +0.029 +0.014 6 0.014
=0.395 total 1
50
The Cumulative Probability Distribution
51
Probability distribution(cont…
52
Permutations and Combinations
Permutations
Combination
It represents the number of ways of selecting k
objects out of n where the order of selection does
not matter.
The number of combinations of n things taken k at
a time is
æn ö n!
æn ö n(n - 1)´ ...´ (n - k +1) n k ç ÷
C = =
n Ck =ç ÷=
OR èk ø k!(n - k)!
èk ø k!
54
The binomial distribution
• A process that has only two possible outcomes
• Is based on Bernoulli trial, James Bernouli (1654 –
1705).
– When a single trial of an experiment can result in only one of
two mutually exclusive outcomes (success or failure; dead or
alive; sick or well, male/female, etc)
57
Example
Suppose that in a certain population 52% of all recorded births are
males. If we select randomly 5 birth records What is the probability
that exactly 3 will be males?
Let the five selected persons be in the following order
58
• We can also have the following sequences with
probabilities q2p3 for each
Given n= 10 p=0.52
required Pr(X=5)=?
• Pr(X=x) = n! p x (1- p)n-x
x!(n-x)!
Pr(X=5) = 10! 0.52 5 (1- 0.52)10-5 =0.24
5!(10-5)!
61
3 or more will be females?
=1-[0.001+0.013+0.111]= 1-0.125=0.875
62
Mean and variance
63
Mean and variance(cont…
• The variance of a random variable is the average
squared difference from the mean.
• Variance =(0-1)2x1/4+(1-1)2x1/2+(2-1)2x1/4
=1/2
64
Mean and variance(cont…
• We can now find the mean and variance of the
Binomial distribution with parameters n and p. First
consider n = 1. Then the probability distribution is:
Value probability
0 1-p
1 p
65
Binomial distribution…
• Now, a variable from the Binomial distribution with
parameters n and p is the sum of n independent
variables from the Binomial distribution with
parameters n and p.
66
Exercise
Suppose that in a certain malarious area past experience
indicates that the probability of a person with a high fever
will be positive for malaria is 0.7. Consider 3 randomly
selected patients (with high fever) in that same area.
1)What is the probability that no patient will be positive ?
2)What is the probability that exactly 1 patient will be
positive?
3) What is the probability that exactly 2 of the patients will
be positive?
4) What is the probability that all 3 patients will be positive?
5) Find the mean and the SD of the probability distribution
given above.
67
Answer
Given n= 3 p=0.7
69
The Poisson Distribution
70
The Poisson Distribution(cont..
71
The Poisson Distribution(cont..
• Three assumptions must be met for a Poisson
distribution to apply:
1. The probability that a single event occurs within a
given small subinterval is proportional to the length of
the subinterval
Example
The daily number of new registrations of cancer
is 2.2 on average. What is the probability of
a) Getting no new cases
b) Getting 1 case
c) Getting 2 cases
d) Getting 3 cases
e) Getting 4 cases
74
The Poisson Distribution(cont..
Answer
A) Given X=0 ,λ= 2.2 and t=1
P(X=0)= =0.111
B) P(X=1) = 0.244
C) P(X=2) = 0.268
D)(X=3) = 0.197
E) P(X=4) = 0.108
75
Poisson distribution
76
Exercise
The monthly suicide in country x between 1977
and 1987 followed a Poisson distribution with
parameter λ= 2.75.what is the probability that a
randomly selected month will be one in which
three adolescent suicides occurred.(e=2.72)
77
Probability distribution of continuous variables
E.G Suppose, X represents the continuous variable ‘Height’; X can
assume an infinite number of intermediate values between 170 and 171
cm 170.1, 170.2, 170.3 etc.
rarely is an individual exactly equal to 170cm tall
Because a continuous random variable X can take on an infinite
number of values, the probability associated with any particular value is
almost equal to zero.
78
The Normal Distribution
79
The Normal Distribution(cont…
Characteristics of the normal distribution:
2
11xx 2
11
22
ff ((xx)) ee for
for xx
22
7183 33..1416
ee 22..7183 1416
andand arearethe
thepopulation
populationmean
meanand
andstandard
standarddeviation.
deviation.
f(x) represents the height of the curve at point x .
81
The Normal Distribution(cont….)
• For continuous random variable probabilities are
areas under the curve.
• So for normal distribution the area between two
points represents the probabilities.
• To find P(c < x < d), we need to find the area
under the appropriate normal curve.
1 x μ 2
P(c < X < d) = d 1
e 2 σ dx
c 2 πσ
83
The standard normal distribution
xx
zz
84
The standard normal distribution
Answer: Z= ( 80-70) / 10 = 1
How many standard deviation units above the mean is a score of
83?
85
The Standard Normal (z)
Distribution
86
Using normal table
P(z1.36)
P(z 1.36)==.9131
.9131
P(z >1.36) =?
P(z
P(z>1.36)
>1.36)=0.0869
=0.0869 or
or
==11--0.913
0.913==.0869
.0869
88
• Find P(Z< -1.20) =? P(Z>1.20) =? P(Z< 1.20) =?
P(z> -1.20) =?
• P(Z< -1.20) =?
P(Z< -1.20)= 0.1151
• P(Z> 1.20) =?
P(Z> 1.20) =0.1151
• P(Z< 1.20) =?
P(Z< 1.20) = 1-0.1151=0.8849
• P(z> -1.20) =?
P(Z> - 1.20) = 1-0.1151=0.8849
89
Example(cont…
90
Example
PP((xx?) ?)..0101
??11
PP((zz ))..01
01
..11
From
FromTableTable zzvalue valuefor
for0.01
0.01isis2.33
2.33
??11
22..3333
..11
??22..33
33(.(.11))1111..233
233
92
The standard normal distribution
From the symmetry properties of the stated normal distribution,
P(Z ≤ -x) = P(Z ≥ x) = 1– P(z ≤ x)
Hence, P(-1 < Z < +1) = 0.6827
93
Fig. Percentage of area under a normal distribution with mean μ and
standard deviation σ
For any normal distribution
about 68% (most) of the observations is contained within one SD of the mean.
about 95% (majority) of the probability is contained within two SDs
and 99% (almost all) within three SDs of the mean. 94
Other Distributions
1. Student t-distribution
2. F- Distribution
3. Chi square Distribution
4. Uniform distribution
5. Exponential distribution
95
Exercise
Assume that among diabetics the fasting blood
level of glucose is approximately normally
distribute with a mean of 105 mg per 100 ml
and SD of 9 mg per 100 ml.
A)What proportions of diabetics have levels
between 90 and 125 mg per 100 ml?
B)What proportions of diabetics have levels
below 87.4 mg per 100 ml?
C) What level cuts of the lower 10% of diabetics?
96
Answer
A)P(90<X<125)
=P(90-105/9<X-µ/ <125-105/9)
= P(-1.67<Z<2.22)
=0.4525 +0.4868=0.9393
B)P(X<87.4)
=p(Z<87.4-105/9)=p(z<-1.96)=0.5-0.4750
=0.025
C)P(X<?)=0.1 P(Z<?-105/9)=0.1
from the table Z=-1.28
-1.28=x-105/9 X=93.48
97
Exercise
• Suppose the height of adult female is normally
distributed with a mean of 162.2 cm and a
standard deviation of 6.8cm.,what is the
probability of a randomly selected female is
taller than 170 cm .
• Ans
A)P(X>170)
=p(Z>170-162.2/6.8)=p(z>1.15)=0.1251
98
Exercise
– Above 1.96?
– Below –1.96?
99
100