FBA1202 Statistics W10

The Normal Distribution and
Other Continuous Distributions

Goals
 Describe the characteristics of the normal
distribution
 Translate normal distribution problems into
standardized normal distribution problems
 Find probabilities using a normal distribution table
 Evaluate the normality assumption
 Recognize when to apply the uniform and
exponential distributions
Probability Distributions
Probability
Distributions
Discrete Continuous
Probability Probability
Distributions Distributions
Binomial Normal
Poisson Uniform
Hypergeometric Exponential
Continuous Probability Distributions
 A continuous random variable is a variable that
can assume any value on a continuum (can
assume an uncountable number of values)
 thickness of an item
 time required to complete a task
 temperature of a solution
 height, in inches
 These can potentially take on any value,

depending only on the ability to measure
accurately.
 The graphical form of the probability distribution for a
continuous random variable forms a smooth curve.
 This curve, which is a function of X, is called a

probability density function (pdf). It is denoted by
f(x).
 Two requirements must be satisfied by all
probability distributions for continuous random
variables.
 f(y) > 0 for all possible intervals of y.



 . f ( y ) dy  1

Definition
 X is a continuous random variable if there is a

function f(x) so that for any constants a and b,
Example
Calculation of the expected value
Discrete case:
E( X ) = ∑ x i p(x i )
all x
Continuous case:
E( X )  
all x
xi p(xi )dx
Example
Solution
Example
 The random variable X is given by the following

PDF.
 Check that this is a valid PDF and calculate the
mean of X.
Variance
Discrete case:
Var ( X )   2
 (x  )
all x
i
2
p(xi )
Continuous case:

Var ( X )     ( xi   ) p( xi )dx
2 2

The Normal Distribution
Probability
Distributions
Continuous
Probability
Distributions
Normal
Uniform
Exponential
 The normal distribution was introduced by the French mathematician Abraham
DeMoivre in 1733. DeMoivre, who used this distribution to approximate
probabilities connected with coin tossing, called it the exponential bell-shaped
curve. Its usefulness, however, became truly apparent only in 1809, when the
famous German mathematician Karl Friedrich Gauss (1777-1855) used it as an
integral part of his approach to predicting the location of astronomical entities. As
a result, it became common after this time to call it the Gaussian distribution.
 During the mid-to late 19th century, it became accepted that it was “normal” for
any well-behaved data set to follow this curve. As a result, following the lead of
the British statistician Karl Pearson, people began referring to the Gaussian
curve by calling it simply the normal curve.
Normal probability distribution
• The normal distribution is the most important of all the
distributions.
• Its graph is bell-shaped.
• psychology, business, economics, natural sciences, nursing,
mathematics
• marks on a test
• IQ scores
• real estate prices
• the height of a man
• blood pressure
• errors in measurements
Properties of the Normal Distribution
 It is bell-shaped.
 The normal curve is symmetrical about the mean μ.

 Mean, Median and Mode are Equal.

 Total area under the curve is equal to 1.

(continued)
 Location is
determined by the
mean, μ f(x)
 Spread is determined σ
by the standard
deviation, σ x
μ

Mean
The random variable = Median
= Mode
has an infinite
theoretical range:
+  to  
 The normal distribution has two parameters

(two numerical descriptive measures), the mean
() and the variance (2). If X is a quantity to be
measured that has a normal distribution with
mean () and the standard deviation (2), we
designate this by writing
 NORMAL:XN (, 2)
 Or XN (, )
Figure 5.7 Several normal distributions with different
means and Standard deviations
Parameters μ and σ
 Normal pdfs have two parameters
μ - expected value (mean “mu”)
σ - standard deviation (sigma)
μ controls location σ controls spread

Shape
f(X) Changing μ shifts the

distribution left or right.
Changing σ increases
or decreases the
σ spread.
μ X
Many Normal Distributions
By varying the parameters μ and σ, we obtain different

normal distributions.
The Normal Probability
Density Function
 The formula for the normal probability density

function is
1 (1/2)[(X μ)/σ] 2
f(X)  e
2π
Where e = the mathematical constant approximated by 2.71828

π = the mathematical constant approximated by 3.14159
μ = the population mean
σ = the population standard deviation
X = any value of the continuous variable
Cumulative Normal Distribution
 For a normal random variable X with mean μ and

variance σ2, i.e., X~N(μ,σ2), cumulative distribution
function is
F(x 0 )  P(X  x 0 )
f(x)
P(X  x 0 )
0 x0 x
Finding Normal Probabilities
Probability is the
Probability is measured by the area
area under the
curve! under the curve
f(X) P (a ≤ X ≤ b)
= P (a < X < b)
(Note that the
probability of any
individual value is zero)
a b X
(continued)
F(b)  P(X  b)
a μ b x
F(a)  P(X  a)
a μ b x
P(a  X  b)  F(b)  F(a)
a μ b x
Areas to the left and Right of x
Probability as
Area Under the Curve
The total area under the curve is 1.0, and the curve is
symmetric, so half is above the mean, half is below
f(X) P(   X  μ)  0.5
P(μ  X   )  0.5
0.5 0.5
μ X
P(   X   )  1.0
The Standardized Normal
 Any normal distribution (with any mean and
variance combination) can be transformed into the
standardized normal distribution (Z), with mean 0
and variance 1
f(Z)
Z ~ N(0,1) 1
Z
0
 Need to transform X units into Z units by subtracting the
mean of X and dividing by its standard deviation
X μ
Z
σ
 Any normal distribution (with any mean and

standard deviation combination) can be
transformed into the standardized normal
distribution (Z)
 Need to transform X units into Z units

Translation to the Standardized
Normal Distribution
 Translate from X to the standardized normal

(the “Z” distribution) by subtracting the mean
of X and dividing by its standard deviation:
X μ
Z
σ
Z always has mean = 0 and standard deviation = 1
Probability Density Function
 The formula for the standardized normal

probability density function is
1 (1/2)Z 2
f(Z)  e
2π
Where e = the mathematical constant approximated by 2.71828

π = the mathematical constant approximated by 3.14159
Z = any value of the standardized normal distribution
The Standardized
Normal Distribution
 Also known as the “Z” distribution

 Mean is 0
 Standard Deviation is 1
f(Z)
1
Z
0
Values above the mean have positive Z-values,
values below the mean have negative Z-values
Definition
The standard normal distribution is a special case of the
normal distribution.
Figure 5.8 Standart normal distribution
  0,   1
 a μ b μ
P(a  X  b)  P Z 
 σ σ 
f(x)  b μ  a μ
 F   F 
 σ   σ 
a µ b x
a μ b μ
0 Z
σ σ
Table 5.1
P(0  Z  z)
Example
 If X is distributed normally with mean of 100

and standard deviation of 50, the Z value for
X = 200 is
X  μ 200  100
Z   2.0
σ 50
 This says that X = 200 is two standard
deviations (2 increments of 50 units) above
the mean of 100.
Normal Distribution: Empirical
Rule for Any Normal Curve
≈ 68% of the observations fall within one standard deviation of the mean.
≈ 95% of the observations fall within two standard deviations of the mean.
≈ 99.7% of the observations fall within three standard deviations of the mean.
The probability equals approximately 0.68 within 1 standard deviation of the mean,
approximately 0.95 within 2 standard deviations, and approximately 0.997 within 3
standard deviations. Question: How do these probabilities relate to the empirical rule?
Normal Distribution: Empirical
Rule for Any Normal Curve
Comparing X and Z units
100 200 X (μ = 100, σ = 50)
0 2.0 Z (μ = 0, σ = 1)
Note that the distribution is the same, only the

scale has changed. We can express the problem in
original units (X) or in standardized units (Z)
The Standardized Normal Table
 The Table gives the probability F(a) for any value of a.
.9772
0 2.00 Z
P(Z < 2.00) = 0.5 + P(0<Z<2)

= 0.5+0.4772
= 0.9772
 The Standardized Normal table gives the

probability less than a desired value for Z
(i.e., from negative infinity to Z)
.5 .4772
Example:
P(Z < 2.00) = .9772
0 2.00 Z
(continued)
The column gives the value of

Z to the second decimal point
Z 0.00 0.01 0.02 …
The row shows 0.0

the value of Z 0.1
. The value within the
to the first .
decimal point . table gives the
2.0 .4772 probability from Z = 0 up
to the desired Z value
2.0
P(Z < 2.00) = .5+.4772=.9772
Looking up probabilities in the
standard normal table
What is the area to

the left of Z=1.51 in a
standard normal
curve?
Area is 93.45%
Z=1.51
Z=1.51
(continued)
 For negative Z-values, use the fact that the

distribution is symmetric to find the needed
probability:
.9772
.0228
Example:
0 2.00 Z
P(Z < -2.00) = 1 – 0.9772
= 0.0228 .9772
.0228
-2.00 0 Z
General Procedure for
Finding Probabilities
To find P(a < X < b) when X is

distributed normally:
 Draw the normal curve for the problem in

terms of X
 Translate X-values to Z-values
 Use the Standardized Normal Table

 Suppose X is normal with mean 8.0 and

standard deviation 5.0
 Find P(X < 8.6)
X
8.0
8.6
(continued)
standard deviation 5.0. Find P(X < 8.6)
X  μ 8.6  8.0
Z   0.12
σ 5.0
μ=8 μ=0
σ = 10 σ=1
8 8.6 X 0 0.12 Z
P(X < 8.6) P(Z < 0.12)

Solution: Finding P(Z < 0.12)
Standardized Normal Probability P(X < 8.6)

Table (Portion) = P(Z < 0.12)
Z .00 .01 .02 =.5+.0478
0.0 .0000 .0040 .0080 =.5478
0.1 .0398 .0438 .0478

0.2 .0793 .0832 .0871
Z
0.00
0.3 .1179 .1217 .1255
0.12
Upper Tail Probabilities

standard deviation 5.0.
 Now Find P(X > 8.6)
X
8.0
8.6
Upper Tail Probabilities
(continued)
 Now Find P(X > 8.6)…

P(X > 8.6) = P(Z > 0.12) = 1.0 - P(Z ≤ 0.12)
= 1.0 - .5478 = .4522
.5478
1.000 1.0 - .5478
= .4522
Z Z
0 0
0.12 0.12
Probability Between
Two Values

standard deviation 5.0. Find P(8 < X < 8.6)
Calculate Z-values:
X μ 8 8
Z  0
σ 5
8 8.6 X
X  μ 8.6  8 0 0.12 Z
Z   0.12
σ 5 P(8 < X < 8.6)
= P(0 < Z < 0.12)
Solution: Finding P(0 < Z < 0.12)
Standardized Normal Probability P(8 < X < 8.6)

Table (Portion) = P(0 < Z < 0.12)
=.0478
Z .00 .01 .02
0.0 .0000 .0040 .0080 .0478
0.1 .0398 .0438 .0478

0.2 .0793 .0832 .0871
0.3 .1179 .1217 .1255 Z

0.00
0.12
Probabilities in the Lower Tail

 Now Find P(7.4 < X < 8)
X
8.0
7.4
Probabilities in the Lower Tail
(continued)
Now Find P(7.4 < X < 8)…

P(7.4 < X < 8)
= P(-0.12 < Z < 0) .0478
= .0478
The Normal distribution is

symmetric, so this probability
is the same as P(0 < Z < 0.12)
7.4 8.0 X
-0.12 0 Z
Areas under the standard normal curve
A1 = A2 = .4082
P(-1.33 < z < 1.33)=P(-1.33 < z < 0) + P( 0< z < 1.33)=A1 + A2
= .4082*2=.8164
Find the probability that a standard normal random
variable exceeds 1.96 in absolute value.
P(|z|> 1.96)=P(z<-1.96 or z>1.96)
Since the area between z=0 and z=1.96 is .4750,

P(|z|> 1.96)=A1 + A2 = .0250 + .0250=.05
Example 5.10: Find the value of z0 such that 95% of the standard
normal Z values lie between - z0 and + z0; that is, find P(- z0 ≤ Z ≤ z0) =
0.95.
This means that the area on each side of the mean will be equal to 0.95 / 2=0.475.
The area between z=0 and z0 is 0.475. We look up 0.475 in the body of Table IV to
find the value z0 = 1.96.
It means 95% of a normal distribution lies between

+1.96 and -1.96 standard deviations of the mean.
Finding the X value for a
Known Probability
 Steps to find the X value for a known

probability:
1. Find the Z value for the known probability
2. Convert to X units using the formula:
X  μ  Zσ
Finding the X value for a
Known Probability
(continued)
Example:
 Now find the X value so that only 20% of all
values are below this X
.2000
? 8.0 X
? 0 Z
Find the Z value for
20% in the Lower Tail
1. Find the Z value for the known probability

Standardized Normal Probability  20% area in the lower
Table (Portion) tail is consistent with a
Z … .03 .04 .05 Z value of -0.84
0.7 … .2673 .2704 .2734

.2000
0.8 … .2967 .2995 .3023
0.9 … .3238 .3264 .3289
? 8.0 X
-0.84 0 Z
Finding the X value
2. Convert to X units using the formula:
X  μ  Zσ
 8.0  ( 0.84 )5.0
 3.80
So 20% of the values from a distribution

with mean 8.0 and standard deviation
5.0 are less than 3.80
Question: What gestational length is smaller than
97.5% of gestations?
 Let X represent gestations length
 We know from prior research that
X ~ N(39, 2)
 A value that is smaller than .975 of gestations
has a cumulative probability of.025
z - percentile
Less than 97.5%
(right tail) = greater
than 2.5% (left tail)
z lookup:
z.025 = −1.96
z .00 .01 .02 .03 .04 .05 .06 .07 .08 .09
–1.9 .0287 .0281 .0274 .0268 .0262 .0256 .0250 .0244 .0239 .0233
Unstandardize and sketch
x    z p  39  (1.96)(2)  35
The 2.5th percentile is 35 weeks

Practice problem
If birth weights in a population are normally

distributed with a mean of 109 oz and a standard
deviation of 13 oz,
a. What is the chance of obtaining a birth weight of
141 oz or heavier when sampling birth records at
random?
b. What is the chance of obtaining a birth weight of
120 or lighter?
Answer
a. What is the chance of obtaining a birth weight of

141 oz or heavier when sampling birth records at
random?
141  109
Z  2.46
13
From the chart or SAS  Z of 2.46 corresponds to a right tail (greater

than) area of: P(Z≥2.46) = 1-(.9931)= .0069 or .69 %
Answer
b. What is the chance of obtaining a birth weight of

120 or lighter?
120  109
Z  .85
13
From the chart or SAS  Z of .85 corresponds to a left tail area of:
P(Z≤.85) = .8023= 80.23%
Example 5.11: College Entrance Exam Application
 Suppose the scores X on a college entrance

examination are normally distributed with a mean of 550
and a standard deviation of 100.
 A certain prestigious university will consider for
admission only those applicants whose scores exceed
the 90th percentile of the distribution.
 Find the mininmum score an applicant must achieve in
order to receive consideration for admission to the
university.
Example 5.11: College Entrance Exam Application
 Solution: P(X ≤ x) = 0.90

 P(X ≤ x) = P(Z ≤ (x-µ)/σ)
=P(Z ≤ (x-550)/100)=0.90
z= (x-550)/100=1.28
X=550+1.28(100)=550+128=678
• The 90th percentile of the test score distribution

is 678.
Figure 5.18 Area under the normal curve
for Example 5.11
That is to say, an applicant must score at least 678 on the

entrance exam to receive consideration for admission by
the university.
Exercise 1
 A radar unit is used to measure speeds of cars

on a motorway.
 The speeds are normally distributed with a
mean of 90 km/hr and a standard deviation of
10 km/hr.
 What is the probability that a car picked at
random is travelling at more than 100 km/hr?
Exercise 2
 For a certain type of computers, the length of

time between charges of the battery is normally
distributed with a mean of 50 hours and a
standard deviation of 15 hours.
 John owns one of these computers and wants
to know the probability that the length of time
will be between 50 and 70 hours.
Exercise 3
 Entry to a certain University is determined by a
national test.
 The scores on this test are normally distributed
with a mean of 500 and a standard deviation of
100.
 Tom wants to be admitted to this university and
he knows that he must score better than at least
70% of the students who took the test.
 Tom takes the test and scores 585.
 Will he be admitted to this university?
Exercise 4
 The annual salaries of employees in a large

company are approximately normally distributed
with a mean of $50,000 and a standard
deviation of $20,000.
 What percent of people earn between $45,000
and $65,000?
Exercise 5
 The time taken to assemble a car in a certain

plant is a random variable having a normal
distribution of 20 hours and a standard
deviation of 2 hours.
 What is the probability that a car can be
assembled at this plant in a period of time
between 20 and 22 hours?
Exercise 6
 Let’s suppose that the income of families are

normally distributed with N($25000, $100002).
 If the poverty level is $10,000, what percentage
of the population lives in poverty?
Problems
1. If X is a random variable with a distribution of N(μ, σ), find:

p(µ−3σ ≤ X ≤ µ+3σ)
2. Calculate the value of a in a normal distribution with a mean of 4 and a

standard deviation of 2 for which:
P(4−a ≤ x ≤ 4+a) = 0.5934
3. In a city, it is estimated that the maximum temperature in June is normally

distributed with a mean of 23º and a standard deviation of 5°. Calculate the
number of days in this month in which it is expected to reach a maximum of
between 21° and 27°.
Answer 1:
 If X is a random variable with a distribution of

N(μ, σ), find:
p(µ−3σ ≤ X ≤ µ+3σ)
 Approximately 99.74% of the X values are less
than three standard deviations from the mean.
Answer 2:
 Calculate the value of a in a normal distribution

with a mean of 4 and a standard deviation of 2
for which:
 P(4−a ≤ x ≤ 4+a) = 0.5934
Answer 3
4. The mean weight of 500 college students is 70 kg and the standard
deviation is 3 kg. Assuming that the weight is normally distributed,
determine how many students weigh:
a. Between 60 kg and 75 kg.
b.More than 90 kg.
c.Less than 64 kg.
d.Exactly 64 kg.
e.64 kg or less.
5. It is assumed that the test results for a class follow a normal distribution
with a mean of 78 and a standard deviation of 36.
a. What is the probability of a student obtaining the a grade higher than
72?
b.Calculate the percentage of students who have scores that are at least
five points higher than the score that marks the border between pass and
fail (a failing grade is defined as 25% of the tests with the lowest scores) .
c. If you know that a student's grade is greater than 72, what is the
probability that it is greater than 84?
Answer 4
Answer 5
6. An IQ test shows that the scores follow a distribution of N(65, 18). It is
desirable to arrange the participants into three groups (low intelligence,
average intelligence and high intelligence) so that 20% of the partipants
are filled under the first group, 65% in the second and 15% in the third.
What are the scores that differentiate each intelligence group from one
another?
7. Several intelligence tests follow a normal distribution with a mean of 100

and a standard deviation of 15.
a. Determine the percentage of the population that would obtain a score
between 95 and 110.
b. What interval centered at a score of 100 contains 50% of the
population?
c. For a population of 2,500, how many are expected to have a score
above 125?
Answer 6
Answer 7

FBA1202 Statistics W10

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

FBA1202 Statistics W10

Uploaded by

Copyright:

Available Formats

The Normal Distribution and

Other Continuous Distributions

 These can potentially take on any value,

 This curve, which is a function of X, is called a

 f(y) > 0 for all possible intervals of y.

 X is a continuous random variable if there is a

 The random variable X is given by the following

 The normal curve is symmetrical about the mean μ.

 Mean, Median and Mode are Equal.

 Total area under the curve is equal to 1.

 The normal distribution has two parameters

μ controls location σ controls spread

f(X) Changing μ shifts the

By varying the parameters μ and σ, we obtain different

 The formula for the normal probability density

Where e = the mathematical constant approximated by 2.71828

 For a normal random variable X with mean μ and

P(a  X  b)  F(b)  F(a)

 Any normal distribution (with any mean and

 Need to transform X units into Z units

 Translate from X to the standardized normal

 The formula for the standardized normal

Where e = the mathematical constant approximated by 2.71828

 Also known as the “Z” distribution

 If X is distributed normally with mean of 100

100 200 X (μ = 100, σ = 50)

Note that the distribution is the same, only the

 The Table gives the probability F(a) for any value of a.

P(Z < 2.00) = 0.5 + P(0<Z<2)

 The Standardized Normal table gives the

The column gives the value of

The row shows 0.0

What is the area to

 For negative Z-values, use the fact that the

To find P(a < X < b) when X is

 Draw the normal curve for the problem in

 Translate X-values to Z-values

 Use the Standardized Normal Table

 Suppose X is normal with mean 8.0 and

P(X < 8.6) P(Z < 0.12)

Standardized Normal Probability P(X < 8.6)

0.1 .0398 .0438 .0478

 Suppose X is normal with mean 8.0 and

 Now Find P(X > 8.6)…

 Suppose X is normal with mean 8.0 and

Standardized Normal Probability P(8 < X < 8.6)

0.1 .0398 .0438 .0478

0.3 .1179 .1217 .1255 Z

 Suppose X is normal with mean 8.0 and

Now Find P(7.4 < X < 8)…

The Normal distribution is

P(|z|> 1.96)=P(z<-1.96 or z>1.96)

Since the area between z=0 and z=1.96 is .4750,

It means 95% of a normal distribution lies between

 Steps to find the X value for a known

1. Find the Z value for the known probability

0.7 … .2673 .2704 .2734

2. Convert to X units using the formula:

 8.0  ( 0.84 )5.0

So 20% of the values from a distribution

The 2.5th percentile is 35 weeks

If birth weights in a population are normally

a. What is the chance of obtaining a birth weight of

From the chart or SAS  Z of 2.46 corresponds to a right tail (greater