Professional Documents
Culture Documents
Normal Distribution:: - Probability - Characteristics and Application of Normal Probability Curve - Sampling Error
Normal Distribution:: - Probability - Characteristics and Application of Normal Probability Curve - Sampling Error
• Probability
• Characteristics and application of normal
probability curve
• Sampling error.
1
Probability
Probability theory developed from the study of games of
chance like dice and cards. A process like flipping a coin,
rolling a die or drawing a card from a deck is called a
probability experiment. An outcome is a specific result of
a single trial of a probability experiment.
2
Probability distributions
Probability theory is the foundation for statistical inference. A probability
distribution is a device for indicating the values that a random variable may
have.
There are two categories of random variables. These are:
discrete random variables, and
continuous random variables.
Discrete Random Variables:
The probability distribution of a discrete random variable specifies
all possible values of a discrete random variable along with their
respective probabilities
3
Examples can be
Frequency distribution
Probability distribution (relative frequency distribution)
Cumulative frequency
Examples of discrete probability distributions are the binomial
distribution and the Poisson distribution.
4
Binomial Distribution
6
The Poisson distribution is based on the Poisson process.
1. The occurrences of the events are independent in an interval.
2. An infinite number of occurrences of the event are possible in
the interval.
3. The probability of a single event in the interval is proportional to
the length of the interval.
4. In an infinitely small portion of the interval, the probability of
more than one occurrence of the event is negligible.
7
Continuous variable
A continuous variable can assume any value within a specified
interval of values assumed by the variable. In a general case, with
a large number of class intervals, the frequency polygon begins to
resemble a smooth curve.
A continuous probability distribution is a probability density
function.
The area under the smooth curve is equal to 1 and the frequency
of occurrence of values between any two points equals the total
area under the curve between the two points and the x-axis.
8
A continuous probability distribution is a probability density
function.
The area under the smooth curve is equal to 1 and the frequency of
occurrence of values between any two points equals the total area
under the curve between the two points and the x-axis.
The normal distribution is the most important distribution in
biostatistics. It is frequently called the Gaussian distribution.
The two parameters of the normal distribution are the mean (m) and
the standard deviation (s).
The graph has a familiar bell-shaped curve.
9
The Normal distribution is also known as the Gaussian Distribution
and the curve is also known as the Gaussian Curve, named after
German Mathematician-Astronomer Carl Frederich Gauss.
What is Normal (Gaussian) Distribution?
The normal distribution is a descriptive model that describes real
world situations.
It is defined as a continuous frequency distribution of infinite range
(can take any values not just integers as in the case of binomial and
Poisson distribution).
This is the most important probability distribution in statistics and
important tool in analysis of epidemiological data and management
science. 10
Characteristics of Normal Distribution
It links frequency distribution to probability distribution
Has a Bell Shape Curve and is Symmetric
It is Symmetric around the mean:
Two halves of the curve are the same (mirror images)
Hence Mean = Median
The total area under the curve is 1 (or 100%)
Normal Distribution has the same shape as Standard Normal
Distribution.
In a Standard Normal Distribution:
The mean (μ ) = 0 and Standard deviation (σ) =1 11
Z Score (Standard Score)3
Z = X – μ/ σ
Z indicates how many standard deviations away from the
mean the point x lies.
Z score is calculated to 2 decimal places.
The relationship between the normal variable X and Z score is
34.1 34.1
13.6 13.6
14
• The mean-mode-median is in the center.
• It is the mean because it is the ARITHMETIC average of
all the scores.
• It is the mode because of all the scores the mean score
happens MOST often.
• It is the median because when the scores are displayed
from lowest to highest, the mean is the MIDDLE score,
the median.
• The EXPECTED value is the mean.
15
The Data Is Centered About the Mean-Mode-Median
• The frequency curve is bell shaped. The bell shape has perfect
bilateral symmetry - the left balances exactly with the right.
• The score at -2 is balanced by a score at +2 and the frequencies from 0
to +2 and from 0 to -2 are equal.
• The area under the curve from 0 to +2 is exactly the same as the area
under the curve from 0 to -2.
16
The bell curve which represents a normal distribution of data
shows what standard deviation represents.
17
Diagram of Normal Distribution Curve
(z distribution)
13.6% 33.35%
2.2%
0.15
Modified from Dawson-Saunders, B & Trapp, RG. Basic and Clinical Biostatistics,
2nd edition, 1994.
-3 -2 -1 μ 1 2 3
18
Distinguishing Features
19
Positive Skewness (Tail to Right)
20
Negative Skewness (Tail to Left)
21