Professional Documents
Culture Documents
4 Variables and
Probability Distributions
Stat 4570/5570
Material from Devore’s book (Ed 8) – Chapter 4 - and Cengage
Continuous r.v.
A random variable X is continuous if possible values
comprise either a single interval on the number line or a
union of disjoint intervals.
2
Continuous r.v.
In principle variables such as height, weight, and
temperature are continuous, in practice the limitations of
our measuring instruments restrict us to a discrete (though
sometimes very finely subdivided) world.
3
Probability Distributions for Continuous Variables
5
Probability Distributions for Continuous Variables
6
Probability Distributions for Continuous Variables
7
Probability Distributions for Continuous Variables
Definition
Let X be a continuous rv. Then a probability distribution
or probability density function (pdf) of X is a function f (x)
such that for any two numbers a and b with a ≤ b,
P (a ≤ X ≤ b) =
8
Probability Distributions for Continuous Variables
9
Probability Distributions for Continuous Variables
10
Example
Consider the reference line connecting the valve stem on a
tire to the center point.
11
Example, cont cont’d
Figure 4.3
12
Example , cont cont’d
Clearly f(x) ≥ 0. How can we show that the area of this pdf
is equal to 1?
13
Probability Distributions for Continuous Variables
Definition
A continuous rv X is said to have a uniform distribution
on the interval [A, B] if the pdf of X is
14
Example
“Time headway” in traffic flow is the elapsed time between
the time that one car finishes passing a fixed point and the
instant that the next car begins to pass that point.
Figure 4.4
16
Example , cont cont’d
e–kx dx = (1/k)e–k a.
17
The Cumulative Distribution
Function
18
The Cumulative Distribution Function
The cumulative distribution function F(x) for a
continuous rv X is defined for every number x by
F(x) = P(X ≤ x) =
For each x, F(x) is the area under the density curve to the
left of x. This is illustrated in Figure 4.5, where F(x)
increases smoothly as x increases.
Figure 4.6
20
Example , cont cont’d
21
Example , cont cont’d
Figure 4.7
22
Using F (x) to Compute Probabilities
23
Percentiles of a Continuous Distribution
When we say that an individual’s test score was at the 85th
percentile of the population, we mean that 85% of all
population scores were below that score and 15% were
above.
24
Percentiles of a Continuous Distribution
Proposition
Let p be a number between 0 and 1. The (100p)th
percentile of the distribution of a continuous rv X, denoted
by η(p), is defined by
p = F(η(p)) = f(y) dy
25
Percentiles of a Continuous Distribution
Thus η(.75), the 75th percentile, is such that the area under
the graph of f(x) to the left of η(.75) is .75.
26
Example 9
The distribution of the amount of gravel (in tons) sold by a
particular construction supply company in a given week is a
continuous rv X with pdf
What is the cdf of sales for any x? How do you use this to
find the probability that X is less than .25? What about the
probability that X is greater than .75? What about
P(.25 < X < .75)?
27
Example 9 cont’d
Figure 4.11
28
Example 9 cont’d
29
Percentiles of a Continuous Distribution
Definition
The median of a continuous distribution, denoted by , is
the 50th percentile, so satisfies .5 = F( ) That is, half the
area under the density curve is to the left of and half is to
the right of .
30
Expected Values
Definition
The expected or mean value of a continuous rv X with
the pdf f (x) is:
µ x = E(X) = x f(x) dx
31
Example, cont
Back to the gravel example, the pdf of the amount of
weekly gravel sales X is:
32
Expected Values of functions of r.v.
33
Variance
The variance of a continuous random variable X with pdf
f(x) and mean value µ is
35
The Normal
4.3 distribution
37
The Normal Distribution
Definition
A continuous rv X is said to have a normal distribution
with parameters µ and σ (or µ and σ2), where <µ<
and 0 < σ, if the pdf of X is
38
The Normal Distribution
The statement that X is normally distributed with
parameters µ and σ2 is often abbreviated X ~ N(µ, σ2).
39
The Normal Distribution
Figure below presents graphs of f(x; µ, σ) for several
different (µ, σ) pairs.
40
The Standard Normal
Distribution
41
The Standard Normal Distribution
The normal distribution with parameter values µ = 0 and
σ = 1 is called the standard normal distribution.
<z<
43
The Standard Normal Distribution
Figure below illustrates the probabilities tabulated in Table
A.3:
44
Example cont’d
45
Example , cont. cont’d
b) P(Z ≥ 1.25) = ?
c) Why does P(Z ≤ –1.25) = P(Z >= 1.25)? What is
(–1.25)?
d) How do we calculate P(–.38 ≤ Z ≤ 1.25)?
46
Percentiles of the Standard Normal
Distribution
47
Example
The 99th percentile of the standard normal distribution is
that value of z such that the area under the z curve to the
left of the value is .99
Tables give for fixed z the area under the standard normal
curve to the left of z, whereas now –
we have the area and want the value of z.
48
Example cont’d
49
Percentiles of the Standard Normal Distribution
50
zα Notation
In statistical inference, we need the z values that give
certain tail areas under the standard normal curve.
51
zα Notation for z Critical Values
For example, z.10 captures upper-tail area .10, and z.01
captures upper-tail area .01.
Since α of the area under the z curve lies to the right of zα,
1 – α of the area lies to its left.
52
zα Notation for z Critical Values
Table below lists the most useful z percentiles and zα
values.
53
Example – critical values
z.05 is the 100(1 – .05)th = 95th percentile of the standard
normal distribution, so z.05 = 1.645.
54
Nonstandard Normal Distributions
When X ~ N(µ, σ 2), probabilities involving X are computed
by “standardizing.” The standardized variable is (X – µ)/σ.
Proposition
If X has a normal distribution with mean µ and standard
deviation σ, then
57
Using Normal to approximate the
Binomial Distribution
58
Approximating the Binomial Distribution
Figure below displays a binomial probability histogram for
the binomial distribution with n = 20, p = .6,
for which µ = 20(.6) = 12 and σ =
59
Approximating the Binomial Distribution
60
Exponential Distribution
61
The Exponential Distributions
The family of exponential distributions provides probability
models that are very widely used in engineering and
science disciplines. Examples?
Definition
X is said to have an exponential distribution with the rate
parameter λ (λ > 0) if the pdf of X is
62
The Exponential Distributions
Integration by parts give the following results:
CDF:
63
The Exponential Distributions
Another important application of the exponential distribution
is to model the distribution of lifetimes.
64
The Exponential Distributions
Suppose a light bulb’s lifetime is exponentially distributed
with parameter λ.
Say you turn the light on, and then we leave and come
back after t0 hours to find it still on. What is the probability
that the light bulb will last for at least additional t hours?
65
The Gamma Distribution
66
The Gamma Function
To define the family of gamma distributions, we first need
to introduce a function that plays an important role in many
branches of mathematics.
Definition
For α > 0, the gamma function is defined by
67
The Gamma Function
The most important properties of the gamma function are
the following:
3.
68
The Gamma Function
So if we let
69
The Gamma Distribution
Definition
A continuous random variable X is said to have a gamma
distribution if the pdf of X is
70
The Gamma Distribution
The Exp dist results from taking α = 1 and β = 1/λ.
72
Example
Suppose the survival time X (weeks) of a random mouse
has a gamma distribution with α = 8 and β = 15.
73
The Chi-Squared Distribution
74
The Chi-Squared Distribution
Definition
Let v be a positive integer. Then a random variable X is
said to have a chi-squared distribution with parameter v if
the pdf of X is the gamma density with α = v/2 and β = 2.
The pdf of a chi-squared rv is thus
(4.10)
76
The Weibull Distribution
The family of Weibull distributions was introduced by the
Swedish physicist Waloddi Weibull in 1939; his 1951 article
“A Statistical Distribution Function of Wide Applicability”
(J. of Applied Mechanics, vol. 18: 293–297) discusses a
number of applications.
Definition
A random variable X is said to have a Weibull distribution
with parameters α and β (α > 0, β > 0) if the pdf of X is
(4.11)
77
The Weibull Distribution
In some situations, there are theoretical justifications for the
appropriateness of the Weibull distribution, but in many
applications f (x; α, β) simply provides a good fit to
observed data for particular values of α and β.
78
The Weibull Distribution
Both α and β can be varied to obtain a number of different-
looking density curves, as illustrated in
79
The Weibull Distribution
β is called a scale parameter, since different values stretch
or compress the graph in the x direction, and α is referred
to as a shape parameter.
80
The Weibull Distribution
The integration is easily carried out to obtain
the cdf of X.
(4.12)
81
The Lognormal Distribution
82
The Lognormal Distribution
Definition
A nonnegative rv X is said to have a lognormal
distribution if the rv Y = ln(X) has a normal distribution.
83
The Lognormal Distribution
The parameters µ and σ are not the mean and standard
deviation of X but of ln(X).
84
The Lognormal Distribution
Figure below illustrates graphs of the lognormal pdf;
although a normal curve is symmetric, a lognormal curve is
skewed. Is the skew positive or negative?
85
The Lognormal Distribution
Because ln(X) has a normal distribution, the cdf of X can be
expressed in terms of the cdf φ(z) of a standard normal
rv Z.
(4.13)
86
The Beta Distribution
87
The Beta Distribution
So far, all families of continuous distributions (except for
the uniform distribution) had positive density over an infinite
interval.
88
The Beta Distribution
Definition
A random variable X is said to have a beta distribution
with parameters α, β (both positive), A, and B if the pdf of
X is
89
The Beta Distribution
Figure below illustrates several standard beta pdf’s.
91
Examples…
92
Example 1
Suppose the pdf of the magnitude X of a dynamic load on a
bridge (in newtons) is
What is F(x)?
93
Example 1 cont’d
94
Example 1 cont’d
95
Example 2
Two species are competing in a region for control of a
limited amount of a certain resource.
97
Example 3
The time that it takes a driver to react to the brake lights on
a decelerating vehicle is critical in helping to avoid rear-end
collisions.
99
Example 4
According to the article “Predictive Model for Pitting
Corrosion in Buried Oil and Gas Pipelines” (Corrosion,
2009: 332–342), the lognormal distribution has been
reported as the best option for describing the distribution of
maximum pit depth data from cast iron pipes in soil.
100
Example 4 cont’d
101