Professional Documents
Culture Documents
Lesson 14 Slides
Lesson 14 Slides
1 / 36
Review
2 / 36
Notation:
µ = population mean
σ = population standard deviation
X̄ = sample mean
s = sample standard deviation
Mean(X̄ )= mean of the sampling distribution of X̄
SD(X̄ )= standard deviation of the sampling distribution of X̄
3 / 36
The Sampling Distribution of X̄
4 / 36
Example 1: Population distribution of blood pressures (BP)
0.030
0.025
Percentage of Men in Population
0.020
µ = 125 mm Hg
0.015
σ = 14 mm Hg
0.010
0.005
0.000
5 / 36
Let’s do a simulation for n = 20
6 / 36
7 / 36
So we did this 500 times...
Let’s look at a histogram of the 500 sample means
Each based on a sample of size 20
8 / 36
Let’s do ANOTHER simulation for n = 50
9 / 36
Summary of simulation results for BP
10 / 36
Example 2: Population distribution of hospital length of stay (LOS)
0.20
0.15
Percentage
µ = 4 days
0.10
σ = 3 days
0.05
0.00
0 5 10 15 20 25 30
11 / 36
Let’s do a simulation for n = 16
12 / 36
13 / 36
So we did this 500 times...
Let’s look at a histogram of the 500 sample means
Each based on a sample of size 16
14 / 36
Let’s do ANOTHER simulation for n = 64
15 / 36
16 / 36
So we did this 500 times...
Let’s look at a histogram of the 500 sample means
Each based on a sample of size 64
17 / 36
Let’s do ANOTHER simulation for n = 256
18 / 36
19 / 36
So we did this 500 times...
Let’s look at a histogram of the 500 sample means
Each based on a sample of size 256
20 / 36
Simulation results for Hospital LOS
21 / 36
Variation in sample mean values tied to size of each
sample
NOT the number of samples ●
8
●
●
●
●
●
●
7
●
● ●
●
●
●
● ●
●
● ●
●
●
●
●
● ●
● ●
●
●
● ●
●
●
6
● ●
●
●
●
●
● ●
●
●
●
●
●
5
●
●
● ●
●
4
●
● ●
●
●
●
3
●
2
22 / 36
Sampling Distribution of X̄
23 / 36
CLT–R Shiny App example
24 / 36
• In real research it is impossible to estimate the sampling
distribution of a sample mean by actually taking multiple
random samples from the same population
• no research would ever happen if a study needed to be
repeated multiple times to understand this sampling
behavior
• Simulations are useful to illustrate a concept, but not to
highlight a practical approach!
• Luckily, there is some mathematical machinery that
generalizes some of the patterns we saw in the simulation
results
25 / 36
Amazing Result
26 / 36
Central Limit Theorem (CLT)
If you take a large SRS of size n from any population, then the
shape of the sampling distribution of x̄ is approximately normal
• shape gets more normal as n increases
• n > 30 is considered large
• CLT allows us to use the standard normal table to compute
approximate probabilities associated with x̄
27 / 36
Central Limit Theorem
Population
0 2 4 6 8 10
Sample Means
based on n = 16
0 2 4 6 8 10
Sample Means
based on n = 64
0 2 4 6 8 10
Sample Means
based on n = 256
0 2 4 6 8 10
28 / 36
Sampling Distribution of X̄
30 / 36
Why is the sampling distribution so important?
31 / 36
Why is the normal distribution so important in the
study of statistics?
32 / 36
Self-check
33 / 36
Self-check
33 / 36
Self-check
34 / 36
Self-check
34 / 36
Self-check
(a) x̄ = 7
(b) x̄ = 49
(c) x̄ = 80
(d) An x̄ close to 80
35 / 36
Self-check
(a) x̄ = 7
(b) x̄ = 49
(c) x̄ = 80
(d) An x̄ close to 80
35 / 36
Vocabulary
36 / 36