Professional Documents
Culture Documents
Statistical Methods For Business & Economics Sampling Distributions
Statistical Methods For Business & Economics Sampling Distributions
Sampling Distributions
Davide Raggi1
1 davide.raggi@unibo.it
. . . . . . . . . . . . . . . . . . . .
.. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. ..
1 / 28
Outline
▶ Random sampling
▶ The sampling average
▶ The Central Limit Theorem (CLT)
▶ Confidence Intervals
▶ Suggested readings:
▶ Chapter 4 of Agresti and Finlay (paragraphs 4.4 and 4.5)
▶ Chapter 5
. . . . . . . . . . . . . . . . . . . .
.. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. ..
2 / 28
Simple random sampling
. . . . . . . . . . . . . . . . . . . .
.. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. ..
3 / 28
Simple random sampling
Main Consequences:
. . . . . . . . . . . . . . . . . . . .
.. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. ..
4 / 28
Simple random sampling
Example:
1∑
n
y= yi
n
i=1
. . . . . . . . . . . . . . . . . . . .
.. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. ..
5 / 28
Simple random sampling (cont.)
Example:
.5
0
1
.5
D
1.5
Density
2
2.5
3
3.5
C
4
College
ollege
.5
ensity GPA
2
1.5
Density
1.5
0
2 2.5 3 3.5 4
College GPA
. . . . . . . . . . . . . . . . . . . .
.. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. ..
7 / 28
The sampling average (cont.)
▶ The random variable collecting all the possible averages
computed for all the possible samples of a given size n is
defined through capital letters as
1∑ n
Y = Yi
n i=1
. . . . . . . . . . . . . . . . . . . .
.. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. ..
8 / 28
The sampling average (cont.)
1.1
8
6
4
2
0
Density
10
2.9
3
3.1
3.2
3.3
Sample
0 size n=50
10
8 6
Density
4
2
0
. . . . . . . . . . . . . . . . . . . .
.. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. ..
9 / 28
The sampling average
Why the sampling average?
. . . . . . . . . . . . . . . . . . . .
.. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. ..
10 / 28
The sampling average (cont.)
Why the sampling average?
1 ∑∑
n n
= Cov(Yi , Yj )
n2 i=1 j=1
1 ∑ 1 ∑ ∑
n n n
= Var(Y i ) + Cov(Yi , Yj )
n2 i=1 n2 i=1
j=1,j̸=i
1 ∑ 2
n
σ2
= 2
σY = Y
n i=1 n
▶ The standard
√ 2 deviation of the sampling distribution, i.e.,
σY
σȲ = n is also known as standard error.
. . . . . . . . . . . . . . . . . . . .
.. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. ..
11 / 28
The sampling average
Summary of the results
. . . . . . . . . . . . . . . . . . . .
.. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. ..
12 / 28
The sampling average (cont.)
Summary of the results
.5
.5
1.5
2.5
3.5
s
D
2.6
2.8
3.4
3.6
8
6
4
2
2.9
3.2
3.3
1
5
0
Density
20
15
10
2.95
3
3.05
3.1
3.15
sample
ample
.5
.2
.1
0
.05
ensity size n=100
n=2
n=10
n=50
4
1 1.5 2
2 3
Density
Density
1
.5
0
0
2 2.5 3 3.5 4 2.6 2.8 3 3.2 3.4 3.6
sample size n=2 sample size n=10
4 6 8 10
20
10 15
Density
Density
5
2
0
0
2.9 3 3.1 3.2 3.3 2.95 3 3.05 3.1 3.15
sample size n=50 sample size n=100
. . . . . . . . . . . . . . . . . . . .
.. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. ..
13 / 28
The sampling average (cont.)
Summary of the results
Sample
1
5
0
Histograms
20
15
10
2
2.5
3
3.5
n
4
n=2
n=10
n=50
n=100
=2
0
.5 size n=50
20 n=100
n=2
n=10
2 2.5 3 3.5 4
n=2 n=10
n=50 n=100
. . . . . . . . . . . . . . . . . . . .
.. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. ..
14 / 28
Central Limit Theorem
. . . . . . . . . . . . . . . . . . . .
.. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. ..
15 / 28
Central Limit Theorem (cont.)
. . . . . . . . . . . . . . . . . . . .
.. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. ..
16 / 28
Central Limit Theorem
Example:
.6
.
Density
.5
.4
.3
.2
.1
-4
-2
0
2
S
4
Sample
2
1
ample size n=50
n=2
.6
.5
.4
.4
.3
Density
Density
.2
.2
.1
0
0
-4 -2 0 2 4 -4 -2 0 2 4
Sample size n=2 Sample size n=50
√ ( )
Figure 5 : GPA example: Sampling distribution of n Y −µ σ obtained
from 10,000 different samples of size n = 2, and n = 50. In red the
density of a Normal variable with mean 0 and variance 1.
. . . . . . . . . . . . . . . . . . . .
.. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. ..
17 / 28
Confidence intervals
. . . . . . . . . . . . . . . . . . . .
.. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. ..
18 / 28
Confidence intervals (cont.)
▶ An interval estimate or confidence interval is an interval of
numbers around the point estimate, within which the
parameter value is expected to fall.
▶ The probability the true parameter is not included in the
interval is called error probability and it is indicated with the
greek letter α. The length of the interval depends on the
confidence level 1 − α, that measures the probability that the
true parameter is included in the interval. In general 1 − α is
taken to be 0.95 or 0.99.
▶ In a nutshell, we would like to obtain an interval that likely is
going to include the true parameter of the population µY
(with probability 1 − α).
▶ In general a confidence interval has the form
. . . . . . . . . . . . . . . . . . . .
.. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. ..
20 / 28
Confidence intervals (cont.)
Ȳ −µY
▶ We know that σȲ ∼ N (0, 1). This result implies that
( )
Ȳ − µY
P −z α2 ≤ ≤ z α2 = 1 − α.
σȲ
. . . . . . . . . . . . . . . . . . . .
.. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. ..
21 / 28
Confidence intervals (cont.)
▶ In particular the interval defined as follows
Ȳ
|{z} ± z α2 σȲ
point estimate
| {z }
margin of error
. . . . . . . . . . . . . . . . . . . .
.. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. ..
22 / 28
Confidence intervals (cont.)
. . . . . . . . . . . . . . . . . . . .
.. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. ..
23 / 28
Confidence Intervals
Graphical illustration
. . . . . . . . . . . . . . . . . . . .
.. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. ..
24 / 28
Confidence Intervals
Example:
. . . . . . . . . . . . . . . . . . . .
.. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. ..
25 / 28
Example: Estimating a proportion
. . . . . . . . . . . . . . . . . . . .
.. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. ..
26 / 28
Example: Estimating a proportion (cont.)
▶ Let π be the proportion of Floridians who would respond yes.
An estimate of π, let say π̂ is given by
∑
1 1200 396
π̂ = (1 − yi ) = = 0.33
N i=1 1200
π̂ ± 1.96σπ̂
. . . . . . . . . . . . . . . . . . . .
.. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. ..
27 / 28
Example: Estimating a proportion (cont.)
▶ In order to compute the confidence interval, we need to
compute σπ̂ . Note first that Yi is a Bernoulli random variable
such that P(Yi = 1) = 1 − π and P(Yi = 0) = π.
√
▶ The standard deviation of Yi is thus σ = π(1 − π).
▶ A
√ naı̈ve estimate of σ on the basis of this sample is then
(0.33) × (1 − 0.33) and then the estimated standard error
of the sample proportion is
√
π̂(1 − π̂)
sπ̂ = = 0.0136.
n
▶ A 95% confidence interval is thus