Professional Documents
Culture Documents
COMPLETE
BUSINESS
STATISTICS
by
AMIR D. ACZEL
&
JAYAVEL SOUNDERPANDIAN
6th edition (SIE)
5-2
Chapter 5
Degrees of Freedom
The Template
5-4
5 LEARNING OBJECTIVES
After studying this chapter you should be able to:
Take random samples from populations
Distinguish between population parameters and sample statistics
Apply the central limit theorem
Derive sampling distributions of sample means and proportions
Explain why sample statistics are good estimators of population
parameters
Judge one estimator as better than another based on desirable
properties of estimators
Apply the concept of degrees of freedom
Identify special sampling methods
Compute sampling distributions and related results using
templates
5-5
Unbiased
Sample
Unbiased,
representative sample
Democrats Republicans drawn at random from
Population
the entire population.
Biased Biased,
People who have Sample
phones and/or cars unrepresentative
and/or are Digest
readers. sample drawn from
Democrats Republicans people who have cars
Population and/or telephones
and/or read the Digest.
5-7
Estimators
X X X X X X X
X X X X X X X
X X X X
Sample points
Sample mean X
( )
5-11
P(X)
5 0.125 0.625 0.5 0.25 0.03125 0.1
6 0.125 0.750 1.5 2.25 0.28125
7 0.125 0.875 2.5 6.25 0.78125
8 0.125 1.000 3.5 12.25 1.53125
0.0
1.000 4.500 5.25000 1 2 3 4 5 6 7 8
X
E(X) = = 4.5
V(X) = 2 = 5.25
SD(X) = = 2.2913
5-14
P(X)
0.062500 0.156250 -2.0 4.00 0.250000
0.078125 0.234375 -1.5 2.25 0.175781 0.05
0.093750 0.328125 -1.0 1.00 0.093750
0.109375 0.437500 -0.5 0.25 0.027344
0.125000 0.562500 0.0 0.00 0.000000 0.00
0.109375 0.546875 0.5 0.25 0.027344 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 5.5 6.0 6.5 7.0 7.5 8.0
0.093750 0.515625 1.0 1.00 0.093750 X
0.078125 0.468750 1.5 2.25 0.175781
0.062500 0.406250 2.0 4.00 0.250000
0.046875
0.031250
0.328125
0.234375
2.5
3.0
6.25
9.00
0.292969
0.281250
E ( X ) X 4.5
0.015625 0.125000 3.5 12.25 0.191406 V ( X ) 2X 2.625
1.000000 4.500000 2.625000 SD( X ) X 1.6202
5-16
P(X)
0.1
symmetric. X
0.00
1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 5.5 6.0 6.5 7.0 7.5 8.0
X
5-17
The expected value of the sample mean is equal to the population mean:
E( X )
X X
The variance of the sample mean is equal to the population variance divided by
the sample size:
2
V(X) 2
X
X
n
The standard deviation of the sample mean, known as the standard error of
the mean,
mean is equal to the population standard deviation divided by the square
root of the sample size:
SD( X ) X
X
n
5-18
compactly distributed around
that population mean
5-19
n=5
When sampling from a population 0.25
P(X)
0.10
P(X)
0.1
(n >30). Large n
0.4
0.3
f(X)
0.2
Population
n=2
n = 30
X X X X
5-21
217 220 217 220
P Z P Z
15 15
100 10
P ( Z 2) 0.0228
5-22
Example 5-2
15 4.00 - 4.49
4.50 - 4.99
10 5.00 - 5.49
5.50 - 5.99
5
6.00 - 6.49
0 6.50 - 6.99
7.00 - 7.49
Range
7.50 - 7.99
5-23
Student’s t Distribution
0 .4
P(X)
0 .2
number of trials, n.
0 .0
0 1 2
n=10,p=0.3
X 0.3
P(X)
0.1
0.0
0 1 2 3 4 5 6 7 8 9 10
X
0.2
P(X)
deviation
0.1
p(1 p)
n 0.0
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 X
0 1 2 3 4 5 6 7 8 9 10 1112 13 1415
15 1515 15 15 15 15 15 151515 1515 15 1515 ^
p
5-25
Unbiasedness
{
Bias
Efficiency
Consistency
n = 10 n = 100
The sample variance (the sum of the squared deviations from the
sample mean divided by (n-1) is an unbiased estimator of the
population variance. In contrast, the average squared deviation
from the sample mean is a biased (though consistent) estimator of the
population variance.
E (s ) E
2 ( x x )
2
2
(n 1)
( x x )2
2
E
n
5-33
If only two data points and the sample mean are known:
12 14 x x4 56
3
5-35
s
2 (x x)
2
(n 1)
5-36
Example 5-4
A sample of size 10 is given below. We are to choose three different numbers from
which the deviations are to be taken. The first number is to be used for the first five
sample points; the second number is to be used for the next three sample points; and
the third number is to be used for the last two sample points.
Sample # 1 2 3 4 5 6 7 8 9 10
Sample 93 97 60 72 96 83 59 66 88 53
Point
Solution: Choose the means of the corresponding sample points. These are: 83.6,
69.33, and 70.5.
Solution: df = 10 – 3 = 7.
0
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
Sample Means (Class Midpoints)