This action might not be possible to undo. Are you sure you want to continue?
https://www.scribd.com/doc/114416193/LevineDkkStatisticsforManagersUsingMicrosoftExcel5Edbab07
12/04/2012
text
original
Chap 71
Statistics for Managers
Using Microsoft® Excel
5th Edition
Chapter 7
Sampling and Sampling Distributions
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 72
Learning Objectives
In this chapter, you will learn:
To distinguish between different survey
sampling methods
The concept of the sampling distribution
To compute probabilities related to the
sample mean and the sample proportion
The importance of the Central Limit
Theorem
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 73
Why Sample?
Selecting a sample is less timeconsuming
than selecting every item in the population
(census).
Selecting a sample is less costly than
selecting every item in the population.
An analysis of a sample is less cumbersome
and more practical than an analysis of the
entire population.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 74
Types of Samples
Quota
Samples
NonProbability
Samples
Judgment Chunk
Probability Samples
Simple
Random
Systematic
Stratified
Cluster
Convenience
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 75
Types of Samples
In a nonprobability sample, items included
are chosen without regard to their probability
of occurrence.
In convenience sampling, items are selected
based only on the fact that they are easy,
inexpensive, or convenient to sample.
In a judgment sample, you get the opinions of
preselected experts in the subject matter.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 76
Types of Samples
In a probability sample, items in the sample
are chosen on the basis of known probabilities.
Probability Samples
Simple
Random
Systematic Stratified Cluster
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 77
Simple Random Sampling
Every individual or item from the frame has an
equal chance of being selected
Selection may be with replacement (selected
individual is returned to frame for possible
reselection) or without replacement (selected
individual isn’t returned to the frame).
Samples obtained from table of random numbers
or computer random number generators.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 78
Systematic Sampling
Decide on sample size: n
Divide frame of N individuals into groups of k
individuals: k=N/n
Randomly select one individual from the 1st group
Select every kth individual thereafter
For example, suppose you were sampling n = 9
individuals from a population of N = 72. So, the
population would be divided into k = 72/9 = 8 groups.
Randomly select a member from group 1, say
individual 3. Then, select every 8
th
individual
thereafter (i.e. 3, 11, 19, 27, 35, 43, 51, 59, 67)
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 79
Stratified Sampling
Divide population into two or more subgroups
(called strata) according to some common
characteristic.
A simple random sample is selected from each
subgroup, with sample sizes proportional to strata
sizes.
Samples from subgroups are combined into one.
This is a common technique when sampling
population of voters, stratifying across racial or
socioeconomic lines.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 710
Cluster Sampling
Population is divided into several “clusters,” each
representative of the population.
A simple random sample of clusters is selected.
All items in the selected clusters can be used, or items
can be chosen from a cluster using another probability
sampling technique.
A common application of cluster sampling involves
election exit polls, where certain election districts are
selected and sampled.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 711
Comparing Sampling Methods
Simple random sample and Systematic sample
Simple to use
May not be a good representation of the population’s
underlying characteristics
Stratified sample
Ensures representation of individuals across the entire
population
Cluster sample
More cost effective
Less efficient (need larger sample to acquire the same
level of precision)
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 712
Evaluating Survey Worthiness
What is the purpose of the survey?
Were data collected using a nonprobability
sample or a probability sample?
Coverage error – appropriate frame?
Nonresponse error – follow up
Measurement error – good questions elicit good
responses
Sampling error – always exists
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 713
Types of Survey Errors
Coverage error or selection bias
Exists if some groups are excluded from the frame
and have no chance of being selected
Non response error or bias
People who do not respond may be different from
those who do respond
Sampling error
Chance (luck of the draw) variation from sample to
sample.
Measurement error
Due to weaknesses in question design, respondent
error, and interviewer’s impact on the respondent
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 714
Sampling Distributions
A sampling distribution is a distribution of all of the
possible values of a statistic for a given size sample
selected from a population.
For example, suppose you sample 50 students from
your college regarding their mean GPA. If you
obtained many different samples of 50, you will
compute a different mean for each sample. We are
interested in the distribution of all potential mean GPA
we might calculate for any given sample of 50
students.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 715
Sampling Distributions
Sample Mean Example
Suppose your population (simplified) was
four people at your institution.
Population size N=4
Random variable, X, is age of individuals
Values of X: 18, 20, 22, 24 (years)
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 716
Sampling Distributions
Sample Mean Example
Summary Measures for the Population Distribution:
21
4
24 22 20 18
N
X
μ
i
=
+ + +
=
=
¯
2.236
N
μ) (X
σ
2
i
=
÷
=
¯
.3
.2
.1
0
18 20 22 24
A B C D
P(x)
x
Uniform Distribution
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 717
Sampling Distributions
Sample Mean Example
1
st
Obs.
2
nd
Observation
18 20 22 24
18 18,18 18,20 18,22 18,24
20 20,18 29,20 20,22 20,24
22 22,18 22,20 22,22 22,24
24 24,18 24,20 24,22 24,24
Now consider all possible samples of size n=2
1
st
Obs.
2
nd
Observation
18 20 22 24
18 18 19 20 21
20 19 20 21 22
22 20 21 22 23
24 21 22 23 24
16 Sample
Means
16 possible samples
(sampling with
replacement)
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 718
Sampling Distributions
Sample Mean Example
Sampling Distribution of All Sample Means
1
st
Obs
2
nd
Observation
18 20 22 24
18 18 19 20 21
20 19 20 21 22
22 20 21 22 23
24 21 22 23 24
16 Sample
Means
18 19 20 21 22 23 24
0
.1
.2
.3
P(X)
X
(no longer uniform)
Sample Means
Distribution
_
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 719
Sampling Distributions
Sample Mean Example
21
16
24 21 19 18
N
X
μ
i
X
=
+ + + +
= =
¯
1.58
16
21)  (24 21)  (19 21)  (18
N
) μ X (
σ
2 2 2
2
X
i
X
=
+ + +
=
÷
=
¯
Summary Measures of this Sampling Distribution:
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 720
Sampling Distributions
Sample Mean Example
Population
N = 4
1.58 σ 21 μ
X
= =
X
2.236 σ 21 μ = =
Sample Means Distribution
n = 2
18 20 22 24
A B C D
0
.1
.2
.3
P(X)
X 18 19 20 21 22 23 24
0
.1
.2
.3
P(X)
X
_
_
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 721
Sampling Distributions
Standard Error
n
σ
σ
X
=
Different samples of the same size from the same population
will yield different sample means.
A measure of the variability in the mean from sample to
sample is given by the Standard Error of the Mean:
Note that the standard error of the mean decreases as the
sample size increases.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 722
Sampling Distributions
Standard Error: Normal Pop.
μ μ
X
=
n
σ
σ
X
=
If a population is normal with mean μ and standard
deviation σ, the sampling distribution of the mean is also
normally distributed with
and
(This assumes that sampling is with replacement or sampling
is without replacement from an infinite population)
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 723
Sampling Distributions
Z Value: Normal Pop.
n
σ
μ) X (
σ
) μ X (
Z
X
X
÷
=
÷
=
Zvalue for the sampling distribution of the sample mean:
where: = sample mean
= population mean
= population standard deviation
n = sample size
X
μ
σ
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 724
Sampling Distributions
Properties: Normal Pop.
(i.e. is
unbiased )
Normal Population
Distribution
Normal Sampling
Distribution
(has the same mean)
μ μ
x
=
x
x
x
μ
x
μ
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 725
Sampling Distributions
Properties: Normal Pop.
For sampling with replacement:
As n increases,
decreases
x
σ
Larger sample
size
Smaller sample
size
x
μ
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 726
Sampling Distributions
NonNormal Population
The Central Limit Theorem states that as the sample
size (that is, the number of values in each sample)
gets large enough, the sampling distribution of the
mean is approximately normally distributed. This is
true regardless of the shape of the distribution of the
individual values in the population.
Measures of the sampling distribution:
μ μ
x
=
n
σ
σ
x
=
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 727
Sampling Distributions
NonNormal Population
Population Distribution
Sampling Distribution
(becomes normal as n increases)
x
x
Larger
sample
size
Smaller sample size
x
μ
μ
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 728
Sampling Distributions
NonNormal Population
For most distributions, n > 30 will give a
sampling distribution that is nearly normal
For fairly symmetric distributions, n > 15 will
give a sampling distribution that is nearly
normal
For normal population distributions, the
sampling distribution of the mean is always
normally distributed
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 729
Sampling Distributions
Example
Suppose a population has mean μ = 8 and standard
deviation σ = 3. Suppose a random sample of size n = 36
is selected.
What is the probability that the sample mean is between
7.75 and 8.25?
Even if the population is not normally distributed, the
central limit theorem can be used (n > 30).
So, the distribution of the sample mean is approximately
normal with
8 μ
x
=
0.5
36
3
n
σ
σ
x
= = =
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 730
Sampling Distributions
Example
5 . 0
36
3
8  8.25
5 . 0
36
3
8  7.75
= =
÷ = =
Z
Z
First, compute Z values for both 7.75 and 8.25.
0.3830 0.5) Z P(0.5 8.25) μ P(7.75
X
= < < = < <
Now, use the cumulative normal table to compute
the correct probability.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 731
Sampling Distributions
Example
= 2(.5000.3085)
= 2(.1915)
= 0.3830
Z
0.5 0.5
Standardized Normal
Distribution
0 μ
z
=
7.75 8.25
Sampling
Distribution
Sample
8 μ
X
=
x
Population
Distribution
8 μ = X
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 732
Sampling Distributions
The Proportion
size sample
interest of stic characteri the having sample in the of number
n
X items
p = =
The proportion of the population having some
characteristic is denoted π.
Sample proportion ( p ) provides an estimate of π:
0 ≤ p ≤ 1
p has a binomial distribution
(assuming sampling with replacement from a finite population or without
replacement from an infinite population)
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 733
Sampling Distributions
The Proportion
Standard error for the proportion:
n
) (1
σ
p
t t ÷
=
n
) (1
σ
Z
p
t t
t t
÷
÷
=
÷
=
p p
Z value for the proportion:
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 734
Sampling Distributions
The Proportion: Example
If the true proportion of voters who support
Proposition A is π = .4, what is the probability that
a sample of size 200 yields a sample proportion
between .40 and .45?
In other words, if π = .4 and n = 200, what is
P(.40 ≤ p ≤ .45) ?
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 735
Sampling Distributions
The Proportion: Example
.03464
200
.4) .4(1
n
) (1
σ =
÷
=
÷
=
t t
p
1.44) Z P(0
.03464
.40 .45
Z
.03464
.40 .40
P .45) P(.40
s s =

.

\

÷
s s
÷
= s s p
Find :
Convert to
standardized
normal:
p
σ
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 736
Sampling Distributions
The Proportion: Example
Use cumulative normal table:
P(0 ≤ Z ≤ 1.44) = P(Z ≤ 1.44) – 0.5 = .4251
Z
.45 1.44
.4251
Standardize
Sampling Distribution
Standardized
Normal Distribution
.40 0
p
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson PrenticeHall, Inc. Chap 737
Chapter Summary
Described different types of samples
Examined survey worthiness and types of survey
errors
Introduced sampling distributions
Described the sampling distribution of the mean
For normal populations
Using the Central Limit Theorem
Described the sampling distribution of a proportion
Calculated probabilities using sampling distributions
In this chapter, we have
This action might not be possible to undo. Are you sure you want to continue?