Professional Documents
Culture Documents
The
The purpose
purpose of
of statistical
statistical inference
inference is
is to
to obtain
obtain
information
information about
about aa population
population from
from information
information
contained
contained in
in aa sample.
sample.
A
A population
population is
is the
the set
set of
of all
all the
the elements
elements of
of interest.
interest.
A
A sample
sample is
is aa subset
subset of
of the
the population.
population.
The
The sample
sample results
results provide
provide only
only estimates
estimates of
of the
the
values
values of
of the
the population
population characteristics.
characteristics.
With
With proper
proper sampling
sampling methods
methods,, the
the sample
sample results
results
can
can provide
provide “good”
“good” estimates
estimates of
of the
the population
population
characteristics.
characteristics.
A
A parameter
parameter is
is aa numerical
numerical characteristic
characteristic of
of aa
population.
population.
In
In point
point estimation
estimation wewe use
use the
the data
data from
from the
the sample
sample
to
to compute
compute aa value
value ofof aa sample
sample statistic
statistic that
that serves
serves
as
as an
an estimate
estimate of
of aa population
population parameter.
parameter.
We
We refer to x as
refer to as the
the point
point estimator
estimator of
of the
the population
population
mean ..
mean
ss is
is the
the point
point estimator
estimator of
of the
the population
population standard
standard
deviation ..
deviation
p is
is the
the point
point estimator
estimator of
of the
the population
population proportion
proportion pp..
• Park visits
• Time spent and age of the visitors
Sampling distribution of x
E( x ) =
N n
x ( ) x
N 1 n n
• A population is infinite when n/N < .05.
Exercise:
The mean annual cost of automobile insurance
is $939. Assume that the standard deviation is
$245.
What is the probability that a sample of
automobile insurance policies will have a
sample mean within $25 of the population
mean for sample sizes: 30, 60 and 90?
What is your interpretation of different
sample size?
Example: St. Andrew’s
xi
990
900
Population Standard Deviation for SAT Score
i
( x ) 2
80
900
Population Proportion Wanting On-Campus Housing
648
p .72
900
3-Digit Applicant
Random Number Included in Sample
744 No. 744
436 No. 436
865 No. 865
790 No. 790
835 No. 835
902 Number exceeds 900
190 No. 190
836 No. 836
. . . and so on
x as Point Estimator of
x
x i
29 , 910
997
n 30
s as Point Estimator of
s
(x i x )2
163, 996
75.2
n1 29
pas Point Estimator of p
p 2 0 3 0 .6 8
Sampling
Distribution
80
of x x 14.6
n 30
x
E ( x ) 990
Sampling
Distribution x 1 4 .6
of x
Area = .7517
x
990 1000
Sampling
Distribution x 1 4 .6
of x
Area = .2483
x
980 990
Sampling
Distribution x 1 4 .6
of x
Area = .5034
x
980 990 1000
With n = 100,
x 8
With n = 30,
x 1 4 .6
x
E ( x ) 990
Sampling
Distribution x 8
of x
Area = .7888
x
980 990 1000
E ( p) p
N n p(1 p) p (1 p )
p p
N 1 n n
N n p(1 p) p (1 p )
p p
N 1 n n
p
E( p ) .72
Area = .4582
p
.67 .72 .77
45.82 %
© 2008 Thomson South-Western. All Rights Reserved Slide
70
Sampling Distribution of p
Sampling
Distribution .72(1 .72)
p .082
0.045
of p 100
30
Area = ???
p
0.67 0.72 0.77
Unbiased
Efficiency
Consistency
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website or school-approved learning management system for classroom
76
use.
Statistics for Business and Economics
(13e)
Stratified Random Sampling
• The population is first divided into groups of elements called strata.
• Each element in the population belongs to one and only one stratum
• Best results are obtained when the elements within
each stratum are as much alike as possible (i.e. a
homogeneous group).
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website or school-approved learning management system for classroom
77
use.
Statistics for Business and Economics
(13e)
Stratified Random Sampling
• A simple random sample is taken from each stratum.
• Formulas are available for combining the stratum
sample results into one population parameter estimate.
• Advantage: If strata are homogeneous, this method is
as “precise” as simple random sampling but with a
smaller total sample size.
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website or school-approved learning management system for classroom
78
use.
Statistics for Business and Economics
(13e)
Cluster Sampling
• The population is first divided into separate groups of
elements called clusters.
• Ideally, each cluster is a representative small-scale
version of the population (i.e. heterogeneous group).
• A simple random sample of the clusters is then taken.
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website or school-approved learning management system for classroom
79
use.
Statistics for Business and Economics
(13e)
Cluster Sampling
• Example: A primary application is area sampling, where
clusters are city blocks or other well-defined areas.
• Advantage: The close proximity of elements can be cost
effective (i.e. many sample observations can be
obtained in a short time).
• Disadvantage: This method generally requires a larger
total sample size than simple or stratified random
sampling.
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website or school-approved learning management system for classroom
80
use.
Statistics for Business and Economics
(13e)
Systematic Sampling
• If a sample size of n is desired from a population
containing N elements, we might sample one element
for every n/N elements in the population.
• We randomly select one of the first n/N elements from
the population list.
• We then select every n/Nth element that follows in the
population list.
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website or school-approved learning management system for classroom
81
use.
Statistics for Business and Economics
(13e)
Systematic Sampling
• This method has the properties of a simple random
sample, especially if the list of the population elements
is a random ordering.
• Advantage: The sample usually will be easier to identify
than it would be if simple random sampling were used.
• Example: Selecting every 100th listing in a telephone
book after the first randomly selected listing
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website or school-approved learning management system for classroom
82
use.
Statistics for Business and Economics
(13e)
Convenience Sampling
• It is a nonprobability sampling technique. Items are
included in the sample without known probabilities of
being selected.
• The sample is identified primarily by convenience.
• Example: A professor conducting research might use
student volunteers to constitute a sample.
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website or school-approved learning management system for classroom
83
use.
Statistics for Business and Economics
(13e)
Convenience Sampling
• Advantage: Sample selection and data collection are
relatively easy.
• Disadvantage: It is impossible to determine how
representative of the population the sample is.
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website or school-approved learning management system for classroom
84
use.
Statistics for Business and Economics
(13e)
Judgment Sampling
• The person most knowledgeable on the subject of the
study selects elements of the population that he or she
feels are most representative of the population.
• It is a nonprobability sampling technique.
• Example: A reporter might sample three or four
senators, judging them as reflecting the general opinion
of the senate.
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website or school-approved learning management system for classroom
85
use.
Statistics for Business and Economics
(13e)
Judgment Sampling
• Advantage: It is a relatively easy way of selecting a
• sample.
Disadvantage: The quality of the sample results
depends on the judgment of the person selecting the
sample.
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website or school-approved learning management system for classroom
86
use.
Statistics for Business and Economics
(13e)
Recommendation
• It is recommended that probability sampling methods
(simple random, stratified, cluster, or systematic) be
• used.
For these methods, formulas are available for evaluating
the “goodness” of the sample results in terms of the
closeness of the results to the population parameters
being estimated.
• An evaluation of the goodness cannot be made with
non-probability (convenience or judgment) sampling
methods.
© 2017 Cengage Learning. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a
license distributed with a certain product or service or otherwise on a password-protected website or school-approved learning management system for classroom
87
use.