Professional Documents
Culture Documents
• Snowball Sampling
Probability Sampling
k=7 ꝿ ꝿ ꝿ ꝿ ꝿ ꝿ ꝿ ꝿ ꝿ ꝿ
The Unbiased Property of the Sample Mean
The sample mean is unbiased because the mean of all the possible sample means (of a
given sample size, n) is equal to the population mean,
Suppose Best Buy, Inc. wants to estimate the mean age of buyers of high-definition
televisions. They select a random sample of 50 recent purchasers, determine the age
of each purchaser, and compute the mean age of the buyers in the sample. The mean
of this sample is a point estimate of the mean of the population.
The sample mean, is a point estimate of the population mean and the sample standard
deviation, is a point estimate of the population standard deviation.
A point estimate, however, tells only part of the story. While we expect the point estimate
to be close to the population parameter, we would like to measure how close it really is. A
confidence interval serves this purpose.
The 95 percent confidence interval is computed as follows, when the number of observations
in the sample is at least 30.
For reasonably large samples, the results of the central limit theorem allow us to
state the following:
1. Ninety-five percent of the sample means selected from a population will be within
1.96 standard deviations of the population mean.
2. Ninety-nine percent of the sample means will lie within 2.58 standard deviations
of the population mean.
The standard deviation discussed here is the standard deviation of the sampling
distribution of the sample mean. It is usually called the "standard error.“
Intervals computed in this fashion are called the 95 percent confidence interval and
the 99 percent confidence interval.
Can You Ever Know the Population Standard
Deviation?
To solve above equation , you must know the value for the population standard
deviation. To know implies that you know all the values in the entire population.
(How else would you know the value of this population parameter?) If you knew all
the values in the entire population, you could directly compute the population mean.
There would be no need to use the inductive reasoning of inferential statistics to
estimate the population mean. In other words, if you knew you really do not have
a need to use the above equation to construct a “confidence interval estimate of the
mean ( known).”
More significantly, in virtually all real-world business situations, you would never know
the standard deviation of the population. In business situations, populations are often
too large to examine all the values. So why study the confidence interval estimate of
the mean ( known) at all? This method serves as an important introduction to the
concept of a confidence interval because it uses the normal distribution. In the next
slides, you will see that constructing a confidence interval estimate when is not known
requires another distribution (the t distribution).
Reference Slide
Student’s t-Distribution
• In situation, where the population is normal
but its standard deviation is unknown, the
student’s t distribution should be used instead
of the normal z- distribution.
• This is particularly important when the sample
size is small. When is unknown, the formula
for a confidence interval resembles the
formula for known except that t replaces z
and s replace
At the start of the twentieth century, William S.
Gosset was working at Guinness in Ireland,
trying to help brew better beer less
expensively .As he had only small samples to
study, he needed to find a way to make inferences
about means without having to know Writing
under the pen name “Student,” Gosset solved this
problem by developing what today is known as
the Student’s t distribution, or the t distribution,
Sample Size Determination for the Mean
To develop an equation for determining the appropriate
sample size needed when constructing a confidence interval
estimate for the mean, recall equation
Solution: 97
A survey is planned to determine the mean annual family medical expenses of employees
of a large company. The management of the company wishes to be 95% confident that the
sample mean is correct to within of the population mean annual family medical
expenses. A previous study indicates that the standard deviation is approximately $400.
a. How large a sample is necessary?
b. If management wants to be correct to within how many employees need to be
selected?