Professional Documents
Culture Documents
Definitions
Population: set of all subjects or elements about which we are interested in making
inferences.
Frame: a list containing all members of the population.
Census: a survey that includes all the elements or units in the frame.
Population Parameters: facts about the population. Since parameters are descriptions
of the population, a population can have many parameters.
Sample: a subset of the population which is used to gain insight about the population.
Samples are used to represent a larger group, the population.
Statistic: a fact or characteristic about a sample.
Descriptive Statistics: the collection, organization, analysis, and presentation of data.
Inferential Statistics: make reasonable estimates about population characteristics using
sample data
Normal Distribution
Population Mean
Confidence Interval of the Population Mean
A 100(1 – α)% confidence interval for the population mean is given by:
x ± za 2
n
if either of the following conditions are true:
1. n ≥ 30 (uses as an approximation for σ), or
2. if σ is known and the population being studied is normally distributed.
Example: Construct 80%, 90%, 95%, and 99% confidence intervals for the population mean if
the standard deviation of the population is 900. Use the following sample data.
n =100
x =425
Solution:
900
4 2 5 ± 1 .2 8 × o r 3 1 0 to 5 4 0
80% Confidence Interval: 100
900
4 2 5 ± 1 .6 4 5 × o r 2 7 7 to 5 7 3
90% Confidence Interval: 100
900
4 2 5 ± 1 .9 6 × o r 2 4 9 to 6 0 1
95% Confidence Interval: 100
900
4 2 5 ± 2 .5 7 5 × o r 1 9 3 to 6 5 7
99% Confidence Interval: 100
Degrees of Freedom
The degrees of freedom for any t-distribution are computed in the following manner:
df = number of sample observations – 1 = n – 1
A manufacturing company is interested in the amount of time it takes to complete a certain stage
of the production process. The project manager randomly samples 10 products as they come
from the production line and notes the time of completion. The average completion time is 23.45
minutes with a sample standard deviation of 4.32 minutes. Based on this sample, construct a 95%
confidence interval for the average completion time for that stage in the production process.
Assume that the population distribution of the completion times is approximately normal.
s
x ± t 2 ,d f
n
4 .3 2
2 3 .4 5 ± 2 .2 6 2
10
Thus, we are 95% confident that the true average completion time of that stage of the process is
between 20.36 minutes and 26.54 minutes.
Chapter 10
However, should the frequency distribution show substantial deviation from normality, a
nonparametric procedure may be warranted. These methods are discussed in Chapter 16.
Is the sample size sufficiently large?
Fortunately, the central limit theorem enables us to define the following rule to make this
decision.
We will continue with steps 7 through 10 after we discuss the test statistic.
Formula
z-Test Statistic
If n ≥ 30 or if the standard deviation of the population, σ, is known and the sample is drawn from
a normal population, then by the central limit theorem the test statistic is approximately given by
x -
z = 0
, w h e re x = .
x n
z has a standard normal distribution, and if σ is unknown and n is greater than or equal
to 30, s can be used as an approximation of σ.
Table 10.1 – Critical Values of the z-Test Statistic for Two-Sided Alternatives
The sample mean is 3.46 standard deviations from the hypothesized value. Is the sample
mean too far away from the value of the mean specified in the null hypothesis for us to
believe that the null is true?
The test statistic z = 3.46 implies x̄ is 3.46 standard deviation units from the mean which
is substantially more than 1.96 standard deviations from the hypothesized value. The
decision must be to reject H0. The sample mean is too far from the hypothesized value
for us to believe the difference is caused by ordinary sampling variation. Essentially, x̄
exceeds the tolerance for rareness that we have imposed by setting α= 0.05.
Step 10: State the conclusion in terms of the original question. There is significant evidence at
the 0.05 level that the students at the local university do not spend, on average, $500 per
semester on textbooks. It would be tempting to conclude that the students at the local university
spend more since the sample mean is greater than the national average. However, we did not test
a hypothesis for spending more; we tested whether the students at the local university were
spending more or less than the national average, and the conclusion must be consistent with the
hypothesis tested.