Professional Documents
Culture Documents
Dr Ali Baştaş
• Intervals
IENG385 8-2
Section 9.2
Statistical
Inference
Statistical Inference
• Statistical inference consists of those methods by which one
makes inferences or generalizations about a population.
IENG385 8-4
Section 9.3
Classical
Methods of
Estimation
Parametric Point Estimation
• A point estimate of some population parameter θ is a single value ˆθ of a
statistic ˆΘ.
• E.g., the value ¯x of the statistic ¯X, computed from a sample of size n, is
a point estimate of the population parameter μ.
IENG385 8-6
Properties of a good estimator and Unbiased estimator
• What are the desirable properties of a “good” decision function that would
influence us to choose one estimator rather than another?
IENG385 9-7
Minimum Variance Unbiased Estimator (MVUE)
• If ˆΘ1 and ˆΘ2 are two unbiased estimators of the same population parameter
θ, we want to choose the estimator whose sampling distribution has the
smaller variance.
• Hence, if σ2ˆθ1 < σ2ˆθ2, we say that ˆΘ1 is a more efficient estimator of θ than
ˆΘ2 .
Definition 9.2
IENG385 9-8
Sampling distributions of different estimators of q
• It is clear that only ˆΘ1 and ˆΘ2 are unbiased, since their distributions are centered at θ.
• The estimator ˆΘ1 has a smaller variance than ˆΘ2 and is therefore more efficient.
• Hence, our choice for an estimator (MVUE) of θ, among the three considered, would be ˆΘ1.
IENG385 9-9
Estimation of the Population Mean – Mean or
the Median?
• For normal populations, one can show that both ¯X (sample mean) and ˜X
(sample median) are unbiased estimators of the population mean μ, but
the variance of ¯X is smaller than the variance of ˜X .
IENG385 8 - 10
Interval Estimation
• Even the most efficient unbiased estimator is unlikely to estimate the
population parameter exactly.
• It is true that estimation accuracy increases with large samples, but there is
still no reason we should expect a point estimate from a given sample to be
exactly equal to the population parameter it is supposed to estimate.
IENG385 8 - 11
Interval Estimation
• An interval estimate of a population parameter θ is an interval of the form:
• The interval ˆθL < θ < ˆθU, computed from the selected sample, is called a 100(1 − α)%
confidence interval; the fraction 1 − α is called the confidence coefficient or the
degree of confidence, and the endpoints, ˆθL and ˆθU , are called the lower and upper
confidence limits.
• Thus, when α = 0.05, we have a 95% confidence interval, and when α = 0.01, we
obtain a wider 99% confidence interval.
• The wider the confidence interval is, the more confident we can be that the interval
contains the unknown parameter.
IENG385 8 - 12
Section 9.4
Single Sample:
Estimating the
Mean
P(-za/2 < Z < za/2) = 1-a
IENG385 9 - 14
Interval estimates of m for different samples
IENG385 9 - 15
IENG385 9 - 16
Additional Example: An electrical firm manufactures light bulbs that have a length of life that is
approximately normally distributed with a standard deviation of 40 hours. If a sample of 30 bulbs has an
average life of 780 hours, find a 96% confidence interval for the population mean of all bulbs produced by
this firm.
IENG385 9 - 17
_
Error in estimating m by x
IENG385 9 - 18
Theorem 9.1
• In Example 9.2, we are 95% confident that the sample mean ¯x = 2.6 differs from
the true mean μ by an amount less than (1.96)(0.3)/√36 = 0.1 and,
IENG385 9 - 19
Theorem 9.2
IENG385 9 - 20
IENG385 9 - 21
Additional Example: A marketing agency wishes to determine the average time, in days, that
it takes to sell a product in various stores in a city. How large a sample will they need in order
to be 98% confident that their sample mean will be within 2 days of the true mean? Assume
that σ = 5 days.
IENG385 9 - 22
IENG385 9 - 23
IENG385 9 - 24
P(-ta /2 < T < ta /2) = 1-a
IENG385 9 - 25
IENG385 9 - 26
IENG385 9 - 27
Additional Example: A machine produces metal pieces that are cylindrical in shape. A sample of
pieces is taken, and the diameters are found to be 1.01, 0.97, 1.03, 1.04, 0.99, 0.98, 0.99, 1.01,
and 1.03 centimeters. Find a 99% confidence interval for the mean diameter of pieces from this
machine, assuming an approximately normal distribution.
IENG385 9 - 28
Concept of a Large-Sample Confidence Interval
• Statisticians recommend that even when normality cannot be assumed, σ is
unknown, and n ≥ 30, s can replace σ and the confidence interval:
• The justification lies only in the presumption that with a sample as large as
30 and the population distribution not too skewed, s will be very close to the
true σ and thus the Central Limit Theorem prevails.
• This is only an approximation and the quality of the result becomes better as
the sample size grows larger.
IENG385 8 - 29
IENG385 9 - 30
Section 9.6
Prediction
Intervals
• The prediction interval provides a good estimate of the location of a future
observation, which is quite different from the estimate of the sample mean
value.
• It should be noted that the variation of this prediction is the sum of the
variation due to an estimation of the mean and the variation of a single
observation.
IENG385 9 - 32
IENG385 9 - 33
IENG385 9 - 34
IENG385 9 - 35
Use of Prediction Limits for Outlier Detection
• The majority of scientific investigators are keenly sensitive to the existence of
outlying observations or so-called faulty or “bad data.”
• The prediction interval produces a bound that “covers” a future single observation
with probability 1 − α if it comes from the population from which the sample was
drawn.
• As a result, for the prediction interval of Example 9.8, if a new pack of beef is
measured and its leanness is outside the interval (93.96, 98.44), that observation
can be viewed as an outlier.
IENG385 9 - 36
Section 9.7
Tolerance Limits
Tolerance Limits
• Suppose that interest centers around the manufacturing of a component
part and specifications exist on a dimension of that part.
• But unlike in the scenario in Section 9.6, one may be less interested in a
single observation, and more interested in where the majority of the
population falls.
IENG385 9 - 38
Tolerance Limits (cont.)
• One must attempt to determine bounds that, in probabilistic sense, “cover”
values in the population (i.e., the measured values of the dimension).
• A bound that covers the middle 95% of the population of observations is:
μ ± 1.96σ
IENG385 9 - 39
Tolerance Limits (cont.)
• However, in practice, μ and σ are seldom known; thus, the user must
apply:
• Table A.7 gives values of k for 1 − α = 0.90, 0.95, 0.99; γ = 0.05, 0.01; and
selected values of n from 2 to 300.
IENG385 9 - 40
IENG385 9 - 41
Distinction among Confidence Intervals, Prediction
Intervals, and Tolerance Intervals
• In the case of confidence intervals, one is attentive only to the population mean.
• For an engineering process that produces shearing pins, a specification will be set
on Rockwell hardness, below which a customer will not accept any pins.
• We should know where the majority of the values of Rockwell hardness are going to
be. Thus, tolerance limits should be used.
IENG385 9 - 42
Distinction among Confidence Intervals, Prediction
Intervals, and Tolerance Intervals
• Prediction intervals are applicable when it is important to determine a
bound on a single value.
• The mean is not the issue here, nor is the location of the majority of the
population.
IENG385 9 - 43
IENG385 9 - 44
IENG385 9 - 45
Case study illustrates that the 3 types of limits can give different results although they are all 99% bounds.
For the confidence interval on the mean, 99% of such intervals cover the population mean diameter. i.e. we
are 99% confident that the mean dia. produced is between 0.9781 and 1.0331cm.
Emphasis is on the mean, not about a single reading or the nature of the distribution of diameters in
the population.
For the prediction limits, the bounds 0.9186 and 1.0926 are based on the distribution of a single “new” metal
piece taken from the process, and 99% of such limits will cover the diameter of a new measured piece.
The tolerance limits give the engineer a sense of where the “majority,” the central 95%, of the diameters of
measured pieces in the population reside.
The 99% tolerance limits, 0.8937 and 1.1175, are different from the other two bounds. If these bounds appear
wide to the engineer, it reflects negatively on process quality.
However, if the bounds represent a desirable result, the engineer conclude that a majority (95% in here) of
the diameters are in a desirable range.
IENG385 9 - 46
Section 9.8
Two Samples:
Estimating the
Difference
between Two
Means
IENG385 9 - 48
IENG385 9 - 49
IENG385 9 - 50
IENG385 9 - 51
IENG385 9 - 52
IENG385 9 - 53
IENG385 9 - 54
IENG385 9 - 55
IENG385 9 - 56
IENG385 9 - 57
Section 9.12
Single Sample:
Estimating the
Variance
IENG385 9 - 59
2
P( x
1-a 2 X 2
xa 2 ) = 1 - a
2
IENG385 9 - 60
IENG385 9 - 61
Example 9.72. The mean heights of 24 undergraduate students studying industrial engineering in
a college is 170 centimeters, with a variance 14 centimeters. Assuming that the heights are
normally distributed, construct a 95% confidence interval for the population variance of the student
heights in the college - σ2.
0.025
IENG385 9 - 62
Section 9.13
Two Samples:
Estimating the
Ratio of Two
Variances
IENG385 9 - 64
P[f1-a 2 (v1 ,v 2 ) F fa 2 (v1 , v 2 )] = 1- a
IENG385 9 - 65
IENG385 9 - 66
2
IENG385 9 - 67
Example 9.77. Construct a 98% confidence interval for
σ1/σ2 in Exercise 9.42 on page 315, where σ1 and σ2 are,
respectively, the standard deviations for the distances
traveled per liter of fuel by the Volkswagen and Toyota
mini-trucks.
IENG385 9 - 68
Lecture References
• Walpole RE, Myers R, Myers SL, Ye K, “Probability & Statistics for
Engineers & Scientists”, Global/9th ed., Pearson, (2016).
IENG385 69
IENG332 – Production Planning - I