Professional Documents
Culture Documents
Statistical Estimation
CHAPTER CONTENTS
CHAPTER CONTENTS
5.1 Introduction 220
5.2 The Methods of Finding Point Estimators 221
5.3 Some Desirable Properties of Point Estimators 245
5.4 A Method of Finding the Confidence Interval: Pivotal Method 261
5.5 One Sample Confidence Intervals 269
5.6 A Confidence Interval for the Population Variance 284
5.7 Confidence Interval Concerning Two Population Parameters 289
5.8 Chapter Summary 298
5.9 Computer Examples 299
Projects for Chapter 5 303
5.1 Introduction
To estimate:
point estimation
interval estimation
1000 $
(700, 1200)
5.2The Methods of Finding Point Estimators
X1, . . ., Xn independent and identically distributed
(iid) random variables (in statistical
language, a random sample)
f (x, 1, . . ., l) Pdf or pmf of the population(?)
(1, . . ., l) the unknown population parameters
Point estimation :
to determine statistics gi(X1, . . ., Xn), i = 1, . . ., l,
which can be used to estimate the value of each of the
parameters
Estimator of i : i 1,, l
ˆ g X ,, X
i i 1 n
Capital letters such as X and S2 to represent the estimators;
Lowercase letters such as x and s2 to represent the estimates.
likelihood
Bayes’ method Chapter 11
Unbiased
An estimator, ˆ, is unbiased if the mean of its sampling distribution is the parameter .
Bias
consistency The estimator are said to satisfy the consistency property if the sample estimator has a high
E ˆ
probabilityBofbeing to the population value for a large sample size.
close
efficiency smaller variance
5.2.1 THE METHOD OF MOMENTS
k = E[(X - )k]
5.2.2 THE METHOD OF MAXIMUM LIKELIHOOD
Even though the method of moments is intuitive and easy to apply, it usually does
not yield “good” estimators.
For most cases of practical interest, the performance of MLEs is optimal for large
enough data.
This is one of the most versatile methods for fitting parametric statistical models
to data.
Maximum likelihood estimates give the parameter values for which the
observed sample is most likely to have been generated.
At times, the MLEs may be hard to calculate. It may be necessary to use numerical methods to
approximate values of the estimate.
5.3 Some Desirable Properties of Point Estimators
5.3.1 UNBIASED ESTIMATORS
The sample mean is always an unbiased estimator of the population mean.
Sample variance
Population variance:
Size of population = N
N
(1 / N ) ( X i )
Elements of population: X1, X2, ,… , XN
2 2
i 1
Unbiased estimators need not be unique.
If we have two unbiased estimators, there are infinitely many unbiased estimators.
It is better to have an estimator that has low bias as well as low variance.
For unbiased estimators,
5.3.2 SUFFICIENCY*
Skipped
5.4 A Method of Finding the Confidence Interval: Pivotal Method
Interval
Purpose: estimation
to have some degree of confidence of securing the true parameter.
Desirable properties: (1) P(L<<U) is high, that is, the true parameter is in (L, U) with high probability,
and (2) the length of the interval (L, U) should be relatively narrow on the average.
Thus, our objective is to find a narrow interval with high probability of enclosing the
true parameter, .
Randomness: For an interval estimator of a single parameter , we will use the random sample to
find two quantities L and U such that L< <U with some probability. Because L and U
depend on the sample values, they will be random.
2.5% 95% 2.5%
? 13.8 ?
The previous example (Example 5.4.1) was built on our knowledge of the
sampling distribution of the sample mean.
Suppose that = (X) is a point estimate of y, and let p(, ) be the pivotal
quantity.
Now, for a given value of α, (0<α<1), and constants a and b, with
(a<b), let
Pivot: z transform of
Pivot : Z
/2 /2
FIGURE 5.5 Probability density of the pivot.
The pivotal method may not be applicable in all situations.
The meaning of 95% C. I.
In an infinite series of trials in which repeated samples of size n are drawn from the same
population and 95% confidence intervals for m are calculated by the same method for
each of the samples, the proportion of intervals that actually include will be 95%.
Pivot : Z
/2 /2
5.5.1 LARGE SAMPLE CONFIDENCE INTERVALS
If the sample size is large, then by the Central Limit Theorem, certain sampling
distributions can be assumed to be approximately normal.
/ /
2 2
It is correct to say “We are 95% confident that the true mean will lie in the interval (74.1, 79.8).”
5.5.2 CONFIDENCE INTERVAL FOR PROPORTION, p
Question: “How do we determine the sample size that we have is sufficient
for the normal approximation that is used in the foregoing formula?”
“The CNN/USA Today/Gallup poll of 818 registered voters taken on June 27-30 showed that
if the election were held now, the president would beat his challenger 52% to 40%, with 8%
undecided. The poll had a margin of error of plus or minus 4%.”
Q: What is the minimum number n to attain a 5% margin of error under 95% confidence level?
Q: If 500 citizens were polled, what will be the margin of error under 95% confidence level.
The margin of error is nothing but a confidence interval.
2.5% 95% 2.5%
P = 50% minimal n
Formula-2
P = 50% maximal n
d: margin of error
Formula-3
Source: https://www.google.com.tw/url?
sa=i&rct=j&q=&esrc=s&source=images&cd=&cad=rja&uact=8&ved=2ahUKEwjS
6Iy20PfdAhUKOrwKHRvyCMsQjRx6BAgBEAU&url=http%3A%2F
%2Fwww.newworldencyclopedia.org%2Fentry
%2FDendrite&psig=AOvVaw1BZibMSRVu2svJadTOUX2Q&ust=15391149663295
90
5.5.3 SMALL SAMPLE CONFIDENCE INTERVALS FOR
The problem: To find a confidence interval for the true mean of a normal
population when the variance 2 is unknown and the sample size is small (< 30).
For n small (n<30) and 2 unknown, we can use the following theorem.
.
In practice, the first step in the previous procedure should include a test of normality.
Even when the data fail the normality test, most statistical software will produce a
confidence interval based on normality or give an error report.
In those cases, nonparametric methods (Chapter 12) such as the Wilcoxon rank sum
method or bootstrap methods (Chapter 13) will be more appropriate.
5.6 A Confidence Interval for the Population Variance
Note: Normal
population!
Note: Normal
population!
Note: Normal
population!
5.7 Confidence Interval Concerning Two Population Parameters