Professional Documents
Culture Documents
Statistical Methods
and Its Applications
Course Learning Objectives (CLO)
CLO1: Determine the parameter estimation; point and interval estimate for parameter mean, proportion
and variance. (C3, PLO1, MQF1)
CLO2: Analyze the problems that involved in estimating and hypothesis testing of real-life situations. (A3,
PLO3,MQF3)
CLO3: Analyze the problems using suitable methods; z-test, t-test, ANOVA, chi-square, regression and
correlation. (A3, PLO2,MQF2)
PARAMETER
ESTIMATION
(1.1)
3. The estimator should be a relatively efficient estimator. That is, of all the
statistics that can be used to estimate a parameter the relatively efficient
estimator has the smallest variance.
POINT AND INTERVAL ESTIMATES
1) A point estimate
• Definition : the value of a sample statistics that is used to estimate a population
parameter.
• Thus, the best point estimate of the population mean is the sample mean x .
Example 1.1 :
• Suppose that a college president wishes to estimate the average age of students
attending classes this semester. The president could select a random sample of
100 students and find the average age of these students, say, 22.3 years. This
type of estimate is called a point estimate.
• We use sample mean to estimate the population mean because the means of
samples vary less than other statistics (such as median and mode) when many
samples are selected from the same population. Therefore, the sample mean is
the best estimate of the population mean other than other measures of central
tendency.
2) An Interval estimate
• Definition:
An interval estimate of a parameter is an interval or a range of values used to
estimate the parameter. This estimate may or not contain the value of the
parameter being estimated.
• Example 1.2 : An interval estimate for the average age of all students might be
21.9 22.7 and 22.3 0.4 years.
•The question arises, what number should we subtract from and add a point
estimate to obtain an interval estimate? The answer to this question depend on
two considerations:
• Second, the quantity subtracted and added must be larger if we want to have
a higher confidence in our interval. We always attach a probabilistic statement
to the interval estimation which is given by confidence interval level. An interval
that is constructed based on this confidence interval is called confidence
interval.
2) Confidence Interval:
• A confidence interval is a specific interval estimate of a parameter
determined by using data obtained from a sample and by using the
specific confidence level of the estimate.
deviation , is not known, then we use the sample standard deviation, s, for .
. Consequently, we use s x s .
n
The value of z used here is read from the standard normal distribution table for
the given confidence level.
MAXIMUM ERROR OF ESTIMATE FOR .
• The term z 2 is called the maximum error of estimate. For a
n
specific value say, 0.05 , 95% of the sample means will fall within this
error value on either side of the population mean.
• Definition :
Maximum error of estimate is the maximum likely difference
between the point estimate of a parameter and the actual value of
the parameter.
How to find the z-value from the standard normal distribution table?
z
-1.96 0 1.96
• For a (1 )100% confidence level, the area between –z and z is 1
because the total area under the normal curve is 1.0 the total area under the
curve in the two tails is . This, as mentioned earlier, is called the significance
level.
• For a 95% confidence level, 1 0.95 0.05 . Therefore the area under the
curve in each of the two tails is 2 . The value of z associated with (1 )100%
confidence level is sometimes denoted by z 2 .
2 2
1
z
-z z
EXAMPLE 1.3:
The president of a large university wished to estimate the average age of the
students presently enrolled. From past studies, the standard deviation is
known to be 2 years. A sample of 50 students is selected, and the mean is
found to be 23.2 years. Find the 95% confidence interval of the population
mean.
SOLUTION:
Since the 95% confidence interval is desired, z 1.96 . Hence,
2
substituting in the formula
X z
2 n
X z X z
2 n 2 n
2 2
23.2 1.96 23.2
50 50
23.2 0.6 23.2 0.6
22.6 23.8
Or 23.2 0.6 years. Hence, the president can say, with 95% confidence, that
the average age of the students is between 22.6 and 23.8 years, based on
50 students.
EXERCISES 1.1
3. Find each.
a) z 2 for the 98% confidence interval
b) z 2 for the 90% confidence interval
c) z 2 for the 94% confidence interval
INTERVAL ESTIMATION OF A POPULATION MEAN: SMALL SAMPLES
1. It is bell-shaped.
2. It is symmetric about the mean.
3. The mean, median, and mode are equal to 0 and are located at the center
of the distribution.
4. The curve never touches the x-axis.
Find the value of t for 16 degrees of freedom and 0.05 area in the right tail of
a t-distribution curve.
SOLUTION:
0 1.746
df=16
The value of t for 16 df and
0.05
0.05 area in the left tail
1.746 0
CONFIDENCE INTERVAL FOR USING THE t-Distribution
s
X t Or X ts x sx
s
n Where
n
Ten randomly selected automobiles were stopped and the tread depth of the
right front tire was measured. The mean was 0.32 inch, and the standard
deviation was 0.08 inch. Find the 95% confidence interval of the mean depth.
Assume that the variable is approximately normally distributed.
SOLUTION:
Since is unknown and must s must replace it, the t-distribution must be
used for 95% confidence interval. Hence, with 9 degrees of freedom, t 2 2.262
s s
X t X t
2 n 2 n
0.08 0.08
0.32 2.262 0.32 2.262
10 10
0.08
0.32 0.057 0.32 2.262
10
0.26 0.38
Therefore, one can be 95% confident that the population mean thread depth of
all right front tires is between 0.26 and 0.38 inch based on a sample of 10 tires.
EXERCISES 1.2
1. When should the t distribution be used to find a confidence interval for the
mean?
Find the 98% confidence interval for the cigarette tax in all 50 states.