Professional Documents
Culture Documents
Inferential Statistics
Glyzel Grace M. Francisco
2nd Semester, 2022-2023
Point
Estimation
Estimation
Inferential Interval
Estimation
Statistics Hypothesis
Testing
Estimation of Parameter
to determine the approximate value of a population
parameter on the basis of a sample statistic
Test of Hypothesis
to make decisions regarding unknown population
parameter values based on sample data
The point estimate for the population mean month salary is 21,104.
𝑥 135 − 12
𝑝ො = = = 𝟎. 𝟗𝟏𝟏𝟏 𝑜𝑟 𝟗𝟏. 𝟏𝟏%
𝑛 135
the point estimate for the proportion of all PhD graduates who find
a job is 91.11%.
Error of Estimation
• difference between the estimate and the true value of the
parameter
Margin of Error
• also called the maximum error of the estimate, is the maximum
likely difference between the point estimate of a parameter and
the actual value of the parameter
(1-𝜶)
𝜶 𝜶/2 𝒁𝜶 /𝟐
100%
90% 0.10 0.05 1.645
95% 0.05 0.025 1.96
99% 0.01 0.005 2.575
Note: Get the average of the two z-scores if the probability is exactly at the
middle of the two probabilities. Otherwise, use the closer one.
GGMFRANCISCO INTRODUCTION TO INFERENTIAL STATISTICS | 19
DEPARTMENT of
STATISTICS Student t Distribution
A t-distribution is like a Z distribution, except has slightly fatter tails
to reflect the uncertainty added by estimating 𝜎
As the sample size gets larger, the Student t distribution gets closer to the
standard normal distribution.
GGMFRANCISCO INTRODUCTION TO INFERENTIAL STATISTICS | 23
DEPARTMENT of
STATISTICS Interval Estimation: z Critical Value
As n increases, the t approaches to z
Two-tailed
𝒁0.05/2
Note: s = 2733.3707
GGMFRANCISCO INTRODUCTION TO INFERENTIAL STATISTICS | 30
DEPARTMENT of
STATISTICS Confidence Interval of the Mean
22,130 18,465 25,616 22,440 19,869
n=5, 𝑥=21704,
ҧ 𝑠=2733.3707, 𝛼=0.05, 𝝈 is unknown
95% CI, 𝜶 = 𝟎. 𝟎𝟓, 𝒕𝟎.𝟎𝟓,𝒅𝒇=𝒏−𝟏=𝟒 = 𝟐. 𝟕𝟕𝟔 (see slide 32)
𝟐
𝑠
𝑥ҧ − 𝐸 < 𝜇 < 𝑥ҧ + 𝐸 , 𝑤ℎ𝑒𝑟𝑒 𝐸 = 𝑡𝛼 ,𝑛−1
2 𝑛
𝑠 2733.3707
𝐸 = 𝑡0.05 = 2.776 = 𝟑𝟑𝟗𝟑. 𝟑𝟖𝟑𝟗
2
,4 𝑛 5
21704 − 3393.3839 < 𝜇 < 21704 + 3393.3839
𝟏𝟖, 𝟐𝟏𝟎. 𝟔𝟏𝟔𝟏 < 𝝁 < 𝟐𝟓, 𝟎𝟗𝟕. 𝟑𝟖𝟑𝟗 𝑜𝑟 𝟏𝟖, 𝟐𝟏𝟎. 𝟔𝟏𝟔𝟏, 𝟐𝟓, 𝟎𝟗𝟕. 𝟑𝟖𝟑𝟗
We are 95% confident that the true value of the population mean entry level
monthly salary of random employees falls between 18,310.6161 and 25,097.3839.
GGMFRANCISCO INTRODUCTION TO INFERENTIAL STATISTICS | 31
DEPARTMENT of
STATISTICS Confidence Interval of the Mean
Alpha = 0.05
Tail = two-tailed
𝑝Ƹ 𝑞ො 𝑝Ƹ 𝑞ො
𝑝Ƹ − 𝑧𝛼 < 𝑝 < 𝑝Ƹ − 𝑧𝛼/2
2 𝑛 𝑛
Remarks:
1. p = population proportion and 𝑝Ƹ = sample proportion
𝑥
2. For a sample proportion, 𝑝Ƹ = and 𝑞ො = 1 − 𝑝Ƹ where X is the number of
𝑛
sample units that possess the characteristics of the interest and n is the
sample size.
3. 𝑛𝑝Ƹ and 𝑛𝑞ො are each greater than or equal to 5.
GGMFRANCISCO INTRODUCTION TO INFERENTIAL STATISTICS | 33
DEPARTMENT of
STATISTICS Confidence Interval for a proportion
Example: A survey was conducted with 1404 respondents and found that 323
students paid for their education by student loans. Find the 90% confidence interval
of the true proportion of students who paid for their education by student loans.
𝑥 323 (See slide 23)
𝑝ො = = = 𝟎. 𝟐𝟑, 𝑞ො = 1 − 𝑝ො = 1 − 0.23 = 𝟎. 𝟕𝟕, 𝑧𝛼=0.10/2 = 1.645
𝑛 1404
𝑝Ƹ 𝑞ො
𝑝Ƹ − 𝐸 < 𝑝 < 𝑝Ƹ + 𝐸 , 𝑤ℎ𝑒𝑟𝑒 𝐸 = 𝑍𝛼/2
𝑛 We are 90% confident
that the true
𝑝Ƹ 𝑞ො (0.23)(0.77) proportion of students
𝐸 = 𝑍0.10/2 = 1.645 = 𝟎. 𝟎𝟏𝟖𝟓
𝑛 1404 who paid for heir
0.23 − 0.0185 < 𝜇 < 0.23 + 0.0185 education by student
𝟎. 𝟐𝟏𝟏𝟓 < 𝝁 < 𝟎. 𝟐𝟒𝟖𝟓 𝑜𝑟 loans falls between
𝟎. 𝟐𝟏𝟏𝟓, 𝟎. 𝟐𝟒𝟖𝟓 𝑜𝑟 𝟐𝟏. 𝟏𝟓%, 𝟐𝟒. 𝟖𝟓% 21.15% and 24.85%
GGMFRANCISCO INTRODUCTION TO INFERENTIAL STATISTICS | 34
DEPARTMENT of
STATISTICS Confidence Interval for Variances and Standard Deviations
Example: Find the 95% confidence interval for the variance and
standard deviation of the nicotine content of cigarettes
manufactured if a random sample of 20 cigarettes has a standard
deviation of 1.6 milligrams. Assume the variable is normally
distributed.
2
The critical value for 1−𝛼 with df=19 is
2
(𝑛 − 1)𝑠 2 2 <
(𝑛 − 1)𝑠 2
< 𝜎
2𝛼/2 1−𝛼/2
2
(𝑛 − 1)𝑠 2 (𝑛 − 1)𝑠 2
<𝜎<
𝛼/2
2
1−𝛼/2
2
• Quite often you ask, How large a sample is necessary to make an accurate
estimate?
• The answer is not simple, since it depends on three things: the margin of
error, the population standard deviation, and the degree of confidence.
• For example, how close to the true mean do you want to be (2 units, 5
units, etc.), and how confident do you wish to be (90, 95, 99%, etc.)
Note: If there is any fraction or decimal portion in the answer, use the next whole
number for sample size n.
GGMFRANCISCO INTRODUCTION TO INFERENTIAL STATISTICS | 42
DEPARTMENT of
STATISTICS Sample Size Determination
Example: A scientist wishes to estimate the average depth of a river. He
wants to be 99% confident that the estimate is accurate within 2 feet. From a
previous study, the standard deviation of the depths measured was 4.33
feet.
𝑍0.01/2 = 2.575, 𝜎 = 4.33, 𝐸=2
2 2
𝑍𝛼/2 ∙ 𝜎 (2.575)(4.33)
𝑛= = = 31.2 ≈ 𝟑𝟐
𝐸 2
Therefore, to be 99% confident that the estimate is within 2 feet of the true
mean depth, the scientist needs a sample of at least 32 measurements.
Note: If there is any fraction or decimal portion in the answer, use the next whole
number for sample size n.