Professional Documents
Culture Documents
Statistika1 - Minggu 5 - DTMI
Statistika1 - Minggu 5 - DTMI
STATISTIKA I
Manufacturer
Is there, on average, exactly
660ml of water in each
bottle?
Volume of Water = 𝟔𝟔𝟔𝟔𝟔𝟔 𝒎𝒎𝒎𝒎
Company Claim :
The manufacturer is
making a claim it wishes
THE ASSUMPTION
The 660 ml on the bottle is
If the data indicates the bottles are being filled assumed to be true.
properly, then we failed to reject the null
(failed to reject our assumption). HYPOTHESIS
𝑯𝑯𝟎𝟎 ∶ 𝑽𝑽𝑽𝑽𝑽𝑽𝑽𝑽𝑽𝑽𝑽𝑽 = 𝟔𝟔𝟔𝟔𝟔𝟔𝟔𝟔𝟔𝟔
We are not saying we have proven the null. 𝑯𝑯𝟏𝟏 ∶ 𝑽𝑽𝑽𝑽𝑽𝑽𝑽𝑽𝑽𝑽𝑽𝑽 ≠ 𝟔𝟔𝟔𝟔𝟔𝟔 𝒎𝒎𝒎𝒎
Just that our assumption has held up.
Example : Bottled Water
THE ASSUMPTION
The 660 ml on the bottle is
assumed to be true.
HYPOTHESIS
𝑯𝑯𝟎𝟎 ∶ 𝑽𝑽𝑽𝑽𝑽𝑽𝑽𝑽𝑽𝑽𝑽𝑽 = 𝟔𝟔𝟔𝟔𝟔𝟔𝟔𝟔𝟔𝟔
𝑯𝑯𝟏𝟏 ∶ 𝑽𝑽𝑽𝑽𝑽𝑽𝑽𝑽𝑽𝑽𝑽𝑽 ≠ 𝟔𝟔𝟔𝟔𝟔𝟔 𝒎𝒎𝒎𝒎
Let’s say you are laying on your bed while scrolling your smart phone.
Suddenly you encounter a sudden smell of smoke.
You know that may mean a serious fire is taking place. Or it could be nothing
serious; maybe someone burned trash.
Actual Condition
𝑵𝑵𝑵𝑵 𝒔𝒔𝒔𝒔𝒔𝒔𝒔𝒔𝒔𝒔𝒔𝒔𝒔𝒔 𝑭𝑭𝑭𝑭𝑭𝑭𝑭𝑭 𝑺𝑺𝑺𝑺𝑺𝑺𝑺𝑺𝑺𝑺𝑺𝑺𝑺𝑺 𝑭𝑭𝑭𝑭𝑭𝑭𝑭𝑭
Conclusion
*In general, the real-world consequences of a Type II Error are much greater.
In this case, Type II error may mean loss of property or even lives.
A. Hypothesis Testing
Probability of Type I Error
𝛼𝛼 = 𝑃𝑃 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 𝐼𝐼 𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒 = 𝑃𝑃(𝑟𝑟𝑟𝑟𝑟𝑟𝑟𝑟𝑟𝑟𝑟𝑟 𝐻𝐻0 𝑤𝑤𝑤𝑤𝑤𝑤𝑤 𝐻𝐻0 𝑖𝑖𝑖𝑖 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡)
The type I error probability is called the significance level, the α-error
or size of the test.
Suppose :
Size of sample 𝑛𝑛 = 10 𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠
Standard deviation of the burning rate
𝜎𝜎 = 2.5 𝑐𝑐𝑐𝑐/𝑠𝑠𝑠𝑠𝑠𝑠
The sample mean burning rate 𝑥𝑥̅ is observed
If 48.5 ≤ 𝑥𝑥̅ ≤ 51.5, we will not reject the null
hypothesis.
An Ejection Seat and a Rocket Motor
The z-values that correspond to the critical values 48.5 and 51.5 are
Therefore,
𝜶𝜶 = 𝑷𝑷 𝒛𝒛 < −𝟏𝟏. 𝟗𝟗𝟗𝟗 + 𝑷𝑷 𝒛𝒛 > 𝟏𝟏. 𝟗𝟗𝟗𝟗 = 𝟎𝟎. 𝟎𝟎𝟎𝟎𝟎𝟎𝟎𝟎 + 𝟎𝟎. 𝟎𝟎𝟎𝟎𝟎𝟎𝟎𝟎 = 𝟎𝟎. 𝟎𝟎𝟎𝟎𝟎𝟎𝟎𝟎
An Ejection Seat and a Rocket Motor
This implies that 5.74% of all random samples would lead to rejection of
the hypothesis 𝐻𝐻0 : 𝜇𝜇 = 50𝑐𝑐𝑐𝑐/𝑠𝑠𝑠𝑠𝑠𝑠 when the true mean burning rate is really
50cm/sec.
An Ejection Seat and a Rocket Motor
Acceptance region
Question!
1. What will happen if 𝑛𝑛 = 16, 𝑛𝑛 = 20, 𝑛𝑛 = 50?
2. What will happen if we are widening the acceptance region?
An Ejection Seat and a Rocket Motor
The z-values that correspond to the critical values 48.5 and 51.5 when
𝜇𝜇 = 52 are
48.5 − 52 51.5 − 52
𝑍𝑍1 = = −4.43 𝑍𝑍2 = = −0.63
2.5/√10 2.5/√10
Therefore,
𝜷𝜷 = 𝑷𝑷 −𝟒𝟒. 𝟒𝟒𝟒𝟒 ≤ 𝒁𝒁 ≤ −𝟎𝟎. 𝟔𝟔𝟔𝟔 = 𝑷𝑷 𝒁𝒁 ≤ −𝟎𝟎. 𝟔𝟔𝟔𝟔 − 𝑷𝑷 𝒁𝒁 ≤ −𝟒𝟒. 𝟒𝟒𝟒𝟒
= 𝟎𝟎. 𝟐𝟐𝟐𝟐𝟐𝟐𝟐𝟐 − 𝟎𝟎. 𝟎𝟎𝟎𝟎𝟎𝟎𝟎𝟎 = 𝟎𝟎. 𝟐𝟐𝟐𝟐𝟐𝟐𝟐𝟐
An Ejection Seat and a Rocket Motor
Question!
What will happen to 𝜷𝜷 − 𝒆𝒆𝒆𝒆𝒆𝒆𝒆𝒆𝒆𝒆 if 𝒏𝒏 = 𝟏𝟏𝟏𝟏, 𝒏𝒏 = 𝟐𝟐𝟐𝟐, 𝒏𝒏 = 𝟓𝟓𝟓𝟓?
A. Hypothesis Testing
Important Points :
1. The size of the critical region, and consequently the probability of
a type I error 𝛼𝛼, can always be reduced by appropriate selection
of the critical values.
2. Type I and type II errors are related. A decrease in the probability
of one type of error always results in an increase in the
probability of the other provided that the sample size 𝑛𝑛 does not
change.
3. An increase in sample size reduces 𝛽𝛽 provided that 𝛼𝛼 is held
constant.
4. When the null hypothesis is false, 𝛽𝛽 increases as the true value of
the parameter approaches the value hypothesized in the null
hypothesis. The value of 𝛽𝛽 decreases as the difference between
the true mean and the hypothesized value increases
A. Hypothesis Testing
A widely used procedure in hypothesis testing is to use a type I error or
significance level of 𝛼𝛼 = 0.05 . This value has evolved through
experience and may not be appropriate for all situations.
Power
The power of a statistical test is the probability of rejecting
the null hypothesis 𝐻𝐻0 when the alternative hypothesis is
true.
𝑷𝑷𝑷𝑷𝑷𝑷𝑷𝑷𝑷𝑷 = 𝟏𝟏 − 𝜷𝜷
(the probability of correctly rejecting a false null hypothesis)
𝑃𝑃-Value
The 𝑷𝑷-value is the smallest level of significance that would
lead to rejection of the null hypothesis 𝐻𝐻0 with the given
data.
𝑃𝑃-Value (Observed Significance level)
Consider the two-sided hypothesis test for burning rate
𝑯𝑯𝟎𝟎 : 𝝁𝝁 = 𝟓𝟓𝟓𝟓 𝒄𝒄𝒄𝒄 𝒑𝒑𝒑𝒑𝒑𝒑 𝒔𝒔𝒔𝒔𝒔𝒔𝒔𝒔𝒔𝒔𝒔𝒔
𝑯𝑯𝟏𝟏 : 𝝁𝝁 ≠ 𝟓𝟓𝟓𝟓 𝒄𝒄𝒄𝒄 𝒑𝒑𝒑𝒑𝒑𝒑 𝒔𝒔𝒔𝒔𝒔𝒔𝒔𝒔𝒔𝒔𝒔𝒔
with 𝑛𝑛 = 16 and 𝜎𝜎 = 2.5 𝑐𝑐𝑐𝑐/𝑠𝑠𝑠𝑠𝑠𝑠. Suppose that the observed sample mean is
𝑥𝑥̅ = 51.3 𝑐𝑐𝑐𝑐/𝑠𝑠𝑠𝑠𝑠𝑠.
� < 𝟓𝟓𝟓𝟓. 𝟑𝟑
𝑷𝑷 − 𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯 = 𝟏𝟏 − 𝑷𝑷 𝟒𝟒𝟒𝟒. 𝟕𝟕 < 𝑿𝑿
Test Statistic :
� −𝝁𝝁𝟎𝟎
𝑿𝑿
𝒁𝒁𝟎𝟎 =
𝑷𝑷 − 𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯 𝝈𝝈/ 𝒏𝒏
Probability above 𝑧𝑧0 and
𝑷𝑷 − 𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯 𝑷𝑷 − 𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯
probability below − 𝑧𝑧0 ,
Probability above 𝑧𝑧0 Probability above 𝑧𝑧0
𝑃𝑃 = 2[1 − Φ 𝑧𝑧0 ]
𝑃𝑃 = 1 − Φ 𝑧𝑧0 𝑃𝑃 = Φ 𝑧𝑧0
Rejection Criterion for Fixed-
Rejection Criterion for Rejection Criterion for
Level Tests
Fixed-Level Tests Fixed-Level Tests
𝑧𝑧0 > 𝑧𝑧𝛼𝛼 or 𝑧𝑧0 < −𝑧𝑧𝛼𝛼 𝑧𝑧0 > 𝑧𝑧𝛼𝛼 𝑧𝑧0 < −𝑧𝑧𝛼𝛼
2 2
Example : Propellant Burning Rate
𝑧𝑧0 > 𝑧𝑧𝛼𝛼 or 𝑧𝑧0 < −𝑧𝑧𝛼𝛼 , where 𝑧𝑧𝛼𝛼 = 𝑧𝑧0.025 = 1.96
2 2 2
𝑷𝑷 − 𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯
𝑃𝑃 = 2 1 − Φ 𝑧𝑧0 = 2 1 − Φ 3.25 = 0.0012
Because of 𝑧𝑧0 = 3.25 > 𝑧𝑧0.025 = 1.96, then we reject 𝐻𝐻0 = 50 at the 0.05
level of significance.
Example : Propellant Burning Rate
Interpretation:
We conclude that the mean burning rate differs from 50
centimeters per second, based on a sample of 25
measurements. In fact, there is strong evidence that the
mean burning rate is not 50 centimeters per second.
Sample Size
Probability of a Type II Error for a Two-Sided Test on the Mean, Variance Known
𝜇𝜇 − 𝜇𝜇0 𝑛𝑛 𝜇𝜇 − 𝜇𝜇0 𝑛𝑛
𝛽𝛽 = Φ 𝑧𝑧𝛼𝛼/2 − − Φ −𝑧𝑧𝛼𝛼/2 −
𝜎𝜎 𝜎𝜎
𝑧𝑧𝛼𝛼 + 𝑧𝑧𝛽𝛽 2 𝜎𝜎 2
2
𝑛𝑛 ≈ 2
𝜇𝜇 − 𝜇𝜇0
2
𝑧𝑧𝛼𝛼 + 𝑧𝑧𝛽𝛽 𝜎𝜎 2
𝑛𝑛 ≈
𝜇𝜇 − 𝜇𝜇0 2
C. Tests on the Mean of a Normal Distribution, Variance Unknown
Test Statistic
� − 𝝁𝝁𝟎𝟎
𝑿𝑿
𝒕𝒕𝟎𝟎 =
𝑺𝑺/ 𝒏𝒏
𝑷𝑷 − 𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯
Probability above 𝑡𝑡0 and
𝑷𝑷 − 𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯 𝑷𝑷 − 𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯
probability below − 𝑡𝑡0 ,
Probability above 𝑧𝑧0 Probability above 𝑧𝑧0
Rejection Criterion for Fixed-
Rejection Criterion for Rejection Criterion for
Level Tests
Fixed-Level Tests Fixed-Level Tests
𝑡𝑡0 > 𝑡𝑡𝛼𝛼,𝑛𝑛−1 or 𝑡𝑡0 < −𝑡𝑡𝛼𝛼,𝑛𝑛−1 𝑡𝑡0 > 𝑡𝑡𝛼𝛼,𝑛𝑛−1 𝑡𝑡0 > 𝑡𝑡𝛼𝛼,𝑛𝑛−1
2 2
Example : Golf Club Design
The increased availability of light materials with high strength has
revolutionized the design and manufacture of golf clubs, particularly
drivers. Clubs with hollow heads and very thin faces can result in much
longer tee shots, especially for players of modest skills. This is due
partly to the “spring-like effect” that the thin face imparts to the ball.
Firing a golf ball at the head of the club and measuring the ratio of the
ball’s outgoing velocity to the incoming velocity can quantify this
spring-like effect. The ratio of velocities is called the coefficient of
restitution of the club. An experiment was performed in which 15
drivers produced by a particular club maker were selected at random
and their coefficients of restitution measured. In the experiment, the
golf balls were fired from an air cannon so that the incoming velocity
and spin rate of the ball could be precisely controlled. It is of interest
to determine whether there is evidence (with 𝛼𝛼 = 0.05) to support a
claim that the mean coefficient of restitution exceeds 0.82. The
observations follow:
Example : Golf Club Design
Interpretation :
There is strong evidence
to conclude that the mean
coefficient of restitution
exceeds 0.82.
E. Tests on a Population Proportion
Test Statistic :
𝑿𝑿 − 𝒏𝒏𝒑𝒑𝟎𝟎
𝒁𝒁𝟎𝟎 =
𝒏𝒏𝒑𝒑𝟎𝟎 (𝟏𝟏 − 𝒑𝒑𝟎𝟎 )
𝑷𝑷 − 𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯
Probability above 𝑧𝑧0 and
𝑷𝑷 − 𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯 𝑷𝑷 − 𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯𝐯
probability below − 𝑧𝑧0 ,
Probability above 𝑧𝑧0 Probability above 𝑧𝑧0
𝑃𝑃 = 2[1 − Φ 𝑧𝑧0 ]
𝑃𝑃 = 1 − Φ 𝑧𝑧0 𝑃𝑃 = Φ 𝑧𝑧0
Rejection Criterion for Fixed-
Rejection Criterion for Rejection Criterion for
Level Tests
Fixed-Level Tests Fixed-Level Tests
𝑧𝑧0 > 𝑧𝑧𝛼𝛼 or 𝑧𝑧0 < −𝑧𝑧𝛼𝛼
2 2 𝑧𝑧0 > 𝑧𝑧𝛼𝛼 𝑧𝑧0 < −𝑧𝑧𝛼𝛼
Example :
Example : Automobile Engine Controller
Exercise 1
Exercise 3
Assignment 1
Assignment 2
Cloud seeding has been studied for many decades as a weather modification procedure
(for an interesting study of this subject, see the article in Technometrics, “A Bayesian
Analysis of a Multiplicative Treatment Effect in Weather Modification,” 1975, Vol. 17, pp. 161–
166). The rainfall in acre-feet from 20 clouds that were selected at random and seeded
with silver nitrate follows:
18.0, 30.7, 19.8, 27.1, 22.3, 18.8, 31.8, 23.4, 21.2, 27.9, 31.9, 27.1, 25.0, 24.7, 26.9, 21.8, 29.2,
34.8, 26.7, and 31.6.
a. Can you support a claim that mean rainfall from seeded clouds exceeds 25 acre-
feet? Use α = 0.01. Find the P-value.
b. Check that rainfall is normally distributed.
c. Compute the power of the test if the true mean rainfall is 27 acre-feet.
d. What sample size would be required to detect a true mean rainfall of 27.5 acre-feet
if you wanted the power of the test to be at least 0.9?
e. Explain how the question in part (a) could be answered by constructing a one-sided
confidence bound on the mean diameter.
Assignment 3
It is believed that at least 60% of the residents in a certain
area favor an annexation suit by a neighboring city. What
conclusion would you draw if only 110 in a sample of 200
voters favored the suit? Use a 0.05 level of significance.