MAGA
ELEMENTARY
STATISTICS AND
PROBABILITY
Eto
hei On
WTA, 2. cr Oy
ayesLesson 17
HYPOTHESIS TESTING FOR A ME:
1. Perform hypothesis testing for a mean,
3 hours
module, exercise sheet, writing materials
iii
As a recall, selecting appropriate statistical test is one of the crucial moments in
hypothesis testing. Thus, at this point forward, we deal each of these statistical test here.
171. ¢ Test fora Mean
The ¢ test for a mean is a statistical test for the mean of a population. It is used to
answer questions on possible difference between the observed value (sample mean in this case)
and the hypothesized value (population mean in this case) per standard error of the given data.
Additionally, it can be only be used when population standard deviation ¢ is known.
Assumptions:
1. The sample is a random sample.
2. Either n 2 30 or the population is normally distributed if n <30.
X-u
The formula for z testis Zz
where: bg observed/computed sample mean
K hypothesized population mean
o population standard deviation
n sample size
Iitustrative Example 1.
Problem: SAT Tests. For a specific year, the average score on the NAT Math test was 515.
The variable is normally distributed, and the population standard deviation is 100. The same
superintendent in the previous example wishes to see if her students scored significantly below
the NAT Math average score. She randomly selected 36 student scores, as shown. At a= 0.10,
is there enough evidence to support the claim? Source: www.chacha.com
496 506 507 SOS 438 499 505 522 531 762 513 493 522 668 543 S19 349 506
519 516 714 517 S11 551 287 523 576 516 515 500 243 509 523 503 414 504
116Problem Question: Are the students’ scores significantly below the NAT Math average score?
Hypotheses:
‘Ho: The students’ scores are the same as the NAT Math average score, Hz w=515
‘Ih: The students’ scores significantly below the NAT Math average score. Hy: sr <515 (claim)
Critical Value: Since «= 0.10 and the test is a left-tailed test, the critical value is 1.28.
Refer to this arbitrary critical values of z in different «
fa =0.10, C9.
Hom = k J =0.05, CV.
[a= 0.01, C¥.--23
(2) Left-taited
(a= 0.05, CV. +1 65
Hem > k | q20.01, CV. 24233
Hou >t fis OV. =41.28
(b) Righ-taites
spp
(c) Two-tailed
‘Test Value:
In the problem, the population mean (515) population standard deviation (100) and sample
size (35) is given in the problem. We lack the sample mean,
Determine the sample mean first in case it is not given.
FADE _ 496450545224 +++ +504 _ 18,325 509008
1” 36 36
Since we had determined all the needed values, we can proceed to the z test for a mean,
__X-m _ 509.028-515 _
Bee O88
va N36
7Decision: Since the test value of -0.36 is not beyond
the critical value of —1.28, we fail to reject the null
hypothesis. In the figure on the left, it clearly shows
that the test value falls within the noncritical region,
clearly supporting our decision.
Conclusion: Therefore, the students” scores are the same as the NAT Math average score.
Mitustrative Example 2
Problem: Ages of Medical Doctors. A researcher believes that the mean age of medical doctors
in a large hospital system is older than the average age of doctors in the United States, which is
46. Assume the population standard deviation is 4.2 years. A random sample of 30 doctors from
the system is selected, and the mean age of the sample is 48.6. Test the claim at a= 0.05.
Source: American Averages.
Problem Question: Is the rrean age of medical doctors in a large hospital system is older than
the average age of doctors in the United States?
Fypatheses:
‘Hy: The mean age of medical doctors in a large hospital system is the same as the average
age of doctors in the United States
Hy "= 46
Hy: The mean age of medical doctors in a large hospital system is older than the average
age of doctors in the United States
Hy: > 46 (claim)
Critical Value: Since a = 0.05 and the test is a right-tailed test, the critical values are + 1.65.
Test Value:
In the problem, the saraple mean (48.6) the population mean (46) population standard
deviation (4.2), sample size (30) is given in the problem. Proceed to 2 test for a mean
43.39
Decision: Since the test value of +3.39 is
beyond the critical value of +1.65, we
reject the null hypothesis. Inthe figure on
the left, it clearly shows that the test value
falls within the critical region, clearly
supporting our decision.
Conclusion: Therefore, the mean age of medical doctors in a large hospital system is older than
the average age of doctors in the United States.
118Mlustrative Example 3
Problem: Cost of Rehabilitation. The Medical Rehabilitation Education Foundation reports
that the average cost of rehabilitation for stroke victims is $24,672. To see if the average cost
of rehabilitation is different at a particular hospital, a researcher selects a random sample of 35
stroke victims at the hospital and finds that the average cost of their rehabilitation is $26,343.
The standard deviation of the population is $3251. At a = 0.01, can it be concluded that the
average cost of stroke rehabilitation at a particular hospital is different from $24,672?
Source: Snapshot, USA TODAY
Problem Question: Is the average cost of stroke rehabilitation ata particular hospital different
fiom the average cost of rehabilitation for stroke victims of The Medical Rehabilitation
Education Foundation?
Hypotheses:
‘Hy: The average cost of stioke rehabilitation at a particular hospital is the same to the average
cost of rehabilitation for stroke victims of The Medical Rehab Education Foundation,
He: = 824,672
Hj: The average cost of stoke rehabilitation at a particular hospital is different to the average
cost of rehabilitation for stroke victims of The Medical Rehab Education Foundation,
Hur 41 2824,672 (claim)
Critical Value: Since «= 0.01 and the test is a two-tailed test, the critical values are +2.58
Test Value:
In the problem, the sample mean (26,343) the population mean (24,672) population
standard deviation (3251), sample size (35) is given in the problem. Proceed to z test for a mean,
X- _ 26,343-24,672
x 3251
vn 36
Decision: Since the test value of +3.04 is
beyond the critical value of +2.58, we
reject the null hypothesis. In the figure on
the lef, it clearly shows that the test value
falls within the critical region, clearly
supporting our decision.
a8 ° 4258-4204
Conclusion: Therefore, the average cost of stroke rehabilitation at a particular hospital is
different to the average cest of rehabilitation for stroke victims of The Medical Rehab
Education Foundation. It can also be gleaned that the average cost of stroke rehabilitation at a
particular hospital is more costly to the average cost of rehabilitation for stroke victims of The
Medical Rehabilitation Education Foundation.
11917.2. Test fora Mean
The f test for a mean is also a statistical test for the mean of a population. It is also
used to answer questions on possible difference between the observed value (sample mean in
this case) and the hypothesized value (population mean in this case) per standard error of the
given data. However, it is the test to be used when population standard deviation o is unknown,
thus, we use the sample standard deviation s.
Assumptions:
1. The sample is a random sample.
2. Either 1 > 30 or the population is normally distributed if m <30.
The formula for ¢test is
The degrees of freedom is d.f.="
where: x observed/computed sample mean
K hypothesized population mean
s sample standard deviation
n sample size
Mlustrative Example 4.
Problem: Hospital Infections. A medical investigation claims that the average number of
infections per week at a hospital in southwestem Pennsylvania is 16.3. A random sample of 10
weeks had a mean number of 17.7 infections. The sample standard deviation is 1.8. Is there
enough evidence to reject the investigator's claim at « = 0.05? Source: Based on information
obtained from Pennsylvania Health Care Cost Containment Council
Problem Question: Is the average number of infections per week at a hospital in southwestern
Pennsylvania is 16.3?
Hypotheses:
‘He: The average number of infections per week at a hospital in S.W. Pennsylvania is 16.3?
=163 (claim)
‘Hh: The average number of infections per week at a hospital in S.W. Peansylvania is not 16.3?
His #163
Critical Value: The testis two-tailed. Since «= 0.05 and df. = 10 ~ 1 =9, the ewv. is + 2.262,
Contdence
One aa D
«a Cwm S=
Se =
120Test Value:
In the problem, the sample mean (17.7), population mean (16.3) sample standard deviation
(1.8) and sample size (10) is given in the problem.
Since we had determined all the needed values, we can proceed to the f test for a mean,
Decision: Since the test value of 42.460 is,
beyond the critical values of +2.262, we
reject the null hypothesis. In the figure on ~°" "| Parl po
the left, it clearly shows that the test value a
falls within the critical region, clearly per
supporting our decision. ie t voautte
Conclusion: Therefore, the average number of infections per week at @ hospital in S.W.
Pennsylvania is more than 16.3.
Mlustrative Example 5.
Problem: Substitute Teachers’ Salaries. An educator claims that the average salary of
substitute teachers in school districts in Allegheny County, Pennsylvania is less than $60 per
day. A random sample of eight school districts is selected, and the daily salaries (in dollars) are
shown. Is there enough evidence to support the educator's claim at a= 0.102
DATA: 60 56 60 55 70 55 60 55 Source: Source: Pittsburgh Tribune-Review.
Problem Question: Is the average salary of substitute teachers in school districts in Allegheny
County, Pennsylvania is less than $60?
Hypotheses:
Hi: The average salary of substitute teachers in school districts in Allegheny County,
Pennsylvania is $60.
Ho: 1 =860
‘Hi, The average salary of substitute teachers in school districts in Allegheny County,
Pennsylvania is less than $60.
Hy: