1-Hypothesis Testing

Hypothesis Testing
Niniet Indah Arvitrida, ST, MT

niniet@ie.its.ac.id
SepuluhNopember Institute of Technology
INDONESIA
2008
A bit concept review...
Statistics is ...
Population is ...
Sample is ...
Variable is ...
Parameter is ...
Relation between statistics & parameters ...
Difference between statistics and probability
Statistics assumptions :
Population and its parameters are (unknown / known)
Sample is used to ..... parameters
Course 1 Objectives
Students able to
Formulate null and alternative hypotheses for
applications involving a single population mean or
proportion
 Formulate a decision rule for testing a hypothesis
 Know how to use the test statistic, critical value, and p-

value approaches to test the null hypothesis
 Know what Type I and Type II errors are
 Compute the probability of a Type II error
Review of Estimating
Population Value
Point and Interval Estimates
 A point estimate is a single number,

 a confidence interval provides additional
information about variability
Lower Upper
Confidence Confidence
Point Estimate
Limit Limit
Width of
confidence interval
Point Estimates
We can estimate a with a Sample

Population Parameter … Statistic
(a Point Estimate)
Mean μ x
Proportion p p
Confidence Intervals
 How much uncertainty is associated with a point
estimate of a population parameter?
 An interval estimate provides more information

about a population characteristic than does a
point estimate
 Such interval estimates are called confidence

intervals
Estimation Process
Random Sample I am 95%

confident that
Population μ is between
Mean 40 & 60.
(mean, μ, is x = 50
unknown)
Sample
Confidence Level
Confidence Level
Confidence in which the interval
will contain the unknown
population parameter
A percentage (less than 100%)
Confidence Interval for μ
(σ Known)
Assumptions
Population standard deviation σ is known
Population is normally distributed
If population is not normal, use large sample
Confidence interval estimate
σ
x  z α/2
n
Finding the Critical Value
Consider a 95% confidence interval: z α/2  1.96
1    .95
α α
 .025  .025
2 2
z units: z.025= -1.96 0 z.025= 1.96

Lower Upper
x units: Confidence Point Estimate Confidence
Limit Limit
Margin of Error
Margin of Error (e): the amount added and
subtracted to the point estimate to form the
confidence interval
Example: Margin of error for estimating μ, σ known:
σ σ
x  z /2 e  z /2
n n
Factors Affecting Margin of
Error
σ
e  z /2
n
Data variation, σ : e as σ
Sample size, n : e as n
Level of confidence, 1 -  : e if 1 - 
Chap 7-15
Example
 A sample of 11 circuits from a large normal
population has a mean resistance of 2.20 ohms.
We know from past testing that the population
standard deviation is .35 ohms.
 Determine a 95% confidence interval for the true

mean resistance of the population.
Chap 7-16
Example
(continued)
 A sample of 11 circuits from a large normal
population has a mean resistance of 2.20 ohms.
We know from past testing that the population
standard deviation is .35 ohms.
σ
Solution: x  z /2
n
 2.20  1.96 (.35/ 11 )
 2.20  .2068
1.9932 .......... ..... 2.4068
Chap 7-17
Interpretation
 We are 95% confident that the true mean resistance is
between 1.9932 and 2.4068 ohms
 Although the true mean may or may not be in this
interval, 95% of intervals formed in this manner will
contain the true mean
 An incorrect interpretation is that there is 95% probability that this

interval contains the true population mean.
(This interval either does or does not contain the true mean, there is
no probability for a single interval)
Chap 7-18
Student’s t Distribution
Note: t z as n increases
Standard
Normal
(t with df = )
t (df = 13)
t-distributions are bell-
shaped and symmetric, but
have ‘fatter’ tails than the t (df = 5)
normal
0 t
Chap 7-19
Approximation for Large Samples
Since t approaches z as the sample size increases,
an approximation is sometimes used when n  30:
Technically Approximation
correct for large n
s s
x  t /2 x  z /2
n n
Chap 7-20
Now we talk about
HYPOTHESIS
What is a Hypothesis?
A hypothesis is a claim
(assumption) about a
population parameter:
population mean
Example: The mean monthly cell phone bill
of this city is  = $42
population proportion
Example: The proportion of adults in this
city with cell phones is p = .68
The Null Hypothesis, H0
States the assumption (numerical) to be
tested
Example: The average number of TV sets in U.S.
Homes is at least three (
H0 : μ  3 )
Is always about a population parameter,
not about a sample statistic
H0 : μ  3 H0 : x  3
The Null Hypothesis, H0
(continued)
Begin with the assumption that the null

hypothesis is true
Similar to the notion of innocent until
proven guilty
Always contains “=” , “≤” or “” sign
May or may not be rejected
The Alternative Hypothesis, HA
Is the opposite of the null hypothesis
e.g.: The average number of TV sets in U.S. homes
is less than 3 ( HA:  < 3 )
Never contains the “=” , “≤” or “” sign
May or may not be accepted
Is generally the hypothesis that is believed (or
needs to be supported) by the researcher
Hypothesis Testing Process
Claim: the
population
mean age is 50.
(Null Hypothesis:
Population
H0:  = 50 )
Now select a
random sample
Is x  20 likely if  = 50?
If not likely, Suppose
the sample
REJECT mean age Sample
Null Hypothesis is 20: x = 20
Reason for Rejecting H0
Sampling Distribution of x
x
20  = 50
If H0 is true
If it is unlikely that ... then we
we would get a reject the null
sample mean of ... if in fact this were hypothesis that
this value ... the population mean…  = 50.
Level of Significance, 
Defines unlikely values of sample statistic if null
hypothesis is true
Defines rejection region of the sampling
distribution
Is designated by  , (level of significance)
Typical values are .01, .05, or .10
Is selected by the researcher at the beginning
Provides the critical value(s) of the test
Level of Significance
and the Rejection Region
Level of significance =  Represents
critical value
H0: μ ≥ 3  1-α
Rejection
HA: μ < 3 Lower tail test 0 region is
shaded
H0: μ ≤ 3 
1-α
HA: μ > 3
Upper tail test 0
H0: μ = 3 /2 /2

1-α
HA: μ ≠ 3
Two tailed test 0
Errors in Making Decisions
Type I Error
Reject a true null hypothesis
Considered a serious type of error
The probability of Type I Error is 

Called level of significance of the test
Set by researcher in advance
Errors in Making Decisions(continued)
Type II Error
Fail to reject a false null hypothesis
The probability of Type II Error is β

Outcomes and Probabilities
Possible Hypothesis Test Outcomes
State of Nature
Decision H0 True H0 False
Do Not
No error Type II Error
Key: Reject
(1 -  ) (β)
Outcome H0
(Probability) Reject Type I Error No Error
H0 () (1-β)
Type I & II Error Relationship
 Type I and Type II errors can not happen at
the same time
 Type I error can only occur if H0 is true
 Type II error can only occur if H0 is false
If Type I error probability (  ) , then

Type II error probability ( β )
Factors Affecting Type II Error
All else equal,
 β when the difference between hypothesized
parameter and its true value
 β when 
 β when σ
 β when n
Critical Value
Approach to Testing
Convert sample statistic (e.g.: x ) to test statistic ( Z
or t statistic )
Determine the critical value(s) for a specified
level of significance  from a table or computer
If the test statistic falls in the rejection region, reject

H0 ; otherwise do not reject H0
Lower Tail Tests
H0: μ ≥ 3
 The cutoff value,
-zα or xα , is called a HA: μ < 3
critical value 
Reject H0 Do not reject H0

-zα 0
xα μ
σ
x   μ  z
n
Upper Tail Tests
H0: μ ≤ 3
 The cutoff value,
zα or xα , is called a HA: μ > 3
critical value

Do not reject H0 Reject H0

0 zα
μ xα
σ
x   μ  z
n
Two Tailed Tests
 There are two cutoff values H0: μ = 3
(critical values):
HA: μ 
3
or ± zα/2
/2 /2
xα/2
Lower
Reject H0 Do not reject H0 Reject H0
xα/2 -zα/2 0 zα/2
Upper μ0
xα/2 xα/2
Lower Upper
σ
x /2  μ  z /2
n
Critical Value
Approach to Testing
Convert sample statistic ( x ) to a test statistic
( Z or t statistic )
Hypothesis
Tests for 
 Known  Unknown
Large Small
Samples Samples
Calculating the Test Statistic
Hypothesis
Tests for μ
The test statistic is:

Large Small
x μ
z  Samples Samples
σ
n
(continued)
Hypothesis
Tests for 
The test statistic is: But is sometimes

approximated Large Small
x μ using a z:
t n1  x μ
Samples Samples
s z 
σ
n n
(continued)
Hypothesis
Tests for 
The test statistic is:

Large Small
x μ
t n1  Samples Samples
s
n (The population must be
approximately normal)
Review: Steps in Hypothesis Testing
1. Specify the population value of interest
2. Formulate the appropriate null and
alternative hypotheses
3. Specify the desired level of significance
4. Determine the rejection region
5. Obtain sample evidence and compute the test
statistic
6. Reach a decision and interpret the result
Hypothesis Testing Example
Test the claim that the true mean # of
TV sets in US homes is at least 3.
(Assume σ = 0.8)
 1. Specify the population value of interest
 The mean number of TVs in US homes
 2. Formulate the appropriate null and alternative

hypotheses
 H : μ  3 HA: μ < 3 (This is a lower tail test)
0
 3. Specify the desired level of significance

 Suppose that  = .05 is chosen for this test
(continued)
4. Determine the rejection region
 = .05
-zα= -1.645 0
This is a one-tailed test with  = .05.

Since σ is known, the cutoff value is a z value:
Reject H0 if z < z = -1.645 ; otherwise do not reject H0
5. Obtain sample evidence and compute the test
statistic
Suppose a sample is taken with the following results:
n = 100, x = 2.84 ( = 0.8 is assumed known)
Then the test statistic is:
xμ 2.84  3  .16

z     2.0
σ 0.8 .08
n 100
(continued)
 6. Reach a decision and interpret the result
 = .05
z
-1.645 0
-2.0
Since z = -2.0 < -1.645, we reject the null
hypothesis that the mean number of TVs in US
homes is less than 3
(continued)
 An alternate way of constructing rejection region:
Now
expressed
 = .05 in x, not z
units
x
2.8684 3
2.84 σ 0.8
x α  μ  zα  3  1.645  2.8684
Since x = 2.84 < 2.8684, n 100
we reject the null
hypothesis
p-Value Approach to Testing
Convert Sample Statistic (e.g. x ) to Test Statistic
( Z or t statistic )
Obtain the p-value from a table or computer
Compare the p-value with 
If p-value <  , reject H0
If p-value   , do not reject H0

p-Value Approach to Testing
(continued)
p-value: Probability of obtaining a test

statistic more extreme ( ≤ or  ) than the
observed sample value given H0 is true
Also called observed level of significance
Smallest value of  for which H0 can be

rejected
p-value example
Example: How likely is it to see a sample mean of 2.84
(or something further below the mean) if the true
mean is  = 3.0?
 = .05
P( x  2.84 | μ  3.0)
p-value =.0228
 
 2.84  3.0 
 P z  
0.8 x
 
 100 
2.8684 3
 P(z  2.0)  .0228
2.84
p-value example (continued)
Compare the p-value with 
If p-value <  , reject H0

If p-value   , do not reject H0
 = .05
Here: p-value = .0228 p-value =.0228
 = .05
Since .0228 < .05, we reject
the null hypothesis
2.8684 3
2.84
Example: Upper Tail z Test
for Mean ( Known)
A phone industry manager thinks that customer
monthly cell phone bill have increased, and now
average over $52 per month. The company wishes
to test this claim. (Assume  = 10 is known)
Form hypothesis test:

H0: μ ≤ 52 the average is not over $52 per month
HA: μ > 52 the average is greater than $52 per month
(i.e., sufficient evidence exists to support the
manager’s claim)
Example: Find Rejection Region
(continued)
Suppose that  = .10 is chosen for this test
Find the rejection region: Reject H0
= .10

0 zα=1.28
Reject H0 if z > 1.28

Review:
Finding Critical Value - One Tail
Standard Normal
What is z given  = 0.10? Distribution Table (Portion)
.90 .10
Z .07 .08 .09
 = .10
1.1 .3790 .3810 .3830
.50 .40
1.2 .3980 .3997 .4015
z 0 1.28
1.3 .4147 .4162 .4177
Critical Value
= 1.28
Example: Test Statistic (continued)
Obtain sample evidence and compute the test statistic

Suppose a sample is taken with the following results:
n = 64, x = 53.1 (=10 was assumed known)
Then the test statistic is:
x μ 53.1  52
z    0.88
σ 10
n 64
Example: Decision (continued)
Reach a decision and interpret the result:
Reject H0
= .10

0
1.28
z = .88
Do not reject H0 since z = 0.88 ≤ 1.28
i.e.: there is not sufficient evidence that the
mean bill is over $52
p -Value Solution (continued)
Calculate the p-value and compare to 

p-value = .1894
P( x  53.1 | μ  52.0)
Reject H0
= .10  
 53.1  52.0 
 P z  
 10 
0  64 
1.28  P(z  0.88)  .5  .3106
z = .88  .1894
Do not reject H0 since p-value = .1894 >  = .10

Example: Two-Tail Test
( Unknown)
The average cost of a hotel
room in New York is said
to be $168 per night. A
random sample of 25
hotels resulted in
x = $172.50 and H0: μ= 168
s = $15.40. Test at the HA: μ
 = 0.05 level. 168
(Assume the population distribution is normal)
Example Solution: Two-Tail Test
H0: μ= 168 /2=.025 /2=.025
HA: μ
168
 = 0.05 Reject H0 Do not reject H0 Reject H0
-tα/2 0
tα/2
 n = 25 -2.0639 2.0639
1.46
  is unknown, so x μ 172.50  168
t n1    1.46
use a t statistic s 15.40
n 25
 Critical Value:
t24 = ± 2.0639 Do not reject H0: not sufficient evidence that

true mean cost is different than $168
Hypothesis Tests for Proportions
Involves categorical values

Two possible outcomes
“Success” (possesses a certain characteristic)
“Failure” (does not possesses that characteristic)
Fraction or proportion of population in the “success”

category is denoted by p
Proportions (continued)
Sample proportion in the success category is denoted
by p
x number of successes in sample

 p 
n sample size
When both np and n(1-p) are at least 5, p can be

approximated by a normal distribution with mean
and standard deviation

μP  p p(1  p)
σp 
n
Hypothesis Tests for Proportions
The sampling
distribution of p is Hypothesis
normal, so the test Tests for p
statistic is a z value:
np  5 np < 5
and or
pp
z n(1-p)  5 n(1-p) < 5
p(1  p)
Not discussed
n in this chapter
Example: z Test for Proportion
A marketing company
claims that it receives 8%
responses from its
mailing. To test this
claim, a random sample of
500 were surveyed with
Check:
25 responses. Test at the
 = .05 significance level. n p = (500)(.08) = 40

n(1-p) = (500)(.92) = 460
Z Test for Proportion: Solution
Test Statistic:
H0: p = .08
pp .05  .08
HA: p  . z   2.47
p(1  p) .08(1  .08)
08= .05
n = 500, p = .05
n 500
Critical Values: ± 1.96 Decision:
Reject Reject Reject H0 at  = .05
Conclusion:
.025 .025
There is sufficient
-1.96 0 1.96 z evidence to reject the
-2.47 company’s claim of 8%
response rate.
p -Value Solution (continued)
Calculate the p-value and compare to 
(For a two sided test the p-value is always two sided)
Do not reject H0
Reject H0 Reject H0 p-value = .0136:
/2 = .025 /2 = .025
P(z  2.47)  P(x  2.47)
.0068 .0068  2(.5  .4932)
 2(.0068)  0.0136
-1.96 0 1.96
z = -2.47 z = 2.47
Reject H0 since p-value = .0136 <  = .05

Type II Error
Type II error is the probability of
failing to reject a false H0
Suppose we fail to reject H0: μ  52
when in fact the true mean is μ = 50
50 52
Reject Do not reject
H0: μ  52 H0 : μ  52
Type II Error (continued)
Suppose we do not reject H0:   52 when in fact the
true mean is  = 50
This is the range of x where

This is the true H0 is not rejected
distribution of x if  = 50
50 52
H0:   52 H0 :   52
Type II Error (continued)
Suppose we do not reject H0: μ  52 when in
fact the true mean is μ = 50
Here, β = P( x  cutoff ) if μ = 50
 β
50 52
H0: μ  52 H0 : μ  52
Calculating β
Suppose n = 64 , σ = 6 , and  = .05
σ 6
cutoff  x   μ  z   52  1.645  50.766
(for H0 : μ  52) n 64
So β = P( x  50.766 ) if μ =
50
50 50.766 52
H0: μ  52 H0 : μ  52
Calculating β (continued)
Suppose n = 64 , σ = 6 , and  = .05
 
 50.766  50 
P( x  50.766 | μ  50)  P z    P(z  1.02)  .5  .3461  .1539
 6 
 64 
Probability of
type II error:
 β = .1539
50 52
H0: μ  52 H0 : μ  52
Chapter Summary
Addressed hypothesis testing methodology
Performed z Test for the mean (σ known)
Discussed p–value approach to hypothesis
testing
Performed one-tail and two-tail tests . . .
Chapter Summary (continued)
Performed t test for the mean (σ

unknown)
Performed z test for the proportion
Discussed type II error and computed its
probability

1-Hypothesis Testing

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

1-Hypothesis Testing

Uploaded by

Copyright:

Available Formats

Hypothesis Testing

Niniet Indah Arvitrida, ST, MT

 Know how to use the test statistic, critical value, and p-

 A point estimate is a single number,

We can estimate a with a Sample

 An interval estimate provides more information

 Such interval estimates are called confidence

Random Sample I am 95%

Confidence interval estimate

z units: z.025= -1.96 0 z.025= 1.96

Example: Margin of error for estimating μ, σ known:

 Determine a 95% confidence interval for the true

 2.20  1.96 (.35/ 11 )

 An incorrect interpretation is that there is 95% probability that this

Begin with the assumption that the null

H0: μ = 3 /2 /2

The probability of Type I Error is 

The probability of Type II Error is β

If Type I error probability (  ) , then

If the test statistic falls in the rejection region, reject

Reject H0 Do not reject H0

Do not reject H0 Reject H0

The test statistic is:

The test statistic is: But is sometimes

The test statistic is:

 2. Formulate the appropriate null and alternative

 3. Specify the desired level of significance

Reject H0 Do not reject H0

This is a one-tailed test with  = .05.

xμ 2.84  3  .16

Compare the p-value with 

If p-value <  , reject H0

If p-value   , do not reject H0

p-value: Probability of obtaining a test

Smallest value of  for which H0 can be

Compare the p-value with 

If p-value <  , reject H0

Form hypothesis test:

Find the rejection region: Reject H0

Do not reject H0 Reject H0

Reject H0 if z > 1.28

Obtain sample evidence and compute the test statistic

n = 64, x = 53.1 (=10 was assumed known)

Then the test statistic is:

Do not reject H0 Reject H0

Calculate the p-value and compare to 

Do not reject H0 since p-value = .1894 >  = .10

t24 = ± 2.0639 Do not reject H0: not sufficient evidence that

Involves categorical values

“Failure” (does not possesses that characteristic)

Fraction or proportion of population in the “success”

x number of successes in sample

When both np and n(1-p) are at least 5, p can be

Reject H0 since p-value = .0136 <  = .05

This is the range of x where

Performed t test for the mean (σ

You might also like