Professional Documents
Culture Documents
HYPOTHESIS TESTING A procedure based on sample evidence and probability theory to determine whether
the hypothesis is a reasonable statement.
TEST STATISTIC A value, determined from sample information, used to determine whether to reject the null
hypothesis.
CRITICAL VALUE The dividing point between the region where the null hypothesis is rejected and the region
where it is not rejected.
Important Things to Remember about H0 and H1
H0: null hypothesis and H1: alternate
hypothesis
H0 and H1 are mutually exclusive and
collectively exhaustive
H0 is always presumed to be true
H1 is the research hypothesis
A random sample (n) is used to “reject H0”
If we conclude 'do not reject H0', this does
not necessarily mean that the null
hypothesis is true, it only suggests that
there is not sufficient evidence to reject H0;
rejecting the null hypothesis then, suggests
that the alternative hypothesis may be true.
Equality is always part of H0 (e.g. “=” , “≥” ,
“≤”).
“≠” “<” and “>” always part of H1
In actual practice, the status quo is set up
as H0
In problem solving, look for key words and
convert them into symbols. Some key
words include: “improved, better than, as
effective as, different from, has changed,
etc.”
There are 4 major components of a test of hypothesis.
1. Null hypothesis
3. Test statistic
4. Rejection region
Null hypothesis
H0
H0 : 3
Alternative hypothesis
H1
H 1 : 50 ,000
Signs in the Tails of a Test
Test Statistic
continued
H 0 is true H 0 is false
Rejection Rejection
Region Acceptance Region
Region
One-tailed Test
Rejection
Region Acceptance
Region
Steps in hypothesis testing
Example
– A new billing system for a department store will
be cost- effective only if the mean monthly
account is more than $170.
– A sample of 400 accounts has a mean of $178.
– If accounts are approximately normally
distributed with = $65, can we conclude that
the new system will be cost effective?
Testing the Population Mean ( is Known)
Example – Solution
– The population of interest is the credit
accounts at the store.
– We want to know whether the mean account
for all customers is greater than $170.
H1 : > 170
– The null hypothesis generally specifies a
single value of the parameter
H0 : = 170
Approaches to Testing
The Rejection Region Method
x xL
The standardized test statistic
– Instead of using the statistic x , we can
use the standardized value z.
x
z
n
– Then, the rejection region becomes
One tail test
z z
Critical Value for the Rejection Region
Set the probability of committing a Type I
error be (also called the significance
level).
The standardized test statistic
Example - continued
– We re-do this example using the
standardized test statistic.
Recall: H0: = 170
H1: > 170
– Test statistic:
x 178 170
z 2.46
n 65 400
– Rejection region: z > z.051.645.
The standardized test statistic
Example - continued
Conclusion
Since Z = 2.46 > 1.645, reject the null
hypothesis in favor of the alternative
hypothesis.
p-value Method
– The p-value provides information about the
amount of statistical evidence that supports
the alternative hypothesis.
H 0 : x 170
H1 : x 170
x 178
Interpreting the p-value
Describing the p-value
– If the p-value is less than 1%, there is
overwhelming evidence that supports the
alternative hypothesis.
– If the p-value is between 1% and 5%, there is a
strong evidence that supports the alternative
hypothesis.
– If the p-value is between 5% and 10% there is a
weak evidence that supports the alternative
hypothesis.
– If the p-value exceeds 10%, there is no evidence
that supports the alternative hypothesis.
The p-value and the Rejection
Region Methods
– The p-value can be used when making
decisions based on rejection region methods
as follows:
• Define the hypotheses to test, and the required
significance level
• Perform the sampling procedure, calculate the test
statistic and the p-value associated with it.
• Compare the p-value to Reject the null
hypothesis only if p-value <; otherwise, do not
reject the null hypothesis.
= 0.05
The p-value
x 170
x L 175.34 x 178
Steps in Hypothesis Testing using SPSS
State the null and alternative hypotheses
Define the level of significance (α)
Calculate the actual significance : p-
value
Make decision : Reject null hypothesis, if
p≤ α, for 2-tail test; and
if p*≤ α, for 1-tail test.(p* is p/2 when p is
obtained from 2-tail test)
Conclusion
Inference About a Population Mean When the
Population Standard Deviation Is Unknown or
When the Sample Size is Small
In practice, the population standard deviation will be
unknown.
Recall that when is known we use the following
statistic to estimate and test a population mean
x
z
When is unknown or when n the sample size is
x
t
s n
s2
x
2
i
i
n 1507.55.
n 1
s 1507.55 38.83
Testing when is unknown
Rejection region
• The test statistic is 1.676 1.89
x 460.38 450
t 1.89
s n 38.83 50
One-Sample Test
p̂ p
Z
p(1 p) / n
where np 5 and n(1 p) 5
Testing the Proportion
Example 12.6
– A pharmaceutical company claimed that its
medicine was 80% effective in relieving allergy
for a period of 15 hours. In a sample of 200
persons, who were given medicine, 150
persons had relief. Do you thank that the
company’s claim is justified? Use 0.05 level of
significance.
Testing the Proportion
Solution
– The problem objective is to test the
effectiveness of medicine.
– The data are nominal.
– The parameter to be tested is ‘p’.
– Success is defined as “having relief”.
– The hypotheses are:
H0: p = .8
H1: p < .8
Testing the Proportion
– Solution
• The rejection region is z < z = z.05 = -1.645.
• The sample proportion is pˆ 150 200 .75
• The value of the test statistic is
pˆ p .75 .8
Z 1.786
p (1 p ) / n .8(1 .8) / 200
Since calculated z is less than critical value, we
reject null hypothesis and conclude that the
claim of the company that its medicine is 80%
effective is not justified.
T-Tests : When sample size is small
(<30) or When the Population
Standard Deviation Is Unknown
Variable : Normal
Types of t-tests:
One-sample t-test
Paired or dependent
sample t-test Independent
samples t-test (Equal and
Unequal Variance)
One-sample t-test
H 0 : 0
H1 : 0
H1 : 0
H1 : 0
Paired sample t-test
H0 : d 0
H1 : d 0
H1 : d 0
H1 : d 0
Matched pairs
Test statistic:
xD D
t
s D nD
Degree of freedom = nD 1
Independent sample t-test
H 0 : 1 2
H 1 : 1 2
H 1 : 1 2
H 1 : 1 2
The sampling process.
Population 1 Population2
Parameters: Parameters:
1and 1 2
2 and 22
Statistics: Statistics:
2
x1 ands1 2
x2 ands 2
Sample size: n1 Sample size: n2
If the two population standard deviations are
unknown, then we can estimate the standard
error of the difference between two means.
ˆ ˆ2 2
ˆ x1 x2 1 2
n1 n2
Test statistic:
z
x1 x2
2 2
ˆ ˆ
1
2
n1 n2
If population variance unknown and the sample size
is small and the population variances are equal
1 1
x1 x2 s
2
p
n1 n2
Where:
s
2 n 1s n 1s
1
2
1 2
2
2
p
n1 n2 2
Test statistic:
t
x1 x2
2 1 1
s
p
n1 n2
Degree of freedom = n1 n2 2
One way
Analysis of Variance ( ANOVA )
The Null Hypothesis is that the population means are the same. The Alternative
Hypothesis is that at least one of the means is different.
H0: µ1 = µ2 =…= µk
H1: The means are not all equal
Reject H0 if F > F,k-1,n-k
The test statistic used to test the hypothesis is
F statistic
Assumptions:
H 0 : 1 2 3 ........
H1 : Not all means are same
ANOVA – Example (File Airlines.sav)
EXAMPLE
Recently a group of four major carriers
joined in hiring Brunner Marketing
Research, Inc., to survey recent
passengers regarding their level of
satisfaction with a recent flight. The
survey included questions on ticketing,
boarding, in-flight service, baggage
handling, pilot communication, and so
forth.
To compute SSE, find deviation between each observation and its treatment mean. Each of these
values is squared and then summed for all 22 observations.
SSE = {(94-87.25)2 + (90-87.25)2 + ……+ (80-87.25)2 } + {(75-78.20)2 + (68-78.20)2 + ……+ (88-
78.20)2 } + {(70-72.86)2 + (73-72.86)2 + ……+ (65-72.86)2 } + {(68-69)2 + (70-69)2 + ……+ (65-
69)2 } = 594.41
Finally, determine SST = SS total – SSE.
SST = 1485.10 – 594.41 = 890.69
ANOVA – Example
Step 3: Find the appropriate test statistic. Use the F statistic
Calculations: It is convenient to summarize the calculations of F statistic in an
ANOVA Table.
The computed value of F is 8.99, which is greater than the critical value of 5.09, so the
null hypothesis is rejected.
Conclusion: The mean scores are not the same for the four airlines; at this point we can
only conclude there is a difference in the treatment means. We cannot determine which
treatment groups differ or how many treatment groups differ.
ANOVA Example – SPSS Output
Test of Homogeneity of Variances
Satisfaction
Levene Statistic df1 df2 Sig.
.962 3 18 .432
ANOVA
Satisfaction
Sum of
Squares df Mean Square F Sig.
Between 890.684 3 296.895 8.991 .001
Groups
Within Groups 594.407 18 33.023
Total 1485.091 21
ANOVA Example – SPSS Output
Multiple Comparisons
Satisfaction
Tukey HSD
(I) Carrier (J) Carrier Mean 95% Confidence Interval
Difference (I- Lower Upper
J) Std. Error Sig. Bound Bound
TWA 9.050 3.855 .124 -1.85 19.95
Eastern Allegheny 14.393 *
3.602 .004 4.21 24.57
Ozark 18.250* 3.709 .001 7.77 28.73
Eastern -9.050 3.855 .124 -19.95 1.85
TWA Allegheny 5.343 3.365 .410 -4.17 14.85
Ozark 9.200 3.480 .071 -.63 19.03
Eastern -14.393 *
3.602 .004 -24.57 -4.21
Allegheny TWA -5.343 3.365 .410 -14.85 4.17
Ozark 3.857 3.197 .631 -5.18 12.89
Eastern -18.250 *
3.709 .001 -28.73 -7.77
Ozark TWA -9.200 3.480 .071 -19.03 .63
Allegheny -3.857 3.197 .631 -12.89 5.18
*. The mean difference is significant at the 0.05 level.
ANOVA Example – SPSS Output
Homogeneous Subsets
Satisfaction
Tukey HSDa,b
Carrier
Subset for alpha = 0.05
N 1 2
Ozark 6 69.00
Allegheny 7 72.86
TWA 5 78.20 78.20
Eastern 4 87.25
Sig. .078 .085
Means for groups in homogeneous subsets are displayed.
a. Uses Harmonic Mean Sample Size = 5.266.
b. The group sizes are unequal. The harmonic mean of the group
sizes is used. Type I error levels are not guaranteed.
One Way Analysis of Variance
Example 2
– An apple juice manufacturer is planning to
develop a new product -a liquid concentrate.
– The marketing manager has to decide how to
market the new product.
– Three strategies are considered
• Emphasize convenience of using the product.
• Emphasize the quality of the product.
• Emphasize the product’s low price.
One Way Analysis of Variance
Example 2- continued
– An experiment was conducted as follows:
• In three cities an advertisement campaign was
launched
• In each city only one of the three characteristics
(convenience, quality, and price) was emphasized.
• The weekly sales were recorded for twenty weeks
following the beginning of the campaigns.
One Way Analysis of Variance
Convnce Quality Price
529 804 672 See file
Weekly 658
793
630
774
531
443
JUICE.xls
sales 514 717 596
663 679 602
719 604 502
711 620 659
606 697 689
Weekly
461 706 675
529 615 512
sales
498 492 691
663 719 733
604 787 698
495 699 776
485
557
Weekly
572
523
561
572
353 sales
584 469
557 634 581
542 580 679
614 624 532
Defining the Hypotheses
• Solution
H0: 1 = 2= 3
H1: At least two means differ
O E
k
2
i i
i 1 Ei
Contingency table 2 test – Example
Example
Degree Accounting Finance Marketing
BA 31 13 16 60
BENG 8 16 7 31
BBA 12 10 17 60
Other 10 5 7 39
61 44 47 152
Example
Solution
– The hypotheses are:
H0: The two variables are independent
H1: The two variables are dependent
i 1 Ei
Calculation of the 2 statistic
• Solution – continued
Undergraduate MBA Major
Degree Accounting Finance Marketing
BA 31 24.08
31 (24.08)
k 13 (17.37) 2 16 (18.55) 60
(f e )
BENG 2 8 (12.44) 16 (8.97) 7 (9.58) 31
BBA 31 24.08
12 (15.65)
i i
10 (11.29) 17 (12.06) 39
Other
31 24.08
10 (8.83)
i61
1
e 55 6.39
(6.39) 77 6.80
i
(6.80) 22
44 47 152
5 6.39 7 6.80
31 24.08
The expected frequency
5 6.39 7 6.80
31 24.08
5 6.39 7 6.80
(31 - 24.08)2 (5 - 6.39)2 (7 - 6.80)2
2= 24.08 +….+ 6.39 +….+ 6.80
= 14.70
Contingency table test – 2
Example
• Solution – continued
– The critical value in our example is:
2 ,( r 1)( c 1) .205,( 4 1)( 31) 12.5916
• Conclusion:
Since 2 = 14.70 > 12.5916, there
is sufficient evidence to infer at 5% significance
Likelihood Ratio
13.781 6 .032
Linear-by-Linear Association
2.003 1 .157
N of Valid Cases
152
a. 0 cells (.0%) have expected count less than 5. The minimum expected count is
6.37.
Yates’ Correction for Continuity
Chi-square distribution is a continuous
distribution. Whenever the degrees of freedom
(in case of a 2x2 table), certain corrections for
continuity can be made
Required conditions –
the rule of five
The test statistic used to perform the test
is only approximately Chi-squared
distributed.
For the approximation to apply, the
expected cell frequency has to be at least
5 for all the cells (np 5).
If the expected frequency in a cell is less
than 5, combine it with other cells.