Professional Documents
Culture Documents
Applications
Hypothesis Testing
Objectives: At the end of the class, the students
shall be able to:
1. Differentiate when to use the independent t-test and paired t-test
2. Interpret the SPSS outputs for parametric test and non-
parametric test for 2 samples
3. Answer the midterm examination with accuracy, speed, and
honesty
Lecture Outline:
•What is Hypothesis Testing?
•Hypothesis Formulation
•Statistical Errors
•Effect of Study Design
•Test Procedures
•Test Selection.
Statistics
Descriptive Inferential
Organizing,
Correlational Generalising
summarizing &
describing data
Relationships
Significance
Sampling Error
A=B AB
We also need to establish:
1) How unequal are these observations?
2) Are these observations reflective of the general population?
Example Hypotheses: Isometric Torque
• Is there any difference in the length of time that males and females
can sustain an isometric muscular contraction?
mean♂ = mean♂
mean♀ mean♀
Example Hypotheses: Isometric Torque
• Is there any difference in the length of time that males and
females can sustain an isometric muscular contraction?
n♀
Number of People
100
80 n♂
60
40
20
n♀
Number of People
100
80 n♂
60
40
20
• Type 2 Errors
-Accepting H0 when it is actually false (e.g. previous slide)
-Concluding no difference when one does exist
Errors can occur due to biased/inadequate sampling, poor
experimental design or the use of inappropriate/non-
parametric tests.
Back to Study Design
• Independent Measures
• Individual scores in each data set are independent of one another
• Repeated Measures
• Individual scores in each data set are dependent/paired/correlated
Back to Study Design
• Independent Measures T O1
• Individual scores in each data set are independent of one another
2 Distinct
Groups
P
• Repeated Measures Oa
• Individual scores in each data set are dependent/paired/correlated
O1 T O2
Pre-Experimental
designs.
Same individuals
tested twice
Back to Study Design
• Independent Measures True-Experimental design.
• Individual scores in each data set are independent of one another
Random Group Assignment
O1 T O2
Depends on how
equivalent
groups were R O3
achieved P O4
• Repeated Measures
Cross-Over Design
Example Hypotheses: Isometric Torque
• Is there any difference in the length of time that males and
females can sustain an isometric muscular contraction?
Independent
80
60
40
20
Mean SD n
♀ 18.5 1.74 25
♂ 16.5 1.72 25
Independent t-test: Calculation
Step 2:
Calculate the Standard Error for the difference in means
Mean SD n
♀ 18.5 1.74 25
♂ 17.5 1.72 25
Independent t-test: Calculation
Step 3:
Mean SD n
♀ 18.5 1.74 25
♂ 16.5 1.72 25
Independent t-test: Calculation
Step 4:
Calculate the degrees of freedom (df)
df = (n♀ - 1) + (n♂ - 1) = 48
Mean SD n
♀ 18.5 1.74 25
♂ 16.5 1.72 25
Independent t-test: Calculation
Step 5:
Determine the critical value for t using a t-distribution table
n.b. Use 0.05
Degrees of Freedom Critical t-ratiofor 2 tailed test
44 2.015
46 2.013
48 2.011
50 2.009
Mean SD n
♀ 18.5 1.74 25
♂ 16.5 1.72 25
Independent t-test: Calculation
Step 6 finished:
Compare t calculated with t critical
Therefore,
Calculated t = 2.00
t calculated < t critical
Critical t = 2.01
Effect size n.s.
Mean SD n
♀ 18.5 1.74 25
♂ 16.5 1.72 25
Independent t-test: Calculation
Interpretation:
P > 0.05 Failed to reject the HO
Conclusion:
Mean SD n
♀ 18.5 1.74 25
♂ 16.5 1.72 25
Independent t-test: Calculation
Evaluation:
The wealth of available literature supports that females can sustain
isometric contractions longer than males. This may suggest that the
findings of the present study represent a type I error
Possible solution:
Increase n
Mean SD n
♀ 18.5 1.74 25
♂ 16.5 1.72 25
Independent t-test: SPSS Output
Group Statistics
160
i.e.
Number of Press-Ups
140
This data would have a large
120
Large SD associated with an
100
80
SD independent t-test simply
60
(variance) because some subjects
40 performed better than others
20
0
HOWEVER…
1 2
Week
Advantages of using Paired Data
• Data from independent samples is heavily
200
influenced by variance between subjects
180
160
…using the same participants
on two occasions allows us to
Number of Press-Ups
140
80
60
…now we can remove
40
between subject variance
from subsequent analysis…
20
1 2
Week
Paired t-test: Calculation
Subject Week 1 Week 2 Diff (D) Diff2 (D2)
1 10 12
2 50 52
3 20 25
4 8 10
5 115 120
6 75 80
7 45 50
8 170 175
Steps 1 & 2: Complete this table ∑D = ∑D2 =
Paired t-test: Calculation
Step 3:
Calculate the t statistic
∑D
t= n x ∑D2 – (∑D)2 =
√ (n - 1)
∑D = ∑D2 =
Paired t-test: Calculation
Step 3:
Calculate the t statistic
31
t= 8 x 137 – (31)2 = 7.06 √
7
∑D = ∑D2 =
Paired t-test: Calculation
Steps 4 & 5:
Calculate the df and use a t-distribution table to find t critical
Critical t-ratio Critical t-ratio
Degrees of Freedom (0.05 level) (0.01 level)
1 12.71 63.657
2 4.303 9.925
3 3.182 5.841
4 2.776 4.604
5 2.571 4.032
6 2.447 3.707
7 2.365 3.499
df = n8-1 2.306 3.355
9 2.262 3.250
Paired t-test: Calculation
Step 6 finished:
Compare t calculated with t critical
Therefore,
Calculated t = 7.06
t calculated > t critical
Critical t = 3.499
Effect size sig.
Mean SD n
Week 1 61.6 56.6 8
Week 2 65.5 57.5 8
Paired t-test: Calculation
Interpretation:
P < 0.05 Reject H0
Conclusion:
Std. Error
Mean N Std. Deviation Mean Push-up Data
Pair VAR00001 61.6250 8 56.64157 20.02582
1 VAR00002 65.5000 8 57.54005 20.34348 from lecture 3
Paired Differences
95% Confidence
Interval of the
Std. Error Difference
Mean Std. Deviation Mean Lower Upper t df Sig. (2-tailed)
Pair 1 VAR00001 - VAR00002 -3.87500 1.55265 .54894 -5.17305 -2.57695 -7.059 7 .000
Calculated t
Ignore sign
7.059 > 3.499
df 7 = critical t 2.365 (0.05) So P < 0.01
3.499 (0.01)
Parametric versus Non-Parametric
• Both the t-tests just shown are parametric tests
100
80
60
40
20
100
80
Mean
B
60
40
20
• Mann-Whitney Test
e.g. Exam grades (ordinal) from 14 students in 2 separate schools
• Repeated Measures
• Wilcoxon Test
Mann-Whitney U: Calculation
Step 1:
Rank all the data from both groups in one series, then total each
School A School B
Student Grade Rank Student Grade Rank
J. S. B- T. J. D
L. D. B- M. M. C+
H. L. A+ K. S. P. C+
M. J. D- S. R. B-
T. M. B+ M. P. E
T. S. A- W. A. C-
P. H. F F. A-
Median = B-;∑RA = Median = C+;∑RB =
Mann-Whitney U: Calculation
Step 2:
Calculate two versions of the U statistic using:
(nA + 1) x nA
U1 = (nA x nB) + - ∑RA
2
AND…
(nB + 1) x nB
U2 = (nA x nB) + - ∑RB
2
(nA + 1) x nA
U1 = (nA x nB) + - ∑RA
2
…OR to save time you can calculate U1 and then U2 as follows
U2 = (nA x- n
UB1 )
n 6 7 8 9
0.05 5 8 13 17
0.01 2 4 7 11
Test Statisticsb
VAR00001
Mann-Whitney U 17.500
Calculated U
Wilcoxon W 45.500
Z -.900
(lower value)
Asymp. Sig. (2-tailed) .368
Exact Sig. [2*(1-tailed a
.383
Sig.)]
17.5 > 8
a. Not corrected for ties .
b. Grouping Variable: VAR00002 So P > 0.05 n.s.
Non-Parametric Tests
• Independent Measures
• Mann-Whitney Test
• Repeated Measures
e.g. One group pre-test post-test, assumed non-normal
• Wilcoxon Test
Wilcoxon Signed Ranks: Calculation
Step 1:
Rank all the differences in one series (ignoring signs), then total each
Pre-training Post-training
Athlete Diff. Rank Signed Ranks
OBLA (kph) OBLA (kph) - +
J. S. 15.6 16.1 0.5 6 6
L. D. 17.2 17.5 0.3 4.5 4.5
H. L. 17.7 16.7 -1 -7 -7 4.
M. J. 16.5 16.8 0.3 4.5 5
T. M. 15.9 16.0 0.1 1.5 1.5
T. S. 16.7 16.5 -0.2 -3 -3 1.
P. H. 17.0 17.1 0.1 1.5 5
Medians = 16.7 16.7 ∑Signed Ranks =
Wilcoxon Signed Ranks: Calculation
Step 2:
The smaller of the T values is our test statistic (T+ = 18; T- = 10)
…now consult a table of critical values for the Wilcoxon test
n 6 7 8 9
0.05 0 2 3 5
Conclusion
Calculated T must be less than critical
T to conclude a significant difference Median A = Median B
Wilcoxon Signed Ranks: SPSS Output
Ranks
Test Statisticsb
VAR00002 -
VAR00001 10 > 2
Z -1.364 a
So P > 0.05 n.s.
Asymp. Sig. (2-tailed) .172
a. Bas ed on negative ranks.
b. Wilcoxon Signed Ranks Tes t
So which stats test should you use?
Nominal Q1. What is the LOM? Interval/Ratio
Ordinal
No Q2. Are the data ND?
Yes
• Induction/deduction - reproducibility