Professional Documents
Culture Documents
PART 5 - ANOVA
Analysis of Variance
A B C
ANOVA
A B C
ANOVA
A B C
ANOVA
A B C
ANOVA
A B C
GroupA GroupB GroupC
ANOVA 37 62 50
60 27 63
52 69 58
Let’s work with some data: 43 64 54
40 43 49
52 54 52
55 44 53
39 31 43
39 49 65
23 57 43
A B C
GroupA GroupB GroupC
ANOVA 37 62 50
60 27 63
52 69 58
First calculate the sample means 43 64 54
40 43 49
TOT 49
ANOVA
TOT 49
𝑆𝑆𝐺 = 420 GroupA GroupB GroupC
ANOVA 𝑑𝑓𝑔𝑟𝑜𝑢𝑝𝑠 = 2 37 62 50
60 27 63
𝑆𝑆𝐸 = 3300
52 69 58
Sum of Squares Error 43 64 54
(xA-A)2 (xA-A)2 (xB-B)2 (xB-B)2 (xC-C)2 (xC-C)2 40 43 49
49 64 144 16 9 1 (37-44)2 52 54 52
=(-7)2
256 121 529 36 100 0 =49 55 44 53
64 25 361 361 25 100 39 31 43
1 25 196 1 1 144 39 49 65
16 441 49 49 16 100 23 57 43
1062 1742 496 A,B,C 44 50 53
TOTAL 3300 TOT 49
𝑆𝑆𝐺 = 420 GroupA GroupB GroupC
ANOVA 𝑑𝑓𝑔𝑟𝑜𝑢𝑝𝑠 = 2 37 62 50
60 27 63
𝑆𝑆𝐸 = 3300
52 69 58
𝑑𝑓𝑒𝑟𝑟𝑜𝑟 = 27 43 64 54
Degrees of Freedom Error 40 43 49
52 54 52
𝑑𝑓𝑒𝑟𝑟𝑜𝑟 = 𝑛𝑟𝑜𝑤𝑠 − 1 ∗ 𝑛𝑔𝑟𝑜𝑢𝑝𝑠 55 44 53
= 10 − 1 ∗ 3 39 31 43
39 49 65
= 27 23 57 43
A,B,C 44 50 53
TOT 49
𝑆𝑆𝐺 = 420 GroupA GroupB GroupC
ANOVA 𝑑𝑓𝑔𝑟𝑜𝑢𝑝𝑠 = 2 37 62 50
60 27 63
𝑆𝑆𝐸 = 3300
52 69 58
𝑑𝑓𝑒𝑟𝑟𝑜𝑟 = 27 43 64 54
Plug these into our formula: 40 43 49
52 54 52
𝑆𝑆𝐺 420 55 44 53
𝑑𝑓𝑔𝑟𝑜𝑢𝑝𝑠 2 210
𝐹= 𝑆𝑆𝐸 = 3300 = = 𝟏. 𝟕𝟏𝟖 39 31 43
122.22 39 49 65
𝑑𝑓𝑒𝑟𝑟𝑜𝑟 27
23 57 43
A,B,C 44 50 53
TOT 49
ANOVA with Excel Data Analysis
F Distribution
F-Distribution
shaded area = 𝛼
Fcritical
F-Distribution
faster payments? 14 10 16
10 16 21
ANOVA Exercise #1
null hypothesis!
ANOVA Exercise #1
Two-Way ANOVA 𝐹=
𝑉𝑎𝑟. 𝐵𝑒𝑡𝑤𝑒𝑒𝑛 𝐺𝑟𝑜𝑢𝑝𝑠
𝑉𝑎𝑟. 𝑊𝑖𝑡ℎ𝑖𝑛 𝐺𝑟𝑜𝑢𝑝𝑠
=
𝑑𝑓𝑔𝑟𝑜𝑢𝑝𝑠
𝑆𝑆𝐸
𝑑𝑓𝑒𝑟𝑟𝑜𝑟
Two-Way ANOVA 𝐹=
𝑉𝑎𝑟. 𝐵𝑒𝑡𝑤𝑒𝑒𝑛 𝐺𝑟𝑜𝑢𝑝𝑠
𝑉𝑎𝑟. 𝑊𝑖𝑡ℎ𝑖𝑛 𝐺𝑟𝑜𝑢𝑝𝑠
=
𝑑𝑓𝑔𝑟𝑜𝑢𝑝𝑠
𝑆𝑆𝐸
𝑑𝑓𝑒𝑟𝑟𝑜𝑟
variance in SSE
Block C 12 13 12.5
1,2 10 12 11
Two-Way ANOVA 𝐹=
𝑉𝑎𝑟. 𝐵𝑒𝑡𝑤𝑒𝑒𝑛 𝐺𝑟𝑜𝑢𝑝𝑠
𝑉𝑎𝑟. 𝑊𝑖𝑡ℎ𝑖𝑛 𝐺𝑟𝑜𝑢𝑝𝑠
=
𝑑𝑓𝑔𝑟𝑜𝑢𝑝𝑠
𝑆𝑆𝐸
𝑑𝑓𝑒𝑟𝑟𝑜𝑟
2
multiply by the number 𝑆𝑆𝐺 = 6
Two-Way ANOVA 𝐹=
𝑉𝑎𝑟. 𝐵𝑒𝑡𝑤𝑒𝑒𝑛 𝐺𝑟𝑜𝑢𝑝𝑠
𝑉𝑎𝑟. 𝑊𝑖𝑡ℎ𝑖𝑛 𝐺𝑟𝑜𝑢𝑝𝑠
=
𝑑𝑓𝑔𝑟𝑜𝑢𝑝𝑠
𝑆𝑆𝐸
𝑑𝑓𝑒𝑟𝑟𝑜𝑟
Two-Way ANOVA 𝐹=
𝑉𝑎𝑟. 𝐵𝑒𝑡𝑤𝑒𝑒𝑛 𝐺𝑟𝑜𝑢𝑝𝑠
𝑉𝑎𝑟. 𝑊𝑖𝑡ℎ𝑖𝑛 𝐺𝑟𝑜𝑢𝑝𝑠
=
𝑑𝑓𝑔𝑟𝑜𝑢𝑝𝑠
𝑆𝑆𝐸
𝑑𝑓𝑒𝑟𝑟𝑜𝑟
Two-Way ANOVA 𝐹=
𝑉𝑎𝑟. 𝐵𝑒𝑡𝑤𝑒𝑒𝑛 𝐺𝑟𝑜𝑢𝑝𝑠
𝑉𝑎𝑟. 𝑊𝑖𝑡ℎ𝑖𝑛 𝐺𝑟𝑜𝑢𝑝𝑠
=
𝑑𝑓𝑔𝑟𝑜𝑢𝑝𝑠
𝑆𝑆𝐸
𝑑𝑓𝑒𝑟𝑟𝑜𝑟
Two-Way ANOVA 𝐹=
𝑉𝑎𝑟. 𝐵𝑒𝑡𝑤𝑒𝑒𝑛 𝐺𝑟𝑜𝑢𝑝𝑠
𝑉𝑎𝑟. 𝑊𝑖𝑡ℎ𝑖𝑛 𝐺𝑟𝑜𝑢𝑝𝑠
=
𝑑𝑓𝑔𝑟𝑜𝑢𝑝𝑠
𝑆𝑆𝐸
𝑑𝑓𝑒𝑟𝑟𝑜𝑟
Two-Way ANOVA 𝐹=
𝑉𝑎𝑟. 𝐵𝑒𝑡𝑤𝑒𝑒𝑛 𝐺𝑟𝑜𝑢𝑝𝑠
𝑉𝑎𝑟. 𝑊𝑖𝑡ℎ𝑖𝑛 𝐺𝑟𝑜𝑢𝑝𝑠
=
𝑑𝑓𝑔𝑟𝑜𝑢𝑝𝑠
𝑆𝑆𝐸
𝑑𝑓𝑒𝑟𝑟𝑜𝑟
Two-Way ANOVA 𝐹=
𝑉𝑎𝑟. 𝐵𝑒𝑡𝑤𝑒𝑒𝑛 𝐺𝑟𝑜𝑢𝑝𝑠
𝑉𝑎𝑟. 𝑊𝑖𝑡ℎ𝑖𝑛 𝐺𝑟𝑜𝑢𝑝𝑠
=
𝑑𝑓𝑔𝑟𝑜𝑢𝑝𝑠
𝑆𝑆𝐸
𝑑𝑓𝑒𝑟𝑟𝑜𝑟
Two-Way ANOVA 𝐹=
𝑉𝑎𝑟. 𝐵𝑒𝑡𝑤𝑒𝑒𝑛 𝐺𝑟𝑜𝑢𝑝𝑠
𝑉𝑎𝑟. 𝑊𝑖𝑡ℎ𝑖𝑛 𝐺𝑟𝑜𝑢𝑝𝑠
=
𝑑𝑓𝑔𝑟𝑜𝑢𝑝𝑠
𝑆𝑆𝐸
𝑑𝑓𝑒𝑟𝑟𝑜𝑟
Two-Way ANOVA 𝐹=
𝑉𝑎𝑟. 𝐵𝑒𝑡𝑤𝑒𝑒𝑛 𝐺𝑟𝑜𝑢𝑝𝑠
𝑉𝑎𝑟. 𝑊𝑖𝑡ℎ𝑖𝑛 𝐺𝑟𝑜𝑢𝑝𝑠
=
𝑑𝑓𝑔𝑟𝑜𝑢𝑝𝑠
𝑆𝑆𝐸
𝑑𝑓𝑒𝑟𝑟𝑜𝑟
14 col 12 17 16 15
Multiply by the number of 𝑆𝑆𝐺 = 70
items in each group:
14 × 5 = 70
ANOVA Exercise #2
2% 1% no block
disc disc disc
3. Degrees of Freedom Groups $50 16 23 21 20
$100 14 21 16 17
𝑑𝑓𝑔𝑟𝑜𝑢𝑝𝑠 = 𝑛𝑔𝑟𝑜𝑢𝑝𝑠 − 1
$150 11 16 18 15
=3−1 $200 10 15 14 13
$250 9 10 11 10
=2
col 12 17 16 15
𝑆𝑆𝐺 = 70 𝑑𝑓𝑔𝑟𝑜𝑢𝑝𝑠 = 2
ANOVA Exercise #2
4. Sum of Squares Blocks 2%
disc
1%
disc
no block
disc
(𝜇50 −𝜇 𝑇𝑂𝑇 )2 = (20 − 15)2 = 25 $50 16 23 21 20
𝑆𝑆𝐺 = 70 𝑑𝑓𝑔𝑟𝑜𝑢𝑝𝑠 = 2
𝑆𝑆𝐵 = 174
𝑆𝑆𝑇 = 268
𝑆𝑆𝐸 = 24
ANOVA Exercise #2
2% 1% no block
7. Degrees of Freedom Error disc disc disc
$50 16 23 21 20
𝑑𝑓𝑒𝑟𝑟𝑜𝑟 = (𝑛𝑏𝑙𝑜𝑐𝑘𝑠 − 1)(𝑛𝑔𝑟𝑜𝑢𝑝𝑠 − 1) $100 14 21 16 17
= (5 − 1)(3 − 1) $150 11 16 18 15
$200 10 15 14 13
=8 $250 9 10 11 10
col 12 17 16 15
𝑆𝑆𝐺 = 70 𝑑𝑓𝑔𝑟𝑜𝑢𝑝𝑠 = 2
𝑆𝑆𝐵 = 174 𝑑𝑓𝑒𝑟𝑟𝑜𝑟 = 8
𝑆𝑆𝑇 = 268
𝑆𝑆𝐸 = 24
ANOVA Exercise #2
2% 1% no block
8. Calculate F disc disc disc
$50 16 23 21 20
𝑆𝑆𝐺 70
$100 14 21 16 17
$150 11 16 18 15
𝑑𝑓𝑔𝑟𝑜𝑢𝑝𝑠 2 35
𝐹= = = = 𝟏𝟏. 𝟔𝟕 $200 10 15 14 13
𝑆𝑆𝐸 24 3 $250 9 10 11 10
𝑑𝑓𝑒𝑟𝑟𝑜𝑟 8 col 12 17 16 15
𝑆𝑆𝐺 = 70 𝑑𝑓𝑔𝑟𝑜𝑢𝑝𝑠 = 2
𝑆𝑆𝐵 = 174 𝑑𝑓𝑒𝑟𝑟𝑜𝑟 = 8
𝑆𝑆𝑇 = 268 F = 11.67
𝑆𝑆𝐸 = 24
ANOVA Exercise #2
2% 1% no block
9. Find Fcritical disc disc disc
$50 16 23 21 20
𝛼 = 0.05 $100 14 21 16 17
𝑑𝑓𝑛𝑢𝑚𝑒𝑟𝑎𝑡𝑜𝑟 = 2 $150 11 16 18 15
$200 10 15 14 13
𝑑𝑓𝑑𝑒𝑛𝑜𝑚𝑖𝑛𝑎𝑡𝑜𝑟 = 8 $250 9 10 11 10
𝐹𝑐𝑟𝑖𝑡𝑖𝑐𝑎𝑙 = 𝟒. 𝟒𝟔 col 12 17 16 15
𝑆𝑆𝐺 = 70 𝑑𝑓𝑔𝑟𝑜𝑢𝑝𝑠 = 2
𝑆𝑆𝐵 = 174 𝑑𝑓𝑒𝑟𝑟𝑜𝑟 = 8
𝑆𝑆𝑇 = 268 F = 11.67
𝑆𝑆𝐸 = 24 Fcritical = 4.46
ANOVA Exercise #1
𝑆𝑆𝐺 = 70 𝑑𝑓𝑔𝑟𝑜𝑢𝑝𝑠 = 2
𝑆𝑆𝐵 = 174 𝑑𝑓𝑒𝑟𝑟𝑜𝑟 = 8
𝑆𝑆𝑇 = 268 F = 11.67
𝑆𝑆𝐸 = 24 Fcritical = 4.46
2-way ANOVA in Excel
Two-Way ANOVA
with Replication
Without vs With Replication
without replication with replication
GroupA GroupB GroupC GroupA GroupB GroupC
Block1 16 23 21 Block1 16 23 21
Block2 14 21 16 14 21 16
Block3 11 16 18 11 16 18
Block4 10 15 14 Block2 10 15 14
Block5 9 10 11 9 10 11
Block6 8 8 10 8 8 10
14 − 15 2 = 2 Sample 13 19 16
Means
× 6 𝑖𝑡𝑒𝑚𝑠 𝑝𝑒𝑟 𝑐𝑜𝑙𝑢𝑚𝑛 = 𝟏𝟐 17 13 12
Column
15 16 14 15
Means
SSB = 18 SSC = 12
Fertilizer: A B C
Two-Way ANOVA Warm 13 21 18 B
l
14 19 15 16 o
c
● As before, calculate the 12 17 15 k
Cold 16 14 15 M
Degrees of Freedom 18 11 13
e
14 a
Columns 17 14 8
n
s
𝑑𝑓𝑐𝑜𝑙𝑢𝑚𝑛𝑠 = 3 − 1 = 2 Sample 13 19 16
Means 17 13 12
Column
15 16 14 15
Means
SSB = 18 SSC = 12 dfcolumns = 2
Fertilizer: A B C
Two-Way ANOVA Warm 13 21 18 B
l
14 19 15 16 o
c
● We have a new statistic: 12 17 15 k
Cold 16 14 15 M
SS Interactions 18 11 13
e
14 a
● For each sample mean, 17 14 8
n
s
17 − 14 − 15 + 15 2 + Sample 13 19 16
Means
13 − 14 − 16 + 15 2 + 17 13 12
Column
12 − 14 − 14 + 15 2 = 28 Means
15 16 14 15
Sample 13 19 16
Means 17 13 12
Column
15 16 14 15
Means
SSB = 18 SSC = 12 dfcolumns = 2
SSI = 84 SSE = 50 dferror = 12
SST = 164
Fertilizer: A B C
Two-Way ANOVA Warm 13 21 18 B
l
14 19 15 16 o
c
● A look at Interaction: 12 17 15 k
Cold 16 14 15 M
e
18 11 13 14 a
n
17 14 8 s
Sample 13 19 16
Means 17 13 12
Column
15 16 14 15
Means
SSB = 18 SSC = 12 dfcolumns = 2
SSI = 84 SSE = 50 dferror = 12
SST = 164
2-way with Replication in Excel
Next Up: REGRESSION