You are on page 1of 20

SIMPLE ANALYSIS OF VARIANCE

Analysis of variance ( ANOVA) is a technique in


inferential statistics designed to test whether
or not more than two sample(or groups are
significantly different from each other.

ANOVA is a simultaneous test taking the


samples all at a single time
The application of ANOVA requires
three basic assumptions
• The various groups of measurement are
assumed to be obtained from a population
with normal distribution.
• Each group is randomly selected and
independent of all other groups.
• The variables from each group come from
distributions with approximately the same
standard deviation.
Steps Involve In Constructing the Analysis of
Variance
1. Group the data by treatments and calculate
the treatment totals, treatment means and
grand total.
2. Formulate the null and alternative
hypotheses.
3. Construct an outline of the ANOVA as
follows:
Source of variation Degrees of Sum of Squares Mean Sum of
Freedom Squares

Between column =ssb/dfb

Within column =ssw/dfw

Total
4. Determine the degrees of freedom (df) for
each source of variation as follows:
dfT = NT– 1

dfTr = NTr– 1

dfE = dfT– dfTr


Where:
dfT – Total degrees of freedom
dfTr – Treatment degrees of freedom
dfE – Error degrees of freedom
NT– total number of observations
5. Calculate the Sum of Squares as follows:

Where:
CF – Correction Factor
GT – Grand Total
NT – Total number of Observations

Where:
TSS – Total Sum of Squares
= + + + ….
CF – Correction Factor
Where:
Treatment Sum of Squares
reatment total of group A
reatment total of group B
reatment total of group C
NTr– Total number treatments
CF – Correction Factor

ESS/SSw = TSS - TrSS


Where:
E Sum of Squares
TTotal Sum of Squares
Treatment Sum of Squares
6. Calculate the Mean Squares (MS) by dividing each
SS by its corresponding degrees of freedom

Treatment MS = Treatment SS
Treatment df

Error MS = Error SS
Error MS

7. Calculate the observed F- value

F = Treatment MS
Error MS
8. Obtain the tabular F-values using the treatment df
as f1 and the error df as f2
9. Fill up the ANOVA table with values compute in
steps 4-8
10. Compare the computed F-value with the tabular
values based on the ff decisions:
a. Reject Ho if the computed value is equal to or
greater than the tabular value. Hence, the
computed F-value is significant.
b. Accept Ho if the computed value is less than the
tabular value. Hence the computed F- value is
insignificant
Example:
1. Three groups of students (6 in each group )
were subjected to one of three types of teaching
method. The grades of the students are taken at
the end of the semester and enumerated
according to the grouping
Groups
I II III
Student Method A B C
1 84 70 90
2 90 75 95
3 92 90 100
4 96 80 98
5 84 75 88
6 88 75 90
534 465 561
(Xa)2 (Xb)2 (Xc)2

7056 4900 8100

8100 5625 9025

8464 8100 10,000

9216 6400 9604

7056 5625 7744

7744 5625 8100

47636 36275 52573

Ho:
there is no significant difference among the three
types of teaching method
Set
the desired level of significance: 1 or 5 %
∑x = 534 + 465 + 561 =1560

∑x2=47636+36275+52573 = 136,484
Get the total sum of squares (TSS)

= 135,200

TSS =
∑x2 – CF
=136,484 –135,200
=
1284
Get the between-column variance or between
column sum of squares (SSb) is 1/r of the sum of
the squares of the column sums, minus the
correction term, where r refer to the number of
rows
SSb = 1 Ʃ(sum of each column) 2 - CF
#rows
= 1 (534 2 +465 2 + 561 2 ) –135,200
6
= 1 ( 816,102 ) – 135,200
6
= 817
The within column variance or within column sum of squares
is the difference between the total sum of squares and the
between column sum of squares
SSw = TSS –SSb
= 1284 – 817 = 467

ANOVA TABLE ON THE THREE SAMPLES


SUBJECTED TO DIFF TEACHING METHODS
Source of Variation Sum of Squares df MSS =SS/df
Between-column 817 2 (k-1) 408.5
Within-column 467 15 k(n-1) 31.13
Total 1284 17 (N-1)
Note: k = column , k = 3, n = 6
n = size of the sample per column
Degrees of freedom:
Total df = total # of items – 1
N–1= 18 – 1 = 17
Total df = rk – 1 r = rows , k = # of columns
3(6) – 1 = 17
Between column df = no of columns – 1
dfb = k – 1 = 3–1=2
Within column df = total df- between column df
dfw = df – dfb = 17 – 2 = 15
The mean sum of squares (MSS) – divide each
sum of squares by their corresponding
degrees of freedom
a. MSSb = SSb / dfb = 817/2 = 408.5
b. MSSw = SSw / dfw = 467/15 = 31.13

Locate the tabular value of F, by getting the


intersection value of =2 (df of greater
MSS) and = 15 (df of smaller value)
5% significance :df = 3.68
1% significance ;df = 6.36
Calculate the value of F
F = MSSb = 408.5 = 13.12 computed
MSSw 31.13 value
Compare the computed F value with the tabular value,
then state the decision
If the computed value is less than the tabular value,
accept Ho, insignificant
If the computed value is greater than the tabular value ,
reject Ho, significant
13.12 is greater than 3.68 reject Ho
13.12 is greater than 3.68
Conclusion: There is a significant difference among the
teaching methods
2. The following are the mileage obtained after
several road tests were run using five different
kinds of gasoline on a toyota car
Road Test Gas A B C D E
1st 35 61 38 65 56
2nd 31 63 54 60 69
3rd 42 50 47 57 70
4th 48 42 60 55 50
5th 40 49 55 60 48
3. Four groups of 4 patients each were subjected
to 4 different types of treatment for the same
ailment. The ff data are on the number of days
that elapsed before they were completely
cured. What conclusions may be drawn about
the four types of treatment?
A B C D
Patient1 10 11 3 6
2 9 11 4 10
3 6 18 5 8
4 7 6 7 11

You might also like