Professional Documents
Culture Documents
Unit10 ANOVA
Unit10 ANOVA
What is ANOVA?
The test of hypothesis concerning the equality of two
population means makes use of both the Z and t tests.
However, if there are more than two populations, the
test for the equality of means could be carried out by
considering two populations at a time. This would be a
very cumbersome procedure.
One easy way out could be to use the analysis of
variance (ANOVA) technique. The technique helps in
performing this test in one go and, therefore, is
considered to be important technique of analysis for
the researcher.
What is ANOVA?
The basic principle underlying the technique is that the total
variation in the dependent variable is broken into two parts—one
which can be attributed to some specific causes and the other that
may be attributed to chance.
The one which is attributed to the specific causes is called the
variation between samples and the one which is attributed to
chance is termed as the variation within samples.
Therefore, in ANOVA, the total variance may be decomposed into
various components corresponding to the sources of the variation.
For eg. the sales of chairs could differ because of the various
styles and sizes of stores selling them. The average telephone
bill of the households could be different because they belong to
different income groups and so on.
What is ANOVA?
In ANOVA, the dependent variable in question is
metric (interval or ratio scale), whereas the
independent variables are categorical (nominal
scale).
If there is one independent variable (one factor)
divided into various categories, we have one-way
or one-factor analysis of variance.
In the two-way or two-factor analysis of variance,
two factors each divided into the various
categories are involved.
Where,
xij = the jth observation of the ith sample
T•• = Grand total of all the data
k = Number of treatments (samples)
n = Number of observations in each sample
Where, N = n1 + n2 + . . . . + nk
The ANOVA table can be set up with the above information and if
computed F is greater than the table value of F, null hypothesis
could be rejected.
I Price
H0 : μA = μB = μC (Assuming three price level)
H1 : At least the two means are not same.
II (Blocks or Stores)
H0 : ν1 = ν2 = ν3 (Assuming three different sizes of stores)
H1 : At least two means are not same.
Factorial Design
In factorial design, the dependent variable is the
interval or the ratio scale and there are two or
more independent variables which are nominal
scale.
It is possible to examine the interaction between
the variables. If there are two independent
variables, each having three cells, there would be
a total of nine interactions.
The hypothesis to be tested would be the
following:
Factorial Design
Row-wise:
H0 : Average of rows are equal.
H1 : Average of rows are not equal
Column-wise:
H0 : Average of columns are equal.
H1 : Average of columns are not equal
Interaction:
H0 : Average of all interactions between rows and columns are
equal.
H1 : Average of all interactions between rows and columns are
not equal
Factorial Design
Total sum of squares (TSS), sum of squares due to rows (SSR)
& sum of squares due to columns (SSC) can be computed as
explained earlier. We also need to compute sum of squares due
to interaction (SSI).
Factorial Design