Chapter 5 (Part Ii)

CHAPTER 5 (PART II)
Statistical Inference
Prepared by: Nur Liyana Mohamed Yousop
ANOVA
ANALYSIS OF VARIANCE (ANOVA)
One Way ANOVA
Used to compare the means of two, three or more population groups.
ANOVA derives its name from the fact that we are analyzing variances in
the data (analysis of variances).
ANOVA measures variation between groups relative to variation within

groups.
Each of the population groups is assumed to come from a normally

distributed population.
ASSUMPTIONS OF ONE WAY ANOVA
The m groups or factor levels

being studied represent
populations whose outcome
measures: If these assumptions
1. are randomly and are violated, then the
independently obtained level of significance
and the power of the
2. are normally distributed test can be affected.
3. have equal variances.
T-TEST VS ANOVA
If we conduct
multiple samples, it
will have a
However, using t- compounded effect
test would not be on the error rate of
reliable in cases result.
When we have where there are
only two samples, more than 2
t-test and ANOVA samples
give the same
results.
ANALYSIS OF VARIANCE (ANOVA)
One Way ANOVA
F = VARIANCE BETWEEN SAMPLES

VARIANCE WITHIN SAMPLES
A one way ANOVA is used to compare two or more means from

two or more independent (unrelated) groups using the F-
distribution. The null hypothesis for the test is that the two or
more means are equal.
Excel Analysis ToolPak to test one way ANOVA is:

 Anova: Single Factor
HOW ANOVA WORKS
One Way ANOVA
Why ANOVA not ANOME?

While comparing means its analyses its Basically, ANOVA compares two or
more types of variances: the
variances variance within each sample and
the variance between different
samples.
The black dotted arrows show the
per-sample variation of the
individual data points around the
sample mean (the variance within).
The red arrows show the variation

of the sample means around the
grand mean (the variance
between).
Then calculate the F-Test

** Grand mean = Mean of overall samples
EXAMPLE 1
Difference in Insurance Survey Data
Determine whether any significant differences exist in satisfaction among

individuals with different levels of education.
The variable of interest is called a factor. In this example, the factor is the
educational level, and we have three categorical levels of this factor,
college graduate, graduate degree, and some college.
SOLUTION 1
Applying the Excel ANOVA Tool
Data Analysis tool: ANOVA: Single Factor

 The input range of the data must be in contiguous columns
0.025
SOLUTION 1
Applying the Excel ANOVA Tool
CHI-SQUARE
NON-PARAMETRIC METHOD
INTRODUCTION
σ2 is
unknown/known Nominal @ Categorical
scale data
(E.g. Gender, State of
Birth, Brand)
Case III
NON-PARAMETRIC
n<30 Population is not
normal METHOD
CHI-SQUARE TEST FOR INDEPENDENCE
Test for
• H0: two categorical variables independence
are independent of two
• H1: two categorical variables categorical
are dependent variables.
EXAMPLE 2
Independence and Marketing Strategy
Energy Drink Survey data. A key marketing question is whether the proportion
of males who prefer a particular brand is no different from the proportion of
females.
 If gender and brand preference are indeed independent, we would expect
that about the same proportion of the sample of female students would also
prefer brand 1.
 If they are not independent, then advertising should be targeted differently
to males and females, whereas if they are independent, it would not matter.
CHI-SQUARE TEST CALCULATIONS
 Step 1
◦ Using a cross-tabulation of the data, compute the expected frequency if
the two variables are independent.
 Step 2
◦ Compute a test statistic, called a chi-square statistic, which is the sum
of the squares of the differences between observed frequency, fo, and
expected frequency, fe, divided by the expected frequency in each cell:
CHI-SQUARE DISTRIBUTION
The sampling distribution of C2 is a special distribution called the chi-

square distribution.
 The chi-square distribution is characterized by degrees of freedom.
 Step 3
◦ Compare the chi-square statistic for the level of significance a to the
critical value from a chi-square distribution with (r – 1)(c – 1) degrees
of freedom, where r and c are the number of rows and columns in the
cross-tabulation table, respectively.
The Excel function CHISQ.INV.RT(probability, deg_ freedom) returns the
value of C2 that has a right-tail area equal to probability for a specified
degree of freedom.
By setting probability equal to the level of significance, we can obtain the

critical value for the hypothesis test.
The Excel function CHISQ.TEST(actual_range, expected_range) computes

the p-value for the chi-square test.
EXAMPLE 2
Conducting the Chi-square Test
Result
Test statistic = 6.49
d.f. = (2 – 1)(3 – 1) = 2
Critical value =
CHISQ.INV.RT(0.05,2) = 5.99
p-value =
CHISQ.TEST(F6:H7,F12:H13) =
0.0389
Reject H0
Test statistic
END OF CHAPTER 5

Chapter 5 (Part Ii)

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Chapter 5 (Part Ii)

Uploaded by

Copyright:

Available Formats

CHAPTER 5 (PART II)

Used to compare the means of two, three or more population groups.

ANOVA measures variation between groups relative to variation within

Each of the population groups is assumed to come from a normally

The m groups or factor levels

F = VARIANCE BETWEEN SAMPLES

A one way ANOVA is used to compare two or more means from

Excel Analysis ToolPak to test one way ANOVA is:

Why ANOVA not ANOME?

The red arrows show the variation

Then calculate the F-Test

Determine whether any significant differences exist in satisfaction among

Data Analysis tool: ANOVA: Single Factor

The sampling distribution of C2 is a special distribution called the chi-

By setting probability equal to the level of significance, we can obtain the

The Excel function CHISQ.TEST(actual_range, expected_range) computes

You might also like