You are on page 1of 2

PROBABILITY & STATISTICS

CHI - SQUARE TEST


Amaresh Baranwal - 2021301073
Pratham Maner - 2021301074

Introduction
The Chi-square test of independence (also known as the Pearson Chi-square test, or simply
the Chi square) is one of the most useful statistics for testing hypotheses when the variables
are nominal, as often happens in clinical research. Unlike most statistics, the Chi-square
(χ2) can provide information not only on the significance of any observed differences but
also provides detailed information on exactly which categories account for any differences
found. Thus, the amount and detail of information this statistic can provide renders it one
the most useful tools in the researcher’s array of available analysis tools. As with any
statistic, there are requirements for its appropriate use, which are called “assumptions” of
the statistic. Additionally, the χ2 is a significance test, and should always be coupled with an
appropriate test of strength.

Assumptions of the Chi-square


1) The data in the cells should be frequencies or counts of cases rather than percentages.
2) The levels (or categories) of the variables are mutually exclusive.
3) Each subject may contribute data to one and only one cell in the χ2.
4) The study groups must be independent. This means that a different test must be used if
the two groups are related.
5) The value of the cell expected should be 5 or more in at least 80% of the cells, and no
cell should have an expected value of less than one (3). This assumption is most likely to
be met if the sample size equals at least the number of cells multiplied by 5.
6) The value of the cell expected should be 5 or more in at least 80% of the cells, and no
cell should have an expected value of less than one (3). This assumption is most likely to
be met if the sample size equals at least the number of cells multiplied by 5.

The formula for Chi-Square Test:

Where, O is the observed frequency, E is the expected


frequency. χ2 is the cell Chi-square value. Σ χ2 is the formula
instruction to sum all the Chi-Square values. x2 i-j = i-j
Steps:
Step 1: Propose a null hypothesis (Ho) and an alternate hypothesis (Ha).
Step 2: Calculate the expected values, based on the hypothesis.
Step 3: Apply the chi square formula.
Step 4: Calculation of Degree Of Freedom. Step 5: Interpret the chi square value.
Applications of the Chi-square:
1. Testing the divergence of observed results from expected results when our expectations
are based on the hypothesis of equal probability.
2. Chi-square test when expectations are based on a normal distribution.
3. Chi-square test when our expectations are based on predetermined results.
4. Correction for discontinuity or Yates’ correction in calculating χ2 .
5. Chi-square test of independence in contingency tables.
Conclusion:
 The Chi-square is a valuable analysis tool that provides considerable
information about the nature of research data.
 It is a powerful statistic that enables researchers to test hypotheses about
variables measured at the nominal level. As with all inferential statistics, the
results are most reliable when the data are collected from randomly selected
subjects, and when sample sizes are sufficiently large that they produce
appropriate statistical power.
 The Chi-square is also an excellent tool to use when violations of assumptions
of equal variances and homoscedastic are violated and parametric statistics such
as the t-test and ANOVA cannot provide reliable results.
References:
 Chi-square test and its application in hypothesis testing Article in Journal of the
Practice of Cardiovascular Sciences · January 2015
 The Chi-square test of independence Mary L. McHugh
 The Chi-Square Test: Often Used and More Often Misinterpreted DOI:
10.1177/1098214011426594 American Journal of Evaluation 2012 33: 448
originally published online 8 November 2011 Todd Michael Franke, Timothy
Ho and Christina A. Christie

You might also like