Professional Documents
Culture Documents
Chi-Square test
A chi-square (χ2) statistic is a test that measures how a model compares to actual observed data. The data used
in calculating a chi-square statistic must be random, raw, mutually exclusive, drawn from independent variables, and
drawn from a large enough sample.
Chi-square tests are often used in hypothesis testing. The chi-square statistic compares the size any
discrepancies between the expected results and the actual results, given the size of the sample and the number of
variables in the relationship.
Assumptions
When you choose to analyze your data using a chi-square test for independence, you need to make sure that
the data you want to analyze "passes" two assumptions. You need to do this because it is only appropriate to use a chi-
square test for independence if your data passes these two assumptions. If it does not, you cannot use a chi-square test
for independence. These two assumptions are:
1. Your two variables should be measured at an ordinal or nominal level (i.e., categorical data).
2. Your two variables should consist of two or more categorical, independent groups. Example
independent variables that meet this criterion include sex (2 groups: Males and Females), physical
activity level (e.g., 4 groups: sedentary, low, moderate and high), profession (e.g., 5 groups: surgeon,
doctor, nurse, dentist, therapist), and so forth.
Chi-square of Independence
When considering student sex and course choice, a χ2 test for independence could be used. To do this test, the
researcher would collect data on the two chosen variables (sex and courses picked) and then compare the frequencies
at which male and female students select among the offered classes using the formula given above and a χ2 statistical
table.
The Chi-Square test of independence is used to determine if there is a significant relationship between two
nominal (categorical) variables. The frequency of each category for one nominal variable is compared across the
categories of the second nominal variable.
Moreover, the chi-square test for independence, also called Pearson's chi-square test or the chi-square test of
association, is used to discover if there is a relationship between two categorical variables.
Example:
Is sex independent of education level? A random sample of 45 people were surveyed and each person was asked
to report the highest education level they obtained.
Are sex and education level dependent at 5% level of significance? In other words, given the data collected, is
there a relationship between the gender of an individual and the level of education that they have obtained?
Sex Educational Level Sex Educational Level
Female High School Male Bachelor
Female High School Male Bachelor
Female High School Male Bachelor
Female High School Male Bachelor
Female High School Male Bachelor
Female High School Male Bachelor
Female High School Male Bachelor
Female High School Female Masters
Male High School Female Masters
Male High School Female Masters
Male High School Female Masters
Male High School Female Masters
Male High School Female Masters
Female Bachelor Male Masters
Female Bachelor Male Masters
Female Bachelor Male Masters
Female Bachelor Male Masters
Female Bachelor Male Masters
Female Bachelor
Female Bachelor
Female Bachelor
Female Bachelor
Female Bachelor
Female Bachelor
Male Bachelor
Male Bachelor
Male Bachelor
Step 1 Open the data and select Frequencies and click Contingency tables
Based from the results, there was no significant relationship between the gender of an individual and the level
of education [X2 (2) = 0.279, p = 0.870] at 0.05 level of significance. Since the p value was greater than 0.05.
Activity
A study was done on opinions concerning the legalization of marijuana at Paglaum College. One hundred fifty-
seven respondents were randomly selected from a large pool of faculty, students, and parents at the college.
Respondents were given a choice of favoring the legalization of marijuana, opposing he legalization of marijuana, or
favoring making marijuana a legal but controlled substance. The results of the survey were as follows.
Assumptions
When you choose to analyze your data using Spearman’s correlation, part of the process involves checking to
make sure that the data you want to analyze can actually be analyzed using a Spearman’s correlation. You need to do
this because it is only appropriate to use a Spearman’s correlation if your data "passes" three assumptions that are
required for Spearman’s correlation to give you a valid result.
1. Your two variables should be measured on an ordinal, interval or ratio scale.
2. Your two variables represent paired observations.
3. There is a monotonic relationship between the two variables.
Example:
A study was conducted to the male employees of a certain LGU to check the relationship between smoking and
longevity. A sample of 15 men who are selected purposively, ages above 50 years old was taken and the average number
of cigarettes smoked per day and the age at death was recorded, as summarized in the table. Can we conclude from the
sample that longevity has a relationship with smoking?
Longevity Cigarettes Longevity Cigarettes
80 5 75 19
78 23 68 14
60 25 72 35
53 48 58 29
85 17 92 4
85 8 65 23
73 4
79 26
81 11
Step 1. Open the data and Select Regression and Click Correlation Matrix
Based from the result, there was a high negative relationship between longevity and cigarette smoking of the
respondents [rs (13) = -0.680, p = 0.001] at 0.05 level of significance. This implies that the lower the number of cigarette
smoking, the higher the life expectancy of the respondents. Or, the higher the number of cigarette smoking, the lower
the life expectancy of the respondents.
Activity
Ten NBI agents where identified conveniently by the researcher to be included to his study. The purpose of the
study is to determine if the physical activity has any relation to our blood pressure. Utilizing the table of data below, Is
there significant relationship between the physical activity and blood pressure at 0.05 level of significance?
Physical Blood Pressure
Activities (min) (mm Hg)
60 118
55 117
25 120
50 121
40 119
45 122
35 123
10 124
30 125
20 126
References: