Professional Documents
Culture Documents
Contingency Tables
Learning Objectives
Test of association
Contingency Table
You have learn ANOVA for comparing 3 or more
population means .
What if you have 3 or more population proportions to
compare?
You can’t use one or two proportion test anymore .
Here , we can use contingency table using
2 test statistics.
vs vs
P1 P2 P3
Practical Question Are all populations’ proportions
(example) statistically different?
Statistical Question
Ho: 1 = 2 =3
Ha: at least one i is different
s2
E = the expected value = (Xr * Yc) / Ftotal n 1
r = number of rows
c = number of columns
Xr = total frequency for that row
Yc = total frequency for that column
Ftotal = total frequency for that table
Each row or each column has a total count associated with it.
The actual counts are compared to the expected counts for all
cells. The likelihood of the observed variation is evaluated using a
statistical test called the Goodness-of-Fit Test.
Contingency Table
P1 P2 P3 P4
Bad 2 4 3 1 X1= 10
Good 0 3 2 5 X2 = 10
Y1=2 Y2 =7 Y3=5 Y4=6 Ftotal=20
( calc )
i 1 j 1 Eij
Seagate Confidential 8 Supplier Six Sigma Modular Training
Contingency Tables
r x c Contingency Table
Column Factor Yj
1 2 3 c Totals
Row 1 O11 O12 O13 O1c X1
Factor 2 O21 O22 O23 O2c X2
Xi
3 O31 O32 O33 O3c X3
( calc )
i 1 j 1 Eij
Seagate Confidential 9 Supplier Six Sigma Modular Training
Contingency Tables
4. Click OK
( calc )
i 1 j 1 Eij
Seagate Confidential 14 Supplier Six Sigma Modular Training
Contingency Tables
Learning Objectives
Test of association
Roadmap
Ha: P1 P2
p > 0.05 Fail to Reject Ho (null) Proportions Minitab:
p < 0.05 Reject Ho Testing Stat -Bsc Stat -
1 or 2 Proportions
Test of Association
A contingency table is used to analyze data via a two way
classification (involving two factors). The data are usually attribute
in nature (frequency counts), although they need not be.
This tool is used to test the relationship between two sources of
variation. The relationship can be statistically described as follows:
Ho: “ {factor A} is independent of {factor B} ”
Ha: “ {factor A} is NOT independent of {factor B} “
= =
2(calc) = ( )+( )
+( )+( ) = _____
Solution:
A. Practical Problem: Does a particular gage create more or less defects depending
on the test method?
B. Ho: Test method is independent of the gage.
C. Ha: Test method is NOT independent of gage.
D. Determine the test statistic (calc). (We’ll use Minitab.)
Seagate Confidential 22 Supplier Six Sigma Modular Training
Contingency Tables
Chi-Square Test
Expected counts are printed below observed counts
Method A Method B Method C Total
1 37 41 44 122
29.28 45.95 46.77
2 35 72 71 178
42.72 67.05 68.23
Total 72 113 115 300
ChiSq = 2.035 + 0.534 + 0.164 +
Problem:
Ho : Results of surgical procdeure are not hospital dependent Xr * Y c
Ha : Results of surgical procdeure are hospital dependent E
Ftotal
=
Hosp A Hosp B Hosp C Hosp D Hosp E r c
NI( Observed) 13 5 8 21 43
ij -Eij)2 / Eij
(calc) O
i 1 j 1
(Expected )
(chi-sq)
(calc) = 56.705
SI( Observed) 18 10 36 56 29
(Expected )
(chi-sq)
GI( Observed) 16 16 35 51 10
(Expected )
(chi-sq)
End of Topic
What question do you have?
The 2 Distribution
Use discrete, nominal or Chi-square distribution
category data (no for various degrees of freedom ()
0.5
ranking, variable or ratio
0.4 =2
Observations must be
independent. No repeat 0.35
measurements on the 0.3
same part.
0.25
(R-1)(C-1)= df
0.2 =4
generally works best 0.15
with 5 or more
0.1
observations in each
cell. 0.05 =6 = 10
0
2
0.1
1.2
2.3
3.4
4.5
5.6
6.7
7.8
8.9
11.1
12.2
13.3
14.4
15.5
16.6
17.7
18.8
19.9
10