Professional Documents
Culture Documents
Chap6 ThreeSimpleClassificationMethods
Chap6 ThreeSimpleClassificationMethods
Classification Methods
Common characteristics:
Data-driven, not model-driven
Make no assumptions about the data
Naïve Rule
Predictors:
Prior pending legal charges (yes/no)
Size of firm (small/large)
Charges? Size Outcome
y small truthful
n small truthful
n large truthful
n large truthful
n small truthful
n small truthful
y small fraud
y large fraud
n large fraud
y large fraud
Exact Bayes Calculations
Goal: classify (as “fraudulent” or as “truthful”) a
small firm with charges filed
1 0.00 33.33
2 16.67 33.33
3 11.11 33.33
4 22.22 33.33
5 11.11 33.33
6 27.78 33.33
7 22.22 33.33
8 22.22 16.67 <--- Best k
9 22.22 16.67
10 22.22 16.67
11 16.67 33.33
12 16.67 16.67
13 11.11 33.33
14 11.11 16.67
15 5.56 33.33
16 16.67 33.33
17 11.11 33.33
18 50.00 50.00
Using K-NN for Prediction
(for Numerical Outcome)