Professional Documents
Culture Documents
Bayesian Classifiers
by
Tan, Steinbach, Karpatne, Kumar
• Approach:
– compute posterior probability P(Y | X1, X2, …, Xd) using
the Bayes theorem
P ( X 1 X 2 X d | Y ) P (Y )
P (Y | X 1 X 2 X n )
P( X 1 X 2 X d )
e g e g t in s s
c at c at c o n
cl
a
Tid Refund Marital
Status
Taxable
Income Evade
• Normal distribution:
( X i ij ) 2
1 Yes Single 125K No 1 2 ij2
No
P( X i | Y j ) e
2 No Married 100K
2 2
ij
3 No Single 70K No
4 Yes Married 120K No
5 No Divorced 95K Yes – One for each (Xi,Yi) pair
6 No Married 60K No
7 Yes Divorced 220K No
• For (Income, Class=No):
8 No Single 85K Yes
9 No Married 75K No – If Class=No
10
10 No Single 90K Yes sample mean = 110
sample variance = 2975
1
( 120110 ) 2
Name Give Birth Can Fly Live in Water Have Legs Class
human yes no no yes mammals
A: attributes
python no no no no non-mammals M: mammals
salmon no no yes no non-mammals
whale yes no yes no mammals N: non-mammals
frog no no sometimes yes non-mammals
komodo no no no yes non-mammals
6 6 2 2
bat yes yes no yes mammals P( A | M ) 0.06
pigeon
cat
no
yes
yes
no
no
no
yes
yes
non-mammals
mammals
7 7 7 7
leopard shark yes no yes no non-mammals 1 10 3 4
turtle no no sometimes yes non-mammals P( A | N ) 0.0042
penguin no no sometimes yes non-mammals 13 13 13 13
porcupine yes no no yes mammals
eel no no yes no non-mammals 7
salamander no no sometimes yes non-mammals P( A | M ) P ( M ) 0.06 0.021
gila monster no no no yes non-mammals 20
platypus no no no yes mammals
13
owl
dolphin
no
yes
yes
no
no
yes
yes
no
non-mammals
mammals
P( A | N ) P ( N ) 0.004 0.0027
eagle no yes no yes non-mammals
20
D
D is parent of C
A is child of C
C B is descendant of D
D is ancestor of A
A B
X1 X2 X3 X4 ... Xd
Exercise Diet
Blood
Chest Pain
Pressure