Professional Documents
Culture Documents
Bai 4 Phan L P Classification
Bai 4 Phan L P Classification
Khai ph d liu
Khai ph d liu
Phn lp l g?
Mc ch: d on nhng nhn phn lp cho cc b d liu/mu mi u vo: mt tp cc mu d liu hun luyn, vi mt nhn phn lp cho mi mu d liu u ra: m hnh (b phn lp) da trn tp hun luyn v nhng nhn phn lp
Khai ph d liu 3
Khai ph d liu
D on l g?
Tng t vi phn lp o xy dng mt m hnh o s dng m hnh d on cho nhng gi tr cha bit Phng thc ch o: Git li o hi quy tuyn tnh v nhiu cp o hi quy khng tuyn tnh
Khai ph d liu
Phn lp so vi d bo
Phn lp: o d on cc nhn phn lp o phn lp d liu da trn tp hun luyn v cc gi tr trong mt thuc tnh phn lp v dng n xc nh lp cho d liu mi D bo: o xy dng m hnh cc hm gi tr lin tc o d on nhng gi tr cha bit
Khai ph d liu
Xy dng m hnh
Mi b/mu d liu c phn vo mt lp c xc nh trc
Bc 1 Lp ca mt b/mu d liu c xc nh bi thuc tnh gn nhn lp
Tp cc b/mu d liu hun luyn tp hun luyn - c dng xy dng m hnh M hnh c biu din bi cc lut phn lp, cc cy quyt nh hoc cc cng thc ton hc
Khai ph d liu 8
S dng m hnh
Phn lp cho nhng i tng mi hoc cha c phn lp nh gi chnh xc ca m hnh o lp bit trc ca mt mu/b d liu em kim tra c so snh vi kt qu thu c t m hnh o t l chnh xc = phn trm cc mu/b d liu c phn lp ng bi m hnh trong s cc ln kim tra
Bc 2
Khai ph d liu
V d: xy dng m hnh
D liu hun luyn
Cc thut ton phn lp
NAME RANK YEARS TENURED B phn lp Mary Assistant Prof 3 no (M hnh) James Assistant Prof 7 yes Bill Professor 2 no IF rank = professor John Associate Prof 7 yes OR years > 6 Mark Assistant Prof 6 no THEN tenured = yes Annie Associate Prof 3 no
Khai ph d liu 10
V d: s dng m hnh
B phn lp D liu kim tra D liu cha phn lp (Jeff, Professor, 4)
NAME Tom Lisa Jack Ann RANK YEARS TENURED Assistant Prof 2 no Associate Prof 7 no Professor 5 yes Assistant Prof 7 yes
Khai ph d liu
Tenured?
Yes
11
Chun b d liu
Lm sch d liu o nhiu o cc gi tr trng Phn tch s lin quan (chn c trng) Bin i d liu
Khai ph d liu
12
Khai ph d liu
13
Qui np cy quyt nh
A? B? C?
D?
Yes
Cy quyt nh l mt cy trong nt trong = mt php kim tra trn mt thuc tnh nhnh ca cy = u ra ca mt php kim tra nt l = nhn phn lp hoc s phn chia vo lp
Khai ph d liu 14
To cy quyt nh
Hai giai on to cy quyt nh: xy dng cy o bt u, tt c cc mu hun luyn u gc o phn chia cc mu da trn cc thuc tnh c chn o kim tra cc thuc tnh c chn da trn mt o thng k hoc heuristic thu gn cy o xc nh v loi b nhng nhnh nhiu hoc tch khi nhm
Khai ph d liu 15
Nhit
nng nng nng m p mt mt mt m p mt m p m p m p nng m p
Khai ph d liu
m
cao cao cao cao va va va cao va va va cao va cao
Gi
khng khng khng khng khng c c khng khng khng c c khng c
Lp
N N P P P N P N P P P P P N
16
u m P va c
ma
gi
khng N P
17
m
cao N P va
Mi mt ng dn t gc n l trong cy to thnh mt lut Mi cp gi tr thuc tnh trn mt ng dn to nn mt s lin Nt l gi quyt nh phn lp d on Cc lut to c d hiu hn cc cy
Khai ph d liu 18
Khai ph d liu
20
Khai ph d liu
21
pn
Khai ph d liu
23
Ta c
Do
E (thoitiet)
14
I (2,3)
Tng t
Khai ph d liu
27
Khai ph d liu
30
Khai ph d liu
31
Phn lp Bayes
Bi ton phn lp c th hnh thc ha bng xc sut a-posteriori: P(C|X) = xc sut mu X=<x1,,xk> thuc v lp C V d P(class=N | outlook=sunny,windy=true,) tng: gn cho mu X nhn phn lp l C sao cho P(C|X) l ln nht
Khai ph d liu 32
P(nng | n) = 3/5 P(u m | n) = 0 P(ma | n) = 2/5 P(nng | n) = 2/5 P(m p | n) = 2/5 P(mt | n) = 1/5
Khai ph d liu
35
Mng Neural Phn lp k lng ging gn nht Suy lun da vo trng hp Thut ton di truyn Hng tp th Cc hng tp m
Khai ph d liu
38
Tm tt (1)
Phn lp l mt vn nghin cu bao qut Phn ln c kh nng l mt trong nhng k thut khai ph d liu c dng rng ri nht vi rt nhiu m rng
Khai ph d liu
40
Tm tt (2)
Tnh uyn chuyn vn ang l mt vn quan trng ca tt cc ng dng c s d liu Cc hng nghin cu: phn lp d liu khngquan h, v d nh text, khng gian v a phng tin
Khai ph d liu
41