Professional Documents
Culture Documents
K-Mean v ng dung
NI DUNG CHNH
Phn cm
II.
III.
K-Mean v ng dung
I.
I. PHN CM
1.
K-Mean v ng dung
Phn cm l g?
Qu trnh phn chia 1 tp d liu ban u thnh cc
cm d liu tha mn:
I. PHN CM
K-Mean v ng dung
Nu X : 1 tp cc im d liu
Ci : cm th i
X = C1 Ck
Ci
Cj =
ngoi lai
I. PHN CM
2. Mt s o trong phn cm
Minkowski
p
(||
x
y
||
i i )
1
p
i 1
Euclidean
p=2
K-Mean v ng dung
v.w
cos = || v || . || w ||
5
I. PHN CM
3.
Mc ch ca phn cm
Xc nh c bn cht ca vic nhm cc i tng trong 1 tp
d liu khng c nhn.
K-Mean v ng dung
I. PHN CM
5.
Phn
cm phn cp
Phn
cm da trn mt
Phn
cm da trn li
Phn
cm da trn m hnh
Phn
cm c rng buc
K-Mean v ng dung
Phn
K-Mean v ng dung
c im:
Mi
S cm: K
Output
Cc cm Ci ( i = 1 K) tch ri v hm tiu chun E t
gi tr ti thiu.
xi R
K-Mean v ng dung
Input
Tp cc i tng X = {xi| i = 1, 2, , N},
K-Mean v ng dung
i 1 xi C j
(|| xi c j || )
2
trong cj l trng tm ca cm Cj
K-Mean v ng dung
11
Bc 1 - Khi to
Chn K trng tm {ci} (i = 1K).
( t )= {
i
( tfor
) all
= 1,*, k}
Bc 3 - Cp nht li trng tm
1
c
(t ) x j
Si | x j Si( t )
Bc 4 iu kin |dng
( t 1)
i
12
Khong cch cc
i tng n cc
trng tm
Nhm cc i
tng vo cc cm
K-Mean v ng dung
Trng tm
Khng c
i tng
chuyn
nhm
Kt thc
13
II.3 V D MINH HA
i tng
K-Mean v ng dung
14
II.3 V D MINH HA
Bc 1: Khi to
Chn 2 trng tm ban u:
c1(1,1) A v c2(2,1) B, thuc 2 cm 1 v 2
K-Mean v ng dung
15
II.3 V D MINH HA
1)
(3
1)
d(C, c1) =
(4 2) 2 (3 1) 2
=8
d(C, c1) > d(C, c2)
d(D, c1) =
C thuc cm 2
K-Mean v ng dung
= 13
d(C, c2) =
(5 1) 2 (4 1) 2
= 25
2
2
d(D, c2) = (5 2) (4 1)
= 18
d(D,c1) > d(D, c2)
D thuc cm 2
16
II.3 V D MINH HA
Bc 3: Cp nht li v tr trng tm
Trng tm cm 1 c1 A (1, 1)
2 4 5 1 3 4
,
)
3
3
K-Mean v ng dung
Trng tm cm 2 c2 (x,y) = (
17
II.3 V D MINH HA
A thuc cm 1
d(B, c1 ) = 1 < d(B, c2 ) = 5.56
B thuc cm 1
d(C, c1 ) = 13 > d(C, c2 ) = 0.22
K-Mean v ng dung
C thuc cm 2
d(D, c1 ) = 25 > d(D, c2 ) = 3.56
D thuc cm 2
18
II.3 V D MINH HA
K-Mean v ng dung
19
II.3 V D MINH HA
A thuc cm 1
d(B, c1 ) = 0.25 < d(B, c2 ) = 12.5
B thuc cm 1
d(C, c1 ) = 10.25 < d(C, c2 ) = 0.5
K-Mean v ng dung
Bc 4-3: Lp li bc 2
d(A, c1 ) = 0.25 < d(A, c2 ) = 18.5
C thuc cm 2
d(D, c1 ) = 21.25 > d(D, c2 ) = 0.5
D thuc cm 2
20
II.3 V D MINH HA
K-Mean v ng dung
21
3.
4.
5.
6.
7.
K-Mean v ng dung
2.
phc tp: O( K .N .l ) vi l: s ln lp
C kh nng m rng, c th d dng sa i vi
nhng d liu mi.
Bo m hi t sau 1 s bc lp hu hn.
Lun c K cm d liu
Lun c t nht 1 im d liu trong 1 cm d liu.
Cc cm khng phn cp v khng b chng cho d
liu ln nhau.
Mi thnh vin ca 1 cm l gn vi chnh cm hn
bt c 1 cm no khc.
22
2.
4.
5.
K-Mean v ng dung
3.
23
K-Mean v ng dung
24
K-Mean v ng dung
25
K-Mean v ng dung
26
Ti liu chnh: [WKQ08] Xindong Wu, Vipin Kumar, J. Ross Quinlan, Joydeep
Ghosh, Qiang Yang, Hiroshi Motoda, Geoffrey J. McLachlan, Angus Ng, Bing Liu, Philip
http://en.wikipedia.org/wiki/K-means_clustering
http://en.wikipedia.org/wiki/Segmentation_(image_processing)
http://vi.wikipedia.org/wiki/Hc_khng_c_gim_st
http://people.revoledu.com/kardi/tutorial/kMean/NumericalExample.htm
K-Mean v ng dung
S. Yu , Zhi-Hua Zhou, Michael Steinbach, David J. Hand, Dan Steinberg (2008). Top 10
27
K-Mean v ng dung
28