You are on page 1of 90

I HC THI NGUYN

KHOA CNG NGH THNG TIN







AN HNG SN



NGHIN CU MT S PHNG PHP
PHN CM M V NG DNG

CHUYN NGNH: KHOA HC MY TNH
M S: 60 48 01



LUN VN THC S KHOA HC



HNG DN KHOA HC: PGS.TS NG QUC TO



THI NGUYN - 2008
1

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
MC LC

DANH MC CC T VIT TT ........................................................................
DANH MC CC HNH MINH HO ................................................................
Chng 1 - TNG QUAN V KHM PH TRI THC V KPDL ..................
1.1. Gii thiu chung v khm ph tri thc v khai ph d liu .................
1.2. Qu trnh khm ph tri thc .................................................................
1.3. Qu trnh khai ph d liu ....................................................................
1.4. Cc phng php khai ph d liu .......................................................
1.5. Cc lnh vc ng dng thc tin ca KPDL ........................................
1.6. Cc hng tip cn c bn v k thut p dng trong KPDL ..............
1.7. Cc thch thc - kh khn trong KPTT v KPDL................................
1.8. Kt lun ................................................................................................
Chng 2 - PHN CM D LIU V CC THUT TON TRONG PCDL .
2.1. Khi nim v mc tiu ca phn cm d liu ......................................
2.2. Cc ng dng ca phn cm d liu ....................................................
2.3. Cc yu cu ca phn cm ...................................................................
2.4. Nhng k thut tip cn trong phn cm d liu .................................
2.4.1. Phng php phn cm phn hoch ..........................................
2.4.2. Phng php phn cm phn cp ..............................................
2.4.3. Phng php phn cm da trn mt ...................................
2.4.4. Phng php phn cm da trn li .......................................
2.4.5. Phng php phn cm da trn m hnh .................................
2.4.6. Phng php phn cm c d liu rng buc ...........................
2.5. Mt s thut ton c bn trong phn cm d liu ................................
2.5.1. Cc thut ton phn cm phn hoch ........................................
2.5.2. Cc thut ton phn cm phn cp ............................................
2.5.3. Cc thut ton phn cm da trn mt .................................
2.5.4. Cc thut ton phn cm da trn li ......................................
4
5
6
6
7
8
9
10
11
12
12
13
13
15
16
18
19
19
20
21
22
22
24
24
26
29
32
2

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
2.5.5. Cc thut ton phn cm da trn m hnh ...............................
2.5.6. Cc thut ton phn cm c d liu rng buc .........................
Chng 3 - K THUT PHN CM D LIU M .........................................
3.1. Tng quan v phn cm m .................................................................
3.2. Cc thut ton trong phn cm m ......................................................
3.2.1. Thut ton FCM(Fuzzy C-means) .............................................
3.2.1.1. Hm mc tiu .............................................................
3.2.1.2. Thut ton FCM .........................................................
3.2.2. Thut ton cFCM(- Insensitive Fuzzy C-means) .....................
3.2.2.1. Hm mc tiu .............................................................
3.2.2.2. Thut ton cFCM ........................................................
3.2.3. Thut ton FCM Ci tin ...........................................................
3.2.3.1. Thut ton 1: Thut ton la chn cc im d liu
lm ng vin cho vic chn cc trung tm ca cc cm .......
3.2.3.2. Thut ton 2: Thut ton lc bt cc ng vin ........
3.2.3.3. Thut ton 3: Thut ton chn cc ng vin lm cc
tiu hm mc tiu ..................................................................
3.2.3.4. Thut ton 4: Gn cc trung tm c lin kt gn
gi vo mt cm ..................................................................
3.2.3.5. Tng kt thut ton FCM-Ci tin .............................
Chng 4 - M HNH MNG NRON A KHP DNG CHO PCM ............
4.1. Tng quan v mng Nron ...................................................................
4.2. Cu trc mng Nron ...........................................................................
4.2.1. Hm kch hot ...........................................................................
4.2.2. Lin kt mng ............................................................................
4.2.3. Bi ton hun luyn mng .........................................................
4.3. Mng HOPFIELD ................................................................................
4.3.1. Hun luyn mng .......................................................................
4.3.2. S dng mng .............................................................................
35
36
37
37
38
39
39
42
46
46
48
49

49
51

51

52
56
58
58
61
61
61
61
62
62
63
3

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
4.4. Mng Nron a khp dng cho phn cm ............................................
4.4.1. Xy dng lp mng Layer1 cho ti u cc trung tm cm ........
4.4.2. Xy dng lp mng Layer2 cho ti u cc thuc .................
4.5. S hi t ca FBACN ...........................................................................
4.5.1. Chng minh s hi t ca FBACN ............................................
4.5.2. S hi t FBACN lin tc ca Layer1 .......................................
4.6. Gii thut ca FBACN v FBACN vi vic hc ..................................
Chng 5 - CI T TH NGHIM V NG DNG .....................................
5.1. Ci t th nghim thut ton FCM ......................................................
5.2. ng dng thut ton FCM-Ci tin vo nhn dng nh .......................
KT LUN ............................................................................................................
TI LIU THAM KHO ......................................................................................

63
65
68
72
72
74
75
79
79
82
86
87
















4

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
DANH MC CC T VIT TT


CNTT
CSDL
CEF
DL
FBACN

FCM
HMT
KPDL
KPTT
LKM
MH
NDA
NN
PCM
PCDL
TLTK
TT
XLA

Cng ngh thng tin
C s d liu
Computational Energy Function
D liu
Fuzzy Bi-directional Associative Clustering Network
(Mng Nron a khp phc v cho phn cm m)
Fuzzy C-Means
Hm mc tiu
Khai ph d liu
Khm ph tri thc
Lin kt mng
M hnh
Nhn dng nh
Neural Network
Phn cm m
Phn cm d liu
Ti liu tham kho
Thut ton
X l nh






5

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
DANH MC CC HNH MINH HO


Hnh 1.1
Hnh 1.2
Hnh 2.1
Hnh 2.2
Hnh 2.3
Hnh 2.4
Hnh 2.5
Hnh 2.6
Hnh 2.7
Hnh 2.8
Hnh 2.9
Hnh 3.1
Hnh 3.2
Hnh 3.3
Hnh 3.4
Hnh 4.1
Hnh 4.2
Hnh 4.3
Hnh 4.4
Hnh 4.5
Hnh 5.1
Hnh 5.2
Hnh 5.3
Hnh 5.4
Hnh 5.5
Qu trnh Khm ph tri thc ...................................................
Qu trnh Khai ph d liu ......................................................
M t tp d liu vay n c phn thnh 3 cm .................1
Cc chin lc phn cm phn cp .......................................2
Cu trc phn cp ..................................................................2
Cc cch m cc cm c th a ra .......................................
Cc thit lp xc nh ranh gii cc cm ban u .............
Tnh ton trng tm ca cc cm mi ....................................
Khi qut thut ton CURE ...................................................
Cc cm d liu c khm ph bi CURE ..........................
Hnh dng cc cm c khm ph bi TT DBSCAN ..........
M phng v tp d liu n chiu .......................................
Hm thuc vi trng tm ca cm A trong k-means .............
Hm thuc vi trng tm ca cm A trong FCM ..................
Cc cm khm ph c bi thut ton FCM .......................
M hnh mng Nron .............................................................
M hnh hc c gim st ........................................................
M hnh FBACN ....................................................................
M hnh Lp Layer1 ca FBACN .........................................
M hnh Lp Layer2 ca FBACN .........................................
Giao din ca thut ton FCM khi khi ng ........................
Giao din ca thut ton FCM khi lm vic ..........................
Giao din ca chng trnh khi khi ng .............................
Giao din ca chng trnh khi chn nh phn cm ..........
Giao din ca chng trnh khi thc hin phn cm .............
7
9
14
20
21
23
24
25
27
27
30
44
44
45
46
60
62
64
65
69
80
81
83
84
85
6

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
CHNG 1
TNG QUAN V KHM PH TRI THC
V KHAI PH D LIU
1.1. Gii thiu chung v khm ph tri thc v khai ph d liu ....................................
1.2. Qu trnh khm ph tri thc ....................................................................................
1.3. Qu trnh khai ph d liu ......................................................................................
1.4. Cc phng php khai ph d liu .........................................................................
1.5. Cc lnh vc ng dng thc tin ca KPDL ...........................................................
1.6. Cc hng tip cn c bn v k thut p dng trong KPDL ................................
1.7. Cc thch thc - kh khn trong KPTT v KPDL ..................................................
1.8. Kt lun ...................................................................................................................
6
7
8
9
10
11
12
12

1.1. Gii thiu chung v khm ph tri thc v khai ph d liu
Nu cho rng, in t v truyn thng chnh l bn cht ca khoa hc
in t, th d liu, thng tin, v tri thc hin ang l tiu im ca mt lnh
vc mi nghin cu v ng dng, l khm ph tri thc v khai ph d
liu.
Thng thng, chng ta coi d liu nh l mt chui cc bits, hoc cc
s v cc k hiu hay l cc i tng vi mt ngha no khi c gi
cho mt chng trnh di mt dng nht nh. Cc bits thng c s dng
o thng tin, v xem n nh l d liu c loi b phn t tha, lp
li, v rt gn ti mc ti thiu c trng mt cch c bn cho d liu. Tri
thc c xem nh l cc thng tin tch hp, bao gm cc s kin v mi
quan h gia chng, c nhn thc, khm ph, hoc nghin cu. Ni cch
khc, tri thc c th c coi l d liu mc cao ca s tru tng v
tng qut.
Khm ph tri thc hay pht hin tri thc trong CSDL l mt quy trnh
nhn bit cc mu hoc cc m hnh trong d liu vi cc tnh nng: Phn
tch, tng hp, hp thc, kh ch v c th hiu c.
7

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Khai ph d liu l mt bc trong qu trnh khm ph tri thc, gm
cc thut ton khai thc d liu chuyn dng di mt s qui nh v hiu qu
tnh ton chp nhn c tm ra cc mu hoc cc m hnh trong d liu.
Ni cch khc, mc tiu ca Khai ph d liu l tm kim cc mu hoc m
hnh tn ti trong CSDL nhng n trong khi lng ln d liu.

1.2. Qu trnh khm ph tri thc

Hnh 1.1: Qu trnh KPTT
Bao gm cc bc sau:
Lm sch d liu (Data Cleaning): Loi b d liu nhiu v d liu
khng nht qun.
Tch hp d liu (Data Intergation): D liu ca nhiu ngun c th
c t hp li.
8

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
La chn d liu (Data Selection): La chn nhng d liu ph hp
vi nhim v phn tch trch rt t c s d liu.
Chuyn i d liu (Data Transformation): D liu c chuyn i
hay c hp nht v dng thch hp cho vic khai ph.
Khai ph d liu (Data Mining): y l mt tin trnh ct yu trong
cc phng php thng minh c p dng nhm trch rt ra mu d liu.
nh gi mu (Pattern Evaluation): Da trn mt o no xc
nh li ch thc s, quan trng ca cc mu biu din tri thc.
Biu din tri thc (Knowledge Presentation): giai on ny cc k
thut biu din v hin th c s dng a tri thc ly ra cho ngi
dng.

1.3. Qu trnh khai ph d liu
KPDL l mt giai on quan trng trong qu trnh KPTT. V bn cht,
n l giai on duy nht tm ra c thng tin mi, thng tin tim n c trong
CSDL ch yu phc v cho m t v d on.
M t d liu l tng kt hoc din t nhng c im chung ca
nhng thuc tnh d liu trong kho d liu m con ngi c th hiu c.
D on l da trn nhng d liu hin thi d on nhng quy lut
c pht hin t cc mi lin h gia cc thuc tnh ca d liu trn c s
chit xut ra cc mu, d on c nhng gi tr cha bit hoc nhng gi tr
tng lai ca cc bin quan tm.
Qu trnh KPDL bao gm cc bc chnh c th hin nh Hnh 1.2
sau:



9

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn







Hnh 1.2: Qu trnh KPDL
- Xc nh nhim v: Xc nh chnh xc cc vn cn gii quyt.
- Xc nh cc d liu lin quan: Dng xy dng gii php.
- Thu thp v tin x l d liu: Thu thp cc d liu lin quan v
tin x l chng sao cho thut ton KPDL c th hiu c. y l mt qu
trnh rt kh khn, c th gp phi rt nhiu cc vng mc nh: d liu phi
c sao ra nhiu bn (nu c chit xut vo cc tp), qun l tp cc d
liu, phi lp i lp li nhiu ln ton b qu trnh (nu m hnh d liu thay
i), v.v..
- Thut ton khai ph d liu: La chn thut ton KPDL v thc
hin vic PKDL tm c cc mu c ngha, cc mu ny c biu din
di dng lut kt hp, cy quyt nh... tng ng vi ngha ca n.

1.4. Cc phng php khai ph d liu
Vi hai mc ch khai ph d liu l M t v D on, ngi ta
thng s dng cc phng php sau cho khai ph d liu:
- Lut kt hp (association rules)
- Phn lp (Classfication)
- Hi qui (Regression)
- Trc quan ha (Visualiztion)
Thu
thp v
tin x
l DL
Thng k tm tt
Thut
ton
KPD
L
Mu
Xc
nh
nhim
v
DL trc tip
Xc
nh
DL lin
quan
10

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
- Phn cm (Clustering)
- Tng hp (Summarization)
- M hnh rng buc (Dependency modeling)
- Biu din m hnh (Model Evaluation)
- Phn tch s pht trin v lch (Evolution and deviation
analyst)
- Phng php tm kim (Search Method)
C nhiu phng php khai ph d liu c nghin cu trn, trong
c ba phng php c cc nh nghin cu s dng nhiu nht l:
Lut kt hp, Phn lp d liu v Phn cm d liu.

1.5. Cc lnh vc ng dng thc tin ca KPDL
KPDL l mt lnh vc mi pht trin nhng thu ht c kh nhiu nh
nghin cu nh vo nhng ng dng thc tin ca n. Sau y l mt s lnh
vc ng dng thc t in hnh ca KPDL:
- Phn tch d liu v h tr ra quyt nh
- Phn lp vn bn, tm tt vn bn, phn lp cc trang Web v phn
cm nh mu
- Chun on triu chng, phng php trong iu tr y hc
- Tm kim, i snh cc h Gene v thng tin di truyn trong sinh hc
- Phn tch tnh hnh ti chnh, th trng, d bo ga c phiu trong ti
chnh, th trng v chng khon
- Phn tch d liu marketing, khch hng.
- iu khin v lp lch trnh
- Bo him
- Gio dc.....

11

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
1.6. Cc hng tip cn c bn v k thut p dng trong KPDL.
Vn khai ph d liu c th c phn chia theo lp cc hng tip
cn chnh sau:
- Phn lp v d on (classification &prediction): L qu trnh xp mt i
tng vo mt trong nhng lp bit trc (v d: phn lp cc bnh nhn
theo d liu h s bnh n, phn lp vng a l theo d liu thi tit...). i
vi hng tip cn ny thng s dng mt s k thut ca hc my nh cy
quyt nh (decision tree), mng nron nhn to (neural network),...Hay lp
bi ton ny cn c gi l hc c gim st - Hc c thy (supervised
learning).
- Phn cm (clustering/segmentation): Sp xp cc i tng theo tng cm
d liu t nhin, tc l s lng v tn cm cha c bit trc. Cc i
tng c gom cm sao cho mc tng t gia cc i tng trong cng
mt cm l ln nht v mc tng t gia cc i tng nm trong cc
cm khc nhau l nh nht. Lp bi ton ny cn c gi l hc khng gim
st - Hc khng thy (unsupervised learning).
- Lut kt hp (association rules): L dng lut biu din tri thc dng kh
n gin (V d: 80% sinh vin ng k hc CSDL th c ti 60% trong s h
ng k hc Phn tch thit k h thng thng tin). Hng tip cn ny c
ng dng nhiu trong lnh vc kinh doanh, y hc, tin sinh hc, gio dc, vin
thng, ti chnh v th trng chng khon,...
- Phn tch chui theo thi gian (sequential/temporal patterns): Cng tng
t nh khai ph d liu bng lut kt hp nhng c thm tnh th t v tnh
thi gian. Mt lut m t mu tun t c dng tiu biu X -> Y, phn nh s
xut hin ca bin c X s dn n vic xut hin bin c Y. Hng tip cn
ny c ng dng nhiu trong lnh vc ti chnh v th trng chng khon
bi chng c tnh d bo cao.
12

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
- M t khi nim (concept desccription & summarization): Lp bi ton
ny thin v m t, tng hp v tm tt khi nim (V d: tm tt vn bn).

1.7. Cc thch thc - kh khn trong KPTT v KPDL
KPTT v KPDL lin quan n nhiu ngnh, nhiu lnh vc trong thc
t, v vy cc thch thc v kh khn ngy cng nhiu, cng ln hn. Sau y
l mt s cc thch thc v kh khn cn c quan tm:
+ Cc c s d liu ln, cc tp d liu cn s l c kch thc cc ln,
Trong thc t, kch thc ca cc tp d liu thng mc tera-byte (hng
ngn giga-byte).
+ Mc nhiu cao hoc d liu b thiu
+ S chiu ln
+ Thay i d liu v tri thc c th lm cho cc mu pht hin
khng cn ph hp
+ Quan h gia cc trng phc tp

1.8. Kt lun
KPDL l lnh vc v ang tr thnh mt trong nhng hng nghin
cu thu ht c s quan tm ca nhiu chuyn gia v CNTT trn th gii.
Trong nhng nm gn y, rt nhiu cc phng php v thut ton mi lin
tc c cng b. iu ny chng t nhng u th, li ch v kh nng ng
dng thc t to ln ca KPDL. Chng ny trnh by mt s kin thc
tng quan v KPTT, nhng khi nim v kin thc c bn nht v KPDL.




13

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
CHNG 2
PHN CM D LIU V
CC THUT TON TRONG PHN CM D LIU
2.1. Khi nim v mc tiu ca phn cm d liu ........................................................
2.2. Cc ng dng ca phn cm d liu ......................................................................
2.3. Cc yu cu ca phn cm .....................................................................................
2.4. Nhng k thut tip cn trong phn cm d liu ...................................................
2.4.1. Phng php phn cm phn hoch .............................................................
2.4.2. Phng php phn cm phn cp .................................................................
2.4.3. Phng php phn cm da trn mt ......................................................
2.4.4. Phng php phn cm da trn li ...........................................................
2.4.5. Phng php phn cm da trn m hnh ....................................................
2.4.6. Phng php phn cm c d liu rng buc ...............................................
2.5. Mt s thut ton c bn trong phn cm d liu .................................................
2.5.1. Cc thut ton phn cm phn hoch ...........................................................
2.5.2. Cc thut ton phn cm phn cp ...............................................................
2.5.3. Cc thut ton phn cm da trn mt ....................................................
2.5.4. Cc thut ton phn cm da trn li .........................................................
2.5.5. Cc thut ton phn cm da trn m hnh ..................................................
2.5.6. Cc thut ton phn cm c d liu rng buc .............................................
13
15
16
18
19
19
20
21
22
22
24
24
26
29
32
35
36

2.1. Khi nim v mc tiu ca phn cm d liu
Phn cm d liu l qu trnh nhm mt tp cc i tng tng t
nhau trong tp d liu vo cc cm sao cho cc i tng thuc cng mt
cm l tng ng cn cc i tng thuc cc cm khc nhau s khng
tng ng. Phn cm d liu l mt v d ca phng php hc khng c
thy. Khng ging nh phn lp d liu, phn cm d liu khng i hi phi
nh ngha trc cc mu d liu hun luyn. V th, c th coi phn cm d
liu l mt cch hc bng quan st, trong khi phn lp d liu l hc bng v
d Ngoi ra phn cm d liu cn c th c s dng nh mt bc tin
x l cho cc thut ton khai ph d liu khc nh l phn loi v m t c
im, c tc dng trong vic pht hin ra cc cm.
14

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Hnh 2.1: M t tp d liu vay n c phn thnh 3 cm.
Phn cm c ngha rt quan trng trong hot ng ca con ngi.
Ngay t lc b, con ngi hc cch lm th no phn bit gia mo v
ch, gia ng vt v thc vt v lin tc a vo s phn loi trong tim
thc ca mnh. Phn cm c s dng rng ri trong nhiu ng dng, bao
gm nhn dng mu, phn tch d liu, x l nh, nghin cu th trng....Vi
t cch l mt chc nng khai ph d liu, phn tch phn cm c th c s
dng nh mt cng c c lp chun quan st c trng ca mi cm thu
c bn trong s phn b ca d liu v tp trung vo mt tp ring bit ca
cc cm gip cho vic phn tch t kt qu.
Mt vn thng gp trong phn cm l hu ht cc d liu cn cho
phn cm u c cha d liu nhiu do qu trnh thu thp thiu chnh xc
hoc thiu y , v vy cn phi xy dng chin lc cho bc tin x l d
liu nhm khc phc hoc loi b nhiu trc khi chuyn sang giai on phn
tch cm d liu. Nhiu y c hiu l cc i tng d liu khng chnh
xc, khng tng minh hoc l cc i tng d liu khuyt thiu thng tin
v mt s thuc tnh... Mt trong cc k thut x l nhiu ph bin l vic
thay th gi tr cc thuc tnh ca i tng nhiu bng gi tr thuc tnh
15

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
tng ng. Ngoi ra, d tm phn t ngoi lai cng l mt trong nhng hng
nghin cu quan trng trong phn cm, chc nng ca n l xc nh mt
nhm nh cc i tng d liu khc thng so vi cc d liu trong CSDL,
tc l cc i tng d liu khng tun theo cc hnh vi hoc m hnh d liu
nhm trnh s nh hng ca chng ti qu trnh v kt qu ca phn cm.
Mc tiu ca phn cm l xc nh c bn cht nhm trong tp DL
cha c nhn. Nhng c th quyt nh c ci v to thnh mt cm tt.
N c th c ch ra rng khng c tiu chun tuyt i tt m c th
khng ph thuc vo kq phn cm. V vy, n i hi ngi s dng phi
cung cp tiu chun ny, theo cch m kt qu phn cm s p ng yu cu.
Theo cc nghin cu cho thy th hin nay cha c mt phng php
phn cm tng qut no c th gii quyt trn vn cho tt c cc dng cu trc
CDL. Hn na, cc phng php phn cm cn c cch thc biu din cu
trc ca cc CDL, vi mi cch thc biu din khc nhau s c tng ng
mt thut ton phn cm ph hp. V vy phn cm d liu vn ang l mt
vn kh v m, v phi gii quyt nhiu vn c bn mt cch trn vn
v ph hp vi nhiu dng d liu khc nhau, c bit l i vi d liu hn
hp ang ngy cng tng trong cc h qun tr d liu v y cng l mt
trong nhng thch thc ln trong lnh vc KPDL.

2.2. Cc ng dng ca phn cm d liu
Phn cm d liu c th c ng dng trong nhiu lnh vc nh:
Thng mi: Tm kim nhm cc khch hng quan trng c c
trng tng ng v nhng c t h t cc bn ghi mua bn trong CSDL
Sinh hc: Phn loi cc gen vi cc chc nng tng ng v thu
c cc cu trc trong mu
16

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Th vin: Phn loi cc cm sch c ni dung v ngha tng
ng nhau cung cp cho c gi
Bo him: Nhn dng nhm tham gia bo him c chi ph bi
thng cao, nhn dng gian ln thng mi
Quy hoch th: Nhn dng cc nhm nh theo kiu v v tr a
l,... nhm cung cp thng tin cho quy hoch th
Nghin cu tri t: Phn cm theo di cc tm ng t
nhm cung cp thng tin cho nhn dng cc vng nguy him
WWW: C th khm ph cc nhm ti liu quan trng, c nhiu
ngha trong mi trng Web. Cc lp ti liu ny tr gip cho vic KPTT t
d liu.

2.3. Cc yu cu ca phn cm
Phn cm l mt thch thc trong lnh vc nghin cu ch nhng
ng dng tim nng ca chng c a ra ngay chnh trong nhng yu cu
c bit ca chng. Sau y l nhng yu cu c bn ca phn cm trong
KPDL:
C kh nng m rng: Nhiu thut ton phn cm lm vic tt vi
nhng tp d liu nh cha t hn 200 i tng, tuy nhin, mt CSDL
ln c th cha ti hng triu i tng. Vic phn cm vi mt tp d
liu ln c th lm nh hng ti kt qu. Vy lm cch no chng
ta c th pht trin cc thut ton phn cm c kh nng m rng cao
i vi cc CSDL ln ?
Kh nng thch nghi vi cc kiu thuc tnh khc nhau: Nhiu thut
ton c thit k cho vic phn cm d liu c kiu khong (kiu s).
Tuy nhin, nhiu ng dng c th i hi vic phn cm vi nhiu kiu
d liu khc nhau, nh kiu nh phn, kiu tng minh (nh danh -
17

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
khng th t), v d liu c th t hay dng hn hp ca nhng kiu
d liu ny.
Khm ph cc cm vi hnh dng bt k: Nhiu thut ton phn cm
xc nh cc cm da trn cc php o khong cch Euclidean v
khong cch Manhattan. Cc thut ton da trn cc php o nh vy
hng ti vic tm kim cc cm hnh cu vi mt v kch c tng
t nhau. Tuy nhin, mt cm c th c bt c mt hnh dng no. Do
, vic pht trin cc thut ton c th khm ph ra cc cm c hnh
dng bt k l mt vic lm quan trng.
Ti thiu lng tri thc cn cho xc nh cc tham s u vo: Nhiu
thut ton phn cm yu cu ngi dng a vo nhng tham s nht
nh trong phn tch phn cm (nh s lng cc cm mong mun).
Kt qu ca phn cm thng kh nhy cm vi cc tham s u vo.
Nhiu tham s rt kh xc nh, nht l vi cc tp d liu c lng
cc i tng ln. iu ny khng nhng gy tr ngi cho ngi dng
m cn lm cho kh c th iu chnh c cht lng ca phn cm.
Kh nng thch nghi vi d liu nhiu: Hu ht nhng CSDL thc
u cha ng d liu ngoi lai, d liu li, d liu cha bit hoc d
liu sai. Mt s thut ton phn cm nhy cm vi d liu nh vy v
c th dn n cht lng phn cm thp.
t nhy cm vi th t ca cc d liu vo: Mt s thut ton phn
cm nhy cm vi th t ca d liu vo, v d nh vi cng mt tp
d liu, khi c a ra vi cc th t khc nhau th vi cng mt thut
ton c th sinh ra cc cm rt khc nhau. Do , vic quan trng l
pht trin cc thut ton m t nhy cm vi th t vo ca d liu.
S chiu ln: Mt CSDL hoc mt kho d liu c th cha mt s
chiu hoc mt s cc thuc tnh. Nhiu thut ton phn cm p dng
18

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
tt cho d liu vi s chiu thp, bao gm ch t hai n 3 chiu. Ngi
ta nh gi vic phn cm l c cht lng tt nu n p dng c cho
d liu c t 3 chiu tr ln. N l s thch thc vi cc i tng d
liu cm trong khng gian vi s chiu ln, c bit v khi xt nhng
khng gian vi s chiu ln c th rt tha v c nghing ln.
Phn cm rng buc: Nhiu ng dng thc t c th cn thc hin
phn cm di cc loi rng buc khc nhau. Mt nhim v t ra l i
tm nhng nhm d liu c trng thi phn cm tt v tha mn cc
rng buc.
D hiu v d s dng: Ngi s dng c th ch i nhng kt qu
phn cm d hiu, d l gii v d s dng. Ngha l, s phn cm c
th cn c gii thch ngha v ng dng r rng.
Vi nhng yu cu ng lu ny, nghin cu ca ta v phn tch phn
cm din ra nh sau: u tin, ta nghin cu cc kiu d liu khc v cch
chng c th gy nh hng ti cc phng php phn cm. Th hai, ta a
ra mt cch phn loi chung trong cc phng php phn cm. Sau , ta
nghin cu chi tit mi phng php phn cm, bao gm cc phng php
phn hoch, phn cp, da trn mt ,... Ta cng kho st s phn cm trong
khng gian a chiu v cc bin th ca cc phng php khc.

2.4. Nhng k thut tip cn trong phn cm d liu
Cc k thut phn cm c rt nhiu cch tip cn v cc ng dng trong
thc t, n u hng ti hai mc tiu chung l cht lng ca cc cm
khm ph c v tc thc hin ca thut ton. Hin nay, cc k thut
phn cm c th phn loi theo cc cch tip cn chnh sau :


19

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
2.4.1. Phng php phn cm phn hoch
K thut ny phn hoch mt tp hp d liu c n phn t thnh k
nhm cho n khi xc nh s cc cm c thit lp. S cc cm c thit
lp l cc c trng c la chn trc. Phng php ny l tt cho vic tm
cc cm hnh cu trong khng gian Euclidean. Ngoi ra, phng php ny
cng ph thuc vo khong cch c bn gia cc im la chn cc im
d liu no c quan h l gn nhau vi mi im khc v cc im d liu
no khng c quan h hoc c quan h l xa nhau so vi mi im khc. Tuy
nhin, phng php ny khng th x l cc cm c hnh dng k quc hoc
cc cm c mt cc im dy c. Cc thut ton phn hoch d liu c
phc tp rt ln khi xc nh nghim ti u ton cc cho vn PCDL, do n
phi tm kim tt c cc cch phn hoch c th c. Chnh v vy, trn thc
t thng i tm gii php ti u cc b cho vn ny bng cch s dng
mt hm tiu chun nh gi cht lng ca cm cng nh hng dn
cho qu trnh tm kim phn hoch d liu. Nh vy, tng chnh ca thut
ton phn cm phn hoch ti u cc b l s dng chin lc n tham
(Greedy) tm kim nghim.

2.4.2. Phng php phn cm phn cp
Phng php ny xy dng mt phn cp trn c s cc i tng d
liu ang xem xt. Ngha l sp xp mt tp d liu cho thnh mt cu trc
c dng hnh cy, cy phn cp ny c xy dng theo k thut quy. C
hai cch tip cn ph bin ca k thut ny l:
* Ha nhp nhm, thng c gi l tip cn Bottom-Up
* Phn chia nhm, thng c gi l tip cn Top-Down
20

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn

Hnh 2.2: Cc chin lc phn cm phn cp

Thc t p dng, c nhiu trng hp kt hp c hai phng php phn
cm phn hoch v phn cm phn cp, ngha l kt qu thu c ca phng
php phn cp c th ci tin thng qua bc phn cm phn hoch. Phn
cm phn hoch v phn cm phn cp l hai phng php PCDL c in,
hin c rt nhiu thut ton ci tin da trn hai phng php ny c
p dng ph bin trong KPDL.

2.4.3. Phng php phn cm da trn mt
K thut ny nhm cc i tng d liu da trn hm mt xc
nh, mt l s cc i tng ln cn ca mt i tng d liu theo mt
ngha no . Trong cch tip cn ny, khi mt d liu xc nh th n tip
tc c pht trin thm cc i tng d liu mi min l s cc i tng
ln cn ny phi ln hn mt ngng c xc nh trc. Phng php
phn cm da trn mt ca cc i tng xc nh cc cm d liu c
th pht hin ra cc cm d liu vi hnh th bt k. K thut ny c th khc
phc c cc phn t ngoi lai hoc gi tr nhiu rt tt, tuy nhin vic xc
nh cc tham s mt ca thut ton l rt kh khn, trong khi cc tham s
ny li c tc ng rt ln n kt qu phn cm.

21

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
2.4.4. Phng php phn cm da trn li
K thut phn cm da trn li thch hp vi d liu nhiu chiu, da
trn cu trc d liu li phn cm, phng php ny ch yu tp trung p
dng cho lp d liu khng gian. Mc tiu ca phng php ny l lng ha
d liu thnh cc to thnh cu trc d liu li. Sau , cc thao tc phn
cm ch cn lm vic vi cc i tng trong tng trn li ch khng phi
cc i tng d liu. Cch tip cn da trn li ny khng di chuyn cc
i tng trong cc m xy dng nhiu mc phn cp ca nhm cc i
tng trong mt . Phng php ny gn ging vi phng php phn cm
phn cp nhng chng khng trn cc , ng thi gii quyt khc phc yu
cu i vi d liu nhiu chiu m phng php phn phn cm da trn mt
khng gii quyt c. u im ca phng php phn cm da trn li
l thi gian x l nhanh v c lp vi s i tng d liu trong tp d liu
ban u, thay vo l chng ph thuc vo s trong mi chiu ca khng
gian li.



Hnh 2.3: Cu trc phn cp


22

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
2.4.5. Phng php phn cm da trn m hnh
Phng ny c gng khm ph cc php xp x tt ca cc tham s m
hnh sao cho khp vi d liu mt cch tt nht. Chng c th s dng chin
lc phn cm phn hoch hoc phn cm phn cp, da trn cu trc hoc
m hnh m chng gi nh v tp d liu v cch chng hiu chnh cc m
hnh ny nhn dng ra cc phn hoch. Phng php phn cm da trn
m hnh c gng khp gia cc d liu vi m hnh ton hc, n da trn gi
nh rng d liu c to ra bng hn hp phn phi xc sut c bn. Cc
thut ton phn cm da trn m hnh c hai cch tip cn chnh: m hnh
thng k v mng nron. Phng php ny gn ging vi phng php phn
cm da trn mt , v chng pht trin cc cm ring bit nhm ci tin cc
m hnh c xc nh trc , nhng i khi n khng bt u vi mt
s cm c nh v khng s dng cng mt khi nim mt cho cc cm.

2.4.6. Phng php phn cm c d liu rng buc
S pht trin ca PCDL khng gian trn CSDL ln cung cp nhiu
cng c tin li cho vic phn tch thng tin a l, tuy nhin hu ht cc thut
ton ny cung cp rt t cch thc cho ngi dng xc nh cc rng buc
trong th gii thc cn phi c tha mn trong qu trnh phn cm.
PCDL khng gian hiu qu hn, cc nghin cu b sung cn c thc hin
cung cp cho ngi dng kh nng kt hp cc rng buc trong thut ton
phn cm.
23

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn

Hnh 2.4: Cc cch m cc cm c th a ra

Hin nay, cc phng php phn cm trn v ang c pht trin
v p dng nhiu trong cc lnh vc khc nhau v c mt s nhnh nghin
cu c pht trin trn c s ca cc phng php nh:
Phn cm thng k: Da trn cc khi nim phn tch h thng, nhnh nghin
cu ny s dng cc o tng t phn hoch cc i tng, nhng
chng ch p dng cho cc d liu c thuc tnh s.
Phn cm khi nim: K thut ny c pht trin p dng cho d liu hng
mc, chng phn cm cc i tng theo cc khi nim m chng x l.
Phn cm m: S ng k thut m PCDL. Cc thut ton thuc loi ny
ch ra lc phn cm thch hp vi tt c cc hot ng i sng hng
ngy, chng ch x l cc d liu thc khng chc chn.
Phn cm mng Kohonen: Loi phn cm ny da trn khi nim ca cc
mng nron. Mng Kohonen c tng nron vo v cc tng nron ra. Mi
24

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
nron ca tng vo tng ng vi mi thuc tnh ca bn ghi, mi mt nron
vo kt ni vi tt c cc nron ca tng ra. Mi lin kt c gn lin vi
mt trng s nhm xc nh v tr ca nron ra tng ng.

2.5. Mt s thut ton c bn trong phn cm d liu

2.5.1. Cc thut ton phn cm phn hoch
Thut ton k-means
Thut ton ny da trn o khong cch ca cc i tng d liu
trong cm. Trong thc t, n o khong cch ti gi tr trung bnh ca cc i
tng d liu trong cm. N c xem nh l trung tm ca cm. Nh vy,
n cn khi to mt tp trung tm cc trung tm cm ban u, v thng qua
n lp li cc bc gm gn mi i tng ti cm m trung tm gn, v
tnh ton ti tung tm ca mi cm trn c s gn mi cho cc i tng. Qu
trnh lp ny dng khi cc trung tm hi t.

Hnh 2.5: Cc thit lp xc nh ranh gii cc cm ban u

Mc ch ca thut ton k-means l sinh k cm d liu {C
1
, C
2
,..., C
k
}
t mt tp d liu cha n i tng trong khng gian d chiu X
i
= {x
i1
, x
i2
,...,
x
id
}, i = 1 n, sao cho hm tiu chun:
25

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn

=
e
=
k
i
C x
i
i
m x D E
1
2
) ( t gi tr ti thiu,
trong : m
i
l trng tm ca cm C
i
, D l khong cch gia hai i tng.

Hnh 2.6: Tnh ton trng tm ca cc cm mi

Thut ton k-means bao gm cc bc c bn sau :

Input: S cm k v cc trng tm cm{ }
k
j
j
m
1 =
.
Output: Cc cm C[i] (1 s i s k) v hm tiu chun E t gi tr ti thiu.
Begin
(a) Bc 1 : Khi to
Chn k trng tm { }
k
j
j
m
1 =
ban u trong khng gian Rd (d l s chiu ca
d liu). Vic la chn ny c th l ngu nhin hoc theo kinh nghim.
(b) Bc 2: Tnh ton khong cch
i vi mi im X
i
(1 s i s n), tnh ton khong cch ca n ti mi
trng tm m
j
(1 s j s k). Sau tm trng tm gn nht i vi mi im.
(c) Bc 3: Cp nht li trng tm
i vi mi 1 s j s k, cp nht trng tm cm m
j
bng cch xc nh
trung bnh cng cc vect i tng d liu.
(d) iu kin dng:
26

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Lp li cc bc 2 v 3 cho n khi cc trng tm ca cm khng thay
i.
End.

Thut ton k-means trn c chng minh l hi t v c phc tp
tnh ton l
) ) 3 ((
flop
T nkd O t
. Trong , n l s i tng d liu, k l s cm d
liu, d l s chiu,
t
l s vng lp,
flop
T
l thi gian thc hin mt php
tnh c s nh php tnh nhn, chia,... Nh vy, do k-means phn tch phn
cm n gin nn c th p dng i vi tp d liu ln.Tuy nhin, nhc
im ca k-means l ch p dng vi d liu c thuc tnh s v khm ph ra
cc cm c dng hnh cu, k-means cn rt nhy cm vi nhiu v cc phn
t ngoi lai trong d liu. Hn na, cht lng PCDL ca thut ton k-means
ph thuc nhiu vo cc tham s u vo nh: s cm k v k trng tm khi
to ban u. Trong trng hp cc trng tm khi to ban u m qu lch so
vi cc trng tm cm t nhin th kt qu phn cm ca k-means l rt thp,
ngha l cc cm d liu c khm ph rt lch so vi cc cm trong thc t.
Trn thc t cha c mt gii php ti u no chn cc tham s u vo,
gii php thng c s dng nht l th nghim vi cc gi tr u vo k
khc nhau ri sau chn gii php tt nht.
Ngoi ra thut ton K-means ra, phn cm phn hoch cn bao gm
mt s cc thut ton khac nh: Thut ton PAM; Thut ton CLARA; Thut
ton CLARANS.

2.5.2. Cc thut ton phn cm phn cp
Thut ton CURE
Trong khi hu ht cc thut ton thc hin phn cm vi cc cm hnh
cu v kch thc tng t, nh vy l khng hiu qu khi xut hin cc phn
t ngoi lai. Thut ton CURE khc phc c vn ny v tt hn vi cc
27

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
phn t ngoi lai. Thut ton ny nh ngha mt s c nh cc im i din
nm ri rc trong ton b khng gian d liu v c chn m t cc cm
c hnh thnh. Cc im ny c to ra nh la chn cc i tng nm
ri rc cho cm v sau co li hoc di chuyn chng v trung tm cm
bng nhn t co cm. Qu trnh ny c lp li v nh vy trong qu trnh
ny, c th o t l gia tng ca cm. Ti mi bc ca thut ton, hai cm c
cp cc im i din gn nhau (mi im trong cp thuc v mi cm khc
nhau) c ha nhp.



Nh vy, c nhiu hn mt im i din mi cm cho php CURE
khm ph c cc cm c hnh dng khng phi l hnh cu. Vic co li cc
cm c tc dng lm gim tc ng ca cc phn t ngoi lai. Nh vy, thut
ton ny c kh nng x l tt trong trng hp c cc phn t ngoi lai v
lm cho n hiu qu vi nhng hnh dng khng phi l hnh cu v kch
thc rng bin i. Hn na, n t l tt vi CSDL ln m khng lm
gim cht lng phn cm.

Hnh 2.8: Cc cm d liu c khm ph bi CURE

Hnh 2.7: Khi qut thut ton CURE

28

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
x l c cc CSDL 1n, CURE s dng mu ngu nhin v phn
hoch, mt mu l c xc nh ngu nhin trc khi c phn hoch, v
sau tin hnh phn cm trn mi phn hoch, nh vy mi phn hoch l
tng phn c phn cm, cc cm thu c li c phn cm ln th
hai thu c cc cm con mong mun, nhng mu ngu nhin khng nht
thit a ra mt m t tt cho ton b tp d liu.

Thut ton CURE c thc hin qua cc bc c bn sau:

Chn mt mu ngu nhin t tp d liu ban u.
Phn hoch mu ny thnh nhiu nhm d liu c kch thc bng
nhau: tng chnh y l phn hoch mu thnh p nhm d liu bng
nhau, kch thc ca mi phn hoch l n/p (n l kch thc ca mu).
Phn cm cc im ca mi nhm: Thc hin PCDL cho cc nhm
cho n khi mi nhm c phn thnh n/pq cm (vi q > 1).
Loi b cc phn t ngoi lai: Trc ht, khi cc cm c hnh
thnh cho n khi s cc cm gim xung mt phn so vi s cc cm
ban u. Sau , trong trng hp cc phn t ngoi lai c ly mu
cng vi qu trnh pha khi to mu d liu, thut ton s t ng loi b
cc nhm nh.
Phn cm cc cm khng gian: cc i tng i din cho cc cm
di chuyn v hng trung tm cm, ngha l chng c thay th bi cc
i tng gn trung tm hn.
nh du d liu vi cc nhn tng ng.

phc tp tnh ton ca thut ton CURE l O(n
2
1og(n)). CURE t
thut ton tin cy trong vic khm ph ra cc cm vi hnh th bt k v c
th p dng tt i vi d liu c phn t ngoi lai v trn cc tp d liu hai
chiu. Tuy nhin, n li rt nhy cm vi cc tham s nh s cc i tng
i din, t l co ca cc phn t i in.
29

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Ngoi thut ton CURE ra, phn cm phn cp cn bao gm mt s
thut ton khac nh: Thut ton BIRCH; Thut ton AGNES; Thut ton
DIANA; Thut ton ROCK; Thut ton CHANMELEON.


2.5.3. Cc thut ton phn cm da trn mt
Thut ton DBSCAN
Thut ton DBSCAN thch nghi vi mt dy phn cm v khm
ph ra cc cm c hnh dng bt k trong khng gian CSDL c nhiu.
Trn thc t DBSCAN tm kim cho cc cm bng cch kim tra cc
i tng m c s i tng lng ging nh hn mt ngng ti thiu, tc l
c ti thiu MinPts i tng v mi i tng trong cm tn ti mt i
tng khc trong cm ging nhau vi khong cch nh mt ngng Eps. Tm
tt c cc i tng m cc lng ging ca n thuc v lp cc i tng
xc nh trn, mt cm c xc nh bng mt tp tt c cc i tng lin
thng mt cc lng ging ca n. DBSCAN lp li tm kim ngay khi cc
i tng lin lc mt t cc i tng trung tm, n c th bao gm vic
kt hp mt s cm c mt lin lc. Qu trnh kt thc khi khng tm c
im mi no c th thm vo bt c cm no.
DBSCAN c th tm ra cc cm vi hnh th bt k, trong khi ti
cng mt thi im t b nh hng bi th t ca cc i tng d liu nhp
vo. Khi c mt i tng c chn vo ch tc ng n mt lng ging xc
nh. Mt khc, DBSCAN s dng tham s Eps v MinPts trong thut ton
kim sot mt ca cc cm. DBSCAN bt u vi mt im tu v xy
dng mt lng ging c th c i vi Eps v MinPts. V vy, DBSCAN
yu cu ngi dng xc nh bn knh Eps ca cc lng ging v s cc lng
ging ti thiu MinPts, cc tham s ny kh m xc nh c ti u, thng
thng n c xc nh bng php chn ngu nhin hoc theo kinh nghim.
30

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
phc tp ca DBSCAN l O(n
2
), nhng nu p dng ch s khng gian
gip xc nh cc lng ging ca mt i tng d liu th phc ca
DBSCAN c ci tin l O(nlogn). Thut ton DBSCAN c th p dng
cho cc tp d liu khng gian ln a chiu, khong cch Euclide c s
dng o s tng t gia cc i tng nhng khng hiu qu i vi d
liu a chiu.

Hnh 2.9: Hnh dng cc cm c khm ph bi thut ton DBSCAN

Thut ton: DBSCAN khi to im p ty v ly tt c cc im
lin lc mt t p ti Eps v MinPts. Nu p l im nhn th th tc trn to
ra mt cm theo Eps v MinPts, nu p l mt im bin, khng c im no
lin lc mt t p v DBSCAN s i thm im tip theo ca tp d liu.
Nu s dng gi tr ton cc Eps v Minpts, DBSCAN c th ho nhp
hai cm thnh mt cm nu mt ca hai cm gn bng nhau. Gi s
khong cch gia hai tp d liu S1 v S2 c nh ngha l:
dist(S1, S2) = min{dist(p, q) {p e S1 v q e S2}.

Thut ton DBSCAN c m t chi tit nh sau:

......... Modul chng trnh chnh ..........
DBSCAN(SetOfPoints, Eps, MinOts)
//SetOfPoints is UNCLASSIFIED
Clusterid:= NextId(NOISE);
FOR i FROM 1 TO SetOfPoints.size DO
31

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Point := SetOfPoints.get(i);
IF PointClId = UNCLASSIFIED THEN
IF ExpandCluster(SetOfPoints, Point, ClusterId, Eps, MinPts)
THEN
ClusterId.= nextId(ClusterId)
END IF
END IF
END FOR
END; //DBSCAN

--------Th tc ExpandCluster--------
ExpandClusster(SetOfPoints, Points, C1Id, Eps, MinPts): Boolean;
seeds:= SetOfPoints.regionQuery(Point, Eps)
IF seeds.size < MinPts THEN //no core point
SetOfPoints.changeclId(Point, NOISE),
RETURN False;
ELSE //all points in seeds are density-reachable from Point
SetOfPoints.changeClId(seeds, C1Id);
seeds.delete(Point);
WHILE seeds <> Empty DO
currentP:= seeds.first();
result:= SetOfPoints.regionQuery(CurrentP, Eps);
IF result.size >= MinPts THEN
FOR i FROM 1 to result.size DO
resultpP:= result.get(i);
IF resultp.C1Id IN {UNCLASSIFIED, NOISE} THEN
IF resultp.ClId = UNCLASSIFIED THEN
seeds.append(resultP);
END IF;
32

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
SetOfPoints.changeC1Id(resultP, C1Id),
END IF; //UNCLASSIFIED or NOISE
END FOR;
END IF; //result.size >= Minpts
seeds.delete(currentP);
END WHILE; //seeds <> Empty
RETURN True;
END IF;
END; //ExpandCluster

Trong SetOfPoints hoc l tp d liu ban u hoc l cm c
khm ph t bc trc, C1Id (ClusterId) l nhn nh du phn t d liu
nhiu c th thay i nu chng c th lin lc mt t mt im khc trong
CSDL, iu ny ch xy ra i vi cc im bin ca d liu. Hm
SetOfPoints.get(i) tr v phn t th i ca SetOfPoints. Th tc
SetOfPoints.regionQuery(Point, Eps) tr v mt danh sch cc im d liu
ln cn vi im Point trong ngng Eps t tp d liu SetOfPoints. Tr mt
s trng hp ngoi l, kt qu ca DBSCAN l c lp vi th t duyt cc
i tng d liu. Eps v MinPts l hai tham s ton cc c xc nh bng
th cng hoc theo kinh nghim. Tham s Eps c a vo l nh so vi
kch thc ca khng gian d liu, th phc tp tnh ton trung bnh ca
mi truy vn l O(logn).
Ngoi thut ton DBSCAN ra, phn cm da trn mt cn bao
gm 2 thut ton khc nh: Thut ton OPTICS; Thut ton DENCLUE.

2.5.4. Cc thut ton phn cm da trn li
Thut ton STING
STING l k thut phn cm a phn gii da trn li, trong vng
khng gian d liu c phn r thnh s hu hn cc ch nht, iu ny c
33

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
ngha l cc li c hnh thnh t cc li con thc hin phn cm.
C nhiu mc ca cc ch nht tng ng vi cc mc khc nhau ca phn
gii trong cu trc li, v cc ny hnh thnh cu trc phn cp: mi
mc cao c phn hoch thnh s cc nh mc thp hn tip theo trong
cu trc phn cp. Cc im d liu c np t CSDL, gi tr ca cc tham
s thng k cho cc thuc tnh ca i tng d liu trong mi li c
tnh ton t d liu v lu tr thng qua cc tham s thng k cc mc
thp hn. Cc gi tr ca cc tham s thng k gm: s trung bnh - mean, s
ti a - max, s ti thiu - min, s m - count, lch chun - s, ...
Cc i tng d liu ln lt c chn vo li v cc tham s thng
k trn c tnh trc tip thng qua cc i tng d liu ny. Cc truy
vn khng gian c thc hin bng cch xt cc thch hp ti mi mc ca
phn cp. Mt truy vn khng gian c xc nh nh l mt thng tin khi
phc li ca d liu khng gian v cc quan h ca chng.
STING c kh nng m rng cao, nhng do s dng phng php a
phn gii nn n ph thuc cht ch vo trng tm ca mc thp nht. a
phn gii l kh nng phn r tp d liu thnh cc mc chi tit khc nhau.
Khi ho nhp cc ca cu trc li hnh thnh cc cm, n khng xem
xt quan h khng gian gia cc nt ca mc con khng c ho nhp ph
hp (do chng ch tng ng vi cc cha ca n) v hnh dng ca cc cm
d liu khm ph c, tt c ranh gii ca cc cm c cc bin ngang v
dc, theo bin ca cc v khng c ng bin cho c pht hin ra.
Mt trong nhng hn ch trong khi s dng cch tip cn a phn gii
thc hin phn tch cm cht lng ca phn cm STING hon ton ph
thuc vo tnh cht hp mc thp ca cu trc li. Nu tnh cht hp l
mn, dn n chi ph thi gian x l tng, tnh ton tr nn phc tp v nu
34

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
mc di cng l qu th th n c th lm gim bt cht lng v chnh
xc ca phn tch cm.
Cu trc d liu li thun tin cho qu trnh x l song song v cp
nht lin tc, khi duyt ton b CSDL mt ln tnh ton cc i lng
thng k cho mi , nn n rt hiu qu v do phc tp thi gian to
cc cm xp x O(n), trong n l tng s cc i tng. Sau khi xy dng
cu trc phn cp, thi gian x l cho cc truy vn l O(g), trong g l tng
s li mc thp (g << n).

Thut ton STING gm cc bc sau:

Xc nh tng bt u:
Vi mi ci ca tng ny, tnh ton khong tin cy (hoc c lng
khong) ca xc sut m ny lin quan ti truy vn.
T khong tin cy ca tnh ton trn, gn nhn cho l c lin quan
hoc khng lin quan.
Nu lp ny l lp di cng, chuyn sang Bc 6; nu khc th
chuyn sang Bc 5.
Duyt xung di ca cu trc cy phn cp mt mc. Chuyn sang
Bc 2 cho cc m hnh thnh cc ln quan ca lp c mc cao hn.
Nu c t c cu truy vn, chuyn sang Bc 8; nu khng th
chuyn sang Bc 7.
Truy lc d liu vo trong cc lin quan v thc hin x l. Tr li
kt qu ph hp yu cu ca truy vn. Chuyn sang Bc 9.
Tm thy cc min c cc lin quan. Tr li min m ph hp vi
yu cu ca truy vn . Chuyn sang Bc 9.
9. Dng.

Ngoi thut ton STING ra, phn cm da trn li cn c thm mt
thut ton khc l: Thut ton CLIQUE.
35

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
2.5.5. Cc thut ton phn cm da trn m hnh
Thut ton EM
Thut ton EM c xem nh l thut ton da trn m hnh hoc l
m rng ca thut ton k-means. Tht vy, EM gn cc i tng cho cc
cm cho theo xc sut phn phi thnh phn ca i tng . Phn phi
xc sut thng c s dng l phn phi xc sut Gaussian vi mc ch l
khm ph lp cc gi tr tt cho cc tham s ca n bng hm tiu chun l
hm logarit kh nng ca i tng d liu, y l hm tt m hnh xc
sut cho cc i tng d liu. EM c th khm ph ra nhiu hnh dng cm
khc nhau, tuy nhin do thi gian lp ca thut ton kh nhiu nhm xc nh
cc tham s tt nn chi ph tnh ton ca thut ton kh cao. c mt s ci
tin c xut cho EM da trn cc tnh cht ca d liu: c th nn, c
th sao lu trong b nh v c th hy b. Trong cc ci tin ny, cc i
tng b hy b khi bit chc chn c nhn phn cm ca n, chng c
nn khi khng b loi b v thuc v mt cm qu ln so vi b nh v chng
s c lu li trong cc trng hp cn li.
Thut ton c chia thnh hai bc v qu trnh c lp li cho
n khi vn c gii quyt:
E: h b h a

+
=
+
=
2
1
,
2
1
2
1

M:
) ( 6
,
d c b
b a
b a
+ +
+
=

Cc bc thc hin ca thhut ton EM
Khi to tham s:
} ,..., , , ,..., , {
) 0 ( ) 0 (
2
) 0 (
1
) 0 ( ) 0 (
2
) 0 (
1 K K o
p p p =
Bc E:
36

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn

= =
k
t
j
t
j j k
t
i
t
i i k
t k
t j t j k
t k j
P x P
P x P
x P
P x P
x P
) ( 2 ) (
) ( 2 ) (
) , , | (
) , , | (
) , (
) , ( ) , | (
) , | (
o e
o e

e e
e
Bc M:

=
+
k
t k i
k
k
t k i
t
i
x P
x x P
) , | (
) , | (
) 1 (
e
e


R
x P
p
k
t k i
t
i

=
+
) , | (
) 1 (
e

Lp li bc 2 v 3 cho n khi t c kt qu

Ngoi thut ton EM ra, phn cm da trn m hnh cn c thm mt
thut ton khc l: Thut ton COBWEB.

2.5.6. Cc thut ton phn cm c d liu rng buc
Thut ton Phn cm m: FCM, cFCM v FCM-Ci tin (Cc thut
ton ny s c cp chi tit chng k tip).

Tm li, cc k thut PCDL trnh by trn c s dng rng ri
trong thc t, th nhng hu ht chng ch nhm p dng cho tp d liu vi
cng mt kiu thuc tnh. V vy, vic PCDL trn tp d liu c kiu hn hp
l mt vn t ra trong KPDL giai on hin nay.









37

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
CHNG 3
K THUT PHN CM D LIU M
3.1. Tng quan v phn cm m ...................................................................................
3.2. Cc thut ton trong phn cm m ........................................................................
3.2.1. Thut ton FCM(Fuzzy C-means) .....................................................................
3.2.1.1. Hm mc tiu ..............................................................................................
3.2.1.2. Thut ton FCM ..........................................................................................
3.2.2. Thut ton cFCM(- Insensitive Fuzzy C-means) .............................................
3.2.2.1. Hm mc tiu ..............................................................................................
3.2.2.2. Thut ton cFCM ........................................................................................
3.2.3. Thut ton FCM-Ci tin ...................................................................................
3.2.3.1. Thut ton 1: Thut ton la chn cc im d liu lm ng vin cho
vic chn cc trung tm ca cc cm .........................................................
3.2.3.2. Thut ton 2: Thut ton lc bt cc ng vin ........................................
3.2.3.3. Thut ton 3: Thut ton chn cc ng vin lm cc tiu hm mc tiu ..
3.2.3.4. Thut ton 4: Gn cc trung tm c lin kt gn gi vo mt cm .......
3.2.3.5. Tng kt thut ton FCM-Ci tin ..............................................................
37
38
39
39
42
46
46
48
49

49
51
51
52
56

3.1. Tng quan v phn cm m
Trong cuc sng, chng ta gp rt nhiu ng dng ca bi ton phn
cm. Chng hn nh trong ngnh bu in, hng ngy bu in phi phn
loi th theo m nc, trong m nc li phn loi theo m tnh/thnh ph,
sau khi th v n bu in tnh th bu in tnh li phi phn loi th
theo qun/huyn gi i, n bu in qun/huyn li phn loi th theo
x/phng gi th. chnh l mt ng dng ca bi ton phn cm r.
Vy bi ton phn cm r l g?
Ta c th nh ngha bi ton phn cm r nh sau: Cho tp d liu
mu X, ta kim tra cc im d liu xem n ging vi c im ca nhm
no nht th ta gn im d liu vo trong nhm . Nhng trong thc t
khng phi lc no bi ton phn cm r cng p dng c. Chng hn, ta
c php phn loi sau: Nhng ngi i xe my xn th thuc nhm ngi
giu, nhng ngi i xe my thng thuc nhm ngi bnh dn. Vy ngi
38

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
ngho m i xe my xn th chng ta xp ngi vo nhm no? V vy,
chng ta cn a vo khi nim bi ton phn cm m.
Trong cc phng php phn cm gii thiu trong chng trc,
mi phng php phn cm phn hoch mt tp d liu ban u thnh cc
cm d liu c tnh t nhin v mi i tng d liu ch thuc v mt cm
d liu, phng php ny ch ph hp vi vic khm ph ra cc cm c mt
cao v ri nhau, vi ng bin gia cc cm c xc nh tt. Tuy
nhin, trong thc t, ng bin gia cc cm c th m, cc cm c th
chng ln nhau, ngha l mt s cc i tng d liu thuc v nhiu cc cm
khc nhau, do m hnh ny khng m t c d liu thc. V vy ngi
ta p dng l thuyt v tp m trong PCDL gii quyt cho trng hp
ny. Cch thc kt hp ny c gi l Phn cm m.
Phn cm m l phng php phn cm d liu m cho php mi im
d liu thuc v hai hoc nhiu cm thng qua bc thnh vin. Ruspini (1969)
gii thiu khi nim phn hoch m m t cu trc cm ca tp d liu v
xut mt thut ton tnh ton ti u phn hoch m. Dunn (1973) m
rng phng php phn cm v pht trin thut ton phn cm m.
tng ca thut ton l xy ng mt phng php phn cm m da trn ti
thiu ha hm mc tiu. Bezdek (1981) ci tin v tng qut ha hm mc
tiu m bng cch a ra trng s m xy dng thut ton phn cm m
v c chng minh hi t ca cc thut ton l cc tiu cc b.

3.2. Cc thut ton trong phn cm m
K-means l thut ton PCDL r v C-means l thut ton phn cm
m tng ng, hai thut ton ny cng s dng chung mt chin lc phn
cm d liu. Thut ton C-means m hay cn gi tt l thut ton FCM
(Fuzzy C-means) c p dng thnh cng trong gii quyt mt s ln
39

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
cc bi ton PCDL nh trong nhn dng mu(nhn dng vn tay, nh), x
l nh(phn tch cc cm nh mu, cm mu), y hc(phn loi bnh, phn
loi triu chng), Tuy nhin, nhc im ln nht ca thut ton FCM l
tp d liu ln, tp d liu nhiu chiu, nhy cm vi cc nhiu v phn t
ngoi lai trong d liu, ngha l cc trung tm cm c th nm xa so vi
trung tm thc ca cm. c nhiu cc phng php xut ci tin
cho nhc im trn ca thut ton FCM bao gm: Phn cm da trn xc
sut (keller, 1993), phn cm nhiu m (Dave, 1991), phn cm da trn
ton t LP Norm (Kerten, 1999) v thut ton - Insensitive Fuzzy C-means
(FCM) v thut ton FCM ci tin.

3.2.1. Thut ton FCM(Fuzzy C-means)
3.2.1.1. Hm mc tiu
K thut ny phn hoch mt tp n vect i tng d liu X = {x
1
,,
x
n
} c R
s
thnh c cc nhm m da trn tnh ton ti thiu ha hm mc tiu
o cht lng ca phn hoch v tm trung tm cm trong mi nhm, sao
cho chi ph hm o phi tng t l nh nht. Mt phn hoch m vect
im d liu X = {x
1
,, x
n
} c R
s
l c trng u vo c biu din bi ma
trn U = [u
ik
] sao cho im d liu cho ch c th thuc v mt s nhm
vi bc c xc nh bi mc thuc gia [0, 1]. Nh vy, ma trn U c
s dng m t cu trc cm ca X bng cch gii thch u
ik
nh bc thnh
vin x
k
vi cm i.
Cho u = (u
1
, u
2
, .., u
c1
) l phn hoch m C

(
(
(
(

=
cn c
n
n
cxn
u u
u u
u u u
U


1
2 21
1 12 11


40

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Dunn nh ngha hm mc tiu m nh sau:

= =
=
n
k
c
i
i k ik m
v x d u V U J
1 1
2
) , ( ) , (
Bezdek khi qut ha hm mc tiu m bng cch a ra trng s m
m > 1 l bt k s thc no nh sau:

= =
s s =
n
k
c
i
i k
m
ik m
m v x d u V U J
1 1
2
1 ), , ( ) ( ) , ( (1)
trong :
X = [x
1
,, x
n
] c R
s
l n vect mu d liu tp con thc s chiu trong
khng gian vect R
s
gm c n quan st,
m e [1, +] l trng s m c gi l tham s m,
v
i
e R
s
l trung tm cm th i,
d(x
k
, v
i
) = d
ik
l khun mu bt k o khong cch gia d liu x
k

vi trung tm cm th i, => d
2
(x
k
, v
i
) l khong cch Euclidean,
u
ik
e [0, 1] l bc ca phn t d liu x
k
thuc v cm th i,
V = [v
ji
] = [v1,,v
c
] e R
sxc
l ma trn biu din cc gi tr i tng
tm ca cm,
U = [u
ik
] l ma trn phn hoch m ngu nhin ca X trong C phn.
Mt trong cc nhn t chnh nh hng ti quyt nh phn cm hp l
cc im l vn chn php o phi tng t. Thc vy, tnh ton bc
thnh vin u
ik
ph thuc vo nh ngha ca php o khong cch d
ik
m l
tch v hng trn R
s
. Bnh phng khong cch gia vect mu x
k
v trung
tm v tr ca cm th i c nh ngha nh sau:
) ( ) ( ) , (
i k
T
i k i k i k
v x A v x v x v x d = =
) ( ) ( ) , (
2
2
i k
T
i k i k i k
v x A v x v x v x d = =
trong :
A l ma trn hu hn dng i xng (p p) bt k,
41

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn

2
i k
v x biu din lch ca d liu x
k
vi v
i
, d(x
k
,v
i
) l tch v
hng trn R
s
.
Bc ca thnh vin tha mn rng buc sau:

s s =
s s < <
s s s s s s

=
n k u
c i n u
n k c i u
c
i
ik
n
k
ik
ik
1 1
1 0
1 , 1 , 1 0
1
1
(2)
thun tin, coi mng i tng d liu {x
1
,....,x
n
} l cc ct trong
ma trn i tng d liu X = [x
jk
] = [x
1
, , x
n
] e R
sxc
. Ma trn phn hoch
U l mt cng c tin li m t cu trc cm trong d liu { x
1
, , x
n
};
nh ngha tp tt c cc ma trn thc phn hoch m khng suy thoi cp cn
cho phn hoch n i tng thnh c cm d liu trong khng gian R
cxn
c
vit gn nh sau:
| |
)
`

< = s = e e =

= =
n u u u k i R U M
c
i
ik
c
i
ik ik
cxn
fcn
1 0 ; 1 ; 1 , 0 : , |
1 1
(3)
R
cxn
l khng gian ca tt c cc ma trn thc cp cn.
Thng thng ngi ta gi bi ton phn cm m l bi ton tm cc
thuc u
ij
nhm ti thiu hm mc tiu trn J
m
(U,V) vi cc iu kin sau:
nh l 1: Nu m v c l cc tham s c nh, v i
k
l mt tp c
nh ngha nh sau:

{ } 0 , 1 |
1
= s s =
s s
ik k
n k
d c i i I
(4)
th hm mc tiu J
m
(U,V) t gi tr ti thiu:
( ) ( ) ( )
)
`

= =
n
k
c
i
i k
m
ik m
v x d u V U J
1 1
2
, , min
khi v ch khi:
42

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn

= e =
e
=
|
|
.
|

\
|
=

e
=

k
I i
k k ik
k
k
c
j
m
jk
ik
ik
I I i u
I i
I
d
d
u
|
|
, , 1
, 0
,
1
1
1
2
; n k c i s s s s 1 , 1 (5)
c i
u
x u
v
n
k
m
ik
n
k
k
m
ik
i
s s =

=
=
1 ,
) (
) (
1
1
(6)
nh l c Bezdek chng minh (nu m > 1 , d
2
ik
> 0, 1s i s c) l
ng n.
Mt phn hoch ti u, ngha l hm mc tiu t gi tr ti thiu, m
ch yu da trn tng t gia x
k
v trung tm cm v
i
, iu ny tng
ng vi hai iu kin (5) v (6) phi tha mn. Vi hm mc tiu v cc
rng buc hm mc tiu t gi tr ti thiu trn y.

3.2.1.2. Thut ton FCM
Thut ton FCM cung cp mt qu trnh lp qua li gia phng trnh
(5) v (6) ti u(xp x cc tiu) hm mc tiu da trn o c tng t
c trng s gia x
k
v trung tm cm v
i
, sau mi vng lp, thut ton tnh ton
v cp nht cc phn t u
jk
trong ma trn phn hoch U. Php lp s dng khi
{ } c s
+ ) ( ) 1 (
max
k
ij
k
ij ij
u u , trong c l chun kt thc gia 0 v 1, trong khi k l
cc bc lp. Th tc ny hi t ti cc tiu cc b hay im yn nga ca
J
m
(u, V). Thut ton FCM tnh ton ma trn phn hoch U v kch thc ca
cc cm thu c cc m hnh m t ma trn ny. Cc bc thc hin ca
thut ton FCM nh sau:


43

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
THUT TON FCM
Input : S cm c v tham s m m cho hm mc tiu J;
Output: c cm d liu sao cho hm mc tiu trong (1) t gi tr cc tiu;
Begin
1. Nhp tham s cm c (1<c<n), m (1<m<+].
Khi to ma trn V=[v
ij
], V
(0)
eR
sxc
, j=0;
2. Repeat
2.1. j:=j+1;
2.2. Tnh ma trn phn hoch m U
(j)
theo cng thc (5);
2.3. Cp nht cc trung tm cm V
(j)
=[v
1
(j)
, v
2
(j)
,..., v
c
(j)
]
da vo cng thc (6) v U
(j)

3. Until ( ) c s
+
F
j j
U U
) ( ) 1 (
;
4. Trnh din cc cm kt qu;
End.

Trong ,
F
* l tiu chun Frobenious c nh ngha nh sau:

= =
i k
ik
T
F
u UU Tr U
2
2
) (
v tham s c c cho trc .
Vic chn cc tham s cm rt nh hng n kt qu phn cm, tham
s ny thng c chn theo php ngu nhin hoc theo Heuristic.
i vi m 1
+
th thut ton C-means tr thnh thut ton r.
i vi m th thut ton FCM tr thnh thut ton phn cm m
vi:
c
u
ik
1
= . Cha c quy tc no nhm la chn tham s m m bo cho vic
phn cm hiu qu, nhng thng thng chn m = 2.
d hiu c th xt v d sau: Cho mt tp cc i tng d liu mt
chiu c biu th nh Hnh 3.1 sau:

44

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn

Hnh 3.1: M phng v tp d liu n chiu

Bng quan st d nhn thy c hai cm trong tp d liu trn t tn
tng ng l "A" v "B". Vi thut ton k-means th hm tnh ph thuc
gia i tng d liu v trng tm cm ca n c th hin nh trong
th Hnh 3.2 di y:

Hnh 3.2: Hm thuc vi trng tm ca cm A trong k-means

Da vo hnh rt ra nhn xt rng, cc i tng trong cm A c gi tr
hm thuc vi trng tm ca cm A l bng 1 v bng 0 vi trng tm cm B.
iu ny ngc li vi cc i tng trong cm B.
Th nhng, i vi thut ton FCM th hm thuc ca cc i tng d
liu vi cc trung tm cm d liu c minh ha nh trong th Hnh 3.3
di y:
45

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn

Hnh 3.3: Hm thuc vi trng tm ca cm A trong FCM

Da vo hnh c th nhn xt rng, cc i tng d liu c gi tr hm
thuc vi cc trng tm ca cm A nm trong khong [0, l], hm thuc lc
ny l mt ng cong trn. im c mi tn ch n c nhiu kh nng
thuc v lp B hn l lp A do gi tr hm thuc ca n vo lp A l nh
(=0.2). C th biu din cc gi tr hm thuc trn bng ma trn cho c hai
trng hp nh sau:

(
(
(
(
(
(

=
(
(
(
(
(
(

=
1 . 0 9 . 0
4 . 0 6 . 0
7 . 0 3 . 0
2 . 0 8 . 0
1 0
0 1
1 0
0 1

nxc nxc
U U
S dng v s ct ph thuc vo s cc i tng d liu n v s cc
cm k.
Mt s v d m phng v kt qu cc cm khm ph c ca thut
ton phn cm m FCM nh Hnh 3.4 di y:
46

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn

Hnh 3.4: Cc cm khm ph c bi thut ton FCM

phc tp ca thut ton FCM tng ng vi phc tp ca
thut ton k-means trong trng hp s i tng ca tp d liu cn phn
cm l rt ln.
Tm li, thut ton phn cm m FCM l mt m rng ca thut ton
k-means nhm khm ph ra cc cm chng ln nhau, tuy nhin, FCM vn
cha ng cc nhc im ca thut ton k-means trong vic x l i vi cc
phn t ngoi lai v nhiu trong d liu. Thut ton cFCM c trnh by
di y l mt m rng ca thut ton FCM nhm khc phc cc nhc
im ny.

3.2.2. Thut ton cFCM(- Insensitive Fuzzy C-means)
3.2.2.1. Hm mc tiu
Thut ton phn cm FCM s dng hm bc hai do phi tng t
gia d liu v cc trung tm cm. Suy lun s dng do ny l tnh ton
thp v n gin. Tuy nhin, cch tip cn ny d b nh hng bi nhiu v
cc phn t ngoi lai. khc phc nhc im trn, mt o ci tin
c xut(Vapnik, 1998) s dng tham s c nh sau :

>
s
=
c c
c
c
t t
t
t
,
, 0
, c l tham s phi nhy cm vi nhiu (7)
Hm mc tiu ca thut ton cFCM c nh ngha nh sau:
47

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
( )

= =
=
n
k
c
i
i k
m
ik m
v x u V U J
1 1
, ) , (
c
c
(8)
trong :

=
=
p
l
kl i k
v x v x
1
,
c c
(9)
K hiu:

) (
l
) (+
hoc
) (
.
Lc lng ca tp A l card(A).
nh l 2: Nu m, c v c l cc tham s c nh, vi (U,V) e(E
fc
* R
pc
),
hm mc tiu J
m
(U,V) t gi tr ti thiu khi v ch khi:

( ) ( )

=
e =
e
=
(

e
=

s s . s s
0 ;
, 1
, 0
0 ,
1
1
1
1
1
1 1
k
k
I i
ik
k
k
c
j
m
i k
m
i k
ik
N k c i
I
I i u
I i
I v x v x
u
k
c c
(10)
v:

+
=



+ +
e
e
s s s s
i k
i k
A k
kl
A k
kl
il
p l c i
x
x
v

c
c
|
|
1 1
,
,
(11)
trong : ( ) ( )
{ }{ }
( ) ( )
)
`

+ + e =

=
+
=
+
+
N
k
k k
N
k
kl k k
m
ik i i
x u A
k k
1 1 ,
min
| , 0 c


vi gi thit

=

=
+
=
N
k
k
N
k
k
1 1
v
+
k
,

k
e

m
ik
u ) ( , 0
tp I
k
c nh ngha l I
k
= {i | 1 s i s c; |x
k
- v
i
|
c
= 0; k=1, 2, , N}
( )
{ }
( )
{ } (
(

+ +

=


+ +
e e
+
s s s s
i i i i
A k
kl
A k
kl
i i
il
p l c i
x x
A A card
v

c c
| |
1 1
) (
1
(12)
nh l ny c cc nh khoa hc chng minh.




48

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
3.2.2.2. Thut ton cFCM
Cc bc thc hin ca thut ton cFCM nh sau:

THUT TON cFCM
Input: S cm c v cc tham s m, c cho hm mc tiu J;
Output: Cc cm d liu sao cho hm mc tiu trong (2) t
gi tr cc tiu;
Begin
1. Nhp tham s c (1<c<n), me[1, +], v c>0.
Khi to ma trn V=[v

], V
(0)
e R
sxc
, thit lp j = 0;
2. Repeat
2.1. j:=j+1;
2.2. Tnh ma trn phn hoch m U
(j)
theo cng thc (10);
2.3. cp nht cc trung tm V
(j)
= [v
1
(j)
, v
2
(j)
,., v
c
(j)
]
da vo (12) v U
(j)
;
3. Until ( c s
+
F
j j
U U
) ( ) 1 (
);
4. Trnh din cc cm kt qu;
End.

Tm li, thut ton cFCM l mt m rng ca thut ton FCM trong
vic thch nghi vi nhiu v phn t ngoi lai trong d liu. Tuy vy, hiu qu
ca thut ton cFCM i vi tp d liu ln, tp d liu nhiu chiu cng nh
cch xc nh tham s c l nhng vn tip tc cn phi nghin cu v hon
thin. Thut ton FCM-Ci tin c trnh by di y l mt ci tin nhm
khc phc cc nhc im ny.



49

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
3.2.3. Thut ton FCM-Ci tin
Do thut ton FCM v cFCM c mt s hn ch nht nh. Nn gii
bi ton lm min hm mc tiu ( , ) J u v vi s cm khng c nh v trnh
c trng hp ch t min a phng ta tin hnh xy dng thut ton
phn cm m da trn vic ci tin thut ton FCM thng qua vic din gii
mt s thut ton nh sau:

3.2.3.1. Thut ton 1: Thut ton la chn cc im d liu lm ng
vin cho vic chn cc trung tm ca cc cm
Cho tp d liu
{ }
1 2
, ,..,
n
X x x x = , , 1,..,
p
i
x R i n e = v
*
n N e
Phn chia X thnh m tp con (Nu n b th khng cn chia na):
Gi s . n mb d = + (b, d, m l cc s nguyn dng, 1 d m s < ).
Khi , ta chia
1
m
t
x
t
X C
=
=

, vi
{ }
1 2
, ,..., , 1, 2,.., 1
t t t t
x b
C x x x t m = = v tp
cui cng
{ }
1 2 1
, ,.., , ,..,
m m m m m m
x b b b d
C x x x x x
+ +
= . (13)
Vn chn ln cn :
Bc ny nh hng kh ln ti qu trnh tnh ton. Chng ta c th
dng khi nim phng sai mu trong thng k ton hc gii quyt cho
vn chn ln cn ca mt im d liu.
Chng hn, xt tp { }
n
z z z Z ,.., ,
2 1
= , R z
i
e , n i .. 1 = . Khi , k vng mu
n
z
z
n
i
i
=
=
1
, lch tiu chun l

=
=
n
i
i z
z z
n
s
1
2
) (
1
. khi ta c
1
min( )
s s
=
Xk
k p
r s , vi
Xk
s l lch tiu chun ca tp{ }
1
,..,
k nk
x x vi k = 1, 2, , p.
Ta xt ln cn ca mi im d liu l hnh hp p chiu vi bn knh c
th nh ngha theo lch tiu chun l r.
50

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
im
j
x c gi l nm trong ln cn ca x
i
nu
j
x nm trong hnh
hp p chiu bn knh r ca x
i
.
Tnh m c ca cc im d liu.
m c ca
t t
i x
x C e , k hiu l ( )
t t
i i
D x , l s im d liu nm
trong ln cn ca
t
i
x .
( )
1
( )
b
t t t t
i i j i
j
D x u r x x
=
=

, vi
1, 0
( )
0, 0
z
u z
z
>
=

<

. (14)
Khi , thut ton 1 c trin khai theo cc bc sau:

THUT TON 1
Bc 1: Tnh
1
1
n
z x
i ji
n
j
=

=
v lch tiu chun

( )
2
1
1
n
s x z
i ji i
n
j
=

=
, i=1,2,..p.
Tnh bn knh
1
min
i
i p
r s
s s
= .
Bc 2: Tnh m c ca cc im d liu D
i
l s im d
liu nm trong hnh hp p chiu bn knh r ca
i
x X e :
1
( )
n
j i
j
u r x x
=

.
Bc 3: Tm im
i
x X e sao cho n c c ln nht.
Bc 4: Tnh CC = {x
j
: x
j
nm trong hnh hp p chiu bn knh r
ca x
i
} v \ X X CC = . (* y CC l tp tt c cc im d liu nm
trong hnh hp p chiu bn knh r ca im d liu x
i
*)
Bc 5: Nu X | = th dng. Ngc li, th quay ln bc 2.
51

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Trong trng hp mu d liu ln th chng ta chia tp d liu ra thnh
cc tp d liu nh hn. Sau mi p dng thut ton 1 cho tng mu d
liu mi thu c sau khi phn chia.
Sau khi chy xong thut ton 1, th ta s c c mt tp cc im d
liu lm ng vin cho vic chn cc trung tm ca cc cm. Nu tp ny ln
th chng ta li p dng li thut ton 1 mt ln na. iu ny c th hin
thng qua thut ton 2.

3.2.3.2. Thut ton 2: Thut ton lc bt cc ng vin
Sau khi chy xong thut ton 1. Th t tp d liu X ban u, chng ta
chn ra c n
c
cc im d liu lm ng vin.
Gi s
x
C = {x
j
: x
j
l im d liu ng vin, 1..
c
i n = }. Khi n
c
m ln
th ta s dng thut ton 1 cho tp d liu mi l C
x
. Kt qu ta thu c tp
d liu mi l
{ }
, 1, 2,..,
p
x j p
C x j n = = . Sau khi chy xong thut ton 2, ta c
c tp d liu mi. Khi , ta chuyn sang thut ton 3 tm cc im d
liu lm trung tm ca cc cm, y l nhng im d liu m lm min hm
mc tiu.

3.2.3.3. Thut ton 3: Thut ton chn cc ng vin lm cc tiu HMT
Sau khi kt thc thut ton 1 v thut ton 2 th ta thu c tp cc
im d liu lm ng vin cho trung tm cc cm l
{ }
1 2
, ,...,
p
p
c n
C cc cc cc = .
Trong thut ton FCM, ta dng hm mc tiu ( , ) J u v
2
1 1
( , )
p
n
n
m
ji i j
i j
J u v x cc
= =
=

(15)
Ta thay th hm mc tiu ny bng hm mc tiu mi
*
FCM
J c xc
nh nh sau:
2
*
1 1
.
q n
m c
FCM ji i j cc
i j
J J x cc e
= =
= +

(16)
52

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
vi
2
1 1
q q
c
cc i j
i j i
J cc cc
= = +
=

(17)
v
( )
2
1
1
2
1
1
ji
c
q
i j
m
c
k
i k
x cc
x cc

=
=

(18)
Thng thng ta hay chn m = 2. Do cc cm c th rt gn nhau nn
ta dng J
CC
c thm trng s 1 e > c th phn bit c cc cm ny
trong trng hp chng kh gn nhau.
Khi , thut ton 3 c trin khai theo cc bc sau:

THUT TON 3
Bc 1: q = 1, gn
**
FCM
J = +, 1 o = .
Bc 2: gn
c
q
cc cc
o
= ,
p
c
cc C
o
e .
Bc 3: Tnh thuc
( )
2
1
1
2
1
1
ji
c
q
i j
m
c
k
i k
x cc
x cc

=
=

=1
ji
,
i=1, 2, .., n v j = 1, 2, .., q.
Tnh
*
FCM
J theo cng thc (17)
Bc 4: Nu
* **
FCM FCM
J J < th gn
** *
:
FCM FCM
J J = v q:=q+1;
Bc 5: Nu 1
p
n o > th dng vi q:=q-1. Ngc li th quay
ln bc 2 vi : 1 o o = + .

3.2.3.4. Thut ton 4:Gn cc trung tm c lin kt gn gi vo 1 cm
Sau khi kt thc thut ton 3, ta c mt s hu hn cc im d liu
c chn lm trung tm ca cc cm. l:
{ }
1 2
, ,..., ,
c c c
h p
cc cc cc h n s .
53

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
By gi ta kim tra xem cc cm c th kt ni li c vi nhau hay
khng. iu ny c thc hin nh thut ton 4. Thut ton ny c trin
khai nh sau:
Vic 1: Tm ng dy c th ni
c
i
cc v
c
j
cc (ng dy ny ph thuc
vo trng hp c th ta chn, chng hn nu l lng my, ta s tm mt ng
cong parabol, con mt ta c th dng hnh elip)
Gi s c nhng im nm trn ng dy ni
c
i
cc v
c
j
cc l:
1 2
, ,..,
k
v v v
sao cho:
1 2 1 1
...
c c
i k k j k
v cc v v v v cc v

= = = =
1
c c
i j
cc cc
k

=
+

Sau , ta tin hnh tnh m c ca cc im ny:
1
( )
i
n
c
cc l i
l
D u r x cc
=
=

(19)
1
( )
j
n
c
cc l j
l
D u r x cc
=
=

(20)
1
( ), 1,...,
a
n
v l i
l
D u r x v a k
=
= =

(21)
Nu cc im d liu ny tha mn vi mi l = 1, 2, , k i vi tt c
max( , )
4
i j
l
cc cc
v
D D
D > th ta xc nh hai cm ny c th kt ni c.
Vic 2: Ta ni 2 trung tm v dng lut bc cu ni tip chng
vo mt cm.
Chng hn, nu (
c
i
cc v
c
j
cc l kt ni c) v (
c
j
cc v
c
k
cc l kt ni c)
th
c
i
cc ,
c
j
cc v
c
k
cc l kt ni c v cc trung tm ny c gn vo cng
mt cm.
Khi , thut ton 4 c th hin chi tit qua cc bc sau:


54

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn

THUT TON 4
Vic th nht:
Bc 1: gn
{ }
, 1,...,
c
l l
Cluster cc l h = = .
Bc 2: Ta to ra index1 v index2 u l mng mt chiu c h.
Khi to tt c cc phn t ca mng ny vi gi tr 0.
Bc 3: t i:=1; j:=i+1;
Bc 4: Tm cc im nm trn ng dy ni
c
i
cc v
c
j
cc
(thng ng dy ny da vo c im ca ng cong cn nhn dng,
chng hn lng my th ta dng mt ng cong parabol) l
1 2
, ,..,
k
v v v
sao cho:
1 2 1 1
...
c c
i k k j k
v cc v v v v cc v

= = = =
1
c c
i j
cc cc
k

=
+

Bc 5: Tnh m c ca cc im d liu bc 4:
1
( )
i
n
c
cc l i
l
D u r x cc
=
=


1
( )
j
n
c
cc l j
l
D u r x cc
=
=


1
( ), 1,...,
a
n
v l a
l
D u r x v a k
=
= =


Bc 6:
IF
1
k
a=

(
max( , )
4
i j
a
cc cc
v
D D
D > )THEN
If (index1[i]=0) and (index2[j]=0) then
Begin
cluster cluster cluster
j j i
= ;
Index1[i]:=j;
Index2[j]:=i;
55

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
End
Else if ( | | | | ( )
1 0) 2 0 index i and index j = = then
Begin
| | max( , 1 ) j index i
cluster =
| | max( , 1 ) j index i
cluster
| | max( , 2 ) i index j
cluster ;
| | | |
1 max( , 1 ) index i j index i = ;
| |
2 index j i = ;
End
Else if (( | | | | ( )
2 0) 1 0 index j and index i = = then
Begin
| | 1 2 index index j
cluster
(

=
| | 1 2 index index j
cluster
(

;
i
cluster
| |
1 index i = | |
1 2 index index j (

;
End
Else if (
| | | |
( 1 0) ( 2 0) index i and index j <> <> then
Begin
temp:= | |
1 2 index index j (

;

| | max( 1 , ) index i temp
cluster =
| | max( 1 , ) index i temp
cluster

| | min( 1 , ) index i temp
cluster ;

| | | |
1 : max( 1 , ); index i index i temp =
End
Else (* ca IF u tin *)
begin
{ }
c
i i
cluster cc = v
{ }
c
j j
cluster cc =
End;
Bc 7: IF ( ) ( ) i h and j h s s THEN j:=j+1 v quay tr li bc 4
ELSE if ( ) i h s then quay tr li bc 4
ELSE i:=i+1,j:=j+1 v quay tr li bc 4
56

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Vic th hai:
Bc 1: i:=1;FC:=| ;
Bc 2: IF
| |
1 0 index i <> then
Begin
h:=h-1; xa
i
cluster ;
FC:=FC i ;
End;
Bc 3: i:=i+1;
Bc 4: IF i h s THEN quay li bc 2
ELSE dng li .

3.2.3.5. Tng kt thut ton FCM-Ci tin
T chi tit ca 4 thut ton nh trn, ta c th tng hp thut ton
FCM-Ci tin mt cch tng qut thng qua cc bc c th nh sau:


THUT TON FCM-CI TIN
Cho tp d liu
{ }
1 2
, ,..,
n
X x x x = , , .. 1 , n i R x
p
i
= e v
*
, N p n e
Thut ton FCM-Ci tin c thc hin thng qua cc bc sau:
Bc 1: Nu n l ln th chy thut ton 1 c c tt c cc
im c kh nng lm trung tm cm, ngc li th chy bc 3.
Bc 2: Nu sau khi chy thut ton 1 m s im lm ng vin
ln th ta chy thut ton 2 rt gn bt s cc im ng vin lm cc
trung tm ca cc cm.
Bc 3: Chy thut ton 3 tm cc im lm trung tm cm.
Bc 4: Chy thut ton 4 ni cc trung tm cm c lin h
gn gi vo mt cm.
57

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Sau khi bc 4 kt thc, ta s c tp cc trung tm ca cc cm.
Bc 5: (c th nhy ti bc 6)
Chy thut ton FCM vi tp cc trung tm ca cc cm, sau
dng li.
Bc 6: Tnh cc thuc ca cc im d liu cn li:
Gi
ij
l thuc ca x
i
vo
c
j
cc .
Ta c:
ij

=
n
k
m
c
k i
c
j i
cc x
cc x
1
1
1
2
2
) (
1


Tm li, phn cm m l mt s m rng ca PCDL bng cch thm
vo yu t quan h gia cc phn t v cc cm d liu thng qua cc trng
s trong ma trn U. Bng cch ny, c th khm ph ra cc cm d liu phc
tp theo cch mm do t mt tp d liu cho. Thut ton phn cm m l
mt cch thc m rng cho cc thut ton phn cm r nhm khm ph ra cc
cm d liu chng ln nhau trn cc tp d liu c kch thc ln, nhiu
chiu v nhiu nhiu...









58

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
CHNG 4
M HNH MNG NRON A KHP
DNG CHO PHN CM M
4.1. Tng quan v mng Nron ....................................................................................
4.2. Cu trc mng Nron .............................................................................................
4.2.1. Hm kch hot .................................................................................................
4.2.2. Lin kt mng ..................................................................................................
4.2.3. Bi ton hun luyn mng ...............................................................................
4.3. Mng HOPFIELD ..................................................................................................
4.3.1. Hun luyn mng ............................................................................................
4.3.2. S dng mng ..................................................................................................
4.4. Mng Nron a khp dng cho phn cm .............................................................
4.4.1. Xy dng lp mng Layer1 cho ti u cc trung tm cm .............................
4.4.2. Xy dng lp mng Layer2 cho ti u cc thuc ......................................
4.5. S hi t ca FBACN ............................................................................................
4.5.1. Chng minh s hi t ca FBACN .................................................................
4.5.2. S hi t FBACN lin tc ca Layer1 ............................................................
4.5.3. Gii thut ca FBACN v FBACN vi vic hc ............................................
58
61
61
61
61
62
62
63
63
65
68
72
72
74
75

Thut ton FCM-Ci tin khc phc c mt s hn ch ca thut
ton FCM v FCM. Tuy nhin n li c nhc im l mi khi c yu cu
phn cm th thut ton s chy t u, cc kt qu ca cc mu trc l
khng s dng c cho ln sau nn thi gian chy kh ln nu nh kch
thc mu ln. V vy, trong chng ny, chng ta nghin cu mt m hnh
mng Nron a khp dng cho bi ton phn cm m (a fuzzy bi-directional
associative clustering network FBACN). Mng Nron ny ch yu da vo
ti liu ca hai tc gi Chih-Hsiu Wei, Chin - Shyurng Fahn.

4.1. Tng quan v mng Nron
Trc ht chng ta ai cng bit rng tri thc ca loi ngi cho n nay
ht sc phong ph, su rng v a dng. N bao gm nhng hiu bit ca
chng ta t th gii vi m nh nguyn t, in t, ht nhn, cc ht c bn, ...
59

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
n nhng hiu bit v m v tri t, v h mt tri, h thin h, ... . hiu bit
v th gii t nhin v x hi, v cc nghnh khoa hc, k thut khc nhau
nh: ton, l, ha, cng ngh thng tin v c nhng hiu bit v bn thn con
ngi. Th nhng c mt iu m c v nh l mt nghch l l chng ta
bit "rt t" v chnh b no ca chng ta.
Hn na do nhu cu ngy cng cao trong vic gii quyt cc vn phc tp
v do bn cht ca con ngi l khng mun bng lng vi hin ti m lun
mun vn ti nhng g cao hn, hon thin hn. C l chnh v nhng iu
trn m thut ng "mng Nron" hoc "mng Nron nhn to" ra i. Cc
thut ng ni n mt nghnh k thut mi m n i hi kin thc t
nhiu nghnh khoa hc khc nhau nh ton hc, vt l hc, ha hc, sinh vt
hc, tm l hc, thn kinh hc, ... v tt c ch nhm lm sao to ra nhng
chic my tnh hot ng ging nh " b no " ca chnh chng ta.
Mng Nron nhn to hay thng c gi ngn gn l mng Nron l
mt m hnh ton hc hay m hnh tnh ton c xy dng da trn cc
mng Nron sinh hc. N gm c mt nhm cc Nron nhn to(nt) ni vi
nhau, v x l thng tin bng cch truyn theo cc kt ni v tnh gi tr mi
ti cc nt. Trong nhiu trng hp, mng Nron nhn to l mt h thng
thch ng, t thay i cu trc ca mnh da trn cc thng tin bn ngoi hay
bn trong chy qua mng trong qu trnh hc.
Trong thc t s dng, nhiu mng Nron l cc cng c m hnh ha
d liu thng k phi tuyn. Chng c th c dng m hnh ha cc mi
quan h phc tp gia d liu vo v kt qu hoc tm kim cc dng mu
trong d liu.
60

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn

Hnh 4.1: M hnh mng Nron

Mng Nron nhn to (Artificial Neural Network) l mt m hnh ton
hc bao gm cc nt x l thng tin c s (gi l n v x l hoc Nron) c
mi lin h tng h cao, tin hnh x l thng tin song song v phn tn c
nng lc tnh ton mnh (v d hin nay n c th hc, nh v suy din t mu
d liu...). Mi lin kt gia hai Nron km theo mt trng s no , c
trng cho c tnh kch hot/c ch gia cc Nron. C th xem trng s l
phng tin lu gi thng tin di hn trong mng Nron v nhim v ca
qu trnh hun luyn (hay cn gi l qu trnh hc) mng l cp nht cc trng
s khi c thm thng tin v cc mu hc, hay ni cch khc, cc trng s
c iu chnh sao cho dng iu vo ra ca n m phng hon ton ph
hp vi mi trng ang xem xt. V vy, cu trc ca mng Nron ch yu
c c trng bi loi ca cc Nron v mi lin h x l thng tin gia
chng v do , mng Nron c rt nhiu ng dng trong nhiu lnh vc nh
nhn dng, phn lp nh, phn tch - nn d liu, cc bi ton ti u, d bo,
chun on, V xu th hin i l s kt hp mng Nron vi logic m.
61

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
4.2. Cu trc mng Nron
4.2.1. Hm kch hot
Hm kch hot ca tng Nron trong mng Nron ng vai tr quan
trng trong s lin kt gia cc Nron. Hm ny c trng cho mc lin
kt gia cc Nron.
Trong l thuyt mng Nron, php tng hp cc tn hiu u vo
thng c k hiu di dng:
1
n
j i ji
i
net x e
=
=

vi 1.. ,
j
j n x = l cc tn hiu
vo.
1
( ,..., )
ji j jn
e e e = l trng s, n l s tn hiu u vo. u ra ca Nron j
thng c k hiu l outj hoc fj, c gi l hm kch hot.
1
( ( ) )
n
j j i i
i
out f f x t u e
=
= =

, vi u l ngng kch hot Nron, t l thi gian, f


l hm kch hot.

4.2.2. Lin kt mng
S lin kt trong mng Nron tu thuc vo nguyn l tng tc gia
u ra ca tng Nron ring bit vi cc Nron khc v to ra cu trc mng
Nron. V nguyn tc s c rt nhiu kiu lin kt gia cc Nron nhng
trong thc t ta thng gp cc dng nh: Mng truyn thng v mng hi
quy....

4.2.3. Bi ton hun luyn mng
Bi ton hun luyn mng l qu trnh gii bi ton ti u ha tham s
ca mng, ch yu l cc trng s lin kt mng v cu trc cc dng lin kt
ca cc Nron, gia cc lp da trn thng tin c trong h thng.
Thng th qu trnh hun luyn mng ntron(hay cn gi l thut hc)
c thc hin qua php so snh u ra ca mng vi tn hiu ch o.
M hnh hc c gim st c m phng nh Hnh 4.2 di y:
62

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn

Hnh 4.2: M hnh hc c gim st

Sai s e = y-d l c s hun luyn mng.
Tip theo chng ta tm hiu mt m hnh mng Nron c p dng rt
nhiu l mng Hopfield.

4.3. Mng HOPFIELD
Nm 1982 nh vt l ngi M J.J Hopfield xut ra m hnh
mng Nron (Neural Network - NN) cho php to nh x d liu t tn hiu
vo sang tn hiu ra theo kiu t kt hp, tc l nu tn hiu vo l X thuc
min gi tr D no th kt qu Y cng phi thuc min D . Nh vy, m
mt vct tn hiu vo X b thiu thng tin hoc b bin dng c th c
phc hi v dng nguyn bn ca mnh.
Trong ng dng, mng Hopfield m phng c kh nng t kt
hp ca b no con ngi. Ngoi ra, vi mt s ci bin mng Hopfield cn
c dng gii quyt cc bi ton ti u, bi ton x l d liu trong iu
khin t ng...

4.3.1. Hun luyn mng
Mng Hopfield hc da trn nguyn tc hc c gim st. Gi s c p
mu hc tng ng vi cc vct tn hiu vo X
s
, vi s = 1, 2, .., p. Mng
Hopfield s xc nh ma trn trng s W sao cho:
X
s
= Tinh(X
s
,W) vi mi s =1, 2, ..p.
Vi ma trn trng s W c xc nh nh sau:
63

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
1
0
. ,
,
p
sj si
s
ji
i j
x x if i j
e =

= vi X
s
=(x
s1
,..,x
sm
)

4.3.2. S dng mng
Gi s ta a vo mng tn hiu vo l vct X.
S dng mng tnh u ra tng ng vi tn hiu vo X l qu trnh
lp gm cc bc:
1. Ban u, t X
(0)
=X. Gi Y
(1)
l vct tn hiu ra tng ng vi mt
ln cho X
(0)
lan truyn trong mng.
Y
(1)
= out
(1)
= Tnh (HF, X
(0)
).
2. Nu
(0) (0)
Y X = th tip tc lp vi bc t = t+1 v X
(t+1)
=Y
(1)
. Ngc
li th dng.
Tip theo chng ta nghin cu mt m hnh mng Nron dng cho
phn cm m, l mng Nron a khp.

4.4. Mng Nron a khp dng cho phn cm
Mt vi nm trc, cc h thng Nron ng(i khi gi l mng Nron
hi quy) c s dng nhiu trong cc qu trnh x l thng tin.
64

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn

Hnh 4.3: M hnh FBACN

Cu trc ca mng Nron a khp-FBACN c a ra nh hnh 4.3.
Lp hi quy Layer1 c thc hin bi mt mng Hopfield ti u ho cc
trung tm cm. Trong khi lp hi quy Layer2 c thc hin bi mt
mng Nron a khp ni ti u cc thuc. Kt hp Layer1 v Layer2
to thnh lp hi quy 3, lp ny lm nn cu trc ng ca mng.
Hot ng ca FBACN c m t nh sau: Th 1 l khi to ngu
nhin cc trung tm cm v thuc thnh vin ca Layer1 v Layer2 tng
ng. Th 2, khi to cc thuc thnh vin trong Layer2 s c truyn
sang Layer1. Th 3, da trn vic nhn c cc thuc thnh vin, Layer1
thc hin qu trnh hi quy thu c cc trung tm cm ti u mi. Th 4,
cc trung tm cm mi ca Layer1 truyn sang Layer2. Th 5, da trn vic
nhn c cc trung tm cm mi, thc hin qu trnh hi quy thu c
thuc thnh vin ti u mi. Vic hon tt qu trnh trn t bc 2 n bc 5
c gi l qu trnh lp. Qu trnh lp din ra cho ti khi no t ti mt tiu
chun ti hn.
65

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
4.4.1. Xy dng lp mng Layer1 cho ti u cc trung tm cm
Lp Layer1 ca FBACN c th s dng mng Hopfield hoc mng
Nron a khp tu thuc vo cc rng buc ca FC-partition (FC- fuzzy c).
Nu rng buc lm cho hm mc tiu c dng bc cao, hoc dng logarithm,
hoc dng sin, v.v.. th ta s dng mng Nron a khp thay v dng mng
Hopfield n gin.

Hnh 4.4: M hnh Lp Layer1 ca FBACN
Gie
ji
l trng s kt ni hot ng ca Nron j vi Nron vo i. Tt c
cc u vo n Nron th j c k hiu l i
j
. Khi , tng hp cc tn hiu
u vo i vi Nron j l:
1
e
=
= +

n
j i ji j
i
net v i (1). Vi v
i
l u ra ca Nron
i, f l hm n iu tng v lin tc.
Ta c hm kch hot
2
( ) 1
1 exp( . )
j
j
f net
r net
=
+
(2). y, ngng
r > 0 lm tng tnh thch nghi v kh nng tnh ton ca mng Nron.
Gi s g l ch s quy v khi
66

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn

( 1) ( ) ( )
1
( ) ( . )
s
g g g
j j ji i j
i
v f net f v i e
+
=
= = +

(3)
Khi , ta c th din t mng Nron thng qua ma trn NET sau:
NET = WV + I (4)
vi

1
2
s
net
net
NET
net
(
(
(
=
(
(
(

,
11 1
21 2
1
s
s
s ss
W
e e
e e
e e
(
(
(
=
(
(
(

,
1
2
s
v
v
V
v
(
(
(
=
(
(
(

v
1
2
s
i
i
I
i
(
(
(
=
(
(
(


nh gi tnh n nh ca h thng trong hnh 4.4, chng ta dng
hm tnh ton nng lng CEF(computational energy function-CEF).
Ta c CEF(E) l:
1
. . .
2
T T
E V W V I V = (5)
Hay c th hn l:
1 1 1 2
1
. .
s s
ji i j j j
j i j
s
E v v i v e
= = =
=

(6)
Trong cng thc (6) th E l dng ton phng. V vy, trong mng Nron
ng ny ta c th dng hm mc tiu dng ton phng ti u ha cc
tham s.
Tip theo, ta s l gii s ph hp gia hm mc tiu ca FC-partion v
hm tnh ton nng lng ca FBACN.
Ta c hm mc tiu ca FC-partion l:

,
1 1
( ; ) 2 ( )
n c
T T T m
m k k i k i i i k
k i
z U v x x v x v v u
= =
(

= +

(7)
Trong Layer1 ca FBACN, cc tham s ti u l cc trung tm cm v
i
.
Cc thuc mi u
i,k
c thc hin t lp 2 ti Layer1 s kch hot hi quy
trong Layer1 ti u v
i
. Ngoi ra, u
i,k
tm thi coi l cc hng s trong khi
hi quy trong Layer1. T nh ngha ca hm mc tiu, cc vct trung tm
ca cc cm
p
ip i i i
R v v v v e = ) ,.., , (
2 1
l vct p chiu. V vy, ta phi khai trin
67

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
(7) kh biu thc vct trc khi n c em so snh vi hm tnh ton
nng lng thng gi tr. Thng gi tr c khai trin trong cng thc (7) vi
ln kh u tin l:
2
, , , ,
1 1 1
( ; ) 2 . ( )
p n c
m
m i l k l i l i k
k i l
z U v v x v u
= = =
(
= +

(8)
Quan st (6) v (8) ta thy s khc nhau chnh gia hai cng thc ny
l cch k hiu khc nhau. Trong mng Nron, th cc hot ng ra ca cc
Nron c k hiu mt cch duy nht bng mt k hiu di dng. Chng
hn, hot ng ra v
i
vi s i ... 1 = . Tuy nhin, sau khi khai trin hm mc tiu
ca FC-pariton, c 2 t vit di dng trong tham s v
i,l
, vi 1..1, 1.. l i c = = .
thng nht trong cch th hin, ta k hiu li nh sau:
, ( 1). i l i p l
v v
+
(9)
Khi , ta vit li biu thc (8) nh sau:
, ( 1) ,
1 1 1
( ; ) [ 2( ) . .
p n c
m
m i k i p l k l
k i l
z U v u v x
+
= = =
=

2
, ( 1)
( ) . ]
m
j k i p l
u v
+
+ (10)
S Nron s trong Layer1 (6) lc p .
Ta c
1
2
c p
i
i
I
i

(
(
(
=
(
(
(

,vi
( 1) , ,
1
2 .
n
m
i p l i k k l
k
i u x
+
=
=

, 1.. 1.. i c v l p = = (11)
v cc phn t ca ma trn W c xc nh bi:
/ ,
1
2 ,
0,
n
m
i p k
k
ji
u i j
i j
e
(
(
=

, , 1.. i j c n = (12)
vi k hiu | x | l cch ly s nguyn cao hn v gn nht so vi x.


68

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Qu trnh ti u ca Layer1:
Mc tiu ca mng hi quy l lm cho hm mc tiu xp x ti gi tr
nh nht. Do W l ma trn i xng nn W
T
=W, ta ly gradient ca vct
nng lng trong (5) ta c:

1
( ). .
2
T
E W W V I W V I NET V = + = = (13)
v ( ) . ( )
T T
E E V NET V A = V A = A
1
.
s
j j
j
net v
=
= A

(14)
Tip theo, chng ta xy dng hm kch hot (lin tc hoc ri rc) f
c s dng trong Layer1 ca FBACN theo cng thc sau:

( ) ( )
( 1) ( )
( )
, 0
( )
, 0
g g
j j j
g g
j
g g
j j j
v if net
v f net
v if net
o
o
+

+ >

= =

<

(15)
vi
j
o l gi tr dng nh nht iu chnh v
j
. Ta thy khi
( )
0
g
j
net > th

( 1) ( ) g g
j j j j
v v v o
+
A = = >0,
1
. 0
s
j j
j
E net v
=
A = A s

.
Trong kiu kin trc ny, th s cp nht h s
j
o vi gi tr khng hn
ch. Theo phng php m ta thit k th cch la chn gi tr ca n l
ph hp vi tin trin ca Layer1.

4.4.2. Xy dng lp mng Layer2 cho ti u cc thuc
Lp mng Layer2 ca FBACN c chc nng l ti u ha lp cc
thuc. Ta c th coi cc v
i
l cc hng s tm thi trong khi lp Layer2 hi
quy. Ta c tp
2
, k i i k
x v d = . Khi , cng thc:
( )
2
,
1 1
( ; )
c n
m
m i k k i
i k
z U v u x v
= =
=

c vit li l
, ,
1 1
( )
c n
m
i k i k
i k
u d
= =

.
Khi , bi ton ti u trong lp Layer2 ca FBACN l: lm min
, ,
1 1
( ) .
c n
m
i k i k
i k
u d
= =

vi rng buc
,
1
1, 1..
c
i k
i
u k n
=
= =

.
69

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Theo cng thc Lagrange th ta c th pht biu li bi ton nh sau:
2
, , ,
1 1 1
( ) 1
n c c
m
i k i k i k
k i i
u d u
= = =
(
| |
( |
\ .
(

+

(16)
vi l tham s Lagrange (thng 10000 100000 s s ).
Ta k hiu li
, ( 1). i k k c i
u u
+
v
, ( 1). i k k c i
d d
+
(17)
Khi , ta c th biu din li hm mc tiu l:
2
( 1). ( 1). ( 1).
1 1 1
( ) . ( 1)
n c c
m
k c i k c i k c i
k i i
u d u
+ + +
= = =
(
+
(


(18)
Trong cng thc (18) th s hng c bc cao nht l
( 1).
( )
m
k c i
u
+
(thng thng
ta chn m=2) vi k=1, 2, ..,n v i = 1, 2, ..., c. Nhng mng Hopfield ch ph
hp vi bi ton ti u bc hai. V vy, ta pht trin mng Nron a khp cho
cc bi ton ti u chung. Tc mng Nron a khp c th gii quyt c
vi bt k hm mc tiu bc cao c rng buc no.

M hnh mng Nron a khp n gin c s dng cho Layer2 nh sau:

Hnh 4.5: M hnh Lp Layer2 ca FBACN

Tnh ng ca mng Nron a khp trong hnh 4.5 l:
70

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
( 1) ( ) ( ) ( )
1
( ) ( ( . . ) )
S
g g g g
j j ji i ji i j
i
u f net f u z u i e
+
=
+ + = =

, vi j=1, 2, ..., s. (19)
Ta c ma trn u vo mng: . . NET WU Z U I = + + (20)
Vi
11 12 1
21 22 2
1 2
s
s
s s ss
z z z
z z z
Z
z z z
(
(
(
(
(
(

=

v
1
2
S
u
u
U
u
(
(
(
(
(
(

=



(
(
(
(
(

1
1
2
1
1
) 1 (
m
nxc
m
m
m
u
u
u
U

, m > 1 v U
(1)
= U (21)
Ta k hiu chuyn v ca U
(m-1)
l:
( 1)
T
m
U

. Khi , hm tnh ton nng
lng ca mng Nron s c tnh bi cng thc:
I U U Z U U
m
E
T T T
m
. . .
2
1
- W.U .
1
) 1 (

|
.
|

\
|
|
.
|

\
|
=

(22)
Vi s lng Nron s trong lp Layer2 l n c . Ta c th tnh ton cc phn
t ca ma trn W, Z v I nh sau:
1
2
0 0
0 0
0 0
c n
d
d
W m
d

(
(
(
=
(
(
(

tc l cc phn t
ca W c tnh theo cng thc sau:
,
0 ,
md i j
i
ji
i j
e

=
=
=
, , 1,2,.. i j c n = (23)
Z l ma trn c ( ) ( ) c n c n v c xc nh nh sau:
2 ( 1). .
0
,
,
ji
i i
c j c
c c
if
z

( (
< s

( (
( (

= vi i,j = 1, 2, ..., c n . (24)


V ma trn I l ma trn mt chiu c c n v c xc nh nh sau:
otherwise
71

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
2
2
2
I

(
(
(
=
(
(


Do ma trn trng s W l i xng (W c gi l i xng nu A. W.B =
B.W.A), v vy hm tnh ton nng lng (22) c biu din l:
I U U Z U U
m
E
T T T
. . .
2
1
W.U .
1
1) - (m

|
.
|

\
|

|
.
|

\
|
= (25)
U
<m-1>
c th iu chnh hai ma trn trng s W v Z. Khi , ma trn
NET ca mng Nron a khp c xc nh nh sau:
1
. .
m
NET WU ZU I
< >
= + +
Gi h l hm s xc nh bi:
1
) (

=
m
j j
u u h (26). Khi , h c gi l hm
tnh
1 m
j
u

V tnh ng ca lp Layer2 c th hin cng thc:
( 1) ( )
( )
g g
u f net
j j
+
=
( ) 1 ( )
1
( ( ( ) ) )
s
g m g
ji j ji j j
i
f u z u i e

=
= + +

(27)
Vi net
j
l tng u vo ca Nron th j v c tnh bi cng thc:
j i ji
m
i ji
s
i
j
i u z u net + + =

=

) . . (
1
1
e (28)
Theo cng thc (25), ta c gradient nng lng E V :
s j i u z u
u
E
s
i
j i ji
m
i ji
j
.. 1 ), ) . . ( (
1
1
= + + =
c
c

e vi s = n c (29)
T (28) v (29) ta c , 1..
E
net j s
j
u
j
c
= =
c
(30)
Qu trnh ti u ca Layer2
Khi hm mc tiu (18) c cn bng vi hm tnh ton nng lng
(25) v gradient tnh ton nng lng c lin kt vi gi tr net vo, kt
qu ti u dn t c khi mng tin trin. T khi nim nng lng, hm
72

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
ri rc f l hm kch hot ca mng Nron a khp c xy dng ging
nh ca lp th nht:
Vect nng lng gradient c tnh ton lin quan n u
j
:
j
j
net
u
=
o
o

Hm kch hot ri rc c a ra :
( ) ( )
( 1) ( )
( )
0 ,
( )
,
g g
j j j
g g
j j
g
j j
if net
otherwise
u
u f net
u
o
o
+

>

+
= =

(31)
Vi vect nng lng gradient lun m v cng thc (31), m bo
mng Layer2 s ti u trong qu trnh tin ho.

4.5. S hi t ca FBACN
4.5.1. Chng minh s hi t ca FBACN
Mt yu t quan trng cho mng hi quy l kh nng n nh ca
mng. Trc khi a ra tnh n nh ca mng FBACN, chng ta s bt u
vi mt vi nh ngha v khng gian Metric v nh l a ra bi Steck v
sau l mt nh l hi t ph qut.
nh ngha 1: Trong khng gian Metric cho tp X v hm d: X X R
tha mn cc iu kin:
1) ( , ) 0, , d x y x y X > e
2) ( , ) 0 d x y x y = =
3) ( , ) ( , ), , d x y d y x x y X = e
4) ( , ) ( , ) ( , ) d x z d x y d y z s + , , x y z X e
nh ngha 2: Cho X l khng gian Metric vi khong cch d v
: X X u . im x c gi l im c nh ca u nu ( ) x x u = .
nh ngha 3: nh x u l co nu tn ti c*, vi
*
0 1 c < < sao cho:
( ) ) , ( . ) ( ), (
*
y x d c y x d s u u , X y x e , (32)
73

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Nu u tha mn (32) th u ch c 1 im c nh. Tht vy, gi s c 2
im c nh x v y. Khi theo (32) ta c:
*
( , ) . ( , ) d x y c d x y s . V vy,
d(x,y)=0, nn x = y.
Vy ta c th khi qut iu trn thng qua nh l sau:
nh l v nh x co(nh x thu gn-AXC): nh x co ca khng gian
Metric y c duy nht mt im c nh.
nh l 1: Cho mng Nron nhn to hi quy kt ni y gm cc
Nron s vi kch hot ng

=
+
+ =
s
i
j
g
i ji
g
j
i net f net
1
) ( ) 1 (
) ( . e (33), vi f l mt
hm c gii hn, lin tc v c gi tr thc, hm c o hm c gii hn v
nu tha mn:
s
c f
ji
1
* *
max
'
< < e , s j i .. 1 & = (34) th mng hi t n mt
im c nh duy nht i vi bt k mt gi tr khi to no ca mng.
B : vi mi hm f tha mn iu kin gi thuyt, th vi bt k
, x y R e ta c
'
( ) ( )
max
f x f y f x y s (35)
Da vo cc nh ngha, nh l v b trn, mt nh l hi t ph
bin cho mng Nron a khp ni c a ra nh sau:
nh l 2(i vi mng Nron a khp ni): ng vi mi mng Nron a
khp ni gm s Nron c hai trng s
ji
e v
ji
z vi tnh ng kch hot sau:
( 1) ( ) 1 ( )
1
( ( )) ( )
S
g g m g
j ji j ji j j
i
net f net z f net i e
+
=
= + +

(36)
y f l hm c gii hn, lin tc v c gi tr thc. Nu f tha mn iu
kin:
1 ' **
max
1
( )
m
ji ji
f z c
s
e

< < , (vi i, j = 1, 2, ..., s) (37) th mng hi t


n mt gi tr c nh duy nht i vi vi mi gi tr khi u ca mng.
Chng minh: Theo nh l v nh x co, chng minh FBACN l hi
t n mt im duy nht th chng ta phi ch ra tn ti mt hng s
74

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
c* (0,1) e sao cho vi mi gi tr
1 2
( , ,..., )
S
S
netx netx netx netx = e9 v
1 2
( , ,..., )
S
S
nety nety nety nety = e9 th ( ) ) , ( . ) ( ), (
*
nety netx d c nety netx d s u u
(38)
Ta nh ngha hm s:

= =

+ + = u = u
s
j
s
i
j i ji
m
i ji s
i net f z net f net net net net
1 1
1
2 1
)) ( . )) ( ( ( ) ,..., , ( ) ( e (39)
Vi khng gian
s
9 l y vi Metric la chn, th ta c:
( , )
1
s
d netx nety netx nety netx nety
i i
i
= =

=
, ,
s
netx nety e9 (40)
Ta c : ( ( ), ( )) ( ) ( ) u u = u u d netx nety netx nety =
1 1
1 1
( ( )) ( ( )) ( ) ( )
S S
m m
i i ji i i ji
j i
f netx f nety f netx f nety z e

= =
(

+


' 1 ' '
max max
1 1
( )
S S
m
i i ji i i ji
j i
f netx nety f netx nety z e

= =
(

s +

(41)
**
1 1
S S
i i
i j
netx nety c
= =
s

=
*
. ( , ) c d netx nety (42)
vi
* ** **
1
.
S
j
c c s c
=
= =

. Theo (37) th
**
(0,1) c e (43)
Theo (42) v (43) th nh x l nh x co. Do , theo iu kin ca
(37) th mng Nron ang xt l hi t v mt im duy nht.

4.5.2. S hi t FBACN lin tc ca Layer1
Hm kch hot ca Layer1 c xy dng nh sau:
2
( ) ( )
1 exp( . )
j
f net v
j j j
r net
j j
o
o = +
+
(44)
Vi
j
o v r
j
l cc s thc dng. Ta cn tm gi tr max ca o hm
cp 1 ca (44). Ta bit rng o hm cp 1 ca hm f l cc i ti x nu o
75

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
hm cp 2 ca hm f ti x l bng 0. Gi s
''
( ) 0 f net
j
= th ta c net
j
=0. V
vy, gi tr max ca
'
f l t cc i a phng ti net
j
= 0 v gi tr l
.
' '
( )
max
2
r
j j
f f net
j
o
= = (45)
Mt khc 2 0 n
ji
e s s nn ta c
' '
0 2 . 2 . .
max max
f n f n r
ji j j
e o s s = (46)
Ta c
1 1
. .
. .
n r r
j j j
s s n
j
o
o
= = (47) vi s l s lng Nron c trong mng
v s c p = trong lp Layer1. V th ta chn
1
. .
r
j
n s
j
o
< , v v vy ta c th
tm c mt hng s
**
(0,1) c e sao cho
1
' **
max
f c
ji
s
e < < (48) iu
ny tha mn iu kin ca nh l 1 nn ta c mng l hi t n mt im.

Kt lun: Qu trnh tnh ton v chng minh, ta c c kt qu sau:
Vi Layer1, mng tho mn gi thuyt ca nh l 1, nn mng hi t
Vi Layer2, mng tho mn gi thuyt ca nh l 2, nn mng hi t

4.6. Gii thut ca FBACN v FBACN vi vic hc
Gii thut ca FBACN c thc hin qua cc bc sau:

GII THUT CA FBACN
1) Thit lp cc gi tr c, m, , v cc h s i
v
, i
u
trong lp Layer1
v Layer2 tng ng.
2) t h s n nh
v
A v A
u
cho Layer1 v Layer2 tng ng.
3) Khi to ngu nhin cc trung tm cm
( 1).
,
i p l
v
+
i=1, 2, ..., c v
l = 1, 2.., n trong Layer1 v lp thnh vin
, i k fc
u M (

e vi k=1, 2,
76

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
...,n v i = 1, 2, ...,c trong Layer2.
4) Cp nht cc h s i
v v
j
o o = v i
u u
j
o o = v gi tr mng ban u
(0) 0 net
j
= vi j =1, 2, ..., s
(s=c.p trong Layer1 v s=n.c trong Layer2).
5) Thit lp ch s hi quy g =1 cho Layer1.
6) Trong Layer1, tnh ma trn trng s W theo cng thc (12), ma trn
tn hiu vo bn ngoi I theo cng thc (11), v gi tr mng NET
theo cng thc (4).
7) For j = 1 to s do
if
( ) ( 1)
. 0
g g
j j
net net

< then : / 2
j j
v v
o o = ;
8) For j = 1 to s do
if
( )
0
g
j
net > then :
j j
j
v v
v
o = +
else :
j j
j
v v
v
o =
9) if (( ) &( ) &... &( )
1 2
v v v v v v
s
o o o s A s A s A then goto 10)
else {g:= g+1; goto 6)}.
10) t ch s hi quy g=1 cho Layer2.
11) Trong Layer2, tnh ma trn trng s W theo cng thc (23), ma
trn trng s Z theo cng thc (24) v | |
T
I 2 ,..., 2 , 2 = v ma trn
I U Z NET
m
+ + =
> <
. W.U
1
.
12) For j = 1 to s do
if
( ) ( 1)
. 0
g g
net net
j j

< then : / 2
u u
j j
o o = ;
13) For j = 1 to s do
if
( )
0
g
net
j
s then : u u
u
j j
j
o = +
77

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
else u : u
u
j j
j
o =
14) if (( ) &( ) &... &( )
1 2
u u u u u u
s
o o o s A s A s A then goto 15)
else {g:= g+1; goto 11)}.
15) if
( ) ( 1) g g
U U c

s then Stop else goto 4



i vi FBACN vi vic hc th thut ton tng t nh ca FBACN,
nhng t bc 10 n bc th 14 th c thay bng 10 n 17. Trc ht
ta nh ngha mt s tham s: p
0
l hng s t l xc sut nm trong [0,1], s
dng tnh xc sut. EquiCycle l chu k thng bng m(c ngha l khi vng
lp ang x l cn gi thng bng m ti nhit T). T
start
l nhit xung
quanh, T
stop
l nhit dng(tc dng vic hc) v T
step
l tng nhit thp
hn trong mi vng lp.

Gii thut ca FBACN vi vic hc c thc hin nh sau:

GII THUT CA FBACN VI VIC HC
1 -> 9 = 1 -> 9 ca FBACN
10) t T = T
start
.
11) t ch s hi quy g = 1 cho Layer2.
12) Trong Layer2, tnh gi tr ca ma trn trng s W theo cng thc
(23), ma trn trng s Z theo cng thc (24) v ma trn tn hiu
vo t bn ngoi I v ma trn gi tr mng NET.
13) if
g EquiCycle s
then
{ For j =1 to s do
{ Tnh xc xut
( )
0
/ 1 .
g
j
j
j
T
p
p
e net
u
=
+ A
;
78

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn

| |
( )
( 0,1 )
g
j
a Rand =
}}
esle {
For j =1 to s do {
( ) ( )
0; 1
g g
p a
j j
= =
}};
14) for j = 1 to s do
if
( ) ( 1) ( 1) ( 1)
(( . ) 0) &( . 0))
g g g g
net net a p
j j j j

< <

then
: / 2
u u
j j
o o =

15) for j =1 to s do {
if
( ) ( ) g g
a p
j j
s

then { if Rand([0,1])
0.5 s
then
: u u
u
j j
j
o = +

else
: u u
u
j j
j
o =
}
else { if
( )
0
g
net
j
>
then
: u u
u
j j
j
o = +

else
: u u
u
j j
j
o =
}}
16) if
1 2
(( ) &( ) &... &( )
s
u u u u u u
o o o s A s A s A
then goto 17)
else {g:= g+1; goto 12)}.
17) if T>Tstop then {T:=T-Tstep; goto 11)} else goto 18).
18) if
( ) ( 1) g g
U U c

s
then Stop else goto 4)





79

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
CHNG 5
CI T TH NGHIM V NG DNG
5.1. Ci t th nghim thut ton FCM ........................................................................
5.2. ng dng thut ton FCM-Ci tin vo nhn dng nh ..........................................
79
82

Chng ny trnh by kt qu xy dng chng trnh th nghim ca
thut ton FCM v ng dng thut ton FCM-Ci tin vo qu trnh nhn
dng nh.

5.1. Ci t th nghim thut ton FCM
FCM l mt thut ton c p dng kh nhiu trong phn cm d liu
v hiu nng v tnh hin thc ca n kh tt. Thut ton FCM c bt u
bng cch chn C cm v chn ngu nhin c im lm trung tm cm hoc
chn phn hoch ngu nhin C cm v tnh trng tm ca tng cm ny. Nu
s lng d liu nh hn s cm th ta gn mi d liu l mt trng tm ca
cm, mi trng tm s c 1 s cm. Nu s lng d liu ln hn s cm, vi
mi d liu, ta tnh ton tng t c trng s gia im v trng tm
cm v ly khong cch ti thiu. D liu ny thuc v cm c khong cch
ti thiu ti d liu . Khi chng ta khng chc chn v v tr ca trng tm,
ta cn iu chnh v tr trng tm da vo d liu cp nht hin ti. Sau ,
ta gn tt c d liu ti trng tm mi ny. Qu trnh ny c lp li cho ti
khi khng cn d liu di chuyn sang cm khc. V mt ton hc, vng lp
ny c th chng minh l hi t cc tiu cc b.
Qu trnh ci t ca thut ton c m phng thng qua giao din
ca chng trnh nh Hnh 5.1 v Hnh 5.2 di y:
Ngn ng s dng l Visual C++ 6.0
Tham s ban u: S cm = 3, tham s m m = 2
D liu u vo l cc im mu khc nhau
80

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn

Hnh 5.1: Giao din ca chng trnh khi khi ng

Khi ngi s dng nhp s cm vo khung Nhp s cm, kch chut
vo khung chng trnh to ra cc im ca cm, v tr ca cc im c
th hin khung To xy. Chng trnh s t ng to ra cc cm d
liu bng cch ti gin tng bnh phng cc khong cch gia d liu v
trng tm cm tng ng khi ta kch chut vo khung chng trnh to ra
mi im. Mi im v ta ca n biu th cho mt i tng vi m t
hai thuc tnh ca i tng l mu sc ca im v s nhn biu th cho
cm.
81

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Di y l hnh nh thu c khi chy chng trnh vi s cm nhp vo l
8 cm..

Hnh 5.2: Giao din ca chng trnh khi lm vic

Chng trnh t ng phn thnh 8 cm thng qua s mu hin trong
tng cm v tm ca mi cm.





82

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
5.2. ng dng thut ton FCM-Ci tin vo nhn dng nh
Bi ton nhn dng chnh l qu trnh phn loi cc i tng c
biu din theo mt m hnh no v gn cho chng vo mt lp da theo
cc quy lut v cc mu chun. Nhn dng c rt nhiu ng dng, c p
dng vo rt nhiu lnh vc, chng hn nh nhn dng vn tay, nhn dng ch
vit, nhn dng nh V phn cm mu l mt bc rt quan trng trong qu
trnh nhn dng nh.
Do s lng im nh l rt ln, thng trn 80.000 im nh v s
lng mu ca mu d liu nh l ph thuc vo sc nt ca nh. Nu nh
c cht lng cng tt th s lng mu cng ln, nhng d nh c cht lng
nh th no i na th s lng mu vn ln. Mt khc, trong nhn dng nh,
chng ta ch quan tm ti mt s yu t nht nh, chng hn nh mt, lng
my, ming v da,... nn s lng mu m ta quan tm cng khng ln lm,
v vy p dng thut ton FCM-Ci tin vo vic phn cm mu trong nhn
dng nh l mt ng dng rt cn thit trong bi ton ny.
Qu trnh ng dng ca thut ton FCM-Ci tin c m phng thng
qua giao din ca chng trnh vi Hnh 5.3, Hnh 5.4 v Hnh 5.5 di y:
Ngn ng s dng l Visual C++ 6.0
Tham s ban u: Khai bo mng lu tr s lng mu ca nh, mng
lu tr s trung tm ca cm, s lng cm, tham s m.
D liu u vo l mt File nh mu(Bitmap)
D liu u ra l mt nh mu c nhn dng vi s cm mu
c thut ton FCM-Ci tin thc hin phn cm.




83

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn

Hnh 5.3: Giao din ca chng trnh khi khi ng

Khi chng trnh khi ng xong, ta chn mt nh ngun thc hin
bng cch n vo nt M File nh v chn mt nh cn thc hin nh Hnh
5.4 di y:
84

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn

Hnh 5.4: Giao din ca chng trnh khi chn nh phn cm
Sau khi chn xong, ta n vo nt Thc hin phn cm. Chng trnh s
thc hin qu trnh nhn dng v phn cm mu theo thut ton FCM-Ci tin
v hin th kt qu khung nh ch nh Hnh 5.5 di y.
85

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn

Hnh 5.5: Giao din ca chng trnh khi thc hin phn cm










86

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
KT LUN

Trong qu trnh tm hiu v hon thnh lun vn tt nghip vi ti
Nghin cu mt s phng php phn cm m v ng dng, d t
c nhng kin thc nht nh, nhng em nhn thy phn cm d liu trong
KPDL ni chung v phn cm d liu m ni ring l mt lnh vc nghin
cu rng ln, nhiu trin vng. ti c gng tp trung tm hiu, nghin
cu v trnh by c mt s k thut v thut ton phn cm d liu ph
bin, mt s k thut phn cm m v m hnh mng nron a khp dng cho
phn cm m trong KPDL hin nay, trnh by mt s ci tin ca thut ton
phn cm m(FCM-Ci tin) da trn cc phng php c, ci t th
nghim thut ton phn cm m(FCM) vi ng dng phn cm cc im mu
v thc hin ci t ng dng ca thut ton FCM-Ci tin i vi vic phn
cm mu trong bi ton nhn dng nh mu.
Tuy nhin, do nhng hn ch v ti liu v thi gian nn em mi ch
tm hiu c mt s k thut in hnh trong phn cm v c bit l phn
cm m, ci t v th nghim mt s thut ton ng dng .... nhng cn
mt s k thut khc vn cha c tm hiu v khai thc, ci t th nghim
cha p dng c cho bi ton phn cm tng qut....
Trong thi gian ti em s tip tc nghin cu thm mt s k thut
phn cm v c bit l cc thut ton phn cm m kt hp song song ng
dng vo mt s bi ton thc t Vit Nam hin nay v hy vng s dn a
nhng kin thc c t ti ny sm tr thnh thc t, phc v cho cuc
sng con ngi chng ta.

Hc vin thc hin

An Hng Sn

87

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
TI LIU THAM KHO

Ti liu Ting Vit:
1. Phan nh Diu (1999), L Gch trong Cc H Tri Thc, NXB
i hc Quc gia H Ni, H Ni.
2. Nguyn Trng Thun, iu khin Logic v ng dng, Nh xut
bn Khoa hc v K thut, 2004.
3. Bi Cng Cng v Nguyn Don Phc, H m, mng nron v
ng dng , NXB Khoa hc v k thut, 2006.
4. V Thanh Nguyn, ng dng logic m, mng nron m, h cc
lut m phn tch d bo cc mt hng chin lc, Hi tho khoa
hc H m, mng nron v ng dng, ln 1, H ni 8-9/11/2006.
5. Ng Quc To, Gio trnh X L nh, Lp CHCLC-H Cng
Ngh-HQG H Ni 2001-2002.
6. Ng Quc To, Bi ging mn Data Mining, Lp CHK5-H Thi
Nguyn 2006-2008.
7. Ng Quc To, Bi ging mn X L nh, Lp CHK5-H Thi
Nguyn 2006-2008.
Ti liu Ting Anh:
8. Daniel T. Larose, Discovering Knowledge in Data: An
Introduction toData Mining, ISBN 0-471-66657-2 CopyrightC
2005 John Wiley & Sons, Inc.
9. A. Arning, R. Agrawal, and P. Raghavan. Alinear method for
deviation detection in larger databases, In Proc. 1996 Int. Conf.
Data Mining and Knowledge Discovery (KDD-96), Portland,
Oregon, August 1996.
88

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
10. P.S. Bradley, U. Fayyad, C. Reina, Scaling Clustering Algorithms
to Large Databases, In Proc of 4th International conference on
Knowledge Discovery and Dala Mining (Kdd-98), New York. 1998.
11. D. Fisher, Knowledge acquisition via incremental conceptual
clustering, In Machine Learning, 2 pp. 139-/72, 1987.
12. D. Gibson, J. Kleinberg, P. Raghavan, Clustering Categorical
Data: An Approach Based on Dynamical Systems, VLDB Journal 8
(3-4) pp. 222-236, 2000.
13. J. Han, M. Kamber, Data Mining Concepts and Techniques,
Morgan Kaufmann Publishers, 2001.
14. A.K. Jain, R.C. Dubes, Algorithms for clustering data, Ptentice
Hall, Englewood Cliffs, NJ, 1988.
15. R.A. Jarvis, E.A. Patrick, Clustering using a similarity measure
based on shared near neighbors, IEEE Transactions on Computers
C22, pp. 1025-1034, 1973.
16. M. Manago, Y. Kodratoff, Inuction of Decision Trees from
Complex Structuted Data, In Knowledge Discovery in Databases,
AAAI/Th MIT press, pp. 289-306, 1991.
17. J.C.Bezdek, Pattern Recognition with fuzzy Objective Function
Algorithms, New York, Plenum, 1981.
18. W.Pedrycz, Algorithms of fuzzy clustering with partial
supervision, Pattern Recognition, vol. 23, pp.121-146, 1990.
19. M.P.Windham, Cluster validity for fuzzy clustering algorithms,
Fuzzy Sets and System, vol. 3, pp. 177-183, 1981.
20. W.Pedrycz, Algorithms of fuzzy clustering with partial
supervision, Pattern Recognition, vol. 23, pp.121-146, 1990.
89

S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
21. G.Bueno, R.Gonzalez, J.Gonzalez, and M.Garcia-Rojo, Fuzzy
colour C-means clustering for pattern segmentation in histological
images, The 3rd European Medical and Biological Engineering
Conference, 2005.
22. Chih-Hsiu Wei, Chin - Shyurng Fahn, The multisynapse neural
network and its application to fuzzy clustering.
23. J.H.Wang and C.Y.Peng, Optimal clustering using neural
network, in Proc. IEEE Int. Conf. Syst., Man, Cybern., vol.2, 1998,
pp.1625-1630.
24. Y.Guo, X.Yin, and W.Gong, ART2 neural network clustering for
hierarchical simulation, in Proc. SPIE Int. Soc.Opt.Eng., vol.
2.1998, pp.35-48.
25. M.F.Augusteijn and U.J.Steck, Supervised adaptive clustering: A
hybrid neural network clustering algorithm, neural
Comput.Applicat., vol.7,no. 1, pp.78-89, 1998.
26. E. C. Tsao, J. C. Bezdek, and N. R. Pal, Fuzzy Kohonen
clustering network, Patterm recognition, vol.27, no.5, pp.757-764,
1994.
27. J. Lin, K. Cheng, and C.Mao, A fuzzy Hopfield neural
network for medical image segmentation, IEEE Trans. Nuclear
Sci., vol.43, 1996.
28. Hathaway R.J and Bezdek J.CNTT (2000), Generalized
fuzzy c-means clustering Strategies using LP Norm Distances,
IEEE Trans.Fuzzy Syst, No 5, pp.576-582.
29. J.E.Steck and S.N.Balakrishnan, Use of Hopfield newral networks
in optimal guidance, IEEE Trans. Aerosp.Electron. Syst., vol.30,
no.1, pp 287-293, Jan.1994.

You might also like