Professional Documents
Culture Documents
Doc
Doc
=
e
=
k
i
C x
i
i
m x D E
1
2
) ( t gi tr ti thiu,
trong : m
i
l trng tm ca cm C
i
, D l khong cch gia hai i tng.
Hnh 2.6: Tnh ton trng tm ca cc cm mi
Thut ton k-means bao gm cc bc c bn sau :
Input: S cm k v cc trng tm cm{ }
k
j
j
m
1 =
.
Output: Cc cm C[i] (1 s i s k) v hm tiu chun E t gi tr ti thiu.
Begin
(a) Bc 1 : Khi to
Chn k trng tm { }
k
j
j
m
1 =
ban u trong khng gian Rd (d l s chiu ca
d liu). Vic la chn ny c th l ngu nhin hoc theo kinh nghim.
(b) Bc 2: Tnh ton khong cch
i vi mi im X
i
(1 s i s n), tnh ton khong cch ca n ti mi
trng tm m
j
(1 s j s k). Sau tm trng tm gn nht i vi mi im.
(c) Bc 3: Cp nht li trng tm
i vi mi 1 s j s k, cp nht trng tm cm m
j
bng cch xc nh
trung bnh cng cc vect i tng d liu.
(d) iu kin dng:
26
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Lp li cc bc 2 v 3 cho n khi cc trng tm ca cm khng thay
i.
End.
Thut ton k-means trn c chng minh l hi t v c phc tp
tnh ton l
) ) 3 ((
flop
T nkd O t
. Trong , n l s i tng d liu, k l s cm d
liu, d l s chiu,
t
l s vng lp,
flop
T
l thi gian thc hin mt php
tnh c s nh php tnh nhn, chia,... Nh vy, do k-means phn tch phn
cm n gin nn c th p dng i vi tp d liu ln.Tuy nhin, nhc
im ca k-means l ch p dng vi d liu c thuc tnh s v khm ph ra
cc cm c dng hnh cu, k-means cn rt nhy cm vi nhiu v cc phn
t ngoi lai trong d liu. Hn na, cht lng PCDL ca thut ton k-means
ph thuc nhiu vo cc tham s u vo nh: s cm k v k trng tm khi
to ban u. Trong trng hp cc trng tm khi to ban u m qu lch so
vi cc trng tm cm t nhin th kt qu phn cm ca k-means l rt thp,
ngha l cc cm d liu c khm ph rt lch so vi cc cm trong thc t.
Trn thc t cha c mt gii php ti u no chn cc tham s u vo,
gii php thng c s dng nht l th nghim vi cc gi tr u vo k
khc nhau ri sau chn gii php tt nht.
Ngoi ra thut ton K-means ra, phn cm phn hoch cn bao gm
mt s cc thut ton khac nh: Thut ton PAM; Thut ton CLARA; Thut
ton CLARANS.
2.5.2. Cc thut ton phn cm phn cp
Thut ton CURE
Trong khi hu ht cc thut ton thc hin phn cm vi cc cm hnh
cu v kch thc tng t, nh vy l khng hiu qu khi xut hin cc phn
t ngoi lai. Thut ton CURE khc phc c vn ny v tt hn vi cc
27
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
phn t ngoi lai. Thut ton ny nh ngha mt s c nh cc im i din
nm ri rc trong ton b khng gian d liu v c chn m t cc cm
c hnh thnh. Cc im ny c to ra nh la chn cc i tng nm
ri rc cho cm v sau co li hoc di chuyn chng v trung tm cm
bng nhn t co cm. Qu trnh ny c lp li v nh vy trong qu trnh
ny, c th o t l gia tng ca cm. Ti mi bc ca thut ton, hai cm c
cp cc im i din gn nhau (mi im trong cp thuc v mi cm khc
nhau) c ha nhp.
Nh vy, c nhiu hn mt im i din mi cm cho php CURE
khm ph c cc cm c hnh dng khng phi l hnh cu. Vic co li cc
cm c tc dng lm gim tc ng ca cc phn t ngoi lai. Nh vy, thut
ton ny c kh nng x l tt trong trng hp c cc phn t ngoi lai v
lm cho n hiu qu vi nhng hnh dng khng phi l hnh cu v kch
thc rng bin i. Hn na, n t l tt vi CSDL ln m khng lm
gim cht lng phn cm.
Hnh 2.8: Cc cm d liu c khm ph bi CURE
Hnh 2.7: Khi qut thut ton CURE
28
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
x l c cc CSDL 1n, CURE s dng mu ngu nhin v phn
hoch, mt mu l c xc nh ngu nhin trc khi c phn hoch, v
sau tin hnh phn cm trn mi phn hoch, nh vy mi phn hoch l
tng phn c phn cm, cc cm thu c li c phn cm ln th
hai thu c cc cm con mong mun, nhng mu ngu nhin khng nht
thit a ra mt m t tt cho ton b tp d liu.
Thut ton CURE c thc hin qua cc bc c bn sau:
Chn mt mu ngu nhin t tp d liu ban u.
Phn hoch mu ny thnh nhiu nhm d liu c kch thc bng
nhau: tng chnh y l phn hoch mu thnh p nhm d liu bng
nhau, kch thc ca mi phn hoch l n/p (n l kch thc ca mu).
Phn cm cc im ca mi nhm: Thc hin PCDL cho cc nhm
cho n khi mi nhm c phn thnh n/pq cm (vi q > 1).
Loi b cc phn t ngoi lai: Trc ht, khi cc cm c hnh
thnh cho n khi s cc cm gim xung mt phn so vi s cc cm
ban u. Sau , trong trng hp cc phn t ngoi lai c ly mu
cng vi qu trnh pha khi to mu d liu, thut ton s t ng loi b
cc nhm nh.
Phn cm cc cm khng gian: cc i tng i din cho cc cm
di chuyn v hng trung tm cm, ngha l chng c thay th bi cc
i tng gn trung tm hn.
nh du d liu vi cc nhn tng ng.
phc tp tnh ton ca thut ton CURE l O(n
2
1og(n)). CURE t
thut ton tin cy trong vic khm ph ra cc cm vi hnh th bt k v c
th p dng tt i vi d liu c phn t ngoi lai v trn cc tp d liu hai
chiu. Tuy nhin, n li rt nhy cm vi cc tham s nh s cc i tng
i din, t l co ca cc phn t i in.
29
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Ngoi thut ton CURE ra, phn cm phn cp cn bao gm mt s
thut ton khac nh: Thut ton BIRCH; Thut ton AGNES; Thut ton
DIANA; Thut ton ROCK; Thut ton CHANMELEON.
2.5.3. Cc thut ton phn cm da trn mt
Thut ton DBSCAN
Thut ton DBSCAN thch nghi vi mt dy phn cm v khm
ph ra cc cm c hnh dng bt k trong khng gian CSDL c nhiu.
Trn thc t DBSCAN tm kim cho cc cm bng cch kim tra cc
i tng m c s i tng lng ging nh hn mt ngng ti thiu, tc l
c ti thiu MinPts i tng v mi i tng trong cm tn ti mt i
tng khc trong cm ging nhau vi khong cch nh mt ngng Eps. Tm
tt c cc i tng m cc lng ging ca n thuc v lp cc i tng
xc nh trn, mt cm c xc nh bng mt tp tt c cc i tng lin
thng mt cc lng ging ca n. DBSCAN lp li tm kim ngay khi cc
i tng lin lc mt t cc i tng trung tm, n c th bao gm vic
kt hp mt s cm c mt lin lc. Qu trnh kt thc khi khng tm c
im mi no c th thm vo bt c cm no.
DBSCAN c th tm ra cc cm vi hnh th bt k, trong khi ti
cng mt thi im t b nh hng bi th t ca cc i tng d liu nhp
vo. Khi c mt i tng c chn vo ch tc ng n mt lng ging xc
nh. Mt khc, DBSCAN s dng tham s Eps v MinPts trong thut ton
kim sot mt ca cc cm. DBSCAN bt u vi mt im tu v xy
dng mt lng ging c th c i vi Eps v MinPts. V vy, DBSCAN
yu cu ngi dng xc nh bn knh Eps ca cc lng ging v s cc lng
ging ti thiu MinPts, cc tham s ny kh m xc nh c ti u, thng
thng n c xc nh bng php chn ngu nhin hoc theo kinh nghim.
30
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
phc tp ca DBSCAN l O(n
2
), nhng nu p dng ch s khng gian
gip xc nh cc lng ging ca mt i tng d liu th phc ca
DBSCAN c ci tin l O(nlogn). Thut ton DBSCAN c th p dng
cho cc tp d liu khng gian ln a chiu, khong cch Euclide c s
dng o s tng t gia cc i tng nhng khng hiu qu i vi d
liu a chiu.
Hnh 2.9: Hnh dng cc cm c khm ph bi thut ton DBSCAN
Thut ton: DBSCAN khi to im p ty v ly tt c cc im
lin lc mt t p ti Eps v MinPts. Nu p l im nhn th th tc trn to
ra mt cm theo Eps v MinPts, nu p l mt im bin, khng c im no
lin lc mt t p v DBSCAN s i thm im tip theo ca tp d liu.
Nu s dng gi tr ton cc Eps v Minpts, DBSCAN c th ho nhp
hai cm thnh mt cm nu mt ca hai cm gn bng nhau. Gi s
khong cch gia hai tp d liu S1 v S2 c nh ngha l:
dist(S1, S2) = min{dist(p, q) {p e S1 v q e S2}.
Thut ton DBSCAN c m t chi tit nh sau:
......... Modul chng trnh chnh ..........
DBSCAN(SetOfPoints, Eps, MinOts)
//SetOfPoints is UNCLASSIFIED
Clusterid:= NextId(NOISE);
FOR i FROM 1 TO SetOfPoints.size DO
31
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Point := SetOfPoints.get(i);
IF PointClId = UNCLASSIFIED THEN
IF ExpandCluster(SetOfPoints, Point, ClusterId, Eps, MinPts)
THEN
ClusterId.= nextId(ClusterId)
END IF
END IF
END FOR
END; //DBSCAN
--------Th tc ExpandCluster--------
ExpandClusster(SetOfPoints, Points, C1Id, Eps, MinPts): Boolean;
seeds:= SetOfPoints.regionQuery(Point, Eps)
IF seeds.size < MinPts THEN //no core point
SetOfPoints.changeclId(Point, NOISE),
RETURN False;
ELSE //all points in seeds are density-reachable from Point
SetOfPoints.changeClId(seeds, C1Id);
seeds.delete(Point);
WHILE seeds <> Empty DO
currentP:= seeds.first();
result:= SetOfPoints.regionQuery(CurrentP, Eps);
IF result.size >= MinPts THEN
FOR i FROM 1 to result.size DO
resultpP:= result.get(i);
IF resultp.C1Id IN {UNCLASSIFIED, NOISE} THEN
IF resultp.ClId = UNCLASSIFIED THEN
seeds.append(resultP);
END IF;
32
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
SetOfPoints.changeC1Id(resultP, C1Id),
END IF; //UNCLASSIFIED or NOISE
END FOR;
END IF; //result.size >= Minpts
seeds.delete(currentP);
END WHILE; //seeds <> Empty
RETURN True;
END IF;
END; //ExpandCluster
Trong SetOfPoints hoc l tp d liu ban u hoc l cm c
khm ph t bc trc, C1Id (ClusterId) l nhn nh du phn t d liu
nhiu c th thay i nu chng c th lin lc mt t mt im khc trong
CSDL, iu ny ch xy ra i vi cc im bin ca d liu. Hm
SetOfPoints.get(i) tr v phn t th i ca SetOfPoints. Th tc
SetOfPoints.regionQuery(Point, Eps) tr v mt danh sch cc im d liu
ln cn vi im Point trong ngng Eps t tp d liu SetOfPoints. Tr mt
s trng hp ngoi l, kt qu ca DBSCAN l c lp vi th t duyt cc
i tng d liu. Eps v MinPts l hai tham s ton cc c xc nh bng
th cng hoc theo kinh nghim. Tham s Eps c a vo l nh so vi
kch thc ca khng gian d liu, th phc tp tnh ton trung bnh ca
mi truy vn l O(logn).
Ngoi thut ton DBSCAN ra, phn cm da trn mt cn bao
gm 2 thut ton khc nh: Thut ton OPTICS; Thut ton DENCLUE.
2.5.4. Cc thut ton phn cm da trn li
Thut ton STING
STING l k thut phn cm a phn gii da trn li, trong vng
khng gian d liu c phn r thnh s hu hn cc ch nht, iu ny c
33
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
ngha l cc li c hnh thnh t cc li con thc hin phn cm.
C nhiu mc ca cc ch nht tng ng vi cc mc khc nhau ca phn
gii trong cu trc li, v cc ny hnh thnh cu trc phn cp: mi
mc cao c phn hoch thnh s cc nh mc thp hn tip theo trong
cu trc phn cp. Cc im d liu c np t CSDL, gi tr ca cc tham
s thng k cho cc thuc tnh ca i tng d liu trong mi li c
tnh ton t d liu v lu tr thng qua cc tham s thng k cc mc
thp hn. Cc gi tr ca cc tham s thng k gm: s trung bnh - mean, s
ti a - max, s ti thiu - min, s m - count, lch chun - s, ...
Cc i tng d liu ln lt c chn vo li v cc tham s thng
k trn c tnh trc tip thng qua cc i tng d liu ny. Cc truy
vn khng gian c thc hin bng cch xt cc thch hp ti mi mc ca
phn cp. Mt truy vn khng gian c xc nh nh l mt thng tin khi
phc li ca d liu khng gian v cc quan h ca chng.
STING c kh nng m rng cao, nhng do s dng phng php a
phn gii nn n ph thuc cht ch vo trng tm ca mc thp nht. a
phn gii l kh nng phn r tp d liu thnh cc mc chi tit khc nhau.
Khi ho nhp cc ca cu trc li hnh thnh cc cm, n khng xem
xt quan h khng gian gia cc nt ca mc con khng c ho nhp ph
hp (do chng ch tng ng vi cc cha ca n) v hnh dng ca cc cm
d liu khm ph c, tt c ranh gii ca cc cm c cc bin ngang v
dc, theo bin ca cc v khng c ng bin cho c pht hin ra.
Mt trong nhng hn ch trong khi s dng cch tip cn a phn gii
thc hin phn tch cm cht lng ca phn cm STING hon ton ph
thuc vo tnh cht hp mc thp ca cu trc li. Nu tnh cht hp l
mn, dn n chi ph thi gian x l tng, tnh ton tr nn phc tp v nu
34
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
mc di cng l qu th th n c th lm gim bt cht lng v chnh
xc ca phn tch cm.
Cu trc d liu li thun tin cho qu trnh x l song song v cp
nht lin tc, khi duyt ton b CSDL mt ln tnh ton cc i lng
thng k cho mi , nn n rt hiu qu v do phc tp thi gian to
cc cm xp x O(n), trong n l tng s cc i tng. Sau khi xy dng
cu trc phn cp, thi gian x l cho cc truy vn l O(g), trong g l tng
s li mc thp (g << n).
Thut ton STING gm cc bc sau:
Xc nh tng bt u:
Vi mi ci ca tng ny, tnh ton khong tin cy (hoc c lng
khong) ca xc sut m ny lin quan ti truy vn.
T khong tin cy ca tnh ton trn, gn nhn cho l c lin quan
hoc khng lin quan.
Nu lp ny l lp di cng, chuyn sang Bc 6; nu khc th
chuyn sang Bc 5.
Duyt xung di ca cu trc cy phn cp mt mc. Chuyn sang
Bc 2 cho cc m hnh thnh cc ln quan ca lp c mc cao hn.
Nu c t c cu truy vn, chuyn sang Bc 8; nu khng th
chuyn sang Bc 7.
Truy lc d liu vo trong cc lin quan v thc hin x l. Tr li
kt qu ph hp yu cu ca truy vn. Chuyn sang Bc 9.
Tm thy cc min c cc lin quan. Tr li min m ph hp vi
yu cu ca truy vn . Chuyn sang Bc 9.
9. Dng.
Ngoi thut ton STING ra, phn cm da trn li cn c thm mt
thut ton khc l: Thut ton CLIQUE.
35
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
2.5.5. Cc thut ton phn cm da trn m hnh
Thut ton EM
Thut ton EM c xem nh l thut ton da trn m hnh hoc l
m rng ca thut ton k-means. Tht vy, EM gn cc i tng cho cc
cm cho theo xc sut phn phi thnh phn ca i tng . Phn phi
xc sut thng c s dng l phn phi xc sut Gaussian vi mc ch l
khm ph lp cc gi tr tt cho cc tham s ca n bng hm tiu chun l
hm logarit kh nng ca i tng d liu, y l hm tt m hnh xc
sut cho cc i tng d liu. EM c th khm ph ra nhiu hnh dng cm
khc nhau, tuy nhin do thi gian lp ca thut ton kh nhiu nhm xc nh
cc tham s tt nn chi ph tnh ton ca thut ton kh cao. c mt s ci
tin c xut cho EM da trn cc tnh cht ca d liu: c th nn, c
th sao lu trong b nh v c th hy b. Trong cc ci tin ny, cc i
tng b hy b khi bit chc chn c nhn phn cm ca n, chng c
nn khi khng b loi b v thuc v mt cm qu ln so vi b nh v chng
s c lu li trong cc trng hp cn li.
Thut ton c chia thnh hai bc v qu trnh c lp li cho
n khi vn c gii quyt:
E: h b h a
+
=
+
=
2
1
,
2
1
2
1
M:
) ( 6
,
d c b
b a
b a
+ +
+
=
Cc bc thc hin ca thhut ton EM
Khi to tham s:
} ,..., , , ,..., , {
) 0 ( ) 0 (
2
) 0 (
1
) 0 ( ) 0 (
2
) 0 (
1 K K o
p p p =
Bc E:
36
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
= =
k
t
j
t
j j k
t
i
t
i i k
t k
t j t j k
t k j
P x P
P x P
x P
P x P
x P
) ( 2 ) (
) ( 2 ) (
) , , | (
) , , | (
) , (
) , ( ) , | (
) , | (
o e
o e
e e
e
Bc M:
=
+
k
t k i
k
k
t k i
t
i
x P
x x P
) , | (
) , | (
) 1 (
e
e
R
x P
p
k
t k i
t
i
=
+
) , | (
) 1 (
e
Lp li bc 2 v 3 cho n khi t c kt qu
Ngoi thut ton EM ra, phn cm da trn m hnh cn c thm mt
thut ton khc l: Thut ton COBWEB.
2.5.6. Cc thut ton phn cm c d liu rng buc
Thut ton Phn cm m: FCM, cFCM v FCM-Ci tin (Cc thut
ton ny s c cp chi tit chng k tip).
Tm li, cc k thut PCDL trnh by trn c s dng rng ri
trong thc t, th nhng hu ht chng ch nhm p dng cho tp d liu vi
cng mt kiu thuc tnh. V vy, vic PCDL trn tp d liu c kiu hn hp
l mt vn t ra trong KPDL giai on hin nay.
37
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
CHNG 3
K THUT PHN CM D LIU M
3.1. Tng quan v phn cm m ...................................................................................
3.2. Cc thut ton trong phn cm m ........................................................................
3.2.1. Thut ton FCM(Fuzzy C-means) .....................................................................
3.2.1.1. Hm mc tiu ..............................................................................................
3.2.1.2. Thut ton FCM ..........................................................................................
3.2.2. Thut ton cFCM(- Insensitive Fuzzy C-means) .............................................
3.2.2.1. Hm mc tiu ..............................................................................................
3.2.2.2. Thut ton cFCM ........................................................................................
3.2.3. Thut ton FCM-Ci tin ...................................................................................
3.2.3.1. Thut ton 1: Thut ton la chn cc im d liu lm ng vin cho
vic chn cc trung tm ca cc cm .........................................................
3.2.3.2. Thut ton 2: Thut ton lc bt cc ng vin ........................................
3.2.3.3. Thut ton 3: Thut ton chn cc ng vin lm cc tiu hm mc tiu ..
3.2.3.4. Thut ton 4: Gn cc trung tm c lin kt gn gi vo mt cm .......
3.2.3.5. Tng kt thut ton FCM-Ci tin ..............................................................
37
38
39
39
42
46
46
48
49
49
51
51
52
56
3.1. Tng quan v phn cm m
Trong cuc sng, chng ta gp rt nhiu ng dng ca bi ton phn
cm. Chng hn nh trong ngnh bu in, hng ngy bu in phi phn
loi th theo m nc, trong m nc li phn loi theo m tnh/thnh ph,
sau khi th v n bu in tnh th bu in tnh li phi phn loi th
theo qun/huyn gi i, n bu in qun/huyn li phn loi th theo
x/phng gi th. chnh l mt ng dng ca bi ton phn cm r.
Vy bi ton phn cm r l g?
Ta c th nh ngha bi ton phn cm r nh sau: Cho tp d liu
mu X, ta kim tra cc im d liu xem n ging vi c im ca nhm
no nht th ta gn im d liu vo trong nhm . Nhng trong thc t
khng phi lc no bi ton phn cm r cng p dng c. Chng hn, ta
c php phn loi sau: Nhng ngi i xe my xn th thuc nhm ngi
giu, nhng ngi i xe my thng thuc nhm ngi bnh dn. Vy ngi
38
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
ngho m i xe my xn th chng ta xp ngi vo nhm no? V vy,
chng ta cn a vo khi nim bi ton phn cm m.
Trong cc phng php phn cm gii thiu trong chng trc,
mi phng php phn cm phn hoch mt tp d liu ban u thnh cc
cm d liu c tnh t nhin v mi i tng d liu ch thuc v mt cm
d liu, phng php ny ch ph hp vi vic khm ph ra cc cm c mt
cao v ri nhau, vi ng bin gia cc cm c xc nh tt. Tuy
nhin, trong thc t, ng bin gia cc cm c th m, cc cm c th
chng ln nhau, ngha l mt s cc i tng d liu thuc v nhiu cc cm
khc nhau, do m hnh ny khng m t c d liu thc. V vy ngi
ta p dng l thuyt v tp m trong PCDL gii quyt cho trng hp
ny. Cch thc kt hp ny c gi l Phn cm m.
Phn cm m l phng php phn cm d liu m cho php mi im
d liu thuc v hai hoc nhiu cm thng qua bc thnh vin. Ruspini (1969)
gii thiu khi nim phn hoch m m t cu trc cm ca tp d liu v
xut mt thut ton tnh ton ti u phn hoch m. Dunn (1973) m
rng phng php phn cm v pht trin thut ton phn cm m.
tng ca thut ton l xy ng mt phng php phn cm m da trn ti
thiu ha hm mc tiu. Bezdek (1981) ci tin v tng qut ha hm mc
tiu m bng cch a ra trng s m xy dng thut ton phn cm m
v c chng minh hi t ca cc thut ton l cc tiu cc b.
3.2. Cc thut ton trong phn cm m
K-means l thut ton PCDL r v C-means l thut ton phn cm
m tng ng, hai thut ton ny cng s dng chung mt chin lc phn
cm d liu. Thut ton C-means m hay cn gi tt l thut ton FCM
(Fuzzy C-means) c p dng thnh cng trong gii quyt mt s ln
39
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
cc bi ton PCDL nh trong nhn dng mu(nhn dng vn tay, nh), x
l nh(phn tch cc cm nh mu, cm mu), y hc(phn loi bnh, phn
loi triu chng), Tuy nhin, nhc im ln nht ca thut ton FCM l
tp d liu ln, tp d liu nhiu chiu, nhy cm vi cc nhiu v phn t
ngoi lai trong d liu, ngha l cc trung tm cm c th nm xa so vi
trung tm thc ca cm. c nhiu cc phng php xut ci tin
cho nhc im trn ca thut ton FCM bao gm: Phn cm da trn xc
sut (keller, 1993), phn cm nhiu m (Dave, 1991), phn cm da trn
ton t LP Norm (Kerten, 1999) v thut ton - Insensitive Fuzzy C-means
(FCM) v thut ton FCM ci tin.
3.2.1. Thut ton FCM(Fuzzy C-means)
3.2.1.1. Hm mc tiu
K thut ny phn hoch mt tp n vect i tng d liu X = {x
1
,,
x
n
} c R
s
thnh c cc nhm m da trn tnh ton ti thiu ha hm mc tiu
o cht lng ca phn hoch v tm trung tm cm trong mi nhm, sao
cho chi ph hm o phi tng t l nh nht. Mt phn hoch m vect
im d liu X = {x
1
,, x
n
} c R
s
l c trng u vo c biu din bi ma
trn U = [u
ik
] sao cho im d liu cho ch c th thuc v mt s nhm
vi bc c xc nh bi mc thuc gia [0, 1]. Nh vy, ma trn U c
s dng m t cu trc cm ca X bng cch gii thch u
ik
nh bc thnh
vin x
k
vi cm i.
Cho u = (u
1
, u
2
, .., u
c1
) l phn hoch m C
(
(
(
(
=
cn c
n
n
cxn
u u
u u
u u u
U
1
2 21
1 12 11
40
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Dunn nh ngha hm mc tiu m nh sau:
= =
=
n
k
c
i
i k ik m
v x d u V U J
1 1
2
) , ( ) , (
Bezdek khi qut ha hm mc tiu m bng cch a ra trng s m
m > 1 l bt k s thc no nh sau:
= =
s s =
n
k
c
i
i k
m
ik m
m v x d u V U J
1 1
2
1 ), , ( ) ( ) , ( (1)
trong :
X = [x
1
,, x
n
] c R
s
l n vect mu d liu tp con thc s chiu trong
khng gian vect R
s
gm c n quan st,
m e [1, +] l trng s m c gi l tham s m,
v
i
e R
s
l trung tm cm th i,
d(x
k
, v
i
) = d
ik
l khun mu bt k o khong cch gia d liu x
k
vi trung tm cm th i, => d
2
(x
k
, v
i
) l khong cch Euclidean,
u
ik
e [0, 1] l bc ca phn t d liu x
k
thuc v cm th i,
V = [v
ji
] = [v1,,v
c
] e R
sxc
l ma trn biu din cc gi tr i tng
tm ca cm,
U = [u
ik
] l ma trn phn hoch m ngu nhin ca X trong C phn.
Mt trong cc nhn t chnh nh hng ti quyt nh phn cm hp l
cc im l vn chn php o phi tng t. Thc vy, tnh ton bc
thnh vin u
ik
ph thuc vo nh ngha ca php o khong cch d
ik
m l
tch v hng trn R
s
. Bnh phng khong cch gia vect mu x
k
v trung
tm v tr ca cm th i c nh ngha nh sau:
) ( ) ( ) , (
i k
T
i k i k i k
v x A v x v x v x d = =
) ( ) ( ) , (
2
2
i k
T
i k i k i k
v x A v x v x v x d = =
trong :
A l ma trn hu hn dng i xng (p p) bt k,
41
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
2
i k
v x biu din lch ca d liu x
k
vi v
i
, d(x
k
,v
i
) l tch v
hng trn R
s
.
Bc ca thnh vin tha mn rng buc sau:
s s =
s s < <
s s s s s s
=
n k u
c i n u
n k c i u
c
i
ik
n
k
ik
ik
1 1
1 0
1 , 1 , 1 0
1
1
(2)
thun tin, coi mng i tng d liu {x
1
,....,x
n
} l cc ct trong
ma trn i tng d liu X = [x
jk
] = [x
1
, , x
n
] e R
sxc
. Ma trn phn hoch
U l mt cng c tin li m t cu trc cm trong d liu { x
1
, , x
n
};
nh ngha tp tt c cc ma trn thc phn hoch m khng suy thoi cp cn
cho phn hoch n i tng thnh c cm d liu trong khng gian R
cxn
c
vit gn nh sau:
| |
)
`
< = s = e e =
= =
n u u u k i R U M
c
i
ik
c
i
ik ik
cxn
fcn
1 0 ; 1 ; 1 , 0 : , |
1 1
(3)
R
cxn
l khng gian ca tt c cc ma trn thc cp cn.
Thng thng ngi ta gi bi ton phn cm m l bi ton tm cc
thuc u
ij
nhm ti thiu hm mc tiu trn J
m
(U,V) vi cc iu kin sau:
nh l 1: Nu m v c l cc tham s c nh, v i
k
l mt tp c
nh ngha nh sau:
{ } 0 , 1 |
1
= s s =
s s
ik k
n k
d c i i I
(4)
th hm mc tiu J
m
(U,V) t gi tr ti thiu:
( ) ( ) ( )
)
`
= =
n
k
c
i
i k
m
ik m
v x d u V U J
1 1
2
, , min
khi v ch khi:
42
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
= e =
e
=
|
|
.
|
\
|
=
e
=
k
I i
k k ik
k
k
c
j
m
jk
ik
ik
I I i u
I i
I
d
d
u
|
|
, , 1
, 0
,
1
1
1
2
; n k c i s s s s 1 , 1 (5)
c i
u
x u
v
n
k
m
ik
n
k
k
m
ik
i
s s =
=
=
1 ,
) (
) (
1
1
(6)
nh l c Bezdek chng minh (nu m > 1 , d
2
ik
> 0, 1s i s c) l
ng n.
Mt phn hoch ti u, ngha l hm mc tiu t gi tr ti thiu, m
ch yu da trn tng t gia x
k
v trung tm cm v
i
, iu ny tng
ng vi hai iu kin (5) v (6) phi tha mn. Vi hm mc tiu v cc
rng buc hm mc tiu t gi tr ti thiu trn y.
3.2.1.2. Thut ton FCM
Thut ton FCM cung cp mt qu trnh lp qua li gia phng trnh
(5) v (6) ti u(xp x cc tiu) hm mc tiu da trn o c tng t
c trng s gia x
k
v trung tm cm v
i
, sau mi vng lp, thut ton tnh ton
v cp nht cc phn t u
jk
trong ma trn phn hoch U. Php lp s dng khi
{ } c s
+ ) ( ) 1 (
max
k
ij
k
ij ij
u u , trong c l chun kt thc gia 0 v 1, trong khi k l
cc bc lp. Th tc ny hi t ti cc tiu cc b hay im yn nga ca
J
m
(u, V). Thut ton FCM tnh ton ma trn phn hoch U v kch thc ca
cc cm thu c cc m hnh m t ma trn ny. Cc bc thc hin ca
thut ton FCM nh sau:
43
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
THUT TON FCM
Input : S cm c v tham s m m cho hm mc tiu J;
Output: c cm d liu sao cho hm mc tiu trong (1) t gi tr cc tiu;
Begin
1. Nhp tham s cm c (1<c<n), m (1<m<+].
Khi to ma trn V=[v
ij
], V
(0)
eR
sxc
, j=0;
2. Repeat
2.1. j:=j+1;
2.2. Tnh ma trn phn hoch m U
(j)
theo cng thc (5);
2.3. Cp nht cc trung tm cm V
(j)
=[v
1
(j)
, v
2
(j)
,..., v
c
(j)
]
da vo cng thc (6) v U
(j)
3. Until ( ) c s
+
F
j j
U U
) ( ) 1 (
;
4. Trnh din cc cm kt qu;
End.
Trong ,
F
* l tiu chun Frobenious c nh ngha nh sau:
= =
i k
ik
T
F
u UU Tr U
2
2
) (
v tham s c c cho trc .
Vic chn cc tham s cm rt nh hng n kt qu phn cm, tham
s ny thng c chn theo php ngu nhin hoc theo Heuristic.
i vi m 1
+
th thut ton C-means tr thnh thut ton r.
i vi m th thut ton FCM tr thnh thut ton phn cm m
vi:
c
u
ik
1
= . Cha c quy tc no nhm la chn tham s m m bo cho vic
phn cm hiu qu, nhng thng thng chn m = 2.
d hiu c th xt v d sau: Cho mt tp cc i tng d liu mt
chiu c biu th nh Hnh 3.1 sau:
44
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Hnh 3.1: M phng v tp d liu n chiu
Bng quan st d nhn thy c hai cm trong tp d liu trn t tn
tng ng l "A" v "B". Vi thut ton k-means th hm tnh ph thuc
gia i tng d liu v trng tm cm ca n c th hin nh trong
th Hnh 3.2 di y:
Hnh 3.2: Hm thuc vi trng tm ca cm A trong k-means
Da vo hnh rt ra nhn xt rng, cc i tng trong cm A c gi tr
hm thuc vi trng tm ca cm A l bng 1 v bng 0 vi trng tm cm B.
iu ny ngc li vi cc i tng trong cm B.
Th nhng, i vi thut ton FCM th hm thuc ca cc i tng d
liu vi cc trung tm cm d liu c minh ha nh trong th Hnh 3.3
di y:
45
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Hnh 3.3: Hm thuc vi trng tm ca cm A trong FCM
Da vo hnh c th nhn xt rng, cc i tng d liu c gi tr hm
thuc vi cc trng tm ca cm A nm trong khong [0, l], hm thuc lc
ny l mt ng cong trn. im c mi tn ch n c nhiu kh nng
thuc v lp B hn l lp A do gi tr hm thuc ca n vo lp A l nh
(=0.2). C th biu din cc gi tr hm thuc trn bng ma trn cho c hai
trng hp nh sau:
(
(
(
(
(
(
=
(
(
(
(
(
(
=
1 . 0 9 . 0
4 . 0 6 . 0
7 . 0 3 . 0
2 . 0 8 . 0
1 0
0 1
1 0
0 1
nxc nxc
U U
S dng v s ct ph thuc vo s cc i tng d liu n v s cc
cm k.
Mt s v d m phng v kt qu cc cm khm ph c ca thut
ton phn cm m FCM nh Hnh 3.4 di y:
46
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Hnh 3.4: Cc cm khm ph c bi thut ton FCM
phc tp ca thut ton FCM tng ng vi phc tp ca
thut ton k-means trong trng hp s i tng ca tp d liu cn phn
cm l rt ln.
Tm li, thut ton phn cm m FCM l mt m rng ca thut ton
k-means nhm khm ph ra cc cm chng ln nhau, tuy nhin, FCM vn
cha ng cc nhc im ca thut ton k-means trong vic x l i vi cc
phn t ngoi lai v nhiu trong d liu. Thut ton cFCM c trnh by
di y l mt m rng ca thut ton FCM nhm khc phc cc nhc
im ny.
3.2.2. Thut ton cFCM(- Insensitive Fuzzy C-means)
3.2.2.1. Hm mc tiu
Thut ton phn cm FCM s dng hm bc hai do phi tng t
gia d liu v cc trung tm cm. Suy lun s dng do ny l tnh ton
thp v n gin. Tuy nhin, cch tip cn ny d b nh hng bi nhiu v
cc phn t ngoi lai. khc phc nhc im trn, mt o ci tin
c xut(Vapnik, 1998) s dng tham s c nh sau :
>
s
=
c c
c
c
t t
t
t
,
, 0
, c l tham s phi nhy cm vi nhiu (7)
Hm mc tiu ca thut ton cFCM c nh ngha nh sau:
47
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
( )
= =
=
n
k
c
i
i k
m
ik m
v x u V U J
1 1
, ) , (
c
c
(8)
trong :
=
=
p
l
kl i k
v x v x
1
,
c c
(9)
K hiu:
) (
l
) (+
hoc
) (
.
Lc lng ca tp A l card(A).
nh l 2: Nu m, c v c l cc tham s c nh, vi (U,V) e(E
fc
* R
pc
),
hm mc tiu J
m
(U,V) t gi tr ti thiu khi v ch khi:
( ) ( )
=
e =
e
=
(
e
=
s s . s s
0 ;
, 1
, 0
0 ,
1
1
1
1
1
1 1
k
k
I i
ik
k
k
c
j
m
i k
m
i k
ik
N k c i
I
I i u
I i
I v x v x
u
k
c c
(10)
v:
+
=
+ +
e
e
s s s s
i k
i k
A k
kl
A k
kl
il
p l c i
x
x
v
c
c
|
|
1 1
,
,
(11)
trong : ( ) ( )
{ }{ }
( ) ( )
)
`
+ + e =
=
+
=
+
+
N
k
k k
N
k
kl k k
m
ik i i
x u A
k k
1 1 ,
min
| , 0 c
vi gi thit
=
=
+
=
N
k
k
N
k
k
1 1
v
+
k
,
k
e
m
ik
u ) ( , 0
tp I
k
c nh ngha l I
k
= {i | 1 s i s c; |x
k
- v
i
|
c
= 0; k=1, 2, , N}
( )
{ }
( )
{ } (
(
+ +
=
+ +
e e
+
s s s s
i i i i
A k
kl
A k
kl
i i
il
p l c i
x x
A A card
v
c c
| |
1 1
) (
1
(12)
nh l ny c cc nh khoa hc chng minh.
48
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
3.2.2.2. Thut ton cFCM
Cc bc thc hin ca thut ton cFCM nh sau:
THUT TON cFCM
Input: S cm c v cc tham s m, c cho hm mc tiu J;
Output: Cc cm d liu sao cho hm mc tiu trong (2) t
gi tr cc tiu;
Begin
1. Nhp tham s c (1<c<n), me[1, +], v c>0.
Khi to ma trn V=[v
], V
(0)
e R
sxc
, thit lp j = 0;
2. Repeat
2.1. j:=j+1;
2.2. Tnh ma trn phn hoch m U
(j)
theo cng thc (10);
2.3. cp nht cc trung tm V
(j)
= [v
1
(j)
, v
2
(j)
,., v
c
(j)
]
da vo (12) v U
(j)
;
3. Until ( c s
+
F
j j
U U
) ( ) 1 (
);
4. Trnh din cc cm kt qu;
End.
Tm li, thut ton cFCM l mt m rng ca thut ton FCM trong
vic thch nghi vi nhiu v phn t ngoi lai trong d liu. Tuy vy, hiu qu
ca thut ton cFCM i vi tp d liu ln, tp d liu nhiu chiu cng nh
cch xc nh tham s c l nhng vn tip tc cn phi nghin cu v hon
thin. Thut ton FCM-Ci tin c trnh by di y l mt ci tin nhm
khc phc cc nhc im ny.
49
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
3.2.3. Thut ton FCM-Ci tin
Do thut ton FCM v cFCM c mt s hn ch nht nh. Nn gii
bi ton lm min hm mc tiu ( , ) J u v vi s cm khng c nh v trnh
c trng hp ch t min a phng ta tin hnh xy dng thut ton
phn cm m da trn vic ci tin thut ton FCM thng qua vic din gii
mt s thut ton nh sau:
3.2.3.1. Thut ton 1: Thut ton la chn cc im d liu lm ng
vin cho vic chn cc trung tm ca cc cm
Cho tp d liu
{ }
1 2
, ,..,
n
X x x x = , , 1,..,
p
i
x R i n e = v
*
n N e
Phn chia X thnh m tp con (Nu n b th khng cn chia na):
Gi s . n mb d = + (b, d, m l cc s nguyn dng, 1 d m s < ).
Khi , ta chia
1
m
t
x
t
X C
=
=
, vi
{ }
1 2
, ,..., , 1, 2,.., 1
t t t t
x b
C x x x t m = = v tp
cui cng
{ }
1 2 1
, ,.., , ,..,
m m m m m m
x b b b d
C x x x x x
+ +
= . (13)
Vn chn ln cn :
Bc ny nh hng kh ln ti qu trnh tnh ton. Chng ta c th
dng khi nim phng sai mu trong thng k ton hc gii quyt cho
vn chn ln cn ca mt im d liu.
Chng hn, xt tp { }
n
z z z Z ,.., ,
2 1
= , R z
i
e , n i .. 1 = . Khi , k vng mu
n
z
z
n
i
i
=
=
1
, lch tiu chun l
=
=
n
i
i z
z z
n
s
1
2
) (
1
. khi ta c
1
min( )
s s
=
Xk
k p
r s , vi
Xk
s l lch tiu chun ca tp{ }
1
,..,
k nk
x x vi k = 1, 2, , p.
Ta xt ln cn ca mi im d liu l hnh hp p chiu vi bn knh c
th nh ngha theo lch tiu chun l r.
50
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
im
j
x c gi l nm trong ln cn ca x
i
nu
j
x nm trong hnh
hp p chiu bn knh r ca x
i
.
Tnh m c ca cc im d liu.
m c ca
t t
i x
x C e , k hiu l ( )
t t
i i
D x , l s im d liu nm
trong ln cn ca
t
i
x .
( )
1
( )
b
t t t t
i i j i
j
D x u r x x
=
=
, vi
1, 0
( )
0, 0
z
u z
z
>
=
<
. (14)
Khi , thut ton 1 c trin khai theo cc bc sau:
THUT TON 1
Bc 1: Tnh
1
1
n
z x
i ji
n
j
=
=
v lch tiu chun
( )
2
1
1
n
s x z
i ji i
n
j
=
=
, i=1,2,..p.
Tnh bn knh
1
min
i
i p
r s
s s
= .
Bc 2: Tnh m c ca cc im d liu D
i
l s im d
liu nm trong hnh hp p chiu bn knh r ca
i
x X e :
1
( )
n
j i
j
u r x x
=
.
Bc 3: Tm im
i
x X e sao cho n c c ln nht.
Bc 4: Tnh CC = {x
j
: x
j
nm trong hnh hp p chiu bn knh r
ca x
i
} v \ X X CC = . (* y CC l tp tt c cc im d liu nm
trong hnh hp p chiu bn knh r ca im d liu x
i
*)
Bc 5: Nu X | = th dng. Ngc li, th quay ln bc 2.
51
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Trong trng hp mu d liu ln th chng ta chia tp d liu ra thnh
cc tp d liu nh hn. Sau mi p dng thut ton 1 cho tng mu d
liu mi thu c sau khi phn chia.
Sau khi chy xong thut ton 1, th ta s c c mt tp cc im d
liu lm ng vin cho vic chn cc trung tm ca cc cm. Nu tp ny ln
th chng ta li p dng li thut ton 1 mt ln na. iu ny c th hin
thng qua thut ton 2.
3.2.3.2. Thut ton 2: Thut ton lc bt cc ng vin
Sau khi chy xong thut ton 1. Th t tp d liu X ban u, chng ta
chn ra c n
c
cc im d liu lm ng vin.
Gi s
x
C = {x
j
: x
j
l im d liu ng vin, 1..
c
i n = }. Khi n
c
m ln
th ta s dng thut ton 1 cho tp d liu mi l C
x
. Kt qu ta thu c tp
d liu mi l
{ }
, 1, 2,..,
p
x j p
C x j n = = . Sau khi chy xong thut ton 2, ta c
c tp d liu mi. Khi , ta chuyn sang thut ton 3 tm cc im d
liu lm trung tm ca cc cm, y l nhng im d liu m lm min hm
mc tiu.
3.2.3.3. Thut ton 3: Thut ton chn cc ng vin lm cc tiu HMT
Sau khi kt thc thut ton 1 v thut ton 2 th ta thu c tp cc
im d liu lm ng vin cho trung tm cc cm l
{ }
1 2
, ,...,
p
p
c n
C cc cc cc = .
Trong thut ton FCM, ta dng hm mc tiu ( , ) J u v
2
1 1
( , )
p
n
n
m
ji i j
i j
J u v x cc
= =
=
(15)
Ta thay th hm mc tiu ny bng hm mc tiu mi
*
FCM
J c xc
nh nh sau:
2
*
1 1
.
q n
m c
FCM ji i j cc
i j
J J x cc e
= =
= +
(16)
52
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
vi
2
1 1
q q
c
cc i j
i j i
J cc cc
= = +
=
(17)
v
( )
2
1
1
2
1
1
ji
c
q
i j
m
c
k
i k
x cc
x cc
=
=
(18)
Thng thng ta hay chn m = 2. Do cc cm c th rt gn nhau nn
ta dng J
CC
c thm trng s 1 e > c th phn bit c cc cm ny
trong trng hp chng kh gn nhau.
Khi , thut ton 3 c trin khai theo cc bc sau:
THUT TON 3
Bc 1: q = 1, gn
**
FCM
J = +, 1 o = .
Bc 2: gn
c
q
cc cc
o
= ,
p
c
cc C
o
e .
Bc 3: Tnh thuc
( )
2
1
1
2
1
1
ji
c
q
i j
m
c
k
i k
x cc
x cc
=
=
=1
ji
,
i=1, 2, .., n v j = 1, 2, .., q.
Tnh
*
FCM
J theo cng thc (17)
Bc 4: Nu
* **
FCM FCM
J J < th gn
** *
:
FCM FCM
J J = v q:=q+1;
Bc 5: Nu 1
p
n o > th dng vi q:=q-1. Ngc li th quay
ln bc 2 vi : 1 o o = + .
3.2.3.4. Thut ton 4:Gn cc trung tm c lin kt gn gi vo 1 cm
Sau khi kt thc thut ton 3, ta c mt s hu hn cc im d liu
c chn lm trung tm ca cc cm. l:
{ }
1 2
, ,..., ,
c c c
h p
cc cc cc h n s .
53
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
By gi ta kim tra xem cc cm c th kt ni li c vi nhau hay
khng. iu ny c thc hin nh thut ton 4. Thut ton ny c trin
khai nh sau:
Vic 1: Tm ng dy c th ni
c
i
cc v
c
j
cc (ng dy ny ph thuc
vo trng hp c th ta chn, chng hn nu l lng my, ta s tm mt ng
cong parabol, con mt ta c th dng hnh elip)
Gi s c nhng im nm trn ng dy ni
c
i
cc v
c
j
cc l:
1 2
, ,..,
k
v v v
sao cho:
1 2 1 1
...
c c
i k k j k
v cc v v v v cc v
= = = =
1
c c
i j
cc cc
k
=
+
Sau , ta tin hnh tnh m c ca cc im ny:
1
( )
i
n
c
cc l i
l
D u r x cc
=
=
(19)
1
( )
j
n
c
cc l j
l
D u r x cc
=
=
(20)
1
( ), 1,...,
a
n
v l i
l
D u r x v a k
=
= =
(21)
Nu cc im d liu ny tha mn vi mi l = 1, 2, , k i vi tt c
max( , )
4
i j
l
cc cc
v
D D
D > th ta xc nh hai cm ny c th kt ni c.
Vic 2: Ta ni 2 trung tm v dng lut bc cu ni tip chng
vo mt cm.
Chng hn, nu (
c
i
cc v
c
j
cc l kt ni c) v (
c
j
cc v
c
k
cc l kt ni c)
th
c
i
cc ,
c
j
cc v
c
k
cc l kt ni c v cc trung tm ny c gn vo cng
mt cm.
Khi , thut ton 4 c th hin chi tit qua cc bc sau:
54
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
THUT TON 4
Vic th nht:
Bc 1: gn
{ }
, 1,...,
c
l l
Cluster cc l h = = .
Bc 2: Ta to ra index1 v index2 u l mng mt chiu c h.
Khi to tt c cc phn t ca mng ny vi gi tr 0.
Bc 3: t i:=1; j:=i+1;
Bc 4: Tm cc im nm trn ng dy ni
c
i
cc v
c
j
cc
(thng ng dy ny da vo c im ca ng cong cn nhn dng,
chng hn lng my th ta dng mt ng cong parabol) l
1 2
, ,..,
k
v v v
sao cho:
1 2 1 1
...
c c
i k k j k
v cc v v v v cc v
= = = =
1
c c
i j
cc cc
k
=
+
Bc 5: Tnh m c ca cc im d liu bc 4:
1
( )
i
n
c
cc l i
l
D u r x cc
=
=
1
( )
j
n
c
cc l j
l
D u r x cc
=
=
1
( ), 1,...,
a
n
v l a
l
D u r x v a k
=
= =
Bc 6:
IF
1
k
a=
(
max( , )
4
i j
a
cc cc
v
D D
D > )THEN
If (index1[i]=0) and (index2[j]=0) then
Begin
cluster cluster cluster
j j i
= ;
Index1[i]:=j;
Index2[j]:=i;
55
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
End
Else if ( | | | | ( )
1 0) 2 0 index i and index j = = then
Begin
| | max( , 1 ) j index i
cluster =
| | max( , 1 ) j index i
cluster
| | max( , 2 ) i index j
cluster ;
| | | |
1 max( , 1 ) index i j index i = ;
| |
2 index j i = ;
End
Else if (( | | | | ( )
2 0) 1 0 index j and index i = = then
Begin
| | 1 2 index index j
cluster
(
=
| | 1 2 index index j
cluster
(
;
i
cluster
| |
1 index i = | |
1 2 index index j (
;
End
Else if (
| | | |
( 1 0) ( 2 0) index i and index j <> <> then
Begin
temp:= | |
1 2 index index j (
;
| | max( 1 , ) index i temp
cluster =
| | max( 1 , ) index i temp
cluster
| | min( 1 , ) index i temp
cluster ;
| | | |
1 : max( 1 , ); index i index i temp =
End
Else (* ca IF u tin *)
begin
{ }
c
i i
cluster cc = v
{ }
c
j j
cluster cc =
End;
Bc 7: IF ( ) ( ) i h and j h s s THEN j:=j+1 v quay tr li bc 4
ELSE if ( ) i h s then quay tr li bc 4
ELSE i:=i+1,j:=j+1 v quay tr li bc 4
56
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Vic th hai:
Bc 1: i:=1;FC:=| ;
Bc 2: IF
| |
1 0 index i <> then
Begin
h:=h-1; xa
i
cluster ;
FC:=FC i ;
End;
Bc 3: i:=i+1;
Bc 4: IF i h s THEN quay li bc 2
ELSE dng li .
3.2.3.5. Tng kt thut ton FCM-Ci tin
T chi tit ca 4 thut ton nh trn, ta c th tng hp thut ton
FCM-Ci tin mt cch tng qut thng qua cc bc c th nh sau:
THUT TON FCM-CI TIN
Cho tp d liu
{ }
1 2
, ,..,
n
X x x x = , , .. 1 , n i R x
p
i
= e v
*
, N p n e
Thut ton FCM-Ci tin c thc hin thng qua cc bc sau:
Bc 1: Nu n l ln th chy thut ton 1 c c tt c cc
im c kh nng lm trung tm cm, ngc li th chy bc 3.
Bc 2: Nu sau khi chy thut ton 1 m s im lm ng vin
ln th ta chy thut ton 2 rt gn bt s cc im ng vin lm cc
trung tm ca cc cm.
Bc 3: Chy thut ton 3 tm cc im lm trung tm cm.
Bc 4: Chy thut ton 4 ni cc trung tm cm c lin h
gn gi vo mt cm.
57
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Sau khi bc 4 kt thc, ta s c tp cc trung tm ca cc cm.
Bc 5: (c th nhy ti bc 6)
Chy thut ton FCM vi tp cc trung tm ca cc cm, sau
dng li.
Bc 6: Tnh cc thuc ca cc im d liu cn li:
Gi
ij
l thuc ca x
i
vo
c
j
cc .
Ta c:
ij
=
n
k
m
c
k i
c
j i
cc x
cc x
1
1
1
2
2
) (
1
Tm li, phn cm m l mt s m rng ca PCDL bng cch thm
vo yu t quan h gia cc phn t v cc cm d liu thng qua cc trng
s trong ma trn U. Bng cch ny, c th khm ph ra cc cm d liu phc
tp theo cch mm do t mt tp d liu cho. Thut ton phn cm m l
mt cch thc m rng cho cc thut ton phn cm r nhm khm ph ra cc
cm d liu chng ln nhau trn cc tp d liu c kch thc ln, nhiu
chiu v nhiu nhiu...
58
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
CHNG 4
M HNH MNG NRON A KHP
DNG CHO PHN CM M
4.1. Tng quan v mng Nron ....................................................................................
4.2. Cu trc mng Nron .............................................................................................
4.2.1. Hm kch hot .................................................................................................
4.2.2. Lin kt mng ..................................................................................................
4.2.3. Bi ton hun luyn mng ...............................................................................
4.3. Mng HOPFIELD ..................................................................................................
4.3.1. Hun luyn mng ............................................................................................
4.3.2. S dng mng ..................................................................................................
4.4. Mng Nron a khp dng cho phn cm .............................................................
4.4.1. Xy dng lp mng Layer1 cho ti u cc trung tm cm .............................
4.4.2. Xy dng lp mng Layer2 cho ti u cc thuc ......................................
4.5. S hi t ca FBACN ............................................................................................
4.5.1. Chng minh s hi t ca FBACN .................................................................
4.5.2. S hi t FBACN lin tc ca Layer1 ............................................................
4.5.3. Gii thut ca FBACN v FBACN vi vic hc ............................................
58
61
61
61
61
62
62
63
63
65
68
72
72
74
75
Thut ton FCM-Ci tin khc phc c mt s hn ch ca thut
ton FCM v FCM. Tuy nhin n li c nhc im l mi khi c yu cu
phn cm th thut ton s chy t u, cc kt qu ca cc mu trc l
khng s dng c cho ln sau nn thi gian chy kh ln nu nh kch
thc mu ln. V vy, trong chng ny, chng ta nghin cu mt m hnh
mng Nron a khp dng cho bi ton phn cm m (a fuzzy bi-directional
associative clustering network FBACN). Mng Nron ny ch yu da vo
ti liu ca hai tc gi Chih-Hsiu Wei, Chin - Shyurng Fahn.
4.1. Tng quan v mng Nron
Trc ht chng ta ai cng bit rng tri thc ca loi ngi cho n nay
ht sc phong ph, su rng v a dng. N bao gm nhng hiu bit ca
chng ta t th gii vi m nh nguyn t, in t, ht nhn, cc ht c bn, ...
59
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
n nhng hiu bit v m v tri t, v h mt tri, h thin h, ... . hiu bit
v th gii t nhin v x hi, v cc nghnh khoa hc, k thut khc nhau
nh: ton, l, ha, cng ngh thng tin v c nhng hiu bit v bn thn con
ngi. Th nhng c mt iu m c v nh l mt nghch l l chng ta
bit "rt t" v chnh b no ca chng ta.
Hn na do nhu cu ngy cng cao trong vic gii quyt cc vn phc tp
v do bn cht ca con ngi l khng mun bng lng vi hin ti m lun
mun vn ti nhng g cao hn, hon thin hn. C l chnh v nhng iu
trn m thut ng "mng Nron" hoc "mng Nron nhn to" ra i. Cc
thut ng ni n mt nghnh k thut mi m n i hi kin thc t
nhiu nghnh khoa hc khc nhau nh ton hc, vt l hc, ha hc, sinh vt
hc, tm l hc, thn kinh hc, ... v tt c ch nhm lm sao to ra nhng
chic my tnh hot ng ging nh " b no " ca chnh chng ta.
Mng Nron nhn to hay thng c gi ngn gn l mng Nron l
mt m hnh ton hc hay m hnh tnh ton c xy dng da trn cc
mng Nron sinh hc. N gm c mt nhm cc Nron nhn to(nt) ni vi
nhau, v x l thng tin bng cch truyn theo cc kt ni v tnh gi tr mi
ti cc nt. Trong nhiu trng hp, mng Nron nhn to l mt h thng
thch ng, t thay i cu trc ca mnh da trn cc thng tin bn ngoi hay
bn trong chy qua mng trong qu trnh hc.
Trong thc t s dng, nhiu mng Nron l cc cng c m hnh ha
d liu thng k phi tuyn. Chng c th c dng m hnh ha cc mi
quan h phc tp gia d liu vo v kt qu hoc tm kim cc dng mu
trong d liu.
60
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Hnh 4.1: M hnh mng Nron
Mng Nron nhn to (Artificial Neural Network) l mt m hnh ton
hc bao gm cc nt x l thng tin c s (gi l n v x l hoc Nron) c
mi lin h tng h cao, tin hnh x l thng tin song song v phn tn c
nng lc tnh ton mnh (v d hin nay n c th hc, nh v suy din t mu
d liu...). Mi lin kt gia hai Nron km theo mt trng s no , c
trng cho c tnh kch hot/c ch gia cc Nron. C th xem trng s l
phng tin lu gi thng tin di hn trong mng Nron v nhim v ca
qu trnh hun luyn (hay cn gi l qu trnh hc) mng l cp nht cc trng
s khi c thm thng tin v cc mu hc, hay ni cch khc, cc trng s
c iu chnh sao cho dng iu vo ra ca n m phng hon ton ph
hp vi mi trng ang xem xt. V vy, cu trc ca mng Nron ch yu
c c trng bi loi ca cc Nron v mi lin h x l thng tin gia
chng v do , mng Nron c rt nhiu ng dng trong nhiu lnh vc nh
nhn dng, phn lp nh, phn tch - nn d liu, cc bi ton ti u, d bo,
chun on, V xu th hin i l s kt hp mng Nron vi logic m.
61
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
4.2. Cu trc mng Nron
4.2.1. Hm kch hot
Hm kch hot ca tng Nron trong mng Nron ng vai tr quan
trng trong s lin kt gia cc Nron. Hm ny c trng cho mc lin
kt gia cc Nron.
Trong l thuyt mng Nron, php tng hp cc tn hiu u vo
thng c k hiu di dng:
1
n
j i ji
i
net x e
=
=
vi 1.. ,
j
j n x = l cc tn hiu
vo.
1
( ,..., )
ji j jn
e e e = l trng s, n l s tn hiu u vo. u ra ca Nron j
thng c k hiu l outj hoc fj, c gi l hm kch hot.
1
( ( ) )
n
j j i i
i
out f f x t u e
=
= =
= vi X
s
=(x
s1
,..,x
sm
)
4.3.2. S dng mng
Gi s ta a vo mng tn hiu vo l vct X.
S dng mng tnh u ra tng ng vi tn hiu vo X l qu trnh
lp gm cc bc:
1. Ban u, t X
(0)
=X. Gi Y
(1)
l vct tn hiu ra tng ng vi mt
ln cho X
(0)
lan truyn trong mng.
Y
(1)
= out
(1)
= Tnh (HF, X
(0)
).
2. Nu
(0) (0)
Y X = th tip tc lp vi bc t = t+1 v X
(t+1)
=Y
(1)
. Ngc
li th dng.
Tip theo chng ta nghin cu mt m hnh mng Nron dng cho
phn cm m, l mng Nron a khp.
4.4. Mng Nron a khp dng cho phn cm
Mt vi nm trc, cc h thng Nron ng(i khi gi l mng Nron
hi quy) c s dng nhiu trong cc qu trnh x l thng tin.
64
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Hnh 4.3: M hnh FBACN
Cu trc ca mng Nron a khp-FBACN c a ra nh hnh 4.3.
Lp hi quy Layer1 c thc hin bi mt mng Hopfield ti u ho cc
trung tm cm. Trong khi lp hi quy Layer2 c thc hin bi mt
mng Nron a khp ni ti u cc thuc. Kt hp Layer1 v Layer2
to thnh lp hi quy 3, lp ny lm nn cu trc ng ca mng.
Hot ng ca FBACN c m t nh sau: Th 1 l khi to ngu
nhin cc trung tm cm v thuc thnh vin ca Layer1 v Layer2 tng
ng. Th 2, khi to cc thuc thnh vin trong Layer2 s c truyn
sang Layer1. Th 3, da trn vic nhn c cc thuc thnh vin, Layer1
thc hin qu trnh hi quy thu c cc trung tm cm ti u mi. Th 4,
cc trung tm cm mi ca Layer1 truyn sang Layer2. Th 5, da trn vic
nhn c cc trung tm cm mi, thc hin qu trnh hi quy thu c
thuc thnh vin ti u mi. Vic hon tt qu trnh trn t bc 2 n bc 5
c gi l qu trnh lp. Qu trnh lp din ra cho ti khi no t ti mt tiu
chun ti hn.
65
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
4.4.1. Xy dng lp mng Layer1 cho ti u cc trung tm cm
Lp Layer1 ca FBACN c th s dng mng Hopfield hoc mng
Nron a khp tu thuc vo cc rng buc ca FC-partition (FC- fuzzy c).
Nu rng buc lm cho hm mc tiu c dng bc cao, hoc dng logarithm,
hoc dng sin, v.v.. th ta s dng mng Nron a khp thay v dng mng
Hopfield n gin.
Hnh 4.4: M hnh Lp Layer1 ca FBACN
Gie
ji
l trng s kt ni hot ng ca Nron j vi Nron vo i. Tt c
cc u vo n Nron th j c k hiu l i
j
. Khi , tng hp cc tn hiu
u vo i vi Nron j l:
1
e
=
= +
n
j i ji j
i
net v i (1). Vi v
i
l u ra ca Nron
i, f l hm n iu tng v lin tc.
Ta c hm kch hot
2
( ) 1
1 exp( . )
j
j
f net
r net
=
+
(2). y, ngng
r > 0 lm tng tnh thch nghi v kh nng tnh ton ca mng Nron.
Gi s g l ch s quy v khi
66
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
( 1) ( ) ( )
1
( ) ( . )
s
g g g
j j ji i j
i
v f net f v i e
+
=
= = +
(3)
Khi , ta c th din t mng Nron thng qua ma trn NET sau:
NET = WV + I (4)
vi
1
2
s
net
net
NET
net
(
(
(
=
(
(
(
,
11 1
21 2
1
s
s
s ss
W
e e
e e
e e
(
(
(
=
(
(
(
,
1
2
s
v
v
V
v
(
(
(
=
(
(
(
v
1
2
s
i
i
I
i
(
(
(
=
(
(
(
nh gi tnh n nh ca h thng trong hnh 4.4, chng ta dng
hm tnh ton nng lng CEF(computational energy function-CEF).
Ta c CEF(E) l:
1
. . .
2
T T
E V W V I V = (5)
Hay c th hn l:
1 1 1 2
1
. .
s s
ji i j j j
j i j
s
E v v i v e
= = =
=
(6)
Trong cng thc (6) th E l dng ton phng. V vy, trong mng Nron
ng ny ta c th dng hm mc tiu dng ton phng ti u ha cc
tham s.
Tip theo, ta s l gii s ph hp gia hm mc tiu ca FC-partion v
hm tnh ton nng lng ca FBACN.
Ta c hm mc tiu ca FC-partion l:
,
1 1
( ; ) 2 ( )
n c
T T T m
m k k i k i i i k
k i
z U v x x v x v v u
= =
(
= +
(7)
Trong Layer1 ca FBACN, cc tham s ti u l cc trung tm cm v
i
.
Cc thuc mi u
i,k
c thc hin t lp 2 ti Layer1 s kch hot hi quy
trong Layer1 ti u v
i
. Ngoi ra, u
i,k
tm thi coi l cc hng s trong khi
hi quy trong Layer1. T nh ngha ca hm mc tiu, cc vct trung tm
ca cc cm
p
ip i i i
R v v v v e = ) ,.., , (
2 1
l vct p chiu. V vy, ta phi khai trin
67
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
(7) kh biu thc vct trc khi n c em so snh vi hm tnh ton
nng lng thng gi tr. Thng gi tr c khai trin trong cng thc (7) vi
ln kh u tin l:
2
, , , ,
1 1 1
( ; ) 2 . ( )
p n c
m
m i l k l i l i k
k i l
z U v v x v u
= = =
(
= +
(8)
Quan st (6) v (8) ta thy s khc nhau chnh gia hai cng thc ny
l cch k hiu khc nhau. Trong mng Nron, th cc hot ng ra ca cc
Nron c k hiu mt cch duy nht bng mt k hiu di dng. Chng
hn, hot ng ra v
i
vi s i ... 1 = . Tuy nhin, sau khi khai trin hm mc tiu
ca FC-pariton, c 2 t vit di dng trong tham s v
i,l
, vi 1..1, 1.. l i c = = .
thng nht trong cch th hin, ta k hiu li nh sau:
, ( 1). i l i p l
v v
+
(9)
Khi , ta vit li biu thc (8) nh sau:
, ( 1) ,
1 1 1
( ; ) [ 2( ) . .
p n c
m
m i k i p l k l
k i l
z U v u v x
+
= = =
=
2
, ( 1)
( ) . ]
m
j k i p l
u v
+
+ (10)
S Nron s trong Layer1 (6) lc p .
Ta c
1
2
c p
i
i
I
i
(
(
(
=
(
(
(
,vi
( 1) , ,
1
2 .
n
m
i p l i k k l
k
i u x
+
=
=
, 1.. 1.. i c v l p = = (11)
v cc phn t ca ma trn W c xc nh bi:
/ ,
1
2 ,
0,
n
m
i p k
k
ji
u i j
i j
e
(
(
=
, , 1.. i j c n = (12)
vi k hiu | x | l cch ly s nguyn cao hn v gn nht so vi x.
68
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Qu trnh ti u ca Layer1:
Mc tiu ca mng hi quy l lm cho hm mc tiu xp x ti gi tr
nh nht. Do W l ma trn i xng nn W
T
=W, ta ly gradient ca vct
nng lng trong (5) ta c:
1
( ). .
2
T
E W W V I W V I NET V = + = = (13)
v ( ) . ( )
T T
E E V NET V A = V A = A
1
.
s
j j
j
net v
=
= A
(14)
Tip theo, chng ta xy dng hm kch hot (lin tc hoc ri rc) f
c s dng trong Layer1 ca FBACN theo cng thc sau:
( ) ( )
( 1) ( )
( )
, 0
( )
, 0
g g
j j j
g g
j
g g
j j j
v if net
v f net
v if net
o
o
+
+ >
= =
<
(15)
vi
j
o l gi tr dng nh nht iu chnh v
j
. Ta thy khi
( )
0
g
j
net > th
( 1) ( ) g g
j j j j
v v v o
+
A = = >0,
1
. 0
s
j j
j
E net v
=
A = A s
.
Trong kiu kin trc ny, th s cp nht h s
j
o vi gi tr khng hn
ch. Theo phng php m ta thit k th cch la chn gi tr ca n l
ph hp vi tin trin ca Layer1.
4.4.2. Xy dng lp mng Layer2 cho ti u cc thuc
Lp mng Layer2 ca FBACN c chc nng l ti u ha lp cc
thuc. Ta c th coi cc v
i
l cc hng s tm thi trong khi lp Layer2 hi
quy. Ta c tp
2
, k i i k
x v d = . Khi , cng thc:
( )
2
,
1 1
( ; )
c n
m
m i k k i
i k
z U v u x v
= =
=
c vit li l
, ,
1 1
( )
c n
m
i k i k
i k
u d
= =
.
Khi , bi ton ti u trong lp Layer2 ca FBACN l: lm min
, ,
1 1
( ) .
c n
m
i k i k
i k
u d
= =
vi rng buc
,
1
1, 1..
c
i k
i
u k n
=
= =
.
69
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Theo cng thc Lagrange th ta c th pht biu li bi ton nh sau:
2
, , ,
1 1 1
( ) 1
n c c
m
i k i k i k
k i i
u d u
= = =
(
| |
( |
\ .
(
+
(16)
vi l tham s Lagrange (thng 10000 100000 s s ).
Ta k hiu li
, ( 1). i k k c i
u u
+
v
, ( 1). i k k c i
d d
+
(17)
Khi , ta c th biu din li hm mc tiu l:
2
( 1). ( 1). ( 1).
1 1 1
( ) . ( 1)
n c c
m
k c i k c i k c i
k i i
u d u
+ + +
= = =
(
+
(
(18)
Trong cng thc (18) th s hng c bc cao nht l
( 1).
( )
m
k c i
u
+
(thng thng
ta chn m=2) vi k=1, 2, ..,n v i = 1, 2, ..., c. Nhng mng Hopfield ch ph
hp vi bi ton ti u bc hai. V vy, ta pht trin mng Nron a khp cho
cc bi ton ti u chung. Tc mng Nron a khp c th gii quyt c
vi bt k hm mc tiu bc cao c rng buc no.
M hnh mng Nron a khp n gin c s dng cho Layer2 nh sau:
Hnh 4.5: M hnh Lp Layer2 ca FBACN
Tnh ng ca mng Nron a khp trong hnh 4.5 l:
70
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
( 1) ( ) ( ) ( )
1
( ) ( ( . . ) )
S
g g g g
j j ji i ji i j
i
u f net f u z u i e
+
=
+ + = =
, vi j=1, 2, ..., s. (19)
Ta c ma trn u vo mng: . . NET WU Z U I = + + (20)
Vi
11 12 1
21 22 2
1 2
s
s
s s ss
z z z
z z z
Z
z z z
(
(
(
(
(
(
=
v
1
2
S
u
u
U
u
(
(
(
(
(
(
=
(
(
(
(
(
1
1
2
1
1
) 1 (
m
nxc
m
m
m
u
u
u
U
, m > 1 v U
(1)
= U (21)
Ta k hiu chuyn v ca U
(m-1)
l:
( 1)
T
m
U
. Khi , hm tnh ton nng
lng ca mng Nron s c tnh bi cng thc:
I U U Z U U
m
E
T T T
m
. . .
2
1
- W.U .
1
) 1 (
|
.
|
\
|
|
.
|
\
|
=
(22)
Vi s lng Nron s trong lp Layer2 l n c . Ta c th tnh ton cc phn
t ca ma trn W, Z v I nh sau:
1
2
0 0
0 0
0 0
c n
d
d
W m
d
(
(
(
=
(
(
(
tc l cc phn t
ca W c tnh theo cng thc sau:
,
0 ,
md i j
i
ji
i j
e
=
=
=
, , 1,2,.. i j c n = (23)
Z l ma trn c ( ) ( ) c n c n v c xc nh nh sau:
2 ( 1). .
0
,
,
ji
i i
c j c
c c
if
z
( (
< s
( (
( (
(
(
(
=
(
(
Do ma trn trng s W l i xng (W c gi l i xng nu A. W.B =
B.W.A), v vy hm tnh ton nng lng (22) c biu din l:
I U U Z U U
m
E
T T T
. . .
2
1
W.U .
1
1) - (m
|
.
|
\
|
|
.
|
\
|
= (25)
U
<m-1>
c th iu chnh hai ma trn trng s W v Z. Khi , ma trn
NET ca mng Nron a khp c xc nh nh sau:
1
. .
m
NET WU ZU I
< >
= + +
Gi h l hm s xc nh bi:
1
) (
=
m
j j
u u h (26). Khi , h c gi l hm
tnh
1 m
j
u
V tnh ng ca lp Layer2 c th hin cng thc:
( 1) ( )
( )
g g
u f net
j j
+
=
( ) 1 ( )
1
( ( ( ) ) )
s
g m g
ji j ji j j
i
f u z u i e
=
= + +
(27)
Vi net
j
l tng u vo ca Nron th j v c tnh bi cng thc:
j i ji
m
i ji
s
i
j
i u z u net + + =
=
) . . (
1
1
e (28)
Theo cng thc (25), ta c gradient nng lng E V :
s j i u z u
u
E
s
i
j i ji
m
i ji
j
.. 1 ), ) . . ( (
1
1
= + + =
c
c
e vi s = n c (29)
T (28) v (29) ta c , 1..
E
net j s
j
u
j
c
= =
c
(30)
Qu trnh ti u ca Layer2
Khi hm mc tiu (18) c cn bng vi hm tnh ton nng lng
(25) v gradient tnh ton nng lng c lin kt vi gi tr net vo, kt
qu ti u dn t c khi mng tin trin. T khi nim nng lng, hm
72
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
ri rc f l hm kch hot ca mng Nron a khp c xy dng ging
nh ca lp th nht:
Vect nng lng gradient c tnh ton lin quan n u
j
:
j
j
net
u
=
o
o
Hm kch hot ri rc c a ra :
( ) ( )
( 1) ( )
( )
0 ,
( )
,
g g
j j j
g g
j j
g
j j
if net
otherwise
u
u f net
u
o
o
+
>
+
= =
(31)
Vi vect nng lng gradient lun m v cng thc (31), m bo
mng Layer2 s ti u trong qu trnh tin ho.
4.5. S hi t ca FBACN
4.5.1. Chng minh s hi t ca FBACN
Mt yu t quan trng cho mng hi quy l kh nng n nh ca
mng. Trc khi a ra tnh n nh ca mng FBACN, chng ta s bt u
vi mt vi nh ngha v khng gian Metric v nh l a ra bi Steck v
sau l mt nh l hi t ph qut.
nh ngha 1: Trong khng gian Metric cho tp X v hm d: X X R
tha mn cc iu kin:
1) ( , ) 0, , d x y x y X > e
2) ( , ) 0 d x y x y = =
3) ( , ) ( , ), , d x y d y x x y X = e
4) ( , ) ( , ) ( , ) d x z d x y d y z s + , , x y z X e
nh ngha 2: Cho X l khng gian Metric vi khong cch d v
: X X u . im x c gi l im c nh ca u nu ( ) x x u = .
nh ngha 3: nh x u l co nu tn ti c*, vi
*
0 1 c < < sao cho:
( ) ) , ( . ) ( ), (
*
y x d c y x d s u u , X y x e , (32)
73
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Nu u tha mn (32) th u ch c 1 im c nh. Tht vy, gi s c 2
im c nh x v y. Khi theo (32) ta c:
*
( , ) . ( , ) d x y c d x y s . V vy,
d(x,y)=0, nn x = y.
Vy ta c th khi qut iu trn thng qua nh l sau:
nh l v nh x co(nh x thu gn-AXC): nh x co ca khng gian
Metric y c duy nht mt im c nh.
nh l 1: Cho mng Nron nhn to hi quy kt ni y gm cc
Nron s vi kch hot ng
=
+
+ =
s
i
j
g
i ji
g
j
i net f net
1
) ( ) 1 (
) ( . e (33), vi f l mt
hm c gii hn, lin tc v c gi tr thc, hm c o hm c gii hn v
nu tha mn:
s
c f
ji
1
* *
max
'
< < e , s j i .. 1 & = (34) th mng hi t n mt
im c nh duy nht i vi bt k mt gi tr khi to no ca mng.
B : vi mi hm f tha mn iu kin gi thuyt, th vi bt k
, x y R e ta c
'
( ) ( )
max
f x f y f x y s (35)
Da vo cc nh ngha, nh l v b trn, mt nh l hi t ph
bin cho mng Nron a khp ni c a ra nh sau:
nh l 2(i vi mng Nron a khp ni): ng vi mi mng Nron a
khp ni gm s Nron c hai trng s
ji
e v
ji
z vi tnh ng kch hot sau:
( 1) ( ) 1 ( )
1
( ( )) ( )
S
g g m g
j ji j ji j j
i
net f net z f net i e
+
=
= + +
(36)
y f l hm c gii hn, lin tc v c gi tr thc. Nu f tha mn iu
kin:
1 ' **
max
1
( )
m
ji ji
f z c
s
e
+ + = u = u
s
j
s
i
j i ji
m
i ji s
i net f z net f net net net net
1 1
1
2 1
)) ( . )) ( ( ( ) ,..., , ( ) ( e (39)
Vi khng gian
s
9 l y vi Metric la chn, th ta c:
( , )
1
s
d netx nety netx nety netx nety
i i
i
= =
=
, ,
s
netx nety e9 (40)
Ta c : ( ( ), ( )) ( ) ( ) u u = u u d netx nety netx nety =
1 1
1 1
( ( )) ( ( )) ( ) ( )
S S
m m
i i ji i i ji
j i
f netx f nety f netx f nety z e
= =
(
+
' 1 ' '
max max
1 1
( )
S S
m
i i ji i i ji
j i
f netx nety f netx nety z e
= =
(
s +
(41)
**
1 1
S S
i i
i j
netx nety c
= =
s
=
*
. ( , ) c d netx nety (42)
vi
* ** **
1
.
S
j
c c s c
=
= =
. Theo (37) th
**
(0,1) c e (43)
Theo (42) v (43) th nh x l nh x co. Do , theo iu kin ca
(37) th mng Nron ang xt l hi t v mt im duy nht.
4.5.2. S hi t FBACN lin tc ca Layer1
Hm kch hot ca Layer1 c xy dng nh sau:
2
( ) ( )
1 exp( . )
j
f net v
j j j
r net
j j
o
o = +
+
(44)
Vi
j
o v r
j
l cc s thc dng. Ta cn tm gi tr max ca o hm
cp 1 ca (44). Ta bit rng o hm cp 1 ca hm f l cc i ti x nu o
75
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
hm cp 2 ca hm f ti x l bng 0. Gi s
''
( ) 0 f net
j
= th ta c net
j
=0. V
vy, gi tr max ca
'
f l t cc i a phng ti net
j
= 0 v gi tr l
.
' '
( )
max
2
r
j j
f f net
j
o
= = (45)
Mt khc 2 0 n
ji
e s s nn ta c
' '
0 2 . 2 . .
max max
f n f n r
ji j j
e o s s = (46)
Ta c
1 1
. .
. .
n r r
j j j
s s n
j
o
o
= = (47) vi s l s lng Nron c trong mng
v s c p = trong lp Layer1. V th ta chn
1
. .
r
j
n s
j
o
< , v v vy ta c th
tm c mt hng s
**
(0,1) c e sao cho
1
' **
max
f c
ji
s
e < < (48) iu
ny tha mn iu kin ca nh l 1 nn ta c mng l hi t n mt im.
Kt lun: Qu trnh tnh ton v chng minh, ta c c kt qu sau:
Vi Layer1, mng tho mn gi thuyt ca nh l 1, nn mng hi t
Vi Layer2, mng tho mn gi thuyt ca nh l 2, nn mng hi t
4.6. Gii thut ca FBACN v FBACN vi vic hc
Gii thut ca FBACN c thc hin qua cc bc sau:
GII THUT CA FBACN
1) Thit lp cc gi tr c, m, , v cc h s i
v
, i
u
trong lp Layer1
v Layer2 tng ng.
2) t h s n nh
v
A v A
u
cho Layer1 v Layer2 tng ng.
3) Khi to ngu nhin cc trung tm cm
( 1).
,
i p l
v
+
i=1, 2, ..., c v
l = 1, 2.., n trong Layer1 v lp thnh vin
, i k fc
u M (
e vi k=1, 2,
76
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
...,n v i = 1, 2, ...,c trong Layer2.
4) Cp nht cc h s i
v v
j
o o = v i
u u
j
o o = v gi tr mng ban u
(0) 0 net
j
= vi j =1, 2, ..., s
(s=c.p trong Layer1 v s=n.c trong Layer2).
5) Thit lp ch s hi quy g =1 cho Layer1.
6) Trong Layer1, tnh ma trn trng s W theo cng thc (12), ma trn
tn hiu vo bn ngoi I theo cng thc (11), v gi tr mng NET
theo cng thc (4).
7) For j = 1 to s do
if
( ) ( 1)
. 0
g g
j j
net net
< then : / 2
j j
v v
o o = ;
8) For j = 1 to s do
if
( )
0
g
j
net > then :
j j
j
v v
v
o = +
else :
j j
j
v v
v
o =
9) if (( ) &( ) &... &( )
1 2
v v v v v v
s
o o o s A s A s A then goto 10)
else {g:= g+1; goto 6)}.
10) t ch s hi quy g=1 cho Layer2.
11) Trong Layer2, tnh ma trn trng s W theo cng thc (23), ma
trn trng s Z theo cng thc (24) v | |
T
I 2 ,..., 2 , 2 = v ma trn
I U Z NET
m
+ + =
> <
. W.U
1
.
12) For j = 1 to s do
if
( ) ( 1)
. 0
g g
net net
j j
< then : / 2
u u
j j
o o = ;
13) For j = 1 to s do
if
( )
0
g
net
j
s then : u u
u
j j
j
o = +
77
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
else u : u
u
j j
j
o =
14) if (( ) &( ) &... &( )
1 2
u u u u u u
s
o o o s A s A s A then goto 15)
else {g:= g+1; goto 11)}.
15) if
( ) ( 1) g g
U U c
s
then Stop else goto 4)
79
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
CHNG 5
CI T TH NGHIM V NG DNG
5.1. Ci t th nghim thut ton FCM ........................................................................
5.2. ng dng thut ton FCM-Ci tin vo nhn dng nh ..........................................
79
82
Chng ny trnh by kt qu xy dng chng trnh th nghim ca
thut ton FCM v ng dng thut ton FCM-Ci tin vo qu trnh nhn
dng nh.
5.1. Ci t th nghim thut ton FCM
FCM l mt thut ton c p dng kh nhiu trong phn cm d liu
v hiu nng v tnh hin thc ca n kh tt. Thut ton FCM c bt u
bng cch chn C cm v chn ngu nhin c im lm trung tm cm hoc
chn phn hoch ngu nhin C cm v tnh trng tm ca tng cm ny. Nu
s lng d liu nh hn s cm th ta gn mi d liu l mt trng tm ca
cm, mi trng tm s c 1 s cm. Nu s lng d liu ln hn s cm, vi
mi d liu, ta tnh ton tng t c trng s gia im v trng tm
cm v ly khong cch ti thiu. D liu ny thuc v cm c khong cch
ti thiu ti d liu . Khi chng ta khng chc chn v v tr ca trng tm,
ta cn iu chnh v tr trng tm da vo d liu cp nht hin ti. Sau ,
ta gn tt c d liu ti trng tm mi ny. Qu trnh ny c lp li cho ti
khi khng cn d liu di chuyn sang cm khc. V mt ton hc, vng lp
ny c th chng minh l hi t cc tiu cc b.
Qu trnh ci t ca thut ton c m phng thng qua giao din
ca chng trnh nh Hnh 5.1 v Hnh 5.2 di y:
Ngn ng s dng l Visual C++ 6.0
Tham s ban u: S cm = 3, tham s m m = 2
D liu u vo l cc im mu khc nhau
80
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Hnh 5.1: Giao din ca chng trnh khi khi ng
Khi ngi s dng nhp s cm vo khung Nhp s cm, kch chut
vo khung chng trnh to ra cc im ca cm, v tr ca cc im c
th hin khung To xy. Chng trnh s t ng to ra cc cm d
liu bng cch ti gin tng bnh phng cc khong cch gia d liu v
trng tm cm tng ng khi ta kch chut vo khung chng trnh to ra
mi im. Mi im v ta ca n biu th cho mt i tng vi m t
hai thuc tnh ca i tng l mu sc ca im v s nhn biu th cho
cm.
81
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Di y l hnh nh thu c khi chy chng trnh vi s cm nhp vo l
8 cm..
Hnh 5.2: Giao din ca chng trnh khi lm vic
Chng trnh t ng phn thnh 8 cm thng qua s mu hin trong
tng cm v tm ca mi cm.
82
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
5.2. ng dng thut ton FCM-Ci tin vo nhn dng nh
Bi ton nhn dng chnh l qu trnh phn loi cc i tng c
biu din theo mt m hnh no v gn cho chng vo mt lp da theo
cc quy lut v cc mu chun. Nhn dng c rt nhiu ng dng, c p
dng vo rt nhiu lnh vc, chng hn nh nhn dng vn tay, nhn dng ch
vit, nhn dng nh V phn cm mu l mt bc rt quan trng trong qu
trnh nhn dng nh.
Do s lng im nh l rt ln, thng trn 80.000 im nh v s
lng mu ca mu d liu nh l ph thuc vo sc nt ca nh. Nu nh
c cht lng cng tt th s lng mu cng ln, nhng d nh c cht lng
nh th no i na th s lng mu vn ln. Mt khc, trong nhn dng nh,
chng ta ch quan tm ti mt s yu t nht nh, chng hn nh mt, lng
my, ming v da,... nn s lng mu m ta quan tm cng khng ln lm,
v vy p dng thut ton FCM-Ci tin vo vic phn cm mu trong nhn
dng nh l mt ng dng rt cn thit trong bi ton ny.
Qu trnh ng dng ca thut ton FCM-Ci tin c m phng thng
qua giao din ca chng trnh vi Hnh 5.3, Hnh 5.4 v Hnh 5.5 di y:
Ngn ng s dng l Visual C++ 6.0
Tham s ban u: Khai bo mng lu tr s lng mu ca nh, mng
lu tr s trung tm ca cm, s lng cm, tham s m.
D liu u vo l mt File nh mu(Bitmap)
D liu u ra l mt nh mu c nhn dng vi s cm mu
c thut ton FCM-Ci tin thc hin phn cm.
83
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Hnh 5.3: Giao din ca chng trnh khi khi ng
Khi chng trnh khi ng xong, ta chn mt nh ngun thc hin
bng cch n vo nt M File nh v chn mt nh cn thc hin nh Hnh
5.4 di y:
84
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Hnh 5.4: Giao din ca chng trnh khi chn nh phn cm
Sau khi chn xong, ta n vo nt Thc hin phn cm. Chng trnh s
thc hin qu trnh nhn dng v phn cm mu theo thut ton FCM-Ci tin
v hin th kt qu khung nh ch nh Hnh 5.5 di y.
85
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
Hnh 5.5: Giao din ca chng trnh khi thc hin phn cm
86
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
KT LUN
Trong qu trnh tm hiu v hon thnh lun vn tt nghip vi ti
Nghin cu mt s phng php phn cm m v ng dng, d t
c nhng kin thc nht nh, nhng em nhn thy phn cm d liu trong
KPDL ni chung v phn cm d liu m ni ring l mt lnh vc nghin
cu rng ln, nhiu trin vng. ti c gng tp trung tm hiu, nghin
cu v trnh by c mt s k thut v thut ton phn cm d liu ph
bin, mt s k thut phn cm m v m hnh mng nron a khp dng cho
phn cm m trong KPDL hin nay, trnh by mt s ci tin ca thut ton
phn cm m(FCM-Ci tin) da trn cc phng php c, ci t th
nghim thut ton phn cm m(FCM) vi ng dng phn cm cc im mu
v thc hin ci t ng dng ca thut ton FCM-Ci tin i vi vic phn
cm mu trong bi ton nhn dng nh mu.
Tuy nhin, do nhng hn ch v ti liu v thi gian nn em mi ch
tm hiu c mt s k thut in hnh trong phn cm v c bit l phn
cm m, ci t v th nghim mt s thut ton ng dng .... nhng cn
mt s k thut khc vn cha c tm hiu v khai thc, ci t th nghim
cha p dng c cho bi ton phn cm tng qut....
Trong thi gian ti em s tip tc nghin cu thm mt s k thut
phn cm v c bit l cc thut ton phn cm m kt hp song song ng
dng vo mt s bi ton thc t Vit Nam hin nay v hy vng s dn a
nhng kin thc c t ti ny sm tr thnh thc t, phc v cho cuc
sng con ngi chng ta.
Hc vin thc hin
An Hng Sn
87
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
TI LIU THAM KHO
Ti liu Ting Vit:
1. Phan nh Diu (1999), L Gch trong Cc H Tri Thc, NXB
i hc Quc gia H Ni, H Ni.
2. Nguyn Trng Thun, iu khin Logic v ng dng, Nh xut
bn Khoa hc v K thut, 2004.
3. Bi Cng Cng v Nguyn Don Phc, H m, mng nron v
ng dng , NXB Khoa hc v k thut, 2006.
4. V Thanh Nguyn, ng dng logic m, mng nron m, h cc
lut m phn tch d bo cc mt hng chin lc, Hi tho khoa
hc H m, mng nron v ng dng, ln 1, H ni 8-9/11/2006.
5. Ng Quc To, Gio trnh X L nh, Lp CHCLC-H Cng
Ngh-HQG H Ni 2001-2002.
6. Ng Quc To, Bi ging mn Data Mining, Lp CHK5-H Thi
Nguyn 2006-2008.
7. Ng Quc To, Bi ging mn X L nh, Lp CHK5-H Thi
Nguyn 2006-2008.
Ti liu Ting Anh:
8. Daniel T. Larose, Discovering Knowledge in Data: An
Introduction toData Mining, ISBN 0-471-66657-2 CopyrightC
2005 John Wiley & Sons, Inc.
9. A. Arning, R. Agrawal, and P. Raghavan. Alinear method for
deviation detection in larger databases, In Proc. 1996 Int. Conf.
Data Mining and Knowledge Discovery (KDD-96), Portland,
Oregon, August 1996.
88
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
10. P.S. Bradley, U. Fayyad, C. Reina, Scaling Clustering Algorithms
to Large Databases, In Proc of 4th International conference on
Knowledge Discovery and Dala Mining (Kdd-98), New York. 1998.
11. D. Fisher, Knowledge acquisition via incremental conceptual
clustering, In Machine Learning, 2 pp. 139-/72, 1987.
12. D. Gibson, J. Kleinberg, P. Raghavan, Clustering Categorical
Data: An Approach Based on Dynamical Systems, VLDB Journal 8
(3-4) pp. 222-236, 2000.
13. J. Han, M. Kamber, Data Mining Concepts and Techniques,
Morgan Kaufmann Publishers, 2001.
14. A.K. Jain, R.C. Dubes, Algorithms for clustering data, Ptentice
Hall, Englewood Cliffs, NJ, 1988.
15. R.A. Jarvis, E.A. Patrick, Clustering using a similarity measure
based on shared near neighbors, IEEE Transactions on Computers
C22, pp. 1025-1034, 1973.
16. M. Manago, Y. Kodratoff, Inuction of Decision Trees from
Complex Structuted Data, In Knowledge Discovery in Databases,
AAAI/Th MIT press, pp. 289-306, 1991.
17. J.C.Bezdek, Pattern Recognition with fuzzy Objective Function
Algorithms, New York, Plenum, 1981.
18. W.Pedrycz, Algorithms of fuzzy clustering with partial
supervision, Pattern Recognition, vol. 23, pp.121-146, 1990.
19. M.P.Windham, Cluster validity for fuzzy clustering algorithms,
Fuzzy Sets and System, vol. 3, pp. 177-183, 1981.
20. W.Pedrycz, Algorithms of fuzzy clustering with partial
supervision, Pattern Recognition, vol. 23, pp.121-146, 1990.
89
S ha bi Trung tm Hc liu i hc Thi Nguyn http://www.lrc-tnu.edu.vn
21. G.Bueno, R.Gonzalez, J.Gonzalez, and M.Garcia-Rojo, Fuzzy
colour C-means clustering for pattern segmentation in histological
images, The 3rd European Medical and Biological Engineering
Conference, 2005.
22. Chih-Hsiu Wei, Chin - Shyurng Fahn, The multisynapse neural
network and its application to fuzzy clustering.
23. J.H.Wang and C.Y.Peng, Optimal clustering using neural
network, in Proc. IEEE Int. Conf. Syst., Man, Cybern., vol.2, 1998,
pp.1625-1630.
24. Y.Guo, X.Yin, and W.Gong, ART2 neural network clustering for
hierarchical simulation, in Proc. SPIE Int. Soc.Opt.Eng., vol.
2.1998, pp.35-48.
25. M.F.Augusteijn and U.J.Steck, Supervised adaptive clustering: A
hybrid neural network clustering algorithm, neural
Comput.Applicat., vol.7,no. 1, pp.78-89, 1998.
26. E. C. Tsao, J. C. Bezdek, and N. R. Pal, Fuzzy Kohonen
clustering network, Patterm recognition, vol.27, no.5, pp.757-764,
1994.
27. J. Lin, K. Cheng, and C.Mao, A fuzzy Hopfield neural
network for medical image segmentation, IEEE Trans. Nuclear
Sci., vol.43, 1996.
28. Hathaway R.J and Bezdek J.CNTT (2000), Generalized
fuzzy c-means clustering Strategies using LP Norm Distances,
IEEE Trans.Fuzzy Syst, No 5, pp.576-582.
29. J.E.Steck and S.N.Balakrishnan, Use of Hopfield newral networks
in optimal guidance, IEEE Trans. Aerosp.Electron. Syst., vol.30,
no.1, pp 287-293, Jan.1994.