You are on page 1of 8

Quang Tri Teacher Training College, Vietnam Ubon Ratchathani Rajabhat University, Thailand

Conference on Information Technology Application in Education and Training



140



S DNG MNG KOHONEN PHN CM
MT S C S D LIU TRONG TRNG HC

CN. HUNH TH KIM NGN
Trng CSP Qung Tr

Ngy nay, vi kho thng tin khng l trn Internet th vic khai ph d
liu v ng dng khai ph d liu vo cuc sng l mt trong nhng kha
cnh ang c quan tm ca ngnh CNTT. Trong gii hn bi bo ti xin
trnh by v mt trong s nhng phng php khai ph d liu l s dng
mng Kohonen phn cm d liu (bao gm: c tnh, cu trc, cch thc
hun luyn, v dch mng). ng thi bi vit cng trnh by hai ng dng c
th ca mng Kohonen trong trng hc l: phn cm mn hc v sinh
vin theo im hc tp.

1. GII THIU
Ngy nay, mi ngi thng ni chng ta ngp chm trong d liu nhng li thiu tri
thc. Qu tht, s pht trin ca cc thit b lu tr, cc h qun tr c s d liu to iu
kin cho vic lu tr d liu ngy cng d dng, dn n vic hnh thnh mt khi lng d liu
khng l, i hi chng ta - nhng ngi s dng - phi bit khai thc, chn lc d liu c ch
cho mnh gia mt bin d liu y. cng l l do Data Mining (DM) ra i.
Thng tin gio dc cng c lu tr trong my tnh v chng ta cn phi tm nng
thng tin c ch t c s d liu phc v cho vic nng cao cht lng gio dc.
Trong gii hn ca bi bo ny, ti xin trnh by vic: S dng mng Kohonen
phn cm mt s c s d liu trong trng hc, t nhm gp phn khng nh th
mnh ca khai ph d liu (KPDL) v thc y ng dng KPDL vo gio dc - o to.
Trong xu th nn gio dc i hc nc ta ang chuyn sang o to theo hc ch tn ch
th cc h thng o to trc tuyn xut hin ngy cng nhiu. V vy, vai tr ca KPDL
ngy cng tr nn quan trng.
2. NI DUNG
2.1. Gii thiu mng Kohonen
a) Gom cm l g? [1]
Cm d liu l tp hp cc i tng c nhng tnh cht no tng t nhau mt
mc no trong tp d liu.
Gom cm d liu l tin trnh nhn din cc cm tim n trong tp cc i tng cha
c xp lp. Tin trnh khai thc cm da trn mc tng t gia cc i tng. Cc
i tng c gom cm sao cho mc tng t gia cc i tng trong cng mt cm
l cc i v mc tng t gia cc i tng nm trong cc cm khc nhau l cc tiu.
Cc cm c c trng bng cc tnh cht chung ca tt c cc i tng trong cm. Do
vy, kho st cc cm s gip khi qut, nhanh chng rt ra cc c im ca khi d liu
ln.
Quang Tri Teacher Training College, Vietnam Ubon Ratchathani Rajabhat University, Thailand
Conference on Information Technology Application in Education and Training

141
b) Mng Kohonen l g?
Mng Kohonen (hay cn gi l Self-Organizing Maps vit tt l: SOMs) l mt k thut
trc quan d liu c pht minh bi Gio s Teuvo Kohonen, n gip gim s chiu (kch
thc) ca d liu, qua gip chng ta hiu c d liu c kch thc ln . [9]
im ni bt ca thut gii SOM l n s chuyn cc dng quan h phc tp, thng k ri
rc gia cc i tng d liu a chiu (n chiu) v cc quan h hnh hc trc quan n gin vi
s chiu c rt gn i nhiu ln thng l mt hoc hai chiu. V bn cht thut gii c
bit n nh l k thut nn d liu da trn vector trng s (vector quantisation). [3]
Mt trong nhng kha cnh th v nht l: SOM l mt dng ca thut ton hc
phn loi d liu hun luyn m khng cn bt c s gim st bn ngoi no hc khng
gim st. [3]
c) To v hun luyn mng Kohonen
Cu trc ca mng Kohonen [4[5]
Mng Kohonen hai chiu gm c 2 lp l lp vo v lp ra Kohonen.
Lp vo (Input layer): dng a d liu hun luyn
vo mng Kohonen. Kch thc ca lp vo tng ng vi
kch thc ca mi mu hc.
Trong mng Kohonen hai chiu, cc neural ca
lp ra c sp xp trn mt mng 2 chiu. Mng ny
c gi l lp ra Kohonen.
Tt c cc neural ca lp vo u c ni vi cc
neural trn lp ra Kohonen. Mi lin kt gia u vo v
u ra ca mng Kohonen tng ng vi mt trng s.
Kch thc ca mi vector trng s bng kch thc ca lp
vo. [2] Ta c th hnh dung cu trc ca mng Kohonen hnh bn :

Gii thut hun luyn mng:
Khi to tham s thi gian t: t = 0.
Bc 1: Khi to vector trng s W
ij
cho mi neural i trong mng Kohonen.
Bc 2: Ly mt mu hun luyn x t tp hun luyn.
Bc 3: Tm mu khp tt nht (BMU) - phn t neural chin thng:
Duyt tt c cc nt v tnh khong cch Euclide gia vector trng s ca
mi nt v vector nhp hin hnh.
Cng thc tnh khong cch Euclide c cho nh sau:
( )
n
2
i i
i = 0
i s t = V - W D


V: vector nhp hin hnh
W: vector trng s ca phn t c chn
Nt c vector trng s gn nht vi gi tr ca vector nhp s c chn l
BMU
Bc 4: Xy dng cc phn t ln cn.
Quang Tri Teacher Training College, Vietnam Ubon Ratchathani Rajabhat University, Thailand
Conference on Information Technology Application in Education and Training

142
c tnh duy nht ca thut ton hc Kohonen l vng ln cn ca BMU c xy
dng trn vector khong cch s c co li sau mt s ln lp nht nh. iu ny c
thc hin bng cch co li bn knh ca vng ln cn theo s ln lp.
Php co s c thc hin theo hm m ni suy sau:
0
( ) exp
t
t


=



(t = 1, 2, 3)

: bn knh ln cn ca BMU ti thi im t

0
: bn knh ln cn ca BMU ti thi im t
0
.
: hng s thi gian
t: l bc lp hin ti
Gi tr ca hng s ph thuc vo v s ln lp chy gii thut. N c tnh
theo cng thc sau: = N/log(
0
)
N: s ln lp chy gii thut (s ln hc ca mng).
v s c dng tnh bn knh ln cn trong mi ln lp ca gii thut.
Khi bn knh ln cn c xc nh, vic xc nh cc phn t ln cn ca BMU s
c thc hin n gin bng cch duyt tt c cc phn t trong mng xem n c nm
trong bn knh ln cn hay khng.
Bc 5: Hiu chnh trng s ca cc phn t ln cn qu trnh hc ca gii thut SOM.
Trng s ca cc phn t ln cn c xc nh bc trn bao gm c BMU s
c iu chnh chng c gi tr gn ging vi gi tr ca vector nhp hn. Phn t cng
gn vi MBU, th trng s ca n s cng d b thay i nhiu hn. Cc vector trng s s
c hiu chnh theo cng thc sau:
( 1) ( ) ( ) ( )( ( ) ( )) W t W t t L t V t W t + = +

: Hm ni suy theo thi gian hc. N th hin s tc ng ca khong cch i vi


qu trnh hc v c tnh theo cng thc sau:
2
2
( ) exp
2 ( )
dist
t
t

=



(t = 1, 2, 3)
dist: l khong cch t mt neural n neural chin thng.
t: bc lp hin ti
L: tc hc (s gim dn theo s ln lp)
Biu thc trn cho thy trng s ca mt nt sau khi hiu chnh chnh l gi tr trng
s c W ca n cng thm phn gi tr khc bit gia trng s W v vector nhp V theo h
s tc hc.
Hm ni suy tc hc L(t) cho mi bc lp c tnh theo cng thc sau:
0
( ) exp
t
L t L


=



(t = 1, 2, 3)
L
0
: Gi tr khi to ban u ca tc hc.
: hng s thi gian
Cng tin dn v im gia th tc hc s cng ging vi hm m ni suy ca php co.
Tc hc s c ni suy dn theo tc hc v gi tr ca hm s tin dn v khng khi
s ln lp t n nhng bc cui cng.
Quang Tri Teacher Training College, Vietnam Ubon Ratchathani Rajabhat University, Thailand
Conference on Information Technology Application in Education and Training

143
Bc 6: Vng lp
Tng t, ly mu hc tip theo. Lp li bc 2 cho n khi gii thut ti u hoc t
n s ln lp xc nh N cho trc.
Ch : S lng vector ca tp hun luyn l c gii hn. cho vic hun luyn
mng a li kt qu tt nht i hi s bc hun luyn phi ln, v nu kch thc ca
tp hun luyn l nh th nhng mu hun luyn c th c s dng lp li. Nhng mu
c th c p dng u n theo chu k hoc theo mt trt t hon v ngu nhin. [9]
Trc quang mng.[8]
The Umatrix (The Unified distance matrix ma trn ng nht khong cch) l
phng thc c s dng ph bin nht th hin khong cch gia cc neural mt
cch trc quan. Phng thc ny s dng khong cch gia cc neural nh l mt tiu
chun xc nh ranh gii gia cc cm d liu. N cha khong cch Euclid t mi
neural trung tm n cc lng ging ca n.
Sau khi U-Matrix c tnh xong th s c hin th di dng li cc hnh lc gic
nh mt cng c trc quan mng. Khong cch gia cc neural cng c hin th vi
nhng mu sc khc nhau.
Dch mng. [5,6]
Xc nh ranh gii gia cc cm.
Cch dch U-matrix nh sau: nu gi tr khong cch gia cc neural ln biu th cho
nhng d liu khng ging nhau (khng cng cm); v ngc li gi tr b biu th cho
nhng d liu ging nhau. Ta c th tng tng trong khng gian 3 chiu gi tr cng ln
th cao cng ln v do n to thnh nhng bc tng (Walls) phn ranh gii trong
khi gi tr b to thnh nhng thung lng (valley) b bao quanh bi nhng bc tng.
Do d liu trong nhng thung lng l c gp nhm cng nhau v hin th mt cm.
Khong cch gia cc neural sau khi c tnh, c th c hin th vi nhng mu khc
nhau. Mu ti gia cc neural biu th cho khong cch ln. Mu sng gia cc neural biu th
cc gi tr nhp l gn nhau. V do vng mu sng biu th d liu cng thuc mt cm v
vng mu ti chnh l ranh gii phn cch gia cc cm. Vic hin th ny gip ta c mt ci
nhn trc quan, y , r rng, chnh xc v d liu.

Xc nh tnh cht ca tng cm: [2]
Vic xc nh tnh cht ca tng cm c thc hin bng vector trng s ca cc
neural thuc cm . S dng phng php ca Sestilo & Dillon,1997. Thnh phn th k
ca vector trng phn nh mc ng gp ca thnh phn th k ca vector nhp. Gi
(w
ij1
,w
ij2
,. . . , w
ijn
) l vector trng ti nt i,j trn lp ra Kohonen. Tm
w
ijmax
=max((w
ij1
,w
ij2
,. . . , w
ijn
).
Cc u vo k c | w
ijmax
- w
ijk
| < T vi T l ngng nm trong khong t 0 n 1. Gi
tr ca ngng T phn nh mc bin thin quanh tr W
ijmax
.
Mt s ng dng ca mng Kohonen trong trng hc
D liu dng thc hin trong phn ny c ly t kt qu hc tp ca sinh vin
lp SPTinK11 khoa CNTT, trng CSP Qung Tr nin kha 2006 2009.
a) Phn cm mn hc
Vic phn cum mn hc c th cho ta thy ton cnh v cc mn hc trong chng trnh;
c c s so snh gia cc mn hc; c s nh gi ng n v tnh cht ca mn hc v
Quang Tri Teacher Training College, Vietnam Ubon Ratchathani Rajabhat University, Thailand
Conference on Information Technology Application in Education and Training

144
kh nng nhn thc ca ngi hc t c bin php tc ng ph hp ci thin cht lng
dy v hc.
D liu:



Hnh nh cc cm ca mng:

Kt qu
Cm Cc mn hc
I
MON23, MON26, MON29, MON30, MON 06, MON09, MON12, MON15,
MON16
II MON01, MON02, MON03, MON04, MON05, MON07, MON17
III
MON17, MON08, MON24, MON28, MON13, MON25, MON31, MON14,
MON18, MON19, MON20, MON21, MON27
IV
MON10,MON22,MON11,MON32,MON33,MON34,MON35,MON36,MON37,
MON38
Cm I: (bao gm cc mn hc c neural so khp tt nht (BMU) l neural 11 v
neural 12) cc mn hc thuc cm ny c t l im di 5; im 5, 6; im 7; im 8;
im 9, 10 tng ng l: 0,3; 0,3; 0,120,2; 0,06; 0,10,05. y l cm cc mn hc c s
lng SV thi li nhiu nht.
Cm II: (BMU: neural 13) cc mn hc thuc cm ny c t l im di 5; im
5, 6; im 7; im 8; im 9, 10 tng ng l: 0,25; 0,58; 0,1; 0,04; 0,006. y l cm cc
mn hc c s lng SV thi li t hn cm 1 nhng im s ch yu li l 5, 6.
Quang Tri Teacher Training College, Vietnam Ubon Ratchathani Rajabhat University, Thailand
Conference on Information Technology Application in Education and Training

145
Cm III: (BMU: neural 21; 22; 23; 31) cc mn hc thuc cm ny c t l im
di 5; im 5, 6; im 7; im 8; im 9, 10 tng ng l: 0,040,17; 0,260,37;
0,150,41; 0,150,24; 0,060,1. Cm ny c s lng sinh vin thi li v t im 9, 10 l
t v tp trung vo cc im s cn li.
Cm IV: (BMU: neural 32; 33) cc mn hc thuc cm ny c t l im di 5;
im 5, 6; im 7; im 8; im 9, 10 tng ng l: 0,007; 0,2; 0,160,26; 0,260,39;
0,20,1. y l cm cc mn hc c t l thi li t nht v t l im 8, 9, 10 l cao nht
trong tt c cc cm.
Qua phn tch nh trn ta thy:
Cm IV c th xem l cm cc mn hc d nht iu ny c th do: [1]
Ging vin truyn t mn hc tt
kim tra d.
Sinh vin hng th hc tp.
Cc cm III c th xem l cm c tnh cht mn hc ph hp vi trnh ca SV v
phn loi c SV
Cm I v II l cm bao gm cc mn hc kh. iu ny c th do: [1]
Ging vin truyn t mn hc cha tt
kim tra kh.
Sinh vin khng hng th hc tp.
Cc nhn nh trn s gip rt nhiu cho vic t chc ging dy, b tr ging vin
nhm nng cao cht lng ging dy, hc tp [1].
b) Phn cm sinh vin theo kt qu hc tp tng mn trong nhiu hc k.
Mi sinh vin khc nhau c mt kh nng khc nhau trong vic hc cc mn hc, mt
ngi c th hc gii mn ny nhng li yu mn khc. Vic thy c kh nng rt c
li cho vic p dng phng php dy hc c bit ha (mt phng php dy hc c trng
ca hc ch tn ch); vic nh hng ngi hc trong nhng bc i sau ny.
D liu:


Quang Tri Teacher Training College, Vietnam Ubon Ratchathani Rajabhat University, Thailand
Conference on Information Technology Application in Education and Training

146

Kt qu:
S dng mng Kohonen ta c th phn tch d liu SV thnh 7 cm sau:
Cm M sinh vin Tnh cht cm
I HVBin, HMDng, NTH, TMT (BMU:11)
Nhm SV c im cc mn hu ht t
TB
II DADng,MTMPhng, LHTon (BMU: 12) Nhm cc SV c im cao u cc mn
III NTH, TTHa, VMLi, TTMNhn (BMU: 13) Nhm SV c im nhiu mn di TB
IV
HQBo,LTMHu,LTHa,TTANgc,NTQNh
(BMU: 21)
Nhm SV c im hu ht cc mn u
cao ch c 2 mn TL v CTDL &GT thp
hn.
V
NTVAnh,PTt,LTH,HTTHin,HTHHa,
NTHuyn,VTLoan, HTNhung,NTOanh,
TTPTho,HTDTrang (BMU: 22)
Nhm SV ny c im thp u tt c cc
mn.
VI
NTKAnh,VTHu,NTPhng,TTTm,
TTTChu,NVCng,NHDng,NTTHin,
NTKHong,VTHng,LQKhnh,HNNguyn,
HTHNhi,HTHNh,HTThy,HNVinh,MV
(BMU: 23, 32, 33)
Nhm SV ny c im cc mn LT,
CTDL&GT, OOP hu ht di 5 cn cc
mn khc im cao hn.
VI
TPHng, NVMng, TVNam, NANht, TPh,
THPhng, NTThu, NLTng (BMU: 31)
y l nhm SV c im rt thp hu ht
cc mn u c im <5.

Du trn nhng phn tch trn ta c th phn nhm cc hc sinh theo kh nng hc t
c bin php ph hp nng cao cht lng gio dc.
3. KT LUN
Trn y ti trnh by v mng Kohonen v ng dng ca n trong vic phn cm
d liu trng hc m c th l gom cm d liu im mn hc v d liu hc sinh theo
kt qu hc tp; ng thi a ra mt s nhn xt.
Tuy nhin trong thi gian hn ch bi bo trnh by cn c nhiu thiu st trong vic
ci t cng nh ng dng mng Kohonen trong thc t.
Mng Kohonen l mt phng php khai ph d liu mnh tuy nhin c th t
c kt qu khai ph tt, hon ho hn cn kt hp cc phng php KPDL khc v cn
nhng kin ca chuyn gia trong lnh vc gio dc c th a ra nhng kt lun chnh
xc nhm nng cao cht lng gio dc.




TI LIU THAM KHO

[1] Phc, Hong Kim, Trin khai ng dng khai ph d liu vo mt s c s d
liu trong trng hc.
[2] Ch nhim ti: TS. Phc TTPht trin CNTT, HQG-HCM ng ch
nhim ti: Th S Nguyn Minh Hip Th vin cao hc, HKHTN, Xy dng mt s
cng c h tr tra cu v tng hp thng tin trong th vin s.
Quang Tri Teacher Training College, Vietnam Ubon Ratchathani Rajabhat University, Thailand
Conference on Information Technology Application in Education and Training

147
[3] Hunh Ph Trung, Kha lun mn hc: Khai ph d liu, Ging vin hng dn:
TS KH Phc.
[4] Ultsch, Self Organizing Neural Networks perform different from statistical k-means
clustering
[5] Jason Ong, Syed Sibte Raza Abidi, Data Mining Using Self-Organizing Kohonen
maps: A Technique for Effective Data Clustering & Visualisation
[6] Jonas Karlsson, Barbro Back, Hannu Vanharanta, Analysing Financial
Performance with Quarterly Data Using Self-Organising Maps.
[7] Juha Vesanto, SOM algorithm implementation in SOM Toolbox.
[8] Tom Germano (March 23, 1999), Self Organizing Maps.
[9] Teuvo Kohonen, Useful instructions when applying the SOM

You might also like