You are on page 1of 32

45

Chng 1:
Chng 2:

Tng quan v k thut khm ph tri thc v khai ph d liu..........................46


Logic m v iu khin m ............................................................................46

Chng 3: Lut kt hp m ....................................................................................... 47


3.1. Lut kt hp ...........................................................................................................47
3.1.1.
Cc khi nim ................................................................................................47
3.1.1.1.
Lut kt hp ...........................................................................................47
3.1.1.2.
h tr v tin cy ...........................................................................48
3.1.2.
Mt s hng tip cn trong khai ph lut kt hp..........................................52
3.2. Lut kt hp m .....................................................................................................53
3.2.1.
M ha d liu ...............................................................................................54
3.2.1.1.
M ha cc thuc tnh im ....................................................................54
3.2.1.2.
M ha thuc tnh hnh kim .................................................................56
3.2.1.3.
M ha thuc tnh mc u tt nghip ...............................................57
3.2.2.
Nhng u im ca vic p dng tp m ri rc ha d liu ......................59
3.2.3.
Khai ph lut m ............................................................................................60
3.2.4.
Thut ton khai ph lut kt hp m ...............................................................63

46

Chng 1: Tng quan v k thut khm ph tri thc


v khai ph d liu
Chng 2: Logic m v iu khin m

47

Chng 3: Lut kt hp m
3.1.

Lut kt hp
Lut kt hp c gii thiu t nm 1993, bi ton khai ph lut kt hp nhn

c rt nhiu s quan tm ca nhiu nh khoa hc. Ngy nay vic khai ph cc lut
nh th vn l mt lnh vc quan trng trong khai ph d liu. Lut kt hp gip
chng ta tm c cc mi lin quan gia cc mc d liu (items) ca c s d liu
(CSDL) [11]. Lut kt hp l dng kh n gin nhng mang li nhiu hiu qu.
Thng tin v nhng dng lut ny rt quan trng v h tr khng nh trong qu trnh
ra quyt nh.
3.1.1. Cc khi nim
3.1.1.1.

Lut kt hp

Cho mt tp I = {I1, I2, ..., Im} gm m mc (Item). Tp X I c gi l tp mc


(itemset)
T={t1, t2,,tn}l tp gm n bn ghi (record cn gi l giao tc transaction), mi
bn ghi t l mt tp mc, c nh danh bi TID (Transaction Identification).
Tng t nh khi nim tp hp, cc bn ghi khng c trng lp, nhng c th
ni rng tnh cht ny ca tp hp v trong cc thut ton sau ny, ngi ta u gi
thit rng cc khon mc trong mt bn ghi v trong tt c cc tp mc khc, c th coi
chng c sp xp theo th t t in ca cc mc.
Gi D l CSDL ca n bn ghi v mi bn ghi c nh nhn vi mt nh danh
duy nht.
V d 3.1: cho c s d liu (dng giao dch):
I={A, B, C, D, E}
T={1, 2, 3 ,4 ,5 ,6}

48

Bng 3.1. CSDL D dng giao tc.


TID

Tp mc

AB DE

BC E

AB

ABC E

AB DE

B D

Mt lut kt hp R c dng X Y . Trong X, Y l tp cc mc. X, Y I v


X Y=. X c gi l tin v Y c gi l h qu ca lut.
C hai o quan trng i vi lut kt hp: h tr (support) v tin cy
(confidence).
3.1.1.2.

h tr v tin cy

nh ngha 3.1: h tr ca mt tp hp X trong c s d liu D l t l gia cc


bn ghi T D c cha tp X v tng s bn ghi trong D (hay l phn trm ca cc bn
ghi trong D c cha tp hp X), k hiu l Support(X) hay Supp(X).
Supp ( X )

{T D : T X }

(3.1)

Ta c: 0 Supp ( X ) 1 vi mi tp hp X.
nh ngha 3.2: h tr ca mt lut kt hp X Y l t l gia s lng cc
bn ghi cha tp hp X Y vi tng s cc bn ghi trong D - K hiu Supp ( X Y ) .
Supp ( X Y )

T D : T X Y
D

(3.2)

Khi chng ta ni rng h tr ca mt lut l 50%, c ngha l c 50% tng s


bn ghi cha X Y . Nh vy, h tr mang ngha thng k ca lut.

49

Trong mt s trng hp, chng ta ch quan tm n nhng lut c h tr cao


(V d nh lut kt hp xt trong ca hng tp phm). Nhng cng c trng hp, mc
d h tr ca lut thp, ta vn cn quan tm (v d lut kt hp lin quan n
nguyn nhn gy ra s t lin lc cc tng i in thoi)
nh ngha 3.3: tin cy ca mt lut kt hp X Y l t l gia s lng cc
bn ghi trong D cha X Y vi tng s bn ghi trong D c cha X. K hiu tin cy
ca mt lut l Conf(r). Ta c 0 Conf (r ) 1
Conf ( X Y ) Supp ( X Y ) / Supp ( X )

(3.3)

C th nh ngha tin cy nh sau:


nh ngha 3.4: tin cy ca mt lut kt hp X Y l t l gia s lng cc
bn ghi ca tp hp cha X Y , so vi tng s cc bn ghi cha X.
Ni rng tin cy ca mt lut l 90%, c ngha l c ti 90% s bn ghi cha X
cha lun c Y.
Nh vy, tin cy ca lut th hin s tng quan (correlation) ga X v Y.
tin cy o tnh ng ca lut, v ngi ta hu nh ch quan tm n nhng lut c
tin cy cao.
Vic khai thc cc lut kt hp t c s d liu chnh l vic tm tt c cc lut c
h tr v tin cy do ngi s dng xc nh trc. Cc ngng ca h tr v
tin cy c k hiu l minsup v minconf. Tri thc em li bi lut kt hp c s
khc bit rt nhiu so vi nhng thng tin thu c t cc cu lnh truy vn d liu.
l nhng tri thc, nhng mi lin h cha bit trc v mang tnh d bo ang tim
n trong d liu. Nhng tri thc ny khng n gin l kt qu ca php nhm, tnh
tng hay sp xp m l ca mt qu trnh tnh ton kh phc tp.
nh ngha 3.5: Cho mt tp mc X I v ngng ph bin (ngng h tr)
ti thiu minsup (0,1] (c xc nh bi ngi s dng). Mt tp mc X c gi l
mt tp ph bin (Frenquent itemset) theo ngng minsup nu v ch nu h tr ca
n ln hn hoc bng minsup: Supp(X) minsup. K hiu cc tp ny l FI

50

Tnh cht 3.1: Gi s A, B I l hai tp hp vi A B th Supp ( A) Supp( B) .


Nh vy, nhng bn ghi no cha tp hp B th cng cha tp hp A.
Tnh cht 3.2: Gi s A, B I l hai tp hp, nu B l tp ph bin v A B th A
cng l tp ph bin.
Tht vy, nu B l tp ph bin th Supp(B) minsup, mi tp hp A l con ca tp
hp B u l tp ph bin trong c s d liu D v Supp(A) Supp(B) (Tnh cht 2.1).
Tnh cht 3.3: Gi s A, B l hai tp hp, A B v A l tp khng ph bin th B
cng l tp khng ph bin.
V d 3.2: trong CSDL cho bng 3.1, v gi tr ngng minsup = 50% s lit k
tt c cc tp ph bin nh sau:
Bng 3.2. Cc tp ph bin ng vi h tr ti thiu 50% trong bng 1.
Cc tp ph bin (FI)

h tr

100% (6/6)

E, BE

83% (5/6)

A, AB, AE, ABE

67% (4/6)

D, BD

50% (3/6)

nh ngha 3.6: Mt tp mc X c gi l ng (closed) nu khng c tp cha


no ca X c cng h tr vi n, tc l khng tn ti mt tp mc X no m
X ' X v t(X) = t(X) (vi t(X) v t(X) tng ng l tp cc bn ghi cha tp mc X v

X). K hiu tp ph bin ng l FCI.


nh ngha 3.7: Nu X l tp ph bin v khng c tp no cha X, ta ni rng X
l mt tp ph bin ln nht (maximally frequent itemset). K hiu tp tt c cc tp
ph bin ln nht l MFI. D thy MFI FCI FI .
Khng phi bt c lut kt hp no c mt trong tp cc lut c th c sinh ra
cng u c ngha trn thc t. M cc lut u phi tho mn mt ngng h tr v
tin cy cho trc. Thc vy, cho mt tp cc giao dch D, bi ton pht hin lut kt

51

hp l sinh ra tt c cc lut kt hp m c tin cy Conf ln hn tin cy ti thiu


minconf v h tr Supp ln hn h tr ti thiu minsup tng ng do ngi dng
xc nh. Hu ht cc bi ton khai ph lut kt hp c chia thnh hai bi ton con
hay tri qua hai pha:
Pha 1: Tm tt c cc tp mc ph bin (FI) t c s d liu, tc l tm tt c cc
tp mc X tha mn Supp ( X ) minsup.
Pha 2: Sinh cc lut tin cy t cc tp ph bin FI tm thy pha 1.
Nu X l mt tp ph bin th lut kt hp c sinh t X c dng:
C
X '
X \ X'

X ' l tp con khc rng ca X.


X / X ' l hiu ca hai tp hp X v X ' .

c l tin cy ca lut tha mn: c minconf.


V d 3.3: Trong bng 3.2 ta c tp ph bin ABE vi h tr 67%, chng ta c
cc lut kt hp sau y:
Bng 3.3. Cc lut kt hp t tp ph bin ABE.
Lut kt hp

tin cy (c)

c
A

BE

Supp(ABE)/Sup(A)=100%

c
B

AE

Supp(ABE)/Sup(B)=67%

c
E

AB

Supp(ABE)/Sup(E)=80%

c
AB

Supp(ABE)/Sup(AB)=100%

c
AE

Supp(ABE)/Sup(AE)=100%

c
BE

Supp(ABE)/Sup(BE)=80%

Nu chng ta cho minconf=90% th ch nhn cc lut kt hp:


A BE; AB E; AE B v chng c tin cy c minconf.

52

3.1.2. Mt s hng tip cn trong khai ph lut kt hp


Lnh vc khai ph lut kt hp cho n nay c nghin cu v pht trin theo
nhiu hng khc nhau. C nhng xut nhm ci tin tc thut ton, c nhng
xut nhm tm kim lut c ngha hn,, c th:
Lut kt hp nh phn (binary association rule): l hng nghin cu u tin ca
lut kt hp. Hu ht cc nghin cu thi k u v lut kt hp u lin quan n
lut kt hp nh phn. Trong dng lut kt hp ny, cc mc, thuc tnh, ch c quan
tm l c hay khng xut hin trong giao tc ca CSDL ch khng quan tm v mc
xut hin. Thut ton tiu biu nht khai ph dng lut ny l thut ton Apriori v
cc bin th ca n. y l dng lut n gin v cc lut khc cng c th chuyn v
dng lut ny nh mt s phng php nh ri rc ho, m ho,
Lut kt hp c thuc tnh s v thuc tnh hng mc (quantitative and
categorial association rule): Cc thuc tnh ca cc CSDL thc t c kiu rt a dng,
nh s nh phn, gi tr nh tnh, nh lng,... pht hin lut kt hp vi cc thuc
tnh ny, cc nh nghin cu xut mt s phng php ri rc ho nhm chuyn
dng lut ny v dng nh phn c th p dng cc thut ton c.
Lut kt hp tip cn theo hng tp th (mining association rules base on
rough set): Tm kim lut kt hp da trn l thuyt tp th.
Lut kt hp nhiu mc (multi-level association rules): Cch tip cn theo lut
ny s tm kim thm nhng lut c dng mua my tnh PC => mua h iu hnh
AND mua phn mm tin ch vn phng, thay v ch nhng lut qu c th nh
mua my tnh IBM PC => mua h iu hnh Microsoft Windows AND mua phn
mm tin ch vn phng Microsoft Office, . Nh vy dng lut u l dng lut
tng qut ho ca dng lut sau v tng qut theo nhiu mc khc nhau.
Lut kt hp m (fuzzy association rules): Vi nhng hn ch cn gp phi trong
qu trnh ri rc ho cc thuc tnh s (quantitave attributes), cc nh nghin cu

53

xut lut kt hp m nhm khc phc cc hn ch trn v chuyn lut kt hp v mt


dng t nhin hn, gn gi hn vi ngi s dng.
Lut kt hp vi thuc tnh c nh trng s (association rule with weighted
items): Trong thc t, cc thuc tnh trong CSDL khng phi lc no cng c vai tr
nh nhau. C mt s thuc tnh c ch trng hn v c mc quan trng cao hn
cc thuc tnh khc. y l hng nghin cu rt th v v c mt s nh nghin
cu xut cch gii quyt bi ton ny. Vi lut kt hp c thuc tnh c nh
trng s, chng ta s khai thc c nhng lut him (tc l c h tr thp, nhng
c ngha c bit hoc mang rt nhiu ngha).
Lut kt hp song song (parallel mining of association rule): Bn cnh khai thc
lut kt hp tun t, cc nh lm tin hc cng tp trung vo nghin cu cc thut gii
song song cho qu trnh pht hin lut kt hp. Nhu cu song song ho v x l phn
tn l cn thit bi kch thc d liu ngy cng ln hn nn i hi tc x l cng
nh dung lng b nh ca h thng phi c m bo. C rt nhiu thut ton song
song khc nhau xut c th khng ph thuc vo phn cng.
Bn cnh nhng nghin cu v cc bin th ca lut kt hp, cc nh nghin cu
cn ch trng xut nhng thut ton nhm tng tc qu trnh tm kim tp ph bin
t CSDL.
Ngoi ra, cn c mt s hng nghin cu khc v khai thc lut kt hp nh: khai
thc lut kt hp trc tuyn, khai thc lut kt hp c kt ni trc tuyn n cc kho
d liu a chiu thng qua cng ngh OLAP, MOLAP, ROLAP, ADO.
3.2.

Lut kt hp m

Khai ph lut kt hp c thuc tnh s hoc thuc tnh hng mc (Quantitative or


categorical association rule) l mt trong nhng hng tip cn quan trng trong lnh
vc khai ph lut kt hp.

54

3.2.1. M ha d liu
Cc thut ton khai ph lut kt hp nh phn ch c th p dng trn nhng c
s d liu quan h c thuc tnh nh phn hoc c s d liu dng giao dch, ch khng
th p dng trc tip cho cc c s d liu c thuc tnh s v thuc tnh hng mc. Do
, chng ta phi tin hnh m ha d liu cho cc thuc tnh s v thuc tnh hng
mc chuyn chng v dng thuc tnh nh phn.
m ha d liu chng ta phi xc nh cc hm thnh vin biu din gi tr
ngn ng cho cc bin m iu ny li khng thuc hn v nhim v nghin cu ca l
thuyt tp m. Chuyn gia th nhiu nht cng ch c th cung cp c kin v min
xc nh cho cc tp m. V vy, nn vic thit k cc hm thuc cho tp m hon ton
do ngi thit k m nhn.
Sau y l minh ha v hai phng php ri rc ha im cc mn hc ton,
l, ha, sinh, vn, s, a, anh vn, hnh kim v mc u tt nghip ca hc sinh.
3.2.1.1.

M ha cc thuc tnh im

Thuc tnh im ton (DT) ly c t im cui nm lp 10 ca cc hc sinh


tt nghip, tin hnh m ha vi cc hm thnh vin tng ng.
M ha thuc tnh im ton theo hnh tam gic:

K hiu
1
2
3
4
5

Tp m
DT_Gioi
DT_Kha
DT_TB
DT_Yeu
DT_Kem

Hnh 3.1. M hnh m ha thuc tnh im theo hnh tam gic.


Cc hm thnh vin ca cc tp m: Kem, Yeu, TB, Kha, Gii c tnh nh sau:
3 x

Kem ( x) 3
0

,0 x 3
, otherwise

(3.4)

x / 3
5 x
Yeu ( x )
2
0

,0 x 3
,3 x 5
, otherwise

(3.5)

55

x 3
2
7 x
TB ( x)
2
0

x 5
2
10 x
Kha ( x)
3
0

,3 x 5
(3.6)

,5 x 7
, otherwise

x 7

Gioi ( x ) 3
0

,7 x 10

,5 x 7
,7 x 10

(3.7)

, otherwise

(3.8)

, otherwise

V d 3.4: M ha thuc tnh im ton bng cc cng thc (3.4), (3.5), (3.6), (3.7),
(3.8), chng ta c cc gi tr m tng ng:
Bng 3.4. V d m ha thuc tnh im ton theo hnh tam gic.
TID

DT

3.3

4.3

3.8

6.2

TID DT_Gioi DT_Kha DT_TB DT_Yeu DT_Kem


1
0.00
0.00
0.15
0.85
0.00
2
0.00
0.00
0.65
0.35
0.00
3
0.00
0.00
0.40
0.60
0.00
4
0.00
0.60
0.40
0.00
0.00

M ha thuc tnh im ton theo hnh thang:


K hiu
1
2
3
4
5

Tp m
DT_Gioi
DT_Kha
DT_TB
DT_Yeu
DT_Kem

Hnh 3.2. M hnh m ha thuc tnh im theo hnh thang.


Cc hm thnh vin ca cc tp m: Kem, Yeu, TB, Kha, Gii c tnh nh sau:

Kem ( x) 1
3 x

,x 0 x 3
,0 x 2
,2 x 3

(3.9)

0
x 2

Yeu ( x)
1
5 x

,x 2 x 5
,2 x 3
,3 x 4
,4 x 5

(3.10)

56

,x 4 x 7
,4 x 5
(3.11)
,5 x 6
,6 x 7

0
x 4

TB ( x)
1
7 x

Gioi ( x) 1
x 8

, x 8 x 10
,9 x 10
,8 x 9

0
x 6

Kha ( x)
1
9 x

,x 6 x 9
,6 x 7
,7 x 8
,8 x 9

(3.12)

(3.13)

V d 3.5: M ha thuc tnh im ton bng cc cng thc (3.9), (3.10), (3.11), (3.12),
(3.13), chng ta c cc gi tr m tng ng:
Bng 3.5. V d m ha thuc tnh im ton theo hnh thang.
TID

DT

3.3

4.3

3.8

6.2

TID DT_Gioi DT_Kha DT_TB DT_Yeu DT_Kem


1
0.00
0.00
0.00
1.00
0.00
2
0.00
0.00
0.30
0.70
0.00
3
0.00
0.00
0.00
1.00
0.00
4
0.00
0.20
0.80
0.00
0.00

Cc thuc tnh im l (DL), im ha (DH), im sinh (DS), im vn (DV),


im s (DSU), im a (DD), im anh vn (DAV) c m ha tng t im ton.
3.2.1.2.

M ha thuc tnh hnh kim

Thuc tnh hnh kim (HK) c ly t hnh kim ca cui nm lp 10, l


thuc tnh hng mc c tp gi tr l {Tot, Kha, TB, Yeu}, nn gn mi gi tr mt tp
m tng ng l HK_Tot, HK_Kha, HK_TB, HK_Yeu, xc nh qua cc hm thnh
vin sau:

57

1
Tot ( x )
0

, if x ' Tot '


, if x ' Tot '

1
Kha ( x)
0

, if x ' Kha '


, if x ' Kha '

(3.14)

(3.15)

1
TB ( x)
0
1
Yeu ( x)
0

, if x ' TB '
, if x ' TB '

(3.16)

, if x ' Yeu '


, if x ' Yeu '

(3.17)

V d 3.6: M ha thuc tnh hnh kim, c cc gi tr m tng ng:


Bng 3.6. V d m ha thuc tnh HK.
TID

HK

TB

4
3.2.1.3.

TID HK_Tot HK_Kha HK_TB HK_Yeu


1
0.00
0.00
1.00
0.00
2
1.00
0.00
0.00
0.00
3
0.00
1.00
0.00
0.00
4
1.00
0.00
0.00
0.00

T
M ha thuc tnh mc u tt nghip

Thuc tnh mc u tt nghip (MDD), l kh nng hc sinh t c im


tt nghip so vi mc im ti a (60 im), c tnh theo cng thc sau:
MDD

iemTN *100
60

(%)

(3.18)

Trong , iemTN c ly t im tt nghip trung hc ph thng ca hc


sinh.
M ha thuc tnh MDD theo hnh tam gic nh sau:
K hiu
45
46
47
48
49

Tp m
MDD_Ratcao
MDD_Cao
MDD_TB
MDD_Thap
MDD_Ratthap

Hnh 3.3. Hm thnh vin ca cc tp m Ratcao, Cao, TB, Thap, Ratthap theo hnh
tam gic.

58

Cc tp m c dng hnh tam gic c tnh nh sau:


30 x

Ratthap ( x) 30
0
x 30
20
70 x
TB ( x )
20
0

x 70

Ratcao ( x) 30
0

,0 x 30

(3.19)

, otherwise

,30 x 50
,50 x 70

(3.21)

, otherwise

,70 x 100

,0 x 30

x / 30
50 x
Thap ( x)
20
0

,30 x 50

(3.20)

, otherwise

x 50
20
10 x
Cao ( x)
30
0

,50 x 70
,70 x 100

(3.22)

, otherwise

(3.23)

, otherwise

V d 3.7: M ha thuc tnh mc u tt nghip bng cc cng thc (3.19), (3.20),


(3.21), (3.22), (3.23), chng ta c cc gi tr m tng ng:
Bng 3.7. V d m ha thuc tnh MDD theo hnh tam gic.
TID
1
2
3
4

MDD
52.5
56.7
46.7
47.5

TID MDD_Ratcao MDD_Cao MDD_TB MDD_Thap MDD_Ratthap


1
0.00
0.13
0.88
0.00
0.00
2
0.00
0.34
0.67
0.00
0.00
3
0.00
0.00
0.84
0.17
0.00
4
0.00
0.00
0.88
0.13
0.00

M ha thuc tnh MDD theo hnh thang nh sau:


K hiu
45
46
47
48
49

Tp m
MDD_Ratcao
MDD_Cao
MDD_TB
MDD_Thap
DT_Ratthap

Hnh 3.4. Hm thnh vin ca cc tp m Ratcao, Cao, TB, Thap, Ratthap theo
hnh thang

59

Cc tp m c dng hnh thang c tnh theo cng thc:


0
x 80
Ratcao ( x)
10
1

, if x 80 x 100
, if 80 x 90

(3.24)

, if x 90

, if x 40 x 70
0
x 40
, if 40 x 50

10
TB ( x )
1
, if 50 x 60

70

, if 60 x 70
10

1
30 x
Ratthap ( x )
10
0

(3.26)

, if 0 x 20

, if x 60 x 90
0
x 60
, if 60 x 70
(3.25)

10
Cao ( x )
1
, if 70 x 80

90 x , if 80 x 90
10

, if x 20 x 50
0
x 20
, if 20 x 30

(3.27)
Thap ( x ) 10
1
, if 30 x 40

50 x , if 40 x 50
10

(3.28)

, if 20 x 30
, if x 30 x 0

V d 3.8: M ha thuc tnh mc u tt nghip bng cc cng thc (3.24), (3.25),


(3.26), (3.27), (3.28), chng ta c cc gi tr m tng ng:
Bng 3.8. V d m ha thuc tnh MDD theo hnh thang.
TID
1
2
3
4

MDD
52.5
56.7
46.7
47.5

TID MDD_Ratcao MDD_Cao MDD_TB MDD_Thap MDD_Ratthap


1
0.00
0.00
1.00
0.00
0.00
2
0.00
0.00
1.00
0.00
0.00
3
0.00
0.00
0.67
0.33
0.00
4
0.00
0.00
0.75
0.25
0.00

3.2.2. Nhng u im ca vic p dng tp m ri rc ha d liu


Gii quyt c vn im bin gy nh tp m c th phn khong mn hn
nh vo trn ca hm thuc.
Ri rc ho bng cch s dng tp m th s lng tp m gn vi mi thuc tnh
l khng ng k.

60

Tp m cho php chng ta biu din lut kt hp di dng t nhin hn v gn gi


hn vi ngi s dng.
Gi tr thuc tnh sau khi ri rc ho bin thin trong khong [0, 1] cho bit mc
thuc t hay nhiu trong khi cc thuc tnh nh phn trc y ch c mt
trong hai gi tr 0, 1. iu ny cho chng ta kh nng c lng chnh xc hn
ng gp ca cc bn ghi trong c s d liu vo mt tp ph bin no .
Cc thuc tnh mc du c m ho, nhng vn gi nguyn c mt s tnh
cht ca thuc tnh nh phn, do vn c th p dng cc thut ton khai ph lut
kt hp nh phn vo khai ph lut kt hp m vi mt vi thay i. V d tnh cht
mi tp con khc rng ca tp ph bin cng l tp ph bin v mi tp cha tp
khng ph bin u l tp khng ph bin vn cn ng nu chng ta chon c
php ton T-norm ph hp.
3.2.3. Khai ph lut m
Cho I = { i1, i2, , in } l tp n thuc tnh, iu l thuc tnh th u trong I.
T = { t1, t2, , tm } l tp m bn ghi, tv l bn ghi th v trong T.
tv[iu] cho bit gi tr ca thuc tnh iu ti bn ghi tv .
p dng phng php m ha thuc tnh phn trn, chng ta gn thuc
tnh iu vi mt tp cc tp m nh sau :
Fiu { f iu1 , f iu2 ,..., f iuk }

V d 3.9:
FDT ={f1DT_Gioi, f2DT_Kha , f3DT_TB , f4DT_Yeu , f5DT_Kem } (vi k = 5)
FDL ={f1DL_Gioi, f2DL_Kha , f3DL_TB , f4DL_Yeu , f5DL_Kem } (vi k = 5)
FDH ={f1DH_Gioi, f2DH_Kha , f3DH_TB , f4DH_Yeu , f5DH_Kem } (vi k = 5)
FDS ={f1DS_Gioi, f2DS_Kha , f3DS_TB , f4DS_Yeu , f5DS_Kem } (vi k = 5)
FDV ={f1DV_Gioi, f2DV_Kha , f3DV_TB , f4DV_Yeu , f5DV_Kem } (vi k = 5)
FDSU ={f1DSU_Gioi, f2DSU _Kha , f3DSU _TB , f4DSU _Yeu , f5DSU _Kem } (vi k = 5)
FDD ={f1DD_Gioi, f2DD_Kha , f3DD_TB , f4DD_Yeu , f5DD_Kem } (vi k = 5)

61

FDAV ={f1DAV_Gioi, f2DAV_Kha , f3DAV_TB , f4DAV_Yeu , f5DAV_Kem } (vi k = 5)


FHK ={f1HK_Tot, f2HK_Kha , f3HK_TB, f3HK_YEU } (vi k = 4)
FMDD ={f1MDD_Ratcao, f2MDD_Cao , f3MDD_TB , f4MDD_Thap , f5MDD_Ratthap} (vi k = 5)
Lut kt hp m c dng : X is A Y is B
Trong :
X, Y I l cc tp mc (tp thuc tnh).
X = {x1, x2,. . . , xp} xi xj (nu i j )
Y = {y1, y2,. . . , yq} yi yj (nu i j )
A { f x , f x ,..., f x } , B { f y , f y ,..., f y } l tp cc tp m tng ng vi cc
1

thuc tnh trong X v Y. f x Fx v f y Fy


i

Chng ta c th vit li lut kt hp m mt trong hai dng sau :


X {x1 ,...., x p } is A { f x1 ,..., f x p } Y { y1 ,..., y p } is B { f y1 ,..., f yq }
( x1 is f x1 ) and ....and ( x p is f x p ) ( y1 is f y1 ) and ...and ( yq is f yq )

Mt tp thuc tnh m trong lut kt hp m l mt cp <X, A> vi A l


tp cc tp m tng ng vi cc thuc tnh trong X v X I .
h tr m (Fuzzy support) ca tp mc <X, A> k hiu l fs(<X, A>) c
xc nh theo cng thc :
m

{
fs( X , A )

x1

(tv [ x1 ]) x2 (tv [ x2 ]) ... x p (tv [ x p ])}

v 1

(3.29)

Trong :
X = {x1, x2,. . . , xp}: tp cc thuc tnh, xi l thuc tnh th i trong X.
T = {t1, t2,. . . , tm}: tp cc bn ghi (giao tc), tv l bn ghi th v trong T

xi (tv [ xi ]) c xc nh theo cng thc:

62

(t [ x ]) if xi (tv [ xi ]) wxi
xi (t v [ xi ]) xi v i
else
0

(3.30)

Vi: x l hm thuc tp m f x gn vi thuc tnh xi.


i

wxi l ngng (xc nh bi ngi dng) ca hm thuc xi .

T (lc lng ca tp T) l s bn ghi trong T v bng m.

ton t T-norm (T-chun). Trong l thuyt logic m, n c vai tr ging

nh php ton logic AND trong logic c in. C nhiu cch chn php Tnorm nh:
Php ly min: a b min( a, b)
Tch i s: a b a.b
Tch b chn: a b max( 0, a b 1)
a if (b 1)

Tch Drastic: a b b if (a 1)
0 if (a, b 1)

1
w w

Php giao: a b 1 min[1, ((1 a) (1 b) ) ]

(w > 0)

Thc t, php ly min v php tnh tch i s l hai php ton ph hp nht v n
thun tin cho vic tnh ton v th hin c mi lin h cht ch gia cc thuc tnh
trong cc tp ph bin.
Khi chn php ly min cho ton t T-norm, cng thc (3.29) tr thnh cng thc:
m

min{
fs( X , A )

x1

(tv [ x1 ]), x2 (tv [ x2 ]),..., x p (tv [ x p ])}

v 1

(3.31)

Khi chn php tnh tch i s cho ton t T-norm, cng thc (3.29) tr thnh:
m

(
fs( X , A )

v 1

{ xu (tv [ xu ])}

xuX

(3.32)

63

Tp mc ph bin: tp thuc tnh m <X, A> gi l ph bin nu h tr ca n


ln hn h tr ti thiu do ngi dng nhp vo fminsup:

fs( X , A ) fminsup

h tr ca lut m c tnh theo cng thc sau:


fs( X is A Y is B ) fs( X Y , A B )

(3.33)

Mt lut c gi l ph bin nu h tr ca n ln hn hoc bng h tr


ti thiu do ngi dng nh ngha:
fs( X is A Y is B ) fminsup

(3.34)

tin cy ca mt lut m c tnh theo cng thc:


fc( X is A Y is B ) fs( X is A Y is B ) / fs( X , A )

(3.35)

Mt lut c coi l tin cy nu tin cy ca n khng nh hn tin cy ti


thiu cho trc.
fc( X is A Y is B ) fminconf

(3.36)

3.2.4. Thut ton khai ph lut kt hp m


Thut ton ny c xy dng da trn thut ton Apriori, Apriori nh phn v
mt s thay i trong ci t thc t nhm ci thin thi gian tm lut. Thut ton khai
thc lut kt hp m chia lm ba pha chnh nh sau :
Pha 1 : Chuyn cc thuc tnh ca c s d liu dng quan h sang cc thuc tnh
m, thng qua hm thuc ca cc tp m tng ng vi tng thuc tnh.
Pha 2 : Tm tt c cc tp thuc tnh m ph bin dng <X, A> c h tr ln
hn h tr cc tiu ca ngi dng nhp vo: fs(<X, A>) fminsup.
Pha 3 : Sinh cc lut kt hp m tin cy t cc tp ph bin tm thy pha th
hai. Nu <X, A> l mt tp thuc tnh m ph bin th lut kt hp m c
sinh t X c dng:
fc
X ' is A'
X \ X ' is A \ A'

Trong :
X l tp con khc rng ca X.

64

X \ X l hiu ca hai tp hp X v X.
fc l tin cy ca lut tho mn fc fminconf (do ngi dng xc nh).
A l tp con khc rng ca A v l tp cc tp m tng ng vi cc thuc
tnh trong X.
A \ A l hiu ca hai tp hp A v A.
u vo ca thut ton (inputs): c s d liu D vi tp cc thuc tnh I v cc bn
ghi T, ngng hm thuc w f , h tr ti thiu fminsup , tin cy ti thiu fminconf
, v Ton t T-norm ().
u ra ca thut ton (outputs) : Tp tt c cc lut kt hp m tin cy.

65

Bng 3.9. Cc k hiu s dng trong thut ton khai ph lut kt hp m.


K hiu

ngha

C s d liu (dng quan h hoc giao tc).

Tp cc mc (thuc tnh) trong D.

Tp cc bn ghi (giao tc) trong D.


C s d liu c cc thuc tnh c m ha, c tnh ton t c s

DF

d liu ban u thng qua hm thuc ca cc tp m tng ng vi


tng thuc tnh.

IF

Tp cc thuc tnh trong DF, mi thuc tnh u c gn vi mt tp


m. Mi tp m f u c mt ngng wf .
Tp cc bn ghi trong DF , cc thuc tnh trong mi bn ghi c

TF

chuyn sang mt gi tr thuc khong [0, 1] nh hm thuc ca cc tp


m tng ng vi tng thuc tnh.

fminsup

h tr ti thiu.

fminconf

tin cy ti thiu.

Ck

Tp cc thuc tnh c kch thc k.

Lk

Tp cc thuc tnh ph bin c kch thc k.

Tp tt c cc thuc tnh ph bin.

FR

Tp tt c cc lut m sinh ra t Lk.

Thut ton khai ph lut kt hp m c m t c th nh sau:


Begin
(DF, IF, TF) = M_ha_d_liu(D, I, T);
L1 = To_L1(DF, IF, TF, fminsup, wf); //to tp ph bin 1 thuc tnh
L = ; FR = ;
k = 2;
While ( Lk 1 )

66

{
Ck = To_L_k(Lk-1);
Lk = Tnh_Support_K(Ck, DF, fminsup, wf);
FRk = Tm_lut(L, Lk, fminconf);
L L Lk ;
FR FR FRk ;

k = k +1;
}
End
Thut ton s dng cc chng trnh con sau:
Chng trnh con M_ha_d_liu (D, I, T): hm ny thc hin nhim v
chuyn i cc thuc tnh t c s d liu D ban u sang cc thuc tnh m
cho c s d liu DF , DF gm cc thuc tnh ca D c gn thm cc tp m
v gi tr ca cc thuc tnh cc bn ghi T trong D c nh x thnh mt gi
tr thuc khong [0, 1] thng qua cc hm thuc ca cc tp m tng ng vi
cc thuc tnh.
V d 3.10: cho CSDL D nh sau:
Bng 3.10. Thng tin 10 hc sinh.
TID
1
2
3
4
5
6
7
8
9
10

DT DL DH
3.3 4.7 4.7
4.3 6.0 6.0
3.8 3.9 4.0
6.2 5.6 5.5
5.9 7.8 7.6
3.4 5.6 5.3
4.1 4.9 5.0
4.7 5.1 5.5
3.6 6.4 6.2
5.2 4.6 6.2

DS DV DSU DD DAV
6.4 5.5
5.1 6.1
3.5
7.9 5.8
6.3 7.2
4.6
3.8 3.6
3.5 4.9
4.1
5.5 4.7
5.2 5.8
4.3
8.9 6.5
7.7 8.5
4.4
7.1 5.2
5.1 5.6
2.7
6.7 5.0
5.1 6.2
4.0
6.3 5.2
5.4 6.4
4.5
8.4 5.3
6.0 7.9
2.6
7.2 6.8
5.8 7.0
5.5

HK
TB
T
K
T
T
K
K
K
TB
T

MDD
52.5
56.7
46.7
47.5
60.8
40.0
51.7
60.0
67.5
45.0

67

Cc ch ci trong dng u tin c ngha: DT (im ton), DL (im l), DH


(im ha), DS (im sinh), DV (im vn), DSU (im s), DD (im a), DAV
(im anh vn), HK (hnh kim), MDD (mc u).
Sau khi thc hin hm M_ha_d_liu (D, I, T), ta c IF bao gm 49 thuc
tnh c m ha so vi 10 thuc tnh ban u trong c s d liu D. Mi thuc
tnh mi l mt cp nm trong du ngoc vung bao gm tn thuc tnh ban u v tn
ca tp m gn vi thuc tnh y nh sau:

68

Bng 3.11. Cc thuc tnh m.


IF

K hiu

IF

K hiu

[DT, DT_Gioi]

[DSU, DSU_Gioi]

26

[DT, DT_Kha]

[DSU, DSU_Kha]

27

[DT, DT_TB]

[DSU, DSU_TB]

28

[DT, DT_Yeu]

[DSU, DSU_Yeu]

29

[DT, DT_Kem]

[DSU, DSU_Kem]

30

[DL, DL_Gioi]

[DD, DD_Gioi]

31

[DL, DL_Kha]

[DD, DD_Kha]

32

[DL, DL_TB]

[DD, DD_TB]

33

[DL, DL_Yeu]

[DD, DD_Yeu]

34

[DL, DL_Kem]

10

[DD, DD_Kem]

35

[DH, DH_Gioi]

11

[DAV, DAV_Gioi]

36

[DH, DH_Kha]

12

[DAV, DAV_Kha]

37

[DH, DH_TB]

13

[DAV, DAV_TB]

38

[DH, DH_Yeu]

14

[DAV, DAV_Yeu]

39

[DH, DH_Kem]

15

[DAV, DAV_Kem]

40

[DS, DS_Gioi]

16

[HK, HK_Tot]

41

[DS, DS_Kha]

17

[HK, HK_Kha]

42

[DS, DS_TB]

18

[HK, HK_TB]

43

[DS, DS_Yeu]

19

[HK, HK_Yeu]

44

[DS, DS_Kem]

20

[MDD, MDD_Ratcao]

45

[DV, DV_Gioi]

21

[MDD, MDD_Cao]

46

[DV, DV_Kha]

22

[MDD, MDD_TB]

47

[DV, DV_TB]

23

[MDD, MDD_Thap]

48

[DV, DV_Yeu]

24

[MDD, MDD_Ratthap]

49

[DV, DV_Kem]

25

69

S dng phng php m ha theo hnh tam gic, chng ta thu c kt qu sau
khi m ha t d liu bng 3.10 nh sau :
Bng 3.12. Gi tr cc thuc tnh c m ha.
DT
3.3
4.3
3.8
6.2
5.9
3.4
4.1
4.7
3.6
5.2
DH
4.7
6.0
4.0
5.5
7.6
5.3
5.0
5.5
6.2
6.2

1
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
11
0.00
0.00
0.00
0.00
0.20
0.00
0.00
0.00
0.00
0.00

2
0.00
0.00
0.00
0.60
0.45
0.00
0.00
0.00
0.00
0.10
12
0.00
0.50
0.00
0.25
0.80
0.15
0.00
0.25
0.60
0.60

3
0.15
0.65
0.40
0.40
0.55
0.20
0.55
0.85
0.30
0.90

4
0.85
0.35
0.60
0.00
0.00
0.8
0.45
0.15
0.70
0.00

5
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00

DL
4.7
6.0
3.9
5.6
7.8
5.6
4.9
5.1
6.4
4.6

6
0.00
0.00
0.00
0.00
0.27
0.00
0.00
0.00
0.00
0.00

7
0.00
0.50
0.00
0.30
0.73
0.30
0.00
0.05
0.70
0.00

8
0.85
0.50
0.45
0.70
0.00
0.70
0.95
0.95
0.30
0.80

9 10
0.15 0.00
0.00 0.00
0.55 0.00
0.00 0.00
0.00 0.00
0.00 0.00
0.05 0.00
0.00 0.00
0.00 0.00
0.20 0.00

13
0.85
0.50
0.50
0.75
0.00
0.85
1.00
0.75
0.40
0.40

14
0.15
0.00
0.5
0.00
0.00
0.00
0.00
0.00
0.00
0.00

15
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00

DS
6.4
7.9
3.8
5.5
8.9
7.1
6.7
6.3
8.4
7.2

16
0.00
0.30
0.00
0.00
0.63
0.03
0.00
0.00
0.47
0.07

17
0.70
0.70
0.00
0.25
0.37
0.97
0.85
0.65
0.53
0.93

18
0.30
0.00
0.40
0.75
0.00
0.00
0.15
0.35
0.00
0.00

19 20
0.00 0.00
0.00 0.00
0.60 0.00
0.00 0.00
0.00 0.00
0.00 0.00
0.00 0.00
0.00 0.00
0.00 0.00
0.00 0.00

70

DV
5.5
5.8
3.6
4.7
6.5
5.2
5.0
5.2
5.3
6.8
DD
6.1
7.2
4.9
5.8
8.5
5.6
6.2
6.4
7.9
7.0

21
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
31
0.00
0.07
0.00
0.00
0.50
0.00
0.00
0.00
0.30
0.00

22
0.25
0.40
0.00
0.00
0.75
0.10
0.00
0.10
0.15
0.90
32
0.55
0.93
0.00
0.40
0.50
0.30
0.60
0.70
0.70
1.00

23
0.75
0.60
0.30
0.85
0.25
0.90
1.00
0.90
0.85
0.10
33
0.45
0.00
0.95
0.60
0.00
0.70
0.40
0.30
0.00
0.00

24
0.00
0.00
0.70
0.15
0.00
0.00
0.00
0.00
0.00
0.00
34
0.00
0.00
0.05
0.00
0.00
0.00
0.00
0.00
0.00
0.00

25 DSU 26
27
0.00
5.1 0.00 0.05
0.00
6.3 0.00 0.65
0.00
3.5 0.00 0.00
0.00
5.2 0.00 0.10
0.00
7.7 0.23 0.77
0.00
5.1 0.00 0.05
0.00
5.1 0.00 0.05
0.00
5.4 0.00 0.20
0.00
6.0 0.00 0.50
0.00
5.8 0.00 0.40

35 DAV 36
0.00 3.5 0.00
0.00 4.6 0.00
0.00 4.1 0.00
0.00 4.3 0.00
0.00 4.4 0.00
0.00 2.7 0.00
0.00 4.0 0.00
0.00 4.5 0.00
0.00 2.6 0.00
0.00 5.5 0.00

37
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.25

28
0.95
0.35
0.25
0.90
0.00
0.95
0.95
0.80
0.50
0.60
38
0.25
0.80
0.55
0.65
0.70
0.00
0.50
0.75
0.00
0.75

29
0.00
0.00
0.75
0.00
0.00
0.00
0.00
0.00
0.00
0.00

30
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00

39
0.75
0.20
0.45
0.35
0.30
0.90
0.50
0.25
0.87
0.00

40
0.00
0.00
0.00
0.00
0.00
0.10
0.00
0.00
0.13
0.00

71

HK
TB
T
K
T
T
K
K
K
TB
T

41
0.00
1.00
0.00
1.00
1.00
0.00
0.00
0.00
0.00
1.00

42
0.00
0.00
1.00
0.00
0.00
1.00
1.00
1.00
0.00
0.00

43
1.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
1.00
0.00

44
MDD
0.00 52.5
0.00 56.7
0.00 46.7
0.00 47.5
0.00 60.8
0.00 40.0
0.00 51.7
0.00 60.0
0.00 67.5
0.00 45.0

45
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00

46
0.13
0.34
0.00
0.00
0.54
0.00
0.09
0.50
0.88
0.00

47
0.88
0.67
0.84
0.88
0.46
0.50
0.92
0.50
0.13
0.75

48
0.00
0.00
0.17
0.13
0.00
0.50
0.00
0.00
0.00
0.25

49
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00

Do hm thuc ca mi tp m f c mt ngng wf nn ch nhng gi tr no


vt ngng wf mi c tnh n, ngc li nhng gi tr no khng vt ngng
wf c xem nh bng khng (theo cng thc 3.30). Trong lun vn ny chn wf =
0.5, do nhng c t mu trong bng 3.12 cho bit gi tr ca nhng
vt ngng wf.
Chng trnh con L1 = To_L1(DF, IF, TF, fminsup, wf): hm ny sinh ra L1 l
tp tt c cc tp ph bin c mt phn t (lc lng bng 1). Cc tp thuc tnh
ph bin ny phi c h tr ln hn hoc bng fminsup.
Thut gii to L1 : L1 = To_L1(DF, IF, TF, fminsup, wf)
L1 = ;
For each iIF
If (fs({i},wf) fminsup) then
L1 = L1 {i};
EndIf
EndFor
Return L1

72

V d 3.11: CSDL nh bng 3.10 v cho fminsup = 44% ta c:


Bng 3.13. Tp cc thuc tnh m ph bin c mt phn t - L1.
Thuc tnh m ph bin
Tn
8
[DL, DL_TB]
13
[DH, DH_TB]
17
[DS, DS_Kha]
23
[DV, DV_TB]
28
[DSU, DSU_TB]
32
[DD, DD_Kha]
38
[DAV, DAV_TB]
47
[MDD, MDD_TB]
Nh vy L1 = {{8},{13},{17},{23},{28},{32},{38},{47}}

h tr
54.5%
52.0%
53.3%
58.5%
56.5%
49.8%
47.0%
59.2%

Chng trnh con Ck = To_L_k(Lk-1): hm ny thc hin vic sinh ra tp cc


tp thuc tnh m ng c vin c k phn t t tp thuc tnh m ph bin Lk-1 c
k-1 phn t.
Thut gii to Ck = To_L_k(Lk-1):
Begin
For each i, j Lk 1 and i j do
If (i.item1 = j.item1 and i.item2 = j.item2 andand i.itemk-2 =
j.itemk-2 and i.itemk-1<j.itemk-1 and i.itemk-1.fieldroot j.itemk-1.fieldroot)
itemset = {i.item1, i.item2,, i.itemk-2, i.itemk-1, j.itemk-1};
Ck Ck {itemset};

EndIf
EndFor
Return Ck
End
Thut gii trn thuc tnh i.itemk-1.fieldroot, j.itemk-1.fieldroot l nhng thuc tnh
gc; i.itemk, j.itemk l s hiu ca thuc tnh m th k.
V d 3.12. vi L1 nh sau:

73

Bng 3.14. Tp ph bin L1.


Item1
Thuc tnh gc h tr
8
DL
54.5%
13
DH
52.0%
17
DS
53.3%
23
DV
58.5%
28
DSU
56.5%
32
DD
49.8%
38
DAV
47.0%
47
MDD
59.2%
Vi k = 2, C2 = To_L_k(L1) cho bng tp ng c vin 2 thuc tnh C2 l:
Bng 3.15. Tp ng c vin C2.
Item1
8
8
8
8
8
8
8
13
13
13
13
13
13
17
17
17
17
17
23
23
23
23
28
28
28
32
32
38

Item2
13
17
23
28
32
38
47
17
23
28
32
38
47
23
28
32
38
47
28
32
38
47
32
38
47
38
47
47

74

C2 = {{8,13}, {8,17}, {8,23}, {8,28}, {8,32},

{8,38}, {8,47}, {13,17},

{13,23}, {13,28}, {13,32}, {13,38}, {13,47}, {17,23}, {17,28}, {17,32}, {17,38},


{17,47}, {23,28}, {23,32}, {23,38}, {23,47}, {28,32}, {28,38}, {28,47}, {32,38},
{32,47}, {38,47}}
Tt c cc tp thuc tnh u l hp l v khng c cng thuc tnh gc.
Chng trnh con Lk = Tnh_Support_K(Ck, DF, fminsup, wf): chng trnh con
ny duyt qua csdl DF, vi ngng wf v ton t T-norm tnh h tr cho
cc tp thuc tnh trong Ck . Sau khi duyt xong, Tnh_Support_K s ch chn
nhng tp ph bin (c h tr ln hn hoc bng fminsup) a vo Lk.
V d 3.13: vi k = 2 v C2 nh trn, fminsup = 44%, ton t T-norm l cc tiu v
wf=0.5, sau khi thc hin Tnh_Support_K ta c bng L2 nh sau:
Bng 3.16. h tr ca tp 2 thuc tnh.
Item1
8
8
8
8
8
8
8
13
13
13
13
13
13
17
17
17
17
17
23
23

Item2
13
17
23
28
32
38
47
17
23
28
32
38
47
23
28
32
38
47
28
32

h tr
44.5%
42.0%
45.0%
46.0%
31.5%
31.5%
47.2%
35.5%
46.0%
41.5%
23.5%
29.0%
45.2%
42.3%
42.5%
39.7%
26.0%
39.7%
47.5%
31.5%

75

23
38
25.0%
23
47
41.2%
28
32
29.5%
28
38
25.0%
28
47
42.7%
32
38
32.5%
32
47
30.7%
38
47
28.7%
Bng 3.17. L2 - Cc tp ph bin 2 thuc tnh c h tr >= fminsup.
Item1 Item2
h tr
8
13
44.5%
8
23
45.0%
8
28
46.0%
8
47
47.2%
13
23
46.0%
13
47
45.2%
23
28
47.5%
Nh vy, L2 = {{8,13},{8,23},{8,28},{8,47},{13,23},{13,47},{23,28}}
k = 3, tp ng c vin 3 thuc tnh C3 c to t L2 v h tr tng ng:
Bng 3.18. Cc tp 3 thuc tnh v h tr tng ng.
item1
8
8
8
8
8
8
13
13
23

item2
13
13
13
23
23
28
23
23
28

item3
23
28
47
28
47
47
28
47
47

h tr
43.5%
39.5%
39.7%
39.0%
38.7%
40.7%
40.5%
39.2%
35.2%

V cc tp 3 thuc tnh c h tr <fminsup nn L3 = v thut ton dng.


Vi v d trn, sau khi thc hin thut ton, ta c tp cc tp ph bin:
L L1 L2 ={{8}, {13}, {17}, {23}, {28}, {32}, {38}, {47}, {8,13}, {8,23},

{8,28}, {8,47}, {13,23},{13, 47}, {23,28}}

76

Chng trnh con FRk= Tm_lut(L, Lk, fminconf): chng trnh ny sinh ra lut
kt hp m tin cy t cc tp ph bin Lk.
Thut gii to FRk= Tm_lut(L, Lk, fminconf):
Begin
FRk = ;
For each X Lk do
For each Y X and X Y and Y do
r X \Y Y

If( fc(r ) fminconf) then


FRk FRk {r};

EndIf
EndFor
EndFor
End
V d 3.14: thi hnh chng trnh con FRk= Tm_lut(L, Lk, fminconf), vi k=2 v
tin cy cc tiu l fminconf = 80% , ta c cc lut m FRk sinh ra t Lk:
Bng 3.19. Cc lut m c sinh ra t csdl trong bng 3.12.
STT
1
2
3
4
5
6
7
8
9
10
11

Lut
DL_TB DH_TB
DH_TB DL_TB
DL_TB DV_TB
DL_TB DSU_TB
DSU_TB DL_TB
DL_TB MDD_TB
MDD_TB DL_TB
DH_TB DV_TB
DH_TB MDD_TB
DV_TB DSU_TB
DSU_TB DV_TB

h tr
44.5%
44.5%
45.0%
46.0%
46.0%
47.2%
47.2%
46.0%
45.2%
47.5%
47.5%

tin cy
81.7%
85.6%
82.6%
84.4%
81.4%
86.5%
80.0%
88.5%
86.8%
81.2%
84.1%

You might also like