Professional Documents
Culture Documents
Hanh An Lol
Hanh An Lol
Ni dung mn hoc:
I M I y
Tin x l d liu
Pht hin cc lut kt hp
Cc k thut phn lp v d on
Cc k thut phn nhm
Khai Ph D Liu
C th ti v t a ch:
http://www.cs.waikato.ac.nz/ml/weka/
Simple CLI
Experimenter
Mi trng cho php tin hnh cc th nghim v thc hin cc
kim tra thng k (statistical tests) gia cc m hnh hc my
KnowledgeFlow
Mi trng cho php bn tng tc ha kiu ko/th thit
k cc bc (cc thnh phn) ca mt th nghim
Preprocess
chn v thay i (x l) d liu lm vic
Classify
hun luyn v kim tra cc m hnh hc my (phn loi, hoc
hi quy/d on)
Cluster
hc cc nhm t d liu (phn cm)
Associate
khm ph cc lut kt hp t d liu
Select attributes
xc nh v la chn cc thuc tnh lin quan (quan trng)
nht ca d liu
Visualize
xem (hin th) biu tng tc 2 chiu i vi d liu
ca tp d liu
V d ca mt tp d liu
.......Tn ca tp
relation .weather/
d liu
@data
sunny,8 5,8 5,FALSE,no
overcast,83,86,FALSE,yes |
I
Cc vi d (instances)
Ri rc ha (Discretization)
Chun ha (Normalization)
Ly mu (Re-sampling)
Khai Ph D Liu
10
More options...
11
12
Khai Ph D Liu
13
<
>
_ > _
/ V
-h
A < A
I A
k-Means
Cc b phn cm c th c hin th kt qu v so
snh vi cc cm (lp) thc t
^Hy xem giao din ca WEKA Explorer ...
La chn mt b phn cm (cluster builder)
La chn ch phn cm (cluster mode)
Use training set. Cc cm hc c s c kim tra i vi tp hc
Supplied test set. S dng mt tp d liu khc kim tra cc cm hc
c
Percentage split. Ch nh t l phn chia tp d liu ban u cho vic xy
dng tp kim tra
Classes to clusters evaluation. So snh chnh xc ca cc cm
hc c i vi cc lp c ch nh
Ignore attributes
^ La chn cc thuc tnh s khng tham gia vo qu trnh hc cc cm
Khai Ph D Liu
15
WEKA c th hin th
Mi thuc tnh ring l (1-D visualization)
Mt cp thuc tnh (2-D visualization)
I A
^ A
Khai Ph D Liu
17