Professional Documents
Culture Documents
HNG DN S DNG
WEKA EXPLORER 3.6.3
________________________________________________
________________________________________________
Thng 8/2011
Weka Explorer 3.6.3 CTT305 Khai thc d liu & ng dng
MC LC
1. Gii thiu
2. Tin x l d liu
V d: Unsupervised.Attribute.Discretize
o Hnh bn di l mn hnh iu chnh tham s cho phng php chia gi, trong
c cc tham s nh s lng gi (bins), chia gi theo rng/ su
(useEqualFrequency),
S dng th Asscociate
Th hin kt qu:
o Tp ph bin: Danh sch cc hng mc v ph bin
4. Phn loi
S dng th Classify.
(1): Classifier: La chn b phn loi v cc tham s.
(2): Test Options: Cc ty chn kim th m hnh:
o Use training set: S dng chnh tp d liu hun luyn kim nghim.
o Supplied test set: S dng mt tp d liu khc.
o Cross-validation: Chia d liu thnh nhiu phn (Folds) thc hin nhiu ln
nh gi kt qu.
o Percentage split: Chia d liu thnh 2 phn theo t l %, mt phn dng xy
dng m hnh, phn cn li dnh cho kim th.
o More Options: iu chnh mt s tham s khc:
- Output predictions:
Tr ra kt qu phn loi chi tit cho tng mu
trong d liu kim nghim.
(3): Result list: Danh sch kt qu cc ln chy thut ton, c th tng tc trn danh
sch ny thc hin mt cc chc nng ph.
- Load model, Save model: M/Lu m hnh
phn loi ra tp tin.
- Visualize tree: Mt s b phn loi s dng cy
quyt nh c th cho hnh nh cy.
5. Gom cm
S dng th Cluster.
(1): Clusterer: La chn m hnh gom cm v cc tham s.
(2): Cluster mode: Cc ty chn kim th m hnh:
o Use training set: S dng chnh tp d liu hun luyn kim nghim.
o Supplied test set: S dng mt tp d liu khc.
o Percentage split: Chia d liu thnh 2 phn theo t l %, mt phn dng xy
dng m hnh, phn cn li dnh cho kim th.
o Classes to clusters evaluation: Gom cm trn ton b d liu v nh gi vi
tiu ch li l thp nht. Vi phng php ny ta c th p dng cc phng
php nh ngoi kho st cht lng gom cm.
Ignore attributes: B qua cc thuc tnh ch nh khi tin hnh gom cm.
6. Mt s nh dng tp tin
Attribute-Relation File Format (*.arff)
o L tp tin vn bn, gm 2 phn:
V d:
Mt tp tin csv c ni dung nh sau: