Professional Documents
Culture Documents
Weka Tutorial
Weka Tutorial
Zdravko Markov
Central Connecticut State University
markovz@ccsu.edu
Ingrid Russell
University of Hartford
irussell@hartford.edu
Data Mining
Data Mining: "The non trivial extraction of implicit, previously unknown, and potentially
useful information from data"
William J Frawley, Gregory Piatetsky-Shapiro and Christopher J Matheus
Data mining is the analysis of data and the use of software techniques for finding
patterns and regularities in sets of data.
Approved loan: s1, s2, s4, s5, s6, s7, s8, s9, s14, s15, s17, s18, s19
Rejected loan: s3, s10, s11, s12, s13, s16, s20
Clients data:
http://www.cs.ccsu.edu/~markov/
Download Ch1, DMW Book
Download datasets
Choose filters/unsupervised/attribute/StringToWordVector
Set the parameters as needed (see More)
Click on Apply
Attribute Selection
Finding a minimal set of attributes that preserve the class distribution
Attribute relevance with respect to the class not relevant attribute (accounting)
Attribute Selection
Attribute relevance with respect to the class relevant attribute (science)
Attribute
outlook
Rules
Errors
2/5
sunny -> no
overcast -> yes 0/4
2/5
rainy -> yes
Total
error
4/14
2/4
2/6
1/4
5/14
humidity
3/7
1/7
4/14
2/8
5/14
true -> no
3/5
high -> no
normal -> yes
windy
Classification OneR
11
12
10
Distance(test,X)
play
no
no
yes
yes
yes
yes
Departments-binary-training
predicted
a
Clustering k-means
Click on Ignore attributes
Association Rules
Thank you!