Professional Documents
Culture Documents
Assignment 2 Weka
Assignment 2 Weka
Group Members:
Yogesh Katore(CI15M06)
Kiran Gavhane(CI15M07)
Problem:
Selecting
wheather.arff
file
and
apply
different
Learning
Answer:
A. Become familiar with the use of the WEKA workbench to invoke several
different machine learning schemes. Following are the some snapshot which
shows analysis or training set and splitting set at 66%.
Learning Scheme:ZeroR
Use the following learning schemes, with the default settings to analyze
the weather data (in weather.arff). For test options, first choose "Use
training set", then choose "Percentage Split" using default 66%
OneR
Model:
sunny
rainy -> yes
-> no
NaiveBayes
Evaluate using training set, error rate: 1/14 =7%
Evaluate using split, error rate: 2/5 = 40%
outlook = rainy
| windy = TRUE: no (2.0)
| windy = FALSE: yes (3.0)
Answer: The one with the lower error on the separate test set, which
is NaiveBayes.
What can you say about accuracy when using training set data
and when using a separate percentage to train?
Answer:
When using only training data, the classifier that can build a
more complex model, like J4.8 decision tree, can fit the data.
Accuracy on the train set is not a good predictor of the accuracy on
the separate test set.