0% found this document useful (0 votes)

366 views21 pages

Weka Sample

Weka classifier performance analysis

Uploaded by

LoveAstro

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

366 views21 pages

Weka Sample

Weka classifier performance analysis

Uploaded by

LoveAstro

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Assignment: 1

Artificial Neural Network

Group members and Data Sets

CSC/14/51 Nursery Data set (page 02-09)

CSC/14/05 - Thyroid Disease Dataset (page 10-16)
CSC/14/22- Wine Data set (page 17-20)

Performance Analysis of Different classifiers on WEKA

1|Page

Assignment: 1

Artificial Neural Network

Introduction
Gathered data sets are include valuable information and knowledge which is often hidden. Processing the huge data
and retrieving meaningful information from it is a difficult task. The aim of our work is to investigate the performance
of different classification methods using WEKA for different three dataset obtained from UCI data archive.
WEKA is an open source software which consists of a collection of machine learning algorithms for data mining tasks.
This assignment is to investigate the performance of different classification or clustering methods for a set of large
data set.

Materials and methods

We have used the popular, open-source data mining tool Weka (version 3.6.6) for this analysis. Three different data
sets have been used and the performance of a comprehensive set of classification algorithms (classifiers) has been
analyzed. The analysis has been performed on a Mac book pro with Intel i5 CPU, 2.24 GHz Processor, OSX
Yosemite and 4.00 GB of RAM. The data sets have been chosen such that they differ in size, mainly in terms of the
number of attributes.
For this study the following
Data sets were used:
a) Nursery Database, which is developed to rank applications for nursery schools for providing certain facilities,
based on three factors.
Occupation of parents and child's nursery
Family structure and financial standing
social and health picture of the family
Under this study there was 12960 samples (instances) were analyzed against eight attributes which are,
parents
:
usual, pretentious,
great_pret
has_nurs :
proper, less_proper,
improper,
critical, very_crit
form
:
complete,
completed,
incomplete,
foster
children
:
1,
2,
3,
more
housing
:
convenient,
less_conv,
critical
finance
:
convenient,
inconv
social
:
non-prob,
slightly_prob, problematic and
health
:
recommended, priority,
not_recom.
Classifiers were used:
A total of five classification procedures have been used for this performance comparative study. The
classifiers in Weka have been categorized into different groups such as Bayes, Functions, Lazy, Rules, Tree
based classifiers etc. The following sections explain a brief about each of these procedures/algorithms.
i.

ii.

iii.

Multilayer Perceptron: Multilayer Perceptron is a nonlinear classifier based on the Perceptron. A Multilayer
Perceptron (MLP) is a back propagation neural network with one or more layers between input and
output layer.
A Support Vector Machine (SVM): SVM is a discriminative classifier formally defined by a separating hyper
plane. In other words, given labeled training data (supervised learning), the algorithm outputs an optimal
hyper plane which categorizes new examples.
J48: The J48 algorithm is WEKAs implementation of the C4.5 decision tree learner. The algorithm uses a
greedy technique to induce decision trees for classification and uses reduced-error pruning.

2|Page

Assignment: 1

Artificial Neural Network

iv.

IBk: IBk is a k-nearest-neighbor classifier that uses the same distance metric. k-NN is a type of instance
based learning or lazy learning where the function is only approximated locally and all computation is
deferred until classification. In this algorithm an object is classified by a majority vote of its neighbors.

Naive Bayesian: Naive Bayesian classifier is developed on bayes conditional probability rule used for
performing classification tasks, assuming attributes as statistically independent; the word Naive means
strong. All attributes of the data set are considered as independent and strong of each other.

Steps to apply classification techniques on data set and get result in Weka:

Step 1: Take the input dataset.

Step 2: Apply the classifier algorithm on the whole data set.
Step 3: Note the accuracy given by it and time required for execution.
Step 4: Repeat step 2 and 3 for different classification algorithms on different datasets.
Step 5: Compare the different accuracy provided by the dataset with different classification algorithms and
Identify the significant classification algorithm for particular dataset

Results and Discussion

The data sets have been submitted to a set of classification algorithms of Weka. We have used the 'Explorer' option
of the Weka tool. Certain comparative studies were conducted and following factors were derived. Under this
study I have used two types of test mode which are 10-fold cross-validation and percentage split 66%.

Classification
Multilayer
Perceptron
Support Vector
Machine
J48
k-nearest
neighbor
Naive
Bayesian

Time
taken
seconds

Correctly
Classified
Instances

Incorrectly
Classified
Instances

Kappa
statistic

Mean
absolute
error

Root
mean
squared
error

Relative
absolute
error

Root
relative
squared
error

69.56

99.7299

0.2701

0.996

0.0014

0.0186

0.5218

5.0233

14.23

97.5617

2.4383

0.9641

0.0098

0.0988

3.5721

26.7298

0.03

97.0525

2.9475

0.9568

0.0153

0.0951

5.6151

25.7324

98.3796

1.6204

0.9761

0.0859

0.1466

31.474

39.6775

90.3241

9.6759

0.8567

0.0765

0.1767

28.0234

47.8152

Table 1 Results summary of 10 fold cross validation

Classification
Multilayer
Perceptron
Support Vector
Machine
J48
k-nearest
neighbor
Naive
Bayesian

Time
taken
seconds

Correctly
Classified
Instances

Incorrectly
Classified
Instances

Kappa
statistic

Mean
absolute
error

Root
mean
squared
error

Relative
absolute
error

Root
relative
squared
error

69.28

97.4353

2.5647

0.962

0.006

0.0514

2.1843

13.9063

8.62

97.4353

2.5647

0.962

0.006

0.0514

2.1843

13.9063

0.14

96.4821

3.5179

0.9483

0.0186

0.1055

6.7947

28.5491

97.5261

2.4739

0.9636

0.0854

0.1512

31.2706

40.9314

0.03

90.6718

9.3282

0.8618

0.077

0.1766

28.185

47.7877

Table 2Results summary of 66% split

3|Page

Assignment: 1

Artificial Neural Network

When considering time consuming for five classifiers
under two testing sample methods, the 66% split
take short time for SVM method of classification ,but
comparatively the cross validation method take
more time than percentage split.

Time Taken in seconds

80
60
40
20

Correctly Classified Instances

0
Multilayer Support
Perceptron Vector
Machine

J48

k-nearest
neighbor

Cross validation 10

Naive
Bayesian

66% split

Correctly identified instances are showing better

results under cross validation test mode. All together
all classifier shows relatively similar results except the
naive Bayesian classifier. It gives better results under
the 66% split test mode.
Incorrectly Classified Instances
12
10
8
6
4
2
0
Multilayer Support
Perceptron Vector
Machine

J48

k-nearest
neighbor

Cross validation 10

Naive
Bayesian

102
100
98
96
94
92
90
88
86
84

Multilayer Support
Perceptron Vector
Machine

J48

k-nearest
neighbor

Cross validation 10

Naive
Bayesian

66% split

If we consider the incorrectly identified instances,

again the split validation shows poor performance
than cross validation test mode. Under multilayer
perception, the graph shows greater deviation from
each test mode and cross validation gives better
results under, the multilayer perception classifier.
The nave Bayesian shows low performance in both
test mode.

66% split

Capa Statistics

Capa statistics coefficient is a statistical measure of

inter-rater agreement or inter-annotator agreement
for qualitative (categorical) items. From capa we can
come to this conclusion.

< 0 Less than chance agreement

0.010.20 Slight agreement
0.21 0.40 Fair agreement
0.410.60 Moderate agreement
0.610.80 Substantial agreement
0.810.99 Almost perfect agreement
Mean absolute error

0.1
0.08
0.06
0.04
0.02
0
Multilayer Support
Perceptron Vector
Machine

J48

Cross validation 10

4|Page

k-nearest
neighbor

Naive
Bayesian

66% split

1.05
1
0.95
0.9
0.85
0.8
0.75
Multilayer Support
Perceptron Vector
Machine

J48

Cross validation 10

k-nearest
neighbor

Naive
Bayesian

66% split

The MAE measures the average magnitude of the

errors in a set of five classes. If we consider the
following graph the k-nearest neighbor classifier
shows high mean absolute error than other classifier.
The multilayer perception shows relatively low
absolute error from others and J48 shows average
error rate. When considering the two training modes
there are no big deviation from each other except
the multilayer perception. Multilayer perception
shows low absolute error under cross validation
training mode.

Assignment: 1

Artificial Neural Network

Root mean squared error /Relative

absolute error /Root relative squared
error for Cross validation

Root mean squared error /Relative absolute

error /Root relative squared error for 66%
split
60
50
40
30
20
10
0

60
50
40
30
20
10
0
Multilayer Support
Perceptron Vector
Machine
RMs

J48

RAE

k-nearest
neighbor

Multilayer Support
Perceptron Vector
Machine

Naive
Bayesian

RMs

RRSE

J48

RAE

k-nearest
neighbor

Naive
Bayesian

RRSE

Above two graphs are showing comparison of different error parameters, considerably the multilayer
perception classifier showing good results, it means lower error rate. Except others, but k-nearest and nave
Bayesian are showing high amount of error in determining the five classes.
10-fold cross-validation
100%
90%
80%
70%
60%
50%
40%
30%
20%
10%
0%
Time
taken
seconds

Correctly
Classified
Instances
Multilayer
Perceptron

Incorrectly
Classified
Instances

Kappa
statistic

Support
Vector
Machine

Mean
absolute
error
J48

Root mean
squared
error

k-nearest
neighbor

Relative
absolute
error

Root relative
squared
error

Naive
Bayesian

split 66.0% train, remainder test

100%
80%
60%
40%
20%
0%
Time
taken
seconds

Correctly
Classified
Instances
Multilayer
Perceptron

Incorrectly
Classified
Instances

Kappa
statistic

Support
Vector
Machine

Mean
absolute
error
J48

Root mean
squared
error

k-nearest
neighbor

Relative
absolute
error

Root relative
squared
error

Naive
Bayesian

The above two graphs are showing the compared performance matrices of classifiers in percentage. The close look
of these graphs are showing no significant changes between the parameters. The lower level showing good
performance and higher percentage showing lower performance. Also if we consider training mode the 10 fold cross
validation is showing significant performance than 66% of split. This results proved that multilayer perception is the
best classifier for the nursery dataset and nave Bayesian is the lowest.
5|Page

Assignment: 1

Artificial Neural Network

Precision of classifiers (66%

split)

0.012

0.01

HUNDREDS

Precision of classifiers (Cross

Validation)

0.008
0.006
0.004
0.002

0
Multilayer Support
Perceptron Vector
Machine

J48

k-nearest
neighbor

not_recom

recommend

priority

spec_prior

Naive
Bayesian

0.012
0.01
0.008
0.006
0.004
0.002
0
Multilayer Support
Perceptron Vector
Machine

very_recom

J48

k-nearest
neighbor

not_recom

recommend

priority

spec_prior

Naive
Bayesian

very_recom

The above graphs are showing the precision comparison of five classifier against five classes that we have identified.
Under cross validation training mode the class recommended shows zero precision among all classifier we have used.
But class not_recommeded shows high precision in both training mode. But ver_recommeded class show significant
precision in 66% split. Considering above fact the cross validation again lead in performance.
TP Rate for % split

TP Rate Cross Validation

not_recom

recommend

priority

spec_prior

very_recom

1.2
1
0.8
0.6
0.4
0.2
0

not_recom

recommend

priority

spec_prior

Multilayer Support
Perceptron Vector
Machine

J48

very_recom

1.2
1
0.8
0.6
0.4
0.2
0
Multilayer Support
Perceptron Vector
Machine

J48

k-nearest
neighbor

Naive
Bayesian

k-nearest
neighbor

Naive
Bayesian

Considering the True positive rate the not recommended class is showing similar results under both testing mode
and all types of classifier used same like us the priority and specific priority class. But the class very recommended is
showing significant different between classifiers and testing mode. However we got good results in multilayer
perception and J48 under cross validation testing mode.
FP Rate for % split

FP Rate Cross Validation

not_recom

recommend

priority

spec_prior

very_recom

0.12

0.1

0.08

0.06

not_recom

recommend

priority

spec_prior

very_recom

0.04

0.02

0
Multilayer Support
Perceptron Vector
Machine

6|Page

J48

k-nearest
neighbor

Naive
Bayesian

Multilayer Support
Perceptron Vector
Machine

J48

k-nearest
neighbor

Naive
Bayesian

Assignment: 1

Artificial Neural Network

False positive rate is high in 66% split rather than cross validation for priority and specific priority class when using
nave Bayesian classifier and this is less in SVM classifier.
Recall for % split

Recall Cross Validation

not_recom

recommend

priority

spec_prior

very_recom

1.2
1
0.8
0.6
0.4
0.2
0

not_recom

recommend

priority

spec_prior

Multilayer Support
Perceptron Vector
Machine

J48

very_recom

1.2
1
0.8
0.6
0.4
0.2
0
Multilayer Support
Perceptron Vector
Machine

J48

k-nearest
neighbor

Naive
Bayesian

k-nearest
neighbor

Naive
Bayesian

Recall measurement shows better performance under cross validation. However very recommended class shows
greater difference between the classifiers. But under 66% percentage split we can see lots of differences in recall.
F-Measure for % split

F- Measure Cross Validation

not_recom

recommend

priority

spec_prior

very_recom

1.2
1
0.8
0.6
0.4
0.2
0

not_recom

recommend

priority

spec_prior

Multilayer Support
Perceptron Vector
Machine

J48

very_recom

1.2
1
0.8
0.6
0.4
0.2
0
Multilayer Support
Perceptron Vector
Machine

J48

k-nearest
neighbor

Naive
Bayesian

k-nearest
neighbor

Naive
Bayesian

Study of F-measure does not affect significantly in both testing mode and different classifiers except the one class
which is very-recommended.
ROC Area for % split

ROC Area Cross Validation

not_recom

recommend

priority

spec_prior

very_recom

1.2
1
0.8
0.6
0.4
0.2
0

not_recom

recommend

priority

spec_prior

very_recom

1.2
1
0.8
0.6
0.4
0.2
0
Multilayer Support
Perceptron Vector
Machine

J48

k-nearest
neighbor

Naive
Bayesian

Multilayer Support
Perceptron Vector
Machine

J48

k-nearest
neighbor

Naive
Bayesian

When considering the ROC values the not recommended, specific priority and priority classes are showing high
performance than very recommended and recommended class. In overall view the multilayer perception shows good
performance in classification.
7|Page

Assignment: 1

Artificial Neural Network

ROC curve Analysis

To analyze the ROC performance the above model was [Link] running the model ROC curves were obtained
for different classifiers of particular class.

Class: - not recommended

Class :- recommended

Class: - Very recommended

Class: - priority

When seeing the above ROC curves of classes the recommended class shows poor performance for most of the
classifiers. May due to less amount of instances in that class. (Depend on the data set). Most of the time multilayer
perception gives the good performance. The analysis of ROC time consuming process therefore I did only for cross
validation mode.
8|Page

Assignment: 1

Artificial Neural Network

Class: - specific priority

Under ROC analysis provides the good performance comparison of different classifiers.
Comparison of confusion matrix (cross validation vs 66% split for five classification)
Multilayer perception

svm

J48

K-nearest neighbor

Nave Bayesian

Conclusion
As a conclusion, we have met our objective which is to evaluate and investigate five selected classification algorithms
based on Weka. The best algorithm based on the nursery data is multilayer perception classifier with an accuracy of
99.7299% and the total time taken to build the model is at 69.56 seconds. When considering the time factor
multilayer perception is more time consuming. According to the time factor k-nearest neighbor and nave Bayesian
classifiers took less time but their accuracy is relatively lower than the multilayer perception. By considering all
aspects of performance parameter under two types of training method the multilayer perception significantly
provide the more accurate results. Also the performance of other classification methods are in decreasing order such
as SVM,J48,k-nearest neighbor and nave Bayesian.

9|Page

Assignment: 1

Artificial Neural Network

b) Data sets used:

Thyroid disease dataset supplied by the Garavan Institute and J. Ross % ; Quinlan, New South
Wales Institute, Syndney, [Link] date set used to identify those who has thyroids weather
getting sick or negative.
Under this study there was 3772 samples (instances) were analyzed against thirty attributes
age:
continuous.
sex:
M, F.
on thyroxine:
f, t.
query on thyroxine:
f, t.
on antithyroid medication:
f, t.
sick:
f, t.
pregnant:
f, t.
thyroid surgery:
f, t.
I131 treatment:
f, t.
query hypothyroid:
f, t.
query hyperthyroid:
f, t.
lithium:
f, t.
goitre:
f, t.
tumor:
f, t.
hypopituitary:
f, t.
psych:
f, t.
TSH measured:
f, t.
TSH:
continuous.
T3 measured:
f, t.
T3:
continuous.
TT4 measured:
f, t.
TT4:
continuous.
T4U measured:
f, t.
T4U:
continuous.
FTI measured:
f, t.
FTI:
continuous.
TBG measured:
f, t.
TBG:
continuous.
referral source:
WEST, STMW, SVHC, SVI, SVHD, other.
The data sets have been submitted to a set of classification algorithms of Weka. We have used the 'Explorer' option
of the Weka tool. Certain comparative studies were conducted and following factors are derived. Under this study I
have used two types of test mode which are 10-fold cross-validation and percentage split 66%.

Classification
Multilayer
Perceptron
Support Vector
Machine
J48
k-nearest
neighbor
Naive
Bayesian

Time
taken
seconds

Correctly
Classified
Instances

Incorrectly
Classified
Instances

Kappa
statistic

Mean
absolute
error

Root
mean
squared
error

Relative
absolute
error

Root
relative
squared
error

16.25

97.2699

2.7301

0.7265

0.0319

0.1488

27.9566

63.3737

0.62

94.1498

5.8502

0.0585

0.2419

51.2583

103.0411

0.15

98.0499

1.9501

0.8149

0.0234

0.1336

20.4604

56.914

95.4758

4.5242

0.5306

0.0456

0.2126

39.9548

90.5775

0.01

93.1357

6.8643

0.5385

0.088

0.2271

77.1257

96.7281

Table 3 Results summary of 66% split percentage

10 | P a g e

Assignment: 1

Time
taken
seconds

Correctly
Classified
Instances

Incorrectly
Classified
Instances

Kappa
statistic

Mean
absolute
error

Root
mean
squared
error

Relative
absolute
error

Root
relative
squared
error

16.21

97.2428

2.7572

0.7522

0.0336

0.1553

29.124

64.7703

0.27

93.8494

6.1506

-0.0005

0.0615

0.248

53.3871

103.4332

0.07

98.807

1.193

0.8943

0.0146

0.1054

12.685

43.9447

96.1824

3.8176

0.6465

0.0384

0.1953

33.3689

81.4648

0.01

92.6034

7.3966

0.5249

0.0888

0.2294

77.0863

95.6866

Table 4 Results summary of 10 fold validation

When considering time consuming for five classifier

under two test sample methods, in the both test Knearest neighbor classifier is faster since only taking
0 seconds. Naive Bayes taking same time (0.01sec) on
both test methods.

Time Taken in seconds

20
15
10
5

Correctly Classified Instances

0
Multilayer Support
Perceptron Vector
Machine

J48

k-nearest
neighbor

Naive
Bayesian

100
98
96

Cross validation 10

66% split

94
92

Correctly identified instances shows averagely better

results in cross validation test, though overall J48 classifier
giving better results in both test methods, but when we use
cross validation J48 giving better results. So j48 will be
better classifier for cross validation test.

90
88
Multilayer Support
Perceptron Vector
Machine

J48

k-nearest
neighbor

Cross validation 10

Incorrectly Classified Instances

Naive
Bayesian

66% split

If we consider the incorrectly identified instances,

again the cross validation show poor performance
than split validation test mode. Under J48 the graph
shows greater deviation from each test mode.

8
6
4
2
0
Multilayer Support
Perceptron Vector
Machine

J48

Cross validation 10

k-nearest
neighbor

Naive
Bayesian

66% split

Capa Statistics
1
0.8
0.6
0.4
0.2
0
-0.2

Multilayer Support
Perceptron Vector
Machine

J48

Cross validation 10

11 | P a g e

k-nearest
neighbor

Naive
Bayesian

66% split

Assignment: 1

Artificial Neural Network

The MAE measures the average magnitude of the

errors in a set of five classes. If we consider the
following graph the J48 classifier has lower mean
absolute error than other classifier. The Naise bayes
shows relatively higher absolute error from others
[Link] when use cross validation test
method giving less error comparatively.

Mean absolute error

0.1
0.08
0.06
0.04
0.02
0
Multilayer Support
Perceptron Vector
Machine

J48

k-nearest
neighbor

Cross validation 10

Naive
Bayesian

Root mean squared error /Relative absolute

error /Root relative squared error for 66%
split

66% split
120
100
80
60
40
20
0

Root mean squared error /Relative absolute

error /Root relative squared error for Cross
validation
120
100
80
60
40
20
0

Multilayer Support
Perceptron Vector
Machine
RMs
Multilayer Support
Perceptron Vector
Machine
RMs

J48

RAE

k-nearest
neighbor

J48

RAE

k-nearest
neighbor

Naive
Bayesian

RRSE

Naive
Bayesian

RRSE

Above two graphs is showing comparison of different error parameters, considerably the J48 classifier showing
good results since gives lower error rate. Except others, but LIBSVM and Naise bayes show high amount of error
in determining the five classes.
10-fold cross-validation
100%
90%
80%
70%
60%
50%
40%
30%
20%

10%
0%
Time
taken
seconds

Correctly
Classified
Instances

Incorrectly
Classified
Instances

Multilayer
Perceptron

12 | P a g e

Kappa
statistic

Support
Vector
Machine

Mean
absolute
error

J48

Root mean
squared
error

k-nearest
neighbor

Relative
absolute
error

Naive
Bayesian

Root relative
squared
error

Assignment: 1

Artificial Neural Network

split 66.0% train, remainder test

100%
80%
60%
40%
20%
0%
Time
taken
seconds

Correctly
Classified
Instances

Incorrectly
Classified
Instances

Multilayer
Perceptron

Kappa
statistic

Support
Vector
Machine

Mean
absolute
error
J48

Root mean
squared
error

k-nearest
neighbor

Relative
absolute
error

Root relative
squared
error

Naive
Bayesian

The above two graphs are showing the compared performance matrices in percentage. The close look of this graph
showing no significant changes between the parameters. The lower level showing good performance and higher
percentage showing lower performance. Also if we consider training mode the 10 fold cross validation showing
significant performance than 66% of split. This results proved that multilayer perception is the best classifier for the
nursery dataset and nave Bayesian is the lowest.

TP Rate

FP Rate

Precision

Recall

FMeasure

ROC Area

Confusion Matrix

0.992

0.333

0.98

0.992

0.986

0.95

0.667

0.008

0.833

0.667

0.741

0.95

a b <-- classified as
3497 44 | a = negative
60 171 | b = sick

0.941

0.97

0.5

J48(negative)

0.993

0.213

0.987

0.993

0.99

0.878

J48(Sick)

0.787

0.007

0.868

0.787

0.825

0.878

0.984

0.52

0.968

0.984

0.976

0.739

0.48

0.016

0.655

0.48

0.554

0.739

0.94

0.213

0.986

0.94

0.963

0.92

0.787

0.06

0.45

0.787

0.573

0.92

Classification
Multilayer
Perceptron(negative)
Multilayer
Perceptron(Sick)
Support
Vector(negative)
Machine
Support Vector(Sick)
Machine

k-nearest
neighbor(Negative)
k-nearest
neighbor(Sick)
Naive
Bayesian(Negative)
Naive
Bayesian(Sick)

Table 3 Results summary of 66% split percentage

According to the above results we can conclude J48 has the good classification since it has both TP Rate and FP Rate
higher when used percentage split test.

13 | P a g e

Assignment: 1

Artificial Neural Network

TP Rate

FP Rate

Precision

Recall

FMeasure

ROC Area

Confusion Matrix

0.988

0.26

0.983

0.988

0.985

0.951

0.74

0.012

0.795

0.74

0.767

0.951

a b <-- classified as
1197 10 | a = negative
25 50 | b = sick

0.939

0.968

0.5

J48(negative)

0.995

0.117

0.992

0.995

0.994

0.951

J48(Sick)

0.883

0.005

0.919

0.883

0.901

0.951

0.984

0.377

0.976

0.984

0.98

0.806

0.623

0.016

0.716

0.623

0.667

0.806

0.936

0.225

0.985

0.936

0.96

0.925

0.775

0.064

0.441

0.775

0.562

0.925

Classification
Multilayer
Perceptron(negative)
Multilayer
Perceptron(Sick)
Support
Vector(negative)
Machine
Support Vector(Sick)
Machine

k-nearest
neighbor(Negative)
k-nearest
neighbor(Sick)
Naive
Bayesian(Negative)
Naive
Bayesian(Sick)

Table 4 Results summary of 10 fold split

According to the above results we can conclude J48 has the good classification since it has higher TP Rate higher
when used percentage cross validation 10 fold test.
So finally according to all above classifiers J48 is the good classifier for the sick dataset. Since it has provided better
performance on both cross validation and split percentage.

ROC Curve

14 | P a g e

Assignment: 1

Artificial Neural Network

Fig1 ROC curve for J48 (cross validation-fold 10)

Fig2 ROC curve for J48 (percentage split)

15 | P a g e

Assignment: 1

Artificial Neural Network

Fig3 ROC curve for Nave bayes (cross validation-fold 10)

Fig4 ROC curve for Nave based (percentage split)

The above four ROC curve ,we can identify when we use J48 classifier with cross validation(fold 10) testing method for the
above sick datasets the giving better smooth curve it shows the better classifier is J48 out of all the above five classifier.
If we order the classifier according to the all above result it will be like following order(the lowest numer giving higher
performance).

1.
2.
3.
4.
5.

J48
Nave bayes
Multi layer perception
K-nearest neigbour
LibSVM.

Conclusion
Out of all above results in order to analyze the performance of a classifier though J48 classifier gave the better
performance for the sick dataset, it understood different classifier may give better performance for the different
datasets, which means the performance of a classifier depend on number of instances, number of attributes. But
anyhow in order to classify certain data we have to consider higher number of instances and higher number of
attributes. But finally to take the proper decision we have to run the same datasets through using different
classifier and different testing mode such as different values of cross validation and appropriate percentage split
(but 66% is the standard value).

16 | P a g e

Assignment: 1

Artificial Neural Network

c) Data set used:

Relation:

wine

Instances: 178
Attributes: 14

Class
Alcohol
Malic_acid
Ash
Alcalinity_of_ash
Magnesium
Total_phenols
Flavanoids
Nonflavanoid_phenols
Proanthocyanins
Color_intensity
Hue
OD280/OD315_of_diluted_wines
Proline

Results and Discussion

Classification

Time
taken
seconds

Correctly
Classified
Instances

Incorrectly
Classified
Instances

Kappa
statistic

Mean
absolute
error

Root
mean
squared
error

Relative
absolute
error

Root
relative
squared
error

Multilayer
Perceptron

0.77

97.191

2.809

0.9574

0.0247

0.1172

5.6355

25.0058

Support Vector
Machine

0.11

98.3146

1.6854

0.9745

0.226

0.279

51.4678

59.5404

J48

0.04

93.8202

6.1798

0.9058

0.0486

0.2019

11.0723

43.0865

94.9438

5.0562

0.9238

0.0413

0.1821

9.3973

38.8682

0.01

96.6292

3.3708

0.9489

0.0217

0.1294

4.9371

27.6176

k-nearest
neighbor
Naive
Bayesian

Table 5 Results summary of 10 fold cross validation

Classification

Time
taken
seconds

Correctly
Classified
Instances

Incorrectly
Classified
Instances

Kappa
statistic

Mean
absolute
error

Root
mean
squared
error

Relative
absolute
error

Root
relative
squared
error

Multilayer
Perceptron

0.74

96.7213

3.2787

0.9506

0.0252

0.128

5.6297

26.5694

Support Vector
Machine

0.06

98.3607

1.6393

0.9753

0.2259

0.2788

50.54

57.8844

J48

86.8852

13.1148

0.8027

0.0874

0.2957

19.5639

61.3956

95.082

4.918

0.926

0.0431

0.1792

9.6393

37.2046

0.01

98.3607

1.6393

0.9753

0.0124

0.0713

2.7794

14.8027

k-nearest
neighbor
Naive
Bayesian

Table 6 Results summary of 66% split

17 | P a g e

Assignment: 1

Artificial Neural Network

Time Taken in seconds

When considering time consuming for five

classifier under two test sample method the 66%
split take short time for SVM method of
classification ,but comparatively the cross
validation method take more time than percentage
split.

0.9
0.8
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0

Correctly Classified Instances

Multilayer Support
Perceptron Vector
Machine

J48

k-nearest
neighbor

Naive
Bayesian

Cross validation 10

66% split

100

Cross validation 10

66% split

95
90

Correctly identified instances show better results

under cross validation test mode. All together all
classifier shows same conclusion except the naive
Bayesian classifier. It gives better results under the
66% split test mode.

85
80
Multilayer Support
Perceptron Vector
Machine

J48

k-nearest
neighbor

Naive
Bayesian

Incorrectly Classified Instances

Cross validation 10

If we consider the incorrectly identified instances,

again the split validation show poor performance
than cross validation test mode. Under multilayer
perception the graph show greater deviation from
each test mode and cross validation gives better
results under multilayer perception classifier. The
nave Bayesian show low performance in both test
mode.

66% split

14
12
10
8
6
4
2
0
Multilayer
Perceptron

Support
Vector
Machine

J48

k-nearest
neighbor

Naive
Bayesian

Capa statistics coefficient is a statistical measure of

inter-rater agreement or inter-annotator agreement for
qualitative (categorical) items. From capa we can come
to this conclusion.

Capa Statistics
Cross validation 10
1.2
1
0.8

< 0 Less than chance agreement

0.6

0.010.20 Slight agreement

0.2

0.21 0.40 Fair agreement

0.410.60 Moderate agreement

0.610.80 Substantial agreement

0.810.99 Almost perfect agreement

18 | P a g e

66% split

0.4

0
Multilayer Support
Perceptron Vector
Machine

J48

k-nearest
neighbor

Naive
Bayesian

Assignment: 1

Artificial Neural Network

Mean absolute error

0.25

Cross validation 10

The MAE measures the average magnitude of

the errors in a set of five classes. If we consider
the following graph the Support vector
machine classifier shows high mean absolute
error than other classifier. The multilayer
perception shows relatively low absolute error
from others and J48 shows average error rate.
When considering the two train modes there is
no big deviation from each other except the
multilayer perception. Multilayer perception
shows low absolute error under cross
validation training mode.

66% split

0.2
0.15
0.1
0.05
0
Multilayer
Perceptron

Support
Vector
Machine

J48

k-nearest
neighbor

Naive
Bayesian

Root mean squared error /Relative absolute error /Root relative

squared error for Cross validation
RMs
70

RAE

RRSE

59.5404
51.4678

60
50

43.0865

38.8682

27.6176

25.0058

30
20
10

5.6355
0.1172

11.0723
0.279

9.3973

0.2019

0.1821

4.9371
0.1294

Multilayer
Perceptron

Support
Vector
Machine

J48

k-nearest
neighbor

Naive
Bayesian

Root mean squared error /Relative absolute error /Root relative

squared error for 66% split
RMs
70

RAE

61.3956

57.8844
50.54

RRSE

50
37.2046

40
25.5694

19.5639

20
10

5.6297
0.128

9.6393
0.2788

0.2957

0.1792

14.8027
2.7794
0.0713

0
Multilayer
Perceptron

Support
Vector
Machine

J48

k-nearest
neighbor

Naive
Bayesian

Above two graphs are showing comparison of different error parameters, considerably the multilayer
perception classifier showing good results that means lower error rate. Except others, but Support vector
Machine and J48 show high amount of error in determining the five classes.
19 | P a g e

Assignment: 1

Artificial Neural Network

10-fold Cross-Validation
100%
80%
60%
40%
20%

0%
Time
taken
seconds

Correctly
Classified
Instances

Incorrectly
Classified
Instances

Multilayer
Perceptron

Kappa
statistic

Support
Vector
Machine

Mean
absolute
error
J48

Root mean
squared
error

k-nearest
neighbor

Relative
absolute
error

Root relative
squared
error

Naive
Bayesian

Split 66.0% train, remainder test

100%

80%
60%
40%
20%
0%
Time
taken
seconds

Correctly
Classified
Instances

Incorrectly
Classified
Instances

Multilayer
Perceptron

Kappa
statistic

Support
Vector
Machine

Mean
absolute
error
J48

Root mean
squared
error

k-nearest
neighbor

Relative
absolute
error

Root relative
squared
error

Naive
Bayesian

The above two graphs are showing the compared performance matrices in percentage. The close look of this
graph showing no significant changes between the parameters. The lower level showing good performance and
higher percentage showing lower performance. Also if we consider training mode the 10 fold cross validation
showing significant performance than 66% of split. This results proved that multilayer perception is the best
classifier for the Wine dataset and nave Bayesian is the lowest.

Final Conclusion
Finally, This study focuses on finding the right algorithm for classification of data that works better on diverse
data sets. However, it is observed that the accuracies of the tools vary depending on the data set used. It
should also be noted that classifiers of a particular group also did not perform with similar accuracies. Overall,
the results indicate that the performance of a classifier depends on the data set, then number of instances
especially on the number of attributes used in the data set and one should not rely completely on a particular
algorithm for their study. So, we recommend that users should try their data set on a set of classifiers and choose
the best one.

20 | P a g e

Assignment: 1

Artificial Neural Network

References
1. Gopala Krishna, Bharath Kumar and Nagaraju Orsu Performance Analysis and Evaluation of Different Data
Mining Algorithms used for Cancer Classification, (IJARAI) International Journal of Advanced Research in
Artificial Intelligence, Vol. 2, No.5, 2013.
2. Mohd Fauzi bin Othman and Thomas Moh Shan Yau Comparison of Different Classification Techniques
Using WEKA for Breast Cancer IFMBE Proceedings Vol. 15.2007.
3. Rohit Arora and Suman Comparative Analysis of Classification Algorithms onDifferent Datasets using
WEKA, International Journal of Computer Applications (0975 8887),Volume 54 No.13, September 2012
4. Samrat Singh and Vikesh Kumar Performance Analysis of Engineering Students for Recruitment Using
Classification Data Mining Techniques Samrat Singh et al , IJCSET , Vol 3, Issue 2, 31-37 ,February 2013 .

21 | P a g e

Analysis of Machine Learning Algorithms Using WEKA: Aaditya Desai Dr. Sunil Rai
No ratings yet
Analysis of Machine Learning Algorithms Using WEKA: Aaditya Desai Dr. Sunil Rai
6 pages
Classifier Performance in Energy Design
No ratings yet
Classifier Performance in Energy Design
7 pages
German Credit Score Classification Analysis
No ratings yet
German Credit Score Classification Analysis
9 pages
Comparing CART, KNN, and PCA Methods
No ratings yet
Comparing CART, KNN, and PCA Methods
7 pages
Decision Tree Algorithms Comparison in WEKA
No ratings yet
Decision Tree Algorithms Comparison in WEKA
7 pages
Machine Learning Classifier Assignment
No ratings yet
Machine Learning Classifier Assignment
6 pages
Decision Tree Algorithms Comparison in WEKA
No ratings yet
Decision Tree Algorithms Comparison in WEKA
7 pages
41 j48 Naive Bayes Weka
No ratings yet
41 j48 Naive Bayes Weka
5 pages
Performance Assessment of Different Classification Techniques For Intrusion Detection
No ratings yet
Performance Assessment of Different Classification Techniques For Intrusion Detection
5 pages
Minimalist Business Slides XL by Slidesgo
No ratings yet
Minimalist Business Slides XL by Slidesgo
27 pages
Evaluating Classifiers with Mc Nemar's Test
No ratings yet
Evaluating Classifiers with Mc Nemar's Test
13 pages
Comparative Analysis of Classification Algorithms On Diferrent Dataset Using Weka SW PDF
No ratings yet
Comparative Analysis of Classification Algorithms On Diferrent Dataset Using Weka SW PDF
5 pages
Understanding Pattern Recognition Techniques
No ratings yet
Understanding Pattern Recognition Techniques
46 pages
Machine Learning Project Report: Regression & Classification
No ratings yet
Machine Learning Project Report: Regression & Classification
10 pages
Supervised Learning Classification Algorithms Comparison
No ratings yet
Supervised Learning Classification Algorithms Comparison
6 pages
Performance of Three Classifiers Analysis
No ratings yet
Performance of Three Classifiers Analysis
4 pages
Comparative Analysis of Classification Algorithms Using Weka
No ratings yet
Comparative Analysis of Classification Algorithms Using Weka
12 pages
Comparison of Airline Classification Algorithms
No ratings yet
Comparison of Airline Classification Algorithms
4 pages
Classification and Prediction Techniques
No ratings yet
Classification and Prediction Techniques
19 pages
ANN Tools and WEKA Experiments
No ratings yet
ANN Tools and WEKA Experiments
14 pages
Divorce Prediction Using ML
No ratings yet
Divorce Prediction Using ML
12 pages
DWDM Lab Manual for B.Tech Students
No ratings yet
DWDM Lab Manual for B.Tech Students
46 pages
Classification FoundationalMathofAI S24
No ratings yet
Classification FoundationalMathofAI S24
6 pages
Hybrid Model for Enhanced Classifier Accuracy
No ratings yet
Hybrid Model for Enhanced Classifier Accuracy
6 pages
Experiment 1 Aim:: Introduction To ML Lab With Tools (Hands On WEKA On Data Set (Iris - Arff) ) - (A) Start Weka
No ratings yet
Experiment 1 Aim:: Introduction To ML Lab With Tools (Hands On WEKA On Data Set (Iris - Arff) ) - (A) Start Weka
55 pages
Scalable SVM and Classification Techniques
No ratings yet
Scalable SVM and Classification Techniques
6 pages
Pattern Recognition
No ratings yet
Pattern Recognition
33 pages
A Study of Some Data Mining Classification Techniques
No ratings yet
A Study of Some Data Mining Classification Techniques
4 pages
Feature Selection in IDS Using KDD-99
No ratings yet
Feature Selection in IDS Using KDD-99
44 pages
Classification Algorithms Overview
No ratings yet
Classification Algorithms Overview
20 pages
Survey of Data Mining Classification Techniques
No ratings yet
Survey of Data Mining Classification Techniques
6 pages
Software Reusability Prediction Techniques
No ratings yet
Software Reusability Prediction Techniques
6 pages
Classification Notes
No ratings yet
Classification Notes
14 pages
Predicting Campus Placement with Data Mining
No ratings yet
Predicting Campus Placement with Data Mining
10 pages
CSCI946 W5-Classification
No ratings yet
CSCI946 W5-Classification
72 pages
Weka Data Mining Project Report 2019-20
No ratings yet
Weka Data Mining Project Report 2019-20
33 pages
Course Work AI - Foundation
No ratings yet
Course Work AI - Foundation
12 pages
Comparative Analysis of ML Classification
No ratings yet
Comparative Analysis of ML Classification
10 pages
Post Op Weka Data Set Sample PDF
No ratings yet
Post Op Weka Data Set Sample PDF
8 pages
Data Mining End 23 24
No ratings yet
Data Mining End 23 24
2 pages
Classification and Prediction in Data Mining
No ratings yet
Classification and Prediction in Data Mining
30 pages
Survey of Classification Techniques in Data Mining: Open Access
No ratings yet
Survey of Classification Techniques in Data Mining: Open Access
10 pages
Data Mining: Classification & Prediction Techniques
No ratings yet
Data Mining: Classification & Prediction Techniques
58 pages
Classification Algorithms in Data Mining
No ratings yet
Classification Algorithms in Data Mining
3 pages
Assignment 3 Masters of Engineering Project Report Template Copy
No ratings yet
Assignment 3 Masters of Engineering Project Report Template Copy
11 pages
Data Mining: Classification Techniques
No ratings yet
Data Mining: Classification Techniques
35 pages
Latest Data Mining Lab Manual
No ratings yet
Latest Data Mining Lab Manual
74 pages
V6i5 0268
No ratings yet
V6i5 0268
7 pages
Review of Data Mining Classification Techniques
No ratings yet
Review of Data Mining Classification Techniques
4 pages
2013 Selection of The Best Classifier From Different Datasets Using WEKA PDF
No ratings yet
2013 Selection of The Best Classifier From Different Datasets Using WEKA PDF
8 pages
Naive Bayes vs J48: Classification Performance
No ratings yet
Naive Bayes vs J48: Classification Performance
6 pages
Weka Data Mining Techniques Overview
No ratings yet
Weka Data Mining Techniques Overview
14 pages
Classification Techniques Overview
No ratings yet
Classification Techniques Overview
141 pages
DM Assignment 2
No ratings yet
DM Assignment 2
23 pages
Income Prediction
No ratings yet
Income Prediction
19 pages
Amlt Bca Unit-1
No ratings yet
Amlt Bca Unit-1
24 pages
Leaf Disease Detection with ML in Python
No ratings yet
Leaf Disease Detection with ML in Python
9 pages
Dar es Salaam Peri-Urban Land Use Changes
No ratings yet
Dar es Salaam Peri-Urban Land Use Changes
15 pages
Sequence Classification Mechanisms
No ratings yet
Sequence Classification Mechanisms
4 pages
Circular 20240429201652 Cho Ai and ML 22cs015 v2
No ratings yet
Circular 20240429201652 Cho Ai and ML 22cs015 v2
8 pages
Mobile Phone Price Prediction A Comparative Study
No ratings yet
Mobile Phone Price Prediction A Comparative Study
7 pages
Data Mining Techniques Unit 2
No ratings yet
Data Mining Techniques Unit 2
48 pages
Python IEEE Projects 2023-2024
No ratings yet
Python IEEE Projects 2023-2024
8 pages
Learning in Artificial Intelligence
67% (6)
Learning in Artificial Intelligence
8 pages
Flower Classification With Deep CNN and Machine Learning Algorithms
No ratings yet
Flower Classification With Deep CNN and Machine Learning Algorithms
5 pages
Naïve Bayes Classifier for Image Classification
No ratings yet
Naïve Bayes Classifier for Image Classification
5 pages
ML Theory Questions Final
No ratings yet
ML Theory Questions Final
3 pages
LectureNote NeuralNetwork II
No ratings yet
LectureNote NeuralNetwork II
29 pages
Understanding Machine Learning Types and Techniques
No ratings yet
Understanding Machine Learning Types and Techniques
6 pages
Mini Project Report
No ratings yet
Mini Project Report
42 pages
AI Chatbot for University Students
No ratings yet
AI Chatbot for University Students
141 pages
Analytics Interview Preparation Guide
No ratings yet
Analytics Interview Preparation Guide
19 pages
Machine Learning for Tweet Segregation
No ratings yet
Machine Learning for Tweet Segregation
4 pages
Understanding Autoencoders Explained
No ratings yet
Understanding Autoencoders Explained
32 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
31 pages
Downloading YelpCHI Dataset for Analysis
No ratings yet
Downloading YelpCHI Dataset for Analysis
20 pages
Gaussian Naive Bayes & K-Means Analysis
No ratings yet
Gaussian Naive Bayes & K-Means Analysis
14 pages
Phishing Website Detection Tool Using ML
No ratings yet
Phishing Website Detection Tool Using ML
17 pages
Spot Welding
No ratings yet
Spot Welding
15 pages
Theory Building in Management
No ratings yet
Theory Building in Management
18 pages
Understanding Deep Learning Concepts
No ratings yet
Understanding Deep Learning Concepts
74 pages
Development and Validation of A Learning Analytics Framework - Two Case Studies Using Support Vector Machines
No ratings yet
Development and Validation of A Learning Analytics Framework - Two Case Studies Using Support Vector Machines
20 pages
Ieee Conference Paper Template
No ratings yet
Ieee Conference Paper Template
5 pages
Machine Learning Classification Techniques
No ratings yet
Machine Learning Classification Techniques
22 pages
Intelligent Assistant for Linux CLI
No ratings yet
Intelligent Assistant for Linux CLI
7 pages
Letter Recognition Using HOlland-Style Adaptive Classifiers PDF
No ratings yet
Letter Recognition Using HOlland-Style Adaptive Classifiers PDF
22 pages