You are on page 1of 1

Abstract

The aim of this paper is to study on different data sets of data mining field for prediction more often to
generate exact results for future purpose. This paper focus on identifying the best classifier algorithm and
displaying it by a predictive data mining model. Five Real World dataset is taken and feature extraction of
desired potential variables is done using WEKA an Open Source Tool. The five dataset record is tested
and applied on various classification algorithms such as Nave Bayes, Decision tree, Support Vector
Machine, Neural Network, K-Nearest Neighbor using weka an Open source tool. As a result, a table is
generated based on all classification algorithms and comparison of all five classifiers is also done in order
to predict the accuracy and to find the best performing classification algorithm among all. Finally, the
algorithm that produces the highest accuracy was chosen as the most successful algorithm for modeling a
specific dataset. In this paper, Area Under The Curve (AUC) is also shown which means that, if the curve
lies between 0 and 1, this increasingly recognized as a better measure for evaluating algorithm
performance than accuracy. A bigger AUC value implies a better ranking performance for a classifier.
This paper showcases the importance of Prediction and Classification based data mining algorithms in
five different field and also presents some promising future lines.

You might also like