Professional Documents
Culture Documents
Ensemble Classifier
a,b
Department of Computer Science and Engineering, Faculty of Engineering and
Technology, SRM Institute of Science and Technology (Vadapalani Campus)
Chennai, India
raina.arya@rediffmail.com
With the proliferation of internet into daily lives of people and the growing
dependence on e-commerce and e-services, the vulnerability of networks to
attacks is also becoming a major issue. The need of the hour is robust Intrusion
detection systems capable of effectively detecting network attack behaviour,
which is pivotal to the network security. Many systems proposed in the past using
single classifiers are not able to achieve higher performance metrics like
accuracy, precision and recall as the nature and behaviour of Malware is
unpredictable. In this paper, a hybrid network intrusion detection model based on
neural network and ensemble classifiers is proposed. The proposed method learns
the effective features using optimized neural networks and machine learning
algorithms, and the test results are produced in conjunction with the voting
ensemble classifier. The KDD-CUP 99 standard network intrusion detection
dataset is used for evaluation and experimentation. The experimental results show
that the multiclassification network intrusion detection model proposed in this
paper improves the performance evaluation metrics and provides stable results.
Introduction
With businesses growing across lengths and breadths, the probability of the security
breach of private data available online has also increased significantly. The reliance of
network and securing private assets is increasing day-by-day [1]. These systems are also
being used for continual monitoring and reporting regarding any abnormal activity or
includes normal network traffic patterns [2] and then detect abnormalities from those.
Anomaly intrusion detection offers the intriguing ability in detecting odd attacks even
before they have been categorized by the security analysts, and being capable of
detecting differences on the existing attack methods. In our proposed system, we aim to
In order to create data for the Intrusion Detection System [3], it’s needed to configure
the real working environment to analyze all the probabilities of attacks, which is not
cost efficient. Data validation, data pre-processing, feature engineering are the phases
involved. The data analysis phase (data validation, pre-processing, feature engineering)
of data and deciding on how to organize, classify, interrelate, compare and display the
applications, remote sensing, etc. and have standard and legitimate ground truth
databases for analysis. Likewise, most of the computer Network IDS uses the KDD
The paper proposes an approach which uses combines a voting ensemble classifier with
a neural network model to give accurate and consistent results for the type of attack that
a network has been subjected to. The algorithm tests the proposed model on other
Classifiers with the neural network model to prove that the voting ensemble classifier in
The paper is organised as follows. Section 2 throws some light on the work
Proposed Methodology
The network intrusion detection system is a hybrid system because it uses an ensemble
classifier with ML algorithms and a neural network model, which is translated into a
classifier. Here we use the whole 10% of the KDDCup-99 dataset for accurate results.
The traditional ML algorithms work better for organized data and uniform data. The
neural network model performs better when we keep adding data. So, if we keep adding
The dataset is divided into 2, training and testing, dummy variable(Y) is created
with the outcome column for the neural network model. Then the neural network model
is built and the model is converted into a classifier, so, that it can be sent to the
The ML classifiers and the keras classifier (neural network model classifier) is
given in the voting classifier as estimators. The voting classifier is then trained and
tested.
In general, this method predicts and gives the best possible and consistent
Results:
Discussion of Results:
The results show that the voting ensemble classifier gives an accuracy of 99.96%. The
random forest classifier also gives a better accuracy but when more data is added in the
future, the accuracy and consistency of voting ensemble classifier will be better because
of the neural network model (keras classifier), the same cannot be said for random
forest classifier or any other ML classifiers that’s been used.
Conclusion:
The voting ensemble classifier produces consistent and accurate results, than that of
other ML algorithms and the keras classifier for diverse datasets.