You are on page 1of 23

SRIDEVI WOMEN’S ENGINEERING COLLEGE

V.N.PALLY ,GANDIPET-500075
Accredited by NBA and NAAC
Affiliated to JNTUH, Approved by AICTE, New Delhi

Department of Electronics &communication Engineering

Mini Project Seminar On


Classification Of IRIS Dataset Using Classification Based KNN
Algorithm in Supervised Learning

Under the Guidance of: Presented By:


Dr.S.Jagadeesh Allam sowmya : 17D21AO463
Maduri Nandini : 17D21A0498
Moddu Bhargavi: 17D21A0499
CONTENTS
• ABSTRACT

• INTRODUCTION

• METHODOLOGY

• EXISTING SYSTEM

• PROPOSED SYSTEM

• SOFTWARE&HARDWARE REQUIREMENTS

• DATASET

• UML DIAGRAMS

• IMPLEMENTATION

• RESULT

• CONCLUSION

• FUTURE SCOPE

• REFERENCES
ABSTRACT
Dataset is harder to handle, predict and classify. So, using machine learning algorithm we can
easily predict and classify the data. we use the KNN algorithm for the classification purpose and
linear regression for prediction purpose. using this algorithm, we can classify the test datapoint
into several classes and also, we visualized all the data points using various plot. We have taken
the IRIS dataset of flower with three unique types of Iris flower that is Setosa, Versicolor and
Virginica with four features of flower and we can predict the type of the test flower using linear
regression and K Nearest Neighbour algorithm. we evaluating and applying the KNN algorithm
and linear regression on Iris dataset.
INTRODUCTION
• Machine learning is about prediction on unseen data or testing data. In
machine learning a computer first learn to perform a task by training
dataset. Then the computer perform the same task with the testing data . In
Supervised learning we pass both input and output data and the result is
already known. Supervised learning is of two types Classification based and
Regression based.we are using classification based supervised learning.
KNN is a simple algorithm that stores all available cases and classifies
based on a similarity measures. 
• The implementation of the model includes six basic steps of machine
learning that are:
1. Collect data/prepare data
2. Choose algorithm.
3. Creating object of the model
4.Train the model by training dataset
5. Making prediction on unseen data or testing data
6. Evaluation of the model.
• The dataset contain 150 Samples of data that has 3 classes (setosa,
versicolor, virginica) each contain 50 samples. To train the machine we split
the dataset into two Parts training and testing dataset, then the machine will
train by training dataset and then it will test on testing dataset. Now we will
evaluate the model weather it recognize the iris species accurately or not.

Fig1:Setosa Fig2:veriscolor Fig3:Virginica


K-NEAREST NEIGHBOUR(KNN) ALGORITHM FOR
MACHINE LEARNING

K-Nearest Neighbour is one of the simplest Machine Learning


algorithms based on Supervised Learning technique. K-NN
algorithm assumes the similarity between the new case/data and
available cases and put the new case into the category that is most
similar to the available categories. K-NN algorithm stores all the
available data and classifies a new data point based on the similarity.
This means when new data appears then it can be easily classified
into a well suite category by using K- NN algorithm.
METHODOLOGY
• As our intention is to design a model that is able to automatically recognize the iris species
accurately.
• So for that we collected/prepared data which involve data preprocessing and splitting of data.
Data preprocessing involve handling of missing data, handle of categorical data and handling
of feature scaling.
• Categorical data involves nominal data and ordinal data which can be handle by pandas as
well as machine learning and for handling missing data and feature scaling we use pandas
and machine learning respectively.
• Splitting of dataset involves training data and testing data. We shuffle the data so that there is
no any particular sequence in training as well as testing dataset.
• K- Nearest Neighbors is the simplest supervised machine learning algorithm that classifies a
data point based on how its neighbors are classified.
EXISTING SYSTEM
IRIS flower classification using Neural Network. Existing iris flower
dataset is preloaded in MATLAB and is used for clustering into three
different species. The dataset is clustered using the neural network
clustering tool in MATLAB. Neural network clustering tool is
mainly used for clustering large data set without any supervision.
PROPOSED SYSTEM
• Different datasets of Iris Flower are assembled. There are
absolutely 150 data samples having a place with three unique types
of Iris Flower that is Setosa, Versicolor and Virginca. The gathered
Iris Datasets are stacked into the Machine Learning Model.
• Kaggle accompanies a couple of standard datasets, for example the
Iris dataset for order. The IRIS dataset is imported from Kaggle.
MODULES IN PROPOSED SYSTEM
• Upload Dataset

• Preprocess Dataset

• Split Train Test Dataset

• Build KNN model

• Make Prediction

• Iris Graph
SOFTWARE REQUIREMENTS
OPERATING SYSTEM : Windows 7,Windows 8, (or higher versions)

LANGUAGE : Python 3.7

BROWSER : Mozilla Firefox(or any browser)

HARDWARE REQUIREMENTS
PROCESSOR : Pentium 3 (or higher versions)
RAM : 1GB RAM and higher

HARD DISK : Atleast 40GB and higher


DATASET
• Petal length

• Petal width

• Sepal length

• Sepal width
UML DIAGRAMS

A UML diagram is a diagram based on the Unified Modeling Language with the purpose of visually

representing a system along with its main actors, roles, actions, artifacts or classes, in order to better

understand, alter, maintain, or document information about the system.

• CLASS DIAGRAM

• SEQUENCE DIAGRAM

• ACTIVITY DIAGRAM

• COMPONENT DIAGRAM
CLASS DIAGRAM

Class diagrams are the backbone of


almost every object-oriented
method, including UML. They
describe the static structure of a
system.
SEQUENCE DIAGRAM
Sequence diagrams describe
interactions among classes in
terms of an exchange of
messages over time.
ACTIVITY DIAGRAM

Activity diagrams illustrate the


dynamic nature of a system by
modeling the flow of control from
activity to activity.
COMPONENT DIAGRAM

Component diagrams describe the


organization of physical software
components, including source code, run-
time (binary) code, and executables.
IMPLEMENTATION

 Data Pre-processing step.


 Fitting the K-NN algorithm to the Training set.
 Predicting the test result.
 Test accuracy of the result.
 Visualizing the test set result.
RESULT

In the screen x-axis represents sepal length and y-


axis represents petal length and each different colour
dots represents number of records in that species
categories. Blue colour for setosa and orange for
versicolor and green for virginica.

From the Iris Graph we can say that the three


species i.e Setosa, Virginica, Versicolor are separated.
CONCLUSION

In this analysis we tried to build a model that is able to recognize the


iris species accurately on the basis of 3 classes by using IRIS dataset
and KNN algorithm.
FUTURE SCOPE

We created our model based on Machine Learning but for better results we
can use Deep Learning Models. Also we created basic project we can add more
features for the better results. The number of records used in our dataset is less
so we also add more no. of records to our dataset to have accurate results.
REFERENCES

1. https://www.geeksforgeeks.org/supervised-unsupervised-learning/
2. https://medium.com/@adi.bronshtein/a-quick-introduction-to-k-nearest-
neighbors-algorithm-62214cea29c7
3. https://www.geeksforgeeks.org/k-nearest-neighbours/
THANK YOU

You might also like