Professional Documents
Culture Documents
External PPT - Animesh Singh-200301120038
External PPT - Animesh Singh-200301120038
AND MANAGEMENT
GUIDED BY
DR. SANGRAM KESHARI SWAIN
Team 1 Animesh Singh-200301120038
2 Avinash Kumar-200301120002
3 Abhishek Raj-200301120025
4 Aditya Raj-200301120020
5 Vishal Mandal-200301120055
Experiment 1
Aim - Demonstration of
preprocessing on dataset student.arff
LET'S BEGIN!
What is data preprocessing ?
Data preprocessing is a way of converting the raw data into a much-desired form so that useful
information can be derived from it
Data discretization refers to a method of converting a huge number of data values into smaller
ones so that the evaluation and management of data become easy.
Association means that one variable provides information about another and correlation means
that two variables show an increasing or decreasing trend. For ex -
First we have to load the dataset
Discretization
Demonstration of preprocessing on
dataset labor.arff
LET'S BEGIN!
First we have to load the dataset
Discretization
LET'S BEGIN!
Association rule
LET'S BEGIN!
First we have to load the dataset
On the right side click on start button and we can see the
apriori algorithm applied on the test data.
Experiment 5
Aim -Demonstration of
classification rule process on
dataset student.arff using j48
algorithm
LET'S BEGIN!
First we have to load the dataset
LET'S BEGIN!
First we have to load the dataset
LET'S BEGIN!
7:Demonstration of Association rule process on dataset contactlenses.arff
using apriori algorithm.
•The Apriori algorithm refers to the algorithm that is used to calculate the
association rules between objects.
• The Apriori algorithm is an influential algorithm that is generally used in the field of
data mining and association rule learning.
•It is used to identify frequent itemsets in a dataset and generate an association-
based rule based on the itemsets.
•It means how two or more objects are related to one another.
First we have to load the dataset
•Clicking on the associate tab will bring up the interface for association rule
algorithm.
•We will use apriori algorithm. This is the default algorithm
•In order to change the parameters for the run (example support, confidence etc)
we click on the text box immediately to the right of the choose button.
Experiment 8
Aim - 8:Demonstration of
classification rule process on dataset
employee.arff using j48 algorithm
LET'S BEGIN!
8:Demonstration of classification rule process on dataset employee.arff using j48
algorithm
•The j48 algorithm is a classification algorithm that produces decision trees based
on information theory.
•It is an extension of Ross Quinlan’s earlier ID3 algorithm also known in Weka as J48,
J standing for Java.
•The decision trees generated by C4.5 are used for classification, and for this
reason, C4.5 is often referred to as a statistical classifier.
First we have to load the dataset
In notepad
@relation employee
@attribute age {25, 27, 28, 29, 30, 35, 48}
@attribute performance {good, avg, poor}
@data
25, 10k, poor
27, 15k, poor
27, 17k, poor
28, 17k, poor
29, 20k, avg
30, 25k, avg
29, 25k, avg
30, 20k, avg
35, 32k, good
48, 35k, good 48, 32k,good
STEP 4:
Under the “text” options in the main panel. We
select the 10-fold cross validation as our
evaluation approach.
STEP 5:
We now click ”start” to generate the model .The Ascii
version of the tree as well as evaluation statistic will
appear in the right panel when the model construction
is complete. :
Experiment 9
Aim - Demonstration of clustering rule
process on dataset iris.arff using simple
k-means
LET'S BEGIN!
Step 1
Step 2
Step 4
One of the choices has been chosen. We must ensure that they
are in the ‘cluster mode’ panel before running the clustering
algorithm. The choice to use a training set is selected, and then
the ‘start’ button is pressed. The screenshots below display the
process and the resulting window.
Step 5
Step 6
LET'S BEGIN!
First we have to load the dataset
Discretization