Welcome to Scribd!

Skip carousel

DM - Lab - 8 - Jupyter Notebook

Uploaded by

GATE Aspirant

0% found this document useful (0 votes)

27 views5 pages

data ming lab file

Original Title

DM_Lab_8 - Jupyter Notebook

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

data ming lab file

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

27 views5 pages

DM - Lab - 8 - Jupyter Notebook

Uploaded by

GATE Aspirant

data ming lab file

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 5

Search inside document

4/27/23, 8:17 PM DM_Lab_8 - Jupyter Notebook

Topic : CART - Classification & Regression Tree, ID3

In [22]:

1 import pandas as pd
2 import numpy as np
3
4 df1 = pd.read_csv('/home/c0nqu3r0r/Desktop/_Second sem/Data Mining/Dataset/Te
5 df1.drop('Day', axis = 1, inplace = True)# for classification
6
7 df2 = pd.read_csv('/home/c0nqu3r0r/Desktop/_Second sem/Data Mining/Dataset/ar

In [23]:

1 df1.head()

Out[23]:

Outlook Temperature Humidity Wind PlayTennis

0 Sunny Hot High Weak No

1 Sunny Hot High Strong No

2 Overcast Hot High Weak Yes

3 Rain Mild High Weak Yes

4 Rain Cool Normal Weak Yes

In [24]:

1 from sklearn.preprocessing import LabelEncoder

2 Le = LabelEncoder()
3
4 df1['Outlook'] = Le.fit_transform(df1['Outlook'])
5 df1['Temperature'] = Le.fit_transform(df1['Temperature'])
6 df1['Humidity'] = Le.fit_transform(df1['Humidity'])
7 df1['Wind'] = Le.fit_transform(df1['Wind'])
8 df1['PlayTennis'] = Le.fit_transform(df1['PlayTennis'])

In [25]:

1 y1 = df1['PlayTennis']
2 x1 = df1.drop(['PlayTennis'], axis = 1)

localhost:8888/notebooks/Desktop/_Second sem/Data Mining/Lab Work/DM_Lab_8.ipynb 1/5

4/27/23, 8:17 PM DM_Lab_8 - Jupyter Notebook

In [26]:

1 # CART classification Tree

2 from sklearn import tree
3 clf1 = tree.DecisionTreeClassifier(criterion = 'gini')
4 clf1 = clf1.fit(x1, y1)
5 tree.plot_tree(clf1)

Out[26]:

[Text(0.4444444444444444, 0.9, 'X[0] <= 0.5\ngini = 0.459\nsamples =

14\nvalue = [5, 9]'),
Text(0.3333333333333333, 0.7, 'gini = 0.0\nsamples = 4\nvalue = [0,
4]'),
Text(0.5555555555555556, 0.7, 'X[2] <= 0.5\ngini = 0.5\nsamples = 1
0\nvalue = [5, 5]'),
Text(0.3333333333333333, 0.5, 'X[0] <= 1.5\ngini = 0.32\nsamples =
5\nvalue = [4, 1]'),
Text(0.2222222222222222, 0.3, 'X[3] <= 0.5\ngini = 0.5\nsamples = 2
\nvalue = [1, 1]'),
Text(0.1111111111111111, 0.1, 'gini = 0.0\nsamples = 1\nvalue = [1,
0]'),
Text(0.3333333333333333, 0.1, 'gini = 0.0\nsamples = 1\nvalue = [0,
1]'),
Text(0.4444444444444444, 0.3, 'gini = 0.0\nsamples = 3\nvalue = [3,
0]'),
Text(0.7777777777777778, 0.5, 'X[3] <= 0.5\ngini = 0.32\nsamples =
5\nvalue = [1, 4]'),
Text(0.6666666666666666, 0.3, 'X[0] <= 1.5\ngini = 0.5\nsamples = 2
\nvalue = [1, 1]'),
Text(0.5555555555555556, 0.1, 'gini = 0.0\nsamples = 1\nvalue = [1,
0]'),
Text(0.7777777777777778, 0.1, 'gini = 0.0\nsamples = 1\nvalue = [0,
1]'),
Text(0.8888888888888888, 0.3, 'gini = 0.0\nsamples = 3\nvalue = [0,
3]')]

localhost:8888/notebooks/Desktop/_Second sem/Data Mining/Lab Work/DM_Lab_8.ipynb 2/5

4/27/23, 8:17 PM DM_Lab_8 - Jupyter Notebook

In [27]:

1 df2.head()

Out[27]:

Position Level Salary

0 Business Analyst 1 45000

1 Junior Consultant 2 50000

2 Senior Consultant 3 60000

3 Manager 4 80000

4 Country Manager 5 110000

In [28]:

1 x2 = df2.iloc[:, 1:2].values
2 y2 = df2.iloc[:, 2].values

localhost:8888/notebooks/Desktop/_Second sem/Data Mining/Lab Work/DM_Lab_8.ipynb 3/5

4/27/23, 8:17 PM DM_Lab_8 - Jupyter Notebook

In [29]:

1 # CART - Regression Tree

2 from sklearn import tree
3 clf2 = tree.DecisionTreeRegressor()
4 clf2 = clf2.fit(x2, y2)
5 tree.plot_tree(clf2)

Out[29]:

[Text(0.703125, 0.9285714285714286, 'X[0] <= 8.5\nsquared_error = 80

662250000.0\nsamples = 10\nvalue = 249500.0'),
Text(0.53125, 0.7857142857142857, 'X[0] <= 6.5\nsquared_error = 692
1484375.0\nsamples = 8\nvalue = 124375.0'),
Text(0.375, 0.6428571428571429, 'X[0] <= 4.5\nsquared_error = 13812
50000.0\nsamples = 6\nvalue = 82500.0'),
Text(0.25, 0.5, 'X[0] <= 3.5\nsquared_error = 179687500.0\nsamples
= 4\nvalue = 58750.0'),
Text(0.1875, 0.35714285714285715, 'X[0] <= 2.5\nsquared_error = 388
88888.889\nsamples = 3\nvalue = 51666.667'),
Text(0.125, 0.21428571428571427, 'X[0] <= 1.5\nsquared_error = 6250
000.0\nsamples = 2\nvalue = 47500.0'),
Text(0.0625, 0.07142857142857142, 'squared_error = 0.0\nsamples = 1
\nvalue = 45000.0'),
Text(0.1875, 0.07142857142857142, 'squared_error = 0.0\nsamples = 1
\nvalue = 50000.0'),
Text(0.25, 0.21428571428571427, 'squared_error = 0.0\nsamples = 1\n
value = 60000.0'),
Text(0.3125, 0.35714285714285715, 'squared_error = 0.0\nsamples = 1
\nvalue = 80000.0'),
Text(0.5, 0.5, 'X[0] <= 5.5\nsquared_error = 400000000.0\nsamples =
2\nvalue = 130000.0'),
Text(0.4375, 0.35714285714285715, 'squared_error = 0.0\nsamples = 1
\nvalue = 110000.0'),
Text(0.5625, 0.35714285714285715, 'squared_error = 0.0\nsamples = 1
\nvalue = 150000.0'),
Text(0.6875, 0.6428571428571429, 'X[0] <= 7.5\nsquared_error = 2500
000000.0\nsamples = 2\nvalue = 250000.0'),
Text(0.625, 0.5, 'squared_error = 0.0\nsamples = 1\nvalue = 200000.
0'),
Text(0.75, 0.5, 'squared_error = 0.0\nsamples = 1\nvalue = 300000.
0'),
Text(0.875, 0.7857142857142857, 'X[0] <= 9.5\nsquared_error = 62500
000000.0\nsamples = 2\nvalue = 750000.0'),
Text(0.8125, 0.6428571428571429, 'squared_error = 0.0\nsamples = 1
\nvalue = 500000.0'),
Text(0.9375, 0.6428571428571429, 'squared_error = 0.0\nsamples = 1
\nvalue = 1000000.0')]

localhost:8888/notebooks/Desktop/_Second sem/Data Mining/Lab Work/DM_Lab_8.ipynb 4/5

4/27/23, 8:17 PM DM_Lab_8 - Jupyter Notebook

In [30]:

1 # ID3
2 from sklearn import tree
3 clf3 = tree.DecisionTreeClassifier(criterion = 'entropy')
4 clf3 = clf3.fit(x1, y1)
5 tree.plot_tree(clf3)

Out[30]:

[Text(0.4444444444444444, 0.9, 'X[0] <= 0.5\nentropy = 0.94\nsamples

= 14\nvalue = [5, 9]'),
Text(0.3333333333333333, 0.7, 'entropy = 0.0\nsamples = 4\nvalue =
[0, 4]'),
Text(0.5555555555555556, 0.7, 'X[2] <= 0.5\nentropy = 1.0\nsamples
= 10\nvalue = [5, 5]'),
Text(0.3333333333333333, 0.5, 'X[0] <= 1.5\nentropy = 0.722\nsample
s = 5\nvalue = [4, 1]'),
Text(0.2222222222222222, 0.3, 'X[3] <= 0.5\nentropy = 1.0\nsamples
= 2\nvalue = [1, 1]'),
Text(0.1111111111111111, 0.1, 'entropy = 0.0\nsamples = 1\nvalue =
[1, 0]'),
Text(0.3333333333333333, 0.1, 'entropy = 0.0\nsamples = 1\nvalue =
[0, 1]'),
Text(0.4444444444444444, 0.3, 'entropy = 0.0\nsamples = 3\nvalue =
[3, 0]'),
Text(0.7777777777777778, 0.5, 'X[3] <= 0.5\nentropy = 0.722\nsample
s = 5\nvalue = [1, 4]'),
Text(0.6666666666666666, 0.3, 'X[0] <= 1.5\nentropy = 1.0\nsamples
= 2\nvalue = [1, 1]'),
Text(0.5555555555555556, 0.1, 'entropy = 0.0\nsamples = 1\nvalue =
[1, 0]'),
Text(0.7777777777777778, 0.1, 'entropy = 0.0\nsamples = 1\nvalue =
[0, 1]'),
Text(0.8888888888888888, 0.3, 'entropy = 0.0\nsamples = 3\nvalue =
[0, 3]')]

localhost:8888/notebooks/Desktop/_Second sem/Data Mining/Lab Work/DM_Lab_8.ipynb 5/5

Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
統計學習CH2 Lab - Jupyter Notebook (直向)
Document41 pages
統計學習CH2 Lab - Jupyter Notebook (直向)
張F
No ratings yet
Practice - DL - Ipynb - Colaboratory
Document14 pages
Practice - DL - Ipynb - Colaboratory
noorullahmd461
No ratings yet
PCA Code-Checkpoint
Document4 pages
PCA Code-Checkpoint
Аурел П
No ratings yet
Project-Password Strength Classifier
Document6 pages
Project-Password Strength Classifier
Olalekan Samuel
No ratings yet
Poisonousmushrooms: 1 Importing The Libraries
Document8 pages
Poisonousmushrooms: 1 Importing The Libraries
Samuel Peoples
No ratings yet
K-Means Clustering Analysis of Mall Customer Data
Document5 pages
K-Means Clustering Analysis of Mall Customer Data
ada saja
No ratings yet
Decision Trees - Jupyter Notebook
Document4 pages
Decision Trees - Jupyter Notebook
Me me
No ratings yet
R19C076 - Chanukya Gowda K - Mlda - Assignment-2
Document19 pages
R19C076 - Chanukya Gowda K - Mlda - Assignment-2
Chanukya Gowda k
No ratings yet
Chapter 5
Document22 pages
Chapter 5
Rajat Bansal
No ratings yet
"Feature Names:" "Target Names:" "/nfirst 10 Rows of X:/N": From Import
Document6 pages
"Feature Names:" "Target Names:" "/nfirst 10 Rows of X:/N": From Import
19C089 SHAAMBHAVI S
No ratings yet
Ann 1
Document20 pages
Ann 1
1314 Vishakha Jagtap
No ratings yet
Import As Import As From Import: "Mean Squared Errors: "
Document1 page
Import As Import As From Import: "Mean Squared Errors: "
ul
No ratings yet
Kelompok 3 - Latihan 1 Setup Python Dan Aljabar Linier
Document12 pages
Kelompok 3 - Latihan 1 Setup Python Dan Aljabar Linier
Satrya Budi Pratama
No ratings yet
Act5 Wisnu Trenggono Wirayuda 57418379
Document9 pages
Act5 Wisnu Trenggono Wirayuda 57418379
Wisnu Trenggono Wirayuda
No ratings yet
Deltapdf
Document3 pages
Deltapdf
VIJAY YADAV
No ratings yet
Machine Learning for Engineers: Introduction to NumPy
Document13 pages
Machine Learning for Engineers: Introduction to NumPy
Gilbe Testa
No ratings yet
KMeans Clustering of Synthetic 2D Data
Document4 pages
KMeans Clustering of Synthetic 2D Data
I.A.P. JABH RIDET
No ratings yet
Practical 2
Document3 pages
Practical 2
soham pawar
No ratings yet
Labpractice 2
Document29 pages
Labpractice 2
Rajashree Das
100% (1)
Quantile Regression Explained
Document4 pages
Quantile Regression Explained
ramesh158
No ratings yet
Neural Networks: From Import
Document3 pages
Neural Networks: From Import
Anonymous VNu3ODGav
No ratings yet
Numpy Functions
Document9 pages
Numpy Functions
Ramesh Kumar Mojjada
No ratings yet
Minicurso R PDF
Document100 pages
Minicurso R PDF
Matheus Sandrini Rossi
No ratings yet
Play Tennis Tree
Document1 page
Play Tennis Tree
Sabi Ullah
No ratings yet
Eas - Jupyter Notebook
Document10 pages
Eas - Jupyter Notebook
farian amrulloh
No ratings yet
Lecture 2 Python 常用library
Document41 pages
Lecture 2 Python 常用library
Yuanxing
No ratings yet
Data Preparation
Document11 pages
Data Preparation
Vivek Munjayasra
No ratings yet
20bce2251 VL2021220503859 Ast02
Document10 pages
20bce2251 VL2021220503859 Ast02
TANMAY MEHROTRA
No ratings yet
Scenario 1:: Acknowlegement
Document17 pages
Scenario 1:: Acknowlegement
HOWARD NICOLAS SOLARTE MORA
No ratings yet
Lecture 22
Document64 pages
Lecture 22
Tev Wallace
No ratings yet
ML Lab-Assignment-5
Document8 pages
ML Lab-Assignment-5
Ajeet Singh
No ratings yet
Numpy Session1
Document1 page
Numpy Session1
jeevangowda2709
No ratings yet
Numpy Complete Material
Document19 pages
Numpy Complete Material
Leonardo Camacho Arias
No ratings yet
Analisis Dinamico Eje X
Document24 pages
Analisis Dinamico Eje X
VICTOR MANUEL PAITAN MENDEZ
No ratings yet
Assign. No 1
Document7 pages
Assign. No 1
getachewbonga09
No ratings yet
Matplotlib
Document8 pages
Matplotlib
Mahevish Fatima
No ratings yet
Salary Prediction LinearRegression
Document7 pages
Salary Prediction LinearRegression
Yagnesh Vyas
100% (1)
01 - To Explore Basic Python and Libraries
Document7 pages
01 - To Explore Basic Python and Libraries
John Wick
No ratings yet
Assignment 12
Document4 pages
Assignment 12
dash
No ratings yet
Generative AI Binary Classification
Document7 pages
Generative AI Binary Classification
Cyborg Ultra
No ratings yet
Content: From Import Import As Import Import Import As
Document8 pages
Content: From Import Import As Import Import Import As
بشار الحسين
No ratings yet
AP19110010030 R Lab-Assignment-1
Document12 pages
AP19110010030 R Lab-Assignment-1
Sravan Kilaru AP19110010030
No ratings yet
Brain Tumor Classification
Document12 pages
Brain Tumor Classification
Ultra Bloch
No ratings yet
Tutorial 2 - Clustering
Document6 pages
Tutorial 2 - Clustering
Gupta Akshay
100% (2)
06 Seaborn
Document13 pages
06 Seaborn
Anonymous 001
No ratings yet
Outliers, Hypothesis and Natural Language Processing
Document7 pages
Outliers, Hypothesis and Natural Language Processing
subhajitbasak001
100% (1)
K Means Clustering
Document10 pages
K Means Clustering
Walid Sassi
100% (1)
List, Tuple, Set Operations in Python
Document2 pages
List, Tuple, Set Operations in Python
MANGESH BHOSALE
No ratings yet
Implement Forward and Backward Propagation for Neural Network
Document5 pages
Implement Forward and Backward Propagation for Neural Network
Ravi K
No ratings yet
Interpolatingfunction : Bla Ndsolve ( (F ''' (T) + F (T) F '' (T) 0, F (0) 0, F ' (0) 0, F ' (100 000) 1), F, T)
Document7 pages
Interpolatingfunction : Bla Ndsolve ( (F ''' (T) + F (T) F '' (T) 0, F (0) 0, F ' (0) 0, F ' (100 000) 1), F, T)
Alexis Castle
No ratings yet
Kecerdasan Artifisial Dan Masyarakat - M5
Document8 pages
Kecerdasan Artifisial Dan Masyarakat - M5
Citra Larasati
No ratings yet
Untitled66 - Jupyter Notebook
Document2 pages
Untitled66 - Jupyter Notebook
Gopala krishna Seelamneni
No ratings yet
Matplotlib - Pyplot PLT Numpy NP Scipy Seaborn Sns Scipy Random
Document4 pages
Matplotlib - Pyplot PLT Numpy NP Scipy Seaborn Sns Scipy Random
Brangy Castro
No ratings yet
Data Mining Portfolio
Document19 pages
Data Mining Portfolio
JohnMaynard
No ratings yet
NLP Lab Experiment Implementing Spam Detection
Document9 pages
NLP Lab Experiment Implementing Spam Detection
SAILASHREE PANDAB (RA1911032010030)
No ratings yet
Numpy - KickStart - Jupyter Notebook
Document14 pages
Numpy - KickStart - Jupyter Notebook
dimple mahadule
No ratings yet
Python Practical
Document35 pages
Python Practical
Rahul Verma
No ratings yet
AIML Lab Manual
Document9 pages
AIML Lab Manual
AARUNI RAI
No ratings yet
Kerr - Solve Ivp
Document8 pages
Kerr - Solve Ivp
yulieth andrea ramirez romero
No ratings yet
Java Notes
Document203 pages
Java Notes
baluchandrashekar2008
No ratings yet
Sample Questions For Ceed
Document3 pages
Sample Questions For Ceed
Shashank Kasliwal
No ratings yet
Linux File System iDataAgent Overview
Document267 pages
Linux File System iDataAgent Overview
Bryan Bowman
No ratings yet
LPIC 3 Program
Document8 pages
LPIC 3 Program
Leon Homar
No ratings yet
Full Download Introduction To Criminology 3rd Edition Walsh Test Bank
Document35 pages
Full Download Introduction To Criminology 3rd Edition Walsh Test Bank
rosannpittervip
100% (36)
Understanding Text Types Through Story Analysis
Document11 pages
Understanding Text Types Through Story Analysis
Calendula Jean Munar Pitos
No ratings yet
Adjacent Precast Concrete Box Beam Bridges
Document86 pages
Adjacent Precast Concrete Box Beam Bridges
SudathipTangwongchai
100% (1)
Isllcselfassessment - Southern 1
Document3 pages
Isllcselfassessment - Southern 1
api-285323152
No ratings yet
Heizer Om11 Irm Ch03
Document27 pages
Heizer Om11 Irm Ch03
Jeni Zhang
No ratings yet
11M Review U6
Document5 pages
11M Review U6
khuongnb
No ratings yet
Ana Miranda
Document8 pages
Ana Miranda
api-26075761
No ratings yet
Pooled Cross Section Data
Document47 pages
Pooled Cross Section Data
Uttara Ananthakrishnan
No ratings yet
Learner Worksheet 6: Application
Document1 page
Learner Worksheet 6: Application
norly
No ratings yet
Ramkumar N. Parthasarathy: Professor
Document2 pages
Ramkumar N. Parthasarathy: Professor
jintu
No ratings yet
William Wordsworth About Rustic Life
Document5 pages
William Wordsworth About Rustic Life
johar khan
100% (1)
Habib Bank Report Final Year
Document44 pages
Habib Bank Report Final Year
Muhammed Jamal
No ratings yet
Deutsche Bank CSR
Document2 pages
Deutsche Bank CSR
Michael Green
No ratings yet
Oral-Com - Q1 - Module-4 Functions of Communication
Document19 pages
Oral-Com - Q1 - Module-4 Functions of Communication
Lucky Heart Joy Baniqued
No ratings yet
4 - Step Case Studies of Shyam Edlabadkar Sir 1-50 PDF
Document102 pages
4 - Step Case Studies of Shyam Edlabadkar Sir 1-50 PDF
avidmaster
No ratings yet
Complete Project On Ethics & Management
Document34 pages
Complete Project On Ethics & Management
divya
100% (1)
Te040 Oracle Eam
Document31 pages
Te040 Oracle Eam
umesh_sha
No ratings yet
ME Final
Document81 pages
ME Final
Ankit Dahiya
No ratings yet
Sarvani Profile
Document12 pages
Sarvani Profile
dox4print
100% (1)
Mid and Western Nepal DPR Terms of Reference
Document24 pages
Mid and Western Nepal DPR Terms of Reference
suman subedi
No ratings yet
Configuring and Extending Applications
Document192 pages
Configuring and Extending Applications
anirudhabhuyar1
No ratings yet
Maths in Focus Chapter 11
Document52 pages
Maths in Focus Chapter 11
eccentricftw4
50% (2)
BTech R03 EEE Syllabus Book
Document60 pages
BTech R03 EEE Syllabus Book
VenKat
50% (2)
Caco Cell Line
Document9 pages
Caco Cell Line
donaldozc07
No ratings yet
Fall 2015 PHD Orientation
Document54 pages
Fall 2015 PHD Orientation
maykelnawar
No ratings yet