0% found this document useful (0 votes)

13 views8 pages

Content Beyond Syllabus and Case Based Program

The document outlines a program demonstrating the ID3 decision tree algorithm using a dataset related to used car price prediction. It includes code for importing data, splitting datasets, training models using Gini index and entropy, making predictions, and calculating accuracy. Additionally, it discusses the growth of India's used car market and the objectives of creating a predictive model for determining used car prices.

Uploaded by

rohanjavarker0305

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Topics covered

Used Car Market,
Model Training,
Revenue Optimization,
Data Insights,
Data Dictionary,
Model Optimization,
Car Features,
Pandas Library,
Model Evaluation,
Classification Report

0% found this document useful (0 votes)

13 views8 pages

Content Beyond Syllabus and Case Based Program

Uploaded by

rohanjavarker0305

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Topics covered

Used Car Market,
Model Training,
Revenue Optimization,
Data Insights,
Data Dictionary,
Model Optimization,
Car Features,
Pandas Library,
Model Evaluation,
Classification Report

PSAGAR INSTITUTE OF RESEARCH & TECHNOLOGY

DEPARTMENT OF COMPUTERSCIENCE &ENGINEERING

Content Beyond Syllabus

Program l: Write a programto demonstrate the working of the decision tree-based ID3
algorithm.

Importing the required packages

import numpy as np
import pandas as pd
from [Link] import confusion_matrix

#rom sklearn.cross_validation import train_test_ split

from [Link] import DecisionTreeClassifier

from [Link] import accuracy_sCore

from [Link] import classification _report
#Function importing Dataset
def importdata():
balance_data = [Link] csv('[Link]

'databases/balance-scale/[Link],sep="", header =None)

# Printing the dataswet shape
print ("Dataset Length: ", len(balance_data))

print ("Dataset Shape:", balance_data.shape)

# Printing the dataset obseravtions

print ("Dataset: ",balance_data.head()

return balance data

# Function tosplit the dataset

def splitdataset(balance_data):
# Separating the target variable
X= balance [Link][:, 1:5]
Y= balance [Link][:, 0]
#Splitting the dataset into train and test
A_rain, X_test, y train, y_test =train_test_split(
OF RESEARCH& TECHNOLOGY
SAGAR INSTITUTE COMPUTER SCIENCE & ENGINEERING
DEPARTMENT OF

X,Y, test_size = 0.3, random_state = 100)

return X, Y, X_train, X_test, y_train, y_test

#Function to perform training with ginilndex.

def train_using_gini(X_train, X_test, y_train);

# Creating the classifier object

clf gini =DecisionTreeClassifier(criterion = "gini",

_samples_leaf-5)
random_state = 100, max_depth=3, min

# Performing training
cIf [Link](X_train, y_train)
return clf gini
#Function to perform training with entropy.

def tarin_using_entropy(X_train,X_test, y_train):

#Decision tree with entropy

clf entropy= DecisionTreeClassifier

criterion ="entropy", random_state =100,

max_depth =3, min_samples_leaf =5)

# Performing training
clf_entropy.fit(X_train, y_train)

return clf entropy

# Function to make predictions

def prediction(X_test, clf_object):

# Predicton on test with ginilndex

Y_pred = clf_object.predict(X_test)

print("Predicted values: ")

print(ypred)
return y pred
# Function to calculate accuracy

def cal_accuracy(y_test, y_pred):

print("Confusion Matrix: ",

confusion_ matrix(y_test, y_pred))

TECHNOLOGY
OF RESEARCH &
PSAGAR INSTITUTE COMPUTER SCIENCE & ENGINEERING
DEPARTMENT OF

accuracy_score(y_test,y_pred)°100)
print ("Accuracy : ",
print("Report: ",classification_report(y_test, y_pred))

def main():

# Building Phase

data = importdata()

X, Y, X_train, X_test, y_train, y_ test = splitdataset(data)

clf gini= train_using_gini(X_train, X_test, y_train)
clf_entropy = tarin_using_entropy(X_train, X_test, y_train)
#Operational Phase
print("Results Using Gini Index:")

# Prediction using gini

Y_pred gini= prediction(X_test, clf gini)

cal_accuracy(y_test, y_pred gini)
print("Results Using Entropy:")

#Prediction using entropy

Y_pred_entropy = prediction(X_test,cIf_entropy)

cal_accuracy(y_test, y_pred_entropy)
# Calling main function

if_name ==" main_":

main()
SAGAR INSTITUTE OF RESEARCH & TECHNOLOGY
DEPARTMENT OF COMPUTERSCIENCE & ENGINEERING

OUTPUT:

Dataset Length: 625

Dataset Shape: (625, 5)

Dataset:01234

0B11 11

1R111 2
2R1113

3R1114

4R1115
SAGAR INSTITUTE OFRESEARCH AND TECHNOLOGY,
BHOPAL
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

CASE STUDY BASEDQUESTION

TITLE:USED CARS PRICE PREDICTION

Riding the digital wave, India's used car market is set to grow at a compounded
[Link] of 11% and is likely to touch sales of up to 8.3 million units by FY26
as more people have been opting for pre - owned cars for personal mobility in the
pandemic amid the
ongoing supply shorages for manufacturing new cars.
The used car market in the country is expected to reach over 70 lakh vehicles by
2025- 26,up from 38 lakh in 2020 - 21 as the
Covid -19 pandemic, digitalization , changing demographics and aspirations, first -
time buyers and availability of financing options are acting as growth drivers,
according to a report by OLX Autos and rating agency Crisil.
"MyCars" is a new - age startup laying foundations in the settin
gup a car resell domain and they are setting up a team of ML experts to make
predictive models determine the price of
second - hand cars to optimize their revenue, you have joined as a new Data
Scientist and your role is to create a model to determine the selling price of a used
car.

Objective:
"Provide the best-performing model to determine the price of the used car.
"Providing the most important features which determine the price
Data Description
The data provided consists of the following Data Dictionary
"ld: Unique ID assigned to a specific car.
"year: Manufacture year of the car.
"brand: Brand of the car.
ofull model name: Model name includes other details such as engine
capacity, transmission,etc., basically a detailed model name.
emodel name: Just the model name of the car.
"price: Sellprice of the 2nd ownership car.
sdistance travelled(km): Distance traveled by car.
ueltype: Fuel engine type.
city: City where the car is registered.
car age: Age of
the ca
Td year brand full_modelnane odel nune price distonce_travelled(kas) fuel_type cAty brandantk ca y
0 2010 Hondn Monda Brlo 0 MT BO80.0 Petrct Mumtel

1 2012 Nissan Nissan Sunny XV Diesel

Press Fse toeot fhull sCreen
unny 119120.0 Dlesel Munbal 9.0

2 2017 Toyota Tbyota Fortuner 2.D 4x MT (2010-2020| Foatuner 2050000.o 4503.0 Dleses Thane

3 2017 Mercedes Benz Meredes-Denz [Link] E220d Expression (2019-.. EClass 419000.0 2U000.0 Dleset Mumbal 49

4 2012 Hyundal Hyundal Vema Fluldic 16 CRDI SX Verna 475000.0 23800,0 Dlescl Mumbal 9.0

1720 1720 2015 Hyundal Eon Era Eon 290000.0 28000.0 Pettol PUne 60
Hyundal
1721 1721 2011 Benticy Continental Flyng Spur W12 Contnental 7600000.0 30000.0 Petral Pune 10.9
Bentley
1722 1722 2008 Mahindra-Renaut Mahindra-Renault Logan DLE 1.5dc Logan 185000.0 142522.0 De Pune 24 130

1723 1723 1080 Mahindra Mahindra Jeep CJ G00D Jeep 326000.0 18681.0 Dlesel Pune 31.0

1724 1724 2017 Hyundal Hyundal Creta SX Pus 1.6 AT CRDI Creta 1395000.0 31028.0 Pune

1725 rOwax 11 columns

SAGARINSTITUTE OF RESEARCH AND TECHNOLOGY,
BHOPAL
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
<class [Link]'>
RangeIndex: 1725 entries, e to 1724
Data columns (total 11 columns):
Column Non-Null Count Dtype

Id 1725 non-null int64

1 year 1725 non-null int64
2 brand 1725 non-null object
3 full model_name 1725 non-null object
model name 1725 non-null object
5 price 1725 non-null float64
6 distance travelled(kms) 1725 non-null float64
fuel_type 1725 non-null object
city 1725 non - null object
brand rank 1725 non-null int64
10 car age 1725 non-null float64
dtypes: float64(3), int64(3), object(5)
memory usage: 148.4+ KB

[Link]()
Id year price distance travelled (Kms) brand rank car age
cOunt 1725.000000 1725.0000001.725000e+03 1725.0000001725.0000001725.000000

mean 862.000000 2015.390725 1.494837e+06 53848.256232 15.731014 5.609275

std 498.108924 3.207504 1,671658e+06 4725.54196312951122 3207504

min 0.000000 1990.000000 6.250000e+04 350.000000 1.000000 0.000000

25% 431.000000 2013.000000 5.450000e+05 29000.000000 5.000000 3.000000

50% 862,000000 2016.000000 8.750000e+05 49000.000000 14.000000 5.000000

75% 1293.000000 2018.000000 1.825000er06 7O500.000000 24.000000 a.000000

max 1724.000000 2021.000000 1.470000e+07 780000.000000 81.000000 31.000000

SAGAR INSTITUTEOF RESEARCH AND TECHNOLOGY,
BHOPAL
ENGINEERING
DEPARTMENT OF COMPUTER SCIENCE AND
year travelled(kms)
price distance brand rank car_age
Id
0.100282 .022191 0.054391
ld 1,000000 0.054391 -0.105696
-0.386107 0.134275 1.000000
year -0.054391 1.000000 0.288483
-0.137351 -0.164591-0.288483
-0.105696 0.288483 1.000000
price
0.111406 0.386107
1.000000
distance travelled(kms) 0.100282 -0.386107 -0.137351
-0.111406 1.000000 -0.134275
0.022191 0.134275 -0.164591
brand_rank 0.134275 1.000000
0.386107
0.054391-1.000000 0.288483
car_ age

[Link]().sum()

Id
year
brand
full model name
model name
price
distance travelled(kms)
fuel_type
city
brand rank
car_age
dtype: int64

[Link]()

Car Price Prediction Using Distance Data
No ratings yet
Car Price Prediction Using Distance Data
15 pages
Car Price Prediction Using Python AI
No ratings yet
Car Price Prediction Using Python AI
32 pages
Car Price Prediction Project
No ratings yet
Car Price Prediction Project
34 pages
Car Price Prediction Using ML Techniques
33% (3)
Car Price Prediction Using ML Techniques
15 pages
Used Car Price Prediction Model
No ratings yet
Used Car Price Prediction Model
4 pages
Project Documentation
No ratings yet
Project Documentation
1 page
SVM Guide for Data Science Enthusiasts
100% (1)
SVM Guide for Data Science Enthusiasts
28 pages
Class Participation
No ratings yet
Class Participation
9 pages
Used Car Price Prediction Analysis
No ratings yet
Used Car Price Prediction Analysis
23 pages
Web App Code
No ratings yet
Web App Code
5 pages
Finalll - Ipynb - Colab
No ratings yet
Finalll - Ipynb - Colab
11 pages
Car Price Prediction
No ratings yet
Car Price Prediction
18 pages
Used Car Price Prediction Models Guide
No ratings yet
Used Car Price Prediction Models Guide
6 pages
Used Car Price Prediction Model Report
No ratings yet
Used Car Price Prediction Model Report
10 pages
AI Lab 8
No ratings yet
AI Lab 8
12 pages
Car Price Model Prediction
No ratings yet
Car Price Model Prediction
9 pages
Data Mining with Classification Methods
No ratings yet
Data Mining with Classification Methods
20 pages
Ajay and Saurabh
No ratings yet
Ajay and Saurabh
16 pages
Mini Project New
No ratings yet
Mini Project New
25 pages
Data Analytics Research Paper
No ratings yet
Data Analytics Research Paper
3 pages
Linear Regression
No ratings yet
Linear Regression
4 pages
Updated Used Cars Price Prediction Using Machine Learning
No ratings yet
Updated Used Cars Price Prediction Using Machine Learning
24 pages
Linear Regression
100% (1)
Linear Regression
16 pages
Task 3 Car Price Prediction Using Machine Learning
No ratings yet
Task 3 Car Price Prediction Using Machine Learning
30 pages
Used Car Price Prediction with KNN Model
100% (1)
Used Car Price Prediction with KNN Model
4 pages
Pre-owned Car Price Prediction Model
No ratings yet
Pre-owned Car Price Prediction Model
26 pages
Used Car Price Prediction Model
No ratings yet
Used Car Price Prediction Model
6 pages
Machine Learning-Based Models For Accurate Car Pri
No ratings yet
Machine Learning-Based Models For Accurate Car Pri
6 pages
Car Price Prediction Analysis Report
No ratings yet
Car Price Prediction Analysis Report
10 pages
Prediction of The Price of Used Cars Based On Mach
No ratings yet
Prediction of The Price of Used Cars Based On Mach
7 pages
Car Price Prediction with Machine Learning
No ratings yet
Car Price Prediction with Machine Learning
10 pages
Electric Vehicle Range Analysis Report
No ratings yet
Electric Vehicle Range Analysis Report
37 pages
Predicting Used Car Prices in India
No ratings yet
Predicting Used Car Prices in India
15 pages
JETIR2204201
No ratings yet
JETIR2204201
7 pages
PPSD 1743674861
No ratings yet
PPSD 1743674861
3 pages
Used Car Price Prediction Model
No ratings yet
Used Car Price Prediction Model
26 pages
Car Price Prediction with Machine Learning
No ratings yet
Car Price Prediction with Machine Learning
6 pages
Car Data Analysis with Pandas
No ratings yet
Car Data Analysis with Pandas
15 pages
Car Mileage Prediction with AI Model
No ratings yet
Car Mileage Prediction with AI Model
17 pages
Machine Learning Project 1690186790
No ratings yet
Machine Learning Project 1690186790
18 pages
Import As Import As
No ratings yet
Import As Import As
18 pages
Car Price Prediction Model Analysis
No ratings yet
Car Price Prediction Model Analysis
8 pages
About The Dataset - Car Evaluation Dataset (UCI Machine Learning Repository
No ratings yet
About The Dataset - Car Evaluation Dataset (UCI Machine Learning Repository
5 pages
ML Lab 2024-26 Final
No ratings yet
ML Lab 2024-26 Final
46 pages
Report
No ratings yet
Report
4 pages
Python Programs for Data Manipulation
No ratings yet
Python Programs for Data Manipulation
22 pages
Used Car Price Prediction Model
No ratings yet
Used Car Price Prediction Model
3 pages
Car Price Prediction
No ratings yet
Car Price Prediction
21 pages
Linear Regression Price Prediction Guide
No ratings yet
Linear Regression Price Prediction Guide
5 pages
Car Price
No ratings yet
Car Price
6 pages
Data Science & ML Internship Project
No ratings yet
Data Science & ML Internship Project
14 pages
Linear Regression on Used Car Prices
No ratings yet
Linear Regression on Used Car Prices
3 pages
Used Car Price Prediction Project Report
No ratings yet
Used Car Price Prediction Project Report
10 pages
DS On MTCARS Solutions
No ratings yet
DS On MTCARS Solutions
3 pages
City-Cycle MPG Prediction Data Analysis
No ratings yet
City-Cycle MPG Prediction Data Analysis
23 pages
Car Data Analysis Project Overview
No ratings yet
Car Data Analysis Project Overview
19 pages
Answers CIS
No ratings yet
Answers CIS
19 pages
PM Protected
No ratings yet
PM Protected
2 pages
CN Assignment
No ratings yet
CN Assignment
13 pages
SBI Clerk Eligibility Criteria 2019 PDF
No ratings yet
SBI Clerk Eligibility Criteria 2019 PDF
2 pages
Understanding-Manipulative-Information - PDF 20250923 214918 0000
No ratings yet
Understanding-Manipulative-Information - PDF 20250923 214918 0000
13 pages
Paul Franco - Becoming Who You Are - Nietzsche On Self-Creation
No ratings yet
Paul Franco - Becoming Who You Are - Nietzsche On Self-Creation
27 pages
11 Essential KPIs for Innovation Success
No ratings yet
11 Essential KPIs for Innovation Success
7 pages
Halloween Thesis Statement Writing Guide
100% (3)
Halloween Thesis Statement Writing Guide
7 pages
North East Centre For Agricultural Biotechnology Office of The DBT - Necab, Aau:: Jorhat-13
No ratings yet
North East Centre For Agricultural Biotechnology Office of The DBT - Necab, Aau:: Jorhat-13
2 pages
FS1 Le4 Johnlloyd Delarosa
No ratings yet
FS1 Le4 Johnlloyd Delarosa
13 pages
Bioanalysis Lab Transformation Guide
No ratings yet
Bioanalysis Lab Transformation Guide
12 pages
Engineering Mathematics Model Paper 21MAT31
No ratings yet
Engineering Mathematics Model Paper 21MAT31
6 pages
AQA English Language Paper 2 2026 Section a Scheme of Work Document Edit
No ratings yet
AQA English Language Paper 2 2026 Section a Scheme of Work Document Edit
36 pages
A Protocol For Systematic Review OF CGIAR'S RESEARCH (2012-2023) ON Climate-Induced Drought and Heat Stress
No ratings yet
A Protocol For Systematic Review OF CGIAR'S RESEARCH (2012-2023) ON Climate-Induced Drought and Heat Stress
20 pages
Science: Quarter 1 - Module 4: Earth's Mechanism
100% (1)
Science: Quarter 1 - Module 4: Earth's Mechanism
24 pages
E20 Calculations Chapter3
No ratings yet
E20 Calculations Chapter3
29 pages
The Speed of A Projectile When It Is at Its Greatest Height Is
No ratings yet
The Speed of A Projectile When It Is at Its Greatest Height Is
2 pages
Lab Reports Mechanics, Heat & Vibrations
No ratings yet
Lab Reports Mechanics, Heat & Vibrations
79 pages
Abdolmaleki Et Al. - 2018 - Maximum A Posteriori Policy Optimisation
No ratings yet
Abdolmaleki Et Al. - 2018 - Maximum A Posteriori Policy Optimisation
23 pages
Academic Integrity in Classes at MDC
No ratings yet
Academic Integrity in Classes at MDC
9 pages
Instituicaoirena Renewable Energy and Jobs 2024 748
No ratings yet
Instituicaoirena Renewable Energy and Jobs 2024 748
88 pages
OB Models for BBA Students
No ratings yet
OB Models for BBA Students
20 pages
Unit Ii - QB
No ratings yet
Unit Ii - QB
2 pages
Dream Interpretation: Acrylic Nails
No ratings yet
Dream Interpretation: Acrylic Nails
1 page
Nikola Tesla: Genius and Innovator
No ratings yet
Nikola Tesla: Genius and Innovator
4 pages
SSC Special 2020 Results
No ratings yet
SSC Special 2020 Results
60 pages
PSC1501 Assignment 4
No ratings yet
PSC1501 Assignment 4
5 pages
Mastering Non-Verbal Communication
No ratings yet
Mastering Non-Verbal Communication
4 pages
Diploma Pharmacy Exam June 2009
No ratings yet
Diploma Pharmacy Exam June 2009
1 page
English Half-Yearly Exam Class XI 2023-24
No ratings yet
English Half-Yearly Exam Class XI 2023-24
8 pages
What Is Installation Art?
No ratings yet
What Is Installation Art?
17 pages
C6+ Hydrocarbons and Hydrocarbon Dewpoint
No ratings yet
C6+ Hydrocarbons and Hydrocarbon Dewpoint
6 pages
Class XII Painting Theory Marking Scheme
No ratings yet
Class XII Painting Theory Marking Scheme
4 pages
Oil Industry Dissertation Writing Help
100% (2)
Oil Industry Dissertation Writing Help
5 pages

Content Beyond Syllabus and Case Based Program

Uploaded by

Content Beyond Syllabus and Case Based Program

Uploaded by

PSAGAR INSTITUTE OF RESEARCH & TECHNOLOGY

DEPARTMENT OF COMPUTERSCIENCE &ENGINEERING

Content Beyond Syllabus

Importing the required packages

#rom sklearn.cross_validation import train_test_ split

from [Link] import accuracy_sCore

'databases/balance-scale/[Link],sep="", header =None)

print ("Dataset Shape:", balance_data.shape)

# Printing the dataset obseravtions

print ("Dataset: ",balance_data.head()

return balance data

# Function tosplit the dataset

X,Y, test_size = 0.3, random_state = 100)

return X, Y, X_train, X_test, y_train, y_test

def train_using_gini(X_train, X_test, y_train);

clf gini =DecisionTreeClassifier(criterion = "gini",

def tarin_using_entropy(X_train,X_test, y_train):

#Decision tree with entropy

clf entropy= DecisionTreeClassifier

max_depth =3, min_samples_leaf =5)

return clf entropy

# Function to make predictions

# Predicton on test with ginilndex

print("Predicted values: ")

def cal_accuracy(y_test, y_pred):

confusion_ matrix(y_test, y_pred))

X, Y, X_train, X_test, y_train, y_ test = splitdataset(data)

# Prediction using gini

Y_pred gini= prediction(X_test, clf gini)

#Prediction using entropy

if_name ==" main_":

Dataset Length: 625

Dataset Shape: (625, 5)

CASE STUDY BASEDQUESTION

TITLE:USED CARS PRICE PREDICTION

1 2012 Nissan Nissan Sunny XV Diesel

1725 rOwax 11 columns

Id 1725 non-null int64

mean 862.000000 2015.390725 1.494837e+06 53848.256232 15.731014 5.609275

std 498.108924 3.207504 1,671658e+06 4725.54196312951122 3207504

25% 431.000000 2013.000000 5.450000e+05 29000.000000 5.000000 3.000000

50% 862,000000 2016.000000 8.750000e+05 49000.000000 14.000000 5.000000

max 1724.000000 2021.000000 1.470000e+07 780000.000000 81.000000 31.000000

You might also like