E4 DS203 2023 Sem2

Uploaded by

sparee1256

0% found this document useful (0 votes)

3 views2 pages

Original Title

E4-DS203-2023-Sem2

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

3 views2 pages

E4 DS203 2023 Sem2

Uploaded by

sparee1256

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

Exercise – 4: DS203-2023-Sem2

Submissions due by: Feb 19, 2024, 11:55pm

This exercise is aimed at:

• Getting introduced to and running various Classification algorithms on a given data set and understanding
their relative characteristics, performance, and advantages.
• Calculating, effectively documenting, and understanding various Classification metrics and developing an
approach towards effectively using them.
• Creating and consolidating multiple plots with the aim to compare and contrast the results of the algorithms
• Get introduced to the relevant functions of the Python library: sklearn
• Using LLM tools like ChatGPT to generate code

In this Exercise you will be processing datasets:

• Clusters-4-v0.csv
• Clusters-4-v1.csv
• Clusters-4-v2.csv

Processing steps:

1. Divide the data set into ‘train’ and ‘test’ datasets, once, and use them for all subsequent processing.
2. Review the data using appropriate plots and understand the overall structure of the data.
3. Use the following algorithms / variants to process the datasets:
a. Logistic Regression
b. SVC – with ‘linear’ kernel (what is ‘linear’?)
c. SVC – with ‘rbf’ kernel (what is ‘rbf’?)
d. Random Forest Classifier – with min_samples_leaf=1
e. Random Forest Classifier – with min_samples_leaf=3
f. Random Forest Classifier – with min_samples_leaf=5
g. Neural Network Classifier – with hidden_layer_sizes=(5)
h. Neural Network Classifier – with hidden_layer_sizes=(5,5)
i. Neural Network Classifier – with hidden_layer_sizes=(5,5,5)
j. Neural Network Classifier – with hidden_layer_sizes=(10)
4. Generate, capture, and save into a csv file, the following metrics in (for train and test data, for all the
algorithms): For example, see the following image:
• Accuracy, Precision (per class), Precision (average), Recall (per class), Recall (average), F1-score (per
class), F1-score (average), AUC (per class), AUC (average).
• (Hint: The following functions may be used: accuracy_score, precision_score, recall_score, f1_score,
roc_auc_score, roc_curve)
5. Generate plots like the ones below - to understand the classification boundaries and the overall performance
of the classifiers:

6. As in the case of E3, compare the metrics within and across the datasets (train and test) and algorithms. In
addition to the variations in metrics, include in your report aspects related to classification boundaries,
overfitting, etc.
7. List your major learnings from this exercise.
8. Your submissions should be as follows:
a. A PDF document with all the above analyses and comments. Ensure that you include the required
figures and Tables (ie. metrics data) in your report, along with the explanations and analysis.
b. A functional Python source file (.py file) as part of your submission. Please DO NOT upload Jupyter
Notebooks – they get bulky!
c. Name of the PDF should be E4-your-roll-number.pdf and the name of the Python source file should
be E4-your-roll-number.py
d. Upload the PDF and the source file to the assignment submission point E4.

oooOOOooo

Ian Talks Python A-Z
From Everand
Ian Talks Python A-Z
Ian Eress
No ratings yet
Machine Learning with Python: A Comprehensive Guide with a Practical Example
From Everand
Machine Learning with Python: A Comprehensive Guide with a Practical Example
MARTIN NEEL
No ratings yet
Smai A1 PDF
Document3 pages
Smai A1 PDF
Zubair Ahmed
No ratings yet
ML Lab Manual
Document38 pages
ML Lab Manual
Rahul
No ratings yet
SNJB’s Computer Lab VII Instructor Manual
Document19 pages
SNJB’s Computer Lab VII Instructor Manual
viswanath kani
No ratings yet
ML Lab 04 Manual - Pandas and MatplotLib
Document7 pages
ML Lab 04 Manual - Pandas and MatplotLib
dodela6303
No ratings yet
PDS Exp 4 To 6
Document9 pages
PDS Exp 4 To 6
X
No ratings yet
Experiment 1
Document17 pages
Experiment 1
Aksauce Succed
No ratings yet
Individual Assignment 2
Document4 pages
Individual Assignment 2
jemal yahyaa
No ratings yet
Important Questions
Document4 pages
Important Questions
Adilrabia rsl
No ratings yet
Data Science & Big Data - Practical
Document7 pages
Data Science & Big Data - Practical
RAKESH G
No ratings yet
CS 475/675 Machine Learning: Homework 3 Visualizations
Document8 pages
CS 475/675 Machine Learning: Homework 3 Visualizations
Ali Zain
No ratings yet
Data Mining and Warehousing Lab Syllabus
Document4 pages
Data Mining and Warehousing Lab Syllabus
PhamThi Thiet
No ratings yet
Data Science course outcomes and units
Document4 pages
Data Science course outcomes and units
Chaitanya 2003
No ratings yet
Ass3 v1
Document4 pages
Ass3 v1
Reeya Prakash
No ratings yet
Data Mining Assignment No 2
Document4 pages
Data Mining Assignment No 2
Nouman Rasheed
No ratings yet
ML Lab 09 Manual - Introduction To Scikit Learn
Document6 pages
ML Lab 09 Manual - Introduction To Scikit Learn
ALI HAIDER
No ratings yet
SCS16L
Document3 pages
SCS16L
sanjay mehra
No ratings yet
DS MachineLearningEngineerTechnicalChallenge3.1
Document1 page
DS MachineLearningEngineerTechnicalChallenge3.1
fercho120
No ratings yet
2021 ITS665 - ISP565 - GROUP PROJECT-revMac21
Document6 pages
2021 ITS665 - ISP565 - GROUP PROJECT-revMac21
Umairah Ibrahim
No ratings yet
Python For Data Science
Document22 pages
Python For Data Science
Mohit Malghade
No ratings yet
Project 1
Document4 pages
Project 1
aqsa yousaf
No ratings yet
External Lab and Skill 17-05-2022
Document10 pages
External Lab and Skill 17-05-2022
Vignesh Challa
No ratings yet
VTU Machine Learning Lab Manual - Implement 10 Algos
Document43 pages
VTU Machine Learning Lab Manual - Implement 10 Algos
vijay1985jan09
No ratings yet
18CSL76 Artificial Intelligence and Machine Learning Laboratory syllabus for CS
Document1 page
18CSL76 Artificial Intelligence and Machine Learning Laboratory syllabus for CS
Sanjay Kumar
No ratings yet
Lab-11 Random Forest
Document2 pages
Lab-11 Random Forest
KamranKhan
No ratings yet
Machine Learning Assignment: Iris Dataset, Parameter Settings, Experiments
Document1 page
Machine Learning Assignment: Iris Dataset, Parameter Settings, Experiments
creatively_1
No ratings yet
Develop employability skills through data science labs
Document7 pages
Develop employability skills through data science labs
anbhute3484
No ratings yet
Intro To Scikit Learning
Document18 pages
Intro To Scikit Learning
THIRUNEELAKANDAN
No ratings yet
Quantified Movement Analysis
Document5 pages
Quantified Movement Analysis
Bhoj Raj Bhatta
No ratings yet
CSE 5311: Design and Analysis of Algorithms Programming Project Topics
Document3 pages
CSE 5311: Design and Analysis of Algorithms Programming Project Topics
Fgv
No ratings yet
CS5785 Homework 4: .PDF .Py .Ipynb
Document5 pages
CS5785 Homework 4: .PDF .Py .Ipynb
Al Tarino
No ratings yet
ELL784/AIP701: Assignment 3: Instructions
Document3 pages
ELL784/AIP701: Assignment 3: Instructions
lovlesh roy
No ratings yet
DWDM Final Lab Syllabus
Document2 pages
DWDM Final Lab Syllabus
saisimba99
No ratings yet
Solution Methodology
Document5 pages
Solution Methodology
Arnab Dey
No ratings yet
PDS Exp 1 To 3
Document17 pages
PDS Exp 1 To 3
X
No ratings yet
Infotec Ai 1000 Program-hcia-Ai Lab Guide
Document82 pages
Infotec Ai 1000 Program-hcia-Ai Lab Guide
micke juarez
No ratings yet
Lab1 BoW ImageClassification
Document3 pages
Lab1 BoW ImageClassification
Vikramaditya Tarai
No ratings yet
Assignment 3
Document2 pages
Assignment 3
sonu23144
No ratings yet
Chatgpt Advices
Document15 pages
Chatgpt Advices
fotini81729
No ratings yet
Viva Q-Ans
Document3 pages
Viva Q-Ans
Shivanshu Kumar Upadhyay
No ratings yet
Statistics CA 2023-24 Sem 2
Document4 pages
Statistics CA 2023-24 Sem 2
Priyanka V Galagali
No ratings yet
2023 Its665 - Isp565 - Group Project
Document6 pages
2023 Its665 - Isp565 - Group Project
2021826386
No ratings yet
ML Lab Manual 18csl76 1
Document54 pages
ML Lab Manual 18csl76 1
Kollipara Sai Sandeep
No ratings yet
Assignment 1.1: First 10 Rows Looks Like Below in Notepad++
Document6 pages
Assignment 1.1: First 10 Rows Looks Like Below in Notepad++
priyam
100% (1)
Assignment 2
Document3 pages
Assignment 2
vedantsimp
No ratings yet
Task WrittenAssignment DLMDSPWP01
Document5 pages
Task WrittenAssignment DLMDSPWP01
Rahul M S
No ratings yet
Data Structures and Algorithms
Document6 pages
Data Structures and Algorithms
suryaniraj79
No ratings yet
MMSegmentation
Document2 pages
MMSegmentation
Æhmëd Djëllãb
No ratings yet
1152CS239-Intro. To Data Science-Syllabus
Document6 pages
1152CS239-Intro. To Data Science-Syllabus
Shashitha Reddy
No ratings yet
6CS4 22 Machine Learning Lab Manual
Document46 pages
6CS4 22 Machine Learning Lab Manual
Pragati Bagul
50% (2)
ECE3502 IoT Time Series Analysis
Document11 pages
ECE3502 IoT Time Series Analysis
Karthik Reddy
No ratings yet
Object Oriented Programming Using C++
Document16 pages
Object Oriented Programming Using C++
prassu
No ratings yet
Ee382M - Vlsi I: Spring 2009 (Prof. David Pan) Final Project
Document13 pages
Ee382M - Vlsi I: Spring 2009 (Prof. David Pan) Final Project
sepritahara
No ratings yet
HY425 ProgAssignment2
Document3 pages
HY425 ProgAssignment2
Jimis Sougias
No ratings yet
Fnal+Report Advance+Statistics
Document44 pages
Fnal+Report Advance+Statistics
Pranav Viswanathan
100% (1)
Lab Manual: Department of Computer Engineering
Document65 pages
Lab Manual: Department of Computer Engineering
Rohit
No ratings yet
RANDOM FOREST (Binary Classification)
Document5 pages
RANDOM FOREST (Binary Classification)
Noor Ul Haq
No ratings yet
ENV11114 Modelling Wildlife Populations Assessment - Instruction Sheet Paul Ward (P.ward@napier - Ac.uk)
Document1 page
ENV11114 Modelling Wildlife Populations Assessment - Instruction Sheet Paul Ward (P.ward@napier - Ac.uk)
ΣΤΩΙΚΟΣ
No ratings yet
Question Bank
Document3 pages
Question Bank
Vivekanand Ks
No ratings yet
Subjective Visual Vertical in Vestibular Disorders Measured With The Bucket Test
Document9 pages
Subjective Visual Vertical in Vestibular Disorders Measured With The Bucket Test
camila hernandez
No ratings yet
Student Campus Placement Prediction Analysis Using ChiSquared Test On Machine Learning Algorithms-IJRASET
Document10 pages
Student Campus Placement Prediction Analysis Using ChiSquared Test On Machine Learning Algorithms-IJRASET
IJRASETPublications
No ratings yet
Machine Learning Methods for Credit Card Fraud Detection Using SMOTE and AdaBoost
Document9 pages
Machine Learning Methods for Credit Card Fraud Detection Using SMOTE and AdaBoost
Akhil Jabbar Meerja
No ratings yet
Data Mining
Document28 pages
Data Mining
GURUPADA PATI
No ratings yet
Parkinsons
Document22 pages
Parkinsons
Hrudhay Lucky
No ratings yet
Cpot PDF
Document5 pages
Cpot PDF
Ayub Esperanza
No ratings yet
Diagnosis of Malaria Using Double Hidden Layer Extreme Learning Machine Algorithm With CNN Feature Extraction and Parasite Inflator
Document14 pages
Diagnosis of Malaria Using Double Hidden Layer Extreme Learning Machine Algorithm With CNN Feature Extraction and Parasite Inflator
Md Nahiduzzaman
No ratings yet
Student Performance Prediction by Using Data Mining Classification Algorithms
Document6 pages
Student Performance Prediction by Using Data Mining Classification Algorithms
Mussie Tekle
No ratings yet
Credit Card Default Prediction: Final Project Report
Document28 pages
Credit Card Default Prediction: Final Project Report
aditghanekar10
No ratings yet
DDoS Attack Detection in SDNs Using Improved KNN and Degree of Attack
Document10 pages
DDoS Attack Detection in SDNs Using Improved KNN and Degree of Attack
Ahmed
No ratings yet
RAJIV RANJAN - 19!02!2023 - Predictive Modelling Project Report - Final
Document34 pages
RAJIV RANJAN - 19!02!2023 - Predictive Modelling Project Report - Final
Rajiv Ranjan
100% (1)
Employee Attrition Prediction
Document25 pages
Employee Attrition Prediction
user user
100% (1)
CSDS 440: Machine Learning Performance Evaluation Metrics
Document20 pages
CSDS 440: Machine Learning Performance Evaluation Metrics
Zaid Sulaiman
No ratings yet
Using Adaptive Alert Classification To Reduce False Positives in Intrusion Detection
Document24 pages
Using Adaptive Alert Classification To Reduce False Positives in Intrusion Detection
Dicki Setiawan
No ratings yet
2D Barcode Detection Using Ai
Document5 pages
2D Barcode Detection Using Ai
Leo RS
No ratings yet
Feature Analysis of Lymphomas Disease Prediction
Document13 pages
Feature Analysis of Lymphomas Disease Prediction
Kishore G
No ratings yet
AnalyticsEdge Rmanual PDF
Document44 pages
AnalyticsEdge Rmanual PDF
jtex2
No ratings yet
Apples-to-Apples in Cross-Validation Studies: Pitfalls in Classifier Performance Measurement
Document9 pages
Apples-to-Apples in Cross-Validation Studies: Pitfalls in Classifier Performance Measurement
Luis Martínez Ramírez
No ratings yet
Object Detection and Ship Classification Using YOLOv5
Document10 pages
Object Detection and Ship Classification Using YOLOv5
BOHR International Journal of Computer Science (BIJCS)
No ratings yet
Acerta White Paper Increasing Efficiency by Eliminating EOL
Document10 pages
Acerta White Paper Increasing Efficiency by Eliminating EOL
Mohammed Zuber
No ratings yet
Fingerprint Recognition System Using Artificial Neural Network As Feature Extractor: Design and Performance Evaluation
Document18 pages
Fingerprint Recognition System Using Artificial Neural Network As Feature Extractor: Design and Performance Evaluation
Desi Irfan Tan
No ratings yet
1904 08067
Document68 pages
1904 08067
Hitesh
No ratings yet
Telecom Customer Churn RV PDF
Document29 pages
Telecom Customer Churn RV PDF
Ramachandran Venkataraman
No ratings yet
Machine learning model predicts T2DM risk using HRV features
Document10 pages
Machine learning model predicts T2DM risk using HRV features
SuzanaPetrovic
No ratings yet
Accuracy of M2BPGi, Compared With Fibro Scan®, in Analysis of Liver Fibrosis in Patients With Hepatitis C
Document7 pages
Accuracy of M2BPGi, Compared With Fibro Scan®, in Analysis of Liver Fibrosis in Patients With Hepatitis C
alan lowus
No ratings yet
Cross-Domain Similarity Learning Improves Face Recognition
Document10 pages
Cross-Domain Similarity Learning Improves Face Recognition
Data LOG NGUYEN
No ratings yet
Isolation Forest
Document11 pages
Isolation Forest
Varun Nakra
No ratings yet
Classification Ppts 2021
Document80 pages
Classification Ppts 2021
PRIYA RATHORE
No ratings yet
Mini Project - Machine Learning - Tejas Nayak
Document65 pages
Mini Project - Machine Learning - Tejas Nayak
tejas_nayak
No ratings yet
FCSRTP - Preliminaryanalysis - 2021
Document7 pages
FCSRTP - Preliminaryanalysis - 2021
Priscila Selingardi
No ratings yet