Welcome to Scribd!

Question 12

Uploaded by

0% found this document useful (0 votes)

1 views2 pages

This document provides instructions for Assignment 2 on exploring regression methods. Students are asked to: 1) Select a baseline regression approach using the provided dataset and assess its performance via cross-validation. 2) Generate at least two alternative regression models by applying preprocessing, feature selection, or different regression methods. 3) Evaluate and visualize the performance of all models using cross-validation. The submission must include code, a report with descriptions of methods chosen, output, and a table/plot comparing results.

Original Description:

Original Title

question12

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

1 views2 pages

Question 12

Uploaded by

ejaknon

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

Computer Science 3202/6915

Assignment 2 – Explore Regression Methods

Due date: Sunday February 18th by 11:30pm. (Closing date: March 10th at
midnight)

Learning goals:
1. Explore pre-processing techniques on a dataset.
2. Get familiar with the regression approaches available in scikit-learn.
3. Practice applying regression approaches.
4. Practice using cross-validation to select the best performing approach.

Instructions:
In the Brigthspace folder for this assignment, there is a dataset available
(A2data.tsv). This dataset consists of 99 numeric inputs and one numeric label for
48 instances. The dataset is given as a tab-delimited text files with one instance
per line and a column header. The first 99 columns are the features and the last
column is the output/label.

Your job is to work with this data to generate a regression model. You are allowed
to use any pre-processing technique, feature selection and regression method
available in scikit-learn. You are required to assess model performance using
cross-validation. These are the steps to complete this assignment:

1. Select, implement and assess the performance of a baseline regression

approach. That is, use a simple regression method directly on the data as
given (i.e., don’t do any pre-processing or feature selection) and obtain the
cross-validation root mean square error (RMSE) of this baseline model.
1. This is a small dataset (48 instances) so carefully consider which cross-
validation would be appropriate (10-fold CV, 5-fold CV, LOO-CV).
2. Generate at least two alternative regression models. You are allowed to
choose how to create those alternative regression models. For example,
applying a pre-processing technique and the same simple regression
method you use in step 1 counts as an alternative regression model; or
using a different regression method with the original data also counts as an
alternative regression model, or any combination of pre-processing, feature
selection and regression method counts as an alternative regression model.
3. Evaluate all the generated regression models using cross-validation and
create a plot to visualize and compare the performance of the models
(some suitable visualizations are box plots of the RMSE, interval plots of the
RMSE, or scatterplots showing actual output vs predicted output)

Submission:
1. Python code used to complete this assignment. Include instructions on how

Winter 2024 1/2

Computer Science 3202/6915
Assignment 2 – Explore Regression Methods

to run your code.

2. A report in a single PDF file containing:
1. Brief description and justification of your choice of methods.
1. Explain what method do you choose as baseline and why.
2. Explain what methods do you choose to generate at least two
alternative models and why.
2. A screenshot of a run of your program showing its output.
3. The data visualization(s) generated in step 3.
4. A table with the average RMSE ± standard deviation per method.
5. A brief concluding paragraph summarizing and interpreting your results.

Resources which might be useful:

1. Available methods in scikit-learn
https://scikit-learn.org/stable/supervised_learning.html#supervised-
learning
2. Cross-validation with linear regression
https://www.kaggle.com/code/jnikhilsai/cross-validation-with-linear-
regression

Winter 2024 2/2

Latin America
Document31 pages
Latin America
seety2
No ratings yet
Soviet Psychology
Document8 pages
Soviet Psychology
Anonymous I5m6kN
No ratings yet
Project On Data Mining: Prepared by Ashish Pavan Kumar K PGP-DSBA at Great Learning
Document50 pages
Project On Data Mining: Prepared by Ashish Pavan Kumar K PGP-DSBA at Great Learning
Ashish Pavan Kumar K
No ratings yet
Foul Perfection Essays and Criticism PDF
Document259 pages
Foul Perfection Essays and Criticism PDF
Adrián López Robinson
100% (2)
English III - Course No. 1001370
Document394 pages
English III - Course No. 1001370
MLSBU11
No ratings yet
Assignment 1
Document20 pages
Assignment 1
goosam9992
No ratings yet
Telecom Customer Churn Project Report
Document25 pages
Telecom Customer Churn Project Report
Sravanthi Ammu
50% (2)
An Analysis of Slang Words Used in Social Media
Document5 pages
An Analysis of Slang Words Used in Social Media
Julliene Diaz
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Homework Task 1: Submission Date: See Activities Calendar
Document4 pages
Homework Task 1: Submission Date: See Activities Calendar
Jorge Talavera Anaya
No ratings yet
Project Questions
Document4 pages
Project Questions
vansh gupta
No ratings yet
Instructions and Guidelines
Document2 pages
Instructions and Guidelines
Dishan Otieno
No ratings yet
CSC 603 - Final Project
Document3 pages
CSC 603 - Final Project
bme.engineer.issa.mansour
No ratings yet
Project - Machine Learning-Business Report: By: K Ravi Kumar PGP-Data Science and Business Analytics (PGPDSBA.O.MAR23.A)
Document38 pages
Project - Machine Learning-Business Report: By: K Ravi Kumar PGP-Data Science and Business Analytics (PGPDSBA.O.MAR23.A)
Ravi Kotharu
No ratings yet
ML 1 Project
Document2 pages
ML 1 Project
Chaitanya Sanga
No ratings yet
2021 Homework3 Introduction
Document8 pages
2021 Homework3 Introduction
Ali Zain
No ratings yet
Smai A1 PDF
Document3 pages
Smai A1 PDF
Zubair Ahmed
No ratings yet
Advance Machine Learning
Document4 pages
Advance Machine Learning
ranjit.e10947
No ratings yet
Anomaly Detection Using The Numenta Anomaly Benchmark
Document8 pages
Anomaly Detection Using The Numenta Anomaly Benchmark
Mallikarjun patil
No ratings yet
2016 - ECM - Notes
Document43 pages
2016 - ECM - Notes
Nigel Mutambanengwe
No ratings yet
Submission Type Due Date Total Score Available From Description
Document3 pages
Submission Type Due Date Total Score Available From Description
donna
No ratings yet
DSTR Assignment
Document5 pages
DSTR Assignment
mamad
No ratings yet
Sen PT2 QB Solution
Document13 pages
Sen PT2 QB Solution
Sarthak kadam
No ratings yet
Hybrid Palm Oil Mills Maintenance System
Document10 pages
Hybrid Palm Oil Mills Maintenance System
wanamei
No ratings yet
Machine Learning Algorithm
Document8 pages
Machine Learning Algorithm
Shivaprakash D M
No ratings yet
Description: Bank - Marketing - Part1 - Data - CSV
Document4 pages
Description: Bank - Marketing - Part1 - Data - CSV
ravikgovindu
No ratings yet
Comparing Methods Assignment
Document2 pages
Comparing Methods Assignment
Ali Nasar
No ratings yet
CW1 Paper
Document4 pages
CW1 Paper
revaluate21
No ratings yet
Project - Data Mining: Bank - Marketing - Part1 - Data - CSV
Document4 pages
Project - Data Mining: Bank - Marketing - Part1 - Data - CSV
donna
No ratings yet
Data Mining Assignment No 2
Document4 pages
Data Mining Assignment No 2
Nouman Rasheed
No ratings yet
Exercise 07
Document5 pages
Exercise 07
Doublev Omer
No ratings yet
Project Assignment.2024
Document2 pages
Project Assignment.2024
tin nguyen
No ratings yet
Introduction To Process Simulators and Process Simulation
Document12 pages
Introduction To Process Simulators and Process Simulation
Samriddha Das Gupta
No ratings yet
Assignment 3
Document4 pages
Assignment 3
404xenos
No ratings yet
Record
Document132 pages
Record
Hareesh Madathil
No ratings yet
Week 3 Developing Simulation Models
Document35 pages
Week 3 Developing Simulation Models
Gregory Alex
No ratings yet
Week 3 Developing Simulation Models
Document35 pages
Week 3 Developing Simulation Models
Gregory Alex
No ratings yet
QSB Imp
Document22 pages
QSB Imp
SudamBehera
No ratings yet
CPT525 22-23 Question - Plus - Markingscheme
Document10 pages
CPT525 22-23 Question - Plus - Markingscheme
Sulaimon Bashir
No ratings yet
Dsa Material 05 12 2020
Document78 pages
Dsa Material 05 12 2020
Irshad Jacek
No ratings yet
Data Mining Project 11
Document18 pages
Data Mining Project 11
Abraham Zeleke
No ratings yet
CSC583 Artificial Intelligence Algorithms Group Assignment (30%)
Document3 pages
CSC583 Artificial Intelligence Algorithms Group Assignment (30%)
harith danish
No ratings yet
Chapter 2：基于模型的建模 & 连续动力学建模 & 系统的参与者模型
Document69 pages
Chapter 2：基于模型的建模 & 连续动力学建模 & 系统的参与者模型
gs68295
No ratings yet
CT107-3-3-TXSA - Group Assignment
Document4 pages
CT107-3-3-TXSA - Group Assignment
Sharveen Veen
No ratings yet
CADCAM Question Bank MID-2-1
Document7 pages
CADCAM Question Bank MID-2-1
Dokula Ganeshkumar
No ratings yet
ML Assign1 2023 Updated
Document3 pages
ML Assign1 2023 Updated
Computer Email
No ratings yet
Final Project Report
Document18 pages
Final Project Report
jstpallav
No ratings yet
Milestone
Document7 pages
Milestone
kirankashif1118
No ratings yet
Report Digit Recognition
Document11 pages
Report Digit Recognition
Aristofanio Meyrele
No ratings yet
Its665 Isp565 Group Project March 2023
Document10 pages
Its665 Isp565 Group Project March 2023
2021826386
No ratings yet
Assignment1 FN
Document8 pages
Assignment1 FN
Vân Anh Nguyễn
No ratings yet
2012 Nikolaos Nikolaou MSC
Document102 pages
2012 Nikolaos Nikolaou MSC
uyjco0
No ratings yet
Project Questions
Document3 pages
Project Questions
ravikgovindu
No ratings yet
Beldiceanu12a-Model A
Document17 pages
Beldiceanu12a-Model A
Sophia Rose
No ratings yet
Predictive Analytics Exam-December 2019: Exam PA Home Page
Document9 pages
Predictive Analytics Exam-December 2019: Exam PA Home Page
justtestit
No ratings yet
CT107-3-3-TXSA - Group Assignment
Document4 pages
CT107-3-3-TXSA - Group Assignment
Sharveen Veen
No ratings yet
Databyte ML Task 1
Document6 pages
Databyte ML Task 1
Mohini Thakur
No ratings yet
Lab Manual Ds&Bdal
Document100 pages
Lab Manual Ds&Bdal
SEA110 Kshitij Bhosale
No ratings yet
Lab Assignment-2 Linear Regression
Document1 page
Lab Assignment-2 Linear Regression
adityasingh.b9
No ratings yet
The Call
Document1 page
The Call
Nur Aneesa
No ratings yet
Comprogram Lab Syllabus
Document2 pages
Comprogram Lab Syllabus
bkvuvce8170
No ratings yet
TAU2466 Assignment Brief
Document6 pages
TAU2466 Assignment Brief
kanpurstreet
No ratings yet
Assignment 1:: Intro To Machine Learning
Document6 pages
Assignment 1:: Intro To Machine Learning
Minh Trí
No ratings yet
Predictive Analytics Exam-December 2020: Exam PA Home Page
Document10 pages
Predictive Analytics Exam-December 2020: Exam PA Home Page
justtestit
No ratings yet
Predictive Analytics Exam-June 2020: Exam PA Home Page
Document10 pages
Predictive Analytics Exam-June 2020: Exam PA Home Page
justtestit
No ratings yet
DP-Designing and Implementing
Document10 pages
DP-Designing and Implementing
Steven Doh
No ratings yet
Notes On PLC and Industrial Networks
Document13 pages
Notes On PLC and Industrial Networks
joshua
No ratings yet
Web Coding Book
Document550 pages
Web Coding Book
vapabyssboi
No ratings yet
Spectral Density Estimation
Document8 pages
Spectral Density Estimation
volly666
No ratings yet
Complete IELTS U4 Wri
Document9 pages
Complete IELTS U4 Wri
Anonymous N6ccr9MV
No ratings yet
Award Certificates EDITABLE
Document7 pages
Award Certificates EDITABLE
Joyce de Leon
No ratings yet
CBSE 10th Results
Document2 pages
CBSE 10th Results
Aditya Kumar
No ratings yet
Insert Page Numbers & Header and Footer
Document4 pages
Insert Page Numbers & Header and Footer
Lasith Malinga
No ratings yet
Interrupts in 8051: Microprocessor Laboratory EE-337
Document11 pages
Interrupts in 8051: Microprocessor Laboratory EE-337
Ayandev Barman
No ratings yet
Spanish 101-005 Course Outline (Grisel)
Document8 pages
Spanish 101-005 Course Outline (Grisel)
Nawaf Alsuhaibani
No ratings yet
Full PRE-UT-U16P Unit 16 Plus Test Without An PDF
Document4 pages
Full PRE-UT-U16P Unit 16 Plus Test Without An PDF
Макар Джулай
No ratings yet
JPPF
Document31 pages
JPPF
Katya Lizbeth Perez Romero
No ratings yet
Display Signal-To-Interference-plus-noise Ratio (SINR) Map - MATLAB Sinr
Document5 pages
Display Signal-To-Interference-plus-noise Ratio (SINR) Map - MATLAB Sinr
Risqi Raharjo
No ratings yet
November 2017 Grade: Snapshot-Elementary, Ed. Longman, 2001, Brian Abbs, Ingrid Freebairn, Chris Barker
Document8 pages
November 2017 Grade: Snapshot-Elementary, Ed. Longman, 2001, Brian Abbs, Ingrid Freebairn, Chris Barker
Alina Donici
No ratings yet
3d Shapes Lesson
Document3 pages
3d Shapes Lesson
api-336080613
No ratings yet
Narrative Observation
Document3 pages
Narrative Observation
Edwin Siruno Lopez
No ratings yet
Clark, H.H. (1975) Bridging
Document6 pages
Clark, H.H. (1975) Bridging
marysidea2
No ratings yet
Python Development With PyDev and Eclipse
Document12 pages
Python Development With PyDev and Eclipse
A. M. Anisul Huq
No ratings yet
Parallel Asynchronous Programming Java
Document144 pages
Parallel Asynchronous Programming Java
prasad velgala
No ratings yet
Qmail Pre-Installation and Total Configuration
Document56 pages
Qmail Pre-Installation and Total Configuration
Srinivas Rao
No ratings yet
Anie Altamirano Learning For Life Handout
Document4 pages
Anie Altamirano Learning For Life Handout
Annapurna V
No ratings yet
GED11 - 14. Drama
Document6 pages
GED11 - 14. Drama
Kyle Cepillo
No ratings yet
Islam and Moral Education
Document2 pages
Islam and Moral Education
Shahadat Hossain
No ratings yet
Anglo-Saxon Literature
Document19 pages
Anglo-Saxon Literature
Altaf Kalwar
No ratings yet
Concept Paper
Document3 pages
Concept Paper
Marinel Alcantara
No ratings yet