Welcome to Scribd!

Capstone Project-Naan Mudlvan

Uploaded by

0% found this document useful (0 votes)

2 views2 pages

The document provides instructions for a capstone project to build a predictive model using a CPU performance dataset. The dataset contains 209 rows and 9 columns with details on CPU vendors, models, specifications and performance scores. The tasks are to preprocess the data by encoding categorical variables, dropping rare vendors and the model column, split the data into train and test sets, build a linear regression model on the training set, calculate performance metrics on both sets, check for multicollinearity using VIF, and predict the performance score for a new CPU instance.

Original Description:

Upload

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

2 views2 pages

Capstone Project-Naan Mudlvan

Uploaded by

mohanraj28174

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

Capstone Project

General Instructions:
1. Provide appropriate comments in your code.
2. Perform all task programmatically using Python libraries.

‘XYZ’ hardware service center is specialized in servicing the CPUs. The center has maintained the details
about the CPUs they have serviced which is available in “machine_data.csv”. The dataset has 209 rows
and 9 columns. [Source of raw dataset: UCI repository] The details of the columns are as follows:

 vendor: represents the manufacturer of the CPU.

 model: represents the model number of the CPU.
 cycle_time: represents the time taken for internal data transfer in nanoseconds of the CPU.
 min_memory: represents the minimum main memory required by the CPU.
 max_memory: represents the maximum main memory supported by the CPU.
 cache: represents the size of cache memory required by the CPU.
 min_threads: represents the number of threads that run in the CPU when it is just switched on.
 max_threads: represents the maximum number of threads that can be run on the CPU.
 score: represents the performance score of the CPU.

Based on this data, ‘XYZ’ hardware service center would like to build a predictive model that predicts the
performance score for the new CPUs.
As a data science expert, you are expected to build the best model for the given scenario.

Problem statement:

Perform the following activities to build the model:

1. Import the data set “machine_data.csv”.

2. As part of data preprocessing, perform the following activities:
a. Encode the categorical column – ‘vendor’ using label encoder.
b. Identify the vendors who have manufactured less than 5 CPUs and drop those rows from the
given dataset, corresponding to the identified vendors.
c. Drop the column ‘model’.

[Note: The preprocessed dataset should be used further.]

3. Select ‘score’ as the target variable to be predicted and remaining features as predictors.
4. Split the data into training and testing data set in the ratio 80:20.
5. As part of model building, perform the following activities:
a. Based on the training data, build a Linear Regression model.
b. Find the train and the test score for the built model.
c. Calculate the adjusted R-Squared values on both the train and the test data.
6. Calculate the VIF values for all the features considered while building the model using the train data.
7. Based on the model built, predict the performance score of a new test sample/ new CPU instance
which is given below:

(Note: In the above hardware instance, the vendor value '14' is the label encoded value for vendor
'harris'.)

SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
OCI 2023 Architect Associate 1Z0 1072 23
Document89 pages
OCI 2023 Architect Associate 1Z0 1072 23
saw andrew
No ratings yet
SAS Interview Questions You'll Most Likely Be Asked
From Everand
SAS Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
C Programming for the Pc the Mac and the Arduino Microcontroller System
From Everand
C Programming for the Pc the Mac and the Arduino Microcontroller System
Peter D Minns
No ratings yet
System Refresh
Document6 pages
System Refresh
Subhash Reddy
No ratings yet
OCI 2023 Architect Associate 1Z0-1072-23
Document89 pages
OCI 2023 Architect Associate 1Z0-1072-23
RAJA SEKHAR REDDY REDDEM
0% (2)
Programming AVR Micro Controllers in C
Document50 pages
Programming AVR Micro Controllers in C
Dhishan Amaranath
No ratings yet
Strato2000 Service Manual - Rev3 PDF
Document325 pages
Strato2000 Service Manual - Rev3 PDF
Maximus Decimus Meridius
No ratings yet
1996 Seadoo GSX GTX Sup Manual
Document39 pages
1996 Seadoo GSX GTX Sup Manual
tonyeld
No ratings yet
Azure Cand c2 Project Starter Template
Document45 pages
Azure Cand c2 Project Starter Template
Utkarsh
100% (1)
Introduction To Variant Configuration With An Example Model
Document23 pages
Introduction To Variant Configuration With An Example Model
yash maurya
No ratings yet
AC40 1060 Service Manual
Document68 pages
AC40 1060 Service Manual
macrufo
No ratings yet
Network Simulator-3 (N S-3) practical Lab Manual
From Everand
Network Simulator-3 (N S-3) practical Lab Manual
Vikram Patalbansi
No ratings yet
Ubiquiti Home Network
Document132 pages
Ubiquiti Home Network
Christopher Matthew Pieper
100% (2)
M2S Ac391 An
Document16 pages
M2S Ac391 An
Farhan Bin Khalid
No ratings yet
EE204 - Computer Architecture Course Project
Document7 pages
EE204 - Computer Architecture Course Project
SuneelKumarChauhan
No ratings yet
cs239 Ejer1
Document2 pages
cs239 Ejer1
Donn Gamboa
No ratings yet
Cache-Assignment Handout 12
Document9 pages
Cache-Assignment Handout 12
sch123321
No ratings yet
Benchmarking Warehouse Workloads On The Data Lake Using Presto
Document13 pages
Benchmarking Warehouse Workloads On The Data Lake Using Presto
wilhelmjung
No ratings yet
Introduction To The SysOperation Framework
Document59 pages
Introduction To The SysOperation Framework
Koiti Takahashi
No ratings yet
SystemC Introduction
Document21 pages
SystemC Introduction
Muhammad Ismail
No ratings yet
Readme File For Code Example
Document12 pages
Readme File For Code Example
julionba
No ratings yet
Practice Final Exam
Document3 pages
Practice Final Exam
Thành Thảo
No ratings yet
VCenter Chargeback-Costing Calculator
Document8 pages
VCenter Chargeback-Costing Calculator
William DA
No ratings yet
Compass Tuning Parameters
Document10 pages
Compass Tuning Parameters
Yanal Kazan
No ratings yet
Project - Cache Organization and Performance Evaluation
Document9 pages
Project - Cache Organization and Performance Evaluation
adviful
No ratings yet
File 482621234 482621234 - Assignment 2 - 7378831553794248
Document5 pages
File 482621234 482621234 - Assignment 2 - 7378831553794248
Bob Philip
No ratings yet
Project 2012fall
Document5 pages
Project 2012fall
ATHIRA V R
No ratings yet
4.Cn Lab Manual 2020
Document41 pages
4.Cn Lab Manual 2020
shettyayush139
No ratings yet
CE018 Readme
Document7 pages
CE018 Readme
julionba
No ratings yet
Microsoft NET User Guide For IBM SPSS Statistics
Document46 pages
Microsoft NET User Guide For IBM SPSS Statistics
srini durvesh
No ratings yet
App-Note Asset Utilization
Document5 pages
App-Note Asset Utilization
Nick Edwards
No ratings yet
Sangfor Hci Sizing - Quotation Technical Training 20150919 For Emea
Document15 pages
Sangfor Hci Sizing - Quotation Technical Training 20150919 For Emea
Muhammad Sabir
No ratings yet
Thinking Outside The Box
Document9 pages
Thinking Outside The Box
sanketdange2007
No ratings yet
CNS Lab Manual
Document40 pages
CNS Lab Manual
Akshu Sushi
No ratings yet
Assignment 2
Document45 pages
Assignment 2
Utkarsh
No ratings yet
Term Project Advance Computer Architecture Spring 2017: Submission
Document3 pages
Term Project Advance Computer Architecture Spring 2017: Submission
Tofeeq Ur Rehman FASTNU
No ratings yet
SP (Simple Processor) Specification
Document4 pages
SP (Simple Processor) Specification
risirarocks
No ratings yet
2324 BigData Lab3
Document6 pages
2324 BigData Lab3
Elie Al Howayek
No ratings yet
PracticeProblems COA8e
Document40 pages
PracticeProblems COA8e
Anousith Phompida
No ratings yet
CPE 325: Embedded Systems Laboratory Laboratory Assignment #4
Document2 pages
CPE 325: Embedded Systems Laboratory Laboratory Assignment #4
Sheeraz Ali
No ratings yet
3 Cuda
Document5 pages
3 Cuda
manvitha thottempudi
No ratings yet
XPC Getting Started en
Document31 pages
XPC Getting Started en
Wessel Jacobs
No ratings yet
Steps For SAP Client
Document2 pages
Steps For SAP Client
Pradeep Kumar
No ratings yet
Readme
Document5 pages
Readme
Pande Upadana
No ratings yet
program Files/UGS/NX 7.5/MACH/resource/library/machine/ascii/machine - Database - Dat
Document5 pages
program Files/UGS/NX 7.5/MACH/resource/library/machine/ascii/machine - Database - Dat
JarekCholewa
No ratings yet
Assinmet&Case Study
Document19 pages
Assinmet&Case Study
santosh vighneshwar hegde
No ratings yet
Project 12spring
Document5 pages
Project 12spring
Farzin Ghotbi
No ratings yet
PRR - Vxworks
Document250 pages
PRR - Vxworks
TapasKumarDash
No ratings yet
Build Reliable Machine Learning Pipelines With Continuous Integration
Document22 pages
Build Reliable Machine Learning Pipelines With Continuous Integration
Pratyush Khare
No ratings yet
Kernrate
Document22 pages
Kernrate
Sandro Reimão
No ratings yet
Lecture Notes-Computer Architecture-Module 1
Document20 pages
Lecture Notes-Computer Architecture-Module 1
mokshagnanare26
No ratings yet
DS 8.5 Installation Guide PDF
Document519 pages
DS 8.5 Installation Guide PDF
Anil Karunakaran
No ratings yet
Programming For Problem Solving
Document95 pages
Programming For Problem Solving
seemakujur3377
No ratings yet
Lab 2
Document8 pages
Lab 2
SAURABH DEGDAWALA
No ratings yet
LoopBack Application
Document12 pages
LoopBack Application
Gytis Bernotas
No ratings yet
Optimizing Multiplayer Game Server Performance On Aws
Document46 pages
Optimizing Multiplayer Game Server Performance On Aws
Xristos Beretis
No ratings yet
Mini-Project Ele654
Document10 pages
Mini-Project Ele654
Mazlan
No ratings yet
Informatica Building Parameter Files Dynamically
Document2 pages
Informatica Building Parameter Files Dynamically
Narayana Ankireddypalli
No ratings yet
Ijcet: International Journal of Computer Engineering & Technology (Ijcet)
Document6 pages
Ijcet: International Journal of Computer Engineering & Technology (Ijcet)
IAEME Publication
No ratings yet
Design of An 8-Bit Bus-Oriented Computer
Document13 pages
Design of An 8-Bit Bus-Oriented Computer
Suresh Kothala
No ratings yet
IBM SVC Instructions - MiTrend
Document2 pages
IBM SVC Instructions - MiTrend
Chalks Dreaner Alvino Vela
No ratings yet
Cross Database Comparison: Enhancement Guide
Document35 pages
Cross Database Comparison: Enhancement Guide
Paratchana Jan
No ratings yet
MICROSOFT AZURE ADMINISTRATOR EXAM PREP(AZ-104) Part-4: AZ 104 EXAM STUDY GUIDE
From Everand
MICROSOFT AZURE ADMINISTRATOR EXAM PREP(AZ-104) Part-4: AZ 104 EXAM STUDY GUIDE
Devi Prasad
No ratings yet
Principales of Ac One Mark 5 Units NEW-1
Document11 pages
Principales of Ac One Mark 5 Units NEW-1
mohanraj28174
No ratings yet
Mobile Computing Full Notes
Document195 pages
Mobile Computing Full Notes
mohanraj28174
No ratings yet
A New Automobile Sales Marketing Model For Innovat
Document24 pages
A New Automobile Sales Marketing Model For Innovat
mohanraj28174
No ratings yet
Volume7 Issue2 Paper6 2023
Document17 pages
Volume7 Issue2 Paper6 2023
mohanraj28174
No ratings yet
Bca 2021 - 22
Document123 pages
Bca 2021 - 22
mohanraj28174
No ratings yet
Null 1
Document27 pages
Null 1
mohanraj28174
No ratings yet
PWM Freq Arduino Due
Document9 pages
PWM Freq Arduino Due
Edgar Eduardo Medina Castañeda
No ratings yet
Bads VendorCoProductCode 9750261061448617998
Document8 pages
Bads VendorCoProductCode 9750261061448617998
Intel Core I3
No ratings yet
S&Ais3000 PDF
Document150 pages
S&Ais3000 PDF
Narcis Patrascu
No ratings yet
9 - 28 - 0 - 0 - 40 - 5th Electrical DE&MP
Document165 pages
9 - 28 - 0 - 0 - 40 - 5th Electrical DE&MP
vijay kumar Gupta
No ratings yet
Avita Magus Bill From Flipkart
Document1 page
Avita Magus Bill From Flipkart
Anonymous zwCV8Z
No ratings yet
Memory Modul Part Number D2X533BW-X256
Document8 pages
Memory Modul Part Number D2X533BW-X256
Rendy Adam Farhan
No ratings yet
VXVM Commands
Document57 pages
VXVM Commands
rtirmazi
No ratings yet
As 34885 002 001
Document53 pages
As 34885 002 001
Yan Chen
No ratings yet
Cardinal 708
Document30 pages
Cardinal 708
Micky Boza
0% (1)
New Technology: Gadgets & Trends
Document31 pages
New Technology: Gadgets & Trends
dunscotus
No ratings yet
Cable ID Test Limit Length Headroom Date / Time: 06/04/2019 09:11:58 AM Testes Coop - FLW
Document6 pages
Cable ID Test Limit Length Headroom Date / Time: 06/04/2019 09:11:58 AM Testes Coop - FLW
Lucas Severo
No ratings yet
Sfra Megger
Document6 pages
Sfra Megger
Mehtab Ahmed
No ratings yet
Nanosynth: Reference Manual
Document48 pages
Nanosynth: Reference Manual
camlog
No ratings yet
EPM Manual Model 42i
Document347 pages
EPM Manual Model 42i
chaling3
No ratings yet
Ax 3000
Document2 pages
Ax 3000
Nadolu Marian
No ratings yet
Licensing Windows Desktop OS For Virtual Machines
Document6 pages
Licensing Windows Desktop OS For Virtual Machines
Yasin Sakarya
No ratings yet
14 Commands
Document18 pages
14 Commands
api-308151707
No ratings yet
Ccent (Icnd1)
Document44 pages
Ccent (Icnd1)
GuzganShobolan
No ratings yet
Jksimmet: James Didovich 2014
Document42 pages
Jksimmet: James Didovich 2014
InformationEmissary
100% (1)
History of Computer
Document53 pages
History of Computer
Xivaughn Sebastian
100% (1)
Role of IT in Library Automation and Networking
Document8 pages
Role of IT in Library Automation and Networking
Somvir
100% (1)
Computer Graphics 2
Document9 pages
Computer Graphics 2
Waguma Leticia
No ratings yet
Magnastart Brochure
Document4 pages
Magnastart Brochure
Dan Alexandru Neagu
No ratings yet
Travelmate 630: Service Guide
Document134 pages
Travelmate 630: Service Guide
scribdermaniac
No ratings yet
Avision AVA6 Plus Service Manual
Document46 pages
Avision AVA6 Plus Service Manual
Ramon Johnson
0% (1)
LG - 42pq10r Plasma TV
Document25 pages
LG - 42pq10r Plasma TV
Cristian Chanampa
No ratings yet