Welcome to Scribd!

Assignment 2

Uploaded by

0% found this document useful (0 votes)

24 views2 pages

This document outlines the assignments for Machine Learning in Fall 2020. Part A involves using a dataset on congressional voting records to train a decision tree to predict political party. Students are asked to split the data, train on 25% and test on the rest, repeating with different splits, and report the tree sizes and accuracies. They are also asked to experiment with different training set sizes from 30-70% and report the mean, max, and min accuracies and tree sizes. Part B involves using a heart disease dataset to build an SVM model to predict whether patients have heart disease, experimenting with features, learning rates, and calculating accuracy on the test set. Students work in teams of 3-4 and submit by the given deadline

Original Description:

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

24 views2 pages

Assignment 2

Uploaded by

Rana Sayd

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

Machine Learning – [Fall 2020]

Assignment 2

Part A – Decision Tree:

The data set available for this assignment is based on the U.S. congress voting record
from 1984. The data set consists of the votes (yes or no) on sixteen issues for each of
the 435 members of congress. from the voting record. You will use this data to learn a
decision tree that predicts the political party of the representative based on his /her vote.

1. Use the voting data to train a decision tree to predict political party (Democrat or
Republican) based on the voting record. Use 25% of the members of congress for
training and the rest for testing. Rerun this experiment five times and notice the
impact of different random splits of the data into training and test sets. Report the
sizes and accuracies of these trees in each experiment.

2. Measure the impact of training set size on the accuracy and the size of the learned
tree. Consider training set sizes in the range (30-70%). Because of the high variance
due to random splits repeat the experiment with five different random seeds for each
training set size then report the mean, maximum and minimum accuracies at each
training set size. Also measure the mean, max and min tree size.
 Start with training data size 30%, 40% … Until you reach 70%.
 The data set contained many missing values , i.e., votes in which a member of
congress failed to participate. To solve those issue insert—for each absent
vote—the voting decision of the majority.
Machine Learning – [Fall 2020]

Part B - SVM:
The attached dataset "heart.csv" contain 303 records of patients have heart disease or
not according to features in it. You are required to build SVM model to predict whether
patient have heart disease or not (target).

Cost Function:

Update Weights Equations:

1- If Point is correctly classified.

2- If Point is not correctly classified.

a) Split dataset into training and testing sets.

b) Try different set of features and choose the best features.
c) Try different values of learning rate and see how this changes the accuracy of the
model.
d) Implement accuracy function (correctly predicted values / test set size).
Note: The final grade is proportional to the accuracy of your results .

Important Notes:
 You can only use “pandas”, “numpy” and “matplotlib” libraries. (Don’t use “sklearn”)
 The maximum number of students in a team is 4 and minimum 3.
 No late submission is allowed.
 Cheating students will take negative grades and no excuses will be accepted.
 Deadline is on Sunday 10/1/2021 at 11:59 PM.

Project Submission Machine Learning - Ankit Bhagat - 8th Jan
Document36 pages
Project Submission Machine Learning - Ankit Bhagat - 8th Jan
ankitbhagat
100% (6)
Data Science Intervieew Questions
Document16 pages
Data Science Intervieew Questions
Satyam Anand
100% (1)
Data Science Interview Questions
Document68 pages
Data Science Interview Questions
Ava White
100% (1)
Lecture 1 Inferential Statistics
Document32 pages
Lecture 1 Inferential Statistics
Juliet
No ratings yet
I Am Sharing 'Interview' With You
Document65 pages
I Am Sharing 'Interview' With You
Branch Reed
100% (3)
Real Time Clock
Document44 pages
Real Time Clock
subucud
100% (1)
TQHQ GetBetter Proposal 102022
Document3 pages
TQHQ GetBetter Proposal 102022
Chandria Calderon
No ratings yet
Breast Cancer Classification
Document16 pages
Breast Cancer Classification
Tester
100% (2)
Bagging and Random Forest Presentation1
Document23 pages
Bagging and Random Forest Presentation1
endale
100% (2)
Sampling and Sample Size Determination
Document42 pages
Sampling and Sample Size Determination
Abhay Gupta
No ratings yet
Ats Company Profile
Document40 pages
Ats Company Profile
farhanfiksi
No ratings yet
Evaluation Method, Random Forest
Document6 pages
Evaluation Method, Random Forest
om gorantiwar
No ratings yet
The NTD Sampling Template FINAL
Document16 pages
The NTD Sampling Template FINAL
tatran0195
No ratings yet
II.1 Additional Exercises Sampling - With Responses
Document6 pages
II.1 Additional Exercises Sampling - With Responses
Parisa Rezaei
No ratings yet
1
Document4 pages
1
someoneelses
0% (1)
QM Statistic Notes
Document24 pages
QM Statistic Notes
Salim Abdulrahim Bafadhil
No ratings yet
Usl 70 Marks Set 1
Document2 pages
Usl 70 Marks Set 1
Roshan Kumar
No ratings yet
Laboratory Exercise No 3B
Document6 pages
Laboratory Exercise No 3B
Michael Abeleda
No ratings yet
Data Analytics Lab
Document46 pages
Data Analytics Lab
Anupriya Jain
No ratings yet
ML Exam
Document5 pages
ML Exam
Aravind Kumar Reddy
No ratings yet
12 Unit8
Document48 pages
12 Unit8
mesfin
No ratings yet
Learning Activity 1 Jigsaw
Document15 pages
Learning Activity 1 Jigsaw
api-321582699
No ratings yet
DP-Designing and Implementing
Document10 pages
DP-Designing and Implementing
Steven Doh
No ratings yet
Chapter IIIa
Document51 pages
Chapter IIIa
solomon asefa
No ratings yet
Farhad Salih Ahmed (Esam)
Document8 pages
Farhad Salih Ahmed (Esam)
bawanlava
No ratings yet
Decision Tree
Document26 pages
Decision Tree
Arpan Kumar
No ratings yet
Untitled
Document19 pages
Untitled
Aqsa Rani
No ratings yet
ML Mid Question Solve
Document19 pages
ML Mid Question Solve
md.anis molla
No ratings yet
Question Bank Research Methodology and Biostatistics BPT 402 1. Calculate Appropriate Measure of Skewness From The Following Data
Document17 pages
Question Bank Research Methodology and Biostatistics BPT 402 1. Calculate Appropriate Measure of Skewness From The Following Data
Atul Dahiya
No ratings yet
Machine Learning
Document6 pages
Machine Learning
Pravin Sakpal
No ratings yet
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
Document6 pages
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
pradeep dhote
No ratings yet
ML PPTS Merged
Document514 pages
ML PPTS Merged
PG
No ratings yet
Unit 2
Document63 pages
Unit 2
Mdv Prasad
No ratings yet
TR Rain Error
Document6 pages
TR Rain Error
VaggelarasB
No ratings yet
T-Test Formula For Thesis
Document4 pages
T-Test Formula For Thesis
oxylhkxff
100% (2)
RV College of Engineering, Bengaluru-59: Self-Study Component-2
Document20 pages
RV College of Engineering, Bengaluru-59: Self-Study Component-2
Alishare Muhammed Akram
No ratings yet
Unit 1 Machine Learning
Document10 pages
Unit 1 Machine Learning
sahugungun76
No ratings yet
Interview Questions For DS & DA (ML)
Document66 pages
Interview Questions For DS & DA (ML)
pratikmovie999
100% (1)
XSTK Câu hỏi
Document19 pages
XSTK Câu hỏi
imaboneofmysword
No ratings yet
8614 (1) - 1
Document17 pages
8614 (1) - 1
Saqib Khalid
No ratings yet
Ranking Features Based On Predictive Power - Importance of The Class Labels
Document11 pages
Ranking Features Based On Predictive Power - Importance of The Class Labels
Juan
No ratings yet
AI Capstone Project - Notes-Part2
Document8 pages
AI Capstone Project - Notes-Part2
minha.fathima737373
No ratings yet
Data Science
Document17 pages
Data Science
Nabajit
No ratings yet
Workflow of A Machine Learning Project
Document12 pages
Workflow of A Machine Learning Project
ashish
No ratings yet
Buslytc Reviewer Terms For Quiz One
Document6 pages
Buslytc Reviewer Terms For Quiz One
Jeff Clinton Lim
No ratings yet
Only Quat
Document8 pages
Only Quat
BI11-286 Nguyễn Xuân Vinh
No ratings yet
STAT 2300 General Syllabus
Document6 pages
STAT 2300 General Syllabus
jbkj
No ratings yet
8614 - Assignment 2 Solved (AG)
Document19 pages
8614 - Assignment 2 Solved (AG)
Maria Gillani
No ratings yet
Mba - Iii Sem Research Methodology - MB0034 Set - 2
Document24 pages
Mba - Iii Sem Research Methodology - MB0034 Set - 2
Dilipk86
No ratings yet
Interview Questions
Document67 pages
Interview Questions
vaishnav Jyothi
100% (1)
AI & ML Notes
Document22 pages
AI & ML Notes
karthik singarao
No ratings yet
Data Analysis
Document12 pages
Data Analysis
api-484250989
No ratings yet
Unit 4
Document24 pages
Unit 4
Abhirami K
No ratings yet
Week 7 - Tree-Based Model
Document8 pages
Week 7 - Tree-Based Model
Nguyễn Trường Sơn
100% (1)
How To Conduct A Measurement Systems Analysis
Document5 pages
How To Conduct A Measurement Systems Analysis
Navnath Tamhane
No ratings yet
BA Notes From Lecture
Document9 pages
BA Notes From Lecture
akashsharma9011328268
No ratings yet
Definition: Random Sampling Is A Part of The Sampling Technique in Which Each Sample
Document5 pages
Definition: Random Sampling Is A Part of The Sampling Technique in Which Each Sample
May Joy Calcaben
No ratings yet
Assignment 1
Document3 pages
Assignment 1
amanuel29831
No ratings yet
Report Ainque...... GUMAPI
Document85 pages
Report Ainque...... GUMAPI
Angelica Talastasin
No ratings yet
Ljmba Sem-I Subject - Business Statistics Text Book: Business Statistics by Ken Black (5 Edition)
Document29 pages
Ljmba Sem-I Subject - Business Statistics Text Book: Business Statistics by Ken Black (5 Edition)
anon_892666860
No ratings yet
Random Forest Algorithms - Comprehensive Guide With Examples
Document13 pages
Random Forest Algorithms - Comprehensive Guide With Examples
faria shahzadi
No ratings yet
FROM DR Neerja Nigam
Document75 pages
FROM DR Neerja Nigam
amankhore86
No ratings yet
Introduction To Business Statistics Through R Software: Software
From Everand
Introduction To Business Statistics Through R Software: Software
Editor IJSMI
No ratings yet
5 - 7 M A R C H 2 0 1 9: E-Catalogue
Document423 pages
5 - 7 M A R C H 2 0 1 9: E-Catalogue
jm
No ratings yet
Introduction To Measurement and Instrumentation System
Document38 pages
Introduction To Measurement and Instrumentation System
Faizal Abdullah
No ratings yet
Autopilot NP2035 - Operator Manual
Document159 pages
Autopilot NP2035 - Operator Manual
Vipin Krishnan
No ratings yet
KD318-MUI ADSL Router
Document15 pages
KD318-MUI ADSL Router
sebastian valero
No ratings yet
Lecture Notes ELE353 Chap2-4 Spr14
Document6 pages
Lecture Notes ELE353 Chap2-4 Spr14
George Nicholas Rahwan
No ratings yet
TCS BPS Intro & Non Voice JD
Document2 pages
TCS BPS Intro & Non Voice JD
Śhűbhãm Mhąšķë
No ratings yet
Jasir - Draftsman Cun Doc Controller
Document2 pages
Jasir - Draftsman Cun Doc Controller
Kadayikkal Abdul Majid
No ratings yet
Sony HDW-F900R CineAlta 24P HDCAM Package HDWF900RPAC1D
Document179 pages
Sony HDW-F900R CineAlta 24P HDCAM Package HDWF900RPAC1D
Komlósi Zsolt
No ratings yet
Social Media and Legal Problems
Document14 pages
Social Media and Legal Problems
Abhinandan Ray
No ratings yet
Fuzzy Twin Support Vector Machines With Distribution Inputs
Document15 pages
Fuzzy Twin Support Vector Machines With Distribution Inputs
jacasil293
No ratings yet
CV YUNITA KURNIAWATI - 2020-Digabungkan PDF
Document10 pages
CV YUNITA KURNIAWATI - 2020-Digabungkan PDF
Yunita Kurniawati
No ratings yet
K-Means Clustering: CMPUT 615 Applications of Machine Learning in Image Analysis
Document13 pages
K-Means Clustering: CMPUT 615 Applications of Machine Learning in Image Analysis
Thilaga Mohan
No ratings yet
Icf Grade 9 4TH Quarter Exam
Document2 pages
Icf Grade 9 4TH Quarter Exam
Chariss Joy Lacaya
100% (3)
Nokia Bluetooth Headset BH-803 User Guide: Issue 1 EN
Document17 pages
Nokia Bluetooth Headset BH-803 User Guide: Issue 1 EN
ahmetSaracoglu
No ratings yet
Motilal Nehru National Institute of Technology, Allahabad: Assignment On Network Layer
Document4 pages
Motilal Nehru National Institute of Technology, Allahabad: Assignment On Network Layer
Swati Agarwal
No ratings yet
Ecc PDF
Document68 pages
Ecc PDF
Umesh Chandra Parasar
No ratings yet
Cadence Datasheet
Document1,327 pages
Cadence Datasheet
Sujit Kumar
No ratings yet
Capitulo 1
Document10 pages
Capitulo 1
maria
No ratings yet
L3 (Concavity& Inflection Points) Filled
Document7 pages
L3 (Concavity& Inflection Points) Filled
Wilson Zhang
No ratings yet
HD-VP410 Spec. V3.1 and User Manual
Document7 pages
HD-VP410 Spec. V3.1 and User Manual
Company BIM design
No ratings yet
Pure Sine Wave Inverter Design: Proteus ISIS
Document5 pages
Pure Sine Wave Inverter Design: Proteus ISIS
Tatenda Bizure
No ratings yet
An Online Headphone Screening Test Based On Dichotic Pitch
Document13 pages
An Online Headphone Screening Test Based On Dichotic Pitch
Anthi092
No ratings yet
New Text Document
Document6 pages
New Text Document
RATON
No ratings yet
POTRAZ 3rd - Quarter - 2017 PDF
Document24 pages
POTRAZ 3rd - Quarter - 2017 PDF
Jotham Nyamande
No ratings yet
Risk Management in Emerging Payments
Document23 pages
Risk Management in Emerging Payments
Anonymous cNEs3g0
100% (1)
Specialist Switch Requirement BoQ
Document6 pages
Specialist Switch Requirement BoQ
Lohit Yadav
No ratings yet
Book 1
Document3 pages
Book 1
Duncan Wekesa
No ratings yet