ML 2022 Sheet 10

Uploaded by

dummy

0% found this document useful (0 votes)

5 views1 page

Original Title

ml2022sheet10

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

5 views1 page

ML 2022 Sheet 10

Uploaded by

dummy

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 1

Search inside document

DR.

PETER ARNDT HEINRICH-HEINE-UNIVERSITÄT DÜSSELDORF

DR. KONRAD VÖLKEL WINTER TERM 2022/23
Machine Learning
Exercise Sheet 10
(3 Exercises, 100 Points)
Due: 20.12.2022, 10:00

Exercise 1: (25 Points)

Gradient boosting linear regression

We can use the gradient boosting approach with base models other than regression trees.
Explain under which circumstances gradient boosting linear regression (with another linear
regression predicting residuals) will work better or worse than just a single linear regression
on the data and explain why.
For the linear regression considered here, don’t use any kernels or base function extension
(so the model is just linear).

Exercise 2: (15 Points)

The number of trees hyperparameter

Explain what happens with a random forest classification model if, all other hyperparameters
kept fixed, the number of trees in the ensemble grows (in the limit to ∞).
Will the classification accuracy on a test set not used for training go up or down? What about
the variance?

Exercise 3: (60 Points)

Tree ensemble methods on penguins (programming task)

(see the notebook ml-forests-companion.ipynb for clues on preparing the dataset)

The goal is to implement both Random Forests and AdaBoost for classification and evaluate
the implementation by comparison with Scikit-Learn’s implementation on the Palmer Penguin
dataset, after finding suitable hyperparameters.
Proceed in these steps:
1. (5 points) Load and prepare the Palmer Penguin dataset for the classification task,
exclude the species ’Chinstrap’ from the data (to make it a binary classification problem).
2. (10 points) Find good hyperparameters for high accuracy using
sklearn.ensemble.RandomForestClassifier and
sklearn.ensemble.AdaBoostClassifier on the dataset prepared before.
3. (20+20 points) Using the sklearn.tree.DecisionTreeClassifier but none
of the sklearn.ensemble methods and classes, implement both Random Forest
and AdaBoost algorithms.
4. (5 points) Compare the performance of your own implementations against Scikit-learn
on the hyperparameters found previously. In case you see a notable difference, try to
explain that.
If you decide to implement only one of the two algorithms, you can still get up to 40 points
in this exercise.

Ml2022sheet09 1
Document2 pages
Ml2022sheet09 1
dummy
No ratings yet
Review Questions DS
Document14 pages
Review Questions DS
Saleh Alizade
No ratings yet
Basic Ensemble Learning (Random Forest, AdaBoost, Gradient Boosting) - Step by Step Explained
Document8 pages
Basic Ensemble Learning (Random Forest, AdaBoost, Gradient Boosting) - Step by Step Explained
Luciano
100% (1)
Data Mining: Model Overfitting Introduction To Data Mining, 2 Edition by Tan, Steinbach, Karpatne, Kumar
Document15 pages
Data Mining: Model Overfitting Introduction To Data Mining, 2 Edition by Tan, Steinbach, Karpatne, Kumar
Yosua Siregar
No ratings yet
COMP3308/COMP3608 Artificial Intelligence Week 10 Tutorial Exercises Support Vector Machines. Ensembles of Classifiers
Document3 pages
COMP3308/COMP3608 Artificial Intelligence Week 10 Tutorial Exercises Support Vector Machines. Ensembles of Classifiers
hariet
No ratings yet
Homework 2: CS 178: Machine Learning: Spring 2020
Document3 pages
Homework 2: CS 178: Machine Learning: Spring 2020
Jonathan Nguyen
No ratings yet
Boost
Document43 pages
Boost
Gustavo Colimba
No ratings yet
ch3 2
Document30 pages
ch3 2
楊喻妃
No ratings yet
Multiple Linear Regression Using Python Machine Learning: Kaleab Woldemariam, June 2017
Document8 pages
Multiple Linear Regression Using Python Machine Learning: Kaleab Woldemariam, June 2017
apurv shukla
No ratings yet
Applied Sciences: On The Optimal Size of Candidate Feature Set in Random Forest
Document13 pages
Applied Sciences: On The Optimal Size of Candidate Feature Set in Random Forest
dom
No ratings yet
Decision Trees in Sklearn Decision Trees in Sklearn
Document7 pages
Decision Trees in Sklearn Decision Trees in Sklearn
adityaacharya44
No ratings yet
Updated ML LAB Manual-2020-21
Document57 pages
Updated ML LAB Manual-2020-21
Sneha
No ratings yet
CAT2 Key
Document10 pages
CAT2 Key
Vikashanand Sasikumar
No ratings yet
Bagging and Random Forest Presentation1
Document23 pages
Bagging and Random Forest Presentation1
endale
100% (2)
Ensemble Learning and Random Forests
Document37 pages
Ensemble Learning and Random Forests
Dhanunjayanath reddy konudula
No ratings yet
Homework 2 AML
Document2 pages
Homework 2 AML
happy_user
No ratings yet
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
Document6 pages
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
pradeep dhote
No ratings yet
R Power
Document7 pages
R Power
cher_isabella
No ratings yet
Decision Tree c45
Document30 pages
Decision Tree c45
AhmadRizalAfani
No ratings yet
Machine Learning Random Forest Algorithm - Javatpoint
Document14 pages
Machine Learning Random Forest Algorithm - Javatpoint
RAMZI Azeddine
No ratings yet
L9 Model Assessment
Document26 pages
L9 Model Assessment
Hieu Tien Trinh
No ratings yet
Machine Learning and Data Mining: Introduction to (Học máy và Khai phá dữ liệu)
Document26 pages
Machine Learning and Data Mining: Introduction to (Học máy và Khai phá dữ liệu)
Lộc Sẹo
No ratings yet
Interview Questions
Document67 pages
Interview Questions
vaishnav Jyothi
100% (1)
Answers For End-Sem Exam Part - 2 (Deep Learning)
Document20 pages
Answers For End-Sem Exam Part - 2 (Deep Learning)
Ankur Borkar
No ratings yet
Machine Learning: Continuation
Document3 pages
Machine Learning: Continuation
gp_peixoto
No ratings yet
COMP4702 Notes 2019: Week 2 - Supervised Learning
Document23 pages
COMP4702 Notes 2019: Week 2 - Supervised Learning
Kelbie Davidson
No ratings yet
Week 7 - Tree-Based Model
Document8 pages
Week 7 - Tree-Based Model
Nguyễn Trường Sơn
100% (1)
Lab 5
Document2 pages
Lab 5
Daejuswaram Gopinath
No ratings yet
Noc17-Mg24 Week 08 Assignment 01
Document4 pages
Noc17-Mg24 Week 08 Assignment 01
Amarendra Pattanayak
No ratings yet
Assignment 1
Document2 pages
Assignment 1
Arnav Yadav
No ratings yet
RV College of Engineering, Bengaluru-59: Self-Study Component-2
Document20 pages
RV College of Engineering, Bengaluru-59: Self-Study Component-2
Alishare Muhammed Akram
No ratings yet
Team 5
Document12 pages
Team 5
sathvika pingali
No ratings yet
Modest AdaBoost
Document4 pages
Modest AdaBoost
Lucas Gallindo
No ratings yet
DIT865 2018 Mar Solution
Document9 pages
DIT865 2018 Mar Solution
Education VietCo
No ratings yet
Pca
Document19 pages
Pca
HJ Consultants
No ratings yet
Bilal Ahmed Shaik Data Mining
Document88 pages
Bilal Ahmed Shaik Data Mining
Shaik Bilal Ahmed
No ratings yet
Bagging and Boosting: Amit Srinet Dave Snyder
Document33 pages
Bagging and Boosting: Amit Srinet Dave Snyder
isaias.prestes
No ratings yet
Interview Questions For DS & DA (ML)
Document66 pages
Interview Questions For DS & DA (ML)
pratikmovie999
100% (1)
Chapter 4
Document27 pages
Chapter 4
All Uun
No ratings yet
Machine Learning With The Arduino Air Quality Pred
Document10 pages
Machine Learning With The Arduino Air Quality Pred
Girmit Girmit
No ratings yet
Data Science Interview Questions
Document68 pages
Data Science Interview Questions
Ava White
100% (1)
Title: Implementation of Decision Tree Classification: Department of Computer Science and Engineering
Document8 pages
Title: Implementation of Decision Tree Classification: Department of Computer Science and Engineering
reilyshawn
No ratings yet
LP III Lab Manual
Document8 pages
LP III Lab Manual
saket bharati
100% (1)
AMTA Assignment AMTA B (Aswin Avni Navya)
Document13 pages
AMTA Assignment AMTA B (Aswin Avni Navya)
Shambhawi Sinha
No ratings yet
Machine Learning Lab Manual (15CSL76)
Document30 pages
Machine Learning Lab Manual (15CSL76)
Priyanka Verma
No ratings yet
Lab Manual: Department of Computer Science and Engineering
Document30 pages
Lab Manual: Department of Computer Science and Engineering
PARIK
No ratings yet
Experiment 2.2 KNN Classifier
Document7 pages
Experiment 2.2 KNN Classifier
Arslan Mansoori
No ratings yet
A Communication-Efficient Parallel Algorithm For Decision Tree
Document9 pages
A Communication-Efficient Parallel Algorithm For Decision Tree
Raana
No ratings yet
ML - Practical File
Document15 pages
ML - Practical File
Jatin Mathur
No ratings yet
FDDB Present
Document25 pages
FDDB Present
Siao Da
No ratings yet
Ensemble Learning
Document7 pages
Ensemble Learning
Gabriel Gheorghe
100% (1)
Chapter#10 (Part#01) SL (K-NN)
Document27 pages
Chapter#10 (Part#01) SL (K-NN)
sajjadimam
No ratings yet
Handwritten Digit Recognition Project Paper
Document15 pages
Handwritten Digit Recognition Project Paper
Swifty Spot
No ratings yet
ML 2022 Sheet 05
Document2 pages
ML 2022 Sheet 05
dummy
No ratings yet
BigData ML
Document10 pages
BigData ML
Mo Farhan
No ratings yet
Statistical Learning Master Reports
Document11 pages
Statistical Learning Master Reports
Valentina Mendoza Zamora
No ratings yet
6CS005 - Assessment 21-22
Document4 pages
6CS005 - Assessment 21-22
Tanminder Maan
No ratings yet
Using Random Forests v4.0
Document33 pages
Using Random Forests v4.0
rollschach
No ratings yet
Computer Vision HW 6: Ethan Gibson December 2018
Document3 pages
Computer Vision HW 6: Ethan Gibson December 2018
Ethan
No ratings yet
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Slides Lecture02
Document93 pages
Slides Lecture02
dummy
No ratings yet
Slides Lecture05 Notes
Document32 pages
Slides Lecture05 Notes
dummy
No ratings yet
Blatt 07
Document1 page
Blatt 07
dummy
No ratings yet
Slides Lecture04
Document55 pages
Slides Lecture04
dummy
No ratings yet
Slides Lecture03
Document39 pages
Slides Lecture03
dummy
No ratings yet
ML Section15 Neural Networks
Document133 pages
ML Section15 Neural Networks
dummy
No ratings yet
ML Section19 Sampling MCMC
Document61 pages
ML Section19 Sampling MCMC
dummy
No ratings yet
ML Section08 Linear Regression
Document30 pages
ML Section08 Linear Regression
dummy
No ratings yet
ML Section16 Causality
Document57 pages
ML Section16 Causality
dummy
No ratings yet
ML Section07 Distributions Map ML
Document53 pages
ML Section07 Distributions Map ML
dummy
No ratings yet
ML Section14 GMM em
Document53 pages
ML Section14 GMM em
dummy
No ratings yet
ML Section13 Isomap Lle
Document46 pages
ML Section13 Isomap Lle
dummy
No ratings yet
ML 2022 Sheet 03
Document1 page
ML 2022 Sheet 03
dummy
No ratings yet
ML 2022 Sheet 06
Document2 pages
ML 2022 Sheet 06
dummy
No ratings yet
ML 2022 Sheet 01
Document2 pages
ML 2022 Sheet 01
dummy
No ratings yet
ML 2022 Sheet 05
Document2 pages
ML 2022 Sheet 05
dummy
No ratings yet
ML 2022 Sheet 04
Document2 pages
ML 2022 Sheet 04
dummy
No ratings yet
ML 2022 Sheet 02
Document2 pages
ML 2022 Sheet 02
dummy
No ratings yet
EBUS537 Theme4 Week 5
Document26 pages
EBUS537 Theme4 Week 5
kulkarniakshay1402
No ratings yet
Identifying Fake News Using Real Time Analytics
Document9 pages
Identifying Fake News Using Real Time Analytics
IJRASETPublications
No ratings yet
Chandigarh Group of Colleges College of Engineering Landran, Mohali
Document47 pages
Chandigarh Group of Colleges College of Engineering Landran, Mohali
tanvi wadhwa
No ratings yet
Ajit Tiwari Laptop
Document69 pages
Ajit Tiwari Laptop
Ajit Tiwari
No ratings yet
Divyasharma Resume July2023
Document2 pages
Divyasharma Resume July2023
Ajay Bhattacharjee
No ratings yet
Bagging+Boosting+Gradient Boosting
Document48 pages
Bagging+Boosting+Gradient Boosting
Parimal Shivendu
100% (1)
Aqi To Print
Document63 pages
Aqi To Print
Sujan Bohara
No ratings yet
Team 1 - Final Document
Document44 pages
Team 1 - Final Document
S S Dharshini
No ratings yet
Decision Forests For Computer Vision and Medical Image Analysis
Document367 pages
Decision Forests For Computer Vision and Medical Image Analysis
ducquangmartin
100% (1)
An Analysis of Trends and Sales in The Video Game Industry: Business Analytics Project
Document10 pages
An Analysis of Trends and Sales in The Video Game Industry: Business Analytics Project
Navpreet Hans
No ratings yet
Student Graduation Using DataMining
Document207 pages
Student Graduation Using DataMining
Daniel
No ratings yet
Article Machine Learning Based Modelling and Optimization in Hard Turning of Aisi D6 Steel With Newly Developed Altisin Coated Carbide Tool
Document33 pages
Article Machine Learning Based Modelling and Optimization in Hard Turning of Aisi D6 Steel With Newly Developed Altisin Coated Carbide Tool
ali afzal
No ratings yet
Mini Project Report Template
Document21 pages
Mini Project Report Template
Prajwal Kumbar
No ratings yet
Prediction of Medical Costs Using Regression Algorithms: A. Lakshmanarao, Chandra Sekhar Koppireddy, G.Vijay Kumar
Document7 pages
Prediction of Medical Costs Using Regression Algorithms: A. Lakshmanarao, Chandra Sekhar Koppireddy, G.Vijay Kumar
Gr Ranjere
0% (1)
TB 969425740
Document16 pages
TB 969425740
guohong hu
No ratings yet
The Application of Machine Learning For Sport Result Prediction A Review
Document49 pages
The Application of Machine Learning For Sport Result Prediction A Review
Quang Duong
No ratings yet
Comparative Study On Effort Estimation Using Different Data Mining Techniques
Document7 pages
Comparative Study On Effort Estimation Using Different Data Mining Techniques
Anita Gutierrez
No ratings yet
Machine Learning: An Applied Econometric Approach: Sendhil Mullainathan and Jann Spiess
Document38 pages
Machine Learning: An Applied Econometric Approach: Sendhil Mullainathan and Jann Spiess
abdul salam
No ratings yet
Soal CISDM
Document3 pages
Soal CISDM
Reza Hikamatulloh
No ratings yet
IC3E 2018 Paper 181
Document8 pages
IC3E 2018 Paper 181
LAKSHMIPRIYA.BM
No ratings yet
Cyber Security and Data Mining Competition Phase-04: Team Members
Document13 pages
Cyber Security and Data Mining Competition Phase-04: Team Members
01fe20bcs408
No ratings yet
Chapter 1
Document14 pages
Chapter 1
THE PREACHER
No ratings yet
Predictiveanalysis of PSL Match Winners Using Machine Learning Techniques
Document12 pages
Predictiveanalysis of PSL Match Winners Using Machine Learning Techniques
UMT Artificial Intelligence Review (UMT-AIR)
No ratings yet
Predicting Mental Health Illness Using Machine Learning Algorithms
Document8 pages
Predicting Mental Health Illness Using Machine Learning Algorithms
Ruthuja B P
No ratings yet
Code Mixed Sentimental Analysis
Document8 pages
Code Mixed Sentimental Analysis
Robin Michael
No ratings yet
Rainfall Predictions Using Data Visualization Techniques
Document7 pages
Rainfall Predictions Using Data Visualization Techniques
IJRASETPublications
100% (1)
A Guide To 21 Feature Importance Methods and Packages in Machine Learning (With Code) - by Theophano Mitsa - Dec, 2023 - Towards Data Science
Document41 pages
A Guide To 21 Feature Importance Methods and Packages in Machine Learning (With Code) - by Theophano Mitsa - Dec, 2023 - Towards Data Science
Nadhiya
100% (1)
Detection of Heart Failure Using Different Machine Learning Algorithms
Document5 pages
Detection of Heart Failure Using Different Machine Learning Algorithms
International Journal of Innovative Science and Research Technology
No ratings yet
Income Qualification Project3
Document40 pages
Income Qualification Project3
Nikhil
No ratings yet
Buettner 2019
Document6 pages
Buettner 2019
äBHïSHëK DHöTë
No ratings yet