Welcome to Scribd!

Skip carousel

07 The Problemof Overfitting

Uploaded by

Muneeb Butt

0% found this document useful (0 votes)

4 views24 pages

Original Title

07TheProblemofOverfitting (5)

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

4 views24 pages

07 The Problemof Overfitting

Uploaded by

Muneeb Butt

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 24

Search inside document

Introduction to

Machine Learning
Dr. Muhammad Amjad Iqbal
Associate Professor
University of Central Punjab, Lahore.
amjad.iqbal@ucp.edu.pk

https://sites.google.com/a/ucp.edu.pk/mai/iml /
Slides of Prof. Dr. Andrew Ng, Stanford and Dr. Humayoun
Regularization

2
The problem of overfitting
• So far we've seen a few algorithms
• Work well for many applications, but can suffer from
the problem of overfitting

3
Overfitting with linear regression
Example: Linear regression (housing prices)
Price

Price

Price
Size Size Size

Overfitting: If we have too many features, the learned hypothesis

may fit the training set very well ( ), but fail
to generalize to new examples (predict prices on new examples).
The hypothesis is just too large, too variable and we don't have enough data to
constrain it to give us a good hypothesis 4
Example: Logistic regression

x2 x2 x2

x1 x1 x1

( = sigmoid function)
Addressing overfitting:
size of house

Price
no. of bedrooms
no. of floors
age of house
average income in neighborhood Size
kitchen size
• Plotting hypothesis is one way to decide whether
overfitting occurs or not
• But with lots of features and little data we cannot
visualize, and therefore:
• Hard to select the degree of polynomial
• What features to keep and which to drop
Addressing overfitting:

Options:
1. Reduce number of features. (but this means loosing
information)
― Manually select which features to keep.
― Model selection algorithm (later in course).
2. Regularization.
― Keep all the features, but reduce magnitude/values of
parameters .
― Works well when we have a lot of features, each of
which contributes a bit to predicting .
Cost function

8
Intuition

Price
Price

Size of house Size of house

Suppose we penalize and make , really small.

Regularization.
Small values for parameters
― “Simpler” hypothesis
― Less prone to overfitting
Housing:
Unlike the polynomial
― Features: example, we don't know what
― Parameters: are the high order terms
How do we pick the ones that need to be shrunk?
With regularization, take cost function and modify it to shrink all the parameters

By convention you don't penalize θ0 - minimization is from θ1 onwards

Regularization.

• Using the regularized objective

(i.e. cost function with

Price
regularization term)
• We get a much smoother curve
which fits the data and gives a
much better hypothesis
Size of house
λ is the regularization parameter
Controls a trade off between our two goals
1) Want to fit the training set well
2) Want to keep parameters small
In regularized linear regression, we choose to minimize

What if is set to an extremely large value (perhaps too large for

our problem, say )?
- Algorithm works fine; setting to be very large can’t hurt it
- Algorithm fails to eliminate overfitting.
- Algorithm results in underfitting. (Fails to fit even training data
well).
- Gradient descent will fail to converge.
In regularized linear regression, we choose to minimize

What if is set to an extremely large value (perhaps for too large

for our problem, say )?
Price

Size of house
Regularized linear regression

15
Regularized linear regression
Gradient descent 𝜕
𝐽 (𝜃)
𝜕𝜃0
Repeat

[ + 𝝀
𝒎
𝜽 𝒋 ]
(regularized)

Same as before
Interesting term:
Usually learning rate is small and m is large Ex.
Normal equation

−1
𝜃= ( 𝑋 𝑋)
𝑇 𝑇
𝑋 𝑦
Non-invertibility (optional/advanced).
Suppose ,
(#examples) (#features)

If ,
Regularized logistic regression

20
Regularized logistic regression.

x1
Cost function:
Gradient descent
Repeat

[ + 𝝀
𝒎
𝜽 𝒋 ]
(regularized)
Advanced optimization
function [jVal, gradient] = costFunction(theta)
jVal = [ code to compute ];

gradient(1) = [code to compute ];

gradient(2) = [code to compute ];

gradient(3) = [code to compute ];

gradient(n+1) = [ code to compute ];

End

Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
Rating: 5 out of 5 stars
5/5 (1)
Unit 02 - Nonlinear Classification, Linear Regression, Collaborative Filtering - MD
Document14 pages
Unit 02 - Nonlinear Classification, Linear Regression, Collaborative Filtering - MD
Mega Silvia Hasugian
No ratings yet
120 DS-With Answer
Document32 pages
120 DS-With Answer
Asim Mazin
100% (1)
Regression Analysis Assignment
Document8 pages
Regression Analysis Assignment
ضیاء گل مروت
No ratings yet
Chapter 009
Document50 pages
Chapter 009
trannnen
No ratings yet
Ds 5
Document13 pages
Ds 5
Jglewd 2641
No ratings yet
Regularization PDF
Document32 pages
Regularization PDF
Dimas Wihandono
No ratings yet
The Problem of Overfitting: Overfitting With Linear Regression
Document32 pages
The Problem of Overfitting: Overfitting With Linear Regression
Pallab Puri
No ratings yet
CS 3035 (ML) - CS - End - May - 2023
Document11 pages
CS 3035 (ML) - CS - End - May - 2023
Rachit Srivastav
No ratings yet
Greedy Algorithm: Submitted To: Dr. Kaushal Pratap Sengar Submitted By: Rakhi Yadav 0901eo221049 Eeiot (2 Yr, 3 Sem)
Document11 pages
Greedy Algorithm: Submitted To: Dr. Kaushal Pratap Sengar Submitted By: Rakhi Yadav 0901eo221049 Eeiot (2 Yr, 3 Sem)
itsmerakhi8940
No ratings yet
Deep Learning - Summary - Deep - Learning
Document17 pages
Deep Learning - Summary - Deep - Learning
aabotony
No ratings yet
Curse of Dimensionality and Its Reduction
Document5 pages
Curse of Dimensionality and Its Reduction
Sameer Kattel
No ratings yet
Regularization
Document19 pages
Regularization
Eshan Jain
No ratings yet
Lecture 09 ML
Document26 pages
Lecture 09 ML
saharabdouma
No ratings yet
Introtodeeplearning MIT 6.S191
Document36 pages
Introtodeeplearning MIT 6.S191
jayateertha
No ratings yet
Supervised Algorithms-Regression: Linear & Logistic: Muhammad Bello Aliyu
Document32 pages
Supervised Algorithms-Regression: Linear & Logistic: Muhammad Bello Aliyu
Mohammed Danlami Yusuf
100% (1)
Machine Learning For Interviews
Document12 pages
Machine Learning For Interviews
vishalmundhe494
No ratings yet
Random Forest
Document25 pages
Random Forest
abdala sabry
No ratings yet
02 - Diagnostics For Machine Learning Model
Document20 pages
02 - Diagnostics For Machine Learning Model
MauJuarezSan
No ratings yet
Stock Price Prediction
Document30 pages
Stock Price Prediction
Apurva Sriwastwa
No ratings yet
Chap 4 Beyond Gradient Descent
Document26 pages
Chap 4 Beyond Gradient Descent
HRITWIK GHOSH
No ratings yet
Machine Learning Questions
Document2 pages
Machine Learning Questions
Priyaprasad Panda
No ratings yet
Neural Networks For Machine Learning: Lecture 16a Learning A Joint Model of Images and Captions
Document19 pages
Neural Networks For Machine Learning: Lecture 16a Learning A Joint Model of Images and Captions
Reshma Khemchandani
No ratings yet
w4 Generalisation
Document42 pages
w4 Generalisation
Swastik Sindhani
No ratings yet
Regularization: The Problem of Overfitting
Document24 pages
Regularization: The Problem of Overfitting
Nitesh Bisht
No ratings yet
CS550 Regression Aug12
Document63 pages
CS550 Regression Aug12
dipsresearch
100% (1)
How To Learn Machine Learning Algorithms For Interviews
Document16 pages
How To Learn Machine Learning Algorithms For Interviews
DummY Name
No ratings yet
DeepLearning L1 Intro
Document92 pages
DeepLearning L1 Intro
lafdali
No ratings yet
Module - 2 Ver 1.4
Document35 pages
Module - 2 Ver 1.4
Pranav B
No ratings yet
Strategy Deck
Document16 pages
Strategy Deck
saicherish90
No ratings yet
Machine Learning Tricks
Document22 pages
Machine Learning Tricks
Ankit Shukla
No ratings yet
Regression Trees, Step by Step. Learn How To Build Regression Trees and - by Ivo Bernardo - Aug, 2022 - Towards Data Science
Document36 pages
Regression Trees, Step by Step. Learn How To Build Regression Trees and - by Ivo Bernardo - Aug, 2022 - Towards Data Science
Akash Mukherjee
No ratings yet
Understanding Machine Learning Algorithms - in Depth
Document167 pages
Understanding Machine Learning Algorithms - in Depth
suryanshmishra452.inhaltmart
No ratings yet
Chapter 2 - Regularization - Modified
Document28 pages
Chapter 2 - Regularization - Modified
01fe19bcs262
No ratings yet
DL Unit-2
Document24 pages
DL Unit-2
Kalpana M
No ratings yet
ML Notes
Document14 pages
ML Notes
zomukoza
No ratings yet
Linear Regression Regularization
Document13 pages
Linear Regression Regularization
sonal
No ratings yet
DL Class1
Document18 pages
DL Class1
Rishi Chaary
No ratings yet
Lecture 4
Document33 pages
Lecture 4
Venkat ram Reddy
No ratings yet
House Price Prediction Using Machine Learning
Document6 pages
House Price Prediction Using Machine Learning
phani phaniii
No ratings yet
Types of MC
Document29 pages
Types of MC
njhujhkun
No ratings yet
Geoffrey Hinton With Nitish Srivastava Kevin Swersky: Neural Networks For Machine Learning
Document31 pages
Geoffrey Hinton With Nitish Srivastava Kevin Swersky: Neural Networks For Machine Learning
Mahapatra Milon
No ratings yet
Risk-Sensitive Prescriptive Analytics: Real Estate Case Study
Document5 pages
Risk-Sensitive Prescriptive Analytics: Real Estate Case Study
Ahmed
No ratings yet
Comparison of Advanced Algorithms
Document11 pages
Comparison of Advanced Algorithms
Shabin K
No ratings yet
Data Analytics: Modern Data Analytics (G0Z39A) Prof. Dr. Ir. Jan de Spiegeleer
Document44 pages
Data Analytics: Modern Data Analytics (G0Z39A) Prof. Dr. Ir. Jan de Spiegeleer
Ali Shana'a
No ratings yet
Faisal Nadeem (SAP# 30601)
Document7 pages
Faisal Nadeem (SAP# 30601)
FAisal NAdeem
No ratings yet
Tuning Parameters
Document15 pages
Tuning Parameters
Kodjo ALIPUI
No ratings yet
APS1070 Lecture (3) Slides
Document70 pages
APS1070 Lecture (3) Slides
Саша Цой
No ratings yet
Deeplearning2015 Goodfellow Adversarial Examples 01
Document43 pages
Deeplearning2015 Goodfellow Adversarial Examples 01
gagamamapapa
No ratings yet
Gradient Descent
Document8 pages
Gradient Descent
raja
No ratings yet
Machine Learning
Document17 pages
Machine Learning
Sonali Dalvi
No ratings yet
Regularization: The Problem of Overfitting
Document23 pages
Regularization: The Problem of Overfitting
Shubham Sharma
No ratings yet
Regularization: The Problem of Overfitting
Document23 pages
Regularization: The Problem of Overfitting
PravinkumarGhodake
No ratings yet
CISC 867 Deep Learning: 15. Generative Adversarial Networks
Document71 pages
CISC 867 Deep Learning: 15. Generative Adversarial Networks
adel hany
No ratings yet
Lecture 8 - Gradient Descent, Learning Models, Loss Functions (DONE!!) PDF
Document37 pages
Lecture 8 - Gradient Descent, Learning Models, Loss Functions (DONE!!) PDF
Sharelle Tew
No ratings yet
Deep Gen Models Tutorial
Document96 pages
Deep Gen Models Tutorial
vizard
No ratings yet
Datagiri: Presented 17 November By: Himanshu Shrivastava
Document17 pages
Datagiri: Presented 17 November By: Himanshu Shrivastava
neeraj12121
No ratings yet
DetailsofML 1
Document22 pages
DetailsofML 1
林祐任
No ratings yet
Machine Learning Interview Questions
Document8 pages
Machine Learning Interview Questions
Priya Koshta
No ratings yet
Assignment1-Linear Regression
Document5 pages
Assignment1-Linear Regression
shreyagoudar06
No ratings yet
BDA-PPT Final
Document28 pages
BDA-PPT Final
Lohith Kumar
No ratings yet
Edab Module - 2
Document20 pages
Edab Module - 2
Chirag 17
No ratings yet
Forward Pass
Document2 pages
Forward Pass
Muneeb Butt
No ratings yet
Non-Linear Hypotheses: Neural Networks: Representation
Document7 pages
Non-Linear Hypotheses: Neural Networks: Representation
Muneeb Butt
No ratings yet
Grouting
Document21 pages
Grouting
Muneeb Butt
No ratings yet
Linear Regression: Dimensionality Reduction
Document7 pages
Linear Regression: Dimensionality Reduction
Muneeb Butt
No ratings yet
Daniyal Ejaz L1S23MSCE0001 Thermal Treatment Presentation
Document23 pages
Daniyal Ejaz L1S23MSCE0001 Thermal Treatment Presentation
Muneeb Butt
No ratings yet
CAAnDoS - Fall 2021 - Section 02A - 01
Document214 pages
CAAnDoS - Fall 2021 - Section 02A - 01
Muneeb Butt
No ratings yet
Crash Injury Severity Prediction With Artificial Neural Networks
Document128 pages
Crash Injury Severity Prediction With Artificial Neural Networks
Muneeb Butt
No ratings yet
CAAnDoS - Fall 2021 - Lecture 01
Document74 pages
CAAnDoS - Fall 2021 - Lecture 01
Muneeb Butt
No ratings yet
Project Input Data For Each Student
Document1 page
Project Input Data For Each Student
Muneeb Butt
No ratings yet
CAAnDos - Presentation Rubrics
Document1 page
CAAnDos - Presentation Rubrics
Muneeb Butt
No ratings yet
CAAnDos - Presentation Rubrics - Part 2
Document1 page
CAAnDos - Presentation Rubrics - Part 2
Muneeb Butt
No ratings yet
Outer Brickwork Rate Ananlysis From Outside - xlsx-1
Document1 page
Outer Brickwork Rate Ananlysis From Outside - xlsx-1
Muneeb Butt
No ratings yet
2nd Option
Document6 pages
2nd Option
Muneeb Butt
No ratings yet
Status Concrete Raw Materials-10!10!23
Document1 page
Status Concrete Raw Materials-10!10!23
Muneeb Butt
No ratings yet
Best Design For Windows
Document6 pages
Best Design For Windows
Muneeb Butt
No ratings yet
Mock Test 1 Keys 2
Document5 pages
Mock Test 1 Keys 2
Muneeb Butt
No ratings yet
WAH 1st Merit List 22 Nov 20191574405833
Document3 pages
WAH 1st Merit List 22 Nov 20191574405833
Muneeb Butt
No ratings yet
PMP Students-Assignment 2 PDF
Document14 pages
PMP Students-Assignment 2 PDF
Muneeb Butt
No ratings yet
Urban Transport Rev)
Document57 pages
Urban Transport Rev)
mohit sharma
No ratings yet
Prediction of Mandibular Growth Rotation, Assessment of The Skieller, Bjork and Linde-Hansen Method PDF
Document9 pages
Prediction of Mandibular Growth Rotation, Assessment of The Skieller, Bjork and Linde-Hansen Method PDF
Jose Collazos
No ratings yet
Journal of Asrifa Iriany Prepaired Scopus
Document9 pages
Journal of Asrifa Iriany Prepaired Scopus
Fatih Unru
No ratings yet
Extended Family Relationships - How They Impact The Mental Health
Document64 pages
Extended Family Relationships - How They Impact The Mental Health
Hanabusa Kawaii Idou
No ratings yet
Role of Statistics in Engineering A Review IJERTV11IS100057
Document6 pages
Role of Statistics in Engineering A Review IJERTV11IS100057
devraj10010713
No ratings yet
OM Forecasting
Document210 pages
OM Forecasting
ms22a031
No ratings yet
Full Ebook of Using Python For Introductory Econometrics 1St Edition Florian Heiss Daniel Brunner Online PDF All Chapter
Document69 pages
Full Ebook of Using Python For Introductory Econometrics 1St Edition Florian Heiss Daniel Brunner Online PDF All Chapter
yirisrossan
100% (12)
ML PPT On Laptop Price Prediction
Document17 pages
ML PPT On Laptop Price Prediction
Ssvg Sumanth
100% (1)
Advanced Quantitative Methods
Document125 pages
Advanced Quantitative Methods
Kittn Huffa
No ratings yet
Review of Sessions 1-7 PUBH 614 Spring 2019
Document68 pages
Review of Sessions 1-7 PUBH 614 Spring 2019
welcome martin
No ratings yet
3 Bhawna Rajput
Document11 pages
3 Bhawna Rajput
Irma Yuningsih
No ratings yet
Writing The Research Title
Document3 pages
Writing The Research Title
Jhimson Cabral
No ratings yet
Logistic Quantile Regression in Stata
Document18 pages
Logistic Quantile Regression in Stata
Davide Radice
No ratings yet
Evaluation of Correlation Equations of CBR of Soils
Document9 pages
Evaluation of Correlation Equations of CBR of Soils
alejandro varela
No ratings yet
Hypotest 8
Document2 pages
Hypotest 8
anoorvagupta
No ratings yet
Comparative Analysis Study For Air Quality Prediction in Smart Cities Using Regression Techniques
Document10 pages
Comparative Analysis Study For Air Quality Prediction in Smart Cities Using Regression Techniques
Fresy Nugroho
No ratings yet
Regression 2
Document3 pages
Regression 2
SANDRA MARTIN RCBS
No ratings yet
Multiple Regression
Document100 pages
Multiple Regression
Aman Poonia
100% (1)
Chapter 9 - Simple Regression
Document62 pages
Chapter 9 - Simple Regression
Gilang Pratama
No ratings yet
Ijasft 5 136
Document6 pages
Ijasft 5 136
peertechz
No ratings yet
Dmickeviciene,+PDFSam Baltic Journal of Sport 2101 2016-06-29
Document6 pages
Dmickeviciene,+PDFSam Baltic Journal of Sport 2101 2016-06-29
Gabija Adomaitytė
No ratings yet
Journal of Behavioral and Experimental Finance: Liu Liu, Hua Zhang
Document9 pages
Journal of Behavioral and Experimental Finance: Liu Liu, Hua Zhang
John Joshua S. Gerona
No ratings yet
Malik2010-How Downsizing Affects The Job Satisfaction and Life Satisfaction of Layoff Survivors
Document7 pages
Malik2010-How Downsizing Affects The Job Satisfaction and Life Satisfaction of Layoff Survivors
Hafsa
No ratings yet
Group - 2 Correlation and Regression - Assgn1
Document16 pages
Group - 2 Correlation and Regression - Assgn1
Ketan Poddar
No ratings yet
Enhancing Reading Fluency of Grade 3 Students Amidst Covid-19 Pandemic: Parent-Teacher Reading Intervention
Document10 pages
Enhancing Reading Fluency of Grade 3 Students Amidst Covid-19 Pandemic: Parent-Teacher Reading Intervention
Althea Mae P. Sln
No ratings yet
(Download PDF) Data Science and Analytics With Python 1St Edition Jesus Rogel Salazar Online Ebook All Chapter PDF
Document42 pages
(Download PDF) Data Science and Analytics With Python 1St Edition Jesus Rogel Salazar Online Ebook All Chapter PDF
melissa.hruby950
100% (5)
Specification, SFU Notes
Document19 pages
Specification, SFU Notes
Atiq ur Rehman Qamar
No ratings yet
ECON1203 HW Solution Week11
Document7 pages
ECON1203 HW Solution Week11
Bad Boy
No ratings yet