ML Concept

Uploaded by

Vikash Rryder

0% found this document useful (0 votes)

6 views3 pages

Original Title

ML concept

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

6 views3 pages

ML Concept

Uploaded by

Vikash Rryder

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 3

Search inside document

Overfitting in ML: How to prevent it

What is Overfitting
“When model fits training data too well giving minimum sum of square of errors
but fails to perform well in testing dataset”. Overfitting happens when a model
learns the details and noise in the training data to the extent that it negatively
impacts the performance of the model on unseen data.
Let’s look visually;
In Linear Regression, we would like our model to follow a line like the following:

Even though the overall cost is not minimal, the line above fits within the trend
very well, making the model reliable. Let’s say we want to infer an output for an
input value that is not currently resident in the data set (i.e. generalize). The line
above could give a very likely prediction for the new input, as, in terms of Machine
Learning, the outputs are expected to follow the trend seen in the training set.
If our model gives trend line as below then it is Overfitting:

If above model obtains very minimal sum of square of errors and is fitting line with
all points, then model has captured all noise from data. Surely it is not going to fit
testing data. We call this model with high Variance.

How to Prevent it:

1) Regularization: In machine learning is the process of regularizing the
parameters that constrain, regularizes, or shrinks the coefficient estimates
towards zero
By looking at Green curve we can clearly say it is overfit. And below are
equation for both curves:

Curve1 = -x4+7X3-5x2+31x+30
Curve 2 = (1/5)x4+(7/5)X3-x2+(31/5)x+30
Larger coefficients tend to learn all points leading to overfitting so we
regularize the coefficients to avoid the problem

2) Cross Validation
In this technique we split training data into multiple mini train-test splits.
Then these splits are used to tune your model.

3) Remove features
You must remove irrelevant features from model to stop overfitting.
Multicollinearity should be also checked thoroughly in this technique.

Data Science Interview
Document12 pages
Data Science Interview
Vaibhav Jain
100% (3)
50 Advanced Machine Learning Questions - ChatGPT
Document18 pages
50 Advanced Machine Learning Questions - ChatGPT
Lily Lauren
100% (1)
ML MU Unit 2
Document42 pages
ML MU Unit 2
Paulos K
100% (2)
Exploratory Data Analysis and Visualiza On
Document30 pages
Exploratory Data Analysis and Visualiza On
Vikash Rryder
No ratings yet
Data Science Interview Questions
Document300 pages
Data Science Interview Questions
MaheshBirajdar
100% (1)
Essentials of Machine Learning Algorithms
Document15 pages
Essentials of Machine Learning Algorithms
Andres Valencia
No ratings yet
Data Science Interview Questions
Document68 pages
Data Science Interview Questions
Ava White
100% (1)
I Am Sharing 'Interview' With You
Document65 pages
I Am Sharing 'Interview' With You
Branch Reed
100% (3)
40 Interview Questions On Machine Learning - AnalyticsVidhya
Document21 pages
40 Interview Questions On Machine Learning - AnalyticsVidhya
Kaleab Tekle
100% (1)
Girish Chadha - 29th December 2022
Document35 pages
Girish Chadha - 29th December 2022
Girish Chadha
100% (3)
Data Science Intervieew Questions
Document16 pages
Data Science Intervieew Questions
Satyam Anand
100% (1)
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet
30 Days of Interview Preparation
Document415 pages
30 Days of Interview Preparation
heet
100% (1)
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
Rating: 5 out of 5 stars
5/5 (1)
Validation Over Under Fir Unit 5
Document6 pages
Validation Over Under Fir Unit 5
Harpreet Singh Bagga
No ratings yet
ML 5
Document14 pages
ML 5
dibloa
No ratings yet
Types of Machine Learning
Document63 pages
Types of Machine Learning
williamkin14
No ratings yet
Data Interpretation
Document8 pages
Data Interpretation
Manu
No ratings yet
Data Mining Assignment Help
Document5 pages
Data Mining Assignment Help
Statistics Homework Solver
No ratings yet
Overfitting and Underfitting in Machine Learning
Document3 pages
Overfitting and Underfitting in Machine Learning
Zahid Javed
No ratings yet
Machine Learning Models: by Mayuri Bhandari
Document48 pages
Machine Learning Models: by Mayuri Bhandari
mayuri
No ratings yet
Statistics Interview 02
Document30 pages
Statistics Interview 02
Sudharshan Venkatesh
100% (1)
Feature Scaling in Machine Learning
Document4 pages
Feature Scaling in Machine Learning
Varun Bhayana
No ratings yet
Group Assignment: Machine Learning: TOPIC: Predicting of Census Data Using Machine Learning Techniques
Document11 pages
Group Assignment: Machine Learning: TOPIC: Predicting of Census Data Using Machine Learning Techniques
Simran Saha
No ratings yet
ML Model Paper 1 Solution-1
Document10 pages
ML Model Paper 1 Solution-1
VIKAS KUMAR
No ratings yet
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
Document26 pages
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
Md Fazle Rabby
100% (2)
Interview Questions
Document67 pages
Interview Questions
vaishnav Jyothi
100% (1)
Q. (A) What Are Different Types of Machine Learning? Discuss The Differences
Document12 pages
Q. (A) What Are Different Types of Machine Learning? Discuss The Differences
Hassan Saddiqui
No ratings yet
Interview Questions
Document2 pages
Interview Questions
rashmi
No ratings yet
Supervised Algorithms-Regression: Linear & Logistic: Muhammad Bello Aliyu
Document32 pages
Supervised Algorithms-Regression: Linear & Logistic: Muhammad Bello Aliyu
Mohammed Danlami Yusuf
100% (1)
Interview Questions For DS & DA (ML)
Document66 pages
Interview Questions For DS & DA (ML)
pratikmovie999
100% (1)
P-149 Final PPT
Document57 pages
P-149 Final PPT
Vijay rathod
No ratings yet
Underfitting and Overfitting
Document4 pages
Underfitting and Overfitting
hokijic810
No ratings yet
1-Linear Regression
Document22 pages
1-Linear Regression
Srinivasa G
No ratings yet
The Nature of Feature Selection Technique
Document7 pages
The Nature of Feature Selection Technique
Rasika Dilshan
No ratings yet
A "Short" Introduction To Model Selection
Document25 pages
A "Short" Introduction To Model Selection
Suvin Chandra Gandhi (MT19AIE325)
No ratings yet
Receiver Operator Characteristic
Document25 pages
Receiver Operator Characteristic
Suvin Chandra Gandhi (MT19AIE325)
No ratings yet
GlobalLogic - Optimization Algorithms For Machine Learning
Document4 pages
GlobalLogic - Optimization Algorithms For Machine Learning
Kumar manickam
No ratings yet
Ensemble Learning
Document7 pages
Ensemble Learning
Gabriel Gheorghe
100% (1)
ML Final Project Report
Document8 pages
ML Final Project Report
Aditya Gupta
No ratings yet
Samatrix Assignment3
Document4 pages
Samatrix Assignment3
Yash Kumar
No ratings yet
Whole ML PDF 1614408656
Document214 pages
Whole ML PDF 1614408656
Kshatrapati Singh
100% (1)
DL Unit-2
Document24 pages
DL Unit-2
Kalpana M
No ratings yet
07two Marks Quest & Ans
Document4 pages
07two Marks Quest & Ans
V MERIN SHOBI
No ratings yet
10 Techniques To Deal With Class Imbalance in Machine Learning
Document10 pages
10 Techniques To Deal With Class Imbalance in Machine Learning
CHLIAH HANANE
No ratings yet
Week - 5 (Deep Learning) Q. 1) Explain The Architecture of Feed Forward Neural Network or Multilayer Perceptron. (12 Marks)
Document7 pages
Week - 5 (Deep Learning) Q. 1) Explain The Architecture of Feed Forward Neural Network or Multilayer Perceptron. (12 Marks)
Mrunal Bhilare
No ratings yet
Unit 2
Document28 pages
Unit 2
LOGESH WARAN P
No ratings yet
Assignment 1:: Intro To Machine Learning
Document6 pages
Assignment 1:: Intro To Machine Learning
Minh Trí
No ratings yet
Chapter-3-Common Issues in Machine Learning
Document20 pages
Chapter-3-Common Issues in Machine Learning
codeavengers0
No ratings yet
Fundamental of ML Week 4
Document15 pages
Fundamental of ML Week 4
Raj Physio
No ratings yet
CS 461 - Fall 2021 - Neural Networks - Machine Learning
Document5 pages
CS 461 - Fall 2021 - Neural Networks - Machine Learning
Victor Ruto
No ratings yet
Group9 ABA Ensemble Model
Document5 pages
Group9 ABA Ensemble Model
Reshma Majumder
No ratings yet
Data Mining Primer
Document5 pages
Data Mining Primer
JoJo Bristol
No ratings yet
Q No. 1 1.1machine Learning:: Machine Learning Is The Study of Computer Algorithms That Improve Automatically
Document10 pages
Q No. 1 1.1machine Learning:: Machine Learning Is The Study of Computer Algorithms That Improve Automatically
sajid
No ratings yet
Week 4 Lab Procedure: Part 1 - Polynomial Fit
Document8 pages
Week 4 Lab Procedure: Part 1 - Polynomial Fit
Mark Cyrulik
No ratings yet
Simple Regression Model Fitting
Document5 pages
Simple Regression Model Fitting
Aparna Singh
No ratings yet
AI & ML Notes
Document22 pages
AI & ML Notes
karthik singarao
No ratings yet
Lab Manual 05
Document13 pages
Lab Manual 05
Islam Ulhaq
No ratings yet
Machine Learning: Interview Guide
Document21 pages
Machine Learning: Interview Guide
Ronald Alejandro Chaupin Bautista
No ratings yet
ML Interview Questions and Answers
Document25 pages
ML Interview Questions and Answers
santoshguddu
100% (1)
Hyperparameters
Document15 pages
Hyperparameters
raja
No ratings yet
Speech-To-Text: Python
Document10 pages
Speech-To-Text: Python
Vikash Rryder
No ratings yet
What Is One Hot Encoding and How To Do It: Michael Delsole Apr 24, 2018 5 Min Read
Document6 pages
What Is One Hot Encoding and How To Do It: Michael Delsole Apr 24, 2018 5 Min Read
Vikash Rryder
No ratings yet
Python For Data Science by Akshita Sawhney
Document18 pages
Python For Data Science by Akshita Sawhney
Vikash Rryder
No ratings yet
Regularization in Machine Learning: Open in App Get Started
Document8 pages
Regularization in Machine Learning: Open in App Get Started
Vikash Rryder
No ratings yet
Understand The Softmax Function in Minutes: Data Science Bootcamp
Document15 pages
Understand The Softmax Function in Minutes: Data Science Bootcamp
Vikash Rryder
No ratings yet
Pandas Pro Ling and Exploratory Data Analysis With Line One of Code!
Document12 pages
Pandas Pro Ling and Exploratory Data Analysis With Line One of Code!
Vikash Rryder
No ratings yet
RIP Correlation. Introducing The Predictive Power Score: Sign Up and Get An Extra One For Free
Document11 pages
RIP Correlation. Introducing The Predictive Power Score: Sign Up and Get An Extra One For Free
Vikash Rryder
No ratings yet
Speed Up Your Numpy and Pandas With Numexpr Package: You Have 2 Free Stories Left This Month
Document11 pages
Speed Up Your Numpy and Pandas With Numexpr Package: You Have 2 Free Stories Left This Month
Vikash Rryder
No ratings yet
Categorical Encoding Using Label-Encoding and One-Hot-Encoder
Document9 pages
Categorical Encoding Using Label-Encoding and One-Hot-Encoder
Vikash Rryder
No ratings yet
Data Science With Python: Prasad Valse
Document52 pages
Data Science With Python: Prasad Valse
Vikash Rryder
No ratings yet
Build An App in Less Than 10 Lines of Code.: Linear Regression
Document9 pages
Build An App in Less Than 10 Lines of Code.: Linear Regression
Vikash Rryder
No ratings yet
Data Quality
Document16 pages
Data Quality
Vikash Rryder
No ratings yet
CATEGORICAL FEATURES With PYTHON
Document24 pages
CATEGORICAL FEATURES With PYTHON
Naresha a
No ratings yet
All You Want To Know About Machine Learning
Document11 pages
All You Want To Know About Machine Learning
Vikash Rryder
No ratings yet
Tableau: Histogram
Document5 pages
Tableau: Histogram
Vikash Rryder
No ratings yet
80 Q and A Data Science
Document20 pages
80 Q and A Data Science
Vikash Rryder
No ratings yet
Untitled
Document17 pages
Untitled
Vikash Rryder
No ratings yet
Reference Material
Document1 page
Reference Material
Vikash Rryder
No ratings yet
100 Q and A Data Science
Document23 pages
100 Q and A Data Science
Vikash Rryder
No ratings yet