0% found this document useful (0 votes)

38 views6 pages

Regularization Notes

Uploaded by

louisblair938

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views6 pages

Regularization Notes

Uploaded by

louisblair938

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

9/26/25, 2:06 PM Regularization in Machine Learning - GeeksforGeeks

Search... Sign In

Python for Machine Learning Machine Learning with R Machine Learning Algorithms EDA Math for M

Regularization in Machine Learning

Last Updated : 18 Sep, 2025

Regularization is a technique used in machine learning to prevent

overfitting and performs poorly on unseen data. By adding a penalty for
complexity, regularization encourages simpler, more generalizable
models.

Prevents overfitting: Adds constraints to the model to reduce the risk

of memorizing noise in the training data.
Improves generalization: Encourages simpler models that perform
better on new, unseen data.

Types of Regularization
There are mainly 3 types of regularization techniques, each applying
penalties in different ways to control model complexity and improve
generalization.

1. Lasso Regression

A regression model which uses the L1 Regularization technique is called

LASSO (Least Absolute Shrinkage and Selection Operator) regression.
It adds the absolute value of magnitude of the coefficient as a penalty
term to the loss function(L). This penalty can shrink some coefficients to
zero which helps in selecting only the important features and ignoring
the less important ones.

1 n m
Cost = n
∑i=1 (yi − y^i )2 + λ ∑i=1 ∣wi ∣

[Link] 1/7
9/26/25, 2:06 PM Regularization in Machine Learning - GeeksforGeeks

Where

m - Number of Features
n- Number of Examples
yi- Actual Target Value
y^i - Predicted Target Value

Lets see how to implement this using python:

X, y = make_regression(n_samples=100, n_features=5, noise=0.1,

random_state=42) : Generates a regression dataset with 100
samples, 5 features and some noise.
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,
random_state=42) : Splits the data into 80% training and 20% testing
sets.
lasso = Lasso(alpha=0.1) : Creates a Lasso regression model with
regularization strength alpha set to 0.1.

from sklearn.linear_model import Lasso

from sklearn.model_selection import train_test_split
from [Link] import make_regression
from [Link] import mean_squared_error

X, y = make_regression(n_samples=100, n_features=5, noise=0.1,

random_state=42)
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,
random_state=42)

lasso = Lasso(alpha=0.1)
[Link](X_train, y_train)

y_pred = [Link](X_test)

mse = mean_squared_error(y_test, y_pred)

print(f"Mean Squared Error: {mse}")

print("Coefficients:", lasso.coef_)

Output:

Lasso Regression

[Link] 2/7
9/26/25, 2:06 PM Regularization in Machine Learning - GeeksforGeeks

The output shows the model's prediction error and the importance of
features with some coefficients reduced to zero due to L1 regularization.

2. Ridge Regression

A regression model that uses the L2 regularization technique is called

Ridge regression. It adds the squared magnitude of the coefficient as a
penalty term to the loss function(L). It handles multicollinearity by
shrinking the coefficients of correlated features instead of eliminating
them.

Cost = 1
n
∑ni=1 (yi − y^i )2 + λ ∑m

2
i=1 wi

Where,

n= Number of examples or data points

m = Number of features i.e predictor variables
yi = Actual target value for the ith example

y^i = Predicted target value for the ith example

wi = Coefficients of the features

λ= Regularization parameter that controls the strength of

regularization

Lets see how to implement this using python:

ridge = Ridge(alpha=1.0) : Creates a Ridge regression model with

regularization strength alpha set to 1.0.

from sklearn.linear_model import Ridge

from [Link] import make_regression
from sklearn.model_selection import train_test_split
from [Link] import mean_squared_error

X, y = make_regression(n_samples=100, n_features=5, noise=0.1,

random_state=42)
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,
random_state=42)

ridge = Ridge(alpha=1.0)
[Link] 3/7
9/26/25, 2:06 PM Regularization in Machine Learning - GeeksforGeeks
[Link](X_train, y_train)
y_pred = [Link](X_test)

mse = mean_squared_error(y_test, y_pred)

print("Mean Squared Error:", mse)
print("Coefficients:", ridge.coef_)

Output:

Ridge Regression

The output shows the MSE showing model performance. Lower MSE
means better accuracy. The coefficients reflect the regularized feature
weights.

3. Elastic Net Regression

Elastic Net Regression is a combination of both L1 as well as L2

regularization. That shows that we add the absolute norm of the
weights as well as the squared measure of the weights. With the help
of an extra hyperparameter that controls the ratio of the L1 and L2
regularization.

Cost = 1
n
∑ni=1 (yi − y^i )2 + λ ((1 − α) ∑m

m 2
i=1 ∣wi ∣ + α ∑i=1 wi

Where

n = Number of examples (data points)

m = Number of features (predictor variables)
yi = Actual target value for the ith example

y^i = Predicted target value for the ith example

wi= Coefficients of the features

λ= Regularization parameter that controls the strength of
regularization
α= Mixing parameter where 0 ≤ α ≤ 1 and α= 1 corresponds to
Lasso (L1 ) regularization, α= 0 corresponds to Ridge (L2 )

[Link] 4/7
9/26/25, 2:06 PM Regularization in Machine Learning - GeeksforGeeks

regularization and Values between 0 and 1 provide a balance of both

L1 and L2 regularization

Lets see how to implement this using python:

model = ElasticNet(alpha=1.0, l1_ratio=0.5) : Creates an Elastic Net

model with regularization strength alpha=1.0 and L1/L2 mixing ratio
0.5.

from sklearn.linear_model import ElasticNet

from [Link] import make_regression
from sklearn.model_selection import train_test_split
from [Link] import mean_squared_error

X, y = make_regression(n_samples=100, n_features=10, noise=0.1,

random_state=42)
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,
random_state=42)

model = ElasticNet(alpha=1.0, l1_ratio=0.5)

[Link](X_train, y_train)

y_pred = [Link](X_test)
mse = mean_squared_error(y_test, y_pred)

print("Mean Squared Error:", mse)

print("Coefficients:", model.coef_)

Output:

Elastic Net Regression

The output shows MSE which measures how far off predictions are from
actual values (lower is better) and coefficients show feature importance.

Benefits of Regularization
Now, let’s see various benefits of regularization which are as follows:

1. Prevents Overfitting: Regularization helps models focus on

underlying patterns instead of memorizing noise in the training data.
2. Improves Interpretability: L1 (Lasso) regularization simplifies models
by reducing less important feature coefficients to zero.
[Link] 5/7
9/26/25, 2:06 PM Regularization in Machine Learning - GeeksforGeeks

3. Enhances Performance: Prevents excessive weighting of outliers or

irrelevant features helps in improving overall model accuracy.
4. Stabilizes Models: Reduces sensitivity to minor data changes which
ensures consistency across different data subsets.
5. Prevents Complexity: Keeps model from becoming too complex which
is important for limited or noisy data.
6. Handles Multicollinearity: Reduces the magnitudes of correlated
coefficients helps in improving model stability.
7. Allows Fine-Tuning: Hyperparameters like alpha and lambda control
regularization strength helps in balancing bias and variance.
8. Promotes Consistency: Ensures reliable performance across different
datasets which reduces the risk of large performance shifts.

Learn more about the difference between the regularization

techniques here: Lasso vs Ridge vs Elastic Net

Comment More info

[Link] 6/7

ML - Perplexity
No ratings yet
ML - Perplexity
71 pages
Regularization - Ridge and Lasso
No ratings yet
Regularization - Ridge and Lasso
7 pages
Machine Learning Notes ?
No ratings yet
Machine Learning Notes ?
64 pages
Machine Learning Regularization Techniques
No ratings yet
Machine Learning Regularization Techniques
13 pages
Types of Machine Learning
No ratings yet
Types of Machine Learning
63 pages
Linear Regression in Machine Learning
No ratings yet
Linear Regression in Machine Learning
38 pages
Feature Selection
No ratings yet
Feature Selection
19 pages
Machine Learning Optimization Techniques
No ratings yet
Machine Learning Optimization Techniques
49 pages
2.1 Supervised Regression
No ratings yet
2.1 Supervised Regression
26 pages
Lecture 1.5-1.6
No ratings yet
Lecture 1.5-1.6
23 pages
Linear Models: Regression & SVM Techniques
No ratings yet
Linear Models: Regression & SVM Techniques
92 pages
Regularization Techniques for Linear Models
No ratings yet
Regularization Techniques for Linear Models
23 pages
04 Linear
No ratings yet
04 Linear
31 pages
Mastering The Basics of Machine Learning
No ratings yet
Mastering The Basics of Machine Learning
65 pages
Machine Learning: Bias-Variance Trade-off
No ratings yet
Machine Learning: Bias-Variance Trade-off
62 pages
Locally Weighted Regression Guide
No ratings yet
Locally Weighted Regression Guide
6 pages
ML PYQs
No ratings yet
ML PYQs
32 pages
Linear and Non-linear Regression Methods
No ratings yet
Linear and Non-linear Regression Methods
13 pages
Overfitting and Underfitting in ML
No ratings yet
Overfitting and Underfitting in ML
10 pages
Understanding Linear Regression & Regularization
No ratings yet
Understanding Linear Regression & Regularization
4 pages
Understanding Machine Learning Concepts
No ratings yet
Understanding Machine Learning Concepts
25 pages
L11+ Regularization
No ratings yet
L11+ Regularization
24 pages
Overfitting Underfitting: UNIT 2: Optimization and Regularization in Neural Networks
No ratings yet
Overfitting Underfitting: UNIT 2: Optimization and Regularization in Neural Networks
18 pages
CS2011 2
No ratings yet
CS2011 2
14 pages
Regularization
No ratings yet
Regularization
19 pages
Lecture+Notes+-+Advanced+Regression
No ratings yet
Lecture+Notes+-+Advanced+Regression
12 pages
B Ridge - and - Lasso - Regression
No ratings yet
B Ridge - and - Lasso - Regression
5 pages
UC Berkeley Machine Learning Guide
No ratings yet
UC Berkeley Machine Learning Guide
185 pages
ML Labs
No ratings yet
ML Labs
46 pages
Machine Learning Optimization Techniques
No ratings yet
Machine Learning Optimization Techniques
70 pages
9 - Linear Regression-Problems and Solutions
No ratings yet
9 - Linear Regression-Problems and Solutions
23 pages
Regression
No ratings yet
Regression
56 pages
Parameter Norm Penalties
No ratings yet
Parameter Norm Penalties
6 pages
UC Berkeley ML Course Guide
100% (1)
UC Berkeley ML Course Guide
185 pages
Linear Regression Lab: Methods & Examples
100% (1)
Linear Regression Lab: Methods & Examples
18 pages
NNDL Notes
No ratings yet
NNDL Notes
73 pages
Perceptron and Linear Regression Exercises
No ratings yet
Perceptron and Linear Regression Exercises
3 pages
Overfitting in Linear Regression
No ratings yet
Overfitting in Linear Regression
8 pages
Optimizing Model Complexity in ML
No ratings yet
Optimizing Model Complexity in ML
32 pages
CH - En.u4cse19101 Cheduri Linearregression
No ratings yet
CH - En.u4cse19101 Cheduri Linearregression
8 pages
Regularization in Machine Learning
No ratings yet
Regularization in Machine Learning
5 pages
Machine Learning Algorithms Overview
No ratings yet
Machine Learning Algorithms Overview
59 pages
Ridge and Lasso Regresssion
No ratings yet
Ridge and Lasso Regresssion
22 pages
Linear Regression and Regularization Techniques
No ratings yet
Linear Regression and Regularization Techniques
61 pages
Data Fitting and Uncertainty (A Practical Introduction To Weighted Least Squares and Beyond)
No ratings yet
Data Fitting and Uncertainty (A Practical Introduction To Weighted Least Squares and Beyond)
6 pages
Berkeley Machine Learning
No ratings yet
Berkeley Machine Learning
185 pages
M1 L4 RR1 (Section 3.7)
No ratings yet
M1 L4 RR1 (Section 3.7)
7 pages
Lecture 7
No ratings yet
Lecture 7
29 pages
Mlda - Lab
No ratings yet
Mlda - Lab
35 pages
Metrics for Machine Learning Models
No ratings yet
Metrics for Machine Learning Models
20 pages
Optimizers for Parameterized Models
No ratings yet
Optimizers for Parameterized Models
3 pages
Linear Regression Analysis and MSE Evaluation
No ratings yet
Linear Regression Analysis and MSE Evaluation
5 pages
Understanding the Least Squares Method
No ratings yet
Understanding the Least Squares Method
36 pages
Linear Regression with Python OLS
No ratings yet
Linear Regression with Python OLS
23 pages
Machine Learning Insem-01 QP
No ratings yet
Machine Learning Insem-01 QP
6 pages
CH 4
No ratings yet
CH 4
39 pages
Regularisation in Machine Learning Models
No ratings yet
Regularisation in Machine Learning Models
79 pages
Updating Weight
No ratings yet
Updating Weight
9 pages
TensorFlow Convolutional Neural Networks
No ratings yet
TensorFlow Convolutional Neural Networks
2 pages
SQL Server Encryption Guide
No ratings yet
SQL Server Encryption Guide
16 pages
Course Plan - Lab - FDS
No ratings yet
Course Plan - Lab - FDS
3 pages
4cspl2041 - Introduction To Machine Learning
No ratings yet
4cspl2041 - Introduction To Machine Learning
6 pages
NLP Assignment-2 Solution
100% (3)
NLP Assignment-2 Solution
5 pages
Binary to BCD Conversion Guide
No ratings yet
Binary to BCD Conversion Guide
5 pages
Alan Turing Code Breaker Challenge - Ver - 1
No ratings yet
Alan Turing Code Breaker Challenge - Ver - 1
2 pages
Thesis 307 PDF
No ratings yet
Thesis 307 PDF
198 pages
06 3 Sigma Limit
No ratings yet
06 3 Sigma Limit
28 pages
Transforming Linear Equations Guide
No ratings yet
Transforming Linear Equations Guide
2 pages
AutoML Seminar Report for B-Tech
No ratings yet
AutoML Seminar Report for B-Tech
34 pages
Boosting Feature Selection for Classification
No ratings yet
Boosting Feature Selection for Classification
9 pages
Solving Linear Programming Graphically
No ratings yet
Solving Linear Programming Graphically
30 pages
Introduction to Fuzzy Logic Concepts
No ratings yet
Introduction to Fuzzy Logic Concepts
68 pages
Extreme Risk Analytics Course Overview
No ratings yet
Extreme Risk Analytics Course Overview
2 pages
NM Assignment Questions (AutoRecovered)
No ratings yet
NM Assignment Questions (AutoRecovered)
7 pages
BA Abridged
No ratings yet
BA Abridged
328 pages
Magnetic Susceptibility Inversion Guide
No ratings yet
Magnetic Susceptibility Inversion Guide
5 pages
Unit 5 Review: Adding & Subtracting Fractions
No ratings yet
Unit 5 Review: Adding & Subtracting Fractions
5 pages
Digital Daily Management System DDMS Archive
No ratings yet
Digital Daily Management System DDMS Archive
80 pages
Indian Dance Classification with CNN-RNN
No ratings yet
Indian Dance Classification with CNN-RNN
7 pages
Understanding Linear Block Codes
No ratings yet
Understanding Linear Block Codes
38 pages
Instrumentation & Control Basics
No ratings yet
Instrumentation & Control Basics
2 pages
Unit 1 Revision Sheet
No ratings yet
Unit 1 Revision Sheet
14 pages
A Predictive Model For Steady-State Multiphase Pipe Flow: Machine Learning On Lab Data
No ratings yet
A Predictive Model For Steady-State Multiphase Pipe Flow: Machine Learning On Lab Data
23 pages
B.Tech Programming Exam Paper 2024
No ratings yet
B.Tech Programming Exam Paper 2024
2 pages
FIR Filter Design with Transition Bands
No ratings yet
FIR Filter Design with Transition Bands
14 pages
Lecture 1
No ratings yet
Lecture 1
57 pages

Regularization Notes

Uploaded by

Regularization Notes

Uploaded by

9/26/25, 2:06 PM Regularization in Machine Learning - GeeksforGeeks

Regularization in Machine Learning

Regularization is a technique used in machine learning to prevent

Prevents overfitting: Adds constraints to the model to reduce the risk

A regression model which uses the L1 Regularization technique is called

Lets see how to implement this using python:

X, y = make_regression(n_samples=100, n_features=5, noise=0.1,

from sklearn.linear_model import Lasso

X, y = make_regression(n_samples=100, n_features=5, noise=0.1,

mse = mean_squared_error(y_test, y_pred)

A regression model that uses the L2 regularization technique is called

n= Number of examples or data points

y^i ​= Predicted target value for the ith example

wi = Coefficients of the features

λ= Regularization parameter that controls the strength of

Lets see how to implement this using python:

ridge = Ridge(alpha=1.0) : Creates a Ridge regression model with

from sklearn.linear_model import Ridge

X, y = make_regression(n_samples=100, n_features=5, noise=0.1,

mse = mean_squared_error(y_test, y_pred)

3. Elastic Net Regression

Elastic Net Regression is a combination of both L1 as well as L2

n = Number of examples (data points)

y^i ​= Predicted target value for the ith example

wi= Coefficients of the features

regularization and Values between 0 and 1 provide a balance of both

Lets see how to implement this using python:

model = ElasticNet(alpha=1.0, l1_ratio=0.5) : Creates an Elastic Net

from sklearn.linear_model import ElasticNet

X, y = make_regression(n_samples=100, n_features=10, noise=0.1,

model = ElasticNet(alpha=1.0, l1_ratio=0.5)

print("Mean Squared Error:", mse)

Elastic Net Regression

1. Prevents Overfitting: Regularization helps models focus on

3. Enhances Performance: Prevents excessive weighting of outliers or

Learn more about the difference between the regularization

Comment More info

You might also like

y^i = Predicted target value for the ith example

y^i = Predicted target value for the ith example