Assign3 Lasso

Uploaded by

Chelsi Gondalia

0% found this document useful (0 votes)

12 views3 pages

Original Title

Assign3_Lasso

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

12 views3 pages

Assign3 Lasso

Uploaded by

Chelsi Gondalia

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 3

Search inside document

Assignment 3- LASSO Regression

In this study, we consider a dataset of claim sizes (severities) from Allstate. The data set has 130 features
and we are uncertain about what the features represent. Hence, the regression models in this study are purely
based on shrinkage and selection techniques.
Since the data is large (188,318 records and 130 features) we chose a training data that conists of 20% of
the data to keep runtimes manageable. We want to develop models to predict loss. The most basic candidate
for predictive modeling: linear regression is first used. This OLS model is regressed on all the 130 features
and is compared to the LASSO regression model later in this report.
We try to use LASSO regression using glmnet for our predictive modeling purposes. Given the size of
our model, LASSO regression helps in selection as well as shrinkage. For the LASSO model, there is a
penalty term which is weighed by a tuning parameter,  Figure 1 visualizes the relationship between
feature coefficients and . It should be noted that when there are large s, the feature coefficients are
essentially set to zero. We must reach a compromise where the feature coefficients and  value makes the
most sense.

Figure 1. Relationship between feature coefficients and tuning parameter, .

Figure 2 illustrates how the mean-squared error of our in-sample (blue) and out-of-sample data decreases
as model complexity i.e. number of features in the model increases. It is worth mentioning that as the
model becomes more complex, the in-sample Mean-squared error becomes smaller and our LASSO
model starts resembling our OLS regression model.

Figure 2. Mean-squared-error of the test and train sets as a function of the model complexity.
Furthermore, Figure 3 compares the relationship between Mean-squared error and l. It is clear that the
mean-squared error increases exponentially as log () goes beyond 4. In order to tune our parameter , we
use the cross-validation technique with 5 folds. With this technique we get a minimum =1.285.

Figure 3. Mean-squared-error as a function of the tuning parameter, .

Now that we have our LASSO regression model tuned, we must evaluate its performance. From further
calculations and by comparing our model predictions to the test set data we obtain a R2=0.474. This value
by itself is quite low and shows that our model may not be completely reliable. We also compare the
LASSO regression model to the OLS regression model in Table 1. It should be noted that with LASSO
regression model, the in-sample RMSE increases slightly and contrastingly out-of-sample RMSE
decreases significantly by 40%. This shows that the LASSO regression model is more efficient with Out-
of-sample predictions than the OLS regression model. However, we should keep in mind that even the
LASSO model is not perfect and may not be used as the key factor in decision making in this case.
Table 1. In-sample, Out-of-sample RMSEs for Linear and LASSO regression models.
In-sample RMSE Out-of-sample RMSE
Linear Regression model 1929.04 3584.16
LASSO Regression model 1994.70 2122.16

Analyzing The Bias-Variance Tradeoff Between Linear Models: Author: Mihir Mutyampeta
Document20 pages
Analyzing The Bias-Variance Tradeoff Between Linear Models: Author: Mihir Mutyampeta
Mihir Mutyampeta
No ratings yet
Lasso and Ridge Regression
Document30 pages
Lasso and Ridge Regression
Aarti
No ratings yet
What Is LASSO Regression Definition, Examples and Techniques
Document15 pages
What Is LASSO Regression Definition, Examples and Techniques
Sudip Singh
No ratings yet
House Prices Prediction in King County
Document10 pages
House Prices Prediction in King County
Jun Zhang
No ratings yet
Module 4: Regression Shrinkage Methods
Document5 pages
Module 4: Regression Shrinkage Methods
205Abhishek Kotagi
No ratings yet
New Report of Music For The Masses
Document20 pages
New Report of Music For The Masses
neil stag
No ratings yet
Regression Analysis in Machine Learning: Context
Document16 pages
Regression Analysis in Machine Learning: Context
Navneet Lalwani
No ratings yet
SMAI Report on Lasso and Ridge Regularization
Document6 pages
SMAI Report on Lasso and Ridge Regularization
Ali
No ratings yet
SMAI Assignment 7 Report - 20161204 PDF
Document6 pages
SMAI Assignment 7 Report - 20161204 PDF
Ali
No ratings yet
ML Final Project Report
Document8 pages
ML Final Project Report
Aditya Gupta
No ratings yet
EDA 4th Module
Document26 pages
EDA 4th Module
205Abhishek Kotagi
No ratings yet
LASSO
Document7 pages
LASSO
fasilistheo
No ratings yet
Primer 3e Chap4 Case New
Document10 pages
Primer 3e Chap4 Case New
l.purwandiipb
0% (1)
ML - LLM Interview: Linear Regression
Document43 pages
ML - LLM Interview: Linear Regression
vision20200512
No ratings yet
5-LR Doc - R Sqared-Bias-Variance-Ridg-Lasso
Document26 pages
5-LR Doc - R Sqared-Bias-Variance-Ridg-Lasso
Monis Khan
No ratings yet
Primer 3e Chap3 Case New
Document8 pages
Primer 3e Chap3 Case New
l.purwandiipb
No ratings yet
Lecture 09 ML
Document26 pages
Lecture 09 ML
saharabdouma
No ratings yet
F Test For Lack of Fit
Document6 pages
F Test For Lack of Fit
sionglan
No ratings yet
PS Notes (Machine Learning
Document14 pages
PS Notes (Machine Learning
Kodjo ALIPUI
No ratings yet
Halifean Rentap 2020836444 Tutorial 2
Document9 pages
Halifean Rentap 2020836444 Tutorial 2
halifeanrentap
No ratings yet
Benchmarking ML
Document9 pages
Benchmarking ML
Yentl Hendrickx
No ratings yet
Bias Variance Ridge Regression
Document4 pages
Bias Variance Ridge Regression
Sudheer Redus
No ratings yet
Module 3.3 Classification Models, An Overview
Document11 pages
Module 3.3 Classification Models, An Overview
Duane Eugenio Ani
No ratings yet
Lasoo Regression
Document8 pages
Lasoo Regression
Mohsin Ali
No ratings yet
Goodness of Fit Negative Binomial Regression Stata
Document3 pages
Goodness of Fit Negative Binomial Regression Stata
Karen
No ratings yet
Regularization in Machine Learning: Open in App Get Started
Document8 pages
Regularization in Machine Learning: Open in App Get Started
Vikash Rryder
No ratings yet
Describe in Brief Different Types of Regression Algorithms
Document25 pages
Describe in Brief Different Types of Regression Algorithms
Rajeshree Jadhav
No ratings yet
Module 1 Notes
Document73 pages
Module 1 Notes
20EUIT173 - YUVASRI KB
100% (1)
Tuning Parameters
Document15 pages
Tuning Parameters
Kodjo ALIPUI
No ratings yet
Tobit Analysis - Stata Data Analysis Examples
Document10 pages
Tobit Analysis - Stata Data Analysis Examples
Angger Wiji Rahayu
No ratings yet
Module 3 - Multiple Linear Regression
Document68 pages
Module 3 - Multiple Linear Regression
juntujuntu
No ratings yet
What Is Linear Regression
Document3 pages
What Is Linear Regression
thepirate 52
No ratings yet
How to build linear regression models step-by-step
Document5 pages
How to build linear regression models step-by-step
Aparna Singh
No ratings yet
SSPSS Data Analysis Examples Poisson Regression
Document34 pages
SSPSS Data Analysis Examples Poisson Regression
Ahmad Yang
No ratings yet
Metrices of The Model
Document9 pages
Metrices of The Model
Narmatha
No ratings yet
Termpaper Econometrics
Document21 pages
Termpaper Econometrics
Basanta Rai
No ratings yet
Predictive Modelling: Linear Regression Analysis for Sales Prediction
Document35 pages
Predictive Modelling: Linear Regression Analysis for Sales Prediction
Girish Chadha
100% (3)
Multi-Collineartity, Variance Inflation and Orthogonalization in Regression
Document5 pages
Multi-Collineartity, Variance Inflation and Orthogonalization in Regression
53melmel
No ratings yet
Theory Week 9
Document1 page
Theory Week 9
SergioLópez
No ratings yet
Bias Varience Trade Off
Document35 pages
Bias Varience Trade Off
mobeen
100% (1)
SAS - Running A Lasso Regression Analysis
Document2 pages
SAS - Running A Lasso Regression Analysis
korisnik_01
No ratings yet
Thesis of Slash Distribution
Document6 pages
Thesis of Slash Distribution
aliyahhkingnewark
100% (1)
Decision Science June 2023 Sem II
Document8 pages
Decision Science June 2023 Sem II
ChoudharyPrem
No ratings yet
What is the coefficient of variation
Document10 pages
What is the coefficient of variation
wawan Gunawan
No ratings yet
Adaptive Filter Theory Course Assignment
Document13 pages
Adaptive Filter Theory Course Assignment
georgez111
No ratings yet
SEM With AMOS and Tutorial
Document118 pages
SEM With AMOS and Tutorial
Johan Sampoerno
No ratings yet
Quantile Regression (Final) PDF
Document22 pages
Quantile Regression (Final) PDF
booianca
100% (1)
R Help 6 Correlation and Regression
Document5 pages
R Help 6 Correlation and Regression
Anna
No ratings yet
Machine Learning Question Bank-Unit 3
Document6 pages
Machine Learning Question Bank-Unit 3
INFRA 10'S
No ratings yet
03 Logistic Regression
Document23 pages
03 Logistic Regression
Harold Costales
No ratings yet
SMS Call Prediction
Document6 pages
SMS Call Prediction
RJ Rajpoot
No ratings yet
Module 3.1 Time Series Forecasting ARIMA Model
Document19 pages
Module 3.1 Time Series Forecasting ARIMA Model
Duane Eugenio Ani
No ratings yet
DAS
Document3 pages
DAS
Muhammad Arif Hassan
No ratings yet
Techniques for Analyzing Linear and Non-Linear Regression Models
Document9 pages
Techniques for Analyzing Linear and Non-Linear Regression Models
acr2944
No ratings yet
Latent Variable Path Analysis: In: Using Mplus For Structural Equation Modeling: A Researcher's Guide
Document23 pages
Latent Variable Path Analysis: In: Using Mplus For Structural Equation Modeling: A Researcher's Guide
mesay83
No ratings yet
Structural Equation Modeling: Dr. Arshad Hassan
Document47 pages
Structural Equation Modeling: Dr. Arshad Hassan
Kashif Khurshid
No ratings yet
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet
Quantile Regression: Estimation and Simulation
From Everand
Quantile Regression: Estimation and Simulation
Marilena Furno
Rating: 3.5 out of 5 stars
3.5/5 (1)
Guided Randomness in Optimization, Volume 1
From Everand
Guided Randomness in Optimization, Volume 1
Maurice Clerc
No ratings yet
Regression Analysis for Social Sciences
From Everand
Regression Analysis for Social Sciences
Alexander von Eye
No ratings yet
Edu 808 Mathematics Curriculum and Instructions in Secondary Schools
Document238 pages
Edu 808 Mathematics Curriculum and Instructions in Secondary Schools
Dejene Girma
No ratings yet
Quality Control and Improvement with MINITAB - Week 4 Key Concepts
Document5 pages
Quality Control and Improvement with MINITAB - Week 4 Key Concepts
smg26thmay
No ratings yet
Hypothesis Testing Betsy Farber PPG - 2
Document16 pages
Hypothesis Testing Betsy Farber PPG - 2
Selutan Adus
No ratings yet
Research Methodology Unit 10: Testing of Hypotheses
Document30 pages
Research Methodology Unit 10: Testing of Hypotheses
Margabandhu Narasimhan
No ratings yet
T-Test MCQ (Free PDF) - Objective Question Answer For T-Test Quiz - Download Now!
Document14 pages
T-Test MCQ (Free PDF) - Objective Question Answer For T-Test Quiz - Download Now!
jayant bansal
0% (1)
Why Social Science Is Not Science
Document11 pages
Why Social Science Is Not Science
Busola Ajibola - Amzat
100% (1)
M - 5 - MMW
Document21 pages
M - 5 - MMW
Kervi Rivera
No ratings yet
Observing Children in Their Natural Worlds
Document309 pages
Observing Children in Their Natural Worlds
Roxana Badea
No ratings yet
Chapter 1 - Research Foundation and Fundamentals
Document52 pages
Chapter 1 - Research Foundation and Fundamentals
Rathiga Arirengan
No ratings yet
Multiple Linear Regression
Document33 pages
Multiple Linear Regression
mathewsujith31
No ratings yet
PID Statistic
Document9 pages
PID Statistic
yoonginism
No ratings yet
Reading 8 Hypothesis Testing
Document11 pages
Reading 8 Hypothesis Testing
Nghia Tuan Nghia
No ratings yet
Qualitative and Quantitative Research Paradigms in Business
Document11 pages
Qualitative and Quantitative Research Paradigms in Business
Sumudu Samarasinghe
No ratings yet
Marketing Research For A Restaurant in Hendon, London.
Document27 pages
Marketing Research For A Restaurant in Hendon, London.
Manzil Madhwani
No ratings yet
402 3rd Quiz
Document4 pages
402 3rd Quiz
Muhammad Imran
No ratings yet
Chapter 8-Statistical Inference - Updated 27 December 2022 GC
Document18 pages
Chapter 8-Statistical Inference - Updated 27 December 2022 GC
almi debele
No ratings yet
Public Speaking Notes
Document10 pages
Public Speaking Notes
estefania sofea zahara
100% (4)
Finding Your Way: Crafting Speeches for Any Occasion
Document24 pages
Finding Your Way: Crafting Speeches for Any Occasion
Jeshia Josselene Ortiz
No ratings yet
Lutfiah Farah Azura - Jawaban Uts Ekonometrika 1
Document4 pages
Lutfiah Farah Azura - Jawaban Uts Ekonometrika 1
folder kuliah
No ratings yet
Gretl Empirical Exercise 2 - KEY PDF
Document3 pages
Gretl Empirical Exercise 2 - KEY PDF
Daniel Lee Eisenberg Jacobs
No ratings yet
01 - Basic Concepts
Document21 pages
01 - Basic Concepts
Shivan H Latchmipersad
100% (1)
Writing A Position Paper
Document24 pages
Writing A Position Paper
Kyla Camille Vicencio
No ratings yet
Npar Tests: Descriptive Statistics
Document58 pages
Npar Tests: Descriptive Statistics
suzanalucas
No ratings yet
Chapter 8 Hypotheses
Document21 pages
Chapter 8 Hypotheses
Diep Anh Phan
No ratings yet
Understand Patterns and Relationships in Mathematics and Everyday Life
Document6 pages
Understand Patterns and Relationships in Mathematics and Everyday Life
Sunny Egghead
No ratings yet
Pharmacy Statistics Midterms - Hypothesis Testing
Document41 pages
Pharmacy Statistics Midterms - Hypothesis Testing
Kagura
No ratings yet
Way2 Final
Document104 pages
Way2 Final
Chris Tine
No ratings yet
Chapter 1. GeES 3026 Geography and Environmental Studies 2022
Document57 pages
Chapter 1. GeES 3026 Geography and Environmental Studies 2022
moges lake
No ratings yet
BCO01-02 - Stability of Steel Arches PDF
Document25 pages
BCO01-02 - Stability of Steel Arches PDF
Siva Prasad Mamillapalli
No ratings yet
Materi Estimasi
Document34 pages
Materi Estimasi
RIFKI ARIWARDI
No ratings yet