Welcome to Scribd!

Deep Learning - Lecture 6

Uploaded by

0% found this document useful (0 votes)

4 views11 pages

This document discusses techniques for addressing overfitting in deep neural networks, including regularization methods like L1 and L2 regularization and dropout. It begins by defining overfitting and contrasting it with underfitting and normal fitting. Next, it explains that overfitting occurs in deep neural networks due to their large number of parameters, continuous gradient updating, and the scale sensitiveness of cross entropy loss. The document then introduces regularization as a way to reduce model complexity and prevent overfitting by reducing weights for uninformative features. It proceeds to explain L1 and L2 regularization, providing illustrations, and then defines dropout and its implementation in PyTorch.

Original Description:

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

4 views11 pages

Deep Learning - Lecture 6

Uploaded by

Myristica At

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 11

Search inside document

SIF 910: Deep Learning

Lecture 6: Modiﬁcation and Ext. to a Feed Forward Neural Net (1)

Chandra Kusuma Dewa

2021/2022
Outlines

● Overﬁtting in deep neural network training

● Solving overﬁtting problem with regularization
● Regularization techniques
○ L1 Regularization
○ L2 Regularization
○ Dropout
What is Overﬁtting?

Image source: https://arxiv.org/pdf/1901.06566.pdf

Underﬁt v.s. Normal v.s. Overﬁt

Image source: http://mlwiki.org/index.php/Overﬁtting

The main cause of the overﬁtting problem...

According to Salman and Liu (2019):

● Deep neural networks are prone to

overﬁtting because of the large
number of parameters to be
learned,
● Continuous gradient updating, and
● Scale sensitiveness of cross
entropy loss

Image source: https://ai.googleblog.com/2019/08/exploring-weight-agnostic-neural.html

How to prevent the problem? - Regularization

We can make the neural

networks less complex by
reducing the value of the
weights for the uninformative
features of the dataset. In other
words, we regularize the model
using the following methods:
● L1 Regularization
● L2 Regularization
● Dropout
● etc

Image source: https://www.analyticsvidhya.com/blog/2018/04/fundamentals-deep-learning-regularization-techniques/

L1 Regularization

Image source: https://androidkt.com/how-to-add-l1-l2-regularization-in-pytorch-loss-function/

L2 Regularization

Image source: https://androidkt.com/how-to-add-l1-l2-regularization-in-pytorch-loss-function/

L1 and L2 Regularization in PyTorch

Image source: https://androidkt.com/how-to-add-l1-l2-regularization-in-pytorch-loss-function/

Dropout

Image source: https://jmlr.org/papers/volume15/srivastava14a/srivastava14a.pdf

Dropout in PyTorch

Deep Learning Review and Discussion of Its Future
Document7 pages
Deep Learning Review and Discussion of Its Future
airsrch
No ratings yet
Institute of Engineering & Management
Document3 pages
Institute of Engineering & Management
manish pandey
No ratings yet
Post-Reading Report Alex Shen (Mid Exam)
Document36 pages
Post-Reading Report Alex Shen (Mid Exam)
Alex Shen
No ratings yet
19 Deep Learning
Document49 pages
19 Deep Learning
Shame Bope
No ratings yet
6lowpan Research Paper
Document8 pages
6lowpan Research Paper
gsrkoxplg
100% (1)
Unit 1
Document14 pages
Unit 1
Manish Sontakke
No ratings yet
Project Proposal CS 327 - Software Engineering 2 Semester Project (Spring 2011)
Document5 pages
Project Proposal CS 327 - Software Engineering 2 Semester Project (Spring 2011)
sarahkhalid59
No ratings yet
Mini Project Fln..
Document51 pages
Mini Project Fln..
Umesh Maurya
No ratings yet
Machine Learning Yearning
Document116 pages
Machine Learning Yearning
glory m
No ratings yet
Deep Learning With Python Illustrated Guide For Beginners & Intermediates: The Future Is Here!: The Future Is Here!, #2
From Everand
Deep Learning With Python Illustrated Guide For Beginners & Intermediates: The Future Is Here!: The Future Is Here!, #2
William Sullivan
Rating: 1 out of 5 stars
1/5 (1)
Thesis On Face Recognition Using Neural Network
Document7 pages
Thesis On Face Recognition Using Neural Network
WriteMyPaperCheapCanada
100% (2)
15 Report PDF
Document35 pages
15 Report PDF
Binay Adhikari
No ratings yet
Final Review
Document13 pages
Final Review
Keseho
No ratings yet
Neural Network Research Papers PDF
Document5 pages
Neural Network Research Papers PDF
fys1q18y
100% (1)
Machine Learning Masterclass 2023
Document6 pages
Machine Learning Masterclass 2023
giriprasad gunalan
No ratings yet
50 Machine Learning Interview
Document8 pages
50 Machine Learning Interview
Gowtham J
No ratings yet
Early Prediction For Chronic Kidney Disease Detection A Progressive Approach To Health Management
Document34 pages
Early Prediction For Chronic Kidney Disease Detection A Progressive Approach To Health Management
Naveen Kumar
No ratings yet
5 Exciting Deep Learning Advancements To Keep Your Eye On in 2021 - by Andre Ye - Towards Data Science
Document12 pages
5 Exciting Deep Learning Advancements To Keep Your Eye On in 2021 - by Andre Ye - Towards Data Science
Marco Antonio Pretti
No ratings yet
A Quick Guide On Basic Regularization Methods For Neural Networks - by Jaime Durán - Yottabytes - Medium
Document9 pages
A Quick Guide On Basic Regularization Methods For Neural Networks - by Jaime Durán - Yottabytes - Medium
Bayu Adhi Nugroho
No ratings yet
Report - Nutrition Analysis Using Image Classification
Document10 pages
Report - Nutrition Analysis Using Image Classification
poi poi
No ratings yet
Ieee Research Papers On Data Mining
Document4 pages
Ieee Research Papers On Data Mining
fzg9w62y
100% (1)
DSC100 Data Science Fundamentals by SAP
Document249 pages
DSC100 Data Science Fundamentals by SAP
galup.inc
No ratings yet
Anomaly Detection in LTE KPI With Machine Laerning
Document15 pages
Anomaly Detection in LTE KPI With Machine Laerning
satyajit
No ratings yet
Report of Ann Cat3-Dev
Document8 pages
Report of Ann Cat3-Dev
Dev Ranjan Raut
No ratings yet
NLP - PBL - Project Report - Draft.02
Document32 pages
NLP - PBL - Project Report - Draft.02
Harshdip Patil
No ratings yet
Programming ML - NET (Dino Esposito, Francesco Esposito)
Document577 pages
Programming ML - NET (Dino Esposito, Francesco Esposito)
khundalini
No ratings yet
Presentation On Global Certification Courses (By ATS Infotech) Presentation On Global Certification Courses (By ATS Infotech)
Document24 pages
Presentation On Global Certification Courses (By ATS Infotech) Presentation On Global Certification Courses (By ATS Infotech)
Sandesh Sv
No ratings yet
C++ Neural Networks and Fuzzy Logic
Document454 pages
C++ Neural Networks and Fuzzy Logic
Karthi Ramachandran
No ratings yet
Neural Network PHD Thesis PDF
Document5 pages
Neural Network PHD Thesis PDF
lisafieldswashington
100% (2)
Khosla 2020
Document7 pages
Khosla 2020
ARIEL YONATAN ALIN 21.51.2106
No ratings yet
AI Insta Fake Proj Report
Document27 pages
AI Insta Fake Proj Report
aryasurve1210
No ratings yet
Internet of Things (IoT) A Quick Start Guide: A to Z of IoT Essentials
From Everand
Internet of Things (IoT) A Quick Start Guide: A to Z of IoT Essentials
Chitra Lele
No ratings yet
CIKM
Document173 pages
CIKM
Sahithi Katakam
No ratings yet
Neural Network Research Paper PDF
Document6 pages
Neural Network Research Paper PDF
fvgbhswf
100% (1)
Machine Learning and Deep Learning
Document15 pages
Machine Learning and Deep Learning
Alessandro Pecora
No ratings yet
Deep-Memory Networks vs. Deep Learning Networks For Stock Market Prediction
Document3 pages
Deep-Memory Networks vs. Deep Learning Networks For Stock Market Prediction
mjp1524185
No ratings yet
13 Useful Deep Learning Interview Questions and Answer
Document6 pages
13 Useful Deep Learning Interview Questions and Answer
MD. SHAHIDUL ISLAM
No ratings yet
M.phil Computer Science Thesis
Document6 pages
M.phil Computer Science Thesis
katrinaduartetulsa
100% (2)
01 Influence
Document7 pages
01 Influence
skruzic
No ratings yet
Deep Learning: Computer Vision, Python Machine Learning And Neural Networks
From Everand
Deep Learning: Computer Vision, Python Machine Learning And Neural Networks
Rob Botwright
No ratings yet
Network Planning With Deep Reinforcement Learning
Document14 pages
Network Planning With Deep Reinforcement Learning
Sadaf Ayesha
No ratings yet
Report 7
Document43 pages
Report 7
pulkit sharma
No ratings yet
30 Frequently Asked Deep Learning Interview Questions and Answers
Document28 pages
30 Frequently Asked Deep Learning Interview Questions and Answers
Khirod Behera
100% (1)
E Thesis Cmu
Document8 pages
E Thesis Cmu
gjgpy3da
100% (2)
Binary Image Classification Using Deep Learning From Scratch
Document7 pages
Binary Image Classification Using Deep Learning From Scratch
IJRASETPublications
No ratings yet
Artecle Review
Document4 pages
Artecle Review
muna
No ratings yet
"Neural Network and Machine Learing": A Colloquium Report On
Document16 pages
"Neural Network and Machine Learing": A Colloquium Report On
Yash
No ratings yet
Deep Learning Interview Questions: Click Here
Document45 pages
Deep Learning Interview Questions: Click Here
Rajachandra Voodiga
No ratings yet
Deep Learning Handout
Document6 pages
Deep Learning Handout
KHAN AZHAR ATHAR
100% (1)
Learning AI Development With UX
Document41 pages
Learning AI Development With UX
A Rao
No ratings yet
A Case For SMPs
Document7 pages
A Case For SMPs
Gath
No ratings yet
DeepLearning L1 Intro
Document92 pages
DeepLearning L1 Intro
lafdali
No ratings yet
A Survey On Multiclass Image Classification Based On Inception-V3 Transfer Learning Model
Document6 pages
A Survey On Multiclass Image Classification Based On Inception-V3 Transfer Learning Model
IJRASETPublications
No ratings yet
Thesis On Network Monitoring System
Document8 pages
Thesis On Network Monitoring System
angelaweberolathe
100% (1)
Data Science With Machine Learning Curriculum 2021
Document12 pages
Data Science With Machine Learning Curriculum 2021
AJAY S
No ratings yet
Information Technology Thesis Examples PDF
Document7 pages
Information Technology Thesis Examples PDF
mandyfroemmingfargo
100% (2)
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
From Everand
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
Mark Magic
No ratings yet
Theories, Detection Methods, and Opportunities of Fake News Detection
Document4 pages
Theories, Detection Methods, and Opportunities of Fake News Detection
aditya
No ratings yet
Distracted Driver Detection Using Convolutional Neural Networks and Transfer Learning
Document18 pages
Distracted Driver Detection Using Convolutional Neural Networks and Transfer Learning
Vaibhavi Chettiar
No ratings yet
Exploring the World of Data Science and Machine Learning
From Everand
Exploring the World of Data Science and Machine Learning
NIBEDITA Sahu
No ratings yet