Welcome to Scribd!

Practical Aspects of Deep Learning PIII

Uploaded by

0% found this document useful (0 votes)

17 views11 pages

This document discusses numerical approximation of gradients as a way to debug the backpropagation algorithm in neural networks. It explains how gradient checking works by approximating the gradient of a function f at a point θ using a two-sided finite difference formula. This approximation is compared to the actual derivative value to validate that the gradient is being computed correctly during backpropagation. The document provides tips for implementing gradient checking such as only using it for debugging, including regularization terms, and checking at different points in the training process.

Original Description:

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

17 views11 pages

Practical Aspects of Deep Learning PIII

Uploaded by

Pedro Casariego Córdoba

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 11

Search inside document

Practical aspects of Deep

Learning Part III

Arles Rodríguez
aerodriguezp@unal.edu.co

Facultad de Ciencias
Departamento de Matemáticas
Universidad Nacional de Colombia
Motivation
• Backpropagation is the most difficult part of a
neural network to implement.
• There is a mathematical way to debug the
gradient programming called gradient
checking.
• It consists of a numerical approximation of the
gradients.
Numerical approximation of gradients
Let`s say , 𝜀=0.01
𝑓 ( 𝜃+𝜀)

𝑓 ( 𝜃+𝜀 ) − 𝑓 (𝜃 − 𝜀)
𝑓 ( 𝜃 + 𝜀) − 𝑓 ( 𝜃 − 𝜀)
𝑓 ′ ( 𝜃) ≈
2𝜀

𝑓 (𝜃− 𝜀) Given

2𝜀 𝑓 ′ ( 𝜃) ≈
𝑓 ( 1.01 ) − 𝑓 (0.99)
2 (0.01)
( 1.01 )3 −(0.99)3
𝑓 ′ (𝜃 )≈
2 (0.01)
¿3,0001

𝑓 ′ ( 𝜃 ) =3 𝜃 2=3

𝜃−𝜀 𝜃+ 𝜀 Approx error is 0.0001

𝜃=1
Some math details
′ 𝑓 (𝜃+ 𝜀) − 𝑓 (𝜃− 𝜀)
𝑓 ( 𝜃+𝜀) 𝑓 ( 𝜃 ) =lim

𝑓 ( 𝜃+𝜀 ) − 𝑓 (𝜃 − 𝜀)
𝜀→ 0 2𝜀

That is the second order Taylor

expansion of (Bengio, 2012).
𝑓 (𝜃− 𝜀)
The error of this approximation on in
( 𝜃
2𝜀 )
the order of

If .0001

This two sides difference formula is

accurate.

𝜃−𝜀 𝜃+ 𝜀
𝜃=1
Gradient checking
• Take parameters and concatenate into a big
vector

• Take parameters d and concatenate into a big

vector
• How to validate that is the gradient of ?
Gradient checking implementation

for each i:

• To check if the distance between the two

vectors is computed.
Interpretation of

• If derivative approximation is ok.

• If let’s take a careful look.
• If there might be a bug. Review components
of the vector vs and make sure that no
component is too large .
Tips to implement gradient check
• Don’t use in training.
• Gradient check must be used only on debug.
• When doing gradient check remember the regularization
term.

must include regularization term

• Gradient check does not work with dropout because
dropout is randomly eliminating subsets of hidden units.
• Turn off dropout to check gradients (keep_prob = 1) and
then turn on dropout.
Tips to implement gradient check

• Run gradient check at starting (with approx. to

zero) and after some iterations.
• This can be useful because sometimes derivatives
seem well for small values of 𝑊,𝑏 and when training
values tend to grow there can be big differences in
gradients in the case of bugs.
References
• Ng. A (2022) Deep Learning Specialization.
https://www.deeplearning.ai/courses/deep-learning-specialization/
• Bengio, Y. (2012). Practical Recommendations for Gradient-Based
Training of Deep Architectures. In Lecture Notes in Computer Science
(including subseries Lecture Notes in Artificial Intelligence and Lecture
Notes in Bioinformatics) (Vol. 7700 LECTU, pp. 437–478).
https://doi.org/10.1007/978-3-642-35289-8_26
¡Thank you!

Let's Practise: Maths Workbook Coursebook 7
From Everand
Let's Practise: Maths Workbook Coursebook 7
ExcelSoft Technologies Pvt. Ltd.
No ratings yet
Applications of Derivatives Errors and Approximation (Calculus) Mathematics Question Bank
From Everand
Applications of Derivatives Errors and Approximation (Calculus) Mathematics Question Bank
Mohmmad Khaja Shareef
No ratings yet
Statistic PartB
Document10 pages
Statistic PartB
Ankit Chatterjee
No ratings yet
(MCT-333 - Week 11) Stability in Frequency Domain - Examples
Document3 pages
(MCT-333 - Week 11) Stability in Frequency Domain - Examples
Malik ahxan
No ratings yet
Quiz No. 4 Solution
Document5 pages
Quiz No. 4 Solution
Camille Salmasan
No ratings yet
Appendix - Errors and Uncertainties
Document2 pages
Appendix - Errors and Uncertainties
stevenwuminecraft
No ratings yet
EEF 467 Exam 2020
Document7 pages
EEF 467 Exam 2020
Nguh Daniel
No ratings yet
Calibration of Optical Tachometers
Document35 pages
Calibration of Optical Tachometers
xaver akbar
No ratings yet
Solution Basic Econometrics
Document10 pages
Solution Basic Econometrics
Madhav ।
No ratings yet
Bài Tập Ước Lượng C12346
Document55 pages
Bài Tập Ước Lượng C12346
Nguyễn Viết Dương
No ratings yet
WQU Econometrics M3 Compiled Content PDF
Document44 pages
WQU Econometrics M3 Compiled Content PDF
Narayani Pandey
No ratings yet
ECE539 Chapter 1 Vector Calculus
Document9 pages
ECE539 Chapter 1 Vector Calculus
tabbara.ibrahim951
No ratings yet
Experiment 3
Document12 pages
Experiment 3
Sayaf Khan
No ratings yet
Data Analysis Formula Paper - Updated 2017
Document4 pages
Data Analysis Formula Paper - Updated 2017
Rand Ghaddar
No ratings yet
ISE503 Project Report
Document6 pages
ISE503 Project Report
Didik Hariadi
No ratings yet
Problema 6-23 Shigley
Document7 pages
Problema 6-23 Shigley
losdesquiciados
No ratings yet
ISOM2500 Regression Practice Solutions
Document3 pages
ISOM2500 Regression Practice Solutions
3456123
No ratings yet
Lecture 8 Backpropagation
Document28 pages
Lecture 8 Backpropagation
Hodatama Karanna One
No ratings yet
Multiple Linear Regression
Document17 pages
Multiple Linear Regression
ismael kenedy
No ratings yet
Lecture 01 - Review of Fundamentals
Document6 pages
Lecture 01 - Review of Fundamentals
Caden Lee
No ratings yet
Test 3 Memo 2023
Document3 pages
Test 3 Memo 2023
chidochashe monalisa
No ratings yet
Solving High-Order Fractional Differential Equations Using The Hybrid Bernoulli Function
Document5 pages
Solving High-Order Fractional Differential Equations Using The Hybrid Bernoulli Function
Central Asian Studies
No ratings yet
Chapter 1
Document15 pages
Chapter 1
ُIBRAHEEM ALHARBI
No ratings yet
DNV-RP-F101 Part A
Document3 pages
DNV-RP-F101 Part A
Aqilah
No ratings yet
Lesson 1 Limit of A Function (Algebraic)
Document84 pages
Lesson 1 Limit of A Function (Algebraic)
Roberto Discutido
100% (1)
Inferential Statistics
Document19 pages
Inferential Statistics
iiyousefgame YT
No ratings yet
Answers Lesson 6
Document2 pages
Answers Lesson 6
Octav Paloaie
No ratings yet
Negative Binomial Regression
Document36 pages
Negative Binomial Regression
John
No ratings yet
Chapter 1
Document15 pages
Chapter 1
Tariq Alharbi
No ratings yet
E7 2021 FinalReview
Document85 pages
E7 2021 FinalReview
kong
No ratings yet
Machine Learning Week 3
Document4 pages
Machine Learning Week 3
Anandhs
No ratings yet
Lampiran Kti
Document22 pages
Lampiran Kti
Muhammad ihsan
No ratings yet
L11 Computational Fluid Dynamics
Document56 pages
L11 Computational Fluid Dynamics
OPO
No ratings yet
CHE145 Gomez, Terrado
Document42 pages
CHE145 Gomez, Terrado
Ricky Jay
No ratings yet
2013 Exam2 Solution - Fluids Mechanics
Document7 pages
2013 Exam2 Solution - Fluids Mechanics
Christophe Emerich
No ratings yet
Assignment 4 Ans
Document10 pages
Assignment 4 Ans
Anuska Dey
No ratings yet
Physics Lab
Document80 pages
Physics Lab
Priyanshu Kumar
No ratings yet
ECEN 214 Lab 8
Document8 pages
ECEN 214 Lab 8
Shoaib Ahmed
No ratings yet
MachineLearning PDF
Document94 pages
MachineLearning PDF
Kamal Boulefra
No ratings yet
Interduction: Trapezoidal Rule
Document5 pages
Interduction: Trapezoidal Rule
anand
No ratings yet
Logs Final R
Document37 pages
Logs Final R
Pratul Prakhar
No ratings yet
CE324 - Module 10 - NumericalDifferentiation
Document14 pages
CE324 - Module 10 - NumericalDifferentiation
Jeffrey James Lloren
No ratings yet
Mech 430 Final Formula Sheet Updated
Document12 pages
Mech 430 Final Formula Sheet Updated
Salman Masri
No ratings yet
2023 - Tracia3 Formula Sheet
Document38 pages
2023 - Tracia3 Formula Sheet
Tiyani Mayimele
No ratings yet
Probability and Random Processes (15B11MA301)
Document17 pages
Probability and Random Processes (15B11MA301)
Aditya gaur
No ratings yet
FIN 5309 Homework 9 Solution Fall 2018: Instructions
Document16 pages
FIN 5309 Homework 9 Solution Fall 2018: Instructions
Ontime Bestwriters
No ratings yet
Optimization Problem
Document21 pages
Optimization Problem
Lyes Br
No ratings yet
Roots of Equations
Document13 pages
Roots of Equations
Francis Morales Almia
No ratings yet
Regularization and Feature Selectio N
Document102 pages
Regularization and Feature Selectio N
Ehab Emam
No ratings yet
eLectures-Numerical Methods
Document151 pages
eLectures-Numerical Methods
okeke ekene
No ratings yet
22-Intro To Inference For Decision Making-19-03-2024
Document15 pages
22-Intro To Inference For Decision Making-19-03-2024
nchiva
No ratings yet
4.2 Sketching Graph of Functions Using Derivatives
Document9 pages
4.2 Sketching Graph of Functions Using Derivatives
diza
No ratings yet
Worksheet Solution 4
Document8 pages
Worksheet Solution 4
abdelrahmanhelal13
No ratings yet
Exam With Solutions
Document7 pages
Exam With Solutions
Carlos Andres Pinzon Loaiza
No ratings yet
Formulario 2da PDF
Document1 page
Formulario 2da PDF
Mery Cristina Cahuana Cáceres
No ratings yet
Formulario 2da PDF
Document1 page
Formulario 2da PDF
Richard Larino
No ratings yet
𝑓𝑐𝑟 = 2√𝑓′𝑐 Cuantía Mecánica Ac. 𝜌 = 𝜔. 𝑓′𝑐 𝑓𝑦 𝜌% → 𝜌 = 𝑝ℎ𝑜/100 𝐾𝑢 = 𝑀𝑢 𝑏. 𝑑 Resist. Nom. y Últ. f (ω) 𝑀𝑛 = 𝑓 𝑐. 𝑏. 𝑑 - 𝜔 (1 − 0.59𝜔) 𝜑. 𝑀𝑛 = 𝜑. 𝑓 𝑐. 𝑏. 𝑑 - 𝜔 (1 − 0.59𝜔)
Document1 page
𝑓𝑐𝑟 = 2√𝑓′𝑐 Cuantía Mecánica Ac. 𝜌 = 𝜔. 𝑓′𝑐 𝑓𝑦 𝜌% → 𝜌 = 𝑝ℎ𝑜/100 𝐾𝑢 = 𝑀𝑢 𝑏. 𝑑 Resist. Nom. y Últ. f (ω) 𝑀𝑛 = 𝑓 𝑐. 𝑏. 𝑑 - 𝜔 (1 − 0.59𝜔) 𝜑. 𝑀𝑛 = 𝜑. 𝑓 𝑐. 𝑏. 𝑑 - 𝜔 (1 − 0.59𝜔)
Acero Elias
No ratings yet
Kendali Digital-4 - Discrete Control Design
Document17 pages
Kendali Digital-4 - Discrete Control Design
Chinta Wulandari
No ratings yet
GCSE Maths Revision: Cheeky Revision Shortcuts
From Everand
GCSE Maths Revision: Cheeky Revision Shortcuts
Scool Revision
Rating: 3.5 out of 5 stars
3.5/5 (2)
Generalized Fermat Equation
From Everand
Generalized Fermat Equation
Ran Van Vo
No ratings yet
Practical Aspects of Deep Learning PI
Document46 pages
Practical Aspects of Deep Learning PI
Pedro Casariego Córdoba
No ratings yet
The Embedding of Homeomorphisms of The Plane in Continuous Flows
Document16 pages
The Embedding of Homeomorphisms of The Plane in Continuous Flows
Pedro Casariego Córdoba
No ratings yet
Chapter3 Updated
Document54 pages
Chapter3 Updated
Pedro Casariego Córdoba
No ratings yet
Deep Neural Networks
Document25 pages
Deep Neural Networks
Pedro Casariego Córdoba
No ratings yet
Quiz 6
Document2 pages
Quiz 6
Pedro Casariego Córdoba
No ratings yet
Chapter - 3: Measures of Central Tendency
Document80 pages
Chapter - 3: Measures of Central Tendency
Desyilal
No ratings yet
Fundamentals of Business Statistics: Hypothesis Testing Dr. P.K.Viswanathan
Document26 pages
Fundamentals of Business Statistics: Hypothesis Testing Dr. P.K.Viswanathan
Tanmay Iyer
No ratings yet
FINAL General Maths PSMT
Document16 pages
FINAL General Maths PSMT
Bflygraydude
No ratings yet
CS 4700: Foundations of Artificial Intelligence
Document36 pages
CS 4700: Foundations of Artificial Intelligence
suppi
No ratings yet
Introduction To Probability PPT 1 Final
Document71 pages
Introduction To Probability PPT 1 Final
Nikoli Major
No ratings yet
Statistics Hypothesis Testing & Confidence Intervals
Document36 pages
Statistics Hypothesis Testing & Confidence Intervals
ujwala_512
No ratings yet
Indian Sign Language Character Recognition: Course Project-CS365A
Document14 pages
Indian Sign Language Character Recognition: Course Project-CS365A
sukruth
No ratings yet
Research Methodology MCQ (Multiple Choice Questions) - JavaTpoint
Document26 pages
Research Methodology MCQ (Multiple Choice Questions) - JavaTpoint
Ashok Kumar Rajanavar
No ratings yet
Haberman Data Set Ed A
Document10 pages
Haberman Data Set Ed A
Varun Akuthota
No ratings yet
Value at Risk - Theory and Illustrations
Document67 pages
Value at Risk - Theory and Illustrations
czarina210
No ratings yet
Handwritten Javanese Script Recognition Method Based 12-Layers Deep Convolutional Neural Network and Data Augmentation
Document11 pages
Handwritten Javanese Script Recognition Method Based 12-Layers Deep Convolutional Neural Network and Data Augmentation
IAES IJAI
No ratings yet
Impact of Leadership Styles in Organizational Commitment - 3
Document13 pages
Impact of Leadership Styles in Organizational Commitment - 3
gedle
No ratings yet
GMM Resume PDF
Document60 pages
GMM Resume PDF
damian camargo
No ratings yet
Role of Digital Marketing and Impact On Youth Buying Behaviour
Document14 pages
Role of Digital Marketing and Impact On Youth Buying Behaviour
Anuj Oberoi
No ratings yet
Essays in Experimental Jurisprudence
Document343 pages
Essays in Experimental Jurisprudence
Vishnu
100% (1)
Elementary Statistics in Social Research
Document3 pages
Elementary Statistics in Social Research
Myk Twentytwenty NBeyond
No ratings yet
CSCI E-63C Week 9 Assignment
Document2 pages
CSCI E-63C Week 9 Assignment
Anand Gehlot
No ratings yet
Binomial Probability Formula 2
Document4 pages
Binomial Probability Formula 2
abel mahendra
No ratings yet
Research Paper Using One Way Anova
Document10 pages
Research Paper Using One Way Anova
h040pass
100% (1)
Auerbach C. SSD For R. An R Package For Analyzing..Data 2ed 2022
Document296 pages
Auerbach C. SSD For R. An R Package For Analyzing..Data 2ed 2022
top dicas
100% (1)
Session 1-14 Strategic Analytics 0821 PDF
Document166 pages
Session 1-14 Strategic Analytics 0821 PDF
Manoj Kumar Meena
No ratings yet
Interpreting Logistic Regression Models
Document14 pages
Interpreting Logistic Regression Models
Junior TaHua
No ratings yet
Artificial Intelligence Vrs Statistics
Document25 pages
Artificial Intelligence Vrs Statistics
Javier Gramajo Lopez
100% (1)
PASA KA NA AND2 Sagot Kay Uy
Document89 pages
PASA KA NA AND2 Sagot Kay Uy
Allen Peter Uy
No ratings yet
GMM Said Crv10 Tutorial
Document27 pages
GMM Said Crv10 Tutorial
Shipra Jain
No ratings yet
Relationship Between RMRB and Gsi Based On in Situ Data: Conference Paper
Document7 pages
Relationship Between RMRB and Gsi Based On in Situ Data: Conference Paper
Manish Kumar Singh
No ratings yet
Chapter - 03 - Descriptive - Statistics - Numerical - Measures TB PDF
Document49 pages
Chapter - 03 - Descriptive - Statistics - Numerical - Measures TB PDF
Zeaweed . Ocean
No ratings yet
Identity in Vocational Development
Document9 pages
Identity in Vocational Development
Thais Lanutti Forcione
No ratings yet
Data Science Specialization
Document21 pages
Data Science Specialization
SEENU MANGAL
No ratings yet
Shashidhar-18csl76 Final
Document19 pages
Shashidhar-18csl76 Final
Mohammad Athiq kamran
No ratings yet