AI and ML

Uploaded by

Gaurav Raut

0% found this document useful (0 votes)

12 views14 pages

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

12 views14 pages

AI and ML

Uploaded by

Gaurav Raut

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 14

Search inside document

Gradient Descent and

Cost Function
Optimization:
• In our day-to-day lives, we are optimizing variables based on our personal
decisions and we don’t even recognize the process consciously.
• We are constantly using optimization techniques all day long.
• For example, while going to work, choosing a shorter route in order to minimize
traffic woes, scheduling a cab in advance to reach the airport on time.
• Optimization is the ultimate goal, whether you are dealing with actual events in
real-life or creating a technology-based product.
• Optimization may be defined as the process by which an optimum is achieved. It
is all about designing an optimal output for your problems with the use of
resources available.
• One of the most popular optimization technique is Gradient Descent.
What is gradient descent?
• It is an optimization algorithm which is mainly used to find the
minimum of a function by optimizing parameters.
• Parameters means coefficient in linear regression and weights in
neural networks.
Contd..
• The main goal of gradient descent is to minimize cost function
• The inclined and/or irregular is the cost function when it is plotted
and the role of gradient descent is to provide direction and the
velocity (learning rate) of the movement in order to attain the
minima of the function i.e where the cost is minimum.
What is cost function?
• A Cost Function/Loss Function tells us “how good” our model is at
making predictions for a given set of parameters.
• Generally, the cost function is in the form of Y = X². In a Cartesian
coordinate system, this represents an equation for a parabola which
can be graphically represented as :
• Now in order to minimize the function mentioned above, firstly we
need to find the value of X which will produce the lowest value of Y
(in this case it is the red dot)
• Now a function is required which will minimize the parameters over a
dataset
• The most common function which is often used is the mean squared
error.
• m_curr=m_curr-learning_rate*d/dm
• b_curr=b_curr-learning_rate*d/db
Type of gradient descent
• BATCH GRADIENT DESCENT
• STOCHASTIC GRADIENT DESCENT
• MINI-BATCH GRADIENT DESCENT
BATCH GRADIENT DESCENT

• calculates the error for each example within the training dataset, but
only after all training examples have been evaluated does the model
get updated
• This whole process is like a cycle and it's called a training epoch.
• advantage of batch gradient descent are its computational efficient, it
produces a stable error gradient and a stable convergence
• disadvantages are the stable error gradient can sometimes result in a
state of convergence that isn’t the best the model can achieve. It also
requires the entire training dataset be in memory and available to the
algorithm.
STOCHASTIC GRADIENT DESCENT
• By contrast, stochastic gradient descent (SGD) does this for each
training example within the dataset, meaning it updates the
parameters for each training example one by one.
• One advantage is the frequent updates allow us to have a pretty
detailed rate of improvement.
• The frequent updates, however, are more computationally expensive
than the batch gradient descent approach. Additionally, the frequency
of those updates can result in noisy gradients, which may cause the
error rate to jump around instead of slowly decreasing.
MINI-BATCH GRADIENT DESCENT
• Mini-batch gradient descent is the go-to method since it’s a
combination of the concepts of SGD and batch gradient descent.
• It simply splits the training dataset into small batches and performs an
update for each of those batches.
• This creates a balance between the robustness of stochastic gradient
descent and the efficiency of batch gradient descent.

PEGA Interview Qa
Document5 pages
PEGA Interview Qa
sushma yalamati
No ratings yet
Ways to Achieve Quality
From Everand
Ways to Achieve Quality
chakrapani srinivasa
No ratings yet
UML Diagram Presentation
Document30 pages
UML Diagram Presentation
Rubin Chaulagain
100% (2)
Tutorial Sheet 1 EE008 3.5 3
Document2 pages
Tutorial Sheet 1 EE008 3.5 3
gasath2001
No ratings yet
Flowchart
Document10 pages
Flowchart
liz
No ratings yet
Gradient Descent
Document17 pages
Gradient Descent
Jatin Kumar Garg
No ratings yet
Unit IV BPA GD
Document12 pages
Unit IV BPA GD
Thrisha Kumar
No ratings yet
Optimization
Document3 pages
Optimization
saisundaresan27
No ratings yet
Gradient Descent Optimization
Document27 pages
Gradient Descent Optimization
Akash Raj Behera
No ratings yet
Yash 21bsds12
Document3 pages
Yash 21bsds12
yashpatelykp
No ratings yet
DL Class1
Document18 pages
DL Class1
Rishi Chaary
No ratings yet
CS435 Ch5
Document15 pages
CS435 Ch5
Kareem CS
No ratings yet
Deep Learning (All in One)
Document23 pages
Deep Learning (All in One)
B Basit
No ratings yet
SCSA3015 Deep Learning Unit 4 PDF
Document30 pages
SCSA3015 Deep Learning Unit 4 PDF
pooja vikirthini
No ratings yet
Gradient Descent Optimization
Document4 pages
Gradient Descent Optimization
Guna Seelan
No ratings yet
Gradient Descent
Document2 pages
Gradient Descent
Sumita Barun kumar
No ratings yet
Deep Learning
Document20 pages
Deep Learning
nikhilsinha0099
No ratings yet
Lecture 9 Loss, Optimizers, Batch Processing, Accuracy
Document12 pages
Lecture 9 Loss, Optimizers, Batch Processing, Accuracy
Hodatama Karanna One
No ratings yet
Basic Operation Research Model
Document15 pages
Basic Operation Research Model
Shebin Thampy
No ratings yet
Gradient Descent Unit3
Document9 pages
Gradient Descent Unit3
Beghin Bose
No ratings yet
Lec 20
Document21 pages
Lec 20
Abcdefgh Efghabcd
No ratings yet
Hyper-Parameter Tuning Techniques in Deep Learning - Towards Data Science
Document14 pages
Hyper-Parameter Tuning Techniques in Deep Learning - Towards Data Science
Gabriel Pehls
No ratings yet
Q. (A) What Are Different Types of Machine Learning? Discuss The Differences
Document12 pages
Q. (A) What Are Different Types of Machine Learning? Discuss The Differences
Hassan Saddiqui
No ratings yet
Unit 2
Document13 pages
Unit 2
read4free
No ratings yet
Optimization Techniques in Deep Learning
Document14 pages
Optimization Techniques in Deep Learning
ayesha
No ratings yet
Lec 5 - Gradient-Descent
Document31 pages
Lec 5 - Gradient-Descent
Ankur Saroj
No ratings yet
Code Adam Optimization Algorithm From Scratch
Document28 pages
Code Adam Optimization Algorithm From Scratch
Valentin Lungershausen
No ratings yet
Mid Semester Project Review UditSoni
Document25 pages
Mid Semester Project Review UditSoni
us22mac2r30
No ratings yet
Stochastic Gradient Descent - Term Paper
Document8 pages
Stochastic Gradient Descent - Term Paper
Arslán Qádri
No ratings yet
Chap 4
Document24 pages
Chap 4
Hamze Ordawi1084
No ratings yet
Assessing Capability: Comparing The Voices of The Customer and The Process
Document32 pages
Assessing Capability: Comparing The Voices of The Customer and The Process
Sium Adnan Khan 1511153030
No ratings yet
3 Regression
Document23 pages
3 Regression
JOJO
100% (1)
DL Class3
Document28 pages
DL Class3
Rishi Chaary
No ratings yet
Machine Vesion hw6
Document18 pages
Machine Vesion hw6
Teddy Bz
No ratings yet
Fundamental of ML Week 3
Document16 pages
Fundamental of ML Week 3
Raj Physio
No ratings yet
Unit - 1
Document138 pages
Unit - 1
hrishabhjoshi123
No ratings yet
Gradient Descent
Document9 pages
Gradient Descent
alwyn.ben
No ratings yet
Models PDF
Document86 pages
Models PDF
Ankit Kumar
No ratings yet
Intro To Data Structure
Document24 pages
Intro To Data Structure
FreeNet Codes
No ratings yet
ML Notes
Document14 pages
ML Notes
zomukoza
No ratings yet
Advantages Bpa
Document38 pages
Advantages Bpa
Selva Kumar
No ratings yet
Deep Learning - Summary - Deep - Learning
Document17 pages
Deep Learning - Summary - Deep - Learning
aabotony
No ratings yet
Dimensionality Reduction
Document47 pages
Dimensionality Reduction
bka212407
No ratings yet
AE8803 Final Presentation Manan Gandhi
Document23 pages
AE8803 Final Presentation Manan Gandhi
mgandhi4
No ratings yet
Lecture 4
Document33 pages
Lecture 4
Venkat ram Reddy
No ratings yet
Activations, Loss Functions & Optimizers in ML
Document29 pages
Activations, Loss Functions & Optimizers in ML
Aniket Dhar
No ratings yet
Linear Regression Gradient Descent Vs Analytical Solution
Document5 pages
Linear Regression Gradient Descent Vs Analytical Solution
yt peek
No ratings yet
Document 2
Document30 pages
Document 2
Mohnish Chaudhari
No ratings yet
Blockchain Platforms Allow The Development of Blockchain-Based
Document22 pages
Blockchain Platforms Allow The Development of Blockchain-Based
Nithya Prasath
No ratings yet
Stochastic Gradient Descent
Document23 pages
Stochastic Gradient Descent
jenna.amber000
No ratings yet
Analysis of Algorithm - Lecture 1
Document9 pages
Analysis of Algorithm - Lecture 1
OyeladeAyo
No ratings yet
An Overview of Gradient Descent Optimization Algorithms PDF
Document12 pages
An Overview of Gradient Descent Optimization Algorithms PDF
sharkie
No ratings yet
DATA STRUCTURES-lect1
Document97 pages
DATA STRUCTURES-lect1
Anu Gau
No ratings yet
Unit 1
Document38 pages
Unit 1
GRASH KJT
No ratings yet
Gradient Descent Algorithm
Document5 pages
Gradient Descent Algorithm
ravinyse
No ratings yet
Data Structure Notes
Document6 pages
Data Structure Notes
Exclusive Munda
No ratings yet
L25 Recap of Data Analysis PDF
Document3 pages
L25 Recap of Data Analysis PDF
Ananya Agarwal
No ratings yet
Types of Machine Learning
Document63 pages
Types of Machine Learning
williamkin14
No ratings yet
3-Logic Regression
Document27 pages
3-Logic Regression
sonia
No ratings yet
Supervised Algorithms-Regression: Linear & Logistic: Muhammad Bello Aliyu
Document32 pages
Supervised Algorithms-Regression: Linear & Logistic: Muhammad Bello Aliyu
Mohammed Danlami Yusuf
100% (1)
CSL0777 L17
Document27 pages
CSL0777 L17
Konkobo Ulrich Arthur
No ratings yet
CVDL Cae 2
Document7 pages
CVDL Cae 2
wansejalm527
No ratings yet
Regularization For Deep Learning
Document31 pages
Regularization For Deep Learning
Tarun Gopalan
No ratings yet
Bundle Adjustment: Optimizing Visual Data for Precise Reconstruction
From Everand
Bundle Adjustment: Optimizing Visual Data for Precise Reconstruction
Fouad Sabry
No ratings yet
E-Fraud Detection
Document49 pages
E-Fraud Detection
UDAY SOLUTIONS
No ratings yet
ABAP Code Sample To Learn Basic Concept of Object-Oriented Programming
Document4 pages
ABAP Code Sample To Learn Basic Concept of Object-Oriented Programming
mudit
No ratings yet
Design Patterns Mock Test
Document7 pages
Design Patterns Mock Test
Abhinav Tayade
No ratings yet
Ensembles of Classifiers: Evgueni Smirnov
Document43 pages
Ensembles of Classifiers: Evgueni Smirnov
Christian I. Ango
No ratings yet
3 - 4 - 5th Program
Document3 pages
3 - 4 - 5th Program
ambika400
No ratings yet
MSC Computer Science
Document36 pages
MSC Computer Science
Rakesh Padiyath
67% (3)
Module 4 DS
Document114 pages
Module 4 DS
iliyaz pasha
No ratings yet
Intro To
Document37 pages
Intro To
hammad155
No ratings yet
Activation Functions in Neural Networks - GeeksforGeeks
Document12 pages
Activation Functions in Neural Networks - GeeksforGeeks
wendu feleke
No ratings yet
Assignment No. - 1: // Write A C++ Programme To Generate Prime Numbers in A Given Range
Document16 pages
Assignment No. - 1: // Write A C++ Programme To Generate Prime Numbers in A Given Range
Rajib Mandal
No ratings yet
Reverse Engineering For Beginners by Dennis Yurichev - August 2016
Document987 pages
Reverse Engineering For Beginners by Dennis Yurichev - August 2016
ntcase
No ratings yet
Ah en Plcnext Engineer Change Notes 108337 en 28
Document66 pages
Ah en Plcnext Engineer Change Notes 108337 en 28
Neven
No ratings yet
GE8151 - PSPP - Question Bank With Answers PDF
Document52 pages
GE8151 - PSPP - Question Bank With Answers PDF
meckup123
100% (2)
اسئلة واجوبة معالجة متوازية نهائية
Document10 pages
اسئلة واجوبة معالجة متوازية نهائية
Ammar H AL Madani
No ratings yet
Coding JCL Statements: Continuing The Parameter Field
Document1 page
Coding JCL Statements: Continuing The Parameter Field
Mauricio Ponce Gomez
No ratings yet
db2 Interview Questions
Document26 pages
db2 Interview Questions
Naveen Rawat
No ratings yet
Device Drivers
Document103 pages
Device Drivers
Amit Kumar
No ratings yet
Programming in C For BCA BIT BE
Document0 pages
Programming in C For BCA BIT BE
www.bhawesh.com.np
No ratings yet
Advantages of JSP
Document10 pages
Advantages of JSP
Edacus E- Learning solution
No ratings yet
Cs212-Lab 1
Document3 pages
Cs212-Lab 1
Mohammed Ali Joudeh
No ratings yet
Textbook Embedded System Design Embedded Systems Foundations of Cyber Physical Systems and The Internet of Things Marwedel Ebook All Chapter PDF
Document54 pages
Textbook Embedded System Design Embedded Systems Foundations of Cyber Physical Systems and The Internet of Things Marwedel Ebook All Chapter PDF
melissa.mcmillan964
100% (12)
OUTLINE ASM2 Programming 2022
Document55 pages
OUTLINE ASM2 Programming 2022
hoangvu
No ratings yet
Dmbs New Slides Unit 2
Document28 pages
Dmbs New Slides Unit 2
Yasir Ahmad
No ratings yet
Visual Studio 6.0: Object-Oriented Computer Language Microsoft's Visual Basic Backward Compatibility
Document3 pages
Visual Studio 6.0: Object-Oriented Computer Language Microsoft's Visual Basic Backward Compatibility
Mary Jansi
No ratings yet
DS 1
Document69 pages
DS 1
Nayan Gaulkar
No ratings yet
Mapa de Memoria PLC
Document3 pages
Mapa de Memoria PLC
Julio C. Garcia Rodriguez
No ratings yet