Gradient Descent for Linear Regression: repeat until convergence: (:=:=) − α ( −) 1 ∑ − α ( ( −) ) 1 ∑

Uploaded by

Laith Bounenni

0% found this document useful (0 votes)

23 views1 page

Original Title

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

23 views1 page

Gradient Descent for Linear Regression: repeat until convergence: (:=:=) − α ( −) 1 ∑ − α ( ( −) ) 1 ∑

Uploaded by

Laith Bounenni

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 1

Search inside document

27/09/2020 Machine Learning - Home | Coursera

The way we do this is by taking the derivative (the tangential line to a function) of our cost function. The slope of
the tangent is the derivative at that point and it will give us a direction to move towards. We make steps down the
cost function in the direction with the steepest descent, and the size of each step is determined by the parameter
α, which is called the learning rate.

The gradient descent algorithm is:

repeat until convergence:

θj := θj − α ∂θ∂ j J(θ0 , θ1 )

where

j=0,1 represents the feature index number.

Intuitively, this could be thought of as:

repeat until convergence:

θj := θj − α[Slope of tangent aka derivative in j dimension][Slope of tangent aka derivative in j dimension]

Gradient Descent for Linear Regression

When specifically applied to the case of linear regression, a new form of the gradient descent equation can be
derived. We can substitute our actual cost function and our actual hypothesis function and modify the equation to
(the derivation of the formulas are out of the scope of this course, but a really great one can be found here):

repeat until convergence: {

1 m
θ0 := θ0 − α ∑ (h θ (x i ) − y i )
m i=1
m
1
θ1 := θ1 − α ∑ ((h θ (x i ) − y i )x i )
m i=1
}

where m is the size of the training set, θ0 a constant that will be changing simultaneously with θ1 and xi , yi are
values of the given training set (data).

Note that we have separated out the two cases for θj into separate equations for θ0 and θ1 ; and that for θ1 we
are multiplying xi at the end due to the derivative.

The point of all this is that if we start with a guess for our hypothesis and then repeatedly apply these gradient
descent equations, our hypothesis will become more and more accurate.

https://www.coursera.org/learn/machine-learning/resources/JXWWS 1/1

Notes On Contrastive Divergence
Document3 pages
Notes On Contrastive Divergence
AkibaPleijel
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
Rating: 3.5 out of 5 stars
3.5/5 (8)
Long-Memory Time Series: Theory and Methods
From Everand
Long-Memory Time Series: Theory and Methods
Wilfredo Palma
No ratings yet
Solved Examples of Cramer Rao Lower Bound
Document6 pages
Solved Examples of Cramer Rao Lower Bound
Ik Ram
No ratings yet
Maximum Likelihood Estimation Explained
Document6 pages
Maximum Likelihood Estimation Explained
nimishth925
No ratings yet
(Machine Learning Coursera) Lecture Note Week 1
Document8 pages
(Machine Learning Coursera) Lecture Note Week 1
JonnyJack
No ratings yet
ML:Introduction: Week 1 Lecture Notes
Document10 pages
ML:Introduction: Week 1 Lecture Notes
Shubham
No ratings yet
ML Course Notes: Linear Regression
Document10 pages
ML Course Notes: Linear Regression
copsamosto
No ratings yet
Cost Function
Document17 pages
Cost Function
juanka82
No ratings yet
Supervised Learning Algorithms Regression
Document7 pages
Supervised Learning Algorithms Regression
ram
No ratings yet
Sam HW2
Document4 pages
Sam HW2
Ali Hassan
No ratings yet
Notes
Document8 pages
Notes
박이삭
No ratings yet
220 Logistic Regression
Document5 pages
220 Logistic Regression
mihir thakkar
100% (1)
Cost Function: y 2m 1 (Y ) 2m 1
Document1 page
Cost Function: y 2m 1 (Y ) 2m 1
Laith Bounenni
No ratings yet
CS229 Lecture Notes: Supervised Learning
Document30 pages
CS229 Lecture Notes: Supervised Learning
anilkram
No ratings yet
CS229 Lecture Notes: Supervised Learning
Document30 pages
CS229 Lecture Notes: Supervised Learning
Alistair Gnow
No ratings yet
Machine Learning Coursera Quiz 2
Document6 pages
Machine Learning Coursera Quiz 2
Noha El-sheweikh
100% (1)
7 PDF
Document1 page
7 PDF
Laith Bounenni
No ratings yet
7 PDF
Document1 page
7 PDF
Laith Bounenni
No ratings yet
ML: Introduction 1. What Is Machine Learning?
Document38 pages
ML: Introduction 1. What Is Machine Learning?
rohit
No ratings yet
Sms Essay 2
Document6 pages
Sms Essay 2
akhile293
No ratings yet
Saxena Cs4758 Lecture4
Document2 pages
Saxena Cs4758 Lecture4
Ivan Avramov
No ratings yet
CS229 Problem Set #1 Solutions: Supervised Learning
Document10 pages
CS229 Problem Set #1 Solutions: Supervised Learning
suhar adi
No ratings yet
Devoir 1
Document6 pages
Devoir 1
Melania Nitu
No ratings yet
Linear Regression With One Variable - Coursera - Quiz-2
Document4 pages
Linear Regression With One Variable - Coursera - Quiz-2
Ramai Azami
No ratings yet
ML:Introduction: Week 1 Lecture Notes
Document5 pages
ML:Introduction: Week 1 Lecture Notes
AK
No ratings yet
Mach Learning Qs
Document7 pages
Mach Learning Qs
john kay
No ratings yet
A Tutorial of Machine Learning
Document16 pages
A Tutorial of Machine Learning
Sumod Sundar
No ratings yet
cs229.... Machine Language. Andrew NG
Document17 pages
cs229.... Machine Language. Andrew NG
krishna
No ratings yet
Gradient Descent
Document5 pages
Gradient Descent
mirza
No ratings yet
Regularization
Document3 pages
Regularization
Toshinari Tong
No ratings yet
41 DeepLearning
Document4 pages
41 DeepLearning
Caballero Alférez Roy Torres
No ratings yet
Machine Learning - Home - Coursera Quiz PDF
Document5 pages
Machine Learning - Home - Coursera Quiz PDF
Mary Peace
100% (1)
All LectureNotes
Document155 pages
All LectureNotes
Srinivas Nemani
No ratings yet
Linear Regression
Document12 pages
Linear Regression
ksms9
No ratings yet
Note-Stat Ch2 Estimation
Document8 pages
Note-Stat Ch2 Estimation
RHL
No ratings yet
1 s2.0 S0377042707005n
Document18 pages
1 s2.0 S0377042707005n
青山漫步
No ratings yet
Derivation of Normal Equations
Document7 pages
Derivation of Normal Equations
Muhammad Usman
No ratings yet
ML:Introduction: Week 1 Lecture Notes
Document8 pages
ML:Introduction: Week 1 Lecture Notes
Amanda Garcia
No ratings yet
Modern Machine Learning Definition and Examples
Document10 pages
Modern Machine Learning Definition and Examples
ranvsingh3793
No ratings yet
Multiple Features
Document5 pages
Multiple Features
Ranjeet Singh 186
No ratings yet
ML Review02
Document4 pages
ML Review02
languages cultures
No ratings yet
02 Linear Regression
Document6 pages
02 Linear Regression
Matix
No ratings yet
Notes 2. Linear - Regression - With - Multiple - Variables
Document10 pages
Notes 2. Linear - Regression - With - Multiple - Variables
Ali Shan
No ratings yet
06 Logistic Regression PDF
Document10 pages
06 Logistic Regression PDF
marc
No ratings yet
Linear Regression With Multiple Features
Document7 pages
Linear Regression With Multiple Features
marc
No ratings yet
Pendulum Cart
Document3 pages
Pendulum Cart
pavan_v_2
No ratings yet
LOGISTIC REGRESSION CLASSIFICATION
Document66 pages
LOGISTIC REGRESSION CLASSIFICATION
sameeringate
No ratings yet
Machine Learning - Home - Week 1 - Notes - Coursera
Document7 pages
Machine Learning - Home - Week 1 - Notes - Coursera
copsamosto
No ratings yet
Cost Function - Coursera Week3
Document1 page
Cost Function - Coursera Week3
Mr. K
No ratings yet
Integral Transforms: Richard Earl (Edited, and Chapter 6 Added, by Sam Howison) Hilary Term 2019
Document58 pages
Integral Transforms: Richard Earl (Edited, and Chapter 6 Added, by Sam Howison) Hilary Term 2019
EDU CIPANA
No ratings yet
Module 1 SHARE
Document19 pages
Module 1 SHARE
tadeus
No ratings yet
Arena Stanfordlecturenotes11
Document9 pages
Arena Stanfordlecturenotes11
Victoria Moore
No ratings yet
CM 06 HamiltonsEqns
Document6 pages
CM 06 HamiltonsEqns
osama hasan
No ratings yet
Difference Calculus: N K 1 3 M X 1 N y 1 2 N K 0 K
Document9 pages
Difference Calculus: N K 1 3 M X 1 N y 1 2 N K 0 K
Aline Guedes
No ratings yet
Gradient Descent and SGD
Document8 pages
Gradient Descent and SGD
ROHIT ARORA
No ratings yet
ML DSBA Lab2
Document4 pages
ML DSBA Lab2
Houssam Fouki
No ratings yet
What Is Machine Learning?
Document12 pages
What Is Machine Learning?
AMISHA YELPALE
No ratings yet
EM Algo
Document8 pages
EM Algo
coolacl
No ratings yet
8 Vertical Transformations
Document4 pages
8 Vertical Transformations
Laila Soliman
No ratings yet
7 PDF
Document1 page
7 PDF
Laith Bounenni
No ratings yet
Normal Equation: θ = (X X) X y
Document1 page
Normal Equation: θ = (X X) X y
Laith Bounenni
No ratings yet
Gradient Descent Tips: X X X X X
Document1 page
Gradient Descent Tips: X X X X X
Laith Bounenni
No ratings yet
7 PDF
Document1 page
7 PDF
Laith Bounenni
No ratings yet
Cost Function: y 2m 1 (Y ) 2m 1
Document1 page
Cost Function: y 2m 1 (Y ) 2m 1
Laith Bounenni
No ratings yet
Cost Function: y 2m 1 (Y ) 2m 1
Document1 page
Cost Function: y 2m 1 (Y ) 2m 1
Laith Bounenni
No ratings yet
SVM Hyperplane Separator Finds Largest Margin Between Classes
Document2 pages
SVM Hyperplane Separator Finds Largest Margin Between Classes
Laith Bounenni
No ratings yet
SVM Hyperplane Separator Finds Largest Margin Between Classes
Document2 pages
SVM Hyperplane Separator Finds Largest Margin Between Classes
Laith Bounenni
No ratings yet
Decision Trees
Document3 pages
Decision Trees
Laith Bounenni
No ratings yet
Decision Trees
Document3 pages
Decision Trees
Laith Bounenni
No ratings yet
Decision Trees
Document3 pages
Decision Trees
Laith Bounenni
No ratings yet
Decision Trees
Document3 pages
Decision Trees
Laith Bounenni
No ratings yet