Professional Documents
Culture Documents
Stochastic Gradient Descent: 10701 Recitations 3
Stochastic Gradient Descent: 10701 Recitations 3
10701 Recitations 3
Mu Li
February 5, 2013
The problem
until stop
I ηt is the learning rate, and
1X
∇F (w (t) ) = ∇w f (w (t) ; yi , xi )
n
i
f (w ) = Eyi ,xi f (w ; yi , xi )