Professional Documents
Culture Documents
Assignment #2
2. (Linear Regression) Please explain from the probabilistic view why the
least squares cost function 𝐽(𝜃⃑) is a reasonable choice when choosing 𝜃⃑ ? (Hint:
probabilistic assumption and maximum likelihood estimation)
predict the output 𝑦 ∈ {0, 1} given an input vector 𝑥⃑. Please derive the stochastic
gradient ascent rule for logistic regression learning problems.
𝑥& 𝑥+ 𝑦
0 0 2
0 1 3
1 0 3
1 1 4