Week 9 PDF

APPLIED MACHINE
LEARNING IN PYTHON
Applied Machine Learning
Linear Regression: Least-Squares

APPLIED MACHINE
LEARNING IN PYTHON
APPLIED MACHINE
LEARNING IN PYTHON
APPLIED MACHINE
LEARNING IN PYTHON
Least-Squares Linear Regression

("Ordinary least-squares")
• So widely used method for estimating w and b for linear aggression problems is
called least-squares linear regression, also known as ordinary least-squares.
• Least-squares linear regression finds the line through this cloud of points that
minimizes what is called the means squared error of the model.
• The mean squared error of the model is essentially the sum of the squared
differences between the predicted target value and the actual target value for
all the points in the training set.
APPLIED MACHINE
LEARNING IN PYTHON

• Finds w and b that minimizes
the mean squared error of the
linear model: the sum of
squared differences between
predicted target and actual
target values.
• No parameters to control
model complexity.
APPLIED MACHINE
LEARNING IN PYTHON

• So the technique of least-squares, is designed to find the slope, the w value,
and the b value of the y intercept, that minimize this squared error, this mean
squared error.
• One thing to note about this linear regression model is that there are no
parameters to control the model complexity.
• No matter what the value of w and b, the result is always going to be a straight
line.
• This is both a strength and a weakness of the model as we'll see later.
APPLIED MACHINE
LEARNING IN PYTHON

• When you have a moment, compare this simple linear model to the more complex
regression model learned with K nearest neighbors regression on the same dataset.
• You can see that linear models make a strong prior assumption about the relationship
between the input x and output y.
• Linear models may seem simplistic, but for data with many features linear models can be
very effective and generalize well to new data beyond the training set.
• Now the question is, how exactly do we estimate the linear models w and b parameters
so the model is a good fit?
APPLIED MACHINE
LEARNING IN PYTHON
How are Linear Regression Parameters w, b

Estimated?
• Parameters are estimated from training data.
• There are many different ways to estimate w and b:
– Different methods correspond to different "fit" criteria and goals and
ways to control model complexity.
• For linear models, model complexity is based on the nature of the weights w
on the input features.
• Simpler linear models have a weight vector w that's closer to zero,

• i.e., where more features are either not used at all that have zero
weight or
• have less influence on the outcome, a very small weight.
APPLIED MACHINE
LEARNING IN PYTHON

Estimated?
• Typically, given possible settings for the model parameters, the learning
algorithm predicts the target value for each training example, and then
computes what is called a loss function for each training example.
• That's a penalty value for incorrect predictions.
• The prediction is incorrect when the predicted target value is different than
the actual target value in the training set.
• The learning algorithm finds the parameters that optimize an objective

function, typically to minimize some kind of loss function of the predicted
target values vs. actual target values.
APPLIED MACHINE
LEARNING IN PYTHON

Estimated?
• For example, a squared loss function would return the squared
difference between the target value and the actual value as the penalty.
• The learning algorithm then computes or searches for the set of w, b

parameters that minimize the total of this loss function over all training
points.
APPLIED MACHINE
LEARNING IN PYTHON
Least-Square Linear Regression

• The most popular way to estimate w and b parameters is using what's
called least-squares linear regression or ordinary least-squares.
• Least-squares finds the values of w and b that minimize the total sum of
squared differences between the predicted y value and the actual y value in
the training set.
• Or equivalently it minimizes the mean squared error of the model.
• Least-squares is based on the squared loss function mentioned before.

APPLIED MACHINE
LEARNING IN PYTHON
Least-Squares Linear Regression in Scikit-Learn

from sklearn.linear_model import LinearRegression
X_train, X_test, y_train, y_test = train_test_split(X_R1, y_R1,

random_state = 0)
linreg = LinearRegression().fit(X_train, y_train)
print("linear model intercept (b): {}".format(linreg.intercept_))

print("linear model coeff (w): {}".format(linreg.coef_))
linreg.coef linreg.intercept
_ _
APPLIED MACHINE
LEARNING IN PYTHON

from sklearn.linear_model import LinearRegression
X_train, X_test, y_train, y_test = train_test_split(X_R1, y_R1,

random_state = 0)

Underscore denotes a quantity derived from training data, as

opposed to a user setting.
_ _
APPLIED MACHINE
LEARNING IN PYTHON

linear model coeff (w): [ 45.70870465]
from sklearn.linear_model import LinearRegression linear model intercept (b): 148.44575345658873
R-squared score (training): 0.679
X_train, X_test, y_train, y_test = train_test_split(X_R1, y_R1, R-squared score (test): 0.492
random_state = 0)

Underscore denotes a quantity derived from training data, as

opposed to a user setting.
_ _
APPLIED MACHINE
LEARNING IN PYTHON
K-NN Regression vs Least-Squares Linear

k-NN (k=7) Regression LS linear
R-squared score (training): 0.720 R-squared score (training): 0.679
R-squared score (test): 0.471 R-squared score (test): 0.492
APPLIED MACHINE
LEARNING IN PYTHON
Applied Machine Learning
Bias and Variance

Week 9 PDF

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Week 9 PDF

Uploaded by

Copyright:

Available Formats

APPLIED MACHINE

Applied Machine Learning

Linear Regression: Least-Squares

Least-Squares Linear Regression

Least-Squares Linear Regression

Least-Squares Linear Regression

Least-Squares Linear Regression

How are Linear Regression Parameters w, b

• Simpler linear models have a weight vector w that's closer to zero,

How are Linear Regression Parameters w, b

• That's a penalty value for incorrect predictions.

• The learning algorithm finds the parameters that optimize an objective

How are Linear Regression Parameters w, b

• The learning algorithm then computes or searches for the set of w, b

Least-Square Linear Regression

• Or equivalently it minimizes the mean squared error of the model.

• Least-squares is based on the squared loss function mentioned before.

Least-Squares Linear Regression in Scikit-Learn

X_train, X_test, y_train, y_test = train_test_split(X_R1, y_R1,

linreg = LinearRegression().fit(X_train, y_train)

print("linear model intercept (b): {}".format(linreg.intercept_))

Least-Squares Linear Regression in Scikit-Learn

X_train, X_test, y_train, y_test = train_test_split(X_R1, y_R1,

linreg = LinearRegression().fit(X_train, y_train)

print("linear model intercept (b): {}".format(linreg.intercept_))

Underscore denotes a quantity derived from training data, as

Least-Squares Linear Regression in Scikit-Learn

linreg = LinearRegression().fit(X_train, y_train)

print("linear model intercept (b): {}".format(linreg.intercept_))

Underscore denotes a quantity derived from training data, as

K-NN Regression vs Least-Squares Linear

Applied Machine Learning

Bias and Variance

You might also like