Professional Documents
Culture Documents
Introduction
• A model relating the variables based on observed
data in which two or more variables of interest are
related, and the mechanistic model relating these
variables is unknown is called an empirical model.
Introduction y is the salt concentration (milligrams/liter)
found in surface streams in a particular
watershed and x is the percentage of
the watershed area consisting of paved roads.
There is no obvious physical mechanism that
relates the salt concentration to the roadway
area, but the scatter diagram indicates that
some relationship, possibly linear, does exist.
Introduction
A linear relationship will not pass exactly
through all of the points, but there is an
indication that the points are scattered
randomly about a straight line. Therefore, it is
probably reasonable to assume that the mean
of the random variable Y (the salt
concentration) is related to roadway area x by
the following straight-line relationship:
• where Y represents the tool life, x1 represents the cutting speed, x2 represents
the tool angle, and ϵ is a random error term. This is a multiple linear regression
model with two regressors.
• The term linear is used because the equation is a linear function of the
unknown parameters 𝛽0 , 𝛽1 and 𝛽2 .
Introduction
The parameter 𝛽0 is the intercept of
the plane. We sometimes call 𝛽1 and
𝛽2 partial regression coefficients
because 𝛽1 measures the expected
change in Y per unit change in x1
when x2 is held constant, and 𝛽2
measures the expected change in Y
per unit change in x2 when x1 is held
constant.
• where the intercept 𝛽0 and the slope 𝛽1 are unknown regression coefficients.
• Assumed that each observation, Y, can be described by the model
• where 𝜖 is a random error with mean zero and variance 𝜎 2 . The random errors
corresponding to different observations are also assumed to be uncorrelated random
variables.
Least Squares Estimation
• Suppose that we have n pairs
of observations (x1, y1), (x2,
y2), . . . (xn, yn).
• and the sum of the squares of the deviations of the observations from
the true regression line is
Least Squares Estimation
Least Squares Estimation
• The fitted or estimated regression line is therefore
𝑥10 = 0.6
Error Sum of Squares
• The residuals from the fitted regression model are used to estimate the
variance of the model errors 𝜎 2 . Recall that 𝜎 2 determines the amount of
variability in the observations on the response 𝑦 at a given value of the
regressor variable 𝑥.
• The sum of the squared residuals is used to obtain the estimate of 𝜎 2 .
Coefficient Estimators, Simple Linear Regression
Standard Error of the Slope and Intercept, Simple Linear Regression
Regression and Analysis of Variance