0% found this document useful (0 votes)

211 views15 pages

Me310 5 Regression PDF

This document provides an overview of least squares regression for fitting data to a linear model. It discusses minimizing the sum of squared errors to determine the slope and y-intercept of the best-fit line. An example is provided to demonstrate calculating the linear regression line and correlation coefficient for a data set. The correlation coefficient indicates how well the linear model fits the data compared to just using the mean y-value. Nonlinear data can sometimes be transformed to apply linear regression.

Uploaded by

iqbal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

211 views15 pages

Me310 5 Regression PDF

Uploaded by

iqbal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

ME 310

Numerical Methods

Least Squares Regression

These presentations are prepared by

Dr. Cuneyt Sert
Mechanical Engineering Department
Middle East Technical University
Ankara, Turkey
csert@[Link]

They can not be used without the permission of the author

1
About Curve Fitting
Curve fitting is expressing a discrete set of data points as a continuous function.
It is frequently used in engineering. For example the emprical relations that we use in heat transfer
and fluid mechanics are functions fitted to experimental data.

Regression: Mainly used with experimental data, which might have significant amount of error
(noise). No need to find a function that passes through all discrete points.

f(x) Linear f(x)

Regression

Polynomial
Regression
x x

Interpolation: Used if the data is known to be very precise. Find a function (or a series of
functions) that passes through all discrete points.
Four different
A single functions
f(x) f(x)
function

x x
Polynomial Interpolation Spline Interpolation
2
Least Squares Regression
(Read the statistics review from the book.)
Fitting a straight line to a set of data set (paired data points).
(x1, y1), (x2, y2), (x3, y3), ..., (xn, yn)

y
a0 : y-intercept (unknown)
a0 + a 1 x a1 : slope (unknown)
yi
ei a0 + a 1 xi ei = yi - a0 - a1 xi
Error (deviation) for the
ith data point
xi x

Minimize the error (deviation) to get a best-fit line (to find a0 and a1). Several posibilities are:
Minimize the sum of individual errors.
Minimize the sum of absolute values of individual errors.
Minimize the maximum error.
Minimize the sum of squares of individual errors. This is the preferred strategy (Check the
bookto see why others fail).
3
Minimizing the Square of Individual errors
n n
Sr e
i1
2
i (yi1
i a 0 a1x i )2 Sum of squares of the residuals

Determine the unknowns a0 and a1 by minimizing Sr.

To do this set the derivatives of Sr wrt a0 and a1 to zero.

Sr
x a y
n
-2 (yi a 0 a1x i) 0 n a0
a 0
i 1 i
i 1

Sr
x a x a x y
n
-2 [(yi a 0 a1x i)x i ] 2

a 1
i 0 i 1 i i
i1

or
n

x a y
i 0 i
These are called the normal equations.
xi x a x y
2
i 1 i i

Solve these for a0 and a1. The results are

n (xi yi ) xi yi where y-bar and x-bar are the means of

a1 a 0 y - a1x
n xi2 ( xi )2 y and x, respectively.
4
Example 24:

Use least-squares regression to fit a straight line to

x 1 3 5 7 10 12 13 16 18 20

y 4 5 6 5 8 7 6 9 12 11

n 10
x i 105 n (x i yi ) x i yi 10*906 105 * 73
a1 0.3725
y i 73 n x i2 ( x i )2 10 * 1477 105 2

x 10.5
y 7 .3 a 0 7.3 - 0.3725 *10.5 3.3888
x 1477
2
i

x y 906
i i

Exercise 24: It is always a good idea to plot the data points and the regression line to see
how well the line represents the points. You can do this with Excel. Excel will calculate a0
and a1 for you.
5
Error of Linear Regression (How good is the best line?)
Spread of data around the mean Spread of data around the regression line

y y

a 0 + a1 x

x x
n n n
St (y
i1
i y) 2
Sr e 2
i (y i a0 a1x i )2
i1 i1

St Sr
sy std. deviation sy / x std. error of estimate
n 1 n2

The improvement obtained by using a regression line instead of the mean gives a maesure of
how good the regression fit is.
S t Sr
coefficient of determination r2
St
n (xi yi ) xi yi
correlation coefficient r
n xi2 ( xi )2 n yi2 ( yi )2 6
How to interpret the correlation coefficient?
Two extreme cases are
Sr = 0 r=1 describes a perfect fit (straight line passing through all points).
Sr = St r=0 describes a case with no improvement.
Usually an r value close to 1 represents a good fit. But be careful and always plot the data points
and the regression line together to see what is going on.

Example 24 (contd): Calculate the correlation coefficient.

n 10 n
S t ( yi y)2 64.1
x i 105 i 1

y i 73
n
Sr ( yi a 0 a1x i )2 12.14
x 10.5 i 1

y 7 .3 S t Sr
r2 0.8107
St
x 1477
2
i
r 0 .9
x y 906
i i

7
Example 24 (contd): Reverse x and y. Find the linear regression line and calculate r.
x = -5.3869 + 2.1763 y
St = 374.5, Sr = 70.91 (different than before).
r2 = 0.8107, r = 0.9 (same as before).

Exercise 25: When working with experimental data we usually take the variable that is
controlled by us in a precise way as x. The measured or calculated quantities are y. See
Midterm II of Fall 2003 for an example.

Linearization of Nonlinear Behavior

Linear regression is useful to represent a linear relationship.
If the relation is nonlinear either another technique can be used or the data can be transformed so
that linear regression can still be used. The latter technique is frequently used to fit the the following
nonlinear equations to a set of data.
Exponential equation ( y=A1 e B1x )
Power equation ( y=A2 x B2 )
Saturation-growth rate equation ( y=A3 x / (B3+x) )
8
Linearization of Nonlinear Behavior (contd)
(1) Exponential Equation (y = A1 e B1x)
ln y = ln A1 + B1x
y
y = A1 e B1x
ln y or
Linearization ln y = a0 + a1x

x x

Example 25: Fit an exponential model to the following data set.

x 0.4 0.8 1.2 1.6 2.0 2.3

y 750 1000 1400 2000 2700 3750

Create the following table. x 0.4 0.8 1.2 1.6 2.0 2.3
ln y 6.62 6.91 7.24 7.60 7.90 8.23

Fit a straight line to this new data set. Be careful with the notation. You can define z = ln y
Calculate a0 = 6.25 and a1 = 0.841. Straight line is ln y = 6.25 + 0.841 x
Switch back to the original equation. A1 = ea0 = 518, B1 = a1 = 0.841.
Therefore the exponential equation is y = 518 e0.841x. Check this solution with couple of data
9
points. For example y(1.2) = 518 e0.841 (1.2) = 1421 or y(2.3) = 518 e0.841 (2.3) = 3584. OK.
Linearization of Nonlinear Behavior (contd)
(2) Power Equation (y = A2 x B2)
log y = log A2 + B2 log x
y
y = A2 x B2
log y or
Linearization log y = a0 + a1 log x

log x
x

Example 26: Fit a power equation to the following data set.

x 2.5 3.5 5 6 7.5 10 12.5 15 17.5 20

y 7 5.5 3.9 3.6 3.1 2.8 2.6 2.4 2.3 2.3

log x 0.398 0.544 0.699 0.778 0.875 1.000 1.097 1.176 1.243 1.301
log y 0.845 0.740 0.591 0.556 0.491 0.447 0.415 0.380 0.362 0.362

Fit a straight line to this new data set. Be careful with the notation.
Calculate a0 = 1.002 and a1 = -0.53. Straight line is log y = 1.002 0.53 log x
Switch back to the original equation. A2 = 10a0 = 10.05, B2 = a1 = -0.53.
Therefore the power equation is y = 10.05 x-0.53. Check this solution with couple of data points. For
10
example y(5) = 10.05 * 5-0.53 = 4.28 or y(15) = 10.05 * 15-0.53 = 2.39. OK.
Linearization of Nonlinear Behavior (contd)
(3) Saturation-growth rate Equation (y = A3 x / (B3+x) )
1/y = 1/A3 + B3/A3 x
y
y = y=A3 x / (B3+x)
1/y or
Linearization 1/y = a0 + a1x

x 1/x

Example 27: Fit a saturation-growth-rate equation to the following data set.

x 0.75 2 2.5 4 6 8 8.5

y 0.8 1.3 1.2 1.6 1.7 1.8 1.7

1/x 1.333 0.5 0.4 0.25 0.1667 0.125 0.118

1/y 1.25 0.769 0.833 0.625 0.588 0.556 0.588

Fit a straight line to this new data set. Be careful with the notation.
Calculate a0 = 0.512 and a1 = 0.562. Straight line is 1/y = 0.512 + 0.562 (1/x)
Switch back to the original equation. A3 = 1/a0 = 1.953, B3 = a1A3 = 1.097.
Therefore the saturation-growth rate equation is 1/y = 1.953 x/(1.097+x). Check this solution with
11
couple of data points. For example y(2) = 1.953*2/(1.097+2) = 1.26 OK.
Polynomial Regression (Extension of Linear Least Sqaures)
Used to find a best-fit line for a nonlinear behavior.
This is not nonlinear regression described at page 468 of the book. That section is omitted.

y a0 + a 1 x + a 2 x2
yi Example for a second order
polynomial regression
a 0 + a 1 x i+ a2 x i2
ei = yi - a0 - a1 xi - a2 xi2
Error (deviation) for the
ith data point
xi x

n n
Sr e
i1
2
i (y
i1
i a 0 a1x i a 2 x i2 )2 Sum of squares of the residuals

Sr Sr Sr
Minimize this sum to get the normal equations 0, 0, 0
a 0 a 1 a 2
Solve these equations with one of the techniques that we learned to get a0, a1 and a2.

12
Polynomial Regression Example
Find the least-squares parabola that fits to the following data set.

x 0 1 2 3 4 5
y 2.1 7.7 13.6 27.2 40.9 61.1

Normal equations to find a least-squares parabola are

n

x i x 2i a0 yi

xi x 2i x i3 a1 xiyi
x2
i x i3 x i4 a2 x2 y
i i

n6
a 0 2.479 , a1 2.359 , a2 1.861
x i 15 y i 152 .6
y 2.479 2.359 x 1.861 x 2
x 2i 55 x i y i 585.6 S t Sr 2573 .4 3.75
r2 0.999
x i3 225 x 2i y i 2488 .6 St 2573 .4

x i4 979 r 0.999

13
Multiple Linear Regression
y = y(x1, x2)

Individual errors are ei = yi - a0 - a1 x1i - a2 x2i

n n
Sum of squares of the residuals is Sr e2i (y i a0 a1 x 1i a2 x 2i )2
i1 i1

S r S r S r
Minimize this sum to get the normal equations 0, 0, 0
a0 a1 a2

n

x 1i x 2i a0 yi

x 1i x 12i x 1i x 2i a1 x 1i y i
x
2i x 1i x 2i x 22i a2 x y
2i i

Solve these equations to get a0, a1 and a2.

14
Example 28:

Use multiple linear regression to fit

x 0 1 1 2 2 3 3 4 4
y 0 1 2 1 2 1 2 1 2
z 15 18 12.8 25.7 20.6 35 29.8 45.5 40.3

n9
x i 20 x i y i 30 9 20 12 a0 242 .7

x 2i 60 z i 242 .7 20 60 30 a1 661
y i 12 x i z i 661 12 30 20 a2 331 .2

y 2i 20 y iz i 331 .2

a0 14.40 , a1 9.03 , a2 5.62

z 14.4 9.03 x 5.62 y

Exercise 26: Calculate the standard error of the estimate (sy/x) and the correlation coefficient (r).

Engineering Curve Fitting Guide
No ratings yet
Engineering Curve Fitting Guide
44 pages
Linear Least-Squares Regression Guide
No ratings yet
Linear Least-Squares Regression Guide
22 pages
Curve Fitting Techniques Overview
No ratings yet
Curve Fitting Techniques Overview
44 pages
Curve Fitting
100% (1)
Curve Fitting
43 pages
Linear Least-Squares Regression Methods
No ratings yet
Linear Least-Squares Regression Methods
24 pages
Curve Fitting: There Are Two General Approaches For Curve Fitting
No ratings yet
Curve Fitting: There Are Two General Approaches For Curve Fitting
63 pages
Clase 11 Calculo Numerico I
No ratings yet
Clase 11 Calculo Numerico I
37 pages
Curve Fitting Techniques Explained
No ratings yet
Curve Fitting Techniques Explained
44 pages
ANUM 2012 Curve-Fitting
No ratings yet
ANUM 2012 Curve-Fitting
44 pages
Curve Fitting Techniques Overview
No ratings yet
Curve Fitting Techniques Overview
48 pages
Least Squares Regression Techniques
No ratings yet
Least Squares Regression Techniques
44 pages
Chapter 5 Curve Fitting
100% (2)
Chapter 5 Curve Fitting
93 pages
L7 CurveFitting (LeastSquaresRegression)
No ratings yet
L7 CurveFitting (LeastSquaresRegression)
45 pages
Curve Fitting-Linear Regression
No ratings yet
Curve Fitting-Linear Regression
20 pages
Dr. Siti Mariam Binti Abdul Rahman Faculty of Mechanical Engineering Office: T1-A14-01C E-Mail: Mariam4528@salam - Uitm.edu - My
No ratings yet
Dr. Siti Mariam Binti Abdul Rahman Faculty of Mechanical Engineering Office: T1-A14-01C E-Mail: Mariam4528@salam - Uitm.edu - My
30 pages
Curve Fitting & Interpolation Guide
No ratings yet
Curve Fitting & Interpolation Guide
48 pages
Numerical Methods With Applications
No ratings yet
Numerical Methods With Applications
34 pages
Least Squares Curve Fitting: Numerical Methods
No ratings yet
Least Squares Curve Fitting: Numerical Methods
39 pages
Curve Fitting and Regression Methods
No ratings yet
Curve Fitting and Regression Methods
171 pages
3.2 Least Square and Polynomial Regression
No ratings yet
3.2 Least Square and Polynomial Regression
39 pages
L4 Emt 2101 Engineering Mathematics Iii
No ratings yet
L4 Emt 2101 Engineering Mathematics Iii
25 pages
08 Curvefitting W Interpolation
No ratings yet
08 Curvefitting W Interpolation
64 pages
CISE301-Topic 3 Curve Fitting
No ratings yet
CISE301-Topic 3 Curve Fitting
38 pages
Linear & Polynomial Regression Guide
No ratings yet
Linear & Polynomial Regression Guide
7 pages
Curve Fitting Techniques Explained
No ratings yet
Curve Fitting Techniques Explained
59 pages
Lecture25 Ps
No ratings yet
Lecture25 Ps
10 pages
Curve Fitting
No ratings yet
Curve Fitting
17 pages
Part 5
No ratings yet
Part 5
27 pages
Curve Fitting, B-Splines & Approximations
No ratings yet
Curve Fitting, B-Splines & Approximations
14 pages
Curve Fitting Techniques Overview
No ratings yet
Curve Fitting Techniques Overview
119 pages
Curve Fitting and Interpolation
No ratings yet
Curve Fitting and Interpolation
14 pages
Simple Linear Regression Guide
No ratings yet
Simple Linear Regression Guide
19 pages
Module 5
No ratings yet
Module 5
26 pages
ECE 3040 Lecture 18: Curve Fitting by Least-Squares-Error Regression
No ratings yet
ECE 3040 Lecture 18: Curve Fitting by Least-Squares-Error Regression
38 pages
Regression
No ratings yet
Regression
12 pages
Curve Fitting for Engineers
No ratings yet
Curve Fitting for Engineers
51 pages
CH 17
No ratings yet
CH 17
36 pages
Data Fitting and Regression Techniques
No ratings yet
Data Fitting and Regression Techniques
14 pages
Advanced Mathematics Curve Fitting: Prepared by
No ratings yet
Advanced Mathematics Curve Fitting: Prepared by
17 pages
GEM 803 Chapter 3 (1st Term SY2019-2020)
No ratings yet
GEM 803 Chapter 3 (1st Term SY2019-2020)
20 pages
Lecture 5 CAE Curve Fitting
No ratings yet
Lecture 5 CAE Curve Fitting
46 pages
Regression & Correlation
No ratings yet
Regression & Correlation
44 pages
Curve Fitting & Interpolation Guide
No ratings yet
Curve Fitting & Interpolation Guide
64 pages
Curve Fitting and Least Squares Methods
No ratings yet
Curve Fitting and Least Squares Methods
17 pages
Output Input Linear Correlation Coefficient Regression Analysis
No ratings yet
Output Input Linear Correlation Coefficient Regression Analysis
6 pages
Chapter 5 Curve Fitting PART 1
No ratings yet
Chapter 5 Curve Fitting PART 1
49 pages
5) 305 - (Full Note)
No ratings yet
5) 305 - (Full Note)
446 pages
Lab 05-1
No ratings yet
Lab 05-1
6 pages
Regression
No ratings yet
Regression
6 pages
Chapter 3
No ratings yet
Chapter 3
65 pages
Nonlinear Regression Model Analysis
No ratings yet
Nonlinear Regression Model Analysis
7 pages
Curve Fitting - Regression
No ratings yet
Curve Fitting - Regression
10 pages
Chapter 7 Curve Fitting V2.
No ratings yet
Chapter 7 Curve Fitting V2.
45 pages
Numerical Methods: Curve Fitting
No ratings yet
Numerical Methods: Curve Fitting
19 pages
Chapter 5 Fitting Other Models
No ratings yet
Chapter 5 Fitting Other Models
22 pages
Lec4 Numerical Model
No ratings yet
Lec4 Numerical Model
26 pages
Exponential Model Curve Fitting
No ratings yet
Exponential Model Curve Fitting
47 pages
EEPC102 Module - 6 Lesson 2
No ratings yet
EEPC102 Module - 6 Lesson 2
12 pages
Machine Learning Unit 1
No ratings yet
Machine Learning Unit 1
26 pages
Panel Data Ayele
No ratings yet
Panel Data Ayele
27 pages
Effect of Age Length of Service and Educ 721c6f4a
No ratings yet
Effect of Age Length of Service and Educ 721c6f4a
8 pages
Jurnal Nur Aeni Salsabila 119020059 Word
No ratings yet
Jurnal Nur Aeni Salsabila 119020059 Word
11 pages
Practice No.06 PDF
No ratings yet
Practice No.06 PDF
2 pages
CH03 Wooldridge 7e PPT 2pp
No ratings yet
CH03 Wooldridge 7e PPT 2pp
38 pages
Econometrics Formulas Updated
No ratings yet
Econometrics Formulas Updated
4 pages
Prac 12 Model Selection Solution
No ratings yet
Prac 12 Model Selection Solution
10 pages
Test Bank Questions Chapter 5
No ratings yet
Test Bank Questions Chapter 5
3 pages
CH07 Wooldridge 7e PPT 2pp
No ratings yet
CH07 Wooldridge 7e PPT 2pp
25 pages
Vehicle Data Regression Analysis
No ratings yet
Vehicle Data Regression Analysis
2 pages
Understanding Serial Correlation in Regression
No ratings yet
Understanding Serial Correlation in Regression
50 pages
Chapter 3 Multiple Regression
No ratings yet
Chapter 3 Multiple Regression
49 pages
CIA Understanding
No ratings yet
CIA Understanding
5 pages
AP StatisticsChapter 3 Review Multiple Choice
No ratings yet
AP StatisticsChapter 3 Review Multiple Choice
2 pages
Prac 1 Data Analysis: Tip: Use Excel Average Formula
No ratings yet
Prac 1 Data Analysis: Tip: Use Excel Average Formula
8 pages
Chapter 4 - Equation Fitting
No ratings yet
Chapter 4 - Equation Fitting
24 pages
Unit 4 A Major 1
No ratings yet
Unit 4 A Major 1
9 pages
Math-130 Laboratory 3
No ratings yet
Math-130 Laboratory 3
10 pages
CFA 2024 L1 Simple Linear Regression
No ratings yet
CFA 2024 L1 Simple Linear Regression
13 pages
Simultaneous Equations and Trigonometry
No ratings yet
Simultaneous Equations and Trigonometry
12 pages
L021171013 Skripsi Dapus-Lamp
No ratings yet
L021171013 Skripsi Dapus-Lamp
15 pages
Regression Analysis Techniques
No ratings yet
Regression Analysis Techniques
34 pages
(Activity 1) - RAMOS, Mark Jerald B. - BSMA 2-9
No ratings yet
(Activity 1) - RAMOS, Mark Jerald B. - BSMA 2-9
2 pages
TOPIC Regression Analysis Panel and Cross Sectional Data
No ratings yet
TOPIC Regression Analysis Panel and Cross Sectional Data
37 pages
Regression Analysis - Classical Assumptions Additional Notes
No ratings yet
Regression Analysis - Classical Assumptions Additional Notes
7 pages
Web Sites: Xy XX Xy XX
No ratings yet
Web Sites: Xy XX Xy XX
4 pages
Curve Fitting Techniques Explained
No ratings yet
Curve Fitting Techniques Explained
157 pages
Projectile Motion Simulation Lab AP Physics 1 Fillable Form
No ratings yet
Projectile Motion Simulation Lab AP Physics 1 Fillable Form
6 pages
Biostatistics Individual Assignment
No ratings yet
Biostatistics Individual Assignment
18 pages

Me310 5 Regression PDF

Uploaded by

Me310 5 Regression PDF

Uploaded by

ME 310

Least Squares Regression

These presentations are prepared by

They can not be used without the permission of the author

f(x) Linear f(x)

Determine the unknowns a0 and a1 by minimizing Sr.

Solve these for a0 and a1. The results are

n (xi yi ) xi yi where y-bar and x-bar are the means of

Use least-squares regression to fit a straight line to

Example 24 (contd): Calculate the correlation coefficient.

Linearization of Nonlinear Behavior

Example 25: Fit an exponential model to the following data set.

x 0.4 0.8 1.2 1.6 2.0 2.3

Example 26: Fit a power equation to the following data set.

x 2.5 3.5 5 6 7.5 10 12.5 15 17.5 20

Example 27: Fit a saturation-growth-rate equation to the following data set.

x 0.75 2 2.5 4 6 8 8.5

1/x 1.333 0.5 0.4 0.25 0.1667 0.125 0.118

Normal equations to find a least-squares parabola are

Individual errors are ei = yi - a0 - a1 x1i - a2 x2i

Solve these equations to get a0, a1 and a2.

Use multiple linear regression to fit

a0 14.40 , a1 9.03 , a2 5.62

You might also like