0% found this document useful (0 votes)

152 views6 pages

Simple Linear Regression Overview

This document summarizes key concepts from Chapter 16 of the Statistics textbook, which covers simple linear regression and correlation. It defines the linear regression model and explains how to estimate coefficients using the least squares method. It also discusses assumptions of the classical linear regression model and how to assess model fit using the standard error of estimate, testing the slope, and coefficient of determination. The document outlines how to use the regression equation for prediction and developing confidence intervals. It concludes by covering regression diagnostics including analyzing residuals, checking for normality and homoscedasticity, and identifying outliers and influential observations.

Uploaded by

maustro

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

152 views6 pages

Simple Linear Regression Overview

Uploaded by

maustro

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

ECON1203 Statistics

Chapter 16 Simple Linear

Regression & Correlation
Contents
1.
2.
3.
4.
5.
6.

Model
Estimating the Coefficients
Error Variable: Required Conditions
Assessing the Model
Using the Regression Equation
Regression Diagnostics (Part 1)

Introduction

Regression analysis predicts one variable based on other variables.

The dependant variable is to be forecast ( Y ).

The statistics practitioner believes that it is related to independent

variables ( X 1 , X 2 , , X k ).

Correlation analysis determines whether a relationship exists:

o Scatter diagram
o Coefficient of correlation
o Covariance

16.1 Model

Deterministic models determine the dependent variable from the

independent variables. They are unrealistic because there may be other
influencing variables.
Probabilistic models include the randomness of real life (e.g. an error
variable).

The error variable ( ) is the difference between the estimated and

actual dependent variable.

The first-order linear model (or simple linear regression model) is a
straight-line model with one independent variable:
o

y= 0 + 1 x +

dependant variable

independent variable

y-intercept

slope of the line

error variable

16.2 Estimating the coefficients

The least squares line ( ^y =b 0+ b1 x ) uses the least squares method

to minimise the sum of squared deviations (

i=1

The sum of squares for error (SSE) is the minimised sum of squared
deviations.
Residuals are the deviations between the actual data points and the line:
o

( y i ^y i )2

e i= y i ^y i

Least squares line coefficients:

s xy

b1 =

b0 = y b 1 x

s2x

( x ix )( y i y )

s xy = i=1

( x i x )2

s 2x = i=1

x = i=1
n
n

y = i=1
n

Shortcuts:
o

s xy =

1
n1

1
s =
n1
2
x

xi yi

i=1

x i y i i=1

x
i=1

( )
xi

n
2
i

i=1

Excel:
o Have two columns of data: one for the dependent variable; the
other for the independent variable.
o Click Data, Data Analysis, and Regression.
o

Specify the Input

Range and the Input

Range.

Assumptions of Classical Linear Regression Model

1. The least squares line coefficients ( 0

1 ) are linear.

and

2. The observed variables ( x i , y i ) are randomly sampled.

x i ; they are not all equal.

3. There is sample variation in

4. The mean of

a. Therefore,
5. The variance of

x :

is 0, regardless of

and

E ( i|x i )=0 .

are uncorrelated.

is a constant:

Var ( i ) = 2 .
i

a. But in reality, not necessarily true (e.g. higher income may increase
variance in expenditure because they have a greater range of
choices)
6. The error variables are uncorrelated:

Cov ( i , j )=0 .

7. The error variables are normally distributed:

i N ( 0, 2 ) .

16.4 Assessing the Model

There are three ways of assess how well the linear model fits the data:
1. The standard error of estimate
2. The

t -test of the slope

3. The coefficient of determination

Sum of Squares for Error (SSE)

SSE= ( y i ^y i ) =( n1 ) s
i=1

2
y

s2xy
s2x

Standard Error of Estimate ( s )

SSE
( s = n2
) isusually compared with y

Testing the Slope

We can use hypothesis testing to infer the population slope ( 1 ) from

the sample slope ( b1 ).

1=0 , there is no linear relationship (but there may be a quadratic

relationship).

The sample slope ( b1 ) is an unbiased estimator of the population

slope ( 1 ) ( E ( b1 )= 1 ) because the estimated standard error of

s
s
=
b
(
( n1 ) s 2x ) decreases as
1

Test statistic for

1=t=

increases.

b1 1
[ where =n2 ]
sb
1

Confidence interval estimator of

1=b1 t 2 s b [ where =n2 ]

Coefficient of Determination

Coefficient of Determination:

R 2=

s2xy
2

=1
2

sx s y

( y i y ) SSE = Explained variation

SSE
=
2
Variation y
( y i y )
( yi y )2

( y i y ) =( y i y ) + ^y i^y i

( y i y ) =Unexplained residual ( y i ^y i ) + Explained variation ( ^y y )

( y i y )2= ( yi ^y i )2+ ( ^y i y )2

Variation y=SSE+ SSR

Coefficient of Correlation

We can use hypothesis testing to infer the population coefficient of

correlation ( ) from the sample coefficient of correlation ( r ).

Sample coefficient of correlation:

Test statistic for

t=r

s xy
sx sy

n2
1r 2 [where

=n2 and variables are

bivariate normally distributed]

16.5 Using the Regression Equation

^y i=b 0+b1 x i

is a point estimator.

There are two interval estimators:

1. Prediction interval:

1 ( x gx )
^y t 2,n2 s 1+ +
n ( n1 ) s 2x

2. Confidence interval estimator of the expected value of

y : ^y t 2,n2 s

2
1 ( x gx )
+
n ( n1 ) s 2x

The farther the given value of

x , the greater the estimated error:

is from

( x gx )

( n1 ) s 2x

16.6 Regression Diagnostics (Part 1)

Residual analysis

Standard deviation of the ith residual : s =s 1h i

2
1 ( x ix )
Where hi= +
n ( n1 ) s 2x

Normality

The residuals should be normally distributed.

Homoscedasticity

The variance of the error variable should be constant.

Independence of the error variable

The error variable should be independent.

Outliers
Outliers may be:
1. Recording errors
2. Points that should not have been included in the sample
3. Valid and should belong to the sample

Influential observations

Some points are influence in determining a least squares line. Without it,
there would be no least squares line.

Procedure
1. Develop a model that has a theoretical basis; find an independent variable
that you believe is linearly related to the dependent variable.
2. Gather data for the two variables from (preferably) a controlled
experiment, or observational data.
3. Draw a scatter diagram. Determine whether a linear model is appropriate.
Identify outliers and influential observations.
4. Determine the regression equation.
5. Calculate the residuals and check the required conditions:
a. Is the error variable normal?
b. Is the variance constant?
c. Are the errors independent?

6. Assess the models fit:

a. Compute the standard error of estimate.
b. Test
c.
7. If the
a.
b.

1 or

to determine whether there is a linear

relationship.
Compute the coefficient of determination.
model fits the data, use the regression equation to:
Predict a particular value of the dependant variable
Estimate its mean

Simple Linear Regression Analysis Guide
No ratings yet
Simple Linear Regression Analysis Guide
50 pages
325unit 1 Simple Regression Analysis
No ratings yet
325unit 1 Simple Regression Analysis
10 pages
Simple Linear Regression Analysis Guide
No ratings yet
Simple Linear Regression Analysis Guide
67 pages
Simple Linear Regression Explained
No ratings yet
Simple Linear Regression Explained
7 pages
LP-III Lab Manual
No ratings yet
LP-III Lab Manual
49 pages
Regression Analysis
No ratings yet
Regression Analysis
22 pages
Week 13
No ratings yet
Week 13
25 pages
Simple Linear Regression in Python
100% (1)
Simple Linear Regression in Python
50 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
47 pages
Linear Regression for Managers
No ratings yet
Linear Regression for Managers
9 pages
Simple Linear Regression Overview
No ratings yet
Simple Linear Regression Overview
25 pages
Pradytha Galuh Putranti - 2304220013 - SSD - B ING-STAT
No ratings yet
Pradytha Galuh Putranti - 2304220013 - SSD - B ING-STAT
26 pages
Linear Regression Analysis Assignment
No ratings yet
Linear Regression Analysis Assignment
8 pages
Business Statistics II
100% (2)
Business Statistics II
100 pages
Simple Linear Regression Guide
No ratings yet
Simple Linear Regression Guide
12 pages
Simple Linear Regression Overview
No ratings yet
Simple Linear Regression Overview
51 pages
Wa0001.
No ratings yet
Wa0001.
43 pages
PE Civil: Transportation Ebook Practice Exam
No ratings yet
PE Civil: Transportation Ebook Practice Exam
41 pages
Simple Linear Regression Explained
No ratings yet
Simple Linear Regression Explained
56 pages
Raw Introduction to Linear Regression (서울대 회귀분석 강의노트)
No ratings yet
Raw Introduction to Linear Regression (서울대 회귀분석 강의노트)
226 pages
Simple Linear Regression Overview
No ratings yet
Simple Linear Regression Overview
31 pages
Understanding Regression Analysis Basics
50% (2)
Understanding Regression Analysis Basics
44 pages
Simple Linear Regression Explained
No ratings yet
Simple Linear Regression Explained
5 pages
Linear Statistical Analysis Overview
No ratings yet
Linear Statistical Analysis Overview
4 pages
Simple Linear Regression Explained
No ratings yet
Simple Linear Regression Explained
38 pages
Daunit 3
No ratings yet
Daunit 3
32 pages
Regression Analysis Basics
No ratings yet
Regression Analysis Basics
56 pages
Regression Analysis Essentials
No ratings yet
Regression Analysis Essentials
55 pages
Simple Linear Regression Explained
No ratings yet
Simple Linear Regression Explained
37 pages
Simple Regression
No ratings yet
Simple Regression
35 pages
9 Regression (Statistics IEM 2-2)
No ratings yet
9 Regression (Statistics IEM 2-2)
32 pages
CH 12
No ratings yet
CH 12
57 pages
9 Regression (Statistics IEM 2-2)
No ratings yet
9 Regression (Statistics IEM 2-2)
32 pages
Chapter 6 Student
No ratings yet
Chapter 6 Student
21 pages
Simple Regression and Correlation Analysis
No ratings yet
Simple Regression and Correlation Analysis
30 pages
Business Applications of Linear Regression
No ratings yet
Business Applications of Linear Regression
36 pages
Understanding Linear Regression Models
No ratings yet
Understanding Linear Regression Models
10 pages
Simple Linear Regression Explained
No ratings yet
Simple Linear Regression Explained
25 pages
Simple Linear Regression Explained
No ratings yet
Simple Linear Regression Explained
58 pages
Simple Regression Analysis Guide
No ratings yet
Simple Regression Analysis Guide
58 pages
STAT630Slide Adv Data Analysis
0% (1)
STAT630Slide Adv Data Analysis
238 pages
Lecture8 4
No ratings yet
Lecture8 4
29 pages
Data Analytics Unit 3 Notes
100% (3)
Data Analytics Unit 3 Notes
28 pages
Applied Linear Regression Models 4th Ed Note
No ratings yet
Applied Linear Regression Models 4th Ed Note
46 pages
Regression Models - Follow
No ratings yet
Regression Models - Follow
7 pages
1 - Simple Linear Regression
No ratings yet
1 - Simple Linear Regression
43 pages
Chapter 4
No ratings yet
Chapter 4
30 pages
Simple Linear Regression Overview
No ratings yet
Simple Linear Regression Overview
32 pages
FinQuiz - Curriculum Note, Study Session 2, Reading 4
No ratings yet
FinQuiz - Curriculum Note, Study Session 2, Reading 4
5 pages
Statics Thinking-Regression
No ratings yet
Statics Thinking-Regression
51 pages
Linear Regression
No ratings yet
Linear Regression
22 pages
Econometrics Final
No ratings yet
Econometrics Final
13 pages
Lecture 4
No ratings yet
Lecture 4
60 pages
Correl Regr
No ratings yet
Correl Regr
33 pages
292322356
No ratings yet
292322356
69 pages
Career FAQs - Financial Planning PDF
No ratings yet
Career FAQs - Financial Planning PDF
206 pages
Career FAQs Banking Careers
No ratings yet
Career FAQs Banking Careers
181 pages
Career FAQs - Property PDF
No ratings yet
Career FAQs - Property PDF
169 pages
Career FAQs - Accounting PDF
No ratings yet
Career FAQs - Accounting PDF
150 pages
Career FAQs InformationTechnology
No ratings yet
Career FAQs InformationTechnology
178 pages
Career FAQs - Sample CV PDF
No ratings yet
Career FAQs - Sample CV PDF
4 pages
Career FAQs - Law (NSW and ACT) PDF
No ratings yet
Career FAQs - Law (NSW and ACT) PDF
148 pages
Career FAQs - Financial Planning PDF
No ratings yet
Career FAQs - Financial Planning PDF
206 pages
Career FAQs - Accounting PDF
No ratings yet
Career FAQs - Accounting PDF
150 pages
The Sims - Build 5
100% (1)
The Sims - Build 5
12 pages
Career FAQs Banking Careers
No ratings yet
Career FAQs Banking Careers
181 pages
Christmas - Classical - Bach and Rachmaninoff Medley
100% (4)
Christmas - Classical - Bach and Rachmaninoff Medley
8 pages
Kiki's Delivery Service - Tabidachi
No ratings yet
Kiki's Delivery Service - Tabidachi
5 pages
Sims Theme "Since We Met" Sheet Music
No ratings yet
Sims Theme "Since We Met" Sheet Music
9 pages
Town With An Ocean View
100% (9)
Town With An Ocean View
4 pages
The First Noel (Jazz)
No ratings yet
The First Noel (Jazz)
2 pages
Ebook - Christmas - Bock's Best 25 Outstanding Christmas Piano Arrangements Vol 3 PDF
100% (11)
Ebook - Christmas - Bock's Best 25 Outstanding Christmas Piano Arrangements Vol 3 PDF
58 pages
Design of Bunkers-Rcc-Pdf-Stud-Upload
100% (1)
Design of Bunkers-Rcc-Pdf-Stud-Upload
12 pages
Quantum-Inspired Experience Replay in DRL
No ratings yet
Quantum-Inspired Experience Replay in DRL
12 pages
Week 1 Exercises
No ratings yet
Week 1 Exercises
2 pages
3D Printed Earth: Sustainable Building Methods
No ratings yet
3D Printed Earth: Sustainable Building Methods
18 pages
Physics Notes Pressure 251009 133814
No ratings yet
Physics Notes Pressure 251009 133814
6 pages
Molecular Symmetry and Group Theory - Carter
100% (1)
Molecular Symmetry and Group Theory - Carter
155 pages
6.2 Gravitation-Past Exam
No ratings yet
6.2 Gravitation-Past Exam
5 pages
Digital Detector Array Performance Evaluation and Long-Term Stability
No ratings yet
Digital Detector Array Performance Evaluation and Long-Term Stability
18 pages
SAT Suite Question Bank-Rhetorical Synthesis 1-H-50 With Explanations
No ratings yet
SAT Suite Question Bank-Rhetorical Synthesis 1-H-50 With Explanations
37 pages
6665 01 Rms 2003 SPECIMEN
No ratings yet
6665 01 Rms 2003 SPECIMEN
5 pages
The Fractal Geometry of Nature Its Mathematical Basis and Application To Computer Graphics
No ratings yet
The Fractal Geometry of Nature Its Mathematical Basis and Application To Computer Graphics
137 pages
ID Fan Boiler
No ratings yet
ID Fan Boiler
23 pages
Brochure TIMREX Graphites For Carbon Brushes and Carbon Parts
No ratings yet
Brochure TIMREX Graphites For Carbon Brushes and Carbon Parts
12 pages
Scotchweld EC 2216BA
No ratings yet
Scotchweld EC 2216BA
8 pages
Profis Anchor 2.5.0 HILTI
No ratings yet
Profis Anchor 2.5.0 HILTI
6 pages
Spotlight XI 2025chem Day-3 2024P1-Solution
No ratings yet
Spotlight XI 2025chem Day-3 2024P1-Solution
4 pages
XTR 106
No ratings yet
XTR 106
15 pages
Dave - Science SAS Preparation Exercise #2
No ratings yet
Dave - Science SAS Preparation Exercise #2
17 pages
Review of Drying and Storage Techniques
No ratings yet
Review of Drying and Storage Techniques
5 pages
Nitrogen Atom and Atomic Structure Worksheet
No ratings yet
Nitrogen Atom and Atomic Structure Worksheet
6 pages
Cryostat and Frozen Section: Notes
No ratings yet
Cryostat and Frozen Section: Notes
4 pages
US Vertical Motors - In509-1d
No ratings yet
US Vertical Motors - In509-1d
48 pages
Engineering Students' Solar Tracker
No ratings yet
Engineering Students' Solar Tracker
21 pages
Short Practice Test 08 - Test Paper - Yakeen NEET 2.0 2024
No ratings yet
Short Practice Test 08 - Test Paper - Yakeen NEET 2.0 2024
5 pages
Calorimetry: Heat of Solution Analysis
No ratings yet
Calorimetry: Heat of Solution Analysis
15 pages
Motivation Letter Clausthal-Zellerfeld, Germany
No ratings yet
Motivation Letter Clausthal-Zellerfeld, Germany
2 pages
2024-25 DMS Unit-II Questions
No ratings yet
2024-25 DMS Unit-II Questions
2 pages
Breakdown of Native Oxide Enables Multifunctional, Free-Form Carbon Nanotube Metal Hierarchical Architectures
No ratings yet
Breakdown of Native Oxide Enables Multifunctional, Free-Form Carbon Nanotube Metal Hierarchical Architectures
9 pages
Physcis Pp1 Papers: Regularly
No ratings yet
Physcis Pp1 Papers: Regularly
50 pages
2023 HSC Maths Ext 1
No ratings yet
2023 HSC Maths Ext 1
20 pages

Simple Linear Regression Overview

Uploaded by

Simple Linear Regression Overview

Uploaded by

ECON1203 Statistics

Chapter 16 Simple Linear

Regression analysis predicts one variable based on other variables.

The dependant variable is to be forecast ( Y ).

The statistics practitioner believes that it is related to independent

Correlation analysis determines whether a relationship exists:

Deterministic models determine the dependent variable from the

The error variable ( ) is the difference between the estimated and

actual dependent variable.

slope of the line

16.2 Estimating the coefficients

The least squares line ( ^y =b 0+ b1 x ) uses the least squares method

to minimise the sum of squared deviations (

Least squares line coefficients:

Specify the Input

Range and the Input

Assumptions of Classical Linear Regression Model

2. The observed variables ( x i , y i ) are randomly sampled.

x i ; they are not all equal.

3. There is sample variation in

7. The error variables are normally distributed:

16.4 Assessing the Model

t -test of the slope

3. The coefficient of determination

Sum of Squares for Error (SSE)

Standard Error of Estimate ( s )

Testing the Slope

We can use hypothesis testing to infer the population slope ( 1 ) from

1=0 , there is no linear relationship (but there may be a quadratic

The sample slope ( b1 ) is an unbiased estimator of the population

Test statistic for

Confidence interval estimator of

1=b1 t 2 s b [ where =n2 ]

( y i y ) SSE = Explained variation

( y i y ) =Unexplained residual ( y i ^y i ) + Explained variation ( ^y y )

Variation y=SSE+ SSR

We can use hypothesis testing to infer the population coefficient of

Sample coefficient of correlation:

Test statistic for

=n2 and variables are

bivariate normally distributed]

16.5 Using the Regression Equation

There are two interval estimators:

2. Confidence interval estimator of the expected value of

The farther the given value of

x , the greater the estimated error:

16.6 Regression Diagnostics (Part 1)

Standard deviation of the ith residual : s =s 1h i

The residuals should be normally distributed.

The variance of the error variable should be constant.

Independence of the error variable

The error variable should be independent.

6. Assess the models fit:

to determine whether there is a linear

You might also like