0% found this document useful (0 votes)

92 views20 pages

Multiple Regression Estimation Techniques

This document describes the key concepts and methods for multiple linear regression analysis. It discusses: 1. The multiple linear regression model, assumptions, and matrix formulation. 2. Methods for estimating the regression coefficients (β) using least squares and their properties, including being unbiased, minimum variance (Gauss-Markov theorem), and scale invariant. 3. Methods for estimating the variance (σ2) including an unbiased estimator used in practice. 4. When normality is assumed, the maximum likelihood estimators are the same as the least squares estimators and their properties are described.

Uploaded by

AmalAbdlFattah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

92 views20 pages

Multiple Regression Estimation Techniques

Uploaded by

AmalAbdlFattah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Ch6.

Multiple Regression: Estimation

1 The model
yi = β0 + β1 xi1 + β2 xi2 + · · · + βk xik + ǫi , i = 1, 2, · · · , n, (1)

The assumptions for ǫi and yi are analogous to as those for simple linear regression,
namely

1. E(ǫi ) = 0 for all i = 1, 2, · · · , n, or, equivalently, E(yi ) = β0 + β1 xi1 +

β2 xi2 + · · · + βk xik .

2. var(ǫi ) = σ 2 for all i = 1, 2, · · · , n, or, equivalently, var(yi ) = σ 2 .

3. cov(ǫi , ǫj ) = 0 for all i 6= j , or, equivalently, cov(yi , yj ) = 0.

In matrix form, the model can be written as
      
y1 1 x11 x12 ··· x1k β0 ǫ1
      
 y2  1 x21 x22 ··· x2k   β1   ǫ2 
 .  = .  
..   ..  +  
 .  . .. ..
 .  . . . .   .  · · ·
yn 1 xn1 xn2 ··· xnk betak ǫn

or
y = Xβ + ǫ.

The assumption on ǫi or yi can be expressed as

1. E(ǫ) = 0 or E(y) = Xβ .

2. cov(ǫ) = σ 2 I or cov(y) = σ 2 I .

n × (k + 1) and is called the design matrix. In this chapter, we

The matrix is
assume that n > k + 1 and rank(X )=k + 1.
2 Estimation of β and σ 2

2.1 Least squares estimator for β

The least squares approach is to seek the estimators of β which minimize the sums of
squares if deviations of the n observed y ’s from their predicted values ŷ , i.e., minimize
n
X n
X
ǫ̂2i = (yi − ŷi )2 .
i=1 i=1

Theorem 2.1 If y = Xβ + ǫ, where X is n × (k + 1) of rank k + 1 < n, then

the least squares estimator of β is

β̂ = (X ′ X)−1 X ′ y.

P ROOF : Exercise.

2.2 Properties of the least squares estimator β̂

Theorem 2.2 If E(y) = Xβ , then β̂ is an unbiased estimator for β .
P ROOF :

E(β̂) = E[(X ′ X)−1 X ′ y]

= (X ′ X)−1 X ′ E(y)
= (X ′ X)−1 X ′ Xβ
= β.

Theorem 2.3 If cov(y) = σ 2 I , the covariance matrix for β is given by σ 2 (X ′ X)−1 .

P ROOF : Exercise.

Theorem 2.4 (Gauss-Markov Theorem) If E(y) = Xβ and cov(y) = σ 2 I , the

least squares estimators β̂j , j = 0, 1, · · · , k , have minimum variance among all
linear unbiased estimators, i.e., the least squares estimators β̂j , j = 0, 1, · · · , k are
best linear unbiased estimators (BLUE).
P ROOF : We consider a linear estimator Ay of β and seek the matrix A for which Ay
is a minimum variance unbiased estimator of β . Since Ay is to be unbiased for β , we
have
E(Ay) = AE(y) = AXβ = β,
which gives the unbiasedness condition

AX = I

since the relationship AXβ = β must hold for any positive value of β .
The covariance matrix for Ay is

cov(Ay) = A(σ 2 I)A′ = σ 2 AA′ .

The variance of the βj ’s are on the diagonal of σ 2 AA′ , and therefore we need to
choose A (subject to
bAX = I ) so that the diagonal elements of AA′ are minimized. Since

AA′ = [A − (X ′ X)−1 X ′ + (X ′ X)−1 X ′ ][A − (X ′ X)−1 X ′ + (X ′ X)−1 X ′ ]′

= [A − (X ′ X)−1 X ′ ][A − (X ′ X)−1 X ′ ]′ + (X ′ X)−1 .
Note in the last equality, AX = I is used. Since [A − (X ′ X)−1 X ′ ][A −
(X ′ X)−1 X ′ ]′ is positive semidefinite, the diagonal elements are great than or equal
to zero. These diagonal elements can be made equal to zero by choosing A =
(X ′ X)−1 X ′ . (This value of A also satisfies the unbiasedness condition AX = I ).
The resulting minimum variance estimator of β is

Ay = (X ′ X)−1 X ′ y,

which is equal to the least square estimator β̂ .

Remark: The remarkable feature of the Gauss-Markov theorem is its distributional

y ; normality is not required. The
generality. The result holds for any distribution of
only assumptions used in the proof are E(y) = Xβ and cov(y) = σ 2 I . If these
assumptions do not hold, β̂ may be biased or each β̂j may have a larger variance than
that of some other estimator.

Corollary 2.1 If E(y) = Xβ and cov(y) = σ 2 I , the best linear unbiased estima-
′ ′
tor of a′ β is a′ β̂ , where β = (X X)−1 X y .
A fourth property of β̂ is the following: the predicted value ŷ = β̂0 + β̂1 x1 +
′
· · · + β̂k xk = β̂ x is invariant to simple linear changes of scale on the x’s, where
x = (1, x1 , x2 , · · · , xk )′ .

Theorem 2.5 If x = (1, x1 , · · · , xk )′ and z = (1, c1 x1 , · · · , ck xk ), then ŷ =

′ ′ ′
β̂ x = β̂ z z , where β̂ z is the least squares estimator from the regression of y on z .

P ROOF : We can write Z = XD , where D = diag(1, c1 , c2 , · · · , ck ). Substitut-

ing Z = XD into β̂ z , we have

β̂ z = [(XD)′ (XD)]−1 (XD)′ y

= D −1 (X ′ X)−1 X ′ y
= D −1 β̂.

Hence,
′ −1 ′ ′
β̂ z z = (D β̂) Dx = β̂ x.
2.3 An estimator for σ 2
By assumption 1, E(yi ) = x′i β , and by assumption 2, σ 2 = E[yi − E(yi )]2 , we
have
σ 2 = E(yi − x′i β)2 .

Hence, σ 2 can be estimated by

X n
2 1
s = (yi − x′i β)2
n − k − 1 i=1
1
= (y − X β̂)′ (y − X β̂)
n−k−1
SSE
= .
n−k−1
With the denominator n − k − 1, s2 is an unbiased estimator of σ 2 .

Theorem 2.6 If E(y) = Xβ and cov(y) = σ 2 I , then

E(s2 ) = σ 2 .
P ROOF : Exercise.

Corollary 2.2 An unbiased estimator of cov(β̂) is given by

c β̂) = s2 (X ′ X)−1 .
cov(

Theorem 2.7 If E(y) = Xβ , cov(y) = σ 2 I , and E(ǫ4i ) = 3σ 4 for the linear

model y = Xβ + ǫ, then s2 is the best (minimum variance) quadratic unbiased
estimator of σ 2 .

P ROOF : See Graybill (1954) or Wang and Chow (1994, pp.161-163).

3 The model in centered form

In matrix form, the centered model for the linear multiple regression becomes
!
α
y = (j, X c ) + ǫ,
β1
where j is a vector of 1’s, β 1 = (β1 , β2 , · · · , βk )′ ,
 
x11 − x̄1 x12 − x̄2 ··· x1k − x̄k
 
1  x21 − x̄1 x22 − x̄2 ··· x2k − x̄k 
X c = (I − J )X 1 = 
 .. .. ..
.

n  . . . 
xn1 − x̄1 xn2 − x̄2 ··· xnk − x̄k

The matrix I − n1 J is sometimes called the centering matrix.

The corresponding least squares estimator becomes
!
α̂
= [(j, X c )′ (j, X c )]−1 (j, X c )′ y
β̂ 1
!−1
n 0
= nȳ X ′c y
0 X ′c X c
!
ȳ
= ′ −1 ′ .
(X c X c ) X c y
4 Normal model

4.1 Assumptions
Normality assumption:

y is Nn (Xβ, σ 2 I) or ǫ is Nn (0, σ 2 I).

Under normality, cov(y) = cov(ǫ) = σ 2 I implies that the y ’s are independent as

well as uncorrelated.

4.2 Maximum likelihood estimators for β and σ 2

Theorem 4.1 If y is Nn (Xβ, σ 2 I), where X is n × (k + 1) of rank k + 1 < n,
the maximum likelihood estimators of β and σ 2 are

β̂ = (X ′ X)−1 X ′ y,
1
σ̂ 2 = (y − X β̂)′ (y − X β̂).
n
P ROOF : Exercise.

The maximum likelihood estimator β̂ is the same as the least squares estimator
β̂ . The estimator σ̂ 2 is biased since the denominator is n rather n − k − 1. We often
use the unbiased estimator s2 to estimate σ 2 .

4.3 Properties of β̂ and σ̂ 2

Theorem 4.2 Suppose y is Nn (Xβ, σ 2 I), where X is n×(k +1) of rank k +1 <
n and β = (β0 , · · · , βk )′ . Then the maximum likelihood estimators β̂ and σ̂ 2 have
the following distributional properties:

(i) β̂ is Nk+1 (β̂, σ 2 (X ′ X)−1 ).

(ii) nσ̂ 2 /σ 2 is χ2 (n − k − 1), or equivalently, (n − k − 1)s2 /σ 2 is χ2 (n − k − 1).
(iii) β̂ and σ̂ 2 (or s2 ) are independent.

= (X ′ X)−1 X ′ y is a linear function of y , E(β̂) =

P ROOF : (i) Since y is normal, β̂
β and cov(β̂) = σ 2 (X ′ X)−1 , β̂ ∼ Nk+1 (β, σ 2 (X ′ X)−1 ).
(ii)
2 y′2 y
nσ̂ /σ = (I − X(X ′ X)X ′ ) ,
σ σ
and I − X(X X)X is idempotent, hence nσ̂ 2 /σ 2 is χ2 (n − k − 1).
′ ′

′ ′
(iii) Since β̂ = (X X)−1 X y and

nσ̂ 2 = y ′ (I − X(X ′ X)X ′ )y,

and that (X ′ X)−1 X ′ (I − X(X ′ X)X ′ ) = O, we have β̂ and σ̂ 2 (or s2 ) are

independent.

Theorem 4.3 If y is Nn (Xβ, σ 2 I), then β̂ and σ̂ 2 are jointly sufficient for β and
σ2 .

P ROOF : Using the Neyman factorization theorem. For details, see Rencher and Schaalje
(2008, pp.159-160).

Since β̂ and σ̂ 2 are jointly sufficient for β and σ 2 , no other estimators can improve
on the information they extract from the sample to estimate β and σ 2 . Thus, it is not
surprising that β̂ and s2 are minimum variance unbiased estimators.
Theorem 4.4 If y is Nn (Xβ, σ 2 I), then β̂ and s2 have minimum variance among
all unbiased estimators.

P ROOF : See Graybill (1976, P.176) or Christensen (1996, pp.25-27).

2
5 R in fixed-x regression
The proportion of the total sum of squares due to regression is measured by
SSR2 SSE
R = =1− ,
SST SST
Pn 2
Pn 2 ′ ′ 2
where SST = i=1 (yi − ȳ) , SSR = i=1 (ŷi − ȳ) = β̂ X y − nȳ , and

SST = SSR + SSE,

Pn
where SSE = i=1 (yi − ŷi )2 .
The R2 is called the coefficient of determination or the squared multiple correla-
tion. The positive square root R is called the multiple correlation coefficient. If the x’s
were random, R would estimate a population multiple correlation.
We list some properties of R2 and R.

1. The range of R2 is 0 ≤ R2 ≤ 1. If all the β̂j ’s were zero, except for β̂0 , R2
would be zero. (This event has probability zero for continuous data.) If all the
y -values fell on the fitted surface, that is, if yi = ŷi , i = 1, 2, · · · , n, then R2
would be 1.

2. R = ryŷ ; that is, the multiple correlation is equal to the simple correlation
between the observed yi ’s and the fitted ŷi ’s.

3. Adding a variable x to the model increases (can not decrease) the value of R2 .

4. If β1 = β2 = · · · = βk = 0, then
k
E(R2 ) = .
n−1

Note that the β̂j ’s will not be zero when the βj ’s are zero.
5. R2 cannot be partitioned into k components, each of which is uniquely at-
tributable to an xj , unless the x’s are mutually orthogonal, that is, unless
Pn
i=1 (xij − x̄j )(xim − x̄m ) = 0 for j 6= m.

6. R2 is invariant to full-rank linear transformations on the x’s and to a scale

change on y (but not invariant to a joint linear transformation including y and
the x’s).

Adjusted R2 (AdjR2 )

(R2 − k/(n − 1))(n − 1)

Ra2 2
= AdjR =
n−k−1
(n − 1)R2 − k
=
n−k−1
SSE/(n − k − 1)
=1−
SST /(n − 1)
R2 can also be expressed in terms of sample variances and covariances:
′′
2 β̂ 1 X c X c β̂ 1
R = Pn 2
i=1 (yi − ȳ)
s′yx S −1
xx (n − 1)S xx S −1
xx syx
= P n 2
i=1 (yi − ȳ)
s′yx S −1
xx syx
=
s2y
′
X cy X ′c X c ′
X cy
Note that β̂ 1 = (n − 1)(X ′c X c )−1 n−1 =( n−1 )−1 n−1 = S −1
xx syx . This
form of R2 will facilitate a comparison with R2 for the random-x case.
Geometrically, R is the cosine of the angle θ between y and ŷ corrected for their
means. The mean of ŷ is ȳ , the same as the mean of y . Thus, the centered form of y
and ŷ are y − ȳj and ŷ − ȳj .

(y − ȳj)′ (ŷ − ȳj)

cos θ = p .
[(y − ȳj) (y − ȳj)][(ŷ − ȳj) (ŷ − ȳj)]
′ ′
Note that

(y − ȳj)′ (ŷ − ȳj) = [(ŷ − ȳj) + (y − ŷ)]′ (ŷ − ȳj)

= (ŷ − ȳj)′ (ŷ − ȳj) + (y − ŷ)′ (ŷ − ȳj)
= (ŷ − ȳj)′ (ŷ − ȳj) + 0.

Hence, p r
(ŷ − ȳj)′ (ŷ − ȳj) SSR
cos θ = = = R.
(y − ȳj)′ (y − ȳj) SST

6 Generalized least squares: cov(y) = σ 2V

The model

y = Xβ + ǫ, E(y) = Xβ, cov(y) = Σ = σ 2 V ,

Theorem 6.1 Let y = Xβ + ǫ, E(y) = Xβ , and cov(y) = cov(ǫ) = σ 2 V ,

where X is a full-rank matrix and V is a known positive definite matrix. For this model,
we obtain the following results:
(i) The best linear unbiased estimator (BLUE) of β is

β̂ = (X ′ V −1 X)−1 X ′ V −1 y.

(ii) The covariance matrix for β̂ is

cov(β) = σ 2 (X ′ V −1 X)−1 .

(iii) An unbiased estimator of σ 2 is

2 (y − X β̂)′ V −1 (y − X β̂)
s = .
n−k−1
V is positive definite, there exists an n × n nonsingular matrix P
P ROOF : (i) Since
′ −1
such that V = P P . Multiplying y = Xβ + ǫ by P , we obtain

P −1 y = P −1 Xβ + P −1 ǫ,

Applying the least square approach to this transformed model, we get

β̂ = [X ′ (P −1 )′ P −1 X]−1 X ′ (P −1 )′ P −1 y
= (X ′ V −1 X)−1 X ′ V −1 y.
X is full rank, X ′ V −1 X is positive definite. The estimator β̂ =
Note that since
(X ′ V −1 X)−1 X ′ V −1 y is usually called the generalized least squares estimator.
(ii) and (iii) are left as exercises.

Theorem 6.2 If y is Nn (Xβ, σ 2 V ), where X is n × (k + 1) of rank k + 1 and V

is a known positive definite matrix, then the maximum likelihood estimators for β and
σ 2 are

β̂ = (X ′ V −1 X)−1 X ′ V −1 y,
2 1
σ̂ = (y − X β̂)′ V −1 (y − X β̂).
n
P ROOF : Exercise.

L5 2025 Spring
No ratings yet
L5 2025 Spring
40 pages
Classical Linear Regression and Its Assumptions
No ratings yet
Classical Linear Regression and Its Assumptions
63 pages
Chap 2
No ratings yet
Chap 2
40 pages
Linear Model
No ratings yet
Linear Model
14 pages
Understanding Multiple Regression Models
No ratings yet
Understanding Multiple Regression Models
21 pages
Econometric Theory: Module - Ii
No ratings yet
Econometric Theory: Module - Ii
11 pages
Gauss-Markov Theorem
No ratings yet
Gauss-Markov Theorem
5 pages
Classical Linear Regression Overview
No ratings yet
Classical Linear Regression Overview
5 pages
Gauss Markov Theorem
100% (1)
Gauss Markov Theorem
3 pages
EC501 Lecture 02
No ratings yet
EC501 Lecture 02
27 pages
OLS Matrix Analysis for Statisticians
No ratings yet
OLS Matrix Analysis for Statisticians
14 pages
Understanding Classical Regression Models
No ratings yet
Understanding Classical Regression Models
7 pages
Unbiasedness in Linear Regression Models
No ratings yet
Unbiasedness in Linear Regression Models
9 pages
Gauss Markov Theorem
No ratings yet
Gauss Markov Theorem
3 pages
Understanding Linear Models in Statistics
No ratings yet
Understanding Linear Models in Statistics
8 pages
Multiple Linear Regression Model by Jeevan Bista
No ratings yet
Multiple Linear Regression Model by Jeevan Bista
16 pages
Chapter 4. Gauss-Markov Model
No ratings yet
Chapter 4. Gauss-Markov Model
20 pages
Advanced Econometrics PDF
No ratings yet
Advanced Econometrics PDF
58 pages
Classical Multiple Regression Analysis
No ratings yet
Classical Multiple Regression Analysis
5 pages
Lecture II - Docx - 12
No ratings yet
Lecture II - Docx - 12
12 pages
Asymptotic Analysis of Least Squares Estimators
No ratings yet
Asymptotic Analysis of Least Squares Estimators
8 pages
R Guide to Simple Linear Regression
No ratings yet
R Guide to Simple Linear Regression
14 pages
R Code for Simple Linear Regression
No ratings yet
R Code for Simple Linear Regression
14 pages
Understanding Simple Linear Regression
No ratings yet
Understanding Simple Linear Regression
30 pages
Econ-607 - Unit2-W1-3
No ratings yet
Econ-607 - Unit2-W1-3
117 pages
General Linear Hypothesis & ANOVA Analysis
No ratings yet
General Linear Hypothesis & ANOVA Analysis
9 pages
Finite Sample Properties of OLS Estimators
No ratings yet
Finite Sample Properties of OLS Estimators
35 pages
Regression Estimation Methods Explained
No ratings yet
Regression Estimation Methods Explained
4 pages
Education and Research: UP School of Statistics Student Council
No ratings yet
Education and Research: UP School of Statistics Student Council
26 pages
Econometrics: CLM & OLS Basics
No ratings yet
Econometrics: CLM & OLS Basics
11 pages
Multiple Linear Regression Analysis
No ratings yet
Multiple Linear Regression Analysis
61 pages
Asymptotic Analysis of Least Squares Estimators
No ratings yet
Asymptotic Analysis of Least Squares Estimators
7 pages
Derivation of BLUE Property of OLS Estimators
100% (2)
Derivation of BLUE Property of OLS Estimators
4 pages
Econometric Methods Lecture Notes
No ratings yet
Econometric Methods Lecture Notes
152 pages
RegEstimationLS ML StatColumbia
No ratings yet
RegEstimationLS ML StatColumbia
44 pages
Linear Stochastic Models Overview
No ratings yet
Linear Stochastic Models Overview
12 pages
K-Variable Linear Model Overview
No ratings yet
K-Variable Linear Model Overview
17 pages
Econometrics Regression Insights
No ratings yet
Econometrics Regression Insights
20 pages
Statistical Inference in Linear Models
No ratings yet
Statistical Inference in Linear Models
43 pages
Lecture 24: Weighted and Generalized Least Squares 1 Weighted Least Squares
No ratings yet
Lecture 24: Weighted and Generalized Least Squares 1 Weighted Least Squares
8 pages
Wooldridge 6e AppE IM
No ratings yet
Wooldridge 6e AppE IM
5 pages
Linear Gaussian Models in Regression Analysis
No ratings yet
Linear Gaussian Models in Regression Analysis
112 pages
Econometrics: OLS Estimator Insights
No ratings yet
Econometrics: OLS Estimator Insights
13 pages
Lecture Slides On Statistics at Uni St. Gallen
No ratings yet
Lecture Slides On Statistics at Uni St. Gallen
20 pages
6 Properties of LSE
No ratings yet
6 Properties of LSE
2 pages
Solutions Manual For Econometric Analysis 7th Edition by Greene Sample Chapter
No ratings yet
Solutions Manual For Econometric Analysis 7th Edition by Greene Sample Chapter
13 pages
Multiple Regression with Random Variables
No ratings yet
Multiple Regression with Random Variables
6 pages
The Linear Regression Model
No ratings yet
The Linear Regression Model
24 pages
Multivariate Linear Regression Models
No ratings yet
Multivariate Linear Regression Models
7 pages
WST 311 Notes Part 2 2024
No ratings yet
WST 311 Notes Part 2 2024
21 pages
STAT 353: Expectation, Variance & Regression Guide
No ratings yet
STAT 353: Expectation, Variance & Regression Guide
44 pages
Ordinary Least Squares Method Explained
No ratings yet
Ordinary Least Squares Method Explained
4 pages
Two-Variable Regression Model Basics
No ratings yet
Two-Variable Regression Model Basics
13 pages
Econometrics II Final Exam Solutions
No ratings yet
Econometrics II Final Exam Solutions
14 pages
Diagnostic Test Template for Assessment
No ratings yet
Diagnostic Test Template for Assessment
4 pages
Bayesian Inference in Multiple Regression
No ratings yet
Bayesian Inference in Multiple Regression
3 pages
Statistical Inference with Resampling Methods
No ratings yet
Statistical Inference with Resampling Methods
53 pages
Minitab Basics for STAT10010 Lab 1
No ratings yet
Minitab Basics for STAT10010 Lab 1
5 pages
Quadratic Forms and Their Distributions
No ratings yet
Quadratic Forms and Their Distributions
16 pages
Understanding MANOVA in Research
No ratings yet
Understanding MANOVA in Research
13 pages
Introduction to Econometrics Concepts
No ratings yet
Introduction to Econometrics Concepts
9 pages
Excel Engineering Statistics Lab Guide
No ratings yet
Excel Engineering Statistics Lab Guide
10 pages
Worksheet 1
No ratings yet
Worksheet 1
3 pages
Worksheet 1
No ratings yet
Worksheet 1
3 pages
IB Diploma Math: Continuous Distributions
No ratings yet
IB Diploma Math: Continuous Distributions
1 page
Sample Size Calculation Guide
No ratings yet
Sample Size Calculation Guide
2 pages
EU Economic Data Overview 2000-2018
No ratings yet
EU Economic Data Overview 2000-2018
191 pages
The Clown Game
No ratings yet
The Clown Game
3 pages
AP Statistics Review Questions and Answers
No ratings yet
AP Statistics Review Questions and Answers
2 pages
Intro to Linear Algebra: Course Overview
No ratings yet
Intro to Linear Algebra: Course Overview
10 pages
Spss Exercises
100% (2)
Spss Exercises
12 pages
Continuous Distributions in IB Math
No ratings yet
Continuous Distributions in IB Math
2 pages
Calculus for Economics Students
No ratings yet
Calculus for Economics Students
62 pages
Continuous Distributions in IB Math
No ratings yet
Continuous Distributions in IB Math
2 pages
Linear Models Course Overview AS611
No ratings yet
Linear Models Course Overview AS611
3 pages
Middle School Performance Analysis 2017-18
No ratings yet
Middle School Performance Analysis 2017-18
15 pages
Grade 1-7 Student Grades Report
No ratings yet
Grade 1-7 Student Grades Report
25 pages
Trigonometric Function Derivatives Guide
No ratings yet
Trigonometric Function Derivatives Guide
5 pages
Panel Data Regression in Stata Tutorial
No ratings yet
Panel Data Regression in Stata Tutorial
2 pages
Topics 1 and 2 Review Markscheme
50% (2)
Topics 1 and 2 Review Markscheme
14 pages
OLS Regression with Stata 9+ Guide
No ratings yet
OLS Regression with Stata 9+ Guide
13 pages
Understanding Functions in Mathematics
No ratings yet
Understanding Functions in Mathematics
9 pages
MA20010: Vector Calculus and Partial Differential Equations
No ratings yet
MA20010: Vector Calculus and Partial Differential Equations
22 pages
Automatic Control Systems - B. C. Kuo and F. Golnaraghi
No ratings yet
Automatic Control Systems - B. C. Kuo and F. Golnaraghi
103 pages
Homogeneous Transformation Matrix and Its Partitioning in Robotics - 20250309 - 194020 - 0000
100% (1)
Homogeneous Transformation Matrix and Its Partitioning in Robotics - 20250309 - 194020 - 0000
2 pages
Revised Simplex and Duality in LP
No ratings yet
Revised Simplex and Duality in LP
33 pages
GB Sir Dropper Test 3
No ratings yet
GB Sir Dropper Test 3
6 pages
Mcqs - Unit # 7: F.SC Part 2: Calculus and Analytic Geometry, Mathematics 12
No ratings yet
Mcqs - Unit # 7: F.SC Part 2: Calculus and Analytic Geometry, Mathematics 12
4 pages
Screenshot 2025-02-08 at 12.32.33 AM
No ratings yet
Screenshot 2025-02-08 at 12.32.33 AM
3 pages
Invariant Cones and Uniqueness of The Ground State For Fermion Systems - Faris (1972)
No ratings yet
Invariant Cones and Uniqueness of The Ground State For Fermion Systems - Faris (1972)
7 pages
Gauss Elimination for Linear Equations
No ratings yet
Gauss Elimination for Linear Equations
54 pages
Points, Scalars, and Vectors in CG
No ratings yet
Points, Scalars, and Vectors in CG
23 pages
ELECTRODYNAMICS
No ratings yet
ELECTRODYNAMICS
2 pages
NED University IM-212 Assignment 2 Guide
No ratings yet
NED University IM-212 Assignment 2 Guide
22 pages
Assignment For B Tech
No ratings yet
Assignment For B Tech
1 page
AMD Notes
No ratings yet
AMD Notes
251 pages
Linear Algebra Assignment 6 Overview
No ratings yet
Linear Algebra Assignment 6 Overview
10 pages
Further Maths 3
No ratings yet
Further Maths 3
36 pages
Convex Sets and Their Properties
No ratings yet
Convex Sets and Their Properties
2 pages
CHP 5 Matrices Determinants1
No ratings yet
CHP 5 Matrices Determinants1
45 pages
MP-206 Stress and Strain Analysis Assignment
No ratings yet
MP-206 Stress and Strain Analysis Assignment
6 pages
Numerical Methods Study Guide 2019-2020
No ratings yet
Numerical Methods Study Guide 2019-2020
3 pages
Eigen Value Problem Openboard
No ratings yet
Eigen Value Problem Openboard
24 pages
Direct Stiffness Method Exercises
No ratings yet
Direct Stiffness Method Exercises
3 pages
Normalized PCA Techniques Explained
No ratings yet
Normalized PCA Techniques Explained
23 pages
Vector Calculus: Coordinate Systems
No ratings yet
Vector Calculus: Coordinate Systems
25 pages
Linear Algebra & Discrete Math Tasks
No ratings yet
Linear Algebra & Discrete Math Tasks
5 pages
MLQ4 Quantum Information Exercises
No ratings yet
MLQ4 Quantum Information Exercises
3 pages
Linear Algebra & Calculus Lab Manual
No ratings yet
Linear Algebra & Calculus Lab Manual
32 pages
MAI 102 Assignment 3 - 240214 - 233830
No ratings yet
MAI 102 Assignment 3 - 240214 - 233830
22 pages
Matrix Decomposition in Statistics
No ratings yet
Matrix Decomposition in Statistics
32 pages
Determinants Matrices Test 11th 01 Dec 2025
No ratings yet
Determinants Matrices Test 11th 01 Dec 2025
2 pages

Multiple Regression Estimation Techniques

Uploaded by

Multiple Regression Estimation Techniques

Uploaded by

Ch6.

Multiple Regression: Estimation

1. E(ǫi ) = 0 for all i = 1, 2, · · · , n, or, equivalently, E(yi ) = β0 + β1 xi1 +

2. var(ǫi ) = σ 2 for all i = 1, 2, · · · , n, or, equivalently, var(yi ) = σ 2 .

3. cov(ǫi , ǫj ) = 0 for all i 6= j , or, equivalently, cov(yi , yj ) = 0.

The assumption on ǫi or yi can be expressed as

n × (k + 1) and is called the design matrix. In this chapter, we

2.1 Least squares estimator for β

Theorem 2.1 If y = Xβ + ǫ, where X is n × (k + 1) of rank k + 1 < n, then

2.2 Properties of the least squares estimator β̂

E(β̂) = E[(X ′ X)−1 X ′ y]

Theorem 2.3 If cov(y) = σ 2 I , the covariance matrix for β is given by σ 2 (X ′ X)−1 .

Theorem 2.4 (Gauss-Markov Theorem) If E(y) = Xβ and cov(y) = σ 2 I , the

cov(Ay) = A(σ 2 I)A′ = σ 2 AA′ .

AA′ = [A − (X ′ X)−1 X ′ + (X ′ X)−1 X ′ ][A − (X ′ X)−1 X ′ + (X ′ X)−1 X ′ ]′

which is equal to the least square estimator β̂ .

Remark: The remarkable feature of the Gauss-Markov theorem is its distributional

Theorem 2.5 If x = (1, x1 , · · · , xk )′ and z = (1, c1 x1 , · · · , ck xk ), then ŷ =

P ROOF : We can write Z = XD , where D = diag(1, c1 , c2 , · · · , ck ). Substitut-

β̂ z = [(XD)′ (XD)]−1 (XD)′ y

Hence, σ 2 can be estimated by

Theorem 2.6 If E(y) = Xβ and cov(y) = σ 2 I , then

Corollary 2.2 An unbiased estimator of cov(β̂) is given by

Theorem 2.7 If E(y) = Xβ , cov(y) = σ 2 I , and E(ǫ4i ) = 3σ 4 for the linear

P ROOF : See Graybill (1954) or Wang and Chow (1994, pp.161-163).

3 The model in centered form

The matrix I − n1 J is sometimes called the centering matrix.

y is Nn (Xβ, σ 2 I) or ǫ is Nn (0, σ 2 I).

Under normality, cov(y) = cov(ǫ) = σ 2 I implies that the y ’s are independent as

4.2 Maximum likelihood estimators for β and σ 2

4.3 Properties of β̂ and σ̂ 2

(i) β̂ is Nk+1 (β̂, σ 2 (X ′ X)−1 ).

= (X ′ X)−1 X ′ y is a linear function of y , E(β̂) =

nσ̂ 2 = y ′ (I − X(X ′ X)X ′ )y,

and that (X ′ X)−1 X ′ (I − X(X ′ X)X ′ ) = O, we have β̂ and σ̂ 2 (or s2 ) are

P ROOF : See Graybill (1976, P.176) or Christensen (1996, pp.25-27).

SST = SSR + SSE,

6. R2 is invariant to full-rank linear transformations on the x’s and to a scale

(R2 − k/(n − 1))(n − 1)

(y − ȳj)′ (ŷ − ȳj)

(y − ȳj)′ (ŷ − ȳj) = [(ŷ − ȳj) + (y − ŷ)]′ (ŷ − ȳj)

6 Generalized least squares: cov(y) = σ 2V

y = Xβ + ǫ, E(y) = Xβ, cov(y) = Σ = σ 2 V ,

Theorem 6.1 Let y = Xβ + ǫ, E(y) = Xβ , and cov(y) = cov(ǫ) = σ 2 V ,

(ii) The covariance matrix for β̂ is

(iii) An unbiased estimator of σ 2 is

Applying the least square approach to this transformed model, we get

Theorem 6.2 If y is Nn (Xβ, σ 2 V ), where X is n × (k + 1) of rank k + 1 and V

You might also like