Chap-2 - Panel Data Pooled Regression Model

MODELS FOR PANEL DATA:
POOLED REGRESSION MODEL

NACEUR KHRAIEF
TUNIS BUSINESS SCHOOL - 2024
Pooled Regression
 We begin the analysis by assuming the simplest version of the model, i.e., there is no
heterogeneity among the groups, the pooled model
and
 In this form, if the remaining assumptions of the classical model are met (zero
conditional mean of , homoscedasticity, uncorrelatedness across observations and
strict exogeneity of ), then OLS is the efficient estimator,
Pooled Regression
 Stack the T observations for individual i in a single equation
 Where and now includes the constant term.

Or
 Where
Pooled Regression
 If it is assumed that is well behaved
if
 Then the most efficient estimator of is
 With estimated variance in which
 Where
Pooled Regression
 ROBUST COVARIANCE MATRIX ESTIMATION AND BOOTSTRAPPING
 However, It is quite likely that there is correlation across observations such that , then
we can estimate the appropriate asymptotic covariance matrix as follows
Pooled Regression
 ROBUST COVARIANCE MATRIX ESTIMATION AND BOOTSTRAPPING
 In this setting, there may be heteroscedasticity across individuals. Suppose, then, we

assume that the disturbance vector consists of plus these omitted components. Then,
 The ordinary least squares estimator of is

Pooled Regression
The true asymptotic covariance matrix would take the form

Pooled Regression
We can estimate this matrix
Where is the vector of T residual for individual i.

Bootstrapping offers another approach to estimating an appropriate covariance matrix
for the estimator.
Pooled Regression
 ROBUST ESTIMATION USING GROUP MEANS

The pooled regression model can also be estimated using the sample means of the data.
The implied regression model is obtained by pre-multiplying each group by where is
row vector of ones
or
and with unspecified.

Pooled Regression
 This is a heteroscedastic regression for which we would use the White estimator for
appropriate inference.
 White estimator : We use OLS (inefficient) but consistent estimators, and calculate an
alternative (“robust” or “consistent) standard error that allows for the possibility of
heteroskedasticity.
 Then: with
 The square root of the previous formulae is the heteroscedasticity-robust standard
error.
Pooled Regression
 ESTIMATION WITH FIRST DIFFERENCES

 First differencing is another approach to estimation.
 The intent would explicitly be to transform latent heterogeneity out of the model.
 which implies the first differences equation

Pooled Regression
 ESTIMATION WITH FIRST DIFFERENCES

 Advantage: it removes the latent heterogeneity from the model whether the fixed or
random effects model is appropriate.
 Disadvantage: the differencing also removes any time-invariant variables from the
model. Of course, this is not helpful for the application in the example like the impact
of Education on the Wage because it is the primary object of the analysis.
 Technical problems:
1. Trade-off between the cross-observation correlation and moving average (MA)
disturbance
2. The new disturbance is autocorrelated, then it is conceivable and could be
correlated (the two-step feasible GLS for an MA disturbance).
Pooled Regression
 THE WITHIN- AND BETWEEN-GROUPS ESTIMATORS

 We could formulate a pooled regression model in three ways.
(a). First, the original formulation is
(b). In term of deviations from the group means

,
(c). and in terms of the group means

Pooled Regression
 To estimate by pooled OLS, we would use the total sum of squares and cross products
Where
and
Pooled Regression
 In (b), the moments matrices we use are within-group’s (i.e., deviations from the
group means) sums of squares and cross products,
 Finally, for c, the means of group mean are the overall mean (i.e., ). Therefore the
moment matrices are the between-groups sums of squares and cross products
Pooled Regression
It is easy to verify that
And
Pooled Regression
 Therefore, there are three possible least squares estimator of β corresponding to

theses decompositions.
 (a). The least squares estimator in the pooling regression is
 (b). The within-groups estimator from is
This is the least squares dummy variable (LSDV) estimator.

Pooled Regression
 (c). An alternative estimator would be the between-groups estimator
This is the least squares estimator based on the N sets of group means.

Chap-2 - Panel Data Pooled Regression Model

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Chap-2 - Panel Data Pooled Regression Model

Uploaded by

Copyright:

Available Formats

MODELS FOR PANEL DATA:

POOLED REGRESSION MODEL

 Stack the T observations for individual i in a single equation

 Where and now includes the constant term.

 If it is assumed that is well behaved

 With estimated variance in which

 ROBUST COVARIANCE MATRIX ESTIMATION AND BOOTSTRAPPING

 ROBUST COVARIANCE MATRIX ESTIMATION AND BOOTSTRAPPING

 In this setting, there may be heteroscedasticity across individuals. Suppose, then, we

 The ordinary least squares estimator of is

The true asymptotic covariance matrix would take the form

We can estimate this matrix

Where is the vector of T residual for individual i.

 ROBUST ESTIMATION USING GROUP MEANS

and with unspecified.

 ESTIMATION WITH FIRST DIFFERENCES

 which implies the first differences equation

 ESTIMATION WITH FIRST DIFFERENCES

 THE WITHIN- AND BETWEEN-GROUPS ESTIMATORS

(b). In term of deviations from the group means

(c). and in terms of the group means

It is easy to verify that

 Therefore, there are three possible least squares estimator of β corresponding to

 (b). The within-groups estimator from is

This is the least squares dummy variable (LSDV) estimator.

 (c). An alternative estimator would be the between-groups estimator

You might also like