Professional Documents
Culture Documents
Cheng Hsiao
To cite this article: Cheng Hsiao (1985) Benefits and limitations of panel data, Econometric
Reviews, 4:1, 121-174, DOI: 10.1080/07474938508800078
Cheng Hsiao
University of Southern California, Los Angeles 90089-0152
and
University of Toronto, Toronto M5S 1Al
Key Words and Phrases: panel data; longitudinal data; fixed
e f f e c t s ; random e f f e c t s ; linear mode 2s; nonlinear models;
dynamic models; l a t e n t variable mode 2s.
ABSTRACT
Observations for a number of cross-sectional units over time
have become increasingly available. The new data sources enable
econometricians to construct and test more complicated behavioral
models than a single cross sectional or time series data set
would allow. The availability of new data sources, however, also
raises new issues. In this paper we review some basic econo-
metric methods that have been used to analyze such data sets. We
also indicate areas of research where panel data may be useful.
1. INTRODUCTION
- N - N - N
where yt -- IiZlyit/N, x =
-t git/N, and ut = Iizl
tion is not that each technician comes and goes, randomly sampled
from all employees, but that all are available, and if we want to
assess differences between these specific technicians, then the
fixed effect model is more appropriate. Similarly, if an experi-
ment involves hundreds of individuals that are considered a
random sample from some larger population, random effects are
more appropriate. But if the situation is one of analyzing just
a few individuals, say five or six, where the sole interest lay
in just these individuals, then individual effects would more
appropriately be fixed and not random. The situation to which a
model applies and the inferences based on it are the deciding
factors in determining whether we should treat effects as random
or fixed. When individual units in the sample are of interest,
the effects are more appropriately considered fixed. When infer-
ences will be made about the characteristics of a population from
which those in the data are considered to be a random sample,
then the effects should be considered r a n d ~ m . ~
Closely related to the question of fixed effect or random
effect inference is the impact of nonorthogonality between the
effects a . and the included explanatory variables, x on the
1 -it'
appropriateness of either approach. There are arguments which
suggest that the individual effects and the explanatory variables
are correlated (Mundlak (1961, 1978)). For example, in a panel
of farms observed over several years, suppose that y is a
it
measure of the output of the i-th farm in the t-th season, x.
-1t
are measured inputs, a represents the input reflecting soil
i
quality and other characteristics of the farm's location which
are known to the farmer but unknown to the investigating econo-
2 ~ this
n sense, if N becomes large, one would not be interested
in the specific effects of each individual but rather in the
characteristics of the population. A random effect framework
would be more appropriate.
BENEFITS AND LIMITATIONS O F PANEL DATA
i = 1,..., N,
8, = yi - Gi,
T
where ii = 2 T and ji = It=l
T
where ; = ItZ1uit/T. The transformation of the data attained
i
by subtracting from each observation the time series mean for the
138 HSIAO
where
BENEFITS AND LIMITATIONS OF PANEL DATA
sponding y
it
and x
-it
values, then regress [yit-(1-$ +) y- i ] On
i c a l l y e q u i v a l e n t t o t h e GLS e s t i m a t o r .
When a . a r e c o r r e l a t e d w i t h
1
zit, t h e random e f f e c t i n f e r e n c e
i n v o k e s a j o i n t d i s t r i b u t i o n of ( a . ,511. For l i n e a r r e g r e s s i o n
1
models i t i s s u f f i c i e n t t o assume t h a t
- i = 1, ..., N, (3.14)
E*(ailxi) = 1-1 + w
a*' Z i 3
f ( a i I s i ) i s l i n e a r , t h e n E * ( a i ( x-1. ) = E ( a i l z i ) .
3 ~ E
BENEFITS AND LIMITATIONS OF PANEL DATA 141
E9; (x
HS I A O
where
where . =
-1
xi - E 1 ) and Ef = vec(n') ' = [zi,.. . , $1
is a IXKT' n ' denoting the t-th row of
vector with *t n. We can
and
where
.+;.:
If the conditional variance-covariance matrix is homoscedastic,
namely, Var(xi(5x) = V does not depend on x , (3.20) will con-
verge to v
Equation (3.17) implies that l is subject to restrictions.
Let g = (&' , s t ) . We specify the restrictions on fl ((3.17)) by
the conditions that
One is the correlation between the time persistent errors and the
lagged dependent variables. The other is the problem of initial
observations. The issue of correlation between the residuals and
lagged dependent variables is not affected by the size of the
time series observations, T, while the initial value problem
arises only when T is small. When T is large, the weight of
the initial observation in the likelihood function becomes negli-
gible and it is appropriate to ignore this issue.
If ai are treated as fixed constants, the LSDV estimators
for y and & are biased when T is fixed.5 The bias is caused by
having to eliminate the unknown individual constant (or effects)
a i from each observation which creates the correlation of order
1/T between (yi,t-l-Gi and the residuals in the transformed
7
model,
- - T
where /T, assuming yio a r e observable.
Yi,-l 't=l Y i , t - 1
When T i s v e r y l a r g e , t h e r i g h t - h a n d s i d e v a r i a b l e s become
a s y m p t o t i c a l l y u n c o r r e l a t e d and t h e LSDV e s t i m a t o r s a r e c o n s i s -
tent. For s m a l l T, t h e b i a s f o r y i s n e g a t i v e and t h e b i a s f o r
$ i s positive if y > 0. The b i a s does n o t go t o z e r o a s Y
approaches t o z e r o (Anderson and Hsiao ( 1 9 8 1 ) , N i c k e l 1 ( 1 9 8 1 ) ) .
When a . a r e t r e a t e d a s random, t h e amalgamated e r r o r terms
1
v. = a.
~t 1
+ u i t a r e s e r i a l l y c o r r e l a t e d and a r e c o r r e l a t e d w i t h
Hence t h e LS e s t i m a t o r i s b i a s e d . The LS e s t i m a t e s of y
y i
and $ a r e b i a s e d upward and b i a s e d toward z e r o , r e s p e c t i v e l y
(Nerlove ( 1 9 7 1 ) , Trognon ( 1 9 7 8 ) ) .
To implement t h e maximum l i k e l i h o o d p r o c e d u r e , we need t o
i n i t i a l i z e a t i m e dependent. s t o c h a s t i c p r o c e s s . For fixed T
t h e assumption a b o u t t h e i n i t i a l o b s e r v a t i o n s d i r e c t l y a f f e c t s
t h e i n t e r p r e t a t i o n of a random e f f e c t model and t h e c o n s i s t e n c y
of t h e MLE (Anderson and Hsiao (1981, 1 9 8 2 ) ) .
I f yio a r e assumed t o be f i x e d c o n s t a n t s , t h e MLE i s c o n s i s -
t e n t when N tends t o i n f i n i t y . However, f i x e d yio and (3.26)
t o g e t h e r imply t h a t Cov(yio,ai) = 0 and Cov(yit,ai) # 0 for t =
1 , ..., . I n o t h e r words, t h e model p e r m i t s a c r o s s - s e c t i o n a l
unit to s t a r t from some a r b i t r a r y p o s i t i o n and t h e i n d i v i d u a l
effects, a a r e n o t b r o u g h t i n t o t h e model i n time 0 b u t a f f e c t
i'
t h e p r o c e s s i n time 1 and l a t e r . I f t h e p r o c e s s h a s been i n
o p e r a t i o n p r i o r t o t h e t i m e it i s sampled, it does n o t seem
r e a s o n a b l e t o view y a s d i f f e r e n t from any o t h e r y t =
io it'
1,. .. . Indeed, if t h e t r u e Cov(yio,ai) # 0, maxhizing the
likelihood function L under t h e assumption of fixed initial
o b s e r v a t i o n s w i l l n o t y i e l d c o n s i s t e n t e s t i m a t o r s f o r y and &
if T i s fixed, even when N is l a r g e because E(BL/%) # 0.
I n g e n e r a l , i t would seem more r e a s o n a b l e t o a l l o w an i n d i -
v i d u a l t o s t a r t a t any a r b i t r a r y p o s i t i o n , and a l s o t o a l l o w f o r
BENEFITS AND LIMITATIONS OF PANEL DATA
m
Let w
io - xi,-jYJ. Since -it
stand for ,@ttj-o x are different for
different i and t, each y io actually has different mean w.
10
for i = 1, . . . , N.
If we wish to estimate wio together with Y and &, there are
incidental parameters problems. Moreover, the MLE of wio are not
independent of the MLE of y and &. Hence the MLE is inconsistent
if T is fixed, if it exists at all.
To circumscribe the incidental parameter problem, Bhargava
and Sargan (1983) suggest that we predict wio by all the observed
148 HSIAO
-x.
1
(also see Chamberlain (1984), Heckman (1981b)). Conditional
on x . we have
"1'
1 if y . > O ,
lt
0 otherwise.
The log likelihood function for the random effect model is given
by
pared with the linear case. In the linear case it was not re-
strictive to decompose a into its linear projection on x and an
i -i
orthogonal residual. Now we are assuming that E(a.(x.) is indeed
1 "1
linear, that w. is independent of L , and that wi has a specific
1
probability distribution. These are restrictive assumptions and
there is a payoff to relaxing them.
When ai are treated as fixed constants, the distribution of
ai plays no role and the computation of the MLE can be simpli-
fied. However, the ML estimation of ai and & are not separable.
For typical panels, T is small and N is large. The inconsis-
tency of cr is transmitted into the MLE of & (e.g., see Heckman
i
(1981a) for some Monte Carlo evidence). Moreover, in general,
simple functions for Y (see above (3.5)) are not always easy to
Nj
find. Andersen (1970, 1973) has suggested a general approach to
find Y by conditioning the likelihood function of the observ-
Nj
ables on the minimum sufficient statistics Ti for the incidental
parameters ai. Provided that t , exists and is not dependent on
1
the structural parameters 2 , the conditional likelihood function
of observables conditioning on ti, i = 1,. . . , N, will no longer
depend on ai. Hence under mild regularity conditions, maximizing
the conditional likelihood function will yield consistent estima-
tors of f?.6 Unfortunately, for nonlinear models, except for
special cases such as the fixed effect logit models (Andersen
(1973), Chamberlain (1980)) or Poisson models (Hausman, Hall and
Griliches (1984)), it is not generally possible to find the
minimum sufficient statistic ti for the incidental parameters ai
which are independent of the structural parameters (e.g., the
'When uit are normally distributed, the LSDV estimator of & for
the linear model is the conditional MLE (Cornwell and Schmidt
(1984)).
HSIAO
4. NON-STANDARD SITUATIONS
are correlated with each other, then unless there are sufficient
numbers of exogenous variables in the first equation which do not
appear in the other equations, the system (4.1) is not identified
(e.g., Hsiao (1983)).
If the correlation of (v v v ) are due to a common omitted
1' 2' 3
variable, say,
2 2 2 2
Ev, v! =
-1t-lt (
0: t oU)$cl4 t diag(ol ,02,03) ,
where 2'
= (d 1 ,d2 ,d3) . Note that the within group variance-
covariance matrix (4.6) is only of rank one.
For simplicity, we assume that there is no restriction on
the coefficients of exogenous variables. Under this assumption,
we may further ignore the existence of exogenous variables with-
out loss of generality because there are no excluded exogenous
variables which may be legitimately used as instruments for the
exogenous variables appearing in the equation. We also assume
> 2.
y32 = 0 and T - If the variance-covariance matrix of the
error term is unrestricted, the model remains unidentified.
However, if v satisfy (4.5)-(4.7), we may use y purged of the
-it
common omitted variable h as instruments to identify the model
(Chamberlain (1977a,b)).
To see how we may construct instruments by utilizing (4.6),
we rewrite (4.1) under the assumption that yg2 = 0, and &g = 2, g
= 1, 2, 3 , in terms of the following reduced form,
u
'dl lit
d2+Y21dl U2itt'21ulit
Ld3+Y31dl (U3itt'3~ulit,
BENEFITS AND LIMITATIONS OF PANEL DATA
I J
E
lit
E '
2it
E
3it,
Substituting y for h in the reduced form equation for yg, we
1
have
Equation ( 4 . 9 ) s # t, as an
can be estimated by using y
lis'
instrument for y provided that d a2 $ 0 . Purge y of its
lit' 1 cu 2
dependence on h and form the residual
- - - - -
Zit -
2
W
- '2it al 'lit - '2it al 'lit *
L
Then, provided d2ul # 0, w2 can be used as an instrument for y1
in the structural equation y
3:
v = -2 + U - -dl2 U
Zit dl 'lit Zit lit '
lit
+ U
3it
- -d3 U
lit
dl
t = 1, . . . , T ,
where
lt ) are determined
The values of the truncation remainders (b.
by the lag coefficients and the presample x's. Identification
requires constraints on either the lag coefficients or on the
stochastic process generating these x's. Because there are
usually many more degrees of freedom available in panel data, we
can use a different kind of prior restrictions than the usual
approach of constraining lag coefficients to identify the trun-
cation remainders (Pakes and Griliches (1984)).
For instance, if x is generated by a p-th order autore-
gressive process, then for q 2 0, the projection of the unob-
served presample x's on x is given by
-i
03
(9)
where $t,j = Zqzl Bt+q~j , J = I,..., Pa
,
.,
t-j > p
where $t
,j = $t-p+j-l + @t,j if and = $t, otherwise
Subtracting yit by yil, we have
m
restrictions on the sequence {f3T)T=o. Of course, if T is small
relative to p, we will not be able to build up much information
on the tail of the lag distribution. This simply reflects that
short panels, by their very nature do not contain unconstrained
information on that tail. However, the early coefficients are
often of significant interest in themselves. Moreover, they may
provide a basis for restricting the lag structure to be a func-
tion of a small number of parameters in further work.
plim f3 -
N*
d - B(l - Var(zit -&zi,t-I) )
162 HSIAO
5. CONCLUSION
ACKNOWLEDGEMENTS
This work was supported in part by the Social Sciences and
Humanities Research Council of Canada grant 410-84-1032 to the
University of Toronto. I would also like to thank D. Young for
editorial assistance and M. Denny, C. Manski, A. Melino, D.
Mountain, S. Nickell, D. Poirier, an associate editor and a
referee for helpful comments.
REFERENCES
Aigner, D.J., Hsiao, C., Kapteyn, A , , & Wansbeek, T., (1984).
Latent variable models in econometrics. Handbook of Econo-
metrics, Vol. 11, ( Z . Griliches & M. Intriligator, Eds.)
Amsterdam: North-Holland, 1322-1393.
Ashenfelter, O., Deaton, A., & Solon, G., (1984). Does it make
sense to collect panel data in developing countries? Mimeo.
Dielman, T., Nantell, T., & Wright, R., (1980). Price effects of
stock repurchasing: a random coefficient regression approach.
Journal of Financial and Quantitative Analysis, 15, 175-189.