Professional Documents
Culture Documents
2
2
Today we will explore the descriptive statistics of the panel data. The panel data structure is more
complex than cross-sectional or time series, so the descriptive statistics may be challenging to
understand. Let's use the long-form panel dataset we saved last time to illustrate panel data's basic
concepts and features. You can also download the dataset from the link below. The dataset contains
multiple observations of US workers from 2010 to 2018.
https://drive.google.com/file/d/1BfUQ...
First, let's set the panel data by typing "xtset," followed by the panel variable "ID" and the time variable
"year."
The command "xtdescribe" tells us about the patterns of the panel data. It is a balanced panel dataset.
There are 4,586 workers in the data over five time periods. The distribution of "T_i" tells us all workers
are observed for five time periods.
Next, let's examine the summary statistics of the panel data. Panel data have two dimensions, across
individuals and over time. An essential feature of the panel data is the between-variation and the
within-variation.
*************************************
* Statistics *
*************************************
xtdescribe
xtdescribe if lwage!=.
xtsum wagerate
xtsum lwage
xtsum schooling
xttab union
log close