Professional Documents
Culture Documents
1
• Recall that a model is correctly
SPECIFIED when
Specification
– No important variables are
omitted.
TSS /(n 1)
2 n 1
R 1 (1 R )
2
n k 1
RSS n 1
R 1
2
TSS n k 1 5
Adjusted R2
• Note that the “severity” of the penalty term for each additional variable
will be greater when n is small.
6
Adjusted R2 Example
n 100 100 100 100 30 30 30 30
k 2 3 4 5 2 3 4 5
10
Level-Level Form
11
12
– Underlying theory
Functional suggests non-linear DGP.
– Alternative coefficient
interpretations are desired.
Typical Non-linear Transformations
• 1. Double Log Form (Log-Log): Elasticities
14
• 2. Semi-log form: partial elasticities
dY / Y %Y
B1
dX X
15
Log-Level Form
16
2. Semi-log form: partial elasticities
dY Y
B1
dX / X %X
18
• 3.) Polynomial Functions: Squared terms
Yi 1
B1 .5 * B2
X i Xi
20
• 5.) Inverse transformation
Yi 1
B1 B2
X i Xi
Again, the marginal effect of X on Y decreases as
the value of X increases. In this case, it
approaches zero (or B1) as X approaches infinity.
a3
s3
a3 3
0
s
25
Zero (approx.) Skew
1
n
i
(Y Y ) 3
a3 3
.07
s
60
Series: X1
Sample 1 526
50
Observations 526
40 Mean 0.052784
Median 0.099718
30 Maximum 2.743626
Minimum -3.371083
Std. Dev. 1.025389
20 Skewness -0.074157
Kurtosis 2.930136
10
Jarque-Bera 0.589082
Probability 0.744873
0
-3 -2 -1 0 1 2 26
1
n
i
(Y Y ) 3
a3 3
0
s
• a3 > 0 :positive skew, or skewed to the
right.
a3 3
0
s
• a3 < 0: negative skew, skewed to the left.
29
Negative Skew
1
n
i
(Y Y ) 3
a3 3
.62
s
STATA: hist educ
sum educ, detail
200
Series: EDUC
Sample 1 526
160 Observations 526
Mean 12.56274
120 Median 12.00000
Maximum 18.00000
Minimum 0.000000
80 Std. Dev. 2.769022
Skewness -0.619574
Kurtosis 4.884245
40
Jarque-Bera 111.4653
Probability 0.000000
0
0.0 2.5 5.0 7.5 10.0 12.5 15.0 17.5
30
Problem: Relationship between
Skewed variables will be non-linear
WAGE vs. EDUC
30
WAGE is positively
25
skewed – larger
20 dispersion above mean.
15 EDUC is negatively
WAGE
10
skewed – larger
dispersion below mean.
5
-5
0 4 8 12 16 20
31
EDUC
In order to make the relationship linear in
Order parameters, data must be transformed
such that skew is reduced.
Correcting
for Skew
Square Square-root, natural log will reduce
positive skew.
4.5
4
3.5
3
LN(Y)
2.5
2
1.5
1
0.5
0
Y 33
1
7
10
13
16
19
22
25
28
31
34
37
40
43
46
49
•Square transformation increases dispersion of
larger values, reduces relative dispersion of smaller
values.
•Thus, negative skew is reduced. Distribution
becomes more symmetric.
Square Transformation to Reduce Negative Skew
700
600
500
Y2 400
300
200
100
0
1 2 3 4 5 6 7 8 9 10 11 12Y13 14 15 16 17 18 19 20 21 22 23 2434
Variable Skew Coefficient
EDUC -0.62
Transformations (EDUC)2 0.39
WAGE 2.00
Ln(WAGE) 0.39
35
Transformed Scatter Plots
3 3
2 2
LWAGE
LWAGE
1 1
0 0
-1 -1
0 4 8 12 16 20 0 50 100 150 200 250 300 350
EDUC EDUC_SQ
36
Transformed Regressions
37
• (1) A one unit increase in
EDUC increases WAGE by
0.54 units (dollars)
38
General 39
Interaction Terms
• A general interaction term captures the change in the
marginal effect of Xi on Yi due to a one unit change in a
second independent variable Zi.
Yi = + 1Xi + 2XiZi + ei
Yi
ˆ1 ˆ2 Z
X i
Yi ˆ
2
Z i X i 40
MPC Example
• Is the marginal propensity to consume of households
dependent on the wealth level of the household?
Number of obs = 30
F( 2, 27) = 65.91
Prob > F = 0.0000
R-squared = 0.8300
Adj R-squared = 0.8174
----------------------------------------------------------
cons | Coef. Std. Err. t P>|t|
-------------+--------------------------------------------
inc | .7079231 .1754693 4.03 0.000
inc_wlth | 1.14e-06 3.84e-07 2.96 0.006
_cons | 17986.08 6367.999 2.82 0.009
----------------------------------------------------------
42
Interpretation:
• 1. Interaction Coefficient 2 is positive and
significant: An increase in WEALTH leads to an
increase in MPC.
CONSi
ˆ1 ˆ2WEALTH 0.71 0.0000011*WEALTH
INCi
CONS ˆ
2 0.0000011
WEALTH i INCi
CONS CONS
INC
Wealth = 100,000
.81
.71
Wealth = 0
INC WEALTH=100,000 44
Other Examples
N N N N
Y( 0 ) B0 B1 X
For D i 1 : Y NB N B X u
i 0 0 1 i i
Y NB N B X u
i 0 0 i i
1
N N N N N
Y(1) B0 0 B1 X
Therefore :
Y(1) Y( 0 ) 0 50
Case 1: 0 > 0, B0 > 0, B1 > 0. Dummy causes
intercept to shift upwards.
Yi
Y= (B0+0)+B1X
Y= B0+B1X
B0 +0
B0
Xi
51
Case 2: 0 < 0, B0 > 0, B1 > 0. Dummy causes
intercept to shift Downwards.
Yi
Y= B0+B1X
Y= (B0+0)+B1X
B0
B0 +0
Xi
52
Wage2.dta Example
REG WAGE EDUC EXPER FEMALE
WAGE
Males
EDUCi
Females
-4.86
-6.89
Note: holding EXPER
constant.
55
• Dummy Variable Trap: Use one fewer dummy variables
than possible conditions or states. 56
– For example, gender traditionally has two possible
states, so a single variable allows for comparison of
one relative to the other.
– Other example:
Yi
Di = 1 Yi = B0+(B1+ 1)Xi
Yi = B0 + B1Xi
Di = 0
B0
Xi
62
Slope-Dummy Case 2: 1 < 0, B0>0, B1>0. Xi has a
SMALLER impact on Yi when Di = 1.
Yi
Yi = B0 + B1Xi
D=0
Yi = B0+(B1+ 1)Xi
D=1
B0
Xi
63
64
Wage1 Slope-Dummy Example
• Suppose that we do want to examine the impact of
gender on the return to education. . .
• First, we must construct a slope-dummy interaction
term between the dummy FEMALE and EDUC.
• In STATA, type:
“gen EDUC_FEM = EDUC * FEMALE”
65
• Note, you may include both an intercept and slope dummy term
simultaneously, but this can lead to MULTICOLLINEARITY issues.
• We get:
WAGE = -5.66 + 0.87*EDUC + 0.15*EXP - 0.15*EDUC_FEM
(.01) (0.00) (0.00) (0.01)
(p-values in parentheses)
1 = -0.15 < 0 . . .
• …This means that, on average, one additional year
of education will yield $0.15 per hour less in
additional wages for a female vs. a male.
WAGEi ˆ ˆ
B1 1 0.87 0.15 0.72
EDUCi
66
WAGE = -5.66 + 0.87*EDUC + 0.15*EXP - 0.15*EDUC_FEM
Slope=.87
WAGE
Di=0
Males
Slope=.72
EDUCi
Di=1
-5.66 Females