Professional Documents
Culture Documents
Model Specifica-On: Motivation
Model Specifica-On: Motivation
Specifica-on: Mo-va-on
2
Introduc-on
• Ques-on 1: Do we include all explanatory variables or only a
few?
3
Introduc-on
• Suppose that all explanatory variables in a dataset are
relevant for the dependent variable. Should we include all?
4
Example: Stock market index
5
Explanatory variables
• Stock characteris-cs: Dividends, earnings, vola-lity, book
value,issuing ac-vity
6
Stock index and book-to-market ra-o
7
Log stock index and book-to-market ra-o
8
Change in log stock index, and book-to-
market ra-o
9
Regression output
10
Bias-efficiency trade-off
11
Bias-efficiency trade-off
12
Decision metrics
– Informa-on criteria
– Out-of-sample predic-on
13
Informa-on criteria
• Commonly used informa-on criteria:
– Answer: Penalty is 2/n for AIC and log(n)/n for BIC; BIC imposes
stronger penalty if log(n) > 2, n≥ 8.
14
Out-of-sample predic-on
• Commonly used out-of-sample predic-on metrics:
15
Itera-ve selec-on methods
• Commonly used methods to select explanatory variables:
– t-test and F-test
– Informa-on criteria
– Out-of-sample predic-ons
16
Data transforma-on
• Seang:
17
Taking logarithms
• Use for:
– Exponen-al growth..
18
Taking differences
• Use for:
• Trending paderns
– Sta-s-cal assump-ons may not hold.
First difference:
19
Non-linear effects
• Advantages:
– Get non-linear func-onal form.
– May provide economically meaningful speciffica-on.
20
Non-linear effects
21
Endogeneity
• OLS requires some assump-ons:
– explanatory variables should be exogenous
– viola-on of this: endogeneity.
• We want to explain
22
Endogeneity
• Given the es-mates (y: flights, x: insurances)
y = 10000 + 0.25x + e
• Correct: 4000 insurances sold è expected number of flights
= 10000 + 0.25*4000= 11000
– High x tends to go together with high y.
– The iden-fied correla-on yields adequate predic-ons.
• Implica-ons:
– Obtain new data: X stays constant (and y changes)
– Need "controlled experiment"
– OLS es-mator b converges to true coefficient β for n è 1 (OLS is
consistent)
24
Economic models
• In economics:
– Controlled (or natural) experiments are rare
– New data with same X cannot be obtained
– Explanatory variables are stochas-c!
• If X stochas-c:
– new data set è new X values
– X can be correlated with other variables
– If X correlated with e
• X is endogenous
• There is another variable that affects y and X
– OLS does not properly es-mate β (inconsistent)
– If X uncorrelated with e
• X is exogenous
• OLS consistent
25
Omided variable - Example
• Model student's grade using adendance at lectures.
26
Other examples - Strategic behavior
• Consider a model explaining demand using price.
27
Other examples - Measurement errors
28
Endogeneity
• Common problem in economics
1 Omided variables
2 Strategic behavior
3 Measurement errors
è X is correlated with ε
29
Simulated example, y = 1 + 2x* + u
•
30
Measurement error example
31
Direc-on of bias in the measurement error
case
• OLS is "biased towards zero"
• Intui-vely:
– x-values on the les likely have nega-ve measurement errors
– x-values on the right likely have posi-ve measurement errors
32
OLS in presence of endogeneity
• If X endogenous
– X correlated with ε
– Even with in infinite amount of data: OLS does not give useful
es-mates
33
"Solving endogeneity": Graphical
representa-on
•
34
Instrumental variable es-ma-on
• Z variables are instruments if
– Z and X are correlated
– Z does not correlate with ε
35
Solving "endogeneity: Graphical
representa-on
1 Use Z to decompose X in explained and unexplained part
2 Eect size of explained part on y equals β
3 Unexplained part is added to error term
Endogeneity is solved as
• X unexplained not correlated with X explained
• X unexplained is exogenous
36
Finding instruments
What are good instruments?
37
Examples of instruments
• Explain obtained grade using adendance:
– Poten-al instruments:
• Travel -me home to university
• Policy change to obligatory adendance
38
Summary
If X is in fact exogenous
– OLS and 2SLS both consistent
– Variance OLS smaller than variance 2SLS!
èUse OLS
If X is endogenous
– 2SLS is consistent
– OLS inconsistent
è Use 2SLS
39
Examples of instruments
• Explain obtained grade using adendance:
– Poten-al instruments:
• Travel -me home to university
• Policy change to obligatory adendance
40
Tes-ng the validity of instruments
• Valid instruments sa-sfy three condi-ons
1 There are enough instruments
– Easy! Just count.
2 Instruments are correlated (enough) with X
– Check significance of instruments in first stage regression
3 Instruments are not correlated with ”
– Perform Sargan test
41
Test correla-on Z vs. X
• X1 poten-ally endogenous variables
• X2 exogenous variables
• Z = (Z*; X2) instruments
42
Sargan test
•
43
Sargan test
•
44
Notes on the Sargan test
• Test only works when there are too "many" instruments
(m > k)
45
Tes-ng for exogeneity of variables-
Hausman test
• Intui-on:
– Use the instruments to split poten-ally endogenous variables into
• 1 a guaranteed exogenous part
• 2 a poten-ally endogenous part
46
Hausman test - procedure
•
47