Professional Documents
Culture Documents
Ec221 ST2021 Exam
Ec221 ST2021 Exam
Ec221 ST2021 Exam
EC221
Principles of Econometrics
Instructions to candidates
This paper contains FOUR questions, divided into two sections. Section A contains ONE question
related to Michaelmas Term and Section B contains THREE questions related to Lent Term. You
should answer ALL questions from Section A and ALL questions from Section B.
If at any point in this exam you feel that anything is unclear, please make additional assumptions that
you feel are necessary and state them clearly.
For Section A: Please type your answer in a Word-processing software on a computer (e.g. Word).
You could combine the typed document with scanned or photographed hand-drawn diagrams and
computations. The maximum word count is 1500 words, beyond which nothing will be marked. There
is no minimum word count and concise answers will be rewarded.
For Section B: Please use pen and paper and scan (or photograph) your answers. You could also use
an iPad or a tablet. There is no maximum word count for Section B. Please annotate your answers
clearly.
The answers must then be converted to pdf and uploaded to Moodle as ONE individual file together
with the Coversheet. Please make sure every single scanned page is legible and properly ordered.
The file will be run through Turnitin to ensure academic integrity.
Time Allowed Submit PDF with answers within 24 hours after official start of the exam
You are supplied with: Lindley & Scott Cambridge Statistical Tables
Table A5 Durbin-Watson d-statistic
You may also use: Open book examination
Question 1
[33.34 marks]
A blowout of the BP Deepwater Horizon oil-well in April 2010 led to the largest marine oil spill in history,
lasting until July of that year. Researchers would like to analyse whether consumers reacted to the
disaster by reducing their consumption of BP branded petrol during the oil spill. They collected data
on the prices and quantities sold at BP-branded and non-BP petrol stations across zip codes (postal
codes; small local areas) in the US. Either a zip code contains BP stations, in which case the average
price and average number of gallons sold for each of these BP stations is recorded (and the indicator
variable BP = 1), or a zip code contains no BP station, in which case the price and quantity at these
non-BP stations is recorded (and BP = 0). An observation is a particular petrol station. Non-BP
stations in BP zip codes are not used in the sample.
Prices and quantities are the averages either for the period January 2009 to March 2010 (before the
oil spill) in columns (1) and (2) or for April 2010 to July 2010 (during the oil spill) in columns (3) to (6)
of the table below. Prices are in US Dollars per gallon and coded as P rice. Quantities are in logarithm
and coded as ln(sales). The researchers also constructed a variable called Green Index, which is
supposed to measure the environmental orientation of consumers in the zip code. The Green Index
is constructed by combining the share of hybrid vehicle registrations, per capita membership in the
Sierra Club, an environmental organisation, and per capita contributions to Green Party election funds
in the zip code, all measured prior to 2010. The Green Index is then standardised to have mean 0
and standard deviation 1. Using either P rice or ln(sales) as the dependent variable, the researchers
obtain the following results.
(a) Define the treatment, the outcome, and the counterfactuals implicit in the regression in column
(3)? What do the researchers use as the control group in this regression?
[3.34 marks]
(b) Why do the researchers run the regressions in columns (1) and (2) for the period before the oil
spill? What do you conclude from this exercise?
[6 marks]
(c) What is the average effect of the oil spill on BP prices in column (3)? Discuss whether this is
likely a causal effect.
[6 marks]
Question 2
[22.33 marks]
Consider the linear regression model under the Gauss Markov assumptions
y = Xβ + ε,
H0 : Rβ = c against H1 : Rβ 6= c,
where R is a J × k matrix of known constants with full row rank (no redundant restrictions) and c is
a J × 1 vector of known constants.
Lagrange multipliers play an important role in constrained optimisation. In lectures we showed how,
with the help of the Lagrangean
where λ is the J × 1 vector of Lagrange multipliers associated with our J restrictions, we could obtain
the constrained OLS estimator for β. Specifically, we showed
where β̂ is the usual OLS estimator β̂ = (X 0 X)−1 X 0 y . The estimator for the Lagrange multipliers
was shown to equal
Hint: You are allowed to make use of the well-known sampling distribution of β̂|X (no need to
derive).
[4 marks]
(c) Suppose σ 2 is known. Based on the sampling distribution of λ∗ in (b), propose a test statistic
to test for H0 : λ = 0 against H1 : λ 6= 0. Clearly state the sampling distribution of your test
statistic and discuss how to carry out the test.
Hint: You are allowed to make use of the well-known sampling distribution of the suitably nor-
malised estimator s2 (no need to derive).
[5 marks]
(e) Show that the LM test statistic in (d) is equivalent to the more familiar Wald test statistic.
[4 marks]
y = Xβ + ε, where E(ε|X) 6= 0,
where y is an (n×1) vector of the dependent variable and X is a full column rank (n×k) dimensional
matrix that contains exogenous and endogenous regressors. Let Z be an (n × L) dimensional matrix
that contains instruments, L ≥ k.
n
We assume that {(yi , xi , zi )}i=1 forms a random sample.
(a) Provide and discuss the requirements on our instruments that are needed for identification and
will permit us to obtain a consistent estimator for β. Distinguish in your discussion the concepts
of underidentification, just identified and overidentification.
[3 marks]
(b) Suppose our equation is just identified. Derive the IV estimator for β, β̂IV .
[3 marks]
(c) Suppose our equation is overidentified.
(i) For this setting, we considered the 2SLS estimator for β, β̂2SLS . Consider the following
estimation method: 1) regress y on Z and collect the fitted values ỹ; 2) regress each column
of X on Z and collect fitted values X̃; 3) regress ỹ on X̃. Show that the resulting estimator
from 3) is the same as the 2SLS estimator.
[3 marks]
We are advised that there is evidence of heteroskedasticity such that V ar(εi |zi ) = σ 2 ωi2 , where
ωi2 = h(zi ) for some known function h(·). In matrix form, V ar (ε|Z) = σ 2 Ω, where Ωii = ωi2
and Ωij = 0 for i 6= j.
Give an intuitive explanation for the form of the criterion using (c.ii).
[3 marks]
(iv) Derive the optimal IV estimator for β.
[3.33 marks]
(v) Show that the optimal IV estimator for β equals β̂2SLS , when we do not have heteroskedas-
ticity.
[2 marks]
(d) Suppose our equation is just identified, show that the optimal IV estimator for β that recognizes
the presence of heteroskedasticity equals the β̂IV estimator derived in (b). Interpret this finding.
[3 marks]
y = x1 α1 + X2 α2 + ε,
where y is a binary variable. In particular, let y = 1 if the application for a mortgage was denied, and
y = 0 if an application for a mortgage was successful.
The matrix X = [x1 : X2 ] is full column rank and E(ε|X) = 0. We obtain a random sample
{(yi , xi1 , x0i2 ) , i = 1, .., n}.
The parameter of interest is α1 , where in our example x1 is an indicator of the individual belongs
to an ethnic minority group and X2 are other characteristics that a loan officer might legally con-
sider when deciding on a mortgage application. We will restrict our attention to the variables: pirat
(ratio of total monthly debt payment to total monthly income), lvrat_med and lvrat_high (dummy
variables indicating whether the loan-to-value ratio is intermediate or high, with the excluded dummy
being low), and a consumer credit score, chist (which ranges from 1 to 5, where 5 is the worst rating),
and hschool (dummy indicating completion of high school). We have a random sample of 2,380 ob-
servations. There are 285 denied applications, the average of pirat equals 0.33, and the average of
chist equals 2.12.
(a) Discuss how we can obtain the OLS estimator for α1 using a residual based regression. Clearly
define any notation you introduce and provide a clear interpretation of α1 in this model.
[5 marks]
(b) The linear probability is known to exhibit heteroskedasticity. Prove that the estimator derived in
(a) for α1 remains consistent in the presence of heteroskedasticity.
Hint: you are expected to use the formula for α1 provided in (a) in your answer to this question.
[5 marks]
(c) You want to test the hypothesis H0 : α1 = 0 against H1 : α1 > 0. Discuss how you can test
this hypothesis. In your answer indicate how you can obtain the standard error for α1 .
[5 marks]
The loan officer considers a behavioural model underlying the riskiness associated with each mort-
gage application. In particular, he considers the model
y ∗ = x1 δ0 + X2 δ1 + u,
where y ∗ represents the risk associated with the mortgage application. The loan officer will decide to
deny the application (y = 1) if
y ∗ > 0.
The loan officer is concerned about the presence of heteroskedasticity in the errors, and proceeds
to estimate a probit model that permits heteroskedasticity, such that V ar(ui |xi ) = exp(x0i2 γ)2 . The
log-likelihood is given by
n
( ! " !#)
X xi1 δ0 + x0i2 δ1 xi1 δ0 + x0i2 δ1
log L (δ0 , δ1 , γ) = yi log Φ + (1 − yi ) log 1 − Φ .
exp x0i2 γ exp x0i2 γ
i=1
Ignoring the presence of heteroskedasticity is serious here, as the usual probit parameter estimator for
(δ0 , δ1 ) will be inconsistent in the presence of heteroskedasticity. In Stata we can use the command
hetprobit to estimate the above model.
END OF PAPER