Professional Documents
Culture Documents
2. You work for a policy institution and your boss has asked you to analyze market structure in the
US. Given your vast knowledge of economics and the data that you have available, you have come
up with the following system of three simultaneous equations that describe the market structure of
US industries:
Analyze whether you can identify the system as it is or not. If you cannot identify the whole system,
propose a solution that will solve the issue.
ECONOMETRICS / QUANTITATIVE AND STATISTICAL METHODS I
3. You work for the Minister of the Economy, and he has asked you to do some study on the
exporting activity of your country’s firms. He would like to know whether export subsidies that were
given to firms last year were effective. You have data on 200 firms (assume it’s a representative
sample of the population). The data consist of the following variables:
Exporta: A 1 if the firm exported during the last year, a 0 if not.
Crecim: Average growth rate (of assets) of the firm during the last three years.
Tecnologia: A 1 if the firm is in a technology intensive sector, a 0 if not.
Subvencion: A 1 if the firm received an export subsidy last year, a 0 if not.
Robust
exporta Coef. Std. Err. t P>|t| [95% Conf. Interval]
1) Interpret the value of the β1, β2 and β4 parameters and comment on their statistical significance.
2) Use the model to estimate the probability that the following firms will export:
- Firm A: A firm from a non-technology sector that received a subsidy and grew 3% on average
during the last three years.
- Firm B: A firm from a non-technology sector that did not receive a subsidy and grew 14% on
average during the last three years.
- Firm C: A firm from a technology sector that received a subsidy and grew -6% during the last three
years.
There are two different commands for probit estimation, so we show the results of both:
ECONOMETRICS / QUANTITATIVE AND STATISTICAL METHODS I
Robust
exporta Coef. Std. Err. z P>|z| [95% Conf. Interval]
Robust
exporta dF/dx Std. Err. z P>|z| x-bar [ 95% C.I. ]
obs. P .33
pred. P .1105265 (at x-bar)
3) Use the above results (however you think appropriate) to interpret the effect of crecim, tecnologia
and inter.
You ask Stata for the classification table (cutoff 0.5) and get:
Probit model for exporta
True
Classified D ~D Total
+ 52 14 66
- 14 120 134
4) Find the ratio of correctly classified observations. Are you improving over the “naive” (no-model)
predictions? Which are you identifying better, the exporting companies or the non-exporting
companies?
ECONOMETRICS / QUANTITATIVE AND STATISTICAL METHODS I
4. You have to estimate the effects of education on wages, and you are interested in estimating an
equation that looks like:
b) You now have collected data for a sample of people and run the above model by using simple
OLS regression (with traditional standard errors). Then your boss tells you that your data have six
possible “characteristics” which may affect the validity of your analysis. For each of the
characteristics 1-6 below (taken independently):
- Give a technical name to the problem they may cause.
- Describe how the problem will affect the results of the estimation (i.e. whether it affects the
properties of the parameter estimates, the standard errors, etc.).
- Identify and briefly describe what test you could use (if any exists!) to detect whether indeed that
problem is present.
- Describe how you can solve the problem.
1) “It is well known that people with more education are more intelligent to begin with, and
intelligence affects wages, but you do not have a measure of ‘intelligence’ in your model”.
2) “The dispersion of wages is much bigger for people in executive positions than for people in non-
executive positions”.
3) “The return of education (impact of education on wages) is likely to be different for males and for
females”.
4) “The people in your sample come from the customer database of a company that sells software
products”.
5) “Total working experience and tenure in a company are variables that are very much related”
6) “Education and wages are two ‘characteristics’ of a person that are the joint result of common
factors such as the education of parents, etc…”.
ECONOMETRICS / QUANTITATIVE AND STATISTICAL METHODS I
5. You work for an investment bank and your boss has asked you to analyze investment decisions by
investors. Given your knowledge of economics and finance you have come up with a system of four
simultaneous equations that describe the way investors decide on their investments (assume that the
equations are "believable", that is, do not try to make sense out of the equation):
Analyze whether you can identify the system as it is or not. If you cannot identify the whole system,
propose a solution that will solve the issue.
6. You have to estimate the determinants of firm profitability, and you are interested in estimating an
equation that looks like:
You have data for a sample of companies and run the above model by using simple OLS regression
(with traditional standard errors). Then your boss tells you that your data have five possible
ECONOMETRICS / QUANTITATIVE AND STATISTICAL METHODS I
“characteristics” which may affect the validity of your analysis. For each of the characteristics 1-5
below (taken independently):
a) Give a technical name to the problem they may cause.
b) Describe how the problem will affect the results of the estimation (i.e. whether it affects the
properties of the parameter estimates, the standard errors, etc.).
c) Identify and briefly describe what test you could use (if any exists!) to detect whether indeed that
problem is present.
d) Describe how you can solve the problem.
1) “It is well known that for companies that are larger in size the dispersion of ROA is bigger”.
2) “Volatility of sector output and growth in the company’s assets tend to be very much related”.
3) “The bigger that investment is, the less it seems to affect ROA”.
4) “You have used a database that has information only about companies that are currently
operational”.
5) “The determinants of asset growth and of return on assets (ROA) are quite similar”.
7. The government is planning to subsidize language courses for immigrants since not knowing the
language limits the access to good jobs. In order to understand the potential impact of this policy,
you are required to estimate the effect of speaking Spanish on the wages that immigrants (from non-
Spanish-speaking countries) earn in Spain. The National Survey on Immigrants contains data
collected in 2007on wages and language proficiency (and of other characteristics, such as age,
education, gender, etc) for a random sample of immigrants.
a) You first estimate a linear regression where the regressor (Span) is a binary indicator of language
proficiency (1 if the person speaks fluent Spanish, 0 otherwise) and the dependent variable (wage)
is the net monthly wage (see the results below). What is the estimated effect of speaking fluent
Spanish on wages? Comment on the coefficient and interpret the test on significance of the effect.
. reg wage Span
------------------------------------------------------------------------------
wage | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
Span | 177.757 24.90259 7.14 0.000 128.9358 226.5782
_cons | 988.6795 20.69451 47.77 0.000 948.1082 1029.251
------------------------------------------------------------------------------
b) You then extend the model to include additional regressors: age (in years), woman (a dummy
which takes value 1 for female immigrants), and a binary variable (univ) which is 1 if the
ECONOMETRICS / QUANTITATIVE AND STATISTICAL METHODS I
immigrant has a university degree. Use the results (below) to predict the salary of a 20-year old
female immigrant who speaks fluent Spanish but has no university degree.
------------------------------------------------------------------------------
wage | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
Span | 144.7333 23.26845 6.22 0.000 99.11577 190.3509
age | 10.92333 1.081844 10.10 0.000 8.802389 13.04428
woman | -445.0152 21.61999 -20.58 0.000 -487.401 -402.6295
univ | 468.8543 26.5885 17.63 0.000 416.7278 520.9808
_cons | 704.486 43.78279 16.09 0.000 618.6503 790.3217
------------------------------------------------------------------------------
c) Someone tells you that Span is probably endogenous. Explain why it is reasonable to expect that
Span (speaking fluent Spanish) is endogenous to wages.
d) You believe that the age at which the immigrant arrived in Spain could be a valid instrument for
Span (speaking fluent Spanish). Explain why the age of arrival may be a valid instrument.
e) The following table shows the results of estimating the above regression by 2SLS (the instrument
arrive is a dummy which takes value 1 if the person was less than 14 years old when he/she arrived
in Spain and a 0 otherwise). Is the instrument weak? Explain.
First-stage regressions
-----------------------
------------------------------------------------------------------------------
Span | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
age | -.0016024 .0006855 -2.34 0.019 -.0029464 -.0002585
woman | .0405692 .0136925 2.96 0.003 .0137252 .0674132
univ | .1723813 .0166719 10.34 0.000 .1396963 .2050663
arrive | .2672933 .0288442 9.27 0.000 .2107444 .3238421
_cons | .6790657 .025861 26.26 0.000 .6283654 .729766
------------------------------------------------------------------------------
------------------------------------------------------------------------------
wage | Coef. Std. Err. z P>|z| [95% Conf. Interval]
-------------+----------------------------------------------------------------
ECONOMETRICS / QUANTITATIVE AND STATISTICAL METHODS I
g) Compare the estimated effect of Span on wages in the OLS and 2SLS regressions. Why are they
so different?