Professional Documents
Culture Documents
Instructions: Answer the following questions as clearly as you can. Circle your answers. Partial credit
is available, but we cannot give partial credit unless you show your work. You do not need to write out
every detail, but give a clear indication of how you are making your calculations. For problems
with many parts where you use information from earlier in the problem to answer the question, you
can receive full credit if you do the correct work but use the wrong numbers. The clearer you are in
your explanation the more straightforward this part of the grading will be.
You may use any type of hand-held calculator. All other electronic devices are prohibited.
You have collected data on two variables. The x-variable is the number of hours spent studying for an
exam, and the y-variable is the score on the exam. You have 24 observations.
You calculate the following values using Excel in order to calculate the OLS regression by hand.
X 180
i Y 30
( X X ) 37000
i
2
(Y Y ) i
2
42000
( X X )(Y Y ) 27000
i i
Also, in case this didnt make it onto your formula sheet (it was on a practice problem), note that it
can be shown that the Sum of Squares Estimated: SSE 1
2
(X i X )2 .
1) What is X ?
X
X i
180
6
n 30
SY2
(Y Y )
i
2
42000
1448.28
n 1 29
SY2 1448.28
SY2 49.94
n 30
cov( X , Y )
(X i X )(Yi Y )
27000
931.03
n 1 29
1 i
( X X )(Yi Y ) 27000
.7297
( X i X )2 37000
6) Interpret, using words, the meaning of 1 in this particular regression.
As the numbers of hours of study increases by 1, the exam score is predicted to decrease by .
7297 points. I know, this is a strange result to have the exam score go down with studying.
Maybe the class is some funky graduate philosophy course where the more you study the
worse you do. Its not as though I fabricated the example out of thin air. Oh, yeah, I did.
0 Y 1 X 30 (.7297)*6 34.378
9) Suppose one observation (call it observation 0) in the smple is X0=20, Y0=14. What is the value
of 0 ?
2
10) What is the value of S ? That is, the variance of the regression.
SSR=SST-SSE=42000-19702.7=22297.3
SSR 22297.3
Finally, S 796.332
2
n2 28
12) What is the test statistic for a null hypothesis that 1 = -.55?
1 1 .7941 .55
t0 1.23
S .0215
1
13) Will you reject the above null hypothesis at a level of =.10? Why or why not?
No. The critical value is larger than this test statistic (in absolute value). (1.697 is the
critical value, to be precise.)
SSE 19702.7
r2 .469
SST 42000
Part 2 (30 points total)
1) (10 points) Claim: All else equal, as the variance of the independent variable increases, the OLS
estimator 1 becomes more precise.
True
S2i
One version of the formula for the variance of the estimator betahat is S 2
1
As
(n 1) S x2
the variance of X rises, the denominator rises, making the variance of the estimator
beta1hat smaller or equivalently the estimator becomes more precise.
c. Use English to explain the intuition for your answer for a. (4 points)
We are seeking to explain the dependent variable Y with X. The wider the range of
values we have in our dataset for X (and the variance is the measure of this variation of
X), the more information we will have about how X and Y are connected, which
improves our precision). If the support is very narrow, like a pencil tip, the beam of
wood will be very unstable and the beam will tilt one way of another. However if the
support is broad, like a table top, the beam will be stable and not likely to tip. Same
with 1 .
2) (12 points total (3 points each part)) Compare the following two OLS regressions:
Yi 0 1 X i i
i.
1
Yi 0 1 2 X i i
ii. 6
(The + sign distinguishes the fact that the values youd calculate for equation ii may differ from
i.) That is, the values of the Y variable have been multiplied by 1/6 and the value of the X
variables have been multiplied by 2. How do the values of the following terms derived from
equation ii differ from the equivalent terms for equation 1?
(You need to give us some indication of how you arrive at your answers. I recommend answering
these in the order given.)
If a person correctly identifies the direction, but was unable to quantify the amount of
change, that is worth 1.5 points each. A decent but failed attempt to mathematically
determine the change is worth 2 points, assuming they are going in the right direction. If
they do the math right but dont say at the end slope decreases etc that should not be
penalized.
a) 1
S2
1 XY2
( X i X )(Yi Y )
SX ( X i X )2
1 1 1
S 2 ( X i X )(Yi Y ) 1
(2 X i 2 X )( Yi Y )
1 XY 6 6 3 1
S X2 (2 X i 2 X )2 4 ( X i X ) 2 12
b)
0
1 1 1 1 1
0 Y 2 1 X ( Y 2 1 X ) (Y 1 X ) 0
6 6 12 6 6
c) SSR+
1 1 1 1
SSR ( Yi Y ) 2 (Yi Y ) 2 SSR
6 6 36 36
d) R2+
3) (8 points) In the OLS regression of Y on X where the assumptions of the CLRM are met, what is
the expected value of 0 1 X i i ? (You may use the unbiasedness results from class, and
not rederive them. Show your reasoning and be precise.)
E[ 0 1 X i i ] E[ 0 ] E[ 1 X i ] E[i ]
Xi is assumed to be nonrandom in CLRM, and we proved that 0 and 1 are unbiased in
class. So we can rewrite as:
However, we have never actually established in class that E[i ] 0 . It may seem obvious,
but we should be able to rigorously justify the obvious, which is what I want here for the
last two points. It isnt hard, but to get there a student needs to go do roughly the
following. (A student doesnt have to have every single step, but it needs to be clear they
have this idea.)