Professional Documents
Culture Documents
Khoirunurrofik
Topics
Week Topic
14 Presentation
(Des 9 : 8.00)
Grading Policy
• QUIZ = 10
• Paper +Presentation =40
Agenda
• Linear Probability Model
• Probit Model
• Logit Model
Introduction
• In particular, researchers analyzing consumer choice
often must cope with dummy dependent variables
(also called qualitative dependent variables).
• For example, how do high school students decide
whether to go to college? What distinguishes Pepsi
drinkers from Coke drinkers? How can we convince
people to use public transportation instead of
driving?
• For an econometric study of these topics, or of any
topic that involves a discrete choice of some sort, the
dependent variable is typically a dummy variable.
Models with Binary Dependent Variables
• Many of the choices that individuals and firms make
are ‘‘either–or’’ in nature.
• A high school graduate decides either to attend college or not.
• Aworker decides either to drive to work or to get there using a
different means of transportation.
• A household decides either to purchase a house or to rent.
• A firm decides either to advertise its product in a local newspaper
or it decides not to.
public transportation is P y = 0 = 1 − .p
• There are other factors that affect the decision, but let us focus on this
single explanatory variable.
• A priori we expect that as x increases, and commuting time by bus
increases relative to commuting time by car, an individual would be more
inclined to drive.
• That is, we expect a positive relationship between x and p, the probability
that an individual will drive to work.
Linear Probability Model
Break the dependent variable into
y = E ( y) + e = p + e fixed and random parts
• One problem with the linear probability model is that the error
term is heteroskedastic; the variance of the error term e varies
from one observation to another
y value e value Probability
1 1 − ( 1 + 2 x ) p = 1 + 2 x
0 − ( 1 + 2 x ) 1 − p = 1 − ( 1 + 2 x )
Linear Probability Model
var ( e ) = ( 1 + 2 x )(1 − 1 − 2 x )
p̂ = b1 + b2 x
dp
= 2
dx
Problems:
• It implies marginal effects of changes in continuous explanatory variables
are constant, which cannot be the case for a probability model.
• This feature also can result in predicted probabilities outside the [0, 1]
interval.
• The linear probability model error term is hetero- skedastic, so that a
better estimator is generalized least squares
• R2 is usually very poor and a questionable guide for goodness of fit
Problem of LPM
1. R2 is not an accurate measure of overall fit.
For models with a dummy dependent
variable, R2 tells us very little about how well
the model explains the choices of the
decision makers.
p̂ = b1 + b2 x
auto Coef. Std.
dtime 0.0703099
_cons 0.4847951
d p
5 0.8363446
10 1.1878941
The average time travel differential is DTIME = 0:1224 (1.2 minutes), and
for this value the marginal effect of a 10-minute increase in the time travel
differential is 0.1191.
When the mean difference in travel time is near zero, the effect of a
change in travel time difference is greater.
Average Marginal Effect
Average Marginal Effect
Rather than evaluate the marginal effect at a specific value, or the mean
value, the average marginal effect (AME) is often considered
Logit Model
• Probit model estimation is numerically complicated
because it is based on the normal distribution.
• A frequently used alternative to the probit model for
binary choice situations is the logit model.
• These models differ only in the particular S-shaped
curve used to constrain probabilities to the [0, 1]
interval.
• If L is a logistic random variable, then its probability
density function is
Differences between logit and probit
probabilities
Logit Model
• The cumulative distribution function for a logistic
random variable is
• After estimating a logit model, the regular hypothesis testing analysis can
be undertaken using the z-statistics obtained.
• However, the interpretation of the coefficients is totally different from
that of regular OLS.
• Given this, the coefficient β2 obtained from a logit model estimation
shows the change in Li = ln(Pi/(1 − Pi)) for a unit change in X, which has
no particular meaning.
Interpretation of the estimates in logit models :
Calculate the change in average Dˆi:
• To do this, first insert the mean values of all the explanatory variables
into the estimated logit equation and calculate the average Dˆ i.
• Then recalculate, but now increasing the value of the explanatory variable
under examination by one unit to obtain the new average Dˆi.
• The difference between the two Dˆis obtained shows the impact of a one-
unit increase in that explanatory variable on the probability that Di = 1
(keeping all other explanatory variables constant).
• This approach should be used cautiously when one or more of the
explanatory variables is also a dummy variable (for example how can
anyone define the average of gender?). When dummies of this kind (for
example gender) exist in the equation, the methodology used is to
calculate first the impact for an ‘average male’ and then the impact for an
‘average female’ (by setting the dummy explanatory variable first equal to
one and then equal to zero) and comparing the two results.
Interpretation of the estimates in logit models : Take the
partial derivative
Interpretation of the estimates in logit models :
Multiply the obtained βj coefficients by 0.25