Professional Documents
Culture Documents
Logistic regression
• A version of the multiple regression in which
the outcome (i.e., the DV) is a categorical
Logit Models: variable.
Nonstop fun for the whole family – If the # of categories = 2, then it is called binary
logistic regression (or simply a logit model)
L9 – If the # of categories > 2, then it is called
“Gee dad, after dinner will you
“Hey I want to know about
show us how to do logit models?”
ordered Logit Models” multinomial logistic regression (or multinomial
logit model)
“Dad, I want to learn about
Multinomial Logit models!!!”
X…
• LR: Predict the probability of Y occurring given
1
3/31/2023
2
3/31/2023
Gee dad, how do I interpret the sign? Odds Ratios versus Coefficients
• If reporting coefficients
– A positive coefficient increase the probability or Coefficient
likelihood of Y = 1. Odds Ratio Ln(Odds Ratio)
– A negative coefficient decreases the likelihood of Y 1.002267 0.00226
= 1).
2.234545 0.80404
3
3/31/2023
4
3/31/2023
Dichotomous DV (Two
Bowen & Wiersema (Modeling Limited DVs)
choices/categories)
• DV 0-1 dummy variable • The interpretation of the directional impact (+
– Can’t have values outside of 0 or 1 …(e.g., it is a buy, or -) of a change in an explanatory variable in
no buy decision)
the binary LM or PM is identical to that for
• Logit modeluses logistical distribution
OLS, except, …the direction of the effect refers
• Probit modeluses standard normal distribution
to the change in the probability of the choice
– They tend to produce similar results
– Heterosked. causes major problems for Logit and
for which y = 1).
Probit models
– Use probit models for sample selection (Heckman
models)!!!
5
3/31/2023
•
• . logit admit gre gpa i.rank, robust
•
• Iteration 0: log pseudolikelihood = -249.98826 – For every one unit change in gre, the log odds of
• Iteration 1: log pseudolikelihood = -229.66446
•
•
Iteration
Iteration
2:
3:
log
log
pseudolikelihood
pseudolikelihood
=
=
-229.25955
-229.25875
admission (versus non-admission) increases by 0.002.
• Iteration 4: log pseudolikelihood = -229.25875
•
• Logistic regression Number of obs = 400
– For a one unit increase in gpa, the log odds of being
•
•
• Log pseudolikelihood = -229.25875 Pseudo R2
Wald chi2(5)
Prob > chi2
=
=
=
0.0829
36.66
0.0000 admitted to graduate school increases by 0.804.
------------------------------------------------------------------------------
– The indicator variables for rank have a slightly
•
• | Robust
• admit | Coef. Std. Err. z P>|z| [95% Conf. Interval]
•
•
-------------+----------------------------------------------------------------
gre | .0022644 .0011027 2.05 0.040 .0001032 .0044257 different interpretation. For example, having attended
• gpa | .8040377 .3451359 2.33 0.020 .1275838 1.480492
•
•
|
rank |
an undergraduate institution with rank of 2, versus an
•
•
•
2 | -.6754429
3 | -1.340204
4 | -1.551464
.3144686
.3445257
.4160544
-2.15
-3.89
-3.73
0.032
0.000
0.000
-1.29179
-2.015462
-2.366915
-.0590958
-.6649459
-.7360121
institution with a rank of 1, decreases the log odds of
•
•
|
_cons | -3.989979 1.138089 -3.51 0.000 -6.220593 -1.759366
admission by 0.675.
• ------------------------------------------------------------------------------
6
3/31/2023
Hoetker, 2007
• Since y* is unobserved, we use do not know • For logit models that report the odd ratio
the distribution of the errors, ε 1:1 an event is equally likely to occur (50% prob)
• In order to use maximum likelihood 2:1 an event is twice as likely to occur (66.7%
estimation (ML), we need to make some prob)
assumption about the distribution of the The effect of a one unit change in variable X is to
errors. change the odds by a factor of exp(Bx)
Values > 1 increase the odds of the event occurring
• A good (but a bit technical) summary of MLE: Values < 1 decrease the odds of the event occurring
– https://online.stat.psu.edu/stat415/lesson/1/1.2
Dad, “What if you have more than 2 CEO’s preferred flavor of ice cream:
categories?” Chocolate, Vanilla, or Strawberry
• Multinomial logit models
Manatee
– Use when there are > 2 categories
– Categories
Poop…
Served at
• Ordinal consisting of ordered categories Lickety Split in
– Socio-economic status (e.g., lower, middle, upper class) Englewood FL!!!
– Use ordered logit model (in Stata, ologit command)
• or
• Nominal consisting of unordered categories
– Favorite ice cream flavor (e.g., Vanilla, chocolate, strawberry,
manatee poop (FL)
7
3/31/2023
8
3/31/2023
Dad, one more question, “What if I Dad, will you talk about
have panel data?” Conditional Logit Models?
• Logit and Probit • “Not today son…We have to save some of the
– xtlogit DV IV1 IV2… fun for next time”
– xtprobit DV IV1 IV2… • “Ok dad”
• Multinomial Logit & Multinomial Probit
– xtmlogit DV IV1 IV2…
– xtmprobit DV IV1 IV2…
9
3/31/2023
10