Professional Documents
Culture Documents
(IIA)
1
The independence assumption follows from the
initial assumption that the disturbances are
independent and homoscedastic.
Example:
The odds of PhD students choosing mango juice
during breakfast over orange juice is
independent of the availability or attributes of
other alternatives such as pineapple, apple
juice…etc.
1
McFadden (1987) shows how this hypothesis can also be tested using a Lagrange multiplier test.
4
Conditional logit (clogit) vs Mlogit (Wooldridge,
2002)
6
The Multinomial Probit (Mprobit) Model
7
The mprobit model is frequently motivated using
a latent-variable framework. The latent variable
for the jth alternative, j=1,…,J is
ij zi j ij (37)
where 1xq row vector z i contains the observed
independent variables(regressors) for the ith
decision maker. Associated with z i are the J
vectors of regression coefficients j . The
i ,1, ....., i , J are distributed independently and
identically standard normal. The decision maker
chooses the alternative k such that
ik im , m k .
8
The probability that alternative k is chosen is;
9
Denote the deterministic part of the model as
ij ' zi j ' ; that probability that individual i
chooses outcome k is;
2
Dunnett, C. W. (1989) Algorithm AS 251: Multivariate normal probability integrals with product correlation
structure. Journal of the Royal Statistical Society, Series C 38: 564-579.
10
quadrature is used to approximate the above
single dimension integral.
This results in the following K-point quadrature
formula;
1 K J 1 J 1
Pr( yi k ) wk ( xk 2 ij ) ( xk 2 ij )
2 k 1 j 1 j 1
(43)
where wk and x k are the weights and roots of
the Laguerre polynomial of order K.
Identification
In eq.(38), not all J of the j are identifiable.
To remove the indeterminacy, l is set to the
zero vector, where l is the base outcome. That
fixes the lth latent variable to zero so that the
remaining variables measure the attractiveness
of the other alternatives relative to the base.
11
choice not only of labour force participation,
but also of the choice of whether a participant
chooses to work part-time or full-time. Another
example can be the responses to a
questionnaire which illicit a series of satisfaction
rankings from taste surveys.
13
Pr( y i m | xi ) Pr( m1 y i* m )
Pr( m 1 x'i u i m
14
One can evaluate this probability for any
combination of parameters { , } .
15
n M
li m i
[ ( x ' ) ( m 1 x ' i ) zim
i 1 m 1
Finally, taking logarithms, we come to the log-
likelihood function which can be used to
estimate the ML coefficients:
n M
li zim ln[( m x'i ) ( m 1 x'i )].
i 1 m 1
Any gradient-based ML solution routine will be
~, ~}
able to calculate those parameters {
which maximise the sample likelihood satisfying
the following FOCs;
ln L ln L
0 and 0
.
16
model is based on generalised extreme value
(GEV) distribution and it generalises the mlogit
model to a nested multinomial logit model.
11 1 J1 I 1 IJ I
F ( ) exp[ G(e ,..., e ;...; e ,..., e )]
for the following particular specification of the
function G(.);
4
McFadden(1978) Modelling the choice of residential location, in Spatial Interaction Theory and Planning Models,
75-96, A. Karlquist L. Lundquist, F. Sinckars and W. Weibull et al (Eds.), Amsterdam, New York, North-Holland.
17
G (Y ) G(Y11 ,..., Y1J1 ,..., YI 1 ,..., YIJ I )
I Ji
( Yij1/ i ) i
i 1 j 1
In fact
i 1 Cor[ ij , jk ]
18
Closed form solutions for this probability can be
derived as a function of Vij , and, i (see Cameron
and Trivedi, 2006;p.526).
19
The level or state at which a decision is made.
First-level decisions are made first, followed by
second-level decisions, and so on.
Bottom level
Alternative set
Alternative
The model
5
McFadden, D. (1977) Quantitative methods for analysing behaviour of individuals: some recent developments.
Cowles Foundation Discussion paper no. 474. ______(1981) Econometric Models of probabilistic choice. In
Structural Analysis of Discrete Data with Econometric Applications, pp. 198-272. Cambridge, MA: MIT Press.
6
Amemiya (1985) Advanced Econometrics, Cambridge, MA: Harvard University Press.
21
Let xij and yi refer to the vectors of
explanatory variables specific to categories (
i, j ) and ( i ), respectively.
exp( xij )
Pr j |i
k
exp( xik )
7
The nlogit command in STATA 9 allows you to apply linear constraints of the inclusive-value parameters. You can
constrain inclusive-value parameters to, say, be equal to each other; or specify fixed values rather than allowing
these parameters to be freely estimated. To estimate the model, you need to specify and display the tree structure of
the nested logit model using nlogitgen and nlogittree. The former helps you to generate a categorical variable that
identifies the first-level set of alternatives.
22
I i ln exp( xik )
k
then
exp( yi i I i )
Pri
m exp( yi m I m )
Three-level nlogit model
Following Greene (2003), we index the first-level
alternatives as i , the second level alternative as
j and the bottom-level alternative as k.
Let xijk , yij , and, zi refer to row vectors of
explanatory variables specific to categories (
i, j , k ), (i, j ), and (i ), respectively.
23
exp( xijk )
Prk |ij
exp( x
n ijn )
exp( yij ij I ij )
Pr j |i
m
exp( yim im I im )
then
exp( zi i J i )
Pri
l exp( zl l J l )
24
If we restrict all the ij , and, i to be 1, we
recover the clogit model of the following form:
exp(Vijk )
Prijk
l m n
exp(Vijk )
where
8
Note that STATA 9 uses FIML to fit the model.
25
positive outcome in group g, the log likelihood
of the nlogit model is;
ln L ln(Prijkg )
g
26