Professional Documents
Culture Documents
Lec2 Demand Berry1994 Sp15
Lec2 Demand Berry1994 Sp15
Cornell
Spring 2015
Uij ≥ Uik , ∀k 6= j
Like all discrete choice models, only differences matter: if every Vij is
shifted up or down by a constant, the choice does not change.
Typically we normalize the observed utility of one choice to 0
(Vi0 = 0).
Scales do not matter either. Dividing both sides of the inequality by a
positive constant does not change the inequality.
The standard solution is to normalize the variance of the error terms.
Note that the choice probability is a J dimensional integral in the
most unrestricted form, which indicates the level of computational
complexity.
Panle Jia Barwick (Cornell) Discrete-Choice Berry94 Spring 2015 6 / 50
Logit models
Let sj denote the market share of product j (omitting the subscript i):
Z
sj = I (εk − εj ≤ Vj − Vk , ∀k 6= j )f (ε)d ε
Z
−(Vj −Vk +εj ) − εj
= ∏ e −e ∗ e − εj ∗ e −e d εj
k 6 =j
e Vk
− ∑ k 6 =j ∗ e − εj
Z
− εj
∗ e − εj ∗ e −e
V
= e e j d εj
e Vk
− ∑ k 6 =j +1 ∗ e − εj
Z
− εj V
= e ∗e e j d εj
e Vk
Z
= exp(−εj − e −εj ∗ (∑ ))d εj
k e Vj
V
Let e λj = ∑k e Vkj . Using the change of variables (replacing d εj with
e
d (εj − λj )), we have:
Z
sj = exp(−εj − e −εj ∗ e λj )d εj
Z
−(εj −λj )
= e − λj ∗ e −(εj −λj ) e −e d ( εj − λj )
e Vj
= e − λj = (1)
∑ k e Vk
where ‘Xj ’ is the vector of product attributes, and ‘pj ’ is product j’s
price.
This kind of indirect utility can be derived from a quasi-linear utility
function, where consumers assign a fixed budget to consumption of
product j = 0, ..., J, which is free of income effects.
For many products, the assumption of no income effects seems
reasonable.
For large household items (cars, houses, durable goods), this is less
ideal. Later we will study papers that incorporate income effects.
The price coefficient ‘α’ is the marginal utility of a dollar, and the
β
ratio α measures consumers’ willingness to pay for one additional unit
of some product attribute (for example, 1GB of computer RAM).
The expected consumer surplus with J choices is:
h i J
E (CS ) = E max {Uij }K
j =0 = ln ( ∑ e Vj ) + euler constant
j =0
Logit models are easy to estimate, and the endogeneity problem can
be easily addressed (see Berry 1994). However, it has a couple of
undesirable features.
What are the undesirable features?
Logit models are easy to estimate, and the endogeneity problem can
be easily addressed (see Berry 1994). However, it has a couple of
undesirable features.
IIA property: independence from irrelevant alternatives. From the
share equation (1), we know that:
sj e Vj
= V , ∀j, k
sk e k
Consider the case of cars. Suppose the price of BMW increases. For
some consumers who used to buy BMW, their utility from this
product decreases enough such that they switch to their second best
choice.
In models with iid εij and Vj that is identical across i, the proportion
of consumers who rank each brand as their second choice is simply
determined by the order statistics.
These models would predict that when BMW becomes more
expensive, a high fraction of its consumers would switch to either
Honda accord or Toyota camry, the most popular products.
This switching pattern is unlikely to hold in reality, because a typical
BMW consumer values the engine power (not the fuel efficiency).
Uij = Wk + Vj + εij , j ∈ Bk
Then:
sj = sj |Bk ∗ sBk
e Wk + λ k I k
sBk = , Ik = ln ∑j ∈Bk e Vj /λk
∑l e Wl +λl Il
(V /λ )
e j k
sj | B k =
∑i ∈Bk e (Vi /λk )
The most general solution is the generalized nested logit, where the
same product can belong to multiple nests.
For example, suppose consumers can choose between driving, car
pool, bus, or train:
driving and car pool are similar choices, while bus or train are similar
choices;
like bus and train, car pool is also inflexible to some extent;
as a result, car pool can belong to both nests.
Let αjk denote the portion of εj that belongs to nest k, and let
∑k αjk = 1. The choice probability of choosing product j can be
decomposed to:
sj = ∑ sj | k ∗ sk
k
{∑j ∈k [αjk exp(Vj )]1/λk }λk
sk =
∑K
l =1 { ∑j ∈l [ αjl exp(Vj )]
1/λl }λl
Pj Qj
Shares: we need physical shares (sj ), not revenue shares ( ∑ Pk Qk ).
k
Sometimes this requires converting observed sales to physical shares.
Prices: what if we need to aggregate over different styles, packages,
SKUs, etc.?
Use shares weighted prices. Constant shares (e.g. sample average
shares) are preferred.
Suppose we are interested in studying the car demand. For now, let
us simply ignore ξ j .
There are 100 million households in US.
If each household’s purchase probability is described by sj (X , P ), with
such a big “sample size”, the model predicted aggregate share should
1
differ from observed market shares by approximately √100,000,000 .
There is no demand model (with pure consumer taste shocks) that is
up to the test.
As a result, we need unobserved quality ξ j to help us fit data, even if
we are bold enough to claim prices are exogenous!
Theorem
If the density function F (ε) is positive and continuous for all ε, and we
normalize mean utility for the outside option to 0, then the inverse
function g −1 exists: ξ = g −1 (s ). In other words, there is one and only one
mean utility vector that can rationalize the observed market share.
We show that r (δ, s ) is a continuous function that maps [δ, δ̄]J into
itself.
The Brouwer’s fixed point theorem shows that a fixed point exists.
The dominant diagonal argument says the fixed point is unique iff
∂r
∑k ∂δkj < 1, ∀j.
∂rj
What is implied by condition ∑k ∂δk < 1, ∀j?
Intuition: since δ0 is fixed at 0, when attributes of all rival products
increases ε, δj (or rj (δ, sj ) only needs to increase by less than ε to
rationalize sj . Hence the ‘slope’ of rj (δ, sj ) is less than 1.
In logit models,
σξd =1 σξd =3
(1) (2) (3) (4)
Parameter True Value OLS IV OLS IV
βo 5 3.460 4.980 0.378 4.890
(0.158) (0.226) (0.415) (0.738)
βx 2 1.410 1.990 0.325 1.950
(0.058) (0.091) (0.127) (0.272)
α 1 0.726 0.995 0.181 0.979
(0.029) (0.039) (0.076) (0.128)
Table: Berry, Table 1 – Monte Carlo Parameter Estimates 100 Random Samples
of 500 Duopoly Markets Logit Utility Notes: The values given in the table are
empirical means and (standard errors). The utility function is
uij = βo + βx xj + σξd ξ j − αpj + eij . Marginal cost is
cj = e γo +γx xj +σξc ξ j +γw wj +σω ωj .