Professional Documents
Culture Documents
Abstract
* Erasmus University Rotterdam, bellet@ese.eur.nl. An early version of this paper circulated under
the title “The Paradox of the Joneses: Superstar Houses and Mortgage Frenzy in Suburban America” and
was released as a CEP Discussion Paper. I am extremely grateful to Samuel Bowles, Andrew Clark, Nico-
las Coeurdacier, Stefano Dellavigna, Paul Frijters, Pedro Gardete, Steve Gibbons, Sergei Guriev, Thomas
Piketty, and Alexandra Roulet for very helpful discussions and comments. I also thank the seminar and
conference participants at Sciences Po Department of Economics, INSEE-CREST, London School of Eco-
nomics, Paris School of Economics, INSEAD, Erasmus School of Economics, 2017 ECINEQ conference,
and 2018 Berkeley Haas SICS conference for useful comments. I acknowledge financial support from
CREST, Centre for Economic Performance (LSE) and INSEAD.
1.5
Impact of log house size on house satisfaction
0 .5 -.5 1
5
3
201
198
198
198
199
199
199
199
199
200
200
200
200
200
201
Notes: House satisfaction is regressed on the log house size of new-movers, defined as homeowners who
bought their house within the last two years (N = 22,772). Coefficients on log house size are plotted
by year. Each dot corresponds to a separate regression. 90% confidence intervals drawn using robust
standard errors. Sampling weights are included. Source: AHS national surveys (1985-2013).
an Easterlin paradox in housing consumption. Here, I plot the average house size
of new movers in suburban areas between 1985 and 2013 against their average house
satisfaction. Despite an increase in average house size of more than 600 square feet be-
tween 1985 and 2007, there is no long-term rise in house satisfaction over this period.
Since average household size fell from 3.0 to 2.8, this paradox cannot be explained by a
fall in average space per person. In Appendix A.2, I show that this puzzle is robust for
both new and old movers to the inclusion of controls related to house quality (house
price per square foot and age of house) and household characteristics (e.g., household
size, length of tenure or income).
Can positional externalities explain this paradox? To answer this question, I com-
bine 18 AHS waves of metropolitan files from 1984 to 2009. The metropolitan surveys
were conducted every year from 1984 to 1996 and around every two years thereafter.
They are representative at the level of metropolitan statistical area (MSA), which com-
prises an average of five metropolitan counties. Metropolitan statistical areas (MSA)
are delineated by the U.S. Office of Management and Budget (OMB). They consists in
a core urbanized area containing a substantial population together with adjacent com-
10
2700 10
Average house size
2300 8.5
2200
8
2100
7.5
2000
1900 7
85
87
89
91
93
95
97
99
01
03
05
07
09
11
13
20
19
19
19
19
19
19
19
19
20
20
20
20
20
20
Notes: The dashed-line shows the average house satisfaction of new suburban movers on a 1 to 10 scale
(vertical-left axis). The solid line shows the average size of their house in square feet (vertical-right axis).
New movers are defined as homeowners who bought their house within the last 2 years before being
surveyed (N = 22,772). Sampling weights included. Source: AHS national surveys (1985-2013).
munities having a high degree of economic and social integration with that core. A
different set of MSAs are surveyed each year, so that on average, each MSA is surveyed
four times over the period from 1984 to 2009. Each county is further divided between
its central city area and its suburban area. A “suburb” therefore corresponds to the
suburban area of a metropolitan county. These suburbs represent around 54% of the
total U.S. population and are fully representative of the suburban population. After
dropping observations with missing values, I end up with a pooled sample of 182,583
observations of homeowners surveyed between 1984 and 2009 within 152 suburbs (see
Table S2 for detailed summary statistics). A map showing the spatial distribution of
these suburbs can be found in Figure S7.
Following Deaton and Stone (2013), a first test for positional externalities consists
of comparing the coefficient of own house size on individual satisfaction at the house-
hold level with the same coefficient at the suburban level. Conditional on suburb-year
fixed effects, and whether a household’s reference house is located within the same
suburb, a household-level regression of house satisfaction on own house size will re-
veal the individual-level coefficient. However, the aggregate suburb-level regression
will reveal the difference between the individual and the suburb coefficient. In the
11
House size is positively related to both house and neighborhood satisfaction within
suburbs at the household level. However, once aggregated, the effect of size on house
satisfaction becomes negative, suggesting that the positive effect of own house size is
more than offset by the negative externality of others’ house size. Conversely, average
house size affects neighborhood satisfaction positively: the suburb-year coefficient is
higher than the microdata coefficient.13
12
Notes: The Figure maps all single-family houses web-scraped from Zillow that are located within one of
the seven counties of Baltimore–Columbia–Towson MSA.
The geolocalized data set provides highly detailed information about the evolution
and distribution of house size across suburbs. Figure S2 plots the median and 90th
percentile size of all houses by year of construction. Over the last 50 years, the median
newly built house doubled in size. Considering that average household size has de-
creased by around 20% since 1960, the amount of private space per person has risen
at an even higher rate. A strong rise in size also occurred at the top of the distribution.
Between 1980 and 2009, the 90th percentile house size grew 1.4 times faster than the
median house size.
This McMansion effect becomes even more salient when looking at how the con-
struction of new houses has altered the distribution of the housing stock since 1980.
Whether I look at the size of the 90th percentile house relative to the median house for
the stock of existing units each year, or the share of houses larger than 3,000 square
feet, a clear U-shape pattern emerges (Figure S3). This rise in top house size inequality
13
1.9
1.8
1.7
1.6
1920 1930 1940 1950 1960 1970 1980 1990 2000 2010
Notes: The Figure plots the 90th percentile house size relative to the median house size for the stock
of existing houses each year within four suburbs of the Baltimore–Columbia–Towson metropolitan area
(Maryland). A value of 1.5 means the 90th percentile house is 50% bigger than the median house in the
suburb. Source: Author’s own calculation from Zillow.
echoes the evolution of top wealth inequality over the last century (Saez and Zucman
2016).14 From 1940 to 1960, the 90th percentile size of the housing stock declined
by 10 percentage points compared with the median house. The ratio remained stable
until 1980, when the reverse trend occurred. The 90th-percentile-size house then in-
creased by almost 15 percentage points relative to the median. On the eve of the 2008
financial crisis, the ratio had recovered to the level of the 1930s. Over the same pe-
riod, the share of houses larger than 3,000 square feet increased from 5% to 15% of all
houses built. Looking at the entire distribution of houses by construction decade, it is
clear that this increase in size inequality was driven by the top of the size distribution,
which became increasingly fat-tailed after 1980 (see Figure S4).
Importantly, the distribution of house size varies greatly between and within sub-
urbs, even for houses located within the same MSA. To illustrate this, I show the evolu-
tion of the ratio of 90th percentile size to median size in four suburbs of the Baltimore
MSA for the stock of houses since 1920 (Figure 4). The latter varies between 1.6 and
2.25 times the median house size. In 1960, the 90th percentile house was 70% larger
14 Using decennial Census data, Albouy and Zabek (2016) find a similar result for home values and rent
by looking at the variance of logs, the Gini coefficient, and the Theil entropy index.
14
15
16
Large Houses (> 3000 sqft) Large Houses (> 3000 sqft)
1
.3
Density
Density
.2
.5
.1
0 0
0 5 10 15 20 25 0 .5 1 1.5 2 2.5 3 3.5 4 4.5 5
Number of roads around the house (1.27 miles radius) Distance to nearest road (miles)
.2 1
Density
Density
.1 .5
0 0
0 5 10 15 20 25 0 .5 1 1.5 2 2.5 3 3.5 4 4.5 5
Number of roads around the house (1.27 miles radius) Distance to nearest road (miles)
Notes: The Figure shows the distribution of the number or roads around a house within a 1.27 miles (2
kilometers) radius, and the distance to the nearest road (in miles) separately for all houses bigger than
3,000 square feet (upper panels) and for all houses smaller than 3,000 square feet (lower panels). I use
geolocalized housing data collected on Zillow together with data from the National Highway Planning
Network (NHPN). Roads include national highway routes, but also rural and urban arterials, and rural
and urban collectors.
me a rough approximation of the area within the surburb where an AHS house is most
likely located (hence its distance to other houses), it is subjected to various forms of
measurement errors related to the limited number of predictors used in the prediction
task itself, but also the assumptions needed to convert a distance into a driving time.
For instance, the geodetic distance measure does not account for the available net-
work of streets, one-way streets, traffic congestion, speed limits, turning time, or stop
signs and traffic lights. Proximity to road networks therefore remains my preferred
measures of visual salience within a suburb.
17
N
H
Y βj
Uist = (hist /Hstσ )α zjist
j=1
18
9.25
8.5
8.25
7.75
750 1000 1250 1500 1750 2000 2250 2500 2750 3000 3250
Own house size (sqft)
Notes: Nonparametric house satisfaction curves (measured on 1 to 10 scale) as a function of own house
size (in square feet) for households living in bottom quartile (A) and top quartile (B) suburbs in terms
of the 90th percentile size of the housing stock. A household from suburb B living in a 2000 square feet
house must increase the size of his house by 500 square feet to report the same level of house satisfaction
than a household from suburb A living in a house of similar size. Sampling weights are included. Sources:
AHS metropolitan files and author’s own calculation from Zillow.
houses; households with less than 1,250 square feet seem much less affected by the Mc-
Mansion externality. Formally, this implies that decreasing marginal returns to size are
more pronounced in the presence of positional exernalities (i.e., σ > 0). This further
supports a ratio specification of relative size, as the difference specification sometimes
˜ = hist − σ Hst ) would lead to the opposite prediction.19
used in the literature (hist
H
Knowing Uist , sensitivity to the size of other houses can be directly estimated from
a log linear regression:
N
X
H
ln(Uist ) = constant + αln(hist ) − ασ ln(Hst90 ) + βj ln(zjist ) + uist (1)
j
19 Since σ > 0 and since house satisfaction can be shown to be a concave function of relative size (0 <
∂U H /∂hist
α ≤ 1), sign( ist
∂Hst
= sign − α 2 σ < 0. This specification of relative size implies that the interaction
term between own and reference house size should be negative, and diminishing marginal utility to size
should be more pronounced under higher levels of σ .
19
Visual Salience. The decision to build large houses may be driven by unobserved
suburban characteristics that in turn correlate with house satisfaction. Some of those
characteristics likely affect all houses, i.e. regardless of where a house ranks in the local
size distribution. This provides a first placebo test of positional externalities: I expect
the estimated σ coefficient will be insignificant when house size growth is driven by
lower percentile of the distribution (e.g. the 10th house size percentile), rather than
the top of the distribution (i.e. the 90th size percentile).
However, this provides only a weak test as unobserved shocks affecting rich house-
holds more than poor households may still right skew the house size distribution, and
be endogenous to satisfaction. I therefore propose a stronger placebo test. Unlike most
local drivers of house construction, positional externalities are conditional on existing
homeowners being visually exposed to larger houses. To identify positional externali-
ties in house size, I rely on the fact that, on average within a suburb, the visual salience
of other houses is higher when these houses are built in close proximity to (versus in
isolation from) the suburban road network. Importantly, this should be true “on av-
erage”, i.e. regardless of where exactly within the suburb an existing homeowner is
located. Indeed, as long as those houses can be observed while driving, new construc-
tions do not need to occur within a household’s “local” neighborhood for positional
externalities to be identified.21 I define visual salience as the average number of roads
around a reference house in a radius of 5 minutes driving time (i.e. at 36 mph, a radius
of 2 kilometers—or 1.27 miles—using Manhattan distance). I also show robustness us-
ing a measure of the average distance to the nearest road (in miles).
20 See in particular Luttmer (2005), Charles et al. (2009), or Bertrand and Morse (2016). Notably, the
well-being literature usually reports level-log or ordered logit estimates. Results from these alternative
specifications are shown in Appendix C.5.
21 The main concern here relates to more general types of housing externalities that would also affect the
value of the house, for instance. However, unlike positional externalities (Frank et al. 2014), those type of
externalities are hyper-local: Rossi-Hansberg et al. (2010) show they decline by half approximately every
1,000 feet.
20
21
Empirical Specification. Given that Cobb–Douglas utility seems to match the rela-
tionship between house satisfaction and (relative) house size well (section 3.1), I esti-
mate log-log regressions of the following form:
H 90
log(Uist ) = α+γ1 log(hist )+γ2 log(Hist )+βXist +SuburbY earst +T enureP eriodit +uist (2)
H
where Uist is the house satisfaction evaluated from 1 to 10 at t by i in s, hist captures
90
homeowners’ own house size, and Hist the reference house size. Reference size is de-
fined as the 90th percentile size of visually salient houses built after purchase. I use the
number of roads within a 5 minutes driving time around the house (2 kilometers ra-
dius) as my main measure of visual salience and, for each individual household, define
visually salient houses as houses built over the household’s tenure period that belong
to the top quartile of houses in terms of visual salience. I also construct a series of
placebo measures of reference house size by varying both the reference size threshold
(e.g. lower size percentile) and visibility of houses (e.g. lower visual salience quartile).
In this specification, σ can be interpreted as the ratio between two elasticities
(σ = −γ2 /γ1 ), where coefficients γ1 and γ2 correspond to the elasticity of house sat-
isfaction with respect to own and others’ house size, respectively. In other words, a σ
coefficient of 1 implies that a household visually exposed to house size growth at the
90th percentile must increase the size of their own house at the same rate to maintain
a constant house satisfaction level.
Xist is an extensive list of controls for household (and house) characteristics, be-
sides house size.22 Besides addressing potential sources of omitted variable bias, those
controls make sure major indirect effects of positional externalities on house market
outcomes (e.g. home prices, or home size improvements) that may also affect house
satisfaction are held constant. On top of this large set of controls, I add a set of
508 dummies for suburb-year effects (SuburbY earst ), which allows me to compare
households who share similar average suburban characteristics. This controls for any
suburb-specific characteristics affecting households’ current house satisfaction regard-
less of when they moved, such as the current characteristics of the housing stock, the
22 Household controls include age, race, sex, household size, education, annual income, number of cars,
tenure length, mortgage conditions (monthly mortgage payments, term and interest rate) and overall
neighborhood satisfaction evaluated by the household at t on a scale from 1 to 10. House controls include
house market value per square foot, local real estate taxes, age of house, quality features (balcony, porch,
extra bathrooms, heating equipment, recent remodeling), and defects (holes, water leak).
22
23
(0.002) (0.002) (0.002) (0.002) (0.003) (0.003) (0.003) (0.003) (0.003) (0.003)
Ln(reference size) -0.041∗∗∗ -0.033∗∗∗ -0.040∗∗∗ -0.045∗∗∗ -0.039∗∗∗ -0.046∗∗∗ -0.013 0.012 -0.003 -0.007
(0.006) (0.006) (0.009) (0.009) (0.009) (0.011) (0.009) (0.011) (0.008) (0.011)
σ 0.52∗∗∗ 0.62∗∗∗ 0.74∗∗∗ 0.82∗∗∗ 0.77∗∗∗ 0.90∗∗∗ 0.25 -0.24 0.06 0.13
Observations 182,570 182,570 182,570 182,570 182,570 182,009 182,570 182,570 182,425 182,180
R2 0.030 0.184 0.194 0.203 0.223 0.223 0.223 0.223 0.223 0.223
Panel B: Ln(Neighborhood Satisfaction)
Ln(reference size) -0.005 0.008 0.031∗∗ 0.030∗∗ 0.031∗∗ 0.006 0.047∗∗∗ 0.022 0.013∗ 0.012
(0.007) (0.007) (0.013) (0.013) (0.012) (0.015) (0.012) (0.016) (0.008) (0.017)
Observations 182,570 182,570 182,570 182,570 182,570 182,009 182,570 182,570 182,425 182,180
R2 0.019 0.172 0.183 0.192 0.196 0.196 0.196 0.196 0.196 0.196
Controls:
24
Household Controls — Yes Yes Yes Yes Yes Yes Yes Yes Yes
Suburb × Year FEs — — Yes Yes Yes Yes Yes Yes Yes Yes
Tenure Period FEs — — — Yes Yes Yes Yes Yes Yes Yes
Full House Controls — — — — Yes Yes Yes Yes Yes Yes
Placebo Reference Size Controls — — — — — Yes — — — —
Reference House Specification:
Size Percentile 90th 90th 90th 90th 90th 90th 50th 10th 90th 90th
Visual Salience High High High High High High High High Low High
Post-Purchase Yes Yes Yes Yes Yes Yes Yes Yes Yes No
Notes: OLS coefficients reported. Panel A: House satisfaction (logged) as the dependent variable. Coefficients on own house size and reference size (logged)
reported. Panel B: Neighorhood satisfaction (logged) as the dependent variable. All regressions include own house size (logged) and reference size. Household
controls include age, race, sex, household size, education, annual income (logged), number of cars, monthly mortgage payments, mortgage conditions (term
and interest rate) and overall neighborhood satisfaction. House controls include house market value per square foot (logged), local real estate taxes (logged),
age of house, quality features (balcony, porch, extra bathrooms, heating equipment, recent remodeling), and defects (holes, water leak). High (low) visual
salience corresponds to the top (bottom) quartile of houses within suburbs in terms of proximity to roads (i.e. number of roads within 5 minutes driving
time). Robust standard errors clustered at the suburb-year level are reported in parentheses. ∗ p < 0.10, ∗∗ p < 0.05, ∗∗∗ p < 0.01.
purchase (i.e. the bottom quartile of houses in terms of visual salience within sub-
urb), and the 90th percentile size of visually salient houses built before purchase. In
this more restrictive specification, the estimated σ is now close to 0.9, which means
the average homeowner must upscale their house at almost the same rate as top house
size if they want to maintain their level of house satisfaction over time (the difference
between both coefficients is not statistically significant). This effect is similar in mag-
nitude to prior estimates on the relationship between self-reported happiness, own
income and neighbors’ income (see Perez-Truglia 2020, for a review).
In columns (7) to (10) of Table 2 Panel A, I run a series of placebo tests where I
replace my main measure of reference size by each one of the alternative measures de-
scribed above. This allows me to more directly test three key features of my identifica-
tion strategy. First, past literature defines positional externalities as upward-looking,
which means they can be traced back to house size growth driven by the top of the size
distribution (Bertrand and Morse 2016; Frank et al. 2014).26 Using the 50th or 10th
percentile of visually salient houses built after purchase leads to a non-significant σ
coefficient, as shown in columns (7)-(8). Figure S11 (left panel, high visual salience)
plots the estimated σ coefficients, replacing the 90th size percentile of visually salient
houses by its 70th, 50th, 30th and 10th size percentile. The results show a clear and
consistent decrease in the estimated σ coefficient when house size growth after pur-
chase is driven by lower percentiles of the size distribution. This suggests the effect
is not driven by general changes in the housing stock unrelated to house size, like the
number of new constructions. In Table S4, I look at the total number of new houses
built in the household’s suburb over tenure (logged) as an additional placebo test. I
find no significant effects on house satisfaction, but a negative and significant effect on
neighborhood satisfaction. Controlling for this measure does not affect my estimates
on reference house size.
Second, positional externalities in house size are conditional on existing homeown-
ers being visually exposed to larger houses. I therefore expect the effect of reference
house size to be smaller (or non-significant) for houses built in isolation from the sub-
urban road network. This is because those houses are less visually salient, on average.
Column (9) uses as reference size the 90th percentile size of non-visually salient houses
(bottom quartile) built after purchase. In stark contrast to column (5), the estimated
σ coefficient is close to 0. Figure 7 plots the estimated σ by quartile of visual salience.
It shows a clear linear decline of the effect as the visual salience of larger houses goes
26 See Frank et al. (2014) or Bertrand and Morse (2016) for further discussion and evidence on trickle-
down consumption and expenditure cascades.
25
.5 1
(σ coefficient)
(σ coefficient)
.25
.5
0
0
-.25
-.5
-.5
-.75 -1
4th (High) 3rd 2nd 1st (Low) Above median Below median
Visual Salience (Quartile) Own House Size
Notes: Figure (a) plots the sensitivity coefficients (σ = −γ2 /γ1 ) to the 90th and 10th percentile house size
of houses built after purchase by quartile of visual salience. Visual salience is measured by the proximity
to roads of reference houses (i.e. number of roads within 2 kilometers radius). Figure (b) plots the
sensitivity coefficients to the 90th and 10th percentile house size of visually salient houses (top quartile)
for homeowners living in above (vs. below) median house size within suburbs. Sampling weights are
included in all regressions. The 95% confidence intervals are drawn using standard errors clustered at
suburb-year level.
26
3.4 Robustness
Neighborhood Satisfaction. Rather than positional externalities, coefficient σ could
capture a more general form of neighborhood externality. For instance, the construc-
tion of McMansions may be associated with disutility from gentrification, or a signal
residential segregation, as it alters the composition of neighbors.28 The opposite effect
could also arise if gentrification or residential segregation lead to positive neighbor-
hood externalities. The inclusion of neighborhood satisfaction as a control variable al-
ready captured these effects. However, a more direct test can be performed through re-
placing house satisfaction by neighborhood satisfaction on the left-hand side of equa-
tion (2). Panel B of Table 2 reports the resulting γ2 coefficient on reference house size.
If anything, exposure to very large houses generate positive externalities on neighbor-
hood satisfaction. Moreover, contrary to house satisfaction, neighborhood satisfaction
is driven by size increases coming from the middle of the size distribution (50th per-
centile house size). Overall, these results confirm the evidence discussed in Table 1.
Positional externalities in size are distinct from general neighborhood satisfaction ef-
fects, as both measures behave in opposite ways with respect to others’ consumption
levels.29
28 The construction of McMansions could also be considered unaesthetic, which should mostly lower
neighborhood satisfaction.
29 Those results suggest households may face a tradeoff between the social benefits associated with
living in neighborhoods with larger houses and the social costs of maintaining their relative status, espe-
cially when growth in size is driven by the top of the distribution.
27
28
Attrition Bias. The use of Zillow to recover historical variation in reference house
size may be problematic if attrition or home-remodeling effects alter the measurement
of past house size in a systematic way. For instance, assuming that large mansions
are more likely to be demolished than smaller houses, the average historical size of
older houses will be underestimated. Conversely, assuming that improvements are
made to the size of old houses, their average historical size will be overestimated. I
address this concern in Appendix C.4 by comparing the mean to median house size
ratio in Zillow with the equivalent measure from the U.S. Census Bureau’s Survey of
Construction (SOC) across regions. Looking at a period of nearly 40 years, there is a
perfect correlation between Zillow and SOC data, and no evidence of a diverging gap
31 As long as it is visually salient (i.e. can be observed while driving), a reference house does not have
to be built in the homeowner’s specific neighborhood to generate positional externalities. And unlike
positional externalities (Frank et al. 2014), these supply-side effects are hyper-local (Campbell et al. 2011;
Rossi-Hansberg et al. 2010).
32 Market values in the AHS are higher in levels than transaction prices, but have similar time-series
patterns (DiPasquale and Somerville 1995; Kiel and Zabel 1999).
33 Local real estate taxes is the only predictor that remains weakly significant, but the effect size is very
small.
29
Functional Forms. The preferred log-log regressions in Table 2 directly result from
the Cobb–Douglas specification of house satisfaction in Section 3.1. They echo the
relative consumption literature and lead to a straightforward interpretation of σ . Al-
though the use of OLS remains widespread, the literature typically also estimates or-
dered probit regressions. However, recent work on the cardinality of subjective scales
show that when respondents use response scales in an approximately linear manner,
OLS estimates are valid (see Kaiser and Vendrik 2020, for a review). I follow a similar
approach as Kristoffersen (2017) and regress the log of house market values on dum-
mies for each of the response categories of the house satisfaction question.34 Figure
S14 confirms the validity of the linearity of scale use assumption. Table S11 also re-
produces the main estimates from Table 2 using level-log and ordered probit models.
The results still hold: the coefficients on own and reference house size, along with their
interactions, remain significant and of the expected signs and relative magnitude.
30
31
where regression (3) takes (logged) house size as the dependent variable. Each regres-
sion includes household/house fixed effects µi and year fixed effects Γt . I use Hst90 as the
32
33
34
35
5 Conclusion
This paper provided direct field-level evidence of positional externalities in the Amer-
ican housing market. I identify positional externalities from cross-sectional variation
in house satisfaction and the size of visually-salient houses built within a household’s
suburb over tenure. I show that a rise in size of visually-salient McMansions—defined
as the 90th percentile size of houses built in close proximity to the suburban road
network—nearly offsets the satisfaction gains from living in a larger house. The ev-
idence is consistent with the puzzling fact that despite a major upscaling in size of
single-family houses in U.S. suburbs since 1980, households did not experience a sig-
nificant increase in house satisfaction. I complement the cross-sectional analysis with
indirect evidence of positional externalities on house market outcomes. I show that
within-households, changes in visual exposure to McMansions also affects homeown-
ers’ decision to upscale the size of their own house.
These results have major implications for our understanding of consumer markets
in unequal economies. They imply consumption inequality driven by the top of the
distribution can lower the satisfaction consumers derive from their own purchases.
Competing for relative size becomes a zero sum game (Frank 2013). For those who de-
cide to upscale, once the satisfaction gains of living in a relatively large house vanish,
households may be left with the negative long-run impact of their decisions in terms
of extra costs (from extra energy costs to re-furnishing costs). While I cannot directly
test whether relative visible consumption (or consumption inequality) better explains
life satisfaction than relative income (or income inequality), this paper also calls for
a reassessment of the relative income hypothesis in terms of positional consumption
choices.
The documented magnitude of positional externalities also suggest regulations
aimed at restricting the size of houses at the top of the distribution, from zoning
laws to housing permits, may be welfare improving (Frank 2005). Such regulations
already exist in a number of U.S. counties, but their overall welfare impact is yet to be
evaluated. In most cases, they take the form of maximum lot size ratios that restrict
the size of houses. However, suburban communities tend to favor the extensive use
of minimum lot size requirements, which may have amplified positional externalities
and increased financial distress, with no long-term improvements in house satisfaction
36
References
Abadie, A., S. Athey, G. W. Imbens, and J. M. Wooldridge (2023). When should you adjust
standard errors for clustering? The Quarterly Journal of Economics 138(1), 1–35.
Abbiasov, T., C. Heine, E. L. Glaeser, C. Ratti, S. Sabouri, A. S. Miranda, and P. Santi (2022).
The 15-minute city quantified using mobility data. Technical report, National Bureau of
Economic Research.
Abel, A. B. (1990). Asset prices under habit formation and catching up with the joneses. The
American Economic Review, 38–42.
Albouy, D. and M. Zabek (2016). Housing inequality. Technical report, National Bureau of
Economic Research.
Bailey, M., R. Cao, T. Kuchler, and J. Stroebel (2018). The economic effects of social networks:
Evidence from the housing market. Journal of Political Economy 126(6), 2224–2276.
Bao, H. X. and C. C. Meng (2023). Housing wealth distribution, inequality and residential
satisfaction. Regional Studies, 1–14.
Benjamin, D. J., O. Heffetz, M. S. Kimball, A. Rees-Jones, et al. (2014). Can marginal rates of
substitution be inferred from happiness data? evidence from residency choices. American
Economic Review 104(11), 3498–3528.
Bertrand, M. and A. Morse (2016). Trickle-down consumption. Review of Economics and Statis-
tics 98(5), 863–879.
Bordalo, P., N. Gennaioli, and A. Shleifer (2019). Memory and reference prices: an application
to rental choice. In AEA Papers and Proceedings, Volume 109, pp. 572–76.
Bottan, N. L. and R. Perez-Truglia (2022). Choosing your pond: location choices and relative
income. Review of Economics and Statistics 104(5), 1010–1027.
Boustan, L. P. (2010). Was postwar suburbanization “white flight”? evidence from the black
migration. The Quarterly Journal of Economics 125(1), 417–443.
Boustan, L. P. and R. A. Margo (2013). A silver lining to white flight? white suburbanization
and african–american homeownership, 1940–1980. Journal of Urban Economics 78, 71–80.
Breza, E., S. Kaur, and Y. Shamdasani (2018). The morale effects of pay inequality. The Quar-
terly Journal of Economics 133(2), 611–663.
Brodeur, A. and S. Flèche (2019). Neighbors’ income, public goods, and well-being. Review of
Income and Wealth 65(2), 217–238.
Bursztyn, L., B. Ferman, S. Fiorin, M. Kanz, G. Rao, et al. (2018). Status goods: Experimental
evidence from platinum credit cards. The Quarterly Journal of Economics 133(3), 1561–1595.
Campbell, J. Y., S. Giglio, and P. Pathak (2011). Forced sales and house prices. American
Economic Review 101(5), 2108–31.
Card, D., A. Mas, E. Moretti, and E. Saez (2012, October). Inequality at Work: The Effect of
Peer Salaries on Job Satisfaction. American Economic Review 102(6), 2981–3003.
37
38
39
40
ii
25 40
35
30
Percent
20 15
10
5
0
1 2 3 4 5 6 7 8 9 10
House satisfaction (1-10)
Note: Histogram of house satisfaction reported on a 1 to 10 scale by AHS suburban homeowners between
1985 and 2013 (pooled sample). Statistics are weighted using the AHS-provided weights. Source: AHS
National Surveys (1985-2013)
5000
90th percentile size 50th percentile size
4000
Size of new houses (sqft)
3000
2000
1000
1920 1930 1940 1950 1960 1970 1980 1990 2000 2010
Notes: The Figure plots the median house size (solid line) and 90th percentile size (dashed line) of new-
build houses by year of construction from 1920 to 2009. Source: Author’s own calculation from Zillow.
iii
1.9
15
% Houses Bigger than 3000 sqft
1.85
P90/P50 house size ratio
10
1.8
5
1.75 1.7
0
1920 1930 1940 1950 1960 1970 1980 1990 2000 2010
Notes: Figure S3 takes the stock of houses each year and plots the 90th percentile house size relative to
the median house size between 1920 and 2009 (left axis), along with the share of houses bigger than 3000
square feet (right axis). Source: Author’s own calculation from Zillow.
Notes: Kernel distribution of house size by decade (1960-2010). Source: Author’s own calculation from
Zillow.
iv
1.5
Impact of log house size on house satisfaction
0 .5 -.5 1
5
3
201
198
198
198
199
199
199
199
199
200
200
200
200
200
201
House satisfaction regressed on log house size of homeowners each year. Controls: price per square
foot, age of the house, household income, monthly mortgage payments, household size, householder’s
age, race, sex, education, length of tenure and number of cars. Coefficients on log house size are plotted
by year. Each dot corresponds to a separate regression. 90% confidence intervals drawn using robust
standard errors. Sampling weights included. Source: AHS national surveys (1985-2013).
2500 .2
Average house size
Residual house satisfaction
2300
0
2200
-.1
2100
2000
-.2
85
87
89
91
93
95
97
99
01
03
05
07
09
11
13
20
19
19
19
19
19
19
19
19
20
20
20
20
20
20
Notes: Dashed-line: average residual house satisfaction of homeowners each year (vertical-left axis) after
controlling for the price per square foot, age of the house and household characteristics (income, monthly
mortgage payments, household size, householder’s age, race, sex, education, length of tenure and number
of cars). Vertical-right axis: average house size. Sampling weights included. Source: AHS national
surveys (1985-2013).
vi
Notes: The figure maps the 152 counties and pseudo-counties located within the 45 Metropolitan Statis-
tics Areas (MSA) surveyed in the American Housing Survey. These locations cover 55% of the total US
population and almost the entire suburban population.
vii
Large Houses (> 3000 sqft) Large Houses (> 3000 sqft)
.3 1
Density
Density
.2
.5
.1
0 0
0 5 10 15 20 25 0 .5 1 1.5 2 2.5 3 3.5 4 4.5 5
Number of roads around the house (1.5 miles radius) Distance to nearest road (miles)
.2 1
Density
Density
.1 .5
0 0
0 5 10 15 20 25 0 .5 1 1.5 2 2.5 3 3.5 4 4.5 5
Number of roads around the house (1.5 miles radius) Distance to nearest road (miles)
1992 2005
Notes: Distribution of the proximity to the suburban road network measures for the full sample of geolo-
calized houses. We compare the visual salience measures obtained using NHPN data from either 2005 or
1992 (i.e. 13 years gap). Roads include all national highway routes, rural and urban arterials, and rural
and urban collector roads.
viii
Step 1. First, I impute missing longitude and latitude information for each house in the AHS
sample based on overlapping variables with the geolocalized sample.
To impute longitude and latitude to each AHS house, I follow Stekhoven and Bühlmann
(2011) and train a random forest algorithm within each suburb separately. Multiple imputa-
tion methods based on linear regressions do not fully consider the dependence between each
of the imputed variables, which in the case of spatial data is particularly important. Random
forest allows for interactions between continuous and discrete variables within suburbs and
simultaneously predicts both latitude and longitude. I use longitude and latitude as target
variables and unit size, lot size, year of construction and any possible interactions between
these variables as predictors.
0 .1 .2 .3 .4 .5 .6 .7 .8 .9 1
Correlation coefficient between true and simulated values (within suburbs)
Longitude Latitude
To test the robustness of the imputation method, I simulate random missing values for
each suburbs in Zillow in the same proportions as the corresponding missing values in the
AHS. I then look at the correlation coefficient between imputed and true values within suburbs.
Figure S9 plots the distribution of the correlation coefficients for all suburbs. More than half of
the suburbs have a correlation higher than 0.3, and 25% show a correlation higher than 0.5.38
38 Due to the much larger size of Californian suburbs, removing the state of California greatly improves
the average suburb prediction.
ix
Figure S10: Distribution of Predicted Distance Between AHS House and New-Build
Houses (Sources: author’s own computation from AHS and Zillow)
10
8
6
Percent
4
2
0
0 5 10 15 20 25 30 35 40 45 50
Average estimated distance from new-build houses (miles)
Step 3. Finally, I convert the predicted distance measure into an average predicted driving
time (in minutes) using the following formula:
(Distance × 60)
DrivingT ime = (4)
36
I assume an average speed of 36 miles per hour, which corresponds to the average speed
reported by respondents in my sample. Indeed, since 2002, the American Housing Surveys ask
homeowners about their daily commuting distance and commuting time to work. Respondents
in my sample report a daily commuting distance of 15.75 miles and a commuting time of 26
xi
Ln(house satisfaction)
Coeff. S.E.
Ln(own size) 0.051∗∗∗ (0.003)
Ln(reference size) -0.039∗∗∗ (0.009)
Household Controls
Ln(income) 0.004∗∗∗ (0.001)
Total mortgage monthly payment 0.000 (0.000)
Mortgage’s interest rate -0.001∗∗∗ (0.000)
Mortgage term (years) 0.000 (0.000)
Number of cars 0.001 (0.001)
Household size -0.005∗∗∗ (0.001)
Respondent’s age 0.001∗∗∗ (0.000)
Respondent’s education -0.005∗∗∗ (0.000)
Black respondent 0.017∗∗∗ (0.003)
Hispanic respondent 0.021∗∗∗ (0.004)
Male respondent -0.004∗∗∗ (0.001)
Ln(neighborhood satisfaction) 0.278∗∗∗ (0.006)
Other House Controls
Ln(house value per sqft) 0.020∗∗∗ (0.002)
Ln(real estate tax payment) 0.001 (0.001)
Size change last 12 months 0.017∗∗∗ (0.005)
Number of bathrooms 0.014∗∗∗ (0.001)
Age of house -0.001∗∗∗ (0.000)
Inside water leaks -0.031∗∗∗ (0.002)
Heating equipment broken -0.033∗∗∗ (0.006)
Unit has porch/balcony 0.020∗∗∗ (0.003)
Observations 182570
R2 0.223
Adjusted R2 0.216
Tenure Period FE Yes
Suburb × Year FE Yes
Notes: OLS coefficients reported. Reference size is defined as the 90th percentile size of visually
salient houses built over households’ tenure period. Sampling weights are included in all regressions.
Robust standard errors clustered at the suburb-year level reported in parentheses. ∗ p < 0.10, ∗∗ p <
0.05, ∗∗∗ p < 0.01
xii
xiii
.75 .75
Homeowners' Sensitivity to Size of Reference
Houses (σ coefficient)
.25 .25
0 0
-.25 -.25
-.5 -.5
-.75 -.75
-1 -1
10th 30th 50th 70th 90th 10th 30th 50th 70th 90th
pth percentile size of reference houses after purchase pth percentile size of reference houses after purchase
Notes: The Figure plots the sensitivity coefficient (σ = −γ2 /γ1 ) to the size of reference houses built over
households’ tenure in their suburbs estimated from regression (2) for visually salient houses (top quartile
in terms of proximity to roads) and non-visually salient houses (bottom quartile). Each dot represents a
separate regression where the pth percentile size of newly built houses is used as a measure of reference
house size. Sampling weights included. 90% confidence intervals drawn using standard errors clustered
at suburb-year level.
xiv
xv
Notes: OLS regressions reported. Reference size is defined as the 90th percentile size of houses built
over households’ tenure in their suburbs. Each regression varies the degree of visual salience of ref-
erence houses. Columns (1) to (4) use the total number of roads around a reference house within a
2 kilometers (1.27 miles) radius as a measure of visual salience (higher quartile means higher visual
salience). Columns (5) to (8) use the distance to the nearest road around a reference house (in miles)
as a measure of visual salience (lower quartile means higher visual salience). Controls include house-
holder’s age, race, sex, education, household size, annual income (logged), number of cars, monthly
mortgage payments, mortgage conditions (term and interest rate), neighborhood satisfaction, house
market value per square foot (logged), local real estate taxes (logged), age of house, quality features
(balcony, porch, extra bathrooms, heating equipment, recent remodeling), and defects (holes, water
leak). Sampling weights included. Robust standard errors clustered at the suburb-year level are
reported in parentheses ∗ p < 0.10, ∗∗ p < 0.05, ∗∗∗ p < 0.01
xvi
Ln(house satisfaction)
(1) (2) (3)
All Counties All Counties Excl. California
xvii
4
Sensitivity to 90th Percentile Size
-1
-2
0
5
10
15
20
25
30
35
40
45
50
55
60
Notes: The Figure plots the estimated σ as a function of predicted driving time from new-build houses
(measured in minutes at 36 miles per hour), excluding the State of California, separately for high visual
salience and low visual salience houses. Coefficients are estimated using specification (2), where own
house size and the 90th percentile size of new houses built in proximity (vs. isolation) to road networks
(top vs. bottom quartile) are interacted with 30 equally-spaced bins of two minutes driving time. Each
dummy captures a given average driving time bracket (shown in minutes) separating each house from
each other new-build houses in their suburbs since they moved in. The dashed line represents the non-
parametric fit using a locally weighted regression.
xviii
Table S8: Reference House Size and Local Housing Market Conditions
xix
xx
3000
Mean size of new-build houses (sqft)
2750
2500
2250
2000
1750
1500
1970 1975 1980 1985 1990 1995 2000 2005 2010
year
Notes: Figure S13 plots the mean size of new-build single family houses from 1971 to 2009 using data
from Zillow (dark lines) and from the Survey of Construction (grey lines).
Sources: Survey of Construction (SOC) and author’s own calculations from Zillow.
However, Zillow captures on average bigger houses than the SOC. Two important reasons
can explain this gap in levels. First, the SOC historical data regroups both MSA and non-MSA
single-family houses, while the Zillow sample is restricted to metropolitan suburbs, where
houses are on average bigger. A better comparison is to restrict the SOC to houses built within
MSA. This only partly addresses the problem as suburban and central city houses cannot be
distinguished in the SOC. However, the Figure shows it reduces half of the gap.39 Second, the
SOC is top-coded for the biggest 1% houses, which means Zillow does a better job at measuring
the true size of the biggest houses built. If I truncate the Zillow sample to exclude the top
percentile, the remaining gap is negligible.
Lastly, attrition bias may affect the distribution of houses over time if big houses are more
39 The Census Bureau does not compute averages at the MSA level for the period 1971-2009, and access
to the micro data of the SOC is restricted to the 1999-2009 period.
xxi
Attrition Index
(1) (2)
Coeff. S.E. Coeff. S.E.
Time since the house was built (years) -0.007 (0.020) -0.027 (0.027)
Northeast × Time since the house was built (years) -0.080 (0.062)
South × Time since the house was built (years) 0.095∗∗ (0.038)
West × Time since the house was built (years) 0.067 (0.045)
Observations 156 156
R2 0.311 0.377
Adjusted R2 0.293 0.348
Notes: In column (1), I regress the measure of attrition defined in equation (5) against the number
of years since the house was built, controlling for Census region dummies. In column (2), I interact
the number of years since the house was built with the Census region dummies. Sources: Survey of
Construction (SOC) and author’s own calculations from Zillow.
likely to be demolished than small houses. House remodeling and upscaling should also bias
the Zillow measures. To check whether this is the case, I take the ratio of mean to median size
in each census region for each year t as a first approximation of the size distribution for both
datasets. I then compute the difference between the two measures and construct the following
index:
h Mean Mean i
Attrition measuret = 100 × ( )Zillow,t − ( )SOC,t (5)
Median Median
To test whether the distribution of house size between the two datasets varies over time,
I regress this index on the number of years since houses were built and region fixed effects.
Table S10 shows the results of the regression. The coefficient on time is not significant. I can
also interact time with region fixed effects, as attrition and remodeling could play differently
across regions. Except in the South, I find no significant differences across regions. This further
reduces the attrition concern.
xxii
xxiii
.6
Impact on Ln(house market value)
.4
.2
0
5 6 7 8 9 10
House Satisfaction (vs. Below 5)
Notes: The Figure plots the estimates of an OLS regression where the log of house market value is re-
gressed on dummies for each of the response categories of the house satisfaction question conditional on
suburb-year and period fixed effects. Due to low sample size, I regroup answers below 5 as a single cate-
gory (0.9% of answers) and use it as the excluded category. Controls include householder’s age, race, sex,
education, household size, annual income (logged), number of cars, monthly mortgage payments, mort-
gage conditions (term and interest rate), neighborhood satisfaction, house market value per square foot
(logged), local real estate taxes (logged), age of house, quality features (balcony, porch, extra bathrooms,
heating equipment, recent remodeling), and defects (holes, water leak). Sampling weights included. 95%
confidence intervals drawn using robust standard errors clustered at the suburb-year level.
xxiv
xxv
(0.003) (0.004) (0.004) (0.003) (0.003) (0.003) (0.005) (0.006) (0.004) (0.003) (0.004) (0.004)
Ln(reference size) -0.033∗∗∗ -0.041∗∗∗ -0.040∗∗∗ -0.036∗∗∗ -0.037∗∗∗ -0.039∗∗∗ -0.046∗∗ -0.056∗∗ -0.037∗∗∗ -0.041∗∗∗ -0.031∗∗∗ -0.047∗∗∗
(0.011) (0.011) (0.012) (0.012) (0.010) (0.013) (0.020) (0.025) (0.012) (0.014) (0.010) (0.014)
σ 0.70 0.76 0.80 0.68 0.68 0.87 1.03 1.48 0.81 0.70 0.57 0.97
Observations 84,545 97,884 91,879 90,579 118,112 64,340 23,693 19,337 91,311 91,051 92,472 89,507
R2 0.225 0.237 0.224 0.240 0.215 0.278 0.255 0.278 0.230 0.227 0.231 0.228
Controls:
Household Controls Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes
Suburb × Year FEs Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes
Tenure Period FEs Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes
Full House Controls Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes
xxvi
Notes: OLS coefficients reported. House satisfaction (logged) as the dependent variable. Coefficients on own house size and reference size (logged) reported.
Each column reports heterogeneous effects splitting the full sample between low (less than 2 persons) and high (more than 2) household size (columns
1-2), low (below median) and high (above median) household income (columns 3-4), lower (no college) or higher (undergraduate or graduate) education
(columns 5-6), low (below median) and high (above median) suburb size in square miles (columns 7-8), population density (columns 9-10) and commuting
time (columns 11-12). Data on commuting time is only available since 2002. Robust standard errors clustered at the suburb-year level are reported in
parentheses. Household controls include age, race, sex, household size, education, annual income (logged), number of cars, monthly mortgage payments,
mortgage conditions (term and interest rate) and overall neighborhood satisfaction. House controls include house market value per square foot (logged),
local real estate taxes (logged), age of house, quality features (balcony, porch, extra bathrooms, heating equipment, recent remodeling), and defects (holes,
water leak). ∗ p < 0.10, ∗∗ p < 0.05, ∗∗∗ p < 0.01.
Appendix C.7 Moving Costs
Decision to Move = 1
(1) (2)
Ln(own size) -0.032∗∗∗ -0.014
(0.007) (0.011)
Ln(reference size) -0.038∗ -0.023
(0.023) (0.025)
Ln(own size) × Above median size 0.003
(0.015)
Ln(reference size) × Above median size -0.031∗
(0.019)
Above median size 0.206
(0.177)
Observations 73666 73666
R2 0.136 0.136
Adjusted R2 0.124 0.124
Controls Yes Yes
Tenure Period FEs Yes Yes
Suburb × Year FEs Yes Yes
Notes: OLS coefficients reported. Reference size is defined as the 90th percentile size of visually
salient houses built over households’ tenure period. In any survey year, I identify as “movers” house-
holds whose house becomes occupied by new owners in the next survey year (versus those who re-
main in the same housing unit). I use this indicator as a dependent variable in regression (2), instead
of house satisfaction. Sampling weights included in all regressions. Robust standard errors clustered
at the suburb-year level reported in parentheses. ∗ p < 0.10, ∗∗ p < 0.05, ∗∗∗ p < 0.01
xxvii
with x a composite commodity, h the size of the house, H s the McMansion externality in
suburb s and p the housing price per square foot. The marginal utility is positive in own
house size Uh > 0 and negative in reference house size UH s < 0. In a perfectly competitive
economy, the housing market internalizes the externality so p and y adjust to variations in H s .
In equilibrium, utility is equalized across the two suburbs so that the household is equally
happy in both places, with no incentive to move. The problem can be rephrased from the
indirect utility function V as
V y(H s ), p(H s ), H s = k ∀ s (6)
where k is a constant. This market equilibrium condition is the starting point of the hedo-
nic pricing (HP) approach introduced by Rosen (1974) or Roback (1982). The indirect utility
of housing is an increasing function of income (Vy > 0) and a decreasing function of house
prices for new movers (Vp < 0)40 . The marginal impact of a change in the housing size stock
depends on whether the externality is positive (VH s > 0) or negative (VH s < 0). The implicit
cost of positional externalities C experienced by an existing home owner can be defined as the
increase in income required to make new movers indifferent net of the variation in the market
value of houses:
And combining equation (8) and (7), the implicit hedonic cost of the McMansion external-
ity equals
C = dy/dH s − Vp /Vy (dp/dH s ) = −VH s /Vy > 0 (9)
When the labor and housing markets are in equilibrium, the implicit cost of positional
externalities exactly equals the marginal willingness to pay (MWTP) to avoid relative down-
40 The fact that higher income allows for better house quality logically leads to a positive marginal
utility of income. The estimation of the later is therefore very sensitive to the inclusion of dwelling
specific controls for quality.
xxviii
The hedonic cost of positional externalities computed from the wage and price gradients
would therefore give a downward biased estimate of the true cost, as it neglects the residual
effect (dV /dH s )/Vy not capitalized in private markets.
41 For a discussion of the LS approach, see Luechinger and Raschky (2009); Van Praag and Baarsma
(2005).
42 Evidence on “remembered utility” and on the fact that subjective well-being may differ from flow
utility is reviewed by Kahneman and Thaler (2006); Kimball and Willis (2006).
xxix