Professional Documents
Culture Documents
This online appendix gives complements on the model (e.g. on variants with continuous time, fully
flexible prices, with nominal illusion, another specification for the debt process). It gives also additional
proofs.
XI Complements
XI.A The model in continuous time
XI.A.1 Formulation of the model in continuous time
We can write M = 1 − ξ∆t, β = 1 − r∆t and M f = 1 − ξ f ∆t. In the small time limit (∆t → 0),
ξ, ξ f ≥ 0 are the macro parameters of inattention. The model (28)-(29) becomes, in continuous time:
ρf ρf ξ
φπ + φx + > 1. (110)
κ κσ
Section XI.A.2 contains a derivation of the Phillips curve in continuous time.
δ := r + λ, α = δ + ξ. (111)
The discrete-time m̄t becomes e−ξt , where ξ ≥ 0 is the amount of cognitive discounting (rationality
corresponds to ξ = 0).
If a firm can reset its price at time 0, it sets it to the value in (27):
#$ ∞ % $ t & '
p∗0 − p0 = E −(δ+ξ)t
δe −mfx µt + mfπ πs ds dt ,
0 0
1
and using $ %$ & $ %$ & $
∞ t ∞ ∞ ∞
−αt −αt e−αs
e πs ds dt = πs ds e dt = πs ds
t=0 s=0 s=0 t=s s=0 α
we have #$ ∞ '
! ′ "
p∗0 − p0 = E −αt
e ′
mf π πt − µt dt , (112)
0
with
δ f
m ,m′f π :=
µ′t := δmfx µt .
α π
Inflation at time 0 is π0 = ṗ0 = λ (p∗0 − p0 ). Hence, we have:
#$ ∞ '
−αt
! ′ ′
"
π0 = λE e mf π πt − µt dt . (113)
0
To solve this, it is useful to use the differentiation operator, D = dtd . With this notation, for a
function f (sufficiently regular), the Taylor expansion formula can be written as:
∞
( ∞ % &
(k) τk ( kτ
k
f (t + τ ) = f (t) = D f = eτ D f,
k=0
k! k=0
k!
i.e.
f (t + τ ) = eτ D f (t) . (114)
Hence (113) can be rewritten (dropping the expectations for ease of notation):
λ ! ′ "
πt = mf π πt − µ′t , (116)
α−D
and multiplying by α − D,
! "
(α − D) πt = λ m′f π πt − µ′t .
We recall that µt = − (γ + φ) xt (see (107)), so that
δ
(r + λ + ξ − D) π = λ mfπ πt + κxt , (117)
α
κ = κ̄mfx , (118)
κ̄ = λ (r + λ) (γ + φ) . (119)
2
This gives the continuous-time version of the Phillips curve in the basic model, equation (29).
! "
r + ξ f πt − π̇t = κxt , (120)
The first one is the linearization of the net profits, ytf = ct − ωt Nt . The second one is the labor
supply condition.
c1−γ − 1 N 1+φ
max − s.t. c = wN + k,
c,N 1−γ 1+φ
φ
ĉ (k) = k, (125)
φ+γ
γ
and N̂ (k) = − φ+γ k. If wealth goes up by $1, labor supply goes down, so that consumption goes up by
φ
less than $1 (indeed, it goes up by φ+γ < 1). This fact will show up in the more complex intertemporal
versions to which we now turn.
3
Here I state a variant of Proposition 29. That proposition was written as a function of planned
future labor supply. Here I explicitly solve for planned future labor supply.
Proposition 17 (Consumption given beliefs, solving out for labor supply) Suppose that we have
! "
ŷ BR (Xt , Nt ) = Ŷ BR (Xt ) + w (Xt ) Nt − N̄ , (126)
where Ŷ BR (Xt ) is some function that is independent of the agent’s own labor supply satisfying Ŷ BR (0) =
0; and the optimization of utility is over cτ , Nτ . Then optimal consumption is, up to second order terms:
+ &&,
( 1 % %
ŵ (X τ )
ct = ȳ + bk kt + EBR t τ −t
br r̂ BR (Xτ ) + bk Ŷ BR (Xτ ) + , (127)
τ ≥t
R φ
−1 r φ
br := , bk := χ, χ := . (128)
γR2 R φ+γ
Note that the bk has been multiplied by χ. Also, income has been increased by ŵ(X φ
τ)
, because of the
labor supply response. The marginal propensity to consume (MPC) out of capital is now Rr χ, rather
than Rr . This is analogous to the static MPC in (125): higher wealth kt is spent on higher consumption
and reduced labor supply.
Proof of Proposition 17 We start from (210), the consumption given planned future labor
supply:
+ ,
r ( 1 ) r ) ! " **
ct = kt + ȳ + EBR
t τ −t
br r̂ BR (Xτ ) + Ŷ BR (Xτ ) + w (Xτ ) Nτ − N̄ . (129)
R τ ≥t
R R
ω̂ (Xτ ) γ
Nτ − N̄ = − ĉτ , (130)
φ φ
where ĉτ is consumption that the consumer plans, at time t, to enjoy at time τ ≥ t. Because under the
agent’s subjective model the Euler equation holds, we have
1 BR ! BR "
ĉτ = ĉt + Et r̂ (Xt ) + · · · + r̂ BR (Xτ −1 ) . (131)
γR
So,
+ &&,
r ( 1 % r
%
ω̂ (X τ ) γ
BR
ĉt − kt = Et τ −t
br r̂ BR (Xτ ) + Ŷ BR (Xτ ) + − ĉτ
R τ ≥t
R R φ φ
+ &&,
( 1 % r
%
ω̂ (X τ ) γ
BR
= Et τ −t
br r̂ BR (Xτ ) + Ŷ BR (Xτ ) + − (A + B) ,
τ ≥t
R R φ φ
4
with
( 1 r
A = ĉt = ĉt ,
τ ≥t
Rτ −t R
- .
r 1 BR ( 1 ! BR BR
" r BR ( 1 ( 1
B = Et r̂ (X t ) + · · · + r̂ (X τ −1 ) = E r̂ BR (Xτ )
R γR τ ≥t+1
Rτ −t γR2 t τ ≥t Rτ −t k≥1
Rk
r BR ( 1 1 BR BR
( 1
= E r̂ (Xτ ) = −b r E r̂ BR (Xτ ) .
γR2 t τ ≥t Rτ −t r t
τ ≥t
R τ −t
Hence,
+ &&,
r ( 1 %% γ
&
r
%
ω̂ (X τ ) γ
ĉt − kt = EBR
t τ −t
1+ br r̂ BR (Xτ ) + Ŷ BR (Xτ ) + − cˆt ,
R τ ≥t
R φ R φ φ
i.e.
% & + &&,
γ r ( 1 %% γ
&
r
%
ω̂ (X τ )
1+ ĉt = kt + EBR
t τ −t
1+ br r̂ BR (Xτ ) + Ŷ BR (Xτ ) + .
φ R τ ≥t
R φ R φ
Proposition 18 (Consumption with active fiscal policy) Consider an agent maximizing over (cτ , Nτ )
/∞ τ −t
utility U = EBRt τ =t β u (cτ , Nτ ) subject to the law of motion for wealth (49), and with extra transfers
BR
T (Xτ ) that do not depend on the agent’s own actions. Up to second order terms (and for small wealth
kt ), consumption is:
+ ,
( 1 ! "
ĉt = bk kt + EBRt τ −t
br r̂ BR (Xτ ) + bY ĉ (Xτ ) + bk T BR (Xτ ) , (133)
τ ≥t
R
100
The 2018 NBER WP version of this paper uses
! " ! " ! " ! "
T BR Zτ , Ztd = (1 − my ) T Ztd + my T (Zτ ) = T Ztd + my T Zτ − Ztd
which leads to + ,
τ
( −1
0 1 r
EBR
t T BR (Xτ ) = − Bt + my m̄τ −t Et dτ − r du .
R u=t
5
−1 r φ r φmy +γ
with br = γR 2 , bk = R χ with χ = γ+φ , and bY = m
R Y
with mY = φ+γ
. As usual, the chosen labor
supply is given by Ntφ = ω (Xt ) c−γ
t .
So, the situation is a little subtle, as the coefficient on future aggregate consumption is Rr mY (so it
is equal to Rr if my = 1), while the coefficient on future transfers and wealth is Rr χ. This is because
ĉ (Xτ ) implicitly incorporates the effects of future labor supply.
r
Using (127), we have, with bk := R
χ,
+ % % & &,
( 1 ω̂ (X τ )
ĉt − bk kt = EBR
t br r̂ BR (Xτ ) + bk my ĉ (Xτ ) − N̂ (Xτ ) + + bk T BR (Xτ ) ,
τ ≥t
Rτ −t φ
+ % % & &,
( 1 γ
= EBR
t br r̂ BR (Xτ ) + bk my + ĉ (Xτ ) + bk T BR (Xτ ) , using (123)
τ ≥t
Rτ −t φ
+ % % & &,
( 1 r φm y + γ
= EBR
t br r̂ BR (Xτ ) + ĉ (Xτ ) + bk T BR (Xτ ) .
τ ≥t
Rτ −t R φ+γ
Proposition 19 (IS curve, anchoring on the present) Suppose that agents anchor their projection
of future income on current income, with a weight my0 . Then, the IS curve of the basic behavioral
setup (Proposition 9) still holds, but replacing in M and σ the parameters (mY , mr ) by (m′Y , m′r ) =
1 φ
1−χmy0
(mY , mr ), where χ = γ+φ . The values of M and σ increase with the anchoring my0 .
101
Formally, at time t, the agent maximizes utility U subject to the perceived law of motion of wealth as in the main
text in Section I.A, but using (134) for perceived income.
6
At the same time, the marginal propensity to consume out of a 0-persistence shock to current income
is
r
MP C = χmy0 + χ my . (135)
R
Proof of Proposition 19 The general equilibrium result comes straight from Proposition 18. We
just set the perceived extra transfer to T BR (Xτ ) := my0 ŷ (Xt ). The consumption policy becomes (we
−1 r φ
take the case with zero wealth), with br = γR 2 , bk = R χ and χ = γ+φ :
+ ,
( 1 ) r *
ĉt = EBR
t τ −t
br r̂ BR (Xτ ) + mY ĉ (Xτ ) + bk my0 ŷ (Xt ) ,
τ ≥t
R R
+ ,
( 1 ) r *
= χmy0 ŷ (Xt ) + EBRt τ −t
br r̂ BR (Xτ ) + mY ŷ (Xτ ) .
τ ≥t
R R
This gives the MPC. Taking into account that income equals consumption, so that ĉt = ŷ (Xt ), we
have:
+ ,
1 ( 1 ) r *
ĉt = EBR br r̂ BR (Xτ ) + mY ŷ (Xτ )
1 − χmy0 t τ ≥t
R τ −t R
+ ,
1 ( m̄τ −t ) r *
= Et τ −t
br mr r̂ (Xτ ) + mY ŷ (Xτ ) .
1 − χmy0 τ ≥t
R R
This is the expression (52) of the main text, but replacing (mY , mr ) by (m′Y , m′r ) =
1
1−χmy0
(mY , mr ) .!
There is a minor surprise: when agents anchor more in the present (with a higher my0 ), then the IS
curve becomes more forward looking. The reason is a GE effect. Because present consumption reacts
more to current income (higher my0 ), GE effects are stronger, and in particular, the impact of future
disturbances are amplified.
The macro model has the same form as in the baseline version, with a slightly modified values for M
and σ. However, the micro behavior is very different. Indeed, the MPC out of a 0-persistence innovation
to labor income is (135), so quantitatively it is close to χmy0 . Estimates from the tax rebate and other
literatures point to my0 χ ≃ 0.3 (for example, see Johnson et al. (2006)). Quantitatively, that makes
fairly little difference to the value of M. Still, now the GE channel for the transmission of monetary
policy is strong, much like in Kaplan et al. (2018).
Discussion Through the mechanism highlighted in this section, we capture features as in Kaplan
et al. (2018), where a high MPC is important for the transmission of monetary policy. The impact
of interest rates comes in part via intertemporal substitution, and in good part via the GE effect on
aggregate income – but for that we need a high MPC.
Here, behavioral economics allows us quite easily to capture a high MPC out of current income.
What to make of that ease?
The favorable interpretation is that part of the art of macroeconomics is to find useful metaphors,
and this behavioral metaphor is quite useful as it is tractable, intuitive (one understands clearly the
worldview of the agent), and quite possibly true to the first order. Certainly, it is much simpler than
7
tracking the heterogeneity among credit-constrained agents as in Kaplan et al. (2018). The unfavorable
interpretation is that behavioral economics is too free. My own sense is that we are in a period of
explorations of theoretical possibilities, and that this is a good thing. One can at least discipline behav-
ioral models with microeconomic data (see e.g. Taubinsky and Rees-Jones (2017) for the measurement
of attention to taxes), and exploring those behavioral models is a fruitful enterprise. Those simple-to-
use behavioral models might even become the models of choice to a variety of issues whenever there
is non-standard behavior, with the understanding that they might be a metaphor for more complex
mechanisms, such as those coming from credit constraints.
This means that to simulate his future labor supply, the agent just imagine he’ll do like the rest
of the agents. However, we allow the agent to actively think of the optimization as on today’s labor
supply. This is a very close variant of Woodford (2013), who makes the assumption, verbatim: “I further
assume that households have no choice but to supply the hours of work that are demanded by firms,
at a wage that is fixed by a union that bargains on behalf of the households. A household then has a
single decision each period, which is the amount to spend on consumption.”
Proposition 20 (Consumption with active fiscal policy) Consider an agent maximizing utility U =
/∞ τ −t
EBR
t τ =t β u (cτ , Nτ ) subject to the law of motion for wealth (6), and the perception (136), and with
active fiscal policy, as in the setup of Sections IV.A and X.A. Up to second order terms (and for small
wealth kt ), consumption is:
+ ,
( 1 ! "
ĉt = bk kt + EBR
t τ −t
br r̂ BR (Xτ ) + bk ĉ (Xτ ) + bk T BR (Xτ ) , (137)
τ ≥t
R
where br = −1
,b
γR2 k
= r
R
. As usual, the chosen labor supply is given by Ntφ = ω (Xt ) c−γ
t .
This model is rather easier to handle, as it obviates the problem of choosing labor supply at all
φ
future dates.102 Then, the proof is much simpler. The result is the same, but we replace χ = φ+γ by 1,
r
so that bk = R .
8
where mcπ ∈ [0, 1] is the consumer’s attention to inflation. This makes it so that the perceived interest
rate is:
r̂ BR (Xt ) = mr (it − mcπ Et [π (Xt+1 )] − r̄) . (139)
it = jt + φπ πt ,
with φπ ≥ 0.
In a model with flexible prices and no capital, the output gap is always xt = 0. The behavioral IS
curve still imposes:
r n (Xt ) = r̂ BR (Xt ) .
Take for simplicity an economy with constant rtn = r n = jt . Then, we have:
φπ πt = mπ Et [πt+1 ] . (141)
9
Proposition 21 Determinacy in the flexible price economy) Take the flexible price economy, when the
consumer has pays only an attention mπ to inflation (with mπ = 1 in the rational case). We have
determinacy if and only if:
φπ > mπ . (142)
That is, the agent only partially sees the deviations of the economy from its trend.
In Step 1, the simulation handles the basic non-stationarity of the variables. In Step 2, the simulation
anchors in the trend value S∗,t , and enriches it partially to handle the dynamics.
To build intuition, let us take some examples. First, if there is no macro-capital variable, we just
generalized the baselines procedure. Indeed, take the case without any capital variable, so St = Xt .
Then, step 1 just generates S∗t = X̄, and step 2 generates what we had above (see (8)), Xt+1 =
(1 − m̄) X̄ + m̄GX (Xt , ϵt+1 ) .
Next, enrich the case with Kt+1 = Kt + g + bXt , where g is some trend growth rate. For instance,
Kt could be the permanent part of log productivity. Then, step 1 gives K∗t = K0 + gt and X∗t = X̄.
That is, the simulation sees the2baseline.
3 Next, Step
2 23gives deviations from that benchmark. Then,
BR k BR
under the BR simulation, Et Ŝt+k = m̄ Et Ŝt+k , where Ŝτ := Sτ − (0, Kt + g (τ − t)) is the
deviation from the baseline. This phenomenon is general, as the next Proposition records.
10
Proposition 22 Suppose that we have a system:
Kt+1 = bK K K K
K Kt + bX Xt + b + εt+1 , (145)
Xt+1 = bX
X Xt
X
+b + εX
t+1 , (146)
where εK , εX are mean-zero variables independent across periods. The mean of Xt satisfies:
X∗ = bX X
X X∗ + b . (147)
K∗,t+1 = bK K K
K K∗,t + bX X∗ + b , (148)
Proof We note that bX K = 0, meaning that the long-run trend doesn’t affect the short-run variables.
We can rewrite the system (145)-(146) as
St+1 = bSS St + bS .
so that 2 3 ! "t
EBR Ŝt = m̄t bSS Ŝ0 .
2 3 ! "
t
As the rational case corresponds to the special case where m̄ = 1, we have E Ŝt = bSS Ŝ0 . Hence,
2 3 2 3
we have EBR Ŝt = m̄t E Ŝt .!
11
around gt = 0. The following generalizes the basic behavioral IS curve. It extends in a behavioral
context previous analyzes of government spending (Eggertsson (2011); Woodford (2011)).
Proposition 23 (Model with government consumption). Given government consumption and deficits,
the basic two-equation behavioral New Keynesian model of Proposition 2 still holds, except that in the
IS curve the natural rate of interest given by
bg bd
rtn = rtn0 + (gt − MEt [gt+1 ]) + dt , (153)
σ σ
φ
where rtn0 is the “pure” natural rate of interest that prevails without fiscal policy, bg = φ+γ , and bd is
given in Proposition 7. The corresponding natural rate of consumption (i.e., the consumption level that
would prevail if prices were flexible) is:
1+φ
ĉnt = −bg gt + ζt . (154)
γ+φ
Proof of Proposition 23. In the proof, we consider the case with zero deficit – as deficits enter
linearly and are treated in Proposition 7. The aggregate resource condition is Yt = ct + Gt = eζt Nt (up
to second order terms due to price dispersion), i.e.
so
ω̂t = (φ + γ) ĉt + φ (gt − ζt ) . (156)
In the “natural” (i.e., flexible-price) economy we have ω̂t = ζt , so the natural rate of consumption
ĉnt satisfies
ζt = (φ + γ) ĉnt + φ (gt − ζt ) , (157)
i.e. (154). In that same natural natural economy, we still have (22):
0 1
ĉnt = MEt ĉnt+1 − σr̂tn ,
12
like in the basic model without fiscal policy. Hence, the Phillips curve does not change. !
The next proposition calculates the corresponding increase in GDP.
Proposition 24 (Impact of government spending with passive monetary policy). Suppose that at time
0, the government spends g0 , financed by a deficit d0 at time 0, and the central bank does not adjust the
interest rate i0 . Then, consumption changes by
ĉ0 = bd d0 ,
Ŷ0 = (1 + bd ) g0 ,
Proof of Proposition 24. By linearity, we can suppose that we start from the steady state (so
i0 = r0n0 = r̄). At time t ≥ 1, the economy will be fully at the steady state, with no deficit (i.e., no
deficit after payment of the interest rate on the debt, so dt = 0 for t ≥ 1). Hence, xt = πt = 0 for t ≥ 1.
At time t = 0, we have (using i0 = r0n0, and (153))
As ĉn0 = −bg g0 ,
ĉ0 = ĉn0 + x0 = bd d0 ,
and the GDP change is
Ŷ0 = g0 + ĉ0 = g0 + bd d0 .
!
13
Xt , so is seen only) with
* cognitive
) * discounting, and for simplicity we assume perceptions are otherwise
BR
correct, i.e. T̂ B̂t = T̂ B̂t .
Proposition 25 (Discounted Euler equation with sensitivity to budget deficits, in alternative formula-
tion) In the alternative formulation above, we have the following variant for Proposition 7 on the impact
of public debt. Because agents are not Ricardian, a temporary increase B̂t of public debt increases
economic activity. The IS curve (24) becomes:
) ) r* * ! "
xt = MEt [xt+1 ] + b̃d dt + 1 − B̂t − σ it − Et [πt+1 ] − rtn0 , (159)
R
where rtn0 is the “pure” natural rate with zero deficits (derived in (23)), dt is the budget deficit and
r φ
b̃d = R−rm Y φ+γ
(1 − m̄) is the sensitivity to temporary debt increases. When agents are rational, b̃d = 0,
but with behavioral agents, bd > 0. We can equivalently write this equation by saying that the behavioral
IS curve (25) holds, but with the following modified natural rate, which captures the stimulative action
of deficits:
b̃d ) ) r* *
rtn = rtn0 + dt + 1 − B̂t . (160)
σ R
Hence, this formulation is close in spirit to that of the main text. The main difference is the now
the “default” perception of debt (when simulating the future at time t) is B∗ (the steady state level of
debt), rather than Bt . Hence, the stimulative impact now also includes the temporary deviations from
steady state debt, Bt − B∗ . The “impact of the debt” can also be written:
) r* ) *
dt + 1 − B̂t = T̂ B̂t + B̂t , (161)
R
as dt = T̂t + Rr B̂t . ) *
A simple application is the following. When we have T̂ B̂t = −ψB B̂t , the impact on the debt is:
) ) r* *
b̃d dt + 1 − B̂t = b̃d (1 − ψB ) B̂t . (162)
R
For instance, the natural rate becomes:
b̃d
rtn = rtn0 + (1 − ψB ) B̂t . (163)
σ
So, temporarily high debt has a stimulative effects on the natural interest rate, as it makes agents feel
richer.
14
r
with bY = m
R Y
. Now, as kt = Bt = B∗ + B̂ (Xt ), the terms in B∗ cancel out in (164), as
( 1 ) r *
B∗ + − B∗ = 0.
τ ≥t
Rτ −t R
This is a form of Ricardian equivalence: agents do not react to the “permanent” part of debt. Only
the deviations of debt from the baseline B∗ matter. Hence, the outcome will be the same as if B∗ = 0,
and we will assume that B∗ = 0 in what follows.
Using the language of forward operators (F yt := yt+1 ), we can rewrite (164) as:
) *
ĉt = bk B̂t + (1 − β m̄F )−1 bY ĉt + bk T̂t ,
i.e.,
(1 − β m̄F ) ĉt = bY ĉt + bk Dt
with
Dt := (1 − β m̄F ) B̂t + T̂t = T̂t + B̂t − β m̄B̂t+1 ,
) *
and given B̂t+1 = R B̂t + T̂t , we have, using
r r
dt := Tt + Bt = T̂t + B̂t (165)
R R
) * ) ) r* *
Dt = (1 − m̄) Tˆt + B̂t = (1 − m̄) dt + 1 − B̂t .
R
We have derived ) ) r* *
ĉt = MEt [ĉt+1 ] + b̃d dt + 1 − B̂t ,
R
bk (1−m̄)
with b̃d = 1−bY
, i.e.
r φ
b̃d = (1 − m̄) . (166)
R − rmY φ + γ
Reintegrating interest rates,
) ) r* * ! "
ĉt = MEt [ĉt+1 ] + b̃d dt + 1 − B̂t − σ it − Et [πt+1 ] − rtn0 .
R
The rest of the proof is as in the proof of Proposition 7. !
Proposition 26 (Optimal mix of fiscal and monetary policy in a ZLB environment). The following
monetary and fiscal policies yield the first best (xt = πt = 0) at all dates: During the crisis (t ∈ (T1 , T2 )),
use fiscal policy
σr n0
dt = − t ,
bd
15
i.e. run a deficit with low interest rates, it = 0. After the crisis (t ≥ T2 ), pay back the accumulated
debt by running a government fiscal surplus and keeping the economy afloat with low rates, e.g. dt =
R−1 (BT2 − B0 ) (1 − ρd ) ρt−T
d
2
< 0 for some ρd ∈ (0, 1), and adjust it = rtn ≡ rtn0 + bdσdt to ensure full
macro stabilization, xt = πt = 0. Before the crisis (t < T1 ), set it = dt = 0.
Proof. The proof is simply by examination of the basic equations of the NK model, (28)-(29). We
adjust the instruments so that xt = πt = 0 at all dates. Note that there are multiple ways to soak up
the debt after the crisis, so that dt = R−1 (BT2 − B0 ) (1 − ρd ) ρt−T
d
2
is simply indicative. !
The ex-ante preventive benefits of potential ex-post fiscal policy. Proposition 26 shows
that “the possibility of fiscal policy as ex-post cure produces ex-ante benefits”. Imagine that fiscal policy
is not available. Then, the economy is depressed at the ZLB during (T1 , T2 ). However, it is also depressed
before: because the IS curve is forward looking, output threatens to be depressed before T1 , and that
can put the economy to the ZLB at a time T0 before T1 .104 Hence, the threat of a ZLB-depression
in (T1 , T2 ) creates an earlier recession at (T0 , T2 ) with T0 < T1 . Intuitively, agents feel “if something
happens, monetary policy will be impotent, so large dangers loom”. However, if the government has
fiscal policy in its arsenal, the agents feel “worse case, the government will use fiscal policy, so there is
no real threat”, and there is no recession in (T0 , T1 ). Hence, there is a possibility of fiscal policy as an
ex-post cure to produce ex-ante benefits.
In general, monetary and fiscal policies are substitutes (dt and it enter symmetrically in (28)), so a
great number of policies achieve the first best. However, fiscal policy dt helps monetary policy if there is
a constraint (e.g. at the ZLB), so the possibility of future fiscal policy is a complement to the monetary
policy (as it relieves the ZLB).105
16
where the last equality comes from the fact that
! ! " "
vaa (a (1, St ) , St , 1) = vaa a md , 0 , 0, md + O (ε) .
Let us first take the case where inattention enters linearly, i.e. when a (m, St ) = ar (mSt ) + O (ε2 )
where ar (St ) = a (1, St ) is the rational response. This is the simpler case, and covers the situation with
mr , my , mfπ , mfx (and it was the case in Gabaix (2014)). We have:
! " ! "
a (m, St ) = ar (mSt ) + O ε2 = ar (0) + ∂ar · mSt + O ε2
and # '
1 ! d " ! ! d " " ! d " ! "
L (St , m) = St am,S m , 0 vaa a m , 0 , 0, m am,S m , 0 St (1 − m)2 + o ε2 .
′ d
2
which gives 71, as the agent takes this leading quadratic approximation of her utility losses when
choosing optimal attention m.
Let us next take the more complex nonlinear case where a (m, St ) = ar (H (m) St ) + O (ε2 ) for a
non-linear function H(m), such that H (0) = 0 and H (1) = 1. This is for instance the case when
considering m̄, where we have a non-linear response (see (225)). The same algebra shows:
! " ! "
a (m, St ) = ar (H (m) St ) + O ε2 = ar (0) + ∂ar · H (m) St + O ε2
so
! "
ǎ (m, St ) = ∂ar · (H (m) − 1) St + O ε2
As, evaluating derivatives at St = 0,
we also have
! " H (m) − 1 ! " ! " ! "
ǎ (m, St ) = am,St md , 0 ′ d
St + O ε2 = am,St md , 0 J (m) (m − 1) St + O ε2
H (m )
where
H (m) − 1
J (m) := (168)
(m − 1) H ′ (md )
and
# '
1 ′ ! d " ! ! d " " ! d " ! "
L (St , m) = Sam,S m , 0 vaa a m , 0 , 0, m am,S m , 0 S J (m)2 (1 − m)2 + o ε2 .
d
(169)
2
17
When H (m) is linear, J (m) = 1, but otherwise it’s a bit different from 1. Hence, to be formal, we
assume that the agent also does a Taylor expansion of the losses in m, which implies that the agent
replaces J (m) by 1.
Proof of Proposition 27 Let us do some calculations first, using the notations of Section I.D.
At the steady state (which has q = µ = 0, c = 1), differentiating (14), and under the optimal subsidy
τ = 1ε , we have:
BR
vqq = vqq = 1 − ε. (171)
Next, as
∞
(
V BR
(qit , Xτ ) := EBR
t (βθ)τ −t v BR (qit , Xτ )
τ =t
In total, the expected losses from inattention are: 12 Λf (1 − m)2 = 21 Vqq E [qm̄
2
](1 − m)2 , i.e.
1−ε 0 21
Λf = E qm̄ .
1 − βθ
Now, let us go to the case of a one-factor model driven by ζt , the productivity shock, which follows
an AR(1) with coefficient ρ, and the central bank follows a Taylor rule. Then, we have πt = bπζ ζt and
µt = bµζ ζt in equilibrium. This implies that
18
for a coefficient
∞
1 ( ! ! " "
qm̄,ζ = (1 − βθ) (βθm̄)k k mfπ ρ + · · · + ρk bπζ − mfx bµζ
m̄ k=0
∞ % &
1 ( k f1−ρ
k
π f µ
= (1 − βθ) (βθm̄) k mπ ρb − mx bζ ,
m̄ k=0 1−ρ ζ
evaluated at m̄ = m̄d . ) λf σ2 *
ε−1 2
So using Proposition 13, we have m̄ = A Kf , m̄ with λf =
f ζ d
q .
1−βθ m̄,ζ
Endogenization mfπ and mfx The same reasoning holds for other attention factors. Equation
(27) gives:
∞
(
qmfx = (1 − βθ) (βθm̄)k Et [−µt+k ] , (176)
k=0
1 − βθ µ
qmfx ,ζ = − b . (177)
1 − βθm̄ρ ζ
hence
(1 − βθ) βθm̄ρ
qmfπ ,ζ = bπζ . (178)
(1 − βθm̄) (1 − βθm̄ρ)
where again expressions are evaluated at m̄ = m̄d .
19
Proposition 28 In the one-factor economy where all shocks come from TFP and the central banks
follow a Taylor rule, we have (with rˆt = rt − r̄, cˆt = ct − c̄):
! n "
(r̂tn , xt , πt , rˆt , cˆt , ŷt , µt ) = brζ , bxζ , bπζ , brζ , bcζ , byζ , bµζ ζt , (179)
with
n 1+φ
brζ = − (1 − ρM) , (180)
σ (γ + φ)
σ n
bxζ = κσ brζ , (181)
1 − Mρ + (φπ − ρ) 1−ρβ f + σφx
κ
bπζ = bx , (182)
1 − ρβ f ζ
brζ = (φπ − ρ) bπζ + φx bxζ , (183)
1+φ
bcζ = byζ = bxζ + . (184)
γ+φ
bµζ = − (φ + γ) bxζ (185)
−1 1 + φ
r̂tn = (1 − ρM) ζt ,
σ γ+φ
n 1+φ
hence the value of brζ . Next, by definition of xt , ĉt = xt + ĉnt and using (21), ĉnt = ζ
γ+φ t
, which gives
the value of bcζ . Next, as ŷt = ĉt , byζ = bcζ . Also, brζ comes from the fact that:
Next, in our AR(1) world, the 2-equation model of Proposition 2 reads (with β f := βM f ):
Discounted Euler equation in the 2-period model We will see that the consumer satisfies a
discounted Euler equation. Call R = 1/β the steady state interest rate, so that R0 = R + r̂0 and the
20
perceived interest rate is: R0 = R + mr r̂0 . Rewrite (77) as
% &
cd + m̄ĉ1
c0 = b c0 + 1 ,
R + mr r̂0
) *
cd1
where cd0 = b cd0 + R
. Then, we have:
% &
m̄ĉ1 − mRr r̂0
ĉ0 = b ĉ0 + ,
R
i.e.
b 1) mr *
ĉ0 = m̄ĉ1 − r̂0 .
1−bR R
b 1 b 1
In the rational model, we have c0 = c
1−b R 1
and c0 = c1 = 1. Hence, 1−b R
= 1. We obtain:
mr
ĉ0 = m̄E0 [ĉ1 ] − r̂0 . (186)
R
This is a “discounted Euler equation” (with discount factor m̄), i.e. instead of the rational Euler
equation, ĉ0 = E [ĉ1 ] − r̂0 . The same factor m gives power to fiscal policy, and yields a discounted Euler
equation.
Derivation of (80). Call k1 the wealth at the beginning of period 1 (before receiving labor
income and profit), and T1 the transfer received from the government, and I1 the profit income from
the oligopolistic firms (so that ω1 N1 + I1 = c1 when aggregating). The rational value function at time
1 is, given the labor supply fixed at 1 at t = 1:
where m̄ is optimized upon in the sparse max. Taking here the m̄ as given, then the decision is simply:
so that the intra-period labor supply condition ω0 uc0 + uN0 = 0 holds. Given that Vk1 = uc1 , we obtain
21
Now, we have Vkr1 = u′ (c1 ) = u′ (k1 + y1 ) with y1 = ω1 N1 + I1 + m̄T1 , so
1 βR0
= ,
c0 c1
c1 y1
with c1 = y1 + R (y0 − c0 ) i.e. c0 + R
= y0 + R
, and with the Euler equation c1 = βR0 c0 :
% & % &
1 ys y1 + m̄ŷ1
c0 = y0 + 1 = b y0 + .
1+β R R
V (k, X) = max u (c, N) + βEV (R (X) (k + ȳ + my ŷ (X) + w (X) (N − N (X)) − c) , m̄ (ΓX + ε)) ,
c,N
(187)
and optimal consumption satisfies uc (c (k, X) , N) = Vk (k, X) (independently of N because utility is
separable), so that cX = VukX
cc
and (using the fact that we linearize around c̄ = N̄ = 1):
VkX
cX = − , (188)
γ
which gives ĉt = cX Xt . Hence, to derive consumption, we simply need to calculate VkX .
To calculate VkX , I use the general procedure outlined in Gabaix (2016a), Section 10.1 — but the
present derivation is self-contained. Call a = (c, N) the action, and define:
Behavior at the default, steady state model. I call the default model the model at the steady state
(X = 0), with steady state values for income, wage and interest rate, and only private wealth kt
potentially variable (but close to the steady state value, which is 0). At the default (with constant
φ
interest and income), the optimal policy is c (k) = ȳ+bk k, bk = χ Rr , χ = φ+γ , and N (k) = 1− Rr (1 − χ) k
(linearizing for small k). This is the permanent-income analog of Section XI.B.1, when the agent
consumes a fraction Rr of his wealth every period on higher consumption and leisure. So, using c−γ = Vk ,
we have:Vk (k) = (1 + bk k)−γ so at k = 0
γφ r
Vk = 1, Vkk = − ,
φ+γR
106
Here I use the notation R (X) = 1 + r̄ + mr r̂ (X) for the perceived gross interest rate. To lighten up the notation, I
use R (X), rather than the slightly more precise RBR (X).
22
r γ φ r
Nk = − , ck = .
Rφ+γ φ+γR
We use the notation Dk for the total derivative with respect to k: for a function f (k, a),
so that
Dk K (k, 0, a (k, 0)) = β. (193)
Also (189) gives:
r γ
Dk KX = wX Nk = − wX . (194)
Rφ+γ
We also calculate, using the first order condition NX = wX
φ
− φγ cX and c = y in equilibrium:
% &
γ wX γ wX
KX = my ŷX − NX = my ŷX + ĉX − = my + ŷX − .
φ φ φ φ
We next proceed the to main derivation. We first differentiate (190) w.r.t. X, using the envelope
theorem:
VX (k, X) = βVk · (R (X) KX (k, X, a (k, X)) + RX K (k, X, a (k, X)) , m̄ΓX) + βVX m̄Γ.
Next, we totally differentiate w.r.t k, and evaluate all derivatives at (k, X) = (0, 0), using RDk K = 1,
This gives:
23
We are now almost done. Let us observe that
( (
(1 − β m̄Γ)−1 Xt = (β m̄Γ)τ −t Xt = (β m̄)τ −t Et Xτ .
τ ≥t τ ≥t
Given that
ĉt = cX Xt , ŷ (Xτ ) = yX Xτ , r̂ (Xτ ) = rX Xτ ,
this equivalently expresses:
( # '
τ −t r β2
ĉt = Et (β m̄) mY ŷX − RX Xτ (197)
τ ≥t
R γ
+ % &,
( τ −t r β2
= Et (β m̄) mY ŷ (Xτ ) − mr r̂ (Xτ ) . (198)
τ ≥t
R γ
24
Momentum in revision and cognitive discounting Here I detail how cognitive discounting might
explain (a), when viewed as a theory of the aggregate forecasting agent. Coibion and Gorodnichenko
(2015a) run the regression:
and find that forecast revision predicts ex-post forecast errors (when averaging across agents), i.e. β̂ > 0.
I consider the univariate case, and show how cognitive discounting framework maps to this result.
The true dynamics is xt+1 = Γxt + εt+1 , but agent perceives instead xt+1 = m̄(Γxt + εt+1 ). The
subjectively expected value at time t of the future variable xt+h is
Ft xt+h = EBR h
t xt+h = (m̄Γ) xt . (199)
Hence forecast revision for xt+h between date t − 1 and date t are:
We note that when m̄ = 1, then forecast revision is exactly in the rational case: Γh εt , i.e. one
iterates the shock forward to revise forecast.
Now we turn to the ex-post forecast error:
1 − m̄h
βh ≥ β h ≡ ,
m̄h
λ
which is quite similar to the βCG = 1−λ in their theory framework, where λ is the level of information
h
rigidity. If we set λ = 1 − m̄ , i.e. my inattention to future variable is the source of rigidity, then we
map our theory to their empirical finding. Empirically, they find λ ≃ 0.23 at the one-period horizon107 ,
which would give m̄ ≃ 0.73 – much in line with the calibration. For other horizons, the standard errors
become big, but they find λ ∈ [0.3, 1] .
In conclusion, applying the cognitive discounting model to the professional forecasters of Coibion
and Gorodnichenko (2015a), one estimates m̄ ≃ 0.73. Of course, the model is not about professional
forecasters, but about the average consumer, which might have a lower m̄. I conclude that use the
Coibion and Gorodnichenko (2015a), coupled the cognitive discounting parameterization, would be a
107 λ
See their Figure 1, β = 1−λ = 0.3, which gives λ = 0.23.
25
0 0.8 0.9
-0.5 0.7
0.85
-1 0.6
-1.5 0.5
0.8
-2 0.4
0.75
-2.5 0.3
-3 0.2
0.7
-3.5 Rational 0.1
Behavioral
-4 0 0.65
0 5 10 15 20 0 5 10 15 20 0 5 10 15 20
Output Gap Inflation Log price Level
16 1.2
14
1
12
0.8
10
8 0.6
6
0.4
4
0.2
2
0 0
0 5 10 15 20 0 5 10 15 20
Nominal Interest Rate Cost-push Shock
Figure 6: This figure shows the optimal interest rate policy in response to a cost-push shock (νt ),
when the central bank follows the optimal discretionary strategy. The behavior is very similar in
the two cases, as the central bank does not rely on future commitments for its optimal policy. This
illustrates Proposition 6. Units are percentage points. The cost-push shock follows an AR(1) process
with autocorrelation ρν = 0.2.
fruitful path of future research – potentially estimating the whole “term structure of attention” with
some mr and m̄.
When condition (35) fails, one of the eigenvalues of AZLB is greater than 1 in modulus. Then,
limT →∞ ∥ATZLB b∥ = ∞ (it is easy to verify that b is not exactly the eigenvector corresponding to
the root less than 1 in modulus). Hence, limT →∞ ∥z0 (T ) ∥ = ∞. Furthermore, this explosion is a
recession: given that the entries of AZLB are positive, and those of b are negative, each of the terms
108
Note that Werning (2015) parameter’s is κ̄ = 0.5 year−2 in continuous time, it is κ = κ̄ 412 in discrete time with
quarterly units.
26
Traditional case Behavioral case
0 0
-500 -5
-1000 -10
Output gap
Output gap
-1500 -15
-2000 -20
-2500 -25
This paper This paper
Eggertsson & Woodford (2003) Eggertsson & Woodford (2003)
Werning (2012) Werning (2012)
-3000 -30
0 5 10 15 20 25 30 35 40 45 0 5 10 15 20 25 30 35 40 45
Horizon Horizon
Figure 7: Cost of ZLB under various calibrations. This paper (connected line) uses κ = 0.11, σ = 0.20,
β = 0.99 for quarterly time units. Eggertsson and Woodford (2003a) (dashed line) uses κ = 0.02,
σ = 0.5, β = 0.99. Werning (2015) (dashed and dotted) uses κ = 0.0312, σ = 1, β = 0.99. Units are
percentage points.
! "
in I + AZLB + ... + ATZLB
−1
b is negative, hence z0 (T ) has unboundedly negative inflation and output
gap.
When condition (35) holds, all roots of AZLB are less than 1 in modulus. Hence, limT →∞ z0 (T ) =
−(AZLB − I)−1 b, a finite value.
Proof of Lemma 3 The proof mimics the ones in Woodford (2003c) and Galı́ (2015). We have
( 0 ∞
1 1
W = − uc cE0 β t (γ + φ) x2t + εvari (pt (i)) ,
2 t=0
where vari (pt (i)) is the dispersion of prices at time t. As in Woodford (2003, Chapter 6),
∞
( ( ∞
θ θ
β t vari (pt (i)) = β t πt2 + v−1
t=0
(1 − θ) (1 − βθ) t=0 1 − βθ
∞
γ+φ( t 2 θ
= β πt + v−1 ,
κ̄ t=0 1 − βθ
using the value of κ̄ in Proposition 2, and calling v−1 := vari (p−1 (i)).
Hence,
(∞ # '
1 t 2 γ+φ 2 1 θ
W = − uc cE0 β (γ + φ) xt + ε πt − uc cε v−1
2 t=0
κ̄ 2 1 − βθ
ε ( t ) 2 κ̄ 2 *
∞
1
= − uc c (γ + φ) E0 β πt + xt + W−
2 κ̄ t=0 ε
( 0 ∞
1 1
= − KE0 β t πt2 + ϑx2t + W− ,
2 t=0
27
with K := uc c (γ + φ) κ̄ε ,ϑ := κ̄ε ,and
1 θ
W− := − uc cε vari (p−1 (i)) . (200)
2 1 − βθ
where Ξt are Lagrange multipliers. The first order conditions are: Lxt = 0 and Lπt = 0, which give re-
! "
spectively −ϑxt +κΞt = 0 and −πt −Ξt +M f Ξt−1 1t>0 = 0, i.e. Ξt = ϑκ xt and πt = −ϑ
κ
xt − M f xt−1 1t>0 .
Let us now explicitly solve for the dynamics of xt . We define α := ϑκ . For t > 0, using the Phillips
! "
curve and πt = −α xt − M f xt−1 , we get,
! " ! " κ νt
− xt − M f xt−1 = −βM f xt+1 − M f xt + xt + ,
α α
i.e.
νt+1
G (F ) xt = , (201)
α
where F is the forward operator (F k xt := xt+k ), and
) ! "2 κ*
G (X) := βM f X 2 − β M f + 1 + X + Mf. (202)
α
Given that
G (0) > 0 > G (1) , (203)
one of the roots of G (X) = 0, which we call Λ, is between 0 and 1, while the other root is greater than
1. Given that νt = ν0 ρtν , the solution of (201) has the form
xt = aΛt+1 + bρt+1
ν
ν0 ρt+1
Plugging this in (201) gives bG (ρν ) ρt+1
ν = α
ν
, i.e.
ν0
b= . (205)
αG (ρν )
This also gives an explicit path for πt and pt , via (40) and (41).
To study the sign of xt , suppose without loss of generality a positive cost-push shock, i.e. ν0 > 0.
We recall (203). If ρν ∈ (Λ, 1), then G (ρν ) < 0 so xt < 0 for all t. If ρν ∈ [0, Λ), then G (ρν ) > 0 and
again xt < 0 for all t.110 We conclude that at all dates, sign (xt ) = −sign (ν0 ). Economically, a cost
109
For simplicity we don’t consider separately the singular case ρν = Λ. Then, xt = (−at + b) ρt+1
ν .
110
In the singular case ρν = Λ has the same sign by continuity.
28
push shock always entails a negative output gap (to “fight inflation”) under the optimal commitment
policy. As by (41), - .
t−1
(
−ϑ ! "
pt = xt + 1 − M f xτ ,
κ τ =0
and xt < 0 at all dates, we see that pt is strictly positive at all date, and so is its long run value, under
the optimal commitment policy following a cost-push shock.
Proof of Proposition 6 The central bank today takes its future actions as given, and chooses xt ,
πt , it to minimize today’s loss − 21 (πt2 + ϑx2t ) subject to the behavioral IS equation and behavioral NK
Phillips curve. This is equivalent to
1! 2 "
max − πt + ϑx2t subject to πt = βM f Eπt+1 + κxt + νt ,
πt ,xt 2
1! 2 " ! "
L=− πt + ϑx2t + Ξ βM f Eπt+1 + κxt + νt − πt .
2
The first order conditions are: Lxt = 0 and Lπt = 0, i.e. −ϑxt + κΞ = 0 and −πt − Ξ = 0, which
together yields πt = − ϑκ xt . The explicit value of it is in Section XII.
Proof of Proposition 6: Complements Here is the derivation of it . Substitute (42) into the
Phillips curve:
) κ* βM f ϑ ϑ
πt = βM f Et πt+1 + κ − πt + νt ⇒ πt = 2
Et πt+1 + νt .
ϑ ϑ+κ ϑ + κ2
Iterating forward:
∞ %
( &τ −t (∞ % &τ −t
βM f ϑ ϑ βM f ϑρν ϑ ϑ 1
πt = Et ντ = νt = f ϑρ νt
τ =t
ϑ+ κ2 ϑ + κ2 τ =t
ϑ + κ2 ϑ+κ2 2
ϑ+κ 1− βM
2
ν
ϑ+κ
ϑ
= νt = ϑΦνt
ϑ + κ − βM f ϑρν
2
29
Proof of Proposition 7 To lighten up the proof, we take the case with no deviation of the interest
rate. The general case is the same, as all things enter additively. I give the proof in the case my = 1
(so that mY = 1) – the 2018 NBER Working paper version of this paper has the general case.
We start from Proposition 18, which gives optimal consumption with fiscal policy.112 We have, with
bk = Rr χ, bY = Rr ,
+ ,
( 1
ĉt = bk kt + EBR
t (bk T (Xτ ) + bY ĉ (Xτ ))
τ ≥t
Rτ −t
( EBR [T (Xτ )]
t
= bk Bt + bk + Ft ,
τ ≥t
Rτ −t
with
( 1 ( m̄τ −t
Ft := bY EBR
t τ −t
ĉ (Xτ ) = bY Et ĉ (Xτ ) . (206)
τ ≥t
R τ ≥t
Rτ −t
so we have:
- + - τ −1
.,.
( 1 r (
ĉt = bk Bt + bk τ −t
− Bt + Et m̄τ −t dτ − r du + Ft (207)
τ ≥t
R R u=t
+ - τ −1
.,
( m̄τ −t (
= Et bk d (Xτ ) − r d (Xu ) + Ft . (208)
τ ≥t
Rτ −t u=t
We see that the impact of Bt cancels out, a form of partial Ricardian equivalence. Old debt (Bt ) does
not make the agent feel richer. But a new deficit today (d (Xt )) does. Let us calculate the cumulative
impact of a deficit in (208)
( m̄k m̄
r m̄ R (1 − m̄)
R
J := 1 − r =1−r m̄ =1− = .
k≥1
Rk 1− R
R − m̄ R − m̄
So,
+ ,
( m̄τ −t
ĉt = Et bk Jd (Xτ ) + Ft
τ ≥t
Rτ −t
+ ,
( m̄τ −t ) *
= Et bk Jd (Xτ ) + b̃Y ĉ (Xτ )
τ ≥t
Rτ −t
30
So, multiplying by R and gathering the cˆt terms:
Proposition 29 (Consumption given beliefs) Consider an agent maximizing over (cτ , Nτ ) utility Ut =
/∞ τ −t
EBR
t τ =t β u (cτ , Nτ ) subject to the law of motion for wealth (49). Up to second order terms (and
for small wealth k0 ), consumption is:
+ ,
r ( 1 ! "
ct = kt + ȳ + EBR t τ −t
br r̂ BR (Xτ ) + by ŷ BR (Nτ , Xτ ) , (210)
R τ ≥t
R
−1 r̄
where expectations are taken under the agent’s subjective model of the world, br = γR2 , by = R , and Nτ
labor supply at the perceived optimum. The chosen labor supply is given by Ntφ = ŷN
BR
(Nt , Xt ) c−γ
t .
It is stated as a function of an endogenous labor supply, because this is the form that is most useful
in some derivations (Section XI.B develops the case that explicitly solves for labor supply). Versions of
this proposition were proven a number of times with minor variants (e.g. Eusepi and Preston (2011);
Woodford (2013); Gabaix (2016a); Auclert (2019)), but for completeness let us derive it.113 !
Proof of Proposition 29. For simplicity, we take the deterministic case (as we consider first order
Taylor expansions, the general case is the deterministic case where the path of variables are their
expected values). The agent wants to maximize, over ct and Nt :
- .
( ( ! "
L= β t u (ct , Nt ) + λ k0 + qt ȳ + ŷ BR (Nt , Xt ) − ct , (211)
t≥0 t≥0
8t−1 ! BR
"
where qt := 1/ τ =0 1 + r̄ + r̂ (Xτ ) . Here we consider the decision at time 0, which is just a
113
See also the proof of Lemma 4.2 in Gabaix (2016a) for another style of derivation, using dynamic programming.
31
normalization. Consider first the problem of optimizing L over ct (taking the value of yt := ȳ +
) t *ψ
ŷ BR (Nt , Xt ) as given), the FOC is β t c−γ
t = λqt , so c t = c β
0 qt (with ψ := γ1 ), for some c0 = λ−ψ .
/ /
With Ω := k0 + t≥0 qt yt the total perceived wealth, the perceived budget constraint is Ω = t≥0 qt ct =
/
c0 t≥0 β tψ qt1−ψ . This gives consumption at decision time:
/
k0 + t≥0 qt yt
c0 = / . (212)
β tψ q 1−ψ
t≥0 t
From there, let us see the impact of a small change in income. We have, at the default value of interest
/ ! / "
rates, qt = β t , so t≥0 β tψ qt1−ψ = 1−β
1
, so c0 = (1 − β) k0 + t≥0 β t yt . This yields by = 1 − β = Rr in
(210). The impact of the interest rate on current consumptions is similar, though a little tedious (it is
done in Section XII).
Next, for labor supply at time 0, the behavioral agent optimizes (211) given his perceived model of
the world. This gives LN0 = 0, i.e. −N0φ + λŷN BR
(N0 , X0 ) = 0. As we saw that c−γ
0 = λ, we obtain
N0φ = c−γ
0 ŷ BR
N (N0 , X0 ). !
Application to this paper’s behavioral agent When my = 1, and no initial wealth. This is the
simplest case. The behavioral agent perceives his dynamic budget constraint is (49) and (50). Hence,
we apply Proposition 29 to (50). At the optimum policy, we have Nt = N (Xt ) under the perceived
motion for Xt , so the planned labor supply also verifies Nτ = N (Xτ ), so ŷ BR (Nt , Xt ) = ŷ (Xt ). Now,
using cognitive discounting (10), we have (51). This gives the consumption:
+ , + ,
( 1 ) r * ( m̄τ −t ) r *
ĉt = EBR
t τ −t
br r̂ BR (Xt ) + ŷ (Xt ) = Et τ −t
br r̂ (Xt ) + ŷ (Xt ) .
τ ≥t
R R τ ≥t
R R
With a general my or non-zero initial wealth. Here things are more complex, because the aggregate
wealth is 0, and the agent plans to have non-zero wealth next period (as he misperceives income),
so the agent doesn’t plan that her future labor supply will be equal to the aggregate labor supply,
Nτ = N (Xτ ). Section XI.B gives the proof.
Another method: Dynamic programming with Taylor expansion. The following method is a bit less
intuitive, but may be handy to automatize when considering medium-scale extensions of this model.
The subjective value function of the agent satisfies:
1 (
µ= / , Ω = k0 + qt ytBR ,
tψ 1−ψ
t≥0 β qt t≥0
32
8 ! "
with qt = 1/ t−1 BR
τ =0 1 + rτ . To lighten up the notation, I drop here the BR superscripts—still remem-
bering that we reason in the space of perceived interest rate and incomes.
I linearize around the steady state, which has qt = β t and 1 = βR = β (1 + r̄). This gives, at the
ȳ
steady state, µ = 1 − β = Rr̄ = by , and Ω = k0 + 1−β , so c0 = µΩ = Rr k + ȳ.
The impact of a change dyτ is easy to derive:
dyτ dyτ
dc0 = µdΩ = µ = by .
Rτ Rτ
This implies:
∂c0 1
= by τ .
∂yτ R
The impact of an interest rate is more delicate. Consider a change change drτ , for just one date τ .
It creates a bond price change dqt = R−1
t+1 drτ 1t>τ , so that
( ( −1 ( −1 −1
dqt = t+1
dr 1
τ t>τ = t+1
drτ = τ +1
drτ .
t≥0 t≥0
R t≥τ +1
R rR
This gives
dµ ( r (
= −µ (1 − ψ) β tψ qt−ψ dqt = − (1 − ψ) dqt
µ t≥0
R t≥0
r 1 drτ
= (1 − ψ) dr τ = (1 − ψ) .
R rRτ +1 Rτ +2
Also,
( −ȳ
dΩ = ȳ dqt = drτ .
t≥0
rRτ +1
dµ drτ r −ȳ
dc0 = µΩ + µdΩ = c0 (1 − ψ) τ +2 + drτ
µ R R rRτ +1
% &
drτ rk0 drτ
= (−ψc0 + c0 − ȳ) τ +2 = −ψc0 +
R R Rτ +2
br
= τ drτ ,
R
r̄
k0 −ψc0 −1
with br = R R2
. In the main text, we linearize around c0 = c̄ = 1, k0 = 0, so br = γR2
. We just
showed that:
∂c0 1
= br τ .
∂rτ R
Now, since there is no capital in the NK model, we have ŷτ = ĉτ : income is equal to aggregate
33
r mr
demand. Hence, using b̃y := m
R Y
and b̃r := br mr = − γR2 , (213) becomes:
+ ,
( m̄τ −t ) *
ĉt = Et τ −t
b̃y ĉτ + b̃r r̂τ . (214)
τ ≥t
R
m̄ r m̄
ĉt = b̃y ĉt + b̃r r̂t + Et [ĉt+1 ] = mY ĉt + b̃r r̂t + Et [ĉt+1 ] .
R R R
m̄Et [ĉt+1 ]+Rb̃r r̂t
Multiplying by R and gathering the ĉt terms, we have ĉt = R−rmY
. This suggests defining
m̄ −Rb̃r mr
M := R−rmY
and σ := R−rmY
= γR(R−rmY )
, and we get (19). This then translates into (25).
Proof of Equation (57) If the firm were free to choose its real (log) price qit freely, it would
∗ 1−τ
choose price qit∗ maximizing (14), i.e. eqit = 1− 1f MCt . The subsidy τf = 1ε was chosen to eliminate the
ε
monopoly distortion on average.
The FOC for the (subjectively) optimal flexible price is qi∗,BR (Xτ ) := argmaxqi v BR (qi , Xτ ). For
firms facing the Calvo pricing friction, we have, much as in the traditional model, that the price is the
weighted average of future optimal prices:114
( 2 3
qit = (1 − βθ) (βθ)τ −t EBR
t qi
∗,BR
(Xτ ) , (215)
τ ≥t
EBR
t [Π (Xτ )] = m̄τ −t Et [Π (Xτ )] , EBR
t [µ (Xτ )] = m̄τ −t Et [µ (Xτ )] .
114
/ τ −t BR
The proof is as in the traditional model: the FOC of problem (17) is EBR τ ≥t (βθ) vqi (qit , Xτ ) = 0 and
/ ) * ) *
∗,BR BR τ −t BR ∗,BR ∗,BR
linearizing around qi (Xτ ), the FOC is E τ ≥t (βθ) vq q
i i
qi (X τ ) , X τ · q it − q i (X τ ) = 0. Tak-
) *
∗,BR ∗,BR
ing the Taylor expansion around 0 disturbances so qi (Xτ ) close to 0, the terms vqBR i qi
qi (Xτ ) , Xτ are ap-
proximately constant and equal to vqBR (0, 0) up to first order terms, and the FOC is (up to second order terms)
/ ) * i qi
BR τ −t ∗,BR
E τ ≥t (βθ) qit − qi (Xτ ) = 0, which gives (215).
34
So, we have the following counterpart to the equation right before (G16):
( 0 f 1
qit = (1 − βθ) (βθ)τ −t EBR
t mπ Π (Xτ ) − mfx µ (Xτ )
τ ≥t
( 0 1
= (1 − βθ) (βθ)τ −t m̄τ −t Et mfπ Πτ − mfx µτ .
τ ≥t
! "′
Proof of Proposition 12 The state vector is zt = xt , πt , πtd . We can write the system of
! "′
Proposition 11, together with the Taylor rule, as Et zt+1 = Bzt + b̃ at , π̄tCB , for a matrix B and
coefficient b̃, where as before at = jt − rtn . Calculations show that:”
⎛ 1+σφ ⎞
M
x
+ Mσκβ f Mσ
(φπ − β1f − (1 − ζ)η) − M
σ
(1 − η − β1f )
⎜ ⎟
B=⎝ − βκf 1
βf
+ (1 − ζ)η 1 − η − β1f ⎠. (217)
0 (1 − ζ)η 1−η
! "′
To study equilibrium multiplicity, we dispense with the forcing term b̃ at , π̄tCB : indeed, the difference
between two candidate equilibria will satisfy Et zt+1 = Bzt . Hence, we study the system Et zt+1 = Bzt ,
Consider also the characteristic polynomial of B, Φ (Λ) := det (ΛI − B) (with I the identity matrix),
8
which factorizes as Φ (Λ) = 3i=1 (Λ − Λi ), where the Λi ’s are the eigenvalues of B.
Inflation πtd is a predetermined variable, not a jump variable. Hence, for the Blanchard and Kahn
(1980) determinacy, B needs to have 1 eigenvalue less than 1 in modulus (corresponding to the prede-
termined variable πtd ), and 2 greater than 1 (corresponding to the free variables xt , πt ). This implies
that a necessary condition is Φ(1) > 0. We can calculate this term:
Mβ f (1 − βM f )(1 + σφx − M)
Φ(1) = φπ − 1 + ζ > 0, (218)
κση κσ
which is equivalent to the behavioral Taylor criterion, (62). This, however, is not sufficient. The
sufficient conditions are the “auxiliary Routh-Hurwitz” conditions, to which I now turn.
There is a one-to-one mapping from any (non-unitary) root of Ψ(·) to a root of Φ(·) by construction:
λ +→ Λ (λ) = λ+1
λ−1
. It is easy to show that Re (λ) < 0 if and only if |Λ (λ) | < 1. Thus, the conditions
for B to have exactly two eigenvalues Λ outside the unit circle is the same as the conditions for Ψ(·) to
have exactly two roots λ with positive real parts. We next use the Routh-Hurwitz theory, which has
been developed to handle that case.
We write Φ (Λ) as
(3
Φ(Λ) = ai Λi
i=0
35
with
a3 = 1,
% &
1
a2 = − 1 − ηζ + f − C1 − C2 < 0,
β
% &
1 1
a1 = f (1 − ηζ) + + 1 − ηζ C1 + (φπ + 1 − η) C2 > 0,
β βf
1 − ηζ
a0 = − C1 − (1 − η)φπ C2 < 0,
βf
with C1 ≡ 1+σφ
M
x
and C2 ≡ Mκσβ f are defined for convenience.
We can then rewrite Ψ(λ) as
(3
Ψ(λ) = bi λi ,
i=0
where
b3 = a3 + a2 + a1 + a0 ,
b2 = 3a3 + a2 − a1 − 3a0 ,
b1 = 3a3 − a2 − a1 + 3a0 ,
b0 = a3 − a2 + a1 − a0 .
The criterion b3 > 0 is exactly the Taylor criterion in the text. Also, by inspection, b0 > 0 (since
a3 , a1 > 0 while a2 , a0 < 0). We assume that φπ , φx are nonnegative.
Applying the Routh-Hurwitz determinacy criterion for polynomial, Ψ(λ) has exactly two roots with
positive real parts if and only if when going through the sequence
b2 b1 − b3 b0
b3 → b2 → b′1 := → b0
b2
signs change exactly twice (see for example Meinsma (1995)). Given b3 and b0 are positive, this is
possible if and only if (b2 , b′1 ) are not both positive, i.e. if and only if b2 and b′1 := b2 b1 − b3 b0 are not
both positive (i.e., Not(b2 > 0 and b′1 > 0)). Thus we have proven the following.
Proposition 30 (Equilibrium determinacy with behavioral agents – with backward looking terms)
Assume that φπ , φx are nonnegative. A necessary and sufficient condition for equilibrium determinacy is
that the Taylor criterion (62) in the text holds, and that the following “auxiliary Routh-Hurwiz condition”
holds:
b2 and b′1 := b2 b1 − b3 b0 are not both positive. (220)
I conducted some numerical explorations, making sure that the main Taylor criterion was verified.
Then, the auxiliary Routh-Hurwitz condition (220) was always verified. Without claiming that it is
actually always verified, it seems that the “hard” economic essence is in the Taylor criterion of the main
text, while auxiliary Routh-Hurwitz condition (220) is a much less demanding condition.
Proof of Proposition 14 Call δct = ĉ (m) − ĉ (1), the difference between the actual consumption
ĉ (m) given inattention m, and the ideal consumption with full attention (which would have m = 1).
36
As the agent pays full attention to the wage, we can write the first order condition for labor supply at
decision time as N̂t (m) = −γ ĉ (m) + φ1 ω̂t , which gives δNt = −γ
φ t φ
δct . So, with at = (ct , Nt ), we have the
following expression:
- % &2 .
−γ γ (γ + φ)
(δa)′ uaa δa = ucc (δc)2 + uN N (δN)2 = − γ + φ (δc)2 = − (δc)2 . (221)
φ φ
This represents the leading Taylor expansion term of the utility losses from a suboptimal action driven
by inattention, times two.
r φmdy +γ r
with b̃y = mY R
= φ+γ R
. This gives the following marginal impact of attention m̄ on consumption:
+ ,
∂ĉt (m̄) (
cm̄ := = Et τ β τ m̄τ −1 ft+τ . (222)
∂ m̄ τ ≥0
I consider the limit of small time intervals. The reason is in (68), we obtain vaa = uaa in the limit
of small time intervals (this is quite well-known; see e.g. Gabaix (2016a), footnote 38). Hence, the
prefactor for the losses from inattention is:
γ (γ + φ) 0 2 1
Λ= E cm̄ . (223)
φ
I now consider the case where all fluctuations are driven by productivity ζt , as in the setup of Section
XI.H.3. So, ft is an AR(1) with autocorrelation ρ, and
1 ( τ τ τ 1 βρm̄
cm̄ = τ β m̄ ρ ft = ft
m̄ τ ≥0 m̄ (1 − βρm̄)2
where I used: ( α
ατ τ = for |α| < 1. (224)
τ ≥0
(1 − α)2
Hence,
βρ
cm̄ = ft , (225)
(1 − βρm̄)2
As all is by productivity, then ŷt = byζ ζt and r̂t = brζ ζt , so
) *
ft = b̃y byζ + mr br brζ ζt ,
37
So, the endogenization of m̄ follows from Proposition 13, with
γ (γ + φ) 0 2 1 γ (γ + φ) 2 2
λσS2 = Λ = E cm̄ = cm̄,ζ σζ . (227)
φ φ
Endogenizing attention to interest rate and income. Let us now endogenize attention to the
interest rate. The marginal impact of attention to the interest rate on consumption is: (e.g. starting
from (52) or (127)) + ,
(
cmr = Et β τ m̄τ br r̂t+τ ,
τ ≥0
r φ
with by = R φ+γ
, and in the AR(1) case,
+ , + ,
( (
cmy = Et β τ m̄τ by byζ ζt+τ = Et β τ m̄τ by byζ ρτ ζt ,
τ ≥0 τ ≥0
38
References for the Online Appendix
Angeletos, George-Marios and Zhen Huo, “Myopia and Anchoring,” Working Paper, 2019.
Auclert, Adrien, “Monetary policy and the redistribution channel,” American Economic Review,
2019, 109 (6), 2333–67.
Blanchard, Olivier Jean and Charles M Kahn, “The Solution of Linear Difference Models Under
Rational Expectations,” Econometrica, 1980, 48 (5), 1305–1311.
Bordalo, Pedro, Nicola Gennaioli, Yueran Ma, and Andrei Shleifer, “Over-reaction in macroe-
conomic expectations,” NBER Working Paper No. 24932, 2018.
Coibion, Olivier and Yuriy Gorodnichenko, “What can survey forecasts tell us about information
rigidities?,” Journal of Political Economy, 2012, 120 (1), 116–159.
Coibion, Olivier and Yuriy Gorodnichenko, “Information Rigidity and the Expectations Forma-
tion Process: A Simple Framework and New Facts,” The American Economic Review, 2015, 105 (8),
2644–2678.
Eggertsson, Gauti B, “What fiscal policy is effective at zero interest rates?,” NBER Macroeconomics
Annual, 2011, 25 (1), 59–112.
Eggertsson, Gauti B and Michael Woodford, “The Zero Bound on Interest Rates and Optimal
Monetary Policy,” Brookings Papers on Economic Activity, 2003, 34 (1), 139–235.
Eusepi, Stefano and Bruce Preston, “Expectations, learning, and business cycle fluctuations,” The
American Economic Review, 2011, 101 (6), 2844–2872.
Gabaix, Xavier, “A Sparsity-Based Model of Bounded Rationality,” Quarterly Journal of Economics,
2014, 129 (4), 1661–1710.
Gabaix, Xavier, “Behavioral Macroeconomics via Sparse Dynamic Programming,” NBER Working
Paper No. 21848, 2016.
Gabaix, Xavier and David Laibson, “The 6D Bias and the Equity-Premium Puzzle,” NBER
Macroeconomics Annual, 2002, 16, 257–312.
Galı́, Jordi, Monetary Policy, Inflation, and the Business Cycle: An Introduction to the New Keynesian
Framework and its Applications, Princeton University Press, 2015.
Johnson, David S., Jonathan A. Parker, and Nicholas S. Souleles, “Household Expenditure
and the Income Tax Rebates of 2001,” The American Economic Review, December 2006, 96 (5),
1589–1610.
Kaplan, Greg, Benjamin Moll, and Giovanni L Violante, “Monetary policy according to HANK,”
American Economic Review, 2018, 108 (3), 697–743.
Mankiw, N. Gregory and Ricardo Reis, “Sticky Information versus Sticky Prices: A Proposal to
Replace the New Keynesian Phillips curve,” Quarterly Journal of Economics, 2002, 117 (4), 1295–
1328.
Meinsma, Gjerrit, “Elementary proof of the Routh-Hurwitz test,” Systems & Control Letters, 1995,
25 (4), 237–242.
Taubinsky, Dmitry and Alex Rees-Jones, “Attention variation and welfare: theory and evidence
from a tax salience experiment,” Review of Economic Studies, 2017, 85 (4), 2462–2496.
Werning, Iván, “Managing a Liquidity Trap: Monetary and Fiscal Policy,” NBER Working Paper
17344, 2012.
Werning, Iván, “Incomplete Markets and Aggregate Demand,” NBER Working Paper No. 21448,
August 2015.
Woodford, Michael, Interest and Prices: Foundations of a Theory of Monetary Policy, Princeton
39
University Press, 2003.
Woodford, Michael, “Simple Analytics of the Government Expenditure Multiplier,” American Eco-
nomic Journal: Macroeconomics, 2011, 3 (1), 1–35.
Woodford, Michael, “Macroeconomic Analysis without the Rational Expectations Hypothesis,” An-
nual Reviews of Economics, 2013, 5, 303–346.
40