Professional Documents
Culture Documents
Autocracies
Abstract
a governance performance perspective, the central ruler relies on the local agent to
implement policies but is also afraid of the agent’s moral-hazard problem. Central-
ization efforts, such as harsher punishment for policy failure, improve accountability
but inhibit local innovation because innovation is risky. I define this trade-off as
mit a harsh punishment when an opportunity for innovation is less likely to occur,
which helps to explain why autocracies tend to recentralize after the economy and
governance become more established. This indicates that beyond political survival,
the performance concern can also shape important autocratic institutions. More-
over, the demanding condition of the first-best outcome in the solution suggests
governance.
icy implementation.
The Accountability-Innovation Dilemma in Autocracies 1
1 Introduction
survival perspective, autocrats naturally fear that local agents are powerful enough to
overthrow them. In ancient China, it was a common aspiration of the emperors of the
Han, Tang, Ming, and Qing dynasties to take power back from vassal kings or local
warlords. However, no ruler can govern independently. Rulers also have many reasons to
delegate to their local agents. Autocrats may need local leaders to repress, as Putin has
may itself be a necessary concession for rulers to retain their throne since the rulers’ can
make political promises more credible by formally constraining their power Acemoglu and
Robinson (2001); Myerson (2008). For instance, Kazakhstan’s president Mikhail Tokayev
consolidated his power after the political instability in January 2020 but still chose to
general context from the perspective of political survival. For personalist regimes, Egorov
and Sonin (2011) argued that dictators fill their ruling coalition with loyal but incompe-
tent associates to avoid being betrayed by competent agents. In contrast, McMahon and
Slantchev (2015) argued that autocrats would rather appoint competent generals and
use the material incentive to control them. Dragu and Przeworski (2019) also discussed
how autocrats curb the tendency of their repressive agents to raise political threats by
controlling monetary transfer. A dictator will also try to maintain a small winning coali-
tion and seize control of money (De Mesquita et al., 2005). It is also feasible to analyze
the central-local tension under the political survival context. For example, Tolstrup and
modeled the power tension between local leaders and the central ruler, taking Kadyrov
However, previous literature could be enriched from two aspects. First, it focuses
solely on political survival, the life-and-death power struggle, but ignores other vital
incentives of autocrats. Second, it does not capture the difference between the central-
Autocrats are motivated not only by power struggles in the palace but also by eco-
nomic performance and local governance. It has always been a fundamental goal of
public goods. Wintrobe (1990, 2000) pointed out that authoritarian regimes will pro-
vide public goods to the masses, the so-called ”pigsty project,” to buy loyalty. Output
legitimacy and developmentalism ideology are considered important ruling pillars of au-
Voth, 2014) and the Assad regime’s emphasis on its ability to bring peace and order to
people (Wedeen, 2015). In addition, after entering the 21st century, the primary ruling
repression, more emphasis has been placed on cooptation and information manipulation
for relatively institutionalized authoritarian regimes such as China (Holbig and Gilley,
2010; Zhu, 2011) and Singapore (Wong and Huang, 2010), economic performance and
From the perspective of economy and governance, as discussed above, autocrats also
problems, so they may use their autonomy to perform acts harmful to their principals.
For example, the ideal points of an expert may be far away from that of a politician,
causing the politician to withdraw the delegation Holmstrom (1980). Agents may also
use their power in initiation and approval to execute projects that are in their interests,
but harmful to the principal Baker et al. (1999). The last emperor of the Ming Dynasty,
Chongzhen, ended up hanging himself from the Old-Crooked-Neck tree on Mount Mei
after his army had been defeated by the Qing Dynasty because the local nobles refused to
provide enough tax revenue to feed the army. Moreover, just a few years after Chongzhen’s
death, the Qing Dynasty also faced the principal-agent dilemma of local leaders. However,
the central government also strongly needs to grant administrative autonomy from a
governance perspective, as many classics pointed out. In ancient China, the tendency to
strengthen top-down control was paralleled by the emperor’s high reliance on local leaders
at the county and even provincial levels to provide public goods, such as infrastructure
and disaster relief. The fiscal decentralization reforms initiated by the Communist Party
of China in the 1980s were also a corrective to the rigid and inefficient Leninist vertical
Meanwhile, the relationships between the central and local governments are also dif-
ferent from the principal-agent problem in the general sense. The power arrangement of
the central and local governments is relatively stable over time. We cannot imagine the
central ruler kicking the local governments aside and governing on his own like a legislator
takes power back from the experts. Therefore, the central-local power dynamics are not
only reflected in the formal and large-scale redistribution of decision-making power, but
in contrast, some seemingly technical issues, such as changing the criteria and intensity of
bureaucratic rewards and penalties, actually contain information about the restructuring
4 Hongding Zhu
of central-local power. For example, when the central government imposes more severe
penalties on local governments for governance failures, it limits local governments’ dis-
cretion, forcing them to adopt the logic of avoiding responsibility and making decisions
more prudently.
I, therefore, represent the degree of centralization by starting with the intensity with
which the central government punishes local bureaucrats. I set up an agency model to
discuss how an autocrat concerned with governance performance would balance central-
ization and decentralization. In my model, the central government as the principal needs
to rely on the productivity of the local government to implement a policy with a broad
meaning, such as economic development or governance, and the local agent can choose
either the safe or the innovative way of implementation. Safe policy implementation does
not fail but consumes more resources; Successful policy innovation will improve gover-
nance efficiency, but the agent will be punished once he fails. For the principal, the
local innovation, but it also leads to more governance failures due to more reckless risk-
taking by local agents, worsening local accountability, while severer punishment has the
opposite effect.
Scholars agree that the pursuit of political accountability may face some trade-offs. If
the re-election threshold is too harsh, it may cause the agent to adopt a myopic strategy
that exerts little effort (Gehlbach, 2021), tightening restrictions on election rigging in
increasing the probability of conflict (Luo and Rozenas, 2018), and constraining the
backsliding (Luo and Przeworski, 2019). In formal literature, no model yet discuss the
In the solution, there exists a medium optimal intensity of punishment that avoids
ronment in the extension model, since the local agent has an adverse selection problem,
the principal must also offer a medium optimal promotion incentive to encourage the
agent to report truthfully when innovative opportunities arise. Thus, the principal can
only ensure her first-best payoff by offering the optimal punishment and reward. Such
bluntness of top-down bureaucracy by showing its tendency to distort the agent’s incen-
tive. This complements other research Myerson (2021) on the necessity to establish a
decentralized public goods provision method for which local people can directly hold the
agent accountable.
When the optimal punishment is not feasible, the choice between ”being harsh or
vironment with abundant innovation opportunities, the principal tends to bear the ac-
countability cost in order to enjoy the efficiency advantage brought by local innovation;
otherwise, the principal will choose to punish the agent more severely to reduce gover-
that, at least for bureaucratic control, performance concerns can shape important au-
leniently autocrats treat their local agents. This may explain why some autocracies give
greater autonomy to local governments during the early period of rapid economic de-
6 Hongding Zhu
velopment and recentralize power when economic and governance systems become more
established. I use China before and after 2012 as a supportive case of such an explanation.
2 The Model
I begin by showing the technical linkage of my model to other agency models. The
decision process in/between organizations involves four steps: initiation, ratification, im-
plementation, and monitoring(Fama and Jensen, 1983). The divergent allocation of the
four decision rights between the principal and the agent describes a variety of institutional
arrangements. It induces rich theoretical interests, but all involve the pursuit of an ideal
principal-agent relationship in which the principal can effectively induce the productiv-
ity of the agent to achieve a better payoff. For example, Baker et al. (1999); Aghion
and Tirole (1997) discuss the motivation and feasibility of delegating the initiation and
ratification of projects; classic agency models (Eisenhardt, 1989) in light of the latter
two steps focus on moral hazard (Pauly, 1968; Holmstrom, 1982) or adverse selection
(Mirrlees, 1986; Myerson, 1981) of the agent and the qualities of different hierarchical
structures when there are multiple agents (Melumad et al., 1995; Mookherjee, 2006). I
mon that the separation of policy-making and implementation makes the principal rely
on result-oriented assessment to monitor and give incentives to the agent. At the same
time, the latter has discretion in the implementation process (Gailmard, 2002), which
There are two main differences between my model and classical agency models. First,
I assume the principal can only use crude instruments, such as punishment or promotion,
instead of monetary transfer. This setup ensures that the ideal type in our model is a
The Accountability-Innovation Dilemma in Autocracies 7
bureaucratic government, not a firm, although, under fiscal federalism, the government
exhibits certain attributes of a firm (Oi, 1992). Therefore, I am able to focus my analysis
Second, I assume that the principal also internalizes the spending of the agent. Such a
setting is also due to the substantive difference between the relationship of contractors in
the market and the relationship between different tiers of government. For the latter, even
in a full-scale fiscal federalism setting, the central governments still have the incentive to
save the local fiscal budget. One reason is that the central governments need the local
governments to cope with many tasks simultaneously, therefore, do not want to exhaust
2.1 Setup
govern independently but relies on a local agent to implement a policy. The policy
outcome is G. She chooses a production target T > 0 and a binary action Z ∈ {0, 1}
where Z = 1 means she punishes the agent by exerting a cost r to his payoff, and r > 0
means the agent cannot meet the target. The agent has the freedom to choose the method
of implementation E ∈ {0, 1} where E = 0 means the safe method with zero probability
to fail and E = 1 means innovation, which is risky. After choosing the implementation
In many autocracies, the central ruler does not care about the process and manner
through which her agents carry out the policy but cares more about accomplishing the
into quantifiable and assessable indicators; political principals in each tier of the govern-
ment then iterate this process and subcontract the policy to lower-level agents. In other
autocracies, similar outsourcing systems also exist, sometimes even purer, such as tax
farming.
After the decision of implementation method E and expenditure x, the agent deliver
where P (x) is a production function strictly concave and P (0) = 0, P ′ (0) = +∞. h ∈
(0, 1) is a probability that the policy fails due to innovation risk. Clearly, G = 0 when the
policy fails. The “failure” here refers to any bad outcome the principal dislikes, which
varies in different regimes. For instance, it could be that the policy turns out to be a
with the safe method. I assume k > 1 to focus on the non-trivial situation where inno-
vation is not obviously inferior to the safe method. k as a random variable complies to a
U = −Zr − x (2)
The Accountability-Innovation Dilemma in Autocracies 9
V = αG − x (3)
where α > 0 means how important the policy is compared with money.
I first discuss the setup with complete information as the baseline to show the main
implications of this article and then discuss the situation in which the principal cannot
observe k as the extension. The sequence of the game in the baseline is as follows:
F.
2. the principal observes k, h, chooses a production target T ≥ 0 for the agent, and
3. the agent observes k, h, chooses E ∈ {0, 1}, decides to spend x ≥ 0 on the policy;
4. the policy outcome G is realized and the agent is punished Z = 1 if and only if
G < T;
The equilibrium concept used in the baseline is Subgame Perfect Nash Equilibrium
R+ , E ∗ : R+ → {0, 1}, and x∗ : R+ → [0, +∞] such that given E ∗ and x∗ , T ∗ maxi-
mize the payoff of the principal; for each T , E ∗ (T, Z) and x∗ (T, Z) constitute a Subgame
Using backward induction, given T , the agent chooses between three implementation
the agent does not exert any effort, E is trivial, and he is sure to be punished. Now his
payoff is UN = −r.
Second, he can choose the safe method of implementation and exert a non-trivial
of money that makes G = P (x) = T . Any more investment is a waste of money and any
less investment cannot help him meet the target and he would rather choose aN to save
P −1 ( Tk )}1 . Similarly, he will spend the exact amount of money that makes kP (x) = T .
For convenience, I assume the agent prefers aS to aR to aN whenever the former weakly
dominates the latter. The condition that the agent does not choose aN is either US ≥ UN
So we have the first threshold T so that the agent chooses x = 0 if and only if T > T :
The condition that the agent is willing to innovate is UR > US , which solves T ≥ T̃ ,
1
P ( 1−h )
solves k > 1−h
. The agent’s calculation of payoff among the three implementation
1
P ( 1−h )
Lemma 1 (Agent’s Choice When He Makes Budget) When k > 1−h
, the agent
Intuitively, when the policy target is harder to meet due to the concave property of the
−1 −1 T
xS (T ) − xR (T ) = P (T ) − P (5)
k
is increasing. The incentive of efficiency (saving money) gradually outweighs the concern
of risks. So the agent tends to choose aR when T is larger. The threshold only exists
when the efficiency k of innovation is large enough compared with its risk h, which is
1
P ( 1−h )
equal to the condition T̃ < T that solves k > 1−h
. And when T > T , both aS and aR
By picking different T > 0, the principal chooses between three payoff functions. When
the agent shirks and chooses aN , the principal’s payoff is denoted as VN = 0; when the
agent plays safe and chooses aS , denoted as VS = αP (xS ) − xS ; and when the agent takes
VS = αP (xS ) − xS T ∈ (0, T̃ ]
V (T ) = VR = α(1 − h)kP (xR ) − xR T ∈ (T̃ , T ] (6)
VN = 0
T >T
We always have interior solutions to ensure VS > 0 and VR > 0, so VN is never optimal
Though xS (T ) and xR (T ) are different functions of T , one easier way for the principal
1
can compare the function α(1 − h)kP (x) − x and αP (x) − x. Clearly, when k ≥ 1−h
, VR
is large enough compared with its risk h, it is favorable for the principal to innovate. We
first discuss the global property of VS and VR , without considering the constraint of T .
Denote the first-order derivative of P (x) as p′ , and we solve the optimal T for VS and VR :
′−1 1
T̂S = P p (7)
α
′−1 1
T̂R = kP p (8)
α(1 − h)k
I further assume α is smaller than some α̃ so that both T̂S , T̂R ≤ T . This assumption helps
to focus on the interesting situation where T̂i are interior, otherwise it degenerates to T
The Accountability-Innovation Dilemma in Autocracies 13
1
1 P ( 1−h )
when either k ≥ 1−h
or k ≤ 1−h
. In this degenerated case, the principal completely
exhausts the agent’s productivity. Substantively, such an extreme case implies that one
single policy is so important that the cost is negligible, and the principal pushes the
agent to the edge of shirking (any more workload will make the agent strike), which is
To briefly sort out our progress, we have derived the two critical thresholds of k.
1
P ( 1−h )
First, the agent only chooses aR for some large T when k > 1−h
. Second, the principal
1
1 1 P ( 1−h )
prefers VR (T̂R ) to VS (T̂S ) when k ≥ 1−h
. When 1−h
> 1−h
, the discussion will be
1
more interesting, and it requires the condition P 1−h
< 1, which means that the risk is
1
relatively small. Otherwise, if P 1−h
≥ 1, the main result of the following analysis still
remains, but for some k, the situation degenerates. So I only discuss the situation when
1
P 1−h
> 1 and leave the discussion when otherwise in the appendix.
k, h, under some r small (large) enough, VS (T̂S ) (VR (T̂R )) is not feasible because T̂S > T̃
(T̂R < T̃ ). I define the first-best outcome for the principal as when max{VR (T̂R ), VS (T̂S )}
If any optimal r that ensures the first-best outcome of the principal exists, it must
1
be medium so as assure when k < 1−h
, r is not too small so that T̂S ∈ (0, T̃ ]; when
1
k ≥ 1−h
, r is not too large so that T̂R ∈ (T̃ , T ]. Under optimal r, the agent’s choice
of implementation method is always aligned with the preference of the principal. The
principal observes k, the first-best outcome for the principal is feasible if and only if
r̂S = T̃ −1 P p′−1 1
≤ r ≤ T̃ −1 1
p′−1 1
α 1−h
P α
= r̂R .
When the parameter r is too large, r > r̂R , the first-best outcome is not feasible
1
for the principal when k ≥ 1−h
. The agent is unwilling to innovate under T = T̂R (k)
because the punishment for failure is too harsh. The principal now has two options. First,
she can pick the nearest T larger than T̂R , which is T̃ , and get the payoff VR (T̃ ). By
setting a heavier workload, she can force the agent to innovate (to save money), but this
also deviates from the optimal production of innovation. Second, she can abandon the
innovation opportunity and instead choose T = T̂S to get VS (T̂S ). Her choice between
the two “plan-Bs” depends on how large T̃ is, given VR (T ) a decreasing function on
T > T̂R . And T̃ is increasing in r. So when r is larger than some threshold r̃R so that
VR (T̃ ) < VS (T̂S ) the principal abandon the innovation opportunity and would rather
Similarly, when r is too small, r > r̂R , the first-best outcome is not feasible for
1
P ( 1−h ) 1 2
the principal when 1−h
≤ k < 1−h
. The principal chooses T = T̃ if and only if
1
r̃S ≤ r < r̂S , otherwise she chooses T = T̂R , even though now k < 1−h
. I now formally
present the equilibrium of the model and leave the proof in the appendix:
r ≥ T̃ −1 P p′−1 1
α
1
3. When k ≥ 1−h
,
1
Figure 2: r̂R < r̃R < r, k > 1−h . The first-best outcome is not feasible for the principal
when the optimal innovation is ex-ante better, but the punishment intensity is too large.
1
Figure 3: r̃S < r < r̂S , k < 1−h . The first-best outcome is not feasible for the principal
when the optimal safe implementation is ex-ante better, but the punishment intensity is
too lenient. The optimal policy target and payoff of playing safe are not feasible, but the
largest feasible payoff of playing safe, VS (T̃ ), is still better than innovation.
16 Hongding Zhu
The equilibrium tells a simple story, with the key role played by the intensity of punish-
between the accountability cost and potential loss of innovation. However, when r is too
large or small, the principal suffers a loss of payoff by the agent’s tendency to “choose the
wrong option” which is inevitable—the agent’s willingness to innovate when the principal
In a previous analysis, I take k as given and discuss the equilibrium outcome of the
suppose the principal can choose r from some parameter space and discuss the principal’s
When r is optimal, for any k, the principal is able to get max{VS (T̂S ), VR (T̂R (k))}.
1
Z
1−h
Z +∞
Vr∈[r̂S ,r̂R ] = VS (T̂S (k))dF + VR (T̂R (k))dF (9)
1
1 1−h
I define the inefficiency caused by the too-harsh punishment as the “innovation insuffi-
1
ciency.” When r > r̂R , for some k ≥ 1−h
the principal cannot get VR (T̂R (k)), but instead
get max{VR (T̃ (k; r)), VS (T̂S )}. The agent’s willingness to innovate under T = T̂R (k) re-
quires T̃ (k; r) < T̂R (k), which solves k > k̂R (r), where k̂R (r) is a threshold of k that
increases in r. Intuitively, when punishment is too harsh, the agent is only willing to
innovate when it brings him a huge efficiency advantage and thus saves much money,
1
“accountability cost.” Similarly, when r < r̂S , for some k < 1−h
the principal cannot get
VS (T̂S ), but instead get max{VS (T̃ (k; r)), VR (T̂R )}. The agent is only willing to play safe
under T = T̂S when T̃ (k; r) ≥ T̂S , which solves k < k̂S (rL ) where k̂S (rL ) is a threshold
of k that increases in r. Intuitively, when punishment is too lenient, the agent’s reckless
innovation causes unnecessary risks of policy failure, and he only plays safe when the
1
cipal’s payoff is not first-best when k ∈ [ 1−h , k̂R (r)] (cost of innovation insufficiency);
1
when r < r̂S , the principal’s payoff is not first-best when k ∈ [k̂S (rL ), 1−h ] (cost of ac-
countability).
The proof of Proposition 3 is in the appendix. I then quantify the two costs by writing
Z Z 1
k̂S (rL ) 1−h
Z +∞
VS (T̂S (k))dF + V1 dF + VR (T̂R (k))dF r ∈ (0, r̂S )
1
1 k̂ S (r L ) 1−h
Z 1 Z +∞
1−h
V = VS (T̂S (k))dF + VR (T̂R (k))dF r ∈ [r̂S , r̂R ] (10)
1
1 1−h
Z 1 Z k̂R (r) Z +∞
1−h
VS (T̂S (k))dF + V2 dF + VR (T̂R (k))dF r ∈ (r̂R , +∞)
1
1 1−h
k̂R (r)
in which V1 refers to non-first-best payoff with accountability cost and V2 refers to non-
first-best payoff with innovation insufficiency. The cost of accountability C1 and innova-
18 Hongding Zhu
Z 1
1−h
C1 = VS (T̂S (k)) − V1 dF (11)
k̂S (rL )
Z k̂R (r)
C2 = VR (T̂R (k)) − V2 dF (12)
1
1−h
Clearly, given that k̂i (r) increases in r, C1 decreases in r and C2 increases in r. Any
planner who can design r faces this trade-off between the two costs when no optimal r
is available in the parameter space. Suppose the principal can choose a discrete set of
r from a coarse parameter space. Denote {rL , rH } as a pair of available r closest to the
range of optimal r, such that rL < r̂S < r̂R < rH . When the principal chooses rL (rH ),
she suffers the cost C1 (C2 ), so her calculation is based on comparing the two costs.
1 1
The probability that k appears in [k̂S (rL ), 1−h ] and [ 1−h , k̂R (rH )] is the critical fac-
tor that determines which cost is higher. An extreme case is when the probability
1
F (k̂R (rH )) − F ( 1−h ) arbitrarily close to 0, the cost of innovation insufficiency is close
to 0. Hence, the principal only worries about reducing the accountability cost, and she
chooses the harsh punishment rH . The intuition is that when the opportunity for inno-
vation is rarer, the inefficiency caused by inhibited innovation under a harsh punishment
is smaller than the accountability cost caused by reckless innovation under a lenient pun-
ishment. Similarly, when the opportunity for innovation is rich, the accountability cost
she chooses among the optimal set of r whenever available. If not available, the principal
1
chooses the harsh punishment rH when F (k̂R (rH ))−F ( 1−h ) is sufficiently small compared
1
with F (k̂S (rL )) − F ( 1−h ), and she chooses the lenient punishment rL if otherwise.
The Accountability-Innovation Dilemma in Autocracies 19
Consider an alternative setup where the principal cannot observe the realization of k.
1
kL < < kH (13)
1−h
and Pr(k = kH ) = q ∈ (0, 1). The principal replies to a message from the agent, m,
{L, H}. The feasibility of the first-best outcome requires the agent to stick to strategies
with truthful report m = k, which is an extra requirement compared with the complete
When the principal ignores m, she plans based on the prior q, and the two players
get respectively V ∗ and U ∗ . Clearly, V ∗ is not the first-best outcome for the principal.
Instead, only when the agent report m = k and the principal plans based on m can she
The intuition is that reporting any m = H makes the principal think there is a good
opportunity for innovation and thus choosing a high production target T̂R due to the
efficiency advantage of innovation. To save some effort, the agent has the incentive
always to report a lower type, deviating from m = k to m = L, given that the principals
believe the message. To make it incentive compatible for the agent to report m = k,
naturally, the principal must reward any m = H with an extra political incentive w > 0
to compensate the agent. w could be any political reward, but it is the most intuitive if
we consider it a promotion. The payoffs with the political reward w for innovation are:
w must be large enough so that Utrue + qw ≥ UfLake so the agent is willing to report
m = H; also not too large to ensure Utrue + qw ≥ UfHake + w so the agent will not report
principal does not observe k, the first-best outcome for the principal is feasible if and only
3
We do not need to worry about the strategy m = ¬k because it is inferior to either m = L or m = H
The Accountability-Innovation Dilemma in Autocracies 21
if both
1) r̂S = T̃ −1 P p′−1 1
≤ r ≤ T̃ −1 1
p′−1 1
α 1−h
P α
= r̂R .
UfLake −Utrue Utrue −UfHake
2) q
≤w≤ 1−q
5 Discussion
policy implementation. Conditions to ensure the first-best payoff for the principal are
very demanding in the real world, constrained by the institutional environment, which is
The principal must be able to commit to a punishment intensity r, and the optimal set
[r̂S , r̂R ] must be available in the parameter space. However, first, the principal may not
be able to set r. In a multi-level bureaucratic system, r is possibly the choice made by the
and the agent, r is constrained by other contracts and institutions and is thus stable
in a specific period, making the principal hard to adjust. Second, the parameter space
of available r could be coarse. In specific policy fields, ideological fences undermine the
credibility of small r, while nepotism (He, 2000) and collisions (Zhou, 2016) undermine
The principal must also be able to reward innovative agents with political incentives,
such as promotion. However, first, the principal may not be able to grant such a reward.
22 Hongding Zhu
Department (zuzhibu) of the Party Committee one level higher, which limits the situations
where promotion incentive is an available instrument. Second, even if the principal can
commit to the reward, the parameter space could also be coarse. In China, the criteria
for promotion in CCP are intentionally kept vague to prevent the opportunistic behavior
of the agent, so the principal cannot commit to promoting an agent with certainty, while
nepotism and favoritism further undermine the credibility of promotion reward. Also,
promotion indicates that the principal not only wants to induce the productivity of agents
for particular policies but also to promote more capable agents—there are only limited
seats for promotion. Though such an incentive is not discussed in the article, it is not
The discussion above supplements existing literature on the limitation of the central-
ized, top-down system in public goods provision by showing its inevitability to distort the
agent’s incentive. Myerson (2021) argued that an autocratic bureaucracy frustrates local
governance due to the central leader’s inability to commit to punishing the local agent for
governance failure when the agent delivers her benefits. However, my model implies that
even when the central leader’s preference is perfectly aligned with local people (suppose
they share the same utility function), the policy outcome can still be distorted by the
demanding requirements for perfect bureaucratic control. Fortmann et al. (1986) and
Ostrom (1990) emphasized the importance of locally accountable usage of the budget
and organization leadership. In my model, if such institutions exist so that the people
can set the upper bound of expenditure after observing the realization of k, then the
agent cannot increase his payoff by wasting money to implement the safe method when
The Accountability-Innovation Dilemma in Autocracies 23
Proposition 4 shows the critical factor in the model, the probability that an innovation
opportunity arises, according to which the principal adjusts her choice of punishment
intensity. When there are more innovation opportunities, the principal tends to internalize
the accountability cost in order to enjoy the efficiency edge of more local innovation. In
contrast, when there are scarce opportunities, the accountability cost resulting from the
reckless risk-taking of the agent outweighs the efficiency edge, and the principal hence
This provides an alternative way to explain the central-local power dynamics in autoc-
racies. Previous literature proposed reasons based on political survival or political con-
are widely leveraged (Siegle and O’Mahony, 2010; Brancati, 2006). However, it does not
explain the central-local dynamics in full-scale autocracies like China, where no substan-
tial and formal decentralization of state power has been issued. Fiscal federalism, from
an economic point of view, could explain the delegation of financial and administrative
power to local governments in China. However, it cannot account for the recentralization
at the expense of the willingness of local agents to initiate policy experiments. After
Xi Jinping became the leader of CCP in 2012, China initiated a recentralization process,
including strengthening digital screening (Gao and Tan, 2020), anti-corruption, personnel
24 Hongding Zhu
ishment for local policy failure and disobedience. One critical goal of this effort is to
curb “localism” (Bulman and Jaros, 2021), which points directly to the accountability
problem of local governance. Unorthodox, sometimes illegal, and risky, local policy in-
novation has played a significant role in China’s economic miracle but also caused severe
local accountability problems (Heilmann, 2018). In the famous piece, O’Brien and Li
in rural areas, discussing how the difficulty of policy outcome evaluation, fiscal incen-
tives, and the transition of the cadre assessment system distort the implementation–it
was common that harmful policies were executed with great efforts while policies wel-
comed by the people were ignored. The official rhetoric of the CCP on policy innovation
after 2012 claimed that “policy innovation should be constrained by the rule of law”, and
”disorderly action,” a charge against local cadres for risky and abusive policy innovation,
also emerged after the recentralization process in 2012 4 . Thus, the recentralization ef-
fort is partly based on the “urgent need” of China to reform, which requires a strong
central government (Gao and Teets, 2020) to overcome the fragmented political system
(Lieberthal and Lampton, 2018; Mertha, 2009). However, as predicted by the intuition
of the Accountability-Innovation dilemma, the local state agents are halting policy ex-
6 Conclusion
ization based on the concern of local governance performance, using the intensity of bu-
4
http://www.xinhuanet.com/politics/2016-01/15/c 128632676.htm
The Accountability-Innovation Dilemma in Autocracies 25
intensity contains information on the central ruler’s effort to reduce local discretion in
executing risky policies. By making policy failures more costly, harsher top-down punish-
ment helps to reduce unnecessary risky policies and improve the accountability of local
governance. However, it can also inhibit necessary local innovation when opportunities
arise.
The conditions are strict for the principal to avoid such a dilemma and ensure the
first-best policy outcome, which requires both the punishment and reward for the agent
is hindered by multiple political factors, pointing out the limitations of the top-down
bureaucratic method in local public good provision. Institutions that empower people in
budget-making after observing the policy innovation opportunity facilitate the prevention
of the moral-hazard situation in which the agent holds back on necessary innovation and
When optimal punishment is not feasible for the principal, she trades off between
the two costs: the insufficient innovation caused by a too-harsh punishment or the ac-
countability deterioration caused by reckless local risk-taking. When the probability that
an innovation opportunity occurs is small, the principal prefers to exert harsher punish-
ment for policy failure. This helps explain why autocracies adopt recentralization reforms
aimed at improving local governance accountability after the rapid growth phase and indi-
cates that the incentive of policy output can shape important institutional arrangements
References
Daron Acemoglu and James A Robinson. A theory of political transitions. American
Economic Review, 91(4):938–963, 2001. 1
Philippe Aghion and Jean Tirole. Formal and real authority in organizations. Journal of
political economy, 105(1):1–29, 1997. 6
George Baker, Robert Gibbons, and Kevin J Murphy. Informal authority in organizations.
Journal of Law, Economics, and organization, 15(1):56–73, 1999. 3, 6
Dawn Brancati. Decentralization: Fueling the fire or dampening the flames of ethnic
conflict and secessionism? International organization, 60(3):651–685, 2006. 23
Bruce Bueno De Mesquita, Alastair Smith, Randolph M Siverson, and James D Morrow.
The logic of political survival. MIT press, 2005. 1
Tiberiu Dragu and Adam Przeworski. Preventive repression: Two types of moral hazard.
American Political Science Review, 113(1):77–87, 2019. 1
Georgy Egorov and Konstantin Sonin. Dictators and their viziers: Endogenizing the
loyalty–competence trade-off. Journal of the European Economic Association, 9(5):
903–930, 2011. 1
Eugene F Fama and Michael C Jensen. Separation of ownership and control. The journal
of law and Economics, 26(2):301–325, 1983. 6
Louise Fortmann et al. The role of local institutions in communal area development in
botswana. 1986. 22
Xiang Gao and Jie Tan. From web to weber. China Review, 20(3):71–98, 2020. 23
The Accountability-Innovation Dilemma in Autocracies 27
Xiang Gao and Jessica Teets. Beyond fragmented authoritarianism: China’s new gover-
nance model. Working Paper, 2020. 24
Scott Gehlbach. Formal models of domestic politics. Cambridge University Press, 2021.
4
Zengke He. Corruption and anti-corruption in reform china. Communist and Post-
Communist Studies, 33(2):243–270, 2000. 21
Sebastian Heilmann. Red swan: how unorthodox policy-making facilitated China’s rise.
The Chinese University of Hong Kong Press, 2018. 24
Heike Holbig and Bruce Gilley. Reclaiming legitimacy in china. Politics & policy, 38(3):
395–422, 2010. 2
Bengt Holmstrom. On the theory of delegation. Technical report, Discussion Paper, 1980.
3
Bengt Holmstrom. Moral hazard in teams. The Bell journal of economics, pages 324–340,
1982. 6
Kenneth G Lieberthal and David M Lampton. Bureaucracy, politics, and decision making
in post-Mao China, volume 14. University of California Press, 2018. 24
Zhaotian Luo and Adam Przeworski. Democracy and its vulnerabilities: Dynamics of
democratic backsliding. forthcoming at QJPS, 2019. 5
Zhaotian Luo and Arturas Rozenas. Strategies of election rigging: trade-offs, determi-
nants, and consequences. Quarterly Journal of Political Science, 13(1):1–28, 2018.
4
Roger B Myerson. The autocrat’s credibility problem and foundations of the constitu-
tional state. American Political Science Review, 102(1):125–139, 2008. 1
Jean C Oi. Fiscal reform and the economic foundations of local state corporatism in
china. World politics, 45(1):99–126, 1992. 3, 7
Elinor Ostrom. Governing the commons: The evolution of institutions for collective
action. Cambridge university press, 1990. 22
Mark V Pauly. The economics of moral hazard: comment. The american economic
review, 58(3):531–537, 1968. 6
Joseph Siegle and Patrick O’Mahony. Decentralization and internal conflict. Making
decentralization work: Democracy, development, and security, pages 135–166, 2010. 23
Milan W Svolik. The politics of authoritarian rule. Cambridge University Press, 2012. 2
Nico Voigtländer and Hans-Joachim Voth. Highway to hitler. Technical report, National
Bureau of Economic Research, 2014. 2
Ronald Wintrobe. The tinpot and the totalitarian: An economic theory of dictatorship.
American political science review, 84(3):849–872, 1990. 2
Benjamin Wong and Xunming Huang. Political legitimacy in singapore. Politics & Policy,
38(3):523–543, 2010. 2
Li-An Zhou. The administrative subcontract: Significance, relevance and implications for
intergovernmental relations in china. Chinese Journal of Sociology, 2(1):34–74, 2016.
7, 21
Yuchao Zhu. “performance legitimacy” and china’s political adaptation strategy. Journal
of Chinese Political Science, 16(2):123–140, 2011. 2
30 Hongding Zhu
Appendix
Proof of Lemma 1
T
Q(T ) = US (T ) − UR (T ) = (1 − h)r + P −1 ( ) − P −1 (T ) (17)
k
and extend the domain of T to {0}. US (0) = −P −1 (0) = 0 and UR (0) = −(1 − h)r −
P −1 (0) < 0, so Q(0) = US (0) − UR (0) > 0. And clearly US (+∞) < UR (+∞), so
Q(+∞) = US (+∞) − UR (+∞) < 0. By the existence theorem of zero point of continuous
function, there must exist some T̃ so that Q(T̃ ) = 0 and accordingly, US (T̃ ) = UR (T̃ ).
theorem, we have:
1
x′S (T ) = (18)
p′ (x S)
1
x′R (T ) = (19)
kp′ (x R)
1 1 1 1 1−k
Q′ (T ) = − < − = <0 (20)
kp′ (x R) p′ (x S) kp′ (x R) p′ (x R) p′ (xR )
xS (T ) − xR (T ) + (h − 1)r = 0 (21)
dT̃ P (xR )
= ′ <0 (22)
dk 1 − k pp′(x R)
(xS )
Proof of Proposition 1
1
P ( 1−h )
Case 1. When k ≤ 1−h
. r does not affect the payoff of the principal because the
1
T̃ ≥ T̂S = P (p′−1 ( )) (25)
α
1
Case 3. When k ≥ 1−h
. The principal only worries if r is too large. To ensure the
1
feasibility of the highest possible payoff VR (T̂R ) when k ≥ 1−h
, r must be smaller than a
32 Hongding Zhu
unique r̃R (k) such that T̃ ≤ T̂R . Given T̂R an increasing function of k and k is a random
1
variable, so T̃ ≤ T̂R must be satisfied for the smallest k = 1−h
, which requires:
1 1
T̃ ≤ P (p′−1 ( )) (26)
1−h α
1 1 1
T̃ −1 [P (p′−1 ( ))] ≤ r ≤ T̃ −1 [ P (p′−1 ( ))] (27)
α 1−h α
Proof of Proposition 2
1
P ( 1−h )
Case 1. When k ≤ 1−h
. T ∗ = T̂S and a∗ = aS (T̂S ). Because now k is also smaller
1
than 1−h
, VS is favored by the principal and for any non-trivial T ≤ T , the agent only
that r < r̂S the first-best payoff VS (T̂S ) is not feasible for the principal, and she receives
Now let us compare min{VS (T̃ ) and VR (T̂R ). Due to the single-peak property, VS (T )
Clearly, there must exists a unique r̃S such that VS (T̃ ) < VR (T̂R ) when r < r̃S . Hence,
when r < r̃S , the principal has to receives the second-best payoff VR (T̂R ); when r̃S < r <
So we have when r ≥ R̂S , T ∗ = T̂S , a∗ = aS (T̂S ), the principal receives the first-best
payoff VS (T̂S ). When r̃S ≤ r < r̂S , T ∗ = T̃ , a∗ = aS (T̃ ), the principal receives the
non-first-best payoff VS (T̃ ). When r < r̃S , T ∗ = T̂R , a∗ = aR , the principal receives the
1
Case 3. When k ≥ 1−h
. Case 3 is a mirroring of Case 2. The principal prefers aR since
1
1 P ( 1−h )
k≥ 1−h
, however, because k > 1−h
, for T ≤ T̃ the agent only chooses aS . So when
T̂R ≤ T̃ , which means that r > r̂R the first-best payoff is not feasible for the principal
Similarly, the compare between the two non-first-best payoff depends on a threshold
r whenever T̃ ≥ T̂S :
There must exists a unique r̃R such that VR (T̃ ) < VS (T̂S ) when r ≥ r̃R . Hence, when
r ≥ r̃R , the principal has to accept the non-first-best payoff VS (T̂S ); when r̂R ≥< rr̃R ,
So we have when r < r̃R , T ∗ = T̂R , a∗ = aR (T̂R ), the principal receives VR (T̂R ), which
is first-best for the principal; when r̂R ≥ r ≤ r̃R , T ∗ = T̃ , a∗ = aR (T̃ ) (notice that the
payoff VR (T̃ ); when r > r̃R , T ∗ = T̂S , a∗ = aS (T̂S , the principal receives the non-first
Proof of Proposition 3
1
I first prove the cost of innovation insufficiency. When k ≥ 1−h
, the first-best payoff of
the principal is VR (T̂R ). According to Lemma 1, when T = T̂R (k), the agent chooses aR
and deliver the first-best payoff for the principal if and only if T̂R (k) > T̃ (k, r). Denote
1
According to Proposition 1, the condition is satisfied for k > 1−h
if and only if r < r̂R ,
which is to say there must be some ΠR (k) ≤ 0 when r ≥ r̂R . We know that T̂ (k) is
function. Accordingly, when r ≥ r̂R , there must be a unique k̂R (r) such that ΠR (k) ≤ 0
1
on [ 1−h , k̂R (r)] and ΠR (k) > 0 if otherwise.
Next, I prove the cost of accountability part of the proposition, which complies to the
1
P ( 1−h ) 1
mirroring logic of the first part. When 1−h
<k < 1−h
, the first-best payoff for the
principal is VS (T̂S ), which requires the agent chooses aS when T = T̂S . Then, according
1
P ( 1−h ) 1
According to Proposition, the condition is satisfied for any k ∈ 1−h
, 1−h if and only
if r ≥ r̂S , which is to say there must be some ΠS (k) ≥ 0 when r < r̂S . Given T̃ (k, r) a
decreasing function of k, ΠS (k) is increasing in k. Accordingly, when r < r̂S , there must
1
be a unique k̂S (rL ) such that ΠS (k) ≥ 0 on [k̂S (rL ), 1−h ] and ΠR (k) < 0 if otherwise.
The Accountability-Innovation Dilemma in Autocracies 35
Proof of Proposition 4
Without any lose of generality, I provide a more intuitive proof of this proposition assum-
1
large enough such that k > 1−h
. The existence of such upperbound is natural because
we cannot imagine a policy innovation that yields infinite payoff for the principal.
Z 1
1−h
C1 = VS (T̂S (k)) − V1 dF (32)
k̂S (rL )
Z k̂R (r)
C2 = VR (T̂R (k)) − V2 dF (33)
1
1−h
1 1
We want to prove that C1 > C2 when F k̂R (rH ) −F 1−h = Pr 1−h < k < k̂R (rH )
1
is small enough, which is equal to when Pr k̂S (rL ) < k < 1−h is large enough. Denote
1 1
Pr k̂S (rL ) < k < 1−h as Γ1 (h) and Pr 1−h < k < k̂R (rH ) as Γ2 (h). Given F a uniform
1
1−h
− k̂S (rL )
Γ1 (h) = (34)
k−1
and when r = rL :
1
k̂R (rH ) − 1−h
Γ2 (h) = (35)
k−1
where Γ1 (h) is weakly increasing in h and Γ2 (h) is the opposite. Therefore, the proof
∈ (0, 1).
36 Hongding Zhu
1
Z limh→0 1−h
Z 1
lim C1 = VS (T̂S (k)) − V1 dF = VS (T̂S (k)) − V1 dF = 0 (36)
h→0 limh→0 k̂S (rL ) 1
Z limh→0 k̂R (r)
lim C2 = VR (T̂R (k)) − V2 dF > 0 (37)
h→0 1
and when h → 1:
Z k
lim C1 = VS (T̂S (k)) − V1 dF > 0 (38)
h→1 k̂S (rL )
Z limh→0 k̂R (r) Z k
lim C2 = VR (T̂R (k)) − V2 dF = VR (T̂R (k)) − V2 dF = 0 (39)
h→1 limh→0 1
k
1−h
Denote C1 − C2 as a new function β(h). Due to the monotonicity of C1 (h) and C2 (h),