You are on page 1of 5

Why Parrondo’s Paradox Is

Irrelevant for Utility Theory,


Stock Buying, and the
Emergence of Life
Can Losses Be Combined to Give Gains?

INTRODUCTION

S
ince the initial publication of Parrondo’s paradox [1, 2], a number of articles on
the subject have appeared both in the academic literature and in the media.
The paradox describes a reversal of fortune for a gambler who loses money by
playing either of two games of chance, but who makes money by randomly playing
the two games. The academic literature suggests that Parrondo’s paradox advances
von Neumann and Morgenstern’s work on utility theory, that it offers money for free,
RAGHURAM IYENGAR AND
that it has implications for investing strategies, and that it may hold the key for
RAJEEV KOHLI
understanding certain biological processes, possibly even the emergence of life [3, 4].
Harmer and Abbott [2] suggest that the paradox might operate in economics or social
dynamics to extract benefits from ostensibly detrimental situations. For example, if
a society or an ecosystem suffers from declines in either the birth rate or the death Raghuram Iyengar and Rajeev Kohli
rate, declines in both together might combine with favorable consequences. are at the Graduate School of Business,
Media reports compare Parrondo’s paradox with the sacrifice of chess pieces to Columbia University, New York, NY;
win a game. There are suggestions that Parrondo’s paradox offers the possibility of e-mail: rk35@columbia.edu.
making money by investing in losing stocks. Paulos [5] observes that “standard stock
market investments cannot be modeled by games of this type, but variations of these
games might conceivably give rise to counterintuitive investment strategies. Al-
though a much more complex phenomenon, the ever-increasing valuations of some
dot-coms with continuous losses may not be as absurd as they seem. Perhaps they’ll
one day be referred to as Parrondo profits.” The same article quotes Derek Abbott as
saying that “President Clinton, who at first denied having a sexual affair with Monica
S. Lewinsky saw his popularity rise when he admitted that he had lied. The added
scandal created more good for Mr. Clinton.” All these examples seem to suggest
miraculous possibilities for turning unfavorable situations into favorable ones, either
by purposeful actors or by acts of nature. Parrondo’s paradox appears to be a
remarkable finding with far-reaching consequences in wide ranging situations.
Or is it? That’s the question we ask and hope to answer. To do so, we need to look
more closely at the rationale behind Parrondo’s paradox, stripping it of all extrane-
ous baggage and descriptive analogies as ratchets and pawls. We need to understand

© 2004 Wiley Periodicals, Inc., Vol. 9, No. 1 C O M P L E X I T Y 23


the structures of the associated games
If the two preceding outcomes are then toss coin
of chance and uncover what exactly We need to understand the
drives the reversal of fortune in Parron- structures of the associated Both losses (LL) B1
do’s games. We will then be in a better games of chance and uncover A loss followed by a win (LW) B2
position to assess if one or another phe- what exactly drives the reversal A win followed by a loss (WL) B3
nomenon can be explained by it and of fortune in Parrondo’s games. Both wins (WW) B4
what opportunities it offers to investors
and gamblers, biologists and econo- Note that Game B has the structure of
mists. bility in all states of the history-depen- the following second-order Markov
dent game. Then it is possible that, even Chain:
THE LOGIC OF PARRONDO’S PARADOX if the new coin offers unfavorable odds,
Parrondo’s paradox is typically de- the modified game has winning odds. LL LW WL WW

冢 冣
scribed using two games. One game is a The demonstration of this feasibility is LL q1 p1 0 0
simple coin toss—a zero-order (memo- Parrondo’s paradox. LW 0 0 q2 p2
A⫽
ryless) game. The other game has the A reasonable question to ask is if WL q3 p3 0 0 ,
WW 0 0 q4 p4
structure of a Markov chain—it uses there is any need to randomize at all.
several coins, one of which is chosen for Before each play, why not just compare
play depending on what has happened the odds offered by the zero-order game where qi ⫽ 1 ⫺ pi ,1 ⱕ i ⱕ 4. The row
on previous coin tosses. The expected with the odds offered by the Markov represents the four possible outcomes,
return from the zero-order game de- chain game? You could simply play the LL, LW, WL, WW, on the two preceding
pends only on the probability of landing game that offered better odds. The an- plays, say plays (j ⫺ 1) and j; the column
heads. In contrast, the expected return swer is that this is indeed a dominant also represents the same four outcomes
from the history-dependent game de- strategy. It is always better to do this on plays j and (j ⫹ 1). Thus, play j is
pends not only on the probabilities with than to randomize over games. common to the row and column states.
which the various coins land heads but We formalize this argument below. The cell entries are the probabilities of
also on how often each coin is tossed, For illustrative purposes, we consider transition from a row state to a column
which is determined by the steady-state the more recent capital-independent state. The nonzero entries are the prob-
probabilities of the Markov chain. games [6], which use a second-order abilities associated with the outcomes
The games are set up so that if Markov chain to describe the state-de- on play (j ⫹ 1) given the outcomes on
played separately, each game has a neg- pendant game. The same line of argu- plays (j ⫺ 1) and j.
ative expected return. Consider ran- ment can be adapted in an obvious way Let ␲1, ␲2, ␲3, ␲4 denote the steady-
domly switching between the games. to games in which the payoffs from one state probabilities associated with the
What happens? The answer is that ran- game depend on the capital. We then states LL, LW, WL, WW. Then [6]:
domization changes the transition describe a very simple game based on a
probabilities and, therefore, the steady- first-order Markov chain for which stra- q 3q 4
state probabilities for the Markov-chain tegic play, but not randomization, pro- ␲1 ⫽ ,
N
game. This in a nutshell is the reasoning duces winning odds. We conclude by
behind the paradox: an unconditional reassessing the relevance of Parrondo’s p 1q 4 p 1p 2
␲2 ⫽ ␲3 ⫽ , ␲4 ⫽ ,
probability (the probability of winning paradox for economic theory, for strat- N N
the memoryless game) enters as a con- egies of investments in stocks, and for N ⫽ q 3 q 4 ⫹ p 1 p 2 ⫹ 2p 1 q 4 .
ditional (transition) probability in the economics or social dynamics that “ex-
Markov-chain game, changing the tract benefits from ostensibly detrimen-
Thus, for example, ␲1 is the long-run
steady-state probabilities of the latter tal situations” [1].
probability of two successive losses, LL.
and reversing a player’s fortune.
It follows that the unconditional prob-
Another interpretation of the para- CAPITAL INDEPENDENT GAMES
ability of winning (i.e., landing heads)
dox is as follows. Consider playing only Consider two coin-toss games, A and B.
on any play is
the history-dependent game; it leads to All bets are a dollar per play. A player
losses. Now add an option to each state loses a dollar if a tossed coin lands tails
of the history-dependent game: allow and wins a dollar if it lands heads. Game p win ⫽ ␲1p1 ⫹ ␲2p2 ⫹ ␲3p3 ⫹ ␲4p4,
the player the option of tossing another A is played by tossing coin A1 that lands
coin. Surely the option is worth using heads with probability p ⬍ 1/2; the long which can be written as
only if the new coin gives better odds in run returns from game A are negative.
at least some states of the history-de- Game B, which also has negative long-
1
pendent game. Now suppose we im- run returns, is played with 4 coins; coin p win ⫽ , c ⫽ q3q4 ⫺ p1p2,
2 ⫹ 共c/s兲
pose the constraint that a player must Bi lands heads with probabilities pi , 1 ⱕ
use the new coin with the same proba- i ⱕ 4. Game B is played as follows: s ⫽ p1共p2 ⫹ q4兲.

24 C O M P L E X I T Y © 2004 Wiley Periodicals, Inc.


Thus, game B gives losing returns in the order Markov chain. It is for this specific
long run if pwin ⬍ 1/2 or equivalently if A reasonable question to ask is if structure, and for higher-order Markov
c ⬎ 0. To change a losing game B into a there is any need to randomize at chains, that one attains Parrondo’s par-
winning game, one requires c ⬍ 0. Sup- all. Before each play, why not adox. But randomization cannot pro-
pose we play A and B at random, select- just compare the odds offered by duce winning combinations of two los-
ing game A with probability ␭ and game the zero-order game with the ing games if game B is described by a
B with probability (1 ⫺ ␭). Game B re- odds offered by the Markov chain first-order Markov chain or if it is also a
quires tossing: game? zero-order game (i.e., just a simple coin
toss). We will examine these cases be-
1. coin B1 if the two preceding out- low. But it is worth noting that any sit-
ues ␭i for each i in the expression ␭i p ⫹ uation in which Parrondo paradox ap-
comes are both losses (LL);
(1 ⫺ ␭i) pi, 1 ⱕ i ⱕ 4; as “c” decreases pears should have the appropriate
2. coin B2 if the last two outcomes
with p1, p2, p3, p4, its lower bound is structure—the transitions between out-
were a loss followed by a win (LW);
obtained by setting ␭i ⫽ 1 when pi ⬍ p comes should be representable in the
3. coin B3 after a win followed by a
and ␭i ⫽ 0 when pi ⬎ p (for pi ⫽ p, ␭i can form of at least a second-order Markov
loss (WL); and
take any value between zero and one). chain. It is difficult to see how public
4. coin B4 after a pair of wins (WW).
The corresponding deterministic strat- attitude towards a past President, stock
egy gives a winning probability pwin* ⱖ
We can write the corresponding investments and the many other cases
p⬘win, where pi ⫽ max{p, pi} and
*
in which Parrondo’s paradox is evoked
Markov chain by replacing pi by p⬘ ⫽
␭p ⫹ (1 ⫺ ␭)pi ,1 ⱕ i ⱕ 4, in A. The [3, 4] can be represented as a second-
1 order Markov chain. One cannot take
unconditional probability of winning p *win ⫽ , c* ⫽ q3*q4* ⫺ p1*p2*,
2 ⫹ 共c*/s*兲 just any two uncertain, losing opportu-
(landing heads) is therefore
s* ⫽ p1*共p2* ⫹ q4*兲. nities and hope that randomizing over
them will produce winning odds. We
1
p⬘win ⫽ , c⬘ ⫽ q⬘3q⬘4 ⫺ p⬘1p⬘2, As ␭p ⫹ (1 ⫺ ␭) pi ⱕ max{p, pi}, we have show this below, where we consider
2 ⫹ 共c⬘/s⬘兲 combining a simple coin toss with a
c* ⱕ c⬘, the equality occurring only when
s⬘ ⫽ p⬘1共p⬘2 ⫹ q⬘4兲. pi ⫽ p, 1 ⱕ i ⱕ 4. Otherwise, the lower game represented by a first-order
bound c* is achieved by playing game A if Markov chain. We show that a random-
The revised odds of winning exceed one pi ⬍ p, and playing game B otherwise. ization strategy over two such games
half if c⬘ ⬍ 0, which is possible even if This obvious and smart way of combin- cannot produce a winning combina-
c ⬎ 0. This is the reversal of fortune in ing the two games produces winning tion. In contrast, the simpler determin-
Parrondo’s paradox. odds whenever randomization does and istic switching noted above can produce
Is this the best way for a gambler to also in instances when randomization a winning combination in this situation.
combine the two games? The answer is does not yield a winning game.
“no.” Randomizing over the two games To summarize, the best possible way FIRST-ORDER MARKOV GAMES
forces the same convex combination of of combining games A and B in Par- Consider two games A and B. All bets
p and pi ,1 ⱕ i ⱕ 4. This is a natural rondo et al. [6] is to play game A when- are a dollar per play. A player loses a
constraint for Brownian ratchets, the ever it offers higher winning odds than dollar if a tossed coin lands tails and
original motivation for the paradox, be- does the relevant coin in Game B; oth- wins a dollar if it lands heads. Game A is
cause one cannot separate particles by erwise play Game B. Unless game A al- unchanged—it is played with a single
their states; the only way to affect the ways dominates game B, the winning coin and is winning with probability
transition probabilities is to randomly odds from this strategy always increase p ⬍ 1/2. Game B is played with two
change the potential applied to the en- if pi ⬍ p for some 1 ⱕ i ⱕ 4; and they coins, B1 and B2. Coin B1 is tossed after
tire system of particles. But there is no exceed one half if c* ⬍ 0, which is equiv- a win; it leads to a win (loss) with prob-
such natural constraint for a gambler alent to the condition p1*p2* ⬎ q3*q4*. This ability p1 (q1 ⫽ 1 ⫺ p1). Coin B2 is tossed
who will switch from one game to the trivial “optimal” strategy suggests that after a loss; it leads to a win (loss) with
other whenever it is profitable to do so. randomization should not be used to probability p2 (q2 ⫽ 1 ⫺ p2). These
This trivial solution is the best for pre- play the games of Parrondo’s paradox probabilities can be represented by the
cisely the same reason that randomiza- unless a player is uninformed about the following first-order Markov chain:
tion works, except that it forces a player states (the sequence of outcomes of
to choose a fixed probability for playing plays) and therefore does not know W L
the two games regardless of the state of
game B. In both cases, the change in
when to switch to the single-coin game.
But it is not clear to us when and if such
W
L 冉 p1
p2
q1
q2 冊.
fortune is brought about by a change in games arise in games of chance.
the transition probabilities of a Markov It is also important to note that game The corresponding steady-state proba-
chain. Suppose we allow different val- B above has the structure of a second- bility of a win is pwin ⫽ p2/(q1 ⫹ p2),

© 2004 Wiley Periodicals, Inc. C O M P L E X I T Y 25


which is increasing in p1 and p2. Game tossed coin lands tails and wins a dollar straint on the modified transition prob-
B is losing if pwin ⬍ 1/2, or equivalently if it lands heads. In game A, the proba- abilities of the history-dependant game.
if p2 ⬍ q1. Let p ⬎ p2. A randomization bility of winning (losing) after a win is Removing the constraint improves win-
strategy in which game A is played with p1(q1) and of winning (losing) after a ning odds and leads to a strategy in
probability ␭ and game B is played with loss is p2(q2). In game B, the probability which the single-coin game is played
probability 1 ⫺ ␭ leads to the transition of winning (losing) after a win is p3 (q3) whenever it offers better odds than the
matrix, and of winning (losing) after a loss is coin one is forced to toss in the history-
p4(q4). Game A is losing if p2 ⬍ q1, and dependent game. One should note that
game B is losing if p4 ⬍ q3. Suppose there is a difference between Brownian
W L
冉 冊
p1 ⬍ p3 and p4 ⬍ p2. The transition
W p⬘1 q⬘1 ratchets and gambles of the sort consid-
L p⬘2 q⬘2 , matrix if game A is played with proba-
ered here and by Harmer and Abbott [1,
bility ␭ and game B is played with prob-
2] and Parrondo et al. [6]. In a Brownian
ability 1 ⫺ ␭ is
where p⬘i ⫽ ␭p ⫹ (1 ⫺ ␭)p⬘i , i ⫽ 1, 2. The ratchet particles can be subjected to ei-
probability of winning is p⬘win ⫽ p⬘2/ ther a flat potential or an asymmetric
W L
(q⬘1 ⫹ p⬘2). Thus, p⬘win ⬎ 1/2 implies p⬘2 ⬎
q⬘1. Substituting for p⬘j, j ⫽ 1, 2, gives the
W
L 冉␭␭pp ⫹⫹ 共1共1 ⫺⫺ ␭␭兲p兲p
1
2
3
4

1 ⫺ 共␭p1 ⫹ 共1 ⫺ ␭兲p3兲
1 ⫺ 共␭p2 ⫹ 共1 ⫺ ␭兲p4兲 .
potential. One cannot separate particles
by their states. Thus, one cannot deter-
condition ␭(2p) ⫹ (1 ⫺ ␭)(p1 ⫹ p2) ⬎ 1. ministically change the transition prob-
Probabilistic play (randomization) is a
As the lefthand side is a convex combi- abilities for some particles but not for
winning strategy if ␭(p1 ⫹ p2) ⫹ (1 ⫺
nation of 2p and p1 ⫹ p2, it obtains its others. Such a constraint does not nat-
␭)(p3 ⫹ p4) ⬎ 1, which is impossible
maximum value for either ␭ ⫽ 0 or ␭ ⫽ urally occur in games of chance. A
because p2 ⬍ q1 and p4 ⬍ q3. Now sup-
1. If the maxima is attained for ␭ ⫽ 1, player offered the choice of two games
pose game A is played after a loss and
then the condition reduces to 2p ⱖ can switch from one to the other when-
game B is played after a win. Then the
␭(2p) ⫹ (1 ⫺ ␭)(p1 ⫹ p2) ⬎ 1, which is ever it is profitable to do so. This strat-
impossible as p ⬍ 1/2. If the maxima egy dominates randomization.
occurs for ␭ ⫽ 0, then the condition It is incorrect to suggest that Parron-
reduces to p1 ⫹ p2 ⱖ ␭(2p) ⫹ (1 ⫺ One should note that there is a
do’s paradox represents an advance in
␭)(p1 ⫹ p2) ⬎ 1, which is again impos- difference between Brownian
utility theory [6]. Utility theory specifies
sible if pwin ⬎ 1/2. Thus, randomization ratchets and gambles of the sort
a set of preference axioms for which a
cannot be used to combine the two los- considered here and by Harmer
decision maker maximizes expected
ing games into a winning game. But and Abbott [1, 2] and
Parrondo et al. [6]. utility over risky outcomes. Neither ran-
strategic play can increase the winning
domization nor a deterministic betting
odds to more than one half if p2 ⬍ p ⬍
strategy is an extension of utility theory
p1. In this case, one plays game B when
coin B1 is to be used, and switches to corresponding transition matrix is because strategies have associated pay-
game A when game B can only be offs with which a player attaches utility.
played with coin B2. The Markov tran- W L So long as utility is monotonically in-
sition probability becomes W
L 冉 p3
p2
q3

q2 .
creasing in payoffs, a utility-maximizing
player will prefer higher payoffs to
lower ones. The fact that randomization
W L The steady-state probability of winning
冉 冊
gives positive returns means that a risk-
W p1 q1 is p2/(q3 ⫹ p2), which exceeds one half if
L p q , q3 ⬍ p2 . Thus, playing game A after a
neutral player will prefer to play a ran-
domized game than to not play it (and
loss and game B after a win gives better
prefer to play the trivial deterministic
where, p(q) now plays the role of p2(q2). than even odds of winning if p4 ⬍ q3 ⬍
strategy over a randomized game), and
The steady state probability of winning, p2 ⬍ q1. For example, p1 ⫽ 0.2, p2 ⫽
this is perfectly consistent with utility
p/(q1 ⫹ p), exceeds one half if p ⬎ q1, 0.75, p3 ⫽ 0.4, p4 ⫽ 0.5, gives the win-
theory, not an extension or departure
which is feasible. For example, p ⫽ 0.45, ning odds 15/31 for game A, 5/11 for
from it. Parrondo’s games have also
p1 ⫽ 0.6, p2 ⫽ 0.3 gives pwin ⫽ 3/7 for game B, and 15/27 for strategic play.
game B, so both A and B are losing been incorrectly said to be paradoxes of
games. However, playing game A when- CONCLUSION game theory [1, 2], despite the fact that
ever game B allows tossing only B2 gives There are deterministic ways of playing these are games against nature; game
the winning odds of 9/17. the games of Parrondo’s paradox that theory, in contrast, is concerned with
Finally, consider two games that are never do worse than randomization and the actions and reactions of strategic
both described by first-order Markov can provide better winning odds under players. Games of chance, of which Par-
chains. Once again, all bets are a dollar more general conditions. This happens rondo’s games are examples, are quite
per play; a player loses a dollar if a- because randomization forces a con- different because there is no active ad-

26 C O M P L E X I T Y © 2004 Wiley Periodicals, Inc.


versary whose behavior changes with Parrondo’s paradox is suggested to how randomize between births and
the opponents’ play. possibly operate in economics or social deaths, to create a realization of Parron-
One view that has caught the imag- dynamics to extract benefits from os- do’s paradox. In a similar vein, Parron-
ination of many people is that Parron- tensibly detrimental situations. Harmer do’s paradox has been compared with
do’s paradox suggests the possibility of and Abbott [1] suggest the example of the sacrifice of chess pieces to win a
making money by investing in losing declining birth and death rates. Each by game, to the decline in the fitness of a
stocks. The above analysis makes clear itself has a “negative” effect on a society species before it evolves a higher level
why such a conclusion is unwarranted. or an ecosystem, but declines in both of survival fitness, to the stabilization of
Even if stock markets were exactly mod- together might combine with “favorable unstable systems when combined in the
eled by the types of games considered consequences.” It is unclear what right way, and to public attitudes to-
by Parrondo, the strategy to use would Harmer and Abbot have in mind. First, ward politicians. As best as we know,
be the obvious one suggested by the it is not clear what negative and favor- none of these is related to the fact that
above analysis: sell stock that has par- able consequences mean; and second, changing the transition probabilities of
ticularly poor prospects and buy stock there is no reason, as far as we know, to a Markov chain affects the steady-state
with better prospects; this way you will believe that one or both of the rates of probabilities associated with the states,
always do better than if you randomize births and deaths have the structure of a which in the end is the entire reason for
over losing stocks. Markov chain or that nature can some- Parrondo’s paradox.

REFERENCES
1. Harmer, G.P.; Abbott, D. Game theory: Losing strategies that can win by Parrondo’s Paradox. Nature 1999a, 402, 864.
2. Harmer, G.P.; Abbott, D. Parrondo’s Paradox. Statistical Sci 1999b,14(2), 206 –213.
3. Blakeslee, S. Paradox in game theory: Losing strategy that wins. The New York Times 2000, p 5, col 2, sect F, Jan 25.
4. Casti, J.L. Losing games for your winning plays. Complexity 2001, 6(6), 11–14.
5. Paulos, J.A. Winning with losing games: A new paradox in the world of probability. ABC News (2000), http://abcnews.go.com/sections/science/WhosCounting/
whoscounting000601.html.
6. Parrondo, J.M.R.; Harmer, G.P.; Abbott, D. New paradoxical games based on Brownian ratchets. Phys Rev Lett 2000, 85(24), 5226 –5229.

© 2004 Wiley Periodicals, Inc. C O M P L E X I T Y 27

You might also like