Professional Documents
Culture Documents
83 – 95
10.1515/jaiscr-2015-0021
Abstract
Meanwhile, complex networks science is in- To test the effect of network topology, we use
creasingly used to model and analyse human in- two well known network classes, known as the
teractions [6, 7]. Many societies where humans or scale-free networks [7] and well-mixed networks.
other sentient agents interact display heterogeneous Also, we use two evolutionary processes to evolve
network topologies, in which some agents / people the populations. First, we test the effect of topology
are very well connected while others are sparsely using the death-birth Moran process [22], which
connected [7, 8]. In such societies or multi-agent is an evolutionary process used to model the evo-
networks [9], the topology of the network of in- lution of players over time, particularly in biolog-
teractions can be quantified by a range of metrics, ical systems. Then we use a stochastic strategy
including the clustering coefficient, average path adoption process that would update the strategy of
length, global and local assortativity [10, 11, 12], a randomly selected node, by comparing it with a
topological mutual information [13, 14], various selected neighbour’s strategy. The same process
static and dynamic centrality measures [15] and so has been used as an evolutionary process by San-
forth. Moreover, many of these social systems dis- tos et al. [23], with the pure-strategy PD game. We
play the so-called scale-free property, which is char- will refer to this evolutionary process as the ‘strat-
acterised by maximum heterogeneity in terms of egy adoption process’ in the rest of the paper. We
number of connections, and the small-world prop- compare and contrast the evolutionary outcomes
erty, where the characteristic path length is compar- of these two processes when players are placed in
atively small and the clustering comparatively high. both well-mixed networks and scale-free networks.
Therefore it is clear that the topology of any society Based on the results of these observations, we argue
or agent network must be considered in determining that the evolutionarily unstable strategies in a well-
the dynamics of a system, including how individual mixed population may survive and even dominate
agents make decisions in such systems. in a heterogeneous network of players. We show
In this work, we study how network topology that the topology of the interactions of the players,
affects the evolutionary stability of strategies. We the evolutionary update process as well as the ini-
use a class of strategies known as ‘memory-one tial topological distribution of players are signifi-
strategies’ in prisoner’s dilemma (PD) game to eval- cant determining the overall evolutionary stability
uate the effect of network topology on evolution- of a strategy. When the players are distributed in
ary stability. Iterated prisoner’s dilemma game has a homogeneous network however, the evolutionary
widely been used to model -the strategic decision process used would not have a significant effect the
making of self-interested opponents [16, 17, 18]. evolutionary stability of a strategy. We further test
In memory-one strategies, each player would base the effect of topology on the evolutionary stability
his action on a probability derived based on the by varying network assortativity [10], which is a
previous interaction with the same opponent [19, measure of the similarity of mixing of nodes in a
20]. In this work, we particularly focus on a sub- network. We argue that when the network becomes
class of memory-one strategies known as ‘Zero- more heterogeneous, network topology would have
Determinant(ZD) strategies’, along with other well- a more significant effect on the evolutionary stabil-
known memory-one strategies. Zero-determinant ity of strategies. We call this topologically influ-
strategies have been demonstrated to be extortion- enced evolutionary stability of strategies as ‘topo-
ate strategies, meaning that they have the ability to logical stability’.
unilaterally set the payoff of the opponent [21]. In- Understanding the topological effect on the evo-
tuitively, this would suggest that Zero-Determinant lutionary stability of a strategy would help us to
strategies have the potential to be evolutionarily sta- make better predictions about the evolutionary sta-
ble against any competing strategy. However, Zero- bility of a strategy in a real-world environment.
determinant strategies have later been shown to be Even though a strategy may be theoretically sta-
evolutionarily unstable in a well-mixed population ble or not, its actual evolutionary behaviour may
of players [18]. In this work, we test whether net- depend on the topology of the interconnections in
work topology affects the evolutionary stability of the population and the evolutionary update mech-
Zero-determinant strategies in a non-homogeneous anism used. By studying these effects extensively,
network of players. the modelling of evolutionary games may be im-
proved, by increasing the accuracy of the predic- for xεS. Suppose xi is th strategy profile of player i
tions of the evolutionary stability of strategies. and x−i be the strategy profile of all players except
The rest of the paper is organised as follows. player i. Thus, when each player iε1, ....., n chooses
We begin by providing a background on the the- strategy xi that would result in the strategy profile
oretical aspects of evolutionary game theory and x = (x1 , ...., xn ), giving a payoff of f (i) to that par-
complex network science, within the scope of this ticular player. A strategy profile x∗ εS is in Nash
work. Therein we introduce the memory-one strate- equilibrium if no unilateral deviation in strategy by
gies in the iterated prisoner’s dilemma (IPD) game, any single player would return a higher utility for
as well as a sub-class of memory-one strategies that particular player. Formally put,
known as Zero-determinant strategies. Further, we
explain death-birth Moran process and its signifi- ∀i, xi εSi : fi (xi∗ , x−i
∗ ∗
) ≥ fi (xi , x−i ) (1)
cance in determining the evolutionary stability of
a strategy, comparing it with the strategy adoption 2.2 Evolutionary Game Theory
process suggested by Santos et al. [23]. Next, we
present the results obtained by simulating both the Evolutionary game theory is an outcome of the
death-birth Moran process and the strategy adoption adaptation of game theory into the field of evo-
process in well-mixed and scale-free networks of lutionary biology [3, 4]. Evolutionary biology is
players. Further, we present the results on how the based on the idea that an organism’s genes largely
variation of network heterogeneity, measured using contribute to its observable characteristics. Thus,
the network assortativity, affects the evolutionary genes determine the fitness of an organism in a
stability of strategies. Finally, we discuss the re- given environment. Evolutionary game theory ar-
sults, presenting our conclusions. gues that the success of an organism depends on
its interaction with other organisms within a pop-
ulation. Thus, the fitness of an organism would be
2 Background dependent on its interaction with other organisms
in that population. In game theoretic terminology,
2.1 Game Theory the interactions of players would be the strategies
Game theory is the study of strategic decision and the fitness of each strategy would be the payoff.
making [1]. Game theory was first developed as a As the population evolves over time, certain strate-
branch of micro-economics [1, 2]. However, later it gies would be dominant while some other strategies
has been adopted in diverse fields of study, such as would be extinct. Thus, evolutionary game theory
evolutionary biology, sociology, psychology, politi- provides a mathematical basis to model the evolu-
cal science and computer science [2]. Game theory tionary process using concepts of game theory. On
has gained such wide applicability due to the preva- the other hand, it opens a new dimension for game
lence of strategic decision making across different theorists to observe how the evolution of strategies
fields of study. One of the key concepts of Game in games would happen over time in populations
theory is that of Nash equilibrium [24]. Nash equi- of players. Some of the critical questions asked in
librium suggests that there is an equilibrium state in EGT include; which populations/strategies are sta-
a strategic game, which neither player would bene- ble? When to individuals adopt other strategies?
fit deviating from. Nash equilibrium can be defined and would it be possible for mutants to invade a
for both pure strategy and mixed strategy scenar- given population?
ios. Also, a strategy game could have multiple Nash The equivalent concept to Nash Equilibrium in
equilibria. evolutionary game theory, is evolutionary stability
A formal definition for Nash equilibrium can be [25]. If Nash Equilibrium can be considered as a
given as follows. Let (S, f ) be a game with n play- static equilibrium, evolutionary stability represents
ers, where Si is the strategy set of a given player i. a dynamic equilibrium of a strategy, over time. A
Thus, the strategy profile S consisting of the strategy strategy is called evolutionarily stable if it has the
sets of all players would be, S = S1 ×S2 ×S3 ....×Sn . potential to dominate over any mutant strategy [3].
f = ( f1 (x), ....., fn (x)) would be the payoff function Evolutionary games are often modelled as iterative
games where a population of players play the same
game iteratively in a well-mixed or a spatially dis- dilemma game are based on the assumption that
tributed environment [26]. each player chooses a strategy based on a proba-
bility distribution. In fact, pure strategies can be re-
strategy Iterated
2.2.1 is called evolutionarily
prisoner’s dilemma stable if it has the potential garded strategies
as a called
special finite-memory
case of mixed strategies
strategies[18], [28],
where where the
to dominate over any mutant strategy [3]. Evolutionary games current mixed strategy would be dependent on n number of
each strategy is chosen with the probability of one.
Prisoner’s
are often modelled dilemma is a games
as iterative game where
that isa found in of
population historical states between the two players.
players play the theory
same game Memory-one strategies [18, 28] are a special sub-
classical game [27].iteratively
Given theinpayoffa well-mixed
matrix or a When considering the previous state between two players
spatially distributed environment [26]. class of mixed strategies, where the current mixed
in Fig. 1, the inequality T>R>P>S should be satis- in a PD game, there could be four possible states. Namely
1) Iterated prisoner’s dilemma: Prisoner’s dilemma is a strategy CC, CD, of aDC gameand would
DD, where depend on the immedi-
C represents cooperation and
fied
gameinthata prisoner’s
is found indilemma
classical game. In other
game theory [27].words,
Given the ate D previous
represents interaction
defection,between
respectively.the two players in
Memory-one strategies
in the prisoner’s
payoff matrix in Fig. dilemma game, theT>R>P>S
1, the inequality highest mutual should be are represented by calculating the probabilities of cooperation
satisfied are
in aobtained
prisoner’s concern. On the other hand, Memory-one strategies
payoffs bydilemma
the players game.whenIn other words, in
both play- by a player in the next move, given the type of the previous
the prisoner’s dilemma game, the highest mutual payoffs are are interaction a specialisation of a more
of the player with thegeneral
same classopponent. of strate-
For example,
ers cooperate. However, if one player cooperates
obtained by the players when both players cooperate. However, giesa called strategyfinite-memory
(1,1,1,1) would strategies
imply that [18,the 28], where
Player A would
while
if one the other
player defects,while
cooperates the defector
the other would
defects,obtain a
the defector cooperate with player B, irrespective of the previous encounter
the current mixed strategy would be dependent on n
higher payoff.
would obtain The dilemma
a higher payoff. The is that the Nash
dilemma is thatEqui-
the Nash between Player A and B. Similarly, (0,0,0,0) would represent a
Equilibrium of this game, which occurs when both players number
playerofthat historical states between
always defects. Thus, the the puretwo players.
strategy cooperation
librium of this game, which occurs when both play-
defect, does not provide the optimum payoff for both the and defection can be thought of as a special case of memory-
ers defect, does not provide the optimum payoff for
players.
When considering the previous state between
one or finite memory strategies. By varying the probabilities
both the players. twoofplayers
cooperationin a PD under game,
eachthere
of thecould previousbe four pos- it is
encounters,
sible states.
possible to Namely
define any CC,
number CD, of DC
mixed and DD,
strategies. where
Some of the
C represents cooperation and D represents defec-strategy
well-known memory-one strategies include the Pavlov
(1,0,0,1) and the general cooperator (0.935, 0.229, 0.266, 0.42)
tion, respectively. Memory-one strategies are rep-
strategy. General cooperator is the evolutionarily dominating
resented
strategybythat calculating
evolved atthe lowprobabilities
mutation rates of ascooper-
demonstrated
by Iliopoulos et al. [29]. Tit-for-Tat,
ation by a player in the next move, given the type is another wellofknowns
strategy where a player would only
the previous interaction of the player with the same cooperate if the opponent
used in Santos et al. [23] to demonstrate the evolu- spective in analysing complex systems. Different
tion of cooperation in IPD games with pure strate- classes of complex networks have been defined to
gies. We extend that method for mixed strategies model real-world complex systems such as social
in this work. When employing this particular strat- and biological systems. In this work, we mainly
egy adoption process, a node going through evolu- focus on two such network classes known as well-
tion is not directly replaced. Instead, its strategy mixed networks and scale-free networks that are
could be updated by comparing its cumulative pay- widely discussed in the network analysis literature.
off with that of a stochastically selected neighbour.
As with the Moran process, in each time-step, a 2.4.1 Well-mixed Networks
node is marked for update. Then, a potential node
Well-mixed networks has been one of the most
to compare it with is selected from its neighbours,
widely used network models in network analysis
based on a probability proportional to the fitness
since the inception of complex network science.
(accumulated payoff) of the neighbours. Then, the
Also, they are widely used in modelling evolution-
probability of the marked node adopting the strat-
ary games [29]. These networks are modelled, as-
egy of its selected neighbour is calculated using the
suming that each node is connected to every other
following equation.
node in the network. Due to this reason, well-mixed
networks can be regarded as a class of homoge-
neous networks. Therefore, it is possible to approx-
p = max{0, (Py − Px )/[k> (E (ZD, Pav) − E (Pav, ZD)]} imate well-mixed networks to lattice networks with
(5)
each node having the identical number of neigh-
Where, bours. However, most of the real-world networks
p - Probability that node X would adopt Y’s strategy such as social networks and biological networks
Px - Cumulative pay off of node X do not demonstrate well-mixed behaviour, instead
Py - Cumulative pay off of node Y being spatially distributed with non-homogeneous
k> - Maximum degree of X’s degree (kx ) and Y’s topological features [7]. Still, well-mixed networks
degree (ky ) provide a good reference model to compare with the
E (ZD, Pav) − The expected payoff of a ZD node non-homogeneous networks [16].
against a Pavlov node
E (Pav, ZD)– The expected payoff of a Pavlov node 2.4.2 Scale-free Networks
against a ZD node
Scale-free networks demonstrate a network
topology that shows a power-law degree distribu-
As shown above, the population update prob- tion [6]. In other words, the degree distribution of
ability depends on the payoff difference of the the network would fit in a equation of the form y
marked node and the selected neighbour node. = αx−γ . Here, γ is called the scale-free exponent.
Also, the degree of those two nodes are used to nor- The scale-free exponent is obtained by fitting a par-
malise the effect of degree differences. Still, the ticular degree distribution into a power-law curve.
cumulative payoff of the node with the higher de- As the scale-free exponent increases, so does the
gree would be higher due to the fact that it would power-law nature of the degree distribution. Also,
have more interactions with other players. Thus, if the correlation of the fitting is higher, such a de-
this equation implicitly captures the network topol- gree distribution would indicate that the respective
ogy in calculating the adoption probability. Due to network closely resembles a scale-free network.
this reason, this particular strategy adoption process
Scale-free nature is abundant in real-world net-
can be used to study the topological effect on the
works, such as in social, biological and collabora-
evolutionary stability of strategies.
tion networks [7]. Scale-free networks make per-
fect candidates to study the topological effect of a
2.4 Complex Networks population of players due to this reason. For ex-
Complex Networks are self-organising net- ample, it has been shown that cooperation becomes
works that show non-trivial topological features [7]. the dominant strategy in a scale-free topology due
Complex Network analysis provides a network per- to the heterogeneity of the network [16, 23]. More-
over, scale-free networks can be conveniently gen- of 1000 nodes that are connected via a lattice where
erated using the Preferential attachment growth each node is connected to 8 other random nodes.
model proposed by Barabasi and Albert [7]. Pref- Initially, the two strategies were distributed in a ran-
erential attachment model suggests that when a net- dom manner so that the ZD strategy would occupy
work grows in size, nodes with higher degree have a different fractions of the population (0.6 and 0.4)
higher probability of attracting new nodes. In other in each simulation run. Then, using the death-birth
words, there exists a degree-dependent ‘preference’ Moran process, the population was updated over
in creating links with the existing nodes, when a 150,000 time-steps to observe the evolution of the
new node joins the network. In this work, we use strategies.
scale-free networks generated using the preferential Afterwards, we changed the evolutionary pro-
attachment model for all our simulations. cess to the stochastic strategy adoption process used
by Santos et al. [23]. This enabled us to determine,
2.4.3 Network
2) Scale-free Assortativity
Networks: Scale-free networks demonstrate a random
whether nodes.
it isInitially, the two strategies
the population update were distributed
process or in
thea
network topology that shows a power-law degree distribution random manner so that the ZD strategy would occupy different
[6]. InAssortativity
other words, isthe the tendency
degree observed
distribution of the in net-
network network
fractions of topology
the population that(0.6
affects the
and 0.4) in evolutionary
each simulation sta-
run.
worksfit in
would where nodes
a equation mostly
of the form y connect
= αx . Here,
−γ
withγ similar
is called bilityusing
Then, of the
the strategies in concern.
death-birth Moran process, the population was
the scale-free exponent. The scale-free exponent is obtained by updated over 150,000 time-steps to observe the evolution of
nodes. Typically, this similarity is interpreted
fitting a particular degree distribution into a power-law curve.
in Then we replaced the well-mixed population
the strategies.
terms
As of the degrees
the scale-free exponent of nodes.so Assortativity
increases, does the power-lawhas with a non-homogeneous
nature Afterwards, we changed the scale-free
evolutionarynetwork
process towiththe
been offormally
the degreedefined
distribution.
as aAlso, if the correlation
correlation function of the
of 1000 nodes.
stochastic strategyAs with process
adoption the caseused ofbythe previous
Santos ex-
et al. [23].
fitting is higher, such a degree distribution would indicate that
excess degree distributions and link distribution
the respective network closely resembles a scale-free network. of This enabled the
periment, us to
ZDdetermine,
strategywhether
and theit Pavlov
is the population
strategy
update process or the network topology that affects the evolu-
a network [10, 13]. The formal definition of Assor- were stability
tionary assigned randomly
of the strategiesamong the nodes. With
in concern.
Scale-free nature is abundant in real-world networks, such
astativity makes
in social, use of
biological andnetwork topological
collaboration concepts
networks [7]. Scale- theThen
players spatially distributed in a scale-free net-
we replaced the well-mixed population with a non-
such as degree distribution p(k) and excess de-
free networks make perfect candidates to study the topological
work, both scale-free
homogeneous the death-birth
network Moran
with 1000 process
nodes. and the
As with
effect of a population of players due to this reason. For exam-
gree distribution q(k) for undirected networks [13].
ple, it has been shown that cooperation becomes the dominant
the case of the previous experiment, the ZD
strategy adoption process were applied separately strategy and the
Given in q(k), one cantopology
introduce thethequantity e j,k as Pavlov strategy were assigned randomly among the nodes.
strategy a scale-free due to heterogeneity of to observe thespatially
effect that they have on the evolution
With the players distributed in a scale-free network,
the network [16], [23]. Moreover, scale-free
the joint probability distribution of the remaining networks can
be conveniently generated using the Preferential attachment of strategies. The Fig. 3 depicts thestrategy
both the death-birth Moran process and the degreeadoption
distri-
degrees
growth of the
model two nodes
proposed at either
by Barabasi andend of [7].
Albert a randomly
Preferen-
process were applied separately to observe the effect that they
bution of the scale-free network used. As the figure
have on the evolution of strategies. The Fig. 3 depicts the
chosen link. Given these distributions, the assorta-
tial attachment model suggests that when a network grows in shows, the degree distribution shows a power-law
degree distribution of the scale-free network used. As the figure
size, nodes with higher degree have a higher
tivity of an undirected network is defined as: probability of
shows,
curvethethatdegree distribution with
is reminiscent showsthe a power-law curve that
typical scale-free
attracting new nodes. In other words, there exists a degree-
is reminiscent with the typical scale-free networks.
dependent ‘preference’[in creating links with the existing
] work, networks.
nodes, when a new node joins the( network. In this we use
1 )
scale-free networks
ρ= 2
σq ∑
generated using the preferential
jk e j,k − q j qk
model for all our simulations.
attachment
(6)
jk 400
3) Network Assortativity: Assortativity is the tendency
Number of nodes
observed in networks
where σq is thewhere nodes mostly
standard connectofwith
deviation similar
q(k). In
nodes. Typically, this similarity is interpreted in terms of the 300
this work,
degrees we Assortativity
of nodes. use networks has with varying defined
been formally assorta-
as
ativity values
correlation to observe
function of excesshow thedistributions
degree heterogeneity of
and link 200
distribution of a network
mixing patterns affect[10],
the[13]. The formal stability
evolutionary definition of
of
Assortativity makes use of network topological concepts such
asstrategies.
degree distribution p(k) and excess degree distribution q(k) 100
for undirected networks [13]. Given q(k), one can introduce
the quantity ej,k as the joint probability distribution of the
0
3 Methodology
remaining degrees of the two nodes at either end of a randomly
chosen link. Given these distributions, the assortativity of an 1 5 15 20
undirected network is defined as: Degree
Initially, we re-created the experimental results
obtained in the work
by Adami and Hintze [18], Figure
Fig. 3: The3.degree
The distribution
degree distribution of thenetwork
of the scale-free scale-free
used.
1 ∑
by mixing the ρ = Zero-determinant qk )
jk (ej,k − qjstrategy (6)
with the network used.
σq2
Pavlov strategy in a well-mixed population. This
jk
Next, we changed the initial distribution of the strategies in
further such aNext,
mannerwe thatchanged
the ZD strategy woulddistribution
occupy the majority
where σenabled us to confirm the theoretical re-
q is the standard deviation of q(k). In this work, we
the initial of the
of hubs. This was done by sorting the nodes according to their
sults
use presented
networks in the assortativity
with varying same workvalues usingtothe replica-
observe how strategies in such a manner that the ZD
degree and assigning the top 60% of the nodes with the ZD strategy
the
torheterogeneity
dynamics model, of mixing patterns that
suggesting affectthe
theZD
evolutionary
strategy would Inoccupy
strategy. the majority
this configuration, of hubs.
the initial averageThis
degreewas
of
stability of strategies. ZD nodes was measured to be 3.4 while the average degreede-
of
would be evolutionarily unstable against the Pavlov done by sorting the nodes according to their
Pavlov nodes was 1.8. The evolution of strategies was observed
strategy. To doIII. this,Mwe started with a population
ETHODOLOGY gree the
under anddeath-birth
assigningMoranthe top 60%as of
process welltheasnodes with
the strategy
adoption process. The experiment was repeated with the Pavlov
Initially, we re-created the experimental results obtained strategy occupying the majority of hubs.
in the work by Adami and Hintze [18], by mixing the Zero- Brought to you by | University of Sydney Library
determinant strategy with the Pavlov strategy in a well-mixed As the next step, we mixed the PavlovAuthenticated
strategy with the
population. This further enabled us to confirm the theoreti- general cooperator strategy and the cooperator strategy in
Download Date | 4/19/17 6:19 AM
cal results presented in the same work using the replicator separate scale-free networks of players. We tested the evolution
90 Dharshana Kasthurirathna, Mahendra Piraveenan, Shahadat Uddin
1 1
ZD population fraction
ZD population fraction
0.8 0.4 0.8 0.4
0.6 0.6
0.6 0.6
0.4 0.4
0.2 0.2
0 0
0 20000 40000 60000 80000 0 20000 40000 60000 80000
Time-step Time-step
(a) W ell − mixed (b) Scale − f ree
Fig. 2:Figure
The evolution of ZD population
2. The evolution fraction against
of ZD population fractiontheagainst
Pavlovthe
strategy,
Pavlovinstrategy,
well-mixed and scale-free
in well-mixed populations. The
and scale-free
population is allowed to evolve under the death-birth Moran process with 0.1% replacement rate. The strategies are initially
populations. The population is allowed to evolve under the death-birth Moran process with 0.1%
distributed randomly, with the fraction of ZD nodes being 0.4 and 0.6, respectively.
replacement rate. The strategies are initially distributed randomly, with the fraction of ZD nodes being 0.4
and 0.6, respectively.
ZD strategy is a unique and inherent property of the stratety 1
itself or whether it is a more general behaviour that could occur
the ZD strategy. In this configuration, the initial lation fractions of ZD players were recorded. The
ZD population fraction
with other strategies as well. 0.8
average degree of ZD nodes was measured to be results were averaged over Random
40 independent runs.
Hubs
Finally, we observed
3.4 while the degree
the average evolution of the Pavlov
of Pavlov nodes was and 0.6
ZD strategies in non-homogeneous networks while the net-
1.8. The evolution
work heterogeneity of strategies
was gradually was observed
varying. To perform under
this
the death-birth Moran process as well as the
test, we generated a set of scale-free networks with varyingstrat- 4 Results
0.4
egy adoption
assortativity values process. Theaexperiment
by rewiring scale-free was repeated
network in a 0.2
probabilistic manner. Then, both strategies were distributed In certain figures in this section, we have limited
with the Pavlov strategy occupying the majority of
randomly in each scale-free population in such a manner that the number of 0 time-steps shown, when the strat-
hubs.
the ZD strategy would occupy 60% of the nodes. Afterwards, egy in concern 0becomes extinct 50000 100000
reasonably quickly.
the populations
As the were
nextallowed to evolve
step, we mixed over 150,000 strat-
the Pavlov time- Time-step
Fig.2 shows the evolution of the fraction of ZD
steps under the strategy adoption process and the remaining
egy with the general cooperator strategy and the co-
population fractions of ZD players were recorded. The results nodes
Fig. 4: when the ZDofstrategy
The evolution is mixed
ZD population with against
fraction the the
operatorover
were averaged strategy in separateruns.
40 independent scale-free networks of Pavlov
Pavlov strategy,
strategy in in well-mixed
scale-free populations of varying
and scale-free popu- initial
players. We tested the evolution of strategies with a configurations. The population
lations. The evolutionary is allowed
process used istothe
evolve under the
death-
death-birth Moran process with 0.1% replacement rate. In
random initial distribution of strategies and a strat- birthtwoMoran
IV. R ESULTS the initial process. As expected,
configurations, ZD strategy ZDis strategy
either assigned
egy distribution where the opponent strategy (GC gradually becomes extinct in a well-mixed
randomly or assigned more on hubs (in hubs initialisation,popu- the
In or
certain figures is
cooperator) in initially
this section,
assignedwe mostly
have limited the
on hubs. initial
lation. This confirms that in a homogeneous are
average degrees of ZD and Pavlov nodes net-3.4 and
number of time-steps shown, when the strategy in concern 1.8, respectively).
This enabled us to determine whether any observed
becomes extinct reasonably quickly. Fig.2 shows the evolution work, ZD strategy is evolutionarily unstable against
of the topological stability
fraction of ZD nodesofwhen
ZD strategy is a unique
the ZD strategy and
is mixed the Pavlov strategy that operates as a strong evo-
inherent
with the Pavlov property
strategy inofwell-mixed
the stratetyanditself or whether
scale-free it
popula- lutionarily stable strategy, as suggested by Adami
tions. The evolutionary process used is the death-birth
is a more general behaviour that could occur with Moran and Hintze
process. As expected, ZD strategy gradually becomes extinct in cannot survive[18].
whenMoreover, ZDisdoes
the population not tosurvive
allowed evolve under
other strategies as well. evendeath-birth
the in a non-homogeneous
Moran process. population distributed
a well-mixed population. This confirms that in a homogeneous
network, ZD strategy
Finally, is evolutionarily
we observed unstable
the evolution against
of the Pavlovthe in a scale-free network, when the same evolutionary
Pavlovand
strategy that operates as a strong evolutionarily stable Fig.5 depicts the scenario where a well-mixed population
ZD strategies in non-homogeneous networks process is applied.
strategy, as suggested by Adami and Hintze [18]. Moreover, of ZD and Pavlov strategies are allowed to interact with each
while the network
ZD does not survive even in heterogeneity
a non-homogeneouswas gradually
population otherNext,
over Fig.4
time, depicts
according
thetoevolution
the strategy adoption
of the ZD andprocess
varying.
distributed To perform
in a scale-free this test,
network, whenwethegenerated a set of
same evolutionary instead of the Moran
Pavlov strategies process. Here
in a scale-free too, ZD is gradually
non-homogeneous
processscale-free
is applied.networks with varying assortativity val- eradicated from the population. However, when the same
network of players,
evolutionary process isunder
applieddifferent initial config-
in a scale-free population of
uesFig.4
Next, by rewiring
depicts athe
scale-free
evolutionnetwork
of the in ZDa probabilis-
and Pavlov urations. The figure shows the evolution
players, ZD strategy manages to survive, as shown of in theFig6[a].
tic in
strategies manner. Then,
a scale-free both strategiesnetwork
non-homogeneous were distributed
of players, According to the figure, Pavlov strategy shows
ZD fraction when the strategies are initially dis- weak evolution-
under randomly
different initial configurations.
in each The figure in
scale-free population shows
suchthe a ary stability, failing to eradicate the ZD
tributed randomly as well as when more hubs are strategy completely,
evolution of the ZD fraction when the strategies
manner that the ZD strategy would occupy 60% are initially when the two strategies are initially assigned randomly. On the
distributed randomly as well as when more hubs are assigned assigned
other hand,with
whenthethe
ZDZDstrategy initially.
is initially As to
assigned thethefig-
majority
of ZD
with the the nodes.
strategy Afterwards,
initially. As the
the populations
figure depicts, were al-
Pavlov urehubs
of depicts, Pavlov clearly
as depicted dominates
in Fig.6[b], and eradicates
ZD manages to become the
clearlylowed to evolve
dominates over 150,000
and eradicates the ZD time-steps
strategy, under the
suggesting the ZD
weak strategy, suggesting
evolutionarily that irrespective
stable strategy over the Pavlovof the strategy,
that irrespective of the initial
strategy adoption processdistribution of strategies,
and the remaining popu-ZD becoming the dominant strategy in the network. However, as
ZD
0
000 0 20000 40000 60000 80000
Time-step
(b) Scale
EVOLUTIONARY STABLE − f ree
STRATEGIES IN . . . 91
gainst the Pavlov strategy, in well-mixed and scale-free populations. The
h Moran process with 0.1% replacement rate. The strategies are initially
being 0.4 and 0.6,initial
respectively.
distribution of strategies, ZD cannot survive players, ZD strategy manages to survive, as shown
when the population is allowed to evolve under the in Fig6[a]. According to the figure, Pavlov strat-
death-birth Moran process. egy shows weak evolutionary stability, failing to
e stratety 1 eradicate the ZD strategy completely, when the two
uld occur strategies are initially assigned randomly. On the
ZD population fraction
Moreover, of ZD and Pavlov0.8 strategies are allowed to interact with each 0.8
on the initial distribution of the strategies, when the
other over time, according to the 0.4strategy adoption process
opulation 0.6 population is updated using the strategy adoption
lutionary instead of the0.6Moran process. Here too, ZD is gradually 0.6
eradicated from the population. However, when the same evolutionary process. This suggests that topological
evolutionary process
0.4 is applied in a scale-free population of influence on the0.4evolutionary stability is not limited
d Pavlov players, ZD strategy manages to survive, as shown in Fig6[a]. to the ZD strategy, but may apply to other strategies
f players, According to the0.2 figure, Pavlov strategy shows weak evolution- 0.2
hows the ary stability, failing to eradicate the ZD strategy completely, as well.
initially 0
when the two strategies are initially assigned randomly. On the Finally, we 0tested the evolution of the ZD pop-
assigned other hand, when0the ZD5000 10000assigned
is initially 15000to the
20000majority -0.5 -0.3 -0.1 0.1 0.3 0.5 0.7 0.9
Time-step ulation against the Pavlov strategy while the het-
Assortativity
, Pavlov of hubs as depicted in Fig.6[b], ZD manages to become the
uggesting weak evolutionarily stable strategy over the Pavlovagainst
strategy, erogeneity of the networks are gradually changed.
Fig. 5: The evolution of ZD population fraction the Fig. 9: The evolution of ZD population fraction against the
gies, ZD becoming
Figure
Pavlov the5.dominant
strategy,The strategyofinZD
in aevolution
well-mixed thepopulation
network.The
population. However,
fraction as Fig.9
population
shows the variation of the remaining ZD frac-
Pavlov strategy, in scale-free populations with varying network
is allowed to evolve
against under the
the Pavlov strategy
strategy, inadoption process. The
a well-mixed tion after 150,000
assortativity values.time-steps under
The population the strategy
is allowed to evolve under
strategies are initially distributed randomly,
population. The population is allowed to evolve with the fraction adoption
the process.
strategy As
adoption the figure
process. shows,
The there
results are exits
averaged over
of ZD nodes being 0.4 and 0.6, respectively. 40 independent
a negative runs. between the network assor-
correlation
under the strategy adoption process. The strategies
are initially distributed randomly, with the fraction tativity and the remaining ZD fraction. The actual
of ZD nodes being 0.4 and 0.6, respectively. Pearson correlation value of the two series is -0.85,
the same figure shows, when the Pavlov strategy is initially suggesting a strong negative correlation. Network
would influence the decisions made by individual entities, and
assigned to the majority of hubs, it behaves as a strong assortativity is a remain
many questions measure of the similarity
answered or how
in regards to the topology
Fig.5 depicts
evolutionarily the scenario
stable strategy, wipingwhere
out thea ZD
well-mixed
population. homogeneity
can influenceofthethe mixingindividual
strategies patterns players
of the nodes.
will choose over
Thispopulation of ZD
suggests that andthe
under Pavlov strategies
strategy adoptionareevolutionary
allowed time.
Therefore, this result suggests that the effect of net-
process, the evolutionarily
to interact with each unstable
other overZDtime,
strategy may nottoonly
according
survive, work Intopology on we
this work, theattempted
evolutionary stability
to evaluate howofthe
a network
the strategy adoption process instead of the strategy
but may even become the more prominent Moran in strategy
topologyincreases, as the network
of a population of playersbecomes more
affect the evolutionary
a non-homogeneous population of players.
process. Here too, ZD is gradually eradicated from stability of a strategy. In particular, we focused on a class
heterogeneous.
Then, we mixed However,
the population. the Pavlovwhen
strategy
thewith
samethe GC and the
evolution- of strategies known as Zero-determinant strategies, which has
cooperator strategies respectively, in a Scale-free population. been demonstrated to be evolutionarily unstable against the
ary process is applied in a scale-free population of
Fig.7 and Fig. 8 show the evolution of the GC, cooperator, de- Pavlov strategy.
fector and Tit-for-Tat strategy fractions over time, when those
strategies are initially placed randomly or mostly on hubs. As Based on the results gathered from this research, we
Brought to you by | University of Sydney Library
the figures depict, the opposing strategies too may survive or suggest that network topology hasAuthenticated
an effect on whether a
dominate the population based on the initial distribution of the particular strategyDownload
is evolutionarily stable
Date | 4/19/17 6:19 or
AMnot. However,
strategies, when the population is updated using the strategy the topologically influenced evolutionary stability is a weak
92 Dharshana Kasthurirathna, Mahendra Piraveenan, Shahadat Uddin
1
1 1
1
1 1
fraction
fraction
fraction
fraction
0.8
0.8 0.8
0.8
fraction
fraction
0.8 0.8
0.6
0.6 0.6
0.6 ZD
ZD at
at Hubs
population
population
Hubs
population
population
0.6 0.6 Pavlov
Pavlov at Hubs
ZD at Hubs
population
population
0.4
0.4 0.4
0.4 Pavlov at Hubs
0.4 0.4
0.2
0.2 0.2
0.2
ZD
ZD
ZDZD
ZDZD
0.2 0.2
0
0 0
0
0 0
0 50000
50000 100000
100000 150000
150000 0 0
0 50000
50000 100000
100000 150000
150000
0 50000 100000
Time-step
Time-step 150000 0 50000 100000
Time-step
Time-step 150000
(a) Time-step (b) Time-step
Randomly −
(a) Randomly − initialised
initialised Hubs −
(b) Hubs − initialised
initialised
(a) Randomly − initialised (b) Hubs − initialised
Fig.
Fig. 6:
6: The
The evolution
evolution ofof ZD
ZD population
population fraction
fraction against
against the
the Pavlov
Pavlov strategy,
strategy, in
in scale-free
scale-free populations
populations of
of varying
varying initial
initial
Fig. 6: Figure
configurations.
The 6.
The The
evolution evolution
population
of ZD is of ZD
allowed
population population
to evolve
fraction fraction
under
againstthe
theagainst
strategy
Pavlov the Pavlov
adoption
strategy, strategy,
process.
in (a) in
scale-freeZDscale-free
and populations
Pavlov
populations
configurations. The population is allowed to evolve under the strategy adoption process. (a) ZD and Pavlov strategies strategies
of ofinitial
varyinginitially
initially
distributed randomly
configurations.
varying
distributed The (b)
initial
randomly Majority
population is of hubs
allowed
(b) configurations.
Majority of are
hubsThe
are either
to evolve assigned
underisthe
population
either assigned with
allowed
with ZD
strategy
ZDto or Pavlov
oradoption
evolve (the
Pavlov under
(the initial
process.
the(a)
initial average
ZD and
strategy
average degrees
Pavlovof
adoption
degrees hub
hub and
strategies
ofprocess. non-hub
initially
and (a)
non-hub
strategies
distributedare 3.4 and
randomly 1.8,
and(b) respectively).
1.8,Majority ofinitially
hubs aredistributed
either assigned with ZD
strategies are
ZD 3.4 Pavlov
and respectively).
strategies randomly (b)orMajority
Pavlov (the initialare
of hubs average
eitherdegrees of with
assigned hub and
ZDnon-hub
or
strategies are 3.4 and 1.8, respectively).
Pavlov (the initial average degrees of hub and non-hub strategies are 3.4 and 1.8, respectively).
1 1
fraction
1 1
fraction
1 1
fraction
fraction
fraction
0.8
0.8 0.8
0.8
fraction
population
0.8 0.8
population
0.6 0.6
population
Random Random
population
0.6 Hubs
Hubs
Random 0.6 Hubs
Hubs
Random
population
0.4
0.4 Hubs 0.4
0.4 Hubs
Coorperator
0.4 0.4
Coorperator
0.2 0.2
Coorperator
0.2 0.2
GC
GCGC
0.2 0.2
0
0 0
0
0 0
0 50000
50000 100000
100000 150000
150000 0 0
0 50000
50000 100000
100000 150000
150000
0 50000 100000
Time-step
Time-step 150000 0 50000 100000
Time-step
Time-step 150000
(a)
(a) GC − Time-step
GC − vs −P
vs − P avlov
avlov (b) Time-step
(b) Cooperator − vs
Cooperator − vs −
−PP avlov
avlov
(a) GC − vs − P avlov (b) Cooperator − vs − P avlov
Fig.
Fig. 7:
7: The
The evolution
Figure 7.
evolution of GC
GC and
of The cooperator
evolution
and of GC
cooperator strategies competing
and cooperator
strategies against
against the
competingstrategies Pavlov
Pavlov strategy,
thecompeting againstin
strategy, scale-free
inthe Pavlovpopulations
scale-free strategy, inof
populations of varying
varying
initial
Fig. 7: configurations.
The evolution The
of
scale-freeThe
initial configurations. GCpopulation
and
populations
population is allowed
cooperator
ofisvarying to
allowedinitialevolve
strategies under
competing
to evolve the strategy
against
configurations. the adoption
Pavlov process.
strategy,
The population
under the strategy in In the two
scale-free
is allowed
adoption process. to two
In the initial configurations,
populations
evolve under
initial of varying
the
configurations,
the
the competing
initial strategy
configurations.
competing is
is either
The
strategy assigned
population
either randomly
is allowed
assigned or assigned
to evolve undermore on
on hubs
the strategy (in hubs
adoption initialisation,
hubsprocess. In the the
twoinitial
initialaverage degrees
configurations,
of
thecompeting
strategy
competingnodes
adoption
and process. Inrandomly or
the two1.8, assigned
initial more
configurations, hubs
the(in
competinginitialisation,
strategy istheeither
initial average
assigned degrees
of competing nodes andisPavlov
strategy either nodes
Pavlov nodes are
assigned 3.4
3.4 and
arerandomly respectively).
or assigned
and 1.8, more on hubs (in hubs initialisation, the initial average degrees
respectively).
of competingrandomly
nodes or
andassigned more are
Pavlov nodes on hubs (in1.8,
3.4 and hubs initialisation, the initial average degrees of competing nodes
respectively).
and Pavlov nodes are 3.4 and 1.8, respectively).
1
1 1
1
fraction
fraction
fraction
fraction
1 1
fraction
fraction
0.8
0.8 0.8
0.8
0.8 0.8
population
population
population
population
0.6
0.6 Random 0.6
0.6 Random
population
population
Random Random
0.6 Hubs
Hubs
Random 0.6 Hubs
Hubs
Random
0.4
0.4 Hubs 0.4
0.4 Hubs
0.4 0.4
Tit-for-Tat
Defector
Tit-for-Tat
Defector
0.2
0.2 0.2
0.2
Tit-for-Tat
Defector
0.2 0.2
0
0 0
0
0 0
0 50000
50000 100000
100000 150000
150000 0 0
0 50000
50000 100000
100000 150000
150000
0 50000 100000
Timestep
Timestep 150000 0 50000 100000
Timestep
Timestep 150000
(a) Timestep (b) TTimestep
(a) Def ector −
Def ector vs −
− vs −PP avlov
avlov (b) T
T it −f
it − f or −T
or − − vs
at −
at −P
vs − P avlov
avlov
(a) Def ector − vs − P avlov (b) T it − f or − T at − vs − P avlov
Fig.
Fig. 8:
8: The evolution
TheFigure
evolution of
of Defector
Defector and Tit-for-tat strategies competing against the
the Pavlov strategy, in
in scale-free populations
of 8. The evolutionand Tit-for-tatand
of Defector strategies competing
Tit-for-tat against
strategies competing Pavlov strategy,
against the Pavlov scale-free
strategy, populations
in
Fig.varying
of 8: Theinitial
varying configurations.
evolution
initial of DefectorThe
configurations. andpopulation
The Tit-for-tatis
population allowed
allowed to
isstrategies evolve
evolve under
competing
to the
against
under strategy
thethe Pavlovadoption
strategy strategy,process.
adoption In
In the
in scale-free
process. two
two initial
the populations
initial
of scale-free
configurations, the
varying initial
configurations,
populations
competing
theconfigurations.
of varying
competing strategy
The
strategy is
is either
populationinitial
either assigned
configurations.
is allowed
assigned randomly or
to evolve
randomly
The population
or assigned
under
assigned more
the onishubs
strategy
more
allowed
on hubs (in toprocess.
(in hubs
adoptionhubs
evolve under
In the the
initialisation,
initialisation, the
the initial
two initial
average
average degrees
configurations,
degrees of
strategy
the adoptionnodes
of competing
competing and
process.
strategy
nodes and Pavlov
isIn
either
Pavlov nodes
the two
assigned
nodes are
initial 3.4
3.4 and
and 1.8,
arerandomly respectively).
configurations, the competing
or assigned
1.8, more on hubs
respectively). strategy is either
(in hubs assignedthe initial
initialisation,
average degrees
randomlyof competing nodes
or assigned and
more onPavlov nodes
hubs (in hubsareinitialisation,
3.4 and 1.8, respectively).
the initial average degrees of competing nodes
and Pavlov nodes are 3.4 and 1.8, respectively).
Further, it is important to note that not only Arend Hintze of Michigan State University for their
the topology, but also the initial distribution of the valuable comments and suggestions.
strategies within the network too plays a significant
role in shaping the evolution of the strategies. For
instance, when the evolutionarily unstable strategy References
occupies hubs as opposed to the leaf nodes at the [1] J. Von Neumann and O. Morgenstern, “Game the-
initiation of the evolution, it even manages to be- ory and economic behavior,” 1944.
come the more prominent strategy within the net- [2] E. Rasmusen and B. Blackwell, Games and infor-
work over time, resembling a weak evolutionarily mation. Cambridge, 1994, vol. 2.
stable strategy.
[3] J. M. Smith and G. Price, “lhe logic of animal con-
Even though we mainly focused on the ZD and flict,” Nature, vol. 246, p. 15, 1973.
Pavlov strategies in this work, we could replicate [4] J. M. Smith, Evolution and the Theory of Games.
similar observations with other well-known strate- Springer, 1993.
gies such as the general cooperator and cooperator [5] J. Bendor and P. Swistak, “Types of evolutionary
strategies, competing against the Pavlov strategy. stability and the problem of cooperation.” Proceed-
This could mean that the variation of evolutionary ings of the National Academy of Sciences, vol. 92,
stability due to topological stability of strategies is no. 8, pp. 3596–3600, 1995.
a more general phenomena that may be applica- [6] A.-L. Barabási and R. Albert, “Emergence of scal-
ble to most strategies that are competing with each ing in random networks,” science, vol. 286, no.
other. In addition, the topological effect on the evo- 5439, pp. 509–512, 1999.
lutionary stability may be observed for other games [7] R. Albert and A.-L. Barabási, “Statistical mechan-
other than the Prisoner’s dilemma game. For in- ics of complex networks,” Reviews of Modern
stance, it would be interesting to observe how the Physics, vol. 74, pp. 47–97, 2002.
extensive form games, where the temporal dimen- [8] G. Thedchanamoorthy, M. Piraveenan, S. Ud-
sion is built-in to the strategic decisions, are af- din, and U. Senanayake, “Influence of vaccination
fected by the topological placement of the players. strategies and topology on the herd immunity of
Such studies may be useful in accurately predicting complex networks,” Social Network Analysis and
the evolutionary stability of strategies in scenarios Mining, vol. 4, no. 1, pp. 1–16, 2014.
such as auctions, which can be modelled with ex- [9] M. Prokopenko, M. Piraveenan, and P. Wang, “On
tensive form games. convergence of dynamic cluster formation in multi-
agent networks,” Advances in Artificial Life, pp.
In conclusion, we could identify the following 884–894, 2005.
three factors that determine the topological stability
[10] M. E. J. Newman, “Mixing patterns in networks,”
of strategies in a non-homogeneous network. Net- Physical Review E, vol. 67, no. 2, p. 026126, 2003.
work topology, evolutionary process and the initial
[11] M. Piraveenan, M. Prokopenko, and A. Y. Zomaya,
distribution of the strategies. By varying these three
“Local assortativeness in scale-free networks,” Eu-
factors, an evolutionarily unstable strategy may be rophysics Letters, vol. 84, no. 2, p. 28002, 2008.
able to survive and may even operate as a weak
[12] ——, “Assortative mixing in directed biological
evolutionarily stable strategy, in a population of
networks,” IEEE/ACM Transactions on computa-
players connected in a non-homogeneous topology. tional biology and bioinformatics, vol. 9(1), pp.
Based on our observations, the topological stabil- 66–78, 2012.
ity of strategies may be more prevalent in the social
[13] R. V. Solé and S. Valverde, “Information theory
context of the evolution of strategies, in comparison of complex networks: on evolution and architec-
to the biological context. tural constraints,” in Complex Networks, ser. Lec-
ture Notes in Physics, E. Ben-Naim, H. Frauen-
felder, and Z. Toroczkai, Eds. Springer, 2004,
Acknowledgment vol. 650.
[14] M. Piraveenan, D. Polani, and M. Prokopenko,
The authors would like to thank Michael Harrè
“Emergence of genetic coding: an information-
of the Centre for Complex Systems Research at theoretic model,” in Advances in Artificial Life.
the University of Sydney and Chistoph Adami and Springer Berlin Heidelberg, 2007, pp. 42–52.
[15] M. Piraveenan, M. Prokopenko, and L. Hos- [23] F. Santos, J. Rodrigues, and J. Pacheco, “Graph
sain, “Percolation centrality: Quantifying graph- topology plays a determinant role in the evolution
theoretic impact of nodes during percolation in net- of cooperation,” Proceedings of the Royal Society
works,” PloS one, vol. 8, no. 1, p. e53095, 2013. B: Biological Sciences, vol. 273, no. 1582, pp. 51–
[16] F. C. Santos and J. M. Pacheco, “Scale-free net- 55, 2006.
works provide a unifying framework for the emer-
[24] J. F. Nash et al., “Equilibrium points in n-person
gence of cooperation,” Physical Review Letters,
games,” Proceedings of the national academy of
vol. 95, no. 9, p. 098104, 2005.
sciences, vol. 36, no. 1, pp. 48–49, 1950.
[17] D. B. Fogel, “Evolving behaviors in the iterated
prisoner’s dilemma,” Evolutionary Computation, [25] J. Maynard Smith, “The theory of games and the
vol. 1, no. 1, pp. 77–97, 1993. evolution of animal conflicts,” Journal of theoreti-
[18] C. Adami and A. Hintze, “Evolutionary instabil- cal biology, vol. 47, no. 1, pp. 209–221, 1974.
ity of zero-determinant strategies demonstrates that
winning is not everything,” Nature communica- [26] S. Le and R. Boyd, “Evolutionary dynamics of the
tions, vol. 4, 2013. continuous iterated prisoner’s dilemma,” Journal
of theoretical biology, vol. 245, no. 2, pp. 258–267,
[19] D. Kasthurirathna, M. Piraveenan, and M. Harre, 2007.
“Influence of topology in the evolution of coordi-
nation in complex networks under information dif- [27] A. Rapoport, Prisoner’s dilemma: A study in con-
fusion constraints,” The European Physical Jour- flict and cooperation. University of Michigan
nal B, vol. 87, no. 1, pp. 1–15, 2014. Press, 1965, vol. 165.
[20] D. Kasthurirathna, H. Nguyen, M. Piraveenan,
S. Uddin, and U. Senanayake, “Optimisation [28] D. P. Kraines and V. Y. Kraines, “Natural selec-
of strategy placements for public good in com- tion of memory-one strategies for the iterated pris-
plex networks,” in Social Computing (SocialCom), oner’s dilemma,” Journal of Theoretical Biology,
2014 International Conference on. IEEE, 2014, vol. 203, no. 4, pp. 335–355, 2000.
p. in print.
[29] D. Iliopoulos, A. Hintze, and C. Adami, “Critical
[21] W. H. Press and F. J. Dyson, “Iterated prisoners
dynamics in the evolution of stochastic strategies
dilemma contains strategies that dominate any evo-
for the iterated prisoner’s dilemma,” PLoS compu-
lutionary opponent,” Proceedings of the National
tational biology, vol. 6, no. 10, p. e1000948, 2010.
Academy of Sciences, vol. 109, no. 26, pp. 10 409–
10 413, 2012. [30] A. J. Stewart and J. B. Plotkin, “Extortion and co-
[22] P. A. P. Moran et al., “The statistical processes of operation in the prisoners dilemma,” Proceedings
evolutionary theory.” The statistical processes of of the National Academy of Sciences, vol. 109,
evolutionary theory., 1962. no. 26, pp. 10 134–10 135, 2012.