The Impact of Meta-Strategy On Attendance Dynamics

The Impact of Meta-Strategy on Attendance Dynamics
in the El Farol Bar Problem

Rebecca E. Cohen∗ and Juan G. Restrepo†
Department of Applied Mathematics, University of Colorado at Boulder, Colorado 80309, USA
The El Farol Bar Problem is a classic computational economics problem in which agents attempt
to attend a weekly event at a bar only if it is not too crowded. Each agent has access to multiple
competing strategies that may be used to predict whether attendance will be above the tolerance
threshold. Many different variations of the El Farol Bar Problem have been published, with a
particularly broad variation in the choice of meta-strategy, the algorithm by which each agent selects
the best-performing strategy. This paper discusses the varying mechanisms of self-organization
within the system when different meta-strategies are used, including the existence and location of
fixed points in the limit of an infinite number of agents. Informed by these mechanisms, we present
an evolutionary paradigm that reduces the likelihood of degenerate cases where attendance is always
arXiv:2306.07885v1 [nlin.AO] 13 Jun 2023
too high or too low, while reducing the amplitude of fluctuations in attendance. This evolutionary
paradigm sheds light on the critical role of heterogeneity of strategies selected in the emergence of
a stable steady state in the El Farol Bar Problem.
I. INTRODUCTION agent has access to a pool of strategies, out of which

they select the one that would have performed best dur-
The El Farol Bar Problem was introduced by ing some prior period [1, 2, 4–7, 10–12]. Alternatively,
economist Brian Arthur in 1994 to explore the role of agents may update their strategies based on past behav-
bounded rationality in multi-agent systems [1]. In the ior of the system [3–5, 13, 14]. These two approaches
model, each agent competes to utilize a limited resource. may be used separately or in tandem. The literature
The scenario presented is a weekly event at a bar with contains many different approaches to updating strate-
live music that is only enjoyable when not too crowded, gies. Depending on the implementation, attendance may
but the same concept has been applied to scenarios rang- coalesce to a stable level close to the threshold [3, 5], or
ing from traffic flow to stock trading [2]. Each agent must fluctuate wildly [13]. Some implementations that involve
make decisions based on imperfect models of the system, agent adaptations do so by allowing agents to either col-
and these decisions determine the future dynamics of the laborate or learn from each other’s choices. Intriguingly,
system. allowing agents to collaborate or copy each other’s pre-
In Arthur’s original setup, each agent has a fixed collec- viously effective strategies may lead to greater volatility
tion of strategies available to them. At each iteration, the and lower overall resource utilization [5, 15].
agents select whichever strategy would have performed Models that allow agents to select between multiple
best in prior weeks and use that strategy to make a deci- strategies must employ a meta-strategy, that is, an algo-
sion about whether to attend the event. In simulations, rithm by which agents select their preferred strategy out
attendance at the bar often appears to oscillate around of those options available. There is considerable varia-
the threshold of crowding that agents are willing to toler- tion in the literature with regard to how strategies are
ate. Many different variations of the problem have been evaluated. For example, the accuracy of a strategy may
described, most of which converge to a similar state in be computed as a binary function of whether the strategy
which attendance fluctuates close to the threshold [2– would lead to an agent making the correct choice at each
6]. However, at least in some formulations of the prob- time step [6, 7, 11, 12] or as a numerical-valued function
lem, the mean attendance is observed to differ from the of the prediction error [2, 7, 9].
threshold of crowding that agents are willing to toler-
ate [7, 8], and volatility in attendance dynamics has also In this paper, we explicitly consider the impact of
been shown to be sensitive to parameters governing the meta-strategy, the algorithm by which an agent selects
system’s behavior [7–10]. their strategy, on the dynamics of the system. We
In the El Farol Bar Problem, each agent must make a present one meta-strategy that robustly drives oscilla-
choice about whether to attend the bar based on models tions around the threshold but is prone to large fluctu-
accessible to the agent that do not accurately represent ations in many regimes, and another one in which at-
the dynamics of the system. Previous works are based on tendance may reach a fixed point either above or below
two main approaches. In the first approach, used in the the threshold in certain regimes. By analyzing the behav-
original formulation of the El Farol Bar Problem, each ior of agents probabilitistically, we find analytical expres-
sions and necessary conditions for the existence of fixed
points of the dynamics of attendance. Furthermore, we
present an evolutionary paradigm that improves the ro-
∗ rebecca.cohen@colorado.edu bustness of both meta-strategies, leading to smaller fluc-
† juanga@colorado.edu tuations close to the threshold.
2
Our paper is organized as follows. In Section II, we

present our model for the El Farol Bar Problem and in-
Ctbin (a) =

troduce two different meta-strategies. In Section III, we t − m ≤ j < t (N̂t−j < T ≤ Nt−j )
(2)
analyze the system under each meta-strategy. In Section or (N̂t−j ≥ T > Nt−j ) .
IV, we present an evolutionary paradigm and show how it
modifies the dynamics of the system. Finally, we present The binary decision meta-strategy is used, for example
our conclusions and discuss our results in Section V. in Refs. [4, 5, 7, 9, 11], while Refs. [6, 14] use a similar
meta-strategy in which the cost is computed as an expo-
nentially weighted mean over all time instead of equally
II. MODEL weighting the costs of the m previous weeks.
In contrast to the binary decision meta-strategy, agents
In this section we specify our version of the El Farol could also select the strategy whose attendance predic-
Bar Problem. We begin with a population of A agents, tions are closest to the actual recent history, regardless
all of whom prefer to attend the bar only if the number of whether they are above or below the threshold. Thus,
of agents attending will be less than a threshold T . we consider a protocol in which agents select the strategy
Each agent has a choice of s different strategies, out with the lowest squared error over the previous m weeks,
of which they pick the one they expect to perform best, henceforth referred to as the error minimization meta-
according to a fixed meta-strategy. At time t, we denote strategy. In this case, the cost for strategy a at week t is
the previous m weeks’ attendance as [Nt−m , Nt−m+1 , ..., given by
Nt−1 ], and we call m the memory window. Each strategy
estimates the attendance at week t as a linear combina-
tion of the previous m weeks’ attendance numbers, with Ctem (a) = (Nt−1 − N̂t−1 )2 + (Nt−2 − N̂t−2 )2 +
the weights assigned to each week’s attendance varying (3)
... + (Nt−m − N̂t−m )2 .
between strategies. Thus, the ith strategy will estimate
that attendance on week t will be The error minimization meta-strategy is less frequent
in the literature, although various meta-strategies that
m
X rely on penalizing larger prediction errors have been used
N̂i,t = ai,j Nt−j . (1) [2, 9, 13].
j=1
The weights assigned to each strategy are sampled

from a distribution f (a) where a = (a1 , a2 , · · · , am ). In III. STRATEGY DISTRIBUTION AND STEADY
this paper, we take the ai,j to be independent and uni- STATE BEHAVIOR
formly distributed in [−1, 1]. The agents use a fixed cost
function to evaluate the performance of each strategy. In this section we will study the dynamics of atten-
The strategy with the lowest cost is selected, and the dance under the two different meta-strategies we pre-
agent attends the bar at week t if their chosen strategy sented above. An often-mentioned feature of the El Farol
predicts that attendance will be strictly less than the Bar Problem is that attendance fluctuates closely around
threshold. the threshold T . This is often taken as evidence of emer-
Each agent must test their strategies on the outcomes gent self-organization. However, as we shall see, situa-
of the m prior weeks, and each predictor requires m pre- tions where attendance oscillates wildly or settles above
vious weeks of history. Thus an initial seed history of or below the threshold are also common depending on
length 2m is needed to run the model. This seed history parameter choices. We will illustrate these with numeri-
is generated by sampling each of the initial 2m atten- cal simulations and analytical calculations for both meta-
dance values independently from a uniform distribution strategies.
over {0, 1, ..., A}.
A. Binary Decision Meta-Strategy

A. Meta-Strategies
Here we consider the binary decision meta-strategy in-
One way to evaluate the cost of each strategy is to troduced in Section II A. Fig. 1 shows examples of atten-
treat performance as a binary outcome. Each strategy is dance Nt as a function of time for various combinations
satisfactory in week t if the choice implied by its predic- of memory window m and strategy count s, and thresh-
tion (e.g., above or below the threshold) was the correct old T (indicated with the dashed line), using A = 100.
choice to make in that week. The agent selects whichever For sufficiently low values of m and s, attendance set-
strategy has the most correct choices in the previous m tles into a fixed point above the threshold, as shown in
weeks, and ties are broken at random. We shall hence- Fig. 1(a). Considering that agents enjoy attending only
forth refer to this as the binary decision meta-strategy. when attendance is below the threshold, this is an unfa-
Under this strategy, the cost at week t for strategy a is vorable outcome for the system. Figs. 1(b)-(f) show that
3
FIG. 1: Attendance Nt (solid blue curve) versus t for the binary decision meta-strategy, with A = 100, and various
values of T (orange dashed line), m, and s.
for larger values of m and s, attendance does fluctuate For a given strategy a, if Nt−j = N ∗ for all j ∈
around the threshold, but fluctuations may be very large. {1, 2, ..., m}, the prediction for each time point in the
Fig. 2 shows broader trends in the relationship be- memory window is simply
tween m, s, and the size of fluctuations. The color of
each cell shows the standard deviation of attendance at
m
steady state for the given values of m and s. Fixed points X
(i.e., Nt remains constant for t greater than some t0 ) are N̂i,t = N ∗ aij . (4)
i=1
observed for some very small values of m and s, visible
as red cells in the lower left corner, while neighboring
regimes have much larger fluctuations. Large fluctua- According to the binary decision meta-strategy, if there
tions are also an undesirable outcome since low atten- is any strategy available to the agent such that N̂i,t ≥ T ,
dance means resources (enjoyable bar seats) go to waste it will be selected. So the agent will attend the bar only
while high attendance means a large number of unsatis- if all strategies have N̂i,t < T . Thus the probability Pgo
fied patrons. As m and s grow, the size of fluctuations that an agent attends is
appears to decrease, as illustrated in Fig. 1(f). This pat-
tern is strikingly different from what is observed in some
other variations of the problem, in which volatility in- Pgo = P (N̂i,t < T ∀ i ∈ {1, 2, ..., s})
creases as s grows [7, 15]. = P (N̂1,t < T )s
First, we address the situation observed in Fig. 1 (a),  s (5)
m
i.e., we determine for which parameters can attendance X T
settle at a fixed point Nt = N ∗ . We start by assuming =P a1j < ∗  .
N
that there is a fixed point N ∗ ≥ T . The case where N ∗ < j=1
T is similar and will be discussed later. Our strategy is to

determine the probability, Pgo , that an agent will attend We rewrite Eq. (5) in terms of an Irwin-Hall(m) ran-
the bar the next week. Considering the limit A → ∞, dom variable, the sum of m independent random vari-
we then identify Pgo with N ∗ /A to obtain an implicit ables, each uniformly distributed
Pm in (0, 1). If X has an
equation for N ∗ . Irwin-Hall(m) distribution, j=1 a1j ∼ 2X − m. Hence,
4
FIG. 2: Standard deviation of attendance at steady state

under the binary decision meta-strategy. All simulations
have N = 10, 000, T = 6, 000.
s
T
Pgo = P 2X − m < ∗
N
s
T m
=P X< +
2N ∗ 2
m s
 T m

⌊ 2N ∗ + 2 ⌋ FIG. 3: Solutions to Eq. (6) (blue circles) as a function
1 X m T m
= (−1)k + −k  . of m for s = 2 (above) and as a function of s for m = 2
m! k 2N ∗ 2 (below) with A = 1, 000 and T = 600. Error bars show
k=0
the locations of fixed points in simulation when a fixed
Setting Pgo = N ∗ /A as discussed above, we obtain point is theoretically predicted (e.g. predicted fixed point
is above T ). When fixed points are not predicted, error
⌊ T∗ +m⌋
bars show the range of observed values in simulation.
N∗ 1 2NX 2

k m
= (−1)
A m! k
k=0 (6)

T m
m s in simulation. For these values, the error bars show the
+ −k , N∗ ≥ T . full spread of the observed oscillations over the last 50
2N ∗ 2 trials.
In Fig. 3, the blue circles show the solutions to Eq. (6) The analysis for N ∗ < T is similar (see Appendix A),
as m and s increase. The dashed line shows the thresh- and it predicts there are no fixed points below the thresh-
old T . For sufficiently low m and s values, solutions to old, matching what we observe in simulation.
Eq. (6) are above T , a necessary assumption to justify Our analysis of the Binary Decision meta-strategy
Eq. (5). Error bars on the plots in Fig. 3 show the min- shows that, unless m and s are extremely large, atten-
imum and maximum values over the last 50 time steps dance can either converge to a fixed point away from the
over 5 simulations of duration 100. In the case of fixed threshold or undergo strong oscillations about it. As far
points very close to T , such as m = 5, s = 2, some tri- as we know, fixed points for the Binary Decision meta-
als failed to converge and were discarded. The error bar strategy have not been studied previously. For large m
represents the spread of 5 trials for which attendance did and s, our numerical results suggest that attendance does
converge above T . As s or m grow sufficiently, the solu- indeed converge to the threshold T . Overall, our results
tion drops below T , and we no longer observe fixed points for the binary decision meta-strategy highlight the im-
5
FIG. 4: Attendance Nt (solid blue curve) versus t for the error minimization meta-strategy, with A = 100, and various
values of T (orange dashed line), m, and s.
portant effects of the number of strategies s and the mem- more strategies for each agent (larger s) lead to larger
ory window m on the collective dynamics of the agents. oscillations in attendance?
To address question (i), we solve for fixed points in
weekly attendance by assuming a fixed point, N ∗ , ex-
B. Error Minimization Meta-Strategy ists and determining the probability, Pgo that each agent
would attend the bar. As we did with the binary decision
Now we study the error minimization meta-strategy. meta-strategy, we consider the limit A → ∞ and identify
In Fig. 4 we show attendance Nt versus time t for different Pgo with N ∗ /A to derive an implicit equation for N ∗ .
values of m, s, and T . As in the binary decision meta- Supposing attendance every week is N ∗ , that is Nt = N ∗
strategy, there are some cases where attendance settles for all t, the cost function for each week simplifies to
at an approximately constant value. When it does so
fluctuations are, in general, smaller than in the binary
decision case. On the other hand, when the number of  2
strategies s sufficiently exceeds the memory window m, Xm
as it does in Fig. 4(f), attendance can have very large Ct (a) = m(N ∗ )2  ai,j − 1 . (7)
oscillations, in some cases alternating between 0 and A. j=1
This is illustrated in Fig. 5, which shows the standard
deviation of steady state attendance as a function of m
and s. Volatility appears to increase as s grows relative to Hence, the optimal strategy Pismsimply the one that min-
a fixed m, consistent with patterns observed by Johnson imizes the quadratic form ( j=1 ai,j − 1)2 . In the fol-
et. al and Collins [7, 15]. In the remainder of this section, lowing, we denote by a∗ the strategy weights selected by
we investigate the following two questions: (i) why does an arbitrary agent among their s available choices, and
attendance sometimes settle around values other than by f ∗ their probability density function. In Appendix B,
the threshold T ?, and (ii) why does the availability of we show that this probability density is
6
FIG. 6: Fixed points predicted by numerical solution of

Eqs. (13), (C1), (C2), (C3), as a function of T for A =
100, m = 2 and s = 2, 3, 5, 10. The identity is shown as
a dashed line.
FIG. 5: Standard deviation of attendance at steady state Hence, we compute Pgo by integrating Eq. (10) over
under the minimize squared errors meta-strategy. All the region satisfying Eq. (12). With s = 2 and m = 2, as
simulations have N = 10, 000, T = 6, 000. assumed in Eq. (10), this evaluates to
 h 2 3
i
1
 24 6 + 12 NT∗ + 3 NT∗ − 2 NT∗ 0< T
N∗ ≤1
Z !s−1 Pgo =
s 1
h i
 1 −4 + 36 T − 15 T 2 + 2 T 3
f ∗ (a∗ ) = 1− da , (8) 24 N ∗ N ∗ N ∗ otherwise.
2m E(a∗ ) 2m
(13)
∗
where E(a ) is the region of strategy space that would We are now prepared to construct an implicit equation
have a lower cost than a∗ , i.e., for N ∗ by setting Pgo = N ∗ /A:
 h 2 3
i
E(a∗ ) = {a : C(a) < C(a∗ )}. (9) N∗
1
 24 6 + 12 NT∗ + 3 NT∗ − 2 NT∗ 0< T
N∗ ≤1
= h i
In order to simplify the calculations while still gaining A  1 −4 + 36 T − 15 T 2 + 2 T 3 otherwise.
24 ∗N ∗N N∗
insight into the dynamics of the system, we study the
(14)
case where m = 2 in detail. In this case, the PDF of the
Eq. (14) is an implicit equation for the fixed point N ∗
weights, given by Eq. (8), evaluates to
as a function of the threshold T , for m = 2, s = 2.
While the calculations become more cumbersome as s
s−1
is increased, we have obtained analogous expressions for

∗ ∗ s 1 ∗
f (a ) = 1 − g(a ) , (10) m = 2 and s = 3, 4, 5 (see Appendix C). Plotting the
4 4
fixed point N ∗ as a function of T in Fig. 6, we note
where that the fixed point can be either above or below the
threshold, depending on other parameters. Therefore, for

∗ ∗ this particular meta-strategy, attendance does not self-
2|a1 + a2 − 1|
 if |a∗1 + a∗2 − 1| < 1,
∗ 1 ∗ ∗
organize around the threshold, as is often claimed for
g(a ) = 2 [6|a1 + a2 − 1|− El Farol-type problems. In Refs. [7, 8] it was observed
(a∗ + a∗ − 1)2 − 1] otherwise.

1 2 numerically that in certain versions of El Farol Bar Prob-
(11) lem the mean attendance can differ from the attendance
At the fixed point, an agent will attend the bar if their threshold. Even though our implementation of the El
chosen strategy, a∗ , satisfies Farol Bar Problem is different, our model and results
provide an example where this issue can be explored the-
m
X oretically.
N∗ a∗j < T . (12) While we obtain implicit expressions for fixed points,
j=1 N ∗ , our analysis does not give any information on their
7
FIG. 7: Solutions to Eq. (14) (blue circles) as a function FIG. 8: The effect of evolution in different parameter
of s for m = 2, A = 1, 000, and T = 300 (above) and ranges. Evolution begins at t = 300. All simulations
T = 800 (below). Error bars show the observed range of were run with λ = 0.2, ϵ = 0.7, A = 1, 000.
values Nt in simulation over time steps t = 50 through
t = 100 in five independent simulations.
the strategy a∗ minimizes the cost function. This mini-
mizer is unique since the cost function is quadratic. For
stability. In some cases, instead of the fixed points pre- all other choices of a∗ , this integral will be strictly be-
dicted from our analysis, we observe large fluctuations in tween 0 and 1. Hence, as s approaches ∞, f ∗ approaches
attendance, and we hypothesize that this is because the the Dirac delta function centered at the optimal strat-
fixed points are unstable. This is illustrated in Fig. 7, egy. Thus, we expect that as s approaches infinity all
which shows our theoretical predictions for the fixed agents will make the same choice and every iteration will
points (blue circles), the threshold T (dashed line), and have either all agents or no agents attending. Interest-
the range over which attendance varies over the last 50 ingly, improved individual predictive accuracy leads to
time steps of 5 different simulations of length 100 (error decreased global utility.
bars). When this range is small, suggesting a fixed point,
the numerical values of Nt agree well with our theoretical
result. For larger values of s, the fixed point appears to IV. EVOLUTION
lose stability and large fluctuations are observed.
We now address question (ii), namely, why do large Using the binary decision meta-strategy, attendance
oscillations occur as s becomes large. Recall that the fluctuates around the threshold in all but a few anoma-
probability density for strategies selected by an agent is lous regimes with very few strategies, but the size of fluc-
given by Eq. (10). We R observe that the area of E(a∗ ) tuations remains large unless both m and s are extremely
1
and thus the value of E(a∗ ) 2m da will equal 0 only when large. Meanwhile, the error minimization meta-strategy
8
FIG. 9: Fraction of time above T at steady state with FIG. 10: Average distance from the threshold at steady
the error minimization meta-strategy, with T = 6, 000, state with the binary decision meta-strategy, without
A = 10, 000, ϵ = 0.5, λ = 0.2, without (top) and with (top) and with (bottom) evolution.
(bottom) evolution.
bility of attendance, the system can converge to a Nash

only oscillates close to the threshold when s is in an op- equilibrium in which all agents attend with probability
timal range relative to m; too low, and the system gets T /A [3]. However, when agents are able to adjust their
caught in a fixed point away from the threshold, or too own parameters in a deterministic strategy space, they
high, and fluctuations become excessively large, leading may end up in a deadlock where some agents always at-
to poor utilization of the resource. tend and other agents almost never attend [14], or, al-
Different evolutionary paradigms have been proposed, ternatively, it may drive increased volatility [13], just as
leading to dramatically varied results. When agents use we observe when agents are given access to an increased
stochastic strategies and are able to adjust their proba- number of strategies and agents’ decisions thus synchro-
9
nize.
Instead, we propose a different approach to evolution
in the agent-based system; rather than refining individual
strategies, agents may re-draw their own m and s values
if they are not getting sufficient utility in the present
regime. Every time they change m and s, the weights
for each strategy are drawn once again from a uniform
distribution in [−1, 1]. The strategies then remain fixed
until the next time the agent redraws m and s.
The agent gives each strategy an initial trial period
of length 2m, whatever their current m value is. After
that, on each step, the agent computes their performance
over a window of length m. If their fraction of correct
choices is less than ϵ, the agents redraw m and s with
probability λ. In the simulations that follow, we give
each agent the same starting m0 and s0 values, and allow
the agents to draw uniformly at random from the range
s ∈ [s0 /2, 3s0 /2] and m ∈ [m0 /2, 3m0 /2]. Fig. 8 shows
the effect of evolution in different parameter regimes, for
λ = 0.2, ϵ = 0.7 (as we will discuss below, choices of λ
and ϵ do not significantly affect the results). In all panels,
the simulation is run without evolution for the first 300
steps. Then evolution is turned on a t = 300 (vertical
dashed line). The solid blue line shows the attendance,
Nt , and the horizontal dashed line shows the threshold,
T.
Evolution appears to both reduce the amplitude of
large fluctuations and force attendance to cross the
threshold when it otherwise would not. If attendance
were to remain above T for a sufficiently long period
of time relative to the agents’ m values, more than T
agents would be dissatisfied with their current regime,
and thus they would be likely redraw their parameters
until the behavior of the systems switches. Likewise, if
attendance remains below T for sufficiently long, then
A − T agents would be dissatisfied at each time step,
thus redrawing their parameters. Evolution also appears
to limit the size of fluctuations. In a scenario with large
fluctuations, many agents will be consistently dissatisfied
with their outcomes, as they will be attending in over-
crowded weeks and absent in weeks with capacity. Thus,
scenarios with high fluctuations will be unstable.
Fig. 9 shows the fraction of time attendance is above T
at steady state with the error minimization meta-strategy FIG. 11: Average distance from the threshold at steady
for a range of parameters without (above) and with (be- state with the error minimization meta-strategy, without
low) evolution. When an evolutionary paradigm is used, (top) and with (bottom) evolution.
a larger region of strategy space has attendance close
above T around half the time. Even in regions where
attendance rarely crosses T , the fraction never reaches 0,
while without evolution there are large regions of strategy fluctuations. Note that while these plots look similar to
space for which attendance never rises above the thresh- Figs. 2 and 5, distance from threshold is a different met-
old. With the binary decision meta-strategy, the fraction ric than standard deviation of attendance.
of time above the threshold remains close to 1/2, with or Our evolutionary model does introduce two new pa-
without evolution (not shown). rameters, λ and ϵ, but neither appears to have a dramatic
Figs. 10 and 11 show the effect of evolution on the av- effect on the behavior of the system. Fig. 12 summarizes
erage distance from the threshold, T , at steady state, cal- the deviation from the threshold at different values of
culated as the time average of |Nt − T |. Evolution brings ϵ for a range of m and s values. The effects are not
attendance closer to T , particularly in regions with larger dramatic for ϵ ≥ 0.2. The effect of λ is even less pro-
10
From a more philosophical perspective, we have an

interesting case study on the choice of metrics to opti-
mize. In Arthur’s original paper, the El Farol Bar Prob-
lem is presented as an example of how a collection of
agents making rational decisions based on incomplete in-
formation and flawed models leads to an adaptive self-
organization emerging in the system [1]. The mechanism
underlying this self-organization is that any strategy that
consistently performs poorly will not be selected. How-
ever, our and other papers have shown that giving agents
access to strategies that appear to perform better based
on past performance does not lead to a better outcome
for the system as a whole [6, 13, 15]. When agents are
able to all adjust their strategies in almost identical ways,
the individual strategies selected change, but volatility in
the system remains consistent. Thus, the adaptive self-
organization observed by Arthur is dependent on con-
straints in the agents’ access to predictive models that
are, by their own standards, desirable.
Other works that have included evolutionary algo-
rithms have used that evolution to improve prediction
accuracy, leading to sub-optimal outcomes for the com-
mon sharing of the resource [13, 14]. However, in our
approach to evolution, we attempt to optimize directly
for agents’ ability to utilize the resource.
The effect of the evolutionary paradigm is not to drive
agents toward more accurate predictions, but to drive
the system toward more heterogeneity. This heterogene-
ity, rather than agent-level prediction accuracy, creates a
system in which agents experience higher rates of utility
in a more stable and equitable way.
Appendix A: Fixed Points Below T with the Binary

Decision Meta-Strategy
FIG. 12: Mean, minimum, and maximum distance from This analysis follows similar logic to the case where
threshold, T , at steady state for different values of ϵ. Five the fixed point is above T , discussed in Section III A. If
trials each were run on m and s values of 5, 10, 20, and attendance is below the threshold, an agent will attend
50. if they have at least one strategy that predicts N̂i,t < T .
So in this case, we have
nounced for λ > 0, but including this parameter rather Pgo = P (∃ i ∈ {1, 2, ..., s} , N̂i,t < T )
than setting it to 1 allows for a smoother beginning to = 1 − P (N̂i,t ≥ T ∀ i ∈ {1, 2, ..., s})
the simulation as all agents do not simultaneously change  s (A1)
parameters. m
X T
=1−P  a1j ≥ ∗  .
j=1
N
V. DISCUSSION
which, after evaluating the probability and using the
self-consistent condition Pgo = N ∗ /A, yields the implicit
One of the most intriguing features that we observe in equation
both non-evolving regimes of the El Farol Bar Problem is
that systems in which individual agents’ models appear
⌊ T∗ +m⌋
to make better predictions based on past history end up N∗ 1 2NX 2

k m
having worse utilization of the resource in question. We =1− 1− (−1)
A m! k
have seen mechanistically how increasing the number of k=0 (A2)
m s
strategies available to an agent, given a fixed memory T m ∗
+ −k , N <T.
length, drives volatility. 2N ∗ 2
11
Fig. 13 shows solutions to Eq. (A2) as m and s increase. Appendix B: Distribution of Strategies Chosen in
Solutions to Eq. (A2) were above T with all parameters Error Minimization Meta-Strategy
tested, and hence inconsistent with the setup. No fixed
points were observed below T in simulation either. While We consider the distribution of strategies selected by
it is difficult to interpret this necessity from equation A2, each agent under the minimize squared prediction error.
we observe that with the uniform [−1, 1] distribution of The cost function for each strategy is given by Eq. (3).
weights, strategies are biased toward underestimating at- The cost function is quadratic and will always have a
tendance. In order to have a fixed point above below T , unique minimizer unless all N ’s are equal to 0. Depend-
−T
at least NN agents would need to overestimate atten- ing on the attendance history, this minimum could be
dance. inside or outside of the base of support of the a values.
However, there must also be a unique minimizer of
Eq. (3) within the base of support of the a’s. If the
optimal strategy is within the range of possible a values,
this will be the unique minimizer. We now consider the
case where the the minimizer of (3) is outside of the base
of support. Taking cmin to be the lowest cost observed
within the base of support of the a’s, consider ℓmin the
level curve of the cost function at cmin . Level curves of
equation (3) are all elliptical, while the base of support
of the a values is rectangular. If ℓmin intersected the base
of support of the a’s at multiple points, there would be
some region between ℓmin and the true minimizer of the
cost function within the base of support. Since the cost
surface is convex, this region would have cost below Cmin ,
contradicting our construction of Cmin .
Consider the cumulative distribution of strategies se-
lected. We will use F (a) to denote the cumulative dis-
tribution of strategies randomly assigned to an agent,
F ∗ (a) to denote the cumulative distribution function of
the strategy selected, and ak∗ to denote the kth weight
of the selected strategy. Since the strategies are i.i.d.,
F ∗ (a) = P (a1∗ ≤ a1 , · · · , am∗ ≤ am )

= sP a1,1 < a1 , · · · , am,1 < am , (B1)

FIG. 13: Solutions to equation (A2) as a function of m C(a1 ) < min{C(aj )|j ∈ 2, ..., s} .
(above) and s (below).
Conditioning on the values of the optimal strategy cho-
sen, we find that
Z a1 Z am
∗
F (a) = s ··· P (C(a1 ) < min{C(aj )|j ∈ 2, · · · , s} | a = b)f (b)db
−∞ −∞
Z a1 Z am Z !s−1 (B2)
=s ··· 1− f (c)dc f (b)db .
−∞ −∞ E(b)
Thus, when strategies are sampled from the Uniform(- Appendix C: Implicit Equations for Fixed Points
1,1) distribution, the PDF of the chosen strategies, (a∗1 , with the Error Minimization Meta-Strategy
a∗2 , · · · , a∗m ) is
Here, we show the implicit equations for additional
fixed points shown in Fig. 6. For each choice of s, we
Z !s−1 compute Pgo by integrating Eq. (10) over the region sat-
∗ s 1 isfying Eq. (12), and derive the implicit equation by set-
f (a) = m 1− da . (B3)
2 E(a) 2m ting Pgo = N ∗ /A. With s = 3 and m = 2, we have
12
1 4 2

 64 − 3 NT∗ + 18 NT∗ + 0 < NT∗ ≤ 1 ,
24 NT∗ + 8

N∗


= 1 T 4
T 3

A 
 64 − 3 N ∗ + 32 N ∗ − otherwise .
2

126 NT∗ + 216 NT∗ − 72

(C1)
With s = 5, m = 2, we have

1 T 6 T 5
0 < NT∗ ≤ 1 ,

 384 − 5 N ∗ 4 − 12 N ∗ 3 +


T T




 15 N ∗ + 80 N ∗
∗
2
+105 NT∗ + 60 NT∗ + 12

N 
= 1
T 6
T 5

A 
 384 − 5 N ∗ + 84 N ∗ −
T 4 T 3




 585 N ∗ + 2160 N ∗ −
 2
4455 NT∗ + 4860 NT∗ − 1804 otherwise .


(C2)
Finally, with s = 2, m = 10, we have
T 11

1
0 < NT∗ ≤ 1

22528 − 10 N ∗ −



T 10 T
9
77 N ∗ − 220 N ∗ −




T 8
 7
165 N ∗ + 660 NT∗ +




 6 5
2, 310 NT∗ + 3, 696 NT∗ +




 4 3
3, 630 NT∗ + 2, 310 NT∗ +




 2
935 NT∗ + 220 NT∗ + 22




T 11
 1

 22528 10 N ∗ − otherwise .



N∗ T 10 T 9

319 N ∗ + 4, 620 N ∗ −

= T 8
A



 40, 095 N∗ +
7
231, 660 NT∗ −




 6
935, 550 NT∗ +




 5
2, 694, 384 NT∗ −




 4
5, 533, 110 NT∗ +




 3
7, 938, 810 NT∗ −




 2
7, 577, 955 NT∗ +





4, 330, 260 NT∗ − 1, 099, 404

(C3)
[1] W. Brian Arthur. Inductive reasoning and bounded ratio- ADS ’16, San Diego, CA, USA, 2016. Society for Com-
nality. The American Economic Review, 84(2):406–411, puter Simulation International.
1994. [5] Shu-Heng Chen and Umberto Gostoli. Coordination in
[2] Eduardo Zambrano. The interplay between analytics and the el farol bar problem: The role of social preferences
computation in the study of congestion externalities: The and social networks. Journal of Economic Interaction
case of the el farol problem. Journal of Public Economic and Coordination, 12:59–93, 03 2017.
Theory, 6(2):375–395, 2004. [6] Damien Challet, M Marsili, and Gabriele Ottino. Shed-
[3] Miklos Szilagyi. The el farol bar problem as an iterated ding light on el farol. Physica A: Statistical Mechanics
n-person game. Complex Systems, 21, 06 2012. and its Applications, 332:469–482, 2004.
[4] Shu-Heng Chen and Umberto Gostoli. On the complexity [7] N.F. Johnson, S. Jarvis, R. Jonson, P. Cheung, Y.R.
of the el farol bar game: A sensitivity analysis. In Pro- Kwong, and P.M. Hui. Volatility and agent adaptabil-
ceedings of the Agent-Directed Simulation Symposium, ity in a self-organizing market. Physica A: Statistical
13
Mechanics and its Applications, 258(1):230–236, 1998. and organization in an evolutionary game. Physica A:
[8] Hilmi Luş, Cevat Onur Aydın, Sinan Keten, Hakan Statistical Mechanics and its Applications, 246(3):407–
İsmail Ünsal, and Ali Rana Atılgan. El farol revisited. 418, 1997.
Physica A: Statistical Mechanics and its Applications, [13] D.B. Fogel, K. Chellapilla, and P.J. Angeline. Inductive
346(3):651–656, 2005. reasoning and bounded rationality reconsidered. IEEE
[9] M.A.R. Cara, O. Pla, and Francisco Guinea. Competi- Transactions on Evolutionary Computation, 3(2):142–
tion, efficiency and collective behavior in the “el farol” 146, 1999.
bar model. Physics of Condensed Matter, 10:187–191, 07 [14] Duncan Whitehead. The El Farol Bar Problem Revisited:
1999. Reinforcement Learning in a Potential Game. Edinburgh
[10] Neil F Johnson, Michael Hart, and P.M Hui. Crowd ef- School of Economics Discussion Paper Series 186, Ed-
fects and volatility in markets with competing agents. inburgh School of Economics, University of Edinburgh,
Physica A: Statistical Mechanics and its Applications, September 2008.
269(1):1–8, 1999. [15] Andrew J. Collins. Strategically forming groups in the
[11] Phase Spaces of the Strategy Evolution in the El Farol el farol bar problem. In Proceedings of the 2017 Inter-
Bar Problem, volume ALIFE 2020: The 2020 Conference national Conference of The Computational Social Science
on Artificial Life of ALIFE 2021: The 2021 Conference Society of the Americas, CSS 2017, New York, NY, USA,
on Artificial Life, 07 2020. 2017. Association for Computing Machinery.
[12] D. Challet and Y.-C. Zhang. Emergence of cooperation

The Impact of Meta-Strategy On Attendance Dynamics

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

The Impact of Meta-Strategy On Attendance Dynamics

Uploaded by

Copyright:

Available Formats

The Impact of Meta-Strategy on Attendance Dynamics

in the El Farol Bar Problem

I. INTRODUCTION agent has access to a pool of strategies, out of which

Our paper is organized as follows. In Section II, we

The weights assigned to each strategy are sampled

A. Binary Decision Meta-Strategy

T is similar and will be discussed later. Our strategy is to

FIG. 2: Standard deviation of attendance at steady state

FIG. 6: Fixed points predicted by numerical solution of

bility of attendance, the system can converge to a Nash

From a more philosophical perspective, we have an

Appendix A: Fixed Points Below T with the Binary

F ∗ (a) = P (a1∗ ≤ a1 , · · · , am∗ ≤ am )

You might also like