Professional Documents
Culture Documents
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide
range of content in a trusted digital archive. We use information technology and tools to increase productivity and
facilitate new forms of scholarship. For more information about JSTOR, please contact support@jstor.org.
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
https://about.jstor.org/terms
is collaborating with JSTOR to digitize, preserve and extend access to The American Economic
Review
By ROSEMARIE NAGEL*
Consider the following game: a large num- one another. In the simplest case, a player se-
ber of players have to state simultaneously a lects a strategy at random without forming be-
number in the closed interval [0, 100]. The liefs or picks a number that is salient to him
winner is the person whose chosen number is (zero-order belief). A somewhat more so-
closest to the mean of all chosen numbers mul- phisticated player forms first-order beliefs on
tiplied by a parameter p, where p is a prede- the behavior of the other players. He thinks
termined positive parameter of the game; p is that others select a number at random, and he
common knowledge. The payoff to the winner chooses his best response to this belief. Or he
is a fixed amount, which is independent of the forms second-order beliefs on the first-order
stated number and p. If there is a tie, the prize beliefs of the others and maybe nth order be-
is divided equally among the winners. The liefs about the (n - I )th order beliefs of the
other players whose chosen numbers are fur- others, but only up to a finite n, called the n-
ther away receive nothing.' depth of reasoning.
The game is played for four rounds by the The idea that players employ finite depths
same group of players. After each round, all of reasoning has been studied by various
chosen numbers, the mean, p times the mean, theorists (see e.g., Kenneth Binmore, 1987,
the winning numbers, and the payoffs are pre- 1988; Reinhard Selten, 1991; Robert Aumann,
sented to the subjects. For 0 c p < 1, there 1992; Michael Bacharach, 1992; Cristina
exists only one Nash equilibrium: all players Bicchieri, 1993; Dale 0. Stahl, 1993). There
announce zero. Also for the repeated super- is also the famous discussion of newspaper
game, all Nash equilibria induce the same an- competitions by John M. Keynes (1936 p.
nouncements and payoffs as in the one-shot 156) who describes the mental process of
game. Thus, game theory predicts an unam- competitors confronted with picking the face
biguous outcome. that is closest to the mean preference of all
The structure of the game is favorable for competitors.2 Keynes's game, which he con-
investigating whether and how a player's men- sidered a Gedankenexperiment, has p = 1.
tal process incorporates the behavior of the However, with p = 1, one cannot distinguish
other players in conscious reasoning. An ex- between different steps of reasoning by actual
planation proposed, for out-of-equilibrium be- subjects in an experiment.
havior involves subjects engaging in a finite There are some experimental studies in
depth of reasoning on players' beliefs about which reasoning processes have been analyzed
in ways similar to the analysis in this paper.
Judith Mehta et al. (1994), who studied be-
havior in two-person coordination games, sug-
* Department of Economics, Universitat Pompeu Fa-
gest that players coordinate by either applying
bra, Balmes 132, Barcelona 08008, Spain. Financial sup-
port from Deutsche Forschungsgemeinschaft (DFG) depth of reasoning of order I or by picking a
through Sonderforschungsbereich 303 and a postdoctoral focal point (Thomas C. Schelling, 1964),
fellowship from the University of Pittsburgh are grate- which they call "Schelling salience." Stahl
fully acknowledged. I thank Reinhard Selten, Dieter
and Paul W. Wilson (1994) analyzed behavior
Balkenborg, Ken Binmore, John Duffy, Michael Mitzkewitz,
Alvin Roth, Karim Sadrieh, Chris Starmer, and two anon-
in symmetric 3 x 3 games and concluded that
ymous referees for helpful discussions and comments. I subjects were using depths of reasoning of or-
learned about the guessing game in a game-theory class ders 1 or 2 or a Nash-equilibrium strategy.
given by Roger Guesnerie, who used the game as a dem-
onstration experiment.
'The game is mentioned, for example, by Herve
Moulin (1986), as an example to explain rationalizability, 2 This metaphor is frequently mentioned in the mac-
and by Mario H. Simonsen (1988). roeconomic literature (see e.g., Roman Frydman, 1982).
1313
Both of these papers concentrated on several announcing 0 is also the only strategy com-
one-shot games. In my experiments, the deci- bination that survives the procedure of infi-
sions in first period indicate that depths of nitely repeated simultaneous elimination of
reasoning of order 1 and 2 may be playing a weakly dominated strategies.5 For p = 1 and
significant role. In periods 2-4, for p < 1, I more than two players, the game is a coordi-
find that the modal depth of reasoning does notnation game, and there are infinitely many
increase, although the median choice de- equilibrium points in which all players choose
creases over time.3 A simple qualitative learn- the same number (see Jack Ochs [1995 ] for a
ing theory based on individual experience is survey). For p > 1 and 2p < M (M is the
proposed as a better explanation of behavior number of players), all choosing 0 and all
over time than a model of increasing depth of choosing 100 are the only equilibrium points.
reasoning. This is the kind of theory that Note that for p > 1 there are no dominated
Selten and Joachim Buchta (1994) call a strategies.6 The subgame-perfect equilibrium
"learning direction theory," which has been play (Selten, 1975) does not change for the
successfully applied in several other studies. finitely repeated game.
Other games with unique subgame-perfect
equilibria that have been explored in the ex- II. A Model of Boundedly Rational Behavior
perimental literature include Robert Rosen-
thal's (1981) "centipede game," a market In the first period a player has no informa-
game with ten buyers and one seller studied tion about the behavior of the other players.
experimentally by Roth et al. (1991), a public- He has to form expectations about choices of
goods-provision game studied by Vesna the other players on a different basis than in
Prasnikar and Roth (1992), and the finitely subsequent periods. In the subsequent periods
repeated prisoner's dilemma studied experi- he gains information about the actual behavior
mentally by Selten and Rolf Stoecker (1986). of the others and about his success in earlier
In the experimental work on the centipede periods. Therefore, in the analysis of the data
game by Richard McKelvey and Thomas I make a distinction between the first period
Palfrey (1992) and on the prisoner's dilemma and the remaining periods.
supergame, the outcomes are quite different
from the Nash equilibrium point in the open-
k with positive probability. Obviously, in this equilibrium
ing rounds, as well as over time. While the
p times the mean of the numbers chosen is smaller than k.
outcomes in Roth et al. (1991), Prasnikar and
Therefore, player m can improve his chances of winning
Roth (1992), and my experiments are also far by replacing k by a smaller number with the same prob-
from the equilibrium in the opening round, ability. Therefore no equilibrium exists in which a positive
they approach the equilibrium in subsequent number is chosen with positive probability.
'Numbers in (lOOp, 100] are weakly dominated by
rounds. Learning models have been proposed
lOOp; in the two-player game, 0 is a weakly dominant
to explain such phenomena (see e.g., Roth and strategy. The interpretation of the infinite iteration process
Ido Erev, 1995). might be: it does not harm a rational player to exclude
numbers in the interval (lOOp, 100]. If this player also
I. The Game-Theoretic Solutions believes that all other players are rational, he consequently
believes that nobody will choose from (lOOp, 100], and
therefore he excludes (lOOp2, 100]; if he thinks that the
For 0 c p < 1, there exists only one Nash others believe the same, (lOOp3, 100] is excluded, and
equilibrium at which all players choose 0.4 All so on. Thus, 0 remains the only nonexcluded strategy
based on common knowledge of rationality. If choices
were restricted to integers, all choosing I is also an
equilibrium.
- This kind of unraveling is similar to the naturally oc- 6 It is straightforward to show that all choosing 0 and
curring phenomena observed by Alvin E. Roth and all choosing 100 are equilibria: it does not pay to deviate
Xiaolin Xing (1994) in many markets in which it is important
from 0 (100) if all other players choose 0 (100) and the
to act just a little earlier in time than the competition.number of players is sufficiently large. There is no other
4 Assume that there is an equilibrium at which at least symmetric equilibrium since with a unilateral small in-
one player chooses a positive number with positive prob- crease a player improves his payoff. Also, other asym-
ability. Let k be the highest number chosen with positive metric equilibria or equilibria in mixed strategies cannot
probability, and let m be one of the players who chooses exist for analogous reasons, as in the case p < 1.
The model of first-period behavior is as fol- pected of the other players. Specifically, one
lows: a player is strategic of degree 0 if he can ask whether, with increasing experience,
chooses the number 50. (This can be inter- higher and higher iteration steps will be ob-
preted as the expected choice of a player who served. I will show, however, that the modal
chooses randomly from a symmetric distribu- frequency, polled over all sessions, remains at
tion or as a salient number a la Schelling iteration step 2 in all periods. In Section V-C
[1960]). A person is strategic of degree n if a quite different adjustment behavior is ex-
he chooses the number 50p", which I will call amined, which does not involve anything
iteration step n. A person whose behavior is similar to the computation of a best reply
described by n = 1 just makes a naive best to expected behavior. Instead of this, a behav-
reply to random behavior.7 However, if he be- ioral parameter- the adjustment factor- is
lieves that the others also employ this reason- changed in the direction indicated by the in-
ing process, he will choose a number smaller dividual experience in the previous period.
than 50p, say 50p2, the best reply to all other
players using degree-i behavior. A higher III. The Experimental Design
value of n indicates more strategic behavior
paired with the belief that the other players are I conducted three sessions with the param-
also more strategic; the choice converges to eter p = '/2 (sessions 1 -3), four sessions with
the equilibrium play in the limit as n increases. p = 2/3 (4-7), and three sessions with p =4/3
For periods 2-4, the reasoning process of (8-10).9 I will refer to these as 1/2, 2/3, or 4/3
period 1 can be modified by replacing the ini- sessions, respectively. A subject could partic-
tial reference point r = 50 by a reference point ipate in only one session.
based on the information from the preceding The design was the same for all sessions:
period. A natural candidate for such a refer- 15-18 subjects were seated far apart in a large
ence point is the mean of the numbers named classroom so that communication was not pos-
in the previous period. With this initial refer- sible. The same group played for four periods;
ence point, iteration step 1, which is the prod- this design was made known in the written in-
uct of p and the mean of the previous period, structions. At each individual's place were an
is similar to Cournot behavior (Antoine A. instruction sheet, one response card for each
Cournot, 1838) in the sense of giving a best period, and an explanation sheet on which the
reply to the strategy choices made by the oth- subjects were invited to give written explana-
ers in the previous period (assuming that the tions or comments on their choices after each
behavior of the others does not change from round. The instructions were read aloud, and
one period to the next).8 questions concerning the rules of the game
I can also consider "anticipatory learning," were answered.")
in which an increase in iteration steps is ex- After each round the response cards were
collected. All chosen numbers, the mean, and
0.15
the product of p and the mean A.
were written on
the blackboard (the anonymity of the players median 17
mean 27.05
was maintained). The number closest to the
optimal number and the resulting payoffs were ao
* 0.10-
announced. The prize to the winner of each c
IL
A. First-Period Choices
0 10 20 30 40 50 60 70 80 90 100
Chosen Numbers
" This all-or-nothing payoff structure might trigger un-
reasonable behavior by some subjects which in turn im- FIGURE 1. CH()ICES IN THE FIRST PERIO)D: A) SEssIo)Ns
pedes quicker convergence. In John Duffy and Nagel 1-3 (P = /2); B) SEssio)Ns 4-7 (P = /3);
(1995), the behavior in p-times-the-median game was C) SEssIo)Ns 8-10 (P = 4/3)
studied, in an effort to weaken the influence of outliers.
While first round behavior in both the mean- and median-
treatments was not significantly different, fourth round
choices in p-times-the-median game were slightly lower
subjects chose numbers greater than 50 and 8
than those in p-times-the-mean game. Changes in the pay-
off structure, for example, negative payoffs to losers,
percent chose 50; in the 2/ sessions, 10 percent
might affect the evolution of behavior on the guessing of the chosen numbers were greater than 67,
game in a different way. and 6 percent were 66 or 67. From these results
'/2 and 2,3 sessions are drawn from the same 0.00
<5 8-10 16-20 31-41 51-100 interim intervals
distribution, in favor of the alternative hypoth- 5-7 11-15 21-30 42-50 neighborhood
B.
level of statistical significance, according to a
0.30'
Mann-Whitney U test. The same holds for a step 1
test of the data from the 2A- sessions against <0.255 step 2
sessions, almost 50 percent of the choices are belongs to iteration step 2, which is also where
in the neighborhood interval of either iteration we observe the modal choice, with nearly 30
step 1 or 2, and there are few observations in percent of all observations. Over all 2/' ses-
the interim interval between them. In all ses- sions, the optimal choice is about 25, which
sions only 6-10 percent are at step 3 and also belongs to iteration step 2 with about 25
higher steps (the aggregation of the two left- percent of all observations, the second-highest
hand columns in Figure 2A and Figure 2B frequency of a neighborhood interval. Thus,
[p = '/2 and p = 2/4], respectively, and the many players are observed to be playing ap-
right-hand column of Figure 2C [p 4/])* proximately optimally, given the behavior of
(Choices above 50 in the '/2 and 2, sessions the others.
and choices below 50 in the 4/3 sessions are
graphed only in aggregate.) The choices are B. The Behavior in Periods 2, 3, and 4
mostly below 50 in the '/2 and 2A sessions and
mostly above 50 in the 4/3 sessions; this To provide an impression of the behavior
difference is statistically significant at the over time, Figures 3-5 show plots of pooled
1-percent level, based on the binomial test. data from sessions with the same p for each
To test whether there are significantly more period; the plots show the choices of each sub-
observations within the neighborhood inter- ject from round t to t + 1. In the 1/2 and 2A,
vals than in the interim intervals I consider sessions, 135 out of 144 (3 transition periods x
only observations between step 0 and step 3. 48 subjects) and 163 out of 201 observations
Hence, the expected proportion within the (3 x 67 subjects), respectively, are below the
neighborhood interval under the null hypoth- diagonal, which indicates that most subjects
esis (that choices are randomly distributed be- decrease their choices over time. In all ses-
tween interim and neighborhood intervals) is sions with p < 1, the medians decrease over
then the sum of the neighborhood intervals di- time (see Table 1); this is also true for the
vided by the interval between step 0 and step means except in the last period of the I/2 ses-
3. Note that this is a stronger test than taking sions. In the 4/3 sessions, the reverse is true:
the entire interval 0-100. The one-sided bi- 133 out of 153 observations (3 x 51 subjects)
nomial test, taking into consideration the pro- are above the diagonal and the medians in-
portion of observations in the neighborhood crease and are 100 in the third and fourth pe-
intervals, rejects the null hypothesis in favor riods. Thus from round to round, the observed
of the hypothesis that the pooled observations behavior moves in a consistent direction, to-
are more concentrated in the neighborhood in- ward an equilibrium. (It is this movement that
tervals (the null hypothesis is rejected at the is reminiscent of the unraveling in time ob-
1-percent level, both for the '/2 sessions and served in many markets by Roth and Xing
for the 2A sessions; it is rejected at the 5- [1994].)
percent level for the 4/, sessions).'3"'4 In the 1/2 sessions, more than half of the ob-
Note that over all '/2 sessions, the optimal servations were less than 1 in the fourth round.
choice (given the data) is about 13.5, which However, only three out of 48 chose 0. In the
2A sessions, only one player chose a number
less than 1. On the other hand, in the 4/3 ses-
sions, 100 was already the optimal choice in
'3 Since the iterated elimination of dominated strategies the second period, being chosen by 16 percent
starts the reasoning process at 100, I also tested whether of the subjects; and in the third and fourth pe-
that initial reference point would structure the data in a
riod, it was chosen by 59 percent and 68 per-
coherent way for the different parameters. For p = 4/3 all
iteration steps collapse into 100; thus spikes cannot be cent, respectively. Thus, for the 4, sessions I
explained. For the 2/3 sessions the data are not only con- conclude that the behavior of the majority of
centrated around 100 x (2/3)". On the other hand, the pat- the subjects can be simply described as the
tern of the '/A-session data is similar to the pattern in Figure
best reply (100) to the behavior observed in
3A, except that step n becomes step n + 1. Hence, 100 is
not a plausible initial reference point for most subjects.
the previous period. (Some of the subjects
14 The written comments of the subjects also seem to who deviated from this behavior argued that
support our model. For details see Nagel (1993). they tried to influence the mean [to bring it
100- 100-
A. A.
90- 90-
-~80 . 80- +
o ~~ ~~~~~~~~~~~~~~~~~~~0 +
ii 70 i 70-
c 60- c 60-
o 0
o 0
a) 50- a) 50-
U) U~~~~~~~~~~~~~~~~~C) ++
40- + -40 ++
Si Si + ~~~~~~~~~~~~~~~~~~~~~~~~~~~+ +
530 ~~~~~~~ ~~~~~~~~~~~~~~~~~~~+ 30 ++
300++ + ++ +
+ + +~~~~~~~~~~~~~~
+~~~~~~~~~~~~~~~~ +
0 10 20 3 40 do d io ~0 40 100 0 10 20 3 40 5 60 70 80 90 100
Choices in First Period Choices in First Period
100- 100-
B. B.
90- 90-
80- 80-
.070 c 70-
o 60- *2 60-
H 50 H 50-
40- 40-
Oo
30- 0+
0 30-
o) 0+ +
20- 20 + *. ++
++ +
10 .- + 10: +4
100-i 100-
C. C.
90- + 90-
80- 80-
o0 0
070 c 70Q
.c60 . 60-
LO 50- 0 50 +
0 0L
40- C 40-
5 30 s~~~~~~~~~~~~~~~~~~~ 30-
20- ~~~ ~~~~~~~~~~~~~20- +++
10- 10- + ++
0 10 20 30 40 50 60 70 80 90 100 0 10o 20 30 40 50 d0 70 80 90 10
Choices in Third Period Choices in Third Period
FIGURE 3. OBSERVATIONS OVER TIME FOR SESSIONS 1-3 FIGURE 4. OBSERVATIONS OVER TIME FOR SESSIONs 4-7
(p = A/): A) TRANSITION FRom FIRST TO SECOND (p 7 4): A) TRANSITION FROm FIRST TO SECOND
PERIOD; B) TRANSITION FROM SECOND To THIRD PERIOD; PERIOD; B) TRANSITION FROM SECOND To THIRD PERIOD;
C) TRANSITION FROm THIRD To FoURTH PERIOD C) TRANSITION FROm THIRD To FoURTH PERIOD
down again] or wrote that the split prize was best reply in the 1/2 and 2A sessions, given the
too small to state the obvious right answer.) actual strategies. Instead the best reply is a
The adjustment process toward the equilibri- moving target that approaches 0. The adjust-
um in the 1/2 and 2A3 sessions is quite different ment process is thus more complicated. Com-
from that in the 4/3 sessions. Zero is never the paring Figures 3 and 4, one can see that the
1 00 - H - !
n 80 A. + + + + for p = 2/, I define a rate of decrease of the
90- +
means and medians from period 1 to period 4
+ 4, 4 / within a session by
7 0-
c 8 0- + ,
60- 6
40-
(mean),=,
o 30- /
20-
(1b ) wmcd (median), - (median),=4
(median), ,I
0 10 20 30 40 50 60 70 80 90 100
Choices in First Period The rates of decrease of the single sessions are
shown in Table 1, in the last lines of panels A
and B. The rates of decrease of the session
8 observations
100- H + H medians in the '/2 sessions are higher than
B.
90- + those in the 2/ sessions, and the difference is
80- statistically significant at the 5-percent level
a) 70 + (one-tailed), based on a Mann-Whitney U
- 60- test. There is no significant difference in the
H 50- + rates of decrease of the means. The median
U) 40- seems more informative than the mean, since
*5 30+ the mean may be strongly influenced by a sin-
20- /
gle deviation to a high number. Thus, I con-
10-
clude that the rate of decrease depends on the
parameter p.
0 1 0 2 3 40 50 60 70 80 90 1i00 Analyzing the behavior in the first period, I
Choices in Second Period
found some evidence that r = 50 was a plau-
sible initial reference point. Below, I classify
100-
27 observations the data of each of the subsequent periods ac-
C. + t cording to the reference point r (mean of the
90a c
previous period) and iteration steps n: rp".
-o80-
0
70- 70 Numbers above the mean are aggregated to
-C 60- +
"above mean,_" (see Table 2).'5
As was the case for the first-period behav-
LL050-
ior, one cannot expect that exactly these steps
*) 40-
are chosen. Grouping the data of the subse-
0 30-
020-
quent periods and sessions in the same way as
in the first period, namely, in neighborhood
intervals of the iteration steps and interim in-
0 10 20 3 4 0 60 7 0 90 100
Choices in Third Period
TABLE 1-MEANS AND MEDIANS OF PERIODS 1-4, AND RATE OF DECREASE FROM PERIOD 1 TO PERIOD 4
Rate of decrease:' 0.58 0.7 0.92 0.91 0.75 0.71 0.76 0.76
tervals between two steps, I find no significant least 80 percent of the observations remain
difference between the frequencies of obser- within the bounds of iteration step 0 and iter-
vations in the neighborhood intervals and the ation step 3, with the modal frequency (30 per-
frequencies of observations in the interim in- cent or more) at iteration step 2 when the
tervals. Note also that as the mean decreases, previous period's mean is the reference point.'8
the interval between two steps becomes rather In fact, in periods 1-3, the best reply is within
small.'6 However, I would like to know within step 2 in at least five of the seven sessions.
which iteration steps the numbers are located Within the neighborhood of the mean of the
in the different periods; therefore, I divide the previous period (step 0) there are only a few
interval between steps i and i + 1 geometri- observations, and those frequencies decrease
cally into two intervals.'7 in the 2A3 sessions. The frequency of choices
Parts A and B of Table 2 present the fre- around iteration step 1, corresponding to the
quencies of observations for each iteration Cournot process, also declines to less than 15
step, pooled over the '/2 and 2A, sessions,percent re- in the third and fourth periods. The
spectively. I also state the mean area of each frequencies with more than three steps are be-
iteration step over all sessions, separately for low 10 percent, except in period 4 of the '/2
eacth period. In most sessions and periods, at sessions. I interpret these results to mean that
there is no support for the hypothesis of in-
creasing depth of reasoning, since there is no
tendency for the majority of the subjects to
16 Most of the subjects just mentioned in their com-
increase the depth of reasoning beyond step
ments that the mean will decrease. There were less precise
calculations than in the first period.
'7 If one normalizes the mean of the previous period to
1, the boundaries of step n are (pj + 112, pn - 112]. As in period
1, step 0 has its right-hand boundary at 1. Table 2 reports 1' This corresponds to what we called the anticipatory
the unnormalized length (called "area") of an iteration learning process in Section II. Hence, one might infer that
step. For example, for p = '/2, in period 2, the area of a substantial proportion of subjects believe that the aver-
numbers above the mean is 73, since on average, over all age behavior in period t will be around p times the mean
'A sessions the mean of the previous period (r) is 27. of period t - 1.
TABLE 2-RELATIVE FREQUENCIES AND AREAS OF PERIODS 2-4 ACCORDING TO THE STEP-MODEL
FOR AGGREGATED DATA
Classification Relative frequency Area Relative frequency Area Relative frequency Area
B. Sessions 4- 7 (p = 2/3):
2.19 In the next section I describe the observed (ii) he earned nothing in the previous period,
behavior from period 2 to period 4 in a differ- because his chosen number was either be-
ent way-by a qualitative learning-direction low or above p times the mean (and not
theory. This theory might explain why the mo- the closest to it).
dal frequency of depth of reasoning does not
increase. Since there are only a few winners in each
period, data on having chosen the winning
C. Adjustment Process Due to Individual number are scarce. Therefore, I exclude those
Experience (for p < 1) choices that led to a positive payoff ( 19 out of
144 [ 13 percent] in the '/2 sessions and 23 out
So far, I have categorized behavior into of 201 [9 percent] in the 2A sessions) and pro-
classes based on the deviation from the mean pose a simple qualitative learning theory for
of the previous period. I now analyze individ- the change of behavior after having faced zero
ual adjustments due to individual experience. payoffs.20
There are two possible experiences due to pay- For this purpose, I introduce a parameter
offs a player obtained: called the adjustment factor:
" In periods 2 and 3, step 2 is the modal choice in six 20 Stahl (1994) compares several learning models. I ap-
out of seven sessions; in period 4, this holds in four ses- ply only one learning model, a kind of model that has been
sions, and in three sessions, the modal choice is step 3, successfully used in different experimental settings (see
tied with step 2 or 4. e.g., Selten and Buchta, 1994).
0.5-" , 0.5-
05 ~~~~~~~~~~~~~~~~~increase /......
0.
X 0.1 0.1 M 0.
a: a
@ ~~~~~~~~~~~~~~~~~~~~~~~~a)
0 . . higher than optimal . ............. lower than optimal 0 . ........... ... higher than o
Adjustment factor in period 1 was ............... Adjustmant factor in period 1 was
0U-/
0.6-.B 0.6- E.
a: ~~~~~~~~~~~~~~~~~~~~~~~a:
0.5- 0.5-
a ) .
... .0~~~~~~~~~~~~~~~~~~~~.................
u higher than optimal lower than optimal ... higher than optimal lower than optimal
' ............
.f 04 ft 0.4?
Adjustment factor in period 2 was ... Adjustment factor in period 2 was
0.6F U B. 0.6R EX
-~0.2- 0.2-
0. 1 0. 1 ./
a) EI0.4 P 0.4
..higher than optimal lower than optimal 0 ... higher than optimal ... lower than optimal
Adjustment factor in period 3 was ..Adjustment factor in period 3 was
F 0.3F / T 0.3
U- PE
prisoner's-dilemma supergames. Thus for zero. I apply the theory of first-round behav-
different games, similar kinds of adjustment ior also to the subsequent periods 2-4, using
processes have been used to explain behavior. as the initial reference point the mean of the
However, the dynamics of the behavior can be previous period. I find that the modal choices
quite different: in some games there is a con- are around iteration step 2, and the majority
vergence toward an equilibrium, whereas in of observations remain below step 3. In most
others, the adjustment process may not lead to sessions, the best reply is within step 2 in
an (efficient) equilibrium. periods 1-3. I cannot accept the hypothesis
of increasing iteration steps, and I suggest
V. Summary that another explanation of the observed be-
havior may be more adequate for periods
My analysis of behavior in an abstract game 2-4. I propose a qualitative learning-
leads me to believe that the structure of the direction theory which predicts that a subject
game is favorable for the study of thought pro-tends to increase his adjustment factor in the
cesses of actual players. In the first period direction of the optimal adjustment factor if
the behavior deviates strongly from game- it was below the optimal one and tends to
theoretic solutions. Furthermore, the distribu- decrease the adjustment factor if it was
tion of the chosen numbers over the [0, 100] above the optimal one. A similar kind of
interval in sessions with different parameters simple learning theory has been applied suc-
were significantly different. I have proposed a cessfully in other experiments.
theory of boundedly rational behavior in
which the "depths of reasoning" are of im- REFERENCES
portance. The results indicate that, starting
from initial reference point 50, iteration stepsAumann, Robert. "Irrationality in Game The-
1 and 2 play a significant role, that is, most of ory," in Partha Dasgupta, Douglas Gale,
the observations are in the neighborhood of Oliver Hart, and Eric Maskin, eds., Eco-
50p or 50p2, independent of the parameter p. nomic analysis of markets and games.
This result accounts for the difference of the Cambridge, MA: MIT Press, 1992, pp.
distributions of the chosen numbers for differ- 214-27.
ent parameter values p. Bacharach, Michael. "Backward Induction and
Thus, the theory of boundedly rational be- Beliefs about Oneself." Synthese, June
havior for the first period deviates in several 1992, 91(3), pp. 247-84.
ways from the game-theoretic reasoning: Bicchieri, Cristina. Rationality and coordina-
tion. Cambridge: Cambridge University
(i) I suggested that the "reference point" or Press, 1993.
starting point for the reasoning process is Binmore, Kenneth. "Modeling Rational Play-
50 and not 100. The process is driven by ers, Part I." Economics and Philosophy,
iterative, naive best replies rather than by October 1987, 3, pp. 179-2 14.
an elimination of dominated strategies. . "Modeling Rational Players, Part
(ii) The process of iteration is finite and not II." Economics and Philosophy, April
infinite. 1988, 4(3), pp. 9-55.
(iii) I apply the same theory for p > 1 and Cachon, Gerard P. and Camerer, Colin. "The
p < 1, whereas game-theoretic reasoning Sunk Cost Fallacy, Forward Induction, and
is different for those parameter sets. Behavior in Coordination Games." Work-
ing paper, University of Pennsylvania,
Over time the chosen numbers approach an 1991.
equilibrium or converge to it. In the '4/ ses-Cournot, Antoine A. "Recherches sur les Prin-
sions, the choice 100 is the best reply in all cipes Mathematiques de la Theorie de Ri-
periods but the first. In the third and fourth chesses." 1838. Translated by N. T. Bacon,
period more than 50 percent of the subjects Researches into the mathematical principles
choose this strategy. In the sessions with p < of the theory of wealth. London: Hafner,
1, there is a moving target, which approaches 1960.
Duffy, John and Nagel, Rosemarie. "On the Ro- Roth, Alvin E.; Prasnikar, Vesna; Okuno-
bustness of Behavior on Experimental Fujiwara, Masahiro and Zamir, Shmuel. "Bar-
Guessing Games." Mimeo, University of gaining and Market Behavior in Jerusalem,
Pittsburgh, 1995. Ljubljana, Pittsburgh, and Tokyo: An Exper-
Frydman, Roman. "Towards an Understanding imental Study." American Economic Review,
of Market Processes: Individual Expec- December 1991, 81(5), pp. 1068-95.
tations, Learning, and Convergence to Roth, Alvin E. and Xing, Xiaolin. "Jumping the
Rational Expectations Equilibrium." Amer- Gun: Imperfections and Institutions Related
ican Economic Review, September 1982, to the Timing of Market Transactions."
72(4), pp. 652-68. American Economic Review, September
Keynes, John M. The general theory of interest, 1994, 84(4), pp. 992-1044.
employment and money. London: Macmil- Schelling, Thomas C. The strategy of conflict. Cam-
lan, 1936. bridge, MA: Harvard University Press, 1960.
McKelvey, Richard and Palfrey, Thomas. "An Selten, Reinhard. "Reexamination of the Perfect-
Experimental Study of the Centipede ness Concept for Equilibrium Points in Exten-
Game." Econometrica, July 1992, 60(4), sive Games." International Journal of Game
pp. 803-36. Theory, 1975, 4(1), pp. 25-55.
Mehta, Judith; Starmer, Chris and Sudgen, . "Anticipatory Learning in Two-
Robert. "The Nature of Salience: An Ex- Person Games," in R. Selten, ed., Game
perimental Investigation of Pure Coordina- equilibrium models I. Berlin: Springer Ver-
tion Games." American Economic Review, lag, 1991, pp. 98-154.
June 1994, 84(3), pp. 658-73. Selten, Reinhard and Buchta, Joachim. "Experimen-
Mitzkewitz, Michael and Nagel, Rosemarie. "Ex- tal Sealed Bid First Price Auction with Directly
perimental Results on Ultimatum Games Observed Bid Functions." Discussion Paper
with Incomplete Information." Interna- No. B-270, University of Bonn, 1994.
tional Journal of Game Theory, 1993, Selten, Reinhard and Stoecker, Roif. "End Behav-
22(2), pp. 171-98. ior in Sequences of Finite Prisoner's Dilemma
Moulin, Herve. Game theory for social sci- Supergames: A Learning Theory Approach."
ences. New York: New York Press, 1986. Journal of Economic Behavior and Organi-
Nagel, Rosemarie. "Experimental Results on zation, March 1986, 7(1), pp. 47-70.
Interactive, Competitive Guessing." Dis- Simonsen, Mario H. "Rational Expectations,
cussion Paper No. B-236, University of Game Theory, and Inflationary Inertia," in
Bonn, 1993. P. W. Anderson, Kenneth J. Arrow, and
Ochs, Jack. "Coordination Problems," in John David Pines, eds., The economy as an evolv-
Kagel and Alvin E. Roth, eds., Handbook ing complex system, Vol. 5. Redwood City,
of experimental economics. Princeton, NJ: CA: Addison-Wesley, 1988, pp. 205-41.
Princeton University Press, 1995, pp. 195- Stahl, Dale 0. "The Evolution of Smart,, Play-
252. ers." Games and Economic Behavior, Oc-
Prasnikar, Vesna and Roth, Alvin E. "Consid- tober 1993, 5(4), pp. 604- 17.
erations of Fairness and Strategy: Experi- . "Rule Learning in a Guessing
mental Data from Sequential Games." Game." Working paper, University of
Quarterly Journal of Economics, August Texas, 1994.
1992, 107, pp. 865-88. Stahl, Dale 0. and Wilson, Paul W. "Experimen-
Rosenthal, Robert. "Games of Perfect Infor- tal Evidence on Players' Models of Other
mation, Predatory Pricing, and the Chain Players." Journal of Economic Behavior
Store Paradox." Journal of Economic The- and Organization, December 1994, 25(3),
ory, August 1981, 25(1), pp. 92-100. pp. 309-27.
Roth, Alvin E. and Erev, Ido. "Learning in Van Huyck, John B.; Battalio, Raymond C. and
Extensive-Form Games: Experimental Data Beil, Richard 0. "Tacit Coordination Games,
and Simple Dynamic Models in the Inter- Strategic Uncertainty, and Coordination
mediate Term." Games and Economic Be- Failure." American Economic Review,
havior, January 1995, 8(1), pp. 164-212. March 1990, 80(1), pp. 234-48.