Professional Documents
Culture Documents
1, MARCH 2010 1
Abstract—It is easy to create new combinatorial games but more Pell [1, p. 10] states:
difficult to predict those that will interest human players. We ex-
amine the concept of game quality, its automated measurement If we could develop a program which, upon considera-
through self-play simulations, and its use in the evolutionary search tion of a particular game, declared the game to be uninter-
for new high-quality games. A general game system called Ludi is esting, this would seem to be a true sign of intelligence!
described and experiments conducted to test its ability to synthe-
size and evaluate new games. Results demonstrate the validity of So when this becomes an issue, we will know that the field
the approach through the automated creation of novel, interesting, has certainly matured.
and publishable games.
With this in mind, we describe a framework called Ludi for
Index Terms—Aesthetics, artificial intelligence (AI), combinato- the measurement and synthesis of combinatorial games, and
rial game, evolutionary search, game design. three experiments designed to test the system and the validity
of the following hypotheses.
I. There exist fundamental (and measurable) indicators of
I. INTRODUCTION
game quality.
II. These fundamental indicators may be harnessed for the
HILE games research has led to many important arti-
W ficial intelligence (AI) breakthroughs over the last few
decades, these have generally come through the study of classics
directed search for new high-quality games.
Authorized licensd use limted to: IE Xplore. Downlade on May 13,20 at 1:536 UTC from IE Xplore. Restricon aply.
2 IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, VOL. 2, NO. 1, MARCH 2010
C. Recombination Games
Authorized licensd use limted to: IE Xplore. Downlade on May 13,20 at 1:536 UTC from IE Xplore. Restricon aply.
BROWNE AND MAIRE: EVOLUTIONARY GAME DESIGN 3
and are therefore generally more important. If the total dif- The most widely used GDL is probably the commercially
ference between the two rule sets exceeds a certain threshold available Zillions of Games ZRF rule language [12]. ZRF au-
value then the two games are considered to be distinct. thors define games in a Lisp-like syntax using predefined key-
words, and may programmatically create complex rule struc-
E. General Game Players tures through macros. More recently, the Stanford GDL used
for the AAAI GGP competitions [13] is a lower level language
Given the possibility of creating a large number of rule sets, it
that defines games using first-order logic.
would be desirable to test them automatically through self-play.
General game players (GGPs)—software systems for playing III. THE LUDI SYSTEM
a range of games well rather than any one particular game ex-
Ludi is a system for playing, measuring, and synthesizing
pertly—are ideal for this purpose.
games within the scope of its GDL (Fig. 2). The main compo-
GGPs were first proposed several decades ago [10] but have
nents of the system are:
recently enjoyed a resurgence of interest as researchers come
• GDL: defines the scope of games;
to realize their potential value to the gaming and broader AI
• GGP: interprets games and coordinates play;
communities. This includes GGP competitions run over recent
• strategy module: informs move planning;
years in conjunction with international AI conferences [11].
• criticism module: measures game quality;
• synthesis module: generates new games.
F. Game Description Languages
Central to any GGP is the game description language (GDL) A. Ludi GDL
that defines the scope of games understood by the system. There The Ludi GDL is a high-level game description language
is a delicate balance between defining a GDL that is powerful based on the ludemic understanding of games outlined in
and extensible enough to encompass a wide range of known and Section II. It is structured to follow the basic means-play-ends
not-yet-known games, yet also efficient, elegant, and compre- model of games, extended to include the relationship between
hensible to human authors. the game and its players (Fig. 3).
Authorized licensd use limted to: IE Xplore. Downlade on May 13,20 at 1:536 UTC from IE Xplore. Restricon aply.
4 IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, VOL. 2, NO. 1, MARCH 2010
The Ludi GDL was devised with Kernighan and Pike’s prin- legal destination cells for movable pieces are similarly marked
ciples of good software design [14] in mind: “ ” when those pieces are clicked on.
• simplicity; The play manager coordinates play for one to eight human
• clarity; and artificial players, although only two are used for this study.
• generality; This includes move scheduling, input handling, ko repetition
• automation. testing to avoid infinite loops, all players passing, and so on.
It is a higher level language than the Stanford GDL and Zil- Moves for artificial players are planned using standard alpha
lions ZRF, and although concise and conducive to human au- beta adversarial search with move ordering, beam width reduc-
thoring and machine manipulation, it lacks the universal gen- tion, and iterative deepening [16]. Estimated values for nonter-
erality of the Stanford GDL in particular. However, its hierar- minal board positions are provided by the Strategy module, as
chical and well-defined nature makes it ideal for the intended follows.
experiments, as it is much more likely that a structured tree-
based language will evolve sensible rule sets than an unstruc- C. Strategy Module
tured logic-based one. The Ludi GDL proved sufficiently rich The strategy module provides value estimates for given board
for this intended purpose that its somewhat limited scope was positions relative to each player. This is achieved through a set
not an issue. of advisors working within certain policies.
The following example conveys the essence of the language: 1) Advisors: Advisors are evaluation functions that represent
(game Tic-Tac-Toe some narrow but rational view of the board position [17], and
(players White Black) express whether a player’s position is favorable or unfavorable
(board within this perspective [18].
(tiling square i-nbors) Ludi defines 20 such advisors for aspects such as mobility,
(size 3 3) proximity to goal, attacking potential, connective potential, and
) so on. Each advisor takes as input a board state, end condition,
(end (All win (in-a-row 3))) and player color (Fig. 5) and has the functionality to return both
) an estimate of that player’s positional value and whether the end
This game (Tic-Tac-Toe) is played between White and Black condition has been achieved, as required.
on a 3 3 square grid with orthogonal and diagonal adjacency, 2) Policies: The contributions from each advisor are com-
and is won by the player to make a line of three pieces of their bined using a weighted linear function [15] to provide an overall
color (if any). Unless otherwise stated, it is assumed that players value for board state
take turns placing a piece of their color on an empty board cell
each move.
Ludi GDL definitions closely correspond to a game’s ludemic
description, which is how a human designer would typically (1)
conceptualize the game. A more detailed description of the lan- where is a vector of weights and the set of
guage is given in Appendix I and further examples of games advisor functions. The weight vector constitutes a policy that
defined in the GDL can be found in Appendix II. describes the relative importance of each advisor for that game.
3) Policy Optimization: For efficiency, it is important that
only those advisors relevant to the game have nonzero weight.
B. Ludi GGP
The system is able to derive a default policy for each game based
The core of the Ludi system is its general game player, as upon its rules and to optimize that policy through self-play using
shown in Fig. 2. The Ludi GGP is implemented in C++ and two-membered evolutionary search -ES [19].
provides the following functionality: Policy optimization threw up some surprising emergent
• rules parser; strategies, such as a small negative weighting on stack height
• game object; to disincline repetitive cycles in stacking games, which was
• user interface; later incorporated into default policies where appropriate. A
• play manager. null “fight or flight” policy is used for any player with no
The rules parser loads and parses games defined in the Ludi specified end conditions, in which pieces are moved towards
GDL. If a definition is valid according to the grammar, then the enemy pieces to encourage engagement while maximizing their
corresponding ludeme tree is constructed and the single game movement potential to allow escape if necessary.
object initialized. The game object maintains a record of the Although a high level of play cannot be claimed for all
current board state and handles tasks such as the generation of games, this advisor/policy approach proved sufficient for exer-
legal moves and testing for terminal conditions. cising rule sets defined in the GDL and providing meaningful
The user interface (Fig. 4) presents games uniformly and self-play simulations. All players, human and artificial, are
anonymously so that quality judgments are made on the merits beginners at any newly created game.
of the games themselves rather than their visual attractiveness.
The interface provides a plain English translation of the current IV. GAME EVALUATION
rule set and a tutorial mode to help players understand new The criticism module measures games for quality through
games. In tutorial mode, legal placements are marked “ ” and self-play simulations, based on certain aesthetic criteria.
Authorized licensd use limted to: IE Xplore. Downlade on May 13,20 at 1:536 UTC from IE Xplore. Restricon aply.
BROWNE AND MAIRE: EVOLUTIONARY GAME DESIGN 5
Authorized licensd use limted to: IE Xplore. Downlade on May 13,20 at 1:536 UTC from IE Xplore. Restricon aply.
6 IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, VOL. 2, NO. 1, MARCH 2010
plans that we would ideally like to measure as an indication of human opponents taking 30 s per move on average will com-
the players’ engagement with the game, but this is of course plete such a game in 30 min on average, a figure that experience
impossible. However, we can measure the resulting moves that suggests is reasonable (note that the GGP plays much faster than
represent tangible realizations of those plans. As Perlis [29, p. this for most games).
11] states: “We measure our understanding (and control) by the
extent to which we can arithmetize an activity.” C. Aesthetic Measurement
1) Aesthetic Criteria: The model is divided into intrinsic
and extrinsic aspects, with the latter further divided into quality Aesthetic measurements are made for each game during a
and viability. These define the three main categories of aesthetic number of self-play trials . The first few moves of each game
criteria: are made randomly (but legally) to encourage a more thorough
• intrinsic: based on rules and equipment; exploration of the move search space; random moves are not
• viability: based on game outcomes; included in the aesthetic measurements. Self-play trials that ex-
• quality: based on trends in play. ceed twice the preferred game length are abandoned as draws.
Intrinsic criteria are the easiest to measure but are less useful Intrinsic criteria are measured before the trials start and vi-
for indicating game quality than they are for specifying personal ability criteria measured by the outcome of each trial. Quality
feature preferences. Viability criteria, such as those proposed by criteria measure trends in play during the trials through the use
Althöfer [21], are robust and useful for quickly detecting flaws of lead histories, as shown in Figs. 7 and 8. The white and black
in games. Quality criteria are the most subtle and difficult to dots indicate the relative positional strengths of both players fol-
measure, but are the key to estimating the potential worth of lowing each move, while the thick lines represent the current
games that have proven viable. difference between the eventual winner and loser.
2) Preferred Game Length: It is useful to define a preferred A total of 57 aesthetic criteria were implemented for Ludi,
game length which is the move number at which the cur- consisting of the following:
rent game will ideally reach a natural conclusion. It might seem • 16 intrinsic criteria;
sensible to adjust this value on a game-by-game basis according • 11 viability criteria;
to the estimated complexity of the game based on board size, • 30 quality criteria.
piece count, branching factor, and so on, but this has the unde- We do not claim that this is a universal or canonical set of fea-
sirable effect of rewarding very short games on smaller boards tures with which all games may be measured. Rather, we define
and excessively long games on larger boards. Instead, we wish to a large number of measurements based on observation, experi-
reward deep, involved passages of play emerging from simpler ence, and prior work, and experimentally determine which of
rule sets and piece configurations; this follows more closely the these are most relevant for the task at hand.
spirit of combinatorial game design and will reduce the running As it is not possible to describe all 57 measurements within
time for the experiments while still allowing complex, elegant, the space of this paper, we present a selection below by way of
and (hopefully) interesting games to emerge. example. For complete descriptions of all criteria, see [15].
We therefore standardized the preferred game length over all 1) Completion (Viability): Games should produce more vic-
games to 60 moves, based on the observation that two tories than draws. The completion criterion simply mea-
Authorized licensd use limted to: IE Xplore. Downlade on May 13,20 at 1:536 UTC from IE Xplore. Restricon aply.
BROWNE AND MAIRE: EVOLUTIONARY GAME DESIGN 7
(4)
Fig. 8. Lead histories of (a) an uncertain game and (b) a certain game.
where represents the board evaluation for the eventual
winner at move and represents the board evaluation
sures the total sum of games won by either player as a ratio of for the eventual loser at move . This equation therefore mea-
all games played sures the average number of moves that the eventual winner of
each game spends with a negative lead, and the severity of each
such position.
(2) 4) Uncertainty (Quality): The outcome of each game should
remain as uncertain for as long as possible if all players are to
A low completion rate indicates a flawed game that is either maintain a vested interest in it. For example, Fig. 8(a) shows
drawish or tends to exceed the maximum move limit. an uncertain game in which neither player develops a strong
2) Duration (Viability): Games should neither be too short advantage until its conclusion, while Fig. 8(b) shows a certain
nor too long. The duration criterion measures the average game in which the eventual winner takes a strong early lead and
discrepancy between the total moves made per game and the keeps it; this will probably be a very unsatisfying game for the
preferred game length , which is set to 60 losing player.
The uncertainty criterion measures the degree to which
the estimated lead value for each move falls above or below the
expected average
(3)
Authorized licensd use limted to: IE Xplore. Downlade on May 13,20 at 1:536 UTC from IE Xplore. Restricon aply.
8 IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, VOL. 2, NO. 1, MARCH 2010
the viability criteria; however, they are useful for estimating the parameters and games are created by making statistical choices
quality of viable games. at each decision point in the grammar according to these param-
eters [18]. Similarly, Rolle’s Morphling allows the user to inter-
D. Criteria Efficiency actively experiment with rule variations of a game according to
Some of the criteria not discussed here are significantly certain rules [30].
slower to measure than others and require additional trial runs Games created by these systems fall within the constructive
in different modes. These “slow” criteria include measurements mode of understanding described by Stiny and Gips, as they are
related to puzzle detection, depth estimation, and robustness to directly constructed according to certain rules [28]. On the other
random, overly defensive, and obstructive players. Full details hand, we wish to create games using the evocative mode of un-
of these modes are given in [15]. derstanding and search for those games that stimulate interest
in human players without having to make assumptions about
E. Aesthetic Coefficients any rules necessary for their construction. For this purpose, we
chose an evolutionary approach to search the game design space,
The weighted sum of all 57 criteria gives an estimated aes- which balances the exploration of the design space with the ex-
thetic score , which is the output of the aesthetic measure- ploitation of existing knowledge [31].
ment process for each game
A. Mating Games
(6) Ludi uses a standard evolutionary approach [32] to create
games, except that all surviving offspring are returned to the
The aesthetic weightings provide an aesthetic profile for population and never culled from generation to generation. Fit-
each game, and were determined empirically in Experiment II ness determines order rather than survival, to encourage the
(described shortly) which involved correlating aesthetic mea- emergence of novel rule combinations.
surements for a number of games with human player rankings Evolution continues until a given time limit is reached or the
of those games. The bias term improves the accuracy of the desired number of new games created. The basic process is sum-
correlation. Criteria with a 0 weighting for a given game are not marized in Fig. 9 and explained below. It is worth noting that the
invoked for that game, for efficiency. aim of this process is to produce a number of interesting indi-
viduals rather than a single optimal individual.
1) Population: The population is initialized with a number
V. GAME SYNTHESIS
of known games in order to encourage well-formed offspring,
The synthesis module creates new games through the evolu- as strictly random rule combinations are most unlikely to pro-
tion of rule sets using genetic programming methods. duce viable games. Members of the population all have a unique
Pell has previously demonstrated the automated generation name and remain sorted by estimated aesthetic value at all times.
of Chess-like games using a method of constrained stochastic 2) Parent Selection: For each iteration, two parents are se-
context-free generation, in which the user specifies a number of lected from the population using stochastic universal sampling
Authorized licensd use limted to: IE Xplore. Downlade on May 13,20 at 1:536 UTC from IE Xplore. Restricon aply.
BROWNE AND MAIRE: EVOLUTIONARY GAME DESIGN 9
[33]. This method draws samples from the entire range but se- C. Repopulation
lects fitter individuals more often, encouraging genetic diversity
while maintaining a reasonable standard of fitness. All children that survive to this point are measured for quality
3) Recombination: The ludeme trees of the two parents are and inserted back into the population according to their pre-
then crossed over to produce a child game using standard ge- dicted aesthetic score. These scores are only preliminary at this
netic programming techniques [34]. One of the parents is chosen stage but sufficient for ordering purposes.
at random to act as a template [35] and elements crossed over A by-product of the aesthetic measurement is a viability test
from the second parent with 10% likelihood. Elements are only that identifies games as nonviable if they:
crossed over to elements with which they are compatible, giving • result in draws more often than not;
a weak form of strong typing [36] that encourages the creation • are very unbalanced towards either player;
of well-formed children. The Ludi GDL description of games • have a serious first or second move advantage; or
as high-level, hierarchical ludeme trees is ideal for this purpose. • do not end within a reasonable number of moves.
4) Mutation: Elements of the child game are then visited Viable offspring have their policies optimized and are remea-
and mutated with 10% likelihood. This may involve adding, sured for quality as a postprocessing step. They are the suc-
changing, or removing elements and/or attributes within the cessful products of the evolutionary process and typically rep-
ludeme tree. Context-dependent constraints are again employed resent around 1% of all nonculled children.
to ensure that changes are compatible with element type, 1) Emergence: Since no rule optimization is performed and
and that deleted items are replaced with default values where all surviving offspring are returned to the gene pool, the popula-
needed. Some repair functions [37] are used to correct obvious tion will be rife with flawed rules and broken games. This seems
errors. at odds with the general wisdom that decries the presence of in-
5) Baptism: Each child is given a short name unique to the trons and bloat in the population [40] but proved necessary for
population, using a Markov chain algorithm based upon letter the success of this project.
combination frequencies found within a list of source words Sanitizing the population had the effect of producing off-
[14]. Ludi uses as its input a list of Tolkien-style names from spring that were mostly just slight variations of their original
a public domain computer game [38] to produce names such as: ancestors, as it is highly unlikely that two given rule sets will
Oroth, Galdal, Etherond, Kemeneth, Valindor, Bered, Mor, and recombine and mutate to produce a viable, distinct, and supe-
so on. rior child, due to rule fragility. Instead, it proved more effective
to flood the population with flawed genetic material that would
act as recessive genes and recombine in unexpected and hope-
B. Validity Checks fully serendipitous ways over many generations; this was where
true innovation emerged during the experiments.
Each child game then undergoes a series of validity checks.
1) Rule Safety: The child is tested for rule safety [35] by
invoking the GGP to instantiate the corresponding game object, VI. EXPERIMENTS
in order to exploit its rigorous error checking systems.
Some rule optimization would typically be performed at this Three experiments involving game ranking, measurement,
stage, such as trimming impossible or irrelevant vestigial rules and synthesis were then conducted to test the hypotheses.
[39], but Ludi forgoes this step for reasons given shortly. As
Perlis [29, p. 8] observes: “Optimization hinders evolution.” A. Experiment I: Game Ranking
2) Speed Test: The child is timed at search plies of 1, 2, 3,
and 4 and discarded if move planning exceeds 15 s per move. In order to distinguish good games from bad, it is first neces-
This somewhat draconian measure ensures that slower games sary to determine which games interest human players. The aim
do not unduly hold up the evolutionary process. of the first experiment was to set a yardstick of human player
One concern is that the imposed speed limit will filter out rankings for a database of 79 predefined sample games.
more complex games, typically those on larger boards with 1) Method: Experimental subjects were presented with
more pieces and greater branching factors, and risk producing survey software that randomly selected two games from the
simple children only. However, it was our specific intention to database of 79, then required them to play both games against
produce elegant games in which simple rules combine harmo- the computer and nominate which of the two they found more
niously to produce complex move decisions rather than seeking interesting. Paired comparisons were automatically e-mailed to
complexity in sheer volume of numbers; the validity of this the authors.
approach was born out in the experimental results. 2) Subjects: Fifty seven subjects participated in the survey,
3) Policy Choice: Perfunctory policy choice is achieved by recruited from online board gaming groups. All were at least 18
comparing all combinations of the child’s default policy with years of age.
both parents’ policies in round-robin matches and choosing the 3) Results: Rankings were induced from 628 paired compar-
most successful hybrid. The child is discarded if match games isons by a cross-entropy (CE) method due to the second author,
take too long or more than half fail to produce a winner. F. Maire [15]. The 79 sample games were ranked from most
4) Inbreeding: The child is measured for distance from each preferred to least preferred with a classification rate of 0.8997
member of the population and culled if an overly close relative (variance ), indicating that the derived game rank-
is found. ings were consistent with almost 90% reliability.
Authorized licensd use limted to: IE Xplore. Downlade on May 13,20 at 1:536 UTC from IE Xplore. Restricon aply.
10 IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, VOL. 2, NO. 1, MARCH 2010
TABLE I
CORRELATION BY CRITERIA TYPE
Authorized licensd use limted to: IE Xplore. Downlade on May 13,20 at 1:536 UTC from IE Xplore. Restricon aply.
BROWNE AND MAIRE: EVOLUTIONARY GAME DESIGN 11
VIII. CONCLUSION
We demonstrate both the Ludi system’s ability to automat-
ically measure games for their potential to interest human
players, and its ability to create new high-quality games. This is
the first known demonstration of automated combinatorial game
design at a successful (publishable) level. However, we do not
claim to have defined a canonical set of aesthetic features with
which all games can be measured; rather, our results pertain to
a selection of individuals for which a particular set of aesthetic
measurements appear to resonate for a subset of combinatorial
Fig. 12. Yavalath puzzle: White to play and force a win. games definable in the Ludi GDL. Our main contribution is to
demonstrate a practical approach to the problems of automated
game measurement and content (rule) generation; no doubt
ments made by the system and player rankings for the 19 new many more such aesthetic features will be discovered in the
games. future over a much wider range of games.
These results support hypothesis II: these fundamental Future work might also include expanding the GDL to
indicators may be harnessed for the directed search for new increase its scope and to seed the generative process with larger
high-quality games, at least in the search for new combinatorial data sets of games, such as the thousands described online at
games that this group of subjects find interesting. [42]. Monte Carlo tree search methods such as UCT [43] could
overcome some shortcomings of the advisor/policy approach
VII. DISCUSSION
during move planning to further increase the generality of the
The first thing to note is the general success of the approach; system. It would also be interesting to extend the system to
the system was able to correlate aesthetic measurements perform not only game measurement and synthesis, but the
of games with human player rankings and hence identify detection of flaws such as symmetry strategies [23] and their
those evolved games of most interest. Several of the final 19
correction.
games exhibit novel and interesting rule combinations, and
Finally, we hope that game designers do not see in systems
those ranked #1 and #2 by human players—Ndengrod and
Yavalath—have proven to be of exceptional quality and are such as Ludi a threat to this very human endeavor. Designers
now commercially published [41]. can benefit tremendously from such tools and will, for the fore-
Ndengrod combines Go-like surround capture with a seeable future, hold the edge in unconstrained creativity.
five-in-a-row goal. This combination works well, but is a redis-
covery of an existing game (Irensei) translated to the hexagonal APPENDIX I
grid. THE LUDI GDL
Yavalath, however, features an innovative rule that has not This Appendix provides a functional description of the key
previously been published: win by making four-in-a-row but elements of the Ludi GDL. A complete formal definition of the
lose by making three-in-a-row before doing so. Bearing in mind
grammar is beyond the scope of this paper, but can be found in
the assertion that good games should yield interesting puzzles
[15].
[20], Fig. 12 shows a Yavalath puzzle that demonstrates its
depth. Hint: Black can force a win with either move “ ,” so Games are defined as recursive trees of elements of the fol-
White must make a counter forcing move to avoid this. lowing form:
Analysis of Yavalath’s ancestry reveals that this innovative
(element [attributes] [(element …)s])
winning condition came about from the serendipitous mating
of rules that were impossible in isolation. If such flawed rules The first item of each element is its name which must be
had been optimized out during the evolutionary process then
unique within its current scope. Each element name must be ei-
Yavalath would probably never have emerged.
ther a predefined keyword or a user-defined variable, such as
Teiglith (#4), Elrostir (#5), Gorodrui (#7), and Valion (#16)
demonstrate the approach’s usefulness even with games of a named piece type, that has been previously described within
average or below average appeal. While not overly successful the game’s rule set. Italicized symbols indicate keywords, upper
as games, each involves interesting rule mechanisms that case symbols indicate data types, capitalized symbols indicate
game authors might use as inspiration for future designs. See record types, and square brackets denote optional items. Ap-
Appendix II for GDL descriptions of the games mentioned in proximately 200 keywords and 20 record types are defined and
this section. implemented for the language.
Curiously, 63% of the final games (12 out of 19) featured The root of each rule tree is the main game ludeme
-in-a-row goals, as opposed to 29% (23 out of 79) in the ini-
tial data set. This may be a reflection of the prevalence of such game ! (game NAME [params]
players
games in the initial data set, the superiority of the -in-a-row board
advisor over other advisors, or simply indicate that this is a rules
robust rule that thrives in more contexts than others. In any [support]
event, it shows that interesting games can indeed be derived )
from Tic-Tac-Toe. rules ! { [pieces] [start] [play] end }
Authorized licensd use limted to: IE Xplore. Downlade on May 13,20 at 1:536 UTC from IE Xplore. Restricon aply.
12 IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, VOL. 2, NO. 1, MARCH 2010
Authorized licensd use limted to: IE Xplore. Downlade on May 13,20 at 1:536 UTC from IE Xplore. Restricon aply.
BROWNE AND MAIRE: EVOLUTIONARY GAME DESIGN 13
The following example shows how complex move descrip- 5) Play Ludeme: Specifies other constraints on play
tions may be constructed from these simple predicates. (optional).
Authorized licensd use limted to: IE Xplore. Downlade on May 13,20 at 1:536 UTC from IE Xplore. Restricon aply.
14 IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, VOL. 2, NO. 1, MARCH 2010
Authorized licensd use limted to: IE Xplore. Downlade on May 13,20 at 1:536 UTC from IE Xplore. Restricon aply.
BROWNE AND MAIRE: EVOLUTIONARY GAME DESIGN 15
(board (tiling square i-nbors) (size 4 4)) [16] S. Russell and P. Norvig, Artificial Intelligence: A Modern Approach,
(pieces 2nd ed. Englewood Cliffs, NJ: Prentice Hall, 2003.
(Stone All [17] S. Epstein, “Deep forks in strategic maps: Playing to win,” in Heuristic
(moves Programming in Artificial Intelligence 2, D. Levy and D. Beal, Eds.
(move
(pre (>= (num-nbors to enemy) 1))
Chichester, U.K.: Ellis Horwood, 1991, pp. 189–203.
[18] B. Pell, “Strategy generation and evaluation for meta-game playing,”
(action (push)) Ph.D. dissertation, Trinity College, Univ. Cambridge, Cambridge,
) U.K., 1993.
(move [19] T. Bäck, F. Hoffmeister, and H. Schwefel, “Basic aspect of evolution
(pre (and (empty to) (connected))) strategies,” Stat. Comput., vol. 4, no. 2, pp. 51–63, 1994.
(action (push)) [20] M. Thompson, “Defining the abstract,” The Games J., 2000 [Online].
) Available: http://www.thegamesjournal.com/articles/DefiningtheAb-
)
stract.shtml
)
) [21] I. I. Althöfer, “Computer-aided game inventing,” Friedrich Schiller
(start Universität, Jena, Germany, Tech. Rep., 2003 [Online]. Available:
(place (Stone White) A1) http://www.minet…uni-jena.de.preprints/althoefer_03/CAGI.pdf
(place (Stone Black) D4) [22] H. Iida, K. Takahara, J. Nagashima, Y. Kajihara, and T. Hashimoto,
) “An application of game-refinement theory to Mah Jong,” in Proceed-
(end ings of the International Conference on Entertainment Computing, ser.
(All win (in-a-row 3)) Lecture Notes in Computer Science, M. Rauterberg, Ed. Berlin, Ger-
(All lose (or (in-a-row 4) (group))) many: Springer-Verlag, 2004, vol. 3116, pp. 445–450.
) [23] C. Browne, Connection Games: Variations on a Theme. Natick, MA:
) AK Peters, 2005.
[24] W. Kramer, “What makes a game good?,” The Games J., 2000
[Online]. Available: http://www.thegamesjournal.com/articles/What-
ACKNOWLEDGMENT MakesaGame.shtml
[25] M. Gardner, “Theory of everything,” The New Criterion, vol. 23, no. 2,
The authors would like to thank B. Pham (Queensland 2004 [Online]. Available: http://www.newcriterion.com/articles.cfm/
theory-of-everything-1136
University of Technology, Brisbane, Qld., Australia) for useful [26] H. Ellis, Impressions and Comments. Boston, MA: Houghton Mif-
guidance, I. Althöfer (Friedrich Schiller Universität, Jena, Ger- flin, 1914.
many) for his help and enthusiasm towards the topic, S. Colton [27] G. Birkhoff, Aesthetic Measure. Cambridge, MA: Harvard Univ.
Press, 1933.
(Imperial College London, London, U.K.) for support, the
[28] G. Stiny and J. Gips, Algorithmic Aesthetics: Computer Models for
reviewers, and the proofreaders. Criticism and Design in the Arts. Berkeley, CA: Univ. California
Press, 1978.
[29] A. Perlis, “Epigrams on programming,” SIGPLAN Notices, vol. 17, no.
9, pp. 7–13, 1982.
REFERENCES [30] T. Rolle, “Development of a multi-game engine,” Diploma thesis, Fac-
ulty Math. Comput. Sci., Friedrich-Schiller-University, Jena, Germany,
[1] B. Pell, “METAGAME: A new challenge for games and learning,” in 2003.
Heuristic Programming in Artificial Intelligence 3, H. Van den Kerik [31] L. Hansheng and K. Lishan, “Balance between exploration and ex-
and L. Allis, Eds. Chichester, U.K.: Ellis Horwood, 1992. ploitation in genetic search,” Wuhan Univ. J. Natural Sci., vol. 4, no. 1,
[2] K. Salen and E. Zimmerman, Rules of Play. Cambridge, MA: MIT pp. 28–32, 1999.
Press, 2004. [32] T. Bäck, F. Hoffmeister, and H. Schwefel, “A survey of evolution
[3] E. Berlekamp, J. Conway, and R. Guy, Winning Ways for Your Mathe- strategies,” in Genetic Algorithms, R. Belew and L. Booker, Eds. San
matical Plays. London, U.K.: Academic, 1982, vol. 1 and 2. Mateo, CA: Morgan Kaufman, 1991, pp. 2–9.
[4] R. Dawkins, The Selfish Gene. Oxford, U.K.: Oxford Univ. Press, [33] J. Baker, “Reducing bias and inefficiency in the selection algorithm,”
1976. in Proc. 2nd Int. Conf. Genetic Algorithms Appl., Mahwah, NJ, 1987,
[5] A. Borvo, Anatomie D’un Jeu de Cartes: L’Aluette ou le Jeu de pp. 14–21.
Vache. Nantes, France: Librarie Nantaise Yves Vachon, 1977. [34] J. Koza, “Genetic programming as a means for programming com-
[6] D. Parlett, “What is a ludeme?,” Incompleat Gamester, 2007 [Online]. puters by natural selection,” Stat. Comput., vol. 4, pp. 87–112, 1994.
Available: http://www.davidparlett.co.uk/gamester/ludemes.html [35] W. Langdon, “Data structures and genetic programming,” in Advances
[7] C. Browne, Hex Strategy: Making the Right Connections. Natick, in Genetic Programming 2. Cambridge, MA: MIT Press, 1996, pp.
MA: AK Peters, 2000. 395–414.
[8] J. Holland, Emergence: From Chaos to Order. Redwood City, CA: [36] D. Montana, “Strongly typed genetic programming,” J. Evol. Comput.,
Addison-Wesley, 1998. vol. 3, pp. 199–230, 1995.
[9] L. Horsten, Philosophy of Mathematics, 2007 [Online]. Available: [37] Z. Michalewicz and G. Nazhiyath, “Genocop III: A co-evolutionary al-
http://plato.stanford.edu/entries/philosophy-mathematics gorithm for optimisation problems with nonlinear constraints,” in Proc.
[10] J. Pitrat, “Realization of a general game-playing program,” in Proc. Int. 2nd IEEE Conf. Evol. Comput., 1995, pp. 647–651.
Fed. Inf. Process. Congr., 1968, vol. 2, pp. 1570–1574. [38] J. Greene, Angband, 2007 [Online]. Available: http://members.cox.net/
[11] M. Genesereth, N. Love, and B. Pell, “General game playing: Overview nppangband/download.htm
of the AAAI competition,” AI Mag., vol. 26, no. 2, pp. 62–72, 2005. [39] G. Costikyan, “Don’t be a vidiot: What computer game designers
[12] N. Love, T. Hinrichs, D. Haley, E. Schkufza, and M. Genesereth, “Gen- can learn from non-electronic games,” 2005 [Online]. Available:
eral game playing: Game description language specification,” Stanford http://www.costik.com/vidiot.html
Logic Group, Stanford, CA, Tech. Rep. LG-2006-01, 2006. [40] T. Soule and J. Foster, “Code size and depth flows in genetic program-
[13] J. Mallett and M. Lefler, Zillions of Games, 1998 [Online]. Available: ming,” in Proc. 2nd Conf. Genetic Programm., 1997, pp. 313–320.
http://www.zillions-of-games.com [41] N. R. Andrés, Nestorgames, 2009 [Online]. Available: http://www.
[14] B. Kernighan and R. Pike, The Practice of Programming. Reading, nestorgames.com
MA: Addison-Wesley, 1999. [42] S. Alden, BoardGameGeek, 2000 [Online]. Available: http://www.
[15] C. Browne, “Automatic generation and evaluation of recombination boardgamegeek.com
games,” Ph.D. dissertation, FIT, Queensland Univ. Technol., Brisbane, [43] L. Kocsis and C. Szepesvári, “Bandit based Monte-Carlo planning,” in
Australia, 2008. Proc. 15th Eur. Conf. Mach. Learn., 2006, pp. 282–293.
Authorized licensd use limted to: IE Xplore. Downlade on May 13,20 at 1:536 UTC from IE Xplore. Restricon aply.
16 IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, VOL. 2, NO. 1, MARCH 2010
Cameron Browne received the Ph.D. degree in Frederic Maire (M’97) receiving the M.Sc. degree
computer science from the Faculty of Information in pure mathematics from the Universite Pierre et
Technology, Queensland University of Technology, Marie Curie, Paris 6, France, in 1989, the M.Sc. de-
Brisbane, Qld., Australia, in 2008. gree in computer science engineering from the Ecole
Currently, he is a Visiting Researcher at the Nationale Superieure d’Ingenieurs of Bordeaux,
Computational Creativity Group, Imperial College France, in 1989, and the Ph.D. degree in discrete
London, London, U.K. His research interests include mathematics from the Universite Pierre et Marie
AI techniques for computer artwork and game Curie, Paris 6, France, in 1993.
design. Currently, he is a Senior Lecturer at the Faculty of
Dr. Browne received an Australian Postgraduate Science and Technology, Queensland University of
Award for his Ph.D. studies and the Dean’s Award Technology, Brisbane, Qld., Australia. His research
for Outstanding Thesis of 2008. interests include computer vision and robotics.
Authorized licensd use limted to: IE Xplore. Downlade on May 13,20 at 1:536 UTC from IE Xplore. Restricon aply.