Professional Documents
Culture Documents
Reference:
• Chapter 1 and 2, Prisner, E. Game theory through examples. Mathematical
Assocation of America, 2014.
• Chapter 14, Hillier and Lieberman, Introduction to Operations Research ,
McGraw Hill, 2010, 9th Edition. ISBN 978-0-07-337629-5.
Contents
1. Introduction to game theory
a. What is a game?
b. Game classification
2. Simultaneous games
a. Bimatrix description
b. Which option to choose
c. Dominated move
d. Best response
e. Nash equilibria
3. Solving Nash Games
1. Introduction
1.1. What’s a game?
• Game in game theory refers mathematical game, which has been
applied to economics, politics, business, and other various areas.
• Ingredients:
- Rules
- Outcomes and payoffs: Mathematical game has many possible
outcomes, each producing payoffs for players;
- Uncertainty of the outcome: Since a game contains some random
elements, or has more than one player, the outcome of the game
cannot be predicted;
- No cheating
¨ Rational behavior is usually assumed for all players in a game. That is,
players have preferences and beliefs about the world, and try to
optimize their individual payoffs.
1.2. Game Classification (1 of 2)
Games can be categorized according to several criteria:
¨ How many players are there in the game? Usually there should be
more than one player.
¨ Is play simultaneous or sequential?
- Simultaneous game: Each player has only one move (action), and all moves
are made simultaneously.
- Sequential game: Each player moves at a time, and players may have to
move several times.
¨ Do players have perfect information? Every player knows other players’
moves when about to move
¨ Is the game zero-sum? A player can have a positive payoff only if
another has a negative payoff.
Game classification (2 of 2)
Games can be categorized according to several criteria:
¨ Is the game cooperative or non-cooperative? Even if players negotiate,
the question is whether the results of the negotiation can be enforced
- Cooperative game: The results of the negotiations can be put in a contract
and be enforced.
- Non-cooperative game: A player can move differently from what was
promised in the negotiation. Then the communication is called “cheap
talk.”
¨ Within the scope of this chapter extension, we’ll focus on several main
types of simultaneous games
2. SIMULTANEOUS GAMES
2.1. Normal Form – Bi-matrix Description
¨ Assumption:
- Each player only has finite options.
- Players simultaneously make their moves, determine the outcome of the
game, and receive their payoffs
How many outcomes are possible?
players
Player has possible moves, for
There are possible outcomes.
¨ Symmetric game: All players have the same options, and if the two
players interchange their moves, the payoffs also interchange.
2.2. Which Option to Choose ?
¨ Some people always expect the worst, they want to
maximize their payoffs. So, for any of their options,
they would concentrate on their smallest possible
payoff, and choose the option with highest value. This
value is called the maximin value or security level
¨ This move is called a maximin move (strategy), since it
maximizes the minimum possible payoff.
¨ To choose the maximin move, the player doesn’t have to
know the payoffs of other players
Example 1 – two players (1 of 3)
Two companies share a market, If both don’t advertise, they get $5,000,000
each. If both advertise, both lower their gain to $3,000,000. If A advertises, but B
doesn’t, A gets $6,000,000 and B only $2,000,000, and conversely if B
advertises and A doesn’t. What is the payoff pattern of two players?
Two-person Zero-Sum Games
¨ A game with two players, where a gain of one player equals
a loss to the other
¨ Example: Consider a coin-matching game in which each of
two players A and B selects a head (H) or a tail (T). If the
outcomes match (i.e. H and H, or T and T) player A wins
$1.00 from player B. Otherwise, A loses $1 to B. In this
game each player has two strategies (H or T), which yeild
the following 2x2 game matrix expressed in terms of the
payoff to A
Player A
H T
Player B H 1 -1
T -1 1
Example 2 – two players zero-sum
Assume we are play Rock – Scissors – Paper for one dollar. Develop the
payoff matrix
Does the result change if we consider the risk cost of not protect employees from the
padamic? Assuming that it would cost each company $100,000 if they let their employees
expose to the disease. Wearing mask would reduce the exposure rate by 35%.
2.3. Dominated moves (1 of 2)
¨ A move, , for A strictly dominates another move , if always results in a
higher payoff for A than
- A rational player would never play a move that is strictly dominated by
another one (i.e. weak move).
- A player’s maximin move is never strictly dominated by any of her other
moves The advice given by the maximin mechanism and dominated
move mechanism are typically the same.
¨ If all players are assumed to be rational, and experienced enough, they
reduce the game by eliminating all strictly dominated moves.
- The process of repeatedly reducing the game, as well as its result, a
game that can’t be reduced any further is denoted by IESD – iterated
elimination of strictly dominated moves.
Dominated moves (2 of 2)
¨ A’s move weakly dominates another move, if it yields at least the same
payoff for A in all cases generated by combinations of moves of the
other players, and in at least one case an even better payoff
- Iterated elimination of weakly dominated move (IEWD) is not something
that should be performed automatically.
Example 4 – two bars (1 of 2)
Each of two bars charges its own price of a beer, either $2, $4, or $5. The
cost of obtaining and serving the beer can be neglected. It is expected that
6,000 beers per month are drunk in a bar by tourists, who choose one of
two bars randomly, and 4,000 beers per month by natives who go to the
bar with lowest price,and split evenly in case both bars offer the same
price. What prices would the bars select?
Solution
¨ Consider an instance: If bar A charges $2 and bar B charges $4, then all natives
will choose bar A. Therefore, bar A will serve 4000 beers to the natives, and
3000 beers to tourists, serving 7000 beers in total, making 7000*2 = 14000
dollars. Bar B will only serve 3000 beers to tourists, making 3000 *4 = 12000
dollars.
¨ The payoff matrix, with values in thousands of dollars is in the first matrix
¨ For each bar, move “4” strictly dominates move “2”, therefore we could eliminate
both moves “2” to get the reduced game to the second matrix.
¨ Now, but not before the elimination, move “4” strictly dominates move “5”.
Therefore, we eliminate these moves for both players as well and arrive at a
game with only one option, “4”, for each player, and a payoff of $ 20000 for each.
Therefore both players will choose $4 as the price of the beer.
2.4. Best response
¨ A’s best response to B’s move M is the move that yields the highest
payoff for A, given B’s move M
- There may be several best responses to a given move
- To find A’s best response to B’s move M, we don’t even have to know B’s
payoffs.
¨ You’ll find the best responses for A’s move by looking at the rows of the
bimatrix one by one and selecting in each row the cell where the
second entry is maximum. The label of the corresponding column is the
best response to the move corresponding to that row.
- The same rule applies to find B’s best response with the columns.
B advertises B doesn’t
A advertises 3, 3 6, 2
A doesn’t 2, 6 5, 5
• The best response for A’s move when A advertises is that B advertises.
- The best response for A’s move when A doesn’t advertise is that B
advertises.
• Likewise, the best response for B’s move when B advertises is that A
advertises
- The best response for B’s move when B doesn’t advertise is that A
advertises.
Example 4 revisit – best response
$2 $4 $5
$2 10, 10 14, 12 14, 15
$4 12, 14 20, 20 28, 15
$5 15, 14 15, 28 25, 25
• Adam and Bob have robbed the bank and been arrested. They
are interrogated separately. Adam & Bob have an option to
confess (move C) or to remain silent (move S). The police has
little evidence, but if both remain silent, they’ll be sentenced to
one year on a minor charge. Therefore, the police proposes a
deal: if one confess while the other remains silent, the one
confessing goes free, while the other is sentenced for three
years. However, if both talk, both will be sentenced to two years.
Find the Nash equilibria, If each player’s payoff is 3 minus the
number of years served in jail.
Solution
To simplify the game, we assume that the total payoff for each
player is the sum of the payoffs (in terms of satisfaction) of being at
the preferred place, which gives a satisfaction of c satisfaction units,
and being together with the partner, giving d satisfaction units. We
have two variants, depending on whether
c or d is larger, the low or high love variants. The payoff here is
satisfaction instead of money
Example 7 – battle of sexes (2 of 2)
• Here are the payoff bimatrices for both variants, where Adam chooses the rows
and Beth chooses the columns.
• Using best response, there’re 2 Nash equilibria in this version: (soccer, soccer)
and (movie, movie)
• Games with more than one pure Nash equilibrium are sometimes called
coordination games, since if pre-game negotiations are allowed, the players have
to agree on one of them.
• The question is: which Nash equilibrium is the best?
- By concentrating on Pareto-optimal Nash equilibria.
- A Nash equilibrium is Pareto-optimal if it is not strictly dominated by any other
Nash equilibrium.
3. Solving Nash Games
3.1. Zero-Sum Game Solution
¨ Solution: minimax solution (there is a unique minimax solution)
¨ Example: Payoffs to player A
Player B
Strategy 1 Strategy 2 Minrow
Player A Strategy 1 10 14 10
Strategy 2 7 12 7
MaxCol 10 14
Player A selects strategy 1, Player B selects strategy 1
¨ Note: The optimal solution to such a game may require each player to play a
pure strategy (i.e. a choice of only one strategy/option/action by each player)
strictly determined game when the maximum of the of a row is equal
to the minimum of the of the columns. The value 10 is called a
saddle point.
¨ Minimax may not lead to the best possible outcome if the opponent does not use
a minimax solution
3.2. Mixed Strategies
¨When the minimax equilibrium is .
Player B’s Player A’s strategies Minrow
strategies
C1 C2 C3 C4
x
i 1
i y j 1; x i , y j 0; i , j
j 1
min𝑧=𝑤
n n
min max a1 j y j ,..., amj y j
yj
j 1 j 1
s.t. y1 y2 ... yn 1
y j 0, j 1,2,..., n
Example
B1 B2 B3 Minrow
A1 0 -2 2 -2
A2 5 4 -3 -3
A3 2 3 -4 -4
Maxcol 5 4 2
Solution
B1 B2 B3 Minrow
A1 0 -2 2 -2
A2 5 4 -3 -3
A3 2 3 -4 -4
Maxcol 5 4 2
Solution
max z v
m
s.t. v aij xi 0, j 1,2,.., n
i 1
x1 x2 ... xm 1
xi 0, i 1,2,..., m
Solution
min𝑧=𝑤
Optimal Solution