06-Handout-2x2 Game Theory University of Kent

Outline for today
Stat155
Game Theory Zero sum games
Lecture 6: Solving two player zero-sum games Recall: payoff matrices, mixed strategies, safety strategies, Von
Neumann’s minimax theorem
Solving two player zero-sum games
Peter Bartlett Saddle points
Dominated pure strategies
Solving 2 × 2 games
Equalizing strategies
September 13, 2016
1 / 25 2 / 25
Recall: Two-player zero-sum games Recall: Two-player zero-sum games
Definitions Definitions
Player I has m actions, 1, 2, . . . , m. A mixed strategy is a probability distribution over actions.
Player II has n actions, 1, 2, . . . , n. A mixed strategy for Player I is a vector
The payoff matrix A ∈ Rm×n
represents the payoff to Player I:  
  x1 ( )
a11 a12 · · · a1n  x2  Xm
 
 a21 a22 · · · a2n  x =  .  ∈ ∆m := x ∈ Rm : xi ≥ 0, xi = 1 .
   .. 
A= . . ..  i=1
 .. . . .  xm
am1 am2 · · · amn
If Player I chooses i and Player II chooses j, the payoff to Player I is A mixed strategy for Player II is a vector y ∈ ∆n .
aij and the payoff to Player II is −aij . A pure strategy is a mixed strategy where one entry is 1 and the
The sum of the payoff to Player I and the payoff to Player II is 0. others 0. (This is a canonical basis vector ei .)
3 / 25 4 / 25
Recall: Two-player zero-sum games Recall: Two-player zero-sum games
A safety strategy for Player I is an x ∗ ∈ ∆m that satisfies
min x ∗ > Ay = max min x > Ay .

The expected payoff to Player I when Player I plays mixed strategy y ∈∆n x∈∆m y ∈∆n
x ∈ ∆m and Player II plays mixed strategy y ∈ ∆n is
m X
X n This mixed strategy maximizes the worst case expected gain for
EI ∼x EJ∼y aIJ = xi aij yj Player I.
i=1 j=1
Similarly, a safety strategy for Player II is a y ∗ ∈ ∆n that satisfies
>
= x Ay .
max x > Ay ∗ = min max x > Ay .
x∈∆m y ∈∆n x∈∆m
This mixed strategy minimizes the worst case expected loss for
Player II.
5 / 25 6 / 25
Recall: Two-player zero-sum games Outline
Von Neumann’s Minimax Theorem

For any two-person zero-sum game with payoff matrix A ∈ Rm×n ,
max min x > Ay = min max x > Ay . Zero sum games

x∈∆m y ∈∆n y ∈∆n x∈∆m
Recall: payoff matrices, mixed strategies, safety strategies, Von
We call the optimal expected payoff the value of the game, Saddle points
V := max min x > Ay = min max x > Ay . Solving 2 × 2 games
x∈∆m y ∈∆n y ∈∆n x∈∆m
LHS: Player I plays a safety strategy.

RHS: Player II plays a safety strategy.
Safety strategies are optimal strategies.
7 / 25 8 / 25
Saddle points Saddle points
Example Definition
  A pair (i ∗ , j ∗ ) ∈ {1, . . . , m} × {1, . . . , n} is a saddle point for a payoff
−1 1 5 matrix A ∈ Rm×n if
A =  5 3 4
6 2 1 max aij ∗ = ai ∗ j ∗ = min ai ∗ j .
i j
Suppose Player I plays 2 and Player II plays 2.

If Player I plays i ∗ and Player II plays j ∗ , neither player has an
The payoff is a22 = 3.
incentive to change.
Should either player change their strategy?
Think of these as locally optimal strategies for the players.
This pair of pure strategies, (e2 , e2 ), is called a saddle point (or pure
We’ll see that they are also globally optimal strategies.
Nash equilibrium).
9 / 25 10 / 25
Saddle points: Example Saddle points
1 2 3 4 5 6 7 row min
1 -1 1 2 4 8 -2 7 -1
2 5 3 -4 7 3 1 3 1
3 6 2 -5 2 8 -1 2 1
4 3 1 3 -5 3 4 7 1 Theorem
5 1 2 3 5 4 -6 7 1 If (i ∗ , j ∗ ) is a saddle point for a payoff matrix A ∈ Rm×n , then
6 4 8 3 7 5 8 4 3 ei ∗ is an optimal strategy for Player I,
7 3 1 3 -5 3 4 7 1
ej ∗ is an optimal strategy for Player II, and
column max 6 8 3 7 8 8 7 the value of the game is ai ∗ j ∗ .
Is there a saddle point?

Suppose Player I plays 6 and Player II plays 3.
Neither player should change their strategy: (e6 , e3 ), is a saddle point.
Saddle point is a row/column pair where the row min is equal to the
column max. (Need max of row mins = min of column maxes.)
11 / 25 12 / 25
Saddle points Saddle points
Theorem
Saddle point (i ∗ , j ∗ ) (maxi aij ∗ = ai ∗ j ∗ = minj ai ∗ j ) implies ei ∗ , ej ∗ optimal.
Proof
We have seen that we should always prefer to play last but with a
saddle point, the opposite inequality is also true: So if we find a saddle point, the game is easy to solve.
min max x > Ay ≥ max min x > Ay Another way to simplify a two-player zero-sum game is by removing
y ∈∆n x∈∆m x∈∆m y ∈∆n
dominated rows or columns.
≥ min ei>∗ Ay (∗)
y ∈∆n
= ei>∗ Aej ∗ = max x > Aej ∗ (∗)

x∈∆m
>
≥ min max x Ay .
y ∈∆n x∈∆m
The inequalities are all equalities: min max x > Ay = max min x > Ay .
y x x y
This proves von Neumann’s minimax theorem for this case.
And (∗) shows ei ∗ , ej ∗ optimal.
13 / 25 14 / 25
Outline Removing dominated pure strategies
Example: Plus One (a Mendelsohn game)

Each player picks a number in {1, 2, . . . , n}.
If i = j, payoff is $0.
Zero sum games
Recall: payoff matrices, mixed strategies, safety strategies, Von If |i − j| = 1, the higher number wins $1.
Neumann’s minimax theorem If |i − j| ≥ 2, the higher number loses $2.
Saddle points
15 / 25 16 / 25
Removing dominated pure strategies Removing dominated pure strategies
Example: Plus One Example: Miss-by-one (a Mendelsohn game

Players I and II choose numbers i, j ∈ {1, 2, . . . , 7}.
Player I wins 1 if |i − j| = 1, otherwise the payoff is 0.
Try it!
What is the payoff matrix?
What do you notice about row 5 and row 7?
For Player I, row 5 is always at least as good as row 7.
Player I never needs to play 7:
There is an optimal strategy x ∗ for Player I with x7∗ = 0.
Similarly for columns 5 and 7:
Reduced payoff matrix
Player II never needs to play 5 (can choose y7∗ = 0).
Similarly for row 1 and row 3, and for column 1 and column 3.
Without row 1, columns 3 and 5, we can omit row 4 and column 4.
There is an optimal strategy for each player mixing only 4 numbers.
17 / 25 18 / 25
Outline 2 × 2 games
Finding Player I’s Finding Player II’s

Zero sum games optimal strategy optimal strategy
Recall: payoff matrices, mixed strategies, safety strategies, Von Recall: Pick-a-hand
Saddle points
(Karlin and Peres, 2016) (Karlin and Peres, 2016)
19 / 25 20 / 25
2 × 2 games 2 × 2 games
L R Consider the last two cases.

T c d The optimal strategy for Player I is at the intersection of the lines.
B a b This is an equalizing strategy:
Whatever Player II plays, the expected payoff is the same.
Finding Player I’s optimal strategy Writing x1 = Pr(T ), the equations for the lines are:
a>b>c>d a>b>d>c a>c>b>d a>c>d>b a>d>b>c a>d>c>b
a a a a a a V = b + x1 (d − b),
L L L L L d L
b
cb c c d V = a + x1 (c − a).
R b R R d c
R d b R R
d c db cb
Equating gives
B T B T B T B T B T B T a−b
x1 = .
These represent all games (swap L ↔ R, T ↔ B, or both). a−b+d −c
Saddle points? First four games.
What happens if c → d?
Dominated rows? First, second, third.
Then x1 → 1; the game approaches the fourth case.
Dominated columns? First, third, fourth.
21 / 25 22 / 25
2 × 2 games Solving 2 × 2 games

How to solve a 2 × 2 game
1 Check for a saddle point.
(Is the max of row mins = min of column maxes?) Examples
2 If there are no saddle points, find equalizing strategies.
0 2
1 3
Equalizing strategies satisfy (in standard notation):

3 2
x1 a11 + (1 − x1 )a21 = x1 a12 + (1 − x1 )a22 , 0 1
y1 a11 + (1 − y1 )a12 = y1 a21 + (1 − y1 )a22 .
3 1
Solving gives 0 2
a21 − a22
x1 = ,
a21 − a22 + a12 − a11
a12 − a22
y1 = .
a12 − a22 + a21 − a11
23 / 25 24 / 25
Outline
Zero sum games

Recall: payoff matrices, mixed strategies, safety strategies, Von
Saddle points
25 / 25

06-Handout-2x2 Game Theory University of Kent

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

06-Handout-2x2 Game Theory University of Kent

Uploaded by

Copyright:

Available Formats

Outline for today

Recall: Two-player zero-sum games Recall: Two-player zero-sum games

A safety strategy for Player I is an x ∗ ∈ ∆m that satisfies

min x ∗ > Ay = max min x > Ay .

Recall: Two-player zero-sum games Outline

Von Neumann’s Minimax Theorem

max min x > Ay = min max x > Ay . Zero sum games

LHS: Player I plays a safety strategy.

Suppose Player I plays 2 and Player II plays 2.

Saddle points: Example Saddle points

Is there a saddle point?

= ei>∗ Aej ∗ = max x > Aej ∗ (∗)

Outline Removing dominated pure strategies

Example: Plus One (a Mendelsohn game)

Example: Plus One Example: Miss-by-one (a Mendelsohn game

Finding Player I’s Finding Player II’s

(Karlin and Peres, 2016) (Karlin and Peres, 2016)

L R Consider the last two cases.

2 × 2 games Solving 2 × 2 games

Zero sum games

You might also like