Simultaneous Games-Mixed Strategies: Abdul Quadir Xlri

Simultaneous Games- Mixed Strategies
Abdul Quadir
XLRI
19 December, 2019
Reading: Dixit and Skeath, 7
Rationalizability
I Calculation of Nash equilibrium depends on the following two

factors:
I Compute your best choice given your belief about others’
choices.
I The belief of each player must be correct. (convergence of
expectation)
I Can we justify strategic choices made by player on the basis of
rationality alone?
I The answer is no. See the following example:
L C R
T 0,7 2,5 7,0
M 5,2 3,3 5,2
B 7,0 2,5 0,7
I All nine outcomes are logically conceivable.
Rationalizability
I Calculation of Nash equilibrium depends on the following two

factors:
I Compute your best choice given your belief about others’
choices.
I The belief of each player must be correct. (convergence of
expectation)
I Can we justify strategic choices made by player on the basis of
rationality alone?
I The answer is no. See the following example:
L C R
T 0,7 2,5 7,0
M 5,2 3,3 5,2
B 7,0 2,5 0,7
I All nine outcomes are logically conceivable.
Rationalizability
I Recall that a rational player will not play dominated strategies.

I We can get a similar notion known as never a best response.
I A rational player will not play never a best response.
I Therefore, we can narrow down the set of strategies using this
concept.
I The set of strategies that survive elimination on this ground
are called rationalizable.
I The concept is known as rationalizability.
Example: Rationalizability
I Consider the following game:

L C R
T (3,3) (0,0) (0,2)
M (0,0) (3,3) (0,2)
B (2,2) (2,2) (2,0)
I Note that R is never a best response for column Player and B
is never a best response for row Player.
I The reduces game will look like
L C
T (3,3) (0,0)
M (0,0) (3,3)
Example: Rationalizability

L C R
T (3,3) (0,0) (0,2)
M (0,0) (3,3) (0,2)
B (2,2) (2,2) (2,0)
I Note that R is never a best response for column Player and B
is never a best response for row Player.
I The reduces game will look like
L C
T (3,3) (0,0)
M (0,0) (3,3)
Quantity Competition Between Two Firms
I Consider two firms which produce identical products.

I The price depends on what they produce p = 60 − (q1 + q2 )
I Per unit cost of production is 30 and 36 for firm 1 and 2,
respectively.
I Both the firms are profit maximizer.
I Compute the Nash equilibrium for this game.
I Their profit functions are given by
π1 (q1 , q2 ) = (60 − q1 − q2 )q1 − 30q1

u2 (q1 , q2 ) = (60 − q1 − q2 )q2 − 36q2
I Consider two firms which produce identical products.

I The price depends on what they produce p = 60 − (q1 + q2 )
I Per unit cost of production is 30 and 36 for firm 1 and 2,
respectively.
I Both the firms are profit maximizer.
I Compute the Nash equilibrium for this game.
I Their profit functions are given by
π1 (q1 , q2 ) = (60 − q1 − q2 )q1 − 30q1

u2 (q1 , q2 ) = (60 − q1 − q2 )q2 − 36q2
I Given q2 , firm 1 can maximize its profit by maximizing π1
over q1 .
I F.O.C. implies that
60 − 2q1 − q2 − 30 = 0.
I The best response function for firm 1

q2
q1 = 15 − .
2
I Similarly, the best response function for firm 2
q1
q2 = 12 −
2
I Solving these two equations, we have
q1∗ = 12, q2∗ = 6, p ∗ = 42 π1 = 144, π2 = 36

Best Response Curves
q2
30
q2
q1 = 15 − 2
12
q1
q2 = 12 − 2
15 24 q1
Rationalizability: Quantity Competition Between Two
Firms
I First round: what are the rational beliefs of firm 1 and 2 for
their production ranges?
I From the graph, we see that q1 ∈ [0, 15].
I Thus, firm 2 believes that any negative number or higher than
15 is not possible for firm 1.
I Similarly, q2 ∈ [0, 12] and firm 1 believes that firm 2 will not
produce any negative or higher quantity than 12.
I Second round: Since firm 1 has concluded that firm 2 will not
produce more than 12, then firm 1’s production is
q1 = 15 − 12
2 = 9.
I The best response for q2 = 0 is 15.
I Thus, firm 1’s choice is restricted to the range of [9, 15].
I Similarly, firm 2’s production is restricted to [12, 4.5] to the
best response of q1 = 0 and q1 = 15.
Firms
I First round: what are the rational beliefs of firm 1 and 2 for
their production ranges?
I From the graph, we see that q1 ∈ [0, 15].
I Thus, firm 2 believes that any negative number or higher than
15 is not possible for firm 1.
I Similarly, q2 ∈ [0, 12] and firm 1 believes that firm 2 will not
produce any negative or higher quantity than 12.
I Second round: Since firm 1 has concluded that firm 2 will not
produce more than 12, then firm 1’s production is
q1 = 15 − 12
2 = 9.
I The best response for q2 = 0 is 15.
I Thus, firm 1’s choice is restricted to the range of [9, 15].
I Similarly, firm 2’s production is restricted to [12, 4.5] to the
best response of q1 = 0 and q1 = 15.
Firms
I Third round: If firm 1’s production is q1 = 9,

q2 = 12 − 29 = 7.5.
4.5
I If q2 = 4.5, then q1 = 15 − 2 = 12.75
I Thus, firm 1’ restricted production range now is [9, 12.75]
I In this way the successive round will eventually give you
q1 = 12 and q2 = 6.
Rationalizability
q2
30
q2
q1 = 15 − 2
12
7.5
4.5 q1
q2 = 12 − 2
9 12.7515 24 q1
Rationalizability
q2
30
q2
q1 = 15 − 2
12
7.5
4.5 q1
q2 = 12 − 2
9 12.7515 24 q1
Matching Pennies
I Matching Pennies:
I Consider two players whose payoffs add to zero.
I Consider two players toss two coins. If it matches H and H or
T and T , then Player 1 is paid Rs. 1 by Player 2.
I Otherwise, Player 2 is paid Rs. 1 by player 1.
h t
H (1,-1) (-1,1)
T (-1,1) (1,-1)
I Is there any Nash equilibrium for this game?
I There is no pure strategy Nash equilibrium.
Matching Pennies
I Matching Pennies:
I Consider two players whose payoffs add to zero.
I Consider two players toss two coins. If it matches H and H or
T and T , then Player 1 is paid Rs. 1 by Player 2.
I Otherwise, Player 2 is paid Rs. 1 by player 1.
h t
H (1,-1) (-1,1)
T (-1,1) (1,-1)
I Is there any Nash equilibrium for this game?
I There is no pure strategy Nash equilibrium.
Mixed Strategies
I Suppose players are choosing their pure strategies

unsystematically.
I This means in a sense that they are choosing their pure
strategies randomly.
I Randomization over pure strategies is known as mixed
strategies.
I In other words, mixed strategies are probabilities associated
with each strategies.
I Note that since mixed strategies are probabilities, they must
lie in [0, 1].
I Therefore, there are like the continuous strategies.
I Pure strategies are special case of mixed strategies.
Expected Payoff
I We assume independent randomization among players.

I How one player randomizes over his set of strategies will not
depend on how others randomize.
C D
A (50,50) (80,20)
B (90,10) (20,80)
3
I Suppose Player 1 plays strategy A with probability 4 and
strategy B with probability 14 .
1
I Player 2 plays C with probability 4 and D with probability 34 .
I What are the expected payoffs of players playing their
strategies?
Expected Payoffs
I Row player (Player 1) is playing A with with probability 34 and
strategy B with probability 14 .
I Given this strategy, the expected payoff of Player 2 is given:
3 1
Playing C : × 50 + × 10 = 40
4 4
3 1
Playing D : × 20 + × 80 = 35
4 4

I Against the mixed strategy 43 , 41 for player 1, the best for
Player 2 is to play C .
I Therefore, if Player 2 plays D for the above strategy, the
expected payoff of Player 1:
3 1
× 50 + × 90 = 60 if player 2 plays C
4 4
3 1
× 80 + × 20 = 65 if player 2 plays D
4 4
Indifference Principle
I Note that Player 2 can exploit player 1 by playing C for this

mixed strategy.
I Thus, this is not optimal mix for Player 1.
I The ideal mixed strategy will be where there is no exploitation
of player 1 by player 2 and vice versa.
I This will arise if Player 1 choose a mixed strategy that leave
player 2 indifferent in choosing strategy C and D.
I In other words, player 1 will choose the mixed strategy that
makes the expected payoff of player 2 same for C and D.
I This principle is known as indifference property.
I This is the key for mixed strategy equilibria in non-zero sum
game.
Computation of Mixed Strategy
I To compute exploitation proof mixed strategy, we assign
probability p to the strategy A of player 1.
I Thus, the expected payoff of Player 2 is:
Playing C : p × 50 + (1 − p) × 10
Playing D : p × 20 + (1 − p) × 80
I Player 2 will be indifferent in playing C and D if the expected
payoffs are same for either playing C or D.
I Therefore, we have
p × 50 + (1 − p) × 10 = p × 20 + (1 − p) × 80
I Solving this we get p = 0.7.
I Thus, the exploitation proof mixed strategy for player 1 is
(0.7, 0.3).
I Compute the expected payoff of the mixed strategy (0.7, 0.3)
and compare it with the expected payoffs of pure strategy and
the strategy (0.75,0.25).
Best Response
I Suppose player 2 plays C with probability q and D with

probability (1 − q).
I What is the best response of Player 1 p for the choices of q of
player 2?
I First compute the expected payoff of player 1 if Player 2 plays
C or D.
p × 50 + (1 − p) × 90 if Player 2 plays C
p × 80 + (1 − p) × 20 if Player 2 plays D
I What is the expected payoff of Player 1 against the mixed

strategy q of player 2?
[50p + 90(1 − p)]q + [80p + 20(1 − p)](1 − q)

Best Response

player 2?
C or D.

[50p + 90(1 − p)]q + [80p + 20(1 − p)](1 − q)

Best Response

player 2?
C or D.

[50p + 90(1 − p)]q + [80p + 20(1 − p)](1 − q)

Best Response

player 2?
C or D.

[50p + 90(1 − p)]q + [80p + 20(1 − p)](1 − q)

Best Response
Rearranging the above equation
[50p + 90(1 − p)]q + [80p + 20(1 − p)](1 − q) = 50pq + 90q − 90pq+

80p + 20 − 20p − 80pq − 20q + 20pq
This equates to
70q − 100pq + 60p + 20 = [20 + 70q] + [60 − 100q]p
I Note that we are trying to find p to maximize the Player 1’s

expected payoff given q.
I See the coefficient of p, that is 60 − 100q.
Best Response
I If 60 − 100q > 0, player 1 can increase his expected payoff by

increasing p.
I Thus, if q < 0.6, p = 1.
I If 60 − 100q < 0, player 1 can increase his expected payoff by
decreasing p.
I Thus, if q > 0.6, p = 0.
I For q = 0.6, any p-mix is the best response.
I We can summarize it as:
If q < 0.6, the best response is p = 1.
If q = 0.6, any p-mix is a best response. If q > 0.6, best
response is p = 0.
I This is the best response of Player 1 for Player 2’s strategies.
Best Response Graph
0.6 1 q
Best Response
I Similarly we can compute the best response for player 2.

I The expected payoff of player 2 is:
[50q + (1 − q)20]p + [10q + 80(1 − q)](1 − p)
I Rearranging we get
(80 − 60p) + (100p − 70)q
I Note that if the coefficient of q is positive, then player 2 will

increase q.
I Thus,we have
If p < 0.7, best response is q = 0
If p = 0.7, any q-mix is a best response.
If p > 0.7, best response is q = 1.
Best Response
I Similarly we can compute the best response for player 2.

I The expected payoff of player 2 is:
[50q + (1 − q)20]p + [10q + 80(1 − q)](1 − p)
I Rearranging we get
(80 − 60p) + (100p − 70)q
I Note that if the coefficient of q is positive, then player 2 will

increase q.
I Thus,we have
If p < 0.7, best response is q = 0
If p = 0.7, any q-mix is a best response.
If p > 0.7, best response is q = 1.
Best Response Graph
0.7 1 p
Nash Equilibrium
q
0.6
0.7 1 p

Mixed strategy Nash equilibrium is (0.7, 0.3), (0.6, 0.3) .
Nash Equilibrium
q
0.6
0.7 1 p

Mixed strategy Nash equilibrium is (0.7, 0.3), (0.6, 0.3) .
Mixed Strategy
I We can also use the indifference property and calculate the
optimal mixed strategy q.
I Thus, the expected payoff of Player 1 is:
Playing A : q × 50 + (1 − q) × 80
Playing B : q × 90 + (1 − p) × 20
I Player 1 will be indifferent in playing A and B if the expected

payoffs are same for either playing A or B.
I Therefore, we have
q × 50 + (1 − q) × 80 = q × 90 + (1 − q) × 20
I Solving this we get q = 0.6.

I If probabilities are coming out to be negative or greater than
1, then there is no mixed strategy Nash equilibrium.
Mixed Strategy NE in Coordination Games
I Consider the coordination

X Y
X (1,1) (0,0)
Y (0,0) (3,3)
I Suppose Player 1 and Player 2 are playing X with probability
p and q, respectively.
I What is the mixed strategy Nash equilibria?
I Player 1:
X : 1 × q + (1 − q) × 0 = q
Y : 0 × q + 3 × (1 − q) = 3(1 − q)
I Thus, q = 3(1 − q) implies q ∗ = 43 .

I Similar calculation will yield p ∗ = 34 .
Mixed Strategy NE in Coordination Games
I Consider the coordination

X Y
X (1,1) (0,0)
Y (0,0) (3,3)
I Suppose Player 1 and Player 2 are playing X with probability
p and q, respectively.
I What is the mixed strategy Nash equilibria?
I Player 1:
X : 1 × q + (1 − q) × 0 = q
Y : 0 × q + 3 × (1 − q) = 3(1 − q)
I Thus, q = 3(1 − q) implies q ∗ = 34 .

I Similar calculation will yield p ∗ = 34 .
Best Response
I The expected payoff of Player 1 if player 2 plays X is p and if

he plays Y , then it is 3(1 − p)
I Thus, the expected payoff of Player 1 is
pq + 3(1 − p)(1 − q) = 3(1 − q) + (4q − 3)p
I This implies that If q < 34 , best response is p = 0

If q = 34 , any p-mix is a best response.
If q > 34 , best response is p = 1.
I Since Player 2 is symmetric, his best response is similar to
player 1.
If p < 34 , best response is q = 0
If p = 43 , any q-mix is a best response.
If p > 34 , best response is q = 1.
Best Response
I The expected payoff of Player 1 if player 2 plays X is p and if

he plays Y , then it is 3(1 − p)
I Thus, the expected payoff of Player 1 is
pq + 3(1 − p)(1 − q) = 3(1 − q) + (4q − 3)p
I This implies that If q < 34 , best response is p = 0

If q = 34 , any p-mix is a best response.
If q > 34 , best response is p = 1.
I Since Player 2 is symmetric, his best response is similar to
player 1.
If p < 34 , best response is q = 0
If p = 43 , any q-mix is a best response.
If p > 34 , best response is q = 1.
Best Response Graph for Player 1
0.75 1 q
Best Response Graph for Player 2
0.75
1 q
Nash Equilibria
p
0.75
0.75 1 q
Three Nash equilibria (0, 0), (1, 1), (0.75, 0.75).

Nash Equilibria
p
0.75
0.75 1 q
Three Nash equilibria (0, 0), (1, 1), (0.75, 0.75).

Exercise
1. Compute the Nash equilibria of matching pennies game and

draw the best response functions of both the players.
2. Compute the mixed strategy Nash equilibia of the following
matrix game:
X Y
X (2,1) (0,0)
Y (0,0) (1,2)
Also draw the best response functions of both the players on a
graph.

Simultaneous Games-Mixed Strategies: Abdul Quadir Xlri

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Simultaneous Games-Mixed Strategies: Abdul Quadir Xlri

Uploaded by

Copyright:

Available Formats

Simultaneous Games- Mixed Strategies

I Calculation of Nash equilibrium depends on the following two

I Calculation of Nash equilibrium depends on the following two

I Recall that a rational player will not play dominated strategies.

I Consider the following game:

I Consider the following game:

I Consider two firms which produce identical products.

π1 (q1 , q2 ) = (60 − q1 − q2 )q1 − 30q1

I Consider two firms which produce identical products.

π1 (q1 , q2 ) = (60 − q1 − q2 )q1 − 30q1

I The best response function for firm 1

q1∗ = 12, q2∗ = 6, p ∗ = 42 π1 = 144, π2 = 36

I Third round: If firm 1’s production is q1 = 9,

I Suppose players are choosing their pure strategies

I We assume independent randomization among players.

I Note that Player 2 can exploit player 1 by playing C for this

I Suppose player 2 plays C with probability q and D with

I What is the expected payoff of Player 1 against the mixed

[50p + 90(1 − p)]q + [80p + 20(1 − p)](1 − q)

I Suppose player 2 plays C with probability q and D with

I What is the expected payoff of Player 1 against the mixed

[50p + 90(1 − p)]q + [80p + 20(1 − p)](1 − q)

I Suppose player 2 plays C with probability q and D with

I What is the expected payoff of Player 1 against the mixed

[50p + 90(1 − p)]q + [80p + 20(1 − p)](1 − q)

I Suppose player 2 plays C with probability q and D with

I What is the expected payoff of Player 1 against the mixed

[50p + 90(1 − p)]q + [80p + 20(1 − p)](1 − q)

Rearranging the above equation

[50p + 90(1 − p)]q + [80p + 20(1 − p)](1 − q) = 50pq + 90q − 90pq+

70q − 100pq + 60p + 20 = [20 + 70q] + [60 − 100q]p

I Note that we are trying to find p to maximize the Player 1’s

I If 60 − 100q > 0, player 1 can increase his expected payoff by

I Similarly we can compute the best response for player 2.

[50q + (1 − q)20]p + [10q + 80(1 − q)](1 − p)

(80 − 60p) + (100p − 70)q

I Note that if the coefficient of q is positive, then player 2 will

I Similarly we can compute the best response for player 2.

[50q + (1 − q)20]p + [10q + 80(1 − q)](1 − p)

(80 − 60p) + (100p − 70)q

I Note that if the coefficient of q is positive, then player 2 will

I Player 1 will be indifferent in playing A and B if the expected

I Solving this we get q = 0.6.

I Consider the coordination

I Thus, q = 3(1 − q) implies q ∗ = 43 .

I Consider the coordination

I Thus, q = 3(1 − q) implies q ∗ = 34 .

I The expected payoff of Player 1 if player 2 plays X is p and if

pq + 3(1 − p)(1 − q) = 3(1 − q) + (4q − 3)p

I This implies that If q < 34 , best response is p = 0

I The expected payoff of Player 1 if player 2 plays X is p and if

pq + 3(1 − p)(1 − q) = 3(1 − q) + (4q − 3)p

I This implies that If q < 34 , best response is p = 0

Three Nash equilibria (0, 0), (1, 1), (0.75, 0.75).

Three Nash equilibria (0, 0), (1, 1), (0.75, 0.75).

1. Compute the Nash equilibria of matching pennies game and

You might also like