Professional Documents
Culture Documents
M 4, 1 8, 4 3, 6
D 4, 0 9, 6 6, 8
So, U and D are both best responses to player B’s
strategy to play R. While for strategy L?
Best response
Claim: a rational player who believes that the
opponents are playing some s-i ∈ S-i , will
always choose a best response to s-i
Theorem: if si ∈ Si is a strictly dominated
strategy, it is no best response to any s-i ∈ S-i
Proof: there must be si' ∈ Si dominating it
It is immediate to see that the definition of best
response applied to si is violated by si'
Beliefs
A belief of player i is a possible profile of
opponents’ strategies, ie., an element of set S-i
Beliefs are connected to best responses!
We define a best-response-correspondence
BR: S-i → 𝓅(Si ) that associates to s-i ∈ S-i a
subset of Si such that each si ∈ BR(s-i ) is a best
response to s-i
This is not a function: but BR(s-i ) can be a
singleton (if the best response is unique)
Nash equilibrium
the key tool of game theory
Nash equilibrium
We want to strengthen the dominated
strategy concept with this idea in mind:
Game theory should make predictions about the
outcome of games played by a rational players
A prediction is correct if the players are willing
to play their predicted strategy
That is, players choose their best response
to the predicted strategy of the others
If this happens, the prediction is said to be
self-enforcing (or also strategically stable)
Formal definition
In a n-player game G = {S 1,…,Sn ; u 1,…,un },
strategies (s 1*,…,sn *) are a Nash equilibrium
if, for any i , si * is the best response of player i
to (s 1*,…,si -1*,si +1*,…,sn *)
That is, ∀si ∈ Si :
ui (s 1*,…,si -1*, si *,si +1*,…sn *)
ui (s 1*,…,si -1*, si ,si +1*,…sn *)
M 1, 0 4, 3
D 0, 7 2, 0
M 4, 0 0, 5 7, 3
D 3, 7 3, 7 9, 9
Even
0 1
0 -4, 4 4, -4
Odd
1 4, -4 -4, 4
back to Example 5
(R,R) and (S,S) are both Nash equilibria
This reflects our previous intuition
Brian
R S
R 2, 1 0, 0
Ann
S 0, 0 1, 2
then (s 1*,…,sn *) is a NE
1 5, -5 -5, 5 -5, -5
P -5, -5 -5, -5 -5, -5
NE vs. Pareto efficiency
Pareto efficiency is different from NE:
Pareto efficiency: no way (in the whole game) a user
can improve without somebody else being worse
Nash equilibrium: no way a user can improve with a
unilateral change
M Bob F
The outcome of the
Prisoner’s Dilemma M -1, -1 -21, 0
is not “efficient!” Al
F 0, -21 -20, -20
These strategies are
Pareto efficient