Professional Documents
Culture Documents
Meeta Malviya
180070034
Electrical
Engineering June
2019
3 Nash Equilibrium 5
3.1 Best Response . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
3.2 Pure Strategy Nash Equilibrium . . . . . . . . . . . . . . . . . . . . . . . . . 5
3.3 Maxmin and Minmax Values . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
5 Utility Theory 9
5.1 Axioms of von Neumann - Morgenstern Utility Theory . . . . . . . . . . . . 9
5.2 The von Neumann - Morgenstern Theorem.................................................................10
We all must have played the game of Rock, Paper, Scissor. If we wish to show it in a form
of game it would look like this:
2
2 Dominant Strategy Equilibrium
2.1 Strongly Dominant Strategy
Given a strategic form game T = (N , (Si)i∈N , (ui)i∈N ), a strategy si ∈ Si of player i is
said to be strongly dominated by another strategy st ∈ Si if,
i
ui(st , s−i) > ui(si, s−i) ∀s−i ∈ S−i
i
This implies that player i will always prefer to play strategy st over strategy si.
i
3
2.4 Very Weakly Dominant Strategy Equilibrium
A strategy profile (s∗, s∗, ..., s∗ ) is called a Very weakly dominant strategy equilibrium of the
1 2 n
game T = (N , ∗
i∈N , (ui) i∈N ) if, ∀i = 1, 2, .., n the strategy s is a very weakly dominant
i
(Si)
strategy for player i.
Consider the example of prisoner’s dilemma,
1/ 2 NC C
NC -2,-2 -5,-2
C -2,-10 -5,-10
The strategy C and NC both are very weakly strategies. Hence each profiles therefore
constitute a very weakly dominant equilibrium.
4
3 Nash Equilibrium
It is a stable state of a system involving the interaction of different participants, in which
no participant can gain by a unilateral change of strategy if the strategies of the others
remain unchanged.
Let us consider working in a joint problem-
1/ 2 Work hard Goof off
Work hard 2,2 0,3
Goof off 3,0 1,1
You and your friend both prefers either to work or goof off together. We see that both
goofing off is an equilibrium because if one of the player decides to play goof off then other
can do nothing better than goofing off, which is not in the case of both working together.
Note: This shows that equilibrium may not always gives the highest payoffs to the players.
Maxmin Value
In a given strategic game form, the strategy that maximizes its worst case payoffs in the
situation where all the other players −i happens to play strategies which cause greatest
harm to i. It is also called security value of a player.Its value is given by:
vi = max min
ui(si, s−i)
si∈Si s−i ∈S−i
5
the action that guarantee this payoff is maxmin strategy.
Minmax Value
In a given strategic game form, player its minmax strategy against −i in a 2 player game is
a strategy that minimizes −its best- case payoff, and the minmax value for i against −i is
payoff. Its value is given by:
vi = min max ui(si, s−i)
s−i ∈S−i si∈Si
whereas if player 2 plays Y then he receives minimum payoff if player 1 plays A. And out
of both player gets maxmin payoff if when strategy profile (A, Y ) is played. In simple
words it is maximum of all minimum that he can get.
Similarly he gets minmax payoff if (B, Y ) is played.
1/2 X Y
A 1,2 2,3
B 4,1 3,5
Minmax Theorem
In any finite, two player zero sum game, in any Nash equilibrium each player receives a
payoff that is equal to both his maxmin and minmax value.
It is quite intuitive if we see matching pennies game. For player 1 and 2 minmax payoff is
-1 and maxmin payoff is 1. Whenever a game is played, someone receives minmax and
other receives maxmin value.
1/ 2 heads tails
heads 1,-1 -1,1
tails -1,1 1,-1
6
4 Mixed Strategies and Mixed Strategy Nash
Equilibrium
Let us see the example of matching pennies:
1/ 2 heads tails
heads 1,-1 -1,1
tails -1,1 1,-1
if we observe that we see that there is no pure strategy Nash equilibrium. So what should
our strategy be? A mixed strategy says, I’m just going to play more than one action with
positive probability rather than fixing on any strategy i.e. playing that action with full
probability.
It is the set of all pure strategies which have non zero probability under mixed strategy
profile σi. It is denoted by δ(σi):
δ(σi) = {si ∈ Si : σi(si)} > 0
7
4.4 Theorem- Nash, 1950
Every finite game has a Nash Equilibrium.
1/ 2 A B
A 2,1 0,0
B 0,0 1,2
A Mixed strategy profile is a Nash equilibrium iff utility of the profile is same for both the
players and is greater that all other utility for different actions.
let us look at the candidate for equilibrium, ((2 , 1 ), (1 , 2 . We see,
3 3 )) 3 3
∗
u (A, σ
1 2 2
1 2) = (2) + (0) =
3 3 3
∗
u (B, σ 1 2 2
1 2) = (0) + (1) =
3 3 3
from the statement stated above we conclude that the suggested candidate is in-fact a
Nash equilibrium.
8
5 Utility Theory
Utilities capture the preferences that the player have for different outcomes in terms of real
numbers thus enabling real valued functions to be used in game theoretic analysis. It
assigns payoffs to action s.t. higher value means more preferable.
where
I:
m
pj ≥ 0 for j = 1, 2...m pj = 1,
and j=1
Axiom 1: Completeness
A set of preferences is complete if, for all pairs of outcomes A and B, the individual prefers
A to B, prefers B to A, or is indifferent between A and B or in other words it means that,
every pair of outcomes is related by the preference relation.
Axiom 2:Transtivity
If a player prefers outcome A to B and outcome B to C, then he prefers A to C.
x1 c:: x2 and x2 c:: x3 =⇒ x1 c:: x3 ∀x1 , x2, x3 ∈ X
Axiom 3:Substitutability
This axiom is often called independence. If x1 ∼ x2, then player is indifferent between
lotteries σ1 and σ2 if probabilities of x1, x2 and all other probabilities of outcomes are same.
Axiom 4:Decomposibility
Also called simplification of lotteries. It states that, if probability of each qutcome is same
for σ1 and σ2 then
Pσ1 (xi) = Pσ2 (xi)∀xi ∈ X =⇒ σ1 ∼ σ2
9
Axiom 5:Monotonicity
Consider a player who strictly prefers outcome x1 to outcome x2.Suppose σ1 and σ2 are two
lotteries over {x1; x2}. Monotonicity implies that the player would prefer the lottery that
assigns higher probability to x1.
Axiom 6:Continuity
The axiom states that if, outcome x1 is strictly preferred to x2 and outcome x2 is strictly
preferred to another outcome x3, then player is indifferent to x2 or [p : x1; 1 − p : x3] for
some probability p ≥ 0
It tells that under certain axioms of rational behavior, a decision-maker faced with risky
(probabilistic) outcomes of different choices will behave as if he or she is maximizing the
expected value.
10
6 Repeated Prisoner’s Dilemma
When we think of most interactions in the world, there’s a lot of them that occurs more
than once. Like in a market place firm we interact, negotiate with our competitors
repeatedly. So, there is a long history and a long future ahead of them. Now when you are
asked to make a strategy, your decision will depend upon how your competitors have
played in past.
6.3 Strategies
6.3.1 Tit for Tat
In this strategy we start from co-operating; if the opponent defects, then we defect in the
next round, then again co-operate.
11
6.3.2 Trigger
we start from co-operating, if opponent defects, then we defect forever, hence the name
Trigger.
given the initial belief player 1 assumes that player two has higher probability to play tails
than heads, similarly player two assumes that player 1 has higher probability to play tails.
Hence, player 1 plays tails and player two plays heads. Now they both update their belief.
And it goes on.
Enforceable
Consider a strategic game, T = (N , (Si)i∈N , (ui)i∈N ) with payoff vector r = (r1, r2, ..., rn)
and vi is minmax strategy that player i−1 plays against player i.
A payoff profile ris enforceable if ri ≥ v¯i
12
Feasible
A payoff profile r is feasible if there exist rational, non-negative values αa such that for all
�
i, we can express ri as α∈A αui(a) with σα∈Aαa = 1
Folk Theorem
Consider any n-player game G and any payoff vector (r1, r2, ..., rn).
1. If r is the payoff in any Nash equilibrium of the infinitely repeated G with average
rewards, then for each player i, ri is enforceable.
2. If r is both feasible and enforceable, then r is the payoff in some Nash equilibrium of
the infinitely repeated G with average rewards.
13
7 Bayesian Probability Theorem and Bayesian Games
Everyone uses sentences like, it might rain today, I may not be able to make-up. We all use
probability everyday in our life, to make a guess about our actions for which we are not
sure of.
• if both events are independent, eg. raining today and it being Sunday day, then:
P (A ∩ B) = P (A)P (B)
• DeMorgan’s Laws
P (A ∩ B) = P (A ∪ B)
P (A ∪ B) = P (A ∩ B)
14
7.3 Bayes’ Rule
it gives us a way to express one conditional probability in terms of other.
P (B|A).P (A)
P (A|B) = P (B|A)P (A) + P (B|A)P (A)
In this Bayesian game, the player 1 knows whether player 2 is a-type or b-type but player 2
does not know which type he has to play. Here player 1 has extra an information that
player 2 does not have, this is an example of incomplete information game.
15
probability by which each type is going to occur. So, we calculate the utility of playing left
and right. Which equals to-
When it is a-type game then player 1 will play down, and when it is b-type he’ll play up.
when player 2 makes a strategy of playing left-
3p + 2(1 − p)
0 + 4(1 − p)
if we compare both-
3p + 2(1 − p) = 0 + 4(1 −
p)
2
p=
5
player 1 will play left if the probability of a-type is more than 2
and play right if the
5
probability of a-type is less than 2
5
16
8 Coalitional Games
Cooperative games are often analysed through the framework of cooperative game theory,
which focuses on predicting which coalitions will form, the joint actions that groups take
and the resulting collective payoffs.
A cooperative game with transferable utility is defined as the pair (N, v) where
N = {1, ..., n} is a set of players and v : 2N → R is a characteristic function, with
v(∅)41 = 0. We call such a game also as a game in coalition form, or game in
characteristic form, or coalitional game, or TU game.
Feasible Allocation
An allocation x = (xi, .., xn)is said to be feasible for the a coalition C if
I:
xi ≤ c(C)
i∈C
this means players in C can achieve their respective allocations in x by dividing among
themselves the worth v(C) that they would earn by cooperating.
Rational Allocations
A payoff allocation x = (x1, .., xn) is said to be coalitionally rational if
I:
xi ≥ v(C) ∀C ⊆ N
i∈C
it is also individually rational, as an individual will cooperate only when he gets a payoff
that is equal or more of that he would get if he plays without cooperation.
17
Example 1:Divide the Dollar Game
For the Divide the Dollar game(Version 1), C(N, v) is given by.
{(x1, x2, x3) ∈ R3 : x1 + x2 + x3 = 300; x1 ≥ 0; x2 ≥ 0; x3 ≥ 0}
The core treats the three players symmetrically and includes all individually rational.
In Version 2 of the game (where players 1 and 2 by themselves can earn 300 without the
involvement of player 3), the core will be
{(x1, x2, x3) ∈ R3 : x1 + x2 = 300; x1 ≥ 0; x2 ≥ 0; x3 = 0}
If the players get a nonzero allocation whenever any two players suggest the same
allocation (Version 3 of the game - three person majority game), it can be seen that the
core is empty. The following two insights provide a rationale for the emptiness of the core
in this case. First, when any player i obtains a positive payoff in a feasible allocation, the
other two players must get less than 300 which they would be able to gain by themselves.
Second, regardless of what the final allocation is, there is always a coalition that could gain
if that coalition gets one final opportunity to negotiate effectively against this allocation.
Example 2: Gloves
Mr A and Mr B are knitting gloves. The gloves are one-size-fits-all, and two gloves make a
pair that they sell for Ru 5. They have each made three gloves. How to share the proceeds
from the sale? Each man has three gloves, that is one pair with a market value of Ru5.
Together, they have 6 gloves or 3 pair, having a market value of Ru 15. Since the singleton
coalitions (consisting of a single man) are the only non-trivial coalitions of the game all
possible distributions of this sum belong to the core, provided both men get at least Ru 5,
the amount they can achieve on their own. For instance (7.5, 7.5) belongs to the core, but
so does (5, 10) or (9, 6).
18
8.3 The Shapley Value
The Shapley value is a popular solution concept in cooperative game theory that provides a
unique allocation to a set of players in a coalitional game.The Shapley value faithfully
captures the marginal contributions of the players in deciding the allocations.
Given a coalitional game, the core may be empty or may be very large or even uncountably
infinite. These certainly cause difficulties in getting sharp predictions for the game. The
Shapley value is a solution concept which is motivated by the need to have a solution
concept that would predict a unique expected payoffs allocation for every given coalitional
game.
It is denoted by φ(N, v) = (φ1(N, v), .., φn(N, v)), where φi(N, v) is the expected payoff of
player i. We can write it simply ass φi(v).
Axiom 2: Linearity
Given any two coalitional games (N ; v) and (N ; w), any number p ∈ [0; 1], and any player
i∈ N,
φi(pv + (1 − p)w) = pφi(v) + (1 − p)φi(w)
In other words, the Shapley value of a player for a convex combination of coalitional games
is the convex combination of Shapley values of the player in the individual games. This
axiom asserts that the expected payoff to each player is the same before resolution of
uncertainty and after resolution of uncertainty.
Axiom 3: Carrier
This axiom asserts that the players in a carrier set should divide their joint worth (which is
equal to the worth of the grand coalition) among themselves. This means the dummies are
allocated nothing. The above expression also illustrates a key fact that the Shapley value
always divides the worth of the grand coalition among the players of the game.
19
Calculation of Shapley Value
There is exactly one mapping φ : R2n−1 → Rn that satisfies Axiom 1, Axiom 2, and Axiom
3. This mapping satisfies: ∀i ∈ N ; ∀v ∈ R2n−1
I: |C|!(n − |C| − 1)!
φi(v) = {v(C ∪ {i}) − v(C)}
C⊆N−i n!
The term |C|!(n−|C|−1)! can be interpreted as the probability that in any permutation, the
n
members of C are ahead of a distinguished player i. The term v(C ∪ {i}) − v(C) gives the
marginal contribution of player i to the worth of the coalition C. Thus the above formula
for φi(v) gives the expected contribution of player i to the worth of any coalition.
20
9 Game theory in real life problems
We all have our personal reasons for our choices. We choose according to the situation or
what we think is right for us and sometimes for others too. The aim of doing is what is
best for yourself. In economic terms, you are “best-responding” to other people’s actions in
a purely individual and self-interested way.
Evolution
Humans usually imitate other people in living and survival. Evolution is a popular
application of game theory; for example, people follow the trends and strategies for
survival. Survival not only depends upon fitness but instead also depends upon evaluating
how others in the same community are faring based on their actions. Because of this, it is
important to figure out how certain survival strategies come to be adopted
Bargaining Problems
We buy commodities, negotiate with people every day, we have our belief of how much a
particular thing cost and so does others. We negotiate to come to a point where we both
agree upon. We try to maximize our payoffs. Here payoffs does not depend upon money,
we also consider the perks and conditions with it.
21
chess is an example of game theory as both players know the possible moves and the effects
of those moves.
Same is the case with any other game, all players know the rules of games, and they make
strategies that will help then win the game.
War Strategies
These war strategies and military decisions are examples of game theory. In general, the
military head or commander select the course of action which offers the most significant
promise of success in view of the enemy’s capabilities of opposing him.
22