1 Lecture 1

1
Lecture 1
1.1
Mixed Strategies, Dominance and Never a Best
Response
1. Defn: A normal-from (or strategic-form) game consists of three components:
a set of players N ,
for each player i 2 N , a set of strategies Ai ,
for each player i 2 N , a utility function ui : A ! < that maps
each outcome a (a1 ; :::; an ) to a real number.
2. In a simultaneous-move game like the battle of the sexes the set of
strategies is the same as the set of feasible actions. But the concept
of strategy is applicable to games that involve multiple stages as well.
Example: Sequential Battle of the Sexes. Here, the second mover has
two feasible actions but four strategies. The point is: a strategy is what
one can do, and what one can do depends on what one knows as well
as the set of actions one can choose from.
3. Defn: A mixed strategy for player i, i , is a probability dist. over Ai ,
and i (x) is the probability that player i chooses strategy x 2 Ai . We
use i or Ai to denote the set of mixed strategies.
4. Eg. Suppose Ai = fa1i ; a2i ; a3i ; a4i g. Then a mixed strategy can be denoted by a vector, like, (0:1; 0:2; 0:3; 0:4). The pure strategy of choosing
a1i is denoted by (1; 0; 0; 0).
5. Defn: Each mixed strategy pro…le
( 1 ; :::; n ) induces a prob dist.
over A. The expected utility of player i is
X
( j2N j (aj )) ui (a) .
ui ( ) =
a2A
6. Defn: A strategy ai is dominant for player i if for all a
a0i 2 Ai =ai
ui (ai ; a i ) > ui (a0i ; a i ) :
i
2A
i
and
If a strategy dominates all a0i 2 Ai =ai , it dominates all mixed strategies
i 2 i =ai .
1
7. Defn: A strategy is strictly (weakly) dominated if there exists
such that for all i 2 i
ui ( i0 ;
i)
> ( ) ui ( i ;
0
i
6=
i
i) :
Note that 1) a mixed strategy is strictly (weakly) dominated if any
pure strategy in its support is strictly (weakly) dominated. 2) A pure
strategy may be strictly dominated by a mixed strategy and not by any
pure strategy.
8. For example:
a 1
0
b 0
1
c 0:3 0:3
c is dominated by mixing 50-50 a and b.
9. Defn: a mixed strategy
i 2
i such that
i
is never a best response if there exists no
ui ( i ;
for all
0
i
2
i)
ui ( i0 ;
i)
i.
10. It is obvious that a strictly dominated strategy is never a best response.
The question is whether there are strategies that are never a best response but yet not dominated.
11. Proposition 1. In two-person games, a strategy is strictly dominated if
and only if it is never a best response.
12. The proposition is true only when the number of players is two. When
there are more than two, the in the above proof need not be a mixedstrategy pro…le. In general a undominated strategy need not be a best
response.
13. The proof is provided at the end of this section. Here, I provide an
outline of the argument. Consider the following game:
l
r
a 1
0
b 0
1
c 0:3 0:3
d 0:6 0:5
2
(I only include the payo¤ to the row player.) Since the column player
has only two strategies, we can plot the payo¤ in a diagram. In Figure
1 each dot represents the payo¤ for one of the pure strategies of the row
player. A mixed strategy is a convex combination of pure strategies.
Hence, the entire payo¤ set, denoted by A, is the convex hull of the
four points. Each point in the set correspond to the payo¤ of a mixed
strategy.
14. A mixed strategy is not dominated if it is on the north-east frontier
of the payo¤ set. For example d is not dominated— the set of payo¤s
that strictly dominates d, denoted by B, and the feasible payo¤ set is
disjoint. Since both sets are convex, there exists a hyperplane h passing
through d that separates the two sets. From college algebra, we know
that each hyperplane is characterized by a normal vector = ( 1 ; 2 )
and a constant c such that for all x 2 <2
8
< > c if x 2 A
:x = c if x 2 h :
:
< c if x 2 B
Finally, notes that has positive slope, so that each component of
is non-negative, and we can normalize so that 1 + 2 = 1. It then
follows that d is a best response against .
15. Separating Hyperplane Theorem (Mas-Colell, Whinston, and Green
pp.948): Suppose that the convex sets A, B <N are disjoint. Then
there is p 2 <N with p 6= 0, and a value c 2 <, such that p:x c for
every x 2 A and p:y c for every y 2 B: That is, there is a hyperplane
that separates A and B, leaving A and B on di¤erent sides of it.
16. Proof of Proposition 1 (never best response=strictly dominated): We
will show that if ai is not strictly dominated, it must be a best response
against some j . Suppose #Sj = m. For each ai 2 Ai , de…ne
v(ai )
(ui (ai ; a1j ); ui (ai ; a2j ); :::; ui (ai ; am
j )):
The k component of vector v(ai ) is the payo¤ for player i when he
cofv(x) : x 2
chooses ai and his opponent chooses akj . De…ne V ai
m
Ai =ai g and W
fx 2 R jx > v(ai )g: If ai is not strictly dominated,
then V ai \ W = ;: By the supporting hyperplane theorem, there
3
exists 2 Rm with k k = 1 such that 8x 2 V ai and 8y 2 W; :y
:v (ai )
:x. All components of must be non-negative. If i < 0
then :y < c for any c if i-th component of y is su¢ ciently large and
all other components of y is bounded from above. Hence, ai is a best
response to the mixed strategy j
.
1.2
Nash Equilibrium
1. Defn: A mixed strategy pro…le
i 2 N and for all i0 2 i
ui ( i ;
is a Nash Equilibrium (NE) if for all
i)
ui ( i0 ;
i) .
2. Some …nite games like Matching Pennies have no pure strategy equilibrium, but every …nite game has a mixed strategy NE.
3. Issue 1: The Nash equilibrium implicitly assumes that players knows
that each player is to play the equilibrium. Given this knowledge,
no player wants to deviate. So, there is a sort of circularity in this
concept— the players behave in this way because they are supposed to
behave in this way.
4. The mutual understanding required by Nash equilibrium can be justi…ed in several ways: the players may reach a self-enforcing agreement
to play this way through pregame communication. For examples: you
may agree with a friend to meet at a particular restaurant for dinner. 2. A steady-state convention may be a result of some dynamic
learningnevolutionary process. Examples: we usually takes nodding
your head to mean yes and shaking your head means no. 3. In coordination games, certain equilibrium just “stands out”as a focal point.
5. Issue 2: Note that a player is willing to mix between two strategies if
she is indi¤erent between them. (We use this condition to …nd equilibrium strategies.) One may wonder why should a player mix speci…cally
according to the equilibrium probabilities?
6. There are several responses to this questions. See OR for details.
7. Issue 3: Nash equilibrium requires each strategy only be a weak best
response, not strict. It only requires that no player has any incentive
4
to deviate, it doesn’t require that the players are strictly better o¤
following the equilibrium. In the following game:
a
b
c 0; 1 1; 1
d 100; 0 1; 0
there are two equilibria: (c; b) and (d; a). One could argue that in the
…rst equilibrium, the row player should play d rather than c, because
d is as good as c against b and strictly better against a. But this
argument requires an argument that goes beyond Nash equilibrium—
in the equilibrium (c; b), the row player is certain that the column player
is going to play b, it would not matter that d would do better against
a.
8. We may argue that there is always some very small probability that
the row player may make a “mistake” and choose a even when he
intends to choose b. One way to formalize this idea is to require that
an equilibrium be “robust,” meaning that it remains an equilibrium
even when the game is “slightly”changed.
9. If we replace the weak inequality sign in the de…nition with the strict
inequality sign, it is called a strong Nash equilibrium. The problem
is that strong Nash equilibrium may not exist; e.g.; there is none in
Matching Pennies.
1.3
Common Knowledge of Rationality
1. N.E. requires that each player has the correct expectation of his opponents’ strategies. This is unrealistic in one-shot games without prior
communication.
2. But even when we are not sure about players’expectations, we know
that a rational player will always choose a best response against some
belief. Hence, if a player is con…dent that his opponent is rational, he
should choose a strategy that is a best response against strategies of
his opponent that are themselves best responses (to his strategies).
3. If it is common knowledge that both players are rational, we can continue this process inde…nitely. The set of strategies that survive this
process is called rationalizable.
5
4. Example 1: Consider the following game (Gibbons pp. 6) :
Player 2
L
M R
Player 1 U 1,0 1,2 0,1
D 0,3 0,1 2,0
5. Note that R is strictly dominated by M for Player 2. Now, if Player 1
knows that Player 2 is rational, then Player 1 knows that Player 2 will
never choose M. If M is eliminated, then D becomes dominated by U.
Now, if Player 2 knows Player 1 knows that Player 2 is rational, then
Player 2 knows that Player 1 will not choose D. In that case, Player 2
should choose M.
6. Formally, if player j thinks that player i is rational, player j expects
player i to choose a strategy from the set
n
o
Y
0
0
;
)
8
2
A=
2
j9
2
s:t:
u
(
;
)
u
(
i
i :
i
i
i
j6=i j
i i
i
i i
i
7. But player j does not know exactly which strategy player i may choose.
It is possible, for example, that player j thinks that there is 10% chance
that player i’s strategy is a and 90% chance that player i’s strategy is
b. In a …nite game, this is equivalent (in terms of payo¤) to player j
expecting player i to play a mixed strategy of choosing a with probability 0.1 and b with probability 0.9. Hence, if player j is rational and
he knows that player i is rational, then he may choose a strategy that
is a best response against a player i’s strategy in the convex hull of A.
8. Defn. Let e 0i
en
i
f
i
i
2 e ni 1 j9
and, for each i, de…ne recursively
i
2
j6=i co(
e n 1 ) s:t: Ui ( i ;
j
i)
Ui ( i0 ;
i)
8
e n is the set of strategies that we think player i may choose if every
i
player knows that every player knows that.....every player is rational
for n times. The set of rationalizable strategies for player i is de…ned
en
as: Ri \1
n=0 i :
9. R = (R1 ; :::; Rn ) captures the restriction imposed by common knowledge of rationality. See OR ch.5 for a formal defn. of common knowledge.
6
0
i
2 e ni 1 g:
10. Note that the de…nition we use here is di¤erent from that of OR in that
we require that i be a best response to some mixed strategies. This
rules out any correlation between strategies of the other players. This
is the way Bernheim and Pearce originally de…ne the concept, and it is
consistent with the (implicit) assumption that it is common knowledge
that players choose strategies independently. If players have access to
random devices, then correlated rationalizaibility (as de…ned in OR) is
the proper solution conception.
11. In two person games a strategy is never a best response i¤ it is strictly
dominated. Hence, in two person games, iterated deletion of neverbest-responses is the same as the convex hull of pure strategies that
survive iterated deletion of strictly dominated strategies. In n person
games, n > 2, a never-best-response may be undominated. In that
case, iterated deletion of strictly dominated strategies is equivalent to
correlated rationalizability and is a weaker than rationalizability (in the
sense that the set of rationalizable strategies is a subset of the strategies
that survives iterated deletion of strictly dominated strategies).
12. In our example (U,M) is also the unique Nash equilibrium. In general,
any strategy is part of a Nash equilibrium is rationalizable. But the
converse is not true. Iterated deletion of strictly dominated strategies
is strictly weaker than Nash.
13. Thm 1. Non-emptiness. Ri is non-empty and contains at least one pure
strategy.
14. Outline of the proof: e ni are closed for all n, and countable intersections
of closed sets in non-empty. (Another way to see that is to recognize
that N.E are always contained in R. Thus, R is non-empty whenever
equilibrium exists.) If i 2 Ri , then all pure strategies that i put
positive probability on must also belong to Ri .
15. Find the set of pure strategies that survive iterated deletion of strictly
dominated strategies.
b1 b2 b3 b4
a1 1,4 4,1 3,3 4,2
a2 2,3 6,0 2,2 2,2
a3 3,1 2,7 2,2 2,4
a4 6,6 3,5 1,3 1,4
7
1.3.1
Example: Cournot Oligopoly
1. There are two identical …rms in a market. Let q1 and q2 denote the
outputs of …rms 1 and 2, respectively. The market demand is: 1 q1 q2 .
The marginal cost of production is c.
2. Firm 1’s pro…t function is given by
1
Di¤erentiate
for …rm 1:
1
= (1
q1
q2
c)q1 :
with respect to q1 , we obtain the best response function
c q2
:
2
Similarly, the best response function for …rm 2 is
q1 (q2 ) =
q2 (q1 ) =
1
1
c
2
q1
:
3. Solving the equations, we obtain the Nash equilibrium:
q1 = q2 =
1
c
3
:
4. Let’s consider iterated deletion of strictly dominated strategies. Since
q1 is strictly decreasing in q2 , a rational …rm will never produce more
than q1 (0) = 1 2 c . Thus, the set of strategies that remain after one
round of deletion, Q1 , is given by 0; 1 2 c : Now if a …rm expects its
competitor is rational and will not produce more than 1 2 c , it will not
produce less than q ( 1 2 c ) = 1 4 c . You can see it from the diagram that
the process will converge to the unique Nash equilibrium.
i
h
5. More formally: If Qn is equal to q n ; q n , then Qn+1 will be equal to
h
i
h
i
qn 1 c
1 c qn 1 c qn
qn
1 c
n+2
;
and
Q
will
be
equal
to
+
;
+
. Thus,
2
2
4
4
4
4
we have the following recursive relations:
q n+2 =
and
q n+2 =
8
1
c
4
1
c
4
+
+
qn
4
qn
:
4
Working recursively backward, we obtain :
qn =
1
and
qn =
Set q 0 = 0 and q 0 =
q1 = 1 3 c :
1 c
2
1
n
1
n
n
1
n
2
cX
1i
12
+ q0
4 i=0 4
4
2
cX
1i
12
+ q0 :
4 i=0 4
4
and let n goes to in…nity. We obtain q 1 =
6. The argument will not work when there are three …rms because …rm
1 may expect …rms 2 and 3 each produces an output of 1 2 c and, in
that case, …rm 1 best response is q (1 c) = 0. Thus, no strategy will
be deleted after the …rst round and any output between 0 and 1 2 c is
rationalizable.
1.3.2
Order Independence
1. An important property of rationalizability is that it is order independence—
it does not matter which order we take to do the deletion. If some strategy should be eliminated in round k, then were it put back in the set
would also be eliminated in each subsequent round j, for all j k:The
solution concept would not make sense if this does not hold. Why?
2. Thm 4. Let An denote the set of strategies that survive n round of
iterated deletion of dominated strategies. If ai is strictly dominated
with respect to An , then it is strictly dominated with respect to Am ,
for all m > n:
3. Proof. A pure strategy ai is strictly dominated with respect to An
means that there exists some in 2 co(Ani ) such that
ui (
n
i ; a i)
> ui (ai ; a i ) 8a
i
2 An i :
n
Now, for all m n, if in 2 co (Am
i ), then i strictly dominates ai in
round m as co(Am
co(Ani ). If in 2
= co (Am
i )
i ) ; then it must be that
in some round k between n and m; some pure strategies in the support
9
of in are strictly dominated by some mixed strategies in co(Aki 1 ).
Construct a new strategy ik by replacing each of the pure strategies
eliminated by the mixed strategies that dominates it. Since each of
these mixed strategies is in co(Aki 1 ), ik is also in co(Aki 1 ). Moreover,
since strict dominance is transitive, ik strictly dominates ai . If ik is
in co (Am
i ), the proof is done. If not, repeat the same procedure until
round m is reached.
1.4
Rationality and Coherent Theories
1. (I have made up the terminologies in this part. Gul uses di¤erent
terminologies but his are less convenient.)
2. Nash equilibrium is a theory or prediction about the outcome of a
strategic situation. Here we consider some general properties of any
re…nements of common knowledge of rationality.
3. Defn. A collection of mixed strategies X
(X1 ; :::; Xn )
are called self-justi…able if 8 i and 8 i 2 Xi ; 9 i 2 X i s.t.
:::: n
2 br( i ):
1
i
4. The de…nition says that if X is self-justi…able then every strategy in it
can be justi…ed as a best response to some other strategy in X:
5. Any Nash equilibrium is self-justi…able. More generally, any solution concepts that is consistent with rational behavior should be selfjusti…able.
6. Defn. A collection of mixed strategies X
(X1 ; :::; Xn )
are called self-contained if 8 i and 8 i 2 X i s.t. i 2 br(
1
i)
::::
Xi .
n
7. A set of strategies are self-contained if it includes all best responses
against itself. A solution concept that yields self-contained predictions exhausts all implications of rational behavior— in the sense that
if everyone behave according to the way the solution concept predicts,
then no rational person would want to behave di¤erently.
8. If a set of strategies is not self-contained, then it imposes additional
restrictions beyond common knowledge of rationality.
9. A Nash equilibrium may not be self-contained (e.g the one in Matching
Pennies); a strong Nash equilibrium is.
10
10. Thm 2. R is the largest set of strategy pro…les that are self-justi…able.
11. The procedure introduced in de…nition 1 eliminates all strategies that
are not best response iteratively. The theorem says that a strategy that
is not eliminated by this procedure must be justi…able by some other
strategies that are also not eliminated by this procedure.
12. Outline of the proof: Any X that is justi…able cannot be eliminated by
the procedure, and thus must belong to R: To see that R is justi…able,
note that the procedure must stop after a …nite number of iterations.
13. Thm 3. R is self-contained.
14. Proof: If X is self-justi…able and i is a best response to some strategy
in X i , then f i g [ X must be self-justi…able: If R is not self-contained,
then it could not the largest set of self-justi…able strategies.
15. This simple theorem means that R exhausts all implications of common
knowledge of rationality.
1.5
Dominance and Iterated Deletion of Weakly Dominated Strategies
1. We have already seen that Nash equilibrium does not rule out weakly
dominated strategies. For the same reason, they are also not ruled out
by rationalizability. For example:
l
r
a 0; 1 1; 0 :
b 2; 0 1; 0
2. In this game, l and r are both best response against b, and a and b are
both best response against r. But r weakly dominates l, and b weakly
dominates a.
3. There seems to be a strong intuition ruling out weakly dominated
strategies. We would think that the likely outcome is r; b.
4. The intuition is formalized by assuming that each player believes that
there is some " > 0 probability (for some arbitrarily small ") that his
11
opponent may be irrational and would play any strategy with strictly
positive probability. We then carry out the iteration process for the
strategies of the rational players. The outcome is the set obtained by
deleting all weakly dominated strategies and then deleting all strictly
dominated strategies iteratively. (Gul, 1996 and slightly di¤erently
Dekel and Fudenberg, 1990)
5. It is important to note that only strictly dominated strategies are eliminated after the …rst round, and so the procedure does not justify iterated deletion of weakly dominated strategies.
6. For example
U
D
L
C
R
10000; 2 1; 2 1; 3 :
0; 5
1; 5 0; 1
7. In this example: there are three pure strategy equilibria: (D; L), (D; C),
(U; R). All strategies are rationalizable.
8. L is weakly dominated by C. After deleting L, D is weakly dominated
by U . So the outcome surviving iterated deletion of weakly dominated
strategies is (U; R). But I will argue that (D; C) is not unreasonable.
The standard reason for deleting D is that even though it is a weak
response against C is that it is worse for the row player against R. But
if the row player thinks there is always a chance that the column will
play R, then he might also think there is a small chance that he may
play L in which case U will be a terrible choice.
9. Furthermore, deletion of weakly dominated strategies is based on the
idea that there is always some small probability that any strategy may
be chosen (with " probability). On the other hand, the idea of iterated
deletion is based on the idea that strategies deleted in earlier rounds
will never be chosen for sure. In this sense, iterated deletion of weakly
dominated strategies combine two con‡icting ideas.
10. Another problem is that the iteration of weakly dominated strategies
does not satisfy order-independence. For example, in the …rst game
if we …rst eliminates l, then a and b will both survive. On the other
hand, if we …rst eliminates a, then both l and r survive.
12
1.6
Level-k Models
1. Nash and IDSDS are based on di¤erent logic. IDSDS does not require
that the players know that the equilibrium is going to be played, so it
requires less coordination. However, common knowledge of rationality
is itself a very strong assumption. Nash equilibrium, on the other hand,
only requires that the players have mutual knowledge of rationality. See
OR Ch. 5. We will return to the relation between rationalizability and
Nash equilibrium next week.
2. Consider the p-beauty contest game: There are n players; each is to
choose a number between 0 and 100. The person whose number is
closest to p times the group mean wins a prize. If more than one
person chooses the closest number, then every one who does share the
prize with equal probability.
3. If p < 1, then the unique rationalizable strategy pro…le (hence, the
unique NE) is for everyone to choose 0. For example, suppose p = 2=3.
Then, any number greater than 200=3 will never be a best response.
If a person is rational and knows that every one is rational, then it is
not a best response for this person to choose any number greater than
400/9. And so on.
4. There is a great number of paper that conduct this game experimentally, and the results consistently show that 1) a greater majority of
players do not choose zero, and 2) zero is actually a poor choice empirically.
5. Instead, the results points to the prevalence of the iterated-best-response
rule— there is a spike in the number of people choosing 50p and 50p2 .
Empirically, one tends to do well by thinking two steps ahead or slightly
more in this game.
6. Two observations: 1. Common knowledge of rationality is too strong.
Most likely players do not think in terms of higher order beliefs. 2.
Not all best responses are equally likely. 50p is much more likely than
0 even though both are possible best response.
7. Vince Crawford and others have developed a level-k model and applies
to a variety of settings.
13
8. Level 0 player plays non-strategically. Depending on situations, they
may be choosing randomly (in the p-beauty contest) or non-weaklydominated strategies (like bidding a randomly chosen amount that is
less than the value of the good in an auction). Level one player chooses a
best response against level zero players. More generally, level n players
best respond against level n-1 players.
9. Unlike NE, a level-k model is a non-equilibrium theory.
10. Crawford and his collaborators argue that this model …ts the experimental data of many non-repeated one-shot games better than other
alternatives. My view is mixed (though I haven’t read the literature
carefully. The reason is that the level zero behavior is arbitrary, so
there is scope to rationalize any data by choosing the level zero behavior judiciously.
11. Colin Camerer has a related model in which a higher level chooses a
best respond against a mixture of lower level players.
12. References: Bosch-Domenech et. al., 2002, “One, Two, (Three), In…nity,...: Newspaper and Lab Beauty-Contest Experiments”AER Vol.92.
13. Camerer et. al., 2004, “A Cognitive Hierarchy Model of Games,”QJE
August.
14. Crawford’s paper can be found on his personal webpage.
14