Sapienza University of Rome. Ph.D. Program in Economics a.y. 2012-2014 Microeconomics 2 – Game Theory Lecture notes 1 . Simultaneous-move games: Solution in (iterated) strict dominance and rationalizable strategies 1.1 Simultaneous-move games in normal form: definition and notation 1.1.1 Games in pure strategies 1.1.2 Games in mixed strategies 1.2 Alternative approaches to the determination of the equilibrium 1.3 Strict dominance 1.3.1 Strict and weak dominance: definitions 1.3.2 The prisoners’ dilemma: solution in strict dominance 1.4 Iterated strict dominance 1.4.1 Iterated elimination of strictly dominated strategies 1.4.2 Iterated weak dominance 1.5 Rationalizable strategies 1.5.1 Approach and definition 1.5.2 Iterated elimination of non rationalizable strategies 1.5.3 Relation between the set of strictly undominated and rationalizable strategies The solution, or equilibrium of a game is a prediction that we make as to the choices that the agents, involved in the strategic situation described by the game, will make. Two keyu assumptions enter in the formulation of this prediction. We first is that players behave rationally; we generally attach to this assumption the specific meaning that players choose an action that maximizes their payoff. But in a context of strategic interaction the best action of any player would depend on the actions of the other players. We further assume, therefore, that each player formulates a conjecture of the other players’ choices and evaluates, on this basis, the expected payoff of his alternative strategies. Players’ rationality and beliefs of other players’ choices have a decisive role in the solution concepts of simultaneous-move games that we are going to examine in these Lecture Notes. In increasing order of the deductive and introspective capacities asked of the players, these are basically the following: - strict dominance; - iterated strict dominance; - rationalizable strategies surviving the iterated elimination of non rationalizable strategies; - Nash equilibrium. D. Tosato – Game Theory – Lecture Notes – a.y. 2013.2014 1.1 This Lecture Note is dedicated to the presentation and the critical implications of the solutions based on the application of the notions of dominance and rationalizability (sections 1.3-1.5). Nash equilibrium is analyzed in Lecture Note n. 2; extensions and refinements of Nash equilibrium in static games are presented in separate Lecture Notes. Sections 1.1 and 1.2 contain some preliminary notions: in the former the definition of simultaneous-move games, both in pure and mixed strategies, is presented and notation established; in the latter alternative conceptual approaches to the determination of the equilibrium of the game are briefly illustrated. 1.1 Simultaneous-move games in normal form: definition and notation A simultaneous-move or static game describes a situation of strategic interaction in which the agents involved – henceforth, to be named the players – are supposed to take decisions not knowing the actions simultaneously chosen by the other players. The interpretation to be given of the term “simultaneous” should not be reserved to situations in which the players literally take decisions at the same time, but more generally to situations of ignorance of the choices made by the other players, possibly at different moments in time. The lack of knowledge of the other players’ moves characterizes the game as one of imperfect information, as opposed to situations of perfect information, which obtain if each player knows, when called to play, the decisions taken by all the other players in the preceding rounds of the game. 1.1.1 Games in pure strategies The assumption that players move simultaneously leads to the description of the game in terms of the normal or strategic form, which consists in the indication of: (1) the number of players; (2) the strategies available to each player; (3) the payoff received by each player for each possible combination of strategies chosen by the players. In standard notation: - I is the number of players; players are specified by the subscript i 1,..., I ; a standard compact notation distinguishes player i from all other players, formally indicated as i ; - S i is the space of strategies available to player i. The strategy space may consists of a finite number Ki of elements, si si1 ,..., si ki ,..., si Ki Si , representing Ki distinct pure strategies, or of an infinite number of elements representing a continuum of possible pure strategies, i.e. of possible values of player i ’s decision variable. In this latter case the strategy space is represented by the interval Si si , si . 1 In static games strategies coincide with actions; the two terms will therefore be used interchangeably. In dynamic games a strategy is 1 The strategic interaction problem may call for the presence of several variables capable of assuming a continuum of values. D. Tosato – Game Theory – Lecture Notes – a.y. 2013.2014 1.2 a rule of action indicating the specific action of the player at each round of the game in which he must take a decision; a strategy is therefore an ordered sequence of actions; - S Si Si Si , with Si S j , is the strategy space of the game and j i iI s s1 ,...si ,..., sI si , si S is a profile of strategies, where the compact notation i for all players different from player i has been used; - ui si , si : S is player i’s payoff resulting from all possible profiles of actions of all players, formally a mapping from the strategy space of the game into the real line. Strategic interaction among players shows up in this definition of the payoff function. The payoff function is assumed to be, in general, an ordinal utility function; unless otherwise indicated, this is the interpretation to be given to the players’ payoffs in the matrix games used to illustrate different game situations. When the players are firms, involved for instance in an oligopoly game, payoff functions are better considered to represent money profits. A game is finite if the strategy space S is finite; this requires that both the number of players and the strategy space of each player be finite. In several game situations of specific interest to economic theory, the strategy space of the players is naturally modeled with continuous variables, such as prices and quantities, for instance, in the classic oligopoly models of Cournot and Bertrand. The presentation in this Note basically refers to finite games; separate consideration will be given, when appropriate, to continuous games. Definition 1.1. The normal form representation of a game G consists in the specification of the number of players, the pure strategy space and the payoff function of each player: (1.1) G I , Si , ui si , si i 1 I We say that the game is of complete information if all players know the structure of the game, namely all its constituent elements: the number of players, the strategy space of every player and everybody’s payoff functions. This last element is particularly relevant: incomplete information typically occurs when the payoff function of one or several players is not known by all the other players. A Bayesian approach to the description of the game is in that case necessary; the notion of equilibrium must be correspondingly extended. 1.1.2 Games in mixed strategies Not all games in normal form have solution in pure strategies. Take the following game in bimatrix form, known as Matching Pennies (see Fig. 1.1). Each player’s strategy space contains two actions Si Heads, Tails , for short Si H , T , i 1, 2 . The payoffs reflect a game in which each player has a penny and must choose how to display it. If the two pennies match with heads or tails facing up, player 1, the row player, wins player 2’s penny; if the D. Tosato – Game Theory – Lecture Notes – a.y. 2013.2014 1.3 pennies do not match, player 2 wins player 1’s penny.2 Since each player wins what the other player looses, this is a competitive or zero-sum game. It is immediately clear that the game has no solution in pure strategy; each player would try to outguess the other, but with no element on the basis of which to make one’s choice of strategy. 2 Heads Tails Heads +1, -1 -1, +1 Tails -1, +1 +1, -1 1 Figure 1.1 – Payoff matrix of the Matching Pennies game The uncertainty each player faces in the choice of strategy has led to the idea that players may randomize between their pure strategies S i and thus adopt a mixed strategy, which represents a probability distribution over the set of pure strategies. Although the mixed extension of the game has a crucial role in the proof of existence of a equilibrium solution, the traditional view of mixtures as conscious randomization carried out by each player has, nonetheless, come under heavy criticism. To suppose, in fact, that rational economic agents, firms in particular, should delegate the determination of their behavior to the outcome of a pure chance mechanism is disturbing and utterly unrealistic. Harsanyi (1973) and Aumann (1987) have, however, convincingly argued for a different view of mixing, namely for an approach which reflects the uncertainty that each player has on the strategy choice of the other player - more generally, of the other players.3 “According to this view, players do not randomize; each player chooses a definite strategy. But other players need not know which one, and the mixture represents their uncertainty, their conjecture about this choice” (Aumann and Brandenburger, 1995, p. 1162, emphasis added). This uncertainty can be analytically expressed in terms of a probabilistic conjecture, or belief, concerning the opponent’s behavior. Let us then indicate with 2 the probability that player 1 assigns to player 2 playing strategy s12 Heads and with 1 2 the probability the he assigns to player 2 playing strategy s22 Tails . Let us similarly indicate with 1 and 1 1 the conjecture of player 2 as to the strategy choices s11 Heads and s12 Tails of player 1. The probability distributions 1 ,1 1 and 2 ,1 2 define a new set of possible strategies that each player attributes, by way of conjecture, to the opponent. We will refer to these strategies as mixed strategies to distinguish them from the pure strategies Heads and Tails, of which they are a probability combination. For short, as in most textbooks, we will 2 In the Matching Pennies game payoffs are naturally taken to be a quantity of money, rather than the utility of winning or losing a penny. 3 Actually Harsanyi’s so called purification theorem has the still further implication that players play pure strategies and mixed strategies represent the probabilities with which the pure strategies are effectively played. D. Tosato – Game Theory – Lecture Notes – a.y. 2013.2014 1.4 write that i is the mixed strategy of player i, omitting to say that this is a conjecture made by player j. Definition 1.2. The mixed strategy i i1 ,..., i ki ,..., i Ki of player i is a probability distribution over the set of pure strategies S i . The set of mixed strategies of player i (1.2) Ki Si i Si i k 0, i k 1 k 1 is therefore a K i 1 dimensional simplex of probability distributions over the set of pure strategies S i ; i si k ik is the probability assigned by the mixed strategy i to the pure strategy si k ; Si is a non empty, closed, bounded and convex subset of a K i dimensional vector space. The mixed strategy i is then a vector with as many component as there are pure strategies. When the strategy space contains only two pure strategies, as in the case of the Matching Pennies game, it is sufficient to indicate the first component of the vector i1 ; we will directly set i i1 . The set of mixed strategies Si of the game Matching Pennies is represented in Fig. 1.2 by the segment AB. The points A and B correspond to the mixed strategies i 1, 0 and 1 B A 1 Fig. 1.2 – The set of mixed strategies of the Matching Pennies i 0,1 , that is respectively to the case of playing with certainty strategy Heads and to the case of playing Tails for certain. Pure strategies thus correspond to degenerate mixed strategies, i.e. to mixed strategies that assign probability one to just one pure strategy and probability zero to all the others. With the same notation as used above with regard to pure strategy games, let S i S j e S Si Si be the sets of mixed strategies of the players j i D. Tosato – Game Theory – Lecture Notes – a.y. 2013.2014 1.5 different from player i and the space of mixed strategies of the game.4 A profile of mixed strategies is therefore a vector i , i S . To complete the description of a mixed strategy game we must define players’ payoffs. Returning to Matching Pennies, let us determine the expected payoffs of player 1, first separately for his two pure strategies Heads and Tails and then for the mixed strategy over Heads and Tails that represents player 2’s conjecture. Each pure strategy is now a lottery over the consequences 1, 1 with probabilities 2 ,1 2 . The expected payoffs are u1 Heads, 2 u1 Heads, Heads 2 u1 Heads, Tails 1 2 (1.3) 1 2 11 2 u1 Tails, 2 u1 Tails, Heads 2 u1 Tails, Tails 1 2 = 1 2 11 2 These payoffs correspond to von Neumann and Morgenstern’s expected utilities. The expected payoff to player 1’s mixed strategy 1 ,1 1 is therefore the weighted average of the expected payoffs of the two pure strategies Heads and Tails: (1.4) u1 1 , 2 u1 Heads, 2 1 u1 Tails, 2 1 1 1 1 2 1 1 1 2 11 1 2 11 1 1 2 The expected payoff of player 2 can, obviously, be determined in a wholly similar way. The payoff function of player i is then a mapping from the mixed strategy space of the game to the real line. 2 Heads Tails Heads 1 2 1 1 2 Tails 1 1 2 1 1 1 2 1 Fig 1.3 – Probability distribution induced by the mixed strategies 1 and 2 The definition (1.4) of the expected payoff of player 1 – and similarly of player 2 – shows that the players’ mixed strategies induce a probability distribution over the cells of the payoff matrix of the game, as indicated in Fig. 1.3. If, as apparently obvious, each player attributes The definition of the mixed strategy space of players j i is based on the assumption that each player’s randomization be independent of that of all other players – or better, that each player believes that the mixed strategy choices of the other players are independent, i. e. uncorrelated. This possibility and the consequences thereof are, for an important aspect, analyzed at the end of this Lecture Note. 4 D. Tosato – Game Theory – Lecture Notes – a.y. 2013.2014 1.6 probability 1 2 playing Heads, the probability on each of the cells of the payoff matrix would be equal to 1 4 . We are now in a position to give a formal definition of a mixed strategy game, which parallels Definition 1.1. Definition 1.3 The normal form representation of a mixed strategy game consists in the specification of the number of players, of each player’s mixed strategy space and expected payoff function I , Si , ui i , i i 1 I (1.5) To sum up the notation used in the mixed strategy extension of the game, we have: - I 1, 2,..., I is the finite number of payers; - Si si1 ,..., si k ,..., si Ki is the finite set of pure strategies or actions of player i ; Ki - Si i Si i 0, i k 1 is a K i 1 dimensional simplex of probability k 1 distribution over the set of pure strategies S i ; i k si k is therefore the probability assigned by the mixed strategy i to the pure strategy si k ; Si is a non empty, closed, bounded and convex subset of a K i dimensional vector space; - Si j i S j and S Si Si ; as the Cartesian product of non empty, closed, bounded and convex sets, S Si Si is a non empty, closed, bounded and convex subset of a Ki dimensional vector space; i - i , i S is a profile of mixed strategies of all players; - ui i , i is the payoff to player i from the strategy profile i , i . 1.2 Alternative approaches to the determination of the equilibrium The equilibrium of a game is a profile of strategies such that no player has the incentive to unilaterally deviate from his choice. How is this choice determined in a situation of strategic interaction, of which all players are fully aware? It is commonly assumed, in a context of individual decision-making, that agents behave rationally, namely that the action chosen by each agent is at least as good, according to the agent’s preferences,5 as any other available action.6 We will assume this principle to hold also 5 This notion is sometimes referred to as “instrumental rationality”. D. Tosato – Game Theory – Lecture Notes – a.y. 2013.2014 1.7 in game theoretic situations.7 But in a strategic context the best action of any player generally depends on the actions that the other players will choose. Each player will then be compelled to anticipate these actions and base his decision on a conjecture, or belief, of the other players’ choice of action. In his PhD dissertation John Nash (1950) suggests two approaches to the determination of an equilibrium profile of strategies and thus to the formation of conjectures. The first is by deduction and introspection, that is through a process of deductive reasoning based on the assumption of rationality and common knowledge of rationality and of the game structure. This is the approach followed in this Lecture Note and throughout the course. It will be amply illustrated in the study of Nash Equilibrium. The second approach is based on the “mass-action” interpretation of the equilibrium points of the game. Nash presents this approach envisaging a large population of potential players for each position of the game; with reference to a two-person game, we thus assume a large population of participants in the position of each of the two players. A participant is selected at random from each population and plays the game with no possibility of communication with the other player. The repeated play by randomly selected players, so that the probability that the same players meet again can be ignored, determines an “average playing” of the game. Hence, “the probability that a particular [set] of strategies will be employed in a playing of the game should be the product of the probabilities indicating the chance of each of n pure strategies to be employed in a random playing” (Nash, PhD dissertation, 1950, p. 22). One natural specification of each player’s conjecture about the play of his opponent is that it converges to the probability distribution corresponding to the average playing of the game. In such idealized setting of play, the equilibrium of the game is a steady state of the process of repeated playing in a random matching of players (Osborne, 2004, p. 22). The idea underlying Nash’s mass-action interpretation of equilibrium points of a game is that players learn each other’s strategies from their experience playing the game. An alternative approach to Nash’s mass-action model is to assume that players start from some initial, possible unexplained prior conjecture, which they subsequently revise in response to information received. This model envisages a personal learning process which, if convergent, leads to a steady state solution of the game. Cournot (1838, Ch. VII) relies on such a process for the attainment of the duopoly equilibrium in his quantity strategy game; he assumes, in particular, that each player adjusts his strategy assuming that the other firm will maintain the level of output chosen in the previous period. 6 Bounded rationality departs from this principle. As remarked by Simon (1955), economic agents do not have the knowledge and the means to make optimizing choices and settle, therefore, for satisfycing decisions, on the basis of routines that have proved in the past to deliver satisfactory results. 7 We will shortly see how the principle of rational choice is adapted in the theory of evolutionary games aiming to portray the process of natural selection among animals and plants. D. Tosato – Game Theory – Lecture Notes – a.y. 2013.2014 1.8 Convergence fundamentally depends on the model of learning. Opposite results can easily emerge with reasonable changes of assumptions. Cournot’s duopoly model offers examples of convergent a well as of divergent results. A final mention is in order to evolutionary game theory. Evolutionary game theory originated as an application of the mathematical theory of games to biological contexts. In line with Nash’s mass-action interpretation, it studies the behavior of a large population of agents who repeatedly engage in strategic interaction. In evolutionary game theory strategies are generically inherited traits that control an individual’s action – something akin to a computer program. Payoffs are specified in units of fitness, which represent the capacity of a specie to reproduce itself and thus survive in the Darwinian process of natural selection. The mathematical biologist John Maynard Smith (1972) introduced the notion of evolutionary stable strategy (ESS), as a tool of explaining the existence of an equilibrium outcome of animal conflict, in the sense that, if every member of the population follows an ESS, no mutant can successfully invade and radically change the equilibrium of the specie. An explicit dynamic foundation, in terms of differential equations, of this equilibrium concept was later provided by the model of the replicator dynamics. Replicator dynamics reflects the rule that the fitter players will generate more replicas of themselves than the less fit, which will be thus culled out of the population. The approach and the tools elaborated by evolutionary game theory have been of interest to the development of game theory in general, in particular in the applications to the problem of equilibrium selection in the presence of multiple equilibria. 1.3 Strict dominance We start the search of equilibrium solutions of a game on the basis of a negative criterion: no rational player would ever choose an action that is payoff dominated by another action available to him. This criterion requires each player to consider only his own payoffs; no knowledge is required of the structure of the game; no conjecture need be formulated as to choices of the other players. 1.3.1 Strict and weak dominance: definitions Definition 1.4 The pure strategy si Si is a strictly dominant strategy for player i in the game G I , Si , ui si , si if for all si si we have i 1 I (1.6) ui si , si ui si, si for si Si A double condition identifies a strictly dominant strategy for player i: i) it offers the highest payoff among all strategies available to the player and ii) this highest payoff is for all possible D. Tosato – Game Theory – Lecture Notes – a.y. 2013.2014 1.9 strategies of the other players. A weaker definition of dominance results if the strict inequality sign in (1.6) is changed into a greater than or equal sign, as in the following definition Definition 1.5 The pure strategy si Si is a weakly dominant strategy for player i in the game G I , Si , ui si , si if for all si si we have i 1 I ui si , si ui si, si for si Si (1.7) Weak dominance requires therefore that strategy si have a payoff no less than that of all other available strategies si and at least one payoff strictly greater. The converse of the notion of dominant strategy is the notion of a dominated strategy. Definition 1.6 The pure strategy si Si is strictly dominated for player i in the game G I , Si , ui si , si i 1 if there exists a strategy si Si such that I (1.8) ui si, si ui si , si for si Si Definition (1.6) obviously implies that strategy si strictly dominates strategy si . We can at this point restate the definition of a strictly dominant strategy as a strategy that strictly dominates all other strategies si Si . The extension of Definition 1.6 to the case of weakly dominated strategy parallels that of a weakly dominant strategy. We will examine in the next section an example of application of the notion of dominance to determine the solution of a most renown game, the Prisoners’ Dilemma. These definitions of strictly (weakly) dominant strategy and strictly (weakly) dominated strategy apply also to the mixed extension of a game. Definition 1.7 The mixed strategy i Si is strictly dominated for player i in the game I , Si , ui i , i I i 1 (1.9) if there exists a strategy i Si such that ui i, i ui i , i for i Si We then have: Definition 1.8 The mixed strategy i Si is strictly dominant for player i in the game I , Si , ui i , i I i 1 if it strictly dominates all other strategies in Si . An implication of Definition (1.7) is that a pure strategy may not be strictly dominated by any other pure strategy, but by a mixed strategy of other pure strategies. D. Tosato – Game Theory – Lecture Notes – a.y. 2013.2014 1.10 Consider, for example, the following game (Mas-Colell 1995, p.241) in which player 1 has three pure strategies Top, Middle, Bottom , for short T , M , B and player 2 has two pure strategies Left , Right , for short L, R . The payoffs of the two players are indicated in the normal form representation of the game in Fig. 1.4. Left Right Top 10, 1 0, 4 Middle 4, 2 4, 3 Bottom 0, 5 10, 2 5, 3 5, 3 1 Top 12 2 Bottom 1 1 Fig. 1.4 – Game with pure strategy M dominated by the mixed strategy 1 , 0, 2 2 Note that strategy M is not dominated by either pure strategy T or B: it has a greater payoff than strategy T (B) if player 2 plays strategy R (L). M is, however, strictly dominated by the 1 1 mixed strategy 1 , 0, with a payoff of 5 whatever the strategy played by player 2. 2 2 This is not the only mixed strategy of player 1 which dominates the pure strategy M . It can be easily checked that all mixed strategies with 1 Top 4,6 strictly dominate M. 1.3.2 The prisoners’ dilemma: solution in strict dominance The Prisoners’ Dilemma game is depicted in Fig. 1.5. The story behind the game is the following. Two suspects of having committed a crime are taken into jail by the police and held in separate cells, so that communication among the prisoners is made impossible. The District Attorney has only indirect evidence, which will lead the jury to a sentence of only 3 month jail unless the prisoners admit of having committed the crime they are accused of. To obtain a confession, the Attorney meets each prisoner separately and informs him of the consequences of his possible actions. If he is the only one to confess, he will be rewarded with a light sentence of, say, 1 month jail; but if he negates participating to the crime while the other prisoner confesses, he will be heavily punished with a 12 month sentence to jail. If they both confess, the jury will have mercy on them and sentence both to 6 month jail. D. Tosato – Game Theory – Lecture Notes – a.y. 2013.2014 1.11 The prisoners’’ Dilemma lends itself to a presentation in the terms of a game in normal form. The number I of players is equal to 2; the strategy space of each players contains two actions Confess, Don ' t Confess , for short C, NC ;8 the payoffs reflect to story behind the game. Confess Don’t Confess Confess -6, -6 -1, -12 Don’t Confess -12, -1 -3, -3 2 1 Fig. 1.5 – Payoff matrix of the Prisoners’ Dilemma The first step in search of a solution is to analyze the payoff matrix of the game to see if one of the two strategies is strictly dominant say for player 1. From definition 1.1, we must, therefore, consider only the payoffs of this player. At the risk of being overfastidious, these payoffs are reproduced in Fig. 1.6. It is immediately evident that the strategy Confess strictly dominates the alternative strategy Don’t Confess. This means that the rational choice of player 1, based only on the knowledge of his own payoffs (and not of the entire game) and with no need to make any conjecture as to the decision of the other prisoner, is to play Confess, which offers him a higher payoff than Don’t Confess whichever the choice of the other player. Since player’s 2 rational choice is, for the same reasons, Confess, the equilibrium strategy profile of the game is Confess, Confess . Confess Don’t Confess Confess -6 -1 Don’t Confess -12 -3 2 1 Fig. 1.6 – Payoff of player 1 in the Prisoners’ Dilemma Needless to say, the solution in strict dominance has a paradoxical flavor: the strategy profile Don ' t Confess, Don ' t Confess is preferable for both prisoners. The conflict between the game theoretic solution, rigorously based on the application of the exclusive principle of rational behavior, and the efficient solution, which Pareto dominates the game theoretic solution, is indicative of a situation that may arise in very different and very real contexts, typically in problems of industrial organization. This circumstance paves the way to heavy criticism of game theory solutions concepts and to proposals aimed at finding ways to reconcile equilibrium with efficient solutions. We will dedicate attention to some of these problems further on in the course. Fink for Confess and Quiet or Mum for Don’t Confess are also used to indicate the two strategies of the Prisoners’ Dilemma. 8 D. Tosato – Game Theory – Lecture Notes – a.y. 2013.2014 1.12 1.4 Iterated strict dominance The simple criterion that rational players would never use a strictly dominated strategy has led us to the determination of a unique solution of the Prisoners’ Dilemma. Our purpose is now to understand if substantially the same idea – that a rational player will never play a strictly dominated strategy – opens the way to the determination of a solution in games in which the direct elimination of strictly dominated strategies is, by itself, not sufficient to do it. The technique consists in considering the possibility that the iterated elimination of strictly dominate strategies may reduce the payoff matrix to a single surviving cell as in the Prisoners’ Dilemma. As we will see, this is indeed possible in some instances, but the process of iterated elimination requires a substantial change in the deductive capacities of the players: it is not enough for each player to be individually rational; we have to ask players to know that all players are rational – a property known as common knowledge of rationality (CKR). We will separately examine the problems that may arise when the process of iterated elimination is extended to weakly dominated strategies. To avoid repeating a long sentence, we will distinguish between the solution approach based on the existence of a strictly dominant strategy and the approach based on the iterated elimination of strictly dominated strategies in terms of direct dominance and iterated dominance. 1.4.1 Iterated elimination of strictly dominated strategies Consider the game depicted in Fig. 1.6, in which both players have three strategies, S1 Top, Middle, Bottom , for short S1 T , M , B , and S2 Left , Center , Right , for short S2 L, C , R . 2 Left Center Right Top 0, 1 3, 0 1, 2 Middle 2, 2 1, 0 2, 1 Bottom 1, 0 0, 1 1, 0 1 Fig. 1.6 While no pure strategy of player 2 is dominated by any other of his pure strategies, strategy Bottom of player 1 is strictly dominated by strategy Middle. The elimination of strategy Bottom of player 1 renders, in turn, strategy Center of player 2 strictly dominated by both his other two strategies. The further elimination of this strategy induces player 1 to discard strategy Top, which has now become strictly dominated by strategy Middle. The final step of the process is now for player 2 to eliminate strategy Right that gives him a payoff of 1, as opposed to the payoff of 2 of strategy Left: By successive elimination of strictly dominated D. Tosato – Game Theory – Lecture Notes – a.y. 2013.2014 1.13 strategies of the two players only the strategy profile Middle, Left remains. This profile of strategies is the solution of the game. We have carried the process of iterated elimination of strictly dominated strategies in an utterly mechanical way. However, what is the reasoning that justifies the players to successively discard some strategies without directly knowing the previous decisions of the other player? Here come the important logical underpinnings of iterated dominance, which make the distinction with regard to the direct dominance approach of the Prisoners’ Dilemma. First, the direct dominance approach demands the players to know only their own payoffs: indirect dominance assumes, on the contrary, that players know the entire structure of the game, in particular the complete payoff matrix. This assumption implies that the structure of the game is common knowledge: everybody knows that everybody knows that everybody knows … and so on. Second, it is necessary to explain why player 2 decides to eliminate his pure strategy Center, which is not strictly dominated by either pure strategy Left or Right. To this end, we must assume not only, as we have already done, that the structure of the game is common knowledge, but also that player 2 knows that player 1 is rational and that, as a consequence, he will not play his strictly dominated strategy Middle. The further step in the process of iterated elimination requires that player 1 knows that player 2 knows that he is rational. The final step requires, in turn, that player 2 knows that player 1 knows that he (player 2) knows that player 1 is rational. The term for this assumption is common knowledge of rationality: everybody knows that everybody knows that everybody knows (… and so on) that players are rational. If the strategy space of every player is finite the process of iterated elimination of strictly dominated strategies necessarily ends in a finite, though perhaps very long, number of steps. If the strategy space is infinite, iterated dominance leads to a regress to infinity, as will become apparent in the subsequent application to Cournot’s duopoly game. The process of iterated elimination of strictly dominated strategies can be formalized following the description given above, with player 1 beginning the process and players taking turns in the successive steps.9 Let S10 S1 and S10 S1 be player 1’s initial pure and mixed strategy spaces. Define the first found of elimination of strictly dominated strategies in terms of the surviving undominated strategies (1.10) S11 s1 S10 there does not 1 S1 s.t. u1 1, s2 u1 s1, s2 , s2 S2 Moving to the following round, define (1.11) S22 s2 S20 there does not 2 S2 s.t. u2 s1, 2 , u2 s1, s2 , s1 S11 Proceeding in this way we can define the subset of pure strategies surviving iterated elimination respectively of player 1 and player 2 as S1n 1 and S 2n . At each step of the process 9 A more elegant formal description, independent of the player who starts the process, has the players to move simultaneous at each step. D. Tosato – Game Theory – Lecture Notes – a.y. 2013.2014 1.14 the surviving subset of uneliminated pure strategies is strictly contained in the preceding one; we then have S1n1 S1n3 S10 and S2n S2n2 S20 . Indicating with S1 and S2 , obviously non empty, the final stages of the process, we can conclude that the game admits a solution in iterated dominance if and only if both S1 and S2 contain only one element. Nothing, of course, guarantees that the sets S1 and S2 contain only one element, as the matrix game depicted in Fig. 1.7 of the following Section 1.5.1 shows, since none of the pure strategies of the players is strictly dominated. In this case, we have Si Si , i 1, 2 . 1.4.2 Iterated weak dominance The process of iterated dominance applies to the elimination of strictly dominated and not to weakly dominated strategies. The basic motivation for this exclusion is that, when there are more than one pure strategy with is weakly dominated by another pure strategy, the outcome of the process of iterated elimination may be path dependent, that is it may depend on the order of elimination to the weakly dominated strategies. Consider the game of Fig. 1.7 (Mas-Colell, p. 238). 2 Left Right Top 5, 1 4, 0 Middle 6, 0 3, 1 Bottom 6, 4 4, 4 1 Figura 1.7 While neither strategy of player 2 is dominated, the strategies Top and Middle of player 1 are weakly dominated by strategy Bottom. We can start the process of iterated elimination by either one. Suppose we start by eliminating strategy Top – suppose, more correctly, that player 2, knowing that player 1 is rational, assumes that he will not play Top. The sequence of iterated elimination proceeds at this point by player 1 assuming that the rational player 2 will in turn eliminate the now weakly dominated strategy Right. The remaining profile of strategies is then Bottom, Left , which would accordingly be the solution of the game. It can easily be checked that if the process of iterated elimination of weakly dominated strategies starts with the elimination of strategy Middle, the resulting equilibrium profile of strategies would be Bottom, Right . 1.5 Rationalizable strategies D. Tosato – Game Theory – Lecture Notes – a.y. 2013.2014 1.15 The notion of rationalizable strategies was independently proposed by Bernheim (1984) and Pearce (1984) in a context of diffused criticism of Nash equilibrium and its various refinements. The nature of the criticism implied by the rationalizable strategies approach will be considered further on in the course. Our concern here is with the definition and the relationship with dominance. 1.5.1 Approach and definition As above underlined, the rationale for the elimination – direct and iterated - of strictly dominated strategies is the criterion that no rational player would choose a strategy that would deliver a lower payoff than another, pure or mixed, strategy available to him, regardless of the strategies chosen by the other players. To this negative approach of eliminating “poor” strategies, rationalizable strategies oppose a positive approach to the solution of a game, namely of searching for “good” strategies. These are the strategies that a player would be justified playing, by an appropriate chain of reasoning, when the structure of the game and the players’ rationality are common knowledge. By implication, since a player would never choose a strategy which is not “good”, non rationalizable strategies can be eliminated, in a way similar to iterated dominance, when looking for a solution of the game. Definition 1.9 In the game I , Si , ui i , i I i 1 the mixed strategy i Si is rationalizable if it is a best response to some conjectures (beliefs) that player i as to the strategies i Si of the other players. Formally: i is rationalizable if there exists i Si such that (1.12) ui i , i ui i, i for i i Si By implication, the mixed strategy i Si is never a best response if there is no conjecture of player i as to the strategies i Si of the other players for which i is a best response. There are aspects concerning the role of conjectures that deserve being stressed: i) player i is supposed to formulate a conjecture as to the possible choices of the other players and determine his best response on that basis; these conjectures correspond to the definition we have adopted for mixed strategies i - an expression of player i’s uncertainty as to the strategy choices of his rivals; ii) i is rationalizable if there is some conjecture that makes it a best response. As regards the practical approach to follow to test for razionalizable strategies in typical two-player bi-matrix games, it is convenient to proceed as follows. Start from the pure strategies of D. Tosato – Game Theory – Lecture Notes – a.y. 2013.2014 1.16 iii) player 1 and suppose (conjecture) that also the other player uses a pure strategy (in other terms, that he plays a degenerate mixed strategy). If a pure strategy of player i is not a best response to any pure strategy of another player, it will not be best response to any non degenerate mixed strategy of the other player; The assumption of common knowledge of the structure of the game and of the rationality of the players has the important implication that each player’s conjecture about the other players’ strategies should not be arbitrary. He should expect the other players to use only strategies that are best response to some of their conjectures. The argument of common knowledge of rationality leads now to a potential regress to infinity about the reciprocal conjectures of the players and supplies the basis for the process of iterated elimination of strategies that are never a best response (Fudenberg and Tirole, 1998, p. 49). Definition 1.10 In the game I , Si , ui i , i I i 1 the strategies Si that survive the iterated elimination of never best response strategies represent the set Si R of rationalizable strategies. A convenient analytical definition of a best response strategy is by means of the correspondence BRi i : Si Si , that is the mapping from some conjecture i Si to i Si .10 Definition 1.11 A best response strategy i is an element of the best response correspondence BRi i (1.13) i BRi i i Si u i , i u i, i i i and some i Si We will analyze the properties of best response correspondences in Lecture Nore n. 2 when we will examine the problem of existence of a Nash equilibrium. It is time to work out an example of the determination of best response strategies and of the set of rationalizable strategies. Consider Bernheim’s game depicted in Fig 1.7 Both players have 4 pure strategies: S1 a1 , a2 , a3 , a4 and S2 b1 , b2 , b3 , b4 . In order to determine which strategies of player 1 are best response to some strategies of player 2 we proceed as indicate in point ii) above. Suppose that player 1 conjectures that player 2 is going to play for certain strategy b1 , i.e. that he adopts the degenerate mixed strategy 2 1, 0, 0, 0 ; comparing the payoffs of the different strategies of player 1 in the column corresponding to player 2 playing b1 (respectively 0,5, 7, 0 ) we immediately see that 10 The reason for the definition of the mapping as a correspondence, i.e. a multivalued function, will become apparent in the construction of the best response function in mixed strategies. D. Tosato – Game Theory – Lecture Notes – a.y. 2013.2014 1.17 strategy a3 is player 1’s best response. We signal this in the payoff matrix by a bar over the highest payoff. As we continue in this way, modifying the conjecture of player 1 as to the pure strategy played for certain by player 2, we realize that all his pure strategies are best response to some pure strategy of his rival. We do the same for player 2, indicate his best response pure strategies by a bar under the highest payoff and notice that strategy b4 is never a best response. In fact, as indicated in the last column of the payoff matrix of Fig. 1.7, strategy b4 is also strictly dominated by an equal probability mix of strategies b1 and b3 . We proceed, therefore, to the elimination of this strategy. When this is done, we realize that strategy a4 is no longer a best response, which was previously justified by the presence of b4 . This is a short and imprecise wording for the longer, but correct statement: player 1 correctly conjectures that strategy b4 is never best response; player 2, whose rationality is by assumption common knowledge, will therefore never use it; at this point player 2 correctly conjectures, on the same basis, that player 1 will never choose strategy a4 , which ceases to be a best response. The set of rationalizable strategies of the two players are, therefore, S1 a1 , a2 , a3 and S2 b1 , b2 , b3 ; the set of rationalizable strategies of the game is the Cartesian product S R S1 S 2 . 1 b 2 1 12 b3 b1 b2 b3 b4 a1 0, 7 2, 5 7, 0 0,3 3,5;3,5 a2 5, 2 3, 3 5, 2 0,1 2,5; 2 a3 7, 0 2, 5 0, 7 0,1 3,5;3,5 a4 0, 0 0, -2 0, 0 10, 1 0; 0 1 Fig. 1.7 – Bernheim’s game As stated at the beginning of this Section, rationalizable strategies are strategies that a player would be justified playing, by an appropriate chain of reasoning, when the structure of the game and the players’ rationality are common knowledge. The chain of reasoning that leads to the conclusion that a strategy is rationalizable deserves to be spelled out in detail: important differences emerge among rationalizable strategies from this point of view. Consider strategy a1 of player 1. This strategy is rationalizable by the conjecture that player 2 will play strategy b3 , which is in turn rationalizable for player 2 by the conjecture that player 1 will play strategy a3 . Proceeding further in our chain of reasoning, a3 is razionalizable by player 1’s conjecture that player 2 will play b1 , which is rationalizable by player 2’s D. Tosato – Game Theory – Lecture Notes – a.y. 2013.2014 1.18 conjecture that player 1 plays a1 . At this point, we are back where we started and the loop leading to rationalization continues to infinity. In extreme synthesis: a1 is rationalizable by b3 ; b3 is rationalizable by a3 ; a3 is rationalizable by b1 ; b1 is rationalizable by a1 ; a1 is rationalizable by b3 ; and so on in an infinite regress with players’ conjectures always disproved. Consider, on the contrary, strategy a2 , which is justifiable by player 1’s conjecture that player 2 will play strategy b2 , which is turn razionalizable for player 2 by the conjecture that player 1 will play strategy a2 . In this case, the loop is immediately closed; both players’ conjectures are verified. This property of conjectures to be verified constitutes the equilibrium assumption that leads to the definition of Nash equilibrium, which is a subset of rationalizable strategies. In Definition 1.9 we have associated the set of rationalizable strategies to the set of strategies surviving the elimination of never best response strategies. From Definition 1.10 we have further drawn the conclusion that the set of rationalizable strategies coincides with the set of best response strategies. In games with a finite set of pure strategies, as in Bergheim’s game, there is no analytical way to identify razionalizable strategies: we have to check which strategies are actually best response to some best responses of the other players. In the mixed extension of a game, best response correspondences can be formally determined as will be shown in Lecture Note n. 2, where examples of derivation will be given and, in connection with the proof of existence of Nash equilibrium, properties of mixed best response strategies are defined. 1.6 Relation between the set of strictly undominated and rationalizable strategies It can be readily verified that in the game depicted in Fig. 1.7 the set of razionalizable strategies coincides with the set of iterated undominated strategies. While this is a result of general validity in 2-person games, it need not hold in games with more than 2 players. Let us start by proving the following result Proposition 1.1 A strictly dominated strategy can never be a rationalizable strategy.11 11 The proof of the converse proposition, namely that a rationalizable strategy cannot be strictly dominated, can be found in Fudenberg and Tirole (1991, pp. 50-52). D. Tosato – Game Theory – Lecture Notes – a.y. 2013.2014 1.19 Proof. By direct comparison of definitions we have: from definition 1.9, the pure strategy si is not razionalizable if there exists i Si such that ui si, i ui i , i for all i si Si . But this condition is identical to the condition, given in Definition 1.7, of a strictly dominated strategy. We can draw a first conclusion, namely that the set of razionalizable strategies S R is weakly contained in the set of undominated strategies SUD : S R SUD . In games with 3 or more players, on the contrary, the relation between rationalizable and strictly undominated strategies may depend on the existence of a possible correlation among the choices of the opponents of a given player i . It is a standard assumption in game theory that the mixed strategies i , with i 2 , are statistically independent, in the sense that each player chooses his mixed strategy independently of the simultaneous choices of the other players. The following example, taken from Brandenburger (1992), illustrates the pointConsider the 3-player game in which the strategy spaces of the players are: S1 U , D , S2 L, R and S3 A, B, C . We can represent the game in normal form by means of 3 payoff matrices, in which player 1 is the raw player, player 2 the column player and player 3 chooses the matrix. In each cell of the matrices (see Fig. 1.8), the payoffs of the three players are indicated in the proper order: first that of player 1, then that of player 2 and finally that of player 3. 2 Left Right Up 1,1,1 1,0,1 Down 0,1,0 0,0,0 1 2 Left Right Up 2,2,.7 0,0,0 Down 0,0,0 2,2,.7 1 Player 3: plays A 2 Left Right Up 1,1,0 1,0,0 Down 0,1,1 0,0,1 1 Player 3: plays B Player 3: plays C Fig. 1.8 – Payoff matrices of the 3-players game We first check for strictly dominated strategies. To this end we consider the payoffs of player 1 from all the possible combinations of strategies of player 2 and 3. The result is presented in Fig. 1.9. 2+3 1 L , A L , B L , C R , A R , B R, C Up 1 2 1 1 0 1 Down 0 0 0 0 2 0 Fig. 1.9 – Payoffs of player 1 to the possible combinations of strategies of players 2 and 3 D. Tosato – Game Theory – Lecture Notes – a.y. 2013.2014 1.20 The payoff matrix depicted in Fig. 1.9 shows that neither strategy of player is strictly dominated. The same is true for player 2, since the payoffs to his strategies Left and Right to all the possible combinations of strategies of players 1 and 3 are identical to the payoff matrix of Fig. 1.9. 3 1+2 1 2 U , L U , R D, L D, R A 1 1 0 0 B .7 0 0 .7 C 0 0 1 1 .5 .5 .5 .5 A 12 C Fig. 1.10 – Payoffs of player 3 to the possible combinations of strategies of players 1 and 2 The payoff matrix of Fig. 1.10 is constructed in the same way as that of Fig. 1.9; it shows the payoffs of player 3 to all possible combinations of strategies of players 1 and 2. Inspection reveals that no pure strategy is no strictly dominated either by another pure strategy or by non degenerate mixed strategy. Note, in particular, that strategy B is not dominated, for instance, by an equal chance mixture of strategies A and C. However, strategy B is not a best response to any possible conjecture of independent choices of players 1 and 2, as the bar over the payoffs show. Actually, only the pure strategies A and C can be best response to the possible combinations of strategies of players 1 and 2. Take, for instance, the independent mixed strategies of players 1 and 2 1 2 12 which induces a probability distribution of 1 4 over the columns of the matrix 1.10. the resulting payoffs to strategies A , B and C of player 3 are respectively .5,.3.5,.5 . We conclude that the set of razionalizable strategies is strictly contained in the set of undominated strategies. Assume, on the contrary, that the mixed strategies of players 1 and 2 are correlated, either on account of preplay communications between the players or by the observation of a common signal and are, e.g., U , L D, R 12 and U , R D, L 0 . The expected payoffs to strategies A , B and C of player 3 are now respectively .5,.7,.5 . So that we can conclude that, if correlated randomization is admitted, the set of razionalizable strategies does coincide with the set of undominated strategies. D. Tosato – Game Theory – Lecture Notes – a.y. 2013.2014 1.21 References Aumann, R. (1987), “Correlated Equilibrium as an Expression of Bayesian Rationality”, Econometrica, vol 55, pp. 1-18 Aumann, R. and A. Brandenburger (1995), “Epistemic Conditions for Nash Equilibrium”, Econometrica, vol. 63, n. 5, pp. 1161-1180 Brandenburger, A. (1992), “Knowledge and Equilibrium in Games”. Journal of Economic Perspectives, vol. 6, no. 4, pp. 83-101 Fudenberg, D. and J. Tirole (1991), Game Theory, the MIT Press, Cambridge, Mass, USA, pp. Harsanyi, J. (1973), “Games with Randomly Disturbed Payoffs: A New Rationale for Mixed Strategy Equilibrium Points”, International Journal of Game Theory, vol. 2, pp.1-23, Maynard Smith, J. (1972), On Evolution, Edinburgh University Press, Edinburgh D. Tosato – Game Theory – Lecture Notes – a.y. 2013.2014 1.22
© Copyright 2026 Paperzz