Organizational Behavior and Human Decision Processes 120 (2013) 142–153 Contents lists available at SciVerse ScienceDirect Organizational Behavior and Human Decision Processes journal homepage: www.elsevier.com/locate/obhdp The price of anarchy in social dilemmas: Traditional research paradigms and new network applications Vincent Mak a,1, Amnon Rapoport b,⇑ a b Cambridge Judge Business School, University of Cambridge, Trumpington Street, Cambridge CB2 1AG, United Kingdom Department of Management and Marketing, A. Gary Anderson Graduate School of Management, University of California at Riverside, Riverside, CA 92521, United States a r t i c l e i n f o Article history: Received 7 November 2011 Accepted 16 June 2012 Keywords: Social dilemmas Price of anarchy Transportation and communication networks a b s t r a c t Research on social dilemmas has largely been concerned with whether, and under what conditions, selfish decisions by autonomous individuals jointly result in socially inefficient outcomes. By contrast, considerably less emphasis has been placed on the extent of the inefficiency in those outcomes relative to the social optimum, and how the extent of inefficiency in theory compares with what is observed in experiments or practice. In this expository article, we introduce and subsequently extend the price of anarchy (PoA), an index that originated in studies on communication in computer science, and illustrate how it can be used to characterize the extent of inefficiency in social dilemmas. A second purpose of our article is to introduce a class of social dilemmas that occur when individuals selfishly choose routes in networks, and illustrate how the concept of PoA can be helpful in studying them. Ó 2012 Elsevier Inc. All rights reserved. Introduction In this paper, we introduce an index of inefficiency in social dilemmas called the price of anarchy (PoA). Our primary aim is to place the extent of inefficiency on the agenda of experimental research on social dilemmas, as this area of research has largely been preoccupied with whether socially inefficient outcomes occur and what factors influence the probability of their occurrence. To achieve our aim, we illustrate the application of PoA in a number of examples; some of these (like the classic public goods game) will be familiar to researchers on social dilemmas, while others involve route choices in networks that might be less familiar. We hope that our introductory exposition will engender new research on social dilemmas in networks. But what are social dilemmas? In his path-breaking review of social dilemmas – a cornerstone in the development of research in this field – Dawes (1980) defined a social dilemma as an interactive decision making situation that satisfies two properties. First, the individual payoff for each agent who chooses to defect is strictly higher than the payoff for choosing to cooperate, no matter what choices are made by the other members of his group. The second property mandates that members of the population gain a higher payoff if all cooperate than if all defect. In the language of game theory, Dawes has opted to define social dilemmas as non- ⇑ Corresponding author. Fax: +1 951 827 3970. E-mail addresses: [email protected] (V. Mak), [email protected] (A. Rapoport). 1 Fax: +44 (0) 1223 339701. 0749-5978/$ - see front matter Ó 2012 Elsevier Inc. All rights reserved. http://dx.doi.org/10.1016/j.obhdp.2012.06.006 cooperative one-shot n-person games in which (i) all the n players have strictly dominant strategies that (ii) collectively result in a socially inefficient (Nash) equilibrium.2 Tailored to the paradigmatic n-person Prisoner’s Dilemma (PD) game (e.g., Rapoport, 1970), the definition has several drawbacks. First, it does not allow for games with mixed-strategy equilibria, which are progressively more often included in the ever expanding scope of social dilemma research. Second, it unnecessarily restricts the scope of social dilemma research by excluding games with multiple equilibria. Even more restrictive is the requirement that all the n players have strictly dominant strategies. Historically, this restriction has been relaxed in subsequent reviews of social dilemma research by Messick and Brewer (1983), Kollock (1998), and several experimental studies. In particular, Kollock offers a more general definition of social dilemmas as (p. 183): ‘‘. . .situations in which individual rationality leads to collective irrationality. That is, individually reasonable behavior leads to a situation in which everyone is worse off than they might have been otherwise.’’ Kollock argues that, for example, the 2 2 Assurance game,3 in which neither player has a dominant strategy, ‘‘is a more accurate model than the Prisoner’s Dilemmas Game of many social dilemmas situations’’ (p. 187). And when discussing social dilemmas that satisfy the two properties listed by Dawes, he wrote (p. 185): ‘‘However, not all social dilemmas involve dominating strategies.’’ 2 See the section ‘‘The price of anarchy’’ for a formal exposition of the idea of equilibrium. 3 The 2 2 Assurance game is a coordination game with two equilibria: mutual cooperation, which is socially optimal, and mutual defection, which is socially inefficient. V. Mak, A. Rapoport / Organizational Behavior and Human Decision Processes 120 (2013) 142–153 To add another important example, and illustrate how the field of social dilemmas has been expanded and consequently enriched in the last 30 years by relaxing the definition of Dawes, consider an experiment that, ironically, was conducted by van de Kragt, Orbell, and Dawes (1983) only 3 years after Dawes had published his Annual Review of Psychology paper. The experiment concerns a one-shot step-level public goods game. The game is played by a group of n symmetric players with no pre-play communication. Each player is endowed with e units (e > 0) and must decide independently and simultaneously either to contribute all of them to the benefit of his group (i.e., the public good), or keep all of them for himself. If m or more players (1 < m 6 n) contribute their endowments,4 then each player i receives a reward of r units (e < r); if m 1 or fewer players contribute, then the contributors lose their endowments, whereas the non-contributors keep theirs. It is easy to verify that if exactly m players contribute their endowments, so that everyone receives the reward, while the remaining n m players (in cases when m < n) keep their endowments, the total group payoff is maximized. This outcome satisfies the second property listed by Dawes. However, in violation of the first property, universal contribution is no longer a dominant strategy. In fact, this ‘‘minimal contribution set’’ game has n!=½m!ðn mÞ! asymmetric equilibria in pure strategies in which exactly m players contribute. Additionally, the game has another symmetric equilibrium in which no player contributes. Therefore, while the socially optimal outcome in a step-level public goods game is an equilibrium, it is possible that a socially inefficient outcome occurs, which is also an equilibrium. Indeed, this has often been reported by experimental studies of this game or its variants (see, e.g., Chen, Au, & Komorita, 1996; Mak & Zwick, 2010; McCarter, Budescu, & Scheffran, 2011, among many others). Our reading of the literature on social dilemmas in psychology and economics suggests that cases like the step-level public goods game are quite common. In fact, most research on social dilemmas is motivated by the observation that self-interested behavior by autonomous decision makers generally leads to inefficient outcomes; the presence of dominant strategies is not specifically required. Therefore, in the present paper, we take the broader view of defining social dilemma situations as non-cooperative games with socially inefficient outcomes. As such, social dilemmas encompass vastly different ranges of situations, from the classic context of exploitation of commons resource pools to overpopulation, deforestation, and congestion of traffic networks (as shall be described later). Common to all these social dilemmas is one single theme, namely, that there exists a ‘‘worst-case equilibrium’’ which is socially inefficient. Eliciting cooperation If individual rationality is, in general, not a sufficient condition for achieving collective rationality (e.g., Sandler, 1992), then what proposals may be advanced for eliciting behavior that increases social welfare? This question is of immense importance because of the critical role of social dilemmas in modern society. It has far reaching policy and educational implications that have been studied in much detail (see e.g., Komorita & Parks, 1994; Ostrom, 1990; Ostrom, Gardner, & Walker, 1994). Alternative solutions to social dilemmas have been reviewed and critically discussed by Dawes (1980), Messick and Brewer (1983), Van Lange, Liebrand, Messick, and Wilke (1992), Kollock (1998), and others. They include motivational solutions in which some or all of the decision makers have other-regarding (e.g., altruistic) preferences, and strategic solutions that may or may not assume changes in the fundamental structure of the game. 4 If m = 1, this game is known as the Volunteer’s Dilemma; it has different properties from what is discussed here. 143 For example, Dawes, McTavish, and Shaklee (1977) reported that pre-play discussion of the dilemma significantly reduced the frequency of socially inefficient outcomes in n-person PD games. van Dijk, de Kwaadsteniet, and De Cremer (2009) pointed out the need for common understanding among players in facilitating coordination to achieve socially optimal outcomes. Brewer (1979) and Edney (1980) suggested that cooperative solutions to social dilemmas may be facilitated by exploiting social ties arising from social group identity. Numerous studies (e.g., Isaac & Walker, 1988; Isaac, Walker, & Thomas, 1984; Rapoport & Chammah, 1965) have demonstrated experimentally that changes in the payoff structure may affect the frequency of socially efficient outcomes, such that the greater the personal return from cooperation or the lower the personal return from defection, the higher the level of cooperation. Recent research highlights the impact of uncertainty about the payoff structure, rather than just its (expected) values, on cooperation (e.g., McCarter, Rockmann, & Northcraft, 2010; van Dijk, Wit, Wilke, & Budescu, 2004). In general, these solutions shy away from calling for social designers to recommend which courses of action the players should take or, in more controversial cases, for central authorities to enforce such courses of actions. If norms of social behavior are formed over time (e.g., in small communities), then they are supposed to be established voluntarily. Other solutions that originated in biology and computer science have taken a distinctly more authoritarian perspective. In an influential article on the ‘‘tragedy of the commons’’, the biologist Hardin (1968) concluded that ‘‘freedom in a commons brings ruin to all’’ and advocated ‘‘mutual coercion mutually agreed upon.’’ In computer science, where it is generally not the case that agents are completely unrestricted, Roughgarden (2009) suggested that efficient joint outcomes ‘‘could be improved upon given dictatorial control over everyone’s actions.’’ Others have been looking for a middle ground between centrally enforced solutions and completely unregulated anarchy. For example, Anshelevich et al. (2008) pointed out that agents using communication networks interact with an underlying protocol that proposes a collective solution to all the users who may individually either accept or reject it; as such, the protocol designers may at least seek to promote the best possible equilibrium strategies in terms of total welfare (see the Section ‘‘Extensions and generalizations’’). The price of anarchy Imposing changes in the payoff structure, conducting pre-play communication, or establishing superordinate authority to control everyone’s action, are almost always costly in terms of time and money, often infeasible, and may frequently trigger socially undesirable reactions (e.g., a negative reaction among the group members to the infringement of their individual freedom). Therefore, a key question is which proposal or combination of proposals to implement (if any), and under what conditions in order to achieve near-optimal outcomes. This question cannot be answered in practice without measuring the potential extent of inefficiency caused by the behavior of independent, self-interested individuals. If the extent of inefficiency, even in the worst scenario, is relatively small, then the cost of implementing procedures to elicit cooperative behavior may exceed whatever gain in efficiency that could result. But if it is relatively large, then it might be worthwhile to bring about conditions under which decentralized optimization by selfish individuals is guaranteed to produce outcomes that are near-optimal. Three steps ought to be taken in order to answer this question. The first is to choose a formal model that defines ‘‘the outcome of selfish behavior.’’ The second is to define a measure of the efficiency of each outcome, often referred to as a welfare function. 144 V. Mak, A. Rapoport / Organizational Behavior and Human Decision Processes 120 (2013) 142–153 The third is to define an index that quantifies the loss of efficiency resulting from selfish and uncoordinated behavior. Regarding the first step, the formal model most often chosen to define ‘‘the outcome of selfish behavior’’ is the Nash equilibrium, of which the Nash equilibrium in pure strategies is the simplest to illustrate. A pure-strategy (Nash) equilibrium is defined as follows: each of n players in a non-cooperative game selects a (pure) strategy si from a set of strategies Si (i = 1, 2,. . ., n), with the resulting utility or payoff ui(s) of player i having as its argument the strategy profile s ¼ fs1 ; s2 ; . . . sn g. That is, the payoff of a player, in principle, depends on the strategies chosen by all the players. A strategy profile is called a pure-strategy equilibrium if no player can be better off by unilateral deviation, i.e., ui ðsÞ P ui ðs0i ; si Þ for every i and s0i 2 Si , where si denotes the strategies chosen by all the n players other than player i. It then follows that in equilibrium each strategy is a best response to the equilibrium strategies chosen by the other players. More generally, an equilibrium is defined over mixed strategies, which is a broader notion than pure strategies. A mixed strategy for player i is one in which i chooses over the strategies in Si with a specific probability assigned to each strategy; if one of these probabilities is 1, then the strategy becomes a pure strategy. A mixed-strategy equilibrium is one in which a player cannot be better off by deviating from his mixed strategy, given the mixed strategies of all other players. Regarding the second step, natural candidates for measuring efficiency include the sum of the utilities of all the players (utilitarian function), the minimum utility across all the players (an egalitarian function), or any other function that is deemed meaningful for the particular social dilemma situation under consideration. The most commonly chosen function – the one that we use throughout this paper – is the utilitarian function that is the sum of all the players’ utilities. The third step can be fulfilled by the notion of price of anarchy (PoA), which is the focus of this paper. The notion was first developed by the computer scientists Koutsoupias and Papadimitriou (1999), who proposed the quantification of the loss of efficiency resulting from selfish behavior by the ratio between the social welfare of the worst-case equilibrium (i.e., the equilibrium with the lowest social welfare among all equilibria) and that of the socially optimal outcome (i.e., the joint outcome that maximizes social welfare regardless of whether it is an equilibrium or not). Papadimitriou (2001) then named this index the price of anarchy. We formally illustrate this concept as follows, but for simplicity limit ourselves to pure-strategy outcomes; it is straightforward to generalize the formulation to include all mixed-strategy outcomes. Consider a game played by a set of n players with strategy set Si for each player i and well-defined utilities or payoffs ui(s) (i = 1, 2, . . ., n) for each strategy profile s as discussed before; as such, any joint outcome of the game can completely be defined as a strategy profile. Denote by S the set of all possible strategy profiles. One can then define, as suggested earlier in this section, a measure of efficiency W(s) for each strategy profile (i.e., joint outcome) s, which is the total utility summed over the individual utilities of all the players in the outcome. We next identify a subset E # S to be the set of strategy profiles that are pure-strategy equilibria. If the payoffs in all equilibria are positive, then the price of anarchy is defined by: PoA ¼ maxWðsÞ=minWðsÞ: s2S s2E If the payoffs are framed as costs, then in measuring efficiency, instead of social welfare to be maximized we should consider social cost (the total cost summed over the individual costs of all the players) to be minimized. The price of anarchy is then defined by: PoA ¼ maxCðsÞ=minCðsÞ; s2E s2S where C(s) is the total cost summed over the costs of all players in an outcome with strategy profile s. The PoA can similarly be defined when mixed-strategy outcomes are also considered. Note that the PoA is always defined so that it is larger than or equal to unity. As can be intuited from the definitions, the higher the PoA, the more ‘‘severe’’ the social dilemma is.5 A social dilemma game has a well-defined PoA whenever players’ payoffs in all equilibria can be framed as all positive gains or all costs relative to a natural reference point, so that the social welfare measures in the worst-case equilibrium as well as the socially optimal outcome would have the same sign. Therefore, for example, the application of the PoA is problematic in the 2 2 PD game in which mutual cooperation yields 15 units while mutual defection yields 15 units to each player. However, in many social dilemma experiments, the condition of applicability is fulfilled as players typically have positive payoffs even in the worst-case equilibrium (such as keeping all their endowments in a public goods game).6 In the social dilemmas in networks that are introduced in this article, this condition is also fulfilled as payoffs can always be framed as costs of travel. The PoA has the desirable empirical property of being invariant to re-scaling of the payoff units – measuring the payoffs in cents rather than dollars would not change the PoA of a social dilemma situation. Thus, while it is not necessarily the only possible index of inefficiency in social dilemmas (for example, one might suggest that the difference, rather than ratio, between welfare in the socially optimal outcome and the worse-case equilibrium could also serve as a valid index), it is nevertheless a convenient and meaningful index; the PoA has, indeed, found many applications in algorithmic game theory (see e.g. Nisan, Roughgarden, Tardos, & Vazirani, 2007) in problems as diverse as load balancing, traffic routing, fair cost-sharing allocation, and network design. To sum up, the PoA is an index that is applicable across many social dilemma games and is useful in succinctly highlighting the games’ welfare properties. As such, it is instructive to compare the PoA with another example of constructing indices for social dilemma games, namely, the two indices of cooperation proposed by Rapoport and Chammah (1965, chap. 1). Those indices are tailored specifically to the 2 2 PD game and make use of all four payoffs in the symmetric PD payoff matrix (see Fig. 1A). One of the indices is arguably a satisfactory predictor of cooperation in Rapoport and Chammah’s experimental data. However, both indices are restricted to the 2 2 PD game; they are not generalizable to other n n PD games (n > 2) and certainly not to other social dilemmas. On the other hand, the PoA allows comparison of different games to one another, and not only variants of the same game that differ from one another in their payoffs. Its generalizability is an advantage that Rapoport and Chammah’s indices do not have. Moreover, Rapoport and Chammah’s indices were constructed purely as candidates for predicting experimental cooperation with no other substantive theoretical motivations behind them. By contrast, the PoA has clear and general welfare motivations behind its construction, so that the question of whether/how the PoA influences subjects’ perception of a social dilemma is intrinsically important, as we shall state below. A major purpose of the present paper is to extend the idea of 5 Note that the extent of inefficiency is not the only dimension of the ‘‘severity’’ of a social dilemma. Kollock (1998, p. 185) mentions another dimension of severity as whether the strategies in the socially inefficient equilibrium are strictly dominant. The classic public goods game is among the most severe social dilemmas along this dimension. 6 On the other hand, in game-theoretic analysis final outcomes are stated as utilities (and not numerical values) that are unique up to an affine transformation. Therefore, the payoffs in a social dilemma may be transformed to be all positive in an experimental implementation with no substantive impact on the dilemma’s gametheoretic characteristics as derived from the expected utility approach. V. Mak, A. Rapoport / Organizational Behavior and Human Decision Processes 120 (2013) 142–153 145 Fig. 1. Payoff matrices for two 2 2 games exhibiting social dilemmas. In both cases, (C, C) is the socially optimal joint outcome. PoA to the field of experimental social dilemma research. We argue that the idea, which captures the extent of inefficiency in social dilemmas, has much potential in inspiring new experimental research, which has largely been preoccupied with whether socially inefficient outcomes occur and what factors influence the probability of their occurrence. To begin with, the PoA can serve a descriptive purpose in objectively characterizing the ‘‘severity’’ of a social dilemma in terms of the welfare change from the worst-case equilibrium to the socially optimal outcome. It may then inspire enquiries in experimental research such as: (1) Is subjects’ psychological perception of a social dilemma influenced by the dilemma’s PoA? Which aspects of that perception might be influenced? How might this relationship depend on the specific payoff structure of the experimental game, contextual factors in the experiment, or individual difference variables? (2) If the PoA influences subjects’ perception of a social dilemma, then how might that influence be translated to subjects’ motivation to cooperate? Would an increase in the PoA lead to a higher or lower motivation to cooperate that is mediated by a change in the perception of the social dilemma (see also the Discussion after Example 4 in the next section)? Lastly, as with (1), how might this influence be moderated by the payoff structure of the game, contextual factors, or individual difference variables? We do not venture to provide definite answers or formulate specific hypotheses for these questions, but simply propose them as major potential avenues for researchers to explore. We note that research into these questions may have practical policy implications when deciding what steps to take (if at all) to decrease inefficiency in real-life social dilemmas. Despite the apparent importance of these questions, experimental social dilemmas research in psychology and economics has paid relatively little attention to the extent of inefficiency in social dilemmas, be that the theoretical extent (as derived from game theory analysis) or the empirical extent (as observed in experiments). The predominant dependent variable in experimental social dilemma research is the subject’s choice of strategy, such as the amount of contribution in a public goods game; in step-level public goods game, the frequency of the optimal social outcome being attained (i.e. provision of the public good) is also a much reported variable. Quantification of welfare loss, meanwhile, has seldom been on the agenda. While efficiency is usually mentioned and its measurement occasionally reported (in the form of subject earnings), the quantified efficiency loss of theoretical or observed outcome relative to the social optimum has rarely been discussed. Even more importantly, there is scarcely any attempt to investigate how the extent of inefficiency influences subjects’ perception of the dilemma and their psychological motivation in behaving cooperatively.7 As the PoA was first discussed in the context of social dilemmas in networks, a second objective of this article is to introduce this class of social dilemmas to experimental researchers. In the rest of this article, we first illustrate the PoA with a number of examples, starting with the classical paradigms in social dilemma research, and then proceeding to social dilemmas in networks. We next introduce a number of extensions, such as the notion of the price of stability, replacing equilibrium play with play intended to minimize regret, and the idea of price of empirical anarchy. We conclude the paper with a discussion on future research directions. Examples from well-known social dilemmas In this section, we illustrate the use of PoA with a number of well-known social dilemmas, beginning with the PD game. Example 1: The Prisoner’s Dilemma (PD) game The simplest and most well-known example of social dilemma is the 2 2 PD game. Fig. 1A depicts the payoff matrix of the symmetric version of this game. In the figure, si = C denotes the decision to cooperate, si = D the decision to defect, R is the reward 7 The observations in this paragraph can find support in a look at the social psychology studies on social dilemmas cited in the previous section. Regarding experimental economics research on social dilemmas, similar support can be obtained from examining review texts such as Ledyard (1995) on public goods games, Croson and Marks (2000) on step-level public goods games, and relevant sections in Plott and Smith (2008), e.g. chaps. 51 and 52 and Part 6.1. 146 V. Mak, A. Rapoport / Organizational Behavior and Human Decision Processes 120 (2013) 142–153 to each player for mutual cooperation, T the temptation to defect, S the ‘‘sucker’s payoff’’, and P the punishment for mutual defection (e.g., Axelrod, 1984; Rapoport & Chammah, 1965). Define the social welfare function as the sum of the two players’ payoffs. The defining inequalities for this dilemma are T > R > P > S and R > (T + S)/2; the unique equilibrium is then (D, D) (mutual defection) with total payoff 2R while the socially optimal outcome is (C, C) (mutual cooperation) with total payoff 2P. Consider the case when P > 0, so that the equilibrium payoffs are positive and the concept of PoA is applicable. As such, PoA = R/P, a quantity that (as should be expected) increases in R and decreases in P. For example, if T = 5, R = 4, P = 2, S = 1, then PoA = 4/2 = 2. Rapoport and Chammah (1965) found that the index (R P)/ (T S) had a positive correlation with degree of cooperation in their experimental data across various PD games. This suggests that a higher PoA (i.e., higher R and lower P) empirically increases cooperation in a PD game, all else being equal. Example 2: The game of chicken The Prisoner’s Dilemma game has a unique equilibrium that is socially inefficient. In this sense, it is different from the game of Chicken, which has played an important role in, e.g., the analysis of problems of strategic interaction between nations (e.g., Schelling, 1963). This game is obtained from the PD game by swapping the two least-preferred outcomes in the individual preference ordering of the four outcomes. Fig. 1(B) exhibits the symmetric form of the game. The inequalities that define the game are T > R > P > S and R > ðT þ PÞ=2. The socially optimal outcome is (C, C) with payoff (R, R), which is not an equilibrium. There are two pure-strategy equilibria (C, D) and (D, C) with associated payoffs (P, T) and (T, P), respectively, and the total group payoff is T + P in either equilibrium. The game also has a mixed-strategy equilibrium where each player chooses strategy C (‘‘cooperate’’) and D (‘‘defect’’) with respective probabilities q and 1 q, where q is computed from R q þ P ð1 qÞ ¼ T q þ S ð1 qÞ i.e., q ¼ ðP SÞ=½ðP SÞ þ ðT RÞ: In the mixed-strategy equilibrium, each player earns the same expected payoff of Rq þ Pð1 qÞ ¼ ½RðP SÞ þ SðT RÞ=½ðP SÞþ ðT RÞ. Consider the case where all payoffs are positive, so that the concept of PoA is applicable. The PoA is then the total payoff 2R in the socially optimal outcome divided by that of the worstcase equilibrium, which will be the lowest total payoff among all equilibria. The general formula for the PoA is, therefore, maxf½2R=ðP þ TÞ; R½ðP SÞ þ ðT RÞ=½RðP SÞ þ PðT RÞg: For example, if T = 5, R = 4, P = 2, and S = 1 (the same set of numbers as used in our numerical example for the PD game), then, in the mixed-strategy equilibrium, strategies C and D are played with equal probabilities by each player. Denote the mixed strategy by (1/2C, 1/2D). The total group payoffs associated with the three pairs of equilibrium strategies (C, D), (D, C), and (1/2C, 1/2D) are 7, 7, and 6, respectively, so that the worst-case equilibrium is in mixed strategies. The total group payoff in the socially optimal outcome is 2R = 8. Thus, PoA = 8/6 = 4/3. Compared with the PD game example with the same set of numbers arranged differently in the payoff matrix (compare Fig. 1A and B), the PoA is smaller. Intuitively, this is due to the fact that, in the PD, the socially inefficient outcome involves mutual defection and therefore mutual destruction of payoffs, so that both players receive the lowest possible payoffs in the game. However, in the game of Chicken, all the three equilibria involve some probability of occurrence of an outcome (C, D) or (D, C), in which one player obtains the highest possible payoff in the game while the other obtains a low payoff. This suggests that the social dilemma in the game of Chicken is not as ‘‘severe’’ as that in the PD game in terms of the welfare properties of the two dilemmas, and the difference in PoA values in our numerical example captures this intuition. That said, we do not infer that cooperation should then be more likely with the game of Chicken than with the PD game upon a rearrangement of payoff parameters as described. Systematic experimental comparisons are needed to investigate such a possibility. Example 3: The classic public goods game Next, consider a very simple version of the classic public goods game played by n players, where every player is endowed with one unit and must decide independently and simultaneously whether to keep it for himself or contribute it for the provision of the public good. If the total contribution turns out to be k units, then every player receives a reward of rk units, where (1/n) < r < 1. This game is a social dilemma because a player always finds it a strictly dominant strategy to keep his one unit instead of contributing it, since he will be better off by 1 r as a result. Nevertheless, the socially optimal outcome is for all players to contribute their endowments so that each ends up with a payoff of rn > 1. The socially optimal total group payoff is, therefore, rn2 , while the equilibrium payoff (when every player keeps his endowment) is n. The PoA is thus ðrn2 Þ=n ¼ rn. Note that the PoA increases both with the marginal benefit of the public good, measured by r, as well as the group size n. This captures the intuition that the public goods dilemma is effectively more severe when the population becomes larger and/ or when the public good can more highly improve welfare. There is evidence that these two changes in parameter values could lead to more cooperative behavior in public goods experiments (see, e.g., Ledyard, 1995).8 Thus, it might be suggested that there is a positive correlation between PoA and cooperative behavior in public goods games. Example 4: The step-level public goods game Consider next the following ‘‘minimal contribution set’’ game studied by van de Kragt et al. (1983). Recall that each of n players is endowed with one unit and must decide independently and simultaneously whether to keep it for himself or contribute it for the public good. If m or more players (1 < m 6 n) contribute their endowments, then every player receives a reward of r > 1 units; if m 1 or fewer players contribute, then all contributors lose their endowments, whereas the non-contributors keep theirs. The socially optimal outcome, which is also an equilibrium outcome (see Introduction), has exactly m players contributing and n m players keeping their endowment, so that the total group payoff is m r þ ðn mÞ ð1 þ rÞ ¼ nð1 þ rÞ m: A worst-case equilibrium occurs when no player contributes,9 so that the total group payoff is n. Therefore, PoA ¼ ½nð1 þ rÞ m=n ¼ 1 þ r ðm=nÞ: Controlling for m, the PoA increases with the benefit of the 8 Isaac et al. (1984) suggest that contribution is positively related to the marginal per capita return (MPCR), which would be r in the present example. Ledyard (1995) and Holt and Laury (2008) discuss theoretical and empirical evidence in support of the possibility that contribution increases with group size in a classic public goods game controlling for MPCR (at least when the values of both group size and MPCR are not too large). These are consistent with a positive relationship between contribution and the PoA. 9 Clearly, in any pure- or mixed-strategy equilibrium, a player’s expected utility cannot be less than that of definitely contributing nothing (otherwise the player could be better off by switching from his equilibrium strategy to definitely contributing nothing). This implies that a player’s expected utility in any equilibrium must be at least one. Thus the equilibrium in which no player contributes must be a worst-case equilibrium. V. Mak, A. Rapoport / Organizational Behavior and Human Decision Processes 120 (2013) 142–153 public good, i.e. r, and also increases with the group size n. This observation can be interpreted with a similar intuition as proposed in our earlier discussion of a similar dependence in the classic public goods game. However, an additional feature is that the PoA decreases with the provision threshold m, controlling for the other two parameters. This suggests that a higher provision threshold leads to a ‘‘less severe’’ social dilemma. It is typically observed in experiments on step-level public goods games that a higher provision threshold renders cooperation more difficult and successful provision of the public good less likely (see e.g. Croson & Marks, 2000).10 The PoA perspective highlights the other side of coin, namely, that a higher threshold also makes cooperation less ‘‘important’’, as the improvement in welfare from the worst-case equilibrium to the socially optimal outcome is less than when the threshold is lower. Overall, this points to a positive correlation between the PoA and cooperation in step-level public goods games. Discussion Across the four classic examples we have surveyed, there seems to be much experimental evidence suggesting a positive correlation between the PoA and cooperation. It might be surmised from these streams of research that the correlation is generalizable. In other words, an increase (a reduction) in the potential welfare improvement of cooperation might affect subjects’ perception of the dilemma in a way that increases (decreases) subjects’ psychological motivation to cooperate. However, it may be premature to suggest that the positive correlation between PoA and cooperation is always true in any social dilemma. More psychological theorizing needs to be carried out to explain why a social dilemma with more dramatic welfare change from worst-case equilibrium to socially optimal outcome necessarily increases the subject’s motivation to cooperate, and more experimentation needs to be carried out to demonstrate it. So far the evidence only shows that parameter changes that lead to an increase (a decrease) in the PoA also lead to a higher (lower) likelihood of cooperation, without showing that the change in the extent of inefficiency is perceived by subjects and influences their motivation and behavior. Even if the overall thesis comes to be well supported, much remains unanswered, such as how this influence might be moderated by the specific payoff structure of the game as well as other contextual factors and individual difference variables. Social dilemmas in directed networks Transportation and communication networks provide the infrastructure to conduct much of our social activities. And yet social dilemmas in networks, which have been studied by transportation researchers interested in route choice in traffic networks and computer scientists interested in communication networks, have received relatively little attention among experimenters. In this section, we introduce a few of these dilemmas that we hope would trigger experimental research. The following examples are noncooperative games in which players need to choose over different routes in a network, and the payoffs are always framed as costs. Equilibrium outcomes are often socially inefficient, and the PoA is a convenient index to characterize the extent of inefficiency in each case. 147 In fair cost-sharing games (e.g., Anshelevich et al., 2008; Balcan, Blum, & Mansour, 2010; Monderer & Shapley, 1996), n players choose routes in a network and split the cost of the edges on their route with other users choosing the same edges. A simple case is where each player is a commuter who has to choose one of two alternatives. The first is to drive his car to a common destination at a cost of 1. The second is to share public transportation with others to the same destination, and split the exogenously determined cost of travel k (1 < k < n) evenly among the users of public transportation. Fig. 2 depicts the game. Commuters arrive one at a time in a given sequence and are not informed about the decisions of the commuters who preceded them in the sequence. It is easy to see that a worst-case equilibrium occurs when each commuter drives his own car, so that the total group cost is n units.11 Social cost is minimized if all the n users share public transportation and each pays k/n, in which case the total group cost is k. The PoA is, therefore, n/k, a quantity that increases with the group size n and decreases with the overall cost of public transportation k. In a variant of this game, k is an integer while the first k 1 commuters make their decisions without being informed about the decisions of those who preceded them in the sequence. However, every commuter who makes a decision after those k 1 is fully informed about the decisions of the m commuters who immediately preceded him in the sequence, where m < k. The socially optimal outcome (i.e., all choose public transportation) as well as the worst-case equilibrium (i.e., all drive their own cars) remain the same, and, therefore, so is the PoA. Nevertheless, we may expect that, depending on the values of k and m, an experiment on this variant may produce different empirical observations from the standard game in which players make decisions without being informed of any other player’s decisions. Example 6: Selfish routing in a network – the Pigou–Knight–Downs Paradox The next example describes a model that has been around for a long time (Downs, 1962) and was first discussed qualitatively by the economist Pigou more than 90 years ago. For a non-technical exposition, see Arnott and Small (1994), and for experimental work see Morgan, Orzen, and Sefton (2009). In its simplest form, the model involves a network with a common origin and a common destination (Fig. 3). The two nodes are connected by two parallel edges. Assume that a finite number n of network users have to travel at the same time from the origin to the destination. The top route in Fig. 3 is a wide highway which can accommodate all the n users, and as such is not susceptible to congestion. Assume that, regardless of the number of users of this road, the cost per user is fixed at 1. The bottom route is a considerably shorter but narrower road which is susceptible to congestion; the more users simultaneously drive on this road, the slower the going and, consequently, the higher the cost of travel. Assume that the cost per user for this narrow road is CðmÞ ¼ a þ bðm=nÞc , where c > 0, 1 > a P 0, b P 1 a, and m is the number of users choosing the narrow road. If c = 1, then the cost function is linear. If, in addition, a ¼ 0, then the cost of traveling on the narrow road is simply proportional to the amount of traffic on it. Consider the case when each user selfishly chooses one of the two routes in an attempt to minimize his cost of travel. Then, the concept of Wardrop equilibrium, which is commonly employed in these problems (see Beckmann, McGuire, & Winsten, 1956; Morgan et al., 2009; Wardrop, 1952), prescribes that all users incur the Example 5: Fair cost-sharing games 10 Croson and Marks (2000) suggest that contribution in a step-level public goods game is positively related to the step return (SR), which would be rn/m in our example. The dependence of SR with every game parameter is in the same direction as the dependence of the PoA with that parameter. 11 In any pure- or mixed-strategy equilibrium, a player’s expected utility must be no less than that of definitely driving his own car; otherwise, he could be better off by switching from his equilibrium strategy to definitely driving his own car. Thus, the equilibrium in which everyone drives his own car must be a worst-case equilibrium. 148 V. Mak, A. Rapoport / Organizational Behavior and Human Decision Processes 120 (2013) 142–153 d1 0 Public Transport 1 d2 0 d3 0 1 1 Destination 1 0 dn-1 1 0 dn k Fig. 2. Graphical representation of the fair cost-sharing game. Each di (i = 1, 2, 3, . . ., n) represents a commuter who has to make a decision on whether to use public transportation to get to the destination and split the cost k evenly with other users (1 < k < n), or drive his own car with cost 1 to the same destination (denoted by the direct route from commuter to destination). The number beside each route (i.e. line with arrow) indicates the total cost of using that route. PoA = n/k.. Fig. 3. An example of the Pigou–Knight–Downs with two routes, where m is the number of users of the bottom route, n is the total number of players, c > 0, 1 > a P 0, and b P 1 a. PoA = f1 ðc=b1=c Þ ½ð1 aÞ=ð1 þ cÞð1þcÞ=c g1 (see also Table 1). same traffic cost in equilibrium.12 Because a þ b P 1, the equilibrium cost must be that of the wider road i.e., 1.13 Hence, if there is an improvement in the condition of the narrow road that can be captured as a change in the parameters (e.g., a decrease in a or b or an increase in c), it will be counterbalanced by a corresponding endogenous adjustment of equilibrium demands along the two routes; as a result, the users will incur the same costs as before and will not benefit from the changes at all. This is the ‘‘paradox’’ that has been studied by Morgan et al. (2009). Next, suppose that some central authority has been designated to assign the n users to specific routes in an attempt to minimize the social welfare as measured by the total group cost of travel. Denote the number of users assigned to the narrow and wide roads by m and n m, respectively. The total group cost summed across the n users is: ðn mÞ 1 þ m ½a þ bðm=nÞc ¼ n ð1 aÞm þ b mcþ1 =nc : 12 A Wardrop equilibrium is a Nash equilibrium when its number of users are all integers. In general, the Nash equilibria converge to the Wardrop equilibria when the group size is large. See Morgan et al. (2009) for more discussion on this issue in an experimental context. 13 This must be true in an equilibrium in which some users choose the wide road. If the equilibrium is not as such, it must have all users choosing the narrow road and incur a per user cost a + b P 1; if a + b = 1, then the same conclusion is obtained as in the text. If a + b > 1, then an individual user has an incentive to unilaterally deviate and choose the wide road, so that it cannot be an equilibrium that all users choose the narrow road. The solution of the first-order condition of this expression is m ¼ n fð1 aÞ=½bðc þ 1Þg1=c ; that is, the total group cost is minimized at this value of m.14 The resulting socially optimal total group cost is n f1 ðc=b1=c Þ ½ð1 aÞ=ð1 þ cÞð1þcÞ=c g. The price of anarchy is n 1 = n (the total group cost in equilibrium) divided by the socially optimal total group cost. For example, if c = 1, a ¼ 0, and b ¼ 1, so that CðmÞ ¼ m=n on the narrow road, then the socially optimal choice of m is n/2, which results in a socially optimal total group cost of travel of ðn=2Þ 1 þ ðn=2Þ ðn=2nÞ ¼ 3n=4. The price of anarchy is then PoA = 4/3. Table 1 presents the values of socially optimal n m, m, C(m), and then the PoA, for selected values of c, while the other parameters are controlled at a ¼ 0, b ¼ 1 and n = 100. In this case, the worst equilibrium is attained when all users choose driving on the narrow road and incur a cost of 1 per user. It is easy to see that as c increases, m approaches n, and the PoA increases. However, note that an increase in c corresponds to an improvement in the condition of the narrow road, as the cost of traveling on that road decreases controlling for m. Nevertheless, the PoA increases at the same time. This is because the equilibrium total group cost remains unchanged (i.e., n) while the socially optimal total group cost decreases with an increase in the number of users assigned to the narrow road. That is, upon an increase in c, the socially optimal outcome is actually ‘‘closer’’ to the equilibrium state in which all choose driving on the narrow road, and yet the social dilemma is more severe in the sense of an increase in PoA. Example 7: Choice of routes in a network – the Braess Paradox It would seem natural to believe that increasing the capacity of an existing transportation or communication network by widening its links or adding one or more segments to the network would not worsen and, most likely, would improve efficiency. Braess (1968), an applied mathematician, shattered this belief by demonstrating that, paradoxically, adding a link that connects two alternative routes joining a common origin to a common destination may raise the equilibrium travel cost of every network user. His work on what is commonly known as the Braess Paradox (BP) has stimulated considerable theoretical research in transportation science and computer science (e.g., Roughgarden, 2005). It also has given rise to a few experimental studies in recent years (e.g., Gisches & Rapoport, 2012; Morgan et al., 2009; Rapoport, Kugler, Dugar, & Gisches, 2009; Rapoport, Mak, & Zwick, 2006). The purpose of this 14 Note that, since c > 0, the total group cost expression is convex in m for all positive values of m. Hence, if the first-order condition yields a unique positive solution (as is indeed the case here), that must correspond to a minimum over all positive values of m. 149 V. Mak, A. Rapoport / Organizational Behavior and Human Decision Processes 120 (2013) 142–153 Table 1 Price of anarchy (PoA) values for selected values of c in the Pigou–Knight–Downs Paradox in Fig. 3 with a ¼ 0, b ¼ 1, and n = 100. The ‘‘wide road’’ (‘‘narrow road’’) corresponds to the top (bottom) route in Fig. 3. The worst-case equilibrium in this case always has all users traversing the narrow road and thereby incurring a cost of 1 per user. c 1/5 1/2 1 2 3 4 5 6 7 8 Socially optimal outcome PoA No. of users of wide road, n m No. of users of narrow road, M Cost of using wide road per user Cost of using narrow road per user 60 56 50 42 37 33 30 28 26 24 40 44 50 58 63 67 70 72 74 76 1 1 1 1 1 1 1 1 1 1 0.933 0.851 0.750 0.615 0.528 0.465 0.418 0.380 0.350 0.325 example is to illustrate the games’ pure-strategy PoA in a very simple network exhibiting the BP, and then examine how it changes as the group size increases. Similarly to Example 6, we consider networks with a common origin O and common destination D that are modeled as directed graphs (cf. Fig. 4). Any edge of the graph linking node i to node j is denoted by (i, j), and the traffic on the edge is denoted by fij, which is interpreted as the number of users traversing the edge from i to j. The cost of each user who traverses the edge (i, j) is denoted by cij(fij), which, in principle, is a function of fij. The travel cost of a user is the sum of the edge costs across all the edges in the path he chooses from O to D. For equilibrium consideration, we assume that, as in previous examples, each of the n users independently seeks to choose a path that minimizes his cost of travel. Consider now the simple example in Fig. 4(A), which we call the basic network. The network includes two parallel routes, namely, O ? A ? D and O ? B ? D. Each route has two segments. One segment (A ? D or O ? B) has a fixed cost (210) and as such is not subject to congestion. A second segment (O ? A or B ? D) has a variable cost of the form cij ðfij Þ ¼ 10f ij , and is consequently subject to congestion. The socially optimal outcome, which is also an equilibrium outcome, is for half of the users to choose route O ? A ? D and the other half to choose route O ? B ? D. Next consider the three-route network in Fig. 4B, which we call the augmented network. This network is the same as the basic network with the only difference that a new link A ? B has been added to connect nodes A and B. Travel on this new link (e.g., ‘‘bridge’’) is assumed to be costless: cAB(fAB) = 0 for all 0 6 fAB 6 n. In the augmented network, each user has to choose independently one of three routes when he departs from O, namely, routes O ? A ? D, O ? B ? D, and O ? A ? B ? D. Assume that the group size (total number of players) is n = 18. The augmented network (Fig. 4B) then has a unique equilibrium in pure strategies where all users independently choose to travel on route O ? A ? B ? D. This can be verified by noticing that each unilateral deviation by a single player from route O ? A ? B ? D to either route O ? A ? D or route O ? B ? D increases the individual cost of travel. The associated equilibrium cost of travel (A) Basic 10fOA summed across all the n users is cOA(18) + cAB(18) + cBD(18) = 360. It is easy to verify that this equilibrium is socially inefficient; the total group cost of travel is minimized when half of the users (n/ 2 = 9) choose route O ? A ? D and half choose route O ? B ? D, essentially playing the basic game’s equilibrium in Fig. 4A, in which case the total group cost of travel will be reduced from 360 to 300. This is an illustration of the BP showing that, paradoxically, degradation of the network in Fig. 4B by removing the link A ? B improves performance. The PoA is 360/300 = 1.2. In the more general case when cij ðfij Þ ¼ aij þ bij fij in any edge (i, j), where aij and bij are non-negative constants, the following conditions are necessary and sufficient for the BP to appear across the two networks in Fig. 4 (see Penchina, 1997): 1. The network game must have both fixed user costs and variable user costs. 2. The two paths in the basic network must have an opposite order of appearance of the edges associated with fixed vs. variable costs. 3. The fixed cost of traversing the edge A ? B in the augmented network must be less than the difference in fixed costs between the edges dominated by fixed costs and those dominated by variable costs. Other networks that exhibit the BP may be constructed with more paths (see, e.g., Gisches & Rapoport, 2012; Rapoport et al., 2009; Roughgarden, 2005). The realization of the BP depends jointly on the cost structure and the group size n. We next examine the augmented network in Fig. 4B keeping the cost structure unchanged but letting n increase from 2 to 42; only even values of n are considered. Table 2, which is based on the analysis conducted by Rapoport et al. (2006), presents the essential results. For each value of n, the table shows the worst-case pure-strategy equilibrium number of users choosing each of the three routes (columns 2–4), the associated total group cost (column 5), the socially efficient minimum total group cost (column 6), and the associated PoA values (right-most column). For example, if we set n = 18 as before, then under (B) Augmented A D 210 10fBD B A 10fOA 210 O 1.072 1.174 1.333 1.626 1.896 2.151 2.394 2.630 2.858 3.081 O 210 D 0 210 10fBD B Fig. 4. The networks used in Example 7 to illustrate the Braess Paradox. 150 V. Mak, A. Rapoport / Organizational Behavior and Human Decision Processes 120 (2013) 142–153 Table 2 Pure-strategy price of anarchy (PoA) for selected values of n (all even numbers) in the Braess Paradox in Fig. 4. In the table, when n 6 14, the socially optimal outcome has all users using route O ? A ? B ? D; otherwise, the socially optimal outcome has half of the group using route O ? A ? D and half using route O ? B ? D. n Worst-case equilibrium 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32 34 36 38 40 42 No. of users of O ? A ? D No. of users of O ? B ? D No. of users of O ? A ? B ? D Cost per user 0 0 0 0 0 0 0 0 0 0 1 3 5 7 9 11 13 15 17 19 21 0 0 0 0 0 0 0 0 0 0 1 3 5 7 9 11 13 15 17 19 21 2 4 6 8 10 12 14 16 18 20 20 18 16 14 12 10 8 6 4 2 0 40 80 120 160 200 240 280 320 360 400 420 420 420 420 420 420 420 420 420 420 420 equilibrium all the players choose route O ? A ? B ? D for a total cost of 360. Meanwhile, the total cost of play is minimized at 300 if routes O ? A ? D and O ? B ? D are each chosen by nine users. The PoA, as noted before and stated in Table 2, is 1.2. For another example, if we set n = 26, then under equilibrium 5, 5, and 16 users choose routes O ? A ? D, O ? B ? D, and O ? A ? B ? D, respectively, for a total cost of travel of 420. Total cost of travel is minimized to the value of 340 if each of the routes O ? A ? D and O ? B ? D is chosen by n/2 = 13 users. The PoA is then 1.235, which is higher than that when n = 18. But when n is further increased to 34, the PoA drops again to 1.105. Rapoport et al. (2006) summarized the findings of their purestrategy equilibrium analysis as follows: If n 6 42, the individual cost of travel in any pure-strategy equilibrium increases in n but never exceeds 420. If 2 6 n 6 14, then in equilibrium all the users choose route O ? A ? B ? D and the equilibrium solution is efficient. If 16 6 n 6 20, then once again in equilibrium all the n players choose route O ? A ? B ? D. However, the equilibrium solution is no longer efficient.15 If 22 6 n 6 40, then in equilibrium only a fraction of the n users choose the route O ? A ? B ? D. The size of this fraction decreases as n increases. The equilibrium solution is still inefficient. If 42 6 n, then the route O ? A ? B ? D is never chosen. The equilibrium solution is efficient.16 15 Note that choosing the route O ? A ? B ? D is a strictly dominant strategy for every player whenever n 6 20. This is because, for any player, the maximum expected number of other players traversing O ? A is n 1, so that the cost of traversing O ? A is no more than 10n, which is less than 210 whenever n 6 20; and likewise with B ? D. This means that the equilibrium in which all players choose O ? A ? B ? D is unique whenever n 6 20. 16 When 42 < n there also exists a symmetric mixed-strategy equilibrium in which every player randomizes with 0.5 probability between choosing O ? A ? D and choosing O ? B ? D. In this equilibrium, for any player, the expected number of other players traversing O ? A is (n-1)/2, and similarly for the expected number of other players traversing B ? D. The expected travel cost of O ? A ? D and O ? B ? D are the same and equal to 210 + 10{1+[(n-1)/2]}=215 + 5n, which is higher than the purestrategy equilibrium travel cost of 210 + 5n, and, as such, is inefficient. In this case, if we consider not only pure-strategy but also mixed-strategy equilibria, the PoA is larger than one. Cost per user in socially optimal outcome PoA 40 80 120 160 200 240 280 290 300 310 320 330 340 350 360 370 380 390 400 410 420 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.103 1.200 1.290 1.313 1.273 1.235 1.200 1.167 1.135 1.105 1.077 1.050 1.024 1.000 Table 2 further shows that the change in the pure-strategy PoA value is non-monotonic. The PoA first increases in n reaching a maximum when n = 22, and then decreases. The BP is only manifested for 16 6 n 6 40. In fact, Roughgarden and Tardos (2002, 2004) proved that if the cost functions are all linear, then for any generalized routing problem the pure-strategy PoA cannot exceed 4/3.17 Extensions and generalizations The ideas and procedures presented above may be extended in several different directions. First, we introduce and consider another index, called the price of stability, which focuses on the best – rather than the worst – equilibrium. Secondly, arguing that it is not always realistic or reasonable to assume that all the decision makers in a social dilemma situation necessarily play strategies that form Nash equilibrium, we consider other developments that invoke weaker and more realistic assumptions about the behavior of selfish individuals. In particular, we discuss the case where individuals are allowed to play any sequence of actions in an attempt to minimize their regret with respect to the best outcome that they could possibly achieve. Finally, we further generalize the notion of PoA to empirical arenas by proposing a similar index, called the price of empirical anarchy, which is based on observed and replicable patterns of behavior rather than theoretical concepts like worst-case equilibrium. The price of stability Strategies that jointly result in optimal outcomes are socially desirable but often inherently unstable; when decision makers are autonomous, these strategies are unenforceable as individuals may defect from them with no impunity. On the other hand, equilibrium strategies are stable, as no decision maker may benefit 17 The PoA value may be enhanced experimentally by subtracting the cost of travel from a fixed endowment, G, and thereby focusing on gains rather than losses. This is powerful manipulation to increase the effect of structural changes in the network on the payoffs associated with the choice of alternative routes. For example, consider the cost structures of the two games displayed in Fig. 4, and set the endowment G = 400. Consequently, the equilibrium payoffs in the basic and augmented network games are 400–300 = 100 and 400–360 = 40, respectively. The resulting PoA is equal to 2.5 (see, e.g., Rapoport et al., 2006, 2009). V. Mak, A. Rapoport / Organizational Behavior and Human Decision Processes 120 (2013) 142–153 from unilateral deviation; however, when the equilibria are inefficient, they are socially undesirable. The PoA is proposed to characterize how much worse the solution quality of the Nash equilibrium can be relative to the quality of the socially optimal outcome. But why focus on the worst equilibrium? Anshelevich et al. (2008) point out that in most network applications agents are not completely unrestricted; rather, they interact with a protocol that proposes a collective solution that each agent may either accept or reject. Consequently, protocol designers in communication networks may wish to pursue the best Nash equilibrium in terms of social welfare. This argument extends to social science applications where instead of a protocol of play determined by the network designer, unenforceable agreements are reached endogenously by some sort of pre-play communication, or suggested to the group of decision makers by a leader specifically nominated for this purpose. The best Nash equilibrium is naturally viewed as the optimal solution subject to the constraint that no decision maker has an incentive to unilaterally deviate from it once it is offered. The price of stability (PoS) is then defined as the ratio between the quality of the optimal ‘‘centralized’’ solution and the solution quality of the best Nash equilibrium. When considering only pure-strategy outcomes and using the notations defined in the section ‘‘The Price of Anarchy’’, the PoS can formally be expressed as: PoS ¼ maxWðsÞ=maxWðsÞ: s2S s2E If the payoffs are framed as costs, then the expression takes the form: PoS ¼ minCðsÞ=minCðsÞ: s2E s2S The PoS may similarly be defined when mixed-strategy outcomes are also considered. By definition, 1 6 PoS 6 PoA when both indices are defined on the same set of outcomes (i.e., only purestrategy outcomes or all pure- and mixed-strategy outcomes), with the equality PoS = PoA holding if all equilibria are equally efficient or if there is only one, unique, equilibrium. 151 with the PoA and PoS. They consider four classes of games that include Hotelling games, market sharing games, traffic routing games, and multiple-item auctions, and prove that in the first three cases PoA and PoTA are the same even where play is not converging to equilibrium. The price of empirical anarchy The inefficiency indices introduced above are all based on theoretical notions such as equilibrium or regret minimization. However, it is also obviously important to quantify the extent of empirical inefficiency in any specific instance of social dilemma. This, in fact, is a simple progression from the ideas presented above. We propose an index, called the price of empirical anarchy (PoEA), which can be applied to experimental research. Given a specific empirical setting of social dilemma, the PoEA is defined as the total payoff in the socially optimal outcome divided by the observed average total payoff in cases where players have achieved some form of stable convergence of behavior. For example, consider again the PD game notated in Fig. 1A with T = 5, R = 4, P = 2, S = 1. The socially optimal total payoff is 2R = 8, while the equilibrium total payoff is 2P = 4. Now suppose that, in an experiment on this game, participants play in fixed groups of two, and each group plays the same game repeatedly across many trials. Suppose that it turns out that 60% of the groups end up mutually cooperating in all later trials, while 40% end up mutually defecting in all later trials. Then, the average total payoff among all those groups is 2R 60%þ 2P 40% ¼ 6:4. The PoEA is then 8/6.4 = 1.25. This result may be compared with the theoretical PoA for this set of game parameters, which is R/P = 4/2 = 2 (see earlier discussion). That is, in this example of empirical play of the PD game, the social dilemma turns out to be less severe than theoretically predicted. This is an observation that has been commonly obtained in research reported by, e.g., Rapoport and Chammah (1965), but here, by comparing the PoA and PoEA, we may quantify the difference in the degree of severity. Minimization of regret Conclusions Blum, Hajiagahayi, Ligett, and Roth (2008) weakened the notions of PoA and PoS by arguing that it does not seem realistic to assume that all decision makers will play strategies that form a Nash equilibrium. The argument falls in three parts. First, even under a centralized authority that coordinates all the individual strategies, Nash equilibria may be difficult for the decision makers to compute. Second, even if all the Nash equilibria are easy to compute, there is no particular reason to believe that independent self-interested decision makers will converge to them. Other studies (Goemans, Mirrokni, & Vetta, 2006; Mirrokni & Vetta, 2004) also have questioned the plausibility of selfish decision makers converging to Nash equilibria, and so have many experiments in behavioral economics (see, e.g., Camerer, 2003). Thirdly, for games with only mixed-strategy equilibria, the assumption that decision makers not only play so as to maximize their individual utilities, but also so as to preserve the stability of the entire system, seems untenable. After all, there is no immediate incentive for decision makers to play specified mixed strategies as opposed to any of the pure strategies in the support of the mixed strategy, as they yield the same expected utility in equilibrium. Instead, Blum et al. proposed a weaker assumption that every decision maker attempts to minimize the utility difference (i.e. ‘‘regret’’) between his temporal sequence of actions with respect to what could be his optimal actions in hindsight (see Parks, Sanna, & Posey, 2003, for an experimental study based on a similar idea). They then define the price of total anarchy, PoTA, in a similar way as In introducing the notion of price of anarchy and presenting dilemmas in networks, our work answers to the objective of the OBHDP Special Issue on Social Dilemmas regarding ‘‘what methodological improvements and innovations would enhance social dilemma research,’’ (see the Call for Papers) and, as we hope, may help in advancing the social dilemma literature. Indeed, a cursory look at the experimental literature in this field tells us that the extent of inefficiency has received relatively little attention. Rather, experimental social dilemma research is typically concerned with whether a socially inefficient outcome occurs in settings like the public goods game, and what factors influence the probability of their occurrence. We argue that if the potential loss of welfare is not large enough to justify the extra measures such as sanctions to elicit cooperation or the cost incurred in establishing and maintaining some central authority, it is not worth implementing those extra measures. We suggest that it is an important question to understand how the potential extent of inefficiency in a social dilemma situation, such as indexed by the PoA, is related to the empirical probability of occurrence of socially inefficient outcomes, as well as the empirical efficiency that is observed when human players are engaged in the situation. It is no less important to investigate how any such relationship is caused by the influence of the extent of inefficiency on subjects’ psychological motivation in behaving cooperatively. An example that we have used to drive home our point is Rapoport and Chammah (1965), who reported experimental results on a 152 V. Mak, A. Rapoport / Organizational Behavior and Human Decision Processes 120 (2013) 142–153 series of PD games with varying parameters. While their objective was to study how the percentage of mutual cooperation changed with the parameters, their data could alternatively be examined and analyzed from the point of view of how the theoretical and empirical extents of social inefficiency, as indexed by the PoA and the price of empirical anarchy, respectively, change with each other and the other parameters. These relationships could be different from one setting to another, and moderating contextual and individual difference factors may act on them through interesting psychological processes. Another example is the traffic network studied by Rapoport et al. (2006), the PoA of which changes non-monotonically with group size (Table 2). Rapoport et al. (2006) experimented on this network with group size of 10, 20, and 40 respectively, and found that subject behavior largely conformed to equilibrium, so that the price of empirical anarchy was consistently close to the PoA. If their results can be interpolated for other values of group size between 10 and 40, then the setting is one in which both theoretical and empirical extents of inefficiency in a social dilemma change non-monotonically with the size of the population. Such possibilities invite further experimental study in social dilemmas in networks. Experimental research on social dilemmas in networks has been relatively rare, despite its obvious relevance to group behavior in social situations like traffic congestion. While the Pigou–Knight– Downs Paradox and Braess Paradox have begun to receive some, but still scant, attention in recent years, the fair cost-sharing game or its variants (see earlier discussion on this game) awaits systematic experimentation. In particular, in all of these situations, it is important to consider whether the motivational and strategic solutions that have been examined in other social dilemmas (see e.g. Dawes, 1980; Kollock, 1998; Messick & Brewer, 1983) remain applicable to elicit socially optimal cooperation, or whether they should be modified significantly. Also, payoff uncertainty in these situations (e.g., choosing a route with uncertainty over the weather conditions) must have significant impact on the outcome and the extent of social inefficiency; future research along this direction is warranted. Acknowledgement We gratefully acknowledge financial support from the National Science Foundation grant SES-0752662 awarded to the University of Arizona. References Anshelevich, E., Dasgupta, A., Kleinberg, J., Tardos, É., Wexler, T., & Roughgarden, T. (2008). The price of stability for network design with fair cost allocation. SIAM Journal of Computing, 38, 1602–1623. Arnott, R., & Small, K. A. (1994). The economics of traffic congestion. American Scientist, 82, 446–455. Axelrod, R. (1984). The evolution of cooperation. New York: Basic Books. Balcan, M.-F., Blum, A., & Mansour, Y. (2010). Circumventing the price of anarchy: Leading dynamics to good behavior. Proceedings of Innovations in Computer Science (ICS), 201–213. Beckmann, M. J., McGuire, C. G., & Winsten, C. B. (1956). Studies in the economics of transportation. New Haven, CN: Yale University Press. Blum, A., Hajiagahayi, M. T., Ligett, K., & Roth, A. (2008). Regret minimization and the price of total anarchy. In Proceedings of the 40th annual ACM symposium on theory of computing (pp. 373–382). Braess, D. (1968). Über ein paradoxon der verkehrsplanung. Unternehmensforschung, 12, 258–268. Brewer, M. B. (1979). In-group bias in the minimal group situation: A cognitive– motivational analysis. Psychological Bulletin, 86, 307–324. Camerer, C. F. (2003). Behavioral game theory: Experiments in strategic interaction. Princeton, NJ: Princeton University Press. Chen, X., Au, W. T., & Komorita, S. S. (1996). Sequential choice in a step-level public goods dilemma: The effects of criticality and uncertainty. Organizational Behavior and Human Decision Processes, 65, 37–47. Croson, R. T. A., & Marks, M. B. (2000). Step returns in threshold public goods: A meta- and experimental analysis. Experimental Economics, 2, 239–259. Dawes, R. M. (1980). Social dilemmas. Annual Review of Psychology, 31, 169–193. Dawes, R. M., McTavish, J., & Shaklee, H. (1977). Behavior, communication, and assumptions about other people’s behavior in a commons dilemma situation. Journal of Personality and Social Psychology, 35, 1–11. Downs, A. (1962). The law of peak-hour expressway congestion. Traffic Quarterly, 16, 393–409. Edney, J. J. (1980). The commons problem: Alternative perspectives. American Psychologist, 35, 131–150. Gisches, E. J., & Rapoport, A. (2012). Degrading network capacity may improve performance: Private versus public monitoring in the Braess Paradox. Theory and Decision, 73, 267–293. Goemans, M., Mirrokni, V. S., & Vetta, V. (2006). Sink equilibria and convergence. In Proceedings of the 46th annual IEEE symposium on foundations of computer science (pp. 142–154). Hardin, G. (1968). The tragedy of the commons. Science, 162, 1243–1248. Holt, C. A., & Laury, S. K. (2008). Theoretical explanations of treatment effects in voluntary contributions experiments. In C. R. Plott & V. L. Smith (Eds.), Handbook of experimental economics results. Amsterdam: North-Holland. Isaac, R. M., & Walker, J. (1988). Group size effects in public goods provision: The voluntary contribution mechanism. Quarterly Journal of Economics, 103, 179–199. Isaac, R. M., Walker, J., & Thomas, S. H. (1984). Divergent evidence on free riding: An experimental examination of possible explanations. Public Choice, 43, 113–149. Kollock, P. (1998). Social dilemmas: The anatomy of cooperation. Annual Review of Sociology, 24, 183–214. Komorita, S. S., & Parks, C. D. (1994). Social dilemmas. Madison, WI: Brown & Benchmark. Koutsoupias, E., & Papadimitriou, C. H. (1999). Worst-case equilibria. In 16th Annual symposium on theoretical aspects of computer science (pp. 403–414). Berlin: Springer. Ledyard, J. O. (1995). Public goods: A survey of experimental research. In J. Kagel & A. Roth (Eds.), Handbook of experimental economics. Princeton: Princeton University Press. Mak, V., & Zwick, R. (2010). Investment decisions and coordination problems in a market with network externalities: An experimental study. Journal of Economic Behavior and Organization, 76, 759–773. McCarter, M. W., Budescu, D. V., & Scheffran, J. (2011). The give-or-take-some dilemma: An empirical investigation of a hybrid social dilemma. Organizational Behavior and Human Decision Processes, 116, 83–95. McCarter, M. W., Rockmann, K. W., & Northcraft, G. B. (2010). Is it even worth it? The effect of loss prospects in the outcome distribution of a public goods dilemma. Organizational Behavior and Human Decision Processes, 111, 1–11. Messick, D. M., & Brewer, M. B. (1983). Solving social dilemmas: A review. In L. Wheeler & P. Shiver (Eds.). Review of personality and social psychology (Vol. 4, pp. 11–44). Beverly Hills, CA: Sage. Mirrokni, V. S., & Vetta, A. (2004). Convergence issues in competitive games. APPROX-RANDOM, 3122, 183–194. Monderer, D., & Shapley, L. S. (1996). Potential games. Games and Economic Behavior, 14, 124–143. Morgan, J., Orzen, H., & Sefton, M. (2009). Network architecture and traffic flows: Experiments on the Pigou–Knight–Downs and Braess paradoxes. Games and Economic Behavior, 66, 348–372. Nisan, N., Roughgarden, T., Tardos, É., & Vazirani, V. V. (2007). Algorithmic game theory. Cambridge: Cambridge University Press. Ostrom, E. (1990). Governing the commons: The evolution of institutions for collective action. Cambridge: Cambridge University Press. Ostrom, E., Gardner, R., & Walker, J. (1994). Rules, games, and common-pool resources. Ann Arbor: University of Michigan Press. Papadimitriou, C. H. (2001). Algorithms, games, and the Internet. In Proceedings of the 33rd annual ACM symposium on the theory of, computing (pp. 749–753). Parks, C. D., Sanna, L. J., & Posey, D. C. (2003). Retrospection in social dilemmas: How thinking about the past affects future cooperation. Journal of Personality and Social Psychology, 84, 988–996. Penchina, C. M. (1997). Braess Paradox: Maximum penalty in a minimal critical network. Transportation Research A, 31, 379–388. Plott, C. R., & Smith, V. L. (2008). Handbook of experimental economics results. Amsterdam: North-Holland. Rapoport, A. (1970). N-person game theory: Concepts and applications. Ann Arbor, MI: University of Michigan Press. Rapoport, A., & Chammah, A. M. (1965). Prisoner’s dilemma: A study in conflict and cooperation. Ann Arbor: University of Michigan Press. Rapoport, A., Kugler, T., Dugar, S., & Gisches, E. J. (2009). Choice of routes in congested traffic networks: Experimental tests of the Braes Paradox. Games and Economic Behavior, 65, 538–571. Rapoport, A., Mak, V., & Zwick, R. (2006). Navigating congested networks with variable demands: Experimental evidence. Journal of Economic Psychology, 27, 648–666. Roughgarden, T. (2005). Self routing and the price of anarchy. Cambridge: Cambridge University Press. Roughgarden, T. (2009). Intrinsic robustness of the price of anarchy. In Proceedings of the 41st annual ACM symposium on theory of, computing (pp. 513–522). Roughgarden, T., & Tardos, É. (2002). How bad is selfish routing? Journal of the ACM, 49, 236–259. Roughgarden, T., & Tardos, É. (2004). Bounding the inefficiency in nonatomic congestion games. Games and Economic Behavior, 47, 369–403. Sandler, T. (1992). Collective action: Theory and applications. Ann Arbor: University of Michigan Press. V. Mak, A. Rapoport / Organizational Behavior and Human Decision Processes 120 (2013) 142–153 Schelling, T. C. (1963). The strategy of conflict. Oxford: Oxford University Press. van de Kragt, A., Orbell, J., & Dawes, R. M. (1983). The minimal contribution set as a solution to public goods problems. American Political Science Review, 77, 112–122. van Dijk, E., de Kwaadsteniet, E., & De Cremer, D. (2009). Tacit coordination in social dilemmas: The importance of having a common understanding. Journal of Personality and Social Psychology, 96, 665–678. van Dijk, E., Wit, A., Wilke, H., & Budescu, D. (2004). What we know (and do not know) about the effects of uncertainty on behaviour in social dilemmas. In R. 153 Suleiman, D. Budescu, I. Fischer, & D. Messick (Eds.), Contemporary psychological research on social dilemmas (pp. 315–331). Cambridge: Cambridge University Press. Van Lange, P., Liebrand, W., Messick, D., & Wilke, H. (1992). Social dilemmas: An introduction. In W. Liebrand, D. Messick, & H. Wilke (Eds.), Social dilemmas: Theoretical issues and research findings (pp. 1–40). London: Pergamon Press. Wardrop, J. G. (1952). Some theoretical aspects of road traffic research. Proceedings of the Institution of Civil Engineers – Part II, 1, 325–378.
© Copyright 2026 Paperzz