Equilibrium Vengeance Daniel Friedman and Nirvikar Singh September 2001 Abstract This paper introduces two ideas, emotional state dependent utility components (ESDUCs), and evolutionary perfect Bayesian equilibrium (EPBE). Using a simple extensive form game, we illustrate the e¢ciency-enhancing role of a powerful ESDUC, the vengeance motive. Incorporating behavioral noise and observational noise leads to a range of (short run) Perfect Bayesian equilibria (PBE) involving both vengeful and non-vengeful types. We then derive a (long run) non-trivial EPBE where both types survive and reap mutual gains, as well as a trivial EPBE with only the non-vengeful type and no mutual gains. This is an incomplete …rst draft. Please ask permission before circulating further. Comments of all sorts are very welcome; email [email protected]. 1. Introduction This paper makes two contributions. First, it spotlights a puzzle regarding human behavior, and o¤ers a possible solution. The behavior in question is vengeance: when some culprit harms you or your loved ones, you may choose incur a substantial personal cost to harm him in return. A taste for vengeance, the desire to ”get even,” is so much a part of daily life (and the evening news) that it is easy to miss the evolutionary puzzle. We shall argue that indulging your taste for vengeance in general reduces your material payo¤ or …tness. Absent countervailing forces, the meek (less vengeful people) should have inherited the earth long ago, because they had higher …tness. Why then does vengeance persist? The present paper answers the puzzle by showing how in equilibrium vengeful individuals can achieve equal …tness with the less vengeful in a world with behavioral and observational noise. The second contribution is an apparently new equilibrium concept, evolutionary perfect Bayes equilibrium (EPBE). We believe that EPBE is germane in a wide variety of applications. It de…nes long run equilibrium (generalizing the equal pro…t condition of competitive Equilibrium Vengeance 2 markets) for games of incomplete information with possible entry, exit and/or switching among multiple player types. In this paper we use EPBE to show how vengeance can persist despite its apparent …tness handicap. Vengeance is closely tied to several vexing issues, methodological and substantive. To clear the underbrush, we begin with preliminary discussions on the nature of social dilemmas, the meaning of positive and negative reciprocity, evidence on why both are important to economists, and various modeling approaches employed so far. Section 2 presents a simple extensive form model of the basic dilemma, and shows how the equilibrium changes once vengeance is included. Section 3 introduces the viability problem that arises when an individual’s vengefulness can’t be perfectly known in advance and when behavioral errors are possible. Section 4 derives three short-run perfect Bayes equilibria, two pooling equilibria and one separating equilibrium. Section 5 looks at the long-run in which the nature and prevalence of types can evolve, and derives the resulting EPBE. Following a concluding discussion, Appendix A collects the mathematical details. 2. Preliminaries Figure 1 elucidates the fundamental social dilemma in terms of net material bene…t (x > 0) or cost (x < 0) to ”Self” and bene…t or cost (y >0 or <0) to counterparties, denoted ”Other”. Economists think most often about the mutual gains quadrant I, where actions simultaneously bene…t Self and Other. Such symbiotic actions increase social e¢ciency. Quadrant IV is the well-studied opportunistic region, where Self bene…ts at Other’s expense; the biological terms are parasitism and predation. The ‡ip side is the altruism quadrant II, where Self bears a personal cost in order to bene…t Other. Cipolla (1976) refers to behavior producing outcomes in quadrant III as stupidity, but we shall explain it later as vengeance. Social dilemmas arise from the fact that evolution directly sup- Equilibrium Vengeance 3 ports behavior that bene…ts Self, i.e., outcomes x > 0 in quadrants IV (or I) but not x < 0 in II (or III), while in contrast, e¢ciency requires outcomes above the diagonal [x + y = 0]. Social creatures (such as humans) thrive on cooperation, by which we mean devices that support outcomes in II+ and discourage outcomes in IV-. Such devices somehow internalize Other’s costs and bene…ts. Other’s benefit II. Altruism e.g., favors, parental investment y r = 1/8 I. Mutual Gains e.g., market exchange, symbiosis D II+ C Social gain II- x Self’s cost O Social loss IV+ IV- IV. Opportunism III. Vengeance e.g., shirking, predation and parasitism e.g., strikes, feuds Other’s cost Figure 1: Payo¤s to Self and Other Self’s benefit Equilibrium Vengeance 4 2.1. How Can Cooperation Persist? Biologists emphasize the device of genetic relatedness. If Other is related to Self to degree r>0, then a positive fraction other’s payo¤s are internalized via ”inclusive …tness” (Hamilton, 1964) and evolution favors outcomes above the line [x + ry = 0]. For example, the unusual genetics of insect order hymenoptera lead to r = 3=4 between sisters, so it is no surprise that most social insects (including ants and bees) belong to this order and that the workers are sisters. For humans and most other species, r is only 21 for full siblings and for parent and child, is 1/8 for …rst cousins, and goes to zero exponentially for more distant relations. On average r is rather small in human interactions, as in the steep dashed line in Figure 1, since we typically have only a few children but work and live in groups with dozens of individuals. Clearly non-genetic devices are needed to support human social behavior. Economists emphasize devices based on repeated interaction, as in the ”folk theorem” (cites). Suppose that Other returns the bene…t (”positive reciprocity”) with probability and delay summarized in discount factor ± 2 [0, 1). Then that fraction of other’s payo¤s are internalized (Trivers, 1971) and evolution favors behavior producing outcomes above the line [x + ±y = 0]. This device can support a large portion of socially e¢cient behavior when ± is close to 1, i.e., when interactions between two individuals are symmetric, predictable and frequent. But humans specialize in exploiting once-o¤ opportunities with a variety of di¤erent partners, and here ± is small, as in the same steep dashed line. Other devices are needed to explain such behavior. Here we will emphasize devices based on other-regarding preferences. For example, suppose Self gets a utility increment of ry. Hence Self partially internalizes the material externality, and undertakes behavior that is above the line [x + ry = 0]. Friendly preferences, r 2 [0, 1], thus can explain the same range of behavior as genetic relatedness and repeated interaction. However, by itself the friendly preference device is evolutionarily unstable: those with lower positive r will tend Equilibrium Vengeance 5 to make more personally advantageous choices, gain higher material payo¤ (or …tness), and displace the more friendly types. Friendly preferences therefore require the support of other devices. Vengeful preferences rescue friendly preferences. Self’s material incentive to reduce r disappears when others base their values of r on Self’s previous behavior and employ r < 0 if Self is insu¢ciently friendly. Such visits to quadrant III will reduce the …tness of less friendly behavior and thus boost friendly behavior. But visits to quadrant III are also costly to the avenger, so less vengeful preferences seem …tter. What then supports vengeful preferences: who guards the guardians? This is the central question in the present paper. 2.2. How to model other regarding preferences? Two main approaches can be distinguished in the recent literature. The distributional preferences approach is exempli…ed in the Fehr and Schmidt (1999) inequality aversion model, the Bolton and Ockenfels (1999) mean preferring model, and the Charness and Rabin (1999) social maximin model. These models begin with a standard sel…sh utility function and add additional terms capturing self’s response to how own payo¤ compares to other’s payo¤s. In Fehr-Schmidt, for example, my utility decreases (increases) linearly in your payo¤ when it is above (below) my own payo¤. The other main approach is to model reciprocal preferences directly. Building on the Geanakoplos, Pearce and Stacchetti (1989) model of psychological games, Rabin (1993) constructs a model of reciprocation for two player normal form games, extended by Dufwenberg and Kirchsteiger (1998) and Falk and Fischbacher (1998) to somewhat more general settings. The basic idea is that my preferences regarding your payo¤ depends on my beliefs about your intentions, e.g., if I believe you tried to increase my payo¤ then I want to increase yours. Such models are intractable except in the simplest settings. Levine (1998) improves tractability by replacing beliefs about others’ intentions by estimates of others’ type. Equilibrium Vengeance 6 We favor a further simpli…cation. Model reciprocal preferences as state dependent: my attitude towards your payo¤s depends on my state of mind, e.g., friendly or vengeful, and your behavior systematically alters my state of mind. This emotional state-dependent other-regarding (ESDUC) approach is consistent with Sobel (2000) and is hinted at in some other papers including Charness and Rabin. The approach is quite ‡exible and tractable, but in general requires a psychological theory of how states of mind change. Fortunately a very simple rule will su¢ce for present purposes: you become vengeful towards those who betray your trust, and otherwise have standard sel…sh preferences. Empirical evidence is now accumulating that compares the various approaches (Kagel and Wolfe, 1999; about 10 other cites). Cox (1999) and some authors of the distributional models …nd evidence favoring their models, but all other authors …nd evidence mainly favoring state dependent or reciprocal models. Our own reading of the evidence convinces us to focus on state dependent preferences, while noting that distributional preferences may also play some role. To formalize these ideas, the …rst step is to model explicitly the underlying social dilemma. 3. The Underlying Game We use a simple extensive form version of the prisoner’s dilemma, sometimes known as the Trust game (cite), shown in Panel A of Figure 2 (with payo¤s graphed in Figure 1). Player 1 (Self) can opt out (N) and ensure zero payo¤s to both players. Alternatively Self can trust (T) player 2 (Other) to cooperate (C), giving both a unit payo¤ and a social gain of 2. However, Other’s payo¤ is maximized by defecting (D), increasing his payo¤ to 2 but reducing Self’s payo¤ to -1 and the social gain to 1. The basic game has a unique Nash equilibrium found by backward induction (or iterated dominance): Self chooses N because Other would choose D if given the opportunity, and social gains are zero. Equilibrium Vengeance 7 To this underlying game we add a punishment technology and a punishment motive as shown in Panel B. Self now has the last move and can in‡ict harm (payo¤ loss) h on Other at personal cost ch. The marginal cost parameter c captures the technological opportunities for punishing others. F igure 2: Fitness Payoffs A . Basic Trust G a m e Self O ther T D (- 1, 2) N C (0, 0) (1, 1) B . Extended Trust G a m e Self O ther T D Self* [ 0 ≤ h ≤ 8 ] C N (0, 0) (- 1 - ch, 2 - h) (1, 1) * U tility p a y o f f t o S e l f i s ln h - 1 - c h C . Reduced Trust with a vengeance Self O ther T D (- 1 - v, 2 - v/c) C N (0, 0) (1, 1) Figure 2: Fitness Payo¤s Self’s punishment motive is given by state dependent preferences. If Other chooses D then Self receives a utility bonus of v ln h (but no …tness bonus) from Other’s harm h. In other states utility is equal to Equilibrium Vengeance 8 own payo¤. The motivational parameter v is subject to evolutionary forces and is intended to capture an individual’s temperament, e.g., his susceptibility to anger. See R. Frank (1988) for an extended discussion of such traits. The functional forms for punishment technology and motivation are convenient (we will see shortly that v parameterizes the incurred cost) but are not necessary for the main results. The results require only that the chosen harm and incurred cost are increasing in v and have adequate range. Using the notation ID to indicate the event ”Other chooses D,” we write Self’s utility function as U = x + vID ln h; own material payo¤ x plus the relevant ESDUC. When facing a ”culprit” (ID =1), Self chooses the reduction h in Other’s payo¤ so as to maximize U = ¡1¡ch+v ln h. The unique solution of the …rst order condition is h¤ = v=c and the incurred cost is indeed ch¤ = v. For the moment assume that Other correctly anticipates this choice. Then we obtain the reduced game in Panel C. For sel…sh preferences (v = 0) it coincides with the original version in Panel A with unique Nash equilibrium (N, D) yielding the ine¢cient outcome (0, 0). For v > c, however, the transformed game has a unique Nash equilibrium (T, C) yielding the e¢cient outcome (1, 1). The threat of vengeance rationalizes Other’s cooperation and Self’s trust. 3.1. Can vengeful preferences evolve? Consider evolution of the vengeance parameter v in an unstructured population. Assume for simplicity that the marginal punishment cost c is constant. Again for simplicity (and perhaps realism) assume that, given the current distribution of v within the population, behavior adjusts rapidly towards Nash equilibrium but that there is at least a little bit of behavioral noise. Noise is present because equilibrium is not quite reached or just because the world is uncertain. For example, Self may intend to choose N but may twist an ankle and …nd himself depending on Other’s cooperative behavior, and Other may intend to choose C but oversleeps or gets tied up in tra¢c. Such con- Equilibrium Vengeance 9 siderations can be summarized in a behavioral noise amplitude e ¸ 0. Also, Other may imperfectly observe Self’s true vengeance level v; say that the observational noise amplitude is a ¸ 0. The task now is to compute Self’s (expected) …tness W (v; a; e) for each value of v at the relevant short run equilibrium, given the observational and behavioral noise. The function W de…nes a …tness landscape in which evolution pushes the evolving trait v uphill (Wright, 1949; Eshel, 198x; Kaufman, 1993). First consider the case a = e = 0, where v is perfectly observed and behavior is noiseless. Recall that in this case the short run equilibrium (N, D) with payo¤ W = 0 prevails for v < c, and (T, C) with W = 1 prevails for v > c. Thus W (v; 0; 0) is the unit step function at v = c. In the Appendix it is shown that with a little behavioral noise (small e > 0) the step function slopes down, and with a little observational noise (small a > 0) the sharp corners are rounded o¤, as in Figure 3. As indicated, evolution pushes v downward towards 0 in the subpopulation initially below a level near c ¡ a, and pushes v in the rest of the subpopulation to a level near c + a. Equilibrium Vengeance 10 Figure 3: Fitness Was a Function of Vengefulness v a=e=0 1 a, e > 0 0 a=0<e _ v c Figure 3 4. PBE Figure 3 shows two local …tness maxima for Self, one at v=0 and the other for v = vH > c;when a and e are both small and positive. The Figure suggests that evolutionary forces will tend to move Selfs along the …tness gradient, resulting in a fraction x of the Self population with vengeance near vH and fraction (1 ¡ x) with vengeance near x = 0: The fractions are respectively the fractions of the population initially above and below the local minimum (near c ¡ a), and hence are arbitrary. This con…guration does not yet represent an equilibrium. One must check that Other should attend to the signal; if the error amplitude a is su¢ciently large relative to the fraction (1¡x) of non-vengeful types Other would be better o¤ ignoring the signal and always playing C (see Wittman, 1989, for an analogous situation in arms control.) Likewise, Equilibrium Vengeance 11 if vengeful types are su¢ciently rare relative to the error amplitude, Other might be better o¤ playing D irrespective of the signal. These considerations are incorporated in perfect Bayes equilibrium (PBE), suitably modifed to deal with large populations and trembles. In PBE one arbitrarily and exogenously speci…es for each population (Self and Other) the types and their relative prevalence and then requires that each type in each population picks the payo¤ maximizing strategy (or …tness maximizing in the reduced game) in each subgame given correct beliefs about the strategies chosen in the other population. Nature x(1-a) xa v ,1 (0,0) (1,1) (1-x)a SH (1-x)(1-a) vH,0 (0,0) S O (1,1) O (-1-v, 2-vH/c) 0,0 (0,0) O (1,1) S (-1-v, 2-vH/c) (-1, 2) 0,1 S O (-1, 2) Figure 4: Game Tree The game of incomplete information is shown in Figure 4. Nature chooses among 4 possible Self types: true preferences with v = 0 or v = vH with probabilities x and 1 ¡ x; and realized signal s = 0 or s = 1 with symmetric error rate a = Pr[s = 0jv = vH ] = Pr[s = 1jv = 0]: Self knows own preferences but not the realized signal, and Other knows the realized signal but not the true type. Self’s (0,0) (1,1) Equilibrium Vengeance 12 strategy (given his realized preference parameter v) is just the mixture probability for chosing T; the unconstrained stategy set is [0; 1] but with behavioral noise Self’s strategy set is [e; 1 ¡ e] for given 0 < e < 1=2: Other’s strategy is pair of mixture probabilities for choosing C after observing respectively s = 0 and s = 1. The behavioral noise constrained strategy set is [e; 1 ¡ e] £ [e; 1 ¡ e]: The payo¤s are as in the reduced Trust game of Figure 2C. We focus on the separating PBE in which the vH type of Self tries to Trust (i.e., plays T with probability 1 ¡ e) and the v = 0 type tries to Not Trust (i.e., plays T with probability e), while all Others try to play C (and do so with probability 1 ¡ e) when the signal is s = 1 and tries to play D when the signal is s = 0. Table 1 writes out the resulting …tness outcomes and probabilities. Other’s behavior is optimal as long as the expectation of the D payo¤ 2 ¡ v=c is greater than (less than) the C payo¤ 1when he receives the signal s = 0 (resp. s = 1). The condition simpli…es to E(vjs) · c (resp. ¸ c). The probability that a s = 0 signal will arise in this separating PBE from a vH type is x(1 ¡ e)a versus probability (1 ¡ x)e(1 ¡ a) from a v = 0 type. Routine algebra (or Bayes theorem) reveals that the critical posterior expectation E(vjs = 0) = c corresponds to prior probability a (or population fraction) xs = 1=(1+( 1¡a )( 1¡e )( vHc¡c )). This condition e y can be reexpressed in terms of log odds L(y) = ln( 1¡y ) as in the proposition below. Note that L(y) is negative for 0 < y < 1=2, e.g., for reasonable values of y = a and e, so the inequalities may look a bit strange. Equilibrium Vengeance 13 PBE Probabilities Fitness Payoff Probability Choice Self, Other v = vH v=0 (N, .) 0, 0 (T, C) 1, 1 (T, D) -(1+v), 2-v/c (N, .) (T, C) (T, D) 0, 0 1, 1 -1, 2 Separating Good Pooling Bad Pooling e (1 – e)(1 – a) (1 – e)a e (1 – e)(1 – e) (1 – e)e 1-e e.e e(1 – e) 1- e ea e(1 – a) e (1 – e)(1 – e) (1 – e)e 1-e e.e e(1 – e) Notes: In separating equilibrium, Other tries to play C if s = 1 and D if s = 0. Other always tries to play C in good Pool, and always tries to play D in Bad Pooling. The signal s = 1 with probability a 0 (0, ½) when v = 0 and is s = 0 with probability a when v = vH. The expression a = a(1 – e) + e(1 – a) = e + a – 2ae represents the probability that Other chooses his less preferred action. Figure 5: PBE Probabilities Proposition 1 Given signalling technology with error amplitude a and choice technology with tremble rate e, and given types v = 0 and v = vH > c constituting respectively Self population fractions (1 ¡ x) and x 2 (0; 1); assume that 0 < a; e < 1=2 and ® = a + e ¡ 2ae · 1=(2 + vH ). Then the separating PBE given in Figure 5 exists i¤ L(c=vH ) + L(e) + L(a) · L(x) · L(c=vH ) + L(e) ¡ L(a); the Good Pool equilibrium exists i¤ L(x) ¸ L(c=vH ) ¡ L(a), and the Bad Pool equilibrium exists i¤ L(x) · L(c=vH ) + L(a): proof. We’ve already shown that L(x) · L(c=vH ) + L(e) ¡ L(a) motivates Other to play D when he sees s = 0; the demonstration is similar that L(c=vH ) + L(e) + L(a) · L(x) motivates him to play C Equilibrium Vengeance 14 when s = 1: This veri…es the DC part of the separating PBE in Figure 5. Given this behavior by Other, Self with v = vH will face D with probability ® = (1 ¡ e)a + e(1 ¡ a) = a + e ¡ 2ae when playing T; a simple calculation shows that the expected payo¤ is nonnegative (and therefore she will try to play T) as long as ® · 1=(2 + v): Self with v = 0 will face D with probability 1 ¡ ® when playing T; and she will avoid doing so as long as ® · 1=(2 + v) = 1=2; a redundant condition. This completes the separating PBE veri…cation. If Other …nds it worthwhile to ignore the s = 0 signal because unvengeful types are quite rare, then those types will also try to play T. In the Bayes theorem calculation the expression (1 ¡ x)e(1 ¡ a) thus is replaced by (1 ¡ x)(1 ¡ e)(1 ¡ a) and the factors involving e cancel. Thus the condition L(x) ¸ L(c=vH ) ¡ L(a) ensures that Other wants to play C even when s = 0: The condition ensuring that Self indeed wants to play T is the same as before taking a = 0; so it holds a fortiori. This completes the Good Pool PBE veri…cation. Finally, if Other …nds it worthwhile to ignore the s = 1 signal because vengeful types are quite rare, then those types will also try to play N. In the Bayes theorem calculation the factors involving e again drop out, and the condition L(x) · L(c=vH )+L(a) ensures that Other prefers to play D even when s = 1: The condition e < 1=2 ensures that both types of Self prefer to play N when Other always tries to play D. This completes the Bad Pool PBE veri…cation and the proof./// Equilibrium Vengeance 15 PBE Parameter Values: a = 0.1, e = 0.05, c = 0.5, vH = 2 No Separating or Pooling PBE Separating Good Pooling Bad Pooling x = 0.136 L = - 1.85 x=0 x = 0.002 L = - 6.24 x = 0.036 L = - 3.30 x = 0.75 L = 1.10 x=1 Separating Equilibrium Vengeful type, v = vH Non-vengeful type, v = 0 Fitness function (1 – e)(1 – (2 + vH)a) e(2a – 1) Value in example 0.418 - 0.036 Good Pooling Equilibrium Vengeful type, v = vH Non-vengeful type, v = 0 Fitness function (1 – e)(1 – (2 + vH)e) (1 – e)(1 – 2e) Value in example 0.760 0.855 Figure 6: PBE A numerical example may help …x ideas. Figure 6 takes the marginal punishment cost to be 0:5 and the vengeful type to have preferred cost vH = 2:0: It assumes behavioral noise rate e = 0:05 and observational noise rate a = 0:10: Then for su¢ciently small proportions of the vengeful type (L(x) · ¡3:30 or x · 0:036) we have a Bad Pooling equilibrium in which both types of Self try to opt out and Other tries to defect regardless of signal. For an overlapping range of venge- Equilibrium Vengeance 16 ful type proportions (L(x) 2 (¡6:24; ¡1:85) or x 2 (0:002; 0:136)) we have the Separating equilibrium. No separating or pooling PBE exists for higher values of x until we reach x = 3=4 after which point we have the Good Pooling equilibrium. 5. EPBE We do not yet have an evolutionary equilibrium because the types v = 0 and vH and their population fractions x and 1¡x are exogenous. Types that achieve higher …tness should displace the less …t; in evolutionary equilibrium all existing types within any population should achieve equal and maximal …tness. In the numerical example, the vengeful type in the Separating PBE has …tness (1 ¡e)(1¡(2+v)®) = .95(1-4*.14) = 0:418, while the unvengeful type has …tness e(2®¡1) = .05(.28-1) = -0.036. Thus we should expect the fraction x of the more successful vengeful types to increase (assuming that x somehow is positive to begin with, an issue to be discussed later). A su¢cient increase in x will hit the upper bound of the separating PBE and motivate Other to play C irrespective of the signal. However, in the Good Pool equilibrium the unvengeful type is …tter; from Table 1 we see that W S;GP (vH ) = (1 ¡ e)(1 ¡ (2 + vH )e) while the unvengeful type achieves strictly greater …tness W S;GP (0) = (1 ¡ e)(1 ¡ 2e): In the numerical example the …tnesses are .95*.8 = 0.760 for the vengeful vs .95*.9 = 0.855 for the unvengeful. Thus we should expect that x will decrease in the Good Pool PBE. In neither case do we have an evolutionary equilibrium. A few minutes re‡ection suggests that there may be a hybrid equilibrium in which some fraction q of the Other population attends the signal (i.e., tries to play DC) while the rest don’t (i.e., (1 ¡ q) of Others try to play CC). This fraction q must be such that unvengeful and vengeful Selfs achieve equal payo¤, and the fraction x of vengeful Selfs must be such that Others achieve the same …tness whether or not they attend to the signal. The intuition can be seen in Figure 7: by mixing the …tness landscape in panel A with that in panel B, one can hope to Equilibrium Vengeance 17 get the landscape in panel C with equal local (and global!) maxima at v = 0 and vH : F itn e s s W a s a F u n c tio n o f V e n g e fu ln e s s v S e p a ratin g E q u ilib r iu m 1 (q ) 0 c vH G o o d P o o lin g E q u ilib r iu m 1 (1 - q ) 0 vH H y b rid E P B E 1 (q ,1 - q ) 0 vH Figure 7: Fitness W as a Function of Vengefulness v Working out such an equilibrium one encounters several complications. ² The unvengeful type Selfs must behave coherently. In the simple mix of equilibria some try to play T expecting C and a positive payo¤ while others try to play N expecting D and a negative payo¤. This is incoherent because those unvengeful Selfs who Equilibrium Vengeance 18 expect C are no more likely than those who expect D to match up with the Others who don’t attend the signal. ² In EPBE all unvengeful Selfs must either all try to play T, or all try to play N, or all must be indi¤erent. It can be shown that indi¤erence is possible only if Self’s …tness is 0, and in EPBE the vengeful type would also have …tness zero. Likewise, all unvengeful Selfs try to play N only when their …tness is negative. Since we are interested in mutual social gains, we seek an EPBE with positive …tness and therefore where all unvengeful as well as vengeful Selfs try to play T. ² The separating PBE calculation for the threshold x also needs modi…cation, since it assumes di¤erent initial choices for vengeful and nonvengeful Selfs. ² More fundamentally, the value of vH is no longer exogenous. It should maximize the …tness of the vengeful type (again, in the hybrid equilibrium, not necessarily in the separating PBE). ² The error amplitude of the signal should decrease in vH ; extremely vengeful types should be easier to distinguish from v = 0 types than slightly vengeful types. Thus we also need to take a deeper look at the signal technology when vH is endogenous. To sort this out, we begin with a general de…nition of evolutionary equilibrium. It is written in English to avoid heavy notation that won’t be needed later. De…nition 1 An evolutionary perfect Bayesian equilibrium (EPBE) is a PBE distribution over (extensive form game) strategy pro…les such that in each population the strategies in the support of the distribution achieve equal and maximal expected …tness. The de…nition appears to be new. The novelty is that the distribution over types is endogenous. Most literature so far on evolution in Equilibrium Vengeance 19 games of incomplete information allows di¤erent types to have di¤erent …tnesses and arbitrary distribution. (See Jacobsen et al, 2001, for a recent example.) Although the justi…cation is seldom spelled out, the idea seems to be that the types are determined by last minute circumstance, and evolutionary selection applies to complete type-contingent strategies rather than to the types themselves. For some applications this might be appropriate: when an animal …nds itself at di¤erent points of its life to be the large or small type, or the owner or intruder type, for example. For other cases, including the signalling game in Jacobsen et al, 2001 it seems hard to escape the interpretation that they are studying temporary or short run equilibria and in the longer run the high types (who obtain higher equilibrium payo¤) will become relatively more prevalent. Returning now to the speci…c game shown in Figure 4, we need to augment Self’s strategy set to include the entire range of vengeance types, so Self maximizes …tness in fv 2 [0; v max ]g£fPr[T ] 2 [e; 1¡e]g: Likewise, in EPBE Other uses a strategy that maximizes …tness in fPr[Cjs = 0] 2 [e; 1 ¡ e]g £ fPr[Cjs = 1] 2 [e; 1 ¡ e]g. The other modi…cation is that the observational error rate a now depends on the distance from v = 0: We assume a smooth function a(v) that is positive and decreasing, with a(0) = 1=2 (i.e., in the limit as the vengeful type becomes completely unvengeful, the types can’t be distinguished) and a(v) ! 0 as v ! 1: There is a trivial EPBE related to Bad Pool PBE in which vengeful types are so rare that Other always tries to play D and all types of Self try to play N. In such PBE the less vengeful types are more …t, and the vengeful types become extinct. Evolution then produces a Self population that all has v = 0 and tries to play N, and the Other population always tries to play D regardless of signal. It is straightforward to check that this is indeed an EPBE. We now seek a positive …tness hybrid EPBE where the Self population distribution has support consisting of two points f0; vH g with mass (1 ¡ x) and x, and Pr[T ] = 1 ¡ e for both types; and the Other population distribution has support consisting of the two points Equilibrium Vengeance 20 (1 ¡ e; 1 ¡ e) and (e; 1 ¡ e) with mass (1 ¡ q) and q. Our intuition on dynamics (mentioned above) suggests that such an equilibrium would have a large basin of attraction including the uniform distributions over the feasible strategy pro…les and all nearby distributions. For such an EPBE we have the following necessary conditions. vH = arg max fEq W S (vja(v); e)g; max (1) Eq W S (0ja(0); e) = Eq W S (vH ja(vH ); e), and (2) Ex W O ((1 ¡ e; 1 ¡ e)ja(v); e) = Ex W O ((e; 1 ¡ e)ja(v); e): (3) v2[c;v ] The …rst condition says that the vengeful type achieves maximal …tness given the exogenous tremble rate e, the endogenous signal error amplitude a(v), and the equilibrium attention probability q. The restriction v 2 [c; v max ] re‡ects the ine¢cacy of positive v less than c, and some biological limit v max >> c discussed in our earlier papers. From Table 1 (temporarily called Figure 6) one sees that vengeful types trying to play T attain …tness Eq W S (v) = (1 ¡ e)[q(1 ¡ ® ¡ (1 + v)®) + (1 ¡ q)((1 ¡ e) ¡ (1 + v)e)] = (1 ¡ e)[1 ¡ (2 + v)e ¡ qa(2 + v)(1 ¡ 2e)]: The …rst order condition 0 = dEq W S =dv simpli…es slightly to 0 = ¡e ¡ qa0 (2 + v)(1 ¡ 2e) ¡ qa(1 ¡ 2e) or [ e ]q ¡1 = ¡(2 + v)a0 ¡ a 1 ¡ 2e (4) We consider a simple Gaussian parametric form for the signal error amplitude, with precision parameter k > 0: a(v) = 0:5 exp(¡kv 2 ); so a0 = ¡2kva: (5) Substituting the a0 expressions in the …rst order condition (4) we obtain [ 1 e ]q ¡1 = a[2kv(2 + v) ¡ 1] = (kv(2 + v) ¡ ) exp(¡kv2 ):(6) 1 ¡ 2e 2 Equilibrium Vengeance 21 00 a The maximization problem (1) has second order condition ¡a 0 ¸ 2 0 00 , where ¡a > 0 and a ¸ 0: In the case of (5), the condition can 2+v be written k ¸ 2v3v+2 2 (2+v) : The second condition (2) says that vengeful and nonvengeful type Selfs coexist in the EPBE because they have equal …tness. Recall that Eq W S (v) = (1 ¡ e)[1 ¡ (2 + v)e ¡ qa(2 + v)(1 ¡ 2e)]: Recall also that we are looking for an EPBE in which even the unvengeful play T, so Eq W S (0) = (1 ¡ e)[q(® ¡ (1 ¡ ®)) + (1 ¡ q)((1 ¡ e) ¡ e)] = (1 ¡ e)[1 ¡ 2e ¡ q(2 ¡ 2a ¡ 4e + 4ae)]: Thus (2) reduces to ve = q[2(1 ¡ 2®) ¡ av(1 ¡ 2e)] = q(1 ¡ 2e)[2 ¡ a(4 + v)]: We obtain the necessary condition for Other’s mixture [ e ]q ¡1 = (2 ¡ a(4 + v))=v: 1 ¡ 2e (7) The third condition (7) says that Others who attend the signal coexist in EPBE with those that don’t. Since all unvergeful types in the candidate equilibrium try to play T, the error rate drops out for Self. The relevant error rate for Other is ® = a + e ¡ 2ae: Calculations similar to those in the Good Pool PBE lead to v¡c ® )( )): c 1¡® (8) e + qa(1 ¡ 2e) · 1=(2 + vH ): (9) x = 1=(1 + ( One algorithm for calculating EPBE (and verifying existence and uniqueness) is to note that (4) and (7) have the same left hand sides, so one can solve for v by equating their right hand sides. One then obtains q from either equation, a from its de…nition, and x from (8). To con…rm that vengeful Selfs (and a fortiori unvengeful Selfs) prefer T, one checks that Eq W S (v) is positive, i.e., 1 ¸ (2+v)e+qa(2+v)(1¡2e) or Note that (9) is the same as the corresponding condition in PBE except that ® = e + a(1 ¡ 2e) is replaced by an expression using qa instead of a: Hence (9) is easier to satisfy than the PBE condition. Equilibrium Vengeance 22 Equating the RHS of (6) and (7) we get 2kv(2 + v)a ¡ a = (2 ¡ 4a)=v ¡ a or kv3 + 2kv 2 + 2 = 1=a = 2 exp(kv 2 ): At v = 0 both sides of this last equation are equal to 2, and have equal slopes of 0. The LHS has slope 4kv(1 + 43 v) and the RHS has slope 4kv exp(kv2 ) = 4kv(1 + kv 2 + :::):For for small positive v (up to approximately v = 3 ) the LHS has steeper slope but the reverse is true for larger v 4k (indeed, the slope ratio becomes in…nite), so the equation has a unique solution for positive v. Substituting this solution vH into (6) one obtains a positive value of q for each choice of positive e as long as 1 k isn’t too small (viz, we must have v(2 + v) > 2k ). The value of q must also be less than 1.0 in EPBE, which is ensured by a tremble 1 rate e < 2+1=c ;where c is the positive value of the RHS of (6). One completes the calculation and checks of the EPBE as described above. For example, take c = 0:5, e = 0:05; and k = 0:28: Then v = 2:178; a = 0:132; q = 0:1024 and x = 0:66: The second order condition holds since 0.28>8.534/39.64=.215. The condition (9) ensuring that T is preferred holds since .061<1/4.178=.24. The range and behavior of the EPBE equilibrium can be explored by varying the precision parameter k, while keeping c and e …xed, as shown in Figure 8. Condition (9) and the other conditions are easily seen to hold. In fact, for k slightly above 0.51, the second order condition that accompanies (5) is violated for the given parameter values. Also, it appears that k cannot approach 0 without violating the constraint that q must be between 0 and 1. In each case, the hybrid EPBE (i.e., excluding the trivial case where v is always 0 and Self always chooses not to trust) is unique. Equilibrium Vengeance 23 Table: EPBE (c=0.5, e=0.05) k 0.1 0.2 0.28 0.5 0.51 v 4.502 2.782 2.178 1.402 1.380 a 0.066 0.106 0.132 0.187 0.189 q 0.347 0.121 0.102 0.078 0.078 x 0.505 0.562 0.594 0.665 0.668 Figure 8: EPBE (c=0.5, e=0.05) The example suggests that, as the precision of Other’s signal, measured by k, increases, the equilibrium value of the vengeance ’parameter’ declines monotonically. This tends to increase the equilibrium signal error rate, a, but not too rapidly. The equilibrium rate at which Other attends to the signal, q, is quite small, and is also monotonically decreasing as the precision of the signal, k, increases in this example. Finally, the equilibrium fraction of vengeful Selfs, x, increases monotonically in the precision parameter. 6. Discussion At least three important issues remain. First, is the interesting EPBE stable? Preliminary work suggests that it is not unstable, but has the same issues of neutral stability as do unique mixed Nash equilibria in simple 2x2 bimatrix games. More speci…cally, our current conjecture is that most reasonable evolutionary dynamics for our model will, for a wide range of the basic parameters k (the signal precision parameter), e (the tremble or behavioral noise amplitude), and c (the marginal punishment cost) have two EPBE. Viz, ² The trivial (and ine¢cient) EPBE will have an open basin of attraction and so will be the long-run fate of situations where Equilibrium Vengeance 24 most Selfs initially are not very vengeful and most Others try to play D, at least when they get the s = 0 signal. ² The hybrid EPBE (which reaps a major portion of the potential mutual gains) will have a one dimensional stable manifold and a two dimensional center manifold with a large basin of attraction. A second issue springs from the trivial EPBE: how can one get a critical mass to escape from the trivial EPBE? Or more simply, in the context of the simple game in Figure 2, how can one get v > c starting from v = 0? In another paper we suggest a possible answer to this threshold problem. Subthreshold v < c is not adaptive in a large population, but in small groups one can show that it works together with the discount factor ± to increase …tness. Thus positive values of v could get started in smaller groups and eventually become advantageous in larger groups. Third, one should consider endogenizing the other parameters. Keeping punishment technology c constant (or doing comparative exercises) seems to make sense. The tremble rate parameter e trades o¤ trivially against the endogenous probability q that Other attends to the signal, as can be seen from equations (7) and (4). However, there is every reason to take seriously the evolution of the signal technology. A mutant Self with true vengeance parameter v = 0 who could sometimes mimic the vengeful type would receive a major …tness boost. We refer to this in another paper as the Viceroy problem, a reference to Monarch butter‡ies who correspond to vengeful types and their mimics known as Viceroys. The other paper o¤ers an elaborate solution to this problem involving interactions within and across small groups. In the present paper we focused on two ideas, each of which we believe has widespread applicability independent of the other. Emotional state dependent utility components (ESDUCs) o¤er a tractable and ‡exible modelling approach to other-regarding preferences that is capable of dealing with many of the leading issues in behavioral economics. In particular, the vengeful components emphasised in the present paper may be the key ingredient in models giving new insights Equilibrium Vengeance 25 into ”irrational” con‡icts ranging from employment relations to international struggles. Friendly components may enter models providing insight into behavior within the family, teams, charitable giving, etc. Our emphasis has been a discipline on such other regarding components: they must directly or indirectly bring evolutionary …tness. This theoretical discipline, together with the empirical discipline already favored by behavioral economists, should help to sharpen behavioral models. The second idea is evolutionary perfect Bayesian equilibrium (EPBE). We wrote a general verbal de…nition and worked it out explicitly for a particular (and not especially simple) game of incomplete information. We believe that EPBE is an appropriate characterization of long run behavior when there are multiple ”types” and some opportunity for entry, exit and/or switching among types. Many games of incomplete information could be reconsidered in this light. References JJS01 Jacobsen HJ, Jensen M and Sloth B, ”Evolutionary Learning in Signalling Games,” Games and Economic Behavior 34:1 (January 2001) pp34-63.
© Copyright 2026 Paperzz