Disarmament Games Yuan Deng Vincent Conitzer Duke University Disarmament Game • The disarmament game is • not equivalent to the security game • not an extension of the security game • The disarmament game • captures a different domain of security issues • is highly related to commitments Security Game Leader-follower game • The protector commits to a strategy first, • then the attacker attacks Security Game Leader-follower game? • One way communication Security Game • Two way communication? Hostage Game • Action of Police: •R • PF • PD : : : Call the special team and Rescue directly Pay the ransom and Follow Pay the ransom and Don’t Follow • Action of Terrorist: • GR • KR • ST : : : Get the ransom and Run Kill the hostage and Run Stay Hostage Game R PF PD GR KR ST Hostage rescued Terrorist caught Hostage killed Terrorist caught Hostage killed Terrorist caught Hostage rescued Hostage killed Terrorist caught Terrorist got away Ransom payed Hostage rescued Hostage killed Terrorist got away Terrorist got away Ransom payed Nothing happened Nothing happened Hostage Game • Police: • • • • Hostage killed Terrorist caught Ransom payed Nothing happened : : : : -10 10 -6 -0.5 : : : : : 1 -5 2 6 -0.5 • Terrorist: • • • • • Hostage killed Terrorist caught Terrorist ran away Ransom payed Nothing happened (additive valuation; unspecified valuations are simply 0) Hostage Game GR KR ST R 10, -5 0, -4 0, -4 PF 4, 1 -10, 3 -0.5, -0.5 PD -6, 8 -10, 3 -0.5, -0.5 Hostage Game GR KR ST R 10, -5 0, -4 0, -4 PF 4, 1 -10, 3 -0.5, -0.5 PD -6, 8 -10, 3 -0.5, -0.5 Nash equilibria Stackelberg equilibria (no matter who takes the lead) Hostage Game GR KR ST R 10, -5 0, -4 0, -4 PF 4, 1 -10, 3 -0.5, -0.5 PD -6, 8 -10, 3 -0.5, -0.5 Desired Outcome Pareto better than the equilibrium outcome Multiple-round (pure) commitments Requirement: • Once committing not to play certain strategies, the player cannot play those strategies anymore. • In each round • commit to a subset of available strategies to play • commit not to play certain strategies Multiple-round (pure) commitments GR KR ST R 10, -5 0, -4 0, -4 PF 4, 1 -10, 3 -0.5, -0.5 PD -6, 8 -10, 3 -0.5, -0.5 Multiple-round (pure) commitments GR ST R 10, -5 0, -4 PF 4, 1 -0.5, -0.5 PD -6, 8 -0.5, -0.5 Multiple-round (pure) commitments GR ST R 10, -5 0, -4 PF 4, 1 -0.5, -0.5 PD -6, 8 -0.5, -0.5 Incentivize Police to commit in the next round Multiple-round (pure) commitments GR ST R 10, -5 0, -4 PF 4, 1 -0.5, -0.5 PD -6, 8 -0.5, -0.5 Multiple-round (pure) commitments GR ST PF 4, 1 -0.5, -0.5 PD -6, 8 -0.5, -0.5 Multiple-round (pure) commitments GR ST PF 4, 1 -0.5, -0.5 PD -6, 8 -0.5, -0.5 Multiple-round (pure) commitments GR KR ST R 10, -5 0, -4 0, -4 PF 4, 1 -10, 3 -0.5, -0.5 PD -6, 8 -10, 3 -0.5, -0.5 Multiple-round (pure) commitments GR KR ST R 10, -5 0, -4 0, -4 PF 4, 1 -10, 3 -0.5, -0.5 PD -6, 8 -10, 3 -0.5, -0.5 Fact: The desired outcome can also be achieved if Police commits not to play PD in the first round… Zooming out • Consider two players who can • communicate to each other • make credible commitments • Given a target outcome, can this outcome be obtained by a multipleround commitments? Why Disarmament? Commit not to play Remove (Disarm) some strategies • Consider two countries • communicate to each other • make credible commitments Relation to Commitment • One-round commitment to play (Stackelberg equilibrium) • One-round commitment not to play • less powerful than one round commitment to play • (if assuming players can commit to mixed strategies) Our model: • Multiple-round commitments not to play • in some cases, more powerful than one round commitment to play • e.g. Hostage Game Perspective from extensive-form game The disarmament games can be formulated as an extensive-form game • [Disarmament Stage]: sequential & deterministic • At each decision node, one player can choose some strategies to remove If one player is not willing to remove any strategy • [Game Stage]: simultaneous normal-form game • play the remaining game to get their utility players can play mixed strategies Computational Problem • Disarmament game is an extensive-form game • Compute SPNE? By backward induction? • In Game Stage: • Complexity to compute NE (PPAD-complete) • Equilibrium Selection • Change of the structure of NE by slightly modifications Computational Problem • Decision problem: • Can a specific outcome be obtained as an NE of the disarmament game? Computational Problem • Decision problem: • Can a specific outcome be obtained as an NE of the disarmament game? • If we can construct such an equilibrium, we obtain an automated way to suggest a disarmament scheme: • Given a disarmament objective • Compute the Nash equilibrium fully characterized by disarmament steps R PF PD R PF PD PF PD GR 4, 1 -6, 8 GR 10, -5 ST 0, -4 4, 1 -6, 8 -0.5, -0.5 -0.5, -0.5 ST -0.5, -0.5 -0.5, -0.5 GR 10, -5 4, 1 -6, 8 KR 0, -4 -10, 3 -10, 3 ST 0, -4 -0.5, -0.5 -0.5, -0.5 … … Computational Problem • Decision problem: • Can a specific outcome be obtained as an NE of the disarmament game? • If we can construct such an equilibrium, we obtain an automated way to suggest a disarmament scheme: • • • • Given a disarmament objective Compute the Nash equilibrium fully characterized by disarmament steps Inform both players all disarmament steps Follow the steps without deviation (No coordination is needed!) Computational Problem • Decision problem: • Can a specific outcome be obtained as an NE of the disarmament game? # Rounds 𝑵 × 𝑲 (constant) 𝑵 × 𝑵 Constant >= 3 P NP-complete Unlimited P NP-complete Hostage Game Review GR KR ST R 10, -5 0, -4 0, -4 PF 4, 1 -10, 3 -0.5, -0.5 PD -6, 8 -10, 3 -0.5, -0.5 Hostage Game Review GR KR R 10, -5 0, -4 PF 4, 1 -10, 3 Prisoner’s dilemma Cooperate Defect Cooperate (3,3) (4,0) Defect (0,4) (1,1) Defect Dominates Cooperate Prisoner’s dilemma Cooperate Defect Cooperate (3,3) (4,0) Defect (0,4) (1,1) Defect Dominates Cooperate Mixed Disarmament Example : Prisoner’s Dilemma 1−𝑥 𝑥 Cooperate Defect 1−𝑦 𝑦 Cooperate (3,3) (4,0) Defect (0,4) (1,1) • 𝑥, 𝑦 : maximum probability that can be placed on strategy Defect • Defect is still a dominant strategy => place 𝑥 𝑜𝑟 𝑦 probability on Defect • Utility : row player u𝑟 𝑥, 𝑦 = 3 1 − 𝑥 1 − 𝑦 + 4𝑥 1 − 𝑦 + 𝑥𝑦 column player u𝑐 𝑥, 𝑦 = 3 1 − 𝑥 1 − 𝑦 + 4 1 − 𝑥 𝑦 + 𝑥𝑦 Example : Prisoner’s Dilemma 1−𝑥 𝑥 Cooperate Defect 1−𝑦 𝑦 Cooperate Defect (3,3) (4,0) (0,4) (1,1) • u𝑟 𝑥, 𝑦 = 3 1 − 𝑥 1 − 𝑦 + 4𝑥 1 − 𝑦 + 𝑥𝑦 ≤ 3 • u𝑐 𝑥, 𝑦 = 3 1 − 𝑥 1 − 𝑦 + 4 1 − 𝑥 𝑦 + 𝑥𝑦 ≤ 3 u𝑟 𝑥, 𝑦 ≤ 3 u𝑐 𝑥, 𝑦 ≤ 3 𝑦 𝑥 Cooperate Defect (x) Cooperate (3,3) (12,0) Defect (y) (0,12) (1,1) u𝑟 𝑥, 𝑦 ≤ 4 u𝑐 𝑥, 𝑦 ≤ 4 𝑦 𝑥 u𝑟 = 𝑢𝑐 = 4 Folk Theorem • In repeated games with infinite rounds, any desired utilities can be obtained as an equilibrium. • Desired: (1) feasible: can be obtained by a mixed strategy profile (2) enforceable: larger than the utilities they can guarantee themselves no matter how the other player plays Cooperate Defect Cooperate Defect (3,3) (4,0) (0,4) (1,1) Folk Theorem • In repeated games with infinite rounds, any desired utilities can be obtained as an equilibrium. • Desired: (1) feasible: can be obtained by a mixed strategy profile (2) enforceable: larger than the utilities they can guarantee themselves no matter how the other player plays • In mixed disarmament games, for any desired utilities, there exists an equilibrium achieving at least these utilities. Folk Theorem without repetition! Folk Theorem (# of steps) • In mixed disarmament games, for any desired utilities, there exists an equilibrium achieving at least these utilities • For any desired utilities, we can construct an 𝑛𝜖-(additive) approximate Nash equilibrium with almost the same outcome (differed by at most 𝑛𝜖) and 𝑂(1/𝜖) 𝑦disarmament steps. 𝑛 is the number of strategies 𝑥 Back to Disarmament • Based on Folk Theorem • Two countries only need to negotiate about outcome utilities, that are, • feasible • enforceable • Fair? Social optimal?... • No worry about how to achieve it Conclusion & Future works • Summary • Formulate disarmament games • Investigate computational complexity for the pure case • Show a folk theorem for the mixed case • Future work • Game representation other than normal-form games (e.g. Bayesian game) • Restricted class of games • Applications?
© Copyright 2025 Paperzz