Continuous Mixed Strategy Equilibria in Static and Dynamic Tournaments Vladimir Petkov VUW October 2013 Vladimir Petkov (VUW) Continuous Mixed Strategy Equilibria October 2013 1 / 25 Introduction In this project I study several static and dynamic games that do not have Nash equilibria in pure strategies. I characterize the symmetric equilibria in which the players use continuous mixed strategies. The main novelty in this paper is the analysis of a setting with state dynamics. I derive a closed-form solution for the Markovian mixed equilibrium strategies. It enables us to make predictions about players’behavior in the short run and in the long run. This analysis can be used to study the bonus structure of salesmen within a company. Another application: to construct prizes for sport events. Vladimir Petkov (VUW) Continuous Mixed Strategy Equilibria October 2013 2 / 25 Literature Review Probably the most famous game with a continuous mixed strategy equilibrium is Varian’s model of sales. Two …rms play a Bertrand game. Firms sell to both captive customers (who do not migrate) and opportunistic customers (who buy at the lowest price). There are no pure strategy equilibria: each producer wants to undercut the opponent’s price in order to take advantage of the opportunistic buyers; however, charging low prices will cause …rms to forfeit pro…ts from the captive customers. There is little work on mixed strategies in dynamic games. I am aware of two exceptions (neither exhibits state dynamics). War of attrition games: waiting is costly, but the player who waits the longest gets the prize. Preemption games: if one player chooses to act he gets the prize, but if multiple players act the prize gets destroyed. Vladimir Petkov (VUW) Continuous Mixed Strategy Equilibria October 2013 3 / 25 Setup of the Stage Game First we look at a one-shot game of e¤ort choice. There are two players: 1 and 2. They choose their e¤ort levels, x1 and x2 , simultaneously. Each of them is required to exert e¤ort at least a > 0. If xi > x j , player i wins. Both players get positive prizes! i gets the …rst prize, αP. j gets the second prize, (1 Assume that 1 > α > 0.5. α) P. The prize fund is increasing in the joint e¤ort of the players: P = β ( x1 + x2 ), where 2 > β > 0. To recap, if xi > x j , then the payo¤s of i and j are αβ( x1 + x2 ) Vladimir Petkov (VUW) xi , (1 α ) β ( x1 + x2 ) Continuous Mixed Strategy Equilibria xj. October 2013 4 / 25 No Nash Equilibrium in Pure Strategies This game does not have an equilibrium in pure strategies. If j exerts the minimum e¤ort x j = a, player i will want to work slightly harder than j (i.e. choose xi = a + ε). Obviously, j will then trump i’s e¤ort by choosing x j = xi + ε, etc. But if i’s e¤ort is above some critical y, j may settle for the 2nd prize. if xi > y, j would be better o¤ free-riding on i’s e¤ort. he will choose the minimum e¤ort a, and get (1 α) β( xi + a) The critical value y solves αβ(2y) y= If 1 + β y = (1 α) β(y + a) a. a, or [1 β + αβ] a . 1 + β 3αβ 3αβ > 0, then we get a < y, so we will have a “cycle”. If this condition is violated, both players will exert in…nite e¤ort. Vladimir Petkov (VUW) Continuous Mixed Strategy Equilibria October 2013 5 / 25 No NE in Pure Strategies (Continued) For example, suppose that a = 5, α = 0.6, β = 1.1. Note that 1 + β 3αβ = 0.12 > 0. Vladimir Petkov (VUW) Continuous Mixed Strategy Equilibria October 2013 6 / 25 Symmetric Mixed Strategy Equilibrium of the Stage Game Assume that 1 + β 3αβ > 0 and focus on the symmetric mixed strategy Nash equilibrium of this game. Both players randomize according to a cdf F ( x ) with support [l, h]. If j uses this strategy and i chooses xi , i’s expected payo¤ will be ui = Zxi βα( xi + x j )dF ( x j ) + Zh β (1 α)( xi + x j )dF ( x j ) xi . xi l To randomize, i must be indi¤erent between any xi in [l, h]. Thus, F ( x ) must be such that ∂ui /∂xi = 0 for any xi ! Di¤erentiating w.r.t. xi and using the Leibnitz rule yields F ( xi ) + 2xi F 0 ( xi ) where R= Vladimir Petkov (VUW) 1 R = 0, β (1 α ) > 1. β(2α 1) Continuous Mixed Strategy Equilibria October 2013 7 / 25 Solving the Di¤erential Equation p The solution to this di¤erential equation is F ( xi ) = R + C/ xi , where C is the constant of integration. We also need to determine the support of F. p The constant C must satisfy F (l ) = R + C/ l = 0. p Solving for C yields C = R l. The upper bound of the support satis…es F (h) = R(1 Solving for h gives us h = l R 2 R 1 . So the cdf F of each player’s strategy is F ( x ) = R 1 The corresponding pdf is f ( x ) = Vladimir Petkov (VUW) F0 (x) = p Continuous Mixed Strategy Equilibria lRx 2 p p l/ h) = 1. p ! l p . x 1.5 . October 2013 8 / 25 Solving the Di¤erential Equation (Continued) l ( RR 1 ) The expected e¤ort of player i is x̄i = Z 2 xi p lRxi 2 1.5 dxi = l R R 1 . l If j sticks to this strategy, i will be indi¤erent between any xi in [l, h]. To get i’s expected payo¤, we can set xi = l and substitute this in ui : Zh β (1 α)(l + x j )dF ( x j ) l = β (1 α)l l R R 1 l [1 β (1 α)]. But if i chooses a and a < l, he will get a higher expected payo¤: Zh β (1 α)( a + x j )dF ( x j ) a = β (1 α)l l R R 1 a [1 β (1 α)]. October 2013 9 / 25 Therefore, it must be the case that l = a! Vladimir Petkov (VUW) Continuous Mixed Strategy Equilibria Equilibrium Mixed Strategy and Payo¤s To recap, the equilibrium mixed strategy cdf is p a F(x) = R 1 p , x where R = 1 β (1 α ) . β(2α 1) The strategy support is [ a, a Each player’s expected e¤ort R 2 R 1 ]. is a RR 1 Each gets an expected payo¤ β(1 . α) a R R 1 a [1 β (1 α)]. Note that if β = 1, we have a zero-sum game. In a symmetric equilibrium, each player must expect a zero payo¤. Indeed, when β = 1 we get ui = u j = 0. Vladimir Petkov (VUW) Continuous Mixed Strategy Equilibria October 2013 10 / 25 Illustration Let a = 5, α = 0.6, β = 1.1. Expected e¤ort is x̄ = 8.24. The pdf is shown below. x Vladimir Petkov (VUW) Continuous Mixed Strategy Equilibria October 2013 11 / 25 Illustration (Continued) Game value for β = 1.1. Game value for α = 0.6. The …gures illustrate how the game value changes with α and β. If β < 1, the game value is negative and decreasing in α. Vladimir Petkov (VUW) Continuous Mixed Strategy Equilibria October 2013 12 / 25 Dynamic Game Setting Next I consider a dynamic version of this game. In each period, the players play the stage game described above. Their period-t objective is to maximize the discounted sum of expected instantaneous payo¤s U1t = ∞ ∑ δ(τ t) Et (u1τ ), U2t = ∞ ∑ δ(τ t) Et (u2τ ). τ =t τ =t The state variable is the minimum e¤ort level at (“reputation”). Assume that it evolves according to at+1 = µ + η ( xit + x tj ). Again, using arguments similar to those in the one-shot game, we can show that a pure strategy equilibrium does not exist. Focus on the symmetric equilibrium in Markovian mixed strategies the cdf is conditional on the state: F t = F ( j a t ). Note that such an equilibrium will be subgame-perfect. Vladimir Petkov (VUW) Continuous Mixed Strategy Equilibria October 2013 13 / 25 Analysis of the Dynamic Game Now players have two types of incentive to exert e¤ort: static (to win the current stage game) and dynamic (to a¤ect future e¤ort). Let Ui ( at+1 ) be the continuation value of the game for player i. That is, Ui ( at+1 ) is sum of i’s expected discounted payo¤s from t + 1 onward, where the expectation is taken at the beginning of period t + 1 (i.e., once at+1 has been realized). We conjecture that Ui ( at+1 ) is linear in at+1 : Ui ( at+1 ) = r + sat+1 . Therefore, i’s expected lifetime payo¤ at time t is t Zxi lt βα( xit + x tj )dF ( x tj j at ) + Zh xit t β (1 α)( xit + x tj )dF ( x tj j at ) xit +δEt (Ui ( at+1 )) where Et (Ui ( at+1 )) = Vladimir Petkov (VUW) Zh t [r + s(µ + η ( xit + x tj )]dF ( x tj j at ). lt Continuous Mixed Strategy Equilibria October 2013 14 / 25 Analysis of the Dynamic Game (Continued) Again, i must be indi¤erent between any xit in [l t , ht ]. Therefore, F ( j at ) must be such that the derivative of the expression for the expected lifetime payo¤ is 0 for any xit . Di¤erentiation w.r.t. xit yields the following di¤erential equation: F ( x t j at ) + 2xit F 0 ( x t j at ) R̃ = 0, where R̃ is now given by R̃ = 1 β(1 α) δsη . β(2α 1) Using similar arguments to those in the static game, it can be shown 2 that the period-t strategy support is [ at , at R̃R̃ 1 ]. As before, solving the di¤erential equation yields p ! at F ( x t j at ) = R̃ 1 p . xt Vladimir Petkov (VUW) Continuous Mixed Strategy Equilibria October 2013 15 / 25 Analysis of the Dynamic Game (Continued) Now we pin down the coe¢ cients r, s of the continuation payo¤. We can de…ne Ui ( a) recursively. Let ht = at R̃ R̃ 1 r + sat = Zxi 2 . For any xit in [ at , ht ], we have t at βα( xit + x tj )dF ( x tj j at ) + Zh Zh t β (1 xit α)( xit + x tj )dF ( x tj j at ) t xit + δ [r + s(µ + η ( xit + x tj )]dF ( x tj j at ). at Setting xit = at gives us r + sat = β(1 α) at + at R̃/( R̃ 1) +δ r + s µ + η at + at R̃/( R̃ Vladimir Petkov (VUW) Continuous Mixed Strategy Equilibria at 1) . October 2013 16 / 25 Analysis of the Dynamic Game (Continued) The method of undetermined coe¢ cients yields equations for r and s. The parameter s must satisfy s = β (1 α) 1 + R̃ R̃ 1 + δsη 1 + 1 R̃ R̃ . 1 De…ne A = 1 β(1 α), B = β(2α 1), D = β(1 α). Also, let Z be the discriminant of the quadratic equation for s: Z = [A 4δη (2δη B + δη (2D 2A + B 1)]2 1)[ A D (2A B)]. B There are 2 solutions for s, but we focus on this one: s= Vladimir Petkov (VUW) [A B + δη (2D 2A + B 2δη ( 1 + 2δη ) Continuous Mixed Strategy Equilibria 1)] + p Z . October 2013 17 / 25 Analysis of the Dynamic Game (Continued) Also, the coe¢ cient r must solve r = δr + sµ. Therefore, r= δsµ . 1 δ Again, this is a zero-sum game when β = 1, so we must get Ui ( at ) = 0. With the above expression for s, we do obtain r = s = 0. The other solution of the equation for s does not satisfy this property. A necessary condition for the existence a mixed strategy equilibrium is Z > 0. This condition is satis…ed if δ and β are not too big. Otherwise, players will want to exert in…nite e¤ort. Vladimir Petkov (VUW) Continuous Mixed Strategy Equilibria October 2013 18 / 25 The Markovian Mixed Strategy Equilibrium To recap, the Markovian symmetric mixed strategy is characterized by p ! 1 β(1 α) δsη at t t F(x ja ) = 1 p , β(2α 1) xt The support of the strategy is [ at , at 1 β(1 α) δsη 1 β(1 α) δsη β(2α 1) 2 ]. Each player’s expected e¤ort is x̄ t = at 1 1 β (1 β (1 α ) α) δsη δsη β(2α . 1) Given at , the expected value of the game is t U (a ) = [A B + δη (2D 2A + B 2δη ( 1 + 2δη ) 1)] + p Z δµ 1 δ + at . The variables r, s, A, B, D, Z are as de…ned above. Vladimir Petkov (VUW) Continuous Mixed Strategy Equilibria October 2013 19 / 25 Long-Run Forecasting Suppose that we want to make a forecast about the e¤ort levels and the value of the game many periods from now. Such a forecast should be independent of the current state, and so can be used as a criterion to compare di¤erent games. The period-t expected value of the state in period t + τ, at+τ , is Et a t+τ = 2η R̃ R̃ 1 Suppose that 2η R̃/( R̃ τ 1 t a + ∑µ ζ =0 2η R̃ R̃ 1 ζ . 1) < 1. Then we have lim τ !∞ 2η R̃ R̃ 1 Thus, for τ ! ∞, we get Et at+τ = ā Vladimir Petkov (VUW) τ τ at = 0. µ( R̃ 1) . R̃(1 2η ) 1 Continuous Mixed Strategy Equilibria October 2013 20 / 25 Long-Run Forecasting (Continued) In the long run, a player’s e¤ort will be somewhere in the interval µ 1 µ( R̃ 1)2 2η ( R̃ 1)2 2η R̃2 , The long-run forecast for each player’s e¤ort is µ( R̃ 1) R̃(1 2η ) 1 1 1 β (1 β (1 α ) α) δsη δsη β(2α 1) . The long-term forecast for the value of this game to a player is p Z [ A B + δη (2D 2A + B 1)] δµ µ( R̃ 1) + 2δη ( 1 + 2δη ) 1 δ R̃(1 2η ) 1 . Note that when δ ! 0, our long-term forecasts converge to the predictions for a static game in which a = ā. Vladimir Petkov (VUW) Continuous Mixed Strategy Equilibria October 2013 21 / 25 Numerical Example: Dynamic Incentives We set parameter values η = 0.2, µ = 5, δ = 0.9, β = 1.1. The long-term forecast for the value of the game is increasing in α. If β < 1, the game value is negative and decreasing in α. Vladimir Petkov (VUW) Continuous Mixed Strategy Equilibria October 2013 22 / 25 Extension: The Winner Determines the State Suppose that the winner of the stage game determines the state: at+1 = µ + η maxf x1t , x2t g. Conjecture that Uit+1 = r + sat+1 . Mr. i’s expected lifetime payo¤ is t Zxi lt βα( xit + x tj )dF ( x tj j at ) + Zh t α)( xit + x tj )dF ( x tj j at ) β (1 xit xit Z + δ [r + s ( µ lt + ηxit )]dF ( x tj j at ) + δ Zh xit t [r + s(µ + ηx tj )]dF ( x tj j at ). Di¤erentiation yields F ( x t j at ) + Gx t F 0 ( x t j at ) G= Vladimir Petkov (VUW) 2β(2α 1) , β(2α 1) + δsη R̂ = xit R̂ = 0, where 1 β (1 α ) . β(2α 1) + δsη Continuous Mixed Strategy Equilibria October 2013 23 / 25 Extension (Continued) The solution to this di¤erential equation is t t F ( x j a ) = R̂ 1 Now the strategy support is [ at , at 1 G at xt ! . G R ]. R 1 The expected e¤ort is x̄it = R̂at ( G 1) R̂G 1 ( R̂ 1)G 1 1 . Next we determine the coe¢ cient’s r, s of the continuation payo¤. Again, we can apply the method of undetermined coe¢ cients. Vladimir Petkov (VUW) Continuous Mixed Strategy Equilibria October 2013 24 / 25 Extension (Continued) Setting x t = at allows us to write the following recursive equation: r + sat = R̂G 1 ( R̂ 1)G β(1 α) R̂at ( G 1) 1 1 a t [1 R̂at ( G 1) R̂G 1 ( R̂ 1)G 1 [ β (1 α) + δsη ] +δ r + s µ + η 1 β (1 α)] 1 . Thus, s must satisfy s= β (1 (G α) R̂ 1) R̂G 1 ( R̂ 1)G 1 [1 β (1 α)]. The equation for s can be solved numerically. Note that s appears in R̂ and G. Also, r is given by r= Vladimir Petkov (VUW) δsµ . 1 δ Continuous Mixed Strategy Equilibria October 2013 25 / 25
© Copyright 2026 Paperzz