Quantal-Response Equilibrium Models of The Ultimatum Bargaining

Quantal-Response Equilibrium Models
of The Ultimatum Bargaining Game¢
Kang-Oh Yi
November 16, 2001
Department of Economics
Hong Kong University of Science and Technology
Clear Water Bay, Kowloon, Hong Kong
Tel: (852) 2358-7619, Fax: (852) 2358-2084
e-mail: [email protected]
Abstract
This paper investigates the implications of normal- and extensive-form quantal
response equilibrium (QRE) models (McKelvey and Palfrey, 1995, Games and
Economic Behavior; 1998, Experimental Economics) in the ultimatum bargaining
game, assuming that players maximize expected monetary payo¯ s. It is shown that
normal-form QRE can select a non-sequential equilibrium, and that the selection
depends crucially on the noise structure. The normal-form QRE describes the
main qualitative features of experimental subjects' behavior better than extensiveform QRE even in experiments with extensive-form games. Journal of Economic
Literature Classi± cation Number: C79, C92
Key Words: Quantal Response Equilibrium, Ultimatum Bargaining Game, Weakly
Dominated Equilibrium.
I am deeply grateful to Vincent Crawford and Joel Sobel for their advice and encouragement.
1
PDF created with FinePrint pdfFactory trial version http://www.fineprint.com
1
Introduction
McKelvey and Palfrey's (1995) notion of quantal response equilibrium or \QRE"
has recently attracted a great deal of attention. In a QRE, players do not always
choose their best responses. Instead their strategy choices are noisy, and strategies
with higher expected payo¯ s are chosen with higher probabilities, with players
taking the noise in each other's strategies rationally into account in equilibrium.
In applications of QRE, the noise in players' strategy choices follows a speci± c
distribution, which allows the degree of noisiness to be represented by as few as
one parameter. The distribution most often used is the logit, and a QRE with
a logit response function is called a logit equilibrium. In some applications the
noise parameter is estimated and the resulting logit equilibrium is compared with
subjects' observed choices period by period. In others, a limiting logit equilibrium,
the limit of logit equilibrium as the noise approaches zero, which is usually an
equilibrium in the game without noise, is compared with limiting behavior in the
experiment.
McKelvey and Palfrey's original notion of QRE is a normal-form concept, and
McKelvey and Palfrey (1995), Anderson et al. (1998, 2001), and others have shown
that the normal-form QRE is surprisingly successful in describing the quantitative
as well as qualitative patters of deviation from equilibrium observed in a variety
of normal-from game experiments.
The normal-from QRE gives an identical prediction for an extensive- and
normal-form representation of the same game. However, experimental subjects
are often sensitive to how a game is presented, and the normal-form QRE has had
less success describing experimental results for some games presented in extensive
form, as in Schotter et al.'s (1994) experiment where the subjects' choice behavior
is systematically di¯ erent in di¯ erent representations of a game.
In response to these di¹ culties, McKelvey and Palfrey (1998) recently extended their notion of QRE to extensive-form games, proposing a notion called
2
PDF created with FinePrint pdfFactory trial version http://www.fineprint.com
agent QRE or \AQRE." An AQRE is de± ned like a QRE, but for the agent normal form of an extensive-form game, in which di¯ erent information sets of a given
player are assumed to be played independently by di¯ erent agents, but all of a
given player's agents share the same payo¯ function. Because each agent's noise
is assumed to be independent, for any game with a non-trivial extensive form, an
AQRE di¯ ers from a normal-form QRE, where the noise terms for the agents of a
given player are in e¯ ect assumed to be perfectly correlated.1
In agent normal form, as far as the agents of a subgame are concerned, the
solution of the subgame agrees with the solution of the whole game. This property
brings AQRE much closer to sequential equilibrium in general, and McKelvey and
Palfrey (1998) showed that a limiting AQRE is in fact a sequential equilibrium.
They then used logit-AQRE, AQRE with a logit response function, to analyze
Schotter et al.'s (1994) experimental results, where subjects played a game with
two Nash equilibria, one trembling-hand perfect and one weakly dominated equilibrium, in di¯ erent representations that di¯ er only with respect to inessential
transformation of the extensive form. In one representation, every information
set is a singleton and the weakly dominated equilibrium is not a sequential equilibrium. In the others, subjects made choices simultaneously and both equilibria
are sequential equilibrium. In Schotter et al.'s experiment, the subjects played
mostly the trembling-hand perfect equilibrium strategies but played their dominated equilibrium strategies with signi± cant probabilities in the simultaneous-move
game. McKelvey and Palfrey (1998) showed that logit-AQRE describes subjects'
choice behavior better in the game where the every information set is a singleton
while normal-form logit equilibrium yields a better prediction in the simultaneousmove game. They compared QRE with \noisy Nash model," in which each player's
strategy choice is a convex combination of his Nash equilibrium strategy and random play, and showed that QRE outperforms the noisy Nash model by a large
1
In imperfect-information extensive-form games where every player at move has a single in-
formation set, AQRE and normal-form QRE are identical.
3
PDF created with FinePrint pdfFactory trial version http://www.fineprint.com
margin.
McKelvey and Palfrey (1998) also applied logit-AQRE to various extensiveform game experiments on signaling games and centipede games, and AQRE has
had considerable success describing subjects' choice behavior. These results suggest
that AQRE is a promising way to describe subjects' responses to extensive form
games. But, to my knowledge, no one has yet considered the implications of
AQRE or normal-form QRE in ultimatum bargaining games, even though they
are perhaps the extensive-form game that is most often studied in experiments.2
This paper considers whether the notions of QRE can help to explain behavior in
ultimatum bargaining games by characterizing logit-AQRE and normal-form logit
equilibrium in discrete and continuous ultimatum bargaining games whose players
maximize expected monetary payo¯ s.
In an ultimatum bargaining game, one player, called the Proposer, makes an
all-or-nothing o¯ er, which the other, the Responder, can either accept or reject.
When players maximize expected monetary payo¯ s, in any Nash equilibrium, all
o¯ ers made in equilibrium must be accepted. In any sequential equilibrium, the
Proposer o¯ ers 0 to the Responder (or, in the discrete case, the Proposer o¯ ers
either 0 or the minimum positive proposal) and the Responder accepts.
This prediction is chronically violated in experiments. Most o¯ ers are concentrated between 30% and 50%, and smaller positive o¯ ers were often rejected
(see Roth, 1995, Chapter 4). The reasons for these violations have been a source
of controversy for more than a decade. Some studies have sought to explain the
experimental results for ultimatum games by assuming that subjects' preferences
(\social utilities") depend not only on their own monetary payo¯ s but also on others' in various ways, as in the general models proposed by Rabin (1990), Fehr and
2
The game analyzed in McKelvey and Palfrey (1998) can be viewed as a discrete (binary)
version of an ultimatum bargaining game that is studied in Gale et al. (1995). However, it is
not trivial to generalize their results for versions of the ultimatum bargaining game with larger
discrete strategy spaces or with continuous strategy spaces.
4
PDF created with FinePrint pdfFactory trial version http://www.fineprint.com
Schmidt (1999), and Bolton and Ockenfels (2000), and the econometric models of
subjects' behavior in ultimatum experiments of Costa-Gomes and Zauner (2001).
Other analyses have sought to explain the results without social utility, assuming
expected monetary payo¯ maximization, but studying adaptive learning dynamics
as in Prasnikar and Roth (1992), evolutionary dynamics as in Gale et al. (1995),
or \limited cognition" as in Johnson et al. (2002).3
This paper takes a di¯ erent approach, assuming expected monetary payo¯
maximization as in the papers just mentioned, but ignoring dynamics, instead using
the normal-form QRE and AQRE as static models of boundedly rational strategic
behavior. Extending results of McKelvey and Palfrey (1995, 1998), the present
analysis gives a complete characterization of both notions of QRE in discrete and
continuous versions of the ultimatum bargaining game. The main result is that
in the discrete versions any o¯ er between 0 and equal split can be supported as
a strict best response in a limiting normal-form logit equilibrium. The limiting
logit-AQRE gives a unique selection of a trembling-hand perfect equilibrium with
the minimum positive o¯ er. In the continuous versions, the limiting normal-form
logit equilibrium and the limiting logit-AQRE coincide with the game's unique
trembling-hand perfect equilibrium.
The key di¯ erence between the two notions of QRE, which allows their implications to di¯ er in the discrete version, is in the noise structure. In an AQRE,
at any information set, the di¯ erence in the expected payo¯ s between accepting
or rejecting the o¯ er is the size of the o¯ er. Therefore, for each agent of the Responder, it is better to accept a positive o¯ er and the limiting AQRE, as the noise
disappears, is a sequential equilibrium. By contrast, in a normal-form QRE, all
3
In QRE, rational players are subject to mistakes in making choices. Prasnikar and Roth
(1992) and Johnson et al. (2002) studied di¯ erent aspects of bounded rationality that players'
game-theoretic reasoning skills are limited, or they do not know how the others behave. Although
their analyses emphasize learning, they acknowledged that social preferences play a signi cant
role in their bargaining experiments.
5
PDF created with FinePrint pdfFactory trial version http://www.fineprint.com
agents of the Responder have the same expected payo¯ s. If an information set
is reached with a su¹ ciently small probability, then the agent's strategy choice
should barely a¯ ects the expected payo¯ and the agent should be almost indifferent between accepting and rejecting such o¯ ers. As the Proposer concentrates
the choice probability on a certain o¯ er, the Responder often rejects positive o¯ ers
that are rarely played, and the positive o¯ er can be the Proposer's best response
even without incredible threats. However, in the continuous case, the Proposer's
choice is noisy and the Proposer plays o¯ ers close to the optimal o¯ er as often as
the optimal o¯ er. As the noise vanishes, the Responder accepts those o¯ ers with
higher probability, and the Proposer has no incentive to o¯ er a positive o¯ er in a
limiting QRE.
The rest of the paper is organized as follows. Section 2 introduces the ultimatum bargaining game and the notions of QRE, AQRE, and their limiting
logit counterparts. Section 3 and 4 characterize the QRE's of extensive-form and
normal-form ultimatum bargaining games. Section 5 concludes. Proofs omitted
from the text are in the appendix.
2
QRE of the Ultimatum Bargaining Game
Since the ultimatum bargaining game is two-person perfect-information game whose
players move only once, both game forms can be described using the same notation
without explicitly considering information sets. Let i denote a player, i 2 fp; rg,
where p and r identify the Proposer and the Responder, respectively. the Proposer
chooses a strategy sp 2 Sp . When Sp is discrete, Sp = f0; n1 ; ¸
¸
¸
; 1g for a ± nite
integer n ¡ 1 and Sp = [0; 1] when Sp is continuous.4 In the following, I use (sp ; s0p ]
¸
¸
; s0p g in discrete games. This slight abuse of notation allows
to denote fsp + n1 ; ¸
4
Throughout this paper, the size of pie is normalized to 1. This normalization does not change
the results, and allows me to interpret an outcome as percentage shares of the pie, without
substantively a¯ ecting the results.
6
PDF created with FinePrint pdfFactory trial version http://www.fineprint.com
me to write the set of o¯ ers less than
1
2
with [0; 12 ) no matter whether n is even or
odd. the Responder's strategy, sr 2 Sr , is a function that maps each possible o¯ er
to fAccept; Rejectg with sr (sp ) 2 fAccept; Rejectg. Let ¼
i
be the probability dis-
tribution over Si and ¼ i (si ) denote the probability of si being played. Each player
is assumed to have risk-neutral preferences which depends only on one's own pecuniary payo¯ . When the o¯ er is accepted, the payo¯ s are up (sp ; sr ) = 1 ∙ sp and
ur (sp ; sr ) = sp . If rejected, ui (sp ; sr ) = 0 for all i. Given a strategy pro± le, player
i's expected payo¯ is ¸i (si ; ¼
arise, ¸i (si ; ¼
¤ i)
¤ i)
R
=
s2S¤
i
¼
¤ i (s)ui (si ; s)ds.
When no confusion will
is denoted by ¸i (si ).
For a given player i, the logit response function maps expected payo¯ s for each
possible pure strategy into a mixed strategy for i, and they are denoted by pi and
fi when the strategy spaces are discrete and continuous, respectively. Pi and Fi
are the associated cumulative probability distributions with pi and fi , respectively.
Letting 0 ½
< 1 be the measure of the amount of noise, or equivalently, the
degree of rationality, the logit responses are determined by
pi (si ) = P
exp ( ¸i (si ))
s2Si exp ( ¸i (s))
and
exp ( ¸i (si ))
:
s2Si exp ( ¸i (s)) ds
fi (si ) = R
(1)
This functional form is called a logit function where the odds are determined
by the exponential transformation of the utility times a given non-negative constant
. The ratio of probabilities of two di¯ erent strategies, si and s0i , are given by
exp[ (¸i (si ) ∙ ¸i (s0i ))]. As
! 1, only the choices having the highest expected
payo¯ can be played with positive probabilities so that the choice behavior becomes
best response; when
= 0 all choices have equal probability. One of the features
that distinguishes QRE from some other noise-based equilibrium notions is that
in a limiting logit equilibrium, a weakly dominated strategy can be played with
positive probability. When ¸i (si ; ¼
holds only for s¤
i
and ¼
¤ i (s¤ i )
¤ i)
¡ ¸i (s0i ; ¼
! 0 as
¤ i)
for all 's, if the strict inequality
increases, then si and s0i should be played
with the same probability in a limiting logit equilibrium. Throughout this paper
7
PDF created with FinePrint pdfFactory trial version http://www.fineprint.com
I assume that
is the same for all players and it is common knowledge.5 A logit
equilibrium for
is de± ned by a ± xed point in these probability distributions with
a given .
In games with discrete, ± nite strategy spaces, McKelvey and Palfrey (1995,
1998) showed the existence of logit equilibrium and its convergence to a Nash equilibrium. Although they did not consider games with continuous strategy spaces,
in the ultimatum bargaining game, these properties are preserved in games with
continuous strategies.
Proposition 1. In any versions of ultimatum bargaining game considered, there
exists a logit equilibrium for every
rium.
3
¡ 0 and the limiting QRE is a Nash equilib-
Agent Normal-Form Ultimatum Bargaining Game
In an agent normal-form representation of the ultimatum bargaining game, each
information set is played by a di¯ erent agent, and the agent on move at an information set has the same payo¯ s over terminal nodes as the Responder at the same
information set in the original game. Therefore, A (for Accept) strictly dominates
R (for Reject) at any information set reached by a positive o¯ er, and every responder's agent accepts any positive o¯ er with probability one in a limiting logit
equilibrium. The results for both games with discrete and continuous strategy
spaces are presented for completeness and to discuss some of QRE's interesting
properties.
In an extensive-form game, each agent of the Responder can be identi± ed by
an o¯ er and each agent's choices can be denoted by Ajsp and Rjsp . In both discrete
5
In principle, QRE permits di¯ erent ¼ 's across players, but the common knowledge assumption
is indispensable. In the present analysis, assuming the same ¼ simpli es notation and the proofs
can be directly extended to the case of heterogeneous ¼ 's by choosing ¼ = max ¼ 's.
8
PDF created with FinePrint pdfFactory trial version http://www.fineprint.com
and continuous games, the expected payo¯ s can be calculated in the same way,
¸p (sp ) = (1 ∙ sp )pr (Ajsp );
and
¸r (Ajsp ) = sp and ¸r (Rjsp ) = 0:
(2)
Proposition 2. In ultimatum bargaining games, the limiting logit-AQREs are:
1) when n = 1, pp (0) = 1, pr (Aj0) = 12 , and pr (Aj1) = 1.
2) when n = 2, pp (0) = pp ( 12 ) = 21 , pr (Aj0) = 12 , and pr (Aj 12 ) = pr (Aj1) = 1.
3) when n ¡ 3, pp ( n1 ) = 1 and pr (Aj0) =
1
2
and pr (Ajsp ) = 1 for all sp > 0.
4) when the strategy set is continuous, fp converges to a point-mass at 0, and
pr (Aj0) = pr (Rj0) =
1
2
and pr (Ajsp ) = 1 for all sp > 0.
In case of discrete strategy spaces, Proposition 1 can be generalized to a game with
an arbitrary set of o¯ ers, Sp = fs0p ; s1p ; ¸
¸
¸
; skp g
[0; 1] where k + 1 is the number
of possible o¯ ers.
Corollary 1. In a limiting logit-AQRE, pr (Ajsp ) = 1 for all sp > 0 and pr (Aj0) =
pr (Rj0) =
1
2
if 0 2 Sp . Letting minsp >0 Sp = sp ,
1) when 0 2 Sp and sp < 21 , or when 0 62 Sp , pp (sp ) = 1.
2) when 0 2 Sp and sp = 12 , pp (0) = pp ( 21 ) = 12 .
3) when 0 2 Sp and sp > 12 , pp (0) = 1.
In discrete games, a limiting logit-AQRE depends only on the size of the
minimum positive o¯ er because ¸p (0) =
1
2
and ¸p (sp ) = 1 ∙ sp . In the continuous
case with 0 2 Sp , for the same reason, the limiting logit equilibrium is not a
logit equilibrium. In the limit, the Responder's logit equilibrium strategy is that
accepting any positive o¯ er with probability one and rejecting o¯ er 0 with a positive
probability. Even though such a strategy is well de± ned, the openness of the
Responder's acceptable set of o¯ ers implies that the Proposer does not have a best
response to it.
9
PDF created with FinePrint pdfFactory trial version http://www.fineprint.com
4
Normal-Form Ultimatum Bargaining Game
In a normal-form representation of the ultimatum bargaining game, the Responder's strategy is represented by a complete contingent plan stating whether to
accept or reject each possible o¯ er. Unlike Nash equilibrium, QRE restricts players' behavior in all information sets as in Eq.(1). In the ultimatum bargaining
game, however, even in a limiting logit equilibrium such a restriction is not enough
to rule out the possibility that with positive probability the Responder plays such
a strategy that accepts an o¯ er but rejects bigger o¯ ers so that the Responder accepts all o¯ ers with highest probability but not with probability one. For instance,
the chance that the Proposer o¯ ers the whole pie vanishes so quickly that the Responder is almost indi¯ erent between accepting and rejecting for large 's. Thus,
the Responder rejects the o¯ er of one with probability one half in any limiting
QRE. To rule out such behavior, the present analysis assumes that if the Responder accepts an o¯ er sp then he should accept any o¯ ers larger than sp . That is, if
sr (sp ) = A, then sr (s0p ) = A for all s0p ¡ sp . To distinguish these two normal-form
games, I shall call the game without any restriction on the Responder's choice set
\unabridged normal-form game." After the analysis of the simpli± ed normal-form
game, the limiting logit equilibrium of the unabridged normal-form game is also
presented, which shows that the simplifying assumption does not a¯ ect the set of
limiting logit equilibrium o¯ ers.
Under this simplifying assumption, one can identi± es the Responder's strategy
with his minimum acceptable o¯ ers and can analyze the normal-form representation of a continuous version of the ultimatum bargaining game. This facilitates
the analysis, and allows a direct comparison of the logit equilibrium with limiting
behavior in the evolutionary model of the ultimatum bargaining game in Gale et
al. (1995), where the same restriction is imposed on the Responder's choice set.
Letting sr 2 Sr denote the minimum acceptable o¯ er, the modi± ed rule is that
the Proposer states an o¯ er, sp , and the Responder writes down the minimum
10
PDF created with FinePrint pdfFactory trial version http://www.fineprint.com
acceptable o¯ er, sr , simultaneously. Note that Sr 6= Sp because the Responder
could reject the o¯ er of 1 and Sr = Sp [ fsr g with sr > 1. If sp ¡ sr , then
the Proposer receives 1 ∙ sp and the Responder receives sp . Otherwise, both get
nothing. The same rule is applied to both games with discrete and continuous
strategy spaces.
In a game with discrete strategies, the corresponding expected payo¯ s are
given by
¸p (sp ) = (1 ∙ sp )Pr (sp )
and
¸r (sr ) =
sr
X
spp (s):
(3)
s=sr
The simpli± cation makes sr dominate all s0r > sr so that pr (0) = pr ( n1 ) ¡
pr ( n2 ) ¡ ¸
¸
¸¡ pr (1) ¡ pr (sr ). Because the equilibrium selection of logit equilibrium
is based on strategy perturbations, one might expect that pp ( n1 ) = 1 and pr (0) =
pr ( n1 ) =
1
2
in a limiting logit equilibrium. However, this is not necessarily true as
discussed in Section 2. Given a best response, sp , as
grows, the probabilities of
all the o¯ ers other than sp being played become so small that the Responder is
almost indi¯ erent among choices of sr ½sp . Thus the Responder rejects positive
o¯ ers less than sp so often that the Proposer does not have an incentive to make
an o¯ er less than sp .
Lemma 1. When n ¡ 3, in any limiting logit equilibrium, the Proposer has a
unique best response, sp 2 (0; 21 ), and pr is uniformly distributed on [0; sp ].
The Proposer's limiting logit equilibrium strategy should be between 0 and 21 ,
but Lemma 1 does not identify the set of limiting logit equilibrium o¯ ers. Eq.(3)
implies that a su¹ cient condition for a sp to be a unique best response is pr (sp ) ¡
1
.
n(1¤ sp )+1
Since the Responder's strategy is pr (sp ) =
equilibrium with a best response of sp , any sp 2 (0;
1
)
2
1
nsp +1
in a limiting logit
can be a supported as a
unique best response because it can survive su¹ ciently small noise. Therefore, the
conditions in Lemma 1 are not only necessary but also su¹ cient.
11
PDF created with FinePrint pdfFactory trial version http://www.fineprint.com
Proposition 3. In the simpli± ed normal-form representation of ultimatum bargaining game, limiting logit equilibria are:
1) when n = 1, pp (0) = 1 and pr (0) = pr (1) = pr (sr ) = 31 .
2) when n = 2, pp (0) = pp ( 12 ) =
1
2
and pr (0) = pr ( 12 ) = 12 .
3) when n ¡ 3, the set of limiting logit equilibrium o¯ ers is (0; 21 ). The Proposer
plays only pure strategy sp 2 (0; 21 ), and pr (sp ) =
1
nsp +1
pr (sp ) = 0 otherwise.
for all sp ½ sp and
Unlike Nash equilibrium, a limiting logit equilibrium with a positive o¯ ers
relies on \credible" threats rather than incredible threats. That is, given the consistent beliefs for a su¹ ciently large , since the imperfectly optimizing responder
puts enough weight on the dominated strategies, (0; sp ], the Proposer has a strict
incentive to o¯ er sp . the Responder's imperfectly optimizing behavior e¯ ectively
\threatens" the Proposer. However, this result requires consistent beliefs, and
QRE does not provide any explanation how players can coordinate on a particular
equilibrium.
In the ultimatum bargaining game, the limiting logit equilibrium depends on
redundant strategies. If the Responder's strategies are duplicated, for instance,
1
; 0) while
Sr = f0; 1; 2; ¸
¸
¸
; kg, then the limiting logit equilibrium outcome is ( k+1
that is ( 13 ; 0) when Sr = f0; 1; sr g. This property is of particular interest because
duplicating strategies usually does not a¯ ect the equilibrium outcome. For the
same reason, if the strategy choice of sr = 0 is duplicated enough, zero could be a
unique limiting logit equilibrium o¯ er. On the other hand, the set of logit equilibria
is sensitive to the details of Sp and that makes it hard to generalize Proposition 3 to
1 4 4:5
arbitrary Sp . For instance, when Sp = f0; 10
; 10 ; 10 ; ¸
¸
¸
; 1g,
logit equilibrium o¯ er while
4:5
10
can be. When Sp = f0;
4
cannot be a limiting
10
4:5 6
6
; ;¸
¸
¸
; 1g, 10
can be
10 10
a limiting logit equilibrium o¯ er. Nonetheless, if Sp is ± ne enough, Proposition 3
could serve as a general description of limiting logit equilibrium.
By contrast, when the strategy spaces are continuous, the unique limiting
logit equilibrium is the trembling-hand perfect equilibrium. To get some intuition
12
PDF created with FinePrint pdfFactory trial version http://www.fineprint.com
for this result, consider a positive optimal o¯ er in a logit equilibrium for a ± nite
. Since the Proposer's choice probability depends on the expected payo¯ s, in a
continuous case, some sub-optimal strategies adjacent to the optimal strategy are
played almost as often as the optimal o¯ er no matter how large
is. Therefore, as
grows, the Responder accepts those \sub-optimal" o¯ ers with higher probabilities,
and the size of optimal o¯ er gets smaller and becomes zero in the limit.
Proposition 4. When an o¯ er can be made continuously, the unique normal-form
limiting logit equilibrium is the trembling-hand perfect equilibrium.
Now consider the discrete unabridged normal-form game where the Responder
has 2n+1 strategies. In this case, there is no well-de± ned cumulative distribution
function for the Responder's strategy mixture, which makes the analysis a bit more
complicated. Nonetheless, the limiting logit equilibrium can be easily characterized
using the property that if an information set is reached with probability of o(
¤ 1
),
then at the information set the player at move puts equal probability on every
available strategy. If the Proposer's limiting logit equilibrium o¯ er, sp , is strictly
best as in Lemma 1, the Responder plays all choices with sr (sp ) = A with the
same probability and puts zero weight on strategies with sr (sp ) = R. To make the
comparison easier, in Corollary 2 below, I also present the Responder's strategy
using Prob(Accept sp ), the chance that the Responder accepts o¯ er of sp .
Corollary 2. In the unabridged normal-form representation of the ultimatum
bargaining game, limiting logit equilibria are:
1) when n = 1, pp (0) = 1 and pr (sr ) =
Prob(Accept 0) =
1
2
and Prob(Accept 1) = 12 .
1
4
for all sr 2 fAA; AR; RA; RRg.
2) when n = 2, pp (0) = pp ( 12 ) = 21 , and pr (sr ) =
1
4
if sr ( 12 ) = A and pr (sr ) = 0
otherwise. Prob(Accept 12 ) = 1 and Prob(Accept 0) = Prob(Accept 1) = 21 .
3) when n ¡ 3, the set of limiting logit equilibrium o¯ ers is (0; 21 ). The Proposer
plays only pure strategy sp 2 (0; 21 ), and pr (sr ) =
1
2n
if sr (sp ) = A and pr (sr ) = 0
otherwise. Prob(Accept sp ) = 1 and Prob(Accept sp ) =
1
2
for every sp 6= sp .
13
PDF created with FinePrint pdfFactory trial version http://www.fineprint.com
Although the Responder's equilibrium strategies of normal-form games appear
quite di¯ erent from those of the simpli± ed games, the implications of logit equilibrium are exactly the same in the sense that those equilibrium strategies in Proposition 3 can be directly generated from those in Corollary 2.
Proposition 5. Given the strategy pro± les in Corollary 2, should the Responder's
strategies such that sr (sp ) = A but sr (s0p ) = R for sp < s0p be deleted and rescaled,
the resulting strategy pro± les are identical to those in Proposition 3.
5
Concluding Remarks
This paper provides a general characterization of logit equilibrium in the ultimatum bargaining game assuming monetary payo¯ maximization. In the discrete
versions, the Responder could reject positive o¯ ers with positive probabilities and
thus the Proposer has a strict incentive to o¯ er a positive share of the pie in a
limiting normal-form logit equilibrium, whereas the Responder receives the minimum positive o¯ er in the limiting logit-AQRE. In the continuous versions of the
game, both notions of limiting QRE coincide with its unique trembling-hand perfect equilibrium.
In the ultimatum bargaining game experiments, most o¯ ers were concentrated
between 30% and 50%, and positive but smaller o¯ ers were often rejected. Clearly,
the predictions of all but the discrete normal-form logit equilibrium do not agree
with the subjects' choice behavior. The normal-form logit equilibrium in the discrete case is compatible with experimental results, but its interpretation is not
compelling. In a normal-form logit equilibrium, the Responder is not willing to reject any positive o¯ ers but he \does" either because he \knows" that the Proposer
should never make those o¯ ers or by mistakes. However, they do not seem the
case in experiments as in Prasnikar and Roth (1992), Binmore et al. (2001), and
Johnson et al. (2002). In particular, Binmore et al. (2001) examined variations of
the ultimatum bargaining game where the size of pie is 100 and the disagreement
14
PDF created with FinePrint pdfFactory trial version http://www.fineprint.com
outcome is not (0,0). In the experiment, the subjects were extremely sensitive to
the disagreement outcomes and the result shows clearly that most rejected positive o¯ ers were rejected not by mistakes but intendedly whereas QRE predicts
no di¯ erence in choice behavior.6 Those experiment results question the use of
normal-form logit equilibrium with monetary payo¯ maximization as a description of subjects' choice behavior in ultimatum bargaining game experiments. As
a matter of fact, the Responder having a strict incentive to reject positive o¯ ers is
not compatible with monetary payo¯ maximization in any sensible static models,
and it seems inevitable to incorporate a more elaborate preferences such as social
utilities into analysis.
Appendix
Proof of Proposition 1. This proof deals with only continuous strategy cases.
In the agent normal-form representation, each agent at each information set, sp ,
makes an independent decision. Since accepting an o¯ er sp gives the Responder
( sp )
sp , the agent of the Responder accepts the o¯ er with probability 1+exp
exp( sp ) , which
is independent of pp . Given pr , the expected payo¯ from o¯ ering sp is a function
of pr and the existence follows. For a normal form game, see Anderson, Goeree
and Holt (1998, Appendix A) for the existence proof. The only di¯ erence in the
present analysis is the payo¯ function, but it is still continuous, which is all their
proof requires. Finally, the proofs of convergence in McKelvey and Palfrey (1995,
1998) require only the existence. Q.E.D.
Proof of Proposition 2.
6
( sp )
In any case, since pr (Ajsp ) = 1+exp
exp( sp ) , pr (Aj0) =
When the disagreement outcome is (10,10), 40% (103 of 257) of o¯ ers in the interval [20,30]
were rejected. Only .75% (1 out of 133) are rejected when the disagreement outcome is (70,10).
Those two treatments di¯ er only in the Proposer's disagreement outcomes, and the di¯ erence in
rejection rates are too large to attribute those rejections to mistakes. Instead, for some reasons
the Responder seems to have a strict incentive to reject positive o¯ ers.
15
PDF created with FinePrint pdfFactory trial version http://www.fineprint.com
1
,
2
pr (Rj0) =
¸p (0) =
pr (Ajsp ) ! 1 as
1
,
2
and ¸p (1) = 0 for all
¡ 0, and for every sp > 0
! 1. Given these, consider a discrete case ± rst. The cases of
n = 1 and n ¡ 3 are trivial. When n = 2, the result follows from that ¸p (1) = 0
and pp (0) = pp ( 21 ),
pp (0)
= exp
pp ( 12 )
" ∙
1 1 exp( 2 )
∙
2 2 1 + exp( 2 )
!#
"
#
1
= exp
! 1 as
2 1 + exp( 2 )
! 1:
With a continuous strategy space, from Eq.(1), we have
Fp (sp ) = R sp
0
R sp
exp ( (¸p (y) ∙ ¸p (sp ))) dy
:
R
exp ( (¸p (y) ∙ ¸p (sp ))) dy + s1p exp ( (¸p (y) ∙ ¸p (sp ))) dy
0
Since (¸p (sp ) ∙ ¸p (s0p )) ! (s0p ∙ sp ) as
Proof of Lemma 1.
! 1, Fp (sp ) ! 1 for all sp > 0. Q.E.D.
After I characterize limiting logit equilibrium, I identify
necessary conditions for a limiting logit equilibrium strategy pro± le.
Let sp be the largest o¯ er played with positive probabilities in a limiting logit
equilibrium. Since in a limiting logit equilibrium the o¯ er should be accepted
with probability one, if pp (sp ) > 0, Pr (sp ) = 1 and ¸p (sp ) ¡ ¸p (sp ) +
sp > sp . On the other hand, from Eq.(3), ¸p (sp ) ∙ ¸p (sp ∙
1
)
n
Pr (sp )
p)
¡ 0. Since pr (sp ) ∙ n(1¤Pr (s
is strictly
n(1¤ sp )+1
sp )+1
Pr (sp )
implies ¸p (sp ) ¡ ¸p (sp ∙ n1 ) > ¸p (sp ) for all
n(1¤ sp )+1
if pr (sp ) ∙
1
n
for all
¡ 0 if and only
decreasing in sp ,
1
,
n
and
there exists a " > 0 such that ¸p (sp ) ∙ ¸p (sp ) > " > 0 for all sp 2 Sp nfsp ; sp ∙
1
g.
n
pr (sp ) ¡
sp < sp ∙
Therefore, if any, the possible multiple best responses are sp and sp ∙
pp (sp ∙
1
)
n
1
n
but with
= 0.
Since the best response in a limiting logit equilibrium is best along the asso-
ciated converging sequence of logit equilibria, for all sp 6= sp ∙
" > 0 such that ¸p (sp ∙
1
)∙
n
¸p (sp ) > ", and
h
1
,
n
there exists a
pp (sp ) ! 0 for any ± nite h because
exp( ¸p (sp ))=exp( ¸p (sp ))
exp[ (¸p (sp ) ∙ ¸p (sp ))]
=
:(¢ )
P
1 + j6=sp exp[ (¸p (j) ∙ ¸p (sp ))]
j=0 exp( ¸p (j))=exp( ¸p (sp ))
pp (sp ) = P1
This implies that pr (sp ) = pr (s0p ) + o(
h
) for all sp ; s0p ½sp ∙
1
,
n
and pr (sp ) = o(
h
)
for all sp > sp . For a su¹ ciently large , that allows us to write pp (sp ) and pr (sp )
16
PDF created with FinePrint pdfFactory trial version http://www.fineprint.com
as
pp (sp ) = P
pr (sp ) = P
exp[ (1 ∙ sp )]
1
+ o(
=
1
1 + exp[ ( n ∙ (1 ∙ sp + n1 )pr (sp )]
sp 2Sp exp[ (1 ∙ sp )Pr (sp )]
exp[
sr 2Sr
Since pp (sp ∙
pr (sp ) =
1
)
n
P sr
sr pp (sr )]
sr =sp
Psr
exp[
s=sr spp (s)]
= 1 ∙ pp (sp ) + o(
1 + nsp exp
¸
=
h
1
1 + nsp exp[ (sp ∙
1
)p (s
n p p
1
pr (sp ¤ n
)
pr (sp )
1
)]
n
+ o(
h
);
):
), substituting pp (sp ) into pr (sp ) yields
1
µ + o(
exp[ ( n1 ¤ (1¤ sp + n1 )pr (sp )]
1
(sp ∙ n ) 1+exp[ ( 1 ¤ (1¤ s + 1 )pr (s )]
p n
p
n
Since pp (sp ∙ n1 ) ! 0 and
∙
h
h
):
= exp[ (sp ∙ n1 )pp (sp ∙ n1 )] converges to a positive
constant, the above equation could hold only when pr (sp ) >
" > 0. Therefore, pr (sp ) converges to
1
nsp +1
1
1
n(1¤ sp )+ n
+ " for some
and sp is a unique best response in
a limiting logit equilibrium. This also characterizes the limiting logit equilibrium
strategy pro± le given sp .
Next, let's ± nd boundaries. Since Pr ( n1 ) = 2Pr (0) ¡
Pr (0) ∙ 2 n¤n 1 Pr (0) >
2
3n
2
n
for all , ¸p ( n1 )∙ ¸p (0) =
and a logit equilibrium o¯ er is positive. For the upper
bound, from
¸p (sp ) ∙ ¸p
¶
1
sp ∙
n
¸p (sp ) ∙ ¸p (sp ∙
1
)
n
¶
1
1
= (1 ∙ sp )pr (sp ) ∙ Pr sp ∙
n
n
nsp
½ (1 ∙ sp )pr (sp ) ∙
pr (sp ) = (1 ∙ 2sp )pr (sp );
n
½ 0 for any sp ¡
1
.
2
1
2
Thus, sp ¡
cannot be played with
probability one and the result follows. Q.E.D.
Proof of Proposition 3.
This proof considers only the games with n = 1; 2.
When n = 1, pr (0) = pr (1) = pr (sr ) = 13 , and ¸p (0) ¡
1
2
> ¸p (1) for all . When
n = 2, since pr (0) = pr ( 12 ) ¡ pr (1) ¡ pr (sr ), ¸p (0) = ¸p ( 21 ) > ¸p (1) or pp (0) =
pp ( 21 ) > pp (1). Then the result follows from ¸r (0) = ¸r ( 21 ) ¡
1
2
>
17
PDF created with FinePrint pdfFactory trial version http://www.fineprint.com
1
3
¡ ¸r (1).
Proof of Proposition 4. When the strategy space is continuous, the expected
payo¯ s are
¸p (sp ; fr ) = (1 ∙ sp )Fr (sp );
Since for any ± nite
and
¸r (sr ; fp ) =
Z 1
sfp (s)ds:
sr
the logit equilibrium densities, fi (si )'s, are di¯ erentiable with
respect to si , we have fi0 (si ) = fi (si )Dx ¸i (si ), or
7
fp0 (sp )
=
¸
µ
(1 ∙ sp )fr (sp ) ∙ Fr (sp ) fp (sp );
fr0 (sr ) = ∙ sr fp (sr )fr (sr ):
By integrating these from 0 to x,
fp (sp ) = fp (0) +
fr (sr ) = fr (0) ∙
Z sp ¸
Z0sr
0
µ
(1 ∙ s)fr (s) ∙ Fr (s) fp (s)ds
sfr (s)fp (s)ds:
In a logit equilibrium for a ± nite , since fr0 (sr ) ½0 and Dx2 ¸p (sp ) = (1∙ sp )fr0 (sp )∙
2fr (sp ) < 0, the Proposer's best choice is unique for every
¡ 0.
Let sp = argmaxsp ¸p (sp ; fr ). To obtain a contradiction, suppose that there
exists a su¹ ciently large
such that sp ¡ " > 0 for all
>
. Then from the
equation for fp0 , (1 ∙ sp )fr (sp ) = Fr (sp ), or
Z s
Fr (sp )
p
1 ∙ sp =
=
exp[
fr (sp )
0
"
sp
=
exp
2
"
sp
exp
¡
2
∙
(¸r (s) ∙ ¸r (sp )]ds ¡
∙
Z
0
sp
2
!
exp
" Z
sp
sp
2
#
sfp (s)ds
!#
Z s
p
sp
sp
sp Fp (sp ) ∙
Fp
∙ sp Fp (s)ds
2
2
2
∙
∙
!!#
"
#
sp
sp
sp
sp
Fp (sp ) ∙ Fp
¡
exp
Fp (sp ) :
2
2
2
4
The last inequality holds because fp (sp ) is increasing on [0; sp ]. Since sp > ",
sp Fp (sp ) should be bounded and this implies that fr (0) is bounded because
fr (0)
= exp
fr (sp )
7
i (si )'s
" Z
sp
0
#
sfp (s)ds ½exp[ sp Fp (sp )]:
are di¯ erentiable with respect to si , and in Eq.(1), given strictly positive denominator,
fi (si )'s are the ratio of di¯ erentiable functions of
i (si ).
18
PDF created with FinePrint pdfFactory trial version http://www.fineprint.com
Finally,
fp (sp ) = fp (0) +
Z s ¸
p
0
µ
(1 ∙ s)fr (s) ∙ Fr (s) fp (s)ds ½fp (0) + fr (0)Fp (sp )
where the inequality follows from that fr (sr ) is non-increasing in sr . This implies
that fp (sp ) is bounded, which is a contradiction. Q.E.D.
Proof of Corollary 2.
When n = 1, Sp = f0; 1g and pr (AA) ¡
1
4
be-
cause AA is dominant. Thus ¸p (0) > ¸p (1) for all , and the result follows that
pr (AA)=pr (RR) = exp[ pp (1)] and ¸r (AA) ¡ ¸r (RA) ¡ ¸r (AR) = ¸r (RR).
all
When n = 2, since AAA is dominant, pr (AAA) ¡
1
8
and ¸p (0) > ¸p (1) for
. Since sr (0) does not a¯ ect the Responder's payo¯ , the Responder puts
positive weights only on sr 's with sr ( 12 ) = A. the Proposer's equilibrium strategy
is determined by the fact that pr (sr )'s for sr 's with sr ( 12 ) = R vanish at the rate
of o(
¤ 1
).
When n ¡ 3, one can show that the Proposer's limiting logit equilibrium o¯ er
is unique and strictly positive using the similar argument in the proof of Lemma 1.
Only di¯ erence is that the probability of sp being accepted is the sum of ¼ r (sr ) over
all sr 2 fsr jsr (sp ) = Ag. Given this, it is straightforward to ± nd the Responder's
equilibrium strategy using the fact that pp (sp ) ! 0 for all sp 6= sp and Eq.(*).
Finally, since sr (0) does not a¯ ect the Responder's expected payo¯ , the set of
equilibrium o¯ ers is determined by the fact that the Responder accepts any o¯ er
other than the equilibrium o¯ er with probability one half so that ¸p (0) =
1
2
for all
. Q.E.D.
Proof of Proposition 5.
When n = 1, after deleting AR, rescaling pr 's
gives pr (AA) = pr (RA) = pr (RR) =
Prob(Accept 1) =
2
3
1
,
3
which is Prob(Accept 0) =
1
3
and
as in Proposition 3.
When n = 2, fAAA; AAR; RAA; RARg are played with positive probability
of 41 . Deleting AAR and RAR and rescaling yields pr (AAA) = pr (RAA) = 21 , with
which Prob(Accept 0) = 12 , Prob(Accept 21 ) = 1, and Prob(Accept 1) = 1.
19
PDF created with FinePrint pdfFactory trial version http://www.fineprint.com
When n ¡ 3, let sp be the limiting logit equilibrium o¯ er. First, delete the
Responder's strategies such that sr (sp ) = A but sr (s0p ) = R for sp < s0p . Then
there remains nsp + 1 strategies that are played with equal probabilities of
1
nsp +1
after rescaling. Then Prob(Accept sp ) = Prob(Accept sp ∙ n1 )+ ns 1+1 for all sp ½sp
p
and Prob(Accept sp ) = 1, and the result follows. Q.E.D.
References
1. Anderson, Simon, Jacob Goeree, and Charles Holt (1998), \Rent Seeking
with Bounded Rationality: An Analysis of the All-Pay Auction," Journal of
Political Economy, 106, 828-853.
2. Anderson, Simon, Jacob Goeree, and Charles Holt (2001), \Minimum E¯ ort
Coordination Games: Stochastic Potential and Logit Equilibrium," Games
and Economic Behavior, 34, 177-199.
3. Binmore, Ken, John McCarthy, Giovanni Ponti, Larry Samuelson, and Avner
Shaked (2001), \A Backward Induction Experiment," SSRI working paper
9934R, University of Wisconsin.
4. Bolton, Gary and Axel Ockenfels (2000), \ERC: A Theory of Equity, Reciprocity, and Competition," American Economic Review, 90, 166-193.
5. Costa-Gomes, Miguel and Klaus Zauner (2001), \Ultimatum Bargaining Behavior in Israel, Japan, Slovenia, and the United States: A Social Utility
Analysis," Games and Economic Behavior, 34, 238-269.
6. Fehr, Ernest and Klaus Schmidt (1999), \A Theory of Fairness, Competition,
and Cooperation," Quarterly Journal of Economics, 114, 817-868.
7. Gale, John, Kenneth Binmore, and Larry Samuelson (1995), \Learning to be
Imperfect: The Ultimatum Game," Game and Economic Behavior, 8, 56-90.
20
PDF created with FinePrint pdfFactory trial version http://www.fineprint.com
8. Johnson, Eric, Colin Camerer, Sanker Sen, and Talia Rymon (2002), \Detecting Failures of Backward Induction: Monitoring Information Search in
Sequential Bargaining," Journal of Economic Theory, in press.
9. McKelvey, Richard and Thomas Palfrey (1995), \Quantal Response Equilibria For Normal-Form Games," Games and Economic Behavior, 10, 6-38.
10. McKelvey, Richard and Thomas Palfrey (1998), \Quantal Response Equilibria For Extensive-Form Games," Experimental Economics, 1, 9-41.
11. Roth, Alvin and Vesna Prasnikar (1992), \Considerations of Fairness and
Strategy: Experimental Data From Sequential Games," Quarterly Journal
of Economics, 107, 865-888.
12. Rabin, Matthew (1993), \Incorporating Fairness into Game Theory and Economics," American Economic Review, 83, 1281-1302.
13. Roth, Alvin (1995), \Bargaining Experiments," in John Kagel and Alvin
Roth, editors, Handbook of Experimental Economics, 253-348, Princeton:
Princeton University Press.
14. Schotter, A., K., Weigelt, and C. Wilson (1994), \A Laboratory Investigation
of Multiperson Rationality and Presentation E¯ ects," Games and Economic
Behavior, 6, 445-468.
21
PDF created with FinePrint pdfFactory trial version http://www.fineprint.com

Download Report

Quantal-Response Equilibrium Models of The Ultimatum Bargaining

Paperzz.com

Your Paperzz