Ambiguity Aversion, Robustness, and the Variational

Ambiguity Aversion, Robustness, and the Variational Representation of Preferences
Author(s): Fabio Maccheroni, Massimo Marinacci, Aldo Rustichini
Source: Econometrica, Vol. 74, No. 6 (Nov., 2006), pp. 1447-1498
Published by: The Econometric Society
Stable URL: http://www.jstor.org/stable/4123081
Accessed: 18/03/2009 06:50
Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at
http://www.jstor.org/page/info/about/policies/terms.jsp. JSTOR's Terms and Conditions of Use provides, in part, that unless
you have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you
may use content in the JSTOR archive only for your personal, non-commercial use.
Please contact the publisher regarding any further use of this work. Publisher contact information may be obtained at
http://www.jstor.org/action/showPublisher?publisherCode=econosoc.
Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed
page of such transmission.
JSTOR is a not-for-profit organization founded in 1995 to build trusted digital archives for scholarship. We work with the
scholarly community to preserve their work and the materials they rely upon, and to build a common research platform that
promotes the discovery and use of these resources. For more information about JSTOR, please contact [email protected].
The Econometric Society is collaborating with JSTOR to digitize, preserve and extend access to Econometrica.
http://www.jstor.org
Econometrica,Vol. 74, No. 6 (November, 2006), 1447-1498
AMBIGUITY AVERSION, ROBUSTNESS, AND THE VARIATIONAL
REPRESENTATION OF PREFERENCES
BY FABIO MACCHERONI, MASSIMO MARINACCI, AND ALDO RUSTICHINI1
We characterize,in the Anscombe-Aumann framework,the preferences for which
there are a utilityfunction u on outcomes and an ambiguityindex c on the set of probabilities on the states of the world such that, for all acts f and g,
ftg
+c(p)
min(Ju(f)dp+
c(p) >min u(g)dp
The function u representsthe decision maker'sriskattitudes,while the index c captures
his ambiguityattitudes. These preferences include the multiple priors preferences of
Gilboa and Schmeidler and the multiplier preferences of Hansen and Sargent. This
provides a rigorousdecision-theoreticfoundation for the latter model, which has been
widely used in macroeconomicsand finance.
KEYWORDS:
Ambiguityaversion,model uncertainty,robustness.
1. INTRODUCTION
AMBIGUITY HAS BEENA CLASSICISSUEin decision theory since the seminal
work of Ellsberg (1961). The fundamental feature of ambiguity pointed out
by Ellsberg is that there may be no belief on the states of the world that the
decision maker holds and that rationalizes his choices.
A widely used class of preferences that model ambiguity are the multiple
priorspreferencesaxiomatized by Gilboa and Schmeidler (1989) (also known
as maxmin expected utility preferences). Agents with these preferences rank
payoff profiles f accordingto the criterion
(1)
u(f)dp,
V(f) =min
peCf
where C is a given convex subset of the set A of all probabilitieson states. The
set C is interpreted as a set of priors held by agents, and ambiguityis reflected
by the multiplicityof the priors.
1An extended version of this paper was previouslycirculatedwith the title "VariationalRepresentation of Preferencesunder Ambiguity"ICER WorkingPaper5/2004, March 2004. We thank
Erio Castagnoli,Rose-Anne Dana, LarryEpstein, Peter Klibanoff,Bart Lipman,MarkMachina,
JianjunMiao, SujoyMukerji,Emre Ozdenoren,Ben Polak, and, especially,Andy Postlewaiteand
four anonymousreferees for helpful discussions and suggestions.We also thank several seminar
audiences. Part of this research was done while the first two authors were visiting the Department of Economics of Boston University,which is thanked for its hospitality. They also gratefully acknowledgethe financial support of the Ministero dell'Istruzione,dell'Universithe della
Ricerca. Rustichinigratefullyacknowledgesthe financialsupport of the National Science Foundation (Grant SES-04-52477).
1447
1448
E MACCHERONI,M. MARINACCI,AND A. RUSTICHINI
In the past few years the possibility that agents may not hold a single belief on the states of the world has widely informed the research in macroeconomics and finance. In particular,there has been a growing dissatisfactionin
macroeconomicstowardthe stronguniformityon the agents' beliefs that is imposed by the rationalexpectationshypothesis.Under this hypothesis,all agents
share the same probabilitydistributionon some relevant economic phenomenon and each agent has to be firmlyconvinced that the model he has adopted
is the correct one. This is a strong requirement because agents can have different models, each of them being only an approximationof the underlying
true model, and they may be aware of the possibilitythat their model is misspecified. A weakening of this requirementallows agents to entertain different
priors on the economy.
A large part of the research has modeled ambiguitywith multiple priors
models (see, for instance, Epstein and Wang (1994) and Chen and Epstein
(2002)). More recently,a different alternativehas been explored, startingwith
the work of Hansen and Sargent (2000, 2001) which in turn builds on earlier
work in the field of robustcontrol in the engineeringand optimal controlliterature. In this robustpreferences approach,the objectivefunctions of the agents
take into account the possibilitythat their model q may not be the correctone,
but only an approximation.Specifically,agents rankpayoff profiles f according
to the choice criterion
(2)
V(f)=-min
fu(f)dp
+ OR(p
Iq)},
where R(. IIq):A -* [0, oo] is the relative entropy with respect to q (see Section 4.2 for the definition). Preferences represented by criterion (2) are called
multiplierpreferences.2
Agents who behave according to this choice criterion are considering the
possibility that q may not be the appropriate law that governs the phenomenon in which they are interested and for this reason they take into account
other possible models p. The relative likelihood of these alternative models
is measured by their relative entropy, while the positive parameter 0 reflects
the weight that agents are giving to the possibilitythat q might not be the correct model. As the parameter 0 becomes larger,agents focus more on q as the
correct model, giving less importanceto possible alternativemodels p (Proposition 22).
Hansen and Sargent (2000) have pointed out that this model uncertainty
can be viewed as the outcome of ambiguity,possibly resulting from the poor
quality of the informationon which agents base the choice of their model. For
this reason, the motivation behind multiplier preferences is closely connected
2Hansen and Sargent (2001) also considered a class of multiple priors preferenceswith C =
{pe A: R(p IIq) < r7},which they called constraintpreferences.
VARIATIONALREPRESENTATIONOF PREFERENCES
1449
to the motivation that underlies multiple priors preferences. So far, however,
this connection has been stated simply as intuitively appealing rather than established on formal grounds. In particular,no behavioral foundation of the
preferences in (2) has been provided.
Here we establish this connection by presenting a general class of preferences with common behavioralfeatures that includes both multiplierand multiple prior preferences as special cases. The nature of the connection between
the two main models that we have been discussingso far can be firstestablished
formally.The multiple priors criterion (1) can be written as
(3)
V(f) = min
u(f)dp+8
c(p)},
where Sc: A -- [0, oc] is the indicator function of C (in the sense of convex
analysis;see Rockafellar (1970)) given by
c(P)
pEj
O,
oc,
if
C,
otherwise.
Like the relative entropy, the indicator function also is a convex function defined on the simplex A.
This reformulationclarifies the formal connection with the multiplier preferences in (2) and suggests the general representation
(4)
V(f) = min
u(f) dp + c(p) ,
where c: A -- [0, oo] is a convex function on the simplex. In this paper, we
show that the connection is substantiveand not formal by establishingthat the
two models have a common behavioralfoundation.
We first axiomatize (Theorem 3) the representation (4) by showing how it
rests on a simple set of axioms that generalizes the multiple priors axiomatization of Gilboa and Schmeidler (1989). We then show (Proposition 8) how
to interpret in a rigorous way the function c as an index of ambiguity aversion: the lower is c, the higher is the ambiguityaversion exhibitedby the agent.
The relative entropy OR(p 11q) and the indicator function 8c(p) can thus be
viewed as special instances of ambiguityindices. The assumptionson behavior
for this general representation are surprisinglyclose to those given by Gilboa
and Schmeidler (1989), and result from a simple weakening of their certainty
independence axiom, so the similaritybetween the different representations
has a sound behavioralfoundation.
Once we have established this common structure, we can analyze the relationship between ambiguity aversion and probabilistic sophistication. For
example, the multiplier preferences used by Hansen and Sargent are probabilisticallysophisticated (and the same is true for their constraintpreferences).
1450
E MACCHERONI,M. MARINACCI,AND A. RUSTICHINI
Is this a fundamental property of robust preferences that makes them profoundly different from multiple priors? In the common structure provided
by the representation (4), the answer is simple and clear. The preferences
in (4) are probabilisticallysophisticated if and only if the ambiguityindex c is
symmetric.For multiple priorspreferences, this propertytranslatesinto a symmetry property of the set C. For multiplier preferences (and for constraint
preferences), the condition is reflected by the symmetryof the relative entropy.
The propertyof being probabilisticallysophisticated is therefore a property
of the specific ambiguityindex, not of the model. Being a symmetryproperty,
this property is fragile: small perturbationsdestroy it and produce Ellsbergtype behavior. Even if one adopts the view that such behavior is necessaryfor
ambiguityaversion,the preferences in (4) are typicallyambiguityaverse (in the
precise sense of Theorem 14).
Although our original motivation came from multiple priors and multiplier
preferences, the class of preferenceswe axiomatizegoes well beyond these two
classes of preferences. In particular,multiplier preferences are a very special
example of a new class of preferences, called divergencepreferences,that we
introduce and study in this paper. These preferences are able to accommodate
Ellsberg-typebehavior and, unlike multiple priorspreferences,they are in general smooth (Proposition 23), an importantfeature for applications.This new
class of preferences can provide a tractablealternativeto multiple priorspreferences in economic applications that deal with ambiguity,and the works of
Hansen and Sargent can be viewed as an instance of this (see Theorem 16).
This claim is further substantiated by the observation that divergence
preferences include as a special case a third classic class of preferences-the
mean-variancepreferencesof Markowitz(1952) and Tobin (1958). Recall that
mean-variance preferences are represented by the preference functional
(5)
V(f)
=
f dq -
Var(f).
In Theorem 24, we show that, on the domain of monotonicityof V, the equality
f dq-
= min
Var(f)
20
~
~p(-AI
f dp+ OG(pIIq)
holds, where G(- I q): A -- [0, oo] is the relative Gini concentrationindex (see
Section 4.2.2 for the definition). As a result, the mean-variance preference
functional (5) is a special case of our representation(4). Interestingly,the associated index of ambiguityaversion is the relative version of the classic Gini index. After Shannon'sentropy,a second classic concentrationindex thus comes
up in our analysis.
Summingup, in this paper we generalize a popular class of preferences that
deal with ambiguity-the multiple priors preferences-and in this way we are
VARIATIONALREPRESENTATIONOF PREFERENCES
1451
able to introduce divergence preferences, a large new class of preferences under ambiguitythat are in general smooth and that include, as special cases,
two widely used classes of preferences, the multiplier preferences of Hansen
and Sargent and the mean-variance preferences of Markowitzand Tobin.We
are thus able to provide both a rigorous decision-theoretic foundation on two
widely used classes of preferences and a setting in which the two most classic concentration indices, Shannon's entropy and Gini's index, have a natural
decision-theoretic interpretation. We finally characterize the preferences in
this class that are probabilisticallysophisticated and show that this property
coincides with a symmetrypropertyof the cost function. This property is fragile and so the preferences we characterize are typically not probabilistically
sophisticated.
1.1. AmbiguityAversion
In addition to Schmeidler's (1989) original notion based on preference for
randomization (see Axiom A.5 in Section 3), there are two main notions of
ambiguityaversion in the literature:those proposed by Epstein (1999) and by
Ghirardatoand Marinacci (2002). The key difference in the two approaches
lies in the different notion of ambiguityneutralitythey use: while Ghirardato
and Marinacci (2002) identify ambiguity neutrality with subjective expected
utility,Epstein (1999) more generallyidentifies ambiguityneutralitywith probabilistic sophistication. In a nutshell, Ghirardato and Marinacci (2002) claim
that, unless the setting is rich enough, probabilisticallysophisticated preferences may be compatible with behavior that intuitivelycan be viewed as generated by ambiguity.For this reason, they consider only subjective expected
utility preferences as ambiguityneutral preferences.3
For the general class of preferences we axiomatize, the relative merits of
these two notions of ambiguityaversion are the same as for the special case
represented by multiple priors preferences. Although in the paper we adopt
the view and terminology of Ghirardato and Marinacci (2002), the appeal of
our analysis does not depend on this choice; in particular,we expect that the
ambiguityfeatures of our preferences can be studied along the lines of Epstein
(1999), in the same way as it has been done in Epstein (1999) for multiple
priorspreferences.
To clarifythis issue further,in Section 3.5 we study the form that probabilistic sophisticationtakes in our setting. As we alreadymentioned, we show that
our preferences are probabilisticallysophisticatedonce their ambiguityindices
satisfy a symmetryproperty.As a result, probabilisticsophistication is not peculiar to some particularspecificationof our preferences, but, to the contrary,
andMarinacci(2002)for a detailedpresentation
3Wereferto Epstein(1999)andGhirardato
and motivationof their approaches.Notice that the notion of ambiguityaversion in Ghirardato
staticsexercises
andMarinacci
(2002)iswhatprovidesa foundationforthe standardcomparative
in ambiguity
formultiplepriorspreferencesthatarebasedon the sizeof the set of priors.
1452
E MACCHERONI, M. MARINACCI,AND A. RUSTICHINI
all examples of our preferences will include special cases of probabilistically
sophisticated preferences. For instance, both multiplier and mean-variance
preferences are easily seen to be examples of probabilisticallysophisticateddivergence preferences, although Example 17 shows that this is not the case for
general divergencepreferences, even for those that are very close to multiplier
and mean-variance preferences.
We close by briefly mentioning a possible alternative interpretationof our
preferences. Consider an agent who has to make choices with only limited
information and without a full understandingof what is going on. Some recent psychologicalliterature (e.g., Keren and Gerritsen (1999), Kiihbergerand
Perner (2003)) suggests that in this case the agent may behave as if he were
playingagainst an informed opponent who might take advantageof this uncertainty and turn it against him.
This is a psychological attitude that can be relevant in many choice situations, including Ellsberg-type choice situations, and our preference functional (4) can be viewed as modeling such psychological treat. In fact, agents
who rank payoff profiles accordingto (4) can be viewed as believing they are
playing a zero-sum game against (a malevolent) Nature. This view, however,
has not been firmly established in the psychological and neuroscience literatures, and it is the object of currentresearch (see Hsu, Bhatt, Adolphs, Tranel,
and Camerer (2005) and Rustichini(2005)); for this reason, we do not expatiate further on this interpretation.
The paper is organized as follows. After introducingthe setup in Section 2,
we present the main representationresult in Section 3. In the same section, we
discuss the ambiguityattitudes featured by the preferences we axiomatizeand
we give conditions that make them probabilisticallysophisticated.In Section 4,
we studytwo importantexamplesof our preferences, that is, the multiplepriors
preferences of Gilboa and Schmeidler(1989) and divergencepreference, a new
class of preferences that includes, as special cases, multiplierpreferences and
mean-variance preferences. Proofs and related material are collected in the
Appendixes.
2. SETUP
Consider a set S of states of the world, an algebra X of subsets of S called
events,and a set X of consequences.We denote by .T the set of all the (simple)
acts: finite-valuedfunctions f :S -- X, which are X-measurable.Moreover,we
denote by Bo(2) the set of all real-valued X-measurablesimple functions, so
that u(f) E Bo(V) whenever u: X -- R, and we denote by B(s) the supnorm
closure of BO(X).
Given any x E X, define x E T to be the constant act such that x(s) = x for
all s E S. With the usual slight abuse of notation, we thus identify X with the
subset of the constant acts in F. If f E F, x E X, and A E I, we denote by
xAf E FTthe act that yields x if s E A and f(s) if s A.
VARIATIONALREPRESENTATIONOF PREFERENCES
1453
We assume additionally that X is a convex subset of a vector space. For
instance, this is the case if X is the set of all the lotteries on a set of prizes,
as happens in the classic setting of Anscombe and Aumann (1963). Using the
linear structureof X, we can define as usual for every f, g E T and a E [0, 1]
the act af + (1 - a)g E F, whichyields oaf(s)+ (1 - a)g(s) E X for every s E S.
We model the decision maker'spreferenceson Tf by a binary relation >. As
usual, >-and - denote, respectively,the asymmetricand symmetricparts of >-.
If f E F, an element xf E X is a certaintyequivalentfor f if f - xf.
3. REPRESENTATION
3.1. Axioms
In the sequel we make use of the following properties of >:
AXIOMA.1-Weak Order: If f, g, h E ?, (a) either f > g or g >- f, and
(b) f > g and g> h implyf > h.
A.2-Weak CertaintyIndependence:If f, g E ?, x, y EX, and a E
AXIOM
(0, 1),
af + (1 - a)x > Lag+ (1 - a)x
+ (1 - aa)y ag + (1 - a)y.
c
= of
AXIOMA.3 -Continuity: If f, g, h E Y, the sets {a E [0, 1]: af + (1 a)gg> h} and {a e [0, 1]: h > af + (1 - a)g} are closed.
AXIOMA.4-Monotonicity: If f, g E ? and f(s)
f rg.
-
g(s) for all s E S, then
AXIOMA.5-Uncertainty Aversion: If f, g E ? and a E (0, 1),
f~g
=
af +(1-a)g
rjf.
AXIOMA.6-Nondegeneracy: f >-g for some f, g E F.
Axioms A.1, A.3, A.4, and A.6 are standardassumptions.Axioms A.3 and A.6
are technical assumptions,while Axioms A.1 and A.4 require preferences to
be complete, transitive, and monotone. The latter requirement is basically a
state-independencecondition,which saysthat decision makers always(weakly)
prefer acts that deliver statewise (weakly)better payoffs, regardlessof the state
where the better payoffs occur. If a preference relation >- satisfiesAxioms A.1,
A.3, and A.4, then each act f E F admits a certainty equivalent xf E X. (See
the proof of Lemma 28 in Appendix B.)
E MACCHERONI,M. MARINACCI,AND A. RUSTICHINI
1454
Axiom A.5, due to Schmeidler (1989), is a smoothing axiom that can be interpreted as an ambiguityaversion axiom, as discussed at length in Gilboa and
Schmeidler (1989), Schmeidler (1989), Epstein (1999), and Ghirardato and
Marinacci(2002).
Axiom A.2 is a weak independence axiom-a variationof an axiomof Gilboa
and Schmeidler (1989). It requires independence only with respect to mixing
with constant acts, providedthe mixingweights are kept constant.Axiom A.2 is
weaker than the original axiom of Gilboa and Schmeidler (1989), and it is this
weakening that makes it possible to go beyond the multiple priors model. Because of its importancefor our derivation,we devote the rest of this subsection
to Axiom A.2.
Consider the following strongerversion of Axiom A.2:
AXIOMA.2'-Certainty Independence: If f, g e T, x e X, and a E (0, 1),
then
f
g
'
af + (1 - a)x
ag + (1 - a)x.
Axiom A.2' is the original axiom of Gilboa and Schmeidler (1989). The next
lemma shows how it strengthensAxiom A.2.
LEMMA1: A binaryrelation - on F satisfiesAxiom A.2' if and only if,for all
, x, y E X, anda, E (0, 1],
f,g
af + (1 - a)x > ag + (1 - a)x
=
pf + (1 - P)y
3pg+ (1 3)y.
Axiom A.2 is therefore the special case of Axiom A.2' in which the mixing
coefficients a and /3 are required to be equal. At a conceptual level, Lemma 1
shows that Gilboa and Schmeidler's(1989) certaintyindependence axiomactually involves two types of independence: independence relative to mixingwith
constants and independence relative to the weights used in such mixing. Our
Axiom A.2 retains the firstform of independence, but not the second. In other
words, we allow for preference reversals in mixing with constants unless the
weights themselves are kept constant.
This is a significant weakening of the certainty independence axiom, and
its motivation is best seen when the weights a and /3 are very different, say
a is close to 1 and p is close to 0. Intuitively, acts af + (1 - a)x and ag +
(1 - a)x can then involve far more uncertaintythan acts Pf + (1 - P)y and
p/g + (1 - P/3)y,which are almost constant acts. As a result, we expect that,
at least in some situations, the rankingbetween the genuinely uncertain acts
af + (1 - a)x and ag + (1 - a)x can well differ from that between the almost
constant acts Pf + (1 - P)y and pg + (1 - P)y.
VARIATIONALREPRESENTATIONOF PREFERENCES
1455
Needless to say, even though we believe that such reversals can well occur
(from both a positive and normativestandpoint), the only way to test them, and
so test the plausibilityof Axioms A.2 and A.2', is by runningexperiments.This
is possible because both Axioms A.2 and A.2' have clear behavioral implications. For instance, the following (thought) experiment gives a simple testable
way to compare Axioms A.2 and A.2', and runningthis type of experimentswill
be the subject of future research.
2: Consider an urn that contains 90 black and white balls in unEXAMPLE
known proportion, and the bets (payoffs are in dollars)
t> 0
Black
White
ft
t
t
gt
3t
0.01t
that is, f, pays t dollars whatever happens, while g, pays 3t dollars if a black
ball is drawn and t cents otherwise. For example,
10
fio
g10
Black
10
30
104
White
10
and f10o4
0.1
g104
Black
White
10,000
30,000
10,000
100
Assume the decision maker'spreferences satisfyAxioms A.1 and A.3-A.6, and
she displaysconstant relative risk aversion y E (0, 1).4 If her preferences satisfy
Axiom A.2', then
either ft~, g, for all t or gt - f, for all t.
If, in contrast, her preferences only satisfy Axiom A.2 and not Axiom A.2',
there might exist a thresholdt such that
ft g, forallt>t
and gt f,
for all t < t.
This reversal is compatible with Axiom A.2, but it would reveal a violation of
Axiom A.2'.
We close by observingthat, in terms of preference functionals,by Theorem 3
and Proposition 19 all preference functionals (4) satisfyAxiom A.2 and violate
Axiom A.2' unless they reduce to the multiple priors form (1).
4Noticethat,forall t > - > 0 andy, thereexistf, g, x, y, a, p as discussedin the textsuchthat
- g,.
af + (1 - a)x - ft, ag + (1 - a)x - gt, Pf + (1 - P)y - f,, and pg + (1 - P)y
1456
E MACCHERONI, M. MARINACCI,AND A. RUSTICHINI
3.2. Main Result
We can now state our main result, which characterizespreferences that satisfy Axioms A.1-A.6. Here A = A(2) denotes the set of all finitely additive
probabilities on I endowed with the weak* topology5 and c: A -- [0, 00] is
said to be groundedif its infimumvalue is zero.
3: Let >. be a binaryrelationon F. Thefollowing conditionsare
THEOREM
equivalent:
(i) Therelation>- satisfiesAxioms A.1-A.6.
(ii) Thereexistsa nonconstantaffinefunction u :X - IRand a grounded,convex, and lower semicontinuousfunction c: A -+ [0, oo] such that,for all
f, g e F,
(6)
min(f u(f)dp + c(p) > min(f u(g)dp+ c(p)).
f jg
g
For each u thereis a (unique) minimal c*:A - [0, oo] that satisfies (6) and is
given by
(7)
c*(p)= sup(u(xf)
fGT(
u(f)dp).
The representation(6) involves the minimizationof a convex lower semicontinuous function, which is the most classic variationalproblem. This motivates
the following definition:
4: A preference - on F is called variationalif it satisfies AxDEFINITION
ioms A.1-A.6.
By Theorem 3, variationalpreferences can be represented by a pair (u, c*).
From now on, when we consider a variationalpreference, we will write u and c*
to denote the elements of such a pair. Next we give the uniqueness properties
of this representation.
5: Twopairs (uo, c*) and (u, c*) representthe same variational
COROLLARY
- as in Theorem
3 if and only if thereexist a > 0 and P/ IRsuch that
preference
u = auo +
p and c* =
aco.
In Theorem 3 we saw that c* is the minimal nonnegative function on A for
which the representation (6) holds. More is true when u(X) = {u(x): x e X}
is unbounded (either below or above):
5That is, the
pd(A)
- p(A) for
o(A(X,),
all A
Bo(2)) topology where a net
•
.
{PdideD
converges to p if and only if
VARIATIONALREPRESENTATIONOF PREFERENCES
1457
6: Let > be a variationalpreferencewith u(X) unbounded.
PROPOSITION
Then thefunction c* definedin (7) is the unique nonnegative,grounded,convex,
and lowersemicontinuousfunction on Afor which (6) holds.
As shown in Lemma 29 in Appendix B, the assumption that u(X) is unbounded (above or below) is equivalent to the following axiom (see Kopylov
(2001)).
AXIOMA.7 -Unboundedness: Thereexist x >-y in X such that,for all a E
(0, 1), there exists z E X that satisfies either y >- az + (1 - a)x or az + (1 a)y >-x.
We call the variationalpreferences that satisfyAxiom A.7 unbounded.
3.3. AmbiguityAttitudes
We now study the ambiguityattitudes featured by variational preferences.
We follow the approach proposed in Ghirardato and Marinacci (2002), to
which we refer for a detailed discussion of the notions we use.
Begin with a comparativenotion: given two preferences >1 and >2, say that
(1 is more ambiguityaversethan (2 if, for all f E F and x E X,
(8)
f >- x
=
f2
f x.
To introduce an absolute notion of ambiguity aversion, as in Ghirardato and
Marinacci(2002) we consider subjectiveexpected utility (SEU) preferences as
benchmarks for ambiguityneutrality.We then say that a preference relation
b is ambiguityaverseif it is more ambiguityaverse than some SEU preference.
We now apply these notions to our setting. The first thing to observe is that
variationalpreferences are alwaysambiguityaverse.
7: Each variationalpreferenceis ambiguityaverse.
PROPOSITION
Since variationalpreferences satisfy Axiom A.5 and the choice rule that results from (6) is a maxmin rule, intuitively it is not surprisingthat variational
preferences alwaysdisplaya negative attitude toward ambiguity.Proposition 7
makes this intuition precise.
Next we show that comparative ambiguity attitudes for variational preferences are determined by the function c*. Here u1 m u2 means that there exist
a > 0 and /3 E R such that ul = au2 + / .
8: Given two variationalpreferences>- and -2, the following
PROPOSITION
conditionsare equivalent:
(i) Therelation -1 is moreambiguityaversethan j2.
1458
E MACCHERONI, M. MARINACCI,AND A. RUSTICHINI
u2and c
(ii) ul
c* (providedul
<_
= u2).
Given that u1a U2, the assumption ul = u2 is just a common normalization
of the two utility indices. Therefore, Proposition 8 says that more ambiguity
aversepreference relations are characterized,up to a normalization,by smaller
functions c*. Therefore, the function c* can be interpretedas an indexof ambiguityaversion.
We now give few simple examples that illustrate this interpretation of the
function c*.
EXAMPLE
9: By Proposition 8, maximalambiguityaversion is characterized
by c*(p) = 0 for each p e A. In this case, (6) becomes
f
min u(f) dp > minf u(g)dp,
g
that is,
f -g
min
u(fS(s))
sES
> min
u(g(s)),
seS
a form that clearly reflects extreme ambiguityaversion.
EXAMPLE
10: Minimal ambiguity aversion corresponds here to ambiguity
neutralitybecause, by Proposition 7, all variationalpreferences are ambiguity
averse. Therefore, the least ambiguityaverse functions c* are those associated
with SEU preferences. As will be shown in Corollary20, when a SEU preference is unbounded, c* takes the stark form
c(p =,
=c0,
if p = q,
otherwise,
where q is the subjectiveprobabilityassociated with the preference.
11: Denote by c* the ambiguityneutral index of Example 10 and
EXAMPLE
denote by
the maximal ambiguity index of Example 9. Given a e (0, 1),
suppose thecm
right-handside of (6) is
(1- a)u(f) dq+amins
u(f(s))
> (1 -a)
sSE
u(g)dq + aminu(g(s)),
f
which is the well known e-contaminated model. In this case,
c*(p) = min (1 P1,P2P2EA
a)cq(P2)+
= 5(1-a)q+aA(P),
Cm(P):(1
- a)p2 + ap = p}
VARIATIONALREPRESENTATIONOF PREFERENCES
1459
and this is a simple example of an index c* that does not display extreme ambiguity attitudes.
According to Proposition 8, variationalpreferences become more and more
(less and less, resp.) ambiguity averse as their ambiguity indices become
smaller and smaller (larger and larger, resp.). It is then natural to wonder
what happens at the limit, when they go to 0 or to oc. The next result answers
this question. In reading it, recall that
< c, implies dom c, dom c,+1 and
c1+•
argmin c, C argmin cn+1.6
12: (i) If c, 4 and limncn~(p)= 0 wheneverthis limit is finite,
PROPOSITION
then
(9)
limminf u(f)dp + cn(p)
=
n-+0o pEAj
min
JpEUn dom-cCJ
u(f) dp
for all f E 'F.
(ii) If c, and limn cn (p) = 0c wheneverthis limit is not 0, then
(10)
lim
n-+ c minf
pEAI
u(f)dp+cn(p)
=
min
PEn argmincnJ
u(f ) dp
for all f E F.
Proposition 12 shows that the limit behavior of variational preferences is
describedby multiple priors preferences, but the size of the sets of priors they
feature is very different. In fact, in (9) the relevant set of priors is given by
U, dom c,, whereas in (10) it is given by the much smaller set n,, argminc,.
For example, Proposition22 will show that for an importantclass of variational
preferences, the set ", argmin c, is just a singleton, so that the limit preference
in (10) is actuallya SEU preference, while the set U, dom c, is very large.
We close with few remarks. First, observe that Lemma 32 in Appendix B
shows that the set that Ghirardato and Marinacci (2002) call benchmark
measures-those probabilities that correspond to SEU preferences less ambiguity averse than --is given here by argmin c* = {p E A: c*(p) = 0}.
Second, notice that by standard convex analysis results (see Rockafellar
(1970)), Example 11 can be immediately generalized as follows: the ambiguity index of a convex combinationof preference functionals that represent unbounded variational preferences is given by the infimal convolution of their
ambiguityindices.
6Thetermdomc denotesthe effectivedomain{c < oc} of c, whereasargminc,= {p E A:
c,(p) = 0} becausec, is grounded.Observethat domcnrepresentsthe set of all probabilities
that the decision maker considersto be relevantwhen rankingacts using the ambiguityindex cn,
whereas the smaller set argmincn contains only the probabilities that are getting the highest
weight by this decision maker.
1460
E MACCHERONI,M. MARINACCI,AND A. RUSTICHINI
Finally, Proposition 12 is an example of a result about the limit behaviorof
sequences of variationalpreferences
V.(f)
min
u,(f)
dp + c,(p)),
a type of result that can be importantin some applications to enable dealing
with stability issues. These limit results involve the convergence of minima,
a classic problem in variational analysis. A noteworthy feature of variational
preferences is that the lower semicontinuityof the functions f u,(f) dp+c,(p)
on A makes it possible to use the powerfulde Giorgi-Wijsmantheory of F convergence (often called epiconvergence; see, e.g., Dal Maso (1993)) to study
the behavior of the sequence {V,(f)}, and, for example, to determine the
conditions under which it converges to some preference functional V(f) =
minpEa(f u(f) dp + c(p)) for suitable limit functions u and c.
3.4. An Extension:CountableAdditivity
In Theorem 3 we considered the set A of all finitely additive probabilities.
In applications, however, it is often important to consider countably additive
probabilities,which have very convenient analytical properties. For example,
we will see momentarilythat this is the case for divergence preferences, and
therefore for the multiplierpreferences of Hansen and Sargent (2001) and for
the mean-variancepreferencesof Markowitz(1952) and Tobin(1958). For this
reason, here we consider this technical extension.
Fortunately,in our setting we can still use the monotone continuity axiom
introduced by Arrow (1970) to derive a SEU representationwith a countably
additive subjectiveprobability(see Chateauneuf, Maccheroni,Marinacci,and
Tallon (2005)).
AXIOMA.8-Monotone Continuity: If f, g E FT,x E X, {En,1> E with
E D2E2 D ... and n,,>E, = 0, then f >- g implies that thereexists no > 1 such
that xE,,f >- g.
Next we state the countablyadditiveversion of Theorem 3. Here Ad = AU (X)
denotes the set of all countablyadditiveprobabilitiesdefined on a ur-algebraZ,
while A((q) denotes the subset of Ad that consists of all probabilities that are
absolutelycontinuous with respect to q; i.e., A'((q) = {p E d: p <<q)}.
THEOREM
13: Let > be an unboundedvariationalpreference.Thefollowing
conditionsare equivalent:
(i) Therelation satisfiesAxiom A.8.
(ii) For each t > 0, {p E A : c*(p) < t} is a (weakly)compactsubsetof AU.
VARIATIONALREPRESENTATIONOF PREFERENCES
1461
In this case, thereexistsq E Az such that,for all f, g E F,
(11)
f g
u(f
(f~~~(S(R
Amin
u(f)dp+c*(p)>
u(g)dp+c*(p).
qmin
Lemma 30 in Appendix B shows that even when the preference is not unbounded, Axiom A.8 still implies the countable additivityof the probabilities
involved in the representation.
In view of these results, we call the variational preferences that satisfy Axiom A.8 continuous.
pEA'Or'(q)
PEAO"(q)
3.5. ProbabilisticSophistication
In this section we characterizevariational preferences that are probabilistically sophisticated, an important property of preferences introduced by
Machina and Schmeidler (1992) that some authors (notably Epstein (1999))
identify with ambiguityneutrality(or absence of ambiguityaltogether).
Our main finding is that what makes a variational preference probabilistically sophisticated is a symmetry property of the ambiguity index. As a result, all classes of variational preferences (for example, multiple priors and
divergence preferences) contain a subclass of probabilisticallysophisticated
preferences characterizedby a suitable symmetryproperty of the associated
ambiguityindices.
Specifically, given a countably additive probability q on the o--algebraZ,
a preference relation > is probabilisticallysophisticated(with respect to q) if,
given f and g in F,
q(s E S: f(s) = x) = q(s E S: g(s) = x) for all x E X = f ~ g.
For example, if S is finite and q is uniform, then probabilistic sophistication
amounts to permutation invariance (i.e., each act is indifferent to all its permutations).
A preference satisfies (firstorder)stochasticdominance (with respect to q)
if, given f and g in F,
x) for all x E X =
x) < q(s c S:g(s)
f g.
q(s E S: f(s)
Notice that a preference that satisfies stochastic dominance is probabilistically
sophisticated.
To characterizeprobabilisticsophistication,we need to introduce a few well
known notions from the theory of stochastic orders (see, e.g., Chong and Rice
(1971) and Schmeidler (1979)). Define a partial order >-,, on AZ(q) by
p
p' iff fJf(
'dq
>cx
dq>
f'
J'
(dpf)dq
dq j~
1462
E MACCHERONI,M. MARINACCI,AND A. RUSTICHINI
for every convex function 0 on R. This is the so-called convexorderon probability distributions,and by classic results of Rothschild and Stiglitz (1970) and
Marshalland Olkin (1979) we have p >-~x p' when the "masses"dp(s) are more
scatteredwith respect to dq(s) than the masses dp'(s).
An important property of the convex order >-c is that its symmetric part ~cx
coincides with the identical distributionof the densities with respect to q. That
is, given p and p' in A "(q),
(12)
p ex, '
iff
dp
q seS:
(s)<t
dq
=q( sS:
dp' (s)<t
VteR.
dq
For p in AU(q),the set O(p) =- {p' e A (q): p' -,, p} is called the orbitof p.
A function c: A -+ [0, oo] is rearrangementinvariant(with respect to q) if
dom cC AO(q)and, given p and p' in Ad(q),
P
c(p) = c(p'),
cx pi
whereas it is Shurconvex(with respect to q) if dom c C A'(q) and, given p and
p' in A (q),
(13)
p
pc'p'
=
c(p) > c(p').
A Shur convex function is clearly rearrangementinvariant,whereas the converse is, in general, false. A subset C of Ao(q) is Shur convex (with respect
to q) if its indicatorfunction 8c is Shur convex.7Finally,we say that q in Ad is
adequateif either q is nonatomic or S is finite and q is uniform.
We can now state our characterizationresult, which shows that rearrangement invarianceis the symmetrypropertyof ambiguityindices that characterizes probabilisticallysophisticatedpreferences.
THEOREM14: Let > be a continuous unbounded variationalpreference.If
q E Ao is adequate, then the following conditions are equivalent (with respect
to q):
(i) Therelation satisfiesstochasticdominance.
(ii) Therelation> is probabilisticallysophisticated.
invariant.
(iii) Thefunction c* is rearrangement
c*
Shur
The
is
convex.
function
(iv)
Moreover,for any variationalpreference>, (iv) implies (i) even if q is not adequate.
7Thatis, {p' e A•(q): p' -<cxp}
C for every p E C.
VARIATIONALREPRESENTATIONOF PREFERENCES
1463
The proof of Theorem 14 builds on the results of Luxemburg (1967) and
Chong and Rice (1971), as well as on some recent elaborations of these results
provided by Dana (2005).
As a first straightforwardapplication of Theorem 14, observe that if a set
of priors is Shur convex, then the corresponding (continuous) multiple priors
preferences are probabilisticallysophisticated and the converse is true in the
adequate case. More is true in this case: multiple priors preferences are probabilisticallysophisticated if and only if the indicator functions Sc of their sets
of priors C are rearrangementinvariant,that is, if and only if the sets C are
orbit-closed (O(p) c C for every p e C).8
To furtherillustrateTheorem 14, we now introduce a new class of variational
preferences that plays an importantrole in the rest of the paper. As before, assume there is an underlyingprobabilitymeasure q E Ad. Given a X-measurable
function w:S --+ R with infs,s w(s) > 0 and f w dq = 1, and a convex contin-
uous function &P
:R+ *R+
such that 0 (1) = 0 and limt,,,
(t)/t = 00, the
w-weighted 0 divergence of p E A with respect to q is given by
(14)
w(s)
Dw(pIIq)=
00,
A(q),
(s))
dqr(s), otherwise.
ifspe
Here w is the (normalized) weighting function, and in the special case of uniform weighting w(s) = 1 for all s E S, we just write DO(p IIq) and we get
back to standard divergences, which are a widely used concept of "distance
between distributions"in statistics and informationtheory (see, e.g., Liese and
Vajda(1987)). The two most important divergences are the relativeentropy(or
Kullback-Leibler divergence) given by 4 (t) = t In t - t + 1 and the relative Gini
concentration index (or X2 divergence) given by 0 (t) = 2-' (t - 1)2
The next lemma collects the most important properties of weighted divergences.
LEMMA15: A weighted divergence Dw(. 1q) : A -- [0, oo] is a grounded, con-
vex,and lowersemicontinuousfunction, and the sets
(15)
{p EA:D (p IIq) < t}
are (weakly)compactsubsetsof Ad(q) for all t E R. Moreover,Do(. IIq) :A --+ R
is Shurconvexwheneverw is uniform.
8Thisfact can be used to provide an alternativederivationof some of the results of Marinacci
(2002), which showed that multiple priorspreferences that are probabilisticallysophisticatedreduce to subjectiveexpected utilitywhen there exists even a single nontrivialunambiguousevent.
The direct proofs in Marinacci(2002) are, however, shorter and more insightfulfor the problem
with which that articlewas dealing.
1464
E MACCHERONI, M. MARINACCI,AND A. RUSTICHINI
Thanks to the foregoing properties, preferences represented by the functional
(16)
V(f) =
u(f)dp +
minf
OD(p
I
q)}
where 0 > 0 and u: X -+ IRis an affine function, belong to the class of variational preferences. In view of their importance,we call them divergencepreferences; that is, > on Y is a divergence preference if
ft>g
min
>
u(f)dp+ OD(p IIq)
minEm
u(g)dp + OD(p
| q) }
THEOREM
16: Suppose u(X) is unbounded.Then divergencepreferencesare
continuousvariationalpreferenceswithindexof ambiguityaversiongivenby
=
c*(p) OD=(p
|| q) Vp EA.
In particular,thesepreferencesare probabilisticallysophisticatedwheneverw is
uniform.
Divergence preferences are a very importantclass of variationalpreferences
and we further study them in the next section. Unlike multiple priors preferences, they are in general smooth (see Proposition 23), a noteworthyfeature
for applications.
For a finite state space S, X = R, u(t) = t, and w uniform, some classes of
divergence preferences have been considered by Ben-Tal(1985) and Ben-Tal,
Ben-Israel, and Teboulle (1991). In the next section, we will study two significant examplesof divergencepreferences,correspondingto the relativeentropy
and to the relative Gini concentration index. Here it is important to observe
that, by Theorem 16, all divergence preferences represented by
V(f)
in
u(f)dp+OD(p(lq)p
,
are examples=Iqmm
of probabilisticallysophisticatedvariationalpreferences.'
the
next example shows that even under minimal nonuniformiHowever,
ties of the weighting function, divergencepreferences are in general not probabilisticallysophisticated and they exhibit Ellsberg-typebehavior. Therefore,
'Analogously, Theorem 14 and Lemma 15 show that all multiplepriorspreferenceswith sets of
priors {p e A'I(q):D4(p IIq) < r}j are probabilisticallysophisticated.These preferencesinclude
the constraintpreferences of Hansen and Sargent(2001), mentioned in footnote 2.
VARIATIONALREPRESENTATIONOF PREFERENCES
1465
the probabilisticsophisticationof divergence preferences cruciallydepends on
the uniformityof the weight w.
17: Considera standardEllsberg three color urn,with 30 red balls
EXAMPLE
and 60 balls either green or blue. As usual, consider the bets:
Red
fR
1
Green
0
fG
0
1
0
1
0
1
fRuB
fGUB
Blue
0
0
1
1
where fR pays one dollar if a red ball is drawn and nothing otherwise, fG pays
one dollar if a green ball is drawnand nothing otherwise, and so on. As is well
known, Ellsberg (1961) arguedthat most subjects rankthese acts as
(17)
fR >f
and
fRUB-<fGUB.
Consider a decision maker who has divergence preferences represented by
the preference functional V given by (16). Here it is natural to consider a
uniform q on the three states. Without loss of generality, set u(0) = 0 and
u(1) = 1. By Theorem 16, when w is uniform, it cannot be the case that
V(fR) > V(fG) and V(fRUB) < V(fGUB). However, consider the weighting function w:{R, G, B} -+ IRgiven by
w(R) = 1.01,
w(G) = 0.99,
and
w(B) = 1,
which is only slightlynonuniform.If we set 0 = 1 and take either the weighted
relative entropy 0 (t) = t ln t - t + 1 or the weighted Gini relative index P(t) =
2-' (t - 1)2, then some simple computations (availableupon request) show that
V(fR) > V(fG)
and V(fRUB) < V(fGUB),
thus deliveringthe Ellsberg pattern (17).
3.6. Smoothness
Most economic models are based on the optimization of an objective function. When this function is differentiable, solving the optimization problem
is easier and the solution has appealing properties. There is a well established
set of techniques (first-ordernecessaryconditions, envelope theorems, implicit
function theorems, and so on) that are extremely useful in both finding a solution of an optimizationproblem and characterizingits properties.For example,
they make it possible to carryout comparative statics exercises, a key feature
in most economic models.
1466
E MACCHERONI, M. MARINACCI,AND A. RUSTICHINI
In view of all this, it is importantto studywhether our variationalpreference
functionals are differentiable. In this section (see Theorem 18), we fully characterize the differentiabilityproperties of variational preference functionals
and show that they are adequate for economic applications.
Throughout this section, we assume that X is the set of all monetary lotteries, that is, the set of all finitely supported probabilitymeasures on R. An
act f of F is monetaryif f(s) is a degenerate lottery for every s e S, that is,
if f(s) e IR(with the usual identificationof z IRwith the degenerate lottery
dz E X). The set of all monetary acts can thus be identified with Bo(2).
We consider a variationalpreference functional
V(f) = min u(f) dp+ c*(p)}
restrictedto Bo(I), that is, restrictedto monetaryacts. We also make the standard assumptionthat the associated utility function u is concave (thus reflecting risk aversion), strictlyincreasing,and differentiableon R.
To state our results, we need some standard notions of calculus in vector
spaces (see Rockafellar (1970) and Phelps (1992)). Given f E Bo(I), the diR
rectionalderivativeof V : Bo(E) -+ R at f is the functional V'(f; .): Bo()
-defined by
+ th) - V(f)
V(f
V'(f; h) = lim V(f
hV(f th)t
t4o
Bo().
The functional V is (Gateaux) differentiableat f if V'(f; -) is linear and supnorm continuous on Bo(2). In this case, V'(f; .) is the (Gateaux) differential
of V at f.
The superdifferentialof V at f is the set 8V(f) of all linear and supnorm
continuous functionals L: Bo(1) -*+ R such that
V'(f; h) < L(h)
VheBo(X).
In particular,dV(f) is a singleton if and only if V is differentiableat f. In this
case, dV(f) consists only of the differential V'(f; -) ; i.e., dV(f) = {V'(f; -)}.
We are now ready to state our result. It provides an explicit formula for
the superdifferentialdV(f) at every f E Bo(E) and a full characterizationof
differentiability,along with an explicit formula for the differential.
18: For all f e Bo(e),
THEOREM
(18)
dV(f)-=
u'(f)dr:reargmin
u(f) dp+c*(p)).
VARIATIONALREPRESENTATIONOF PREFERENCES
1467
In particular,V is everywheredifferentiableon Bo(X) if and only if c*is essentially
strictlyconvex.'0In this case,
(19)
V'(f; h) = f hu'(f) dr,
where{r} = argminpEa(fu(f) dp + c*(p)).
The strict convexity of the ambiguity index thus characterizes everywhere
differentiablevariationalpreference functionals. For example, Proposition 23
will show that divergence preferences have a strictly convex ambiguityindex,
provided 0 is strictly convex. By Theorem 18, they are everywhere differentiable.
Variationalpreferences that feature an index c that is not strictlyconvex are,
by Theorem 18, not everywhere differentiable in general. This is a large class
of preferences, which includes, but is much larger than, that of multiple priors
preferences.
However, although there are plenty of examples of variational preferences
that are not everywheredifferentiable,since the convex combination of a convex cost function and a strictlyconvex function is strictly convex, one can approximate arbitrarilywell any variational preference with another one that is
everywheredifferentiable.
In view of all this, the interest of Theorem 18 is both theoretical and practical. In applications, the explicit formulas (18) and (19) are very important because they make possible the explicit resolution of optimal problems
based on the variationalpreference functional V. It is worth observing that in
Maccheroni, Marinacci,and Rustichini (2006) we show that a version of these
formulasholds also for dynamicvariationalpreferences, the intertemporalversion of the variationalpreferences we are introducingin this paper.
On the theoretical side, Theorem 18 is interestingbecause in some economic
applications of ambiguity aversion, the lack of smoothness (and in particular the existence of "kinks"in the indifference curves) has played a key role.
For example, it has been used to justify nonparticipationin asset markets (see
Epstein and Wang (1994)). By fully characterizingthe differentiabilityof variational preferences, Theorem 18 clarifiesthe scope of these results. In particular, it shows that although kinks are featured by some important classes of
ambiguityaverse preferences, they are far from being a property of ambiguity
aversionper se. Indeed, we just observed in the preceding text that it is always
possible to approximateany variationalpreference arbitrarilyclosely with another one that is everywheredifferentiable.
10Here u'(f) dr denotes the functional on Bo(E) that associates f hu'(f) dr to every
h E Bo(2), and c is essentially strictly convex if it is strictly convex on line segments in
argminpeA(fu(f) dp + c*(p)). Clearly,if c is strictlyconvex, then a fortiori it is essenUfeB0(V)
tially strictlyconvex.
1468
E MACCHERONI,M. MARINACCI,AND A. RUSTICHINI
4. SPECIAL CASES
In this section we study in some more detail two important classes of variational preferences: the multiple priors preferences of Gilboa and Schmeidler
(1989) and the divergence preferences we just introduced. In particular,we
show that two importantclasses of preferences-the multiplierpreferences of
Hansen and Sargent (2001) and the mean-variance preferences of Markowitz
(1952) and Tobin (1958), are special cases of divergence, and therefore variational, preferences.
4.1. MultiplePriorsPreferences
Begin with the multiple priors choice model axiomatized by Gilboa and
Schmeidler (1989). As we mentioned in Section 3.1, the multiple priorsmodel
is characterizedby Axiom A.2', a stronger version of our independence Axiom A.2. Next we show in greater detail the relationshipbetween Axiom A.2'
and the variationalformula (6).
In particular,when Axiom A.2' replaces Axiom A.2, the only probabilities
in A that "matter"in the representation (6) are those to which the decision
maker attributes "maximumweight," that is, those in argminc*. The set of
priors C used in the multiple priors model is then given by {p e A: c*(p) = 0}.
19: Let > be a variationalpreference.Thefollowing conditions
PROPOSITION
are equivalent:
(i) Therelation satisfiesAxiom A.2'.
(ii) For all f E T,
(20)
min
u(f)dp+c*(p)p)
=
mm
u(f)dp.
is also equivalentto the following
If, in addition, is unbounded,then (ii)inf=)
condition:
(iii) Thefunction c* takeson only values 0 and oc.
>
The characterizationof the multiple priors model via Axioms A.1, A.2', and
A.3-A.6 is due to Gilboa and Schmeidler(1989). Proposition 19 shows how the
multiple priors model fits in the representationwe established in Theorem 3.
As is well known, the standardSEU model is the special case of the multiple
priors model characterizedby the following strongerversion of Axiom A.5.
AXIOMA.5'-Uncertainty Neutrality: If f, g e F and a e (0, 1),
af + (1 - a)g f.
f~g g
In terms of our representation, by Theorem 3 and Proposition 19 we can
make the following statement:
VARIATIONALREPRESENTATIONOF PREFERENCES
1469
20: Let > be a variationalpreference.Thefollowing conditions
COROLLARY
are equivalent:
(i) The relation> satisfiesAxiom A.5'.
(ii) The relation - is SEU.
satisfiesAxiom A.2' and {p e A: c*(p) = 0} is a singleton.
(iii) The relation
If, in addition, is unbounded,then (iii) is also equivalentto the statement:
(iv) Thereexistsq e A such that c*(q) = 0 and c*(p) = oc for everyp 0 q.
4.2. DivergencePreferences
In the previous section we introduced divergence preferences so as to illustrate our results on probabilisticsophistication.Here we discuss their ambiguity attitudes.
Recall that a preference > on T is a divergence preference if
ftg
min
d
u(f)dp+
O6D'(p 1|q)}
> minf u(g)dp+ OD(p q) ,
where 0 > 0, q is a countably additive probability on the o--algebra Z,
u:X -* R is an affine function, and D q(.IIq): A -> [0, oo] is the w-weighted
4 divergence given by (14).11
By Theorem 16, all divergence preferences are examples of continuous variational preferences. As a result, to determine their ambiguityattitudes,we can
invoke Propositions 7 and 8. By the former result, divergence preferences are
ambiguity averse. As to comparative attitudes, the next simple consequence
of Proposition 8 shows that they depend only on the parameter 0, which can
therefore be interpreted as a coefficient of ambiguityaversion.
21: Given two (w, 4) divergencepreferences>- and >2, thefolCOROLLARY
are equivalent:
conditions
lowing
relation
The
(i)
>1 is moreambiguityaversethan
2*.
0 02 (provided = u2).
and
u,
u2
(ii)
ul
<_
According to Corollary21, divergence preferences become more and more
(less and less, resp.) ambiguityaverse as the parameter 0 becomes closer and
closer to 0 (closer and closer to oc, resp.).
The limit cases where 0 goes either to 0 or to oo are described by Proposition 12, which takes an especially stark form for divergence preferences under
"1Whenwe want to be specific about w and p, we speak of (w, 4) divergence preferences.
1470
E MACCHERONI, M. MARINACCI,AND A. RUSTICHINI
some very mild assumptions.To see this, we need a piece of notation: given a
simple measurablefunction ?p:S -- R, set
essmin
seS
= max{t e R:q({s e S: (s) > t}) = 1}.
•p(s)
For example, when q has a finite support supp(q), we have
(21)
essmin~p(s)=
min ?p(s).
seS
sEsupp(q)
PROPOSITION
22: (i) If w e Lo(q), then for all f
lim min fu(f) dp + ODW(pq)
OtO pe"W(q)Ij
=
F,
essminu(f(s)).
seS
(ii) If 0 is strictlyconvex,thenfor all f e F,
lim min{
u(f)
dp+ODp(p
Otoo pEMO(q)
fU0J
jq)
=u(f)dq.
When w E L"(q), divergence preferences therefore tend more and more,
as 0 - 0, to rank acts according to the very cautious criterion given by
essmin,,s u(f(s)). In contrast, when ( is strictly convex, divergence preferences tend more and more, as 0 -+ o, to rank acts accordingto the SEU criterion given by f u(f) dq.
The next result, a simple consequence of Theorem 18, shows that divergence
preferences are smooth under the assumption that 0 is strictlyconvex (most
examples of divergence Dw satisfy this condition; see, e.g., Liese and Vajda
(1987)). This is an important feature of divergence preferences that makes
them especially suited for optimization problems and that also differentiates
them from multiple priors preferences, which are not everywhere differentiable (see, e.g., Epstein and Wang(1994, p. 295)).
As we did for Theorem 18, we assume that X is the set of all monetary
lotteries and we regard Bo(.) as the collection of all monetary acts. We also
assume that the utility function u is concave, strictlyincreasing, and differentiable on R.
23: If 4 is strictlyconvex, then Dw(.1I q):A(X) - [0, oc] is
PROPOSITION
its effectivedomain and the variationalpreferencefunctional
on
convex
strictly
V : Bo(E) --+R, given by
V(f=min
OD(p
0 II q)f
PEA(X)
f u(f)dp+
1~piJYfBZ)
Vfe
B0(B),
VARIATIONALREPRESENTATIONOF PREFERENCES
1471
is everywheredifferentiablefor all 0 > 0. In this case,
(22) V'(f;h)= f hu'(f)dr,
where{r} = argminpEA(fu(f) dp + ODw(pIIq)).
Consider, for example, multiplier preferences, a special class of divergence
preferences in which c*(p) = OR(p IIq). By some well known properties of the
relative entropy (see Dupuis and Ellis (1997, p. 34)), formula (22) takes, in this
case, the neat form
(f h)
exp(- (f) dq
dq
dq
h)exp(-)
( f;hu'(f)
f exp(-
0)
for all f, h E Bo(V).
Unlike the multiple priors case, for divergence preferences, we do not have
an additional axiom that, on top of Axioms A.1-A.6, would deliver them (for
multiplepriorspreferences, the needed extra axiomwas Axiom A.2'). We hope
that this will be achieved in later work and in this regard it is worth observing
that the ambiguityindex D'(p IIq) is additively separable, a strong structural
property.
After having established the main properties of divergence preferences, we
now move on to discuss two fundamental examples of this class of variational
preferences.
4.2.1. Entropicand multiplierpreferences
We say that a preference >- on F. is an entropicpreferenceif
f
g
(min
min
u(f) dp +ORW(p| q))
(f
u(g)dp+ORW(p| q)),
where 0 > 0, q e A , u: X -- R is an affine function, and RW"(I q) : A -+ [0, oo]
is the weighted relative entropy given by
RW(p 11q) =
s
w(s)
00,
5
dq
-q(s)log
if pE
dAc (q),
otherwise.
dq
q(s)
- -(s)
dq
+ 1 dq(s),
1472
E MACCHERONI, M. MARINACCI,AND A. RUSTICHINI
Since the entropy RW(p IIq) is a special case of divergence Dv(p IIq) defined
in (14), where 4)(t) = t log t - t + 1, entropic preferences are an example of divergence preferences. Hence, by Theorem 16, they are continuous variational
preferences,with index of ambiguityaversion given by
c*(p) = ORW(pIIq) Vp E A.
When w is uniform, is probabilisticallysophisticatedand it features (a positive multiple of) the standardrelative entropy R(. 1Iq) as the index of ambiguity aversion. This is the case considered by Hansen and Sargent (see, e.g.,
Hansen and Sargent (2000, 2001)), who called this class of entropic preferences in which acts are ranked accordingto
V(f)
=min
(f u(f)dp+ OR(p| q))
multiplierpreferences.Although multiplierpreferences are probabilisticallysophisticated, Example 17 shows that this not the case for general entropic
preferences that have nonuniformweightingfunctions w. These "nonuniform"
entropic preferences thus provide a specification of preferences that can, in
general, produce Ellsberg-typebehavior-and so are ambiguityaverse according to all notions of ambiguityavailable in literature-but that also retain the
good analyticaltractabilityof multiplierpreferences.
As to the ambiguity attitudes featured by entropic preferences, by Corollary 21 they are characterizedby the parameter 0 as follows: the lower 0 is,
the more ambiguity averse is the entropic preference. The parameter 0 can
therefore be interpreted as a coefficient of ambiguityaversion.
We have shown how entropic,and hence multiplier,preferences are a special
case of divergence preferences. As we already observed, for divergencepreferences we do not have an additional axiom that, on top of Axioms A.1-A.6,
would deliver them (even though we have been able to point out some strong
structuralproperties that their ambiguityindices satisfy). On the other hand,
we view entropic preferences as essentially an analyticallyconvenient specification of variationalpreferences, much in the same way as, for example,CobbDouglas preferences are an analyticallyconvenient specificationof homothetic
preferences. As a result, in our setting there might not exist behaviorallysignificant axioms that would characterize entropic preferences (as we are not
aware of any behaviorallysignificantaxiom that characterizesCobb-Douglas
preferences). Similarconsiderationsapply to the Gini preferences that we will
introduce momentarily.
For their macroeconomic applications, Hansen and Sargent are mostly interested in dynamic choice problems. Although our model is static, in the
follow-up paper (Maccheroni, Marinacci,and Rustichini (2006)), we provide
a dynamicversion of it and, inter alia, we are able to provide some dynamic
VARIATIONALREPRESENTATIONOF PREFERENCES
1473
specificationsof multiplierpreferences that are time consistent. As is often the
case in choice theory, also here the analysisof the static model is key to paving
the way for its dynamicextension.
We close by discussing the related work of Wang (2003), who (in a quite
different setting) recentlyproposed an axiomatizationof a class of preferences
that include multiplierpreferences as special cases. He considered preferences
over triplets (f, C, q), where f is a payoff profile, q is a reference probability,
and C C A is a confidence region. For such preferences, he axiomatized the
representation
(23)
V(f, C, q)=min{u(f)
peCIf
dp +OR(p li q)J.
His modeling is very differentfrom ours: in our setup, preferences are defined
only on acts, and we derive simultaneouslyboth the utility index u and the ambiguityindex c; that is, uncertaintyis subjective.In Wang(2003), both C and q
are exogenous, and so uncertaintyis objective; moreover, agents' preferences
are defined on the significantlylarger set of all possible triplets that consist of
payoff profile, confidence region, and reference model.
In any case, observe that when (23) is viewed as a preference functional
on F, then it actually represents variational preferences that have as ambiguity index the sum of Sc and OR(-IIq). As a result, Wang'spreferences are a
special case of variationalpreferences once they are interpretedin our setting.
4.2.2. Gini and mean-variancepreferences
We say that a preference ? on F is a Ginipreferenceif
fg
min(f u(f) dp+?OGW(p11q))
min
u(g)+POGw(p||q)),
dp
•
where 0 > 0, q e An, u: X -- R
is an affine function, and G"(. IIq): A -- [0, co]
is the weighted relative Gini index given by
G"(p IIq)
=
w(s)dp
00c,
(s) - 1
dq(s),
if p
EA•(q),
otherwise.
Like the weighted relative entropy R"(p IIq), the Gini index GW(p 11q)
also is a special case of divergence Dw(p IIq) defined in (14), with 4(t) =
1474
E MACCHERONI,M. MARINACCI,AND A. RUSTICHINI
2-1(t - 1)2.12 As a result, by Theorem 16, Gini preferences are continuousvari-
ational preferences, with ambiguityindex OG"(. IIq).
In particular, - is probabilisticallysophisticatedwhen w is uniform. In this
case, when X is the set of all monetary lotteries and u(t) = t for all t E R,
we call the restrictionof these preferences to the collection Bo(X) of all monetary acts monotone mean-variancepreferences,written >mmv; that is, for all
f, ge Bo(1),
mmV
f
g
i
Q
(ffdp+
OG(p I q))
gdp+OG(p
iq))
>amin(
Since the Gini index is, along with Shannon's entropy, a classic concentration
index, monotone mean-variance preferences are a natural example of divergence preferences. However, we are not considering them just for this: their
main interest lies in the close connection they have with mean-variance preferences.
In fact, consider the classic mean-variance preferences of Markowitz(1952)
and Tobin(1958) that are defined on Bo(X) by
(24)
f
-mv g
f dq
-
g dq
Var(f)
--
Var(g),
where Var is the variance with respect to q. These preferences are not
monotone unless their domain is suitably restricted to the set M on which
the (Gateaux) differential of the mean-variance functional f ? f fdq (1/20) Var(f) is positive (as a linear functional). The convex set M, called the
domain of monotonicity of >mv, is where these preferences do not violate the
monotonicityAxiom A.4; that is, where they are economically meaningful.
24: Thedomain of monotonicityM of >-mvis the set
THEOREM
jf
e Bo() :f -
f
f dq 0,
<
q-a.s.}.
Moreover,
ftmg
f d (p+
(min
'min(
G(q)
I
gdp + OG(p q))
12Theclassic Gini concentration index can be obtained by normalizationfrom the relative
index (withuniformw) in the same way Shannon'sentropycan be obtainedfrom relativeentropy.
VARIATIONALREPRESENTATIONOF PREFERENCES
1475
for all f, g E M.
By Theorem 24, mean-variance preferences coincide with monotone meanvariance preferences once they are restricted on their domain of monotonicity M, which is where they are meaningful.
Inter alia, this result suggests that monotone mean-variance preferences
are the natural adjusted version of mean-variance preferences that satisfy
monotonicity. This insight is developed at length in Maccheroni, Marinacci,
Rustichini, and Taboga (2004), and we refer the interested reader to that paper for a detailed analysisof this class of variationalpreferences.
Istitutodi Metodi Quantitativiand IGIER, UniversitaBocconi, Milano, 20100
Italy,
Dipartimentodi Statisticae MatematicaApplicataand CollegioCarloAlberto,
Universitui
di Torino,Torino,10122 Italy;[email protected],
and
Dept. of Economics, Universityof Minnesota,Minneapolis,MN 55455, U S.A.
ManuscriptreceivedAugust,2004;final revisionreceivedApril,2006.
APPENDIX A: NIVELOIDS
The set of all functions in Bo(2) (resp. B(2)) that take values in the interval
K C R is denoted by B0(2, K) (resp. B(X, K)).
When endowed with the supnorm,Bo(Y) is a normed vector space and B(2?)
is a Banach space. The norm dual of Bo0() (resp. B(.)) is the space ba(.)
R endowed with the
of all bounded and finitely additive set functions
:'X-for all ?cE B0(E) (resp.
total variationnorm, the dualitybeing (?P,/t) = f cpdlt
B(,Y)) and all / E ba(2) (see, e.g., Dundorf and Schwartz(1958, p. 258)).
As is well known,the weak* topologies o-(ba(.X),Bo(N)) and o-(ba(2), B(.))
coincide on A(Y). Moreover, a subset of A7(X) is weakly* compact if and only
if it is weakly compact (i.e., compact in the weak topology of the Banach space
ba (X)). For pI, I E B(s), we write ?p> tp if ?p(s) > f&(s)for all s E S.
if
A functionalI: 0 -- R, defined on a nonemptysubset P of B(2), is a niveloid
I((p) - I($i) < sup((p - 0)
for all ?p, f E 0; see Dolecki and Greco (1995). Clearly a niveloid is Lipschitzcontinuous in the supnorm. A niveloid I is normalizedif I(kls) = k for all
k e R such that k1s ec0. With a little abuse, we sometimes write k instead
of k1s.
LEMMA25: Let 0 E int(K).A functional I: BO0(.,K) -+ R is a niveloid if and
only if,for all cp, f E Bo(Z, K), k E K, and a E (0, 1),
(i) c > q impliesI(pc) > I(ti), and
(ii) I(atp + (1 - a)k) = I(acp) + (1 - a)k.
1476
E MACCHERONI, M. MARINACCI,AND A. RUSTICHINI
In this case, I is concave if and only if
(iii) I(tf) = I(Q?p)implies I(aqf + (1 - a) p) > I((p).
Properties (i) and (ii) are called monotonicityand verticalinvariance,respectively. The proof of Lemma 25 can be found in Maccheroni, Marinacci,and
Rustichini (2005). If I:(P
I(I) = sup [I()
-*
R is a niveloid, then
+ inf(qi(s) - p(s))]
iV e B(I)
is the least niveloid on B(s) that extends I (see Dolecki and Greco (1995)).
Moreover:
* If ( is convex and I is concave, then I is concave.
* If (P+ R Bo(),13 then I is the unique niveloid on B(X) that extends I (see
Maccheroni,
_ Marinacci,and Rustichini (2005)).
If 0 is convex and I: (P -- R is a concave niveloid, direct application of
the Fenchel-Moreau theorem (see, e.g., Phelps (1992, p. 42)) to I guarantees
that
I(p)
=
minLEba(5)((p, /-) - I*(Aj)), where I*(1i) = infqB(Y)((q, p ) - I(0)) is the
Fenchel conjugate of I. If At is not positive, there exists qp> 0 such that
a((p, /t) - I(0) for all a >0, whence
(?, /t) < 0. Then (ap, ) - I(a?p)
=
=
If
+ b) =
,t(S)
1, choose q _ B(2). Then (41 + b, F) I*(QL) -00.
I(0q
and so I*(1i) = -oc. That is,
(q, /-) - I(jq) + b(CL(S) - 1) for all b
•R
(cp) = min ((?, p) - I*(p));
(25)
pEA(?)
see also Fllmer and Schied (2002, Theorem 4.12). Set I*(p) = I*(p) for each
pe
p E A(s) and set d,I(4o) = {p
A():I(qj)
- I(?p) < (q -
n,p) for each
The next two results are proved in Maccheroni, Marinacci, and Rustichini
(2005).
LEMMA26: Let 0 be a convexsubsetof B(s) that contains at least one constantfunction, and let I : -- R be a concaveand normalizedniveloid.Then:
(i) Foreach p e A(E), I*(p) = infEP((I, p) -I(-q)).
(ii) I* :A(X) -+ [-oo, 0] is concave and weakly*uppersemicontinuous.
p) - I*(p)) and it is not
(iii) For each Ee 0, ,I(p?) =
argminpEA(z•(((p,
empty.
= {I* = 0} = argmaxpeAJ()I*(p) for all k E R such that
(iv) dI(kls)
kls E (P.
13p?+ R is the set {~c+ b: 0E, b E R}. An importantspecial case in which P + R = Bo(E) is
unbounded.
when 0 = Bo(2, K) and K is•p
VARIATIONALREPRESENTATIONOF PREFERENCES
1477
(v) I* is the maximalfunctional R" A(X) - [-oc, 0] such that
(26)
inf ((p, p)- R(p))
I(po)= pEA(l)
Vo E P.
Moreover,if P + R 2 Bo(2), 1*is the uniqueconcave and weakly*upper
semicontinuous function R :A()) - [-oo, 0] such that (26) holds.
(vi) If (26) holds and T 0 is such that sups,,s (s) - infses ii(s) < b for all
j
, then
_
iqe
(27)
I(f) =
((p , p) - R(p))
inf
{peAd():R(p)>-b}
V i E T.
PROPOSITION27: Let I: Bo(2, K) --+ R be a normalized concave niveloid,
Thenthefollowing conditionsare equivawith K unboundedand L a ur-algebra.
lent:
E . withE1 D E2 D . and n,• En =
(i) If c, q E Bo(X, K), k E K,
{En•}>i
that thereexistsno > 1 such.that
0, then I (p) > I ( /) implies
I(klEn0
+ (lE)>
1).
(ii) The set {p E A(X) : I*(p) > b} is a weaklycompact subset of A((Y) for
each b< 0.
(iii) Thereexists q E d"(X) such that {p E A() :I*(p) > b} is a weaklycompact subsetof Ao(X, q) for each b < 0, and,for each 4 E Bo(I, K),
(28)
I(G) =
mi
pEA0(.,q)
APPENDIX
((, p) -
*(p)).
B: PROOFS OF THE RESULTS IN THE MAIN TEXT
The main result proved in this appendix is Theorem 3. Its proof proceeds
as follows: using Lemma 28, we first show that if - satisfies Axioms A.1-A.6,
then there exists a nonconstantaffine function u: X -- R and a normalized and
concave niveloid I: Bo(0, u(X))
R such that f >- g if and only if I(u(f))
>
-A delivers the desired variational
I(u(g)). Then (25) obtained in Appendix
representation I(u(f))
= minp,,(n)(f u(f) dp + c(p)).
We now move to the proofs. The standardproof of Lemma 1 is omitted.
LEMMA28: A binaryrelation > on F satisfiesAxioms A.1-A.4 and A.6 if
and only if thereexista nonconstantaffinefunction u :X -- IRand a normalized
niveloid I :Bo(., u(X)) --+ R such that
ft g
I(u(f))
> I(u(g)).
PROOF:Assume that > on . satisfiesAxioms A.1-A.4 and A.6. Let x, y E X
be such that x - y. If there exists z E X such that x + z
y + z, with-
1478
E MACCHERONI, M. MARINACCI,AND A. RUSTICHINI
out loss of generality ix ?+ z >- y + 1z; by Axiom A.2 (we can replace z
with x to obtain),lx + x > y + x and (we can replace z with y to obtain) ix ?+ y >- y ?+ y, and we conclude that x >- y, which is absurd. Then
the hypotheses of the mixture space theorem (Hernstein and Milnor (1953))
are satisfied and there exists an affine function u:X -+ R such that x 'e y if
and only if u(x) > u(y). By Axiom A.6 there exist f, g e F such that f >-g. Let
x, y E X be such that x >- f(s) and g(s) > y for all s e S. Then x >
f
>- g > y
implies x >-y and u cannot be constant. Moreover, u is unique up to positive
affine transformationsand we can assume 0 E int(u(X)).
For all f e F, let x,y E X be such that x - f(s) >- y for all s E S. Then
x > f > y. By Axiom A.3, the sets {aE [0, 1]:ax + (1 - a)y >- f} and {a E
[0, 1]: ff ax + (1 - a)y) are closed; they are nonempty because 1 belongs to
the first and 0 belongs to the second; their union is the whole [0, 1]. Because
[0, 1] is connected, their intersection is not empty;hence, there exists P3 [0, 1]
such that fx + (1 - 3)y - f. In particular,any act f admits a certaintyequivalent xf E X.
f
and f
If
~
xf, set U(f) = u(xf).
The function U is well defined because
f
- xf
~ y, with xf, yf EX implies xf - yf and u(xf) = u(yf). Clearly,f > g
if and only if xf xg if and only if u(xf) > u(xg) if and only if U(f) > U(g).
Therefore, U represents>-.
If f F, then u(f) e Bo(1, u(X)). Conversely, if p E B0o(, u(X)), then
?p(s) = u(xi) if s e Ai for suitable xl, ..., XN EX and a partition {(A, A2,
S.., AN) of S in X. Therefore, setting f(s) = x, if s e Ai, we have ?q= u(f).
We can conclude that Bo(1, u(X)) =
E F). Moreover, u(f) = u(g) if
S if and only if f(s) - g(s) for all s e S,
and only if u(f(s)) = u(g(s)) for all s E{u(f):'f
and, by Axiom A.4, f -- g or, equivalently,U(f) = U(g).
Define I('p) = U(f) if cp= u(f). By what we have just observed, I:Bo(,
u(X)) -+ R is well defined. If ?c= u(f), 0f= u(g) e Bo(Z, u(X)), and p > q ,
then u(f(s)) > u(g(s)) for all s E S and f(s) - g(s) for all s e S, SOf g,
U(f) > U(g), and I(?p) = I(u(f))
= U(f) > U(g) = I(u(g)) = I(iq). There-
fore, I is monotonic. Take k e u(X), say k = u(x). Then I(k1s) = I(u(x)) =
U(x) = u(x) = k. Therefore, I is normalized.
Take a E (0, 1), ?P= u(f) e Bo(., u(X)), and k = u(xk) e u(X); denote
by x0 an element in X such that u(xo) = 0. Choose x, y E X such that x>
f (s) >-y for all s e S. Then ax +(1- a)xo > af (s) + (1- a)xo >- ay + (1- a)xo
for all s e S. The technique used in the second paragraphof this proof yields
the existence of p E [0, 1] such that P(ax + (1 - a)xo) + (1 - P)(ay + (1 a)xo) - af + (1 - a)xo, i.e., az + (1 - a)xo - af + (1 - a)xo, where z =
pix + (1 - P)y E X. Then, by Axiom A.2, az + (1 - a)xk - af + (1 - a)xk
and
I(ap? +(1 - a)k)
= I(u(af + (1 - a)xk))
=
u(az + (1 - a)Xk)
VARIATIONALREPRESENTATIONOF PREFERENCES
1479
= au(z) + (1 - a)k = au(z) + (1 - a)O + (1 - a)k
= u(az + (1 - a)xo) + (1 - a)k
= I(u(af + (1 - a)xo)) + (1 - a)k = I(ap?) + (1 - a)k.
By Lemma 25, I is a niveloid, and we alreadyproved that it is normalized.
Conversely, assume there exist a nonconstant affine function u: X -- R
and a normalized niveloid I: Bo(Z, u(X)) -WR such that f - g if and only
if I(u(f)) > I(u(g)).
Choose c E R such that 0 e int(u(X) + c) and set v = u + c. Define
J:
v(X)) --+ R by J((p) = I('p - c) + c. Notice that J is a normalized
Bo(,, and
niveloidl4
f >g
> I(u(g))
I(u(f))
SI(u(f)+
SI(v(f)
c > I(u(g) + c-c)
c-c)+
- c) + c > I(v(g) - c) + c
SJ(v(f))
+ c
> J(v(g)).
Clearly,> satisfies Axiom A.1.
If f, g E
x, y c X, and a E (0, 1), then av(h), (1 - a)v(z), av(h) + (1 •,
e
a)v(z)
Bo(Y, v(X)) for h = f, g and z = x, y. Moreover,
af + (1 - a)x > ag + (1 - a)x
=
=
=
=
=
+ (1 - a)v(x)) > J(av(g) + (1 - a)v(x))
J(av(f)) + (1 - a)v(x) > J(av(g)) + (1 - a)v(x)
J(av(f)) + (1 - a)v(y) > J(av(g)) + (1 - a)v(y)
J(av(f)
J(av(f)
+ (1 - a)v(y)) > J(av(g) + (1 - a)v(y))
af + (1 - a)y > ag + (1 - a)y
and Axiom A.2 holds.
If f, g, h E , a E [0, 1], and there exists a, E [0, 1] such that ac, - a and
af + (1 - an)g9 h for all n > 1, then v(anf + (1 - an)g) = av(f) + (1 an)v(g) converges uniformly to av(f) + (1 - a)v(g) = v(af + (1 - a)g). The
inequality J(v(anf + (1 - an)g)) > J(v(h)) for all n > 1 and the continuity
of J guarantee J(v(af + (1 - a)g)) > J(v(h)). Therefore, {a E [0, 1]: af +
14Infact, for all ?p,4 E Bo(2, v(X)),
J(p) - J(f)
= I(p( - c) + c - I((
- c) - c < sup((p - c) - (qf - c)) = sup(?p - qf).
Moreover,for all t E v(X), J(t) = I(t - c) + c = t - c + c = t.
1480
E MACCHERONI,M. MARINACCI,AND A. RUSTICHINI
(1 - a)g - h} is closed. A similar argument shows that {a E [0, 1]: h > af +
(1 - a)g} too is closed, and Axiom A.3 holds.
Given f, g E T, f(s) > g(s) for all s E S if and only if J(v(f(s))) > J(v(g(s)))
for all s if and only if v(f(s)) > v(g(s)) for all s, then monotonicityof J yields
J(v(f)) > J(v(g)). This shows Axiom A.4.
Finally,because v is not constant and it represents on X, there exist x >- y
and Axiom A.6 holds too.
Q.E.D.
PROOFSOF THEOREM3 AND OF PROPOSITION6: Assume > satisfies Ax-
ioms A.1-A.6. By Lemma 28, there is a nonconstant affine function u:X -R
and a normalized niveloid I: Bo(X, u(X)) --+ R such that f - g if and only if
I(u(f))
>_I(u(g)).
Next we show that Axiom A.5
implies that I: B0o(, u(X)) -* R is concave.
Let ?q,1'fe B0o(, u(X)) be such that I(qp) = I((qi) and a E (0, 1). If f, g e .F
are such that op= u(f) and
ii
= u(g), then f - g and, by Axiom A.5, af + (1 -
a)g > f, that is,
I(a••
+ (1 - a)q/) = I(au(f) + (1 - a)u(g))
= I(u(af + (1 - a)g))
> I(u(f)) = I(p).
Lemma 25 guarantees the concavityof I.
The functional I: Bo(X, u(X)) --+R is, therefore, a concave and normalized
niveloid. For all p e A(I), set c*(p) = -I*(p). Lemma 26 guarantees that
c*(p) =
inf
qEBoB( ', u(X ))
((i, p) -I(q))=sup(u(xf)
fET
-
u(f)dp /
for all p E A(.) (where xf is a certaintyequivalent for f), that c* is nonnegative, grounded, convex, and weakly* lower semicontinuous,and that
I((p)
= min
+pc*(p)
(pdp
Y
Vp
Bo(,,u(X));
see also (25).
Let c: A(s) -+ [0, 00] be a grounded, convex, and weakly* lower semicontinuous function such that
fig
p(min
u(f)dp+ c(p)
> min
u(g)dp+ c(p))
VARIATIONALREPRESENTATIONOF PREFERENCES
1481
For all o = u(f) E Bo(X, u(X)),
I(w)=
I(u(f))
=
u(x) = mi
(f
= mm
PEA(-Y)
(/
p(fu(f)dp+c(p))
u(x)
dp +c(p)
= minm
peA(sY)(fdp
(J
+c(p).
Lemma 26 guarantees c* < c (this concludes the proof that (i) implies (ii)).
Moreover, if u(X) is unbounded, again Lemma 26 guarantees c = c* (this
proves Proposition 6).
For the converse, notice that u and I(cp) = minpeA(X)(f~pdp + c(p)) are,
respectively,a nonconstant affine function and a normalized niveloid that represents >. By Lemma 28, > satisfies Axioms A.1-A.4 and A.6. Concavityof I
Q.E.D.
guaranteesAxiom A.5.15
PROOF OF COROLLARY5: Let (uo, c;) represent > as in Theorem 3. If
(u, c*) is another representation of > (as in Theorem 3), by (6), u and uo are
affine representationsof the restrictionof - to X. Hence, by standarduniqueness results, there exist a > 0 and / E IRsuch that u = auo + P. By (7),
c*(p)= sup(u(xf)-
u(f) dp)
=sup
fE-F(
•auo(xf)+1-f(auo(f)+1)dp)
as desired. The converse is trivial.
=ac•*(p),
Q.E.D.
LEMMA29: Let > be a binaryrelationon X representedby an affinefunction
u :X -- R. The set u(X) is unbounded(eitherbelow or above) if and only if j
satisfiesAxiom A.7.
The standardproof is omitted. Proposition 7 is a consequence of Lemma 32.
PROOF OF PROPOSITION8: (ii) trivially implies (i). As to the converse, let
(ui, c7) represent i as in Theorem 3, i = 1, 2, and set Ii(cp)= minpac()(f pdpc7(p)) for all coE Bo(0, ui(X)).
15Iff - g and a E (0, 1), then
I(u(af + (1 - a)g)) = I(au(f) + (1 - a)u(g))
> aI(u(f)) + (1 - a)I(u(g)) = I(u(f)).
1482
E MACCHERONI,M. MARINACCI,AND A. RUSTICHINI
By (8) and the fact that u1 and u2 are not constant, we can choose
= = For all f e ?, if f ~1x, then f >2 X;therefore, Ii(u(f)) = u(x) <
ul u2 u.
I2(u(f)). This implies I < I12and
=
-
sup
? EBo(1,u(X))
(Ip)
cl(P)
<
fJ/
( dp)
sup
?PEBo(X,u(X))
\f)
I2(P)-fqDdp)=c2(p)
for all p e A(?).
Q.E.D.
PROOFOF PROPOSITION
12: Observe that the functions c, are weakly*
lower semicontinuous on A and so is f u(f) dp + Cn(p) for each n. Using this
observation,we now prove (i) and (ii).
(i) The decreasing sequence {f u(f) dp + c,(p)}n pointwise convergesto
(29)
f
U(f) dp
(P)
•Undomcn
Hence, by Dal Maso (1993, Proposition5.7), this sequence F-converges to
/
u(f) dp +
c, (P)
Un,dom
the weakly* lower semicontinuousenvelope of (29). By Dal Maso (1993, Theorem 7.4), this implies
limmin
u(f)dp+cn(p) =minf
u(f)dp +
cn
5U,,dom
(ii) Since argmin c, = (p E A: cn(p) = 0), the increasingsequence If u(f) dp+
Dal Maso (1993, p. 47), F-converges)
c,(p))}, pointwise converges (and so, by
to
J
u(f) dp + Sn
argmincn
(p)
Because A is weak-* compact, Dal Maso (1993, Theorem 7.8) implies
limmin
n pA I
u(f ) dp + cn(P)
)d
=min
+
u(f) dp +
S
8FnnargminCn(P)
Q.E.D.
VARIATIONALREPRESENTATIONOF PREFERENCES
1483
PROOFOF THEOREM13: Let (u, c*) represent >- as in Theorem 3 and set,
for all cpe B0o(, u(X)), I(Qp)= minpeA( )(f dp + c*(p)). It is easy to check
if I•psatisfiescondition of
that - satisfiesAxiom A.8 on .F if and
only
(i) Proposition 27 on Bo(E, u(X)). Unboundedness of u(X) and the relation c* = -I* allow us to apply Proposition 27 and to obtain the desired equivalence. Q.E.D.
LEMMA
30: Let >- be a variationalpreferencethat satisfiesAxiom A.8. Then
inf
Sg
(f u(f) dp + c*(p)
>
u(f) dp+c*(p))
Vf, gEc.
Xinf(
PROOF:Let (u, c*) represent >- as in Theorem 3 and set I(p) =
minpEA(,)(f dp + c*(p)) for all cpE Bo(E, u(X)). Let p e int(Bo( , u(X))),
E, { 0, and e > 0 such that cp- se Bo(E, u(X)). Then
f
I(lEcp + (minp En-E)E,)
-
I(p) < I(
-
lE) - I( P)
< -ep(E,) < O
for all pee dI(q(). Consider a sequence {kj}jl in u(X) such kj < I(?O)and
kj f I(qo).By Axiom A.8, I satisfies (i) of Proposition27 on Bo(Y, u(X)). Then,
for all j > 1, there is no > 1 such that kj < I(cplE o + (min q E
Hence,
8)lEn0).
m
>
because
the
limn-o I(~plEc + (min - e)lEn)
sequence I(?plEc + (min c kj
limit
to
the
for
j -- oc, we obtain
e) 1En) is increasing. Passing
(30)
lim I(GplE + (minp - 8)1E,)
IE =
n---> oc
)
and p(E,) -- 0 (uniformlywith respect to pe dI(?p)); that is, OdI(p) is a
(weaklycompact) subset of A0(Y). By Lemma 26, minPEA(X)(f?pdp + c*(p)) is
attained for all 'p e int(Bo(Z, u(X))) in (dI(c) and hence in) Az(E). Hence,
I(cp)= (minf~pdp+c*(p)) and
J('p)=
)in(f
'pdpc+ c*(p)
coincide on int(Bo(Z, u(X))) and, being continuous, they coincide on Bo( ,
Q.E.D.
u(X)).
The results of Section 3.5 require some notation and preliminaries.For all
cp e L'(, q), set G,(t) = q({Qp> t}) for each t e R and set T~(b) = inf{t e
R: G,(t) < b} for each b e [0, 1].
1484
E MACCHERONI, M. MARINACCI,AND A. RUSTICHINI
The function G, is the survivalfunction of ?c, and 7, is the decreasingrearrangementof cp.Two functions c', / e L'(Z, q) are equimeasurableif G,=
GQ(if and only if 7, = F,). We refer to Chong and Rice (1971) for a comprehensive study of equimeasurability.The preorder -<cx defined in Section 3.5
can be naturally regarded as a relation on L'(Z, q) by putting p -<,, ji if and
only if f P(cp)dq < f P(qj) dq for every convex 0 on R.
Analogously, the definition of Shur convexity (resp. rearrangementinvariance) can be spelled for functions T: L(Z, q) -- (-oo, 0o] by requiringthat
p -<cx
implies T(?9) < T(Iq) (resp. p
cx
implies T(o) = T(ql)).
Put ~p-<-<,cxq if and only if fo'1,(b) db < fo'F(b) db for all t E [0, 1]. Then
-,,<c
if and only if qp-<-<c
,
and
f
p dq =
f qjdq
(see, e.g., Chong (1976)).
Moreover, the equimeasurabilityrelation is the common symmetricpart -,cx
of <-<cx and -<cx. Simple manipulation of the results of Luxemburg (1967,
pp. 125-126) yields the following lemma:
LEMMA31-Luxemburg:
Let q be adequate. If T: Bo(Z, K) -+ (-00, 00] is
then
the
invariant,
rearrangement
function definedfor all p e L' (Z, q) by H (S) =
is
5'
Shur convex. If H is monotonic, then p
(p/
dq T(
-<-<cx
supEBO(Z,K) (f
q,))
H
H
implies (?) < (p').
PROOF:Let c' E L1'(, q). By Luxemburg(1967, Theorem 9.1) for all qJE
Bo(Z1,K),
(31)
maxJpfi'dq:
~'~
f' e
=
Bo(,, K)
fF,(t)Fq(t)
dt.
Therefore, H(p) < sup,B,,,(,K)( f0 ,(t)q,(t)dt - T(q,)). Again by (31),
e Bo(,, K) there exists fi' e Bo(V, K) with q,'
such that
q,
-- q'
=
then
dq. Since T(q')
f; F,(t)FI(t) dt =
f~,c'
T(q,),
for all
J
T(q) =
IF,(t)F,(t)dt-
p'dq-
T(qj) =
Jp'dq-
T(')
and
=
H(•o)
sup
qEBo(1,K)
0
F(t)Fq(t)dt-
T(q)
.
If ? -<,x 9?',an inequalityof Hardy(see, e.g., Chong and Rice (1971, pp. 57-58))
delivers H(Sp) < H(?o'). Let cp-<-<,cx '. By Chong (1976, Theorem 1.1), there
is a nonnegative Sc"e L' (2, q) such that Sc+ Sc"-<,, S'. If H is monotonic, then
H((p) < H(Sc + So")< H(cp').
Q.E.D.
1485
VARIATIONALREPRESENTATIONOF PREFERENCES
REMARK1: Analogously, if q is adequate and T: L'(2, q) -+ (-oo, oo] is
rearrangementinvariant (resp., q e A'z(2) and T is Shur convex), then the
function defined for all cpe Bo(X) by H(cp) - sup,eL1(2,q)(f cp dq - T(f)) is
Shur convex. If H is monotonic, then ?o
9' implies H(?p) H(p'). (If T is
Shur convex and q is not adequate, use -<-<•
Chong and Rice (1971,_Theorem 13.8)
rather than Luxemburg(1967, Theorem 9.1).) See also Dana (2005).
PROOFOF THEOREM14: For f e F, qf denotes the finite support probability on X defined by qf(x) = q(f-'(x)) for all x E X. We prove (i) =4 (ii) =
(iii)
(iv) = (i).
=:== (ii) This step can be proved by a routine argument.
(i)
(ii)
(iii) Let >-be a continuous unboundedvariationalpreference, which
=
is also probabilistically
sophisticated with respect to an adequate q E A?(E).
Let (u, c*) represent > as in Theorem 3 and set, for all p? Bo(2, u(X)),
I(p?) = minp A)s,(f
(p
dp + c*(p)). For all p E A,
*(p)=sup u(xf)fET
\
u(f) dp)=
Jf
/
sup (I(j)- f
EBo(2,u(X))
\
Jf
jdp).
Theorem 13 guarantees that {c* < oo} C Ao(2). If p e A'( )\A'(, q), there
exists A E Y such that p(A) > 0 and q(A) = 0. If u(X) is unbounded below, without loss of generality assume u(X) 2 (-oo, 0]. Let x, e u-l(-n) and
y E u-1(0). Consider the act f, = x,Ay and the constant act y. Since qf, = 6y =
qy, by probabilisticsophistication, f - y and I(-nlA) = I(u(fn)) = 0 for all
n> 1; therefore,
c*(p)
-
sup(I(-nlA)
nal
\(Jf
lA dp) =
00.
If u(X) is unbounded above, without loss of generality assume u(X) D [0, oo).
Since qf, Let x,, E u-l'(n) and y e u-'(0). Consider the act f, =
yAx,.
~
=
=
all
n
then
and
therefore,
n for
> 1;
I(nlAc)
I(u(fn))
fn x,
qx,,
c*(p) > sup (I(nlAc)
n>l
-
8x,
-
- n(1 - p(A))) = oo.
nlA dp)/ = sup(n
n>l
We conclude that {c* < oo}Al(
, q). In particular, 40 = ?o'q-a.s. implies
C
=
I(p) I(4').
Assume cp, E Bo(X, u(X)) are equimeasurable. Therefore (see, e.g.,
Chong and Rice (1971, p. 12)), there exist x1, ... , xe X, with u(x1) > ... >
u(x,), and two partitions {A1, A2, ..., A,} and {B1, B2, ..., Bj} of S in 2, with
q(Ai) = q(Bi) for all i = 1, 2, ..., n, such that q-a.s.
n
n
=
LU(Xi)1A,
i=1
and
i=1
=•u(xYi)lBi.
1486
E MACCHERONI,M. MARINACCI,AND A. RUSTICHINI
If f(s) = xi for all s E Ai and g(s) = xi for all s E Bi, then f,g e IF and
qf = qg. By probabilisticsophisticationof -, we obtain f - g, whence I(?p) =
I(u(f)) = I(u(g)) = I(if) and I is rearrangementinvariant.Setting T(f) =
-I(-q#), T:Bo(2, -u(X)) -- R is rearrangementinvarianttoo. For all p E
Ao(E, q),
c*(p)
sup
(J
sup
(
4EBo(X,u(X))
dp)
dq
dq- T()
T(
-
,EBo(,,-u(X))
=
By Lemma 31, H(p?)
-ux(f (p dq - T(tI)) for all ?pc L'(V, q)
supo(EBO,(x
is Shur convex; therefore,
is
c*(p) Shur convex and a fortiori rearrangement
invariant.
(iii) =? (iv) Consider T: L'(C, q) -+ [0, co] defined as
(32)
T(f) =
c*(p,),
if J > 0 q-a.s., f/
00,
otherwise,
dq= 1,
where p, is the element of AO(I, q) such that dp,/dq =
set
S4
L'(X,
q):
i>
0
q-a.s.
and
/idq=
. Denote by P the
1.
Let q/'~cx i. Then q E P if and only if q' e P (see, e.g., Chong and Rice (1971,
pp. 15-16)). If 4,, 4' P, then T(4) = oo = T(q'). If 4, 4' e P, then q and 4'
are the Radon-Nikodym derivativesof p, and p,,. By the rearrangementinvariance of c*, T(4') = c*(po,) = c*(pq,)= T(qI) and so T is rearrangement
invariant.By Theorem 13, (p A: c*(p) < t} is a weakly compact subset of A7
for each t > 0; therefore, {1 E L'(Z, q): T(q ) < t} is a weakly compact subset
of L'(E, q) and a fortiori T is weakly lower semicontinuous. Since obviously
T is convex, Luxemburg(1967, Theorem 13.3) guarantees that T (hence c*) is
Shur convex.'6
(iv) =* (i) Let be a variational preference. Let (u, c) represent > as in
Theorem 3 and assume c is Shur convex (with respect to q).17 Set, for all p e
definitionscaveat:Luxemburg(1967, pp. 123-124) defined Shur convexityof T by
16Different
His Theorem13.3shows
invariance,
rearrangement
convexity,andweaklowersemicontinuity.
that if T has these features, then o -<,, so'implies T((o) < T(?p').
'7Noticethatwe arenot assumingthat> is unboundedor continuous,or thatc = c*,orthatq
is adequate.
VARIATIONALREPRESENTATIONOF PREFERENCES
1487
B0(o), I(p) = minpeA(d)(f?odp + c(p)). Then
-I() =-
-p pdq
inf
sup (
.N,
q/ELIq)
(fJ;,q}
c(p)
pdql-T(q),
where T is defined as in (32), replacing c* with c. Let I ' -<c, . If /, V P, then
T(q) = oc > T(f'). Else if e P, f q' dq = f if dq = 1 and ip' > 0 q-a.s. (see,
/,
e.g., Chong and Rice (1971, p. 62)); that is, q' E P. The Shur convexity of c
ensures that T(i') = c(py,) < c(pq,) = T(if). Thus, T is Shur convex. By Remark 1, H(cp) = -I(-?p) is Shurconvex. Next we show that >- satisfies stochastic dominance. Assume that q({s E S: f(s)
x}) q({s E S:g(s)
x}) for all
x E X. For all t E u(X), q({s E S: u(f(s)) > t}) >_<
q({s E S: u(g(s)) > tj), and
this is a fortiori true if t V u(X). We conclude that Gu(f), Gu(g),F >F ,
<
F Uf) Fu(g (see, e.g., Chong and Rice (1971, pp. 30-31)), and -u(f) -<-<x
-u(g). Since H is monotonic, Remark 1 yields -H(-u(f))
> -H(-u(g)), so
that I(u(f)) > I(u(g)) and f ?- g.
Q.E.D.
PROOFOF LEMMA15: Groundedness (in particular, D (q IIq) = 0) and
convexityare trivial,so is Shurconvexityif w is uniform.Weak*lower semicontinuity descends from the fact that the sets {p E A: Dw(p IIq) < t} are weakly
compact in AU(q) for all t E R. The remaining part of the proof is devoted to
show this compactness feature.
We first consider a simple Y-measurable function w with min,,s w(s) > 0,
without requiringf w dq = 1. Set w = minSEsw(s) and without loss of generality suppose t > 0. By definition, {p E A: D(p IIq) < t} A'(q), hence
_
0
=
A
[
)<t
t}.
t}
q)
q)
:Dw(p
{p
11
{p
II < 0-D4(E AU:D•(p
Denote the set on the right-handside by D. We show that (a) limq(B>+op(B) =
0 uniformly with respect to p E D and (b) if {ppnl}l
D and p,(B) -- p(B) for
all B E Z, then p E D. Then a classical result of Bartle,
_ Dunford, and Schwartz
D
Dunford
and Schwartz(1958,
that
is
guarantees
weakly compact (see, e.g.,
Chapter IV)).s
(a) Notice that D'(p IIq) < D'(p IIq) for all p E A", so that
S= {p e d":D'(p
IIq) <
t}
{p E A: D(p
IIq)
< t}= C.
1"Part(a) guaranteesthat D is relativelysequentiallyweakly compact;that is, every sequence
{p,n in D admits a weaklyconvergentsubsequence {p,j }. Part (b) guaranteesthat the limit of p,j
belongs to D; that is, D is sequentiallyweakly compact. The Eberlein-Smuliantheorem guaran-
tees that D is weakly compact.
1488
E MACCHERONI, M. MARINACCI,AND A. RUSTICHINI
Next we show that
p(B) = 0 uniformlywith respect to p E C and a
limq(B0>o
fortioriwith respect to p E D. Let p E C. Clearly,
JA
U'w
dq<
_ wo?-dq )dq<t+1
Y
f
w \9/U•d q)
= 0+, there exists 6 > 0 such that
WO(T)
For all e > 0, because lim,,,,
_
b
<b
(b)
0<
forallAeX.
if
b>
,
2(t + 1)
w4
28'
in particular,04(b) > 0. For all B such that q(B) < 8 and all p E C,
p(B) =
dq +
-2nld
Sdq
W
qdq-=
18
< --q(B)
252 8nt
21
<-8+
++w
>
dq nU
.
dp
dq
( LP)
fL)
S1 dq
w
dq
dq
WC•Pdo
p
8Ed
2
dq
wI
n•>
2(t+1)J
-
(ddq<e.
dq
This concludes the proof of (a).
Let u (B) = fBw dq for all B E I. Notice that
(33)
[ fdp.d,
D'(p IIq)= I
dq
if p E zAU(q),
(q),
otherwise.
00,
Denote by H,, the set of all finite partitions 7r of S in X finer than rw,=
{w- (b) :b e w(S)}. For all r E Hw,,define on Ad the function
Dw(pl,
0
IIq,)=A
E,7 q(A)
q(A)
(A),
with the conventions p -A) = 0 if p(A) = 0 and p(
)t(A) =- o if p(A) > 0,
when q(A) = 0. Notice that because rTis finer than r,,,, then for all A e:r,
)
w(A) is a singleton (improperlydenoted by w(A)) and
(A)=
wdq = w(A)q(A).
CLAIM1: Wehave Dl(p IIq) = sup,,,
D'(p, IIq,) for all p E A'.
VARIATIONALREPRESENTATIONOF PREFERENCES
1489
PROOF:Let p <<q. Take an increasinglyfiner sequence 7rn in IH such that
dp/dq is (U, l irr,)-measurable.Then
q-a.e.
)1A
dp
q(A) IA -- dq
Continuityof 4 and the Fa(see, e.g., Billingsley (1995, p. 470)), hence ctL-a.e.
tou lemma imply
J
< liminf
()dp\
O(p(A)N)
AEi
q(A) (A
n
,Aq)
-
: (p
< sup
AtE!!w
(A).
The converse inequality is guaranteed if f 4)(L)d-=
coc.Else, if dq(L)is
Jensen
the
conditional
inequalityguarantees that, for all iTre ,,
/t-integrable,
EM?
where m =
?dq)
Ix/It(S)
Em(
Em7dqI
m-a.e.,
)
\dq
and E is the expectation. Hence,
=-Em(Em
((o
\)
V
(dp7
dq)
)
1
dp
>Em
d
-( E kEm (dp
for all r E H,, but
dqdp
(dq
E:m(A)O
AE~r:m(A)ZO
A(I(S)
Ar:
dq
m(A)JA
1
L(A)
(w(A)q(A)
AE7T:
q(A)0
1A
-
AE-i: q(A) wO
w(A)q(A)w(A)
= 1A
AET
dpd)
(A) A(S)Adq
q *(A)
dq
(A))
1490
E MACCHERONI,M. MARINACCI,AND A. RUSTICHINI
Therefore,
f
dp
()=d.
t(S)Em( (dp)
-
(
dq
dq)(SE
>/-t(S)Em(C(
t
(A),
A1A(P(A)))
(q(A)
(p(A)A)
AE7T
so that
fS
dp
) L supAE.
riH
p()j )
q(A)
( A).
If p E Ao\Ar(q), there exists B E . such that p(B) > 0 and q(B) = 0, and we
can assume it belongs to some r 1Hw,. Therefore,
supp
reHU'
Ar,
(A)
PS(p(A)p(B)
p(B
Kq(A) i(A)> 4 q(B))
B)=
oc-=
D(p
w
11q).
This concludes the proof of the claim.
Q.E.D.
(b) For all Ir E H,, set D, = {p E A':DI(p,1 IIq,) < t}. We show that
- p(B) for all B E I imply p E D,. First notice that
D), and p,(B)
{Pn}n
p E AO•l(for the Vitali-Hahn-Saks theorem). For all A e 7Tsuch that q(A) = 0,
then p,(A) = 0 for all n > 1 (else Dw((p,), 1Iq,) = cx)and hence
Pnq(A)
(0) = (kp (A))
q(A))
q(A)
For all A E Trsuch that q(A) > 0, clearly pn(A)
AE7T
h(p(A)
q(A) j(A)
p(A).
We conclude that
(Pn(A)YA)<
= lim
A
q(A)
t,
as wanted. Now (b) descends from Claim 1 and the observationthat
p={pe dA:Dw(p IIq) < t}
= pe6":
sup D(p,
"rrEHw
II
q,)5
t =
N
rEn-w
,.
VARIATIONALREPRESENTATIONOF PREFERENCES
1491
This completes the proof when w is simple. Suppose now that w is any
2-measurable function with w = infSEsw(s) > 0. Then there exists a sequence
of simple 2-measurable functions w, such that w, T w and minses w,n(s) > w
for all n > 1. By Levi's monotone convergence theorem, D "(p | q) T DW(p |
q) for all p E A-(q), therefore
D= {pe A'(q):Dw(p
=p
IIq) < t}
A'(q):supDn(pII
nl
q)<
tJ
=
n
n>l
We conclude that D as well is weakly compact.
Q.E.D.
PROOFOF THEOREM16: Lemma 15 and Theorem 3 guarantee that diver-
gence preferences are variational.Unboundedness of u together with Proposition 6 guarantees that c*(-) = OD'(- 11q). Finally, Theorem 13 and Lemma 15
Q.E.D.
imply that > satisfiesAxiom A.8.
PROOF OF THEOREM 18: Set I(cp)= minpe,,A)(fpdp
+ c*(p)) for all
p B0(Z). Because -c* coincides with the Fenchel conjugate of I (see
Lemma 26) on A(2) and I*(/t) = - 00 for all/t e ba(Y)\A(2), then
d(?p)= argmin (
U
qpeBo(2)
U
II((p)=
dp + c*(p)) V
Bo(),
argminfu(f)dp+c*(p)
fEBo(2)
and I is Gateaux differentiableon Bo(2) if and only if P*is strictlyconcave on
line segments in UEBO(l) I(cp), that is, if and only if c* is essentially strictly
convex.19
Let f E Bo(X). Set A = {u'(f) drIr e argminpEa(fu(f) dp + c*(p))} and notice that A is the image of argminpEa(fu(f) dp + c*(p)) = 8I(u(f)) through
the map from A(2) to ba(2) that associates to p(.) in A(X) the bounded and
finitely additive set function
u'(f) dp in ba(2). Because this map is linear
4.)
and a(ba(2), Bo(2))-o-(ba(2), Bo(2))-continuous, and because dI(u(f)) is
weakly* compact and convex, then A is or(ba(X),Bo(,))-compact and convex.
19Thisis proved in Bauschke, Borwein, and Combettes (2001). Notice that our definition of
essentialstrictconvexityis weakerthantheirsandit guaranteesthatstrictconvexityimpliesessentialstrictconvexity.
1492
E MACCHERONI,M. MARINACCI,AND A. RUSTICHINI
By standardresults,
(34)
I'(u(f); u'(f)h)=
mm(f
u'(f)hdp
= minf h dp,
I-A
EA
'Vhe Bo().
J
Let h E Bo(2) and s E S. If h(s) 0 O,then
u(f + th)(s) = u(f (s) + th(s))
= u(f(s)) + u'(f(s))th(s) + o(th(s))
for th(s) -+ 0, that is,
lim
u(f (s) + th(s)) - u(f (s)) - u'(f (s))th(s)
-
th(s)
th(s)-0O
and so
(35)
lim
u(f (s) + th(s)) - u(f (s)) - u'(f (s))h(s)t
t
= 0.
t3,O
Clearly (35) holds also if h(s) = 0. Because f and h are simple, the preceding
limit is uniformwith respect to s e S. Therefore, for t 4 0,
+
<
I(u(f th)) - I(u(f))
-t
< (I(u(f + th)) - I(u(f)
+ I(u(f) + tu'(f)h) I(u(f + th)) - I(u(f)
t
n
/la
minhd
+ tu'(f)h)
minf
I(u(f))))/t-
h
di
+ tu'(f)h)
+ tu'(f)h) - I(u(f))
? I(u(f)
t
Ilu(f + th) - u(f) - tu'(f)hll
t
minf hd
XEA
+ o(1),
where the last inequalitydescends from the Lipschitzcontinuityof I and (34).
Uniformity of limit (35) then delivers
im I(u(f + th)) I(u(f)) -minm hd
tq0
t
/eA
d
VARIATIONALREPRESENTATIONOF PREFERENCES
1493
that is, OV(f) = A.
for all h E B0o() or V'(f; -) = min, Ef(.-)
d/u,
if
c*
then
is essentially strictlyconvex,
Now,
dI(u(f)) is a singleton for every
=
in
dr
and
Ir E (u(f))} is a singleton too. Conversely,
f B0o()
dV(f)
{u'(f)
assume that V is Gateaux differentiable on Bo(0) and per contra that c* is
not essentially strictly convex. Then there exists ?0E B0o() such that I(40p)
contains two distinct elements r, and r2. Because u: R - R is concave, then
it is unbounded below and there is b E R such that p?+ b E Bo(2, u(R)). In
particular,there exists f E Bo(V) such that u(f) = ??+ b and
rl,
min(fdp+c*(p)
r2E
I(4)-=-arg
(cf(+b)dp+c*(p))=
=-argmin
d(u(f)).
It follows that u'(f) dr1,u'(f) dr2 e
Since V is Gateaux differentiable
•V(f).
on Bo(X), then u'(f) dr1= u'(f) dr2,
that is,
(36)
f
hu'(f) dr =
f
hu'(f) dr2 VhE Bo().
Because u is strictlymonotonic, concave, and differentiable,then u'(z) # 0 for
= fJi dr2 for all &E Bo(Y), contradicting
all z E R and (36) implies f f
drQ.E.D.
rl r2.
PROOF OF PROPOSITION19: Let (u, c*) represent - as in Theorem 3 and
without loss of generality assume [-1, 1] u(X).
_ Theorem 1), there is a weakly*
(i) =, (ii) By Gilboa and Schmeidler (1989,
compact and convex set C A(2) such that u(xf) - minPEC
f u(f) dp for all
f E F and each xf - f. By Theorem
3,
_
(37)
c*(p) = sup(minf u(f)dq rf.T
?qECff
u(f) dp
d)
Vp
A().
Suppose p E C. Then c*(p) < 0. Because c* is nonnegative, we have c*(p) = 0.
Next, suppose po V C. By the separatinghyperplanetheorem, there is a simple
measurablefunction p: S -+ u(X) such that f cpdp > f1? dpo for each p E C.
Hence, taking f E T such that 40= u(f), minpc f u(f) dp - f u(f) dpo > 0,
which in turn implies c*(po) > 0. We conclude that c*(p) = 0 if and only if
p E C. Therefore, for all f E F,
min(fu(f)
pE(f)
dp +c*(p)) = u()
=
min
= pE{c*=0}
u(f) dp
minf
peCf
u(f) dp.
1494
E MACCHERONI, M. MARINACCI,AND A. RUSTICHINI
(ii) =4 (i) and (iii) =- (ii) are trivial.Now assume u(X) is unboundedabove
(resp., below).
(ii) ?= (iii) For all pEA(E),
c*(p)= sup
fey
S
mm
in
u(f)dq-
u(f)dp
qEf{c*=O)f}
sup (minf pdq-
oPeBo(X,u(X))
\q
({c*=0}J
Jf
pdp.
Suppose, c*(po) > 0. There exist a nonnegative (resp. nonpositive), simple,
measurable function p: S -- u(X) and e > 0 such that minpeIc=01o
f dp f podpo > e, but nqpE Bo(V, u(X)) for all n E N and
c*(po)>
minf
n dp -
for all n E N. We conclude c*(po) =
>n
n
•pdpo
0
.
Q.E.D.
The proof of Corollary20 is omitted. Just notice that Axiom A.5' can be used
to obtain affinityof the functionalI that appearsin Lemma 28 in the same way
in which Axiom A.5 is used to obtain its concavityat the beginningof the proof
of Theorem 3.
LEMMA
32: Let - be a variationalpreferencerepresentedby (u, c*) as in Theorem 3 and let q e A(2). Thefollowingconditionsare equivalent:
(i) q correspondsto a SEU preferencethat is less ambiguityaversethan >-.
(ii) c*(q) = 0.
(iii) q e 81(k) for some (all) k e u(X), where I(?p) = minpa,,()(f(p dp +
E B0o(, u(X)).
c*(p)) for all Op
In particular,any variationalpreferenceis ambiguityaverse.
PROOF:(i) =? (ii) Suppose to is a SEU preference, with associated subjective probabilityq and nonconstant affine utility index uo such that > is more
ambiguityaverse than >o. By (8), we can assume uo= u. By Proposition8, and
c* < co, and by Corollary20, co(q) = 0; hence, 0 < c*(q) < c(*(q)= 0.
(ii) =? (iii) We have c*(q) = 0 if and only if supEB,,0(z,,,,x(IX(p) -
f ?pdq) =
0
if and only if I((p) Sf p dq for all 'p e Bo(2, u(X)) if and only if I(Gp)- I(k) <
f 'p dq - f k dq for all 'p e B0o(?,u(X)) and all k e u(X) if and only if q e
lI(k) for all k e u(X).
(iii) =4 (i) If q e dl(k) for some k e u(X), then I('p) - I(k) < f p dq k
f dq for all 'p e B0o(, u(X)) and then I('p) <f p dq for all 'p e B0o(, u(X)).
Denote by o0the SEU preference,with associated subjectiveprobabilityq and
utility index u. Notice that for all fe•F and x e X, f x implies I(u(f)) >
u(x), a fortiori f u(f) dq > u(x) and f -o x.
VARIATIONALREPRESENTATIONOF PREFERENCES
1495
Ambiguity aversion of >- now follows from the observation that
Q.E.D.
argminpEA() c*(p) is nonempty and minpEA(.) c*(p) =0.
PROOFOFPROPOSITION
22: (i) Let 0, 4 0. By Proposition 12,
limmin f
+p u(f) dp OnD(p 1q
= minff u(f) dp+
{D(-lIq)<oo}(P)J
If w E LO(q) and w = infs s w(s), then wDp(p I| q) < Dl(p IIq) < IlwllKx
D4(p IIq), which implies domDp(. IIq) = domDw(. IIq). In turn, this implies
limminf fu(f)dp
n
+ OnD(pl q)
peA IfJpedomD(.IlIq)
J
u(f) dp.
mDinf
q)
Next we show that, for all cpE Bo( ),
(38)
inf
f
pedomD4(.I q)J
= inf f dp.
pdp
peA"(C,q)J
with a, > ... > a, and
every p E A(X, q), define p,E Bo(2) by {Ai}il
Let cp=
ajlAj,
a partition of S in . For
•,l
(A>
i: q(Ai)>0
q•
) i:Ai+
i:
1Ai
pq(Ai)=0
It is easy to see that q, is nonnegative and f rp dq = 1. Call p' the element of
is a simple function, then DO(p' II
A(~(, q) such that dp'/dq = fp,. Since
l
p,
q) = f 4(frjp)dq E R, so that p' E domDo(.
I| q). Since f cpdp' = p dp, we
then have I{fp dp:p e A(X, q)I}C {f p dp:p E domD(. IIq)}, which yields
(38) since the converse inclusion is trivial.
It remains to prove that
(39)
inf f
J
pedAa(q)
dp = ess
seS
min•p(s). >
Here essmin,,s p(s) = min{ja
0}. Let i* E {1,...,
:q(Ai)
n} be such that
= min{ai: q(Ai) > 0} and let qA,. be the conditional distribution of q
aoi
Then qAi, E A?(q), and f p dqA, = ess min,,s 4?(s). This proves (39).
on
Ai,.
Let
6,, f oc. By Proposition 23, strict convexity of 4 implies that of
(ii)
D (p 11q) on its effective domain. Hence, argmin
1Iq) - {q} for
each n and so the result follows from Proposition 12. O,,D(p
Q.E.D.
1496
E MACCHERONI, M. MARINACCI,AND A. RUSTICHINI
PROOF OF PROPOSITION23: By Theorem 18, it is enough to show that
Dw(. I q): A(,) -- [0, 00] is strictlyconvex on its effective domain. Let p, p' E
domD'(. IIq), p p', and a e (0, 1). Let pt(B) = fBwdq for all B E .
The assumption infses
w(s) > 0 guarantees that the measure pt is equivalent to the probability q. By (33), p, p' e domD(. IIq) if and only if
f()\
<
(dp'') d
d,, f
00
if and only if
d(L),
ity and nonnegativityof 0 guarantee that
< dp
(1 - a)
( ad
-q
dp
<a(OP( -(s)
dq
) e L1(X,
•(
). Convex-
dp'\ (s)
dq /
N
d
dp'
+ (1- a)o4 -(s) )VseS;
/(dq
strict convexity,p : p', and a E (0, 1) imply that the second inequalityis strict
on a set E E X with q(E) > 0, and hence g(E) > 0. Therefore, 0(aL
+ (1 -
a )) E (L
L1 , g) and
(s) I (1- a)4 d (s)0
dp
-a
S
S
dp +(1-a)dp'\d>0,
dq dF
q )-J.
that is, Dw(ap + (1 - a)p' IIq) < aD (p IIq) + (1 - a)Dw(p' IIq), as desired.
Q.E.D.
+ R defined by J(p) =
is
concave
and
Gateaux
differentiable.
dq
Concavity is
(20)-'
Varq(Gp)
f
trivial.Moreover, for all p, I e Bo(V) and t E IR,
PROOFOF THEOREM24: The functional J: -B
Bo()
~
J(
+ to) =
(s +
tip)dq - 1Varq(?q + tf)
20
(
+[J(1-i
0
+ tq)
J(
t--+O ~t
J(
+[f
p
Jpdq ))dq]t+J(p),
so that
(40)
J'(Gp;q) =
limt
f
-0
dq
dq.
That is, for all p e B0o(2),the Gateaux differentialof J at (pis representedby a
measure with Radon-Nikodym derivative with respect to q given by 1 - 1 ('p -
VARIATIONALREPRESENTATIONOF PREFERENCES
1497
f ?pdq). Therefore, J'(cp;-) is positive as a linear functional on B0o() if and
only if
q({sS:p(s)
-fJ
dq })1.
This relation characterizesthe elements of the domain of monotonicity M. In
Maccheroni, Marinacci,Rustichini,and Taboga(2004) we show that
(f dp+?OG(p1lq))
mJ()-•=qin
V(
M.
Q.E.D.
REFERENCES
ANSCOMBE,E J., AND R. J. AUMANN(1963): "'ADefinition of Subjective Probability," TheAnnals
of MathematicalStatistics,34, 199-205. [1453]
K. (1970):Essaysin the Theoryof Risk-Bearing.Amsterdam:North-Holland.[1460]
ARROW,
BAUSCHKE,H. H., J. M. BORWEIN,AND P. L. COMBETTES(2001): "Essential Smoothness, Essen-
tial StrictConvexity,and Legendre Functionsin Banach Spaces,"Communicationsin Contem-
porary Mathematics, 3, 615-647. [1491]
BEN-TAL,A. (1985): "The Entropic Penalty Approach to Stochastic Programming," Mathematics
of Operations Research, 10, 263-279. [1464]
BEN-TAL,A., A. BEN-ISRAEL,AND M. TEBOULLE(1991): "Certainty Equivalents and Information Measures: Duality and Extremal Principles," Journal of Mathematical Analysis and Applications, 157, 211-236. [1464]
BILLINGSLEY,P. (1995): Probability and Measure (Third Ed.). New York: Wiley. [1489]
CHATEAUNEUF,A., E MACCHERONI,M. MARINACCI,AND J.-M. TALLON(2005): "Monotone
Continuous Multiple Priors," Economic Theory, 26, 973-982. [1460]
CHEN, Z., AND L. G. EPSTEIN(2002): 'Ambiguity, Risk, and Asset Returns in Continuous Time,"
Econometrica, 70, 1403-1443. [1448]
CHONG,K. M. (1976): "Doubly Stochastic Operators and Rearrangement Theorems," Journal of
MathematicalAnalysis andApplications,56, 309-316. [1484]
CHONG, K. M., AND N. M. RICE (1971): "Equimeasurable Rearrangements of Functions,"
QueensPapersin Pure andAppliedMathematics,28. [1461,1463,1484-1487]
DALMASO,G. (1993):An Introductionto F-Convergence.Boston: Birkhauser.[1460,1482]
DANA, R.-A. (2005): "A Representation Result for Concave Schur Concave Functions," Mathematical Finance, 15, 613-634. [1463,1485]
DOLECKI,
S., ANDG. H. GRECO(1995): "Niveloids,"TopologicalMethodsin NonlinearAnalysis,
5, 1-22. [1475,1476]
DUNFORD, N., AND J. T. SCHWARTZ(1958): Linear Operators, Part I. New York: Wiley. [1475,
1487]
DUPUIS,P.,ANDR. S. ELLIS(1997):A WeakConvergenceApproachto the Theoryof LargeDeviations. New York: Wiley. [1471]
ELLSBERG,D. (1961): "Risk, Ambiguity, and the Savage Axioms," Quarterly Journal of Economics, 75, 643-669. [1447,1465]
EPSTEIN,L. G. (1999): 'A Definition of Uncertainty Aversion," Review of Economic Studies, 66,
579-608. [1451,1454,1461]
EPSTEIN, L. G., AND T. WANG (1994): "Intertemporal Asset Pricing under Knightian Uncer-
tainty,"Econometrica,62, 283-322. [1448,1467,1470]
FOLLMER,H., AND A. SCHIED(2002): Stochastic Finance. Berlin: de Gruyter. [1476]
1498
E MACCHERONI, M. MARINACCI,AND A. RUSTICHINI
Made Precise:A ComparativeFounP., ANDM. MARINACCI
GHIRARDATO,
(2002): '"Ambiguity
dation,"Journalof Economic Theory,102, 251-289. [1451,1454,1457,1459]
I., ANDD. SCHMEIDLER
GILBOA,
(1989): "MaxminExpected Utility with a Non-Unique Prior,"
Journalof MathematicalEconomics, 18, 141-153. [1447,1449,1452,1454,1468,1493]
HANSEN, L., AND T. SARGENT(2000): "Wanting Robustness in Macroeconomics," Mimeo, Uni-
versityof Chicago and StanfordUniversity.[1448,1472]
(2001): "Robust Control and Model Uncertainty,"American Economic Review,91,
60-66. [1448,1460,1464,1468,1472]
HERSTEIN,
I. N., AND J. MILNOR(1953): "An Axiomatic Approach to Measurable Utility,"
Econometrica,21, 291-297. [1478]
Hsu, M., M. BHATT,R. ADOLPHS,D. TRANEL, AND C. E CAMERER(2005): "Neural Systems
Responding to Degrees of Uncertaintyin Human Decision-Making,"Science, 310, 1680-1683.
[1452]
KEREN, G., AND L. E. GERRITSEN(1999): "On the Robustness and Possible Accounts of Ambi-
guity Aversion,"Acta Psychologica,103, 149-172. [1452]
I. (2001): "ProceduralRationalityin the MultiplePriorModel," Mimeo, Universityof
KOPYLOV,
Rochester. [1457]
A., ANDJ. PERNER(2003): "The Role of Competition and Knowledge in the EllsKOHBERGER,
berg Task,"Journalof BehavioralDecision Making,16, 181-191. [1452]
LIESE,E, AND I. VAJDA(1987): Convex Statistical Distances. Leipzig: Teubner. [1463,1470]
LUXEMBURG,W. A. J. (1967): "Rearrangement Invariant Banach Function Spaces," Queens Papers in Pure and Applied Mathematics, 10, 83-144. [1463,1484-1486]
MACCHERONI,E, M. MARINACCI,AND A. RUSTICHINI(2005): "Niveloids and Their Extensions,"
Mimeo. [1476]
(2006): "Dynamic Variational Preferences," Journal of Economic Theory, 128, 4-44.
[1467,1472]
MACCHERONI,E, M. MARINACCI,A. RUSTICHINI,AND M. TABOGA(2004): "Portfolio Selection
with Monotone Mean-Variance Preferences," ICER Working Papers, Applied Mathematics
Series 27-2004. [1475,1497]
MACHINA,M. J., AND D. SCHMEIDLER(1992): "A More Robust Definition of Subjective Proba-
bility,"Econometrica,60, 745-780. [1461]
MARINACCI,M. (2002): "Probabilistic Sophistication and Multiple Priors," Econometrica, 70,
755-764. [1463]
MARKOWITZ,H. M. (1952): "Portfolio Selection," Journal of Finance, 7, 77-91. [1450,1460,1468,
1474]
A. W.,ANDI. OLKIN(1979): Inequalities:Theoryof Majorizationand Its Applications.
MARSHALL,
New York: Academic Press. [1462]
PHELPS,R. R. (1992): ConvexFunctions,Monotone Operatorsand Differentiability.New York:
Springer-Verlag. [1466,1476]
R. T. (1970): Convex Analysis. Princeton, NJ: Princeton University Press. [1449,
ROCKAFELLAR,
1459,1466]
ROTHSCHILD,M., AND J. E. STIGLITZ(1970): "Increasing Risk: I. A Definition," Journal of Eco-
nomic Theory,2, 225-243. [1462]
RUSTICHINI,A. (2005): "Emotion and Reason in Making Decisions," Science, 310, 1624-1625.
[1452]
SCHMEIDLER,D. (1979): '"A Bibliographical Note on a Theorem of Hardy, Littlewood, and
Polya," Journal of Economic Theory, 20, 125-128. [1461]
(1989): "Subjective Probability and Expected Utility Without Additivity," Econometrica,
57, 571-587. [1451,1454]
TOBIN, J. (1958): "Liquidity Preference as Behavior Toward Risk," Review of Economic Studies,
25, 65-86. [1450,1460,1468,1474]
WANG, T. (2003): "A Class of Multi-Prior Preferences," Mimeo, University of British Columbia.
[1473]