Outline Statement of Problem Adaptive Utility Example Decision Making with Uncertain Preferences Brett Houlding Department of Mathematical Sciences, Durham University, UK Brett Houlding Decision Making with Uncertain Preferences Future Options Outline Statement of Problem Adaptive Utility Outline: Statement of Problem Decision Problems Historical Treatment Example Adaptive Utility Research Needed Interpretation of Utility Parameter Adaptive Utility Functions Commensurable Utility Functions Surprise and Selection Strategy Trial Aversion and Value of Information Example Future Options Brett Houlding Decision Making with Uncertain Preferences Example Future Options Outline Statement of Problem Adaptive Utility Example Future Options Decision Problems Decision Problems I A Decision Maker (DM) must select a decision from within a set of available options D. I Initially there is uncertainty over what will be the resulting outcome of each available decision. I Let R be the set of all possible outcomes. I Selecting a decision is to be seen as equivalent to selecting a distribution over R. How should the DM select her choice? Brett Houlding Decision Making with Uncertain Preferences Outline Statement of Problem Adaptive Utility Example Future Options Example Example A company currently uses service X , but has the option of instead using a better service Y for an additional cost of c1 . Service Y can either lead to an increase of s1 or s2 in performance, with prior beliefs that either outcome is equally likely. I Possible outcomes (c, s) display changes to status quo in cost c and performance s respectively. I Possible decisions are to use service X or to use service Y . I Service X leads to outcome (0, 0) for sure. I Service Y leads to outcome (−c1 , s1 ) with probability 0.5 and outcome (−c1 , s2 ) otherwise. Brett Houlding Decision Making with Uncertain Preferences Outline Statement of Problem Adaptive Utility Example Future Options Historical Treatment St Petersburg Paradox I Consider the game where a coin is successively flipped. I For a throw of a head £2 is won and placed in a pot. I For each further throw of a head the money in the pot is doubled. I The pot is given to the player as soon as a tail is thrown. I How much would you pay to play this game? Brett Houlding Decision Making with Uncertain Preferences Outline Statement of Problem Adaptive Utility Example Future Options Historical Treatment St Petersburg Paradox I I There is (1/2)n chance of winning £2n for n = 1, 2, . . . P∞ P∞ The expected value of game is n=1 (1/2)n × 2n = n=1 1 = ∞. Maximising expected return is not always a good way to make decisions! Brett Houlding Decision Making with Uncertain Preferences Outline Statement of Problem Adaptive Utility Example Future Options Historical Treatment Bernoulli and Utility Hypothesis I Consider measuring a person’s happiness just like we may do their temperature. I Rather than in degrees Celsius we do this in utils. I The more utils a person has the happier they are. I Bernoulli (1738) suggested that giving someone twice as much money will not necessarily give them twice as many utils. I A utility function u : R → R need not be linear. Suggest choose decision maximising expected utility. Brett Houlding Decision Making with Uncertain Preferences Outline Statement of Problem Adaptive Utility Example Future Options Historical Treatment Notation I d1 d2 means d1 at least as preferable as d2 . I d1 d2 means d1 strictly preferred to d2 . I αd1 + (1 − α)d2 with d1 and d2 decisions means with probability α d1 , otherwise d2 . Brett Houlding Decision Making with Uncertain Preferences Outline Statement of Problem Adaptive Utility Example Future Options Historical Treatment Axioms I A1 Completeness: is a complete relation and the set of feasible decisions D is a closed convex combination of lotteries. I A2 Transitivity: is a transitive relation. I A3 Archimedian: If d1 , d2 , d3 ∈ D are such that d1 d2 d3 , then there is an α, β ∈ (0, 1) such that αd1 + (1 − α)d3 d2 βd1 + (1 − β)d3 . I A4 Independence: For all d1 , d2 , d3 ∈ D and any α ∈ [0, 1], d1 d2 ⇔ αd1 + (1 − α)d3 αd2 + (1 − α)d3 . Brett Houlding Decision Making with Uncertain Preferences Outline Statement of Problem Adaptive Utility Example Future Options Historical Treatment VNM and Utility Theorem Von Neumann and Morgenstern (1947) showed that axioms A1-A4 are all that is required to prove there exists a unique utility function (up to a positive linear transformation) with the properties that: 1. For all d1 , d2 ∈ D, u(d1 ) ≥ u(d2 ) ⇔ d1 d2 . 2. For all d1 , d2 ∈ D and any α ∈ (0, 1), u(αd1 + (1 − α)d2 ) = αu(d1 ) + (1 − α)u(d2 ). This proves we should choose decision with maximum expected utility return. Brett Houlding Decision Making with Uncertain Preferences Outline Statement of Problem Adaptive Utility Example Future Options Example Example In our example all that is needed to determine the solution is the utility function for changes in cost and performance. I Assume u[(c, s)] = c + s. I Then u(X ) = 0 and u(Y ) = 0.5(−c1 + s1 ) + 0.5(−c1 + s2 ) I Select Y if 0.5(s1 + s2 ) > c1 otherwise select X . Brett Houlding Decision Making with Uncertain Preferences Outline Statement of Problem Adaptive Utility Example Future Options Historical Treatment CD and Uncertain Utility? I This is all fine and Anscombe and Aumann (1963) later allowed for subjective probabilities over outcomes of decisions. I Accepted generally by academic community. Witsenhausen (1974) of Bell Labs used utilities in considering how to make decision policies when we accept future opinions may be different. I Yet can we always be certain of what our utilities will be? I Cyert and DeGroot (1975) argued that at times we may be unsure of our utilities (and hence our preferences). How then make decisions? Brett Houlding Decision Making with Uncertain Preferences Outline Statement of Problem Adaptive Utility Example Future Options Adaptive Utility Adaptive Utility I Introduced by Cyert & DeGroot (1975) in the form of parametric utility functions where DM could learn of parameter through Bayesian updating. I Generalises Bayesian Statistical Decision Theory and coincides in single decision case. I The unknown parameter θ could be used to represent unknown trade-off weights or measure of risk-aversion etc. I Of interest in Marketing Theory where a new brand becomes available and Clinical Trials with unknown drug attributes. I Also assumed of use in R&D and Experimental Design. Brett Houlding Decision Making with Uncertain Preferences Outline Statement of Problem Adaptive Utility Example Future Options Surprise and Selection Strategy Reliability and Testing The effects of adaptive utility upon testing and reliability were considered by Houlding and Coolen (2007): I Permits subjective cost of system failure to remain uncertain. I When then should systems be fixed if this involves a known cost but unknown cost of faulty system? I Now observing a system failure can be beneficial as, although means they are more likely, they might be seen to be not as disastrous as was expected. I Decision to replace less reliable system can now be postponed as DM learns of utility for that reliability level. Brett Houlding Decision Making with Uncertain Preferences Outline Statement of Problem Adaptive Utility Example Future Options Research Needed Research Needed Since the initial work of Cyert & DeGroot in 1975 little attention has been paid to developing adaptive utility. In particular there has been little or no attention paid to: I What is the interpretation of a utility parameter and how is it meaningful to compare different utility functions? I Can one develop a proof that adaptive utilities exist and that the DM should seek to maximise adaptive utility? I What are the implications for decision selection strategy? I What are effects on diagnostics such as value of information and risk aversion? Brett Houlding Decision Making with Uncertain Preferences Outline Statement of Problem Adaptive Utility Example Future Options Interpretation of Utility Parameter Interpretation of Utility Parameter I Let a DM’s state of mind θ characterise the true utility function. I Assume θ uncertain so as to represent uncertainty of utility function. I Each possible value for θ will induce a different utility function and hence represent different preferences over rewards and decisions. I Represent in notation as a conditioning argument u[·|θ]. I Model beliefs over θ through a prior probability distribution. Brett Houlding Decision Making with Uncertain Preferences Outline Statement of Problem Adaptive Utility Example Future Options Adaptive Utility Functions Adaptive Utility Functions I We now have a set of possible utility functions u[·|θ1 ], u[·|θ2 ], . . . I Define an adaptive utility function ua (·) to be expected value of u[·|θ] with respect to beliefs about θ. Formally, ua (·) = Eθ u[·|θ] I I Repeated application of von Neumann and Morgenstern proves existence and uniqueness of function. I Furthermore, a DM should seek to choose decision with maximum expected adaptive utility. Brett Houlding Decision Making with Uncertain Preferences Outline Statement of Problem Adaptive Utility Example Future Options Commensurable Utility Functions Commensurability I However, this requires it be meaningful to compare utility values conditioned on differing values of θ. I A classical utility function is only unique up to a positive linear transformation. I We need to scale classical utilities so they are commensurable, i.e., so that it is meaningful to make comparisons between terms such as u(r1 |θ1 ) and u(r2 |θ2 ). I Boutilier (2003) showed this was possible under assumption of extremum equivalence. I Demonstrated commensurability can even be achieved without extremum equivalence. Brett Houlding Decision Making with Uncertain Preferences Outline Statement of Problem Adaptive Utility Example Future Options Surprise and Selection Strategy Surprise and Selection Strategy The effects of adaptive utility and accepting preferences are uncertain, but that they may be learned of, are: I No difference in a one-off decision. I Can model a situation where a DM is pleasantly surprised by outcome or upset by outcome. I Depending on length of decision sequence and prior beliefs, DM should select decisions that are likely to lead to unfamiliar outcomes. I This should be done even if DM expects that outcome will be less favourable than another. Brett Houlding Decision Making with Uncertain Preferences Outline Statement of Problem Adaptive Utility Example Future Options Trial Aversion and Value of Information Trial Aversion and Value of Information The interpretations of utility diagnostics will now be different: I How much should DM pay for information about likely outcome of a decision if uncertain of preferences? I No general connection between value of information and level of utility uncertainty, but depends on individual case. I Trial Aversion is analogous to Risk Aversion (preference or avoidance of actuarially fair bets). I Consider u(r |θ) as a function of θ only. I Would call DM trial averse in region that u(r |θ) is concave. I Purely a diagnostic as never get to choose distribution over θ. Brett Houlding Decision Making with Uncertain Preferences Outline Statement of Problem Adaptive Utility Example Future Options Example Example Return to Example and assume two possible utility functions: I u[(c, s)|θ] = c + θs I θ = θ1 = 1 with probability 0.5 otherwise θ = θ2 = 2. I Assume wish to maximise sum of utility over two periods. I Assume c1 = 2.3, s1 = 1 and s2 = 2 so that in a one off decision DM should stick with service X . Brett Houlding Decision Making with Uncertain Preferences Outline Statement of Problem Adaptive Utility Example Future Options Example Example I Information z about θ comes from observing either positive surprise or negative surprise from utility return after first period. I Let z = 1 for positive surprise and z = 0 for negative surprise. I If service X is chosen utility known for sure and no information gained. I If service Y is chosen we learn about θ. I Assume probability that Y truly leads to performance increase s if it was previously seen to do so is 0.9. I If service Y chosen we thus also learn of its true performance increase. Brett Houlding Decision Making with Uncertain Preferences Outline Statement of Problem Adaptive Utility Example Future Options Example Influence Diagram (c,s) X or Y (c,s) X or Y z q Brett Houlding Decision Making with Uncertain Preferences u Outline Statement of Problem Adaptive Utility Example Example Through use of Bayes’ Theorem we find: I P(θ = 1|z = 1) = 0 I P(θ = 1|z = 0) = 2/3 I P(z = 1) = 1/4 I P(z = 0) = 3/4 Brett Houlding Decision Making with Uncertain Preferences Example Future Options Outline Statement of Problem Adaptive Utility Example Future Options Example Example Through dynamic programming we find: I Should choose service Y in first period even though under initial beliefs service X seems better. I Should choose service Y again in second period only if it was seen to lead to the larger increase in performance, otherwise choose service X . I This trivial example is equivalent to solving a decision tree with 25 end nodes. Brett Houlding Decision Making with Uncertain Preferences Outline Statement of Problem Adaptive Utility Example Future Options Future Options Computational Issues I Adaptive Utility permits a DM to remain uncertain over preferences. I Bayesian sequential decision theory notoriously computationally intensive. I Suffers from so-called “curse of dimensionality” associated with numerical backward induction with large state space. I Involves solving a nested sequence of maximizations and expectations over multi-dimensional spaces. I Now also must incorporate extra computations for utility parameter and its associated learning. I Some integrals can not be found in close form. Brett Houlding Decision Making with Uncertain Preferences Outline Statement of Problem Adaptive Utility Example Future Options Future Options Computational Issues McDaid and Wilson (2001) on classical Bayesian sequential decision theory for testing of software: I “other testing plans are more reasonable ... but often difficult to implement because the expressions for the expected utility become difficult to calculate” I “The most desirable testing plan is the sequential test ... but its solution is not tractable” Brett Houlding Decision Making with Uncertain Preferences Outline Statement of Problem Adaptive Utility Example Future Options Future Options Future Options This work focuses on foundations and is not yet at application stage. Due to intense computation cost for application on interesting problems the further challenges include: I Identification of utility forms which are reasonable at modelling preferences and lead to tractable computations. I Investigation of simple and accurate techniques for eliciting necessary information concerning beliefs and possible preferences. Brett Houlding Decision Making with Uncertain Preferences
© Copyright 2026 Paperzz