8. Decision Making Under Uncertainty

8. Decision Making Under Uncertainty - Games Against
Nature
Up to now randomness (uncertainty) has not been directly taken
into account (except in life insurance).
For example, the risk of a loan not being repaid was measured in
terms of an appropriate increase in the interest.
The resulting calculations were carried out as if everything were
deterministic, but by applying a different interest rate.
1 / 32
Decision Making Under Uncertainty - Games Against
Nature
When a problem is assumed to be deterministic, it is clear that we
should maximise profits or minimise costs.
However, when profits (or losses) are random, then various criteria
are used for decision making, which reflect individuals’ attitude
towards risk.
2 / 32
Criteria for Decision Making Under Uncertainty
1. The minimax criterion.
2. The Savage (regret) criterion.
3. The Hurwicz criterion.
4. The Laplace criterion.
5. Maximisation of expected payoff.
3 / 32
Criteria for Decision Making Under Uncertainty
Suppose one should choose between
A. $40 for sure.
B. $100 if a coin toss results in heads, otherwise $0
Hence, the payoff of the individual my depend on the outcome of a
random experiment, here the result of a coin toss.
4 / 32
The Minimax Criterion
Using the minimax criterion, we make the decision which gives us
the optimal ”worst case scenario”.
Thus if xmin,D is the minimum payoff that is possible under action
D, then an individual makes the decision that maximises this
payoff.
In this case, by taking decision A, the decision maker’s minimum
(guaranteed) payoff is $40.
By taking decision B, the worst possible outcome is that the
decision maker gets nothing.
5 / 32
The Minimax Criterion
Hence, using the minimax criterion, the decision maker should take
decision A.
Note: When costs are considered, let cmax,D be the maximum
possible cost when decision D is taken (i.e. the worst possible
scenario).
Using the minimax criterion, the decision maker takes the decision
which minimises this ”worst case scenario” cost.
The minimax criterion is generally very conservative (risk averse).
For example, if the payoff under decision A were changed to $0.01,
the minimax criterion still recommends decision A.
6 / 32
The Savage (Regret) Criterion
This is a less conservative approach to decision making.
When payoffs are considered, the regret from making a decision
given the outcome of an experiment is the different between the
maximum payoff obtained given the outcome of the experiment
and the payoff actually obtained.
For example, when the coin toss results in tails, if the decision
maker takes the decision A (and thus gets $40), this is the
maximum possible payoff (i.e. the regret in this case is 0).
In this case, when the decision maker takes the decision B (and
thus gets $0), his regret is 40 (the difference between the payoff he
gets and the maximum possible given the result of the experiment).
7 / 32
The Savage (Regret) Criterion
On the other hand, when the result of the coin toss is heads, B is
the best possible decision, obtaining a payoff of $100.
Hence, in this case the regret from making decision B is 0.
In this case, by taking the decision A, the decision maker gets $40
rather than the maximum possible $100.
Hence, the regret from taking decision A in this case is 60.
The following two slides illustrate the minimax and Savage criteria.
8 / 32
Minimax Criterion
A
B
Tails
40
0
Heads
40
100
Min.
40
0
Note that since we are considering payoffs, the worst situation for
a given decision corresponds to the lowest payoff.
9 / 32
Savage Criterion
A
B
Tails
40
0
Heads
40
100
Regret (T)
0
40
Regret (H)
60
0
Max. Regret
60
40
Note that regret is a cost, the worst situation for a given decision
corresponds to the highest regret.
10 / 32
The Savage (Regret) Criterion
Under the Savage (Regret) Criterion, a decision maker should
make the decision which minimizes the maximum possible regret.
The maximum possible regret given decision A is 60.
The maximum possible regret given decision B is 40.
Hence, under the Savage (Regret) criterion, the decision maker
should take action B.
Note: It should be noted that when costs are considered, the
regret felt when an individual takes an action is equal to the cost
actually incurred minus the minimum costs possible given the
outcome of the experiment.
11 / 32
Relation Between the Savage (Regret) Criterion and the
Minimax Criterion
It should be noted that whatever way the original problem is
formulated in (using payoffs or costs), regret is a ”cost”.
The Savage criterion suggests minimising the maximum possible
regret.
Hence, the Savage criterion applies the minimax criterion to the
regret scores.
12 / 32
The Hurwicz Criterion
The Hurwicz criterion takes into account both the best and worst
possible outcomes.
The best outcome is weighed by a factor of α, 0 < α < 1, where α
can be interpreted as the index of optimism.
The worst outcome is weighed by a factor of 1 − α.
13 / 32
The Hurwicz Criterion
When payoffs are considered, a decision maker should maximise
the Hurwicz score of the decision taken, where the score for
decision D is given by
HD = αxmax,D + (1 − α)xmin,D ,
where xmax,D and xmin,D are the maximum and minimum,
respectively, possible payoffs under the action D.
Under action A, the decision maker always gets $40, thus
xmax,D = xmin,D = 40.
Under action B, xmax,D = 100, xmin,D = 0.
14 / 32
The Hurwicz Criterion
Suppose the index of optimism is α = 0.3. It follows that
HA =0.3 × 40 + (1 − 0.3) × 40 = 40
HB =0.3 × 100 + (1 − 0.3) × 0 = 30.
Hence, the decision maker should take action A.
15 / 32
The Hurwicz Criterion
When costs are considered, the Hurwitz score of decision D is
given by
HD = αcmin,D + (1 − α)cmax,D ,
where cmax,D and cmin,D are the maximum and minimum,
respectively, possible costs under the action D.
Note that the coefficient α is always associated with the best
situation (maximum payoff or, as in this case, least costs).
In the case of costs, the decision maker should minimise the
Hurwicz score of the decision taken.
16 / 32
The Hurwicz Criterion
When α = 0 (i.e. the decision maker is very pessimistic), the
Hurwicz criterion corresponds to the minimax criterion.
When α = 1 (i.e. the decision maker is very optimistic), the
Hurwicz criterion corresponds to choosing the action which gives
the maximum possible payoff of all those possible (minimum
possible cost).
In this way, we can model a range of decision makers, from the
most pessimistic to the most optimistic.
17 / 32
The Laplace Criterion
The Laplace criterion states that in the absence of any other
information, we should assume that the possible outcomes of the
experiment are equally likely.
This is the principle of insufficient information
In this case, we maximise the average of the possible payoffs for a
given decision (this is the expected payoff under the above
assumption).
In this case the expected payoff from decision B is
0+100
2
= 50.
The guaranteed payoff from taking decision A is 40.
18 / 32
The Laplace Criterion
Hence, the decision maker should take decision B.
When costs are considered, under the Laplace criterion, we
minimise the average of the possible costs.
19 / 32
Maximisation of Expected Payoff
However, in many cases we may have sufficient information to
estimate the probabilities of various outcomes from previous
experience
e.g. the results of coin tosses, die rolls, the probability of rain on a
day in June.
In this case, it makes sense to use this information, in order to
make a more informed decision.
20 / 32
Maximisation of Expected Payoff
Given that the coin is fair, then the Laplace criterion is equivalent
to the maximisation of the expected payoff (or minimisation of
expected costs, when costs are considered).
More generally, suppose that the probability of heads is p.
The expected reward from decision B is thus
100p + 0(1 − p) = 100p.
Thus the decision maker should take decision B when 100p > 40,
i.e. p > 0.4.
21 / 32
Decision Making Under Uncertainty
When the possible results of an experiment form a discrete set
(e.g. the result of a die roll, coin toss, winner of an election), then
the possible payoffs can be presented in a matrix.
Each row corresponds to a decision.
Each column corresponds to an outcome of the experiment.
22 / 32
Example 8.1
Suppose the time taken to travel to work using one of three modes
of transport: A, B, C , in three different traffic conditions: Good,
Average, Poor, are given below
A
B
C
Good
10
30
40
Average
30
40
40
Poor
90
60
40
23 / 32
Example 8.1
Derive the decision that should be taken under
1. The minimax criterion.
2. The Savage (regret) criterion.
3. The Hurwicz criterion with optimism index α = 0.4.
4. The Laplace criterion.
5. Minimisation of expected cost, given that the
probability of good and poor conditions are both
equal to 0.2.
24 / 32
Example 8.1
25 / 32
Example 8.1
26 / 32
Example 8.1
27 / 32
Example 8.1
28 / 32
Example 8.1
29 / 32
Example 8.1
30 / 32
Example 8.1
31 / 32
Example 8.1
32 / 32