Cardinal Scales for Health Evaluation

Decision Analysis
informs
Vol. 7, No. 3, September 2010, pp. 256–281
issn 1545-8490 eissn 1545-8504 10 0703 0256
®
doi 10.1287/deca.1100.0181
© 2010 INFORMS
INFORMS holds copyright to this article and distributed this copy as a courtesy to the author(s).
Additional information, including rights and permission policies, is available at http://journals.informs.org/.
Cardinal Scales for Health Evaluation
Charles M. Harvey
Department of Decision and Information Sciences, University of Houston, Houston, Texas 77204,
[email protected]
Lars Peter Østerdal
Department of Economics, University of Copenhagen, DK-1353 Copenhagen K, Denmark,
[email protected]
P
olicy studies often evaluate health for an individual or for a population by using measurement scales that
are ordinal scales or expected-utility scales. This paper develops scales of a different type, commonly called
cardinal scales, that measure changes in health. Also, we argue that cardinal scales provide a meaningful and
useful means of evaluating health policies. Thus, we develop a means of using the perspective of early neoclassical welfare economics as an alternative to ordinalist and expected-utility perspectives.
Key words: health scales; population health; cardinal utility; neoclassical welfare economics; social welfare;
preference intensity
History: Received on October 2, 2009. Accepted on April 14, 2010, after 2 revisions. Published online in Articles
in Advance June 17, 2010.
1.
Introduction
called a difference relation, an intensity relation, or a
strength-of-preference relation, and a cardinal scale is
also defined as any scale that is unique up to a positive linear transformation, e.g., an expected-utility
function. Thus, it is not possible to adopt definitions
that agree with all those in the literature.
Each of the three types of binary relations may or
may not satisfy conditions that imply the existence
of a scale that represents the relation. For example, a
so-called lexicographic ordinal relation has no ordinal
scale. A disadvantage of the terms cardinal relation
and cardinal scale is that they do not indicate this
important distinction.
This paper presents models that contain cardinal
scales for the health of an individual and for the
health of members of a population. In the first part,
we argue that changes in individual health and in
population health can be meaningfully compared in a
policy study of population health, and thus cardinal
relations have an operational meaning in such a context. And as a stronger point, we argue that changes
in population health are often the relevant objects to
compare in such a study as a means to compare the
effectiveness of policy options.
In the second part of this paper, we develop cardinal scales for the health of an individual. Here we
A policy analyst who wishes to evaluate the effects
of a social policy on the health of an individual or
within a population has a variety of measurement
scales and supporting procedures to choose from.
Typically, the scale is an ordinal-utility scale or an
expected-utility scale.
Both types of scales represent binary relations that
compare health alternatives; an ordinal-utility scale
represents a relation on health outcomes such that
greater scale amounts correspond to better outcomes,
and an expected-utility scale represents a relation on
probability distributions of health such that greater
expected values of scale amounts correspond to better
distributions.
This paper develops models and procedures for a
different type of binary relation and scale; here, better
changes in health correspond to greater differences in
scale amounts. A relation that compares changes in
outcomes is commonly called a cardinal relation, and a
scale that represents a cardinal relation in this manner
is commonly called a cardinal scale.
But other terminology is used with these meanings,
and this terminology is used with other meanings. For
example, a cardinal relation (as defined here) is also
256
Harvey and Østerdal: Cardinal Scales for Health Evaluation
257
INFORMS holds copyright to this article and distributed this copy as a courtesy to the author(s).
Additional information, including rights and permission policies, is available at http://journals.informs.org/.
Decision Analysis 7(3), pp. 256–281, © 2010 INFORMS
consider health outcomes of two types: health states
and health–duration pairs. By a health state, we mean
the health of a person, measured by one or more
variables. And by a health–duration pair, we mean a
health state and, as additional information, the duration of the health state. We do not consider health
outcomes that are sequences of health states or that
are streams of health states over an interval of time.
We present models in which a cardinal relation on
a person’s health outcomes satisfies certain conditions
if and only if it is represented by a cardinal scale that
has a certain form. For example, we show that a cardinal relation on health–duration pairs satisfies certain conditions if and only if it has a cardinal scale
that is a product of a cumulative discounting function and a scale defined on health states. An open
question is whether parallel results can be obtained
for health outcomes that are sequences of health
states or streams of health states over a bounded or
unbounded interval of time.
The third part of this paper discusses the aggregation of cardinal scales for the members of a population into a cardinal scale for the population as a
whole. We adapt to the context of health a model of
Harvey (1999) in which a cardinal relation for population welfare is represented by a cardinal scale that is
a weighted sum of cardinal scales for the population
members. This model is parallel to the well-known
model of Harsanyi (1955) that aggregates expectedutility functions for the members of a population into
an expected-utility function for the population.
The third part of this paper also discusses conditions on social values that imply simplifications in a
cardinal health model, e.g., equal weights, and it discusses assessment procedures by which a policy analyst can construct cardinal scales for individual and
population health in such a model.
The fourth part of this paper discusses procedures
by which a model that has been constructed can be
used to evaluate a change in population health, for
example, a change that is predicted to result from a
policy option. In brief, a policy analyst can calculate a
change in population health that is indifferent to the
given change and that is much simpler to report. Also,
he can compare the effectiveness of policy options by
comparing these simple equivalent changes. A utility
scale that is not a cardinal scale cannot provide such
an evaluation or comparison of policy options.
This paper concludes with a hypothetical application that evaluates a proposed therapy for the reduction of hypertension, commonly known as high blood
pressure.
The interest throughout this paper is prescriptive.
The models and procedures are intended for studies
of population health that are concerned with valuing
as well as predicting the effects of alternative policies. As in the case of expected-utility, there surely are
many types of violations of the conditions on values
that are presented here. That important subject we do
not address.
Most of the analytic results in this paper, as well as
in Harvey (1999), are based on work by the Danish
mathematician J. L. W. V. Jensen (1905, 1906). Proofs
of results are in the appendix.
2.
Meaningful Interpretations of a
Cardinal Relation
This section and the next discuss cardinal relations
and cardinal scales, respectively. These discussions
are intended as a background, although a very incomplete one, for the models and procedures presented in
later sections.
Pareto (1896) and Fisher (1918) pointed out long
ago that the term “utility” had two different meanings: (i) an older, hedonic meaning in utilitarianism
and classical welfare theory as the degree of pleasure and pain (or welfare, well-being, etc.) that a person experiences as part of a specified consequence, and
(ii) a newer, choice-oriented meaning in consumer
theory as the degree to which a consequence satisfies a person’s preferences. Pareto gave an illustration
in which the two meanings differ: a child may experience better health by taking medicine even though
the child much prefers not to take the medicine.
This distinction in the meaning of utility is logically
independent of that between a cardinal relation and an
ordinal relation. Historically, however, the two distinctions have sometimes been confused. An insufficient
recognition of the distinction in the meaning of utility
made by Pareto (1896) and Fisher (1918) (and a disagreement as to whether cardinal comparisons were
needed in economics) led to a schism between early
INFORMS holds copyright to this article and distributed this copy as a courtesy to the author(s).
Additional information, including rights and permission policies, is available at http://journals.informs.org/.
258
neoclassical welfare economists, who used hedonic
utility and cardinal relations, and so-called new welfare economists, who used choice-oriented utility and
ordinal relations. Cooter and Rappoport (1984, p. 507)
note that “[t]he term ‘ordinalist revolution’ refers to
the rejection of cardinal notions of utility and to the
general acceptance of the position that utility was not
comparable across individuals.” In recent years, there
has been a renewed interest in the hedonic meaning
of utility; see, e.g., Broome (1991a, b; 2004), Kahneman
et al. (1997), Kahneman (2000), Loewenstein and Ubel
(2008), and Dolan and Kahneman (2008).
In this paper, cardinal relations can have either a
hedonic or a choice-oriented interpretation. Below, we
argue that either interpretation is meaningful in a
study of population health—and thus one should be
able to use cardinal relations with either interpretation as a part of the study.
Explicitly or implicitly, a study of population health
assumes value judgments concerning health, often
called social values. Judgments that compare changes
in a person’s health can be made by the person himself or they can be made by someone else (an expert,
an agency, or an idealized person). Such an entity
often is called a social planner. Usually but not always,
the social planner will follow the hedonic evaluations
or the preferences of the affected individuals.
If a cardinal relation on changes in a person’s health
is regarded as comparing changes in what the person experiences, then the comparisons are possible in
principle, no matter how difficult they may be in practice. And if a cardinal relation on changes in health
for a population has a similar hedonic meaning, then
the comparisons of changes are possible in a similar
fashion.
But if a cardinal relation on changes in health is
regarded as the preferences of a person or a group, then
an important question is whether or not the comparisons are meaningful. Here, we argue that a cardinal
relation regarded as preferences can have meaning.
Other arguments for meaningfulness are presented in
von Winterfeldt and Edwards (1986, pp. 208–211).
The argument that cardinal relations as preferences
are not meaningful has two parts. The first part is to
observe that an individual facing a decision is at a
single initial position. Changes from that position can
Harvey and Østerdal: Cardinal Scales for Health Evaluation
Decision Analysis 7(3), pp. 256–281, © 2010 INFORMS
be identified with their final outcomes, and comparisons of changes are not needed. It suffices to observe
the behavior of the individual in choosing outcomes.
The second part of the argument is as follows.
A classic technique of economic analysis is to specify
a set of alternatives to be compared so that it contains
alternatives that are not possible but are similar in
structure to the possible alternatives. Then, a binary
relation can be extended, at least in principle, from the
set of possible alternatives to the larger set of alternatives. But outcomes and changes in outcomes do not
have a similar structure; thus, it is not meaningful to
extend a relation from a set of outcomes to a set of
both outcomes and changes.
We think that this argument is valid but that for
prescriptive studies of population health it does not
preclude the meaningfulness of cardinal relations. To
explain, we must first describe what we mean by
such a study. We mean a study whose purpose is to
examine policy options. Part of the study is empirical, describing and predicting the consequences of
the options. And part is normative, assigning values to the consequences. The values may be hedonic, e.g., percents of infections prevented, or they may
be choice oriented, e.g., preferences in the form of
trade-offs between categories of injury. In either case,
the values are prescriptive: when hedonic, they reflect
choices by the policy analyst as to which harms and
benefits to include, and when choice oriented, they
are preferences that the analyst has assigned by a process of assumption and assessment (which the analyst
should make explicit) rather than by a direct reporting of choice behavior.
The actions of the policy options may be interventions in a single population (including the option of
nonintervention), they may be a single intervention
in many populations (including the case of a population at different times), or they may involve multiple populations and multiple types of intervention.
The consequences of the interventions are the changes
in health that they are estimated to have effected or
are predicted to effect. Often the initial distribution of
health in a population will not be uniform, i.e., not
everyone has the same initial health. And often when
more than one population is involved (as in the hypothetical application in §10), the populations will not
have the same initial distribution of health.
Harvey and Østerdal: Cardinal Scales for Health Evaluation
259
INFORMS holds copyright to this article and distributed this copy as a courtesy to the author(s).
Additional information, including rights and permission policies, is available at http://journals.informs.org/.
Decision Analysis 7(3), pp. 256–281, © 2010 INFORMS
Consider the second part of the argument that cardinal relations as preferences lack meaning, namely,
the argument that a relation cannot be extended from
outcomes to changes in outcomes. For studies of population health, the argument is valid but not relevant;
the consequences to be compared are changes. There
is no question of extending a relation that compares
outcomes to one that compares changes, that is, of
extending an ordinal relation to a cardinal relation.
The first part of the argument, like the second, is
valid but not relevant. Even when there is a single population, most likely the individuals are not
at the same initial position; their changes in health
will entail different initial positions as well as different final positions. And when there is more than one
population, then most likely the initial distributions
of health in the populations are different; changes in
distributions of health will entail different initial distributions of health as well as different final distributions of health.
The preferences in a prescriptive study of population health are not descriptive; they are introduced
according to a process other than direct observation of choice behavior. The process constitutes their
meaning, and the above arguments fail to deny that
meaning.
Actually, we take a stronger position than that
argued for above. We believe that cardinal relations
as preferences are meaningful even when there is
only one population and one initial distribution of
health. We will argue for this position indirectly by
making an analogy with preferences between uncertain outcomes. Suppose that a decision analyst wishes
to construct a prescriptive model of preferences for
a situation in which the consequences are prospects
having two outcomes, the first being a specified status quo position and the second being an outcome
in a specified set. To make the situation less realistic
but a better analogy, assume that the two outcomes
are equally likely. Here, the prospects can be identified with their second outcomes, and comparisons of
prospects are not needed. Perhaps the decision analyst will judge that an ordinal-utility model that compares the second outcomes is appropriate, or perhaps
he or she will judge that an expected-utility model
that compares probability distributions is appropriate.
But we doubt that the analyst would like to be told
that an expected-utility model is meaningless.
The motivating question for this paper is not
whether cardinal relations as preferences are meaningful; rather, it is whether they are useful in prescriptive models on population health. We believe that the
answer will depend on the situation that a policy analyst faces and that it also will and should depend on
the proclivities of the analyst. The goal of this paper,
and the focus of the following sections, is to provide
a few tools for an analyst who chooses to use cardinal
relations.
3.
Cardinal Scales
Suppose that h denotes a health outcome of any
type for an individual or for a population, and that
H denotes a set of health outcomes h. Suppose also
that a change from a health outcome h to a health
outcome h is denoted by h → h . We will refer to comparisons of these changes as preferences even though
they may also have a hedonic interpretation. A statement that a change h → h is at least as preferred as a
change k → k will be denoted by h → h k → k .
Cardinal relations and cardinal scales will be
defined as follows. A set of statements, h → h k → k , where the health outcomes are in a set H , will
be called a cardinal relation and will be denoted by .
We will say that is defined on the set H . Other
types of comparisons, e.g., “is preferred to” and “is
indifferent to,” will be defined in terms of . And a
function wh defined on H will be called a cardinal
scale for provided that h → h k → k if and only
if wh − wh ≥ wk − wk for any health outcomes
in H .
A cardinal relation induces a relation, called an
ordinal relation, that compares outcomes themselves.
Such a relation ord can be defined by h ord h if and
only if h → h h → h. A cardinal scale wh for is
an ordinal scale for ord (because h ord h if and only
if wh − wh ≥ wh − wh if and only if wh ≥
wh), but not every ordinal scale for ord is a cardinal
scale for . An ordinal scale is ordinally unique; that
is, if wh is an ordinal scale, then a function w ∗ (h
is also if and only if there exists a strictly increasing
function f w such that w ∗ (h = f wh for h in H .
As in the case of expected utility, a cardinal relation must satisfy certain conditions in order to have a
Harvey and Østerdal: Cardinal Scales for Health Evaluation
INFORMS holds copyright to this article and distributed this copy as a courtesy to the author(s).
Additional information, including rights and permission policies, is available at http://journals.informs.org/.
260
cardinal scale. The condition that corresponds to the
independence principle is often called dynamic consistency. It states that h → h k → k and h → h k → k imply h → h k → k for any health outcomes in the set H. Various models for the existence
of a cardinal scale have been constructed; see, e.g., Alt
(1936), Debreu (1960), Scott (1964), Pfanzagl (1968),
and Krantz et al. (1971). Except for the model of Scott
(1964), the conditions also imply that the cardinal
scale is cardinally unique; that is if wh is a cardinal
scale, then a function w ∗ h is also if and only if there
exist constants a > 0 and b such that w ∗ h = awh + b
for h in H .
The models of Alt (1936), Debreu (1960), and
Pfanzagl (1968) assume that the set H is topologically
connected in a given topology, and they establish that
a cardinal relation satisfies the conditions if and only
if it has a continuous cardinal scale, that is, a continuous function wh. Because H is connected, the range
of such a function is an interval.
It will simplify our models to assume that there
exist health outcomes that are not indifferent. This
will be the case if and only if any cardinal scale has a
nonpoint range.
In a policy study, the analyst should determine the
appropriateness of conditions that imply the existence
of a cardinal scale having the properties described
above. However, for the purpose of developing the
models in this paper, we will directly assume these
implications.
Definition 1. A cardinal or ordinal relation will be
called proper provided that it has a topologically continuous scale whose range is a nonpoint interval. Any
such scale will be called proper.
Lemma 1. If a cardinal relation is proper, then, (i) the
cardinal scales for are cardinally unique, (ii) any cardinal scale for is proper, and (iii) the relation ord is
proper.
If the health outcomes in a set H are described
by one or more variables whose domains are intervals, then the set H is (topologically) connected. And
if H is connected and satisfies conditions that
imply the existence of a continuous, nonpoint cardinal scale, then is proper. Here, the assumption
that the set H is connected can be weakened. If the
Decision Analysis 7(3), pp. 256–281, © 2010 INFORMS
outcomes in H are described by one or more variables whose domains are intervals and (possibly) also
by one or more variables whose domains are finite
and have sufficiently fine gradations, then H has a
more general property, called “preferential connectedness.” Harvey (2008) shows that the assumption of
connectedness can be weakened to that of preferential
connectedness.
4.
Health States
This section and the next present models of cardinal
scales that evaluate changes in health for an individual. In this section, a health outcome is a state of a
person’s health. For example, the health outcome may
be a morbidity or an injury, or it may be the condition
of the person’s health in general. A health outcome
defined in this manner will be called a health state.
In a policy study, the health states are described
by one or more health variables (often called ratings,
dimensions, health scales, etc.). A health variable may
have either an interval domain or a finite domain. Typically, the set of health states to be compared is defined
as the product set of the domains of the variables.
However, we will not require this condition here.
In the discussion below, the term “ordinal scale”
will mean any scale (including an expected-utility
scale) that compares health states and has not been
verified to be a cardinal scale. Suppose that a policy
analyst wishes to construct a cardinal scale for a set
of health states. The method that we develop is for
the analyst first to obtain an ordinal scale and then
to either verify that the ordinal scale is cardinal or to
construct a cardinal scale that is based on the ordinal
scale.
An advantage of this approach is that it enables the
analyst to separate two different issues of preferences.
The ordinal scale represents the issue of trade-offs
between the health variables, e.g., for a given decrease
in one variable, how much increase in another variable is needed to produce an indifferent health state.
The cardinal scale can then be defined on the single ordinal scale, rather than on the health variables. It represents the issue of preference intensity
between the values of the ordinal scale, e.g., for a
given increase and a given third value, how much
does the third value need to increase to produce a
change that is indifferent to the given increase.
Harvey and Østerdal: Cardinal Scales for Health Evaluation
261
INFORMS holds copyright to this article and distributed this copy as a courtesy to the author(s).
Additional information, including rights and permission policies, is available at http://journals.informs.org/.
Decision Analysis 7(3), pp. 256–281, © 2010 INFORMS
A second advantage of the approach is that a wide
variety of ordinal scales are available for the analyst
to use. Often, such a scale is called “health-related
quality of life” or “quality of life” (see, e.g., Gold et al.
1996, Dolan et al. 1996, Dolan 2000). For brevity, we
will use the term quality scale to refer to any ordinal
scale when the health outcomes are health states.
Quality scales have been developed both for morbidity and trauma. Examples of this type of scale are
the Functional Capacity Index, the Quality of Wellbeing Scale (Kaplan and Anderson 1996), the Health
Utilities Indices (Torrance et al. 1982, 1995, 1996;
Feeny et al. 1996; Furlong et al. 1998), the Health and
Activities Limitation Index, the SF-36 metric for health
status measurement, and the EuroQoL quality of life
scales (Kind 1996, Dolan 1997, Richardson et al. 2001).
Procedures to assess a quality scale have been
developed, either as part of a health study or as
a separate undertaking. Examples of these assessment procedures (commonly called scales) include
time trade-off scales, standard gamble scales, person
trade-off scales, and rating scales; see, e.g., Torrance
(1976, 1986) and Dolan et al. (1996).
Below we develop models of cardinal scales that
depend on a predetermined quality scale. The quality scale can be an established scale, a scale that is
constructed as part of a study, or a health variable
if health states are described by a single variable.
A health state will be denoted by s, a set of health
states will be denoted by S, and a quality scale will
be denoted by qs.
Theorem 1. Suppose that a cardinal relation for a
set S of health states has a proper quality scale qs. Then,
is proper if and only if it has a cardinal scale of the
form ws = f qs), where f q is a continuous, strictly
increasing function defined on the range of qs.
A function f q as described in Theorem 1 will be
called a conversion function. It converts an ordinal scale
q to a cardinal scale f q, i.e., a scale in which differences are meaningful. Differences in the scale q will
be meaningful if and only if q is a cardinal scale itself.
Theorem 1 provides a means of reducing the task
of constructing a cardinal scale ws that is defined
on multivariable health states to that of constructing
a conversion function f q that is defined on a single
variable qs. The conditions below imply (and are
implied by) the existence of conversion functions f q
that have special forms, and thus are relatively simple
to construct.
Definition 2. A health state ŝ will be called middling health (or middling) relative to two health
states s and s provided that the changes s → ŝ and
ŝ → s are indifferent.
Cardinal neutrality for a quality scale qs. For any
health states s ŝ, and s , if qŝ = 12 qs + 12 qs , then ŝ
is middling relative to s and s .
Cardinal constancy for a quality scale qs. If
a health state ŝ is middling relative to two health
states s and s , and t t̂, and t are health states with
qt = qs + q, qt̂ = qŝ + q, and qt = qs + q,
for some number q, then t̂ is middling relative to t
and t .
Theorem 2. Suppose that is a proper cardinal relation, and qs is a proper quality scale for a set S of health
states. Then,
(i) has cardinal neutrality for qs if and only if qs
is a cardinal scale for ;
(ii) has cardinal constancy for qs if and only if has a cardinal scale of the
 form

r > 0
erqs

(1)
ws = qs
r = 0


−erqs r < 0
for some amount of the parameter r. For a health state ŝ
that is middling relative to two health states s and s , its
health quality qŝ is greater than, equal to, or less than
the average health quality, 12 qs + 12 qs , when r is greater
than, equal to, or less than zero, respectively.
The hypothetical application in §10 concerning
hypertension illustrates the use of cardinal constancy.
To oversimplify somewhat, here the quality scale is
blood pressure and the cardinal scale is the risk of
heart disease.
The proof of part (i) is based on Jensen’s (1905, 1906)
functional equation, which in the present notation is
f 12 q + 12 q = 12 f q + 12 f q . The proof of part (ii) is
based on a similar functional relationship introduced
in Harvey (1990) for models of risk attitudes.
5.
Health–Duration Pairs
In this section, a health outcome consists of a single health state and its duration. A health outcome
INFORMS holds copyright to this article and distributed this copy as a courtesy to the author(s).
Additional information, including rights and permission policies, is available at http://journals.informs.org/.
262
defined in this manner will be called a health–duration
pair. The duration of a health state can be defined as
the time from a specified event, e.g., an accident or
the onset of a disease, in which case different individuals in a population will have different initial times,
or it can be defined as the time from a common initial
time, e.g., a specified present time.
A health state will again be denoted by s, and its
duration in years will be denoted by t. Thus, a health–
duration pair will be denoted by (s t). A quality scale
will again be denoted by qs.
Most models of health–duration pairs are expectedutility models; see, e.g., Bleichrodt et al. (1997) for a
discussion of the advantages of such models. The simplest utility functions are of the form us t = tqs,
where the linear factor t represents risk neutrality
toward duration. In these models, the utility us t
of a health–duration pair (or the expected utility of a
probability distribution) is an amount that represents
an equivalent duration in a state of optimal health;
this amount is called “quality-adjusted life years”
(QALYs) or “healthy-years equivalents” (HYEs).
We regard the various QALY models due to Pliskin
et al. (1980) as the basic expected-utility models
for health–duration pairs. Alternative conditions
on preferences are presented in Bleichrodt et al.
(1997), Miyamoto et al. (1998), and Hazen (2004) for
expected-utility models, and in Doctor and Miyamoto
(2003) and Østerdal (2005) for deterministic models. QALY models for health–duration pairs are also
discussed in Loomes and McKenzie (1989), Broome
(1993), Johannesson et al. (1994), Gold et al. (1996),
and Dolan (2000).
Using an ordinal-utility approach, Mehrez and
Gafni (1989) developed QALY/HYE models. They
proposed a procedure in which a health–duration
pair is measured by assessing an indifferent health–
duration pair in which HYE years (i.e., the healthyyears equivalent) of optimal health are followed by
death. See also, e.g., Gafni et al. (1993), Loomes (1995),
and Johannesson (1995).
Describing health outcomes as health states or
as health–duration pairs is a simplification. It may
be appropriate because of data limitations, but it
entails assumptions (whether stated or not) regarding what occurs after the health state. In this section,
Harvey and Østerdal: Cardinal Scales for Health Evaluation
Decision Analysis 7(3), pp. 256–281, © 2010 INFORMS
we assume that a common health state occurs afterward. An alternative assumption would be to assume
that what occurs afterward is stochastically independent of the health state; in §9, we examine a related
assumption of stochastic independence for a population of individuals.
This section develops cardinal scales for a cardinal
relation concerning health–duration pairs. We introduce various conditions on and its associated ordinal relation, and show that they imply (and are implied by) the existence of cardinal scales for that
have various properties or special forms. Throughout,
we assume that is a proper cardinal relation defined
on a product set S × T , where S is a set of health states
as in the previous section and T is a set of durations.
Suppose that s0 denotes the health state that occurs
after the health state in a health–duration pair. The
state s0 will be called the standard state. It may be
specified as, e.g., death, optimal health, or a return
to previous health. We assume that s0 is in the set S
of health states, and we make no assumption as to
whether S contains health states that are better than
or worse than s0 . In particular, when s0 is a state
of death, the set S can contain health states that are
worse than death. For expected-utility models having
states worse than death, see Miyamoto et al. (1998).
Ethical arguments have been made for an “equal
value of life” principle that society should not compare fatalities with nondeath health states; see, e.g.,
Harris (1987) and Nord (2001). In the case that s0
is a state of death, this principle is not consistent
with the models presented here. For reasons why
social preferences among health outcomes should
depend on degrees of health and duration; see, e.g.,
Singer et al. (1995), Williams (1997), and Hasman and
Østerdal (2004).
Health–duration pairs of the form (s0 t describe
the same health outcome because the state s0 occurs
after the time t. Thus, pairs of the form (s0 t should
be modeled as indifferent.
Moreover, for a fixed duration t, the cardinal relation defines a cardinal relation S t for health states
by s1 → s2 S t s3 → s4 provided that (s1 t → s2 t s3 t → s4 t). When the relations S t are the same
for any duration t, they can be regarded as a single
cardinal relation. We will denote such a cardinal relation by S and say that induces the relation S .
Harvey and Østerdal: Cardinal Scales for Health Evaluation
263
INFORMS holds copyright to this article and distributed this copy as a courtesy to the author(s).
Additional information, including rights and permission policies, is available at http://journals.informs.org/.
Decision Analysis 7(3), pp. 256–281, © 2010 INFORMS
These two comments can be formalized as the following conditions on a cardinal relation :
(A) Any two health–duration pairs with the standard state s0 are (ordinally) indifferent.
(B) The relation restricted to health–duration
pairs with a fixed duration t is independent of t, and
the relation S induced by is a proper cardinal
relation.
Condition (B) is not appropriate when the set T
contains the duration t = 0. For it seems natural to
assume that any pairs (s 0) are indifferent. For positive durations, however, (B) seems appropriate, at
least for prescriptive uses in population health studies. In medical decision making, (B) can be violated by
the stated preferences of a patient; see, e.g., Miyamoto
et al. (1998).
Lemma 2. Suppose that is a proper cardinal relation
for a set S × T of health–duration pairs. Then, satisfies
conditions (A) and (B) if and only if there exists a cardinal
scale for of the form
ws t = atgs
(2)
where at is a positive, continuous function on the set T
of durations, and gs is a continuous function on the set
S of health states such that gs has an interval range and
gs0 = 0.
Next, we introduce three additional conditions on
a cardinal relation for health–duration pairs. These
conditions are concerned with the ordinal relation
that is associated with . They imply (and are
implied by) further properties of the functions at and
gs in (2).
Assume that T is the semi-infinite interval T =
0 . This choice avoids two issues: that of interpreting health outcomes of zero duration, and that of
assigning an upper bound on duration.
As before, imagine that a quality scale qs on the
set S of health states has been specified. We will
assume that qs is a proper ordinal scale for the cardinal relation S .
If a health state s is preferred to the standard
state s0 , then pairs (s t with greater durations t
should be preferred (because the standard state s0 will
occur after the state s. Similarly, if a state s is less preferred than s0 , then pairs (s t with lesser durations t
should be preferred.
A pair (s ) with a brief duration should be preferentially close to pairs with the standard state s0 in
the sense defined by condition (E) below. By contrast,
Bleichrodt et al. (1997) and Miyamoto et al. (1998)
define T = 0 ) and introduce a “zero-condition,”
which states that any two outcomes (s1 0) and (s2 0)
are indifferent. This condition and the condition that
ordinal preferences are continuous on the set S ×
0 ) imply condition (E) on the subset S × 0 ).
These ideas can be stated as conditions on the ordinal relation associated with a relation :
(C) qs is a proper ordinal scale for the cardinal
relation S defined by condition (B).
(D) For a health state s and two durations t1 < t2 ,
if s is preferred to s0 , then (s t2 is preferred to (s t1 ,
and if s is less preferred than s0 , then (s t2 is less
preferred than (s t1 ).
(E) For a duration t and two health states s1 and
s2 , if s1 and s2 are preferred to s0 , then there exists a
(small) duration such that (s2 ) is less preferred
than (s1 t), and if s0 is preferred to s1 and s2 , then
there exists a (small) duration such that (s2 ) is
preferred to (s1 t).
Theorem 3. Suppose that is a proper cardinal relation for a set S × T of health–duration pairs. Suppose also
that T = 0 ) and that qs is a function defined on the
set S. Then, and qs satisfy conditions (A)–(E) if and
only if there exists a proper cardinal scale for of the form
ws t = Dtf qs
(3)
where the following are true:
(i) f q is a continuous, strictly increasing function
defined on the range of the function qs, and f qs0 = 0.
Also, induces a proper cardinal relation S on changes
in health states, and f qs) is a proper cardinal scale
for S .
(ii) Dt is a continuous, strictly increasing function
defined on the union [0 ) of the interval T = 0 ) and
zero, and D0 = 0. Also, induces a proper cardinal relation T on changes in duration (as explained below), and
Dt is a proper cardinal scale for T .
Moreover, each of the functions Dt and f q is unique
up to a positive multiple.
We present part of the proof of (ii) here because
it provides a needed explanation. According to a
cardinal scale ws t) of the product form (3), if a
Harvey and Østerdal: Cardinal Scales for Health Evaluation
INFORMS holds copyright to this article and distributed this copy as a courtesy to the author(s).
Additional information, including rights and permission policies, is available at http://journals.informs.org/.
264
health state s is preferred to the standard state s0 (and
thus f qs > 0), then greater durations in s are preferred, and if s is less preferred than s0 (and thus
f qs < 0), then lesser durations in s are preferred.
Because the scale ws t) is proper and thus has a nonpoint range, at least one of these cases occurs.
With this motivation, we will define a cardinal relation T s for a fixed health state s that is preferred
to s0 by t1 → t2 T s t3 → t4 , provided that (s t1 →
s t2 s t3 → s t4 ). And for a fixed state s that is
less preferred than s0 , we will define T s by: t1 →
t2 T s t3 → t4 provided that (s t3 → s t4 s t1 →
s t2 ). In each of these cases, the function Dt is a cardinal scale for T s . Therefore, the cardinal relations
T s , where s is not indifferent to s0 are the same. We
define T as this common cardinal relation.
As shown below, the special conditions defined in
§4 for health states can be applied in the context of
health–duration pairs as conditions on a relation S ,
and they imply the same special forms of a conversion function f q. Thus, it suffices in this section to
discuss conditions on a cardinal relation that imply
special forms of a function Dt. Moreover, a restriction for a function, f q or Dt, is independent of a
restriction on the other function.
Cardinal timing neutrality. For any health state s,
durations t1 < t2 , and shift t > 0, the changes (s t1 →
s t2 ) and (s t1 + t → s t2 + t) are indifferent.
Cardinal timing aversion. For any health state s,
durations t1 < t2 , and shift t > 0, if s is preferred to
s0 , then (s t1 → s t2 ) is preferred to (s t1 + t →
s t2 + t), and if s is less preferred than s0 , then
(s t1 → s t2 ) is less preferred than (s t1 + t →
s t2 + t).
Cardinal constant timing aversion (I). Preferences
are timing averse, and for any s1 s2 t1 < t2 t > 0,
and t > 0, if two changes, (s1 t1 → s1 t1 + t ) and
(s2 t2 → s2 t2 + t , with the same change t in duration are indifferent, then the changes, (s1 t1 + t →
s1 t1 + t + t) and (s2 t2 + t → s2 t2 + t + t), are
indifferent.
Cardinal constant timing aversion (II). Preferences
are timing averse, and for any s t1 < t2 t3 < t4 , and
t > 0, if two changes, (s t1 → s t2 ) and (s t3 →
s t4 ), with the same health state s are indifferent, then the changes, (s t1 + t → s t2 + t) and
(s t3 + t → s t4 + t), are indifferent.
Decision Analysis 7(3), pp. 256–281, © 2010 INFORMS
As the terminology indicates, these conditions on
cardinal preferences between health–duration pairs
are parallel to well-known conditions on ordinal preferences between temporal sequences of events, that
is, conditions on discounting. Specifically, the conditions are parallel to conditions often called zero discounting, positive discounting, and constant positive
discounting.
Theorem 4. Suppose that is a proper cardinal relation for a set S × T of health–duration pairs. Suppose also that T = 0 ) and that qs is a function
defined on S. Assume that and qs satisfy conditions
(A)–(E), and thus by Theorem 3, has a cardinal scale,
ws t = Dtf qs), as described there. Define q0 = qs0 ).
Then,
(i) has cardinal neutrality for qs if and only if f q
can be chosen as f q = q − q0 ;
(ii) has cardinal constancy for qs if and only if f q
can be chosen as

rq
rq0

r > 0

e − e
f q = q − q0
(4)
r = 0


erq0 − erq r < 0
for some amount of the parameter r;
(iii) has cardinal timing neutrality if and only if Dt
can be chosen as Dt = t;
(iv) has cardinal timing aversion if and only if Dt
is strictly concave;
(v) cardinal constant timing aversions (I) and (II) are
equivalent, and they are satisfied if and only if Dt can
be chosen as Dt = r −1 1 − e−rt ) for some amount of the
parameter r > 0.
Moreover, the implications (i) and (ii) for the function
f q are independent of the implications (iii)–(v) for the
function Dt.
A function Dt is an indefinite integral, Dt =
du du, if and only if Dt is absolutely continu0
ous on any bounded subinterval of [0 ). Then, dt
is called a discounting function, and Dt is called a
cumulative discounting function.
We will call any continuous, strictly increasing
function Dt a cumulative discounting function even
though not every such function is absolutely continuous. Any linear or strictly concave function Dt such
as those in Theorem 4 is absolutely continuous.
t
Harvey and Østerdal: Cardinal Scales for Health Evaluation
INFORMS holds copyright to this article and distributed this copy as a courtesy to the author(s).
Additional information, including rights and permission policies, is available at http://journals.informs.org/.
Decision Analysis 7(3), pp. 256–281, © 2010 INFORMS
In the case of cardinal timing neutrality, the discounting function dt can be chosen as dt =
D t = 1; in the case of cardinal constant timing aversion, dt can be chosen as dt = D t = e−rt r > 0;
and in the case of cardinal timing aversion, dt can
be chosen to be positive and strictly decreasing. In the
last case, dt may tend to infinity as t tends to zero,
and thus represent extreme discounting of the future
compared to the present.
6.
Cardinal Scales for
Population Health
Here we come to a division between medical decision
making, i.e., the choice of a medical treatment for an
identified individual, and health policy analysis, i.e.,
the evaluation of policy options concerning the health
of an identified population. We think that describing health as a health state or as a health–duration
pair is often appropriate for health policy analysis,
mainly because of the absence of extensive information on each individual in the population, but we
think that describing health as a sequence or a stream
of health states is appropriate for medical decision
making, mainly because of the availability of extensive information on the individual.
The objective of this paper is to develop models for health policy analysis. We conjecture, however, that the models in the previous sections could
serve as a partial basis for cardinal-utility models
for medical decision making. The medical decisionmaking models could be developed by combining
these models with discounting models for discreteand continuous-time outcomes in Harvey (1986; 1988;
1995; 1998a, b). The models in Harvey (1988) are
expected-utility models, and those in the other papers
are ordinal-utility models.
This section is concerned with the aggregation of
cardinal scale amounts for the individuals in a population into a cardinal scale amount for the population
as a whole. We present a cardinal-utility model that is
parallel to the expected-utility model due to Harsanyi
(1955). Except for one generalization (as explained
below), the model presented is a specialization for
the context of population health of the cardinal-utility
social welfare model in Harvey (1999).
An alternative cardinal-utility welfare model has
been developed by Dyer and Sarin (1979). However,
265
it is an “algebraic” model, and hence does not provide health scales that are continuous functions of the
health outcomes. For this reason, we use the “topological” model in Harvey (1999).
As in §2, the health outcomes can be of any type.
Suppose that the individuals in a population are
indexed by i = 1 n. A health outcome for an
ith individual will be denoted by hi , and the set of
such outcomes will be denoted by Hi . A distribution of health outcomes over the population will be
called a health distribution and will be denoted by h =
h1 hn ). We assume that the set of health distributions is the product set H = H1 × · · · × Hn .
The social welfare model in Harvey (1999) assumes
(as stated in terms of social health) that for each ith
individual there is a cardinal relation i that compares changes in (population) health distributions.
Because we wish to develop models for the presciptive purpose of examining social health policies, we
will assume that i reflects social values regarding health outcomes for the ith individual. Thus, i
depends only on the ith components hi in health distributions. Indeed, we define i as a cardinal relation
that compares changes hi → hi in health outcomes for
the ith individual. We will call i an individual cardinal
relation.
This simplification excludes models in which i
reflects the personal values of an individual who is
altruistic. Such preferences are included in the social
welfare model in Harvey (1999).
The social welfare model also assumes that there
is a cardinal relation for changes in health distributions. We will denote such a relation by P and call
it a population cardinal relation. A cardinal scale for an
individual cardinal relation i will be called an individual scale and will be denoted by wi hi , and a cardinal scale for a population cardinal relation P will be
called a population scale and will be denoted by wP (h).
The generalization of the social welfare model in
Harvey (1999) is to omit the assumption that each
set Hi i = 1 n, of health outcomes is topologically connected. Instead, we assume that the individual and population cardinal relations are proper. As
discussed in §3, this approach permits some of the
variables that describe the health outcomes to have
finite domains rather than interval domains provided
that each set Hi is “preferentially connected.”
Harvey and Østerdal: Cardinal Scales for Health Evaluation
INFORMS holds copyright to this article and distributed this copy as a courtesy to the author(s).
Additional information, including rights and permission policies, is available at http://journals.informs.org/.
266
Decision Analysis 7(3), pp. 256–281, © 2010 INFORMS
The key assumption, both in Harvey (1999) and
here, is the condition below that connects a population cardinal relation to the individual cardinal
relations. This condition is analogous to Pareto conditions in additive-utility models and in expected-utility
models.
Cardinal Pareto agreement. For any changes
h → h and k → k in health distributions, if there is
an ith individual such that hj → hj is indifferent to
kj → kj for all j = i, then preferences between h → h
and k → k according to the population cardinal
relation P agree with preferences between hi → hi
and ki → ki according to the individual cardinal
relation i .
In other words, when society does not need to make
trade-offs between changes in the health of different
persons, then preferences between changes in public health agree with preferences between changes in
health for the one person who matters. In this sense,
the Pareto condition is a requirement of individual
sovereignty.
Suppose that (hi = h; hk k = i) denotes a health distribution in which the ith individual has the health
outcome h and the other individuals have the outcomes indicated. Cardinal Pareto agreement implies
that preferences between changes (hi = h; hk k = i →
hi = h ; hk k = i) according to the relation P agree
with preferences between changes h → h according
to the relation i (and thus are independent of the
health outcomes hk k = i).
Theorem 5. A proper population cardinal relation P
has cardinal Pareto agreement with a set of proper individual cardinal relations i i = 1 n, if and only if
for any individual scales wi hi ) there exist weights ai > 0,
i = 1 n, such that the function
wP h = a1 w1 h1 + · · · + an wn hn (5)
is a population scale. The weights ai in (5) are unique up
to a common positive multiple.
The above model compares changes in population
health by a scale that is a linear function of scales that
compare changes in health for the members of the
population. Hence, it provides a foundation for the
utilitarian principle that cardinal utility for a society
is the sum (or a weighted sum) of cardinal utilities
for the members of the society. The weights ai in the
linear function (5) will be called interpersonal weights,
and the model will be called a linear health model.
The result that cardinal Pareto agreement implies a
linear function (5) may seem surprising. The explanation lies in the difference between ordinal and
cardinal uniqueness. Pareto agreement for ordinal
preferences implies that a population ordinal scale is
a function of individual ordinal scales that is strictly
increasing in each scale, but it does not imply any
form of the function. As noted above, Pareto agreement for cardinal preferences implies that for fixed
outcomes hk k = i, a population scale wP (h) restricted
to health distributions of the form (hi = h; hk k = i)
is a cardinal scale for the individual cardinal relation i . For an individual scale wi hi , it follows that
for these health distributions, wP h = ai wi hi + bi ,
for some ai > 0 and bi . Theorem 5 can be viewed
as an extension of this uniqueness property. Moreover, the social welfare model developed by Harsanyi
(1955) for expected utility can be viewed in the same
manner.
Next, we digress to discuss the special but common
case in which there are only two health outcomes of
interest. We show that in this case a policy analyst
can assume that the individual cardinal relations i
are the same even though it is apparent that they are
not; indeed, the analyst does not even need to assess
a common individual scale.
Suppose that an analyst wishes to use a linear
health model in a policy study. Most likely, he or
she can envision a wide range of health outcomes for
the individuals in the defined population. Assume,
however, that the health distributions to be examined
in the study contain only two health outcomes. The
health distributions in the study differ in that different subgroups of the population have one outcome or
the other. Our impression is that this situation is common. It may occur for various reasons; e.g., the available data provide needed health information only on
the two outcomes, or the analyst decides that only the
two outcomes are relevant to the study.
Corollary 1. In a linear cardinal model, assume that
there exist health outcomes h0 and h1 such that they are
in each set Hi , and h1 is preferred to h0 by each individual. Then, there exist unique individual scales wi hi ) such
that wi h0 = 0 and wi h1 = 1 for i = 1 n. (Hence,
Harvey and Østerdal: Cardinal Scales for Health Evaluation
267
Decision Analysis 7(3), pp. 256–281, © 2010 INFORMS
INFORMS holds copyright to this article and distributed this copy as a courtesy to the author(s).
Additional information, including rights and permission policies, is available at http://journals.informs.org/.
the individual scales wi hi are equal for h0 and are equal
for h1 .)
Suppose that wP h = a1 w1 h1 + · · · + an wn hn is
an associated population scale. For a health distribution h, define Sh = i = 1 n: hi = h1 }. Then, for
any health distribution whose only health outcomes
are h0 and h1 , the population scale wP h) reduces to
the form
ai (6)
wP h =
i∈Sh
In a model as described in Corollary 1, the only
assessment task for the analyst is to assess the interpersonal weights ai i = 1 n. Section 8 discusses a
procedure for determining the weights ai by assessing
so-called person trade-offs.
7.
Equal Individual Scales and Equal
Interpersonal Weights
This section discusses two simplifications of a linear
health model. First, we present a model in which conditions on preferences imply equal individual scales,
and then we present a model in which conditions on
preferences imply both equal scales and equal interpersonal weights. These special linear health models
will be much easier to implement than the general
model.
7.1. Equal Individual Scales
We expect that in most health policy studies the health
outcome sets Hi can be chosen as a common set H ,
and the individual cardinal relations i can be chosen as a common relation I . The set of possible outcomes for an individual can still vary significantly
from one person to another; for example, younger
people and older people may have quite different sets
of possible outcomes. What is assumed is that preferences between changes in outcomes (whether possible
or not) do not vary significantly from one person to
another. This condition does not mean that society is
indifferent between a change for one person and the
same change for another person.
Theorem 6. In a linear cardinal model, assume that the
health outcome sets Hi are equal and that the individual
cardinal relations i are equal. Then, for any individual
scale wI hi for the common relation i , there exist weights
ai > 0 i = 1 n, such that the function
wP h = a1 wI h1 + · · · + an wI hn (7)
is a population scale. The weights ai in (7) are unique up
to a common positive multiple.
For the purposes of comparing policy options and
of evaluating a single option, the gain in simplicity
of a population scale (7) over a population scale (5)
appears to outweigh its loss in generality; indeed, we
regard Theorem 6 as the basic model for purposes of
application. Sections 8 and 9 discuss reasons for this
opinion. The potential use of a population scale (7) to
evaluate a policy option as well as to compare policy
options is reflected in the terminology below.
Definition 3. A model as described in Theorem 6
will be called a population health evaluation model, and
a scale (7) in such a model will be called a population
evaluation scale.
7.2. Equal Interpersonal Weights
In this subsection, we ask when the interpersonal
weights in a population health evaluation model will
be equal. A common individual scale wI hi is cardinally unique, and for a given scale wI hi , the interpersonal weights are unique up to a positive multiple.
It follows that whenever there exist equal weights for
some population scale, then any weights for any population scale are equal. Hence, we can speak loosely
of “equal weights.”
The result below provides conditions on preferences that imply both equal individual cardinal relations and equal interpersonal weights. We will call a
model that satisfies these conditions an equal weights
model. In this model, the only assessment task is to
assess a common individual scale wI hi . For any such
scale, the corresponding function wP (h) is a population scale.
Theorem 7. Suppose that the health outcome sets Hi in
a linear cardinal model are equal, and that h0 is a fixed but
arbitrary health outcome. Then, the following conditions
on preferences are equivalent, and they are satisfied if and
only if the model is an equal weights model.
Harvey and Østerdal: Cardinal Scales for Health Evaluation
268
Decision Analysis 7(3), pp. 256–281, © 2010 INFORMS
INFORMS holds copyright to this article and distributed this copy as a courtesy to the author(s).
Additional information, including rights and permission policies, is available at http://journals.informs.org/.
(a) For any individuals i and j and any health outcome
h, the changes in health distributions
hi = h0! hk = h0 k = i
→ hi = h! hk = h0 k = i and
hj = h0! hk = h0 k = j
→ hj = h! hk = h0 k = j
(8)
are indifferent according to the population cardinal
relation.
(b) The individual cardinal relations i are equal, and
for any individuals i and j there exists a health outcome
h nonindifferent to h0 such that the changes (8) in health
distributions are indifferent according to the population
cardinal relation.
(c) For any individuals i and j and any health outcome
h, the health distributions
hi = h! hk = h0 k = i
and
hj = h! hk = h0 k = j
(9)
are indifferent according to the ordinal relation associated
with the population cardinal relation.
(d) The individual cardinal relations i are equal, and
for any individuals i and j there exists a health outcome h
nonindifferent to h0 such that the health distributions (9)
are indifferent according to the ordinal relation associated
with the population cardinal relation.
7.3.
Should Different Interpersonal Weights Be
Assigned to Different Age Groups?
What differences among the members of a population should imply unequal importance for the same
changes in health outcomes for different individuals?
One possibility is differences in age. Here we discuss
several reasons why different interpersonal weights
might be assigned to people of different ages.
First, age can be related to the social importance of
individuals. Murray (1994) and Murray and Acharya
(1997) argue that because working-aged adults make
greater economic contributions than do children or
seniors, their health has greater social importance.
The authors assign unequal weights to persons of different ages as part of a “disability-adjusted life years”
(DALY) scale for health outcome-streams. Often, in a
DALY scale the weight assigned to a person aged 25
is about twice the weight assigned to someone aged 6
or 67. See Anand and Hanson (1997) for a critical
review. In our opinion, it is not justified to infer
unequal social importance from unequal economic
roles. Moreover, U.S. health policies (e.g., health programs for children and Medicare) implicitly assign
greater weights to children and to seniors than to
working-aged adults.
Second, age can be related to concerns for equality. Greater age tends to imply greater lifetime health
and longevity. Williams (1997) argues for assigning weights that favor equality in people’s lifetime quality-adjusted life years. The weights resulting
from this “fair-innings argument” will be greater for
younger people than for older people.
Greater age also tends to imply lesser future health
and longevity. One can argue for assigning weights
that favor equality in people’s future quality-adjusted
life years. The weights resulting from this futureequality argument will be greater for older people
than for younger people.
Another criterion—one in the spirit of utilitarianism—is to assign weights that favor the sum of
improvements in health and longevity. This criterion
is meaningful for cardinal utility but not for ordinal or expected utility. First, assume that health outcomes are defined as health states—with the duration
of a health state and any ensuing states unknown. A
change in health states (e.g., from fatality to a state
of no health problem) is likely to produce a greater
improvement in a person’s health and longevity for
a younger person than for an older person. So when
health outcomes are defined as health states, the
resulting weights may be greater for younger people. But when health outcomes are defined as health–
duration pairs, a change in health outcomes entails
the same improvement in health and longevity for an
older person as for a younger person. So for health–
duration pairs, knowing the ages of the population
members does not provide a reason for assigning
unequal weights.
Typically, the comparison of policy options in an
equal weights model with health states will differ
from a comparison of the same options in an equal
weights model with health–duration pairs or health
outcome streams. In particular, fatalities will count the
same regardless of age in the model with health states,
whereas a fatality for an older person will count less
Harvey and Østerdal: Cardinal Scales for Health Evaluation
269
INFORMS holds copyright to this article and distributed this copy as a courtesy to the author(s).
Additional information, including rights and permission policies, is available at http://journals.informs.org/.
Decision Analysis 7(3), pp. 256–281, © 2010 INFORMS
than a fatality for a younger person in the model with
health–duration pairs or health outcome streams. In
such a context, one cannot assign equal weights
both in the model with health states and in the
model with health–duration pairs or health outcome
streams. Similar observations have been made for
expected-utility models; see, e.g., Bordley (1994) and
Hammitt (2002).
Below, we define two other types of assessments
that an analyst can use to determine an individual
scale or interpersonal weights. In the case of an individual scale, the analyst can first use assessments of
the above type to imply that the scale has a functional
form or belongs to a parametric family and then use
assessments of the types defined below to determine
the scale.
8.
8.2.
Assessment Procedures
In this section and the next, we describe two types
of procedures concerning a cardinal scale; those in
this section are for constructing a cardinal scale, and
those in the next section are for using a cardinal scale.
In both sections, we assume that social values satisfy the conditions for a population health evaluation
model.
Here, we describe statements of social values that
a policy analyst can use to determine an individual
scale and interpersonal weights (and hence a population scale). The statements might be, for example, interview responses by a policy maker, modeling
judgments by the analyst, or assumptions of social
values that the analyst uses to evaluate policy options
conditional on the assumptions. For brevity, we will
refer to any such statements as assessments, and we
will refer to their use to determine a scale or weights
as assessment procedures.
We limit the discussion to assessment procedures
that require both individual and population cardinal
relations and Pareto agreement between them. We do
not discuss assessment procedures such as time tradeoffs that have been developed for noncardinal preference relations.
8.1. Assessing Conditions on Cardinal Preferences
One method of assessment is to assess a condition on cardinal preferences (such as those defined
in previous sections) that implies a property of an
individual scale or that implies equal interpersonal
weights. Some of the conditions regarding an individual scale imply that the scale has a functional form,
e.g., wI s = f qs and wI s t = Dtf qs; some
determine a scale, e.g., wI s = qs and wI s t =
tqs; and some imply that the scale belongs to a parametric family, e.g., wI s = erqs , r > 0, and wI s t =
r −1 1 − e−rt qs, r > 0.
Assessing Person Trade-Offs for Changes
in Health Quality
The assessments defined here state social trade-offs
between a change in health outcomes and the number
of people who experience the change. Similar assessments, called person trade-offs, have been proposed in
contexts that do not involve cardinal utility; see, e.g.,
Richardson (1994), Nord (1995), Murray and Lopez
(1996), and Green (2001). Østerdal (2009) showed that
an ordinal additive social welfare model based on person trade-off assessments must be of a certain type
in order to satisfy normative conditions such as the
Pareto principle.
For our discussion, first suppose that the health
outcomes are health states and that the model is an
equal weights model. Then, Theorems 1 and 7 imply
that there is a population scale of the form wP s =
f qs1 + · · · + f qsn ), where s = s1 sn denotes
a health distribution.
Select a best health state s 1 and a worst health state
0
s , and normalize f q such that f qs 1 = 1 and
f qs 0 = 0. Because the conversion function f q is
now unique, it remains to find more of its values and
then to interpolate between these values.
To find a value of f q, an analyst can select a health
state s that is easy to visualize and assess a number k
in one of the following statements.
(a) Society is indifferent between an improvement
from s 0 to s for the entire population and an improvement from s 0 to s 1 for k members of the population.
(b) Society is indifferent between a worsening from
s 1 to s for the entire population and a worsening
from s 1 to s 0 for n − k members of the population.
In statement (a) or (b), society is making a tradeoff between a degree of improvement or worsening
and a number of population members that experience
the improvement or worsening. Statement (a) implies
that nf qs − f qs 0 " = kf qs 1 − f qs 0 ))],
Harvey and Østerdal: Cardinal Scales for Health Evaluation
INFORMS holds copyright to this article and distributed this copy as a courtesy to the author(s).
Additional information, including rights and permission policies, is available at http://journals.informs.org/.
270
and statement (b) implies that nf qs 1 − f qs" =
n − kf qs 1 − f qs 0 ))]. Either equation implies
that the conversion function has the value f qs =
k/n. Thus, a value f qs of the conversion function can be interpreted as a proportion p = k/n of the
population.
The person trade-offs in (a) and (b) can also be
stated as comparisons of health distributions or as
comparisons of changes in health distributions. The
following statement is an example.
(c) Society is indifferent between a health distribution in which everyone in the population has the
health state s and a health distribution in which k
members have the health state s 1 and the remaining
n − k members have the health state s 0 .
Statements (a)–(c) are equivalent in the sense that
the assessed number k should be the same in each
one. As a question in behavioral economics, one
could investigate whether people do in fact assess
the same number k in these statements or exhibit
a systematic discrepancy similar to that between
people’s willingness-to-pay and willingness-to-accept
amounts.
When the interpersonal weights are unequal, one
can determine a population scale in two steps. First,
select a subpopulation in which the weights are equal
and use conditions on cardinal preferences and/or
assessments of person trade-offs to determine an individual scale, wI s = f qs). Then, order the population members in some fashion and use assessments
of person trade-offs analogous to (a) and (b) to determine a set of interpersonal weights.
To explain the second step for (a) or (b), suppose
that the phrase “for k members” in (a) is replaced
by “for the first k members,” and the phrase “for
n − k members” in (b) is replaced by “for the last
n − k members.” Suppose that wP s = a1 f qs1 +
· · · + an f qsn ) denotes a population scale with the
given ordering of the individuals and that the weights
are normalized such that a1 + · · · + an = c for a specified number c (e.g., c = 1 or c = n). Then, a modified assessment (a) or (b) implies that a1 + · · · + ak =
cf qs). Here, cf qs) is known. The analyst can
select a list of health states with different degrees of
health quality, and thus obtain a list of equations. The
weights can then be calculated by solving the simultaneous equations.
Decision Analysis 7(3), pp. 256–281, © 2010 INFORMS
Next, suppose that the health outcomes are health–
duration pairs, the weights are unequal, and social
values satisfy the conditions in Theorem 3. Then,
there is a population scale, wP s t = a1 Dt1 f qs1 + · · · + an Dtn f qsn ), where (s t = s1 t1 sn tn ) denotes a health distribution. The analyst can
select a duration t and consider health distributions in
which every health–duration pair has the duration t.
Then, the scale wP s t) reduces to a population scale
regarding health states, and the analyst can use the
assessment procedures described above to determine
a conversion function f q and interpersonal weights
ai . Finally, the analyst can use any of a wide variety of methods to determine a cumulative discounting
function Dt.
8.3.
Assessing Social Attitudes Toward Inequality
in Health Quality
In an assessment (a)–(c) as described above, an analyst selects a health state s and thus a health quality
q = qs, and assesses a number k and thus a proportion k/n. This procedure can be reversed; the analyst can select a number or a proportion and assess
a health state or a health quality. In such a reverse
person trade-off procedure, one assesses amounts, q =
f −1 k/n), and thereby finds a value, f q = k/n, of the
conversion function at each assessed amount q.
By using a reverse person trade-off procedure, an
analyst can select the proportion k/n = 12 . This proportion is relatively easy to visualize; half the population
has the best health state, s 1 , and the other half has the
worst health state, s 0 . The inverse amount q = f −1 12 is the health quality of a middling health state s relative to s 0 and s 1 ; that is, the changes, s 0 → s and
s 0 → s 1 , are indifferent according to the individual
relation.
A reverse person trade-off involves population
health and thus is an assessment of social values.
Below, we explain how it can be interpreted as an
assessment of social attitude toward inequality in
health quality. Suppose that q 1 = qs 1 ) and q 0 =
qs 0 ) are the best and worst amounts of health quality, respectively, and that there is a population scale,
wP s = f qs1 + · · · + f qsn ).
Suppose that s0 1 is a health distribution in which
half the population has the health quality q 1 , whereas
the other half has the health quality q 0 , and that savg
Harvey and Østerdal: Cardinal Scales for Health Evaluation
INFORMS holds copyright to this article and distributed this copy as a courtesy to the author(s).
Additional information, including rights and permission policies, is available at http://journals.informs.org/.
Decision Analysis 7(3), pp. 256–281, © 2010 INFORMS
is a health distribution in which the entire population
has the average health quality, q avg = 12 q 0 + 12 q 1 . The
average health quality in both s0 1 and savg is q avg .
But savg has no inequality in health quality, whereas
s0 1 has an extreme degree of inequality in health
quality. Is savg preferred to s0 1 ?
To examine this question, suppose that sindiff is a uniform distribution that is indifferent to s0 1 and that
q indiff is the common health quality in sindiff . Then, the
comparison between the distributions savg and s0 1
is the same as that between the health quality q avg
in savg and the health quality q indiff in sindiff . If q avg >
q indiff , then the distribution savg with no inequality in
health quality is preferred to the distribution s0 1
with extreme inequality in health quality. In this sense,
society is averse to inequality in health quality with
respect to q 0 and q 1 . In the case that q avg > q indiff for
any pair of health qualities q 0 < q 1 , we will say that
society is averse to inequality in health quality. The
reader can check that the population relation satisfies
this condition if and only if the conversion function
f q is strictly concave.
Suppose that an analyst assesses that social values
have cardinal constancy with respect to a quality scale
qs. Then, by Theorem 2, the conversion function f q
has a linear-exponential form (1). A single reverse person trade-off suffices to determine the parameter r in
(1), and the value of r determines society’s attitude
toward inequality in health quality for any q 0 < q 1 .
By an extension of the above terminology, society is
averse, neutral, or prone to inequality in health quality when r is negative, zero, or positive, respectively.
9.
Evaluation Procedures
Suppose that in a study of population health, an analyst has identified a set of policy options, has estimated the changes h → h in population health that
the options may produce, and has constructed a population health evaluation model that represents social
values relevant to the study. This section describes
procedures by which he or she can use a population
scale in the model to evaluate a policy option, that
is, to calculate and report information on that particular option. Most of these procedures lead directly to
methods for comparing two options: the analyst simply determines the difference or the percentage difference in their evaluations.
271
These procedures are meaningful only in a context
of cardinal relations. For a given policy study, an analyst can choose a procedure that he or she judges to
be appropriate for the situation.
In most policy studies, the issue is how to modify current policy (or whether to modify it). Typically, each policy option (including the current policy)
implies a probability distribution of health distributions. In this paper, however, we discuss only the case
in which it suffices to replace any probability distribution by a predicted health distribution. Thus, we
focus on the difficulty in recommending a policy that
is due to the complexity of the health distributions.
We assume that the analyst models each policy
option as a change from a initial health distribution
to the health distribution that is predicted to result
from the option. To provide a more concrete discussion, we will assume that the initial distribution is the
health distribution that is predicted to result from the
current policy. The reader should note, however, that
the evaluation procedures that are discussed do not
depend on this choice; in particular, the analyst could
choose a simple, hypothetical health distribution as
the initial distribution.
Assume that the population scale is a weighted
average, wP h = a1 wI h1 + · · · + an wI hn , where
a1 + · · · + an = 1. The predicted health distributions for
the current policy and for a policy option will be
denoted by hcur and by hopt , respectively. Thus, the
analyst wishes to evaluate each change hcur → hopt .
The health outcomes can be of any type.
9.1. Reporting a Change in Health Distributions
One type of procedure for reporting a change hcur →
hopt for a policy option is to calculate an indifferent change h → h with simpler health distributions
and report the calculated change. The change h → h
can be calculated by the formula wP h − wP h =
wP hopt − wP hcur .
A variety of procedures are possible. For example, the analyst could select the initial health distribution h to be a uniform distribution (i.e., a distribution
h = h h) in which everyone has the same health
outcome h), and for each policy option calculate a
final distribution h that is also a uniform distribution.
The advantage of these procedures is that they do
not require the population scale to have a simple interpretation, e.g., as a quality scale or as a QALY scale.
Harvey and Østerdal: Cardinal Scales for Health Evaluation
INFORMS holds copyright to this article and distributed this copy as a courtesy to the author(s).
Additional information, including rights and permission policies, is available at http://journals.informs.org/.
272
Decision Analysis 7(3), pp. 256–281, © 2010 INFORMS
The reported information is a change in health distributions and not a difference in population scale
amounts. The disadvantage of the procedures is that
they lack the usual features of measurement; for example, they do not measure a change hcur → hopt as an
amount of a single variable.
9.2. Reporting a Subpopulation
Suppose that the health outcomes in a policy study
range from a worst health outcome, denoted by h0 ,
to a best health outcome, denoted by h1 , and that a
policy option implies a change hcur → hopt that is an
improvement, or at least not a worsening, in population health.
First, assume that the population health evaluation
model is an equal weights model. For each k = 0
n, suppose that hk denotes a health distribution in which k individuals have the health outcome
h1 , and the other n − k individuals have the health
outcome h0 . In particular, h0 denotes the health
distribution in which everyone has the worst outcome h0 , and hn denotes the health distribution in
which everyone has the best outcome h1 .
In such a situation, one can calculate for any change
cur
h → hopt a change h0 → hk that is approximately
indifferent to hcur → hopt in the sense that the difference, wP hk − wP h0 ), is approximately equal to the
difference, wP hopt − wP hcur ). Then, one can report
either the number k or the proportion k/n as an evaluation of the change hcur → hopt .
If the population health evaluation model is not an
equal weights model, then the population members
must be ordered in some fashion, e.g., by increasing
or decreasing age. Suppose that hk denotes a health
distribution in which the first k individuals have the
health outcome h1 . Then, a number k or a proportion
k/n can be calculated as above.
If the individual scale wI hi is normalized such
that wI h0 = 0 and wI h1 = 1, then in the case of
equal weights, k is determined by the first formula
that follows, and in the more general case of (possibly) unequal weights, k is determined by the second
formula:
k = nwP hopt − wP hcur a1 + · · · + ak = wP hopt − wP hcur These evaluation procedures can be used when the
population scale lacks a simple interpretation because
they use a scale measured in units which differ from
those of the population scale.
9.3. Reporting an Average Change in Health
We first discuss procedures for the case in which the
health outcomes are health states, and then discuss
procedures for the case in which the health outcomes
are health–duration pairs. In each case, a weighted
average population scale wP h) will serve as an evaluation scale. The key is to identify circumstances in
which the scale wP h) has a simple interpretation.
9.3.1. Evaluation Scales for Health States. Assume
that an individual scale has the form wI s = f qs),
as described in §4. For a health distribution s =
s1 sn ), suppose that q̄ denotes its weighted average quality scale amount, that is, q̄ = a1 qs1 + · · · +
an qsn .
If preferences have cardinal neutrality for a quality scale qs, then by Theorem 2 one can choose the
individual scale as wI s = qs. Then, wP s = a1 qs1 + · · · + an qsn = q̄. Hence, the scale wP s) has a simple
interpretation, namely, as an average health quality,
and thus it can be used as an evaluation scale. To
evaluate a change scur → sopt , an analyst can use the
formula
wP sopt − wP scur = q̄ opt − q̄ cur opt
cur
(10)
Because the difference, q̄ − q̄ , in average health
quality from scur to sopt equals the average change in
health quality from scur to sopt , the analyst can report
the average change in health quality calculated by (10)
as an evaluation of the change scur → sopt .
If preferences do not have cardinal neutrality for
qs, then the conversion function f q is nonlinear.
In this case, the population scale amount of a health
distribution may be unequal to its average health
quality, and different health distributions with the
same average health quality may be nonindifferent.
Moreover, a change scur → sopt in health distribu
tions may be preferred to a change scur → sopt , even
though the average change in health quality is less for
scur → sopt than it is for scur → sopt . For this reason,
it can be misleading to evaluate a change scur → sopt
by its average change in health quality when preferences do not have cardinal neutrality for the health
quality scale.
Harvey and Østerdal: Cardinal Scales for Health Evaluation
273
INFORMS holds copyright to this article and distributed this copy as a courtesy to the author(s).
Additional information, including rights and permission policies, is available at http://journals.informs.org/.
Decision Analysis 7(3), pp. 256–281, © 2010 INFORMS
9.3.2. Evaluation Scales for Health–Duration Pairs.
Assume that an individual scale has the product form wI s t = Dtf qs), as described in §5.
A health distribution will be denoted by (s t =
s1 t1 sn tn ).
First, consider the use of duration as an evaluation
scale. As mentioned in §5, the QALYs of a health–
duration pair are defined as follows. First, specify a
state s 1 of good health. Then, for any health–duration
pair (s t), find a health–duration pair (s 1 &) such
that (s t) and (s 1 &) are indifferent, i.e., society is
indifferent between a person spending a duration t in
a health state s and spending a (presumably shorter)
duration & in the (presumably better) health state s 1 .
Here, it follows that Dtf qs = D&f qs 1 ).
Typically, the scale of durations & defined in this
manner is called a QALY scale. We will call a duration & the QALY duration of a health–duration pair
(s t), where the redundant term “duration” is added
to emphasize that a QALY scale is a time scale.
Define = &1 &n ) and s1 = s 1 s 1 ).
Pareto agreement implies that a health distribution
(s t) is indifferent to the health distribution (s1 )
with a uniform health state of s 1 because for each
individual the corresponding health–duration pairs
are indifferent.
For convenience, we will assume that the conversion function f q is normalized such that
f qs 1 = 1. It follows that & = D−1 Dtf qs).
If preferences are cardinal timing neutral, and thus
present and future outcomes have the same importance, then by Theorem 4 one can choose Dt = t.
Then, & = tf qs). Suppose that &¯ denotes the average QALY duration of a health distribution (s t), that
is, &¯ = a1 &1 + · · · + an &n . Then, wP s t = wP s1 , =
a1 &1 f qs 1 + · · · + an &n f qs 1 = &.
¯ Therefore, the
population scale can be interpreted as an average
QALY duration, and thus it can be used as an evaluation scale. To evaluate a change (s tcur → s topt , an
analyst can use the formula
wP s topt − wP s tcur = &¯ opt − &¯ cur (11)
Because the difference, &¯ opt − &¯ cur , in average QALY
duration from (s tcur to (s topt equals the average
change in QALY duration from (s tcur to (s topt , the
analyst can report the average change in QALY duration as an evaluation of the change (s tcur → s topt .
If preferences are not cardinal timing neutral, and
thus future outcomes are discounted, then the cumulative discounting function Dt is nonlinear. In this
case, the population scale amount of a health distribution may be unequal to its average QALY duration, and different health distributions with the same
average QALY duration may be nonindifferent. Moreover, a change, scur → sopt , in health distributions may
be preferred to a change, scur → sopt , even though
the average change in QALY duration is less for
scur → sopt than it is for scur → sopt . In a model that
discounts future outcomes, it can be misleading to
evaluate a change scur → sopt by its average change in
QALY duration.
What should be done in the case of nonzero discounting? Here, we propose a procedure that is dual
to the QALY procedure discussed above in that it uses
health qualities rather than durations as an evaluation scale. To define a scale based on health quality, first select a duration t 1 that is to play the role
of the health state s 1 . Perhaps, t 1 is an extremely
long duration (what might be called a Methuselahn
duration). Then, for any health–duration pair (s t),
find a health–duration pair (s t 1 such that (s t)
and (s t 1 ) are indifferent, i.e., society is indifferent
between a person spending a duration t in a health
state s and spending a (presumably longer) duration
t 1 in the (presumably worse) health state s . Suppose
that ' denotes the health quality of the health state s ,
that is, ' = qs ). We will call ' the years-adjusted life
quality of the health–duration pair (s t), and we will
abbreviate this phrase as YALQ. To emphasize that a
YALQ scale is a health quality scale, we will call a
health quality ' a YALQ quality.
Define t1 = t 1 t 1 ) and s = s1 sn ).
Pareto agreement implies that a health distribution
(s t) is indifferent to the health distribution (s t1 )
with a uniform duration of t 1 because for each individual the corresponding health–duration pairs are
indifferent.
As in the discussion of health states, suppose that
social preferences have cardinal neutrality for qs.
Then, by Theorem 4 one can choose the individual
scale as wI s t = Dtqs − qs0 ), where s0 is the
standard state defined in §5. We will assume that
the quality scale qs has been normalized such that
qs0 = 0, and thus wI s t = Dtqs. We also assume
Harvey and Østerdal: Cardinal Scales for Health Evaluation
INFORMS holds copyright to this article and distributed this copy as a courtesy to the author(s).
Additional information, including rights and permission policies, is available at http://journals.informs.org/.
274
Decision Analysis 7(3), pp. 256–281, © 2010 INFORMS
that the cumulative discounting function Dt has
been normalized such that Dt 1 = 1.
Suppose that '¯ denotes the average YALQ quality
of a health distribution (s t, that is, '¯ = a1 '1 + · · · +
an 'n . Then, wP s t = wP s t1 = a1 Dt 1 '1 + · · · +
an Dt 1 'n = '.
¯ Hence, wP s t) can be interpreted as
an average YALQ quality, and thus it can be used as
an evaluation scale. To evaluate a change (s tcur →
s topt , an analyst can use the formula
wP s topt − wP s tcur = '¯ opt − '¯ cur (12)
Because the difference, '¯ opt − '¯ cur , in average YALQ
quality from (s tcur to s topt equals the average
change in YALQ quality from (s tcur to s topt , the
analyst can report the average change in YALQ quality
calculated by (12) as an evaluation of s tcur → s topt .
10.
A Hypothetical Application for
Hypertension
This section illustrates how the models and procedures in this paper can be applied. The example we
present concerns the evaluation of a proposed therapy
for hypertension, known informally as high blood
pressure. The example is hypothetical, both in the
data from clinic trials and in the assessment of individual and population scales. It is also hypothetical
in that neither of us has expertise on hypertension.
We take most of our information from a summarizing report, Chobanian et al. (2003), published by the
American Heart Association.
Optimal blood pressure is defined as a systolic
blood pressure (SBP) of 115 mm Hg and a diastolic
blood pressure (DBP) of 75 mm Hg (or is defined
as SBP less than 120 mm Hg and DBP less than
80 mm Hg). Hypertension is defined as an SBP greater
than 140 mm Hg or a DBP greater than 90 mm Hg.
The World Health Organization estimates that suboptimal blood pressure is responsible for about twothirds of cerebrovascular disease and about one-half
of ischemic heart disease worldwide.
Blood pressure can be measured by two general
methods. The more accurate method, called ambulatory blood pressure monitoring (ABPM), is to have
a person wear a monitoring device during a 24-hour
period. The less accurate method, which we will call
office measuring (OM), is to measure the person’s
blood pressure in a medical office. As discussed in
Chobanian et al. (2003), there are significant and variable differences in the measurement results of the two
methods.
Stage I hypertension is defined in Chobanian et al.
(2003) as an SBP in the range of 140–160 mm Hg and
a DBP in the range of 90–100 mm Hg as measured
by the OM method. Because Stage I hypertension is
defined in terms of the OM method, individuals with
this diagnosis can have a wider range of actual SBP
and DBP amounts.
Suppose that as part of larger project a research
team wished to evaluate the effectiveness of a
proposed therapy for reducing blood pressure in
individuals with Stage I hypertension. The therapy
is a combination of a drug regimen and counseling
sessions to promote lifestyle changes. It was expected
to have tolerable but not insignificant side effects of
the drugs and disruption due to the lifestyle changes.
In part because the therapy includes lifestyle counseling in addition to drugs, it was anticipated to have
significantly different effects on blood pressure for
different populations defined by factors such as level
of education, age, and other health variables.
The goal of the subproject was to evaluate the ability
of the therapy to reduce the risks of cerebrovascular
disease and ischemic heart disease associated with
hypertension for prospective populations of individuals who are diagnosed with Stage I hypertension—and
if possible to compare the predicted risk reduction of
the therapy in different defined populations.
First, the researchers defined six categories of individuals, called Categories I–VI, based on information
other than blood pressure. Then, they conducted clinical trials of the therapy for persons with Stage I
hypertension. For each participant, extensive information was gathered, and the person was placed
in one of six sample populations, also called Categories I–VI, according to the information. Measurements of blood pressure were taken at the beginning
and at the end of a four month period (with different periods for different individuals). The more accurate ABPM method was used because the researchers
were interested in the ability of the therapy to reduce
an individual’s actual blood pressure. Each person’s
blood pressure measurements were aggregated into
a single SBP amount and a single DBP amount such
Harvey and Østerdal: Cardinal Scales for Health Evaluation
INFORMS holds copyright to this article and distributed this copy as a courtesy to the author(s).
Additional information, including rights and permission policies, is available at http://journals.informs.org/.
Decision Analysis 7(3), pp. 256–281, © 2010 INFORMS
that greater SBP and DBP amounts indicated greater
health risks according to previous studies (see below).
A health outcome was defined as a health state
measured by a person’s calculated SBP and DBP.
Thus, s = (SBP, DBP). The data for each sample population consisted of an initial distribution sint of health
states s int at the beginning of the trial periods and a
final distribution sfin of health states s fin at the end
of the trial periods. Hence, the data from the clinical trials consisted of the information in the changes
fin
int
sint
k → sk k = 1 6. The initial distributions sk
were appreciably different; in particular, they had different averages of SBP and of DBP.
The researchers next consulted studies that related
blood pressure and risks of cerebrovascular disease
and ischemic heart disease. The studies had considered different types of risk, i.e., different measures of
the risk of different categories of morbidity and fatality. One type of risk was the probability of a major
stroke during a person’s lifetime. For persons with
Stage I hypertension, the risk measures seemed to
have linear relationships with one another, i.e., greater
differences in one risk measure seemed to correspond
with greater differences in the other risk measures.
Because of this concurrence between types of risk,
the researchers were able to define an individual cardinal relation I on changes s → s in blood pressure
in terms of the consequent changes in risk without
specifying the type of risk. For the same reason, they
were able to define a population cardinal relation P
on changes s → s in health distributions in terms of
changes in the sum of the risks for the population
members without specifying the type of risk.
The researchers judged that these cardinal relations
satisfied the conditions for the existence of cardinal scales, e.g., the condition of dynamic consistency
mentioned in §3. They also judged that the cardinal relations satisfied the conditions for a population health evaluation model with equal interpersonal
weights.
The blood pressure variables and the risk scales
measured harms and disutilities rather than goods
and utilities. Recognizing this, the researchers translated the population health evaluation model into a
harm/disutility model. In this discussion, however,
we will translate as we proceed.
275
The second part of the project was to assess individual and population scales. Based on the risk studies, the researchers judged that for fixed SBP and
DBP amounts, the health states (SBP + 20, DBP)
and (SBP, DBP + 10) are associated with about the
same risk, and that intermediate health states, e.g.,
(SBP + 10, DBP + 5), have about the same risk. Based
on these judgments, they assessed a linear quality
scale, qs = −20−1 (SBP − 115 − 10−1 (DBP − 75).
Later, the researchers decided for reasons of convenience to redefine the quality scale as one-half this
amount, namely, as qs = 12 −20−1 (SBP − 115 − 10−1
(DBP − 75)].
The researchers next observed that an increase in
SBP or DBP from a higher level has a greater effect
on risk than the same increase from a lower level;
e.g., an increase from 140 to 155 mm Hg is worse
than an increase from 115 to 130 mm Hg. Hence, the
functional dependence of risk on blood pressure is
convex over the observed range. By translating from
harms to benefits, the conversion function f q, which
represents the dependence of an individual scale on
qs, is concave.
The researchers modeled f q by verifying the condition of cardinal constancy, at least as a rough
approximation. This condition can be stated by fixing, for example, the variable DBP and considering
different amounts of the variable SBP. Then, cardinal constancy states that if two changes SBP → SBP∗
and SBP∗ → SBP incur equal changes in risk, then for
any amount the changes SBP + → SBP∗ + and
SBP∗ + → SBP + incur equal changes in risk.
By Theorem 2, cardinal constancy implies that
there is a conversion function f q having the linearexponential form in (1). The concavity of f q then
implies that there is a conversion function of the form
f q = −erq , and hence an individual scale of the form
wI s = −erqs , for some parameter amount r < 0.
To assess the parameter r, the researchers used the
studies relating blood pressure to risks. Chobanian
et al. (2003, p. 1210) states that, “[f]or every 20 mm
Hg systolic or 10 mm Hg diastolic increase in BP
[blood pressure], there is a doubling of mortality from
both ischemic heart disease and stroke.” This statement does not specify what change in DBP accompanies a 20 mm Hg increase in SBP or vice versa.
Because individuals with a 20 mm Hg higher SBP also
Harvey and Østerdal: Cardinal Scales for Health Evaluation
INFORMS holds copyright to this article and distributed this copy as a courtesy to the author(s).
Additional information, including rights and permission policies, is available at http://journals.informs.org/.
276
Decision Analysis 7(3), pp. 256–281, © 2010 INFORMS
have a 10 mm Hg higher DBP much more often than
no higher DBP, we guess that the statement refers to
an increase of 20 mm Hg in SBP and an increase of
10 mm Hg in DBP. Such increases in SBP and DBP
correspond to a decrease of one unit in qs.
A risk measure is linearly related to the individual
scale; that is, risk = af q + b. Hence, the above discussion implies that af q − 1 + b = 2af q + b). To
solve this equation for q, differentiate with respect to
q to obtain f q − 1 = 2f q. Because f q = −erq , this
implies that r = −ln 2. Hence, by the definition of qs
wI SBP DBP
= − exp 12 ln 220−1 SBP − 115 + 10−1 DBP − 75" Because the researchers had constructed an equal
weights model, there was no need to assess interpersonal weights. Moreover, a population scale amount
wP s) did not depend on how the individuals in the
population were indexed.
The Category I–VI sample populations had different sizes, as would prospective populations of persons with Stage I hypertension. For that reason, the
researchers chose an averaging population scale, that
is, they chose the scale
wP s =
n
1
w SBPi DBPi n i=1 I
where n is the size of the population, (SBPi , DBPi ) is
the health state of the ith individual in the population,
and wI (SBPi , DBPi ) is as specified above.
The third part of the project was to use the population scale to evaluate (and if possible to compare) the
effectiveness of the therapy to reduce risk in prospective populations consisting of individuals with Stage I
hypertension who have the characteristics of Categories I–VI. Such a population was called a category k
population where k = 1 6. As described above,
fin
the clinical trials provided the data, sint
k → sk k =
1 6, from the six sample populations.
First, the researchers calculated evaluations of the
therapy. They considered each of the three types of
evaluation discussed in §9. Their conclusions were as
follows.
(1) The researchers specified the uniform distribution s0 with s0 = 150, 95) and for each k = 1 6 calfin
culated a uniform distribution sk such that sint
k → sk
is indifferent to s0 → sk . The calculated distributions
ranged from s1 = 137, 88.5) to s6 = 141, 90.5). The
researchers reported that for a category k population,
the reductions in risk due to the therapy would be
equivalent to the reductions in risk due to a uniform
reduction in blood pressure from s0 to sk .
(2) The researchers specified sl = 140, 90) and su =
160, 100) as extreme health states for Stage I hypertension, and for each k = 1 6 calculated a fraction
fin
pk such that the change sint
k → sk is indifferent to a
change from a uniform distribution with su to a distribution in which blood pressure is reduced from su
to sl for a fraction pk of the population and remains
at su for the remaining fraction of the population.
The calculated fractions ranged from p1 = 053 to p3 =
0.41. The researchers reported that for a category k
population, the reductions in risk due to the therapy
would be equivalent to the reductions in risk due to
an extreme reduction in blood pressure from su to sl
in a fraction pk of the population.
(3) The researchers observed that because the confin
version function f q is concave, a change sint
k → sk
would not be indifferent to the change from a uniform distribution with the average amounts of SBP
and DBP in sint
to a uniform distribution with the
k
average amounts of SBP and DBP in sfin
k . For this reason, they did not report the reductions in risk due to
the therapy in category k populations as equivalent
reductions in risk due to changes in average blood
pressure.
The researchers were also able to compare the
effectiveness of the therapy to reduce risk for pairs
of different category k populations. To do so, they
first calculated the differences, wk = wP sfin
k −
int
wP sk k = 1 6. The differences ranged from
w1 = 141 to w6 = 107.
The differences wk show that the therapy would
be most effective for Category I patients and least
effective for Category VI patients. The researchers
were also able to make quantitative comparisons. The
cardinal uniqueness of the population scale implies
that ratios wj /wk are independent of the particular
scale wP s). Moreover, the risk measures in the studies
of cerebrovascular disease and ischemic heart disease
are linearly related to the scales wP s) (because both
are linearly related to wI s). Thus, from the fact that
w1 was 32% greater than w6 , the researchers were
Harvey and Østerdal: Cardinal Scales for Health Evaluation
277
INFORMS holds copyright to this article and distributed this copy as a courtesy to the author(s).
Additional information, including rights and permission policies, is available at http://journals.informs.org/.
Decision Analysis 7(3), pp. 256–281, © 2010 INFORMS
able to estimate that the reductions in risks would be
about 32% greater for a Category I population than
for a Category VI population and that the risk reductions would be intermediate for the other types. They
reported the percents as concise comparisons of effectiveness for the six types of prospective populations
of persons with Stage I hypertension.
Acknowledgments
The authors thank James K. Hammitt for helpful comments on a memo that initiated this paper, and Alec Morton for helpful comments on this paper. They also thank
the reviewer and associate editor for their comments, especially for the suggestion that they include a hypothetical
application. Charles Harvey is retired from the University
of Houston. His address is 5902 NW Pinewood Place, Corvallis, Oregon 97330.
Appendix. Proofs of Results
Proof of Lemma 1. Suppose that wh is a proper cardinal scale and zh is a cardinal scale (proper or not).
Let W and Z denote the ranges of wh and zh, respectively. Because wh and zh are ordinal scales, we have
zh = f wh for some strictly increasing function f w.
The domain W of wh is a nonpoint interval, and the range
Z of zh and f w can any set of numbers.
= 12 w + 12 w . Then, w
is in W ,
For w w in W , define w
and there exist h h̄ h in H such that wh = w wh̄ =
− w = w − w
implies h → h̄ ∼ h̄ → h ,
wh = w . Thus, w
w
which implies zh̄ − zh = zh − zh̄, which implies
= 12 f w + 12 f w ). Jensen (1905, 1906) proved that any
f w
continuous function defined on an interval that satisfies the
above equation is linear (see, e.g., Aczél 1966, p. 43), and
essentially the same argument can be used to show that any
increasing function that satisfies the equation is linear. The
further arguments are straightforward. Proof of Theorem 1. Because qs is proper, it is continuous and has a nonpoint interval range Iq .
If is proper, then it has a cardinal scale ws with a
nonpoint interval range Iw . Both ws and qs are ordinal
scales, and thus, ws = f qs), for some strictly increasing function f q. Moreover, f q is continuous because its
domain Iq and range Iw are intervals.
Conversely, suppose that has a cardinal scale, ws =
f qs), where the function f q is continuous and strictly
increasing. Then, the function ws is continuous, and thus
its range is an interval. Moreover, its range is nonpoint
because the range of the quality scale qs is nonpoint
and f q is a strictly increasing function defined on that
range. Proof of Theorem 2. We will prove only the forward
implications as the converse implications are straightforward to verify. By Theorem 1, has a cardinal scale f qs,
where the function f q is continuous and strictly increasing
function on the nonpoint interval range Iq of qs.
To show part (i), assume that has cardinal neutrality
for qs. For any amounts q q in Iq , there exist health states
s s ŝ with qs = q qs = q , and qŝ = 12 q + 12 q . Then,
qŝ = 12 qs + 12 qŝ, and thus s → ŝ ∼ ŝ → s by the condition of cardinal neutrality. It follows that f qŝ − f qs =
f qs − f qŝ), and thus f qŝ = 12 f qs + 12 f qs ).
Therefore, the function f q is a solution of Jensen’s functional equation, f 12 q + 12 q = 12 f q + 12 f q . Because f q
is continuous and strictly increasing, it follows that f q =
aq + b for some constants a > 0 and b. And because a positive linear transformation of a cardinal scale is a cardinal
scale, qs is a cardinal scale for .
To show part (ii), assume that the relation has cardinal
constancy for qs. Then, for any amounts q q̂ q and q +
q q̂ + q, q + q in Iq , there are health states s ŝ, s and
t t̂ t such that qs = q, qŝ = q̂ qs = q and qt = q +q,
qt̂ = q̂ + q, qt = q + q.
Assume that f q̂ = 12 f q + 12 f q . Then, f qŝ −
f qs = f qs − f qŝ, and thus ŝ is middling relative to s and s . By the condition of cardinal constancy,
it follows that t̂ is middling relative to t and t . Then,
f qt̂ − f qt = f qt − f qt̂, and thus f q̂ + q =
1
f q + q + 12 f q + q.
2
In summary, the function f q is a solution of the
following functional relationship:
f q̂ = 12 f q + 12 f q implies
f q̂ + q = 12 f q + q + 12 f q + q
(13)
This functional relationship is studied in Harvey (1990,
Theorem B1). In the present notation, it is shown that a
continuous, strictly increasing function f q is a solution
of (13) if and only if it has a linear-exponential form: f q =
a exprqs + b r > 0; f q = aqs + b r = 0; or f q =
−a exprqs + b r < 0, for some a > 0 b, and r. And
because a positive linear transformation of a cardinal scale
is a cardinal scale, has a cardinal scale of a form in (1).
Unlike the case for Jensen’s equation, the assumption that
f q is continuous cannot be omitted. Proof of Lemma 2. Assume that ws t) is a proper cardinal scale for . Let ws t = ws t denote the function
of s obtained from ws t by fixing a duration t. Because S
is proper by condition (B), Lemma 1 implies that each function ws t is a proper cardinal scale for S and that the
scales ws t are related by positive linear transformations.
Choose a duration t ∗ and define w ∗ s = ws t ∗ ). Each
function ws t is a positive linear transformation of w ∗ s,
that is, ws t = atw ∗ s + bt for some amounts at > 0
and bt that can depend on the duration t. But condition (A) implies that ws0 t = ws0 t ∗ ) for any t. Hence,
w ∗ s0 = ws0 t ∗ = ws0 t = atw ∗ s0 + bt, and thus
bt = atw ∗ s0 + w ∗ s0 . Therefore, ws t = ws t =
atw ∗ s − w ∗ s0 + w ∗ s0 .
INFORMS holds copyright to this article and distributed this copy as a courtesy to the author(s).
Additional information, including rights and permission policies, is available at http://journals.informs.org/.
278
Define zs t = ws t − w ∗ s0 . Then, zs t = atgs
where gs = w ∗ s − w ∗ s0 . Thus, zs t has the product
form (2). It is a proper cardinal scale for because it differs
from ws t by a constant. Hence, it is continuous, and thus
the functions at and gs are continuous. As noted above,
the function at is positive. And the function gs has an
interval range because w ∗ s is a proper cardinal scale for
the relation S . Moreover, gs0 = w ∗ s0 − w ∗ s0 = 0.
For the reverse implications, assume that a function
ws t = atgs is a proper cardinal scale for where
the functions at gs are as described. Then, ws0 t =
atgs0 = 0 for any t, and thus condition (A) is satisfied. It
remains to show that condition (B) is satisfied.
For any fixed duration t, the function, ws t = ws t =
atgs, is a cardinal scale for the cardinal relation S t
defined by restricting to health–duration pairs with the
duration t. Because the functions ws t = atgs differ
from one another by a positive multiple, gs is a cardinal
scale for each relation S t . Hence, the relations S t are the
same (and thus they can be denoted by S . By assumption,
the function gs is continuous and has an interval range. If
its range is a point, then gs = gs0 = 0 for any health state
s, and thus ws t = 0 for any health–duration pair (s t.
But this contradicts the assumption that ws t is proper
scale. Proof of Theorem 3. Assume that satisfies conditions (A)–(E). Then, it has a cardinal scale, ws t = atgs,
as described in Lemma 2. And as shown in the proof of
Lemma 2, gs is a proper cardinal scale for the cardinal relation S that is induced by . Hence, condition (C)
implies that gs = f qs for a continuous, strictly increasing function f q with a nonpoint interval range. Moreover,
f qs0 = gs0 = 0, and there exists a health state s with
f qs = 0.
Define a function Dt on the interval [0 ) by Dt =
at t in T = 0 ), and D0 = 0. By Lemma 2, Dt is positive and continuous on T . For any health state s that is not
indifferent to the standard state s0 Dt = f qs−1 ws t
is a cardinal scale for the cardinal relation T s that is
defined by . Thus, condition (D) implies that Dt is
strictly increasing on T = 0 ).
The function Dt is continuous and strictly increasing
on the larger interval [0 if and only if it tends to
zero as t tends to zero. Assume that there is a health
state s with f qs > 0. Then, for any , > 0, there exists
a health state s, with 0 < f qs, < f qsD1−1 ,. And
by condition (E), there exists a duration such that the
health–duration pair (s is less preferred than the health–
duration pair (s, 1). Hence, Df qs < D1f qs, ), and
thus D < ,. Hence, Dt tends to zero as t tends to
zero because Dt is an increasing function. The argument
is essentially the same if there is a health state s with
f qs < 0.
Proof of the converse implications consists of verifying each of the conditions (A)–(E). These verifications are
straightforward.
Harvey and Østerdal: Cardinal Scales for Health Evaluation
Decision Analysis 7(3), pp. 256–281, © 2010 INFORMS
It remains to show that the functions f q and Dt as
described are unique up to positive multiples. First, assume
that f ∗ q = af f q and D∗ t = aD Dt, where af > 0 and
aD > 0. Then, w ∗ s t = D∗ tf ∗ qs = aD af Dtf qs) is a
proper cardinal scale for , and the functions f ∗ q and
D∗ t have the properties (i), (ii). Conversely, assume that
w ∗ s t = D∗ tf ∗ qs is a proper cardinal scale for where f ∗ q and D∗ t have the properties (i) and (ii). Then,
f ∗ q is a proper cardinal scale for s , and D∗ t is a proper
cardinal scale for T . By Lemma 1, it follows that f ∗ q =
af f q+bf and D∗ t = aD Dt+bD , where af > 0 and aD > 0.
But f ∗ qs0 = f qs0 = 0 and D∗ 0 = D0 = 0, and thus
bf = 0 and bD = 0. Proof of Theorem 4. By assumption, has a proper cardinal scale, ws t = Dtf qs, where f q and Dt have
the properties in parts (i) and (ii) of Theorem 3. In particular, f q and Dt are proper cardinal scales for the
induced relations s and T . Theorem 3 implies that a function w ∗ s t = D∗ tf ∗ qs is a proper cardinal scale for ,
where f ∗ q and D∗ t have the properties in parts (i) and
(ii) of Theorem 3, if and only if f ∗ q = af f q and D∗ t =
aD Dt for some constants af > 0 and aD > 0. It follows that
the implications to be shown here for the function f q are
independent of those to be shown here for the function Dt.
We will prove only the forward implications because the
converse implications are straightforward to verify.
To show part (i), assume that the relation has cardinal
neutrality for qs. Then, qs is a proper cardinal scale for
S , and thus f ∗ qs = qs − q0 is a proper cardinal scale
for s . Hence, f ∗ q is a positive linear transformation of
the cardinal scale f q mentioned above. But f ∗ qs0 = 0
and f qs0 = 0, and thus f ∗ q = af f q for some constant
af > 0.
The proof of the forward implications in part (ii) is parallel to the above argument with a proper cardinal scale of
the form (1) in the role of the function qs.
To show part (iii), assume that the relation has cardinal timing neutrality. Because the scale, ws t = Dtf q,
is proper, there exists a health state s with f qs = 0.
But cardinal timing neutrality implies that Dt2 f qs −
Dt1 f qs = Dt2 + tf qs − Dt1 + tf qs for any
durations in the interval T . By dividing by f qs, this
equation implies that Dt2 − Dt1 = Dt2 + t − Dt1 + t.
Choosing t2 = t1 + t, it follows that Dt satisfies Jensen’s
functional equation. Thus (as noted in the proof of Theorem 1), Dt is linear, i.e., Dt = at + b, for some constants
a b. But Dt is strictly increasing with D0 = 0, and thus
Dt = at where a > 0.
To show part (iv), assume that has cardinal timing
aversion. By using arguments similar to those above, it follows that Dt2 − Dt1 > Dt2 + t − Dt1 + t for t1 < t2
and t > 0. In particular, with t2 = t1 + t, we have Dt2 >
1
Dt2 − t + 12 Dt2 + t. Hence, Dt is strictly concave.
2
To show part (v), first assume that the relation has constant cardinal timing aversion (I). Then, Dt1 + t f qs1 −
Dt1 f qs1 = Dt2 + t f qs2 − Dt2 f qs2 , for
Harvey and Østerdal: Cardinal Scales for Health Evaluation
279
INFORMS holds copyright to this article and distributed this copy as a courtesy to the author(s).
Additional information, including rights and permission policies, is available at http://journals.informs.org/.
Decision Analysis 7(3), pp. 256–281, © 2010 INFORMS
t1 < t2 t > 0, implies Dt1 + t + tf qs1 − Dt1 +
tf qs1 = Dt2 + t + tf qs2 − Dt2 + tf qs2 ,
for t > 0. But there exists a health state s such that
f qs = 0, and thus for any R > 0 there are health states
s1 s2 with f qs2 /f qs1 = R. Choosing these health
states, it follows that
Dt1 + t − Dt1 Dt1 + t + t − Dt1 + t
=
Dt2 + t − Dt2 Dt2 + t + t − Dt2 + t
(14)
for any t1 < t2 t > 0, and t > 0. In Harvey (1998a,
Theorem 5.2), it is shown that if a strictly increasing, continuous function Dt is a solution of this functional equation,
then it is a linear-exponential function. By part (iv), Dt is
strictly concave, and thus it is a negative-exponential function. Because D0 = 0, this implies that Dt = a1 − e−rt for some r > 0 and a > 0.
Next assume that the cardinal relation has constant cardinal timing aversion (II). Then, Dt2 f qs −
Dt1 f qs = Dt4 f qs − Dt3 f qs, for s t1 < t2 , and
t3 < t4 , and thus, Dt2 + tf qs − Dt1 + tf qs =
Dt4 + tf qs − Dt3 + tf qs, for t > 0. Choose a
health state s such that f qs = 0, and choose durations
such that t2 = t3 and t2 − t1 = t4 − t3 . Define t = t1 t̂ =
t2 t = t4 . Then, the above implication states that Dt̂ =
1
Dt + 12 Dt implies Dt̂ + t = 12 Dt + t + 12 Dt + t.
2
Therefore, the function Dt is a solution of the functional
relationship (13), and thus it is a linear-exponential function. By the argument used in the previous case, it follows
that Dt = a1 − e−rt for some r > 0 and a > 0. Proof of Theorem 5. We will obtain Theorem 5 from
the social welfare model in Harvey (1999, Theorem 1), to
be called the SW model. More accurately, we will prove
Theorem 5 by using the proof of the SW model, because
Theorem 5 generalizes of the SW model in an important
respect. We can omit proofs of the converse implications
because they are straightforward to verify.
Suppose that the set C of social consequences in the SW
model is chosen as the product set H1 × · · · × Hn . As noted
in §6, an individual cardinal relation i for a set Hi corresponds to a cardinal relation, to be denoted by SW
i , for the
set C in the SW model. And a population cardinal relation
P for H is the same as a population cardinal relation SW
P
for C. The condition of Pareto agreement here is equivalent
to the conditions (A) and (B) in the SW model.
Consider condition (C) in the SW model. It states in part
that there exist continuous cardinal scales for the relations
i and P . It also states that the set C is connected, a condition that we do not require here. Instead, we assume that
the relations i and P are proper. Thus, we simply assume
the properties in Lemma 1 of the SW model, which state
that SW
and SW
are represented by cardinal scales that
i
P
are continuous with interval ranges. It follows by Lemma 1
in §3 that any cardinal scales wi hi and wP h) have these
properties.
Next, consider condition (D) in the SW model. It states
(in our terminology) that for any health distributions
hi i = 1 n, there is a health distribution h such that
h is indifferent to hi according to i i = 1 n. Here,
this condition is redundant. The reason is that for the
given distributions hi we can define a distribution h =
1
n
i
h1 hn where hi is the ith component of hi . Then,
for each i = 1 n, the health distribution h has the same
ith component as hi , and thus it is indifferent to hi according to i .
In summary, the assumptions in Theorem 5 imply that
the cardinal relations i and P have the properties of the
and SW
that are stated in the Lemma 1 of
relations SW
i
P
the SW model. As the reader can verify, it is these properties rather than conditions (C) and (D) that are used in the
proof of the SW model. Hence, by using that proof of the
SW model, it follows that for any cardinal scales wi hi and
wP h there exist constants ai > 0 i = 1 n, and b such
that
wP h = a1 w1 h1 + · · · + an wn hn + b
Therefore, for any given individual cardinal scales
wi hi i = 1 n, the function, vP h = wP h − b =
a1 w1 h1 + · · · + an wn hn , is a population cardinal scale of
the form (5).
Now we show that the constants ai in (5) are unique up
to a common positive multiple. For any 0 > 0, the function, vP0 h = 0vP h = 0a1 w1 h1 + · · · + 0an wn hn , is a cardinal scale for the relation P . Conversely, suppose that
a function vP0 h = a∗1 w1 h1 + · · · + a∗n wn hn is a cardinal
scale for P . Then, vP0 h = 0vP h + 1 for some constants
0 > 0 and 1, and thus, (a∗1 − 0a1 w1 h1 + · · · + a∗n −
0an wn hn = 1. The cardinal scales wi hi have nonpoint
ranges because the relations i are proper, and thus a∗i =
0ai i = 1 n (and then 1 = 0) because the terms on the
left-hand side can vary independently of one another. Proof of Theorem 6. Because the individual cardinal
relations i i = 1 n, are equal, they have the same cardinal scales. By choosing the scales wi hi i = 1 n, as
such a common scale, Theorem 6 follows as a special case
of Theorem 1. Proof of Corollary 1. Suppose that wi hi ) are individual scales. Then, wi h1 > wi h0 for each i = 1 n
because h1 is preferred to h0 by each individual. Thus,
the functions vi hi = wi hi − wi h0 /wi h1 − wi h0 )
are also individual scales. But vi h0 = 0 vi h1 = 1 for
i = 1 n, and cardinal uniqueness implies that vi hi are
the only scales with such values.
Suppose that vP h = a1 v1 h1 + · · · + an vn hn is a population scale associated with the individual scales vi hi . Then,
for a health distribution h such that hi = h1 for i in Sh)
and hi = h0 otherwise, we have vP h = i∈Sh ai as was to
be shown. Proof of Theorem 7. Properness and cardinal uniqueness imply that there are individual scales wi hi such
INFORMS holds copyright to this article and distributed this copy as a courtesy to the author(s).
Additional information, including rights and permission policies, is available at http://journals.informs.org/.
280
that wi h0 = 0 i = 1 n. Suppose that wP h = a1 w1 h1 + · · · + an wn hn is an associated population scale. Then,
wP h0 h0 = 0.
First, assume condition (a) or condition (c). Then, for
any health outcome h, the indifference (8) or the indifference (9) implies that wP hi = h; hk = h0 k = i = wP hj = h;
hk = h0 k = j, and thus ai wi h = aj wj h. For j = 1, it follows that wi h = a1 /ai w1 h. Thus, w1 h is an individual
scale for every individual cardinal relation. Suppose that
wI h denotes the scale w1 h and that vP h = 01 wI h1 +
· · ·+0n wI hn denotes an associated population scale. By the
previous argument, 0i wI h = 0j wI h for any i j. Hence,
0i = 0j for any i j.
Second, assume condition (b) or condition (d). Then, by
essentially the above argument there is a common individual scale wI hi such that wI h0 = 0. Suppose that wP h =
a1 wI h1 + · · · + an wI hn is an associated population scale.
For any i j, there exists a health outcome h such that h
is nonindifferent to h0 (and thus wI h = 0), and either
the indifference (8) or the indifference (9) is satisfied. Then,
wP hi = h; hk = h0 k = i = wP hj = h; hk = h0 k = j, and
thus ai wI h = aj wI h. Because wI h = 0, it follows that
ai = a j .
It is straightforward to verify the converse implications
that in an equals weights model the conditions (a)–(d) are
satisfied. References
Aczél, J. 1966. Lectures on Functional Equations and Their Applications.
Academic Press, New York.
Alt, F. 1936. Über die Messbarkeit des Nutzens. Zeitschrift für
Nationalökonomie 7(2. Helf) 161–169. Translation in J. S.
Chipman, L. Hurwicz, M. K. Richter, H. F. Sonnenschein,
eds. 1971. Preferences, Utility, and Demand. Harcourt Brace
Jovanovich, New York, 424–431.
Anand, S., K. Hanson. 1997. Disability-adjusted life years: A critical
review. J. Health Econom. 16(6) 685–702.
Bleichrodt, H., P. Wakker, M. Johannesson. 1997. Characterizing
QALYs by risk neutrality. J. Risk Uncertainty 15 107–114.
Bordley, R. F. 1994. Making social trade-offs among lives, disabilities, and cost. J. Risk Uncertainty 9 135–149.
Broome, J. 1991a. Utility. Econom. Philos. 7 1–12.
Broome, J. 1991b. Weighing Goods. Basil Blackwell, Oxford, UK.
Broome, J. 1993. Qalys. J. Public Econom. 50 149–167.
Broome, J. 2004. Weighing Lives. Oxford University Press, Oxford,
UK.
Chobanian, A. V., G. L. Bakris, H. R. Black, W. C. Cushman, L. A.
Green, J. L. Izzo, Jr., D. W. Jones. 2003. Seventh report of
the Joint National Committee on prevention, detection, evaluation and treatment of high blood pressure. Hypertension 42
1206–1252.
Cooter, R., P. Rappoport. 1984. Were the ordinalists wrong about
welfare economics? J. Econom. Literature 22 507–530.
Debreu, G. 1960. Topological methods in cardinal utility theory. K. J.
Arrow, S. Karlin, P. Suppes, eds., Mathematical Methods in the
Social Sciences. Stanford University Press, Stanford, CA, 16–26.
Doctor, J. N., J. M. Miyamoto. 2003. Deriving quality-adjusted life
years (QALYs) from constant-proportional time tradeoff and
risk posture conditions. J. Math. Psych. 47 557–567.
Harvey and Østerdal: Cardinal Scales for Health Evaluation
Decision Analysis 7(3), pp. 256–281, © 2010 INFORMS
Dolan, P. 1997. Modeling valuations for EuroQoL health states.
Medical Care 35 1095–1108.
Dolan, P. 2000. The measurement of health-related quality of life
for use in resource allocation decisions in health care. A. J.
Culyer, J. P. Newhouse, eds., Handbook of Health Economics,
Vol. 1. Elsevier Science, New York, 1724–1760.
Dolan, P., D. Kahneman. 2008. Interpretations of utility and their
implications for the valuation of health. Econom. J. 118 215–234.
Dolan, P., C. Gudex, P. Kind, A. Williams. 1996. Valuing health
states: A comparisons of methods. J. Health Econom. 15 209–231.
Dyer, J. S., R. K. Sarin. 1979. Group preference aggregation rules
based on strength of preference. Management Sci. 25 822–832.
Feeny, D. H., W. G. Torrance, W. J. Furlong. 1996. Health utilities
index. B. Spilker, ed. Quality of Life and Pharmacoeconomics in
Clinical Trials, 2nd ed. Lippincott-Raven, Philadelphia, 239–252.
Fisher, I. 1918. Is “utility” the most suitable term for the concept
that it is used to denote? Amer. Econom. Rev. 8 335–337.
Furlong, W., D. Feeny, G. W. Torrance, C. H. Goldsmith, S. DePauw,
Z. Zhu, M. Denton, M. Boyle. 1998. Multiplicative multiattribute utility function for the health utilities index Mark 3
(HUI3) system: A technical report. Working Paper 98-11,
McMaster University Centre for Health Economics and Policy
Analysis, Hamilton, ON, Canada.
Gafni, A., S. Birch, A. Mehrez. 1993. Economics, health, and health
economics: HYEs versus QALYs. J. Health Econom. 12 325–339.
Gold, M. R., J. E. Siegel, L. B. Russell, M. C. Weinstein. 1996. CostEffectiveness in Health and Medicine: Report of the Panel on Cost
Effectiveness in Health and Medicine. Oxford University Press,
New York.
Green, C. 2001. On the societal value of health care: What do we
know about the person trade-off technique? Health Econom.
10 233–243.
Hammitt, J. K. 2002. QALYs versus WTP. Risk Anal. 22 985–1001.
Harris, J. 1987. QALYfying the value of life. J. Medical Ethics
13 117–123.
Harsanyi, J. 1955. Cardinal welfare, individualistic ethics, and interpersonal comparisons of utility. J. Political Econom. 61 309–321.
Harvey, C. M. 1986. Value functions for infinite-period planning.
Management Sci. 32 1123–1139.
Harvey, C. M. 1988. Utility functions for infinite-period planning.
Management Sci. 34 645–665.
Harvey, C. M. 1990. Structural prescriptive models of risk attitude.
Management Sci. 36 1479–1501.
Harvey, C. M. 1995. Proportional discounting of future costs and
benefits. Math. Oper. Res. 20 381–399.
Harvey, C. M. 1998a. Proofs of results in “Value-functions for
continuous-time consequences.” Working paper, Department
of Decision and Information Sciences, University of Houston,
Houston.
Harvey, C. M. 1998b. Value-functions for continuous-time consequences. Working paper, Department of Decision and Information Sciences, University of Houston, Houston.
Harvey, C. M. 1999. Aggregation of individuals’ preference intensities into social preference intensity. Soc. Choice Welfare 16 65–79.
Harvey, C. M. 2008. Preferentially connected sets. Draft, University
of Houston, Houston.
Hasman, A., L. P. Østerdal. 2004. Equal value of life and the Pareto
principle. Econom. Philos. 20 19–33.
Hazen, G. B. 2004. Multiattribute structure for QALYs. Decision
Anal. 1(4) 205–216.
Jensen, J. L. W. V. 1905. Om konvekse funktioner og uligheder
imellem middelværdier. Matematisk Tidsskrift B 49–68.
Jensen, J. L. W. V. 1906. Sur les fonctions convexes et les inégalités
entre les valeurs moyennes. Acta Mathematica 30 175–193.
Johannesson, M. 1995. Quality-adjusted life-years versus healthyyears equivalents—A comment. J. Health Econom. 14 9–16.
Harvey and Østerdal: Cardinal Scales for Health Evaluation
INFORMS holds copyright to this article and distributed this copy as a courtesy to the author(s).
Additional information, including rights and permission policies, is available at http://journals.informs.org/.
Decision Analysis 7(3), pp. 256–281, © 2010 INFORMS
Johannesson, M., J. S. Pliskin, M. C. Weinstein. 1994. A note
on QALYs, time tradeoffs, and discounting. Medical Decision
Making 14 188–193.
Kahneman, D. 2000. Experienced utility and objective happiness: A moment-based approach. D. Kahneman, A. Tversky,
eds. Choices, Values, and Frames. Cambridge University Press,
Cambridge, UK, 187–208.
Kahneman, D., P. P. Wakker, R. Sarin. 1997. Back to Bentham?
Explorations of experienced utility. Quart J. Econom. 112
375–405.
Kaplan, R. M., J. P. Anderson. 1996. The general health policy
model: an integrated approach. B. Spiker, ed. Quality of Life and
Pharmacoeconomics in Clinical Trials, 2nd ed. Lippincott-Raven,
Philadelphia, 309–322.
Kind, P. 1996. The EuroQoL instrument: An index of healthrelated quality of life. B. Spiker, ed. Quality of Life and Pharmacoeconomics in Clinical Trials, 2nd ed. Lippincott-Raven,
Philadelphia, 191–202.
Krantz, D. H., R. D. Luce, P. Suppes, A. Tversky. 1971. Foundations
of Measurement, Vol. 1: Additive and Polynomial Representations.
Academic Press, New York.
Loewenstein, G., P. A. Ubel. 2008. Hedonic adaptation and the role
of decision and experience utility in public policy. J. Public
Econom. 92 1795–1810.
Loomes, G. 1995. The myth of the HYE. J. Health Econom. 14 1–7.
Loomes, G., L. McKenzie. 1989. The use of QALYs in health care
decision making. Soc. Sci. Medicine 28 299–308.
Mehrez, A., A. Gafni. 1989. Quality adjusted life years, utility theory, and healthy years equivalents. Medical Decision Making
9 142–149.
Murray, C. J. L. 1994. Quantifying disability: The technical basis
for disability-adjusted life years. Bull. World Health Organ. 72
429–445.
Murray, C. J. L., A. K. Acharya. 1997. Understanding DALYs.
J. Health Econom. 16 703–730.
Murray, C. J. L., A. D. Lopez, eds. 1996. The Global Burden of Disease,
Vol. 1. School of Public Health and WHO, Harvard University
Press, Boston.
Miyamoto, J. M., P. P. Wakker, H. Bleichrodt, H. J. M. Peters.
1998. The zero-condition: A simplifying assumption in QALY
measurement and multiattribute utility. Management Sci.
44 839–849.
281
Nord, E. 1995. The person-trade-off approach to valuing health care
programs. Medical Decision Making 15 201–208.
Nord, E. 2001. The desirability of a condition versus the well-being
and worth of a person. Health Econom. 10 579–581.
Østerdal, L. P. 2005. Axioms for health care resource allocation.
J. Health Econom. 24 679–702.
Østerdal, L. P. 2009. The lack of theoretical support for using person trade-offs in QALY-type models. Eur. J. Health Econom.
10 429–436.
Pareto, V. 1896. Cours d’é conomie politique. F. R. Lausanne, ed.
Oeuvres Completes, Vol. I, No. 26. Librairie Droz, Geneva, 1964.
Pfanzagl, J. 1968. Theory of Measurement. Wiley, New York.
Pliskin, J. S., D. S. Shepard, M. C. Weinstein. 1980. Utility functions
for life years and health status. Oper. Res. 28 206–224.
Richardson, J. 1994. Cost utility analysis: What should be measured? Soc. Sci. Medicine 39 7–21.
Richardson, J., G. Hawthorne, N. A. Day. 2001. A comparison of the
assessment of quality of life (AQoL) with four other generic
utility instruments. Ann. Medicine 33 358–370.
Scott, D. 1964. Measurement models and linear inequalities. J. Math.
Psych. 1 233–247.
Singer, P., J. McKie, H. Kuhse, J. Richardson. 1995. Double jeopardy
and the use of QALYs in health care allocation. J. Medical Ethics
21 144–150.
Torrance, G. W. 1976. Health status index models: A unified mathematical view. Management Sci. 22 990–1001.
Torrance, G. W. 1986. Measurements of health state utilities for economic appraisal. J. Health Econom. 5 1–30.
Torrance, G. W., M. H. Boyle, S. P. Horwood. 1982. Application of
multi-attribute utility theory to measure social preferences for
health states. Oper. Res. 30 1043–1069.
Torrance, G. W., W. J. Furlong, D. H. Feeny, M. H. Boyle. 1995.
Multi-attribute preference functions: Health utilities index.
PharmacoEconomics 7 503–520.
Torrance, G. W., D. H. Feeny, W. J. Furlong, R. D. Barr, Y. Zhang,
Q. Wang. 1996. Multi-attribute utility function for a comprehensive health status classification system: Health utilities
index mark 2. Medical Care 34 702–722.
von Winterfeldt, D., W. Edwards 1986. Decision Analysis and Behavioral Research. Cambridge University Press, Cambridge, UK.
Williams, A. 1997. Being Reasonable about the Economics of Health.
Edward Elgar, Cheltenham, UK.