INTERGENERATIONAL IMPARTIALITY: REPLACING

Journal of Agricultural and Environmental Ethics (2005) 18: 237–257
DOI 10.1007/s10806-005-1491-8
Ó Springer 2005
YEW-KWANG NG
INTERGENERATIONAL IMPARTIALITY:
REPLACING DISCOUNTING BY PROBABILITY WEIGHTING
(Accepted in revised form December 10, 2004)
ABSTRACT. Intergenerational impartiality requires putting the welfare of future
generations at par with that of our own. However, rational choice requires weighting
all welfare values by the respective probabilities of realization. As the risk of nonsurvival of mankind is strictly positive for all time periods and as the probability of
non-survival is cumulative, the probability weights operate like discount factors,
though justified on a morally justifiable and completely different ground. Impartial
intertemporal welfare maximization is acceptable, though the welfare of people in the
very far future has lower effects as the probabilities of their existence are also lower.
However, the effective discount rate on future welfare values (distinct from monetary
values) justified on this ground is likely to be less than 0.1% per annum. Such
discounting does not compromise environmental protection and sustainability unduly. The finiteness of our universe implies that the sum of our expected welfare to
infinity remains finite, solving the paradox of having to compare different infinite
values in optimal growth/conservation theories.
KEY WORDS: discounting, environmental ethics, impartiality, intergenerational,
intertemporal, probability, sustainable development, welfare
1.
INTRODUCTION
Environmental values, caused either by agricultural or non-agricultural
activities, typically involves costs and benefits long into the future, generations
from now. Thus the issues of intergenerational equity is usually involved. In
particular, should future values be discounted? The use of a high discount rate
favors the present generations at a possibly much bigger cost on the future
generations. Using a low or zero discount rate may be giving undue weights to
the far future. Especially for problems with an infinite time horizon, not using
any discounting may mean the absence of a solution. While even the whole
human species is unlikely to live forever, we do not know when will be our
extinction year. It is true that the Earth may become uninhabitable in less than
two billion years as a gradually warming Sun produces a runaway greenhouse
effect. But despite these, we cannot be absolutely certain that we cannot live
beyond 2 or even 20, 200 billion years. The possible scope of scientific advance
238
YEW-KWANG NG
in just the next few thousand years is unimaginable to us now. Imagine
someone just 3 or 4 hundred years ago. His expectation of what is possible by
the year 2005 would be very, very wide off the mark. Thus, how could we
completely rule out the possibility that, after some thousand or hundred of
thousand years from now, we may no longer have to rely on the Earth, the Sun,
or even the Milky Way for our survival. We may estimate that this is unlikely
even in a million years, but we cannot validly rule it out completely. In view of
this, an infinite time horizon with uncertainty is an appropriate one. On the
other hand, we cannot be absolutely certain of our survival even for the next
century. There is always a very small but positive probability that we may be
destroyed by a celestial collision, a nuclear war, or some causes we may not be
able to imagine. Thus, neither a finite nor an infinite time horizon with certainty is appropriate. Rather, an infinite time horizon with uncertainty is an
appropriate one. In other words, the probability of our surviving into the
future decreases with time but will never quite reach absolute zero for any
finite time.1
Ramsey (1928), the first mathematical analyst of optimal savings, regarded the discounting of future utilities as arising from the ‘‘weakness of
imagination’’ (p. 543) about the future. However, without discounting, the
sum/integral may go to infinity. How does one compare different paths that
all have an infinite present value of utility through time? (For philosophical
discussion of this see Segerberg, 1976; Vallentyne and Kagan, 1997). Ramsey
adopted an ingenious method of minimizing the short-fall from a posited bliss
level of utility. This was later overtaken by the overtaking criterion of Weizsacker (1965). (If the cumulated utility value, i.e., instantaneous utility
integrated to any given time, of stream A overtakes that of stream B at any
finite time and remains higher than that of B forever thereafter, stream A is to
be preferred to stream B.) However, both the Ramsey criterion and the
overtaking criterion are not complete (see Chichilnisky, 1996). Thus, studies
of optimal savings/growth typically use a positive discount rate.
The discounting of the future has been objected to by many ecologists,
economists, environmentalists, and sustainable development analysts (e.g.,
Doeleman, 1980; Broome, 1992; Cline, 1992; Peterson, 1993; Price, 1993;
Beckerman, 1995; Portney and Weyant, 1999; Islam, 2001).2 Many objections to discounting are caused by confusing the discounting of future dollar
1
The problem of intergenerational justice is a complex one (see, e.g., Beekman
(2004) and references therein). In this paper, the problem is viewed from the perspective of appropriate time discouting. Elsewhere (Ng, 2004), I also argue that the
problem of sustainability is more one of the disregard of the welfare of others now in
environmental disruption rather than the disregard of the welfare of future generations.
2
See also Spash (1993) on compensatory transfers.
INTERGENERATIONAL IMPARTIALITY
239
values and the discounting of future subjective values (utility or welfare), as
pointed out by Dasgupta (1995, pp. 119–120). The future generations should
be treated equally as our own generation, at least from the viewpoint of
intergenerational impartiality. However, this need not mean that future
economic values including consumption, incomes, and wealth should not be
discounted. One valid reason to discount future objective economic values
(in contrast to the subjective values like utility or welfare) is based on the
fact that the marginal productivity of capital is usually positive. A dollar of
product now can be invested to become more than a dollar in the future.3
Thus, a dollar now is worth more than a dollar in the future. Equivalently,
a dollar in the future is worth less than a dollar now and should thus be
discounted even if we treat the same unit of subjective utility or welfare as
the same whether it occurs (assumed to happen with perfect certainty) in the
present or in the future. This valid reason for discounting future dollars does
not justify the discounting of future subjective values, which is the concern
of this paper. Nevertheless, the absence of discounting future subjective
values (while allowing for the discounting of future dollar values) may be
sufficient to give rise to paradoxes of no solutions and incompleteness, when
the time horizon is infinite. Even where solutions exist, they may be rather
unacceptable, such as requiring the present generations to save more than
50% of the GNP under plausible parameters (Mirrlees, 1967). Adopting a
finite time horizon without discounting is not acceptable either.4 While the
extinction of our species is possible, we do not know when it may occur.
A definite terminal date is thus not reasonable. For any terminal date T, the
welfare of people who would or might live after T is ignored and the welfare
of people who might live in T minus epsilon is given a full weight. This
abrupt difference in treatment cannot be justified unless we know for sure
that mankind will be terminated precisely at time T. Even for those who
believe that the probability of our surviving 100 trillion years from now is
absolutely zero, so that ignoring the welfare of people more than 100 trillion
years from now is reasonable, it is still unreasonable to give full weight to
the welfare of people 99 or 90 or 9 trillion years from now whose probability
3
Another reason for discounting future dollar values is that a dollar of future
consumption is likely to have lower welfare value as the income levels in the future
will likely be larger. With optimal savings/capital accumulation, the two reasons give
the same discount rate. Newell and Pizer (2003) point out that the discount rate on
dollar values should be significantly lower than that suggested by the current practice
if the future path of the discount rate is uncertain and highly correlated.
4
Apart from the point discussed in the text, it also violates the non-dictatorship of
the present (Chichilnisky, 1996). ‘‘We shall say that a welfare function W. . . is a
dictatorship of the present, if W. . . disregards the utility levels of all generations from
some generation on.’’
240
YEW-KWANG NG
of survival is close to zero or very small. So how could we avoid the above
paradoxes and also avoid the possibly morally unacceptable discrimination
against the future generations. This paper suggests a solution.
The next section sets out a set of reasonable axioms and shows that
intertemporal impartial welfare maximization satisfies these axioms, that the
use of probability (of survival) weighting is consistent with intertemporal/
intergenerational impartiality and, due to the similar cumulative nature of
discounting and the survival probability, probability weighting can be used
in place of discounting to avoid the well-known paradoxes associated with
an infinite time horizon. This avoidance is not necessarily possible even with
probability weighting if the population size may increase. Section 3 argues
that the allowance for a variable population size does not change our
conclusions in Section 2. Despite some possibly misleading appearance at
certain places in the argument and my personal strong support for a utilitarian social welfare function (that the society should maximize the unweighted sum of individual welfares), the argument of this paper is not
based on utilitarianism.
2.
THE IMPARTIALITY OF PROBABILITY-WEIGHTED
WELFARE MAXIMIZATION
Consider different states of the world at various time period
t ¼ 0; 1; 2; . . . ; 1. As we may define the relevant time period as fine as we
like (instead of year, we may use second or micro-second), this periodic
treatment of time is not really restrictive. Moreover, making the time period
instantaneous and hence going into the continuous time approach only involves replacing the various summations below by integrations, without
changing the substance. For the present (i.e., for t ¼ 0), we start with a given
state x0 . (Starting with t ¼ 1 makes no difference. The state x need not be
confined to consumption and may include all relevant factors including the
stock of natural capital; cf. Heal, 1998.) For any given time t in the future
(i.e., for t > 0), alternative states xst may prevail. Let the number of alternative states that may prevail at time t be nt . Let the welfare value (or
expected welfare value if uncertainty is present) associated with x st be denoted as wðxst Þ. It is not necessary to assume that, given the state xst , the
welfare value wðxst Þ must also be known with certainty. For each xst , we may
have a probability distribution
of the different values of welfare v associated
R1
with it. Then wðxst Þ ¼ 1 fðvÞ:vðxst Þdv is the expected welfare of xst . (This is
equals to the sum over all possible states of the welfare value in each possible
state weighted by the probability of that state.) Let the probability associated with xst be denoted pst . Then the (unconditional) expected welfare at t is
INTERGENERATIONAL IMPARTIALITY
wut
¼
nt
X
pst :wðxst Þ;
241
ð1Þ
s¼1
Pt s
where ns¼1
pt ¼ 1 and the superscript u to wt just indicates that it is the
unconditional value. (See below on the conditional value.)
Some of the nt states xst at any given t may involve non-survival of the
human species. We may thus define the expected welfare at t conditional
upon our survival as
a
wt ¼
nt
X
s¼1
a
pst :wðxst Þ=
nt
X
pst ;
ð2Þ
s¼1
where the convention of listing all those states involving our survival first is
used, with nat standing for the number of states we do survive at period t (a
for ‘alive’). In other words, given the condition that we do survive at period
t, our expected welfare is given by wt . If the probability of our non-survival
at period t (as seen from time 0) is strictly positive and wt is strictly positive,
wt > wut if Axiom 1 below is accepted.
Axiom 1 (Zero welfare of non-survival): If xst involves our non-survival,
wðxst Þ ¼ 0:
This axiom is compelling if we agree that we are doing welfare calculus
for this human world. With the welfare of non-human beings (including
animals, on which see Ng, 1995, and life after death, on which I have no
expertise) not considered, if we (i.e., the whole human species) do not survive to period t, our welfare at period t must be zero.
It may be noted that this axiom does not imply that the welfare of people
who do survive is necessarily positive. If their welfare is negative, it can be
treated accordingly.
It is now well established that no reasonable social choice involving
interpersonal (including inter-generational) tradeoffs could be made without
using interpersonally comparable cardinal utility/welfare levels. (See Mueller, 1989, chapter 19 and Ng, 2000, chapter 2 that discuss the results of Sen,
1970 on the necessity of interpersonal comparisons and Kemp and Ng, 1976;
Parks, 1976; Roberts, 1980 and others on the necessity of cardinal utility).5
Most welfare economic analysts accept what Bergson (1938) calls ‘‘the
5
In fact, we need what may be called fully cardinal utility/welfare levels, i.e., ratioscale measurability (where only the unit of measurement used is arbitrary) rather
than just functions unique up to increasing affine (linear) transformations that do not
have a well-defined zero point. It may also be noted that it is only economists
unfamiliar with the necessity of interpersonal cardinal utility who have hesitation/
objection to its use. Philosophers (e.g., Vallentyne and Kagan, 1997, p. 7) use
interpersonal cardinal welfare indices with well-defined zeros readily.
242
YEW-KWANG NG
fundamental value premise of individual preference’’ (called welfarism by
John Hicks and Amartya Sen) that the objective variables are only important, at least ultimately speaking, only through their effects on individual
preferences (represented by utilities) or welfare (the differences between
utility and welfare are ignored here; on which see Ng, 2000, chapter 4).
Then, without some interpersonal comparisons of utility (representing
preference) or welfare, we cannot say, e.g., that this generation has not left
enough natural or man-made capital for the future generations. No intertemporal (or interpersonal at a given point in time for that matter) ethics is
really possible. In fact, this is still largely true even if we accept welfarism
only partially; that welfare is important, even if not the only important
thing. Thus, I have no need to apologize for the use of cardinal welfare (also
used by other analysts in this area) in the next axiom. (Ordinalism is however sufficient for the objective theories of consumption, demand, and
general equilibrium.)
Axiom 2 (Intertemporal parity): Once expressed in the interpersonally comparable cardinal indices, the welfare level at all times should be treated at par
with that at any other.
This requires that the welfare of people in say the 300th century
should not be treated as less (or more) important than the welfare of
people now. This axiom is sufficient to ensure intertemporal (including
inter-generational) equity. The only qualification to this axiom is that, if
the welfare level refers to representative welfare of an individual, a
complication may arise if the number of people at different times differ.
We will have to find some aggregate welfare measure for a changing total
population that is intertemporally comparable. But this may be regarded
as a separate problem of how to evaluate social welfare when the population is changing, a problem really beyond the main focus of this
paper. Social optimality involving different sets of population raises wellknown paradoxes. As I have discussed these in Ng (1989) (which argues
in favor of classical utilitarianism maximizing the equally weighted sum
of individual welfares at the ideal moral level but allowing for our possible bias in favor of our own welfares; but see Blackorby et al., 1997 and
Carlson, 1998 on other attempts at the problem and the associated difficulties), I will abstract from this issue by ignoring possible changes in
population size here. (Section 3 shows that that allowing for a variable
population size does not affect our conclusions.) In this section, we take
the population size as unchanged until the time of extinction when the
population size abruptly drops to zero. This makes Axiom 2 compelling
INTERGENERATIONAL IMPARTIALITY
243
as the same welfare value of the same number of people should be
treated equally, as otherwise there is no intertemporal parity.
Axiom 3 (probability weighting): In the presence of uncertainty, any welfare
level should be weighted by the probability of its realization.6
This is just the requirement of rationality for choices involving uncertainty and is familiar to all economists and decision theorists as it is used in
the calculation of expected utility of any action where the utility of any
possible state has to be multiplied by the probability of realization of that
state. This is also done at the beginning of this section for the calculation of
expected welfare and is widely accepted as a compelling condition for rational choice.7 A commentator pointed out that, when some outcomes involve some very bad results, they may be avoided irrespective of the
expected gain. For example, insurance companies do not sell insurances if it
means that the company would go bankrupt should the worse outcome be
realized, no matter how improbable it is. Since bankruptcy probably means
the unemployment of most of the employees of the company and since
unemployment has very high disutility or negative welfare (Winkelmann
and Winkelmann, 1998), the above practice of many insurance companies
may be a good rule-of-thumb method to approximate expected welfare
maximization. Similarly, some environmentalists may find it morally
appealing to eliminate alternatives that involve extinction or terrible suffering for the future generations. If my argument below that the annual risk
of extinction that is beyond our control is very small for a long time to
come, the impartial maximization of expected welfare does not differ significantly from this somewhat overprudent (from the viewpoint of impartial
expected welfare maximization) policy. In other words, both policies would
imply that measures that could (even at very low, though not vanishing,
probability) cause enormous harm or extinction should be avoided even at
very big costs on the present generations.
6
A reviewer of this paper suggests that the essence, though not the precise
statement, of this axiom may be traced to Kant who wrote ‘‘human nature is such
that it cannot be indifferent even to the most remote epoch which may eventually
affect our species, so long as this epoch can be expected with certainty’’ (see Reiss,
1970/1991, p. 50).
7
Some economists regard the use of expected utility as just a convention and has
no normative significance and that the utility indices used need not be the subjective
cardinal utility of the Neoclassical economists. Under some compelling axioms, I
have shown the normative significance and the subjective (fully) cardinal utility/
welfare nature in Ng (1984).
244
YEW-KWANG NG
Axiom 4 (Positive risk): The probability of our non-survival at any future time
is strictly positive.
This is reasonable for both an individual and the whole species; the
axiom here is meant for the species. For an individual, she may be certain
of her survival now. However, for the future, no one can be sure for
certain. Fatal accidents may strike at any time. Consider the victims in
the World Trade Center on 11 September 2001. The same is true for our
species as celestial collisions and other unexpected events may happen,
though the time scale involved is much larger and the probability of nonsurvival is much smaller. The probability of my non-survival 100 years
from now is close to 100%, at least over 90%, but the same for the
whole human species is close to zero (but strictly positive), at least less
than 10%.
Axiom 5 (Non-absolute expected risk): The expected risk of our non-survival
at any year, given survival to the previous year, is smaller than one.
While the risk of non-survival for any given non-degenerate (i.e., not
just a point in time but a discrete period like a year) period is strictly
positive as required in the previous axiom, it is compelling that it is not
absolute in the sense of absolute certainty of non-survival for a particular
year, even given survival to the previous year. It is true that, given the
likely non-perpetual survival, we will become extinct in some future year.
However, we do not know when this particular year will be. Moreover,
we cannot rule out the (even if only slight) possibility of our future
ingenuity in postponing extinction very dramatically such as by moving
to another planet in another solar system. Thus, in terms of expected risk,
this axiom is compelling.
Axiom 6 (cumulative risk): The probability of non-survival is cumulative over
time.
If the probability of my not surviving the first decade from now is 10%
(I hope that this is an exaggeration for ease of calculation rather than
reflecting the true probability), the probability of my surviving this first
decade is 90%. Given that I do survive the first decade, suppose that the
probability of my not surviving the second decade would be 20% and my
survival probability would be 80%. However, looking from now before we
know whether I will survive even the first decade or not, my surviving the
second decade is only 90% times 80% ¼ 72% or my non-survival probability is 28%. This is what is meant by the cumulative nature of
INTERGENERATIONAL IMPARTIALITY
245
non-survival. For the simple case of a constant rate of per-period nonsurvival risk of r, the probability of survival decreases with time t at ð1 rÞt .
(For the instantaneous/continuous case, it is ert .) However, the cumulative
nature of non-survival is general and applies also to variable rates of nonsurvival risk.
Figure 1
For the next axiom, we need a definition of semi-constant or nondecreasing risk (of non-survival). The case of constant risk r is a special case
of semi-constancy in risk. However, semi-constant or non-decreasing risk is
more general and allows the risk to vary with t but with the probability of
survival eventually decreases to no larger than the case of constant risk for
some positive and constant risk r > 0. In Figure 1, curve A depicts the
probability (measured along the vertical axis) of survival (expected at time 0)
to the respective time (on the horizontal axis) for a case of constant risk.
Curve B depicts a case of non-constant risk but with the probability of
survival eventually decreases to a level no higher than A after some point
t ¼ T and remains lower through to infinity. This is called semi-constant or
non-decreasing risk.
246
YEW-KWANG NG
Definition. If the probability of survival decreases (not necessarily at a
constant rate) and eventually stays no higher than the probability of survival
of a constant (positive) risk at all time after some T through to infinity, it is
called a case of semi-constant or non-decreasing risk.
Axiom 7 (semi-constant or non-decreasing risk): The risk of non-survival is
semi-constant or non-decreasing with time.8
First, consider the non-survival risk faced by a typical individual. After
the (life) birth of an individual, the probability that she is alive right then
is 100%. The probability that she will die within a year is the infant
mortality rate. The probability of death at any period is strictly positive
but very small between the age of a few years old and middle age. After
about age 60 or 70 (depending on the public health condition of the
country concerned), the risk increases significantly. Looking at the time of
birth, the probability of survival to the respective future age/time decreases strictly throughout due to the positive risk of death at any time.
(See Figure 2.) However, the rate of decrease may increase or decrease.
Typically, this rate is very high soon after birth but decreases to a very
low figure around youth. It increases after middle/old age. The absolute
risk is likely to decrease again at some very old age, due to the fact that
when the survival probability (expected at birth) has decreased to a very
low figure, say 5% for age 90, there is not much scope for further
absolute decrease. Even if, given survival to age 90, there is as high as
20% of non-survival in the year of age 90, the absolute decrease is only
1% point from 5% to 4%. In contrast, the absolute decrease at age 75
could be 4% points from 40% to 36%, implying however a lower relative
decrease of just 10% in comparison to the relative decrease of 20% at age
90. Put it differently (though not strictly equivalently except under the
additional assumption of no changes in age structure and the profile of
life expectancy), an average person aged 90 is more likely to die within a
year than a person aged 75, but there are more people dying at aged 75
than at age 90 simply because there are less 90 years-old around. (The
figures are all hypothetical, though selected to be reasonable.) For our
purpose here, it is the relative (or conditional) risk that is relevant that is
likely to keep increasing with age after middle age. Although the rate of
absolute decrease in the survival probability decreases at very old age, the
probability of survival still decreases very fast, approaching zero well
before the age of 200. Thus, the risk faced by an individual satisfies
Axiom 7.
8
Axioms 5 and 7 are needed only for Proposition 4.
INTERGENERATIONAL IMPARTIALITY
247
Figure 2
For the whole human species, the risk of non-survival is much smaller
than that of an average (in fact any) person. Whether that risk increases or
decreases is also debatable and a general pattern like that for an average
individual discussed above may be absent. In the absence of more specific
knowledge except that the risk is strictly positive at any period and rather
small, the expectation that the risk is roughly constant may be acceptable,
making the survival probability decreases at ð1 rÞt or ert as already noted.
This is represented by curve A in Figure 1 and is taken as a special case
satisfying Axiom 7. However, our requirement is more general than this,
allowing cases like curve B, which involves some sections of decreasing risk
but not strongly and persistently enough to make the curve staying above any
curve of constant positive risk throughout (i.e., through to infinity). In reality,
one may argue that we will sooner or later be faced with increasing risk when
our sun is expected to run out of energy, though this is likely to occur only
billions of years away. Thus, while we cannot rule out some periods of
decreasing risk, the decrease in risk cannot be expected to be persistent enough to make Axiom 7 inapplicable. Thus, Axiom 7 is compelling.
Axiom 8 (Finite welfare): The welfare level at any time is finite.
Common sense and biological limits suggest that no one can have infinite
welfare. Under our current premise of a constant population size (but see
248
YEW-KWANG NG
the next section for a variable population size), it is also compelling that for
the whole species, the welfare level is also finite at any one time.. Moreover,
allowing for infinite welfare gives rise to paradoxes (including the wellknown St. Petersburg paraxdox; see Arrow, 1964). Other analysts (e.g.,
Chichilnisky, 1996; Vallentyne and Kagan, 1997, p. 7) also reasonably assumes finite utility/welfare (or ‘‘goodness’’) levels.
Axiom 8 allows us to follow Chichilnisky in normalizing welfare levels to
be bounded above by a constant that, without loss of generality, is taken to be
one. For example, if we take the upper welfare limit to be 10 to the power of
999,999,999,999 quintrillion times that of our present welfare value (assumed
positive and non-infinitesimal; or replace ‘‘our present welfare value’’ by ‘‘the
highest welfare value that we would presently enjoy if everyone on earth is as
happy as the happiest person’’), we have the slight inconvenience of having to
use small numbers in the order of 0.000 . . . (999,999,999,999 quintrillion of
them) 1 to describe the size of our present welfare. However, conceptually, this
is not a problem at all. If one is unsure that a welfare limit that large is
unsurpassable, one may just replace the ‘‘quintrillion’’ above by ‘‘quintrillion
to the power of [quintrillion to the power of (quintrillion to . . . and so on for a
quintrillion times’’ and also use the original English (where a billion is a
million millions and a trillion is a billion billions and so on) rather than the
American (where a billion is a thousand millions) definition of quintrillion.
And of course we may complicate the above process for another quintrillion
times, if necessary to satisfy skeptics, without reaching infinity.
Axiom 9 (Welfare relevance): The welfare levels now and in the future are
relevant to our choice.
There are economists and philosophers (e.g., Amartya Sen, being both)
who regard that there are principles important over and above and independent of their contributions to welfare, while I regard welfare as ultimately the only important thing (Ng, 2000, chapter 3). Injustice is the denial
of due welfare. In a short-term analysis, the allowance for other things is
important as the violation of certain desirable principles now may be detrimental to welfare in the longer run. In our present analysis of an infinite
time horizon, ultimate exclusive concern with welfare is defensible. However, Axiom 9 only requires some, rather than exclusive, concern with
welfare and hence is non-controversial. (The proofs of all propositions below are provided in Appendix A.)
Proposition 1 (Intertemporal impartial welfare maximization). Where nonwelfare considerations are unchanged, the criterion of maximizing the unweighted or equally weighted sum/integral of (unconditional and comparable)
welfare
249
INTERGENERATIONAL IMPARTIALITY
W¼
1
X
wut
ð3Þ
t¼0
satisfies the above set of axioms.
Proposition 2 (Impartiality of probability-weighted welfare maximization).
The maximization of (3) is equivalent to the maximization of
W¼
1
X
ð4Þ
pt :wt
t¼0
where wt is the (conditional) welfare defined in (2) and pt probability of our survival at t (expected at t = 0).
Pnat
s¼1
pst is the
The above two propositions means that weighting future welfare levels
by the (lower) probabilities of their realization is consistent with impartiality
between the present and the future generations.
Proposition 3 (Discounting replaced by probability weighting). We may
achieve the ‘‘effective’’ discounting of future welfare levels by the appropriate
use of probability weighting without violating the impartiality between the
present and the future.
Proposition 4 (Insignificance of the very distant future). In intertemporal
impartial welfare maximization (3) or (4), there exists a sufficiently large
positive integer N such that the welfare levels of people at and after year N
from now have no significant effects on current decisions.
Insignificance here does not mean irrelevance. (Thus, Proposition 4 does
not violate Axiom 9.) Where the welfare comparison for the periods before
N happens to be virtually identical, the welfare levels of people at the year N
and thereafter forever do affect the relative size of intertemporal sums of any
two welfare streams. Thus, the welfare of people very far in the future is still
relevant. If it is not relevant, it will not affect such comparisons.
Some conservationists/environmentalists may find Proposition 4 disturbing as it implies that there exists a large number N such that the
welfare of people after year N from now should not affect current decisions significantly. However, this N could be extremely large. In fact, if
my argument that the probability of our extinction in any century in the
next several hundreds if not thousands of centuries will be very low, N
should be extremely large. Provided environmentalists continue to work
to enlighten voters and governments in the world to achieve international
cooperation for greater environmental protection (on a way to help
achieve this, see Ng and Liu, 2003), we are very unlikely to face a high
250
YEW-KWANG NG
probability (but definitely positive) of extinction for a long time to come.
Nevertheless, however small the expected probabilities, they cumulate to
become a large figure after a long time. Since our solar system and even
our universe will not last forever, this is what it should be. We may live
for a trillion (for the optimists, replace that with quintrillion in the
English sense) years, but the probability of that is rather small. If the
welfare of people quintrillion years from now has a very low chance to be
realized, it is proper that it should not significantly affect current decisions
that have significant effects on the welfare of current people (who certainly exist) and people in the nearer future (who will more likely exist).
Environmentalists should worry more about the environmental disruption
that may threaten our survival or impose great suffering or welfare forgone even just in the next few decades, centuries, millenniums, and possibly millions of years rather than about the welfare of people trillions of
years from now after we might have possibly migrated to other planets in
other solar systems or universes. The former more important concern is
not compromised by Proposition 4. For example, consider the concrete
example of the storage of nuclear waste that will likely leak after X years.
If the probability that there will be people X years from now to suffer the
leakage is not negligible, we should take that expected costs seriously,
though some allowance could be made for the possible development of
methods to deal with the leak before X years. If X = one quintrillion
such that the probability that there will be people then to suffer the leak
is negligible, we should not be too much concerned.
3.
THE CASE OF A VARIABLE POPULATION SIZE
For simplicity and to concentrate on the pure intertemporal problem, the
previous section abstracts away the complication of a variable population
size. This makes Axiom 8 (Finite Welfare) clearly compelling. This section
discusses informally the more general case where the population size may
change.
If the population size increases, the acceptability of Axiom 8 may be
suspect. In particular, if the population size increases exponentially at a rate
higher than the rate of risk of extinction (i.e., non-survival), then the expected welfare may increase with time. This will in fact be so if (but not only
if) the welfare of the representative individual is non-decreasing and the
total or aggregate welfare evaluation function is classical utilitarian. The
classical utilitarian welfare function w (dropping the time subscript) is
the equally weighted (or unweighted) sum (over all individuals) of individual
welfares and equals the representative individual welfare value wr (where r
INTERGENERATIONAL IMPARTIALITY
251
stands for ‘‘representative’’) times the number of individual np (p for population), or w ¼ np wr . Of course, classical utilitarianism need not be the only
objective function acceptable (I strongly argue in its favor in Ng (2000,
chapter 5). However, the basic arguments of this paper is not based on this
utilitarian ethics.) However, this is the one that is most likely (out of all
reasonable ones proposed) to cause the violation of Finite Welfare, our case
of defending Axiom 8 using it as a reference point, if valid, must apply a
fortiori if some other reasonable aggregate welfare evaluation function is
used instead, as discussed later.
If w increases (whether it comes from the increase in population size np
or in the representative welfare wr or a combination, and whether under
the classical utilitarian aggregate welfare evaluation function or some
other variant) at a rate higher than the rate of extinction risk, even the
probability-weighted (or risk-discounted) welfare value increases with
time. Then, the summation of wt (with the time subscript put back)
through to time infinity must then still gives us an infinite value, failing to
provide a solution to the paradox of having to choose between different
infinite values. In fact, even if the probability-weighted welfare value does
not increase but just remains constant or even just fails to decrease sufficiently quickly with time, an infinite value may still be involved. If either
np or wr increases indefinitely with time, then welfare w need not be finite
and our convention of taking a finite value as an upper bound for w can
then not be justified and Axiom 8 (Finite Welfare) is not acceptable. We
thus have to defend Finite Welfare even in the presence of a variable
population.
Our case rests mainly on the observation that our limited world cannot
sustain an infinite number of individuals. Not only is our Earth finite (in
mass, space, and time), our Solar System and even our universe is also
finite. Thus, even if we could eventually colonize other planets, our
number cannot be infinite. Even using the classical utilitarian welfare
function w ¼ np wr , if neither np nor wr can be infinite, w must also be
finite. That the finiteness of the universe implies the finiteness of the
population size is clear; it in fact also implies the finiteness of wr , the
representative level of welfare. Our ability to enjoy is related to the size
and complexity of our brain, which is also limited by the finiteness of our
universe, even allowing for the widest dreams of genetic engineering and
brain stimulation (on the latter, see Ng, 2000, App. A; on the finiteness of
wr , see also the discussion after Axiom 8 in the previous section).While w
may increase at a rate higher than the rate of extinction risk, making even
the probability-weighted welfare value increase with time (in fact, I firmly
believe that this is the case now and for the fairly long foreseeable future,
thanks mainly to the advancement of knowledge, but partly based on the
252
YEW-KWANG NG
possibly debatable presumption9 that countries in the world will eventually
come together to seriously tackle the problems of environmental
protection), this increase cannot be maintained at an undiminished rate
indefinitely simply because of the finiteness of our universe. Moreover,
since we will eventually become extinct, the rate of our extinction risk will
eventually surpass that of the rate of increase in welfare, even if still
positive, making the expected welfare eventually declining. Although, as
discussed in Section 2, we do not know now when our extinction risk will
eventually increase dramatically and may continue to use a model of a
constant rate of extinction risk, the observation above means that, at least
for the analysis of very long-term issues (only for such issues could the
paradox of different infinite values arise), if the rate of extinction risk,
population growth rate, and the rate of increase in the representative
welfare are chosen that make the probability-weighted welfare sum to
infinity, the result is inconsistent with the finiteness of our universe and
our welfare. Some of these rates must be revised to give a finite sum. In
particular, the rate of population growth (or increase in representative
welfare) could be too high for long-term sustainability or the rate of
extinction risk too low. Only combination of values that give a finite sum
could be (though not necessarily be) valid. With this understanding, it
means that any formulation that gives an infinite value is not acceptable.
With this result, the paradox of having to compare different infinite values
is solved.
To see that the finiteness of w must apply with stronger force if some
non-classical utilitarian aggregate welfare evaluation function is used
instead, consider the critical-level utilitarianism of Blackorby and Donaldson (1984; see also Blackorby et al., 1997) where the number of individuals
times the average excess of individual welfare over a positive critical level is
where W
is the given positive critical level.
maximized, i.e., w ¼ np ðwr WÞ
For the case of classical utilitarianism, w increases with np at the rate wr . For
the case of critical-level utilitarianism, w increases with np at the rate
which is smaller than wr . In fact, w is always of a lower value
ðwr WÞ,
under critical-level utilitarianism than under the classical utilitarianism.
Thus, if welfare w is finite under the classical utilitarian aggregate welfare
evaluation function, it must be a fortiori also finite under the critical-level
utilitarian one. Now consider another possible revision (see Ng, 1984, who
is, however, not in favor of it at the level of ideal morality) of classical
utilitarianism that dampen down the effect of number by having a
9
This presumption is particularly challengeable in view of US’s refusal to sign the
Kyoto agreement, which, though imperfect, in an important step in the right
direction.
INTERGENERATIONAL IMPARTIALITY
253
‘‘dampening’’ function on np , making the aggregate welfare evaluation
function w ¼ fðnp Þwr where f is increasing but concave in np with f 0 <; ¼ 1
everywhere. Then, again, an increase in np increases w less than in the case of
classical utilitarianism, especially at large np . Again, if welfare w is finite
under the classical utilitarian aggregate welfare evaluation function, it must
be a fortiori also finite under the number-dampened utilitarianism. In fact,
even if we were to go the other way (though no one is in favor of that) and
adopt an ‘‘expanding’’ function on number, w would still be finite, provided
that the expanding function reasonably does not give an infinite value to a
finite np (If it does, what value could it then give to an even higher np ?).
From the above discussion, it may be seen that, even when we allow the
population size to be variable, our arguments, including the various propositions, in Section 3, remain valid.
4.
CONCLUDING REMARKS
The main points and implications of our arguments may be briefly summarized.
Impartiality requires putting the welfare of future generations at par
with that of our own. However, rational choice requires weighting all
welfare values by the respective probabilities of realization.
As the risk of non-survival of mankind is strictly positive for all time
periods and as the probability of non-survival is cumulative, the
probability weights operate like discount factors (and the rates of
extinction risk like the discount rates), though justified on a morally
justifiable and completely different ground than that of using discount
rates.
Recognizing the finiteness of our universe in mass, space, and time, and,
hence, the finiteness of our population size, welfare level, and life (of
mankind) expectancy, even though our expected welfare has to be
summed to time infinity due to our lack of information on our extinction
time, the sum of expected welfare to infinity must remain finite, effectively solving the paradox of having to compare different infinite values
in the theories of optimal growth, capital accumulation, and resource
conservation.
Although a morally valid ground for effective discounting of future welfare
values is possible, the per annum discount rate should be very small, as the
probability of our not surviving the next one hundred years is likely to be
much less than 10%, the per annum effective discount rate or probability
adjustment factor should be much less than 0.1%. (However, note that this
254
YEW-KWANG NG
is the discount rate on future welfare, not on future incomes, consumption,
or other dollar values the discount rate on which is more related to the
appropriate rate of interest, as discussed in Section 1.) With such a low rate,
the objective of environmental protection and sustainability will not be
unduly compromised.
APPENDIX
A
Proof of Proposition 1. None of the above axioms are violated by (3) and in
particular Axiom 2 (intertemporal parity) is satisfied by the equal weighting
in (3).10
Proof of Proposition 2. From (4) and (2), we have
W¼
1
X
pt :wt ¼
t¼0
1
X
"
pt
a
nt
X
s¼1
t¼0
#
a
pst :wðxst Þ=
nt
X
pst
;
s¼1
which gives, from the definition of pt ,
W¼
" na
1 X
t
X
t¼0
#
pst :wðxst Þ
which, from Axiom 1, equals
W¼
"
nt
1 X
X
t¼0
;
s¼1
s¼1
#
pst :wðxst Þ
¼
1
X
wut ;
t¼0
where the last equality follows from (1).11
Proof of Proposition 3: Due to the cumulative nature of the probability
of the risk of non-survival (Axiom 6) and the positive risk (Axiom 4), the
probability weights act like the discount factor, decreasing over time at the
rate of non-survival risk. In particular, for the specific case of a constant
risk, it decreases exponentially, being ð1 rÞt or ert where r is the perperiod or instantaneous rate of non-survival risk.
Proof of Proposition 4: From Axiom 3, the welfare levels at all times have
to be weighted by the probability of their realization. From Axiom 4, this
probability decreases at a positive rate at all times. From Axiom 8 and the
normalization wt 1, it is obvious that, for the special case of a constant
10
Axiom 2 requires parity but does not specify a specific form of parity. The
equally weighted sum is clearly a form of parity. For the argument that this is in fact
the appropriate form of parity, see Ng, 2000, chapter 5. Here we do not have to agree
on this specific form by agreeing to take (2) as only a, instead of the, form satisfying
parity.
11
Note that both (1) and (2) are justified by Axiom 3.
INTERGENERATIONAL IMPARTIALITY
255
risk, for every positive e, there exists some t ¼ N, where N is some sufficiently large integer, such that the following is true:
1
X
ð5Þ
pt :wt < e
t¼N
From Axiom 5 (non-absolute risk), pt for t N are strictly positive,
meaning that the existence of people at and after N is possible. From Axiom
7, (5) remains true for the more general cases.12 Now, consider comparisons
where non-welfare factors are unchanged. (This qualification is not needed
for those accepting welfarism.) Then, (5) means that unless the welfare
comparison for the periods before N happens to be virtually identical, the
welfare levels of people at the year N and thereafter forever do not affect the
relative size of intertemporal sums of any two welfare streams and hence do
not affect current decisions based on such comparisons. Since the probability weighted welfare levels for period N and after sum to an extremely
small number, the effect on current decisions is also insignificant.
ACKNOWLEDGMENT
The motivation for writing this paper is sparked by my drafting of the theme
paper on welfare economics and sustainable development with Dr. Ian Wills
for the Encyclopedia of Life Support Systems by UNESCO. I am also
grateful to an anonymous referee for some helpful comments.)
REFERENCES
Arrow, K. J., Aspects of the Theory of Risk-Bearing. Yrjö Jahnsson Lectures, Yrjö
Jahonssonin Säätio, Helsinki, 1964.
Beckerman, W., ‘‘How Would You Like Your ‘Sustainability,’ Sir? Weak or Strong?
A Reply to My Critics,’’ Environmental Values 4(2) (1995), 169–179.
Beekman, V., ‘‘Sustainable Development and Future Generations,’’ Journal of
Agricultural and Environmental Ethics 17(1) (2004), 3–22.
12
Without Axiom 7 or something similar to replace it, the existence of the required
N in the proof of Proposition 4 is not assured. For example, if the utility/welfare level
increases asymptotically towards its upper bound (or remain at this upper bound)
and the discount factor (probability weight or probability of survival here) decreases
asymptotically towards a finite value, e.g., 0.1, the required N does not exist.
Chichilnisky (1996, p. 253) uses a different but similarly effective assumption that the
discount factors, when summed to infinity, is still less than infinity, in place of our
Axiom 7. We could also use this assumption in place of Axiom 7. However, I find
Axiom 7 intuitively clearer in its meaning.
256
YEW-KWANG NG
Bergson (Burk), A., ‘‘A Reformulation of Certain Aspects of Welfare Economics,’’
Quarterly Journal of Economics 52 (1938), 310–334.
Blackorby C., W. Bossert, and D. Donaldson, ‘‘Critical-level Utilitarianism and the
Population-Ethics Dilemma,’’ Economics and Philosophy 13 (1997), 197–230.
Blackorby C. and D. Donaldson, ‘‘Social Criteria for Evaluating Population
Change,’’ Journal of Public Economics 25(1–2) (1984), 13–33.
Broome, J., Counting the Cost of Global Warming (White Horse Press London,
1992).
Carlson, E., ‘‘Mere addition and the two trilemmas of population ethics,’’ Economics
and Philosophy 14 (1998), 283–306.
Chichilnisky, G., ‘‘An Axiomatic Approach to Sustainable Development,’’ Social
Choice and Welfare 13 (1996), 231–257. Reprinted in G. Chichilnisky (ed.),
Mathematical Economics. Vol 2 (Elgar Cheltenham, UK and Northampton, MA,
1998).
Cline, W. R., The Economics of Global Warming (Institute for International Economics Washington DC, 1992).
Dasgupta, P., ‘‘Optimal Development and the Idea of Net National Product.’’ In
Goldin and Winters (eds.), The Economics of Sustainable Development (Cambridge
University Press Cambridge, 1995).
Doeleman, J. A., ‘‘On the Social Rate of Discount: The Case for Macroenvironmental Policy,’’ Environmental Ethics 2 (1980), 45–58.
Heal, G., ‘‘Interpreting Sustainability.’’ In G. Chichilnisky, G. Heal, and A. Vercelli
(eds.), Sustainability: Dynamics and Uncertainty (Kluwer Academic Publishers
Dordrecht, 1998).
Kant, I., ‘‘Idea for a Universal History with a Cosmopolitan Purpose.’’ In Hans
Reiss (ed.), (1970/1991), Political Writings (Cambridge U Press, Cambridge, 1979).
Kemp, M. C. and Y.-K. Ng, ‘‘On the Existence of Social Welfare Functions, Social
Orderings and Social Decision Functions.’’ Economica 43 (1976), 59–66.
Islam, S. M. N., ‘‘Ecology and Optimal Economic Growth.’’ In M. Munasinghe, O.
Sunkel, and C. de Miguel (eds.), The Sustainability of Long-term Growth (Edward
Elgar Cheltenham, UK, 2001).
Mirrelees, J. A., ‘‘Optimal Growth When Technology is Changing,’’ Review of
Economic Studies 34 (1967).
Mueller, D., Public Choice II (Cambridge University, Press Cambridge, 1989).
Newell, R. G. and W. A. Pizer, ‘‘Discounting the Distant Future: How much do
Uncertain Rates Increase Valuations?’’ Journal of Environmental Economics and
Management 46 (2003), 52–71.
Ng, Y.-K., ‘‘Expected Subjective utility: Is the Neumann-Morgenstern Utility the
Same as the Neoclassical’s?’’ Social Choice and Welfare 1 (1984), 177–186.
Ng, Y.-K., ‘‘Social Criteria for Evaluating Population Change: An alternative to the
Blackorby-Donaldson criterion,’’ Journal of Public Economics 29(3) (1986), 375–
381.
Ng, Y.-K., ‘‘What Should We Do about Future Generations? The impossibility of
Parfit’s theory X,’’ Economics and Philosophy 5 (1989), 135–253.
Ng, Y.-K., ‘‘Towards Welfare Biology: Evolutionary Economics of Animal Consciousness and Suffering,’’ Biology and Philosophy 10 (1995), 255–285.
Ng, Y.-K., Efficiency, Equality, and Public Policy: With a Case for Higher Public
Spending (Macmillan, Basingstoke, Hampshire, 2000).
INTERGENERATIONAL IMPARTIALITY
257
Ng, Y.-K., ‘‘Sustainable Development: A Problem of Environmental Disruption
now instead of Intertemporal Ethics.’’ Sustainable Development 12 (2004), 150–
160.
Ng Y.-K. and Liu P. T, ‘‘Global Environmental Protection: Solving the International Public Goods Problem by Empowering the United Nations through
cooperation with WTO,’’ International Journal of Global Environmental Issues 3(4)
(2003), 409–417.
Ng, Y.-K. and I. Wills, ‘‘Welfare Economics and Sustainable Development.’’ In
Knowledge for Sustainable Development – An Insight into the Encyclopedia of Life
Support Systems, (UNESCO Publishing/Eolss Publishers, Paris, France, Oxford,
UK, 2002), Vol. 3, 485–506.
Parks, Robert P., ‘‘An Impossibility Theorem for fixed Preferences: A Dictatorial
Bergson-Samuelson Welfare Function,’’ Review of Economic Studies 43 (1976),
447–450.
Peterson, E. W., ‘‘Time Preference, the Environment and the Interests of Future
Generations.’’ Journal of Agricultural and Environmental Ethics 6 (1993), 107–126.
Pigou, A. C., The Economics of Welfare (Macmillan, London, 1929).
Portney, P. R. and J. P. Weyant (eds.), Discounting and Intergenerational Equity
(Resources for the Future, Washington, DC, 1999).
Price, C., Time, Discounting and Value (Blackwell Publishers, Oxford, 1993).
Reiss, H (ed.), Kant: Political Writings (Cambridge University Press, Cambridge,
1970–1971).
Roberts, K., ‘‘Social Choice Theory: The Single-Profile and Multi-Profile
Approaches,’’ Review of Economic Studies 47 (1980), 441–450.
Segerberg, K., ‘‘A Neglected Family of Aggregation Problems in Ethics,’’ Noûs,
10(2) (1976), 221–244.
Sen, A. K, Collective Choice and Social Welfare (NorthHolland, Amsterdam, 1970).
Spash, C. L., ‘‘Economics, Ethics, and Long-Term Environmental Damages,’’
Environmental Ethics 15 (1993), 117–132.
World Commission on Environment and Development, Our Common Future (Oxford University Press, New York, 1987).
Ramsey, F. P., ‘‘A Mathematical Theory of Saving,’’ Economic Journal 38 (1928),
543–559.
Vallentyne, P. and S. Kagan, ‘‘Infinite Value and Finitely Additive Value Theory,’’
Journal of Philosophy XCIV (1997), 5–26.
Weizsacker, C. C. V. ‘‘Existence of Optimal Programs of Accumulation for an
Infinite Time Horizon,’’ Review of Economic Studies 32 (1965), 85–104.
Winkelmann, L. and R. Winkelmann, ‘‘Why Are The Unemployed so Unhappy?’’
Economica 65 (1998), 1–15.
Dept. of Economics
Monash University
Clayton, Melbourne, Australia 3800
Phone: 613-99052309
Fax: 613-99055476
E-mail: [email protected]