Single-Generation Estimates of Individual Fitness as Proxies for

vol. 163, no. 4
the american naturalist
april 2004
Single-Generation Estimates of Individual Fitness as Proxies for
Long-Term Genetic Contribution
Jon E. Brommer,1,* Lars Gustafsson,2 Hannu Pietiäinen,1 and Juha Merilä3
1. Bird Ecology Unit, Department of Ecology and Systematics, P.O.
Box 65 (Viikinkaari 1), University of Helsinki, FIN-00014
Helsinki, Finland;
2. Department of Animal Ecology, Evolutionary Biology Centre,
Norbyvägen 18d, SE-75236 Uppsala, Sweden;
3. Evolutionary Genetics Research Unit, Department of Ecology
and Systematics, P.O. Box 65 (Viikinkaari 1), University of
Helsinki, FIN-00014 Helsinki, Finland
Submitted February 14, 2003; Accepted September 12, 2003;
Electronically published April 19, 2004
abstract: Individual fitness is a central evolutionary concept, but
the question of how it should be defined in empirical studies of
natural selection remains contentious. Using founding cohorts from
long-term population studies of two species of individually marked
birds (collared flycatcher Ficedula albicollis and Ural owl Strix uralensis), we compared a rate-sensitive (lind) and a rate-insensitive (lifetime reproductive success [LRS]) estimate of individual fitness with
an estimate of long-term genetic fitness. The latter was calculated as
the number of gene copies present in the population after more than
two generations, as estimated by tracing genetic lineages and accounting for the fact that populations were not completely closed.
When counting fledglings, rate-insensitive estimates of individual
fitness correlated better than rate-sensitive estimates with estimated
long-term genetic contribution. When counting recruits, both classes
of estimates performed equally well. The results support the contention that simple, rate-insensitive measures of fitness, such as LRS,
provide a valid and good estimate of fitness in evolutionary studies
of natural populations.
Keywords: individual fitness, life history, evolution, natural selection,
reproductive timing, rate sensitivity.
Fitness is a concept pivotal to any evolutionary inference
(Endler 1986). However, fitness is not necessarily easily
defined (Stearns 1976), and its definition is, in fact, highly
context dependent (Metz et al. 1992; Mylius and Diek* Corresponding author; e-mail: [email protected].
Am. Nat. 2004. Vol. 163, pp. 505–517. 䉷 2004 by The University of Chicago.
0003-0147/2004/16304-30038$15.00. All rights reserved.
mann 1995). Nevertheless, most authors agree that fitness
equates to some measure of genetic contribution to future
generations (Endler 1986; Charlesworth 1994). One aspect
of fitness, which complicates its practical use, is that it can
be measured on different hierarchical levels. For example,
the distinction between individual- and population-level
fitness permeates the life-history literature (table 1; McGraw and Caswell 1996; Link et al. 2002).
Theoretical studies usually consider fitness on the population level or at least of a group of individuals (clones)
with a certain life history (Roff 1992; Stearns 1992). Indeed, the recent development of invasibility as the master
fitness concept (Metz et al. 1992; Rand et al. 1994; Geritz
et al. 1998; van Dooren and Metz 1998; reviewed in Brommer 2000) highlights the growth rate of a distinct group
of mutants within a resident population. Empirical studies,
on the other hand, use an individual-based estimate of
fitness (Lande and Arnold 1983; Endler 1986; Roff 1997;
Lynch and Walsh 1998). Unfortunately, individual- and
population-level fitness estimates of growth rate do not
necessarily measure the same properties; for example, the
average individual fitness of all individuals is not necessarily the population growth rate (Lenski and Service 1982;
Bennington and McGraw 1995; Murray 1997; Cooch et
al. 2002). Some authors have argued that estimates of individual fitness are fundamentally hampered in their description of fitness (a population parameter) because they
are based on a sample of size one (Lenski and Service
1982; Link et al. 2002).
Two broad classes of fitness estimates can be distinguished. Rate-sensitive fitness estimates take into account
both the number of offspring produced and the age of the
parent when these offspring were produced. Rate-insensitive estimates only represent the number of offspring
produced and ignore the timing of reproduction. Examples
of the latter include the population measure R0 and its
individual analog lifetime reproductive success (LRS; table
1). Rate-sensitive estimates of fitness are, for example, lpop
(or r) and its individual analogue lind (table 1). The intrinsic rate of increase lpop is the fitness measure of choice
in most theoretical studies (e.g., Stearns 1992). In contrast,
506
The American Naturalist
Table 1: Population estimates of fitness and their individual analogues
Name
Net reproductive ratio
Lifetime reproductive success
Intrinsic rate of increase
Age-discounted LRS
Description
Expected number of
same-sex offspring
Total number of offspring produced
Per time-unit increase
in number
Propensity of rate of
increase
Symbol
Level
Timing?
Example
R0
Population
No
Heesterbeek 2002
LRS
Individual
No
Clutton-Brock 1988
lpop (per)
Population
Yes
Stearns 1992
lind
Individual
Yes
McGraw and Caswell 1996
Note: Presented are short descriptions and the symbols for four commonly used estimates of fitness, the level on which these are used to describe
fitness, and whether they incorporate reproductive timing. Examples and discussion of their use can be found in the references given.
empirical studies usually use the rate-insensitive LRS (e.g.,
Roff 1997). Below, we explain the basis of individual fitness
estimates in more detail.
Although the rate-sensitive estimate of individual fitness
lind was introduced only recently, it has been used in several studies (reviewed in Brommer et al. 2002c). Because
lind incorporates the timing of reproduction, it ranks the
estimated fitness of individual life histories very differently
than the rate-insensitive LRS (Käär and Jokela 1998; Brommer et al. 2002c; fig. 1). Rate sensitivity will discount offspring production at later ages and thereby put a premium
on starting to reproduce early in life and make the relationship between LRS and lind curvilinear (e.g., fig. 1).
Hence, long-lived individuals with high LRS will typically
have a similar lind as individuals with a lower LRS. Especially when the fitness consequences of a differential
onset of reproduction are examined, lind will typically reveal strong selection for the earliest possible reproductive
age, whereas LRS will show little or no selection (e.g.,
McGraw and Caswell 1996; Brommer et al. 1998; Oli et
al. 2002).
The occurrence of conflicting patterns of selection, depending on which of the two different estimates of individual fitness is used, leads to the question of which fitness
estimate one should adhere to. Many authors have felt that
the rate-sensitive lind captures fitness significantly better
than the rate-insensitive estimate LRS (Käär and Jokela
1998; Korpelainen 2000; Cooch et al. 2002; Oli et al. 2002).
However, for an objective assessment of the accuracy of
any single-generation estimate of fitness, single-generation
estimates of fitness need to be compared with a more
complete estimate of individual fitness.
In this article, we use long-term data of marked individuals to explore the relationship between single-generation estimates of fitness (such as LRS and lind) and an
estimate of the long-term genetic contribution an individual makes to future generations. We use data on two
species with a markedly different life history: the collared
flycatcher Ficedula albicollis, a short-lived passerine, and
the Ural owl Strix uralensis, a long-lived bird of prey. By
tracing genetic lineages, we estimate the future number of
gene copies individuals in certain founding cohorts will
have left in the population after several generations. We
assume that our estimate of gene copies present in the
population after several generations describes an individual’s fitness better than estimates based only on a single
generation (such as LRS and lind). We then ask whether
rate-sensitive estimates of individual fitness are indeed better correlates of the estimated number of gene copies
found in future generations than rate-insensitive estimates
of fitness.
Material and Methods
Collared Flycatcher
Collared flycatchers were studied in the southern part of
the island of Gotland, Sweden. This article deals with data
collected during 1980–1999. Collared flycatchers are relatively short-lived, mainly monogamous small passerines
that in our study population have relatively short natal
and breeding dispersal distances (Pärt 1990; Pärt and Gustafsson 1991). The birds in the study population use nest
boxes provided in forest patches as breeding sites, and
most of the adults and their offspring have been individually marked and captured yearly since the study started.
We only used data from 12 central woodland plots and
discarded data from peripheral ones. Various experiments
have been performed in this population (e.g., Gustafsson
and Sutherland 1988). For more details on the general
methods and the dynamics of this population, see Gustafsson (1989) and Merilä and Sheldon (2000).
Ural Owl
From 1977 on, Ural owls were studied in a 1,500-km2 area
in southern Finland. The owls bred in boxes (n p 150–
180) that were 3–5 km apart. Ural owls are monogamous
Estimating Fitness 507
the incubation or early nestling period. All offspring were
ringed when they were 2–3 wk old. Unringed new females
were aged by their plumage characteristics as either 1, 2,
or 12 yr old (Pietiäinen and Kolunen 1986). This population has not been experimented with during the course
of this study. For more details on the methods and the
dynamics of this population, see Pietiäinen (1989) and
Brommer et al. (2002a).
Single-Generation Fitness Estimates and
Reproductive Timing
From the life-history data, we distilled several fitness measures. We distinguished between fledglings (offspring that
left the nest) and recruits (offspring that were recorded
breeding later in life). We only used local (within study
area) recruits because the capture probability of birds outside the study area is less efficient and is not standardized
between years. We only considered female recruits because
practically all females (but not all males) are trapped in
both species, genetic father⫺offspring relatedness is uncertain in the collared flycatcher (about 15% of offspring
are of extrapair origin; Sheldon and Ellegren 1999), and
paternal and maternal fitness cannot be separated in the
monogamous Ural owl, leading to pseudoreplication. Considering only the female part of a population is common
in life-history theory and is a valid simplification in case
the dynamics of both sexes do not differ (Stearns 1992).
A rate-sensitive estimate of fitness (lind) was calculated
as the unique positive real root in the equation
1p
冘
x
Figure 1: Estimates of lind plotted against estimates of lifetime reproductive success (LRS) for 1,414 collared flycatcher females. Census was
based either on fledglings (A, lind(fp) vs. lifetime fledgling production
[LFP]) or on recruits (B, lind(rp) vs. lifetime recruit production [LRP]).
The fitness values of females that started to breed as 1-yr-olds are indicated with an open dot, and the fitness of those that started to breed
at later ages are indicated with filled dots. Lifetime fledgling production
is calculated as half the total number of fledglings produced in a female’s
lifetime, and LRP is calculated as half the number of female recruits
produced in a lifetime. Note that measures of LRS and lind are equal for
values of 0 and 1 and that many points may overlap for these fitness
values.
site-tenacious birds with a low divorce rate: they basically
live and breed in the same territory their whole reproductive life (Saurola 1987). Practically all females were
trapped each year by netting them from the box during
1 ⫺x
fl ,
2 x ind
(1)
where x is the age of the individual considered and fx the
production of offspring (McGraw and Caswell 1996). Offspring production is scaled by the parent-offspring relatedness of 1/2 (McGraw and Caswell 1996). Again, we calculated estimates of lind based on counting offspring fx as
fledglings (lind(fp)), and on recruits (lind(rp)). In practice,
equation (1) is most easily solved by using matrix algebra
and an appropriate software package. The value lind is the
dominant eigenvalue of a matrix that has for each age x
the fx /2 values in its first row and 1 in its subdiagonal
(McGraw and Caswell 1996).
Measures of LRS were counted as half the total sum of
fledglings (lifetime fledgling production [LFP]) or half the
sum of recruits (lifetime recruit production [LRP]) that
an individual produced during its entire breeding career.
We followed the above outlined practice for calculating
lind and also incorporated the factor 1/2 in calculating
measures of LRS. In this way, when two offspring (either
fledglings or female recruits) are produced, LRS p
508
The American Naturalist
l ind p 1 (e.g., fig. 1). Note that this way of calculating
measures of LRS differs from what is usual in the literature
but that scaling offspring production in this manner does
not affect the proportional relationship between the performance of two different life histories for a linear measure
such as LRS, whereas lind is highly sensitive to such scaling
(Brommer et al. 2002c).
Long-Term Fitness
We calculated a third long-term estimate of fitness by following lineages of descendants. This estimate is based on
the concept of future descendants (Houston and McNamara 1999) and estimates the number of gene copies of
a focal individual y years after it hatched. Long-term fitness
wg(y) is based on the sum of all descendants (including
the focal individual itself) weighted by their relatedness of
1/2 to the focal individual. Hence,
冘()
G
wg(y) p
1 G
D ,
2 y
(2)
where the factor (1/2)G denotes the relatedness (for diploids) between the focal individual and a descendant of
the Gth generation (self: G p 0, relatedness p 1; F1 offspring: G p 1, relatedness p 1/2; F2 offspring: G p 2,
relatedness p 1/4; etc.) and DyG the number of descendants in the Gth generation alive in year y. Again, when
counting the number of descendants we only considered
females for the reasons outlined above.
Estimating long-term fitness requires taking into account uncertainty due to dispersal. The study populations
are not closed, and a certain fraction of F1 offspring disperses away and may recruit outside the study population
in each season. Their descendants cannot be identified.
Hence, the number of observed descendants of a focal
individual in the study population continually diminishes
as time proceeds, although the actual number of descendants could (in principle) be stable or even increasing
during the given time period. Because the uncertainty in
the number of descendants builds up over time, the explanatory power of the observed number of descendants
will become increasingly low. We incorporated this uncertainty by using a simulation approach in three steps.
First, for each reproductive event, we assumed there was
a probability d of F1 offspring to successfully disperse outside the study population and recruit there. We estimated
d by iteration. We calculated the cohort-specific rate of
population increase lcoh by solving the Euler-Lotka equation 1 p 冘 (1 ⫹ d) Fx L x l⫺x
coh, where Fx is the average number of local female recruits produced by a female of age
x and Lx the proportion of females in the cohort still alive
at age x. In calculating population growth rate, we only
considered females as usual in life-history theory (Stearns
1992). We assumed that the long-term growth rate of the
population (i.e., the harmonic mean of the lcoh’s including
dispersal over all the study years) would be 1 (e.g., Tuljapurkar 1989) and chose parameter d accordingly. Second,
outside-recruitment probability d was assumed to be a
random binomially distributed probability per brood produced. Hence, for each reproductive event, there were a
number (0, 1, 2, 3, …) of simulated outside recruits. To
any such “invisible” individual we randomly assigned the
life history (i.e., age-specific production of fledglings and
recruits) of an individual of the same birth cohort from
our data set. In the same fashion, random life histories
were drawn for all remaining putative descendants. Hence,
we make the strong assumption that the environment outside the study area allows for similar reproductive output
as inside the study area. Third, step 2 was repeated 250
times. For each run, we calculated the estimated number
of descendants as wg(y) p 冘 (1/2)G DyG ⫹ 冘 (1/2)G E (DyG),
where the first term is as in equation (2), and the second
term, E (DyG), is the expected number of descendants of
generation G at yr y, based on the simulation approach
described above.
In the collared flycatcher study population, some individuals were subjected to an experimental alteration of
a life-history component at some point in their life. Because of carryover effects of such manipulations (Gustafsson and Sutherland 1988; Gustafsson and Pärt 1990),
we only used the life-history data for such an individual
until the year in which it was experimented with. The
remaining life history (i.e., age-specific production of recruits and parental survival) of such an experimental bird
was, starting with the year of the experiment, assumed to
follow the life history of a randomly drawn nonexperimental bird of equal age breeding in the same year.
In the Ural owl, we randomized the age distribution of
birds that could not be aged reliably (i.e., those that were
aged by plumage characteristics as being 3 yr or older when
they first bred). We assumed that their ages followed the
average distribution of age at first breeding of ringed birds
that started to breed at 3 yr or older (62% at 3 yr; 21%
at 4 yr; 10% at 5 yr; 7% at 6 yr; Brommer et al. 1998).
We included this uncertainty by randomly assigning these
unreliably aged birds a certain age at first breeding for
each randomization round. Because of age discounting,
rate-sensitive individual fitness is rather insensitive to
changes in the age at first breeding for birds that started
to breed as 3-yr-olds or older.
Data and Analysis
Because we are interested in counting descendants over a
multigeneration time span, we focused on individuals in
Estimating Fitness 509
early (founding) cohorts of the population. We considered
females, which had not been subjected to any experiment
and were unrelated, from all cohorts (i.e., individuals
hatched) before 1988 in the Ural owl and from 1980 to
1989 for the collared flycatcher. We calculated all fitness
estimates described above for these individuals. We considered single-generation estimates of individual fitness
(LRS, lind) for each life-history stage (fledgling, recruit)
separately. Single-generation estimates were correlated
with the estimated number of gene copies left after y years
(y p 1, 2, 3, …). We used Spearman rank correlations
with significance corrected according to the number of
years tested (for k years, a p 0.05/k) and sample size set
by the number of individuals considered. We calculated a
correlation for each run separately and present box plots
in correlation diagrams. To aid in comparing results for
the two different life histories of the species used, we considered their generation time. Generation time was calculated as the average age of females producing a clutch
(Stearns 1992; Charlesworth 1994) during the whole study
period. The sensitivity of our results to uncertainty in our
dispersal parameter d was addressed by a sensitivity analysis. Uncertainty in d may be considered as testing the
sensitivity of our results to violations of our assumptions
that the total population is stable over time or that the
performance of recruits outside the study area is the same.
Results
The consequences of incorporating rate sensitivity into
estimates of individual fitness are readily apparent. In the
collared flycatcher, most females (78%, 1,103/1,414)
started to breed as 1-yr-olds. Such an early start gives high
estimates of lind, especially when fledglings are considered
(open vs. closed dots in fig. 1A). A similar fitness was
rarely achieved by individuals that postponed for at least
1 additional year, although early and late first breeders did
not differ in their total production of fledglings (fig. 1A;
t p 1.0, df p 1, 412, P p .3). In terms of recruits, ratesensitive (lind(rp)) and rate-insensitive (LRP) estimates of
individual fitness were highly correlated (fig. 1B). In general, lind will increase only modestly after a certain number
of offspring have been produced (fig. 1A; cf. Brommer et
al. 2002c for data on Ural owl). There was a clear difference
in the life histories of the two species of bird (table 2).
The collared flycatchers started to breed earlier, had a
shorter generation time, and produced fewer fledglings
(but more recruits) during their lives than Ural owls (table
2).
We estimated the long-term fitness of founding females
and simulated to account for uncertainty in our fitness
estimates. In the Ural owl data set, the age at first breeding
was randomized for 62/176 females that started to breed
Table 2: Descriptive data (mean Ⳳ SE) comparing the life
histories of the collared flycatcher and the Ural owl
Trait
Sample size
Generation time
Age at first production
LFP
lind(fp)
LRP
lind(rp)
Collared flycatcher
1,414
1.8 Ⳳ .02
1.3 Ⳳ .01
2.8 Ⳳ .06
1.9 Ⳳ .03
.15 Ⳳ .008
.17 Ⳳ .009
Ural owl
176
6.4 Ⳳ
3.5 Ⳳ
4.3 Ⳳ
1.2 Ⳳ
.10 Ⳳ
.09 Ⳳ
.1
.1
.3
.03
.02
.02
Note: Generation time is the average age of reproducing females during
the study period. Fitness estimates include half the sum of offspring
produced (counted either as fledglings—lifetime fledgling production
[LFP]—or as female recruits—lifetime recruit production [LRP]) and
age-discounted production of offspring (fledglings: lind(fp); female recruits: lind(rp)). See text for details on how these measures are defined.
at age 12 yr but could not be reliably aged. In the flycatcher
data set, offspring that were subjected to an experimental
alteration of their life history were assigned a random life
history. For both species, the results are qualitatively the
same if these individuals are fully omitted from the analysis. Measures of LRS based on the production of fledglings
proved to be good correlates of long-term genetic contribution to the population in both species (fig. 2A, 2C). The
rate-sensitive estimates of individual fitness lind generally
performed less well. In the short-lived collared flycatcher,
LFP was a good correlate of long-term fitness for 4–6 yr
(i.e., two to three times the generation time) but became
insignificant in the long run (fig. 2A). Similarly, in the
long-lived Ural owl, LFP remained a good correlate of
long-term genetic contribution for up to 15 yr (i.e., about
two times the generation time of this species). For both
life histories, fitness estimates based on recruits proved to
be better correlates of long-term genetic contribution than
fitness estimates based on fledglings, especially in the long
run (fig. 2). The LRP and lind based on recruits are highly
correlated because these two estimates of individual fitness
are identical for most parents (LRP and lind(rp) are equal
for values of 1 and 0; fig. 1B). In the collared flycatcher,
lind(rp) tended to correlate slightly better with our estimate
of long-term genetic contribution than LRP after about
two generations (3–5 yr), but there were no differences in
the strength of correlations of LRP and lind(rp) with longterm fitness in either species after more than two generations (fig. 2B, 2C).
One reason that rate-sensitive estimates of fitness based
on fledglings correlated less well with long-term fitness
than rate-insensitive estimates was due to the fact that the
rate-sensitive estimate discounted reproduction later in the
life of the parent. The correlation between the two fitness
estimates and long-term fitness after about two generations (4 yr for the flycatcher and 12 yr for the Ural owl)
510
The American Naturalist
Figure 2: Correlogram of long-term fitness and measures of lifetime reproductive success and lind for collared flycatcher (CF, panels A and B) and
Ural owl (UO, panels C and D) data without dispersal. Data are presented separately for fitness estimates calculated on the basis of fledglings (A,
C) and recruits (B, D). Box plots indicate the median, quartiles, and spread of 250 simulations. In the Ural owl data set, the age at first breeding
was randomized for 62/176 females that started to breed at age 12 yr but could not be reliably aged. In the flycatcher data set, offspring that were
subjected to an experimental alteration of their life history were assigned a random life history. Filled boxes indicate correlations for the rate
insensitive fitness estimate (lifetime fledgling production, lifetime recruit production); open boxes indicate the rate-sensitive fitness estimates (lind).
The dashed line indicates the Bonferroni-corrected significance threshold of these correlations.
showed a clear contrast (fig. 3). The difference in longterm fitness between females with high LFP and those with
lower LFP produced a good correlation between these two
traits (fig. 3A, 3C). However, because of age discounting,
females with a high LFP had a similar lind(fp) (cf. fig. 1A).
As a consequence the correlation between lind(fp) and
long-term fitness was lower, especially for the long-lived
Ural owl (fig. 3C, 3D).
The decrease in correlations between the estimates of
individual fitness and long-term fitness as time goes by
Estimating Fitness 511
Figure 3: Correlation between long-term fitness (number of gene copies after approximately two generations) and two single-generation estimates
of individual fitness for the collared flycatcher (A, B) and the Ural owl (C, D). The number of gene copies was estimated for the flycatcher after 4
yr and for the Ural owl after 12 yr. Data were chosen from the randomized set so as to have the average correlation (see fig. 2). This estimate of
long-term fitness was plotted against lifetime fledgling production (panels A and C) and lind based on fledglings (lind(fp); panels B, D).
(fig. 2) stems from the loss of genetic lineages. Loss of a
genetic lineage is inevitable because we only considered
the female part of the breeding population, but it is also
partly due to dispersal outside the study area. The flycatcher pedigree allowed following some lineages for up
to seven generations (i.e., F6 offspring), and the Ural owl
pedigree allowed for lineages of three generations. Nevertheless, few maternal lineages remained in the popula-
tion after three times the generation time. Hence, correlations decreased, as there was little contrast across
individuals in their long-term fitness. We incorporated an
estimate of the individuals lost through dispersal in order
to check the validity of our correlations of estimates of
individual fitness and genetic contributions on a longer
timescale (i.e., larger than twice the generation time). Results show that when dispersal was incorporated, estimates
512
The American Naturalist
Figure 4: Correlogram of long-term fitness and measures of lifetime reproductive success and lind for collared flycatcher (CF, panels A and B) and
Ural owl (UO, panels C and D) including hypothetical dispersal outside the study area. Data are presented separately for fitness estimates calculated
on the basis of fledglings (A, C) and recruits (B, D). Symbols and dashed line as in figure 2. Simulation included the hypothetical life histories of
a fraction d of F1 offspring assumed to successfully recruit outside the study area. For Ural owls, d p 0.1 , and for flycatchers, d p 0.125 . In addition,
the randomizations described in the caption of figure 2 are performed.
based on LRS still performed equally well (in case of recruits; fig. 4B, 4D) or better (in case of fledglings; fig. 4A,
4C) than rate-sensitive estimates of individual fitness. Especially for the long-lived Ural owl, the differences between
LFP and lind remained substantial in the long term (fig.
4C). In both species, there was no difference in the strength
of correlation of LRP and lind(rp) with our estimate of
long-term fitness at any timescale once dispersal was incorporated (fig. 4B, 4D).
The differences between the correlations of the two types
of fitness estimates and our estimate of long-term fitness
were robust to variation in the probability d to recruit
outside the study area (fig. 5). The correlations of lind
based on fledglings and LFP with estimated long-term fit-
Estimating Fitness 513
ness increased in unison with increasing values of d.
Hence, conclusions appear robust to violations of our assumptions underlying the simulation of dispersal (equal
performance inside and outside the study area, stable population growth), which would be reflected in either underor overestimation of the dispersal parameter d. When dispersal was incorporated, estimates of individual fitness
based on recruits tended to correlate less with long-term
fitness than estimates based on fledglings (fig. 4), and the
strength of this correlation decreased with increasing values of d (fig. 5). Closer inspection of the correlations
showed that this reduction was due to individuals without
local recruits having (simulated) descendants. Hence, this
result was a direct consequence of simulating outside recruitment as a binomial function of brood size and showed
the potential error in assigning any individual a fitness
value of 0.
Discussion
Fitness is typically thought to be related to the rate of
spread of a gene in the population (Charlesworth 1994).
We have estimated an individual’s genetic contribution to
future generations spanning three generations in the Ural
owl and seven generations in the collared flycatcher. We
have compared a rate-insensitive (LRS) and a rate-sensitive
(lind) estimate of individual fitness based on a census of
fledglings and of recruits with this long-term estimate of
individual fitness. Our aim was to provide an empirical
test for how well an individual’s genetic contribution to
future generations can be captured by single-generation
estimates based on data on the production of F1 descendants and what the importance of reproductive timing is
for estimating individual fitness. For both the short-lived
(collared flycatcher) and the long-lived (Ural owl) life histories, our results show that single-generation estimates of
fitness capture the long-term genetic contribution to the
population. Further, when counting fledglings, rate-insensitive estimates of individual fitness correlate better than
rate-sensitive estimates with estimated long-term genetic
contribution. When counting recruits, both classes of estimates perform equally well. Hence, we find little evidence
that rate-sensitivity as incorporated by lind significantly
improves estimates of individual fitness.
Although all estimates of fitness are based on the production of offspring, studies differ substantially in the time
at which offspring are counted. Measures of LRS have been
calculated by counting offspring that are still dependent
on their parents (Clutton-Brock 1988; Newton 1989), independent offspring (Käär and Jokela 1998; Gaillard et al.
2000; Korpelainen 2000; Kruuk et al. 2000), or breeding
offspring (Clutton-Brock 1988; Merilä and Sheldon 2000).
Our results support the intuitive notion that estimates of
Figure 5: Sensitivity analysis for dispersal parameter d. The parameter
d was varied by 30% with steps of 5% from its original value (see legend
to fig. 4). Average correlation with long-term fitness after about two
generations (4 yr for the collared flycatcher and 12 yr for the Ural owl)
is indicated for lifetime fledgling production (filled circle), lind (filled
square), and lifetime recruit production (open circle). Confidence intervals
(95%) are based on 100 simulation runs each.
LRS based on offspring that are recorded breeding (recruits) correlate better with estimated genetic contribution
to the population than estimates based on dependent offspring (fledglings). However, recruitment is notoriously
difficult to measure, and recruitment outside the study
area is one reason that our estimate of the number of
descendants alive in the local population decreases over
time. We have here captured some uncertainty in recruit-
514
The American Naturalist
ment due to dispersal. In doing so, we have made the
assumption that the reproductive output of individuals
that have recruited outside the study area is equal to those
that stayed. This assumption probably does not hold: collared flycatchers that move away from their natal site have
reduced breeding performance (Pärt and Gustafsson
1991), and Ural owls that immigrate from outside the
study area have a poorer lifetime performance than local
individuals (H. Pietiäinen, unpublished data). This would
mean that we have underestimated losses due to dispersal.
Nevertheless, our sensitivity analysis shows that the difference in the correlations of measures of LRS and measures of lind with (dispersal-corrected) long-term fitness
is qualitatively not affected by the extent of dispersal.
Our simulations have by no means captured all possible
uncertainty. In particular, demographic stochasticity in the
sex ratio of offspring creates error. Long-term fitness is
based on females only, and single-generation estimates
should therefore also be based on females only. In terms
of fledglings, we can only consider both sexes. Although
sex ratios in birds do not deviate much from 50% (Sheldon
1998), stochastic differences across individuals in the proportion of daughters produced per lifetime will lead to
error in the correlation of single-generation and our longterm estimates of fitness. Significantly, LFP and lind based
on fledglings are not equally biased by demographic stochasticity in the sex ratio of fledglings; lind is less sensitive
than LFP to changes over most of the range of lifetime
production (e.g., fig. 1; Brommer et al. 2002a). Of all
offspring produced, chance events may also make it so
that only sons are recruited. Because we only consider
maternal lineages, male-only recruitment at some point
along a lineage would lead to the erroneous conclusion
that the genetic lineage goes extinct. A diploid individual
will have genetically replaced itself in the population if it
recruits two offspring during its lifetime. Hence, not incorporating both sexes adds to the disappearance of our
genetic lineages. Other uncertainty in our long-term estimate of fitness includes the way we incorporated dispersal
in a uniform way. Estimates of lifetime recruitment have
been shown to be biased with respect to the spatial position
of the site of birth within the study area because loss rates
are higher at the edge than in the core (Lambrechts et al.
1999).
Interpreting our result in terms of how fitness should
be quantified depends on the accuracy of our estimates of
individual fitness. The most critical requirement for an
estimate of individual fitness is to separate the genetic
signal from the noise. Environmental and demographic
stochasticity form two aspects of noise that affect both our
single-generation and long-term estimates of individual
fitness. The profound effect of stochasticity on estimates
of individual fitness is evident from studies that have
shown that the low heritability of estimates of individual
fitness is caused by high environmental or nonadditive
variance and not by low additive genetic variance (e.g.,
Kruuk et al. 2000; Merilä and Sheldon 2000). Given such
demographic and environmental noise, it is encouraging
that we can detect a correlation of single-generation estimates of fitness with our estimate of long-term fitness
(r is about 0.3–0.6 after two to three generations depending
on when the offspring is counted). Nevertheless, this correlation may be lower in other populations or species than
the ones used in this article. In a comparison with 38
populations of 18 species of birds, the two bird populations
used in this article were relatively stable (they had a high
estimated time to extinction), mainly due to their relatively
high growth rate (a function of local recruitment and life
span) and relatively low environmental stochasticity (B.
Saether et al., unpublished manuscript). In addition, demographic stochasticity in the Ural owl population was
lowest of all 38 populations (but about average for the
collared flycatcher population). In general, stochastic effects therefore seem to play a more important role in other
bird populations than the populations used in this article,
and the correlation of individual fitness and long-term
performance may therefore be even less in other populations than what we have recorded here.
Benton and Grant (2000) showed in a theoretical study
that rate-sensitive population estimates of fitness performed worse than rate-insensitive estimates in populations that fluctuated around equilibrium (due to either
overcompensatory density dependence or environmental
stochasticity). Such population dynamics are probably
common in natural populations. Hence, for natural scenarios, rate sensitivity does not seem to improve estimates
of either population-level fitness (Benton and Grant 2000)
or individual-level fitness (this article). These results stand
in strong contrast to the predominant view in the current
literature where rate-sensitive estimates of individual fitness are considered better estimates than rate-insensitive
estimates independent of when offspring are counted
(Käär and Jokela 1998; Korpelainen 2000; Cooch et al.
2002; Oli et al. 2002). We see three main reasons why
sensitivity to reproductive timing may not increase the
accuracy of individual fitness estimates.
First, the fitness payoff from offspring produced late in
life is too strongly de-emphasized by rate sensitivity. Offspring of both first and later clutches contribute equally
to LRS, but rate sensitivity discounts reproduction by the
age at which an offspring (either fledgling or recruit) was
produced. Large values of LRS do not translate into large
values of lind. Lifetime reproductive success and lind have
a curvilinear relationship; individuals with an LRS above
a certain critical value differ little in their lind (Käär and
Jokela 1998; Brommer et al. 2002c; this article). Our results
Estimating Fitness 515
show that individuals that contribute most in the long
term have high values of LFP. However, lind based on the
production of fledglings insufficiently contrasted these
successful individuals. In fact, these successful individuals
are the long-lived ones, as LFP is largely determined by
life span (Clutton-Brock 1988; Newton 1989; Ural owl:
Brommer et al. 1998; collared flycatcher: Gustafsson 1989).
Our result thus reflects the classic notion that the optimal
life-history strategy is to live long and produce as many
offspring as possible (Law 1979). Notably, this notion is
not supported by rate-sensitive estimates that discount
reproductive output by parental age, as individual fitness
then rapidly converges to a semiasymptotic value (fig. 1;
Käär and Jokela 1998; Brommer et al. 2002c). Hence, additional reproduction at later ages erroneously contributes
little to the value of age-discounting fitness estimates.
Second, in addition to the sheer number of offspring
produced, offspring may vary in their (reproductive) value.
The inability to accurately incorporate such variation in
offspring value hampers all single-generation estimates of
individual fitness. Part of an offspring’s value includes
genetic effects (e.g., Sheldon et al. 1997 for evidence of
genetically determined quality differences among collared
flycatcher offspring), although it will also reflect environmental conditions. For example, in birds, within-season
variation in the timing of laying affects the value of offspring produced (Ural owl: Brommer et al. 2002b; collared
flycatcher: Sheldon et al. 2003). In addition, among-year
variation in environmental conditions generates a cohort
effect (e.g., Lindström 1999), which will create pronounced
differences in performance across offspring produced in
different reproductive seasons during the life of a parent.
In contrast to single-generation estimates of individual
fitness, our estimate of long-term genetic contribution
does include offspring value because it incorporates not
only the performance of the parent but also the performance of its offspring and all further descendants. In fact,
in theoretical studies reproductive value is measured by
tracing the number of descendants after several generations (Houston and McNamara 1999). Rate sensitivity’s
emphasis on reproduction early in life may produce severe
errors if offspring value increases as a function of parental
age. Typically, a parent will gain advantages during its
lifetime (e.g., in terms of experience [e.g., Nol and Smith
1987] or a better territory [Newton 1989]) that may be
transferred to its offspring. Parents may also increase their
reproductive effort with age because their residual reproductive value is reduced (terminal reproductive effort, e.g.,
Schaffer 1974; see Gustafsson and Pärt 1990 for collared
flycatcher). Offspring produced early in the life of the
parent (especially in the first breeding attempt) may therefore be of poorer value than offspring produced later. Incorporating timing by weighing offspring production with
parental age (as lind does) may thus both overestimate the
fitness contribution of early reproduction and underestimate the fitness contributions of later broods. Lifetime
reproductive success also does not consider variation in
offspring value, but measures of LRS may still provide a
better fitness estimate than lind because rate insensitivity
will buffer it against erroneously inflating the fitness contributions of offspring of different value produced during
the life of the parent.
Third, population dynamical feedback is largely ignored.
Only in a growing population is reproduction early in life
favored by selection, while delaying reproduction to later
in life is favored in a shrinking population (Stearns 1992).
Populations will typically fluctuate, alternatively increasing
and decreasing in size. Hence, reproductive timing will
certainly be an important selective force at times, but its
magnitude (and sign) may change on a shorter timescale
than the life span of an individual parent. Fluctuations in
population dynamics shorter than a generation may mean
that the importance of reproductive timing is unlikely to
be captured as a function of age alone (as lind assumes).
Measures of LRS or lind based on either fledglings or recruits do not incorporate such population dynamical feedback. Lifetime reproductive success may provide a better
estimate of individual fitness, possibly because its rate insensitivity will make it less sensitive when fitness is averaged out over several periods of change in population
size that occur during the life of a parent. In this respect,
further work on estimating individual fitness is needed,
and an individual-based theoretical exploration of the difference in predictive power of the various estimates of
individual fitness that we used in this article will be
interesting.
In conclusion, we have shown that by using long-term
individual-based data, one can trace genetic lineages in
order to gain a more objective insight into the predictive
power of proxy estimates of fitness. Such proxy estimates
are essential for quantification of individual performance
in a population. Contrasting the performance of individuals is an intrinsic element in evolutionary biology and in
applied ecological problems such as conservation biology.
Here, we have addressed a classic dichotomy in the measurement of fitness and focused on comparing two broad
classes of individual fitness, rate sensitive and rate insensitive. We find that the latter often performs equally well
or better than the former, depending on when in the organism’s life cycle offspring are counted. Further studies
utilizing species with different life histories are, of course,
needed to verify the generality of our results and
conclusions.
516
The American Naturalist
Acknowledgments
We thank J. M. McNamara (November 23, 2001) and T.
Benton for providing food for thought. Comments by two
anonymous reviewers substantially improved the quality
of this article. We are very grateful to all those who have
over the years helped out in the field. Our research was
supported by the Swedish Natural Science Research Council (L.G., J.M.), the Academy of Finland (J.E.B., J.M., H.P.),
and the Finnish Cultural Foundation (J.E.B.).
Literature Cited
Bennington, C. C., and J. B McGraw. 1995. Phenotypic
selection in an artificial population of Impatiens pallida:
the importance of the invisible fraction. Evolution 49:
317–324.
Benton, T. G., and A. Grant. 2000. Evolutionary fitness in
ecology: comparing measures of fitness in stochastic,
density-dependent environments. Evolutionary Ecology
Research 2:769–789.
Brommer, J. E. 2000. The evolution of fitness in life-history
theory. Biological Reviews of the Cambridge Philosophical Society 75:377–404.
Brommer, J. E., H. Pietiäinen, and H. Kolunen. 1998. The
effect of age at first breeding on Ural owl lifetime reproductive success and fitness under cyclic food conditions. Journal of Animal Ecology 67:359–369.
Brommer, J. E., H. Pietiäinen, and H. Kokko. 2002a. Cyclic
variation in seasonal recruitment and the evolution of
the seasonal decline in Ural owl clutch size. Proceedings
of the Royal Society of London B 269:647–654.
Brommer, J. E., H. Pietäinen, and H. Kolunen. 2002b.
Reproduction and survival in a variable environment:
Ural owls (Strix uralensis) and the three-year vole cycle.
Auk 119:194–201.
Brommer, J. E., J. Merilä, and H. Kokko. 2002c. Reproductive timing and individual fitness. Ecology Letters 5:
802–810.
Charlesworth, B. 1994. Evolution in age-structured populations. Cambridge University Press, Cambridge.
Clutton-Brock, T. H., ed. 1988. Reproductive success. University of Chicago Press, Chicago.
Cooch, E. G., E. Cam, and W. Link. 2002. Occam’s shadow:
levels of analysis in evolutionary ecology—where to
next? Journal of Applied Statistics 29:19–48.
Endler, J. A. 1986. Natural selection in the wild. Princeton
University Press, Princeton, N.J.
Gaillard, J. M., M. Festa-Bianchet, D. Delorme, and J.
Jorgenson. 2000. Body mass and individual fitness in
female ungulates: bigger is not always better. Proceedings of the Royal Society of London B 267:471–477.
Geritz, S. A. H., É. Kisdi, G. Meszéna, and J. A. J. Metz.
1998. Evolutionary singular strategies and the adaptive
growth and branching of the evolutionary tree. Evolutionary Ecology 12:35–57.
Gustafsson, L. 1989. Collared flycatcher. Pages 75–88 in I.
Newton, ed. Lifetime reproduction in birds. Academic
Press, London.
Gustafsson, L., and T. Pärt. 1990. Acceleration of senescence in the collared flycatcher Ficedula albicollis by reproductive costs. Nature 347:279–281.
Gustafsson, L., and W. J. Sutherland. 1988. The costs of
reproduction in the collared flycatcher Ficedula albicollis. Nature 335:813–815.
Heesterbeek, J. A. P. 2002. A brief history of R0 and a
recipe for its calculation. Acta Biotheoretica 50:189–204.
Houston, A. I., and J. M. McNamara. 1999. Models of
adaptive behaviour. Cambridge University Press,
Cambridge.
Käär, P., and J. Jokela. 1998. Natural selection on agespecific fertilities in human females: comparison of
individual-level fitness measures. Proceedings of the
Royal Society of London B 265:2415–2420.
Korpelainen, H. 2000. Fitness, reproduction and longevity
among European aristocratic and rural Finnish families
in the 1700s and 1800s. Proceedings of the Royal Society
of London B 267:1765–1770.
Kruuk, L. E. B., T. H. Clutton-Brock, J. Slate, J. M. Pemberton, S. Brotherstone, and F. E. Guinness. 2000. Heritability of fitness in a wild mammal population. Proceedings of the National Academy of Sciences of the
USA 97:698–703.
Lambrechts, M. M., J. Blondel, A. Caizergus, P. C. Dias,
R. Pradel, and D. W. Thomas. 1999. Will estimates of
lifetime recruitment of breeding offspring on small-scale
study plots help us to quantify processes underlying
adaptation? Oikos 86:147–151.
Lande, R., and S. J. Arnold. 1983. The measurement of
selection on correlated characters. Evolution 37:1210–
1226.
Law, R. 1979. Ecological determinants in the evolution of
life histories. Pages 81–103 in R. M. Anderson, B. D.
Turner, and L. R. Taylor, eds. Population dynamics.
Blackwell, Oxford.
Lenski, R. E., and P. M. Service. 1982. The statistical analysis of population growth rates calculated from schedules of survivorship and fecundity. Ecology 63:655–662.
Lindström, J. 1999. Early development and fitness in birds
and mammals. Trends in Ecology and Evolution 14:343–
348.
Link, W. G., E. G. Cooch, and E. Cam. 2002. Model-based
estimation of individual fitness. Journal of Applied Statistics 29:207–224
Lynch, M., and B. Walsh. 1998. The genetical analysis of
quantitative traits. Sinauer, Sunderland, Mass.
McGraw, J. B., and H. Caswell. 1996. Estimation of in-
Estimating Fitness 517
dividual fitness from life-history data. American Naturalist 147:47–64.
Merilä, J., and B. C. Sheldon. 2000. Lifetime reproductive
success and heritability in nature. American Naturalist
155:301–310.
Metz, J. A. J., R. M. Nisbet, and S. A. H. Geritz. 1992.
How should we define “fitness” for general ecological
scenarios? Trends in Ecology & Evolution 7:198–202.
Murray, B. G. 1997. Population dynamics of evolutionary
change: demographic parameters as indicators of fitness.
Theoretical Population Biology 51:180–184.
Mylius, S. D., and O. Diekmann. 1995. On evolutionary
stable life histories, optimization and the need to be
specific about density dependence. Oikos 74:218–224.
Newton, I., ed. 1989. Lifetime reproduction in birds. Academic Press, London.
Nol, E., and J. N. M. Smith. 1987. Effects of age and
breeding experience on seasonal reproductive success in
the song sparrow. Journal of Animal Ecology 56:301–
313.
Oli, M. K., G. R. Hepp, and R. A. Kennamer. 2002. Fitness
consequences of delayed maturity in female wood
ducks. Evolutionary Ecology Research 4:563–576.
Pärt, T. 1990. Natal dispersal in the collared flycatcher:
possible causes and reproductive consequences. Ornis
Scandinavica 21:83–88.
Pärt, T., and L. Gustafsson. 1991. Breeding dispersal in the
collared flycatcher (Ficedula albicollis): possible causes
and reproductive consequences. Journal of Animal Ecology 58:305–320.
Pietiäinen, H. 1989. Seasonal and individual variation in
the production of offspring in the Ural owl Strix uralensis. Journal of Animal Ecology 58:905–920.
Pietiäinen, H., and H. Kolunen. 1986. Age determination
of breeding Ural owls. Ornis Fennica 63:26–27.
Rand, D. A., H. B. Wilson, and J. M. McGlade. 1994.
Dynamics and evolution: evolutionarily stable attractors, invasion exponents and phenotypic dynamics.
Philosophical Transactions of the Royal Society of London B 343:261–283.
Roff, D. A. 1992. The evolution of life histories: theory
and analysis. Chapman & Hall, New York.
———. 1997. Evolutionary quantitative genetics. Chapman & Hall, New York.
Saurola, P. 1987. Mate and nest-site fidelity in Ural and
tawny owls. Pages 81–86 in R. W. Nero, R. J. Clark,
R. J. Knapton, and R. H. Hamre, eds. Biology and conservation of northern forest owls. Symposium proceedings. USDA Forest Service General Technical Report
RM-142, Winnipeg, Manitoba.
Schaffer, W. M. 1974. Selection for optimal life histories:
the effect of age structure. Ecology 55:291–303.
Sheldon, B. C. 1998. Recent studies of avian sex ratios.
Heredity 80:397–402.
Sheldon, B. C., and H. Ellegren. 1999. Sexual selection
resulting from extrapair paternity in collared flycatchers.
Animal Behaviour 57:285–298.
Sheldon, B. C., J. Merilä, A. Qvarnström, L. Gustafsson,
and H. Ellegren. 1997. Paternal genetic contribution to
offspring condition predicted by relative size of male
secondary sexual character. Proceedings of the Royal
Society of London B 264:297–302.
Sheldon, B. C., L. E. B. Kruuk, and J. Merilä. 2003. Natural
selection and inheritance of breeding time and clutch
size in the collared flycatcher. Evolution 57:406–420.
Stearns, S. C. 1976. Life history tactics: a review of the
ideas. Quarterly Review of Biology 51:3–47.
———. 1992. The evolution of life histories. Oxford University Press, Oxford.
Tuljapurkar, S. 1989. An uncertain life: demography in
random environments. Theoretical Population Biology
35:227–294.
van Dooren, T. J. M, and J. A. J. Metz. 1998. Delayed
maturation in temporally structured populations with
non-equilibrium dynamics. Journal of Evolutionary Biology 11:41–62.
Associate Editor: Ben C. Sheldon