A general expression for the reproductive value

Behavioral
Ecology
The official journal of the
ISBE
International Society for Behavioral Ecology
Behavioral Ecology (2016), 27(5), 1296–1303. doi:10.1093/beheco/arw044
Original Article
A general expression for the reproductive
value of information
Rebecca K. Pike,a John M. McNamara,b and Alasdair I. Houstona
aSchool of Biological Sciences, University of Bristol, Tyndall Avenue, Bristol BS8 1TQ, UK and bSchool
of Mathematics, University of Bristol, University Walk, Bristol BS8 1TW, UK
Received 22 September 2015; revised 2 March 2016; accepted 7 March 2016; Advance Access publication 1 April 2016.
Information transfer and utilization is ubiquitous in nature. Animals can increase their reproductive value by changing their behavior in
light of new information. Previous work has shown that the reproductive value of information can never be negative given an animal
behaves optimally. Statistical decision theory uses Bayes’ theorem as a mathematical tool to model how animals process information
gained from their environment. We use this technique with an optimality model to establish a new expression for the value of information when behavior is chosen from a continuous range of possibilities. Our expression highlights that the value of information is
proportional to the rate of change of behavior with information. We illustrate our approach using the cooperative behavior between a
male and a female raising their common young. We show that the value of knowing about one’s partner can be quantified and establish
the value of information to a member of the pair when the continuous trait is how long to spend caring for their young. However, the
applications of this expression are wider reaching than parental care decisions and can be used to analyze the behavior of individuals
across a variety of species and contexts.
Key words: Bayes’ theorem, flexibility, information, optimality model, parental care.
INTRODUCTION
Animals are constantly bombarded with cues and signals, and
throughout, we will refer to these cues and signals as observations.
We do this without prejudice that observations may or may not be
actively sought by the animal. When we refer to information, we
use this in an informal sense as the receipt of an observation.
An observation can be thought of as a random variable. If there
is no variance in possible observations, then an animal would not
need to take an observation; the animal would know the information they were about to receive. Before an observation is made, how
valuable is the observation?
Stephens (1989) emphasizes to biologists that an observation is
valuable to the individual if it has the possibility of changing the
behavior of the individual. The value of an observation is not
determined by variance in the possible states of the environment
but rather by the variance in behavior as a result of making the
observation (Gould 1974).
Animals are known to make informed decisions based on the
observations of their environment (Maynard Smith and Harper
1995, 2003), and many organisms show remarkable flexibility in
life-history traits on receipt of some observation that correlates
with their environment.
Address correspondence to R.K. Pike. E-mail: [email protected].
© The Author 2016. Published by Oxford University Press on behalf of
the International Society for Behavioral Ecology. All rights reserved. For
permissions, please e-mail: [email protected]
For instance, Daphnia pulex switch from asexual to sexual reproduction depending on population density (Berg et al. 2001).
Empirical studies (Collins et al. 1994; Luttbeg 1996) and theoretical studies (Getty 1996; Mazalov et al. 1996; Collins et al. 2006)
of female mating tactics suggest that assessing potential mates can
improve estimates of the quality distribution and an individual
male’s quality, improving the female’s choice of whether she should
accept or reject him as a mate. In each example, the individual
changes its behavior in response to making an observation.
Animals gather imperfect information to update their opinion about the true state of nature, a process that can be modeled using the quantitative framework of statistical decision
theory (McNamara and Houston 1980; Dall et al. 2005). At the
heart of the approach is Bayes’ theorem that provides a method
of determining probabilities based on observations. A Bayesian
animal behaves as if it has prior knowledge of the state of the
world (from personal experience or genetic information), which
it uses coupled with a new observation to arrive at a revised,
posterior opinion concerning the state of the world. We are not
suggesting animals consciously complete these calculations or
reach their optimal strategy in this way. Instead, observed behavior may approximate theoretically derived optimal solutions. We
use statistical decision theory as a framework to provide a mathematically rigorous way to model the information gain in animals
(McNamara and Houston 1980).
Pike et al. • The value of information
1297
Behavior can be chosen from a continuous (e.g., how long to
spend being vigilant as opposed to foraging) or discrete (e.g., accept
or reject a mate) range of possibilities. In our model, we make the
assumption that an individual can choose from a continuous range
of behaviors. We argue that there are many cases in nature where
it is more realistic to view behavior as a continuous rather than
discrete trait. Vampire bats, Desmodus rotundus, vary the quantity of
a blood meal shared depending on the degree of relatedness and
expected future reciprocation (Wilkinson 1984). Males and females
in various species of birds adjust the amount of time devoted to
incubation or the amount of food brought to nestlings in response
to the efforts of their mates (Nice 1937; Kendeigh 1952; Skutch
1976; Harrison et al. 2009). Allowing varying degrees of cooperation has been an important assumption in the theoretical study of
the evolution of cooperation (Killingback et al. 1999; Killingback
and Doebeli 2002; McNamara and Doodson 2015).
We derive a new expression to quantify the value of information. Following McNamara and Dall (2010), we define the value of
information as the average increase of an individual’s reproductive value if they behaved optimally after an observation compared
with before. They consider a discrete set of behavioral options and
thus the curve of the resulting payoff function contains a “kink.”
We consider behavior chosen from a continuous range of possibilities where the subsequent optimal payoff function is differentiable
everywhere and is thus smooth. A general expression for the value
of information is given for an animal in a world that can be in one
of 2 states. We then consider an animal in a world that can be in
any of n possible states. The value of gaining information about a
partner is explored with a case study involving a male and female
raising their young. The female must make the decision of how
much parental effort to exert given her information on the type of
male she is paired with.
MODEL
Consider a world that can be in one of 2 states. The world is in
state 0 with probability 1 − p and in state 1 with probability p.
Now consider an individual in this world with a continuous trait T.
We assume that the individual is uncertain of the true state of the
world but as a result of selection acting in the past or from personal
experience, the individual behaves as if it knows p (Dall et al. 2005;
McNamara et al. 2006). Let W0(T) and W1(T) denote the payoff to
an individual with trait T when the world is in state 0 and state 1,
respectively. The expected payoff W(T, p) to an individual depends
on the trait T and the prior p and is given by
W (T , p ) = (1 − p )W0 (T ) + pW1 (T ).
(1)
The optimal trait T*(p) that maximizes the expected payoff in
Equation 1 is a function of p and satisfies the derivative condition
∂W *
(T ( p ), p ) = 0.
∂T
(2)
(
(3)
The payoff to an individual that behaves optimally is then
)
W * ( p ) = W T * ( p ), p .
We assume the individual gains information about the true state of
nature from observation or otherwise. This information is coupled
with the individual’s prior knowledge to give an updated posterior
probability p̂ on the state of nature, which is a random variable
because the animal gains information by observing the random
variable X. Allowing X to take this form prevents the animal from
knowing what information the observation will give. If an animal
knew the information, they were about to receive they would not
need to take an observation. In our analysis below, we assume that
whatever the value of pˆ , the individual then behaves optimally
given this value.
We build on the result of McNamara and Dall (2010) who
describe the value of information as the average increase in reproductive value to the individual after gaining information about the
state of the world. This difference is
I (p )=E[W * ( pˆ ) ] − W * ( p ).
(4)
* ˆ
The first term on the right hand side, E[W ( p )], is the expected
payoff to the individual if they gain information by observation
before choosing their optimal action given this information. The
second term on the right hand side, W*(p), is the payoff given the
individual follows the optimal strategy given that the prior knowledge p is available when the individual makes its choice of action.
McNamara and Dall (2010) show that E[ pˆ ] = p ; that is, the
mean of the posterior probability that state 1 is the true state
equals the prior probability that state 1 is the true state. They
also show W* is a convex function of p. Because E[ pˆ ] = p and
due to the convexity of the optimal payoff function W*(p), then
E[W * ( pˆ )] ≥ W * ( p ). Therefore, I(p) is non-negative. We exploit the
fact that the trait is continuous to extend the result from Equation 4
*
by approximating the term E[W ( pˆ )] using the Taylor expansion
to give the equation
E[W * ( pˆ )] = E[W * ( p )] + E[( pˆ − p ) ⋅W * ′ ( p )]
1

+ E  ( pˆ − p )2W * ′′ ( p ) + o(( pˆ − p )2 )
2


(5)
where the last term on the right hand side of Equation 5 represents
terms of order ( pˆ − p )3 and above. The second term on the right
hand side in Equation 5 disappears because E[ pˆ ] = p. Because W*
is a constant, using routine laws of expectations, we can rewrite
Equation 5 as
1
E[W * ( pˆ )] = W * ( p ) + Var( pˆ ) ⋅W * ′′ ( p ),
(6)
2
disregarding terms of order 3 or higher. Therefore, by Equations 4
and 6, we have
1
I ( p ) = Var( pˆ ) ⋅W * ′′ ( p ).
(7)
2
Expression 7 represents the value of information for small variance.
As we can see, 2 terms are important in quantifying the value of
information, the variance of the posterior probability and the curvature of W*(p). The 2nd derivative of the optimal payoff function
can be calculated directly using Equations 1–3 as
W * ′′ ( p ) = W1′ (T * ( p )) − W0′ (T * ( p )) T * ′ ( p )


(8)
(see Appendix 1 for proof). Substituting Equation 8 into Equation
7, we have
1
I ( p ) = Var( pˆ ) W1′ (T * ( p )) − W0′ (T * ( p )) T * ′ ( p ).


2
(9)
Expression 9 is the new result that the value of information is the
product of the variance of the posterior, the difference between the
rate of change of the payoff with trait T for each state and the rate
of change of a trait with information. If the rate of change of T*(p)
is 0, then there is no change to behavior after a gain in information
and there is no value to information. Only when the term T*(p) is
nonzero, is there a change in behavior as a result of information
gain and therefore value to information.
Behavioral Ecology
1298
GENERAL CASE
The 2D model can be generalized to derive an expression for the
value of information when we consider a world that can be in one
of any n distinct states. Let pi be the probability that the true state
of the world is θi (where i = 1,…, n). We assume that an individual
who is uncertain about the true state of the world has the prior
n
knowledge vector p = ( p1 ,…, pn ) where ∑ i =1 pi = 1. Suppose the
individual gains information by observing the random variable X.
The conditional probability that the observed value of X is x given
the true state of the world is θi is denoted by fi(x). Averaging over all
possible values of the world, the unconditional probability that the
observed value of X is x is f ( x ) = p1 f1 ( x ) + ... + pn f n ( x ) . Applying
Bayes’ rule, if the observation is x, the posterior probability that the
true state of the world is θi is
p f (x )
(10)
qi ( x ) = i i
f (x )
and the updated knowledge vector is defined as
q( x ) = (q1 ( x ), …, q n ( x )).
(11)
Observing the random variable X is seen as determining the value
of the random knowledge vector Q = (Q 1 ,…,Q n ) where Q = q(X).
It can be shown that E(Q) = p where E {Q i } = pi (McNamara
and Dall 2010). That is, the mean of the posterior probability for
any possible state of the world θi being the true state of the world
equals the prior probability that it is the true state. We assume an
individual possessing a continuous trait T has reproductive value
Wi(T) when the world is in state θi. Following from Equation 1, the
payoff function of an individual with trait T and knowledge vector
p, representing the individual’s expected reproductive value, is
(12)
W (T , p ) = pW
1 1 (T ) +  + pnWn (T ).
T*(p) denotes the trait that maximizes payoff given that the knowledge vector is p. The expected reproductive value under this optimal trait is
W ( p ) =W (T ( p ), p ) = p1W1 (T ( p )) +… + pnWn (T ( p )). (13)
To proceed, for mathematical convenience, we do not restrict p to
a probability and so we do not assume that ∑ i pi necessarily sums
to 1. The results derived (Expression 21) will therefore hold for all p.
n
More specifically, they will also hold for the case where ∑ i =1 pi = 1 .
*
*
*
*
Given X is a random variable and W*(p) is a constant, E[W*(Q)]
can be approximated by the second order Taylor expansion of W
about p,
*
n ∂W
E[W * (Q )] = W * ( p ) + ∑ i =1
( p )E[(Q i − pi )]
∂pi
n
1 n
∂2W *
+ ∑ i =1 ∑ j =1
( p )E[(Q i − pi )(Q j − p j )] (14)
2
∂pi ∂p j
+o
(∑
n
i =1
∑
n
j =1
)
(Q i − pi )(Q j − p j ) .
Because E {Q i } = pi , then E [(Q i − pi )] = 0 . Thus, the second
term on the right hand side in Equation 14 is equal to 0. Also,
because this is true, we have
∑ ∑
n
n
i =0
j =0
E[(Q i − pi )(Q j − p j )]
= ∑ i = 0 ∑ j = 0 E (Q i − E{Q i })(Q j − E{Q j })
= Cov(Q i ,Q j ),
n
n
(15)
where Cov(Q i ,Q j ) is the covariance between the estimated probabilities of states of the world i and j. Thus, by Equations 14 and
15, we have
E[W * (Q )] = W * ( p ) +
n
∂2W *
1 n
Cov (Q i ,Q j )
( p ) (16)
∑
i =1 ∑ j =1
∂pi ∂p j
2
when terms of order higher than 3 are disregarded. Because the
value of information is the difference between E[W*(Q)] and W*(p)
from Equation 4, we have
n
∂2W *
1 n
I ( X ) = ∑ i =1 ∑ j =1 Cov (Q i ,Q j )
( p ).
(17)
∂pi ∂p j
2
Calculating the 2nd partial derivative in Expression 17 as
∂2W *
∂T *
( p ) = Wi ′ (T * ( p ))
( p)
∂pi ∂p j
∂p j
(18)
gives I(X) in the form
n
∂T *
1 n
I ( X ) = ∑ i =1 ∑ j =1 Cov (Q i ,Q j )Wi ′ (T * ( p ))
( p ).
(19)
∂p j
2
Expression 19 is the new result for the value of information when
the world can be in any one of n distinct states. As in Expression 9,
∂T *
( p ) in Expression 19 is nonzero is there a
only when the term
∂p j
change in behavior as a result of information gain and thus a value
to information. We can write Expression 19 schematically as
1
I ( X ) = W ′T CT * ′
2
(20)
where W ′T is the transpose of the vector W ′ = (W1′ ,…,Wn ′ ), C is
*
*
*
the covariance matrix and T * ′ is the vector T ′ = (T1 ′ ,…,Tn ′ ).
The value of information is therefore the inner product of the rate
of change of the payoff function with the optimal trait T*(p), the
covariance matrix of the random knowledge vector Q and the rate
of change of a trait with information.
Further manipulation (see Appendix 2) gives I (X) in the form
(
′ *
1 Var ∑ i =1Wi (T ( p ))Q i
I (X ) = −
2 ∑ n pW ′′ (T * ( p ))
n
i =1 i
)
(21)
i
Expression 21 is equivalent to Expression 9 when the world can be
in one of 2 distinct states (see Appendix 3). When n = 2, then the
covariance term in Expression 19 is perfectly negatively correlated,
that is − Var( pˆ ) = Cov (1 − pˆ , pˆ ) . Increasing the probability that the
state of the world is in state 1 decreases the probability the world is
in state 2 by exactly the same amount. This is not always the case
when n > 2 because there is more than 1 degree of freedom.
CASE STUDY: OPTIMAL WAIT TIME AT THE
NEST WHEN THE TYPE OF PARTNER IS
UNKNOWN
Parent birds raising young constantly change their behavior in
response to information gained about the effort of their mate
(Skutch 1976; Harrison et al. 2009). With the life history of long
lived, monogamous sea birds (e.g., some albatross species) in mind,
we apply our expression to find the value of information to an individual about their partner.
Consider a male and female pair of a socially monogamous bird
species where incubation and feeding of the young is shared by both
parents. During the breeding season, one parent must be present
with the young at a nest site at all times from laying to fledgling.
Although one parent stays with the young, the other leaves to forage for themselves and their offspring. We assume that the female
Pike et al. • The value of information
1299
takes the first incubation shift and the male leaves to forage. We also
assume a constant high probability of survival. There are 2 types of
male in the population. Type 1 will return to the nest after foraging
whereas type 0 will not. The female is uncertain which type of male
she is paired with. The probability that the female is paired with a
type 1 male is p, and the probability that the female is paired with
a type 0 male is 1 − p. It is assumed the female behaves as if she
knows p. The female possesses a continuous trait T that determines
how long she waits at the nest for the male to return from foraging before abandoning the nest herself. Waiting carries a cost and
so waiting too long could be detrimental to future breeding success.
The female may face increased vulnerability to predators or suffer a reduction of body condition that may affect her reproductive
success in future years. However, abandoning the offspring leads to
complete breeding failure for the current season. The female follows
strategy T ≥ 0 if she abandons the nest at time T. If the male is of
type 1, the female receives a reward value of V if the male returns
before time T and receives a reward value of L if the male returns
after time T. If the male is of type 0, the female receives reward L if
she leaves at time T. If the male fails to return and the female does
not leave, the costs incurred to the female result in her death and the
payoff to the female is 0. We assume that the reward values V and
L satisfy the inequality V > L. We further assume that the female
incurs a cost c per unit of time she waits for the male to return. The
return time of a male of type 1 is described as a continuous random
variable X supported on the interval (0, ∞) and is modeled by the
exponential distribution, X ~ exp(λ), with rate parameter λ > 0.
When the female’s strategy is T, the expected payoff to the
female paired with a type 0 male is
W0 (T ) = L − cT
(22)
and the payoff to a female paired with a type 1 male averaging over
his possible return times is
T
∞
0
T
W1 (T ) = ∫ f X ( x )[V − cx ]dx + ∫ f X ( x )[ L − cT ]dx
(23)
where f X ( x ) = λe − λx . The first term on the right hand side of
Expression 23 represents the payoff to the female if X < T and
the second term represents the payoff to the female if X > T.
Expression 23 can be solved to give
c
c

W1 (T ) = e − λT  L − V +  + V − .
(24)


λ
λ
From Equation 1, the expected payoff to the female who adopts
strategy T given p is L − cT

W (T , p ) = (1 − p )( L − cT ) + p e − λT

c
c

 L − V +  + V −  (25)
λ
λ
There exists a critical value of p that we define as
c
pcrit =
.
(26)
λ (V − L )
The strategy that satisfies the derivative condition of Equation 2
and maximizes Equation 25 is
T * ( p) =
 p   λ (V − L ) − c )  
1
Ln 

  .
c
λ  1 − p  

(27)
This is the optimal wait time of the female given p. Only when p >
pcrit, is T*(p) > 0. Otherwise T*(p) = 0 and the female should abandon the nest immediately. Figure 1a shows how the optimal wait
time depends on p. If the female follows the optimal strategy then
by definition (Equation 3), she receives the optimal payoff W*(p)
(see Figure 1b). From Equations 25 and 27, the 2nd differential of
the optimal payoff function is
c
W * ′′ ( p ) = 2
.
(28)
p (1 − p )λ
As stated above, we assume that the female behaves as if she knows
the prior probability. If the female gains further information about
the type of male she is paired with by observation, the value of this
information to the female can be calculated directly using Equation
28 as
I ( p ) ∝ W * ′′ ( p ),
(29)
because the behavior of the value of information function I(p)
is proportional to the function W * ′′ ( p ) for a given variance.
Figure 1c illustrates the value of information as a function of p.
Unlike models that reward the focal individual for simply “knowing
the truth” (Grafen 1990), the function I(p) values information only
if a gain in knowledge changes the female’s behavior. The value in
information is the reproductive value gained when this information
is used to change behavior. There is no change to the female’s optimal behavior for a small gain in information about her partner’s
type below pcrit and so information is not valuable. When p is strictly
greater than pcrit, the function I(p) is positive, and information is
valuable because the consequence of gained information always
changes the female’s optimal wait time. The function I(p) becomes
most valuable for values of p closest to 1; thus, there is value in
gaining information about your partner even if you know with a
high likelihood which type they are. When the value of information
is dependent on variance in behavior, a greater change to behavior
will render information more valuable (see Figure 1a).
In some contexts, p can be taken as given (when the prior p is
determined from courtship with a specific male) and in others a
self-consistent solution would be required.
Consider a population where females pair with a randomly chosen male at the beginning of each breeding season. Let 40% of the
males in the population be of type 1. Through a period of observation (e.g., courtship), the female gains information about the type
of male she is paired with to give a prior p. We are interested in
how valuable further observation would be given p. In this case, the
reward value L is a constant because the expected future reproductive success of the female will always be the same despite the type
of male the female is paired with in the current breeding season.
For more complex cases, for example, when a female retains her
mate across breeding seasons, her expected future reproductive
success (L) will depend on p. There are also complications when a
female can choose a male from one of multiple populations where
the proportion of type 1 males differ in each population. If p varies, then so will the costs and benefits. In this scenario, the life history of each population would need to be considered to determine
parameter values. In such cases, one would need to consider selfconsistency (Houston and McNamara 2005). A self-consistent version of this model would be a useful direction for future work.
DISCUSSION
Using statistical decision theory techniques and an optimality
model, we establish a new expression to quantify the reproductive
value of information when behavior is continuously variable. Our
formulae highlight that the value of information is determined by
Behavioral Ecology
1300
(a)
18
16
14
c = 0.1
c = 0.3
c = 0.6
c = 0.9
12
T*(p)
10
8
6
4
2
0
0
0.2
0.4
p
0.6
0.8
1
0.6
0.8
1
(b) 4
3.5
c = 0.1
c = 0.3
c = 0.6
c = 0.9
W*(p)
3
2.5
2
1.5
1
0.5
0
0.2
0.4
p
(c) 45
c = 0.1
c = 0.3
c = 0.6
c = 0.9
40
35
W*"(p)
30
25
20
15
10
5
0
0
0.2
0.4
p
0.6
0.8
1
Figure 1.
(a) The optimal wait time against p. The optimal wait time of the female increases with the probability that the male is of type 1. For p less than or equal to
pcrit, the female should abandon the nest immediately. For p above the critical value of pcrit, it is optimal to wait an intermediate amount of time until p = 1
when the female should wait indefinitely. c is the cost incurred per unit of time the female waits for the male to return. (b) Reproductive value given the
optimal strategy against p. Reproductive value, characterized by the optimal payoff function, increases monotonically with p. W*(p) is a constant in the region
0 < p < pcrit as the optimal wait time for the female is always 0. (c) W*(p) against p, which is proportional to the value of information function I(p). W*(p) is
constant and 0 in the region 0 < p < pcrit, and we can say information is not valuable here. W*(p) is positive for p strictly greater than pcrit and information is
valuable. W*(p) tends to infinity as p tends to 1; thus, information is most valuable for high values of p. There is a discontinuity at p = pcrit. Parameter values
for (a), (b), and (c) are λ = 0.4, V = 4, and L = 1. pcrit = 0.833, 0.25, 0.5, and 0.75 for c = 0.1, 0.3, 0.6, and 0.9, respectively.
Pike et al. • The value of information
a possible change in behavior as a result of an observation. We first
derive an expression for the value of information for a 2-state world
then generalize the model for a world that can be in multiple states.
We illustrate our approach with a case study based on the cooperative behavior between a male and female raising their common
young where an incubating female must decide how long to wait
for her foraging partner to return before abandoning the nest.
Our formulae highlight that the value of information is determined by its potential to change behavior, a point made by
Stephens (1989). In our formula, the variance of the posterior plays
a crucial role as opposed to Stephens’ (1989) criterion that is in
terms of the variance of behavior.
In our model, because ∑ i pi = 1, the set of all possible knowledge vectors p = ( p1 ,…, pk ) is an n − 1 dimensional space Sn − 1.
The optimal payoff function W*(p) is a defining surface over Sn − 1,
and this surface is the reproductive landscape. Following McNamara
and Dall (2010), we use the fact that E[Q] = p (with Q being the
random knowledge vector) and due to the convexity of the reproductive landscape W*(p), then E[W*(Q)] ≥ W*(p). That is, the
height of the reproductive landscape at the prior p is less than or
equal to its average height at the posterior probabilities whose mean
is p. The variance in the posterior determines the average increase
in reproductive value of the individual after gaining information
about the state of the world, which is the value of information.
Stephens quantifies the value of information as the difference
between a perfectly informed individual and an individual with no
information beyond the prior. The value of attending to a potential observation that gives partial information about the state of
the world is solved by calculating the value of perfect information
about the observable states of the cue. In our model, we assume,
perhaps more realistically, that an individual gains partial information about the state of the world and an observation will update the
prior distribution.
For an animal’s behavior to be influenced by a gain in information, the animal must be flexible with their choice of action, always
taking the best action given their current knowledge. In more technical terms, assuming an animal behaves optimally, it is flexible in
its choice of action if it can change its behavior based on the random variable pˆ.
As with other theoretical models of flexible effort adjustment (Axelrod and Hamilton 1981; Sherratt and Roberts 1998;
Killingback and Doebeli 2002), we consider an adjustable trait in
response to the state of the world. The alternative is for the trait to be
genetically determined and unchangeable by the animal (McNamara
et al. 2008). Dall et al. (2005) argue, however, that animals do not
possess complete flexibility but instead follow mechanistic rules which
limit this flexibility, an example being a period of learning in which
behavioral flexibility occurs being followed by a lack of learning for
the rest of the animal’s life (Lotem and Halpern 2010). The value of
information can still be defined in this case but we would expect it to
be less than if behavior were completely flexible.
Behavioral flexibility is encapsulated by the multiplier term T*(p)
in Expression 9 and the vector T * ′ in Expression 19. Both terms
represent the rate of change of the optimal trait with information.
If there is no change to behavior after a gain in information, then
both terms are equal to 0 and there is no value to the information.
Only when there is flexibility in behavior is the rate of change of
the optimal trait nonzero and information is valuable. We conclude
that there is only value to information if the animal possesses flexibility to change its behavior in response to a gain in information.
Our new expression to quantify the value of information uses
Bayesian statistical theory to model the process of an animal
1301
updating its knowledge. An alternative measure of information is
mutual information (Shannon 1948; Shannon and Weaver 1949),
which describes uncertainty reduction measured in terms of
entropy (Cover and Thomas 1991).
Donaldson-Matasci et al. (2010) show the connection between
mutual information and the decision theoretic measure of information within the context of evolution in an uncertain environment.
They show that the fitness value of a developmental cue, when
measured as the increase in long-term growth rate of a lineage,
equals reduction in uncertainty as described by mutual information.
The analysis of Donaldson-Matasci et al. (2010) is concerned
with the case in which the environment as a whole fluctuates, so
that all individuals in a population are subject to the same environmental conditions. In this setting, a cue provides potential information on the current environment and the value of a cue is the
resultant expected increase in geometric mean fitness. In contrast,
we are concerned with the situation in which there are no fluctuations in the environment as a whole, but different individuals are
in different situations, for example, some have partners of type 1,
whereas others do not. In our analysis, a cue provides potential
information about a certain individual in the environment largely
independent of other individuals in the population, and the value
of observing a cue is the resultant expected increase in reproductive
value. Thus, Donaldson-Matasci et al. (2010) are concerned with
environmental stochasticity (aggregate risk), whereas our analysis is
concerned with demographic stochasticity (idiosyncratic risk).
Although Donaldson-Matasci et al. (2010) establish a link
between mutual information and the decision theoretic measure of
information, it is not clear that our context would provide the same
connection. For our context, reproductive value is the appropriate
measure of information.
It is important to note the continuous nature of the trait in our
model. We argue that using a continuous trait in the derivation of
the expression for the value of information allows for more realistic biological applications. For example, males vary the continuous
level of effort invested in a clutch dependent on paternity (Møller
and Cuervo 2000) such as in the mating system of the dunnock
(Prunella modularis) (Davies 1992). In some species, males allocate
sperm according to the expected reproductive value of a copulation
based on the level of sperm competition associated with a female
(Wedell et al. 2002) and on female mate quality (Hunter et al. 2000;
Pizzari et al. 2003). In principle, we can use our expression to find
the value of information about paternity to a male or the value of
information about different types of female.
To apply a biological context to our proof, we illustrated our
expression for the value of information using the cooperative
behavior between a male and a female bird raising their common
young. It is assumed that the female is in possession of a continuous,
flexible trait that allows her to vary the amount of time devoted to
incubation of the young. We quantified the value of information to
the female about the type of male she is paired with and the effect
of information on her behavior. We showed that as the probability the male was of the type that returns to the nest after foraging
increased, the female’s optimal wait time for his return was longer.
We showed that information was valuable to the female if a gain of
information changed her behavior.
In our illustration, we assumed a fixed cost to the female per unit
of time. More realistically for some contexts, a mortality rate parameter could be used. However, the use of such a parameter would
not affect the shape of the value of information curve and therefore
the interpretation of results. The Taylor series is used to approximate the payoff function W*(p). Stephens (1989, p. 137) discusses
Behavioral Ecology
1302
the limitations of using such an approximation method in a similar
context.
Our illustration has shown that there is value in knowing your
partner. If information is valuable to an animal, we would expect an
animal to pay a cost of gathering it. Through courtship, for example,
a female can gather information about a desired quality in a mate.
Our approach allows us to quantify the value of this information.
The general case of our expression is also applicable to a variety
of foraging problems where we can consider an animal in a world
of n distinct habitats with a different value of a parameter in each.
For example, we could consider an animal with a choice of n patches,
each with a different level of prey abundance, where the animal is
flexible in the amount of time it dedicates to a patch (McNamara and
Houston 1985). The animal may gain information about the level of
prey in an individual patch simply by spending time in the patch. We
can determine how valuable this information is to the animal.
There are a wide variety of species and contexts in which an individual possesses a flexible, continuous trait where information would
be useful to them. Our results can be extended by rederiving our
expression for the value of information with the assumption that
the state of nature is a continuum. The probabilities would become
density functions and the summation, integrals. For example, the
optimal clutch size of a female given the state of the environment
could be predicted, where the environment is modeled with a continuous distribution. This would render further opportunities for our
expression to be applied to even more biological scenarios.
APPENDIX 2
Deriving Equation 21
The definition of T*(p) is extended to this setting, still defined as
the value of T that maximizes W(T, p). Differentiating W*(p) partially with respect to pi gives
∂W *
∂W *
∂T *
∂W *
( p) =
(T ( p ), p )
( p) +
(T ( p ), p ).
∂pi
∂T
∂pi
∂pi
(2.1)
The first term on the right hand side in Equation 2.1 is equal to 0
from Equation 2. The remaining term on the right hand side can
be simplified because from Equation 13, we have
n
*
W (T * ( p ), p ) = ∑ pW
i
i (T ( p ))
(2.2)
i =1
and when partially differentiating by pi then all pj when j ≠ i are
held constant. Thus, we resolve
∂W *
(T ( p ), p ) = Wi (T * ( p ))
(2.3)
∂pi
Differentiating Equation 2.3 partially with respect to pj gives
∂2W *
∂T *
( p ) = Wi ′(T * ( p ))
( p ).
∂pi ∂p j
∂p j
n
∑ pW ′(T
( p )) = 0
(2.5)
and differentiating this equation partially with respect to pj gives
i =1
FUNDING
This work was supported by the Engineering and Physical Sciences
Research Council (Ep/1032622/1 to R.K.P. and J.M.M.) and the
European Research Council Advanced Grant (250209 to A.I.H.).
n
(1.1)
∂2W
dT *
∂2W
(T * ( p ), p )
( p ) + 2 (T * ( p ), p ).
∂T ∂p
dp
∂p
(1.3)
The second term on the right hand side of Equation 1.2 is equal
to 0 because the 2nd partial derivative of W from Equation 1
with respect to p evaluated at the optimal strategy T*(p) is equal
to 0. The remaining 2nd partial derivative in Equation 1.2 can be
calculated directly from Equation 1 to give
∂2W
∂
(T * ( p ), p ) =
(W1 (T ) − W0T ).
(1.4)
∂T ∂p
∂T
Thus, by Equations 1.2 and 1.3, we have
∂T *
( p ) = 0,
∂p j
k
k
Thus, by Equations 2.4 and 2.7, we have
k =1
Differentiating W*(p) partially with respect to p gives
∂W *
∂W *
dT *
W *′ ( p ) =
(T ( p ), p ) +
(T ( p ), p )
( p ).
(1.2)
∂p
∂T
dp
The second term on the right hand side of Equation 1.1 is equal to
0 by Equation 2. Differentiating Equation 1.1 again partially with
respect to p gives
k =1
which rearranges to
(1.5)
(2.6)
(2.7)
Wi (T * ( p ))W j’ (T * ( p ))
∂2W *
( p) = −
.
n
∂pi ∂p j
∑ k =1 pkWk ″(T * ( p ))
From Equation 3, the optimal payoff function is defined as
W * ′′ ( p ) = [W1′ (T ) − W0′ (T )]T * ′ ( p ).
*
i
W j ′ (T * ( p ))
∂T *
( p) = − n
.
∂p j
∑ p W ′′ (T * ( p ))
APPENDIX 1
Calculating W*”(p) directly from W*(p)
W * ′′ ( p ) =
i
W j ′ (T * ( p )) + ∑ pkWk ′′ (T * ( p ))
Handling editor: Shinichi Nakagawa
W * ( p ) = W (T ∗ ( p ), p ).
(2.4)
Because W(T, p) is at its maximum at T = T*(p), we have
(2.8)
Combining Equations 2.8 and 17, we get
*
*
n Wi ′ (T ( p ))W j ′ (T ( p ))
1 n
Cov (Q i ,Q j ).
∑
∑
n
(2.9)
2 i =1 j =1 ∑ pW ′′ (T * ( p ))
i
i
i
=
1
Using Cov(X + Y, Z) = Cov(X, Z) + Cov(Y, Z) and because the
covariance of a random variable with itself is the variance of the
random variable, we have
I (X ) = −
∑ ∑
n
n
i =1
j =1
Wi ′ (T * ( p ))W j ′ (T * ( p ))Cov (Q i ,Q j )
(∑
= Var (∑
= Cov
Wi ′ (T * ( p ))Q i , ∑ j =1W j ′ (T * ( p ))Q
n
n
i =1
n
)
Wi ′ (T * ( p ))Q i .
i =1
By Equations 2.9 and 2.10,
(
which gives the result in Expression 21.
i =1
) (2.10)
)
′ *
1 Var ∑ i =1Wi (T ( p ))Q i
I (X ) = −
,
*
n
2
∑ piWi ′′(T ( p ))
n
j
(2.11)
Pike et al. • The value of information
1303
APPENDIX 3
Expression 9 is equivalent to Expression 21
when the world can be in 2 distinct states
Setting i = 2 in Expression 21 gives
I (X ) = −
(
)
′ *
′ *
1 Var Q 0W0 (T ( p )) + Q 1W1 (T ( p ))
.
2
′′ *
p0W0′′ (T * ( p )) + pW
1 1 (T ( p ))
(3.1)
Because ∑ i Q i = 1, then Q0 can be set as 1 − Q and Q1 can be set
as Q. Therefore, the numerator in Expression 3.1 becomes
[W1′ (T * ( p )) − W0′ (T * ( p ))]2 Var (Q )
(3.2)
because Wi ′ (T * ( p )) is a constant. As ∑ i pi = 1, then p0 can be set
as 1 − p and p1 can be set as p and the payoff function W(T, p) has
the same form as Equation 1. Differentiating W(T, p) partially with
respect to T gives
W0′ (T ) + p[W1′ (T ) − W0′ (T )].
(3.3)
Evaluating Equation 3.3 at T = T*(p) and making use of Equation
2 gives
(3.4)
W0’ (T * ( p )) + p W1′ (T * ( p )) − W0′ (T * ( p )) = 0


Differentiating Equation 3.4 totally with respect to p gives
−[(1 − p )W0′′ (T * ( p )) + pW1′′ (T * ( p ))] T * ′ ( p )
= W1′ (T * ( p )) − W0′ (T * ( p)))
Combining Equations 3.1, 3.2, and 3.5,
(3.5)
1 [W1′ (T * ( p )) − W0′ (T * ( p ))]2 Var(Q )T * ′ ( p )
2
W1′ (T * ( p )) − W0′ (T * ( p ))
1
= VarQ [W1′ (T * ( p )) − W0′ (T * ( p ))]T * ′ ( p ). 2
I (X ) =
(3.6)
which matches the original derivation of I(p) in Expression 9
when X = p and Q = pˆ .
REFERENCES
Axelrod R, Hamilton WD. 1981. The evolution of cooperation. Science.
211:1390–1396.
Berg LM, Pálsson S, Lascoux M. 2001. Fitness and sexual response to population density in Daphnia pulex. Freshw Biol. 46:667–677.
Collins SA, Hubbard C, Houtman AM. 1994. Female mate choice in the
zebra finch: the effect of male beak colour and male song. Behav Ecol
Sociobiol. 35:21–25.
Collins EJ, McNamara JM, Ramsey DM. 2006. Learning rules for optimal selection in a varying environment: mate choice revisited. Behav Ecol. 17:799–809.
Cover TM, Thomas JA. 1991. Elements of information theory. New York:
Wiley.
Dall SR, Giraldeau LA, Olsson O, McNamara JM, Stephens DW. 2005.
Information and its use by animals in evolutionary ecology. Trends Ecol
Evol. 20:187–193.
Davies NB. 1992. Dunnock behaviour and social evolution. Oxford series in
ecology and behaviour. Oxford: Oxford University Press.
Donaldson-Matasci MC, Bergstrom CT, Lachmann M. 2010. The fitness
value of information. Oikos. 11:219–230.
Getty T. 1996. Mate selection by repeated inspection: more on pied flycatchers. Anim Behav. 51:739–745.
Gould JP. 1974. Risk, stochastic preference and the value of information. J
Econ Theory. 8:64–84.
Grafen A. 1990. Biological signals as handicaps. J Theor Biol.
144:517–546.
Harrison F, Barta Z, Cuthill I, Szekely T. 2009. How is sexual conflict over parental care resolved? A meta-analysis. J Theor Biol.
22:1800–1812.
Houston and McNamara. 2005. John Maynard Smith and the importance
of consistency in evolutionary game theory. Biol Philos. 20:933–950.
Hunter FM, Harcourt R, Wright M, Davis LS. 2000. Strategic allocation of
ejaculates by male Adelie penguins. Proc Biol Sci. 267:1541–1545.
Kendeigh SC. 1952. Parental care and its evolution in birds. Ill Biol
Monogr. 22:1–356.
Killingback T, Doebeli M. 2002. The continuous prisoner’s dilemma and
the evolution of cooperation through reciprocal altruism with variable
investment. Am Nat. 160:421–438.
Killingback TM, Doebeli M, Knowlton N. 1999. Variable investment, the
continuous prisoner’s dilemma, and the origin of cooperation. Proc Biol
Sci. 266:1723–1728.
Lotem A, Halpern JY. 2010. Co-evolution of learning and data acquisition
mechanisms: a model for cognitive evolution. Philos Trans R Soc Lond B
Biol Sci. 367:2686–2694.
Luttbeg B. 1996. A comparative Bayes tactic for mate assessment and
choice. Behav Ecol. 7:451–460.
Maynard Smith J, Harper DGC. 1995. Animal signals: models and terminology. J Theor Biol. 177:305–311.
Maynard Smith J, Harper DGC. 2003. Animal signals. Oxford: Oxford
University Press.
Mazalov V, Perrin N, Dombrovsky Y. 1996. Adaptive search and information updating in sequential mate choice. Am Nat. 148:123–137.
McNamara JM, Barta Z, Fromhage L, Houston AI. 2008. The coevolution
of choosiness and cooperation. Nature. 451:189–192.
McNamara JM, Dall SRX. 2010. Information is a fitness enhancing
resource. Oikos. 119:231–236.
McNamara JM, Doodson P. 2015. Reputation can enhance or suppress
cooperation through positive feedback. Nat Commun. 6:6134.
McNamara JM, Green RF, Olsson O. 2006. Bayes theorem and its application in animal behaviour. Oikos. 112:243–251.
McNamara JM, Houston AI. 1980. The application of statistical decision
theory to animal behaviour. J Theor Biol. 85:673–690.
McNamara JM, Houston AI. 1985. A simple model of information use in
the exploitation of patchily distributed food. Anim Behav. 33:553–560.
Møller AP, Cuervo JJ. 2000. The evolution of paternity and paternal care in
birds. Behav Ecol. 11:472–485.
Nice MM. 1937. Studies in the life history of the song sparrow I. A population study of the song sparrow. Linn Soc NY Trans. 4:1–247.
Pizzari T, Cornwallis CK, Lovlie H, Jakobsson S, Birkhead TR. 2003.
Sophisticated sperm allocation in male fowl. Nature. 426:70–74.
Shannon CE. 1948. A mathematical theory of communication. Bell Syst
Tech J. 27:379–423, 623–656.
Shannon CE, Weaver W. 1949. The mathematical theory of communication. Urbana (IL): The University of Illinois Press.
Sherratt TN, Roberts G. 1998. The evolution of generosity and choosiness
in cooperative exchanges. J Theor Biol. 193:167–177.
Stephens DW, Krebs JR. 1986. Foraging theory. Princeton (NJ): Princeton
University Press.
Skutch AF. 1976. Parent birds and their young. Austin (TX): University of
Texas Press.
Wedell N, Gage MJG, Parker GA. 2002. Sperm competition, male prudence and sperm-limited females. Trends Ecol Evol. 17:313–320.
Wilkinson GS. 1984. Reciprocal food sharing in the vampire bat. Nature.
308:181–184.