Evolutionary branching in a finite population

Genetics: Advance Online Publication, published on October 26, 2012 as 10.1534/genetics.112.144980
1
2012-1023
2
Evolutionary branching in a finite population:
3
Deterministic branching versus stochastic branching
4
Joe Yuichiro Wakano1, 2 and Yoh Iwasa3
5
6
1. Meiji Institute for Advanced Study of Mathematical Sciences, Meiji University,
7
Kawasaki 214-8571, Japan
8
2. Japan Science and Technology, PRESTO, Japan
9
3. Department of Biology, Kyushu University, Fukuoka 812-8581, Japan
10
11
1
Copyright 2012.
12
13
Running title:
Evolutionary branching in a finite population
14
15
Keywords: adaptive dynamics, genetic drift, stochastic differential equation,
16
mutation-selection-drift balance, population size
17
18
19
Corresponding author:
20
|
21
22
23
24
25
26
Joe Yuichiro Wakano
Meiji Institute for Advanced Study of Mathematical Sciences, Meiji University,
1-1-1 Higashi Mita, Tama-ku, Kawasaki, Kanagawa, Japan 214-8571
Email: Joe Yuichiro Wakano <[email protected]>
Fax: +81-44-934-7660
27
28
29
30
(to appear in Genetics)
31
2
32
Abstract
33
Adaptive dynamics formalism demonstrates that, in a constant environment, a
34
continuous trait may first converge to a singular point followed by spontaneous
35
transition from a unimodal trait distribution into a bimodal one, which is called
36
"evolutionary branching". Most previous analyses of evolutionary branching have been
37
conducted in an infinitely large population. Here, we study the effect of stochasticity
38
caused by the finiteness of the population size on evolutionary branching. By analysing
39
the dynamics of trait variance, we obtain the condition for evolutionary branching as the
40
one under which trait variance explodes. Genetic drift reduces the trait variance and
41
causes stochastic fluctuation. In a very small population, evolutionary branching does
42
not occur. In larger populations, evolutionary branching may occur, but it occurs in two
43
different manners: in deterministic branching, branching occurs quickly when the
44
population reaches the singular point, whilst in stochastic branching, the population
45
stays at singularity for a period before branching out. The conditions for these cases and
46
the mean branching-out times are calculated in terms of population size, mutational
47
effects, and selection intensity, and are confirmed by direct computer simulations of the
48
individual-based model.
49
3
50
51
1. Introduction
In addition to models using quantitative genetics (Lande 1981, Barton &
52
Turelli 1991, Pomiankowski et al. 1991, Iwasa et al. 1991), adaptive dynamics
53
formalism has revealed diverse behaviours of evolutionary dynamics associated with
54
continuous traits in various organisms (Metz et al. 1992, Dieckmann & Law 1996,
55
Geritz et al. 1997, Geritz et al. 1998). This approach allows us to analyse the evolution
56
of many ecological traits for which frequency-dependent selection is prevalent without
57
specifying the underlying genetic systems controlling the trait. Even in a perfectly
58
constant environment, a population may show intriguing temporal behaviour. For
59
example, if the trait evolves by accumulation of mutations of small magnitude and if the
60
fitness is frequency dependent, a continuous trait may show convergence to a singular
61
point followed by spontaneous splitting of a unimodal trait distribution into a bimodal
62
(or multimodal) one, referred to as "evolutionary branching" (Metz et al. 1992, Metz et
63
al. 1996, Geritz et al. 1997). Evolutionary branching is predicted to occur at an
64
evolutionarily singular point that is approaching stable (or convergence stable; CS) but
65
that is not an ESS, and is observed in individual-based simulations in many models for
66
the evolution of ecological traits.
67
In evolutionary branching, an evolving population spontaneously changes its
68
trait distribution from unimodal to bimodal (or multimodal). Although evolutionary
69
branching has been regarded as a model of sympatric speciation by some authors
4
70
(Dieckmann & Doebeli 1999, Doebeli & Dieckmann 2000), the relationship between
71
evolutionary branching in adaptive dynamics formalism and speciation events has been
72
the focus of much debate (for review, Waxman & Gavrilets 2005).
73
Most previous analyses of adaptive dynamics have assumed an infinite
74
population size and that the fitness landscape acts as the deterministic driving force of
75
evolution. However, in a finite population, stochastic fluctuations are unavoidable. For
76
example, an individual with a lower fitness may be able to produce more offspring just
77
by chance. Thus, a finite population can evolve against the selection gradient. As a
78
result, stochasticity allows a population to jump from a low local peak to a higher peak.
79
Hence, stochastic fluctuation can be an important factor in the outcome of evolution.
80
The importance of stochasticity, or genetic drift, has been studied in detail in the context
81
of quantitative population genetics (e.g., Lande 1976, Tazzyman & Iwasa 2010). On the
82
other hand, in population genetics, random drift is known to decrease the genetic
83
diversity of a population (e.g., Ewens 2004), which tends to slow down the action of
84
natural selection. Hence, the effect of genetic drift is important to understand
85
evolutionary dynamics, including evolutionary branching, in finite populations. In fact,
86
Claessen et al. (2007) analysed evolutionary branching in a model including three
87
species, and reported that genetic drift tends to cause a delay in evolutionary branching.
88
However, a more detailed study of the relationship between evolutionary branching and
89
genetic drift is needed.
5
90
In this paper, we study the effect of stochastic fluctuation caused by the
91
finiteness of population size (random genetic drift) on evolutionary branching. As an
92
illustrative example, we adopt a model of a public goods game studied by Doebeli et al.
93
(2004) who investigated evolutionary branching by using a corresponding
94
individual-based simulation. The game model deals with a situation that is both
95
biologically and socially interesting: a self-organised differentiation of players into a
96
highly cooperative group and a non-cooperative group. Concerning the genetics, we in
97
effect assume that the population is haploid and that the focal trait is controlled by a
98
locus with many alleles. By focusing on this simplest case, we can analyse the effect of
99
random genetic drift to the evolutionary outcome most clearly. We first show the results
100
of individual-based simulations using different population sizes. We show that genetic
101
drift is especially important when the population size is small, and that evolutionary
102
branching does not necessarily occur even when the standard theory of adaptive
103
dynamics predicts that it should. We also derive deterministic and stochastic versions of
104
the dynamics of the trait variance. We reveal that there are two different manners in
105
which evolutionary branching occurs: in deterministic branching, branching may occur
106
quickly after the population reaches the singular point, whilst in stochastic branching,
107
the population stays at a singularity for a period before branching out suddenly. We
108
derive the conditions and mean branching-out times for these cases, and confirm our
109
predictions by direct computer simulations of the individual-based model.
6
110
111
2. Evolutionary Branching in a Game in a Finite Population
112
2.1 Individual-based model
113
As a full stochastic model of a population under natural selection, mutation and
114
random genetic drift, we consider the following stochastic process. The population
115
consists of N individuals reproducing asexually. Each individual has a genetically
116
determined trait value z  [0,1] . At each discrete time step t, an individual i obtains a
117
fecundity Fi from pairwise interactions with the other N-1 individuals. Each
118
interaction is a two-player game where a payoff given to an individual i matched with
119
an individual j is given by f ( zi , z j ) . Its fecundity Fi is denoted by
120
121
Fi   f ( zi , z j ) .
(1)
j i
122
123
Our framework is valid for any payoff function f ( zi , z j ) . As an example, we adopt a
124
model by Doebeli et al. (2004) in which trait z represents the amount of investment into
125
public goods shared by both players. In this game, the payoff function is represented by
126
127
f ( zi , z j )  1  B( zi  z j )  C ( zi ) ,
(2a)
128
129
where the benefit function is
7
130
B ( z1  z 2 )  b1 ( z1  z 2 )  b2 ( z1  z 2 ) 2 ,
131
(2b)
132
133
and the cost function is
134
C ( z )  c1 z  c2 z 2 .
135
(2c)
136
137
To keep the population size constant, the fitness of an individual with trait zi is
138
denoted as
139
w( zi ) 
140
Fi
.
(1 / N ) F j
(3)
j
141
142
When the population size is infinitely large, Doebeli et al. (2004) has shown
z* 
c1  b1
4b2  2c2
143
that
144
QCS  4b2  2c2  0 and evolutionarily stable when QES  2b2  2c2  0 . The singular
145
strategy is referred to as an evolutionary branching point when QCS  0 and QES  0 .
146
They have observed evolutionary branching in an individual-based simulation of a large
147
population (N=10,000).
the
singular
strategy
8
is
convergence
stable
when
148
In a finite population, the model includes random genetic drift, and the details
149
of the updating rule and the order of different stages in population dynamics becomes
150
important. Here, we assume a Wright-Fisher process. At each time step, fecundity is
151
calculated for all N individuals. Then all individuals die and are replaced by their
152
offspring which are sampled multinomially, with the probability of an individual to be
153
the parent of each offspring individual proportional to the fecundity of the focal
154
individual.
155
After reproduction, there is a small chance of mutation  , and the mutant has
156
a trait value slightly different from that of the parent. To be specific, it follows a normal
157
distribution with a mean equal to the parent's and a variance  2 . Mutants are
158
constrained to be between 0 and 1, which confines trait z within an interval [0,1].
159
160
2.2 Evolutionary branching depends on the population size.
161
All simulations were run with parameter values for which a unique singular
162
point exists that is convergence stable and evolutionarily non-stable (i.e. the
163
evolutionary branching point). Thus, evolutionary branching is expected when the
164
population size is infinitely large.
165
When the population size is large (N = 8,000), evolutionary branching occurs
166
as soon as the trait evolves to reach the convergent stable (CS) value z* (Figure 1(a)).
167
For an intermediate population size (N = 1,000), the population lingers around the CS
9
168
value, z*, before it finally shows evolutionary branching. The waiting time until the
169
evolutionary branching varies considerably among different simulation runs with the
170
same parameter values (Figure 1(b) and (c)). For a small population size (N=200), we
171
do not observe evolutionary branching and the population seems to stay around z*, even
172
though z* is not evolutionarily stable (Figure 1(d), see also Wakano and Lehmann
173
2012).
174
The evolutionary branching dynamics are characterised by an abrupt increase
175
or explosion of the variance of the focal phenotype (the trait variance). If all the
176
parameters for the interactions are the same, a small population does not branch, an
177
intermediately large population shows delayed branching, and a large population
178
branches immediately. The intuitive reason for the smallness of the population size to
179
suppress evolutionary branching is that genetic diversity is reduced by genetic drift
180
every generation. In the following sections, we provide an approximate dynamics for
181
the trait variance to understand these simulation results.
182
183
184
3. Dynamics of Trait Variance
The full (individual-based) model is analytically intractable. Therefore, we
185
derive a reduced simplified dynamical system here, which is tractable and gives
186
predictions on the behaviour of the full model. One generation in the reduced model
187
consists of two substeps.
10
188
189
190
Substep 1: Natural selection and mutation
Let  (z ) denote a distribution of trait z, which is normalised to satisfy
191
  ( z )dz  1 . When a fitness function
192
sufficiently weak, the trait distribution in offspring pool after mutation, denoted by
193
 (z ) , can be approximated by the following replicator equation plus a mutation term.
w(z ) is given and selection and mutation are
194
195
(z)  (z)  w(z)  w (z) 
 2 2
2 z 2
(z) ,
(4)
196
197
where w is the population average of fitness values. The second term on the
198
right-hand side represents the effect of selection and the third term represents the effect
199
of mutation. The Taylor expansion of fitness function around the average trait value z
200
is denoted as
201
202
w( z )  w( z )  w1 ( z  z ) 
w2
(z  z)2 ,
2
(5)
203
204
where the coefficients w1 and w2 are determined by the current state z . The
205
coefficient w1 represents the selection gradient. The coefficient w2 indicates
206
stabilising selection if it is negative, and indicates disruptive selection if it is positive.
11
207
The derivation of w2 from the payoff function f ( zi , z j ) is explained in Appendix A.
208
For parameter values used in our simulations, we have w2  0.1 , implying that
209
disruptive selection is operating. The population average and the variance of trait values
210
in the offspring trait distribution  (z ) are respectively denoted as
211
212
213
w2
m3 ,
2
w
w
2
2
Var [ z ]  m2  ( w1  2 )m2  w1m3  2 m4   2 ,
2
2
Av [ z ]  z  w1m2 
(6a)
(6b)
214
215
where mk denotes the kth centralised moment of  (z ) . The above approximation is
216
derived by neglecting m5 and higher order moments (see Appendix B).
217
Substep 2: Random sampling (or random genetic drift)
218
From the offspring pool  ( z ) , N individuals are independently sampled to
219
become adults. Hence, even for a fixed  (z ) , different distributions of adult
220
phenotypes are realised according to a probability distribution (probability measure)
221
over all possible trait distributions. After sampling N individuals independently from
222
 ( z ) , the expectation of the average trait value is identical to the average trait value in
223
 (z ) . In contrast, the expectation of the trait variance is decreased by a factor (N-1)/N
224
(Evens 2004). Hence, we have the following:
225
12
E[ Av[ Z ]]  Av [ z ]
226
E[Var[ Z ]] 
.
N 1
Var [ z ]
N
(7)
227
228
229
Combining the two sub-steps, the expected changes in the population average
and the variance in one generation are given by
230
231
E[Av[ Z ]]  w1m2 
232
E[Var[ Z ]] 
w2
m3 ,
2
(8a)
N 1 
w

2
2
2
m2  w1m3  w1 m2  2 (m4  m2 )   2   m2 .

N 
2

(8b)
233
234
We are particularly interested in the dynamics of the variance of the trait distribution.
235
To analyse Eq. (8) further, we here assume that the trait distribution is close to a normal
236
distribution whose mean and variance obey the dynamics given by Eq. (8). For normal
237
distributions, m3  0 and m4  3m2 . Using these relationships, we have the following
238
recursive equations for the mean and variance of the trait.
2
239
240
z  w1m2
241
m2 
(9a)
N 1
1
N 1 2
(w 2  w12 )m2 2  m2 
 .
N
N
N
(9b)
242
243
The first, the second, and the third terms on the right-hand side of Eq. (9b) represent the
13
244
effect of natural selection, the loss of diversity due to genetic drift, and the increase of
245
diversity due to mutation, respectively. Without selection, the variance m2 converges
246
to N 1 2 .
247
248
4. Condition for the Explosion of Trait Variance
249
As long as the selection gradient does not vanish ( w1  0 ), the mean trait value
250
changes according to Eq. (9a), which is a standard result of population genetics (Ewens
251
2004) and corresponds to the canonical equation in adaptive dynamics theory
252
(Dieckmann & Law 1996, Champagnat et al. 2006). If a singular point z*, where the
253
selection gradient vanishes ( w1  0 ), is convergence stable, then the system approaches
254
the singular point with time (Geritz et al. 1997). Once the mean trait (i.e., the first
255
moment of trait distribution) reaches the singularity, it will not change any longer.
256
However, the trait variance (i.e., the second centralised moment of the distribution) may
257
change with time. See Figure 2 for a schematic illustration. Here we focus on the
258
recursion for the trait variance:
259
260
m2 
1
N 1
N 1
2
w2 m2  m2 
 2
N
N
N
(10)
261
262
which indicates that the one generational change in the trait variance is equal to the sum
263
of three terms: the natural selection term, the genetic drift term and the mutation term.
14
264
The right-hand side of Eq. (10) represents the "force" acting on the trait variance
265
(Figure 3). Note that the orders of the three terms in Eq. (10) are m2 2 , m 2 , and 1,
266
respectively. When the variance is small, the mutation (the third term) is dominant, and
267
it increases the trait variance. As the variance increases, the genetic drift (the second
268
term) increases its magnitude and it reduces the trait variance. The balance between
269
these two processes would result in the equilibrium m2 *  N 2 , which represents the
270
variance determined by mutation-drift balance. When the trait variance is large,
271
selection (the first term) becomes dominant. Selection decreases the trait variance when
272
w 2  0 (stabilising selection), but it increases the trait variance when w 2  0
273
(disruptive selection).
274
Combing these, we have three different cases, depending on the parameters.
275
Case (i) When w2  0 , stabilising selection is at work. Then the dynamics of the trait
276
variance (Eq. (10)) has a single positive equilibrium m2 * 
277
which is globally stable, and the trait variance converges to this value for any initial
278
value. At this equilibrium, the trait variance is maintained by mutation, which increases
279
variance, and by genetic drift and stabilising selection, which decrease variance.
280
Especially when the stabilising selection is weak, trait variance converges to the level
281
maintained by mutation-drift balance ( m 2*  N  1 2 ) when w 2 is small.
15
1  1  4( N  1) 2 w2  2
,
2( N  1) w2
1
, disruptive selection is at work, but it is not
4 ( N  1) 2
282
Case (ii) When 0  w2 
283
very strong. Then the dynamics of the trait variance have two positive equilibria: the
284
smaller one is stable, but the larger one is unstable. If the trait variance starts from a
285
small value, it converges to the smaller equilibrium, where the trait variance is
286
maintained mainly by the mutation-drift balance. If the trait variance starts from a larger
287
value, then disruptive selection dominates, leading to the unlimited growth of the trait
288
variance.
289
Case (iii) When w2 
290
the trait variance no longer has a stable equilibrium. Starting from any initial value, the
291
trait variance keeps increasing without limit.
2
1
, disruptive selection is strong. The dynamics of
4 ( N  1) 2
2
292
Figure 4 shows a bifurcation diagram of the variance dynamics given by Eq.
293
(10), with the stabilising/disruptive selection coefficient ( w2 ) as a bifurcation parameter.
294
Figure 5 shows a bifurcation diagram in which the population size (N) is a bifurcation
295
parameter. We have a similar bifurcation diagram where mutational effects (  2 ) serve
296
as a bifurcation parameter (not shown). We regard the explosion of the trait variance in
297
the dynamics Eq. (10) as evolutionary branching. Then, these bifurcation diagrams
298
show that evolutionary branching occurs as a result of a saddle-node bifurcation.
299
Evolutionary branching is predicted to occur at a non-ES singular point when disruptive
16
300
selection intensity, population size, mutation rate, and mutation step size are sufficiently
301
large. Specifically, deterministic dynamics Eq. (10) predict that evolutionary branching
302
occurs when 4 2 ( N  1) 2 w2  1 holds.
303
304
5.
Stochastic Calculation of the Explosion of Variance
305
The deterministic dynamics of the trait variance represented by Eq. (10)
306
indicates only the expectation of one generational change. There must be stochasticity
307
caused by the genetic drift. Results of an individual-based model, such as those
308
illustrated in Figure 1, clearly show stochasticity. To account for stochasticity, the
309
deterministic difference equation Eq. (10) needs to be modified by an additional term.
310
In this section, we derive the stochastic differential equation model considering the
311
stochasticity of trait variance dynamics.
312
313
The deterministic dynamics Eq. (10) can be approximated by the following
ordinary differential equation (ODE):
314
d
m2  f (m2 ) ,
dt
315
(11a)
316
317
where
318
17
319
( N  1) w2 x 2  x  ( N  1)  2
f ( x) 
.
N
(11b)
320
321
If stochasticity has no correlation over generations, and if the paths can be approximated
322
as continuous, then we can incorporate the Brownian motion term, and the differential
323
equation Eq. (11) becomes the following stochastic differential equation (SDE):
324
325
dM 2  f ( M 2 )dt 
2
M 2 dB ,
N
(12)
326
327
where dB denotes Brownian motion and M 2 denotes a random variable representing
328
the trait variance. f ( M 2 ) is the systematic change per generation in variable M 2 and
329
is given by Eq. (11b).
330
The second term on the right-hand side of Eq. (12) represents stochasticity in
331
trait variance caused by random genetic drift (see Appendix C for derivation). The
332
stochastic fluctuation term depends on the trait variance itself (geometric Brownian
333
motion), because the amplitude of fluctuation in the trait variance in each generation of
334
offspring is large when the trait variance in each parental generation is large. The
335
stochastic term also depends on population size, because fluctuation is averaged out in
336
large populations. Hence, random genetic drift becomes less important in larger
18
337
populations. The system is similar to but different from the motion of a particle moved
338
by "force" shown in Figure 3 under heat noise.
339
340
341
342
5.1 Stationary distribution
The corresponding Fokker-Planck equation (or forward Kolmogorov equation)
to Eq. (12) is given by
343
344


1 2 2 2

p(m2;t)  
f (m2 ) p(m2 ;t)

2  m 2 p(m 2 ;t)  ,

2 m2 N
t
m2
(13)
345
346
(Feller, 1971), where pm2 ;t dm2  Pr m2  M 2 t   m2  dm2  is the probability
347
density for the trait variance m2 . The stationary distribution of the trait variance is
348
349
~
p ( x)  C x 3e( N 1) a ( x ) ,
(14a)
350
351
where C is a normalising constant and
352
353
a ( x) 
w2 x 2   2
.
x
(14b)
19
354
355
Figure 6 illustrates an example of stationary distribution of the trait variance, showing
356
that the system tends to show slightly smaller trait variance than the locally stable value
357
in Eq. (11), because the fluctuation is weaker when the variance is smaller (geometric
358
Brownian motion).
359
360
Comparison with the results of simulations:
361
We compared the stationary distribution of the trait variance between the
362
theoretical prediction given by Eq. (14) and the observed distribution obtained from the
363
individual-based simulation (Section 2). Figure 7 shows good agreement between the
364
diffusion approximation (i.e., the SDE model) and the corresponding simulation result
365
with a small population size. In particular, the approximation captures the shift of the
366
mode — the peak of the distribution lies at the value smaller than N 2  0.0008 . The
367
average trait variance in an interval shown in Figure 7 is close to N 2 for both
368
analytic prediction (Eq. (14)) and simulation results.
369
370
5.2. Waiting time for evolutionary branching
371
In Case (ii), there are stable and unstable equilibria for the trait variance, the
372
latter being larger than the former. If the initial value of trait variance is smaller than the
20
373
unstable equilibrium, the deterministic model Eq. (11) predicts that variance would be
374
maintained indefinitely. In contrast, if the initial value is larger than the unstable
375
equilibrium, it quickly increases and grows without limit. Hence, whether trait variance
376
eventually explodes depends on the initial condition. In the presence of stochasticity,
377
however, the trait variance would not be maintained at smaller values than the unstable
378
equilibrium; instead, it would fluctuate around the stable equilibrium for a period, but
379
then eventually increase beyond the unstable equilibrium and diverge to infinity. Hence,
380
under the stochastic differential equation, the trait variance is predicted to explode
381
eventually even if the initial value is small. The timing of this explosion is controlled by
382
the stochasticity.
383
The mean waiting time prior to explosion is a typical "first passage time"
384
problem in diffusion theory. This has also been used in calculating the mean time of
385
fixation in population genetics (Ohta and Kimura 1968), and the mean time to
386
extinction in conservation biology (e.g. Lande 1993; Hakoyama and Iwasa, 2000), and
387
physiology (e.g. Ricciardi, 1977).
388
To study the expectation of waiting time, we assume that a system has a
389
reflecting boundary at m2  l  0 and an absorbing boundary at m2  r (note that the
390
left boundary can never be achieved because the deterministic "force" is increasing the
391
variance while stochastic fluctuation vanishes at m2  0 , i.e., Eq. (12) is singular at
392
m2  0 ). Let T ( ) be the expected waiting time until the system reaches the right
21
393
absorbing boundary m2  r when the initial value of the trait variance is m2 (0)   .
394
To obtain T ( ) , we use the following backward Kolmogorov equation
395
d
2 d2
T ( ) 
T ( ) ,
d
N d 2
 1  f ( )
396
(15)
397
398
with the boundary condition T (r )  0 and T ' (l )  0 (see Appendix E). The solution
399
is
400
401
T ( )  N 
r

l
y
x 3 ye( N 1)( a ( x )a ( y )) dxdy
(16)
402
403
where a(x) is given by Eq. (14b). The expected waiting time is smaller when
404
population size is larger, disruptive selection is stronger, or mutational effects are larger
405
(see Appendix D).
406
407
Comparison with the results of simulations:
408
We compared the mean waiting times observed in the computer simulations of
409
the individual-based model and the prediction by the approximate stochastic differential
410
equation model given by Eq. (16). Once the system reaches a convergence stable point,
411
the trait variance m2 is expected to be close to the locally stable value N 2 (see
22
412
also our results on stationary distribution). Thus, we numerically calculate T ( N 2 )
413
and compare the corresponding result in individual-based simulation (Figure 8). We also
414
derive the probability distribution of the first passage time by using Kolmogorov's
415
backward equation in Appendix E, but the fit of the shape of the probability distribution
416
of waiting times to the distribution observed in individual-based simulations is less than
417
perfect (Figure 9).
418
As we decrease N, the waiting time increases exponentially, and it eventually
419
becomes impossible to escape from the locally stable unimodal state in a reasonable
420
time (Figure 8). This implies that evolutionary branching is unlikely to occur in a small
421
population.
422
When N is very large, genetic drift can be neglected and the system is
423
approximated by the ODE; dm2 / dt  w2 m2   2 . Thus, the waiting time can be
424
approximated by the time required to move from m2  N 2 to the right boundary
425
m2  r . This time is about 1500 for N = 1000 and 960 for N = 2000. This calculation, in
426
conjunction with Figure 8, shows that the stochastic fluctuation plays a dominant role
427
even in a population of 1000 individuals.
2
428
Our reduced model measures genetic diversity in trait distribution. An
429
alternative measure of genetic diversity is the number of segregating alleles, which
430
depends only on  and is predicted by Ewen's sampling formula when selection is
431
absent (Ewens 2004). Figure 10 shows the effect of mutation rate  and mutation step
23
432
size  2 . Simulations confirmed our prediction that only the value of  2 matters,
433
even though the number of segregating alleles in simulation with N = 1800 was about
434
50 and 150 for   0.0025 and 0.01, respectively. It is also confirmed that smaller
435
mutational effects require a longer waiting time until evolutionary branching.
436
437
6. Discussion
438
In this paper, we have studied the effect of stochastic fluctuation due to finite
439
population size on evolutionary branching. We have derived an ordinary differential
440
equation and then a stochastic differential equation for the dynamics of trait variance.
441
The analysis has revealed that there are two different manners in which evolutionary
442
branching occurs: deterministic branching and stochastic branching. In case of
443
deterministic branching, evolutionary branching occurs relatively soon after the mean
444
trait reaches the branching point. This occurs when disruptive selection intensity w2
445
exceeds a threshold value: w2  1 /(4  2 ( N  1) 2 ) , which can be approximated by
446
1 N  2 w2  2 . This condition for deterministic branching implies that the intensity of
447
random genetic drift 1 N is greater than disruptive selection intensity w 2 and the
448
mutational effects  2 (see later). If the opposite inequality holds, i.e., when
449
disruptive selection is weaker than the threshold, then a unimodal distribution is locally
450
stable in the deterministic model and stochastic fluctuation is necessary for the
451
population to undergo evolutionary branching. We call this latter type of evolutionary
24
452
branching "stochastic branching". We observed a relatively long waiting time until
453
evolutionary branching occurs in simulations. The distribution and expectation of the
454
waiting time were mathematically calculated and these predictions were consistent with
455
computer simulations of the individual-based model. Considering the several
456
assumptions we have adopted, the consistency is better than expected, although not
457
perfect. As population size or mutational effect decreases, the waiting time
458
exponentially increases. Mathematically, this is because these factors appear in the
459
exponent in the expression of the expected waiting time (Eq. 16). Although the increase
460
of the waiting time is a continuous phenomenon resulting from the underlying
461
stochasticity in individual-based simulations, we can understand evolutionary branching
462
by classifying it as either deterministic branching in which stochastic fluctuation is
463
unnecessary or to stochastic branching in which stochastic fluctuation is necessary.
464
Our analysis has revealed that evolutionary branching is almost impossible in
465
some biological situations in which the effective population size or mutational effect is
466
sufficiently small. As a population size increases, our condition for deterministic
467
evolutionary branching approaches and converges to the standard branching point
468
condition. The condition can be written as 4w2 N 2  2  1 , which can be used as an
469
approximate indicator; for example a population with size N = 108 with mutational
470
effects  2  1014 is predicted to show deterministic branching when disruptive
471
selection intensity is w2  O(1) , although such individual-based simulation is almost
25
472
impossible.
473
The condition separating deterministic and stochastic branching is understood
474
intuitively by considering the force suppressing variance (genetic drift) and the two
475
forces promoting the variance (disruptive selection and mutation). The balance between
476
them determines which type of branching occurs. The condition for deterministic
477
branching 1 N  2 w2  2 can be rewritten as (1 N ) 2  4w2   2 which means
478
(genetic drift)2 < 4 × (selection effect) × (mutation effect).
479
This implies that deterministic branching occurs when the intensity of genetic drift is
480
smaller than double the geometric average of the selection and mutation intensities. It is
481
important to note that the condition is determined by N 2 instead of N .
482
In order to mimic the full individual-based model by the deterministic and
483
stochastic equations of the trait variance, we have adopted several simplifying
484
assumptions in deriving the dynamics: i.e. trait distribution is assumed to be close to
485
normal, and the trait variance is small enough to justify series expansion. From
486
simulation results, it is clear that the trait distribution finally becomes multimodal when
487
evolutionary branching occurs. Thus, our analytic results are only valid for the initial
488
growth of the trait variance where unimodal distribution (approximated by a normal
489
distribution) becomes broader at the singular point. This initial explosion is followed by
490
highly non-linear and complex dynamics that realises a multimodal distribution.
491
Nevertheless, the agreement of our analytic prediction and simulation results suggests
26
492
that our reduced model captures the onset of evolutionary branching.
493
Based on individual-based simulations, Claessen et al. (2007) reported delayed
494
evolutionary branching and proposed two reasons: random genetic drift and extinction
495
of incipient branches. In contrast to Claessen et al. (2007), our model has no fluctuation
496
in the total population size. Thus, the delayed or absent evolutionary branching is
497
caused by demographic stochasticity— the fluctuation in the number of offspring each
498
individual produces. Our simple model has enabled us to derive the condition for
499
evolutionary branching mathematically, with random genetic drift taken into
500
consideration, which has been only verbally proposed before. In addition, our
501
mathematical analysis and simulation results have revealed that the mutation rate and
502
step size are as important as population size.
503
In this paper, to illustrate the effect of genetic drift on the evolutionary
504
branching, we used as an example a pairwise interaction game that incorporated a
505
Wright-Fisher update. We conjecture that the extension of our model to include a Moran
506
update or to a multi-player game is readily possible. In the case of including a Moran
507
update, the loss of genetic diversity (Eq.7) should be appropriately modified and also
508
the unit of time should be rescaled since N Moran steps correspond to a single
509
Wright-Fisher step. The result of our model should be easily extended to
510
resource-competition models, which have been intensively studied in adaptive dynamics
511
(e.g., Dieckmann & Doebeli 1999, Sasaki & Dieckmann 2011, Mirrahimi et al. 2012).
27
512
We have focused on the dynamic processes in which the population escapes
513
from the locally stable value of the trait variance, while keeping its trait distribution as a
514
unimodal distribution. Conceptually, if we allow all possible trait distributions, we
515
might have two locally stable states, one of which corresponds to a unimodal state and a
516
second that corresponds to a fully branched state. Stochastic branching in the present
517
model only considers the case in which a system located at the unimodal state escapes
518
from this local optimum by stochastic fluctuation (random genetic drift). However, a
519
branched state can also evolve back to the unimodal state by genetic drift; thus, a
520
stochastic analysis including both escaping from and evolving back to the unimodal
521
distribution would give us an even deeper understanding of evolutionary branching.
522
28
523
Acknowledgements
524
This work was supported by Japan Science and Technology PRESTO to J.Y.W., a
525
Grant-in-Aid for Scientific Research (B) from JSPS to Y.I., and also by the support of
526
the Environment Research and Technology Development Fund (S9) of the Ministry of
527
the Environment, Japan. We would like to thank the following people for their very
528
helpful comments: C. Hauert, Y. Kobayashi, C. Miura, W. Nakahashi, H. Ohtsuki, and A.
529
Sasaki.
530
29
531
Literature Cited
532
Barton, N. H., and M. Turelli. 1991. Natural and sexual selection on many loci.
533
Genetics 127:229-255.
534
Champagnat, N., R. Ferriere, and S. Meleard. 2006. Unifying evolutionary dynamics:
535
from individual stochastic processes to macroscopic models. Theor. Popul.
536
Biol. 69:297-321.
537
538
539
540
541
542
Claessen, D., J. Andersson, L. Persson, and A.M. de Roos. 2007. Delayed evolutionary
branching in small populations. Evol. Ecol. Res. 9:51-69.
Dieckmann, U., and R. Law. 1996. The dynamical theory of coevolution: a derivation
from stochastic ecological processes. Jour. Math. Biol. 34:579-612.
Dieckmann, U., and M. Doebeli. 1999. On the origin of species by sympatric speciation.
Nature 400: 354-357.
543
Doebeli, M., and U. Dieckmann. 2000. Evolutionary branching and sympatric
544
speciation caused by different types of ecological interactions. Amer. Nat.
545
156:S77-S101.
546
547
Doebeli, M., C. Hauert, and T. Killingback. 2004. The evolutionary origin of
cooperators and defectors. Science 306:859-862.
548
Ewens, W. J. 2004. Mathematical Population Genetics. Springer-Verlag, New York.
549
Geritz, S. A. H., J. A. J. Metz, E. Kisdi, and G. Meszena. 1997. Dynamics of adaptation
550
and evolutionary branching. Phys. Rev. Lett. 78:2024-2027.
30
551
Geritz, S.A.H., E. Kisdi, G. Meszena, and J. A. J.Metz. 1998. Evolutionarily singular
552
strategies and the adaptive growth and branching of the evolutionary tree. Evol.
553
Ecol. 12:35-57.
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
Feller, W. 1971. Introduction to probability theory and its application. 3rd. ed. vol. 2.
John Wiley, New York.
Hakoyama, H., and Y. Iwasa. 2000. Extinction risk of a density-dependent population
estimated from a time series of population size. J. Theor. Biol. 204:337-359.
Iwasa, Y., A. Pomiankowski, and S. NEE. 1991. The evolution of costly mate
preferences. 2. The handicap principle. Evolution 45: 1431-1442.
Karlin, S., and H. M. Taylor. 1981. A second course in stochastic processes. Academic
Press. London.
Kimura, M. 1957. Some problems of stochastic processes in genetics. Annal. Math. Stat.
28: 882-901.
Lande, R. 1976. Natural selection and random genetic drift in phenotypic evolution.
Evolution 30:314-334.
Lande, R. 1981. Models of speciation by sexual selection on polygenic traits.
Proc.Nat.Acad.Sci.U.S.A. 78: 3721-3725.
Lande, R. 1993. Risks of population extinction from demographic and environmental
stochasticity and random catastrophes. Am. Nat. 142:911-927.
Metz, J.A.J., R.M. Nisbet, and& S.A.H. Geritz. 1992. How should we define "fitness"
31
571
for general ecological scenarios? Trends Ecol. Evol. 7: 198-202.
572
Metz, J.A.J., S.A.H. Geritz, G. Meszena, F.J.A. Jacobs, and J.S. van Heerwaarden.
573
Adaptive dynamics, a geometrical study of the consequences of nearly faithful
574
reproduction. In: Stochastic and Spatial Structures of Dynamical Systems. S.J.
575
van Strien & S.M. Verduyn Lunel, eds. (1996) pp. 183-231. A'dam,
576
North-Holland
577
578
579
580
581
582
583
584
Mirrahimi, S., B. Perthame, and J.Y. Wakano. 2012. Evolution of species trait through
resource competition. J. Math. Biol. 64:1189-1223.
Pomiankowski, A., Y. Iwasa, and S. NEE. 1991. The evolution of costly mate
preferences. 1. Fisher and biased mutation. Evolution 45: 1422-1430.
Ricciardi, L. M. 1977. "Diffusion processes and related topics in biology", Lecture
notes in biomathematics. vol. 14, Springer-Verlag, Berlin.
Sasaki, A., and U. Dieckmann. 2011. Oligomorphic dynamics for analyzing the
quantitative genetics of adaptive speciation. J. Math. Biol. 63:601-635.
585
Tazzyman, S.J., and Y. Iwasa. 2000. Sexual selection can increase the effect of random
586
genetic drift - a quantitative genetic model of polymorphism in Oophaga pumilio,
587
the strawberry poison-dart frog. Evolution 64:1719-1728.
588
Wakano, J.Y., and L. Lehmann. 2012. Evolutionary and convergence stability for
589
continuous phenotypes in finite populations derived from two-allele models. J.
590
Theor. Biol. 310: 206-215.
32
591
592
Waxman, D., and S. Gavrilets. 2005. Twenty questions on adaptive dynamics. J. Evol.
Biol. 18:1139-1154.
593
33
594
Figure Legends
595
596
Figure 1. Evolutionary dynamics. (a) A simulation run with N = 8000. (b,c) two
597
different simulation runs with N = 1000 (d) A simulation run with N = 200.
598
Parameters:   0.01,   0.02, (b1 , b2 , c1 , c2 )  (6.0,1.4,4.56,1.6)
599
A schematic illustration of phenotype distribution  ( z ) and fitness
600
Figure 2.
601
function w(z ) when the mean trait value z has reached the singular strategy z*. The
602
trait variance is Var[ ( z )]  m2 so the standard deviation is
m2 .
603
604
Figure 3. "Force" plotted against the trait variance based on Eq.10. Dotted, solid, and
605
thick curves represent stabilising selection ( w2 = - 0.2), weak disruptive selection ( w2 =
606
0.2), and strong disruptive selection ( w2 = 0.6), respectively. They also correspond to
607
Case i), ii), and iii) in the main text.
608
Parameters: N=400,   0.01,   0.02
609
610
Figure 4. Stable (thick) and unstable (broken) branches of the trait variance m2 with
611
disruptive selection intensity w2 as a bifurcation parameter. Arrows indicate the
612
direction of systematic change in m2 .
613
Parameters: N=1000,   0.01,   0.02
34
614
615
Figure 5. Stable (thick) and unstable (broken) branches of the trait variance m2 with
616
population size N as a bifurcation parameter. A Log plot is shown. Arrows indicate
617
the direction of systematic change in m2 .
618
Parameters:   0.01,   0.02, (b1 , b2 , c1 , c2 )  (6.0,1.4,4.56,1.6)
619
620
Figure 6. The stationary distribution ~
p (m2 ) of the trait variance m2 (Eq.14). A
621
Log-Log plot is shown, and thus the probability density is much higher at the peak than
622
it appears. The locally stable value of the variance according to Eq.11 is m2 *  0.00081,
623
which is close to N 2 =0.0008 (dashed line). Note that a local peak lies at a value
624
smaller than 0.0008 because of geometric Brownian motion.
625
Parameters: N=200,   0.01,   0.02, (b1 , b2 , c1 , c2 )  (6.0,1.4,4.56,1.6)
626
627
Figure 7. The stationary distribution of the trait variance m2 . Curves represent an
628
analytic result (Eq. 14), while bars are a histogram of realised trait variances in a
629
simulation run for 10 6 time steps. The dashed line represents N 2 =0.0008. In the
630
simulation, the initial trait distribution was set as monomorphic at z =0.2.
631
transition from this initial state to the stationary state was negligible as it took only
632
about 5000 time steps.
633
Parameters: N=200,   0.01,   0.02, (b1 , b2 , c1 , c2 )  (6.0,1.4,4.56,1.6)
35
The
634
635
Figure 8
636
A Log-plot of waiting times for evolutionary branching for different population sizes
637
(N). The thick and thin curves represent analytic result based on a SDE model (Eq.16)
638
and that based on an ODE model (Eq.11), respectively. Each open circle represents the
639
result of a single run in individual-based simulation. The first time when the trait
640
variance exceeded 0.1 is shown. In the simulation, the initial trait distribution was set as
641
monomorphic at z =0.2 and each run consisted of 106 time steps. Simulation runs in
642
which the variance did not explode are plotted at 10 6 . Time required to reach a singular
643
point in the simulation was roughly 5000 when N = 200 and 2000 when N = 1800,
644
which is included to draw open circles. Thus, open circles are expected to lie slightly
645
above the curve.
646
Params:   0.01,   0.02, (b1 , b2 , c1 , c2 )  (6.0,1.4,4.56,1.6)
647
648
Figure 9. The distribution of waiting times. Curves represent an analytic result (see
649
Appendix E), while bars represent a histogram of the waiting time obtained in 1000
650
different runs of individual-based simulation.
651
Parameters: N = 1000,   0.01,   0.02, (b1 , b2 , c1 , c2 )  (6.0,1.4,4.56,1.6)
652
653
Figure 10.
A Log-plot of waiting times for evolutionary branching for different
36
654
population sizes and mutation parameters. The upper panel (   0.0025,   0.02 )
655
depicts a result for a smaller mutation rate than in Figure 8, while the lower panel
656
(   0.01,   0.01 ) depicts a result for a smaller mutation step size. As  2  10 6 is
657
the same, the analytic prediction curves are the same. The two simulation results are
658
also undistinguishable.
659
Parameters: (b1 , b2 , c1 , c2 )  (6.0,1.4,4.56,1.6)
660
661
662
663
37
664
Appendices
665
666
Appendix A: Fitness function and the coefficients
667
Because individuals are playing a game, their payoff, and thus their fitness
668
depend not only on an individual's own trait but also on their opponents' traits. In
669
principle, it is impossible to determine a fitness function for an individual with trait z
670
by an average trait z because more information is needed (i.e., the distribution of the
671
trait  (z ) ). However, when the trait variance Var[ ( z )]  m2 is small, fitness is
672
mainly determined by the deviation of the trait value from the average, z  z , and can
673
be approximated by
674
w( z )  w( z )  w1 ( z  z ) 
w2
(z  z)2
2
(A-1)
675
where the derivatives w1 and w2 are evaluated at a situation for which all individuals
676
(except self) carry the average trait z . In other words, Eq.(A-1) represents invasion
677
fitness. The coefficients are functions of z , so that w1  w1 ( z ) and w2  w2 ( z ) .
678
According to adaptive dynamics, a singular point z* satisfies w1 ( z*)  0 and it is
679
convergence stable if
680
a singular point is convergence stable but not evolutionarily stable, evolutionary
681
branching is known to occur.
682
d
w1 ( z )
 0 . It is evolutionarily stable if w2 ( z*)  0 . When
dz
z z*
Here we mainly focus on a case in which the average trait remains at z  z * ,
38
683
and thus we treat w1 and w2 as if they were constant coefficients. They can be
684
explicitly calculated for finite N, but when N >100 it is well approximated by the value
685
obtained by an infinite population limit. In this limit, fitness function is denoted as
w( z ) 
686
687
690
(A-2)
and thus
688
689
f ( z , z*)
,
f ( z*, z*)
w1 


1


f ( z , z*)
f ( z*, z*)  z
z  z* 
(A-3a)
w2 
 2

1
 2 f ( z , z*)


f ( z*, z*)  z
z  z* 
(A-3b)
and
691
hold. To calculate the intensity of disruptive selection, not only the sign but also the
692
magnitude is important. For the model of Doebeli et al. (2004), this becomes
693
w2 
2(b2  c2 )
f z*, z *
(A-4)
c1  b1
. We numerically confirmed that the value of w2 can be well
4b2  2c2
694
where z* 
695
approximated
696
(b1 , b2 , c1 , c2 )  (6.0,1.4,4.56,1.6)
697
magnitude of disruptive selection depends on b1 and c1 , as well as b2 and c2 . In the
by
w2  0.1
for
the
parameter
values
and for the population sizes N >100. The
39
698
vicinity of our choice of parameter values used in all figures, disruptive selection is
699
stronger for smaller values of b1 and larger values of c1 .
700
701
Appendix B: Derivation of deterministic approximation
702
When both selection and mutation are weak, their effects can be approximated
703
as additive and we obtain the replicator equation Eq.(4). By integrating Eq.(5), which is
704
the Taylor expansion of w(z) around
z   z ( z )dz ,
705
706
we have
w    ( z )w( z )dz  w( z ) 
707
708
709
710
711
712
(B-1)
w2
m2
2
(B-2)
where
mk   ( z  z ) k ( z )dz
(B-3)
is the kth centralised moment around z . Putting this into Eq.(4), we have
w
 2  2
 w

 ( z )   ( z )   2 m2  w1 ( z  z )  2 ( z  z ) 2  ( z, t ) 
 ( z ) . (B-4)
2
2 z 2
 2

Imposing zero-flux boundary conditions  / z  0 , we have
  ( z )dz  1
713
714
w2
m3
2
w
 w

2
2
  ( z )( z  z ) dz  1  22 m2 m2  w1m3  22 m4  
  ( z )( z  z )dz  w m
1
2

We denote statistical values of the offspring distribution by
40
(B-5)
Av [ z ]    ( z ) zdz
715
716
(B-6)
Var [ z ]    ( z )( z  Av [ z ]) 2 dz
then we have Av [ z ]  z  w1m2 
717
w2
m3 , which is Eq.(6a).
2
To calculate the variance in the offspring pool, we use
Var [ z ]    ( z )( z  z )  ( Av [ z ]  z )  dz
2
718
719
   ( z ) z  z  dz  ( w1m2 
2
(B-7)
w2
m3 ) 2
2
and we have
Var [ z ]  m2 
720
721
When
722
Var [ z ]  m2  ( w1 
we
w2 2
w
w
m2  w1m3  2 m4   2  ( w1m2  2 m3 ) 2 .
2
2
2
neglect
2
the
fifth
and
higher
order
terms,
(B-8)
we
have
w2
w
2
)m2  w1m3  2 m4   2 , which is Eq.(6b).
2
2
723
724
Appendix C: Derivation of SDE approximation
725
We want to approximate the dynamics by an SDE with the following form
726
dM 2  f ( M 2 )dt  g ( M 2 ) dB
(C-1)
727
where dB denotes Brownian motion and M 2 denotes a random variable representing
728
the variance in trait distribution. To obtain g (x) , we use an established result in
729
statistics that a random variable
730
731
N
Z z
X   i

v 
i 1 
2
(C-2)
obeys the chi-squared distribution where Z i 's are N independent random variables each
41
732
of which is sampled from a normal distribution with mean z and variance v 2 . The
733
variance of the chi-squared distribution is 2N. When N is large, we can approximate z in
734
Eq.(C-2) by Z . The trait variance M 2 can be written by
M2 
735
1
N
 Z
N
i 1
i Z 
2
v2
X,
N
(C-3)
736
M 2 is a random variable, due to random genetic drift. Its variance over probability
737
distribution is given by
2
 v2 
2v 4
.
Var[ M 2 ]  2 N   
N
N
738
(C-4)
739
According to our model assumptions, we approximate the variance v 2 by the trait
740
variance in offspring distribution  ( z ) . Then, we have the following approximation:
Var[ M 2 ] 
741
742
2m2
N
2
(C-5)
which yields g ( x)  2 x 2 / N and Eq.(12).
743
By a more detailed calculation, we can show
744
Var[ M 2 ] 
2( N  1)m2
,
N2
2
(C-6)
745
which results in ~
p ( x)  x  ( 3 N 2 ) /( N 1) e Na ( x ) , but for our purpose Eq.(C-5) is precise
746
enough.
747
42
748
749
Appendix D: Properties of waiting times
Here, we show that the expectation of waiting time
T  N
750
r

 l
y
x  3 ye ( N 1)( a ( x )  a ( y )) dxdy
(D-1)
751
is a decreasing function of w2 , N, and  2 . First, T is an increasing function of
752
a( x)  a( y ) , which satisfies
1 1
a ( x)  a ( y )   w2 ( y  x)   2   
x y
753
(D-2)
754
Since y>x holds in the region of integration in Eq.(D-1), the waiting time T is a
755
decreasing function of w2 and  2 . To show the dependence on N, we use
r y
d
T    x 3e ( N 1)( a ( x )a ( y )) B ( x, y ) dxdy
 l
dN
756
757
758
(D-3a)
where
B ( x, y )  y  N
( x  y )( wxy   2 )
x
(D-3b)
759
When N is large, B(x,y) has the same sign as x-y, which is negative. Thus, T is a
760
decreasing function of N for large N. In addition, since a ( x)  a( y ) is a negative
761
constant independent of N, we have
762
lim N  T  0
(D-4)
763
with all the other parameters fixed at positive values. This is true even for the vanishing
764
mutation rate (i.e. N 2  const . when N   ). However, it does not necessarily
765
go to zero if N (a ( x)  a( y )) does not go to   . For example, if Nw2 and N 2
43
766
remain constant as N   , the waiting time T can be an increasing function of N.
767
768
Appendix E: Distribution of waiting times
769
Let  (x ) be an indicator function such that  ( r )  1 and  ( x )  0 for
770
l  x  r . Then a random variable  ( M 2 (t )) is unity if absorption at m2  r has
771
occurred until time t, and it is zero otherwise. Let u ( , t )  E  ( M 2 (t )) where E
772
denotes the expectation in the stochastic process with the initial value M 2 (0)   . Then
773
the following Kolmogorov backward equation holds (Karlin & Taylor 1981)
 2u
u 1
u
 g ( ) 2
 f ( )

 2
t
774
(E-1)
775
Let Q be a random variable representing the first passage time, then it is intuitively clear
776
that
777
u( t,)  Pr[Q  t | m 2 (0)  ]
(E-2)
778
is the probability that absorption at m2  r has occurred on or before time t. The
779
probability density that absorption occurs exactly at time t is given by u / t . A similar
780
logic and calculation were adopted in a classical paper studying the fixation probability
781
in population genetics (Kimura 1957). The expected first passage time is
782
783

T ( )   t
0
u
dt .
t
(E-4)
After some calculations, we obtain
44
784
 1  f ( )
d
2 d2
T ( ) 
T ( )
d
N d 2
785
which is Eq.(15). To obtain the distribution of waiting time, we numerically solve the
786
partial differential equation Eq.(E-1) with domain   [l , r ] . We impose a reflecting
787
boundary at left and Dirichlet boundary at right u (t , r )  1 . In the numerical algorithm,
788
the initial value is set as u (0, )  0 except u (0, r )  1 . The FTCS method with small
789
dt and d values gives the numerical solution u~ (t , ) . To check the validity of this
790
result, we numerically put it into Eq.(E-4) and compared the result with that of the
791
numerically obtained value of Eq.(16). The expected waiting times obtained by the two
792
different approaches were almost the same. The distribution of the waiting time t from
793
the initial trait variance N 2 is given by u~ (t , N 2 ) , which is shown in Figure 9.
794
45
Wakano & Iwasa, Figure 1
 (z )
w(z )
m2
z
z*
Wakano & Iwasa, Figure 2
Case iii)
Case ii)
Case i)
Wakano & Iwasa, Figure 3
Case i)
no
branching
m2
0.05
0.04
Case ii)
stochastic
branching
0.03
Case iii)
deterministic
branching
w2 * 
1
4  2 N 2
0.02
0.01
-0.04 -0.02 0.00
0.02
0.04
0.06
0.08
0.10
w2
Wakano & Iwasa, Figure 4
stochastic
branching
1
deterministic
branching
0.1
m2
0.01
0.001
N *  2  2 / w2
10 -4
200
400
600
800
1000
N
Wakano & Iwasa, Figure 5
1000
~
p (m2 )
10
0.1
0.001
0.0001
0.001
0.01
0.1
m2
Wakano & Iwasa, Figure 6
frequency
m2
Wakano & Iwasa, Figure 7
Wakano & Iwasa, Figure 8
waiting time
Wakano & Iwasa, Figure 9
Wakano & Iwasa, Figure 10