INVESTIGATION A Test for Selection Employing Quantitative Trait Locus and Mutation Accumulation Data Daniel P. Rice*,1 and Jeffrey P. Townsend*,†,2 *Department of Ecology and Evolutionary Biology and †Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut 06520 ABSTRACT Evolutionary biologists attribute much of the phenotypic diversity observed in nature to the action of natural selection. However, for many phenotypic traits, especially quantitative phenotypic traits, it has been challenging to test for the historical action of selection. An important challenge for biologists studying quantitative traits, therefore, is to distinguish between traits that have evolved under the influence of strong selection and those that have evolved neutrally. Most existing tests for selection employ molecular data, but selection also leaves a mark on the genetic architecture underlying a trait. In particular, the distribution of quantitative trait locus (QTL) effect sizes and the distribution of mutational effects together provide information regarding the history of selection. Despite the increasing availability of QTL and mutation accumulation data, such data have not yet been effectively exploited for this purpose. We present a model of the evolution of QTL and employ it to formulate a test for historical selection. To provide a baseline for neutral evolution of the trait, we estimate the distribution of mutational effects from mutation accumulation experiments. We then apply a maximum-likelihood-based method of inference to estimate the range of selection strengths under which such a distribution of mutations could generate the observed QTL. Our test thus represents the first integration of population genetic theory and QTL data to measure the historical influence of selection. I DENTIFYING which quantitative traits have been subject to strong selection and which have evolved under neutral or nearly neutral conditions is a challenging and important task for evolutionary biologists (Boake et al. 2002). To this end, biologists are devoting increased attention to the genetic basis and evolutionary causes of quantitative variation (Barton and Keightley 2002; Lai et al. 2007; Barton and De Vladar 2009). On one hand, substantial progress has been made in revealing the genetic architecture of quantitative traits (Zimmerman et al. 2000; Ashton et al. 2001; Mackay 2001; Gleason et al. 2002; Verhoeven et al. 2004; Mackay and Lyman 2005; Brem and Kruglyak 2005; Lai et al. 2007; Gleason et al. 2009). On the other hand, it has been difficult to connect the action of microevolutionary forces detected in studies of contemporary populations to their macroevolutionary effects (Grant and Grant 2002). Consequently, few Copyright © 2012 by the Genetics Society of America doi: 10.1534/genetics.111.137075 Manuscript received November 23, 2011; accepted for publication January 17, 2012 Supporting information is available online at http://www.genetics.org/content/ suppl/2012/01/31/genetics.111.137075.DC1. 1 Present address: Department of Organismic and Evolutionary Biology, Harvard University, 26 Oxford St., Cambridge, MA 02138. 2 Corresponding author: Department of Ecology and Evolutionary Biology, Yale University, Osborne Memorial Labs 226, 165 Prospect St., PO Box 208106, New Haven, CT 06520. E-mail: [email protected] attempts to detect natural selection currently exploit the growing body of knowledge about the sizes and directions of quantitative trait locus (QTL) effects (for counterexamples, see Rieseberg et al. 2002; Albertson et al. 2003; Lexer et al. 2005). Progress toward understanding the basis of quantitative genetic variation is likely to come from studying allelic variation at specific QTL (Barton and Keightley 2002). Identifying the genetic architecture of quantitative traits begins with mapping QTL to broad genomic regions and ends with the molecular definition of quantitative trait loci alleles. This high degree of resolution has been achieved, for instance, for some QTL in Drosophila (Zimmerman et al. 2000; Mackay 2001; Palsson and Gibson 2004; Palsson et al. 2005). QTL mapping may be coupled with further genetic dissection (e.g., fine-scale mapping, disequilibrium mapping, transgenic manipulation; Mackay 2001) to characterize the specific loci targeted by selection. Weinig et al. (2003) demonstrated that the QTL architecture behind herbivory tolerance was one of many loci of small effect and simultaneously elucidated locusspecific evolutionary constraints, demonstrating that linking molecular genetic tools to quantitative genetic analysis and field studies in ecologically relevant settings can clarify the role of specific loci in the evolution of quantitative traits. Genetics, Vol. 190, 1533–1545 April 2012 1533 These QTL studies not only generate maps from genotype to phenotype, but also yield information that may be applied to the study of phenotypic evolution (Erickson et al. 2004). Behind every genetically based phenotypic difference lies an evolutionary history. Despite the evident relevance of QTL data to the study of the evolutionary process, very little theory has been developed to link data from QTL studies to population genetic parameters via the methods of molecular evolution. This lack of theoretical tools has inhibited progress in explaining the process underlying the evolution of phenotypes that are genetically dissected in empirical QTL studies. While most tests for selection depend directly on molecular data (McDonald and Kreitman 1991; Depaulis et al. 2003; Schlenke and Begun 2003; Mousset et al. 2004; Schlenke and Begun 2004; Nurminsky et al. 2005; Schlenke and Begun 2005), Orr (1998a) presented two innovative attempts to integrate QTL data and population genetic theory into tests for selection: the QTL sign test and the QTL effect size test. These tests employ a null model of the evolutionary process that produces QTL. They have been applied multiple times to empirical data (Rieseberg et al. 2002; Albertson et al. 2003; Lexer et al. 2005; Orr 2010). However, the QTL sign test has been deemed substantially flawed by its high false-positive rate (Anderson and Slatkin 2003) and the QTL effect size test has been shown to provide very low power to detect selection (Anderson and Slatkin 2003). Moreover, the tests neither model selection explicitly nor account for the distribution of mutational effects, both of which are important to accurately model the “filtering” action of natural selection (Orr 1998b). Accordingly, understanding the evolutionary significance of QTL observations requires evaluation of both the origin and fate of heritable variation. In particular, the evolutionary forces that result in the genetic differences we observe must be estimated (Barton and Keightley 2002). Here we develop a test for a history of directional selection on quantitative traits and a method for estimating the strength of that selection. Using mutation accumulation (MA) data and the QTL effect distribution derived from a cross between two divergent populations, we infer the selective history of the trait. While existing tests (Orr 1998) only model neutral evolution and assume selection to be at work whenever data does not fit the model’s predictions under neutrality, we model selection explicitly and consider neutrality as one potential inference on a continuum. Such a flexible model is a key first step toward unraveling the complex link between molecular evolution and quantitative trait variation. Theory Overview Our framework for inference integrates information from mutation accumulation experiment (Figure 1, A and B), population genetic theory on selection (Figure 1, G–J), the consequent expected outcome of selection (Figure 1, K–N), 1534 D. P. Rice and J. P. Townsend and the empirically obtained QTL effect size distribution (Figure 1, O–R). In our schematic example, mutation accumulation is depicted as resulting in increased variance of the phenotype with no directional bias (Figure 1A) or, alternatively, as resulting in an increased variance in phenotype with a downward bias (Figure 1B). From phenotypic observations obtained along the time course of a mutation accumulation experiment (Figure 1, A and B), the distribution of mutational effects on phenotype can be inferred (Figure 1, C–F). Data as in Figure 1A yield an inferred distribution of mutational effects that is symmetrical around no change (Figure 1, C and D); in contrast, data as in Figure 1B yield asymmetry (Figure 1, E and F). We multiply the mutation effect distribution by the calculated probability of fixation as a function of the strength of selection corresponding to the phenotypic effect (Figure 1, G–J). For a neutral phenotype, fixation probabilities are equivalent across all phenotypes (Figure 1, G and I), whereas under selection, fixation probabilities depend upon the phenotype conferred (Figure 1, H and J). The product of the mutation effect distribution and the fixation probability distribution is the probability distribution of fixed phenotypic differences resulting from substitutions at quantitative trait loci (Figure 1, K–N). Sampling a number of substitutions from this distribution and summing their effects (e.g. Figure 1, O–R, outlines), we compare the cumulative effects with those of the observed QTL (Figure 1, O–R, bars). The accord between the sampled effects and the observed effects varies with the estimated mutation effect distribution and selective regime considered: the accord will be better when the level of selection is appropriate for generating fixed effects whose cumulative effects are close to those of the observed QTL. A likelihood function (Figure 1, S and T) quantifies the fit of the model and provides a test of the hypothesis of neutrality and an estimate of the strength of selection. In our schematic example, the MA and QTL data together could support an effectively neutral model (Figure 1, A, C, D, G, H, K, L, O, P, S). Alternatively, parameterizing with the same QTL data (Figure 1, O–R, bars), but with different MA data (Figure 1, A and B) could yield rejection of neutrality and thus, a conclusion that the quantitative trait has been subject to a history of selection. To develop this inferential framework, we first present a method for estimating the distribution of mutational effects with data from mutation accumulation experiments, which are a well-established approach to inferring the effects of mutations (Keightley, 1994; Garciadorado 1997; Shaw et al. 2002; Halligan and Keightley 2009). In these experiments, one or more lines of the organism under study are kept under unselective conditions, subjected to serial population size bottlenecks (Lopez and Lopez-Fanjul 1993). This enforced genetic drift facilitates the accumulation of mutations with minimal interference from selection. A quantitative trait may be assayed at the end of the experiment or at regular intervals to track its change over time in each population. Using the change in the trait means and Figure 1 A depiction of our approach toward characterizing the neutral or selected evolution of phenotype based on mutation accumulation and quantitative trait locus data for a phenotype. (A) A constructed example of phenotypic change due to unbiased mutation within seven mutation accumulation lines. (B) A constructed example of phenotypic change due to downwardly biased mutation within seven mutation accumulation lines. (C and D) The distribution of mutational effects inferred from A. (E and F) The distribution of mutational effects inferred from B. (G and I) A depiction of a probability of fixation of novel mutations that is invariant across potential phenotypic values. (H and J) A depiction of probability of fixation of novel mutations that increases with the phenotypic value. (K) The product of the functions depicted in C and G: in this case, a symmetrical distribution of fixed mutations with a zero mode. (L) The product of D and H: in this case, an asymmetrical distribution of fixed mutations with a positive mode and positive skewness. (M) The product of the functions depicted in I and M: in this case, a symmetrical distribution with a negative mode. (N) The product of the functions depicted in F and J: in this case, an asymmetrical distribution with a zero mode and positive skewness. (O) An example of a single sample of mutations (outlines) drawn from the distribution depicted in K, compared to the experimentally observed positive and negative QTL (shaded bars): in this case, the sample matched the QTL well. (P) An example of a single sample of mutations (outlines) drawn from the distribution depicted in L, compared to the experimentally observed positive and negative QTL (shaded bars): in this case, the sample matched the positive QTL fairly well, but matched the negative QTL poorly. (Q) An example of a single sample of mutations (outlines) drawn from the distribution in M, compared to the experimentally observed positive and negative QTL (shaded bars): in this case, the sample matched the positive QTL poorly, but matched the negative QTL fairly well. (R) An example of a single sample of mutations (outlines) drawn from the distribution in N, compared to the experimentally observed positive and negative QTL (shaded bars): in this case, the sample matched the positive and negative QTL well. The mutations drawn here “happen” to exactly match the QTL effects depicted in O, but in fact sampling from the fixed mutation distribution is highly stochastic and requires numerically integrating over many samples to accurately yield the likelihood that the fixed mutation distribution underlies the experimentally observed QTL. (S) A plot of the likelihood across a range of values of the strength of selection: in this case, samples drawn from the distribution in K are identified as fitting better than samples drawn from the distribution in L, indicating neutral evolution of the phenotype. Parts of the plot that lie above the dashed line correspond to a 95% CI. The plot at the y-axis corresponds to neutrality and falls within the CI, so we cannot reject a hypothesis of selective neutrality for this phenotype. (T) A plot of the likelihood across a range of values of the strength of selection: in this case, samples drawn from the distribution in N are identified as fitting better than samples drawn from the distribution in M, indicating evolution of the phenotype driven by natural selection. The plot at the y-axis corresponds to neutrality and falls outside the CI, so we can reject a hypothesis of selective neutrality for this phenotype and estimate that the strength of selection corresponds to the level of selection identified by the peak of the plot. the number of generations between observations, we estimate a Gaussian distribution of mutational effects (Turelli 1984; Sawyer et al. 2003; Jones et al. 2007). We then derive the phenotypic effect fixation function from components of classical population genetic theory (Kimura 1962). The product of this function with the mutational effect distribution provides a distribution of QTL effect sizes. Because these components are all probabilistic, comparison of their product to estimated QTL effect sizes and the standard errors thereof constitutes a likelihood-based QTL, Mutation Accumulation, and Selection 1535 approach to inferring the historical regime of genetic drift or natural selection. We apply this test to an example trait, sensory bristle number in the fruit fly Drosophila melanogaster. The genetic architecture of Drosophila sensory bristle number is well studied (Lai et al. 1994; Long et al. 1995, 1996; Dilda and Mackay, 2002; Robin et al. 2002; MacDonald et al. 2005; Mackay and Lyman 2005) and the selective history on this trait has been the subject of much debate (Mather 1941; Robertson 1955; Robertson 1967; Kearsey and Barnes 1970; Mackay et al. 1994,). Estimating the mutational effect distribution To estimate the distribution of the phenotypic effects of mutations on the trait, we analyze the time course of a mutation accumulation experiment. Although the appropriate formal distribution of the mutational effects for any given trait is unknown (Keightley and Lynch 2003), moments of the distribution of mutation effects on a trait can be inferred from the changes in the trait over time. For analytical tractability, we follow Turelli (1984), Sawyer et al. (2003), and Jones et al. (2007) and assume that the effects of mutations on a trait follow a Gaussian distribution 2 2 1 f ðx; m; sÞ ¼ pffiffiffiffiffiffiffiffiffiffiffiffi e2 ðx2mÞ =2s ; 2 2ps (1) with mean m and standard deviation s. This notation and other notation to be used in the derivation of the model are summarized in Table 1. The sum of k independent normally distributed random variables with the same mean and variance is normally distributed with mean km and variance ks2 . Assuming independence of mutational effects, the distribution of the total effect of k mutations is 2 1 2 f ðx; m; s; kÞ ¼ pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi e2 ðx2kmÞ =2ks : 2p ks2 (2) The distribution of the total effect of the mutations that accumulate in g generations is the sum of the distributions of the effect of k mutations, as k varies from zero to infinity, weighted by the probability that k mutations occur. If mutations are independent, the number of mutations affecting the trait that occur in g generations follows a Poisson distribution, Pðnumber of mutations ¼ kÞ ¼ ðguÞk 2gu e ; k! where u is the per generation rate of mutations affecting the trait. The sum over k of Equation 2 weighted by Equation 3 is ! N X 2 2 ðguÞk 2gu 1 pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi e2 ðx2kmÞ =2ks : hðx; g; u; m; sÞ ¼ e k! 2p ks2 k¼1 (4) Equation 4 is an infinite sum and cannot readily be calculated exactly. In practice, however, accurate approximations for small gu may be achieved by sums of only a small number of terms normalized by the total of all terms considered. We included terms so that, for a given gu, the total Poisson weight of the terms was at least 0.999. Inference of historical selection from QTL and MA data Inference of historical selection from QTL and MA data requires specification of the relationship between the strength of selection and the distribution of effect sizes of mutations that fix in the population. To calculate the probability that a mutation with a particular effect would fix, we apply the solutions of selection–diffusion equations. Assuming that these equations hold, the probability of a new mutation fixing in the population is given by Kimura (1962), PðfixationÞ ¼ 1 2 e2s ; 1 2 e22Ns x m s k u g s N c C n H ¼ (H1, H2,. . ., Hn) q ¼ (q1, q2,. . ., qn) e ¼ (e1, e2,. . ., en) 1536 D. P. Rice and J. P. Townsend (5) where s is the selection coefficient and N is the population size. Following Lande (1983) and Chevin and Hospital (2008), we assume that the selective value of the mutation is proportional to its quantitative effect on the trait. To avoid Table 1 Notation Parameter (3) Description Effect size of a mutation or substitution Mean effect size of spontaneous mutations Standard deviation of the effect size of spontaneous mutations Number of mutations occurring between MA measurements Per-generation mutation rate affecting the trait Number of generations between MA measurements Selection coefficient Effective population size Strength of selection: slope of the relationship between s and x Normalizing constant to make Equation 8 a probability density function The number of loci detected Vector of the number of substitutions responsible for the QTL, where hi is the number of substitutions responsible for QTL i Estimated effect sizes of the QTL Standard errors of the effect sizes of the QTL making assumptions about the optimal value of the trait, we assume that the fitness gradient did not change as the value of the trait changed. This assumption stands in contrast to some existing models of adaptive walks (Orr 1998; Martin and Lenormand 2006) that use a Gaussian fitness landscape. It is also a conservative assumption for our purposes: a walk toward a nearby optimum on a peaked fitness landscape will fix mutations of smaller effect than in our model, diminishing the signature of selection that our method is sensitive to. Additionally, we assume that adaptation proceeds by successive fixation of beneficial mutations (Atwood et al. 1951), as opposed to simultaneous selection at many loci. If many loci are selected simultaneously, the strength of selection on a particular locus will tend to decrease over time (Chevin and Hospital 2008), rather than remain constant. Hence, the assumption of successive fixation is also conservative. Furthermore, we assume that the mutation has no other effects on fitness beyond its effect on the trait in question. Thus, for a mutation with phenotypic effect x, s ¼ cx; (6) where c is a constant whose sign and magnitude determine the direction and strength of selection, respectively. A positive value of c indicates selection for an increase in the trait and a negative value of c indicates selection for a decrease in the trait. When c = 0, the trait is neutral (i.e., the selection coefficient for a mutation of any effect size is zero). Substituting Equation 6 into Equation 5 yields 8 1 2 e2cx > < ; c 6¼ 0 22Ncx ; PðfixationÞ ¼ 1 2 e > 1 : ; c¼0 2N measurement in each line. We also use an independent estimate of the per-generation mutation rate of the trait. To calculate probabilities of fixation of mutants drawn from the effect distribution, we also use the historical effective population size of the population in question. To construct a likelihood function for the parameters underlying the observed change in the quantitative trait, we use the effect size and standard error of QTL identified by crossing individuals from two divergent populations that differ in the trait value. Mutation effects distribution Moments of the distribution of mutation effects on a trait were inferred from the changes in the trait value over the course of a mutation accumulation experiment. Given the change in the trait means, the number of generations between observations, and an independent estimate of the per-generation mutation rate for the trait, a likelihood function was specified for the parameters m and s of the distribution of mutation effects as X ln hðxi ; gi ; u; m; sÞ; (9) Lðm; sÞ ¼ i where L(. . .) is the log likelihood, h(. . .) is from Equation 4, and xi and gi are the change in the mean value of the trait and number of generations between each observation. The values of m and s for which this function is maximized are the maximum-likelihood estimates of the parameters of the Gaussian distribution of mutational effect sizes. Estimating selection (7) which provides the probability that a mutation of size x will fix in the population. The probability distribution of the phenotypic effects of substitutions is determined by the distribution of mutational effects multiplied by the relative fixation probability of those effects. Accordingly, multiplying Equation 1 by Equation 7 yields the distribution of effect sizes of substitutions, given that they have fixed 1 2 e2cx 1 2 ðx2mÞ2 =2s2 pffiffiffiffiffiffiffiffiffiffiffiffi e (8) f ðxÞ ¼ C 1 2 e22Ncx 2ps2 To make f(x) a probability distribution, Equation 8 contains a normalization constant, C, chosen to ensure that the integral of f(x) over all x is equal to one. Methods Overview of the data required To infer a mutation effects distribution, we use the time course of a mutation accumulation experiment, specifically the change in trait value and number of generations between each We also applied a maximum-likelihood framework to estimate the strength of historical selection on the trait. The phenotypic effect of each QTL is the result of one or more underlying substitutions, so we maximize the joint likelihood of c and the vector H = (H1, H2, . . ., Hn), where Hi is the number of substitutions at the ith locus and can be any positive integer, and n is the number of loci. There is no explicit constraint on the number of mutations at each locus, nor on the directions of the underlying mutational effects, only the constraint of the specified total effect. Thus, our method can properly accommodate amalgamations of several QTL that are detected as single QTL by a QTL analysis (cf. Mackay 2001; Mackay and Lyman 2005; Studer and Doebley 2011), even if the cryptic QTL are in opposing directions, just as it accommodates multiple mutations of opposing direction. The likelihood of c and H given the QTL data are the product of the joint likelihood of c and H given each locus individually: lðc; H; q; eÞ ¼ n Y lðc; Hi ; qi ; ei Þ: (10) i Here q and e are the effects and standard errors of the observed QTL. The joint likelihood of c and H given a single locus is QTL, Mutation Accumulation, and Selection 1537 Z lðc; Hi ; qi ; ei Þ ¼ N 2N gðx; qi ; ei ÞFðx; c; Hi Þdx; (11) where g(. . .) is the probability density function of a Gaussian distribution with mean qi and standard deviation ei, and F(. . .) is the probability density function of the sum of Hi substitutions under strength of selection c. The latter is calculated by Hi convolutions of f(x) from Equation 8 with itself. The integral in Equation 11 has no closed-form solution and is difficult to evaluate by typical numerical approaches for most values of Hi. We effectively approximated the likelihood by Monte Carlo sampling from the distribution given by F(x) and calculating average value of g(x) evaluated for each sampled effect: + * 1 2 ðxj 2qi Þ2 =2ei 2 : lðc; Hi ; qi ; ei Þ g xj ; qi ; ei j ¼ pffiffiffiffiffiffiffiffiffiffiffiffi e 2pei 2 j (12) Angle brackets represent averaging over all Monte Carlo samples xj. As the number of Monte Carlo samples increases, the right-hand side of Equation 12 approaches the true likelihood. To calculate the likelihood of a given value of c, we fixed c at a value and evaluated the right-hand side of Equation 12 for a range of vectors of H. For each locus i, we started with Hi = 1 and then increased Hi until we found its unimodal maximum likelihood value. We took the likelihood of c to be the product over i of these likelihoods (Equation 10). We calculated the likelihood of a dense grid of values of c in this way to obtain ĉ, the maximum-likelihood value. To test the hypothesis of neutral evolution, we performed a likelihoodratio test with a null hypothesis of c = 0 and an alternate hypothesis of c = ĉ. Sources of data We applied our method to QTL and MA data for sensory bristle number in D. melanogaster. QTL effect size data were extracted from Dilda (2002), who had applied four different artificial selection regimes on bristle traits. Populations HST and LST were selected for high and low numbers of sternopleural bristles, respectively, while populations HAB and LAB were selected for high and low numbers of abdominal bristles. The QTL effects and standard errors from each population are presented in Figures 3A, 4A, 5A, and 6A. These QTL were assayed by crossing the artificially selected populations with a wild-type laboratory strain. Because the QTL for this trait were assayed using artificially selected strains rather than phenotypically divergent strains from wild populations, our example serves to illustrate the use of the test rather than to make an inference about the selective history of the trait in the wild. Mutation accumulation data for sternopleural bristle number and abdominal bristle number were extracted from 1538 D. P. Rice and J. P. Townsend Paxman (1957) and Lopez and Lopez-Fanjul (1993), respectively. Paxman (1957) propagated six lines of flies to accumulate mutations for 40 generations and measured the average number of sternopleural bristles in each line at 5 time points. The distribution of trait changes in the mean sternopleural bristle number of each line between each measurement is depicted in Figure 2A. Lopez and Lopez-Fanjul (1993) maintained 93 MA lines for 61 generations and measured the change in the number of abdominal bristles in each line at the end of the experiment. The distribution of trait changes is depicted in Figure 2B. The per-generation rate of mutation affecting sensory bristle number in Drosophila is estimated to be 0.03 (Mackay and Lyman 2005). Estimating the distribution of mutational effects To estimate the mean and standard deviation of the distribution of mutation effects m and s, we calculated their maximum-likelihood values for each type of bristle. For each bristle trait, we evaluated Equation 9, letting u = 0.03 (Mackay and Lyman 2005). We let xi and gi be the trait changes and numbers of generations between measurements for all measurements in all lines. Implementing a simple hill-climbing algorithm in MATLAB, we calculated the joint maximum-likelihood estimates for m and s (see supporting information, File S1). We estimated the standard deviations of our estimates by finding the values of m and s whose log likelihood was equal to the maximum log likelihood minus one (Hudson 1971). Estimating selection For each of the four QTL data sets, we calculated the likelihood of a range of values of c using an effective populations size N = 50 (Gurganus et al. 1999) and the maximum-likelihood estimates of m and s for the appropriate bristle trait. For each value of c and Hi, we generated 6.5 · 104 Monte Carlo samples from the distribution of substitution effects in MATLAB and calculated the likelihood according to Equation 12 (see File S2). To calculate the significance of the likelihood-ratio test, we computed the test statistic lðc ¼ 0Þ (13) D ¼ 2 2 log lðc ¼ ^cÞ D is approximately x2-distributed with 1 d.f. (Hudson 1971). It follows that the 95% confidence interval includes values of c with likelihoods within 2 log units of the maximum. False-positive rate To estimate the false-positive rate of the test, we simulated QTL data under a neutral model of evolution. For each of the four data sets, we found the maximum-likelihood value of H for c = 0. For the ith observed locus, we generated a neutral locus by summing Hi effects drawn from the distribution of mutation effects for the appropriate bristle trait. Figure 2 Frequency distribution of changes in bristle traits during mutation accumulation. (A) Change in sternopleural bristle number between measurements in a mutation accumulation experiment by Paxman (1957). Six lines of D. melanogaster were raised in unselective conditions in which they were subjected to population bottlenecks for 40 generations. The mean sternopleural bristle number was assayed five times over the course of the experiment. (B) Change in abdominal bristle number in a mutation accumulation experiment by Lopez and Lopez-Fanjul (1993). Ninety-three mutation-accumulation lines of D. melanogaster were maintained for 61 generations and then assayed for mean abdominal bristle number. Although it entailed extensive computation, we generated 100 sets of simulated neutral QTL for each data set. We applied the likelihood-ratio test for selection to these simulated data and recorded the proportion of neutral data sets for which the neutral hypothesis was rejected. 0.05. For abdominal bristles (Lopez and Lopez-Fanjul 1993), our estimate of m was 20.25 6 0.04 and our estimate of s was 0.34 6 0.05. We calculated the MLE ĉ for the four QTL data sets, in each case applying the corresponding mutation-effect distribution. Undetected QTL Selection in the HST population A well-documented bias in QTL studies is the failure to detect small-effect loci (Beavis 1998; Bost et al. 2001; Xu 2003). To assess whether undetected QTL might have influenced our estimate, we applied the analytical approach of Otto and Jones (2000) to estimate the number and average effect of undetected loci in each data set. We then reran our analysis including in each data set the estimated number of additional QTL, with their effect sizes following an exponential distribution with the estimated mean. Based on the nine QTL detected in the HST population (Figure 3A) and the mutation-effect distribution estimated for sternopleural bristle number (Figure 3B), the likelihoodratio test (x2 = 38.1, P , 0.001) rejects a neutral hypothesis for an increase in sternopleural bristle number. The MLE for c, ĉ, was 0.018, with a 95% confidence interval (CI) ranging from 0.013 to 0.022 (Figure 3C). None of the corresponding one hundred simulated neutral data sets based on the HST data yielded a false positive. Applying Otto and Jones (2000) to assess whether undetected QTL might have influenced our HST result yielded an estimate that four loci were not detected and that these loci had an average effect of 0.388. Including four simulated QTL following an exponential distribution with mean 0.388, the MLE ĉ remained 0.018 and the CI ranged from 0.013 to 0.022 (Figure 3D), the same result as in our main analysis. To assess the sensitivity of our test to the estimates of the mutation-effect distribution parameters, we perturbed m by 1 SD in each direction and applied the test. For m = 20.144, ĉ = 0.027 with a CI ranging from 0.021 to 0.031, while for m = 20.022, ĉ = 0.012 with a CI ranging from 0.007 to 0.016 (Figure 3E). Similarly perturbing s, we found that for s = 0.183, ĉ = 0.029, with a CI ranging from 0.025 to 0.034, and for s = 0.288, ĉ = 0.005, with a CI from 0.001 to 0.01 (Figure 3F). Because confidence intervals for ĉ did not include zero, rejection of the hypothesis of neutrality underlying the evolution of the HST QTL data are robust to moderate perturbations in m and s. Sensitivity to the mutation effect distribution To assess the sensitivity of our test to the estimates of the mutation-effect distribution parameters, we perturbed m by 1 SD in each direction and applied the test. We then similarly perturbed s and compared the resulting estimates and confidence intervals to our primary results. Power To estimate the power of the test with a diverse set of putatively realistic QTL distributions, we applied the test to all subsets of the QTL from each of the four populations, using the appropriate mutation-effect distribution parameters in each case. We examined how the proportion of times the test rejected neutrality varied with the number of loci included in the subset. Results The distribution of mutation effects We estimated the maximum-likelihood mean and standard deviation of the Gaussian distribution of mutational effects (Equation 9) on the basis of the MA data of each type of sensory bristle. For sternopleural sensory bristles (Paxman 1957), the maximum-likelihood estimate (MLE) of m was 20.08 6 0.06 (61 SD) and the MLE of s was 0.23 6 Selection in the LST population On the basis of the four QTL detected in the LST population (Figure 4A) and the mutation-effect distribution estimated for sternopleural bristle number (Figure 4B), a likelihoodratio test (x2 = 4.88, P = 0.027) rejects a neutral hypothesis for a decrease in sternopleural bristle number. In this case, QTL, Mutation Accumulation, and Selection 1539 Figure 3 Selection in the HST population. (A) The effects and standard errors of sternopleural bristle QTL detected in the HST population selected for high sternopleural bristle number. (B) The maximum-likelihood estimate of the distribution of mutational effects for sternopleural bristles. (C) Log likelihood vs. c, the strength of selection, given the HST QTL data and the maximum-likelihood distribution of mutational effects. The horizontal line indicates the likelihood threshold for a 95% confidence interval. (D) Log likelihood vs. c, given the HST QTL data plus four imputed unobserved loci. (E) Log likelihood vs. c, given the HST QTL data, with the estimate of m perturbed by adding 1 SD (crosses, dashed threshold line) and subtracting 1 SD (circles, solid threshold line). (F) Log likelihood vs. c given the HST QTL data, with the estimate of s perturbed by adding 1 SD (crosses, dashed threshold line) and by subtracting 1 SD (circles, solid threshold line). the log-likelihood curve does not have a maximum, but instead increases asymptotically as ĉ decreases (Figure 4C). Conservatively using the largest sampled likelihood as the maximum likelihood and applying the likelihood-ratio test, we found that ĉ , 20.005 with 95% confidence. None of the 100 simulated neutral data sets based on the LST data yielded a false positive. We estimated that four loci were not detected and that these loci had an average effect of 20.284. When we included these simulated QTL in the data, we again found ĉ , 20.005 (Figure 4D). Perturbing m, we found that for m = 20.144, ĉ , 0.007, and for m = 20.022, ĉ , 20.017 (Figure 4E). Perturbing s, we found that for s = 0.183, ĉ , 0.001, and for s = 0.288, ĉ , 20.007 (Figure 4F). Because confidence intervals for ĉ included zero, rejection of the 1540 D. P. Rice and J. P. Townsend Figure 4 Selection in the LST population. (A) The effects and standard errors of sternopleural bristle QTL detected in the LST population, selected for low sternopleural bristle number. (B) The maximum-ikelihood estimate of the distribution of mutational effects for sternopleural bristles. (C) Log likelihood vs. c, the strength of selection, given the LST QTL data and the maximum-likelihood distribution of mutational effects. The horizontal line indicates the likelihood threshold for a 95% confidence interval. (D) Log likelihood vs. c, given the LST QTL data plus four imputed unobserved loci. (E) Log likelihood vs. c, given the LST QTL data, with the estimate of m perturbed by adding 1 SD (crosses, dashed threshold line) and subtracting 1 SD (circles, solid threshold line). (F) Log likelihood vs. c given the LST QTL data, with the estimate of s perturbed by adding 1 SD (crosses, dashed threshold line) and by subtracting 1 SD (circles, solid threshold line). hypothesis of neutrality underlying the evolution of the LST QTL data were not robust to moderate perturbations in m and s. Selection in the HAB population On the basis of the three QTL detected in the HAB population (Figure 5A) and the mutation-effect distribution estimated for abdominal bristle number (Figure 5B), a likelihood-ratio test (x2 = 34.7, P , 0.001) rejects the neutral hypothesis for an increase in the number of abdominal bristles. As in the LST population, the log-likelihood curve does not have a maximum, instead increasing asymptotically as ĉ increases (Figure 5C). Conservatively using the largest observed likelihood as the maximum likelihood and applying the likelihood ratio test, we found that ĉ . 0.031 with 95% confidence. Only 1 of the 100 simulated neutral data sets based on the HAB data yielded a false positive. We estimated that five loci were not detected and that these loci had an average effect of 0.245. When we included these simulated QTL in the data, we found that ĉ . 0.037 (Figure 5D). Perturbing m, we found that for m = 20.294, ĉ .0.037, and for m = 20.215, ĉ . 0.026 (Figure 5E). Perturbing s, we found that for s = 0.30, ĉ . 0.034, and for s = 0.392, ĉ . 0.029 (Figure 5F). Because confidence intervals for ĉ did not include zero, rejection of the hypothesis of neutrality underlying the evolution of the HAB QTL data are robust to moderate perturbations of m and s. Selection in the LAB population On the basis of the three QTL detected in the LAB population (Figure 6A), and the mutation-effect distribution estimated for abdominal bristle number (Figure 6B), a likelihood-ratio test (x2 = 4.00, P = 0.046) rejects a neutral hypothesis for a decrease in abdominal bristle number (Figure 6C). Conservatively using the largest observed likelihood as the maximum likelihood and applying the likelihood-ratio test, we found that ĉ , 0 with 95% confidence. Only 1 of the 100 simulated neutral data sets based on the LAB data yielded a false positive. We estimated that 13 loci were not detected and that these loci had an average effect of 20.240. When we included these simulated QTL in the data, ĉ was 0 with a 95% CI ranging from 20.032 to 0.015 (Figure 6D). Perturbing m, we found that for m = 20.294, ĉ , 0.006, and for m = 20.215, ĉ , 20.004 (Figure 6E). Perturbing s, we found that for s = 0.30, ĉ , 0.003, and for s = 0.39, ĉ , 20.001 (Figure 6F). Because confidence intervals for ĉ included zero, rejection of the hypothesis of neutrality underlying the evolution of the LAB QTL data are not robust to perturbations in m and s. Power We examined the power of subsets of the observed QTL to yield a positive result. The proportion of cases in which neutrality could be rejected increased monotonically with the number of QTL in the subset (Figure 7). For single QTL and the observed mutation-effect distribution, the test rejected neutrality in only 43% of cases, while for data sets with six or more of these loci, the test always rejected neutrality. Discussion We have derived a new method for detecting the level of historical selection on quantitative traits. First, we showed how to estimate the parameters of a Gaussian distribution of mutational effects from the change in the trait over the course of a mutation-accumulation experiment. Then, we derived the distribution of the effect sizes of mutations that fix as a function of the regime of selection or neutrality applied to the trait. We demonstrated how to integrate these Figure 5 Selection in the HAB population. (A) The effects and standard errors of abdominal bristle QTL detected in the HAB population (selected for high abdominal bristle number. (B) The maximum-likelihood estimate of distribution of mutational effects for abdominal bristles. (C) Log likelihood vs. c, the strength of selection, given the HAB QTL data and the maximum-likelihood distribution of mutational effects. The horizontal line indicates the likelihood threshold for a 95% confidence interval. (D) Log likelihood vs. c, given the HAB QTL data plus five imputed unobserved loci. (E) Log likelihood vs. c given the HAB QTL data, with the estimate of m perturbed by adding 1 SD (crosses, dashed threshold line) and by subtracting 1 SD (circles, solid threshold line). (F) Log likelihood vs. c given the HAB QTL data, with the estimate of s perturbed by adding 1 SD (crosses, dashed threshold line) and by subtracting 1 SD (circles, solid threshold line). two results to yield a maximum-likelihood estimate of the strength of historical selection given a set of observed QTL effect sizes and the results of a mutation-accumulation experiment. Finally, we applied the test to a well-studied quantitative trait, detecting a history of artificial selection on four cases of sensory bristle numbers in D. melanogaster. The test we have derived is the first test for selection on phenotype that incorporates information about a trait’s genetic architecture as well as the phenotypic change induced by mutation. It is also the first test to include an explicit model of how neutrality and selection shape the loci that affect a trait. Finally, our test is the first that will allow biologists not only to distinguish between neutrally evolving traits and selected traits but also to estimate the range of QTL, Mutation Accumulation, and Selection 1541 Figure 7 Power vs. the number of loci detected. We applied the likelihood-ratio test to all subsets of the four QTL data sets, with the appropriate mutation-effect distribution. The y-axis represents the proportion of subsets with a given number of loci for which the test rejected the hypothesis of neutrality. Figure 6 Selection in the LAB population. (A) The effects and standard errors of abdominal bristle QTL detected in the LAB population. (B) The maximum-likelihood estimate of distribution of mutational effects for abdominal bristles. (C) Log likelihood vs. c, the strength of selection, given the LAB QTL data and the maximum-likelihood distribution of mutational effects. The horizontal line indicates the likelihood threshold for a 95% confidence interval. (D) Log likelihood vs. c, given the LAB QTL data plus 13 imputed unobserved loci. (E) Log likelihood vs. c given the LAB QTL data, with the estimate of m perturbed by adding 1 SD (crosses, dashed threshold line) and subtracting 1 SD (circles, solid threshold line). (F) Log likelihood vs. c given the LAB QTL data, with the estimate of s perturbed by adding 1 SD (crosses, dashed threshold line) and subtracting 1 SD (circles, solid threshold line). strengths of selection that are most likely to have produced the current phenotypic value. As such, it represents a theoretical and practical improvement on existing methods to detect selection on quantitative traits. Application of the test to sensory bristle number in D. melanogaster demonstrated its ability to detect a history of selection. Likelihood-ratio tests demonstrated sufficient power to reject neutrality in four out of four populations that had been subject to artificial selection. For the HST data, we were able to identify a MLE for a coefficient determining the strength of selection, as well as to describe its 95% CI. For the other three data sets, we were able to estimate upper or lower bounds for the strength of selection. Because the HST data contained the largest number of QTL 1542 D. P. Rice and J. P. Townsend (9), this difference in precision is expected. The HST data set was also the only one that contained some QTL with effects opposite to the actual and inferred direction of selection. Similarly, by applying the test to subsets of the data, we demonstrated that the test’s power increased with an increased number of QTL. We also assessed the false-positive rate by applying the test to 100 simulated neutral data sets based on each set of real QTL. In no case did more than 1 of the 100 simulated data sets yield a false positive. We assessed the sensitivity of our test to the estimates of the mutation-effect distribution parameters by perturbing m and s by 1 SD in each direction and reapplying the test. For the HST and HAB data, rejection of neutrality was robust to these perturbations. However, for the LST data and the LAB data, the result was no longer significant in the case of a more negative m and in the case of a smaller s. This loss of significance speaks to a perhaps unsurprisingly higher sensitivity of the test in cases in which selection and mutation are acting in opposite directions (as they are in HST and HAB) than in cases in which selection and mutation are acting in the same direction. Our low false-positive rate across both cases in which selection and mutation are acting in concert or in conflict indicates that this difference in sensitivity is in accord with the actual inferential difficulty and not a problem of a biased test construction. A well-documented bias in QTL studies is the tendency to fail to detect small-effect loci (Beavis 1998; Bost et al. 2001; Xu 2003). To assess whether undetected QTL might have influenced our estimates, we applied the analytical approach of Otto and Jones (2000) to estimate the number and effect of undetected QTL and included simulated loci following these parameters in our analysis. For the three data sets in which the test rejected neutrality, this modification did not change the significance of the result and had little effect on the estimates of c. However, when we included the simulated loci in the LAB data set, where we had already bordered on insufficient power to detect the action of selection, the MLE for c was shifted to 0. For this data set in particular, the analysis indicated that there were many more undetected loci (13) than observed QTL (5). Furthermore, the simulated loci had a mean of 20.240, almost identical to the estimate of m for abdominal bristles, making them effectively neutral for the purposes of our test. Therefore, application of Otto and Jones (2000) imputation can have a considerable impact when the expected number of unobserved loci is much larger than the number of detected QTL. Wider application of this test awaits the availability of more MA and QTL data. Unfortunately, there are few traits for which both MA and QTL analysis have been performed. Practical and ethical considerations limit the range of species for which mutation-accumulation experiments may be performed. On the other hand, numerous examples of applications of Orr’s 1998 test to empirical data (Rieseberg et al. 2002; Albertson et al. 2003; Lexer et al. 2005; Orr 2010) demonstrate that tests for selection on phenotypic traits are in demand. Although both types of experiments can be labor intensive and time consuming, there are many cases in which either MA or QTL data are already available, and only the corresponding partner experiment need be completed to apply this test for historical selection. Several expansions of our model would improve its scope and precision. First, the model presented here implicitly rather than explicitly incorporates time. Because the generation and fixation of mutations take time, the amount of time a trait has had to evolve is an important factor in how many substitutions are responsible for each QTL effect. Explicitly incorporating time into the model should provide more information for determining in which vector H is most likely. Since H affects the likelihood of c, this additional discriminatory power would yield more precise estimates of the strength of selection. However, explicit rather than implicit incorporation of time in our framework is complex. Our test assumes that the probability of fixation of a new mutation follows Equation 5, which has been derived from various models of evolution (Kimura 1962; Moran 1960; Gillespie 1974), and is a fairly general result, accommodating a variety of evolutionary scenarios for which the effective population size captures all relevant complications. Examples include cyclically varying population size (Otto and Whitlock 1997), population subdivision with symmetric migration (Pollack 1966), and deleterious mutations at completely linked loci (Charlesworth 1994). Probability of fixation results that could be used in place of Equation 5 under more diverse evolutionary scenarios are reviewed by Patwa and Wahl (2008). Because we model the sequential mutation and fixation of alleles at different loci, our test applies to QTL identified by crossing related populations and estimates the selective regime responsible for the trait difference between the populations. The method is not applicable, as currently implemented, to QTL identified by crossing individuals from the same population. However, since selection should shape the standing variation within a population, a related test that applies to QTL variation in a single population is conceivable. Accordingly, one potential weakness of the test is that our model assumes that QTL between the divergent populations are constructed from novel mutations, without allowing some proportion of fixed differences to arise from standing variation in the ancestral population. QTL arising from standing variation fix faster and are likely to be of smaller effect size than QTL arising from novel mutations (Barrett and Schluter 2008). Consequently, our model assumption that all variation arises as new mutation is conservative for the purpose of our test: within our framework, smaller QTL effect sizes will be more consistent with neutrality. Finally, distributions of mutational effect other than the Gaussian distribution used here should be explored. Implementing the model with other distributions would allow assessment of its sensitivity to this assumption and provide flexibility to its use as new information comes to light about the distributions of mutational effects found in nature. The central challenge in this endeavor is that other distributions lack the convenient property of the normal distribution that the sum of normally distributed random variables is also normally distributed. As currently implemented, our approach relies on this fact for tractability (Equation 2). As a theoretical result, this test for selection on quantitative traits is important because it bridges population genetic theory and genetic architecture as revealed by QTL. Furthermore, the test for selection outlined here provides an improvement over currently available methods for determining the regime of historical selection or neutrality that generated observed quantitative differences in phenotype. Results of the test applied to four data sets known to be the result of artificial selection for sensory bristle number in D. melanogaster are highly encouraging, because they indicate that the test has sufficient power to be used on moderately sized QTL data sets and also that the false-positive rate is very low. Applying the test to QTL data generated by crossing two individuals representative of diverged wild populations will yield conclusions about the history of selection responsible for the observed trait differences. Finally, our model provides a novel framework that holds promise for expansion in future work to incorporate such factors as time, varying probabilities of fixation, and diverse shapes of the distribution of mutational effects. Acknowledgments We thank Trudy Mackay and Carlos Lopez-Fanjul for graciously and helpfully digging deep in their files to provide key details of their published experiments. DPR was partially funded while performing this work by a Yale College Dean’s Research Fellowship, a Yale-Howard Hughes Medical Institute Future Scientist Summer Fellowship, and a National Science Foundation Graduate Research Fellowship DGE-1144152. The final computations were run on the Odyssey cluster supported by the Faculty of Arts and Sciences Science Division Research Computing Group at Harvard University. QTL, Mutation Accumulation, and Selection 1543 Literature Cited Albertson, R., J. Streelman, and T. Kocher, 2003 Genetic basis of adaptive shape differences in the cichlid head. J. Hered. 94: 291–301. Anderson, E., and M. Slatkin, 2003 Orr’s quantitative trait loci sign test under conditions of trait ascertainment. Genetics 165: 445–446. Ashton, K., A. Wagoner, R. Carrillo, and G. Gibson, 2001 Quantitative trait loci for the monoamine-related traits heart rate and headless behavior in Drosophila melanogaster. Genetics 157: 283–294. Atwood, K. C., L. K. Schneider, and F. J. Ryan, 1951 Periodic selection in Escherichia coli. Proc. Natl. Acad. Sci. USA 37: 146–155. Barrett, R. D. H., and D. Schluter, 2008 Adaptation from standing genetic variation. Trends Ecol. Evol. 23: 38–44. Barton, N., and H. De Vladar, 2009 Statistical mechanics and the evolution of polygenic quantitative traits. Genetics 181: 997–1011. Barton, N., and P. Keightley, 2002 Understanding quantitative genetic variation. Nat. Rev. Genet. 3: 11–21. Beavis, W. D., 1998 QTL analyses: power, precision and accuracy, pp. 145–162 in Molecular Dissection of Complex Traits, edited by A. H. Paterson. CRC Press, Boca Raton, FL. Boake, C., S. Arnold, F. Breden, L. Meffert, M. Ritchie et al., 2002 Genetic tools for studying adaptation and the evolution of behavior. Am. Nat. 160: S143–S159. Bost, B., D. de Vienne, F. Hospital, L. Moreau, and C. Dillmann, 2001 Genetic and nongenetic bases for the L-shaped distribution of quantitative trait loci effects. Genetics 157: 1773–1787. Brem, R. B., and L. Kruglyak, 2005 The landscape of genetic complexity across 5,700 gene expression traits in yeast. Proc. Natl. Acad. Sci. USA 102: 1572–1577. Charlesworth, B., 1994 The effect of background selection against deleterious mutations on weakly selected, linked variants. Genet. Res. 63: 213–227. Chevin, L.-M., and F. Hospital, 2008 Selective sweep at a quantitative trait locus in the presence of background genetic variation. Genetics 180: 1645–1660. Depaulis, F., S. Mousset, and M. Veuille, 2003 Power of neutrality tests to detect bottlenecks and hitchhiking. J. Mol. Evol. 57 (Suppl. 1): S190–S200. Dilda, C., 2002 The genetic architecture of Drosophila sensory bristle number. Ph.D. dissertation, North Carolina State University, Raleigh, NC. Available from http://www.proquest.com, publication number AAT 3346505, accessed November 22, 2011. Dilda, C., and T. Mackay, 2002 The genetic architecture of Drosophila sensory bristle number. Genetics 162: 1655–1674. Erickson, D., C. Fenster, H. Stenoien, and D. Price, 2004 Quantitative trait locus analyses and the study of evolutionary process. Mol. Ecol. 13: 2505–2522. Garciadorado, A., 1997 The rate and effects distribution of viable mutation in Drosophila: minimum distance estimation. Evolution 51: 1130–1139. Gillespie, J. H., 1974 Natural selection for within-generation variance in offspring number. Genetics 76: 601–606. Gleason, J., S. Nuzhdin, and M. Ritchie, 2002 Quantitative trait loci affecting a courtship signal in Drosophila melanogaster. Heredity 89: 1–6. Gleason, J., R. James, C. Wicker-Thomas, and M. Ritchie, 2009 Identification of quantitative trait loci function through analysis of multiple cuticular hydrocarbons differing between Drosophila simulans and Drosophila sechellia females. Heredity 103: 416–424. Grant, P., and B. Grant, 2002 Unpredictable evolution in a 30year study of Darwin’s finches. Science 296: 707–711. 1544 D. P. Rice and J. P. Townsend Gurganus, M. C., S. V. Nuzhdin, J. W. Leips, and T. F. C. Mackay, 1999 High-resolution mapping of quantitative trait loci for sternopleural bristle number in Drosophila melanogaster. Genetics 152: 1585–1604. Halligan, D. L., and P. D. Keightley, 2009 Spontaneous mutation accumulation studies in evolutionary genetics. Annu. Rev. Ecol. Evol. Syst. 40: 151–172. Hudson, D. J., 1971 Interval Estimation from the likelihood function. J. R. Statist. Soc. B Methodol. 33: 256–262. Jones, A., S. Arnold, and R. Burger, 2007 The mutation matrix and the evolution of evolvability. Evolution 61: 727–745. Kearsey, M. J., and B. W. Barnes, 1970 Variation for metrical characters in Drosophila populations. 2. Natural selection. Heredity 25: 11. Keightley, P. D., 1994 The distribution of mutation effects on viability in Drosophila melanogaster. Genetics 138: 1315–1322. Keightley, P., and M. Lynch, 2003 Toward a realistic model of mutations affecting fitness. Evolution 57: 683–685. Kimura, M., 1962 On probability of fixation of mutant genes in a population. Genetics 47: 713. Kimura, M., 1983 The Neutral Theory of Molecular Evolution. Cambridge University Press, New York. Lai, C., R. Lyman, A. Long, C. Langley, and T. Mackay, 1994 Naturally-occuring variation in bristle number and DNA polymorphisms at the scabrous locus of Drosophila melanogaster. Science 266: 1697–1702. Lai, C., J. Leips, W. Zou, J. Roberts, K. Wollenberg et al., 2007 Speed-mapping quantitative trait loci using microarrays. Nat. Methods 4: 839–841. Lande, R., 1983 The response to selection on major and minor mutations affecting a metrical trait. Heredity 50: 47–65. Lexer, C., D. Rosenthal, O. Raymond, L. Donovan, and L. Rieseberg, 2005 Genetics of species differences in the wild annual sunflowers, Helianthus annuus and H. petiolaris. Genetics 169: 2225–2239. Long, A., S. Mullaney, L. Reid, J. Fry, C. Langley et al., 1995 Highresolution mapping of genetic factors affecting abdominal bristle number in Drosophila melanogster. Genetics 139: 1273–1291. Long, A., S. Mullaney, T. Mackay, and C. Langley, 1996 Genetic interactions between naturally occurring alleles at quantitative trait loci and mutant alleles at candidate loci affecting bristle number in Drosophila melanogaster. Genetics 144: 1497–1510. Lopez, M., and C. Lopezfanjul, 1993 Spontaneous mutation for a quantitative trait in Drosophila melanogaster. 2. Distribution of mutant effects on the trait and fitness. Genet. Res. 132: 117– 126. MacDonald, S., T. Pastinen, and A. Long, 2005 The effect of polymorphisms in the Enhancer of split gene complex on bristle number variation in a large wild-caught cohort of Drosophila melanogaster. Genetics 171: 1741–1756. Mackay, T., 2001 Quantitative trait loci in Drosophila. Nat. Rev. Genet. 2: 11–20. Mackay, T., and R Lyman, 2005 Drosophila bristles and the nature of quantitative genetic variation. Philos. Trans. R. Soc. B Biol. Sci. 360: 1513–1527. Mackay, T. F. C., J. D. Fry, R. F. Lyman, and S. V. Nuzhdin, 1994 Polygenic mutations in Drosophila melanogaster: estimates from response to selection of inbred strains. Genetics 136: 937–951. Martin, G., and T. Lenormand, 2006 The fitness effects of mutations across environments: a survey in light of fitness landscape models. Evolution 60: 2413–2427. Mather, K., 1941 Variation and selection of polygenic characters. J. Genet. London 41: 159–193. McDonald, J., and M. Kreitman, 1991 Adaptive protein evolution at the adh locus in Drosophila. Nature 351: 652–654. Moran, P. A. P., 1960 The survival of a mutant gene under selection II. J. Aust. Math. Soc. 1: 485–491. Mousset, S., N. Derome, and M. Veuille, 2004 A test of neutrality and constant population size based on the mismatch distribution. Mol. Biol. Evol. 21: 724–731. Nurminsky, D., F. Depaulis, S. Mousset, and M. Veuille, 2005 Detecting Selective Sweeps with Haplotype Tests: Selective Sweep. Springer, New York. Orr, H., 1998a Testing natural selection vs. genetic drift in phenotypic evolution using quantitative trait locus data. Genetics 149: 2099–2104. Orr, H. A., 1998b The population genetics of adaptation: the distribution of factors fixed during adaptive evolution. Evolution 52: 935–949. Orr, H. A., 2010 The population genetics of beneficial mutations. Philos. Trans. R. Soc. B. Biol. Sci. 365: 1195–1201. Otto, S. P., and C. D. Jones, 2000 Detecting the undetected: estimating the total number of loci underlying a quantitative trait. Genetics 156: 2093–2107. Otto, S. P., and M. C. Whitlock, 1997 The probability of fixation in populations of changing size. Genetics 146: 723–733. Palsson, A., and G. Gibson, 2004 Association between nudeotide variation in Egfr and wing shape in Drosophila melanogaster. Genetics 167: 1187–1198. Palsson, A., J. Dodgson, I. Dworkin, and G. Gibson, 2005 Tests for the replication of an association between Egfr and natural variation in Drosophila melanogaster wing morphology. BMC Genet. 6: 44. Patwa, Z., and L. M. Wahl, 2008 The fixation probability of beneficial mutations. J. R. Soc. Interface 5: 1279–1289. Paxman, G. J., 1957 A study of spontaneous mutation in Drosophila Melanogaster. Genetica 29: 39–57. Pollak, E., 1966 On the survival of a gene in a subdivided population. J. Appl. Probab. 3: 142–195. Rieseberg, L., A. Widmer, A. Arntz, and J. Burke, 2002. Directional selection is the primary cause of phenotypic diversification. Proc. Natl. Acad. Sci. USA 99: 12242–12245. Robertson, A., 1955 Selection in animals: synthesis. Cold Spring Harb. Symp. Quant. Biol. 20: 225–229. Robertson, A., 1967 The nature of quantitative genetic variation Drosophila melanogaster. Proceedings of the Mendel Centennial Symposium of the Genetics Society of Heritage from Mendel, September 7–11, 1965, Fort Collins, CO, pp. 265–280. Robin, C., R. Lyman, A. Long, C. Langley, and T. Mackay, 2002 hairy: a quantitative trait locus for Drosophila sensory bristle number. Genetics 162: 155–164. Sawyer, S., R. Kulathinal, C. Bustamante, and D. Hartl, 2003 Bayesian analysis suggests that most amino acid replacements in Drosophila are driven by positive selection. J. Mol. Evol. 57 (Suppl. 1): S154–S164. Schlenke, T., and D. Begun, 2003 Natural selection drives drosophila immune system evolution. Genetics 164: 1471–1480. Schlenke, T., and D. Begun, 2004 Strong selective sweep associated with a transposon insertion in Drosophila simulans. Proc. Natl. Acad. Sci. USA 101: 1626–1631. Schlenke, T., and D. Begun, 2005 Linkage disequilibrium and recent selection at three immunity receptor loci in Drosophila simulans. Genetics 169: 2013–2022. Shaw, F. H., C. J. Geyer, and R. G. Shaw, 2002 A comprehensive model of mutations affecting fitness and inferences for Arabidopsis thaliana. Evolution 56: 453–463. Studer, A., and J. F. Doebley, 2011 Do large effect QTL fractionate?: a case study at the maize domestication QTL teosinte branched1. Genetics 188: 673–681. Turelli, M., 1984 Heritable genetic variation via mutation-selection balance: Lerch’s zeta meets the abdominal bristle. Theor. Popul. Biol. 25: 138–193. Verhoeven, K., T. Vanhala, A. Biere, E. Nevo, and J. Van Damme, 2004 The genetic basis of adaptive population differentiation: a quantitative trait locus analysis of fitness traits in two wild barley populations from contrasting habitats. Evolution 58(2): 270–283. Weinig, C., L. Dorn, N. Kane, Z. German, S. Hahdorsdottir et al., 2003 Heterogeneous selection at specific loci in natural environments in Arabidopsis thaliana. Genetics 165: 321–329. Xu, S., 2003 Theoretical basis of the Beavis effect. Genetics 165: 2259–2268. Zimmerman, E., A. Palsson, and G. Gibson, 2000 Quantitative trait loci affecting components of wing shape in Drosophila melanogaster. Genetics 155: 671–683. Communicating editor: C. D. Jones QTL, Mutation Accumulation, and Selection 1545 GENETICS Supporting Information http://www.genetics.org/content/suppl/2012/01/31/genetics.111.137075.DC1 A Test for Selection Employing Quantitative Trait Locus and Mutation Accumulation Data Daniel P. Rice and Jeffrey P. Townsend Copyright © 2012 by the Genetics Society of America DOI: 10.1534/genetics.111.137075 Supporting Files File S1: Matlab Code 1 (Estimating mu and sigma) File S2: Matlab Code 2 (Calculating likelihood) Files S1 and S2 are available for download at http://www.genetics.org/content/suppl/2012/01/31/genetics.111.137075.DC1. 2 SI D. P. Rice and J. P. Townsend
© Copyright 2026 Paperzz