Genetic Differentiation Among Oregon Lake Populations of the Daphnia pulex Species Complex D. J. Straughan and N. Lehman Gene flow among invertebrate populations inhabiting bodies of nonflowing freshwater such as ponds or lakes must at some stage involve transport across habitat unsuitable for adult stages. Consequently the potential for interpopulational differentiation is high in these species, yet empirical studies of lake populations of Cladocerans such as Daphnia have failed to reveal high levels of genetic distinctiveness among populations and have led to much speculation about how these populations exchange genes and remain cohesive evolutionary units. In this study we surveyed 42 Oregon lake populations of Daphnia from the D. pulex species complex for genetic variation within the mitochondrial DNA control region. We have used this data to test the relative abilities of various ecological factors to explain the observed patterns in genetic differentiation among lakes. Despite limited genetic variation detected among our samples—11 very similar RFLP-defined mtDNA genotypes from 388 individuals—analyses of nucleotide variance using analogs to Wright’s F statistics indicate that when multilake populations are defined in terms of the river drainage basin to which they belong, strong and significant amounts of among-population genetic variation can be detected at this locus (FST estimates between 0.5 and 0.6). In contrast, we fail to detect consistent significant amongpopulation variation when populations are defined on the basis of regional physical geography, bird migratory flyways, or lake trophic status. The manner in which the data are compiled, that is, whether RFLPs or nucleotide sequences are used, has little effect on the overall conclusions, yet it is clear that nucleotide sequence data would lower the standard errors of FST estimates. We propose that periodic widescale flooding during the late Pleistocene may be an important mechanism to homogenize genetic differences among lake Daphnia continent-wide south of the southern-most extent of Pleistocene glaciation. From the Department of Biological Sciences, California State University, Long Beach, Long Beach, California (Straughan) and the Department of Biological Sciences, University at Albany, SUNY, 1400 Washington Ave., Albany, NY 12222 ( Lehman). We thank D. Decker for valuable discussions and help in all phases of this project; M. Joshi, S. Smith, D. Fera, and D. Tieman for technical assistance; and J. Hicks, M. Pfrender, and S. Straughan for help in obtaining lake samples. This work was supported by California State University, Long Beach and the University at Albany, SUNY. Address correspondence to Niles Lehman at the address above or e-mail: [email protected]. q 2000 The American Genetic Association 91:8–17 8 The freshwater microcrustacean Daphnia (water fleas) is a common and abundant inhabitant of North American lakes and freshwater ponds. Although population densities in these lentic environments can reach very high values, free-swimming Daphnia are rare or absent from most lotic environments such as rivers and streams. Consequently the mechanisms of gene flow among lake or pond populations, and invasion of newly created habitats are largely unknown. It has often been postulated that the desiccation-resistant resting stage of Daphnia, hard dizygotic encasings known as ephippia, can be transported from waterbody to waterbody either by wind, moving water, or on the feet of migratory birds such as waterfowl (Crease et al. 1997; Dodson and Frey 1991; Talling 1951; Weider et al. 1996). Yet many hypotheses concerning gene flow among populations remain untested with highresolution molecular studies. The waterbodies of Oregon provide a perfect milieu in which to examine proposals concerning the population-genetic (and phylogenetic) patterns of Daphnia. Although the state may appear to be an arbitrary political construction, it is actually well defined geographically. It is bounded on the west by the Pacific Ocean. It is bounded on the north and east in large part by the Columbia and Snake Rivers, which are major hydrologic features of the Pacific Northwest. Its southern boundary is less well defined, but does include the Siskiyou mountain range, which partitions the weather patterns and flora sharply, with wetter forms to the north and drier forms to the south, in California. Within the boundaries of the state lie an extreme range of geologic features and ecotones. High alpine, marsh, fertile low- land plains, sand dune, volcanic flow channel, glacial valley, high desert plain, low desert basin, and subalpine forest habitats are all present and abundantly represented. Within the state are some of the driest as well as some of the wettest places in the United States. Accordingly, Oregon can be divided into 10 distinct geomorphic regions ( Loy et al. 1976), each of which contains many permanent water bodies ( lakes and reservoirs) and numerous temporary ponds. The lakes that are found in Oregon have been formed by a wide variety of processes, from man-made (i.e., artificial) reservoirs, to tectonic basins, to lakes formed by landslides and river activity (Johnson et al. 1985). In addition to these geomorphic regions, Oregon has many rivers, and 18 major river drainage basins can be delineated (Johnson et al. 1985). Migratory flyways of birds tend to transect this region in a north/south fashion following general geographic features ( Evanich 1990). The boundaries of the geomorphic regions, river drainage basins, and migratory flyways as defined by Evanich (1990) and Johnson et al. (1985) are indicated in Figure 1. The control region of the mitochondrial DNA (mtDNA) genome has become the locus of preference for a wide range of population- and species-level evolutionary studies in animals. The rapid rate of nucleotide substitution in this region, combined with desirable characteristics of animal mtDNA in general ( high copy number in the cell, uniparental inheritance, lack of recombination, strong conservation of gene order, and amino acid sequence over many taxa) has made it amenable to investigations that require a significant amount of polymorphism unconfounded by allelic heterogeneity (Moritz et al. 1987). Availability of the polymerase chain reaction (PCR) with ‘‘universal’’ oligonucleotide primers anchored in flanking RNA regions that are highly conserved has allowed many investigators to generate sequence and/or RFLP data quickly with very little prior knowledge of the target species’ evolutionary history ( Hillis et al. 1996). And despite potential pitfalls of mtDNA analysis such as interspecific introgression (e.g., Lehman et al. 1991, Spolsky and Uzzell 1984), nuclear integration (e.g., Zhang and Hewitt 1996), occasional heteroplasmy (e.g., Bermingham et al. 1986; Rand and Harrison 1989 and references therein), and lack of concordance of mtDNA gene trees with species trees (e.g., Avise 1989; Takahata 1989), the control region promises to remain an important Figure 1. Population delineations in this study. Dots indicate the cities of Portland, Eugene, and Medford, from north (top) to south ( bottom). (A) The ten geomorphic regions of Oregon ( Loy et al. 1976): CR 5 Coast Range, KL 5 Klamath, WM 5 Willamette, WC 5 Western Cascades, HC 5 High Cascades, DU 5 Deschutes-Umatilla, BM 5 Blue Mountains, HL 5 High Lava Plains, OW 5 Owyhee, BR 5 Basin and Range. (B) The eighteen major river drainage basins in Oregon (Johnson et al. 1985): NC 5 North Coast, MC 5 Mid Coast, SC 5 South Coast, WM 5 Willamette, UM 5 Umpqua, RG 5 Rogue, SN 5 Sandy, HD 5 Hood, DE 5 Deschutes, KL 5 Klamath, JD 5 John Day, GS 5 Goose and Summer, UT 5 Umatilla, GR 5 Grande Range, PO 5 Powder, MR 5 Malheur River, ML 5 Malheur Lake, OW 5 Owyhee. The shaded region drains into California. (C) The four north-south bird migratory flyways ( Evanich 1990): CO 5 Coast, VA 5 Valley, CE 5 Central, EA 5 East. source of fine-scale evolutionary information for many laboratories. The control region encompasses less than 5% of most animal mtDNA genomes ( Brown 1985) and represents the predominant fraction of untranslated mtDNA in multicellular animals. Thus, in contrast to the balance of the mtDNA, which is partially subject to strong stabilizing selection imposed by the critical nature of the genes that are encoded therein, the control region is relatively unconstrained in terms of nucleotide sequence or length. A few promoter-like elements, such as the origin of replication and the D-loop, are known to be embedded in the control region, but even these show poor phylogenetic conservation even among related taxa. Surveys of a diversity of species have shown that the control region accumulates mutations at very high, but variable, rates ranging from 1% to 20% per million years (Stanley et al. 1996) and that small insertions and deletions (‘‘indels’’) are common mutational events. Consequently the control region varies greatly in total size among animals and contributes greatly to overall mtDNA length variation in animal taxa (Moritz et al. 1987). Previous analyses of mtDNA variation in the Daphnia pulex species complex (Colbourne et al. 1998; Crease et al. 1997; Dufresne and Hebert 1997; Lehman et al. 1995; Van Raay and Crease 1994; Weider and Hobaek 1997; Weider et al. 1996) suggest that many Daphnia ‘‘species’’ may have polyphyletic or paraphyletic origins of their mtDNA lineages. Accordingly, few consistent phylogeographic patterns have emerged that would prove useful in the study of gene flow across large physical distances (Crease et al. 1997). In particular, lake populations of members of the pulex complex, often classified as a distinct species D. pulicaria, have exhibited poorly differentiated populations at many geographic scales, as assayed by either allozyme electrophoresis (Cerny and Hebert 1993) or by total mtDNA genome RFLP analysis (Crease et al. 1997). The purpose of the current study is twofold. The first is to examine whether the mtDNA control region has utility in a fine-scale examination of population-genetic variation for testing of specific gene-flow mechanism hypotheses among Oregon lake populations. Specifically, the significance of among-population variation in mtDNA genotypes will be estimated when populations are defined on the basis of river drainage basin, geomorphic region, bird migratory flyway, or lake trophic status. The second is to compare various levels of genetic resolution for their efficacies in such hypotheses testing. Specifically, it will be examined whether different conclusions are reached depending on the manner in which the genetic data are obtained: RFLP data, direct nucleotide sequence data, or a hybrid of the two. Materials and Methods Sample Collection Daphnia samples were collected in the summers of 1996 and 1997. In total, 90 Oregon lakes were successfully sampled Straughan and Lehman • Daphnia Population Genetics 9 with the goal of an even geographic distribution about the state. Eighty-four of these lakes were sampled in 1996, eight of which were resampled in 1997 along with six that had not been sampled previously. Sampling was performed so as to cover all defined geomorphic regions and river drainage basins ( Figure 1), although lake accessibility did restrict optimal sampling uniformity. At the time of sampling, all collected Daphnia were treated as though they were from the pulex complex; no samples were excluded from subsequent genetic analysis despite morphological variation observed within and among lakes. Samples were collected using a small-mouthed plankton net ( BioQuip, Gardena, CA) either from a kayak or from shore. In general, sampling was attempted from the deepest regions of the lake, following the bathymetric maps of Johnson et al. (1985). In the majority of cases sampling was performed from only one location, but each involved multiple net pulls across several meters of lake depth so that individuals were unlikely to be clutchmates. Daphnia were concentrated on mesh screens and preserved in 70% ethanol at 48C until genetic analysis. DNA Extraction, Amplification, and Sequencing Genomic DNA was liberated from preserved Daphnia using the Chelex method (Morin et al. 1993; Walsh et al. 1991). Single individuals were placed in 600 ml microcentrifuge tubes containing 300 ml 10% Chelex 100 resin ( BioRad Laboratories) in ultrapure water. Lid locks were secured to the microcentrifuge tubes and the tubes were autoclaved at 1218C at 10 psi for exactly 20 min. Upon completion of the sterilization cycle, the autoclave chamber was fast exhausted and the microcentrifuge tubes were placed on wet ice. These samples were stored indefinitely (up to 1 year) at 48C before amplification. Immediately prior to amplification via the PCR (Mullis et al. 1986), the tubes were vortexed gently and quickly spun in a microcentrifuge. To amplify high-quality DNA from the mtDNA control region, a nested PCR approach was taken. Initial amplification was achieved using the 12S primer (59TAACCGCGACGGCTGGCAC-39), which anneals in the 12S rRNA gene immediately adjacent to the control region, and the fMet primer (59-GGGCATGAACCCACTAGCTT-39), which anneals to the fMet tRNA immediately adjacent to the control region on the opposite side ( Lehman et al. 1995). These reactions utilized 10 ml Chelex supernatant in 10 The Journal of Heredity 2000:91(1) 50 ml reaction volumes. Forty amplification cycles were performed with an annealing temperature of 508C to generate products approximately 1000 nucleotide pairs in length. Following successful external amplification, the PCR products were diluted 1:1000 in water and 1 ml was used as a template for amplification with the DPUDL-1 (59-CAATCTAGAGCCAAAGCCAGATTCA-39) and DPUDL-2 (59-CCTCTGCAGGTAGCCCTTTAATCAGGCATC-39) primers. Twenty-five amplification cycles were performed with an annealing temperature of 538C to generate products approximately 767 nucleotide pairs in length. All amplifications were accomplished in 50 ml reaction volumes using 8.75 pmol of each primer and 1.25 units AmpliTaq DNA polymerase (Perkin-Elmer) in a PTC-100 thermocycler (MJ Research). For subsequent RFLP analysis, no post-PCR purification was necessary. For subsequent sequence determination, the PCR products were subject to the GeneClean ( Bio 101) process for desalting and were rehydrated in 25 ml of water prior to sequence analysis. Successful DPUDL-1/2 PCR products were subjected to digestion with a panel of seven restriction endonucleases (BamHI, DraI, HhaI, MseI, PalI, RsaI, and Sau96I; New England BioLabs) and the resulting fragments were then separated on either 2% or 3% agarose gels, and bands were visualized by ethidium bromide staining and UV transillumination. Digests were performed using 6–10 ml of unpurified PCR product, 2 ml of manufacturer’s supplied 103 buffer, and 2–8 units of enzyme in 20 ml total volume at 378C for at least 2 h. The seven-enzyme fragment profiles were compiled to generate composite mtDNA genotypes for 2–20 individual Daphnia per lake. After all individuals were unambiguously genotyped with this panel of enzymes, and nucleotide sequences were obtained for representative individuals (see below), a subset of all individuals in the study were genotyped with an eighth restriction enzyme, Tsp509I. This was done to probe for additional variation within one of the defined genotypes (see Results). For each of the composite seven-enzyme RFLP genotypes, a nearly complete nucleotide sequence of the 767 bp DPUDL1/2 PCR product was obtained by direct sequencing of this product from at least one individual. Nucleotide sequencing was performed manually on the double-stranded PCR products as follows. Four to 7 ml of GeneCleaned DNA were mixed with 100 pmol primer, boiled 5 min and snapcooled in a dry-ice/ethanol slurry. Se- quencing was then carried out utilizing [35S]dATP, the modified T7 DNA polymerase (Sequenase, version 2.0, US Biochemical Corp.), and dideoxynucleotide triphosphates. Reaction products were separated on 6% polyacrylamide 0.4 mm gels containing 7 M urea and 0.53 TBE buffer in 38 cm 3 50 cm Sequi-Gen apparatuses ( BioRad Laboratories) and visualized by autoradiography. Five primers were used to completely span the control region: DPUDL-1, DPUDL-2, DCRI-1 (59-TCGGAACCACTTTAGCGCAAG-39) and DCRI-2 (59GTTTAAACCCCTTTTCTAATTTTTGGACC-39) which anneal approximately 100 nucleotides internal to the DPUDL-1 and DPUDL-2 primers, respectively, and DSP-2 (59-GAAAATTAGTATTATAATC-39), which anneals in the middle of the control region and is necessary to resolve precisely the homopolymer runs of cytosine (see Results). These sequences were compared to the published D. pulex control-region sequence from Amana Pond, Iowa (Van Raay and Crease 1994). Sequences were aligned first by use of the Jotun-Hein method contained within the MegAlign module of the Lasergene DNA software package ( DNASTAR, Inc.) and then refined by eye. Observed transitions, transversions, and insertions/deletions (indels) among the aligned sequences, along with percent A 1 T content of the sequences and absolute (uncorrected) numbers of pairwise nucleotide differences, were computed manually. Population Genetic Analyses The nucleotide sequences were used to construct restriction maps with high certainty and these maps could then be used to produce a presence-absence matrix of restriction sites for all individuals genotyped through the RFLP technique. This matrix was input into the HAPLO2 computer program ( Lynch and Crease 1990) in order to calculate estimated pairwise nucleotide substitutions per site among genotypes [d̂xy, corrected from observed values using the method of Jukes and Cantor (1969)], and to estimate the proportion of total mtDNA genetic variance at the nucleotide level that was due to among-population differentiation (NST). In addition, the traditional GST statistic, which unlike NST, treats all genotypes as being equally distinct from one another, was calculated by hand using the weighting method of Nei (1987). Populations were defined in these computer analyses in several different fashions: all lakes within a geomorphic region, all lakes within a river drainage basin, and all lakes within a north-south fly- way ( Figure 1). As a control for spurious results from pooling individuals from many different lakes in a single population, an additional analysis was performed in which populations were constructed by pooling lakes of a given trophic status (oligotrophic, mesotrophic, eutrophic, and hypereutrophic; status as of 1985, Johnson et al.) regardless of geographic location. Genetic-structure analysis using the NST statistic was performed by treating the raw data in three ways. First, the presence-absence matrix of restriction sites was used to estimate d̂xy values among genotypes. This approach, indicated by the label ‘‘RFLP’’ in the Results, uses the maximum-likelihood estimates of Nei and Tajima (1983) to compute the expected pairwise sequence divergences of two genotypes based on the number and type (‘‘4-cutter,’’ ‘‘6-cutter,’’ etc.) of restriction enzymes employed to genotype individuals. As such, this approach does not utilize all the sequence information present in each PCR product. However, because all the individuals in the study were typed with all seven restriction enzymes, but only a small fraction of these individuals had the nucleotide sequences of their PCR products determined completely, the RFLP approach will detect any sequence variation within those individuals at the relevant restriction sites. In contrast, a second approach to data treatment was the utilization of the complete nucleotide sequence data for each genotype, based on the sequences obtained from representative individuals from each genotype. Here, all nucleotide sequence data within the PCR product is utilized, but an assumption is made that all individuals with a given RFLP-based genotype have identical nucleotide sequences, even at nucleotides not assayed by the restriction enzymes. This approach is indicated by the label ‘‘Full Seqs’’ in the Results. A third approach, a hybrid of the first two and indicated by the label ‘‘RFLP/Seq’’ in the Results, was to input the actual nucleotide sequences at the restriction sites into the matrix used to calculate d̂xy values. For example, if genotype 1 had an RFLP presence-absence description for three MseI restriction sites of ‘‘111’’, this would be converted into an RFLP/Seq genotype of ‘‘TTAATTAATTAA’’, because the recognition sequence of MseI is 59-TTAA-39. Similarly, if genotype 2 had the RFLP genotype of ‘‘1–1’’, its RFLP/Seq genotype might be ‘‘TTAATTAGTTAA’’; the actual no-cut sequences were determined by the represen- tative sequences of each RFLP genotype. This hybrid approach potentially improves the estimates of d̂xy values among genotypes over the straight RFLP approach without making assumptions about sequences not covered by restriction sites. For each method of analysis, a subset of the data was analyzed in which artificial lakes (reservoirs) were excluded, with only naturally formed lakes being included. Results Of the 90 lakes sampled, successful DPUDL-1/2 PCR amplifications were obtained from individual Daphnia from 42 of these lakes ( Table 1). These 42 lakes included representatives from 9 of the 10 geomorphic regions, 14 of the 18 river drainage basins, and all of the four defined migratory flyways and four trophic status levels. A roughly uniform distribution of success was achieved geographically, with fewer successes in the southeastern portion of the state, which is drier and supports fewer permanent waterbodies, and more successes along the coast and in the Cascade Range ( Figure 2). Among the unsuccessful lakes, failure was approximately evenly split between lakes in which no Daphnia could be found, and in which Daphnia that did not consistently generate DPUDL-1/2 PCR products. The latter situation could be a consequence of either sample degradation prior to genetic analysis or the predominance of Daphnia not in the pulex group. The DPUDL PCR primers work much less efficiently when nonpulex-group Daphnia are used as templates ( Lehman et al. 1995). For example, amplification with these primers is not reliable from D. middendorffiana, D. tenebrosa, and most notably D. galeata, which is a common lake inhabitant in other portions of North America. From 2 to 20 individuals from each successful lake were genotyped with the seven restriction enzyme panel (mean 9.2 individuals per lake); a total of 388 individual Daphnia were typed. Among these individuals, a total of 11 distinct RFLP genotypes were detected ( Table 2). These genotypes were all relatively similar to one another, with no more than nine restriction site differences separating the most divergent types (15- and 17; Table 3). Three of the genotypes (11-, 12-, and 15-) exhibited a deletion in the DPUDL-1/2 region that was detectable in most RFLP gel electrophoreses, and later demonstrated by sequence analysis (see below) to be a synapomorphic 50 bp deletion in the middle of the control region. Scoring this deletion as the loss of two restriction sites, a parsimony network of site differences among all genotypes was constructed by hand ( Figure 3). The fact that no single most-parsimonious network could be constructed (genotypes 12- and 17 can articulate in more than one equiparsimonious way), combined with the small numbers of restriction-site differences separating most genotypes, indicates that these genotypes are all closely related, perhaps being relatively recent derivatives of one another. The strong interconnectivity of the network suggests further that some of the site gains and losses may be homoplaseous. Three of these genotypes (7, 8, and 9) were relatively widespread ( Figure 2), being detected in all populations, regardless of how the populations were defined. The remaining eight genotypes were much rarer, being present in at most 11 of the 388 genotyped individuals and being restricted to single populations ( Table 1). A rankorder frequency histogram of all 11 genotypes shows the discontinuity in frequencies between the common and rare genotypes ( Figure 4). Notably, the three common genotypes represent the ‘‘core’’ of the connectivity network, with the most common genotype by far (genotype 7) being at the center, at least as depicted in Figure 3. A haplotype diversity of 0.641 was calculated from all genotype frequencies; this figure represents the mtDNA equivalent of heterozygosity among all individuals ( Nei 1987). With the exception of one lake (MER), no apparent variation in genotype frequencies was observed when the same lake was sampled in successive years ( Table 1). This was confirmed by R 3 C G-tests (with the Williams correction) of genotype frequency independences year to year for each individual lake. Only Mercer Lake revealed a significant (P , .05) frequency shift between the 1996 and 1997 samplings. This shift could represent a selection-driven temporal genotypic turnover as detected in some pond Daphnia ( Lynch 1987), or an extremely rapid colonization of this lake by an invading genotype, but the small sample sizes (n # 8 in each year) allow for sampling error effects to occur in individual lakes and preclude any definitive conclusions to be drawn. Nucleotide sequences of approximately 679 bp of the DPUDL-1/2 PCR product from the eleven genotypes (GenBank accession numbers AF165861-AF165872) were aligned Straughan and Lehman • Daphnia Population Genetics 11 Table 1. Summary of lake and RFLP data in this study Lake GeoDrainage morphic Fly basin region way Trophic status Source Sunset (SUL) Devil’s ( DVL) NC MC CR CR CO CO Eu Eu Natural Natural Elbow ( ELB) Mercer (MER) MC MC CR CR CO CO Unknown Meso Natural Natural Munsel (MUL) Sutton (STL) Triangle ( TRL) North Tenmile ( NTL) Tenmile ( TML) Eel ( EEL) Saunders (SAL) Marie (MAL) Hills Creek ( HCR) Scott (SCL) Gold (GOL) Lemolo ( LEM) Applegate (AGR) Lost Creek ( LCL) Emigrant ( EMR) Upper Squaw ( USL) Laurance ( LAL) Wickiup (WKR) Pine Hollow (PHR) Lava ( LAV) MC MC MC SC SC SC SC SC WM WM WM UM RG RG RG RG HD DE DE DE CR CR CR CR CR CR CR CR WC HC HC WC KL KL KL KL HC HC DU HC CO CO CO CO CO CO CO CO VA CE CE VA VA VA VA VA CE CE CE CE Meso Eu Meso Eu Eu Meso Meso Unknown Meso Oligo Meso Meso Meso Meso Eu Meso Meso Meso Meso Oligo Natural Natural Natural Natural Natural Natural Natural Natural Artificial Natural Natural Artificial Artificial Natural Artificial Natural Natural Artificial Artificial Natural Ochoco (OCR) Prineville (PRR) Paulina (PAL) DE DE DE HL BM HL CE CE CE Eu Eu Meso Artificial Artificial Natural East ( EAL) Hosmer ( HOL) DE DE HL HC CE CE Meso Meso Natural Natural Little Cultus ( LTL) Hyatt ( HYR) Upper Klamath ( KLL) DE KL KL HC KL BR CE VA CE Oligo Eu Hyper Natural Artificial Natural Gerber (GRR) Howard Prairie ( HPR) Willow Creek (WIR) McKay (MCR) Mangone (MGL) Morgan (MRL) Phillips (PIR) Wolf Creek (WCR) Unity ( UNR) Antelope (ANR) Totals KL KL UT UT JD GR PO PO PO OW BR KL DU DU BM BM BM BM BM OW EA VA EA EA EA EA EA EA EA EA Eu Meso Meso Eu Meso Eu Meso Eu Eu Eu Artificial Artificial Artificial Artificial Natural Natural Artificial Artificial Artificial Artificial Year sampled N 1996 1996 1997 1996 1996 1997 1996 1996 1997 1996 1996 1997 1997 1997 1996 1997 1997 1996 1996 1996 1996 1996 1996 1996 1996 1996 1997 1996 1996 1996 1997 1996 1996 1997 1997 1996 1996 1997 1996 1996 1996 1996 1996 1996 1996 1996 1996 1996 7 7 12 5 8 7 8 8 8 8 8 12 12 9 6 9 10 4 8 8 8 6 8 8 8 8 12 9 8 8 10 6 8 8 8 10 8 9 8 6 8 2 10 8 9 9 7 7 388 7 8 7 3 4 5 8 1 8 7 9 2 3 12– 13 15– 16 17 1 8 8 7 1 9 1 4 5 7 1 5 3 2 3 2 7 6 2 1 8 4 6 12 1 5 8 8 8 6 8 3 1 8 12 8 3 1 5 11– 6 8 12 9 5 9 10 3 1 2 6 3 5 6 6 5 2 1 3 5 5 1 1 1 1 10 8 9 9 7 7 207 55 88 1 8 8 6 11 2 Abbreviations for drainage basins, geomorphic regions, and flyways correspond to those in Figure 1. Abbreviations for trophic status are as follows: Oligo 5 oligotrophic, Meso 5 mesotrophic, Eu 5 eutrophic, Hyper 5 hypereutrophic. Because the statuses for ELB and MAL could not be unambiguously determined, these two lakes were left out of the analyses involving trophic status. The numbers in the 11 right columns indicate the numbers of each genotype found in each lake by a seven-enzyme RFLP screen of the mtDNA control region. in comparison to the published Iowa D. pulex sequence of the same nucleotide span ( Van Raay and Crease 1994). This ,680 bp region begins 12 bp downstream of the 39 end of the DPUDL-1 primer-binding site, at the first nucleotide of the control region immediately adjacent to the 39 end of the 12S rRNA gene, and this region ends approximately 24 bp upstream of the 39 end of the DPUDL-2 primer-binding site, at the last nucleotide of the control region ( Van Raay and Crease 1994). The general features of this region in any of these sequences do not differ radically from the D. pulex sequence. The A 1 T content of the D. pulex sequence is 66.6%, while the A 1 T con- 12 The Journal of Heredity 2000:91(1) tents of these sequences ranges from 63.0% (genotype 17) to 66.9% (genotype 8). Distinct runs of homopyrimidines are evident near the 12S rRNA bordering side of the sequences. There is a pair of direct repeats of 29 nucleotide pairs in length in every sequence and a putative stem-loop structure located between nucleotide positions 542 and 609 from the 59 end of the control region. Pairwise sequence divergences for the DPUDL-1/2 fragment as a whole range from 0.29% to 3.5% (counting each indel, regardless of length, as a single nucleotide change), and average pairwise nucleotide substitutions per site (d̂xy) values range from 0.0032 to 0.0451 ( Table 3). These val- ues are average or even below average for typical within-species comparisons of the mtDNA control region in animals (Stanley et al. 1996), and support the assessment that all samples in this study are of the same species. The most divergent sequence is genotype 17, which was detected in only two individuals from Gerber Reservoir. Notably this genotype is also a rare inhabitant of some coastal ponds [genotype S82 of Lehman et al. (1995)] and may represent a ‘‘pond’’ genotype found in a lake (c.f. Cerny and Hebert 1993; Crease et al. 1997; Pfrender ME, Spitze K, and Lehman N, manuscript in review). Failure of a phylogenetic analysis by PAUP to recover a sin- Figure 2. Map of the 42 lakes included in the genetic survey and geographic distributions of the major mtDNA RFLP-based genotypes. Genotypes 10–17 are lumped as ‘‘other.’’ ferences. When all genotypes are included, all of these NST values are significantly different from zero using the D statistic (D 5 [NST/SE]2, where SE is the standard error in NST), which is expected to have a chisquare distribution with one degree of freedom ( Lynch and Crease 1990). By comparison, the GST value for drainage basins is 0.567, indicating that nucleotide differences among the genotypes have a minimal effect on the degree of genetic partitioning as would be predicted from the high sequence similarity of all genotypes. To test whether this differentiation was a consequence of the rare genotypes that are typically found in only one drainage basin, the analysis was rerun with all genotypes removed except the common ones (7, 8, and 9). Of interest, this operation actually increased the NST value to gle most parsimonious tree of the 11 genotypes or to give a significant phylogenetic signal by the g1 statistic ( Hillis and Huelsenbeck 1992) further supports the homoplaseous nature of many of the observed nucleotide differences (data not shown). Analyses of population-genetic partitioning using the NST statistic are summarized in Table 4. The most obvious conclusion from this summary is that when populations are defined by the river drainage basin, genetic subdivision is the greatest and most significant. When full datasets are used, values of NST range from 0.484 to 0.592 when populations are defined this way, meaning that between 48% and 59% of all genetic differentiation at the nucleotide level is attributable to amongpopulation (among drainage basin) differences as opposed to within-population dif- Table 2. Mitochondrial DNA control region RFLP genotypes detected among Oregon lake Daphnia Genotype Type clone BamHI GGATCC DraI TTTAAA HhaI GCGC MseI TTAA PalI GGCC RsaI GTAC Sau96I GGNCC 7 8 9 10 11– 12 12– 13 15– 16 17 USL A-1 PHR A-2 TML A-1 DVL A-9 HPR C-4 PAL A-2 HYR A-2 PAL A-4 MCR A-6 MCR A-3 SCP-82 A A A A F– F F– F F– A A B B B B B– B B– B B– B D A A A A A– A A– A A– B A A A L A A– A A– A L– A E F F F F F– F F– A F– F A F B F F F– F F– F F– F F C C C B B– C C– C B– C C Restriction fragment patterns are assigned an arbitrary letter, with the exception that patterns exhibiting no cuts within the 767 bp DPUDL-1/2 PCR product are designated ‘‘F.’’ The recognition sequences for the enzymes used are indicated below their names. The negative signs indicate the presence of a 50 bp deletion in the middle of the control region for three of the genotypes. Actual RFLP patterns are available upon request from the authors. 0.715 or 0.665, depending on whether RFLPs or full sequences were used as the raw data. However, if genotype 9 is removed from the dataset instead, the NST value drops to 0.327 and becomes not significantly different from zero (P . .05). These manipulations suggest that genotype 9 is the single greatest contributor to population differentiation among drainage basins. Examination of the geographic distribution of genotypes ( Table 1 and Figure 2) reveals that genotype 9 is common along the Pacific Coast and in the northeastern portion of the state, but rare inbetween. To help ensure that a subtle difference between ‘‘western’’ and ‘‘eastern’’ forms of genotype 9 did not go undetected by our seven-enzyme RFLP analysis (a difference that would actually increase the magnitude and significance of the NST value), a subset of lakes containing high frequencies of genotype 9 (MUL, MER, DVL, and EEL from the west, and MRL, WCR, WIR, PIR, PRR, and UNR from the east) was further assayed with the enzyme Tsp509I. This enzyme has the recognition sequence 59-AATT-39 and would be expected to cut genotype 9 at 10 sites, surveying approximately an additional 34 nucleotides for variation. No variation among these lakes was detected with this enzyme, however, lending confidence that the NST values obtained with the original RFLP screen were robust. Removing all artificial lakes from the analysis was performed to ensure that recent anthropomorphic influences were not artificially influencing the results. All artificial lakes in this study were constructed by human activity since 1900 and must contain immigrant populations of Daphnia from other waterbodies. Visual examination of Table 1 shows no obvious differences in genotype frequencies between natural and artificial lakes. For example, the combined frequency of the three common genotypes (7, 8, and 9) is only marginally significantly different between the two types of lakes by a G test of independence (P 5 .04), but when the putative ‘‘pond’’ genotype 17 is removed from the analysis, the difference becomes nonsignificant (P . .05). For the eight rare genotypes, each genotype was found only in natural or only in artificial lakes, but three genotypes were restricted to natural and five to artificial lakes, so one might conclude that recently formed waterbodies harbor a disproportionate amount of this class of genetic variation (see Discussion). Removal of artificial lakes from the NST calculations has little effect on genetic parti- Straughan and Lehman • Daphnia Population Genetics 13 Table 3. Observed pairwise divergences among mtDNA genotypes 7 8 9 10 11– 12 12– 13 15– 16 17 7 8 9 10 11– 12 12– 13 15– 16 17 — 0.0192 0.0241 0.0241 0.0241 0.0258 0.0373 0.0258 0.0340 0.0290 0.0350 1 — 0.0241 0.0241 0.0225 0.0291 0.0356 0.0291 0.0356 0.0290 0.0383 1 2 — 0.0032 0.0225 0.0274 0.0323 0.0307 0.0340 0.0274 0.0417 1 2 2 — 0.0225 0.0274 0.0323 0.0307 0.0340 0.0274 0.0417 4 5 5 3 — 0.0307 0.0160 0.0307 0.0144 0.0274 0.0400 1 2 2 2 3 — 0.0242 0.0032 0.0209 0.0160 0.0367 3 4 4 4 1 2 — 0.0274 0.0080 0.0208 0.0451 2 3 3 3 4 1 3 — 0.0242 0.0193 0.0334 5 6 4 4 1 4 2 5 — 0.0208 0.0451 1 2 2 2 5 2 4 3 6 — 0.0383 6 7 7 7 8 7 7 6 9 7 — Above the diagonal, number of observed pairwise restriction site differences between genotypes. Below the diagonal, average pairwise nucleotide substitutions per site between full-sequence genotypes (d̂xy values). tioning by drainage basin. This manipulation lowers the NST value from 0.523 to 0.508 for the RFLP data, and from 0.592 to 0.533 in the Full Seqs data. By contrast, if natural lakes are removed from the analysis and only the 17 artificial lakes are analyzed, the NST value is lowered to 0.391, which is now only of marginal significance (P 5 .05) as a consequence of the reduced population size. Among-population differentiation is much weaker and less significant if populations are defined on the basis of geomorphic region, bird migratory flyways, or lake trophic status ( Table 4). Defining populations by geomorphic regions produces the highest NST values of the three, with the RFLP-based value at 0.265, but this is only marginally significantly different from zero (P 5 .046). The Full Seqs value for geomorphic regions is slightly greater at 0.283, and because of the lowered standard error values engendered by using complete nucleotide sequences, this value is significantly different from zero based on the D statistic. However, it should be recalled that using full sequences assumes that each RFLP-defined genotype is identical at all sites not surveyed by restriction enzymes, and thus some within-population variation could go undetected that could reduce the NST estimation. RFLP-based NST calculations for flyway and trophic status are 0.201 and 0.141, respectively, and neither is significantly nonzero. Figure 3. Parsimony network of mtDNA control region RFLP genotypes discovered in Oregon lake Daphnia. Each cross-hatch indicates a single gain or loss of a restriction site. The synapomorphic 50 bp deletion detected in genotypes 11-, 12-, and 13- is scored as the loss of two restriction sites. Multiple equiparsimonious paths to genotypes 12- and 17 demonstrate the strong interconnectivity of this network and the homoplaseous nature of some of these restriction site mutations. The Iowa D. pulex genotype (1, dashed circle) is included for reference, but the restriction enzymes utilized in this study were chosen in part based on their ability to differentiate among Oregon lake genotypes and hence underestimate the genetic distance between Iowa pond and Oregon lake samples. 14 The Journal of Heredity 2000:91(1) Figure 4. Rank order frequency histogram of the 11 mtDNA control region genotypes. Because the NST values tend to increase as the geographic area of the populations decreases, it is important to demonstrate that the significance of the drainage basin results is not an artifact of an isolation-bydistance phenomenon. Populations that are not subdivided at larger geographic scales can show (inconsequential) subdivision if the scale is made small enough to ensure isolation of rarer genotypes (e.g., Lehman and Wayne 1991). For example, in the present study, if each of the 42 lakes is considered a separate population, NST rises to 0.73. To test further the significance of drainage basin subdivision, Oregon, being roughly rectangular, was artificially partitioned into rectangular sections by overlaying 3 3 6 and 6 3 3 grids. The resulting 18 sections average in size exactly as do the 18 river drainage basins, but encompass different sets of lakes. Auspiciously, both the 3 3 6 grid, making tall, thin sections, and the 6 3 3 grid, making short, fat sections, produce 14 lake-containing populations to mirror the 14 drainage basins that contain lakes in this study. Both grid-generated populations produce markedly lower NST values (3 3 6: 0.383; 6 3 3: 0.299) with RFLP data than do the drainage basin populations. Thus, despite the fact that all three constructions are of the same number and average size, and even contain many of the same lake group- Table 4. Population subdivision among mtDNA genotypes in Oregon lake Daphnia Population division Data format Lakes included Genotypes included NST SE in NST D Significance RFLPs RFLPs RFLPs RFLPs RFLPs RFLP/seqs Full seqs Full seqs Full seqs Full seqs Left CR seq Mid CR seq Rt CR seq All All All Natural Artificial All All All All Natural All All All All (11) 7, 8, 9 only All except 9 All (6) All (8) All (11) All (11) 7, 8, 9 only All except 9 All (6) All (11) All (11) All (11) 0.523 0.715 0.327 0.508 0.391 0.484 0.592 0.665 0.402 0.533 0.614 0.530 0.621 0.139 0.184 0.193 0.104 0.203 0.148 0.029 0.056 0.041 0.062 0.033 0.061 0.036 14.2 15.1 2.88 24.1 3.71 10.6 417 143 94.5 72.8 345 74.9 297 P, P, N.S. P, P5 P, P, P, P, P, P, P, P, .01 .01 RFLPs RFLPs RFLPs RFLP/seqs Full seqs Full seqs All Natural Artificial All All Natural All All All All All All (11) (6) (8) (11) (11) (6) 0.265 0.426 0.302 0.230 0.283 0.395 0.132 0.070 0.169 0.040 0.055 0.060 4.05 37.5 3.20 32.4 26.4 43.0 P5 P, N.S. P, P, P, .046 .01 RFLPs RFLPs RFLPs RFLP/seqs Full seqs Full seqs All Natural Artificial All All Natural All All All All All All (11) (6) (8) (11) (11) (6) 0.201 0.223 0.274 0.173 0.152 0.219 0.120 0.161 0.143 0.049 0.041 0.049 2.82 1.92 3.68 12.3 14.1 20.3 N.S. N.S. P5 P, P, P, RFLPs RFLPs RFLP/seqs Full seqs Full seqs All Natural All All Natural All All All All All (11) (6) (11) (11) (6) 0.141 0.214 0.125 0.132 0.156 0.078 0.111 0.035 0.025 0.026 3.27 3.72 12.8 28.4 35.6 N.S. N.S. P , .01 P , .01 P , .01 Drainage Basins Geomorphic regions .01 .05 .01 .01 .01 .01 .01 .01 .01 .01 .01 .01 .01 Flyways .05 .01 .01 .01 Trophic status ings as a consequence of geographic proximity, the information contained in the boundaries of river drainage basins has a strong positive effect on genetic subdivision. In all cases, including the drainage basin case, the RFLP/Seqs datasets generate NST values that are lower than those from the RFLP datasets ( Table 4). The SE values for the RFLP/Seqs analyses are also lower, as expected. By contrast, use of the Full Seqs dataset affects the NST value in no consistent fashion, raising it in the drainage basin and geomorphic region cases, lowering it in the flyway case, and leaving it essentially unaffected in the trophic status case. In the drainage basin case, focusing on the left and right portions of the control region will increase the NST values over that from the entire 679 bp span studied, while focusing on the middle portion of the control region lowers the NST value in accordance with observations that the D-loop region of the control region is more constrained than its flanking regions (c.f. Moritz et al. 1987). Discussion Genetic partitioning among mtDNA genotypes of lake populations of members of the D. pulex species complex appears to be the greatest among river drainage basins in Oregon. All other manners in which the total population was subdivided showed systematically lower among-population differentiation with analogues of Wright’s F statistics. When geomorphic regions were used to construct populations, the NST values were all significant, but typically one-half of the corresponding drainage basin values. The NST and GST values for mtDNA genotypes among river drainage basins range from roughly 0.5 to 0.6, indicating that about half of the total amount of variation, as detected by a seven-enzyme RFLP screen of roughly 750 bp of the control region, is a consequence of variation among the river drainage basins of the state. Gerrymandering the state into regions of physical-geographical similarity shows marginal levels of among-population differentiation, on the order of 20– 30%. However, comparison of Figure 1a and b reveals a significant amount of covariation between lakes found in drainage basins and geomorphic regions. Thus the among-population variation detected in the latter could be in fact an orthogonal effect of the former. It is not unreasonable to assume that migration among ‘‘permanent’’ waterbodies such as lakes could be mediated by transport of ephippia through lotic environments. With a low specific gravity and a hydrophobic surface, ephippia usually float on the surface of the water ( Dodson and Frey 1991). Thus they would be subject to dispersal through the inlet and outlet streams of lakes, which the lakes of Oregon typically possess. In fact Dobson and Frey (1991) note that Cladocerans, the group to which Daphnia belong, are occasionally found as adults in quiet moving water and in marginal vegetation bordering streams. Notably, 2 of the 11 genotypes detected in this study (12 and 13) were confined to the twin caldera lakes of an extinct volcano (Paulina Lake and East Lake), to which there are no inlet streams, a situation that would favor the in situ accumulation of mutations. While the present study does not rule out gene flow aided by waterfowl, fish, wind, or groundwater, it identifies the interconnectivity of lakes by surface water as the most evident source of among-population migration. The similarity of mtDNA genotypes and low overall sequence divergences detected among lake mtDNA genotypes could be the result of geologically recent events that affected the region. Oregon lies below the southern extent of Pleistocene glaciation (Pielou 1991) and thus its lakes were not covered by the Cordilleran ice sheet. However, the region, especially what is now southern Washington and northern Oregon, was subjected to repeated mass floodings from the release of water from the glacial lakes Columbia and Missoula. These large lakes were present during the Pleistocene and contained more than 2000 km3 of water to the north and east of modern Oregon, and would periodically flood toward the Pacific Ocean with the sudden failures of the ice dams that maintained them (Pielou 1991). The last such event occurred approximately 10,000 years before present and could readily explain the genetic homogenization that is depicted in the distributions of the common, and similar, mtDNA genotypes 7, 8, and 9. Furthermore, Lake Missoula spanned what is now the continental divide and had a variable drainage basin that probably periodically included the modern Mississippi River Valley through predecessors of the Missouri River. Thus if Daphnia migration was aided by river drainages, it is not surprising that poor differentiation between Oregon lake and Midwest lake genotypes was discovered using an RFLP analysis of whole Straughan and Lehman • Daphnia Population Genetics 15 mtDNA genomes (NST 5 0.36 6 0.24; Crease et al. 1997). Periodic large-scale flooding and river transport of cladoceran ephippia thus can help to explain not only the populationgenetic patterns (or lack thereof ) reported in this study, but the commonality of paraphyly in the D. pulex species complex at both the population and species levels. A companion study to this report where Oregon pond populations were analyzed for genetic differentiation suggested that the D. pulex (i.e., pond) and D. pulicaria (i.e., lake) populations in this region are very closely related and that the lake populations may actually provide a genetic source for the more ephemeral pond populations through flooding events (Pfrender ME, Spitze K, and Lehman N, manuscript in review). Mass floodings could actually impede speciation in preexisting lakes, while augmenting speciation overall, by creating new waterbodies that are unoccupied habitats. An alternate scenario to river transport and flooding as sources for gene flow would be that recent human activity has disrupted the prehistoric patterns of genetic differentiation. Although this cannot be ruled out by the existing data, it is numerically unlikely. The lakes in this study each contain in excess of 1 million m3 of water (average, 100 million m3), the smallest being Scott Lake at 1.2 million m3. Assuming an average density of 1 animal/cm3 for Daphnia in lakes where they exist in moderate to high abundance (unpublished observations), as is the case in all 42 lakes reported here, an minimum estimate of 1.2 3 1012 animals/lake can be made. Even if stratigraphic variation within lakes were to lower the average density two to three orders of magnitude below 1/ cm3, these lakes would still support more than 1 billion individuals. In the absence of selection, such large population sizes should be rather insensitive to drift-driven genotypic turnover by innoculums on the order of tens or hundreds as would be expected by lake-to-lake watercraft (or waterfowl) transport of live Daphnia. Selection has been demonstrated to occur among nuclear genotypes in pond Daphnia as monitored by allozyme markers ( Lynch 1987), but no reports exist of selective differences among mtDNA genotypes in Daphnia. In contrast, flooding and river transport have the potential to affect Daphnia distributions over longer periods of time, larger land areas, and continually over time. Lakes are often thought of as ‘‘per- 16 The Journal of Heredity 2000:91(1) manent’’ waterbodies, especially in comparison to temporary ponds, but even the natural lakes in this study are of geologically recent origin, being formed by landslides, volcanic activity, tectonic activity, river quiescence, and glacial scouring in the high Cascades (Johnson et al. 1985). Many have inlet and outlet streams that connect them to other lakes. For example, the nearby Hyatt and Howard Prairie Reservoirs are part of the same stream system that eventually drains into the Klamath River, and these two lakes were the only ones to harbor the rare mtDNA RFLP genotype 5. It would be of interest to compare more lakes found along the same specific waterway, as gene flow by river transport would be expected to have a strong downstream component. In this study, however, an attempt was made to sample the state uniformly, and thus nearby lakes along the same waterway were avoided during sampling. Similarly, lakes in the northern Rocky Mountains, such as Montana, Alberta, and British Columbia, which may have been source populations for Oregon lakes if the latter were indeed populated by Pleistocene floods, should possess a higher clonal diversity and greater differentiation among mtDNA genotypes than those detected in this study. Also somewhat suggestive of river transport is the diversity among artificial lakes in this study. All of these lakes were formed by the creation of dams along major Oregon rivers during the past century and thus have been recently populated. When only artificial lakes are analyzed, all RFLP-based NST values become marginally significant or nonsignificant ( Table 4). This indicates that, to a certain extent, no single factor tested is responsible for gene flow into these lakes. Moreover, five of the eight rarer genotypes are found among the 17 artificial lakes, while only three of the rare genotypes are found among the 25 natural lakes. Yet a closer inspection of the length of time these lakes have been in existence reveals a striking pattern. Of the four oldest artificial lakes in this study (OCR, HYR, GRR, and MCR), all constructed during the 1920s, only OCR fails to harbor a unique rare genotype not found elsewhere. The other artificial lakes were formed by dam projects that were completed after 1939. Newer reservoirs thus are populated by common Oregon genotypes, and because they are all suddenly created in the midst of a river flow, lotic transport would be the most apparent source of immigrants. The longer a reservoir exists, the more likely it becomes that novel genotypes are brought in by human, wind, and/or animal transport. The natural lakes on the other hand could represent an even longer-term stasis state that is dominated by mass floodings which tend to homogenize populations and reestablish the predominance of the common genotypes in the region, in this case genotypes 7, 8, and 9. Again, testing of this scenario awaits study of additional lakes. Because of taxon-specific rate heterogeneities and the high frequencies of insertions and deletions, it is hard to calibrate a molecular clock for the mtDNA control region ( Lynch and Jarrell 1993). Thus it is difficult to estimate how long ago the genotypes in this study diverged from one another. In a comparison study, Weider et al. (1996) determined nucleotide sequences from a 254 bp region of the mtDNA control region from Greenland and Iceland pond isolates of D. pulex. Mean pairwise sequence divergence values in that study ranged from 2% to 10% from genotypes that likely diverged during the Pleistocene, providing further indirect evidence that Oregon lake Daphnia genotypes diverged through mutational processes in the last 10,000–50,000 years. For context, an analysis of mtDNA proteincoding gene ( ND5) evolution in Daphnia places the split of the D. pulex group from its closest relatives (D. melanica and D. middendorffiana) at 1.4–1.6 million years ago (Colbourne et al. 1998), and RFLP variation within this gene among 14 North American lake populations of pulex group members suggests the existence of two distinct genotype clades that likely diverged during the mid-Pleistocene ( Dufresne and Hebert 1997). From Table 4 it can be seen that the way in which the data are formatted has little effect on the general results. The NST values estimated from RFLPs, RFLP/Seqs, or Full Seqs are similar in all cases, never differing by more than about 0.1. The main distinction among these formats is the estimations of the standard errors, which appear to drop when Full Seqs are used. However, this increase in statistical accuracy must be weighed against uncertainties generated when full sequences are predicted from RFLP-based population surveys. Clearly the optimal approach would be to obtain complete nucleotide sequences from each individual in the survey—an increasingly plausible venture with advances in rapid sequencing technologies. The similarities in NST values across data formats generated in this study may be a consequence of the low overall divergences among genotypes and it remains to be seen whether this situation will be observed in more diverse datasets. The results of this study indicate that even with low levels of nucleotide sequence divergence among a small number of genotypes, a fine-scale mtDNA analysis can be used successfully to test hypotheses about patterns of population differentiation. Among-population variation reaches its highest proportion when populations are defined in terms of river drainage basins and its lowest when they are defined in terms of lake trophic status, which has very little geographical correlation. Taken together, these observations suggest that there is a pattern to recent gene flow among Daphnia-inhabited waterbodies, but that without a detailed analysis of large numbers of samples, the pattern may go undetected. References Avise JC, 1989. Gene trees and organismal histories: a phylogenetic approach to population biology. Evolution 43:1192–1208. Bermingham E, Lamb TE, and Avise JC, 1986. Size polymorphisms and heteroplasmy in the mitochondrial DNA of lower vertebrates. J Hered 77:249–252. Brown WM, 1985. The mitochondrial DNA genome of animals. In: Evolutionary genetics (MacIntyre RJ, ed). New York: Plenum Press; 95–130. Cerny M and Hebert PDN, 1993. Genetic diversity and breeding system variation in Daphnia pulicaria from North American lakes. Heredity 71:497–507. Colbourne JK, Crease TJ, Weider LJ, Hebert PDN, Dufresne F, and Hobaek A, 1998. Phylogenetics and evolution of a circumarctic species complex (Cladocera: Daphnia pulex). Zool J Linn Soc 65:347–366. Crease TC, Spitze K, Lee S-K, Yu S-L, Lehman N, and Lynch M, 1997. Allozyme and mtDNA variation in populations of the Daphnia pulex complex from both sides of the Rocky Mountains. Heredity 79:242–251. Dodson SI and Frey DG, 1991. Cladocera and other Branchiopoda. In: Ecology and classification of North American freshwater invertebrates ( Thorp JH and Covich AP, eds). New York: Academic Press; 723–786. Dufresne F and Hebert PDN, 1997. Pleistocene glaciations and polyphyletic origins of polyploidy in an arctic cladoceran. Proc R Soc Lond B 264:201–206. biology and systematics. Annu Rev Ecol Syst 18:269– 292. Mullis K, Falcoma F, Scharf S, Snikl R, Horn G, and Erlich H, 1986. Specific amplification of DNA in vitro: the polymerase chain reaction. Cold Spring Harbor Symp Quant Biol 51:260–273. Evanich JE, 1990. The birder’s guide to Oregon. Portland: Portland Audubon Society. Nei M and Tajima F, 1983. Maximum likelihood estimations of the number of nucleotide substitutions from restriction sites data. Genetics 105:207–217. Hillis DM, Moritz C, and Mable BK, 1996. Molecular systematics, 2nd ed. Sunderland, MA: Sinauer Associates. Nei M, 1987. Molecular evolutionary genetics. New York: Columbia University Press. Hillis DM and Huelsenbeck JP, 1992. Signal, noise, and reliability in molecular phylogenetic analysis. J Hered 83:189–195. Pielou EC, 1991. After the ice age: the return of glaciated life to North America. Chicago: University of Chicago Press. Johnson DM, Petersen RR, Lycan DR, Sweet JW, Neuhaus ME, and Schaedel AL, 1985. Atlas of Oregon lakes. Corvallis: Oregon State University Press. Jukes TH and Cantor CR, 1969. Evolution of protein molecules. In: Mammalian protein metabolism (Munro HN, ed). New York: Academic Press; 21–123. Rand DM and Harrison RG, 1989. Molecular population genetics of mtDNA size variation in crickets. Genetics 121:551–569. Spolsky CM and Uzzell T, 1984. Evolutionary history of the hybridogenetic hybrid frog Rana esculenta as deduced from mtDNA analyses. Mol Biol Evol 3:44–56. Lehman N and Wayne RK, 1991. Analysis of coyote mitochondrial DNA genotype frequencies: estimation of the effective number of alleles. Genetics 128:405–416. Stanley H, Casey S, Carnahan JM, Goodman S, Harwood J, and Wayne RK, 1996. Worldwide patterns of mitochondrial DNA differentiation in the harbor seal (Phoca vitulina). Mol Biol Evol 13:368–382. Lehman N, Eisenhawer A, Hansen K, Mech LD, Peterson RO, Gogan PJP, and Wayne RK, 1991. Introgression of coyote mitochondrial DNA into sympatric North American gray wolf populations. Evolution 45:104–119. Takahata N, 1989. Gene genealogy in three related populations: consistency probability between gene and population trees. Genetics 122:957–966. Lehman N, Pfrender ME, Morin PA, Crease TJ, and Lynch M, 1995. A hierarchical molecular phylogeny within the genus Daphnia. Mol Phylogenet Evol 4:395– 407. Loy WB, Patton CP, and Plank RD, 1976. Atlas of Oregon. Eugene: University of Oregon. Lynch, M, 1987. The consequences of fluctuating selection for isozyme polymorphisms in Daphnia. Genetics 115:657–669. Lynch M and Crease TJ, 1990. The analysis of population survey data on DNA sequence variation. Mol Biol Evol 7:377–394. Talling JF, 1951. The element of chance in pond populations. Naturalist 1951:157–170. Van Raay TJ and Crease TJ, 1994. Partial mitochondrial DNA sequence for the crustacean Daphnia pulex. Curr Genet 25:66–72. Walsh PS, Metzger DA, and Higuchi R, 1991. Chelex 100 as a medium for simple extraction of DNA for PCRbased typing from forensic material. BioTechniques 10: 506–513. Weider LJ and Hobaek A, 1997. Postglacial dispersal, glacial refugia, and clonal structure in Russian/Siberian populations of the arctic Daphnia pulex complex. Heredity 78:363–372. Lynch M and Jarrell PE, 1993. A method for calibrating molecular clocks and its application to animal mitochondrial DNA. Genetics 135:1197–1208. Weider LJ, Hobaek A, Crease TJ, and Stibor H, 1996. Molecular characterization of clonal population structure and biogeography of arctic apomictic Daphnia from Greenland and Iceland. Mol Ecol 5:107–118. Morin PA, Wallis J, Moore JJ, Chakraborty R, and Woodruff DS, 1993. Non-invasive sampling and DNA amplification for paternity exclusion, community structure, and phylogeography in wild chimpanzees. Primates 34: 347–356. Zhang D-X and Hewitt GM, 1996. Nuclear integrations: challenges for mitochondrial DNA markers. Trends Ecol Evol 11:247–251. Moritz C, Dowling TE, and Brown WM, 1987. Evolution of animal mitochondrial DNA: relevance for population Received February 22, 1999 Accepted September 14, 1999 Corresponding Editor: Robert Wayne Straughan and Lehman • Daphnia Population Genetics 17
© Copyright 2026 Paperzz