Amplification of noncoding chloroplast DNA for

Molecular Phylogenetics and Evolution 36 (2005) 509–522
www.elsevier.com/locate/ympev
AmpliWcation of noncoding chloroplast DNA for phylogenetic
studies in lycophytes and monilophytes with a comparative example
of relative phylogenetic utility from Ophioglossaceae
Randall L. Small a,¤, Edgar B. Lickey a, Joey Shaw a, Warren D. Hauk b
a
Department of Ecology and Evolutionary Biology, The University of Tennessee, Knoxville, TN 37996, USA
b
Department of Biology, Denison University, Granville, OH 43023, USA
Received 14 June 2004; revised 6 April 2005
Available online 1 June 2005
Abstract
Noncoding DNA sequences from numerous regions of the chloroplast genome have provided a signiWcant source of characters
for phylogenetic studies in seed plants. In lycophytes and monilophytes (leptosporangiate ferns, eusporangiate ferns, Psilotaceae, and
Equisetaceae), on the other hand, relatively few noncoding chloroplast DNA regions have been explored. We screened 30 lycophyte
and monilophyte species to determine the potential utility of PCR ampliWcation primers for 18 noncoding chloroplast DNA regions
that have previously been used in seed plant studies. Of these primer sets eight appear to be nearly universally capable of amplifying
lycophyte and monilophyte DNAs, and an additional six are useful in at least some groups. To further explore the application of
noncoding chloroplast DNA, we analyzed the relative phylogenetic utility of Wve cpDNA regions for resolving relationships in Botrychium s.l. (Ophioglossaceae). Previous studies have evaluated both the gene rbcL and the trnLUAA–trnFGAA intergenic spacer in this
group. To these published data we added sequences of the trnSGCU–trnGUUC intergenic spacer + the trnGUUC intron region, the
trnSGGA–rpS4 intergenic spacer + rpS4 gene, and the rpL16 intron. Both the trnSGCU–trnGUUC and rpL16 regions are highly variable
in angiosperms and the trnSGGA–rpS4 region has been widely used in monilophyte phylogenetic studies. Phylogenetic resolution was
equivalent across regions, but the strength of support for the phylogenies varied among regions. Of the Wve sampled regions the
trnSGCU–trnGUUC spacer + trnGUUC intron region provided the strongest support for the inferred phylogeny.
 2005 Elsevier Inc. All rights reserved.
Keywords: Botrychium; Chloroplast DNA; Ferns; Lycophytes; Ophioglossaceae; Pteridophytes; Monilophytes
1. Introduction
Chloroplast DNA (cpDNA) sequences are the primary source of characters for phylogenetic studies in
plants. Many early studies focused on protein-coding
gene sequences such as rbcL and were designed to elucidate phylogenetic relationships among higher-level taxa
(e.g., Chase et al., 1993). Subsequently, the potential utility of noncoding regions of the chloroplast genome was
recognized for lower-level (intergeneric, interspeciWc,
*
Corresponding author. Fax: +1 865 974 2258.
E-mail address: [email protected] (R.L. Small).
1055-7903/$ - see front matter  2005 Elsevier Inc. All rights reserved.
doi:10.1016/j.ympev.2005.04.018
and intraspeciWc) studies (e.g., Taberlet et al., 1991).
Noncoding regions such as introns and intergenic spacers often display more variation on a per site basis than
coding regions, presumably due to fewer functional
constraints.
In angiosperm systematics the application of noncoding cpDNA sequences to low-level phylogenetic studies
is now routine (e.g., Shaw et al., 2005; and references
therein). A large number of diVerent noncoding regions
of the chloroplast genome have been investigated in
angiosperms, some of which are highly variable while
others show relatively little variation (Shaw et al., 2005).
These investigations have been facilitated by the large
510
R.L. Small et al. / Molecular Phylogenetics and Evolution 36 (2005) 509–522
number of complete chloroplast genome sequences that
are available from a wide phylogenetic array of angiosperms. The availability of these genome sequences has
provided the opportunity to develop universal angiosperm PCR primers in conserved coding regions that
Xank the more variable noncoding regions.
Molecular systematic studies in lycophytes and
monilophytes (leptosporangiate ferns, eusporangiate
ferns, Psilotaceae, and Equisetaceae; see Pryer et al.,
2004) have generally relied on a subset of the sequences
used in angiosperm systematics. The gene rbcL has been
used extensively in studies for both higher-level and
lower-level taxa (Dubuisson, 1997; Dubuisson et al.,
1998, 2003; Gastony and Johnson, 2001; Geiger and
Ranker, 2005; Hasebe et al., 1993, 1994, 1995; HauXer
and Ranker, 1995; HauXer et al., 1995, 2003; Hauk,
1995; Hauk et al., 2003; Hennequin et al., 2003; Kato
and Setoguchi, 1998; Korall and Kenrick, 2002, 2004;
Little and Barrington, 2003; Murakami et al., 1999; Nakazato and Gastony, 2003; Pinter et al., 2002; Pryer, 1999;
Pryer et al., 2001a,b, 2004, 1995; Ranker et al., 2003,
2004; Sano et al., 2000; Schneider et al., 2002, 2004a,c;
Skog et al., 2004; Wolf, 1995; Wolf et al., 1999). Other
genes such as atpB (Pryer et al., 2001a, 2004; Ranker
et al., 2003, 2004; Wolf, 1997) and rpS4 (Guillon, 2004;
Hennequin et al., 2003; Pryer et al., 2001a, 2004; Sanchez-Baracaldo, 2004a,b; Schneider et al., 2002, 2004c;
Smith and CranWll, 2002) have also been employed.
Among noncoding cpDNA regions, relatively few have
been used in lycophyte and monilophyte studies with the
trnLUAA–trnFGAA intergenic spacer (Taberlet et al.,
1991) being the most widely used by far (Eastwood et al.,
2004; Geiger and Ranker, 2005; HauXer et al., 2003;
Hauk et al., 2003; Pinter et al., 2002; Ranker et al., 2003;
Rouhan et al., 2004; Schneider et al., 2004a,c; Skog et al.,
2002, 2004; Smith and CranWll, 2002; Su et al., 2005; Van
den Heede et al., 2003; Wikstrom et al., 1999) as it is in
angiosperms (Shaw et al., 2005). The trnSGGA–rpS4
intergenic spacer has also been used in a number of
recent studies (Guillon, 2004; Hennequin et al., 2003;
Perrie et al., 2003; Rouhan et al., 2004; Sanchez-Baracaldo, 2004a,b; Schneider et al., 2004b,c; Skog et al.,
2004; Smith and CranWll, 2002). The relatively rare use
of noncoding regions in lycophyte and monilophyte systematics is due in part to the necessary reliance on PCR
primers developed in angiosperm systematics. Unlike
angiosperms, only three complete chloroplast genomes
are available for lycophytes and monilophytes: Adiantum capillus-veneris (Wolf et al., 2003; GenBank Accession No. NC_004766), Huperzia lucidula (Wolf et al.,
2005; GenBank Accession No. AY660566), and Psilotum
nudum (Wakasugi et al., unpublished data, GenBank
Accession No. NC_003386). Despite the availability of
potential primers for numerous regions, many of the
PCR primers published for angiosperm studies may not
work in lycophytes or monilophytes due either to
sequence diVerences in the primer binding sites or rearrangements of the chloroplast genome.
Shaw et al. (2005) evaluated the ampliWcation and phylogenetic utility of 21 diVerent noncoding cpDNA regions
in a wide range of seed plant lineages. The purpose of the
present study was to evaluate the potential applicability of
these regions in lycophytes and monilophytes. To that end
we surveyed 30 species that represent the phylogenetic
breadth of lycophyte and monilophyte lineages (Hasebe
et al., 1995; Pryer et al., 1995, 2001a, 2004). Using these
exemplars we determined whether or not a subset of the
PCR primers used in the Shaw et al. (2005) study would
work in lycophytes and monilophytes. Several additional
regions were surveyed that were not included in the Shaw
et al. (2005) study, and in some cases new primers were
developed speciWcally for lycophytes and monilophytes.
Finally, to evaluate the relative phylogenetic utility of
some of these regions we ampliWed and sequenced three
cpDNA regions for members of Botrychium s.l. and Helminthostachys (Ophioglossaceae). Previous phylogenetic
studies in Ophioglossaceae have employed data from
rbcL and the trnLUAA–trnFGAA intergenic spacer (Hauk
et al., 2003). To complement these data and assess relative phylogenetic utility of diVerent regions we ampliWed
and sequenced two cpDNA regions that are particularly
useful in seed plants: the trnSGCU–trnGUUC intergenic
spacer + the trnGUUC intron (hereafter trnS–trnG–trnG);
and the rpL16 intron. In addition we ampliWed and
sequenced the trnSGGA–rpS4 intergenic spacer + rpS4
gene because it has become widely employed in monilophyte molecular systematics (Guillon, 2004; Hennequin
et al., 2003; Perrie et al., 2003; Rouhan et al., 2004; Sanchez-Baracaldo, 2004a,b; Schneider et al., 2004b,c; Skog
et al., 2004; Smith and CranWll, 2002).
2. Materials and methods
2.1. Plant materials
Thirty species representing a broad phylogenetic
range of lycophyte and monilophyte lineages were
included in the study (Table 1). These included representatives of all three lycophyte families (Isoëtaceae, Lycopodiaceae, and Selaginellaceae) as well as a range of
monilophyte families (eusporangiate ferns including Psilotaceae and Equisetaceae, and leptosporangiate ferns).
Materials were either from Weld collections or greenhouse grown plants. DNAs of Cyatheaceae species were
provided by D. Conant (Lyndon State College, VT).
Species of Ophioglossaceae chosen for detailed analysis
represent Helminthostachys and Botrychium s.l., the latter now segregated into Botrychium s.s., Sceptridium, and
Botrypus (Hauk et al., 2003; Table 1). Based on the
Ophioglossaceae phylogeny of Hauk et al. (2003)
Helminthostachys zeylanica was chosen as the outgroup.
R.L. Small et al. / Molecular Phylogenetics and Evolution 36 (2005) 509–522
511
Table 1
Lycophyte and monilophyte taxa sampled for cpDNA ampliWcation, and Ophioglossaceae (Botrychium s.l. and Helminthostachys) species sampled
for DNA sequencing
Family
Taxon
Source
Voucher
Lycophytes
1 Lycopodiaceae
2 Selaginellaceae
3 Isoëtaceae
Huperzia lucidula
Selaginella arenicola
Isoëtes Xaccida
Carter Co., Tennessee, USA
Lake Co., Florida, USA
Wakulla Co., Florida, USA
R. Small 162
J. Beck 6004
R. Small 296
Eusporangiate Ferns
4 Equisetaceae
5 Psilotaceae
6 Ophioglossaceae
7 Marattiaceae
Equisetum sp.
Psilotum nudum
Ophioglossum vulgatum
Angiopteris evecta
Greenhouse
Greenhouse
Greenhouse
Greenhouse
R. Small 284
R. Small 285
R. Small 286
R. Small 287
Leptosporangiate Ferns
8 Osmundaceae
9 Hymenophyllaceae
10 Schizaeaceae
11 Marsileaceae
12 Salviniaceae
13 Cyatheaceae
14 Cyatheaceae
15 Pteridaceae
16 Pteridaceae
17 Pteridaceae
18 Pteridaceae
19 Dennstaedtiaceae
20 Aspleniaceae
21 Woodsiaceae
22 Woodsiaceae
23 Woodsiaceae
24 Woodsiaceae
25 Dryopteridaceae
26 Dryopteridaceae
27 Dryopteridaceae
28 Davalliaceae
29 Davalliaceae
30 Polypodiaceae
Osmunda cinnamomea
Trichomanes petersii
Lygodium japonicum
Marsilea quadrifolia
Salvinia sp.
Cnemidaria horrida
Cyathea arborea
Adiantum pedatum
Cheilanthes lanosa
Pellaea atropurpurea
Ceratopteris richardii
Dennstaedtia punctilobula
Asplenium platyneuron
Cystopteris protrusa
Onoclea sensibilis
Deparia achrostichoides
Athyrium felix–femina
Dryopteris marginalis
Cyrtomium sp.
Polystichum acrostichoides
Nephrolepis sp.
Davallia sp.
Polypodium appalachianum
Graham Co., North Carolina, USA
Graham Co., North Carolina, USA
Greenhouse
Greenhouse
Greenhouse
Graham Co., North Carolina, USA
Blount Co., Tennessee, USA
Knox Co., Tennessee, USA
Greenhouse
Blount Co., Tennessee, USA
Blount Co., Tennessee, USA
Graham Co., North Carolina, USA
Sevier Co., Tennessee, USA
Graham Co., North Carolina, USA
Graham Co., North Carolina, USA
Graham Co., North Carolina, USA
Greenhouse
Sevier Co., Tennessee, USA
Greenhouse
Greenhouse
Graham Co., North Carolina, USA
E. Lickey 03–30
E. Lickey 03–27
R. Small 288
R. Small 289
R. Small 290
D. Conant 4859
D. Conant 4822
E. Lickey 03–25
E. Lickey 03–22
R. Small 295
R. Small 291
E. Lickey 03–24
R. Small 283
E. Lickey 03–28
E. Lickey 03–33
E. Lickey 03–22
E. Lickey 03–31
E. Lickey 03–26
R. Small 292
E. Lickey 03–34
R. Small 293
R. Small 294
E. Lickey 03–29
Botrychium campestre
Botrychium simplex
Botrychium lunaria
Botrychium lanceolatum
Sceptridium dissectum
Sceptridium japonicum
Sceptridium lunarioides
Botrypus virginianus
Botrypus strictus
Helminthostachys zeylanica
Iowa, USA
Mt. Ashland, Oregon, USA
Marathon, Ontario, Canada
Chippewa Co., Michigan, USA
Chapel Hill, North Carolina, USA
Japan
Dale Co., Alabama, USA
Alger Co., Michigan, USA
Japan
Japan
Farrar s.n., ISC
Hauk 619, NCU
Hauk 564, NCU
Hauk 571, NCU
Hauk 621, NCU
Sahashi s.n., TOHO, DEN
Watkins 29, ISC
Hauk 575, NCU
Sahashi s.n., TOHO, DEN
Sahashi s.n., TOHO, DEN
Ophioglossaceae
Numbers correspond to lanes in Fig. 2. Voucher specimens are deposited at the University of Tennessee Herbarium (TENN) unless otherwise noted.
2.2. Molecular methods
DNAs obtained speciWcally for this study were
extracted from leaf material (stem material from Equisetum and Psilotum) using the Plant DNeasy Mini Kit
(Qiagen); DNA extraction and PCR ampliWcation protocols for Ophioglossaceae were previously described by
Hauk et al. (2003). PCR ampliWcation was performed in
25 L reactions with the following components: 1 L
total genomic DNA (»10–100 ng), 1£ PCR buVer (PanVera/TaKaRa), 200 M each dNTP, 3.0 mM MgCl2
(except for trnS–trnG–trnG which used 1.5 mM MgCl2),
0.2 g/L bovine serum albumin, 0.1 mM each primer,
and 0.625 U rTaq DNA polymerase (PanVera/TaKaRa).
PCR ampliWcation primers are described in Table 2. All
PCR experiments included a negative control (no DNA)
reaction to monitor for contamination. Most regions
were PCR ampliWed using the following cycling conditions: 30 cycles of 95 °C 1 min, 50 °C 1 min followed by a
slow ramp (1 °C/8 s) to 65 °C, 65 °C 4 min. The trnS–
trnG–trnG region was ampliWed using a 2-step PCR
cycling protocol: 30 cycles of 94 °C 1 min, 66 °C 4 min.
The trnT–trnL spacer, trnL intron, trnL–trnF spacer, and
rpS16 regions were ampliWed using the following cycling
512
R.L. Small et al. / Molecular Phylogenetics and Evolution 36 (2005) 509–522
Table 2
Chloroplast DNA regions ampliWed with primers
Region
Primers
Reference
psbA–trnHGUG spacer
trnHGUG: CGC GCA TGG TGG ATT CAC AAT CC
psbA: GTT ATG CAT GAA CGT AAT GCT C
trnK-3914F: TGG GTT GCT AAC TCA ATG G
trnK-2R: AAC TAG TCG GAT GGA GTA G
rpS16-F-fern: AAR CGR TRT GGT AGR AAG CAA
rpS16-R-fern: CGR GAT TGR RCA TCA ATT GCA A
trnSGCU: AGA TAG GGA TTC GAA CCC TCG GT
3⬘ trnGUUC: GTA GCG GGA ATC GAA CCC GCA TC
a
trnG5⬘ 2G: GCG GGT ATA GTT TAG TGG TAA AA
a
trnG5⬘ 2S: TTT TAC CAC TAA ACT ATA CCC GC
atpF-F: TAT YTT GGA RAG GGA GTG T
atpF-R-fern: TTA RGY TTA TCA GTA GCT TCT
trnCGCAF: CCA GTT CRA ATC YGG GTG
psbMR: ATG GAA GTA AAT ATT CTY GCA TTT ATT GCT
psbMF: AGC AAT AAA TGC RAG AAT ATT TAC TTC CAT
trnDGUCR: GGG ATT GTA GYT CAA TTG GT
rpoB: CKA CAA AAY CCY TCR AAT TG
trnCGCAF: CCA GTT CRA ATC YGG GTG
ycf3.x1.F: GCW TTT ACY TAT TAY AGA GAT G
ycf3.x3.R: TNG AAT GGC CTG TTC TCC
trnSGGA: TTA CCG AGG GTT CGA ATC CCT C
rps4.5⬘: ATG TCS CGT TAY CGA GGA CCT
a2: CAA ATG CGA TGC TCT AAC CT
b: TCT ACC GAT TTC GCC ATA TC
c: CGA AAT CGG TAG ACG CTA CG
d: GGG GAT AGA GGG ACT TGA AC
e: GGT TCA AGT CCC TCT ATC CC
f: ATT TGA ACT GGT GAC ACG AG
trnVUAC: GGC TAT ACG GRY TYG AAC CGT A
trnMCAU: CCT ACT ATT GGA TTY GAA CCA ATG ACT C
trnPUGG: TGT AGC GCA GCY YGG TAG CG
petG2: CAA TAY CGA CGK GGY GAT CAA TT
5⬘ rpS12: ATT AGA AAN RCA AGA CAG CCA AT
rpL20: CGY YAY CGA GCT ATA TAT CC
psbB: TCC AAA AAN KKG GAG ATC CAA C
psbH: TCA AYR GTY TGT GTA GCC AT
rpL16-F-fern: ATG CTT AGT GTG YGA CTC GTT
rpL16-R-fern: TCC SCN ATG TTG YTT ACG AAA T
b
8R: GCT ATG CTT AGT GTG TGA CTC
b
1067F: CTT CCT CTA TGT TGT TTA CG
Tate and Simpson (2003)
Sang et al. (1997)
Johnson and Soltis (1994)
Johnson and Soltis (1994)
This paper
This paper
Shaw et al. (2005)
Shaw et al. (2005)
Shaw et al. (2005)
Shaw et al. (2005)
This paper
This paper
Shaw et al. (2005)
Shaw et al. (2005)
Shaw et al. (2005)
Shaw et al. (2005)
Shaw et al. (2005)
Shaw et al. (2005)
This paper
This paper
Shaw et al. (2005)
Souza-Chies et al. (1997)
Cronn et al. (2002)
Taberlet et al. (1991)
Taberlet et al. (1991)
Taberlet et al. (1991)
Taberlet et al. (1991)
Taberlet et al. (1991)
This paper
This paper
This paper
This paper
Shaw et al. (2005)
Shaw et al. (2005)
Shaw et al. (2005)
Shaw et al. (2005)
This paper
This paper
Asmussen (1999)
Asmussen (1999)
trnKUUU intron/matK gene
rpS16 intron
trnSGCU–trnGUUC spacer + intron
atpF intron
trnCGCA–psbM spacer
psbM–trnDGUC spacer
trnCGCA–rpoB spacer
ycf3 introns
trnSGGA–rpS4 spacer + gene
trnTUGU–trnLUAA spacer
trnLUAA intron
trnLUAA–trnFGAA spacer
trnVUAC–trnMCAU intron + spacer
trnPUGG–petG spacer
rpL20-rpS12 spacer
psbB-psbH spacer
rpL16 intron
a
b
Internal primers used for sequencing the trnS–trnG region in Ophioglossaceae only.
rpL16 primers used for ampliWcation and sequencing of Botrychium s.s. and S. dissectum.
conditions: 30 cycles of 94 °C 1 min, 50 °C 1 min, 72 °C
2 min.
The chloroplast regions we screened (Table 2) include
both introns and intergenic spacers. The speciWc regions
we chose to screen were based on (1) the results of studies
of seed plant cpDNA (Shaw et al., 2005); (2) reference to
the literature for noncoding cpDNA regions used in previous lycophyte and monilophyte studies; and (3) examination of the noncoding regions found in the completely
sequenced Adiantum and Psilotum chloroplast genomes.
These regions are all found in the large single copy (LSC)
region of angiosperm chloroplast genomes, and most are
also found in the LSC of the Adiantum and Psilotum chloroplast genomes as shown in Fig. 1. Some rearrangements
(e.g., inversions, translocations) of monilophyte chloroplast genomes relative to angiosperm chloroplast genomes
are apparent upon comparison of the cpDNA genome
maps of Psilotum and Adiantum with typical angiosperms
such as Nicotiana (Wakasugi et al., 1998) (Fig. 1). Further,
some genes are present in some species’ chloroplast
genomes, but not in others (Fig. 1). Most primers used in
this study (Table 2) were previously described from angiosperm studies. A few primer sets, however, were designed
speciWcally for this study (atpF, ycf3, trnP–petG, trnM–
trnV, rpL16, rpS16; see Table 2).
For sequencing of trnS–trnG–trnG, trnS–rpS4, and
some rpL16 in Ophioglossaceae (Botrypus virginianus,
B. strictus, Sceptridium japonicum, and Helminthostachys
zeylanica), PCR products were cleaned prior to sequencing using the ExoSAP-IT kit (United States Biochemical). PuriWed PCR products were sequenced with the ABI
Prism Big Dye Terminator cycle sequencing kit v. 3.1 and
R.L. Small et al. / Molecular Phylogenetics and Evolution 36 (2005) 509–522
513
Fig. 1. Comparative maps of the Large Single Copy (LSC) region of the two completely sequenced monilophyte chloroplast genomes (Psilotum and
Adiantum) relative to a typical angiosperm (Nicotiana) chloroplast genome. Gene acronyms are shown in order from the top (junction of Inverted
Repeat B and LSC) to bottom (junction of LSC and Inverted Repeat A) only to show gene order and presence—no indication of size of regions is
inferred. Noncoding cpDNA regions ampliWed for this study are shown as black boxes on the Nicotiana map. DiVerences between gene arrangement
or presence/absence are shown on the map and are indicated by letter: (A) no genes exist between accD and rbcL in Nicotiana, but a trnRCCG gene is
found here in Psilotum and a trnSeCUCA gene (coding for the modiWed amino acid selenocysteine) in Adiantum. (B) A trnTUGU gene is found here in
both Psilotum and Nicotiana, but is missing in Adiantum. (C) An inversion of the trnT GGU–psbD–pbsC–trnSUGA–ycf9–trnGGCC region is present in
both Adiantum and Psilotum relative to Nicotiana. (D) An inversion and translocation of the trnC GCA–ycf6–psbM region is found in Adiantum relative to Nicotiana and Psilotum. (E) The trnDGUC gene has been translocated in Adiantum relative to Nicotiana and Psilotum. (F) The ycf12 gene is
present in Psilotum and Adiantum, but missing in Nicotiana. (G) The psaM gene is present in Psilotum, but missing in Adiantum and Nicotiana. (H)
The trnSCGA gene is present in Psilotum, but missing in Adiantum and Nicotiana. (I) The chlB gene is present in Adiantum, but missing in Psilotum and
Nicotiana. (J) The rpS16 gene is present in Adiantum and Nicotiana, but missing in Psilotum. (K) The trnKUUU gene plus the matK gene which is
encoded in the trnK intron are present in both Psilotum and Nicotiana, but the trnK exons are missing in Adiantum. (L) The psbA–trnHGUG region is
present in all three chloroplast genomes, but has been translocated into the inverted repeat in Adiantum.
run on an ABI Prism 3100 automated sequencer (University of Tennessee Molecular Biology Resource Facility).
Sequencing electropherograms were assembled and
edited using Sequencher 4.1.2 (GeneCodes). The rpL16
sequences of Botrychium s.s., and Sceptridium dissectum
were cloned using the Qiagen PCR Cloning kit according
to the manufacturers recommendations (Valencia, CA).
The rpL16 sequence of S. lunarioides was sequenced
directly from ampliWed product puriWed using a QIAquick PCR PuriWcation kit (Valencia, CA). These templates were sequenced with the ABI Prism BigDye
Terminator Cycle Sequencing Reaction Kit and run on
an ABI 373XL Stretch DNA sequencer.
2.3. Analyses
To evaluate the ampliWcation success of each of the 18
noncoding regions in the 30 lycophyte and monilophyte
lineages, PCR ampliWcation products were run on 1.5%
agarose gels and digitally documented. A subset of the
PCR products was sequenced to conWrm their identity.
For each cpDNA region that was successfully ampliWed
three of the PCR products were sequenced. In most cases
PCR products from one lycophyte, one eusporangiate
fern, and one leptosporangiate fern were sequenced.
To assess the utility of the trnS–trnG–trnG, trnS–
rpS4, and rpL16 regions in Botrychium s.l. relative to
514
R.L. Small et al. / Molecular Phylogenetics and Evolution 36 (2005) 509–522
available data from rbcL and trnL–trnF a number of
diVerent approaches were used. First, for each data set
descriptive statistics were calculated (sequence length,
number and percentage of variable characters, number
and percentage of phylogenetically informative characters). In addition, phylogenetic analyses of each data set
were performed individually to compare levels of resolution and support (branch lengths, bootstrap and decay
values, consistency and retention indices).
Sequences were initially aligned using Clustal_X
(Thompson et al., 1997), and alignments were manually
reWned in MacClade 4.0 (Maddison and Maddison,
2000). For phylogenetic analysis gaps in the alignment
were treated as missing data, but the individual gaps
were subsequently coded as binary characters and
added to the end of the sequence matrix. Phylogenetic
analyses were performed using the optimality criterion
of maximum parsimony in PAUP* 4.0b10 (SwoVord,
2002). Exhaustive searches were conducted to Wnd all
maximally parsimonious trees, bootstrap support was
estimated using 1000 bootstrap replicates with branch
and bound searches, and decay analyses were conducted
with a reverse-constraints approach as implemented in
TreeRot v. 2 (Sorenson, 1999). One 52 bp region of the
rpL16 data set that consisted almost entirely of varying
lengths of runs of A and G nucleotides was excluded
from phylogenetic analysis due to ambiguous
alignment.
3. Results
3.1. AmpliWcation of noncoding cpDNA in lycophytes and
monilophytes
Eighteen primer sets (Table 2) were screened for their
ability to amplify noncoding cpDNA regions in 30 lycophyte and monilophyte species (Table 1). Of those 18
primer sets screened, eight primer sets showed good
ampliWcation (a single strong band) in most species. Six
other primer sets showed good ampliWcation in a subset
of species screened. Finally, four primer sets produced
either no ampliWcation products or resulted in the ampliWcation of multiple weak products or smears. Fig. 2
shows representative gel pictures for those regions that
ampliWed in at least some species. This information is
summarized in Fig. 3.
One particular taxon (Selaginella) was problematic in
these ampliWcation experiments. Despite trying ampliWcation from DNA of three diVerent Selaginella species
(S. apoda, S. arenicola, and S. kraussiana) we consistently had diYculty getting good ampliWcation from
Selaginella even for those cpDNA regions that worked
in all other species tested (see lane 2 of Fig. 2).
To conWrm that the target region was ampliWed using
these PCR primers and conditions we sequenced a sub-
set of the ampliWcation products and used BLAST
(Altschul et al., 1990) to search GenBank for matching
sequences. In all cases the sequenced PCR product
matched sequences in GenBank from the appropriate
cpDNA region.
It should be noted that the PCR conditions used in
these ampliWcation experiments were those we have
found to be generally useful across a wide range of templates and primers. Given the large number of taxa and
cpDNA regions, we did not attempt to optimize reaction
conditions for each region. It is apparent from evaluation of Fig. 2 that in some cases multiple PCR products
were ampliWed or ampliWcation was weak in some taxa.
Further optimization of PCR conditions (e.g., annealing
temperature, MgCl2 concentration) would likely
improve the ampliWcation of those regions. Additionally,
several region-speciWc issues also became apparent during the course of this investigation and are discussed in
the following paragraphs.
The trnKUUU intron/matK gene region is widely used
in seed plant systematics, but did not amplify in our
experiments. As discussed by Wolf et al. (2003), while
the matK gene is present in Adiantum, a large inversion
(Hasebe and Iwatsuki, 1990) has an endpoint near
matK and no trnK exons have been detected in
Adiantum.
The trnCGCA–rpoB region in Adiantum has undergone
a small inversion relative to its orientation in angiosperm chloroplast DNA (Fig. 1). As a result, the trnCGCA
gene is in a reverse orientation in Adiantum relative to
angiosperms. To account for this in our ampliWcation
experiments we used a primer on the opposite strand of
trnCGCA relative to the primer usually used in angiosperms (see e.g., Shaw et al., 2005).
The trnL intron and trnL-F intergenic spacer has been
used in a previous phylogenetic study in Huperzia (Wikstrom et al., 1999). The length of the trnL intron + trnL-F
spacer reported by Wikstrom et al. (1999) from H. lucidula, however, is signiWcantly shorter than the size of the
corresponding PCR products obtained in this study. The
combined trnL intron + trnL-F spacer sequence (GenBank Accession No. AJ224591) used by Wikstrom et al.
(1999) is 833 bp. In our ampliWcation experiments the
trnL intron from H. lucidula is ca. 500 bp (which agrees
with the GenBank accession), but the trnL-F spacer is ca.
1500 bp (Fig. 2). This apparent discrepancy is due to the
use of only a partial sequence by Wikstrom et al. (1999;
and N. Wikstrom, pers. comm.). Further, the size of these
regions in the complete chloroplast genome sequence for
H. lucidula (Wolf et al., 2005; GenBank Accession No.
AY660566) is consistent with our results.
3.2. Phylogeny of Botrychium s.l.
Sequences of the trnS–trnG intergenic spacer + the
trnG intron, the rpL16 intron, and the trnS–rpS4
R.L. Small et al. / Molecular Phylogenetics and Evolution 36 (2005) 509–522
515
Fig. 2. Gel photos showing the ampliWcation success of the noncoding cpDNA regions tested in 30 lycophyte and monilophyte species. Only those
regions in which ampliWcation for at least some species was successful are shown. Lane numbers are the same across all photos and match the numbers given in Table 1. In each gel photo a molecular weight marker is shown at each end and in the middle [band sizes in decreasing order: 2.68, 2.0,
1.5, 1.2, 1.0 kb (brighter band), 0.9–0.1 kb in 0.1 kb increments].
516
R.L. Small et al. / Molecular Phylogenetics and Evolution 36 (2005) 509–522
Fig. 3. Summary of ampliWcation success of the 18 noncoding cpDNA regions tested in 30 lycophyte and monilophyte taxa. Black boxes indicate a
single strong band ampliWed for this region from this species. Grey boxes indicate that a weak band ampliWed, or that multiple bands ampliWed for
this region from this species. Blank boxes indicate that no ampliWcation product was observed for this region from this taxon.
spacer + rpS4 gene were obtained for nine species of
Botrychium s.l. and the outgroup Helminthostachys
zeylanica. These newly generated sequences have been
deposited in GenBank (Accession Nos. AY870407–
AY870436). The species chosen for this analysis (Table
1) are a subset of the species included in larger analyses
of the family (Hauk et al., 2003) and represent all of the
major “botrychioid” clades recovered in those analyses.
Phylogenetic analyses of the three new sequence data
sets (rpL16, trnS–rpS4, trnS–trnG–trnG) and the equivalent data sets from the previously published analyses
(rbcL, trnL–trnF) were performed independently. Phylogenetic analyses recovered a single most parsimonious
tree from each data set except for rpL16 from which
three equally parsimonious trees were recovered. All
data sets recovered an identical topology (Fig. 4) with
the exception of the B. simplex/B. lunaria/B. campestre
clade. Two of the data sets (rbcL, trnL–trnF) found a
topology of (B. lunaria (B. simplex, B. campestre)); two
of the data sets (trnS–trnG–trnG, trnS–rpS4) found a
topology of (B. campestre (B. lunaria, B. simplex)); the
strict consensus tree of the three trees recovered in the
rpL16 analysis had a polytomy with relationships among
these three species unresolved. The strict consensus tree
resulting from comparison of trees recovered from the
independent data sets is shown in Fig. 4, as are the support measures for each node from the diVerent data sets
(character state changes, bootstrap values, decay values
for each node).
Data set characteristics (sequence length, number of
variable and parsimony-informative nucleotide substitutions and indels, consistency index, retention index, and
tree length) are described in Table 3. While Table 3
shows each noncoding region separately for comparison
(e.g., trnS–trnG spacer and trnG intron; trnS–rpS4 spacer
and rpS4 gene) as well as combined into ampliWed units
(e.g., trnS–trnG spacer + trnG intron; trnS–rpS4
spacer + rpS4 gene) the following descriptions focus on
R.L. Small et al. / Molecular Phylogenetics and Evolution 36 (2005) 509–522
517
Fig. 4. Consensus phylogenetic tree from analyses of sequence data from Wve cpDNA regions for Botrychium s.l. + Helminthostachys. Relative measures of support (s, steps; b, bootstrap; d, decay) for each of the numbered nodes are shown for each of the Wve data sets.
Table 3
Characteristics of the Wve cpDNA sequence data sets for Botrychium s.l.
Data set
Aligned sequence
length (range)
nucleotides
Number (%)
variable
nucleotide
substitutions
Number (%)
informative
nucleotide
substitutions
Number of
indels
(informative
indels)
Consistency
index/retention
index
Tree
length
rbcL gene
trnL–trnF spacer
rpL16 intron
trnS–rpS4 spacer + gene
trnS–rpS4 spacer
rpS4 gene
trnS–trnG spacer + intron
trnS–trnG spacer
trnG intron
1330 (1321–1330)
369 (305–368)
791 (726–747)
956 (938-949)
379 (360–372)
577 (577–577)
1830 (1699–1771)
1047 (924–991)
760 (749–757)
158 (11.9%)
173 (46.9%)
227 (28.7%)
246 (25.7%)
139 (36.6%)
107 (18.5%)
458 (25.0%)
278 (26.6%)
180 (23.7%)
58 (4.4%)
60 (16.3%)
81 (10.2%)
77 (8.1%)
40 (10.6%)
37 (6.4%)
181 (9.9%)
119 (11.4%)
62 (8.2%)
0 (0)
19 (5)
29 (2)
11 (1)
11(1)
0 (0)
38 (6)
28 (4)
10 (2)
0.87/0.76
0.85/0.74
0.88/0.81
0.88/0.77
0.86/0.70
0.92/0.85
0.83/0.73
0.81/0.72
0.86/0.74
191
227
282
297
177
120
597
371
227
The trnS–rpS4 spacer + gene and trnS–trnG spacer + trnG intron data sets were each analyzed together, but are shown both separated into individual
units and together here for comparison.
the combined data sets because these were used for the
phylogenetic analyses. Consistency and retention indices are generally similar across data sets, ranging from
0.83–0.88 to 0.73–0.81, respectively. The data sets vary
widely in size (aligned length) with trnL–trnF being the
smallest (369 nt) and trnS–trnG–trnG being the largest
(1830 nt). Numbers and percentages of variable and parsimony-informative sites also varied considerably across
data sets. The lowest numbers and percentages of both
variable and parsimony-informative sites were obtained
with rbcL, as expected given the conserved nature of this
gene. Among the other sequenced regions the trnL–trnF
intergenic spacer provided the highest percentage of variable (46.9%) and parsimony-informative (16.3%) sites,
while at the same time providing the lowest overall num-
bers of variable (173) and parsimony-informative (60)
sites. The trnS–trnG–trnG region provided the greatest
number of both variable (458) and parsimony-informative
(181) sites, with percentages similar to the other regions.
As expected, the number of variable and parsimonyinformative sites in a given data set is associated with the
overall sequence length of the data set. In an analysis of
cpDNA sequence variation in seed plants Shaw et al.
(2005) showed that sequence length accounted for anywhere from 22 to 83% of the variation in the number of
variable characters observed in a data set. To assess the
relationship between sequence length and the number of
variable and parsimony-informative characters in our
Botrychium s.l. + Helminthostachys data sets we
regressed sequence length by number of both variable
518
R.L. Small et al. / Molecular Phylogenetics and Evolution 36 (2005) 509–522
Fig. 5. Scatter plot of sequence length vs. numbers of variable and parsimony-informative characters for individual data sets. 䊏, indicates
parsimony-informative characters in noncoding regions; 䉬, indicates
variable characters in noncoding regions; 䊉, indicates parsimonyinformative (PI) characters in the gene rpS4; , indicates variable
(var) characters found in the gene rpS4; , indicates parsimony-informative characters in rbcL; 䉱, indicates variable characters in rbcL. A
line of best Wt was calculated for sequence length vs. variable characters in the noncoding regions (upper line), and sequence length vs. parsimony-informative characters in the noncoding regions (lower line).
and parsimony-informative sites for the noncoding
regions sequenced (Fig. 5). For this analysis each region
was separated into individual noncoding regions (trnL–
trnF spacer, trnS–rpS4 spacer, rpL16 intron, trnG intron,
and trnS–trnG spacer). This analysis indicates that 81%
of the variation in the number of variable sites and 79%
of the variation in the number of parsimony-informative
sites is explained by sequence length. Equivalent data for
the genes rbcL and rpS4 are also shown in Fig. 5,
although these data were not included in the regression
analyses.
4. Discussion
4.1. AmpliWcation of noncoding cpDNA in lycophtes and
monilophytes
Most lycophyte and monilophyte molecular phylogenetic studies have relied on a small number of cpDNA
sequences, namely the gene rbcL, the trnL–trnF intergenic spacer, and the trnS–rpS4 intergenic spacer + rpS4
gene. In many cases, these data sets have provided suYcient phylogenetic resolution, while in other cases, especially in studies of very closely related species or
intraspeciWc variation, insuYcient resolution is obtained
due to a paucity of phylogenetically informative characters. This situation is similar to angiosperm studies
where a few popular regions are predominantly used. A
recent study in seed plants (Shaw et al., 2005) demonstrated that several rarely used cpDNA regions were
generally much more variable than the widely used
regions. The present study was undertaken to assess the
potential applicability of some of these same regions in
lycophyte and monilophyte studies.
The PCR-ampliWcation experiments shown in Fig. 2
and summarized in Fig. 3 demonstrate that a wide variety of cpDNA regions can be ampliWed in a broad range
of lycophytes and monilophytes. Eight regions ampliWed
universally or nearly universally (psbA–trnH, trnS–trnG–
trnG, trnS–rpS4, trnL, trnL–trnF, trnM–trnV, trnP–petG,
and rpL16). Six other regions ampliWed well in a subset
of taxa (rpS16, atpF, trnC–rpoB, psbM–trnC, trnD–
psbM, and ycf3). Finally, four regions ampliWed poorly
or not at all from most taxa (trnK/matK, psbB–psbH,
rps12–rpL20, and trnT–trnL).
4.2. Relative phylogenetic utility of Wve data sets in
Botrychium s.l.
Tso test the relative phylogenetic utility of diVerent
cpDNA sequences in resolving relationships, we
analyzed representative species of Botrychium
s.l. + Helminthostachys. Previously published work
(Hauk et al., 2003) used rbcL and trnL–trnF sequences to
address relationships in a larger analysis of Ophioglossaceae. Both of these data sets provided similar and compatible resolution of relationships although support for
clades varied between data sets. To complement and
compare these published data sets we generated data for
nine species of Botrychium s.l. + Helminthostachys from
three additional cpDNA regions: the rpL16 intron, the
trnS–rpS4 intergenic spacer + rpS4 gene, and the trnS–
trnG intergenic spacer + trnG intron. With the exception
of the Botrychium s.s. clade, phylogenetic resolution was
comparable across all data sets (Fig. 4).
Relative levels of support, on the other hand, as measured by branch lengths, bootstrap values, and decay values varied widely between data sets (Fig. 4). Bootstrap
values were generally similar across data sets for those
nodes that are strongly supported in all data sets (e.g.,
nodes 1, 2, and 3 in Fig. 4). For those nodes that are relatively weakly supported in some data sets, however,
bootstrap values varied considerably. For example, node
6 in Fig. 4 (the S. japonicum + S. dissectum clade) has
bootstrap values of 58, 84, 74, 98, and 91% in rbcL, trnL–
trnF, rpL16, trnS–rpS4, and trnS–trnG–trnG, respectively.
Branch lengths and decay values varied even more
widely among data sets than did bootstrap values. For
every node the trnS–trnG–trnG data set provided the
longest branches (i.e., the most character support). Often
the diVerences in branch lengths are dramatic. For example, node 3 in Fig. 4 has branch lengths of 10, 15, 18, 14,
and 42 in rbcL, trnL–trnF, rpL16, trnS–rpS4, and trnS–
trnG–trnG, respectively. Decay values follow a similar
R.L. Small et al. / Molecular Phylogenetics and Evolution 36 (2005) 509–522
pattern with node 3 having decay values of +10, +10,
+14, +10, and +33 in rbcL, trnL trnF, rpL16, trnS–rpS4,
and trnS–trnG–trnG, respectively.
Thus, with respect to the recovered topology all data
sets provide similar results and nearly complete resolution of most relationships. Comparisons of levels of support for the phylogeny, however, reveal diVerences
among data sets and clearly show that some data sets
provide greater support for inferred relationships than
others. Overall, the trnS–trnG–trnG data set provides the
greatest character support, and generally the highest
bootstrap and decay values.
4.3. Relationship between sequence length and variation
There is, of course, an association between sequence
length and the number of phylogenetically informative
characters that a particular region can be expected to
provide. This association is borne out in an analysis of
sequence length vs. numbers of variable and phylogenetically informative characters (Fig. 5). As sequence length
increases, the number of both variable and phylogenetically informative characters also increases with r2 D 0.81
for variable characters and r2 D 0.79 for phylogenetically
informative characters. As expected, the genes rbcL and
rpS4 provide fewer variable or phylogenetically informative characters per unit of sequence compared to the
noncoding regions, presumably due to greater functional
constraints on these genes (Fig. 5).
Although a strong association exists between
sequence length and numbers of variable or phylogenetically informative characters, there remain diVerences in
the numbers of characters that are not accounted for by
sequence length alone (i.e., ca. 20% of the variation). A
portion of this variation is clearly stochastic due to our
Wnite sample size, but some of this variation may be due
to intrinsic diVerences in the phylogenetic utility of the
diVerent regions (see Shaw et al., 2005). These diVerences
in relative levels of variation may reXect the presence of
conserved elements within some noncoding regions such
promoter or regulatory motifs in intergenic spacers, or
conserved secondary structures in introns. Fig. 5 shows a
line of best Wt for both the variable and phylogenetically
informative characters. In the comparison of variable
characters there are three data sets that lie above the line
of best Wt (i.e., have greater than predicted variable characters per unit of sequence): the trnL–trnF spacer, the
rpL16 intron, and trnS–trnG spacer. Two data sets lie
below the line and thus have lower than predicted variable characters per unit of sequence: the trnS–rpS4
spacer and the trnG intron. Further, there are pairs of
sequences with similar lengths, but relatively diVerent
numbers of variable characters. The trnL–trnF data set
was 369 nt long with 173 variable characters while the
trnS–rpS4 data set was 379 nt long, yet contained only
139 variable characters. In other words, the trnS–rpS4
519
data set contained only 80% of the number of variable
characters found in the trnL–trnF data set despite the
fact that they are almost identical in length. Similarly,
the rpL16 intron and trnG intron were 791 and 760 nt
long, with 227 and 180 variable characters, respectively
(i.e., trnG has 79% of the number of variable characters
of rpL16 despite similar lengths). A similar pattern is
seen in the line of best Wt for sequence length vs. phylogenetically informative characters (Fig. 5).
Finally, it should be noted that the genes rbcL and
rpS4 both show considerably lower numbers of variable
and phylogenetically informative characters than the
noncoding regions of similar length (Fig. 5). An advantage of using coding sequences is that they are trivial to
align relative to the sometimes challenging task of aligning noncoding regions. This advantage is clearly outweighed, however, by the lower numbers of variable
characters found in these regions, at least for analyses of
closely related species.
4.4. Choosing an appropriate region for analysis
The addition of the cpDNA noncoding regions
identiWed here to the arsenal of tools available to pteridologists considerably expands the potential sources of
information available for phylogenetic inference. This
leads directly to the question of which particular region
or regions should be employed in any given study.
The analysis of Shaw et al. (2005) identiWed considerable variability in the amount of sequence variation
detected in diVerent noncoding cpDNA regions among
seed plants. In the study of Shaw et al. (2005) the analyzed regions were grouped into “tiers” with “tier 1”
regions providing the greatest number of variable characters, “tier 2” regions providing fewer, and “tier 3”
regions providing the least. Based on these analyses it
was clear that the tier 1 regions should be explored Wrst
for any particular study as they are the most likely to
provide the greatest number of characters. It was also
noted, however, that no one cpDNA region was universally the most informative, and that considerable variation existed among plant lineages as to which cpDNA
region was the most informative. In other words, one
region may be the most informative in one lineage, while
a diVerent region may be the most informative in a
diVerent lineage. The analyses discussed above show that
among the regions surveyed here for Botrychium
s.l. + Helminthostachys, the trnL–trnF spacer, the rpL16
intron, and the trnS–trnG spacer provide greater than
predicted levels of variation while the trnS–rpS4 spacer
and trnG intron provide lower than predicted levels of
variation. Comparative data to determine whether or
not this is generally true across lycophytes and monilophytes are not yet available.
These observations lead to the conclusion that a preliminary survey of several potential cpDNA regions in
520
R.L. Small et al. / Molecular Phylogenetics and Evolution 36 (2005) 509–522
the taxa of interest is a critical step in identifying which
cpDNA region or regions are likely to provide the most
variation in a given lineage (Shaw et al., 2005). Such a
preliminary study can be performed with as few as three
taxa where sequence data from numerous cpDNA
regions are generated from the three exemplars and relative levels of variation are compared across regions for
these three taxa (Shaw et al., 2005). The region or regions
showing the highest level of variation in this preliminary
survey are those most likely to also provide the greatest
number of phylogenetically informative sites in a
broader analysis (Shaw et al., 2005).
4.5. Conclusions
The data and analyses presented here show that numerous cpDNA noncoding regions can be ampliWed in a wide
range of lycophytes and monilophytes, which expands the
number of potential sequences to choose from for phylogenetic studies in these lineages. Comparative sequence
analysis in Botrychium s.l. + Helminthostachys shows that
phylogenetic resolution is consistent among the Wve data
sets employed, but that levels of support for the inferred
phylogeny vary across data sets. In this particular example, coding sequences such as the genes rbcL and rpS4,
provide relatively low levels of sequence variation per unit
of sequence compared to noncoding regions. Among the
noncoding regions sampled the trnL–trnF spacer, the
rpL16 intron, and the trnS–trnG intergenic spacer provide
greater levels of variability per unit of sequence than the
trnS–rpS4 spacer and the trnG intron. These data, taken
together with the conclusions of Shaw et al. (2005), indicate that preliminary studies of the relative phylogenetic
utility of a given cpDNA region should be performed
prior to mounting a full-scale sequencing eVort using any
one region.
Acknowledgments
We thank Dave Conant (Lyndon State College) for
providing DNA of Cyatheaceae species; and the
National Science Foundation, the Hesler Fund from the
University of Tennessee Herbarium, and the Denison
University Research Foundation for funding that supported this research. Paul Wolf and two anonymous
reviewers provided valuable feedback that improved the
manuscript.
References
Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J., 1990.
Basic local alignment search tool. J. Mol. Biol. 215, 403–410.
Asmussen, C.B., 1999. Toward a chloroplast DNA phylogeny of the
tribe Geonomeae (Palmae). Mem. N.Y. Bot. Gard. 83, 121–129.
Chase, M.W., Soltis, D.E., Olmstead, R.G., Morgan, D., Les, D.H.,
Mishler, B.D., Duvall, M.R., Price, R.A., Hills, H.G., Qui, Y.-L.,
Kron, K.A., Rettig, J.H., Conti, E., Palmer, J.D., Manhart, J.R.,
Sytsma, K.J., Michaels, H.J., Kress, W.J., Karol, K.G., Clark,
W.D., Hedren, M., Gaut, B.S., Jansen, R.K., Kim, K.-J., Wimpee,
C.F., Smith, J.A., Furnier, G.R., Strauss, S.H., Xiang, Q.-Y.,
Plunkett, G.M., Soltis, P.S., Swensen, S.M., Williams, S.E., Gadek,
P.A., Quinn, C.J., Eguiarte, L.E., Golenberg, E., Learn, G.H., Graham, S.W., Barrett, S.C.H., Dayanandan, S., Albert, V.A., 1993.
Phylogenetics of seed plants: an analysis of nucleotide sequences
from the plastid gene rbc L. Ann. Missouri Bot. Gard. 80, 528–
580.
Cronn, R.C., Small, R.L., Haselkorn, T., Wendel, J.F., 2002. Rapid
diversiWcation of the cotton genus (Gossypium: Malvaceae)
revealed by analysis of sixteen nuclear and chloroplast genes. Am. J.
Bot. 89, 707–725.
Dubuisson, J.Y., 1997. rbcL sequences: A promising tool for the molecular systematics of the fern genus Trichomanes (Hymenophyllaceae)? Mol. Phylogenet. Evol. 8, 128–138.
Dubuisson, J.Y., Hebant-Mauri, R., Galtier, J., 1998. Molecules and
morphology: conXicts and congruence within the fern genus
Trichomanes (Hymenophyllaceae). Mol. Phylogenet. Evol. 9, 390–
397.
Dubuisson, J.Y., Hennequin, S., Douzery, E.J.P., CranWll, R.B., Smith,
A.R., Pryer, K.M., 2003. rbcL phylogeny of the fern genus Trichomanes (Hymenophyllaceae), with special reference to neotropical
taxa. Int. J. Plant Sci. 164, 753–761.
Eastwood, A., Cronk, Q.C.B., Vogel, J.C., Hemp, A., Gibby, M., 2004.
Comparison of molecular and morphological data on St. Helena:
Elaphoglossum. Plant Syst. Evol. 245, 93–106.
Gastony, G.J., Johnson, W.P., 2001. Phylogenetic placements of Loxoscaphe thecifera (Aspleniaceae) and Actiniopteris radiata (Pteridaceae) based on analysis of rbcL nucleotide sequences. Am. Fern J.
91, 197–213.
Geiger, J.M.O., Ranker, T.A., 2005. Molecular phylogenetics and historical biogeography of Hawaiian Dryopteris (Dryopteridaceae).
Mol. Phylogenet. Evol. 34, 392–407.
Guillon, J.-M., 2004. Phylogeny of horsetails (Equisetum) based on
chloroplast rps4 gene and adjacent noncoding sequences. Syst. Bot.
29, 251–259.
Hasebe, M., Ito, M., Kofuji, R., Ueda, K., Iwatsuki, K., 1993. Phylogenetic relationships of ferns deduced from rbcL gene sequence. J.
Mol. Evol. 37, 476–482.
Hasebe, M., Iwatsuki, K., 1990. Chloroplast DNA from Adiantum
capillus-veneris L, a fern species (Adiantaceae)–clone bank, physical
map and unusual gene localization in comparison with angiosperm
chloroplast DNA. Curr. Genet. 17, 359–364.
Hasebe, M., Omori, T., Nakazawa, M., Sano, T., Kato, M., Iwatsuki,
K., 1994. rbcL gene sequences provide evidence for the evolutionary
lineages of leptosporangiate ferns. Proc. Natl. Acad. Sci. USA 91,
5730–5734.
Hasebe, M., Wolf, P.G., Pryer, K.M., Ueda, K., Ito, M., Sano, R., Gastony, G.J., Yokoyama, J., Manhart, J.R., Murakami, N., Crane,
E.H., HauXer, C.H., Hauk, W.D., 1995. Fern phylogeny based on
rbcL nucleotide sequences. Am. Fern J. 85, 134–181.
HauXer, C.H., Grammer, W.A., Hennipman, E., Ranker, T.A., Smith,
A.R., Schneider, H., 2003. Systematics of the ant-fern genus Lecanopteris (Polypodiaceae): testing phylogenetic hypotheses with
DNA sequences. Syst. Bot. 28, 217–227.
HauXer, C.H., Ranker, T.A., 1995. rbcL sequences provide phylogenetic insights among sister species of the fern genus Polypodium.
Am. Fern J. 85, 361–374.
HauXer, C.H., Windham, M.D., Rabe, E.W., 1995. Reticulate evolution
in the Polypodium vulgare complex. Syst. Bot. 20, 89–109.
Hauk, W.D., 1995. A molecular assessment of relationships among
cryptic species of Botrychium subgenus Botrychium (Ophioglossaceae). Am. Fern J. 85, 375–394.
R.L. Small et al. / Molecular Phylogenetics and Evolution 36 (2005) 509–522
Hauk, W.D., Parks, C.R., Chase, M.W., 2003. Phylogenetic studies of
Ophioglossaceae: evidence from rbcL and trnL-F plastid DNA
sequences and morphology. Mol. Phylogenet. Evol. 28, 131–151.
Hennequin, S., Ebihara, A., Ito, M., Iwatsuki, K., Dubuisson, J.Y.,
2003. Molecular systematics of the fern genus Hymenophyllum s.l.
(Hymenophyllaceae) based on chloroplastic coding and noncoding
regions. Mol. Phylogenet. Evol. 27, 283–301.
Johnson, L.A., Soltis, D.E., 1994. mat K DNA sequences and phylogenetic reconstruction in Saxifragaceae s. str. Syst. Bot. 19, 143–156.
Kato, M., Setoguchi, H., 1998. An rbcL-based phylogeny and heteroblastic leaf morphology of Matoniaceae. Syst. Bot. 23, 391–400.
Korall, P., Kenrick, P., 2002. Phylogenetic relationships in Selaginellaceae based on rbcL sequences. Am. J. Bot. 89, 506–517.
Korall, P., Kenrick, P., 2004. The phylogenetic history of Selaginellaceae based on DNA sequences from the plastid and nucleus:
extreme substitution rates and rate heterogeneity. Mol. Phylogenet.
Evol. 31, 852–864.
Little, D.P., Barrington, D.S., 2003. Major evolutionary events in the
origin and diversiWcation of the fern genus Polystichum (Dryopteridaceae). Am. J. Bot. 90, 508–514.
Maddison, D.R., Maddison, W.P., 2000. MacClade 4: Analysis of phylogeny and character evolution. v. 4.0, Sinauer Associates, Sunderland, MA.
Murakami, N., Nogami, S., Watanabe, M., Iwatsuki, K., 1999. Phylogeny of Aspleniaceae inferred from rbcL nucleotide sequences. Am.
Fern J. 89, 232–243.
Nakazato, T., Gastony, G.J., 2003. Molecular phylogenetics of Anogramma species and related genera (Pteridaceae: Taenitidoideae).
Syst. Bot. 28, 490–502.
Perrie, L.R., Brownsey, P.J., Lockhart, P.J., Large, M.F., 2003. Evidence
for an allopolyploid complex in New Zealand Polystichum (Dryopteridaceae). NZ. J. Bot. 41, 189–215.
Pinter, I., Bakker, F., Barrett, J., Cox, C., Gibby, M., Henderson, S.,
Morgan-Richards, M., Rumsey, F., Russell, S., Trewick, S., Schneider, H., Vogel, J., 2002. Phylogenetic and biosystematic relationships in four highly disjunct polyploid complexes in the subgenera
Ceterach and Phyllitis in Asplenium (Aspleniaceae). Org. Div. Evol.
2, 299–311.
Pryer, K.M., 1999. Phylogeny of marsileaceous ferns and relationships
of the fossil Hydropteris pinnata reconsidered. Int. J. Plant Sci. 160,
931–954.
Pryer, K.M., Schneider, H., Smith, A.R., CranWll, R., Wolf, P.G., Hunt,
J.S., Sipes, S.D., 2001a. Horsetails and ferns are a monophyletic group
and the closest living relatives to seed plants. Nature 409, 618–622.
Pryer, K.M., Smith, A.R., Hunt, J.S., Dubuisson, J.Y., 2001b. rbcL data
reveal two monophyletic groups of Wlmy ferns (Filicopsida:
Hymenophyllaceae). Am. J. Bot. 88, 1118–1130.
Pryer, K.M., Schuettpelz, E., Wolf, P.G., Schneider, H., Smith, A.R.,
CranWll, R., 2004. Phylogeny and evolution of ferns (monilophytes)
with a focus on the early leptosporangiate divergences. Am. J. Bot.
91, 1582–1598.
Pryer, K.M., Smith, A.R., Skog, J.E., 1995. Phylogenetic relationships
of extant ferns based on evidence from morphology and rbcL
sequences. Am. Fern J. 85, 205–282.
Ranker, T.A., Geiger, J.M.O., Kennedy, S.C., Smith, A.R., HauXer,
C.H., Parris, B.S., 2003. Molecular phylogenetics and evolution of
the endemic Hawaiian genus Adenophorus (Grammitidaceae). Mol.
Phylogenet. Evol. 26, 337–347.
Ranker, T.A., Smith, A.R., Parris, B.S., Geiger, J.M.O., HauXer, C.H.,
Straub, S.C.K., Schneider, H., 2004. Phylogeny and evolution of
grammitid ferns (Grammitidaceae): a case of rampant morphological homoplasy. Taxon 53, 415–428.
Rouhan, G., Debuisson, J.-Y., Rakotondrainibe, F., Motley, T.J., Mickel, J.T., Labat, J.-N., Moran, R.C., 2004. Molecular phylogeny of
the fern genus Elaphoglossum (Elaphoglossaceae) based on chloroplast non-coding DNA sequences: contributions of species from
the Indian Ocean area. Mol. Phylogenet. Evol. 33, 745–763.
521
Sanchez-Baracaldo, P., 2004a. Phylogenetics and biogeography of the
neotropical fern genera Jamesonia and Eriosorus (Pteridaceae). Am.
J. Bot. 91, 274–284.
Sanchez-Baracaldo, P., 2004b. Phylogenetic relationships of the subfamily Taenitidoideae, Pteridaceae. Am. Fern J. 94, 126–142.
Sang, T., Crawford, D.J., Stuessy, T.F., 1997. Chloroplast phylogeny,
reticulate evolution, and biogeography of Paeonia (Paeoniaceae).
Am. J. Bot. 84, 1120–1136.
Sano, R., Takamiya, M., Ito, M., Kurita, S., Hasebe, M., 2000. Phylogeny of the lady fern group, tribe Physematieae (Dryopteridaceae),
based on chloroplast rbcL gene sequences. Mol. Phylogenet. Evol.
15, 403–413.
Schneider, H., Russell, S.J., Cox, C.J., Bakker, F., Henderson, S., Rumsey, F., Barrett, J., Gibby, M., Vogel, J.C., 2004a. Chloroplast phylogeny of asplenioid ferns based on rbcL and trnL-F spacer
sequences (Polypodiidae, Aspleniaceae) and its implications for
biogeography. Syst. Bot. 29, 260–274.
Schneider, H., Smith, A.R., CranWll, R., Hildebrand, T.J., HauXer, C.H.,
Ranker, T.A., 2004b. Unraveling the phylogeny of polygrammoid
ferns (Polypodiaceae and Grammitidaceae): exploring aspects of
the diversiWcation of epiphytic plants. Mol. Phylogenet. Evol. 31,
1041–1063.
Schneider, H., Janssen, T., Hovenkamp, P., Smith, A.R., CranWll, R.,
HauXer, C.H., Ranker, T.A., 2004c. Phylogenetic relationships in
the enigmatic Malesian fern Thylacopteris (Polypodiaceae, Polypodiidae). Int. J. Plant Sci. 165, 1077–1087.
Schneider, H., Smith, A.R., CranWll, R., HauXer, C.H., Ranker, T.A.,
Hildebrand, T., 2002. Gymnogrammitis dareiformis is a polygrammoid fern (Polypodiaceae) - resolving an apparent conXict between
morphological and molecular data. Plant Syst. Evol. 234, 121–136.
Shaw, J., Lickey, E.B., Beck, J.T., Farmer, S.B., Liu, W., Miller, J., Siripun, K.C., Winder, C.T., Schilling, E.E., Small, R.L., 2005. The tortoise and the hare II: relative utility of 21 noncoding chloroplast
DNA sequences for phylogenetic analysis. Am. J. Bot. 92, 142–166.
Skog, J.E., Mickel, J.T., Moran, R.C., Volovsek, M., Zimmer, E.A.,
2004. Molecular studies of representative species in the fern genus
Elaphoglossum (Dryopteridaceae) based on cpDNA sequences of
rbcL, trnL-F, and rps4 trnS. Int. J. Plant Sci. 165, 1063–1075.
Skog, J.E., Zimmer, E.A., Mickel, J.T., 2002. Additional support for
two subgenera of Anemia (Schizaeaceae) from data for the chloroplast intergenic spacer region trnL-F and morphology. Am. Fern J.
92, 119–130.
Smith, A.R., CranWll, R.B., 2002. Intrafamilial relationships of the thelypteroid ferns (Thelypteridaceae). Am. Fern J. 92, 131–149.
Sorenson, M.D., 1999. TreeRot v. 2, Boston University, Boston, MA.
Souza-Chies, T.T., Bittar, G., Nadot, S., Carter, L., Besin, E., Lejeune,
B., 1997. Phylogenetic analysis of Iridaceae with parsimony and distance methods using the plastid gene rps4. Plant Syst. Evol. 204,
109–123.
Su, Y.-J., Wang, T., Zheng, B., Jiang, Y., Chen, G.-P., Ouyang, P.-Y.,
Sun, Y.-F., 2005. Genetic diVerentiation of relictual populations of
Alsophila spinulosa in southern China inferred from cpDNA trnL-F
noncoding sequences. Mol. Phylogenet. Evol. 34, 323–333.
SwoVord, D.L., 2002. PAUP*. Phylogenetic Analysis Using Parsimony
(*and Other Methods). v. 4.0b10, Sinauer Associates, Sunderland,
MA.
Taberlet, P., Gielly, L., Pautou, G., Bouvet, J., 1991. Universal primers
for ampliWcation of three non-coding regions of chloroplast DNA.
Plant Mol. Biol. 17, 1105–1109.
Tate, J., Simpson, B., 2003. Paraphyly of Tarasa (Malvaceae) and
diverse origins of the polyploid species. Syst. Bot. 28, 723–737.
Thompson, J.D., Gibson, T.J., Plewniak, F., Jeanmougin, F., Higgins,
D.G., 1997. The Clustal_X windows interface: Xexible strategies for
multiple sequence alignment aided by quality analysis tools.
Nucleic Acids Res. 25, 4876–4882.
Van den Heede, C.J., Viane, R.L.L., Chase, M.W., 2003. Phylogenetic
analysis of Asplenium subgenus Ceterach (Pteridophyta: Asplenia-
522
R.L. Small et al. / Molecular Phylogenetics and Evolution 36 (2005) 509–522
ceae) based on plastid and nuclear ribosomal ITS DNA sequences.
Am. J. Bot. 90, 481–495.
Wakasugi, T., Sugita, M., Tsudzuki, T., Sugiura, M., 1998. Updated map
of tobacco chloroplast DNA. Plant Mol. Biol. Rep. 16, 231–241.
Wikstrom, N., Kenrick, P., Chase, M., 1999. Epiphytism and terrestrialization in tropical Huperzia (Lycopodiaceae). Plant Syst. Evol. 218,
221–243.
Wolf, P.G., 1997. Evaluation of atpB nucleotide sequences for phylogenetic studies of ferns and other pteridophytes. Am. J. Bot. 84, 1429–
1440.
Wolf, P.G., 1995. Phylogenetic analyses of rbcL and nuclear ribosomal
RNA gene sequences in Dennstaedtiaceae. Am. Fern J. 85, 306–327.
Wolf, P.G., Karol, K.G., Mandoli, D.F., Kuehl, J., Arumuganathan, K.,
Ellis, M.W., Mishler, B.D., Kelch, D.G., Olmstead, R.G., Boore, J.L.,
2005. The Wrst complete chloroplast genome sequence of a lycophyte, Huperzia lucidula (Lycopodiaceae). Gene 350, 117–128.
Wolf, P.G., Rowe, C.A., Sinclair, R.B., Hasebe, M., 2003. Complete
nucleotide sequence of the chloroplast genome from a leptosporangiate fern, Adiantum capillus-veneris L. DNA Res. 10, 59–65.
Wolf, P.G., Sipes, S.D., White, M.R., Martines, M.L., Pryer, K.M.,
Smith, A.R., Ueda, K., 1999. Phylogenetic relationships of the enigmatic fern families Hymenophyllopsidaceae and Lophosoriaceae:
evidence from rbcL nucleotide sequences. Plant Syst. Evol. 219,
263–270.