First Use of Combined Large-Subunit and Small

Testing the New Animal Phylogeny: First Use of Combined Large-Subunit
and Small-Subunit rRNA Gene Sequences to Classify the Protostomes
Jon Mallatt and Christopher J. Winchell
School of Biological Sciences, Washington State University
Although the small-subunit ribosomal RNA (SSU rRNA) gene is widely used in the molecular systematics, few
large-subunit (LSU) rRNA gene sequences are known from protostome animals, and the value of the LSU gene for
invertebrate systematics has not been explored. The goal of this study is to test whether combined LSU and SSU
rRNA gene sequences support the division of protostomes into Ecdysozoa (molting forms) and Lophotrochozoa, as
was proposed by Aguinaldo et al. (1997) (Nature 387:489) based on SSU rRNA sequences alone. Nearly complete
LSU gene sequences were obtained, and combined LSU 1 SSU sequences were assembled, for 15 distantly related
protostome taxa plus five deuterostome outgroups. When the aligned LSU 1 SSU sequences were analyzed by
tree-building methods (minimum evolution analysis of LogDet-transformed distances, maximum likelihood, and
maximum parsimony) and by spectral analysis of LogDet distances, both Ecdysozoa and Lophotrochozoa were
indeed strongly supported (e.g., bootstrap values .90%), with higher support than from the SSU sequences alone.
Furthermore, with the LogDet-based methods, the LSU 1 SSU sequences resolved some accepted subgroups within
Ecdysozoa and Lophotrochozoa (e.g., the polychaete sequence grouped with the echiuran, and the annelid sequences
grouped with the mollusc and lophophorates)—subgroups that SSU-based studies do not reveal. Also, the mollusc
sequence grouped with the sequences from lophophorates (brachiopod and phoronid). Like SSU sequences, our
LSU 1 SSU sequences contradict older hypotheses that grouped annelids with arthropods as Articulata, that said
flatworms and nematodes were basal bilateralians, and considered lophophorates, nemerteans, and chaetognaths to
be deuterostomes. The position of chaetognaths within protostomes remains uncertain: our chaetognath sequence
associated with that of an onychophoran, but this was unstable and probably artifactual. Finally, the benefits of
combining LSU with SSU sequences for phylogenetic analyses are discussed: LSU adds signal, it can be used at
lower taxonomic levels, and its core region is easy to align across distant taxa—but its base frequencies tend to be
nonstationary across such taxa. We conclude that molecular systematists should use combined LSU 1 SSU rRNA
genes rather than SSU alone.
Introduction
The small-subunit ribosomal RNA (SSU rRNA or
18S) gene is the gene used most widely for molecular
phylogenetic analysis, especially for evaluating deeplevel relationships among organisms (Adoutte et al.
2000; Van de Peer et al. 2000). A key study using this
gene (Aguinaldo et al. 1997) provided evidence that living triploblastic or bilateralian animals fall into three
groups: the classically recognized (1) deuterostomes and
protostomes; and a division of protostomes into (2) Ecdysozoa, and (3) Lophotrochozoa. The Ecdysozoa, all
of which molt and lack motile locomotory cilia, include
priapulids, kinorhynchs, nematodes, nematomorphs, tardigrades, onychophorans, and arthropods. The Lophotrochozoa, originally identified by Halanych et al.
(1995) and named for those members that have a trochophore larva or a feeding structure called a lophophore,
include the annelids, molluscs, brachiopods, phoronids,
and bryozoans, plus flatworms and related minor
groups, such as rotifers, gastrotrichs, and gnathostomulids (Garey and Schmidt-Rhaesa 1998; Giribet et al.
2000; also see Balavoine 1998). The Ecdysozoa and Lophotrochozoa hypotheses challenge traditional thinking.
Besides proposing a previously unanticipated relation
among the molting animals, they contradict the old
Key words: large-subunit ribosomal RNA, 28S rRNA, small-subunit ribosomal RNA, phylogeny, protostome, Ecdysozoa.
Address for correspondence and reprints: Jon Mallatt, Box
644236, School of Biological Sciences, Washington State University,
Pullman, Washington 99164-4236. E-mail: [email protected].
Mol. Biol. Evol. 19(3):289–301. 2002
q 2002 by the Society for Molecular Biology and Evolution. ISSN: 0737-4038
American-textbook view that flatworms (which lack a
coelom) branched from the base of the bilateralian tree,
followed by nematodes and other pseudocoelomate phyla (Barnes 1980, chapt. 3).
In the past few years, the Ecdysozoa and Lophotrochozoa hypotheses have been accepted by many molecular systematists, anatomists, developmental biologists, and paleontologists (Schmidt-Rhaesa et al. 1998;
Adoutte et al. 2000; Conway Morris 2000; Giribet et al.
2000; Peterson, Cameron, and Davidson 2000; Peterson
and Eernisse 2001). Although challenged by a few researchers (Foster and Hickey 1999; Wagele et al. 1999;
Hausdorf 2000), these hypotheses have accumulated
support from other gene sequences, namely from Hox
genes (de Rosa et al. 1999), b-thymosin (Manuel et al.
2000), and from expanded sets of SSU rRNA sequences
(Garey and Schmidt-Rhaesa 1998; Giribet et al. 2000;
Peterson and Eernisse 2001).
Despite the impact the Ecdysozoa and Lophotrochozoa hypotheses have had, they were founded on incomplete rRNA information. That is, the SSU gene is
just the smaller part of a gene family that also contains
the large-subunit rRNA genes (LSU: 28S, 5.8S, and the
little used 5S genes), which also are valuable for deeplevel phylogenetic analysis (Hillis and Dixon 1991).
Given these facts, LSU rRNA genes should be a powerful tool with which to test the Ecdysozoa and Lophotrochozoa hypotheses—for if the larger rRNA genes fail
to support them, these hypotheses will be severely compromised. In this study, we sequenced the LSU rRNA
genes from 15 distantly related taxa of protostomes and
used the combined LSU and SSU sequences to test the
289
290
Mallatt and Winchell
Ecdysozoa and Lophotrochozoa hypotheses. Furthermore, because complete LSU sequences are almost unknown for protostomes (De Rijk et al. 1995), our new
sequences should provide a foundation for other complete rRNA gene studies in the future.
Materials and Methods
This is the fourth in a series of studies using nearly
complete LSU rRNA gene sequences for animal phylogeny, and most of the methodology is discussed in the
three previous papers (Mallatt and Sullivan 1998; Mallatt, Sullivan, and Winchell 2001; Winchell et al. 2002).
Specimens and Sequences
Information on the sources of the animals, voucher specimen numbers, precise parts of the genes used,
the new GenBank accession numbers (AF342785–
AF342805), and sequences taken from the literature, are
all available as supplementary material at the Molecular
Biology and Evolution website (http://www.smbe.org).
The 15 protostome species used were Amphiporus sp.
(nemertean), Chordodes morgani (nematomorph), Eisenia fetida (oligochaete), Halicryptus spinulosus (priapulan), Limulus polyphemus (arthropod), Peripatoides
novaezealandiae (onychophoran), Phascolopsis gouldii
(sipunculan), Phoronis ijimae (5vancouverensis) (phoronid), Placopecten magellanicus (mollusc), Proceraea
cornuta (polychaete), Sagitta elegans (chaetognath),
Stylochus zebra (platyhelminth), Terebratalia transversa (brachiopod), Trichinella spiralis (nematode), and
Urechis caupo (echiuran), and the five deuterostome
outgroups were Branchiostoma floridae (cephalochordate), Ciona intestinalis (tunicate), Florometra serratissima (echinoderm), Hydrolagus colliei (vertebrate), and
Ptychodera flava (hemichordate). In general, we used
taxa whose SSU rRNA genes are slowly evolving, as
did Aguinaldo et al. (1997). However, not all their taxa
were available to us, and only two of the protostome
species were the same in both studies (T. transversa and
T. spiralis). In using deuterostomes as the outgroup, we
assume that deuterostomes are the sister group of all
protostomes, an assumption supported by many recent
studies (Balavoine and Adoutte 1998; Giribet et al.
2000; Peterson and Eernisse 2001). Following the recommendation of Giribet et al. (2000), no diploblast outgroups were used because diploblast LSU and SSU sequences are very different from those of triploblasts
(Medina et al. 2001) and seem too dissimilar to assure
accuracy of the triploblast tree. The five outgroup taxa—
Branchiostoma, Ciona, Florometra, Hydrolagus, and
Ptychodera (taken from Winchell et al. 2002)—were
chosen primarily to yield uniform sampling across the
deuterostomes (one or two representatives per phylum)
and secondarily as the least divergent and most complete
sequences available. Previously sequenced LSU and
SSU genes from the protostomes Caenorhabditis elegans, Drosophila melanogaster, and Aedes albopictus
were not used because of high divergence (long branches: De Rijk et al. 1995). By contrast, LSU from the
chaetognath Sagitta was sequenced and used here, de-
spite the likelihood of high divergence because taxonomists have requested more gene sequences from this
enigmatic phylum (Garey and Schmidt-Rhaesa 1998;
Giribet et al. 2000). To assure that the divergent Sagitta
would not disrupt the phylogenetic analyses, such analyses were performed twice, first without Sagitta and
then with Sagitta included.
DNA extraction, primers, PCR amplification procedures, purification, sequencing, and fragment assembly were as described by Mallatt and Sullivan (1998)
and Winchell et al. (2002). The gene sequences were
imported into Seqlab, a Macintosh X Window application (Smith et al. 1994). The concatenated 28S, 5.8S,
and 18S rRNA genes were aligned by eye, with the
alignment rigidly based on secondary structure using
the LSU rRNA model of Xenopus laevis (Schnare et
al. 1996) and the SSU models of X. laevis and Strongylocentrotus purpuratus (Gutell 1994). Our alignments are available from EMBL (ftp://ftp.ebi.ac.uk/
pub/databases/embl/align) under the accession numbers
ALIGNp000087 and ALIGNp000088.
As in our past studies of deep-level phylogeny, the
entire conserved core of the 28S gene was readily alignable across taxa and was used; the divergent domains,
comprising over one-third of the 28S gene, were excluded (see table 7.1 in Mallatt, Sullivan, and Winchell
2001; Hassouna, Michot, and Bachellerie 1984). In the
SSU genes, about 15% of the sites could not be aligned
across taxa and were excluded. Overall, we used 2,348,
149, and 1,517 aligned sites from the 28S, 5.8S, and
18S genes, respectively, for a total of about 4,000 sites
in the analysis.
Phylogenetic Analyses
Three data sets were analyzed: (1) combined LSU
1 SSU genes; (2) SSU only; and (3) LSU only. We
combined the genes because the incongruence length
difference test yielded a value of P , 0.27, thereby suggesting that the LSU and SSU sequences were not significantly more incongruent than random partitions (but
see Sullivan [1996] for a criticism of that test as an
arbiter of data combination).
The search strategies for inferring optimal phylogenetic trees included equally weighted maximum parsimony (MP), maximum likelihood (ML), and minimum
evolution (ME) using LogDet-Paralinear distances (Lake
1994; Lockhart et al. 1994; Swofford et al. 1996). We
executed each of these optimality criteria with PAUP*
version 4.0 beta 4a (Swofford 2000). To obtain the best
tree using ML, we followed an iterative search strategy
(Swofford et al. 1996; Mallatt and Sullivan 1998), in
which the GTR 1 I 1 G model was found to fit the data
best.
Of the three tree-building methods used, ME analysis with LogDet distances is emphasized over ML and
MP because it is designed to give the best results when
the gene sequences differ in base composition across
taxa—as our sequences do (see Results). This LogDet
method only crudely accounts for heterogeneity in rates
of evolution across nucleotide sites, although it has been
Phylogeny of Protostomes
shown to yield results similar to more complex models
of rate heterogeneity (Waddell, Penny, and Moore 1997;
J. Sullivan, personal communication). The way it accounts for rate heterogeneity is by using a value called
Pinv, the proportion of invariable sites in the genes. Because Pinv cannot be measured directly, we used the ML
method of calculating this parameter, even though ML
slightly overestimates Pinv (see Winchell et al. 2002); if
the ML-derived value for Pinv led to many trees with
undefined values in 1,000 ME bootstrap replicates, it
was lowered slightly until it led to fewer than 10 such
undefined trees. To determine how sensitive the ME results were to variation in estimates of Pinv, we also used
the following as Pinv values; namely, the proportion of
sites that were constant in our set of protostome and
deuterostome sequences (Pc) and the arbitrarily decreasing range of Pinv values: 0.55, 0.45, 0.35, 0.25, and 0.15.
To measure nodal support for our ME, ML, and
MP trees, nonparametric bootstrap analyses (Felsenstein
1985) were performed with 1,000 replicates for the ME
and MP searches and 100 replicates for the ML
searches.
Spectral Analysis
Spectral analysis, a method that quantifies both
support (S) and conflict (C) within gene sequences for
groups of taxa (or splits), is an alternative to tree-based
analyses (Lento et al. 1995; Penny et al. 1999). In this
part of the study, the LogDet-transformed distances were
analyzed by the Spectrum program (Charleston 1998,
http://taxonomy.zoology.gla.ac.uk/;mac/spectrum/spectrum), with Pinv set at the same ML-based value as in
the ME-tree calculation. (Again, using Pinv is a crude
way to account for heterogeneous rates of evolution
across nucleotide sites.) This Spectrum program must
be supplied with a threshold support value, which must
be chosen according to several criteria: It must be low
enough to yield many splits but not so many as to overload the display buffer and crash the program; also, this
threshold value must yield standardized conflict values
(see subsequently) in roughly the same size range as the
support values so that both support and conflict are interpretable when plotted together on a histogram (as in
fig. 4). A threshold of 0.0003 substitutions per site was
chosen because it fit these criteria best. Conflict values
were standardized according to Lento et al. (1995).
Spectral analysis is not yet a statistical method, so
it does not provide an absolute basis for accepting one
split and rejecting another. To allow better comparisons
between such competing splits, we developed a ‘‘relative support-versus-conflict index’’ (I). This index was
calculated as the ratio, (S1/S2)/(C1/C2), where S1 and
C1 are the support and conflict values for the split being
evaluated, and S2 and C2 are the corresponding values
for the competing (mutually exclusive) split that had the
highest level of support. Although it is not statistical,
this index has a clear meaning: a split with an index of
n (say, 2) has n times more conflict-adjusted support
than does the nearest alternative split. To provide a standard, we identified many splits within deuterostomes
291
(Winchell et al. 2002) that are supported by morphological or independent molecular evidence and found all
such splits to have I-values over 4 (amphioxus plus vertebrates, hemichordates plus echinoderms, lamprey plus
hagfish, holothuroideans plus echinoideans); based on
this precedent from deuterostomes, we called protostome splits valid if they likewise had I-values over 4.
Results
Nucleotide Frequencies
Table 1 shows the proportions of A, C, G, and T
nucleotides in the combined LSU 1 SSU genes of the
taxa used in this study. Note that the Peripatoides and
Sagitta sequences are richest in C and G, whereas Trichinella is rich in A and T. As shown at the bottom of the
table, for the basic set of 19 taxa—that is, with Sagitta
left out—the chi-square test of homogeneity of base frequencies across taxa indicates strong nonstationarity (P
K 0.01), and this becomes even more extreme when
Sagitta is included. On the other hand, when the CGrich Sagitta and Peripatoides sequences are both omitted, the 18 remaining sequences are stationary.
Most of the nonstationarity of base frequencies in
the rRNA genes must be in the LSU gene because the
SSU sequences alone were always stationary (test results not shown).
Phylogenetic Trees
Figure 1 shows the trees from the basic set of 19
taxa (i.e., without Sagitta) as calculated from the combined LSU 1 SSU, the SSU-only, and the LSU-only
gene sequences by the ME, ML, and MP methods. Brief
examination of these trees reveals that SSU and LSU
produce similar phylogenies, and that relative branch
lengths are roughly similar in both the SSU and LSU
trees. That is, taxa with rapidly evolving SSU sequences
(longest branches) usually have rapidly evolving LSU
sequences, and slowly evolving (short-branched) SSU
sequences go with slowly evolving LSU sequences. This
suggests that the SSU and LSU rRNA genes evolve by
a similar set of rules.
In the combined-gene tree of figure 1A, the protostome, Lophotrochozoa, and Ecdysozoa clades are
strongly supported, with bootstrap values of 92% or
higher. Support is also evident (ME bootstrap values
over 60%) for the following seven groups within Lophotrochozoa and Ecdysozoa: (1) polychaete Proceraea
1 echiuran Urechis; (2) brachiopod Terebratalia 1 phoronid Phoronis 1 mollusc Placopecten; (3) Phoronis 1
Placopecten; (4) oligochaete Eisenia 1 Proceraea 1
Urechis 1 Terebratalia 1 Phoronis 1 Placopecten; (5)
all Ecdysozoa other than the priapulan Halicryptus; (6)
Peripatoides 1 arthropod Limulus; and (7) nematomorph Chordodes 1 nematode Trichinella. Comparison
of figure 1B and C shows that the SSU gene seems to
contain more signal for the higher-order groups, Lophotrochozoa and Ecdysozoa (which have SSU-ME bootstrap values of 73% and 81%, respectively, compared
with only 42% and 32% in the LSU tree), whereas the
LSU gene contains more signal for the seven lower-
292
Mallatt and Winchell
Table 1
Proportions of Nucleotide Types in the SSU and LSU rRNA Gene Alignments Used in the Analysis (28S divergent
domains and other nonalignable sites are excluded). The Number of Sites Used (28S 1 18S 1 5.8S) is Listed at Far
Right
Taxon
A
C
G
T
Number of
Sites Used
Amphiporus . . . . . . .
Chordodes . . . . . . . .
Eisenia . . . . . . . . . . .
Halicryptus. . . . . . . .
Limulus . . . . . . . . . .
Peripatoides . . . . . . .
Phascolopsis. . . . . . .
Phoronis . . . . . . . . . .
Placopecten . . . . . . .
Proceraea . . . . . . . . .
Sagitta. . . . . . . . . . . .
Stylochus . . . . . . . . .
Terebratalia. . . . . . . .
Trichinella . . . . . . . .
Urechis . . . . . . . . . . .
Branchiostoma . . . . .
Ciona . . . . . . . . . . . .
Florometra . . . . . . . .
Hydrolagus. . . . . . . .
Ptychodera . . . . . . . .
Mean. . . . . . . . . . . . .
0.2656
0.2740
0.2687
0.2714
0.2627
0.2441
0.2616
0.2681
0.2718
0.2619
0.2445
0.2720
0.2658
0.2858
0.2685
0.2568
0.2668
0.2596
0.2574
0.2582
0.2643
0.2189
0.2115
0.2163
0.2134
0.2250
0.2451
0.2259
0.2161
0.2159
0.2213
0.2499
0.2154
0.2165
0.2080
0.2089
0.2283
0.2111
0.2219
0.2267
0.2296
0.2213
0.2826
0.2778
0.2829
0.2815
0.2871
0.3045
0.2890
0.2826
0.2760
0.2846
0.3060
0.2785
0.2828
0.2679
0.2817
0.2958
0.2903
0.2890
0.2945
0.2942
0.2865
0.2328
0.2366
0.2321
0.2337
0.2253
0.2063
0.2234
0.2332
0.2363
0.2323
0.1996
0.2342
0.2350
0.2383
0.2409
0.2191
0.2318
0.2295
0.2213
0.2181
0.2280
3,855
3,981
3,818
3,887
4,013
3,795
3,910
3,860
3,863
4,013
3,975
3,993
3,996
3,860
3,993
3,899
3,800
3,927
4,003
3,912
3,918
NOTE.—x2 test of homogeneity of base frequencies: For 19 of the above taxa, Sagitta omitted: x2 5 84.2, P 5 0.0053. For all 20 of the above taxa: x2 5
126.3, P 5 0.00000037. For 18 taxa, Sagitta and Peripatoides omitted: x2 5 51.3, P 5 0.46.
order groups within Lophotrochozoa and Ecdysozoa—
all of which have ,50% bootstrap support in the SSU
tree.
To check the validity of the ME bootstrap values
in figure 1, ME bootstrap analyses were rerun upon the
combined LSU 1 SSU sequences over a wide range of
Pinv values (table 2). As seen in this table, support varied
with Pinv for only a few groups, and the high bootstrap
values for the main groups—protostomes, Lophotrochozoa, Ecdysozoa—seemed unaffected by Pinv. At or just
below the ML estimate of Pinv 5 0.62, there is bootstrap
support .60% for every group listed in the table.
Figure 2 shows the LSU 1 SSU tree calculated
from the taxa used in figure 1, except with the chaetognath Sagitta added. The highly divergent Sagitta sequence, represented by the longest branch in the tree,
groups with that of Peripatoides, which is also divergent
and CG rich. The Peripatoides 1 Sagitta group has an
85% ME bootstrap value in this combined gene tree,
and it also occurs in the SSU-only and LSU-only trees
(not illustrated) with bootstrap values of 88% and 62%,
respectively. The other groups in figure 2 resemble those
in figure 1A, except that there are slightly lower bootstrap values for the protostome, Lophotrochozoa, and
Ecdysozoa clades. Also, within Ecdysozoa, Limulus no
longer groups strongly with Peripatoides, and Halicryptus no longer branches off first. The relationships within
Lophotrochozoa, by contrast, seem unaffected by the
presence of Sagitta (cf. fig. 1).
Figure 3 shows the LSU 1 SSU tree that results
when both the Sagitta and Peripatoides sequences are
excluded. With the data set now exhibiting stationarity
of base frequencies (see previously), this tree should
have the most accurate likelihood and parsimony bootstrap values. Thus, it is noteworthy that both ML and
MP strongly support the protostome, Lophotrochozoa,
and Ecdysozoa groups. Within Lophotrochozoa, the relations are essentially the same as in the nonstationary
trees of figures 1A and 2. So few ecdysozoans remain,
however, that the relationships among crown ecdysozoans (Chordodes, Trichinella, and Limulus) are not resolved with this abbreviated set of taxa.
When this manuscript was in preparation, James
Garey analyzed our combined LSU 1 SSU sequences
with Bootstrapper’s Gambit using LogDet-Paralinear
distances and corrections for site to site variation (Lake
1995; Cameron, Garey, and Swalla 2000)—a method
that can better account for heterogeneous rates of evolution across nucleotide sites than can methods that use
Pinv. The Gambit program could only accommodate 15
sequences, so Peripatoides, Sagitta, Trichinella, Hydrolagus, and Ciona were omitted. Within Lophotrochozoa,
the following groups and bootstrap values (1,000 replicates) were attained: Eisenia 1 Proceraea 1 Urechis 1
Terebratalia 1 Phoronis 1 Placopecten: 75%; Terebratalia 1 Phoronis 1 Placopecten: 68%; Eisenia 1 Proceraea 1 Urechis: 50%. These results are similar to
those generated by the ME analysis using Pinv in figure
1A.
Spectral Analysis
Without Sagitta
The results reported in this section were derived
from the 19 core taxa without Sagitta (although adding
Sagitta did not change the following findings). Figure 4
Phylogeny of Protostomes
293
FIG. 1.—Phylogenetic trees for 14 protostomes (chaetognath Sagitta not included) and five deuterostome outgroups calculated from rRNA
gene sequences under the ME criterion using LogDet-Paralinear distances and estimated values for the proportion of invariable sites (Pinv: see
Materials and Methods). The percentages of bootstrap replicates supporting each node are shown and represent, from top to bottom (or right to
left): minimum evolution (ME), maximum likelihood (ML), and maximum parsimony (MP)-based values. Bootstrap values are not shown if the
ME values were ,50%, except for Lophotrochozoa and Ecdysozoa where all bootstrap values are shown. A, Combined LSU 1 SSU rRNA
gene tree: Pinv 5 0.62. B, SSU tree: Pinv 5 0.61. C, LSU tree: Pinv 5 0.59.
98
98
98
99
98
98
97
Pinv
0.15 . . . . . . . . .
0.25 . . . . . . . . .
0.35 . . . . . . . . .
0.45 . . . . . . . . .
0.55 . . . . . . . . .
0.62a . . . . . . . . .
0.64b . . . . . . . . .
93
95
95
95
95
92
87
Lophotrochozoa
(%)
91
93
94
95
94
93
87
Ecdysozoa
(%)
43
41
46
51
59
65
67
NOTE.—The Pinv values of 0.62, and a bit less than 0.62, are most likely to be correct (see text).
aP
inv estimated with the GTR 1 I maximum likelihood model (see fig. 1A).
b The actual proportion of constant sites in the 19 sequences used to calculate P .
inv
Protostomes
(%)
Proceraea,
Urechis
(%)
81
81
79
81
76
76
71
Phoronis,
Terebratalia,
Placopecten
(%)
Proceraea, Urechis,
Eisenia, Phoronis,
Terebratalia,
Placopecten (%)
72
68
70
70
67
61
51
Phoronis,
Placopecten
(%)
83
81
81
85
83
81
78
85
88
88
88
85
77
59
Peripatoides,
Chordodes,
Trichinella,
Limulus (%)
88
90
87
82
75
65
55
Peripatoides,
Limulus
(%)
Table 2
Bootstrap Values for 10 Key Groups in Figure 1, Computed Using LogDet Distances of LSU 1 SSU rRNA Sequences over a Range of Invariable Sites (Pinv)
90
91
89
82
74
64
52
Chordodes,
Trichinella
(%)
294
Mallatt and Winchell
FIG. 2.—Phylogenetic tree calculated from combined LSU 1 SSU
rRNA gene sequences for 15 protostomes. Same as in figure 1A, except
with the chaetognath Sagitta included. Pinv 5 0.59.
FIG. 3.—Phylogenetic tree calculated from combined LSU 1 SSU
rRNA gene sequences for 13 protostomes. Same as in figure 1A, except with both Sagitta and Peripatoides omitted. Pinv 5 0.64.
Phylogeny of Protostomes
295
FIG. 4.—Lento plot of support and conflict values for the 20 most highly supported splits (taxonomic groups) that were computed by
spectral analysis from LSU 1 SSU LogDet distances among the 19 taxa listed in figure 1 (Sagitta not included: Pinv 5 0.62). Units on the yaxis are substitutions per site, and the precise numerical values are listed in column 1 of table 3. Black bars represent splits that are present in
the ME tree (fig. 1A), whereas gray bars represent splits not in that tree: A 5 Chordodes 1 Trichinella, B 5 Peripatoides 1 Limulus, C 5
Lophotrochozoa, D 5 Trichinella 1 Limulus, E 5 protostomes, F 5 Peripatoides 1 Chordodes 1 Trichinella 1 Limulus, G 5 Peripatoides 1
Chordodes, H 5 Ecdysozoa, I 5 Proceraea 1 Urechis, J 5 Terebratalia 1 Phoronis 1 Placopecten, K 5 Eisenia 1 Proceraea 1 Urechis 1
Terebratalia 1 Phoronis 1 Placopecten, L 5 Chordodes 1 Trichinella 1 Limulus, M 5 Phoronis 1 Placopecten, N 5 deuterostomes 1
Peripatoides 1 Halicryptus 1 Chordodes 1 Limulus, O 5 Phascolopsis 1 Stylochus, P 5 Peripatoides 1 Halicryptus 1 Limulus, Q 5
Proceraea 1 Urechis 1 Eisenia, R 5 Stylochus 1 Amphiporus, S 5 Peripatoides 1 Halicryptus 1 Chordodes 1 Limulus, T 5 Peripatoides
1 Halicryptus 1 Trichinella 1 Limulus.
is a Lento plot of support and conflict values for various
splits calculated by spectral analysis of the LogDet distances from the combined LSU 1 SSU sequences. The
precise support and conflict values are listed in column
1 of table 3, as S1 and C1 values, respectively. Spectral
analysis showed most support for the groups that were
also supported by bootstrap analysis, although it should
be noted that most splits containing ecdysozoans have
high conflict values because of the several highly divergent sequences in this superphylum.
Because ranking splits by raw support values can
be misleading, we used a relative support-versus-conflict
index, I, that also takes conflict into account when comparing splits (see Materials and Methods). As shown in
column 1 of table 3, this index exceeded four for the
following eight splits, which are thus considered most
likely to be valid: Lophotrochozoa; Protostomes; Peripatoides 1 Chordodes 1 Trichinella 1 Limulus; Ecdysozoa; Proceraea 1 Urechis; Terebratalia 1 Phoronis
1 Placopecten; Eisenia 1 Proceraea 1 Urechis 1 Terebratalia 1 Phoronis 1 Placopecten; Phoronis 1 Placopecten. Note that all these groups were also identified
by .60% bootstrap values in figure 1. Also note that
the Chordodes 1 Trichinella and Peripatoides 1 Limulus splits had low indices (I # 1.2, table 3), despite
their high support values; their low indices resulted from
a relatively high support for the alternative split of
Trichinella 1 Limulus.
To assess which of the gene components—LSU or
SSU—contributed more to the splits, these two components were analyzed separately (compare columns 2
and 3 in table 3). On the basis of the relative sizes of
their support-versus-conflict indices, the SSU gene contributes more than LSU to the higher-order splits of Lophotrochozoa, protostomes, and Ecdysozoa (that is, the
SSU index exceeds the LSU index for each of these
groups). By contrast, LSU contributes more to the other,
lower-order, taxonomic splits (F, I–K, and M in the table) because the LSU index exceeds the SSU index for
these splits. Recall this is the same pattern exhibited by
bootstrap analysis (fig. 1A–C).
With Sagitta
When the sequence from the chaetognath Sagitta
was included, spectral analysis of the combined LSU 1
SSU data favored a Peripatoides 1 Sagitta pairing. The
support value for this pairing was greater (S 5 11.4 3
1023 cf. 2.6 3 1023) and the conflict value less (C 5
5.2 3 1023 cf. 11.6 3 1023) than for the best alternative
split of Peripatoides 1 Limulus, yielding a relative support-versus-conflict index of 10.0.
The main findings of this study are summarized in
figure 5.
296
Mallatt and Winchell
Table 3
Support and Conflict Values for the Various Groups (splits) of Protostomes Shown in Figures 4 and 1, Calculated by
Spectral Analysis
Split in Figure 4
A. Chordodes 1 Trichinella. . . . . . . . . .
B. Peripatoides 1 Limulus . . . . . . . . . .
C. Lophotrochozoa . . . . . . . . . . . . . . . . .
E. Protostomes . . . . . . . . . . . . . . . . . . . .
F. Peripatoides 1 Chordodes 1
Trichinella 1 Limulus . . . . . . . . . . . .
H. Ecdysozoa . . . . . . . . . . . . . . . . . . . . .
I. Proceraea 1 Urechis. . . . . . . . . . . . . .
J. Terebratalia 1 Phoronis 1
Placopecten . . . . . . . . . . . . . . . . . . . . .
K. Eisenia 1 Proceraea 1 Urechis 1
Terebratalia 1 Phoronis 1
Placopecten . . . . . . . . . . . . . . . . . . . . .
M. Phoronis 1 Placopecten . . . . . . . . . .
O. Phascolopsis 1 Stylochus. . . . . . . . .
Q. Proceraea 1 Urechis 1 Eisenia . . . .
Column 1: LSU 1 SSU
S1/S2@C1/C2 5 I
Column 2: SSU only
S1/S2@C1/C2 5 I
Column 3: LSU only
S1/S2@C1/C2 5 I
S1/S2: 7.26/5.89a 5 1.2
C1/C2: 6.71/6.76a 5 1.0
I 5 1.2
S1/S2: 6.51/5.89a 5 1.1
C1/C2: 6.53/6.76a 5 1.0
I 5 1.1
S1/S2: 6.22/0.85c 5 7.3
C1/C2: 3.06/12.7c 5 0.24
I 5 30.4
S1/S2: 5.8/1.73c 5 3.4
C1/C2: 4.1/11.5c 5 0.36
I 5 9.4
S1/S2: 3.91/6.82a 5 0.6
C1/C2: 5.3/4.0a 5 1.3
I 5 0.4
S1/S2: 1.98/6.82a 5 0.3
C1/C2: 7.0/4.0a 5 1.8
I 5 0.2
S1/S2: 4.79/1.01d 5 4.8
C1/C2: 4.2/11.5d 5 0.37
I 5 13
S1/S2: 6.37/1.15f 5 6.1
C1/C2: 2.5/11.1f 5 0.23
I 5 26.5
S1/S2: 6.97/5.08b 5 1.4
C1/C2: 5.6/6.5b 5 0.86
I 5 1.6
S1/S2: 8.08/5.08b 5 1.6
C1/C2: 4.4/6.5b 5 0.7
I 5 2.3
S1/S2: 3.66/3.66c 5 1.0
C1/C2: 4.01/6.0c 5 0.67
I 5 1.5
S1/S2: 3.35/2.42c 5 1.4
C1/C2: 4.1/7.12c 5 0.58
I 5 2.4
S1/S2: 4.54/1.73c 5 2.6
C1/C2: 6.82/11.5c 5 0.59
I 5 4.4
S1/S2: 4.22/1.73c 5 2.4
C1/C2: 6.25/11.5c 5 0.54
I 5 4.4
S1/S2: 2.92/0.81h 5 3.6
C1/C2: 0.72/2.57h 5 0.28
I 5 12.9
S1/S2: 0.3/1.63g 5 0.2
C1/C2: 5.4/7.0g 5 0.77
I 5 0.3
S1/S2: 6.28/1.0d 5 6.3
C1/C2: 3.9/11.5d 5 0.34
I 5 18.6
S1/S2: 1.41/1.33i 5 1.1
C1/C2: 6.8/2.6i 5 2.6
I 5 0.4
S1/S2: 4.21/3.66c 5 1.2
C1/C2: 7.9/6.0c 5 1.3
I 5 0.9
S1/S2: 2.43/3.66c 5 0.7
C1/C2: 7.7/6.0c 5 1.3
I 5 0.6
S1/S2: 2.5/0.74h 5 3.4
C1/C2: 0.4/1.3h 5 0.3
I 5 11.1
S1/S2: 2.0/0.47j 5 4.3
C1/C2: 0.41/2.49j 5 0.16
I 5 26.1
S1/S2: 0.6/1.33i 5 0.5
C1/C2: 2.7/2.6i 5 1.04
I 5 0.5
S1/S2: 1.69/1.17k 5 1.4
C1/C2: 0.86/0.92k 5 0.93
I 5 1.5
S1/S2: 1.96/0.55l 5 3.6
C1/C2: 1.4/5.4l 5 0.26
I 5 13.9
S1/S2: 1.82/0.37o 5 4.9
C1/C2: 0.1/0.9o 5 0.11
I 5 45
S1/S2: 1.65/1.23r 5 1.3
C1/C2: 3.0/3.25r 5 0.9
I 5 1.5
S1/S2: 1.45/0.81h 5 1.8
C1/C2: 1.53/2.57h 5 0.595
I 5 3.03
S1/S2: 0.35/3.39m 5 0.1
C1/C2: 9.6/2.2m 5 4.3
I 5 0.2
S1/S2: 1.49/0.61p 5 2.4
C1/C2: 1.0/3.2p 5 0.3
I58
S1/S2: ,0.3/3.4m 5 ,0.1
C1/C2: —t
I 5 low
S1/S2: ,0.3/2.4s 5 ,0.13
C1/C2: —t
I 5 low
S1/S2: 2.33/0.46n 5 5.1
C1/C2: 0.34/2.0n 5 0.17
I 5 30
S1/S2: 2.34/0.51q 5 4.6
C1/C2: 0.36/1.65q 5 0.22
I 5 21
S1/S2: 2.05/3.66c 5 0.6
C1/C2: 3.6/6.0c 5 0.6
I51
S1/S2: 2.71/1.17k 5 2.3
C1/C2: 0.8/0.9k 5 0.9
I 5 2.9
NOTE.—Splits with the highest I values in column 1 (I . 4) are the most likely to be valid (I 5 relative support-versus-conflict index).
NOTE.—S1 and C1 are support and conflict values (in units of substitutions per site, 3 1023) for the split listed, whereas S2 and C2 are the corresponding
values for the most highly supported contradictory split. Such second-best splits, as indicated by the symbols for footnotes, are:
a Trichinella 1 Limulus; b Peripatoides 1 Chordodes; c Stylochus 1 Trichinella; d Proceraea 1 Peripatoides; e Deuterostomes 1 Peripatoides 1 Halicryptus 1
Chordodes 1 Limulus; f Ecdysozoans 1 Branchiostoma; g Halicryptus 1 Chordodes 1 Trichinella; h Urechis 1 Terebratalia 1 Phoronis 1 Placopecten 1 Eisenia;
i Urechis 1 Terebratalia; j Proceraea 1 Urechis 1 Phoronis 1 Placopecten; k Eisenia 1 Terebratalia; l Phascolopsis 1 Stylochus 1 Eisenia; m Phascolopsis 1
Eisenia; n Terebratalia 1 Phoronis 1 Placopecten 1 Amphiporus; o Terebratalia 1 Phoronis; p Urechis 1 Placopecten; q Terebratalia 1 Placopecten 1 Eisenia;
r Stylochus 1 Amphiporus; s All lophotrochozoans except Proceraea. t Conflict values could not be calculated when support values were less than 0.3 3 1023,
although the conflicts for such weakly supported groups can be assumed to be high.
Discussion
Ecdysozoa and Lophotrochozoa
The main goal of this study was to test whether
combined LSU 1 SSU rRNA gene sequences support
the division of protostomes into Lophotrochozoa and
Ecdysozoa, as Aguinaldo et al. (1997) proposed based
on SSU sequences alone. The answer is an emphatic
yes: Adding the LSU to the SSU data increased the ME
bootstrap support for these two clades markedly, from
73% and 81% in our SSU-only tree (fig. 1B) to 92%
and 93%, respectively, in the combined LSU 1 SSU
tree (fig. 1A). It is noteworthy that although we used
taxa whose SSU genes contained less bootstrap support
for Lophotrochozoa (73%) and for Ecdysozoa (81%)
than did the taxa used by Aguinaldo et al. (.90%), adding the LSU sequences raised our bootstrap values to
.90%, as seen in that previous study. Furthermore, our
combined LSU 1 SSU sequences continued to support
the existence of Ecdysozoa and Lophotrochozoa when
the disruptive sequence of the chaetognath Sagitta was
included (fig. 2), when the taxa were limited to those
yielding stationary base frequencies (fig. 3), and when
the sequences were analyzed by spectral analysis (fig. 4
and table 3). Thus, this study not only supports the di-
Phylogeny of Protostomes
FIG. 5.—Summary tree showing the findings of this study, derived
from combined LSU 1 SSU rRNA gene sequences. Numbers represent
ME bootstrap percentages (from fig. 1A) over I values (relative support-versus-conflict indices from column 1 of table 3); and key morphological characters are mapped onto some important nodes. Note
that the position of chaetognaths within protostomes is treated as uncertain (?). As for sipunculans, evidence from mitochondrial genes
suggests they are akin to annelids (Boore and Staton 2002), although
our rRNA data do not reveal this. For documentation of the morphological characters see Nielsen (1995, spiral cleavage and protostomy);
Schmidt-Rhaesa (1998, aflagellate sperm); Schmidt-Rhaesa et al.
(1998, dorsal heart); Giribet et al. (2000, haemal system); McHugh
(2000, chaetae), and Peterson and Eernisse (2001, protrusile chaetae).
vision of protostomes into Ecdysozoa and Lophotrochozoa but also strengthens it. We also found that SSU is
best at resolving deeper nodes in the protostome tree,
whereas LSU is best at revealing more apical nodes.
Subgroups of Taxa
Together, the LSU 1 SSU sequences seemed to resolve some subgroups within Ecdysozoa and Lophotrochozoa. Identifying such subgroups is a major goal of
invertebrate taxonomy, toward which SSU alone has
made only a small contribution, despite the availability
of hundreds of SSU sequences from a wide range of
taxa (Garey and Schmidt-Rhaesa 1998; Adoutte et al.
2000; Giribet et al. 2000). For example, SSU-based phylogenies show extensive paraphyly of molluscs, annelids, and some other phyla within the Lophotrochozoa
(Aguinaldo et al. 1997; Halanych 1998; Winnepenninckx, Van de Peer, and Backeljau 1998; Giribet et al.
2000). Given the small number of taxa used in our study
and the fact that our bootstrap values were only in the
60% and 70% range, our proposal that LSU 1 SSU
sequences can resolve subgroups within Ecdysozoa and
Lophotrochozoa is made with caution. Nonetheless,
297
these subgroups were also supported by our spectral
analysis, and most of them have been proposed previously based on more extensive morphological and molecular evidence. To be specific, such subgroups include:
(1) echiurans 1 polychaetes (evidenced by morphology
and elongation factor 1a genes; McHugh 1997, 2000),
(2) a lophophorate 1 mollusc 1 annelid clade (from
fossil evidence; Conway Morris and Peel 1995), which
is independent of flatworms (Garey and Schmidt-Rhaesa
1998), and (3) priapulids as basal ecdysozoans (based
on extensive and conserved SSU sequences; Peterson
and Eernisse 2001; Garey 2001; but see Schmidt-Rhaesa
et al. 1998).
Despite these apparent successes, the sipunculan
(Phascolopsis) sequence suggests a failure. Morphology
places sipunculans with annelids and molluscs, and their
mitochondrial genes relate them to annelids (Boore and
Staton 2002). Our data missed this relation, probably
because sipunculan rRNA sequences are divergent
(Winnepenninckx, Van de Peer, and Backeljau 1998; Giribet et al. 2000) and attracted artifactually to the longbranched flatworm sequence (fig. 1A). This sipunculan
example shows that adding LSU to SSU does not yield
a perfect phylogeny—and adding sequences from more
protostomes will undoubtedly yield more anomalies and
other examples of paraphyly. Even so, the value of combining LSU and SSU is demonstrated by the fact that
our LSU 1 SSU data resolved known subgroups that
our SSU data did not (compare fig. 1A and B).
Our combined-gene analysis joins the lophophorate
and mollusc sequences (Terebratalia, Phoronis, and Placopecten). This echoes an old view that allied brachiopods with molluscs, a view not held by modern experts
(Hyman 1959; p. 516), who either consider brachiopods
and molluscs closer to some polychaete than to each
other (Conway Morris 1998, p. 188) or else consider
lophophorates basal to all other lophotrochozoans, including molluscs (Valentine 1997; Peterson and Eernisse
2001). In one sense our evidence is questionable, in that
it places the phoronid closer to the mollusc than to the
brachiopod. This placement is unconvincing, considering that phoronids and brachiopods are similar animals
united by a lophophore and similar coeloms (Brusca and
Brusca 1990, chapt. 21). In another sense, however, our
evidence does support a lophophorate 1 mollusc clade,
in that it places the sequences of both lophophorates
closer to that of the mollusc than to any other protostome (fig. 5). If the lophophorate 1 mollusc clade
should prove to be real, then brachiopod and mollusc
shells could be homologous, rather than just analogous
as is now believed. Shell structure and pores, shell muscles, and shell formation from a mantle are similar in
both phyla, although these shells have different relations
to the body axis (Barnes 1980, p. 914; Pechenik 1985,
p. 378; Meglitsch and Schram 1991, pp. 212, 514; Nielsen 1995, p. 352; Reindl, Salvenmoser, and Haszprunar
1995).
Results Explained
Despite our modest taxon sampling, there are three
reasons why our results should contain true signal and
298
Mallatt and Winchell
not be the result of sampling error. First, there seems to
be a real benefit in adding LSU sequences to the traditionally used SSU sequences. The evidence for this, as
mentioned previously, is that LSU added resolution
within the Lophotrochozoa and Ecdysozoa, whereas no
such resolution existed in the SSU tree (compare fig. 1B
with A). This is the ‘‘adding more characters’’ approach
to increasing phylogenetic resolution (Mitchell, Mitter,
and Regier 2000), and its apparent success in this study
suggests that the characters in the LSU genes are indeed
valuable additions.
The second reason for the resolution attained must
be our use of the LogDet-based distance method because
the ML and MP methods did not identify the subgroups
of Ecdysozoa and Lophotrochozoa that ME did (compare the ME, ML, and MP bootstrap values in fig. 1A).
The effectiveness of this LogDet method for retrieving
well-known clades from rRNA data has been noted before (Aguinaldo et al. 1997; Mallatt, Sullivan, and Winchell 2001; Garey 2001). Its effectiveness may stem not
only from the fact that this method performs best when
nucleotide frequencies are nonstationary across taxa but
also because all 12 types of nucleotide substitutions are
free to occur at different rates, unlike in the ML or MP
methods (Swofford et al. 1996, pp. 459–461). Spectral
analysis of the LogDet distances also yielded high taxonomic resolution. Spectral analysis has a theoretical advantage over tree-building methods because in the treebuilding process support values from characters that favor contradictory trees cancel one another so that information is lost, whereas in spectral analysis all these
values are retained to produce total support values; that
is, in spectral analysis, no support signal is lost. As mentioned previously, another advantage of spectral analysis
is that it allows conflict values to be taken into account,
whereas tree-building methods do not.
The third reason for the apparent resolving power
of this study is that most of the taxa used had slowly
evolving sequences. There is much debate in molecular
phylogenetics over whether the best way to resolve difficult phylogenies is to use just the most slowly evolving
sequences (the approach of Aguinaldo et al. 1997 and
Garey and Schmidt-Rhaesa 1998) or to add more and
more sequences to split up the long branches (the dominant approach used with SSU gene sequences; Eernisse
1997; Giribet et al. 2000). We favor the former approach
because we believe that long-branch attraction (Felsenstein 1978) remains a major confounding problem in
SSU-based studies, that using slowly evolving sequences minimizes this problem, and that the currently dominant approach of adding taxa for the sake of thoroughness has not improved resolution much, perhaps because
new long branches have been added as often as the older
branches were broken up. Even so, our philosophy is to
avoid using the most divergent sequences rather than to
use only the most conserved ones, and we certainly believe that many more taxa than the 15 used here are
needed to resolve the relationships within Ecdysozoa
and Lophotrochozoa. This idea that the judicious omission and addition of taxa improves phylogenetic estimation was championed, in more depth, by Kim (1998).
Position of Chaetognaths
The phylogenetic position of chaetognath worms is
a longstanding problem. Their anatomical characters are
confusing (Bone, Kapp, and Pierrot-Bults 1991, p. 2),
and their rRNA sequences—the only molecular information available from this phylum—are highly divergent. The SSU sequences of chaetognaths have always
grouped with other rapidly evolving protostome sequences—rhabditid nematodes, gastrotrichs, gnathostomulids, and recently, with onychophorans (Halanych
1996, 1998; Eernisse 1997; Littlewood et al. 1998; Zrzavy et al. 1998; Giribet et al. 2000)—with a lack of consistency that suggests artifactual long-branch attraction.
Both our phylogenetic tree (fig. 2) and spectral analysis
placed Sagitta with Peripatoides, which we suspect is
another artifact of long-branch attraction, here between
two highly divergent and CG-rich sequences (note that
even LogDet cannot correct for the most extreme cases
of nucleotide nonstationarity; Foster and Hickey 1999).
As evidence supporting this suspicion, when we omitted
the Peripatoides sequence and reran the LSU 1 SSU
bootstrap analysis, Sagitta no longer joined the Ecdysozoa but appeared as an independent protostome lineage (in ME and MP trees: not shown). Thus, the position of chaetognaths is left unresolved in our summary
figure 5.
Testing Some Other Hypotheses
The Ecdysozoa and Lophotrochozoa hypotheses
contradict several classical, morphology-based hypotheses of triploblast relationships: that annelids and arthropods are interrelated as Articulata (based on similarities in body segmentation; Schmidt-Rhaesa et al.
1998) and that the flatworm line, then the nematode line,
branched from the base of the triploblast tree before the
appearance of deuterostomes and protostomes (Barnes
1980, chapt. 3). In parts 1–3 of table 4, these three classical hypotheses are tested by spectral analysis of our
combined LSU 1 SSU-gene data. None of these hypotheses has even one-twentieth as much support as the
Ecdysozoa and Lophotrochozoa hypotheses that oppose
it. Further evidence against these classical hypotheses
has been provided by many recent molecular and anatomical studies (Ruppert 1991; Eernisse, Albert, and
Anderson 1992; Balavoine 1998; Boore and Brown
2000; Peterson and Eernisse 2001).
The findings of this study can also test two hypotheses about deuterostomes. The first is that nemerteans are related to chordates (Jensen 1988). SSU-based
evidence speaks against a nemertean-deuterostome link
(Winnepenninckx, Backeljau, and De Wachter 1995;
Sundberg, Turbeville, and Harlin 1998), as does our
combined LSU 1 SSU evidence. In addition to the high
bootstrap support for Amphiporus as a protostome and
lophotrochozoan (fig. 1A), spectral analysis indicates
that the support for Amphiporus 1 deuterostomes is
only one-ninth as high as for Amphiporus-as-lophotrochozoan, and the conflict is three times greater (see part
4 in table 4).
Phylogeny of Protostomes
299
Table 4
Testing Older Hypotheses of Protostome Relationships Based on Spectral Analysis of
LogDet Distances from LSU 1 SSU Genes (taxa of figs. 1 and 4 were used)
Classical Hypothesis
Preferred Alternate Split, This Study
1. Articulata (Annelids 1 Panarthropoda: Eisenia 1
Proceraea 1 Urechis 1 Peripatoides 1 Limulus): S
5 0.08
2. Flatworm (Stylochus) not a Protostome: S 5 20.5
3. Nematode (Trichinella) not a Protostome: S 5 0.2
4. Nemertean (Amphiporus) 1 Deuterostomes: S 5
0.7; C 5 210.1
5. Lophophorates (Terebratalia, Phoronis) 1 Deuterostomes: S 5 20.2
1. Lophotrochozoa: S 5 6.2
Ecdysozoa: S 5 4.2
2. Stylochus in Lophotrochozoa: S 5 6.2
3. Trichinella in Ecdysozoa: S 5 4.2
4. Amphiporus in Lophotrochozoa: S 5
6.2; C 5 23.1
5. Terebratalia and Phoronis in Lophotrochozoa: S 5 6.2
NOTE.—Support (S) and conflict (C) values are substitutions per site (31023). Conflict values could not be calculated
when levels of support were below the threshold of 0.3 3 1023 (see Materials and Methods).
The second hypothesis about deuterostomes is that
they are related to lophophorates (brachiopods, phoronids, bryozoans). Lophophorates have been classified as
deuterostomes based on their development and the fact
that their lophophore resembles a similar feeding structure in pterobranch hemichordates (Hyman 1959; Brusca
and Brusca 1990, p. 797). However, molecular evidence
from many genes now indicates that lophophorates are
protostomes (Halanych et al. 1995; de Rosa et al. 1999;
Saito, Kojima, and Endo 2000). Furthermore, as seen in
part 5 of table 4, spectral analysis gives the lophophorate
1 deuterostome hypothesis negative support.
Relative Value of the LSU Versus the SSU Gene in
Molecular Phylogeny
Over 60 LSU rRNA sequences are now known
from animals (listed in Mallatt and Sullivan 1998; Medina et al. 2001; Winchell et al. 2002). On the basis of
our experience with these sequences, it is possible to
compare the relative strengths and weaknesses of the
LSU and SSU genes for phylogenetic analysis. Overall,
the LSU gene seems to contain less phylogenetic signal,
at least for resolving the deepest branches of the animal
tree (compare fig. 1C to B; also see Winchell et al.
2002). This means that LSU should not be used alone
for deep-level phylogeny but should always be combined with SSU, and indeed, in our experience, the combined LSU 1 SSU sequences always give better resolution than do SSU sequences alone. Although the LSU
gene does add signal, it requires about three times more
work to amplify and sequence than the SSU gene. However, the fact that the conserved core region of the 28S
gene is both well defined and informative about deeplevel animal taxonomy makes aligning LSU genes simpler and more objective than aligning SSU genes.
The divergent domains of the 28S gene, although
not used in this higher-level phylogenetic analysis, can
be included for lower-level analyses all the way down
to the species or even subspecies level (Littlewood
1994; Mallatt and Sullivan 1998; Jarman et al. 2000;
Litvaitis et al. 2000; Winchell 2001). The SSU gene, by
contrast, has few rapidly evolving regions and provides
little resolution at lower taxonomic levels.
Finally, as already mentioned, a weakness of protostome and deuterostome LSU gene sequences is that
they are susceptible to nonstationarity of base frequencies. By contrast, all the SSU data sets with which we
have dealt were stationary.
Conclusions
The combined LSU 1 SSU rRNA gene sequences
used in this study uphold and strengthen the hypotheses
of Aguinaldo et al. (1997), based on SSU sequences
alone, that protostome animals consist of Ecdysozoa and
Lophotrochozoa (fig. 5). More intriguingly, our admittedly preliminary findings suggest that the LSU 1 SSU
sequences can help resolve the topology within Ecdysozoa and Lophotrochozoa, which is one of the main
goals of animal systematics. For example, the results
suggest an unexpected clade of lophophorates and molluscs. More bilateralian LSU sequences are greatly
needed.
Although additional, independent genes must also
be used, LSU rRNA gene sequences could help to free
systematists who now use SSU from the frustrating cycle that many are experiencing, i.e., a cycle of adding
progressively more SSU sequences without much improvement in taxonomic resolution. Widespread taxonomic sampling of protostome LSU sequences should
begin, and if the findings of the present study are true
indicators, the new studies should emphasize slowly
evolving over rapidly evolving sequences and should
include LogDet-based distance methods for maximum
efficiency. We foresee a time when no rDNA-based phylogenetic study will be considered complete unless it
includes both the LSU and SSU genes.
Acknowledgments
Thanks are extended to James Garey for critiquing
the manuscript and running the Gambit analysis, to Jack
Sullivan for methodological discussions, to Gary Thorgaard for sharing lab facilities, to Kevin Pullen of the
Charles R. Conner Museum of Zoology at Washington
State University for help with the voucher specimens,
to D. T. J. Littlewood for advice on which flatworm to
sequence, to J. M. Turbeville for the Amphiporus SSU
sequence, and to all those who provided specimens.
300
Mallatt and Winchell
LITERATURE CITED
ADOUTTE, A., G. BALAVOINE, N. LARTILLOT, O. LESPINET, B.
PRUD’HOMME, and R. DE ROSA. 2000. The new animal phylogeny: reliability and implications. Proc. Natl. Acad. Sci.
USA 97:4453–4456.
AGUINALDO, A. M. A., J. M. TURBEVILLE, L. S. LINFORD, M.
C. RIVERA, J. R. GAREY, R. A. RAFF, and J. A. LAKE. 1997.
Evidence for a clade of nematodes, arthropods, and other
moulting animals. Nature 387:489–492.
BALAVOINE, G. 1998. Are platyhelminthes coelomates without
a coelom? An argument based on the evolution of Hox
genes. Am. Zool. 38:843–858.
BALAVOINE, G., and A. ADOUTTE. 1998. One or three Cambrian radiations? Science 280:397–398.
BARNES, R. D. 1980. Invertebrate zoology. 4th edition. Saunders, Philadelphia.
BOORE, J. L., and W. M. BROWN. 2000. Mitochondrial genomes of Galathealinum, Helobdella, and Platynereis: sequence and gene arrangement comparisons indicate that pogonophora is not a phylum and annelida and arthropoda are
not sister taxa. Mol. Biol. Evol. 17:87–106.
BOORE, J. L., and J. L. STATON. 2002. The mitochondrial genome of the sipunculid Phascolopsis gouldii supports its
association with Annelida rather than Mollusca. Mol. Biol.
Evol. (in press).
BONE, Q., H. KAPP, and A. C. PIERROT-BULTS. 1991. The biology of chaetognaths. Oxford University Press, New York.
BRUSCA, R. C., and G. J. BRUSCA. 1990. Invertebrates. Sinauer,
Sunderland, Mass.
CAMERON, C. B., J. R. GAREY, and B. J. SWALLA. 2000. Evolution of the chordate body plan: new insights from phylogenetic analyses of deuterostome phyla. Proc. Natl. Acad.
Sci. USA 97:4469–4474.
CHARLESTON, M. 1998. Spectrum version 2.3.0. Available
from http://taxonomy.zoology.gla.ac.uk/;mac/spectrum/
spectrum.
CONWAY MORRIS, S. 1998. The crucible of creation. The Burgess Shale and the rise of the animals. Oxford University
Press, Oxford.
———. 2000. The Cambrian ‘‘explosion’’: slow-fuse or megatonnage? Proc. Natl. Acad. Sci. USA 97:4426–4429.
CONWAY MORRIS, S., and J. S. PEEL. 1995. Articulated halkieriids from the Lower Cambrian of North Greenland and
their role in early protostome evolution. Philos. Trans. R.
Soc. Lond. B 347:305–358.
DE RIJK, P., Y. VAN DE PEER, I. VAN DEN BROECK, and R. DE
WACHTER. 1995. Evolution according to large ribosomal
subunit RNA. J. Mol. Evol. 41:366–375.
DE ROSA, R., J. K. GRENIER, T. ANDREEVA, C. E. COOK, A.
ADOUTTE, M. AKAM, S. B. CARROLL, and G. BALAVOINE.
1999. Hox genes in brachiopods and priapulids and protostome evolution. Nature 399:772–776.
EERNISSE, D. J. 1997. Arthropod and annelid relationships reexamined. Pp. 43–56 in R. A. Fortey and R. H. Thomas,
eds. Arthropod relationships. The Systematic Association,
Chapman and Hall, London.
EERNISSE, D. J., J. S. ALBERT, and F. E. ANDERSON. 1992.
Annelida and Arthropoda are not sister taxa—a phylogenetic analysis of spiralian metazoan morphology. Syst. Biol.
41:305–330.
FELSENSTEIN, J. 1978. Cases in which parsimony and compatibility methods will be positively misleading. Syst. Zool.
27:401–410.
———. 1985. Confidence limits on phylogenies: an approach
using the bootstrap. Evolution 17:368–376.
FOSTER, P. G., and D. A. HICKEY. 1999. Compositional bias
may affect both DNA-based and protein-based phylogenetic
reconstructions. J. Mol. Evol. 48:284–290.
GAREY, J. R. 2001. Ecdysozoa: the relationship between Cycloneuralia and Panarthropoda. Zool. Anz. 240:321–330.
GAREY, J. R., and A. SCHMIDT-RHAESA. 1998. The essential
role of ‘‘minor’’ phyla in molecular studies of animal evolution. Am. Zool. 38:907–917.
GIRIBET, G., D. L. DISTEL, M. POLZ, W. STERRER, and W. C.
WHEELER. 2000. Triploblastic relationships with emphasis
on the acoelomates and the position of Gnathostomulida,
Cycliophora, Plathelminthes, and Chaetognatha: a combined approach of 18S rDNA sequences and morphology.
Syst. Biol. 49:539–562.
GUTELL, R. R. 1994. Collection of small subunit (16S- and
16S-like) ribosomal RNA structures. Nucleic Acids Res. 22:
3502–3507.
HALANYCH, K. M. 1996. Testing hypotheses of chaetognath
origins: long branches revealed by 18S ribosomal DNA.
Syst. Biol. 45:223–246.
———. 1998. Considerations for reconstructing metazoan history: signal, resolution, and hypothesis testing. Am. Zool.
38:929–941.
HALANYCH, K. M., J. D. BACHELLER, A. M. A. AGUINALDO,
S. M. LIVA, D. M. HILLIS, and J. A. LAKE. 1995. Evidence
from 18S ribosomal DNA that the lophophorates are protostome animals. Science 267:1641–1643.
HASSOUNA, N., B. MICHOT, and J.-P. BACHELLERIE. 1984. The
complete nucleotide sequence of mouse 28S rRNA gene:
implications for the process of size increase of the large
subunit rRNA in higher eukaryotes. Nucleic Acids Res. 12:
3563–3583.
HAUSDORF, B. 2000. Early evolution of the bilateralia. Syst.
Biol. 49:130–142.
HILLIS, D. M., and M. T. DIXON. 1991. Ribosomal DNA: molecular evolution and phylogenetic inference. Q. Rev. Biol.
66:411–453.
HYMAN, L. H. 1959. The invertebrates, Vol. 5. Smaller coelomate groups. McGraw-Hill, New York.
JARMEN, S. N., S. NICOL, N. G. ELLIOTT, and A. MCMINN.
2000. 28S rDNA evolution in the eumalacostraca and the
phylogenetic position of krill. Mol. Phylogenet. Evol. 17:
26–36.
JENSEN, D. J. 1988. Hubrecht, Macfarlane, Jensen and Willmer:
on the nature and testability of four versions of the nemertean theory of vertebrate origins. Hydrobiologia 156:99–
104.
KIM, J. 1998. Large-scale phylogenies and measuring the performance of phylogenetic estimators. Syst. Biol. 47:43–60.
LAKE, J. A. 1994. Reconstructing evolutionary trees from DNA
and protein sequences: paralinear distances. Proc. Natl.
Acad. Sci. USA 91:1455–1459.
———. 1995. Calculating the probability of multitaxon evolutionary trees: bootstrappers gambit. Proc. Natl. Acad. Sci.
USA 92:9662–9666.
LENTO, G. M., R. E. HICKSON, G. K. CHAMBERS, and D. PENNY. 1995. Use of spectral analysis to test hypotheses on the
origin of pinnipeds. Mol. Biol. Evol. 12:28–52.
LITTLEWOOD, D. T. J. 1994. Molecular phylogenetics of cupped
oysters based on partial 28S rRNA gene sequences. Mol.
Phylogenet. Evol. 3:221–229.
LITTLEWOOD, D. T. J., M. J. TELFORD, K. A. CLOUGH, and K.
ROHDE. 1998. Gnathostomulida—an enigmatic metazoan
phylum from both morphological and molecular perspectives. Mol. Phylogenet. Evol. 9:72–79.
LITVAITIS, M. K., J. W. BATES, W. D. HOPE, and T. MOENS.
2000. Inferring a classification of the Adenophorea (Nem-
Phylogeny of Protostomes
atoda) from nucleotide sequences of the D3 expansion segment (26/28S rDNA). Can. J. Zool. 78:911–922.
LOCKHART, P. J., M. A. STEEL, M. D. HENDY, and D. PENNY.
1994. Recovering evolutionary trees under a more realistic
model of sequence evolution. Mol. Biol. Evol. 11:605–612.
MALLATT, J., and J. SULLIVAN. 1998. 28S and 18S rDNA sequences support the monophyly of lampreys and hagfishes.
Mol. Biol. Evol. 15:1706–1718.
MALLATT, J., J. SULLIVAN, and C. J. WINCHELL. 2001. The
relationships of lampreys to hagfishes: a spectral analysis
of ribosomal DNA sequences. Pp. 106–118 in P. AHLBERG,
ed. Major events in early vertebrate evolution: palaeontology, phylogeny, and development. Taylor and Francis,
London.
MANUEL, M., M. KRUSE, W. E. G. MULLER, and Y. LE PARCO.
2000. The comparison of beta-thymosin homologues among
metazoa supports an arthropod-nematode link. J. Mol. Evol.
51:378–381.
MCHUGH, D. 1997. Molecular evidence that echiurans and pogonophorans are derived annelids. Proc. Natl. Acad. Sci.
USA 94:8006–8009.
———. 2000. Molecular phylogeny of the annelida. Can. J.
Zool. 78:1873–1884.
MEDINA, M., A. G. COLLINS, J. D. SILBERMAN, and M. L. SOGIN. 2001. Evaluating hypotheses of basal animal phylogeny using complete sequences of large and small subunit
rRNA. Proc. Natl. Acad. Sci. USA 98:9707–9712.
MEGLITSCH, P. A., and F. R. SCHRAM. 1991. Invertebrate zoology. 3rd edition. Oxford University Press, New York.
MITCHELL, A., C. MITTER, and J. C. REGIER. 2000. More taxa
or more characters revisited: combining data from nuclear
protein-encoding genes for phylogenetic analysis of Noctuoidea (Insecta: Lepidoptera). Syst. Biol. 49:202–224.
NIELSEN, C. 1995. Animal evolution, interrelationships of the
living phyla. Oxford University Press, Oxford, U.K.
PECHENIK, J. A. 1985. Biology of the invertebrates. Prindle,
Weber & Schmidt, Boston, Mass.
PENNY, D., M. HASEGAWA, P. J. WADDELL, and M. D. HENDY.
1999. Mammalian evolution: timing and implications from
using the LogDeterminant transform for proteins of differing amino acid composition. Syst. Biol. 48:76–93.
PETERSON, K. J., R. A. CAMERON, and E. H. DAVIDSON. 2000.
Bilaterian origins: significance of new experimental observations. Dev. Biol. 219:1–17.
PETERSON, K. J., and D. J. EERNISSE. 2001. Animal phylogeny
and the ancestry of bilaterians: inferences from morphology
and 18S rDNA gene sequences. Evol. Dev. 3:170–205.
REINDL, S., W. SALVENMOSER, and G. HASZPRUNAR. 1995.
Fine structural and immunocytochemical investigations of
the caeca of Argyrotheca cordata and Argyrotheca cuneata
(Brachiopoda, Terebratulida, Terebratellacea). J. Submicrosc. Cytol. Pathol. 27:543–556.
RUPPERT, E. E. 1991. Introduction to the aschelminth phyla: a
consideration of mesoderm, body cavities, and cuticle. Pp.
1–17 in F. W. HARRISON and E. E. RUPPERT, eds. Microscopic anatomy of invertebrates, Vol. 4. Wiley-Liss, New
York.
SAITO, M., S. KOJIMA, and K. ENDO. 2000. Mitochondrial COI
sequences of brachiopods: genetic code shared with protostomes and limits of utility for phylogenetic reconstruction.
Mol. Phylogenet. Evol. 15:331–344.
301
SCHMIDT-RHAESA, A. 1998. Phylogenetic relationships of the
nematomorpha—a discussion of current hypotheses. Zool.
Anz. 236:203–216.
SCHMIDT-RHAESA, A., T. BARTOLOMAEUS, C. LEMBURG, U.
EHLERS, and J. R. GAREY. 1998. The position of the arthropoda in the phylogenetic system. J. Morphol. 238:263–
285.
SCHNARE, M. N., S. H. DAMBERGER, M. W. GRAY, and R.
GUTELL. 1996. Comprehensive comparison of structural
characteristics in eukaryotic cytoplasmic large subunit
(23S-like) ribosomal RNA. J. Mol. Biol. 256:701–719.
SMITH, S. W., R. OVERBEEK, C. R. WOESE, W. GILBERT, and
P. M. GILLEVET. 1994. The genetic data environment: an
expandable GUI for multiple sequence analysis. CABIOS
10:671–675.
SULLIVAN, J. 1996. Combining data with different distributions
of among-site rate variation. Syst. Biol. 45:375–380.
SUNDBERG, P., J. M. TURBEVILLE, and M. S. HÄRLIN. 1998.
There is no support for Jensen’s hypothesis of nemerteans
as ancestors to the vertebrates. Hydrobiologia 365:47–54.
SWOFFORD, D. L. 2000. PAUP*: phylogenetic analysis using
parsimony (* and other methods). Version 4.0 beta 4a. Sinauer Associates, Sunderland, Mass.
SWOFFORD, D. L., G. J. OLSEN, P. J. WADDELL, and D. M.
HILLIS. 1996. Phylogenetic inference. Pp. 407–514 in D. M.
HILLIS, C. MORITZ, and B. K. MABLE, eds. Molecular systematics, 2nd edition. Sinauer, Sunderland, Mass.
VALENTINE, J. W. 1997. Cleavage patterns and the topology of
the metazoan tree of life. Proc. Natl. Acad. Sci. USA 94:
8001–8005.
VAN DE PEER, Y., S. L. BALDOUF, W. F. DOOLITTLE, and A.
MEYER. 2000. An updated and comprehensive rRNA phylogeny of (crown) eukaryotes based on rate-calibrated evolutionary distances. J. Mol. Evol. 51:565–576.
WADDELL, P. J., D. PENNY, and T. MOORE. 1997. Hadamard
conjugations and modeling sequence evolution with unequal rates across sites. Mol. Phylogenet. Evol. 8:398–414.
WAGELE, J. W., T. ERIKSON, P. LOCKHART, and B. MISOF. 1999.
The Ecdysozoa: artifact or monophylum? J. Zool. Syst.
Evol. Res. 37:211–223.
WINCHELL, C. J. 2002. Combining large- and small-subunit
ribosomal RNA genes for phylogenetic comparison: robust
results for the deuterostome animals and the elasmobranch
fishes. M.S. thesis, Washington State University, Pullman.
WINCHELL, C. J., J. SULLIVAN, C. B. CAMERON, B. J. SWALLA,
and J. MALLATT. 2001. Evaluating hypotheses of deuterostome evolution with new LSU and SSU ribosomal DNA
phylogenies. Mol. Biol. Evol. (in press).
WINNEPENNINCKX, B., T. BACKELJAU, and R. DE WACHTER.
1995. Phylogeny of protostome worms derived from 18S
rRNA sequences. Mol. Biol. Evol. 12:641–649.
WINNEPENNINCKX, B. M. H., Y. VAN DE PEER, and T. BACKELJAU. 1998. Metazoan relationships on the basis of 18S
rRNA sequences: a few years later. . . Am. Zool. 38:888–
906.
ZRZAVY, J., S. MIHULKA, P. KEPKA, A. BEZDEK, and D. TIETZ.
1998. Phylogeny of the Metazoa based on morphological
and 18S ribosomal DNA evidence. Cladistics 14:249–285.
RICHARD THOMAS, reviewing editor
Accepted November 21, 2001