ARTICLE IN PRESS Molecular Phylogenetics and Evolution xxx (2006) xxx–xxx www.elsevier.com/locate/ympev Global eukaryote phylogeny: Combined small- and large-subunit ribosomal DNA trees support monophyly of Rhizaria, Retaria and Excavata David Moreira a,¤, Sophie von der Heyden b,1, David Bass b, PuriWcación López-García a, Ema Chao b, Thomas Cavalier-Smith b a Unité d’Ecologie, Systématique et Evolution, UMR CNRS 8079, Université Paris-Sud, Bâtiment 360, 91405 Orsay Cedex, France b Department of Zoology, University of Oxford, South Parks Road, Oxford OX1 3PS, UK Received 26 July 2006; revised 11 October 2006; accepted 2 November 2006 Abstract Resolution of the phylogenetic relationships among the major eukaryotic groups is one of the most important problems in evolutionary biology that is still only partially solved. This task was initially addressed using a single marker, the small-subunit ribosomal DNA (SSU rDNA), although in recent years it has been shown that it does not contain enough phylogenetic information to robustly resolve global eukaryotic phylogeny. This has prompted the use of multi-gene analyses, especially in the form of long concatenations of numerous conserved protein sequences. However, this approach is severely limited by the small number of taxa for which such a large number of protein sequences is available today. We have explored the alternative approach of using only two markers but a large taxonomic sampling, by analysing a combination of SSU and large-subunit (LSU) rDNA sequences. This strategy allows also the incorporation of sequences from non-cultivated protists, e.g., Radiozoa ( D radiolaria minus Phaeodarea). We provide the Wrst LSU rRNA sequences for Heliozoa, Apusozoa (both Apusomonadida and Ancyromonadida), Cercozoa and Radiozoa. Our Bayesian and maximum likelihood analyses for 91 eukaryotic combined SSU+LSU sequences yielded much stronger support than hitherto for the supergroup Rhizaria (Cercozoa plus Radiozoa plus Foraminifera) and several well-recognised groups and also for other problematic clades, such as the Retaria (Radiozoa plus Foraminifera) and, with more moderate support, the Excavata. Within opisthokonts, the combined tree strongly conWrms that the Wlose amoebae Nuclearia are sisters to Fungi whereas other Choanozoa are sisters to animals. The position of some bikont taxa, notably Heliozoa and Apusozoa, remains unresolved. However, our combined trees suggest a more deeply diverging position for Ancyromonas, and perhaps also Apusomonas, than for other bikonts, suggesting that apusozoan zooXagellates may be central for understanding the early evolution of this huge eukaryotic group. Multiple protein sequences will be needed fully to resolve basal bikont phylogeny. Nonetheless, our results suggest that combined SSU+LSU rDNA phylogenies can help to resolve several ambiguous regions of the eukaryotic tree and identify key taxa for subsequent multi-gene analyses. © 2006 Published by Elsevier Inc. Keywords: SSU rDNA; LSU rDNA; Eukaryotic phylogeny; Combined analysis; Rhizaria; Retaria; Excavata 1. Introduction For nearly two decades, eukaryote phylogeny has been studied using SSU rDNA2 as preferred, almost exclusive, * Corresponding author. Fax: +33 1 6915 4697. E-mail address: [email protected] (D. Moreira). 1 Present address: Department of Botany and Zoology, University of Stellenbosch, Private Bag X1, Matieland 7602, South Africa. 2 Abbreviations used: BP; bootstrap proportion; LSU rDNA; large-subunit ribosomal DNA; ML; maximum likelihood; PP; posterior probability; SSU rDNA; small-subunit ribosomal DNA. 1055-7903/$ - see front matter © 2006 Published by Elsevier Inc. doi:10.1016/j.ympev.2006.11.001 Please cite this article in press as: Moreira, D. et al., Global eukaryote phylogeny: Combined small- and large-subunit ribosomal DNA trees support monophyly of Rhizaria, Retaria and Excavata, Mol. Phylogenet. Evol. (2006), doi:10.1016/j.ympev.2006.11.001 ARTICLE IN PRESS 2 D. Moreira et al. / Molecular Phylogenetics and Evolution xxx (2006) xxx–xxx marker (for a historical overview see Cavalier-Smith, 2002; Philippe and Adoutte, 1998; Roger, 1999). Although some of the earliest eukaryotic molecular phylogenies were constructed using partial LSU rDNA sequences (Baroin et al., 1988; Perasso et al., 1989), this marker was curiously soon abandoned under the overwhelming impetus given to the use of SSU rDNA (Sogin, 1991). Nevertheless, in recent years a number of studies have demonstrated that any individual marker, including SSU rRNA, has a series of inherent problems that can engender artefactual results when inferring the global phylogeny of eukaryotes. Among them are an insuYcient number of informative sites, mutational saturation, and uneven evolutionary rates across sites and taxa (Philippe and Adoutte, 1998; Philippe et al., 2000; Stiller and Hall, 1999). This has prompted eVorts to apply a total-evidence approach to the reconstruction of eukaryotic phylogeny by combining diVerent markers into one analysis by direct concatenation (Baldauf et al., 2000; Moreira et al., 2000; Rodríguez-Ezpeleta et al., 2005) or likelihood summation (Bapteste et al., 2002). This has provided strong support for groups whose monophyly was suspected, such as Plantae (Moreira et al., 2000; Rodríguez-Ezpeleta et al., 2005) and Conosa (Bapteste et al., 2002), as well as an improved resolution of internal metazoan phylogeny (Philippe et al., 2005). A major limitation of this approach is the very reduced taxonomic sampling for which a large collection of gene sequences is available, so that many important phyla are not represented. This problem has been addressed in diVerent ways, one of the most eVective being the massive sequencing of expressed sequence tags (EST) from diVerent protist species (Bapteste et al., 2002; de Koning et al., 2005; Rodríguez-Ezpeleta et al., 2005). Nevertheless, we are still very far from a comprehensive representation of eukaryotic diversity. In addition, this approach is diYcult to apply to organisms that cannot be maintained in pure, or nearly pure, culture or that resist the extraction of high quality mRNA. Finally, the computational burden of analysing huge multi-gene data sets increases substantially with the addition of more taxa. The diYcult trade-oV between the merits of a large amount of sequence information for each taxon and a wide taxonomic sampling is crucial to the quest for a robust eukaryotic phylogeny. From a theoretical point of view, an alternative to the analysis of increasingly longer sequences for a relatively limited taxon sampling is the use of a larger taxonomic diversity, even if the number of sites remains relatively small. In fact, several simulation studies have shown that in many cases the addition of taxa can be more beneWcial than the addition of sites to increase the accuracy of the phylogenetic tree, even for complex cases prone to be aVected by artefacts such as long-branch attraction (LBA) (Graybeal, 1998; Hillis, 1996). For the historical reason mentioned above and its easy retrieval (by PCR and direct sequencing), SSU rDNA is the marker with the richest taxonomic representation available today. It is therefore a good candidate to take part in a combination of few genes for a wide taxon collection. We have decided to associate it with a second marker that has been relatively little explored, the LSU rDNA, considering that both markers may reasonably be accommodated within a single model of sequence evolution for a combined analysis. LSU rDNA oVers the additional advantage of being generally adjacent to SSU rDNA in the genome so that both markers can be easily retrieved from uncultured protists from metagenomic analyses (López-García et al., 2002). With that aim, we have signiWcantly improved the taxonomic sampling of LSU rDNA, providing 14 new sequences, enabling us to include six key eukaryotic groups on LSU trees for the Wrst time: Cercozoa, Radiozoa ( D classical Radiolaria minus Phaeodarea, which appear to be Cercozoa: Polet et al., 2004; Yuasa et al., 2006), centrohelid Heliozoa and an unknown micro-heliozoan, Apusomonadida, Ancyromonadida and Jakobea. Maximum likelihood (ML) and, especially, Bayesian analyses of SSU+LSU rDNA data sets yielded phylogenetic trees with much better resolution for a wide diversity of eukaryotic groups than in the corresponding separate SSU and LSU rDNA analyses. These trees provide support for several groups (e.g., Conosa, Excavata) that is comparable to that produced by very large amino acid data sets (Bapteste et al., 2002; Hampl et al., 2005). We also retrieved intriguing results, in particular strong support for the monophyly of the Retaria (Radiozoa and Foraminifera) and indications of a deep divergence of Ancyromonas (Apusozoa) from other bikonts. These results suggest that the combination of both rDNA sequences may provide a valuable marker for reconstructing global eukaryote phylogeny, with the advantage of the usually easy acquisition of new LSU rDNA sequences and the already extremely rich SSU rDNA data base. Combined SSU+LSU rDNA trees can be particularly useful for identifying key taxa for subsequent EST projects and multi-gene analyses. 2. Materials and methods 2.1. Protist cultures and DNA extraction We used culture conditions already described to grow the two centrohelid heliozoa Sphaerastrum fockii, Pterocystis sp. (von der Heyden, 2004) and an unknown marine “microheliozoan” (Cavalier-Smith and Chao, 2003a), the two cryptomonads Cryptomonas ( D Chilomonas) paramecium and Goniomonas sp. ATCC 50108 (von der Heyden et al., 2004), the four cercozoans Paracercomonas marina Cavalier-Smith and Bass ATCC 50344 (formerly misidentiWed as Cercomonas longicauda: see Karpov et al., 2006), Cercomonas sp. strain C-59, Dimorpha-like sp. ATCC 50522 and Thaumatomastix sp. ATCC 50250 (Bass and CavalierSmith, 2004; Cavalier-Smith and Chao, 2003b), and the two apusozoans Apusomonas proboscidea and Ancyromonas ‘sigmoides’ ATCC 50267 (Cavalier-Smith and Chao, 2003c). Cells were collected and DNA puriWed as described previously (Cavalier-Smith and Chao, 1995). Please cite this article in press as: Moreira, D. et al., Global eukaryote phylogeny: Combined small- and large-subunit ribosomal DNA trees support monophyly of Rhizaria, Retaria and Excavata, Mol. Phylogenet. Evol. (2006), doi:10.1016/j.ympev.2006.11.001 ARTICLE IN PRESS D. Moreira et al. / Molecular Phylogenetics and Evolution xxx (2006) xxx–xxx 2.2. Environmental samples and fosmid libraries A metagenomic library of large DNA fragments (»35 kbp) was constructed from sea plankton collected from mesopelagic (500 m) Antarctic water, after Wltration to retain only the 0.2–5 m biomass fraction. Details on this library and the screening for fosmid clones with SSU rDNA genes are in López-García et al. (2002). The two clones (KW16 and HA2) containing SSU rDNA genes related to radiolaria were examined also for the presence of LSU rDNA genes (see below). 2.3. LSU rDNA ampliWcation and sequencing For both the protist DNA and the environmental fosmid clones, the LSU rDNA genes were ampliWed by PCR using speciWc primers 28S-1F (ACCCGCTGAATTT AAGCAT) and 28S-4R (TTCTGACTTAGAGGCGTTC AG). PCRs were carried out under the following conditions: 30 cycles (denaturation at 94 °C for 15 s, annealing at 55 °C for 30 s, extension at 72 °C for 2 min) preceded by 2 min denaturation at 94 °C, and followed by 10 min extension at 72 °C. The amplicons were directly sequenced using the same primers and a set of internal primers, including 28S-568F (TTGAAACACGGACCAAGGAG), 28S-2F (GCAGATCTTGGTGGTAG), 28S-1611R (CTTGGAS ACCTGMTGCGG), and 28S-3R (CACCTTGGAGACC TGCT). All sequences have been submitted to GenBank under Accession Nos. AY752987–AY752993 and DQ386164–DQ386169. 2.4. Reclinomonas americana LSU rDNA sequence assembly A set of partial LSU rDNA sequences (»400–800 bp) of the jakobid R. americana was generously provided by Drs. B. Franz Lang, Gertraud Burger and Michael W. Gray. These sequences were obtained through the Protist EST Program (PEP: http://megasun.bch.umontreal.ca/pepdb/ pep_main.html). The partial sequences were assembled into a nearly complete LSU rDNA sequence manually. 3 available sequences were done by neighbour-joining (NJ) using MUST (Philippe, 1993). This helped to identify the slowest evolving species (those with the shortest branches) within each taxonomic group, which were subsequently analysed in more detail using maximum likelihood (ML) and Bayesian methods. For some groups (in particular Conosa and Excavata), most species showed long branches, so that we were constrained to include them in our analyses. The conserved positions of the alignment used for phylogenetic reconstruction were chosen using GBLOCKS (Castresana, 2000). For both the ML and Bayesian methods we applied a general time reversible (GTR) model of nucleotide substitution, with a six-category discrete approximation of a distribution plus invariable sites (GTR++I model), which was selected using MODELTEST (Posada and Crandall, 1998) as the best of 56 testable substitution models. ML trees were reconstructed with PHYML v. 2.4.4 (Guindon and Gascuel, 2003) and bootstrapped using 1000 replicates. Bayesian trees were inferred using MrBAYES 3 (Ronquist and Huelsenbeck, 2003). All the Markov chain Monte Carlo searches were run with 4 chains for 1,000,000 generations, with trees being sampled every 100 generations. The Wrst 2000 trees were discarded as “burnin”, keeping only trees generated well after those parameters stabilised. Stabilisation of the chain parameters (tree likelihood, shape parameter and proportion of invariant sites) was studied with the program TRACER (Rambaut and Drummond, 2003). Additional Bayesian trees were also reconstructed using other model speciWcations: GTR++I with allowance for covarions, GTR++I with intersite rate autocorrelation, and GTR++I with covarions and intersite rate autocorrelation. For each data set and model speciWcation three independent runs were carried out and the tree topologies obtained were compared to check whether they were congruent or not using the Approximately Unbiased (AU) test (Shimodaira, 2002) implemented in the program CONSEL (Shimodaira and Hasegawa, 2001). Sequence alignments are available upon request. 3. Results and discussion 2.5. Data set construction 3.1. The eukaryotic LSU rDNA tree The 14 new sequences were added to a data set of LSU rDNA sequences (»350) retrieved from GenBank (http:// ncbi.nlm.nih.gov/) and aligned using the ED program of the package MUST (Philippe, 1993). A SSU+LSU rDNA data set containing 110 taxa was constructed by concatenation of the aligned SSU and LSU rDNA sequences from each species. When both markers were not available for the same species, we concatenated the sequences from two closely related species (see Supplementary material Table 1). We have substantially improved the broad-scale taxonomic sampling of the LSU rDNA with 14 new sequences, especially for groups of uncertain phylogenetic aYnity: Cryptomonada, Centrohelida, Ancyromonadida, Apusomonadida, Cercozoa and Radiozoa. Bayesian analysis of a LSU rDNA data set (91 taxa, 1513 sites) yields a tree that supports with strong posterior probability (PP D 1) the monophyly of major clades such as opisthokonts, Conosa and Rhizaria (Fig. 1). In general, support for the diVerent groups is stronger in the LSU rDNA tree than in the corresponding SSU rDNA tree (see Supplementary material Fig. 1). This is particularly noticeable for Conosa (PP D 1 vs. 0.63) and Heterokonta (PP D 1 vs. 0.72), whereas other groups, such as Excavata, are polyphyletic in the SSU 2.6. Phylogenetic analysis Preliminary phylogenetic analyses of the individual LSU and concatenated SSU+LSU rDNA data sets including all Please cite this article in press as: Moreira, D. et al., Global eukaryote phylogeny: Combined small- and large-subunit ribosomal DNA trees support monophyly of Rhizaria, Retaria and Excavata, Mol. Phylogenet. Evol. (2006), doi:10.1016/j.ympev.2006.11.001 ARTICLE IN PRESS 4 D. Moreira et al. / Molecular Phylogenetics and Evolution xxx (2006) xxx–xxx Fig. 1. Unrooted Bayesian phylogenetic tree constructed with an LSU rDNA data set (91 taxa, 1513 sites; GTR++I plus covarion model). New sequences obtained in this work are highlighted in bold. Several long branches were shortened to one fourth of their actual length (labelled 1/4). Numbers at nodes are posterior probabilities and ML bootstrap proportions (only those higher than 0.50 and 50%, respectively, are shown). Other evidence (Richards and Cavalier-Smith, 2005) indicates that the root of the eukaryote tree lies between unikonts (opisthokonts plus Amoebozoa) and bikonts (all other eukaryotes, including kingdoms Plantae and Chromista). However, as unikonts do not appear monophyletic on this tree or Fig. 2, both are arbitrarily drawn as if rooted between opisthokonts and other eukaryotes. Please cite this article in press as: Moreira, D. et al., Global eukaryote phylogeny: Combined small- and large-subunit ribosomal DNA trees support monophyly of Rhizaria, Retaria and Excavata, Mol. Phylogenet. Evol. (2006), doi:10.1016/j.ympev.2006.11.001 ARTICLE IN PRESS D. Moreira et al. / Molecular Phylogenetics and Evolution xxx (2006) xxx–xxx rDNA tree but monophyletic in the LSU tree. We do not Wnd any case where SSU rDNA provides much stronger support than LSU rDNA for a given node. In all our trees, posterior probabilities are higher than the corresponding ML bootstrap proportions (ML BP), in agreement with the general trend observed in the comparisons between these two measures of statistical support (Douady et al., 2003; Erixon et al., 2003; Suzuki et al., 2002). The SSU and LSU rDNA trees diVer in the position of several taxa, but all of these concern weakly supported nodes (PP < 0.80). In fact, an approximately unbiased (AU) test (Shimodaira, 2002) shows that the two tree topologies are not signiWcantly diVerent (P D 0.78). To maximise the phylogenetic information, we therefore also analysed a concatenation of both markers. We considered that both could be accommodated in a single concatenated analysis with the same substitution model, GTR++I, since the two exhibit similar values for several model parameters: similar nucleotide frequencies, parameter values of the distribution of 0.53 and 0.54, and proportion of invariant sites of 0.13 and 0.16 for the SSU and LSU rDNA, respectively. That homogeneity could help circumvent problems encountered when markers with signiWcantly diVerent substitution patterns are analysed together (Bapteste et al., 2002; Pupko et al., 2002). 3.2. The eukaryotic SSU+LSU rDNA tree Concatenation of SSU and LSU rDNA sequences allows the use of over 2500 well conserved sites for phylogenetic reconstruction, excluding all columns that contained a gap in more than one taxon (the exact number depends on the set of species analysed). For a few taxa (Didymium sp., rotaliid foraminifer, Fucus sp., Mallomonas sp., Goniomonas sp. and Spathidium sp.), we concatenated SSU and LSU rDNA sequences from two closely related species because both gene sequences were not available for exactly the same species. To study the impact of the taxonomic sampling, we constructed phylogenetic trees using a variety of species combinations. Fig. 2 shows an unrooted Bayesian tree with the largest representation of eukaryotic phyla available for these markers (91 taxa, 2574 sites). Several deep nodes of this SSU+LSU rDNA tree show only moderate support, which indicates that the concatenation of the two markers is not enough to obtain a global eukaryotic phylogeny with a good resolution of all nodes. Nevertheless, as in the individual LSU rDNA tree, various eukaryotic supergroups are supported by strong posterior probabilities (PP D 1), in particular opisthokonts, Conosa and Rhizaria. The internal phylogeny of opisthokonts shows several interesting relationships. The nucleariid Wlose amoeba Nuclearia simplex branches robustly as sister taxon to the Fungi (PP D 1, ML BP D 95). This result conWrms, with a larger taxonomic sampling, a previous analysis of opisthokont phylogeny also based on concatenated SSU+LSU rDNA sequences (Medina et al., 2003) and refuted a putative aYnity with choanoXagellates based on 18S rRNA 5 alone (Cavalier-Smith, 2000). The sisterhood of fungi and nucleariid amoebae also agrees with a recent multi-protein sequence analysis (elongation factor 1, actin, -tubulin and heat shock protein 70) of opisthokont internal relationships (Steenkamp et al., 2005). Our phylogenetic trees also agree with this multi-protein analysis for the position of Ichthyosporea, which emerge within a clade also including Metazoa ( D Animalia) and ChoanoXagellatea (Steenkamp et al., 2005). This position is also retrieved by phylogenetic analyses of a data set of concatenated proteins encoded by mitochondrial DNA (Lang et al., 2002). In our concatenated SSU+LSU rDNA trees Ichthyosporea (represented by Ichthyophonus hoferi) branch robustly (PP D 1, ML BP D 99) at that position (Fig. 2). This relationship is also retrieved using ML tree reconstruction, and it is not aVected by the use of diVerent combinations of metazoan sequences, nor even by the inclusion of the fast evolving sequences characteristic of bilaterian animals (not shown). A second species frequently classiWed as an ichthyosporean, Capsaspora owczarzaki (Hertel et al., 2002), branches strongly (PP D 1, ML BP D 86) as sister to choanoXagellates in our concatenated analysis. This agrees with a recent analysis of combined SSU+LSU rDNA+actin sequences (Ruiz-Trillo et al., 2004), although in that analysis the clade comprising C. owczarzaki, one ichthyosporean and one choanoXagellate was sister to animals only when the sequence evolution model took covarions into account. As mentioned above, our analyses retrieve a more stable topology, with C. owczarzaki always grouping with choanoXagellates in a clade sister to animals independently of the sequence evolution model applied. This increased stability could be due to the inclusion of a second choanoXagellate species in our analyses. Moreover, AU tests carried out with the SSU+LSU data set show that topologies where the Ichthyosporea, C. owczarzaki and the ChoanoXagellatea are constrained to branch as sisters of N. simplex + Fungi are signiWcantly worse (P < 0.05). The aYnity of C. owczarzaki with choanoXagellates appears to be supported mostly by the LSU rDNA since, in individual analyses (Fig. 1), this marker supports this relationship strongly (PP D 0.98, ML BP D 87), whereas the SSU rDNA yields trees where C. owczarzaki is sister of I. hoferi with weaker support (PP D 0.73, ML BP D 54; Supplementary material Fig. 1). Increasing the sampling of LSU rDNA sequences for other nucleariid and ichthyosporean species, as well as for other opisthokonts of uncertain aYnity, such as ministeriids, corallochytreans and Eccrinales (Cafaro, 2005; Cavalier-Smith and Chao, 2003c; Steenkamp et al., 2005), will help test this possible reorganisation of opisthokont phylogeny. Conosa, one of the two subphyla of phylum Amoebozoa (Cavalier-Smith, 2004), is another group well supported by the SSU+LSU rDNA analyses (Fig. 2). The two conosan subgroups Archamoebae (represented by species of the genera Phreatamoeba and Entamoeba) and Mycetozoa (Dictyostelium, Didymium and Physarum) emerge in all Bayesian trees as sisters with strong support (PP D 1), which is weaker for the ML analyses (ML BP D 65). Inspection of the bootstrapped Please cite this article in press as: Moreira, D. et al., Global eukaryote phylogeny: Combined small- and large-subunit ribosomal DNA trees support monophyly of Rhizaria, Retaria and Excavata, Mol. Phylogenet. Evol. (2006), doi:10.1016/j.ympev.2006.11.001 ARTICLE IN PRESS 6 D. Moreira et al. / Molecular Phylogenetics and Evolution xxx (2006) xxx–xxx Fig. 2. Unrooted Bayesian phylogenetic tree constructed with a SSU+LSU rDNA data set (91 taxa, 2574 sites; GTR++I plus covarion model). Taxa for which the SSU and LSU rDNA sequences belonged to diVerent but related species are labelled as ‘composite’ (see Supplementary Material Table 1 for details). Several long branches were shortened to one fourth of their actual length (labelled 1/4). Triangles indicate the alternative positions for the species A. sigmoides that were examined using the AU test. Numbers at nodes are posterior probabilities and ML bootstrap proportions (only those higher than 0.50 and 50%, respectively, are shown). ML trees reveals that this is due to the branching of the foraminiferan sequence within the Conosa in a proportion of these trees (not shown). Good support is also found for the sisterhood of Alveolata and Heterokonta (Fig. 2), both by the Bayesian (PP D0.97) and ML analyses (BP D 81). In contrast, for a number of groups the concatenated SSU+LSU rDNA data set did not improve signiWcantly the results obtained by analysing the genes separately. One case concerned Plantae, which encompass the three primary photosynthetic eukaryotic groups: Glaucophyta, Rhodophyta and Viridaeplantae (Moreira et al., 2000; RodríguezEzpeleta et al., 2005). Plantae emerge mixed with some chromistan groups in our trees; thus neither Plantae nor Chromista is recovered as holophyletic. One possible Please cite this article in press as: Moreira, D. et al., Global eukaryote phylogeny: Combined small- and large-subunit ribosomal DNA trees support monophyly of Rhizaria, Retaria and Excavata, Mol. Phylogenet. Evol. (2006), doi:10.1016/j.ympev.2006.11.001 ARTICLE IN PRESS D. Moreira et al. / Molecular Phylogenetics and Evolution xxx (2006) xxx–xxx source of problems could be the biased nucleotide composition of the rhodophyte sequences, as shown by a chi-square test (Schmidt et al., 2002), and their faster evolutionary rate (longer branches) compared with several of the glaucophytes and green plants. The probably spurious attraction between glaucophytes and cryptophytes, which has already been noticed in several SSU rDNA analyses (e.g., CavalierSmith and Chao, 2003a; Nikolaev et al., 2004), appears also to be important in hampering the inference of monophyletic Plantae and Chromista. This attraction seems mostly due to the SSU part of the SSU+LSU rDNA concatenation (see Supplementary material Fig. 1), since the LSU rDNA alone did not support any robust location for these two groups (Fig. 1). We observe a similarly inconstant position for centrohelid heliozoa, which very often branch with weak support close to cryptophytes and glaucophytes (Fig. 2) or as an independent lineage. Even the monophyly of Heliozoa (centrohelids plus the microheliozoan) remains uncertain, since the undetermined marine “microheliozoan” (which is probably not a centrohelid) branches far from the two centrohelids (Pterocystis sp. and Sphaerastrum fockii) in many single-gene trees (Supplementary material Fig. 2), despite being consistently together on the concatenated trees. It appears that the two heliozoan branches group together weakly on 18S rRNA trees when long branch taxa are excluded and more nucleotides used (Cavalier-Smith and Chao, 2003c; Nikolaev et al., 2004), but are separate, with the microheliozoan weakly grouping instead with red algae, when long branch taxa and fewer nucleotides are included (Supplementary Fig. 1 and Cavalier-Smith, 2004). Their grouping together as a weakly supported heliozoan clade on the concatenated trees that include long branch taxa (Fig. 2), and with distinctly stronger support when long branch taxa are excluded and over 900 more nucleotides used (Supplementary Fig. 2), suggests that the relationship is real, but additional gene sequences are required to demonstrate it convincingly. Neither our combined SSU+LSU rDNA analyses nor the Wrst protein phylogenies including centrohelid sequences (Nikolaev et al., 2004; Sakaguchi et al., 2005) have resolved the phylogenetic position of this group within bikonts. Albeit with only moderate support (PP D 0.72, ML BP D 67), our combined tree did reproducibly recover the sister relationship between Rhodophyta and Viridaeplantae, which is only occasionally found on 18S rRNA trees and was absent from a recent tree using 143 nuclear proteins (Rodríguez-Ezpeleta et al., 2005), probably because of the very long rhodophyte branch, but is clearly supported by a shared gene replacement of fructose bis-phosphate aldolase (Patron et al., 2004). We also used the SSU+LSU rDNA data set to investigate the phylogeny of Excavata, represented in our analyses by jakobid, diplomonad, Trichomonas and euglenozoan sequences. This was a diYcult analysis because the long branches of most excavates were prone to be attracted by those of Amoebozoa. Data sets excluding Amoebozoa sup- 7 port the monophyly of Excavata with strong PP (1) but weak ML BP (56) (not shown). When Amoebozoa are included, excavates still emerge as a monophyletic group though with weaker support (PP D 0.87, ML BP D 17) (Fig. 2). This suggests that the SSU+LSU rDNA data set contains just enough phylogenetic information to support the monophyly of excavates even if other fast evolving groups are included. However, only the Bayesian analysis is able to retrieve this result, whereas the ML phylogenetic trees yield a mix of all the long branching amoebozoa and excavate sequences within an unsupported (ML BP D 28) group (not shown). Within excavates, our analyses strongly support (PP D 1, ML BP D 85) the monophyly of the phylum Metamonada (Cavalier-Smith, 2003a), represented by several diplomonads and the parabasalian Trichomonas vaginalis (Fig. 2). This agrees with a recent multi-gene (six nuclear protein-coding genes) phylogeny, which also conWrms the inclusion of Carpediemonas within Metamonada (Simpson et al., 2006). In our SSU+LSU rDNA trees metamonads branch weakly (PP D 0.64, ML BP D 32) as sisters of Euglenozoa, and the jakobid Reclinomonas americana occupies a sister position to all the other excavates, whereas, in that multi-gene analysis, Euglenozoa were sisters of Reclinomonas and Heterolobosea (Naegleria) with quite robust support (ML BP D 85). Interestingly, the support in our analyses for the monophyly of all Excavata is comparable to that found in another recent multi-marker analysis (SSU rDNA + 8 protein genes) which, in contrast to our data sets, did not include any other long-branching taxa that might interfere with the long branches of the excavate species (Hampl et al., 2005). 3.3. The monophyly of Rhizaria and the position of Foraminifera: evidence for the Retaria? The determination of new LSU rDNA sequences from several cercozoan species and two environmental clones related to the radiozoan groups Taxopodida and Polycystinea allows testing the reliability of grouping these clades, together with Foraminifera, within the infrakingdom Rhizaria (Cavalier-Smith, 2002; Nikolaev et al., 2004). In the recent past, SSU rDNA analyses have provided contradictory results and variable support for this supergroup, especially because of the extremely fast evolutionary rate of the foraminiferan sequences that induced LBA artefacts. Thus, some ML analyses (which did not include radiolarian sequences) were able to retrieve a relationship between Foraminifera and Cercozoa only when other fast-evolving taxa were removed (Berney and Pawlowski, 2003). Nevertheless, more recent analyses (both including radiolarian sequences) did retrieve the monophyly of Rhizaria, even in the presence of several fast-evolving groups, but with only moderate (ML BP D 72; Nikolaev et al., 2004) or very low (distance ME BP D 20: Cavalier-Smith and Chao, 2003c) bootstrap values. Our Bayesian global eukaryotic phylogenetic trees constructed using both the LSU and SSU+LSU rDNAs Please cite this article in press as: Moreira, D. et al., Global eukaryote phylogeny: Combined small- and large-subunit ribosomal DNA trees support monophyly of Rhizaria, Retaria and Excavata, Mol. Phylogenet. Evol. (2006), doi:10.1016/j.ympev.2006.11.001 ARTICLE IN PRESS 8 D. Moreira et al. / Molecular Phylogenetics and Evolution xxx (2006) xxx–xxx provide much stronger support than hitherto for the holophyly of Rhizaria (PP 1). They also support (PP D 0.92–0.97) a clade comprising Foraminifera and Radiozoa (represented in our analyses by environmental polycystinean- and taxopodid-related sequences) within the Rhizaria (Figs. 1 and 2). The monophyly of Rhizaria and the Foraminifera/Radiozoa grouping are also both reasonably supported by our SSU+LSU rDNA ML tree (78% and 75%). This Foraminifera/Radiozoa clade contradicts some previous single-gene trees that suggested that the closest relatives of Foraminifera might be species of the endomyxan cercozoan genus Gromia, which are marine Wlose testate amoebae. Both SSU rDNA and RNA polymerase II phylogenies retrieved a relationship with Gromia, although with only moderate bootstrap support (Berney and Pawlowski, 2003; Longet et al., 2003). However, radiozoan sequences were not included in those analyses; when Radiozoa were included, Foraminifera grouped with polycystines in SSU rRNA distance analyses with strong (Cavalier-Smith and Chao, 2003b) or weak support (Cavalier-Smith and Chao, 2003c). A more recent work incorporating several radiozoan sequences in a supertree analysis combining SSU rDNA and actin phylogenies (Nikolaev et al., 2004) also retrieved sisterhood of Foraminifera and Gromia. However, that was essentially due to the results of the SSU rDNA analysis, which were incompatible with those of the actin phylogeny because Foraminifera showed two deeply divergent actin paralogues, one related to Gromia and the other to the polycystine Radiolaria. Another source of information was provided by the extremely well conserved protein polyubiquitin. One- or two-amino acid insertions have been found in the ubiquitin monomer-monomer junctions of the polyubiquitin sequences of Cercozoa (including Gromia) and Foraminifera (Archibald et al., 2003; Bass et al., 2005). These insertions appear to be absent in the rest of eukaryotes studied, including the polycystine Radiozoa and the Acantharea (Bass et al., 2005). Given the extraordinary degree of evolutionary conservation of polyubiquitin, the insertions shared by Foraminifera and Cercozoa were considered a potentially good character for the sisterhood of these two groups to the exclusion of Radiozoa (Bass et al., 2005). However, the case of Phaeodarea shows that these insertions may be not as stable as previously thought. Although Phaeodarea were considered for a long time to be members of the Radiolaria (Haeckel, 1887), recent phylogenetic analyses based on SSU rDNA sequence comparison have robustly shown that Phaeodarea belong in Cercozoa within the subphylum Filosa (Polet et al., 2004; Yuasa et al., 2006). Nevertheless, their polyubiquitin sequences lack the insertions between the ubiquitin monomers found in other Cercozoa (Bass et al., 2005). If Phaeodarea really branch within Cercozoa, they have lost the insertion, which could therefore also be the case for Radiozoa, making this character unreliable for excluding their possible closer relationship to Foraminifera. We have studied this problem using a SSU+LSU rDNA data set that included a partial 5⬘-end LSU rDNA sequence (1507 bp) of Gromia oviformis (Pawlowski et al., 1994). Addition of this sequence to our concatenated SSU+LSU rDNA data set reduced the number of conserved sites useful for phylogenetic inference (to a Wnal number of »2200, depending on the set of species retained), but the concatenation was still signiWcantly larger than the separated SSU and LSU rDNA data sets. As in the global eukaryotic phylogeny (Fig. 2), a Bayesian phylogenetic tree of all rhizarian sequences plus several slow evolving outgroup species retrieves with strong statistical support (PP D 1, ML BP D 100) the monophyly of Foraminifera plus Radiozoa (Fig. 3A). G. oviformis emerges with equal support at the base of the other cercozoan sequences. The same result is obtained when two additional partial foraminiferan SSU+LSU rDNA sequences are included, despite an even smaller number of usable sites (2077; Fig. 3B). DiVerent combinations of outgroup sequences yield the same results (not shown). However, if Radiozoa are removed from the data sets, Foraminifera turn out to branch with moderate support (PP D 0.87, ML BP D 78) as sisters of G. oviformis (Fig. 3C). Overall these results favour Foraminifera being more closely related to Radiozoa than to Gromia, and suggest that the placement of Foraminifera within the endomyxan Cercozoa, seen in the absence of radiozoan sequences, is artefactual. A speciWc relationship between Foraminifera and Radiozoa was already advanced by Cavalier-Smith (1999), who erected an infrakingdom Retaria to accommodate these two groups (Cavalier-Smith, 1999); later Retaria was reduced in rank to phylum (Cavalier-Smith, 2002), making Radiozoa and Foraminifera subphyla (Cavalier-Smith, 2003a). However, although our results support this, the conXicting evidence in the literature with diVerent taxon samples makes it desirable to verify the monophyly of Retaria using sequences of several conserved proteins from diVerent members of the Rhizaria, and to determine whether Radiozoa are sisters of Foraminifera (Fig. 3A) or ancestral to them (Fig. 3B). This relationship should also be tested by new LSU rRNA sequences from other deeply branching cercozoan groups, such as Haplosporidia and Phytomyxea (Nikolaev et al., 2004; Reece et al., 2004). If retarian monophyly were conWrmed, this would imply that the amino acid insertion in the polyubiquitin sequence was reversed in Radiozoa. However, since all available radiolarian polyubiquitin sequences have been obtained from specimens directly collected from seawater samples, it cannot be excluded that they actually belonged to undetected contaminant organisms (Bass et al., 2005). The possibility that the root of the Rhizaria lies between the monophyletic phyla Retaria and Cercozoa (with Gromia a deeply branching member of the endomyxan Cercozoa) suggests that the ancestral rhizarian might even have been endowed with an organic and/or mineralised test in its trophic phase. Please cite this article in press as: Moreira, D. et al., Global eukaryote phylogeny: Combined small- and large-subunit ribosomal DNA trees support monophyly of Rhizaria, Retaria and Excavata, Mol. Phylogenet. Evol. (2006), doi:10.1016/j.ympev.2006.11.001 ARTICLE IN PRESS D. Moreira et al. / Molecular Phylogenetics and Evolution xxx (2006) xxx–xxx 9 Fig. 3. Bayesian phylogenetic trees of the Rhizaria constructed with SSU+LSU rDNA data sets: (A) 9 rhizarian taxa, 2189 sites; (B) 11 rhizarian taxa, 2077 sites; (C) 7 rhizarian taxa, 2186 sites. GTR++I plus covarion model. Taxa for which the SSU and LSU rDNA sequences belonged to diVerent but related species are labelled as ‘composite’ (see Supplementary Material Table 1 for details). Several long branches were shortened to one sixth of their actual length (labelled 1/6). Numbers at nodes are posterior probabilities and ML bootstrap proportions. All the trees were rooted using nine slowly evolving sequences (Cyanophora paradoxa, Gnetum gnemon, Saccharomyces cerevisiae, Suberites Wcus, Prymnesium patelliferum, Perkinsus andrewsi, Prorocentrum micans, Skeletonema pseudocostatum and Pelagomonas calceolata) as outgroup. 3.4. The positions of Ancyromonadida and Apusomonadida (Apusozoa) remain unresolved All our SSU+LSU rDNA analyses place the zooXagellate Ancyromonas with the opisthokonts and Conosa, often branching weakly between these two groups (Fig. 2). This agrees with the Wrst SSU rDNA phylogenies reconstructed for this organism (Atkins et al., 2000; Cavalier-Smith, 2000). Support for such separation of Ancyromonas from other bikonts was strong in the Bayesian analyses (PP D 1) but weaker in the ML trees (ML BP D 71). The latter appears to be mainly due to the attraction of Ancyromonas towards Apusomonas proboscidea observed in several ML trees, also discussed by Atkins et al. (2000). Cavalier-Smith (1998a,b) placed Ancyromonas (Ancyromonadida: Cavalier-Smith, 1998a) and Apusomonadida (Apusomonas plus Amastigomonas) in Apusozoa, later grouping both orders with Hemimastigida (e.g., Hemimastix, Spironema, for which sequence data are not yet available) as the class Thecomonadea within a more narrowly deWned phylum Apusozoa, which now also includes the class Diphyllatea (Cavalier-Smith, 2000, 2003a,b). Although on most early SSU rDNA trees Ancyromonas and Apusomonadida were separate (but often close) deep branches among the bikonts (Cavalier-Smith, 2000, 2002, 2004), on some well sampled gamma-corrected ML and distance trees they do weakly group together (Cavalier-Smith, 2003a,b; Nikolaev et al., 2006). It has therefore been unclear whether Thecomonadea and Apusozoa are monophyletic or not (Atkins et al., 2000; Cavalier-Smith and Chao, 2003a). To test that further we analysed an additional sequence data set excluding all long-branch species, which allowed maximising the number of conserved positions useful for phylogenetic reconstruction (905 additional sites for a total of 3479). Bayesian analyses of this data set also support strongly (PP D 1) the proximity of Ancyromonas to the opisthokonts and no relationship with A. proboscidea (Supplementary material Fig. 2). In our Bayesian trees, A. proboscidea branches with moderate support (PP 7 0.87) within a group containing Rhizaria, Alveolata and Heterokonta (Fig. 2 and Supplementary Fig. 2). In contrast, the corresponding ML trees (not shown) join Ancyromonas and A. proboscidea in a weak monophyletic group (ML BP < 53) sister to the opisthokonts. This leaves open the possibility that these two Xagellates are indeed related (Cavalier-Smith, 1998a,b, 2000, 2003a,b), but additional sequence data are necessary to solve this question. In fact, an AU test carried out on the widest SSU+LSU rDNA data set (91 taxa, Fig. 2) does not reject the possibility that these two species form a monophyletic group at the base of the bikonts (P D 0.086). We have also tested topologies where Ancyromonas alone, or with A. proboscidea, branches as sister to Conosa, Please cite this article in press as: Moreira, D. et al., Global eukaryote phylogeny: Combined small- and large-subunit ribosomal DNA trees support monophyly of Rhizaria, Retaria and Excavata, Mol. Phylogenet. Evol. (2006), doi:10.1016/j.ympev.2006.11.001 ARTICLE IN PRESS 10 D. Moreira et al. / Molecular Phylogenetics and Evolution xxx (2006) xxx–xxx Excavata, Viridaeplantae, Cryptomonada, Haptophyta, Heliozoa, Glaucophyta, Rhizaria, Rhodophyta, Heterokonta or Alveolata. All of them are rejected (P < 0.05), supporting the deep divergence of Ancyromonas (alone or with Apusomonas) from all these bikonts in our trees. This tendency of Ancyromonas to group with opisthokonts and conosans is intriguing (see also Nikolaev et al. (2006) where Thecomonadea as a whole weakly occupy this position on the SSU rRNA tree, but other putative Apusozoa—Diphylleia and Breviata—group weakly with the excavates). Several lines of evidence suggest that Amoebozoa are the closest relatives of opisthokonts (Richards and Cavalier-Smith, 2005; Stechmann and Cavalier-Smith, 2003). Although we do not retrieve this result in our SSU+LSU rDNA trees, because Ancyromonas appears even closer to opisthokonts, Amoebozoa are the second closest group to opisthokonts, and statistical support for the relative positions of Ancyromonas and Amoebozoa with respect to opisthokonts is weak (PP D 0.73, ML BP < 54). Thus our analyses are compatible with the very probable sisterhood of Amoebozoa and opisthokonts. If we accept that the root of the eukaryotic tree is between opisthokonts + Amoebozoa (i.e., unikonts) and the rest of the phyla (bikonts), one could argue that the aYnity of Ancyromonas (or Ancyromonas plus apusomonads: Nikolaev et al., 2006) towards the opisthokonts and conosans might arise because this lineage is really sister to all other bikonts. Because of its very deep branching near the unikont-bikont divergence, Ancyromonas may be a very important model for the study of early evolution of eukaryotes as a whole or bikonts in particular. We stress that it has not yet been established whether Ancyromonas possesses the DHFR-TS fusion gene characteristic of bikonts (Richards and Cavalier-Smith, 2005; Stechmann and Cavalier-Smith, 2003) or the myosin II gene characteristic of unikonts (Richards and Cavalier-Smith, 2005). We are actively pursuing this question to clarify its position by these molecular cladistic markers. Also important for clarifying the basal branching of bikonts and the evolutionary aYnities and proper circumscription of Apusozoa will be sequences for other Ancyromonas species and for Micronuclearia, a non-Xagellate protozoan with cortical and mitochondrial ultrastructural resemblances to it (Mikrjukov and Mylnikov, 2001), and LSU rRNA sequences for Breviata, which SSU rRNA trees weakly group with apusomonads or Ancyromonas, or with Diphylleia and/or loukozoan excavates (Cavalier-Smith, 2003a, 2004; Cavalier-Smith and Chao, 2003c; Nikolaev et al., 2006; Walker et al., 2006). (e.g., Baldauf et al., 2000; Hampl et al., 2005; Moreira et al., 2000; Steenkamp et al., 2005), especially thanks to the construction of the Wrst expressed sequence tag (EST) data bases for several protist species (Bapteste et al., 2002; Rodríguez-Ezpeleta et al., 2005) and the Wrst phylogenetic inferences made possible by the complete sequencing of a few protist genomes (e.g., Eichinger et al., 2005). However, despite great eVorts, especially for massive EST sequencing, this approach is yielding results at a still relatively slow pace so that, today, we have a huge number of sequences but for only a very small number of protist species (Rodríguez-Ezpeleta et al., 2005). In addition to this multi-gene way, a complementary approach to the challenge of reconstructing the evolutionary history of eukaryotes could be the use of fewer markers but a much richer taxonomic sampling (Graybeal, 1998). Obviously, SSU rDNA oVers the best sampling, but we have shown that it can be eYciently combined with LSU rDNA that, like its small subunit counterpart, has the advantage of being easy to amplify by PCR and to sequence, even from uncultured organisms (López-García et al., 2002). Hence, with relatively modest eVort it should be possible to gather a SSU+LSU rDNA database with a rich taxonomic sampling very rapidly. It will certainly not resolve the entire eukaryotic phylogeny (see, for example, the ambiguous results for centrohelid heliozoa) but it can provide a more reliable and more comprehensive backbone that could be useful, among other things, to identify key species for EST projects. 4. Conclusions References For several decades, inference of global eukaryotic phylogeny has been mainly based on single-marker analyses that, in most cases, have proved to be aVected by a number of tree reconstruction artefacts (Philippe and Adoutte, 1998). This promoted the preferential use of a multi-marker approach using increasingly larger multi-gene data sets Archibald, J.M., Longet, D., Pawlowski, J., Keeling, P.J., 2003. A novel polyubiquitin structure in cercozoa and foraminifera: evidence for a new eukaryotic supergroup. Mol. Biol. Evol. 20, 62–66. Atkins, M.S., McArthur, A.G., Teske, A.P., 2000. Ancyromonadida: a new phylogenetic lineage among the protozoa closely related to the common ancestor of metazoans, fungi, and choanoXagellates (Opisthokonta). J. Mol. Evol. 51, 278–285. Acknowledgments DM and PLG thank the French Centre National de la Recherche ScientiWque (CNRS), section ‘Dynamique de la Biodiversité’ for an ATIP grant. TCS thanks the UK Natural Environment Research Council (NERC) for research grants and a Professorial Fellowship. S.H. and D.B. thank NERC for research studentships. T.C.S. and D.M. thank the Canadian Institute for Advanced Research Evolutionary Biology Program for Fellowship and Associateship support. We thank Franz Lang, Gertraud Burger and Michael Gray for providing Reclinomonas sequences. Appendix A. Supplementary data Supplementary data associated with this article can be found, in the online version, at doi:10.1016/j.ympev. 2006.11.001. Please cite this article in press as: Moreira, D. et al., Global eukaryote phylogeny: Combined small- and large-subunit ribosomal DNA trees support monophyly of Rhizaria, Retaria and Excavata, Mol. Phylogenet. Evol. (2006), doi:10.1016/j.ympev.2006.11.001 ARTICLE IN PRESS D. Moreira et al. / Molecular Phylogenetics and Evolution xxx (2006) xxx–xxx Baldauf, S.L., Roger, A.J., Wenk-Siefert, I., Doolittle, W.F., 2000. A kingdom-level phylogeny of eukaryotes based on combined protein data. Science 290, 972–977. Bapteste, E., Brinkmann, H., Lee, J.A., Moore, D.V., Sensen, C.W., Gordon, P., DuruXe, L., Gaasterland, T., Lopez, P., Muller, M., Philippe, H., 2002. The analysis of 100 genes supports the grouping of three highly divergent amoebae: Dictyostelium, Entamoeba, and Mastigamoeba. Proc. Natl. Acad. Sci. USA 99, 1414–1419. Baroin, A., Perasso, R., Qu, L.H., Brugerolle, G., Bachellerie, J.P., Adoutte, A., 1988. Partial phylogeny of the unicellular eukaryotes based on rapid sequencing of a portion of 28S ribosomal RNA. Proc. Natl. Acad. Sci. USA 85, 3474–3478. Bass, D., Cavalier-Smith, T., 2004. Phylum-speciWc environmental DNA analysis reveals remarkably high global biodiversity of Cercozoa (Protozoa). Int. J. Syst. Evol. Microbiol. 54, 2393–2404. Bass, D., Moreira, D., Lopez-Garcia, P., Polet, S., Chao, E.E., von der Heyden, S., Pawlowski, J., Cavalier-Smith, T., 2005. Polyubiquitin insertions and the phylogeny of Cercozoa and Rhizaria. Protist 156, 149–161. Berney, C., Pawlowski, J., 2003. Revised small subunit rRNA analysis provides further evidence that Foraminifera are related to Cercozoa. J. Mol. Evol. 57 (Suppl. 1), S120–S127. Cafaro, M.J., 2005. Eccrinales (Trichomycetes) are not fungi, but a clade of protists at the early divergence of animals and fungi. Mol. Phylogenet. Evol. 35, 21–34. Castresana, J., 2000. Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol. Biol. Evol. 17, 540– 552. Cavalier-Smith, T., 1998a. Neomonada and the origin of animals and fungi. In: Coombs, G.H., Vickerman, K., Sleigh, M.A., Warren, A. (Eds.), Evolutionary relationships among Protozoa. Chapman and Hall, London, pp. 375–407. Cavalier-Smith, T., 1998b. A revised six-kingdom system of life. Biol. Rev. Camb. Philos. Soc. 73, 203–266. Cavalier-Smith, T., 1999. Principles of protein and lipid targeting in secondary symbiogenesis: euglenoid, dinoXagellate, and sporozoan plastid origin and the eukaryote family tree. J. Eukaryot. Microbiol. 46, 347–366. Cavalier-Smith, T., 2000. Fleagellate megaevolution. In: Leadbeater, B.S.C., Green, J.C. (Eds.), The Xagellates: Unity, diversity and evolution. Taylor and Francis Limited, London, pp. 361–390. Cavalier-Smith, T., 2002. The phagotrophic origin of eukaryotes and phylogenetic classiWcation of Protozoa. Int. J. Syst. Evol. Microbiol. 52, 297–354. Cavalier-Smith, T., 2003a. The excavate protozoan phyla Metamonada Grassé emend. (Anaeromonadea, Parabasalia, Carpediemonas, Eopharyngia) and Loukozoa emend. (Jakobea, Malawimonas): their evolutionary aYnities and new higher taxa. Int. J. Syst. Evol. Microbiol. 53, 1741–1758. Cavalier-Smith, T., 2003b. Protist phylogeny and the high-level classiWcation of Protozoa. Eur. J. Protistol. 39, 338–348. Cavalier-Smith, T., 2004. Only six kingdoms of life. Proc. Roy. Soc. Lond. B 271, 1251–1262. Cavalier-Smith, T., Chao, E.E., 1995. The opalozoan Apusomonas is related to the common ancestor of animals, fungi and choanoXagellates. Proc. Roy. Soc. Lond. B. 261, 1–6. Cavalier-Smith, T., Chao, E.E., 2003a. Molecular phylogeny of centrohelid heliozoa, a novel lineage of bikont eukaryotes that arose by ciliary loss. J. Mol. Evol. 56, 387–396. Cavalier-Smith, T., Chao, E.E., 2003b. Phylogeny and classiWcation of phylum Cercozoa (Protozoa). Protist 154, 341–358. Cavalier-Smith, T., Chao, E.E., 2003c. Phylogeny of Choanozoa, Apusozoa, and other protozoa and early eukaryote megaevolution. J. Mol. Evol. 56, 540–563. de Koning, A.P., Tartar, A., Boucias, D.G., Keeling, P.J., 2005. Expressed sequence tag (EST) survey of the highly adapted green algal parasite, Helicosporidium. Protist 156, 181–190. Douady, C.J., Delsuc, F., Boucher, Y., Doolittle, W.F., Douzery, E.J., 2003. Comparison of Bayesian and maximum likelihood bootstrap measures of phylogenetic reliability. Mol. Biol. Evol. 20, 248–254. 11 Eichinger, L., Pachebat, J.A., Glockner, G., Rajandream, M.A., Sucgang, R., Berriman, M., Song, J., Olsen, R., Szafranski, K., Xu, Q., Tunggal, B., Kummerfeld, S., Madera, M., Konfortov, B.A., Rivero, F., Bankier, A.T., Lehmann, R., Hamlin, N., Davies, R., Gaudet, P., Fey, P., Pilcher, K., Chen, G., Saunders, D., Sodergren, E., Davis, P., Kerhornou, A., Nie, X., Hall, N., Anjard, C., Hemphill, L., Bason, N., Farbrother, P., Desany, B., Just, E., Morio, T., Rost, R., Churcher, C., Cooper, J., Haydock, S., van Driessche, N., Cronin, A., Goodhead, I., Muzny, D., Mourier, T., Pain, A., Lu, M., Harper, D., Lindsay, R., Hauser, H., James, K., Quiles, M., Madan Babu, M., Saito, T., Buchrieser, C., Wardroper, A., Felder, M., Thangavelu, M., Johnson, D., Knights, A., Loulseged, H., Mungall, K., Oliver, K., Price, C., Quail, M.A., Urushihara, H., Hernandez, J., Rabbinowitsch, E., SteVen, D., Sanders, M., Ma, J., Kohara, Y., Sharp, S., Simmonds, M., Spiegler, S., Tivey, A., Sugano, S., White, B., Walker, D., Woodward, J., Winckler, T., Tanaka, Y., Shaulsky, G., Schleicher, M., Weinstock, G., Rosenthal, A., Cox, E.C., Chisholm, R.L., Gibbs, R., Loomis, W.F., Platzer, M., Kay, R.R., Williams, J., Dear, P.H., Noegel, A.A., Barrell, B., Kuspa, A., 2005. The genome of the social amoeba Dictyostelium discoideum. Nature 435, 43–57. Erixon, P., Svennblad, B., Britton, T., Oxelman, B., 2003. Reliability of Bayesian posterior probabilities and bootstrap frequencies in phylogenetics. Syst. Biol. 52, 665–673. Graybeal, A., 1998. Is it better to add taxa or characters to a diYcult phylogenetic problem? Syst. Biol. 47, 9–17. Guindon, S., Gascuel, O., 2003. A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst. Biol. 52, 696– 704. Haeckel, E., 1887. Report on radiolaria collected by H.M.S. Challenger during the years 1873–1876. Rep Sci Res Voyage H.M.S. Challenger 1873–76 18, 1–1803. Hampl, V., Horner, D.S., Dyal, P., Kulda, J., Flegr, J., Foster, P.G., Embley, T.M., 2005. Inference of the phylogenetic position of oxymonads based on nine genes: support for Metamonada and Excavata. Mol. Biol. Evol. 22, 2508–2518. Hertel, L.A., Bayne, C.J., Loker, E.S., 2002. The symbiont Capsaspora owczarzaki, nov. gen. nov. sp., isolated from three strains of the pulmonate snail Biomphalaria glabrata is related to members of the Mesomycetozoea. Int. J. Parasitol. 32, 1183–1191. Hillis, D.M., 1996. Inferring complex phylogenies. Nature 383, 130–131. Karpov, S.A., Bass, D., Mylnikov, A.P., Cavalier-Smith, T., 2006. Molecular phylogeny of Cercomonadidae and kinetid patterns of Cercomonas and Eocercomonas gen. nov (Cercomonadida, Cercozoa). Protist 157, 125–158. Lang, B.F., O’Kelly, C., Nerad, T., Gray, M.W., Burger, G., 2002. The closest unicellular relatives of animals. Curr. Biol. 12, 1773–1778. Longet, D., Archibald, J.M., Keeling, P.J., Pawlowski, J., 2003. Foraminifera and Cercozoa share a common origin according to RNA polymerase II phylogenies. Int. J. Syst. Evol. Microbiol. 53, 1735–1739. López-García, P., Rodríguez-Valera, F., Moreira, D., 2002. Towards the monophyly of Haeckel’s Radiolaria: 18S rRNA environmental data support the sisterhood of Polycystinea and Acantharea. Mol. Biol. Evol. 19, 118–121. Medina, M., Collins, A.G., Taylor, J.W., Valentine, J.W., Lipps, J.H., Amaral-Zettler, L., Sogin, M.L., 2003. Phylogeny of Opisthokonta and the evolution of multicellularity and complexity in Fungi and Metazoa. Int. J. Astrobiol. 2, 203–211. Mikrjukov, K.A., Mylnikov, A.P., 2001. A study of the Wne structure and the mitosis of a lamellicristate amoeba, Micronuclearia podoventralis gen. et sp. nov. (Nucleariidae, Rotosphaerida). Eur. J. Protistol. 37, 15–24. Moreira, D., Le Guyader, H., Philippe, H., 2000. The origin of red algae and the evolution of chloroplasts. Nature 405, 69–72. Nikolaev, S.I., Berney, C., Fahrni, J.F., Bolivar, I., Polet, S., Mylnikov, A.P., Aleshin, V.V., Petrov, N.B., Pawlowski, J., 2004. The twilight of Heliozoa and rise of Rhizaria, an emerging supergroup of amoeboid eukaryotes. Proc. Natl. Acad. Sci. USA 101, 8066–8071. Nikolaev, S.I., Berney, C., Petrov, N.B., Mylnikov, A.P., Fahrni, J.F., Pawlowski, J., 2006. Phylogenetic position of Multicilia marina and the evolution of Amoebozoa. Int. J. Syst. Evol. Microbiol. 56, 1449–1458. Please cite this article in press as: Moreira, D. et al., Global eukaryote phylogeny: Combined small- and large-subunit ribosomal DNA trees support monophyly of Rhizaria, Retaria and Excavata, Mol. Phylogenet. Evol. (2006), doi:10.1016/j.ympev.2006.11.001 ARTICLE IN PRESS 12 D. Moreira et al. / Molecular Phylogenetics and Evolution xxx (2006) xxx–xxx Patron, N.J., Rogers, M.B., Keeling, P.J., 2004. Gene replacement of fructose-1,6-bisphosphate aldolase supports the hypothesis of a single photosynthetic ancestor of chromalveolates. Eukaryot. Cell. 3, 1169–1175. Pawlowski, J., Bolivar, I., Guiard-MaYa, J., Gouy, M., 1994. Phylogenetic position of foraminifera inferred from LSU rRNA gene sequences. Mol. Biol. Evol. 11, 929–938. Perasso, R., Baroin, A., Qu, L.H., Bachellerie, J.P., Adoutte, A., 1989. Origin of the algae. Nature 339, 142–144. Philippe, H., 1993. MUST, a computer package of Management Utilities for Sequences and Trees. Nucleic Acids Res. 21, 5264–5272. Philippe, H., Adoutte, A., 1998. The molecular phylogeny of Eukaryota: solid facts and uncertainties. In: Coombs, G., Vickerman, K., Sleigh, M., Warren, A. (Eds.), Evolutionary relationships among Protozoa. Chapman and Hall, London, pp. 25–26. Philippe, H., Lopez, P., Brinkmann, H., Budin, K., Germot, A., Laurent, J., Moreira, D., Muller, M., Le Guyader, H., 2000. Early-branching or fast-evolving eukaryotes? An answer based on slowly evolving positions. Proc. Roy. Soc. Lond. B Biol. Sci. 267, 1213–1221. Philippe, H., Lartillot, N., Brinkmann, H., 2005. Multigene analyses of bilaterian animals corroborate the monophyly of Ecdysozoa, Lophotrochozoa, and Protostomia. Mol. Biol. Evol. 22, 1246–1253. Polet, S., Berney, C., Fahrni, J., Pawlowski, J., 2004. Small-subunit ribosomal RNA gene sequences of Phaeodarea challenge the monophyly of Haeckel’s Radiolaria. Protist 155, 53–63. Posada, D., Crandall, K.A., 1998. MODELTEST: testing the model of DNA substitution. Bioinformatics 14, 817–818. Pupko, T., Huchon, D., Cao, Y., Okada, N., Hasegawa, M., 2002. Combining multiple data sets in a likelihood analysis: which models are the best? Mol. Biol. Evol. 19, 2294–2307. Rambaut, A., Drummond, A., 2003. TRACER: A program for analysing results from Bayesian MCMC programs. Version 1.2, Oxford. Reece, K.S., Siddall, M.E., Stokes, N.A., Burreson, E.M., 2004. Molecular phylogeny of the Haplosporidia based on two independent gene sequences. J. Parasitol. 90, 1111–1122. Richards, T.A., Cavalier-Smith, T., 2005. Myosin domain evolution and the primary divergence of eukaryotes. Nature 436, 1113–1118. Rodríguez-Ezpeleta, N., Brinkmann, H., Burey, S.C., Roure, B., Burger, G., LoVelhardt, W., Bohnert, H.J., Philippe, H., Lang, B.F., 2005. Monophyly of primary photosynthetic eukaryotes: green plants, red algae, and glaucophytes. Curr. Biol. 15, 1325–1330. Roger, A.J., 1999. Reconstructing early events in eukaryotic evolution. Am. Nat. 154, S146–S163. Ronquist, F., Huelsenbeck, J.P., 2003. MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics 19, 1572–1574. Ruiz-Trillo, I., Inagaki, Y., Davis, L.A., Sperstad, S., Landfald, B., Roger, A.J., 2004. Capsaspora owczarzaki is an independent opisthokont lineage. Curr. Biol. 14, R946–R947. Sakaguchi, M., Nakayama, T., Hashimoto, T., Inouye, I., 2005. Phylogeny of the centrohelida inferred from SSU rRNA, tubulins, and actin genes. J. Mol. Evol. 61, 765–775. Schmidt, H., Strimmer, K., Vingron, M., von Haeseler, A., 2002. TREEPUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing. Bioinformatics 18, 502–504. Shimodaira, H., 2002. An approximately unbiased test of phylogenetic tree selection. Syst. Biol. 51, 492–508. Shimodaira, H., Hasegawa, M., 2001. CONSEL: for assessing the conWdence of phylogenetic tree selection. Bioinformatics 17, 1246–1247. Simpson, A.G., Inagaki, Y., Roger, A.J., 2006. Comprehensive multigene phylogenies of excavate protists reveal the evolutionary positions of “primitive” eukaryotes. Mol. Biol. Evol. 23, 615–625. Sogin, M.L., 1991. Early evolution and the origin of eukaryotes. Curr. Opin. Genet. Dev. 1, 457–463. Stechmann, A., Cavalier-Smith, T., 2003. The root of the eukaryote tree pinpointed. Curr. Biol. 13, R665–R666. Steenkamp, E.T., Wright, J., Baldauf, S.L., 2005. The protistan origins of animals and fungi. Mol. Biol. Evol. 23, 93–106. Stiller, J., Hall, B., 1999. Long-branch attraction and the rDNA model of early eukaryotic evolution. Mol. Biol. Evol. 16, 1270–1279. Suzuki, Y., Glazko, G.V., Nei, M., 2002. Overcredibility of molecular phylogenies obtained by Bayesian phylogenetics. Proc. Natl. Acad. Sci. USA 99, 16138–16143. von der Heyden, S., 2004. Testing ubiquitous dispersal and freshwater/ marine divergence in free-living protist groups. DPhil Thesis, University of Oxford. von der Heyden, S., Chao, E., Cavalier-Smith, T., 2004. Genetic diversity of goniomonads: an ancient divergence between freshwater and marine species. Eur. J. Phycol. 39, 343–350. Walker, G., Dacks, J.B., Embley, T.M., 2006. Ultrastructural description of Breviata anathema, n. gen., n. sp., the organism previously studied as “Mastigamoeba invertens”. J. Eukaryot. Microbiol. 53, 65–78. Yuasa, T., Takahashi, O., Dolven, J.K., Mayama, S., Matsuoka, A., Honda, D., Bjørklund, K.R., 2006. Phylogenetic position of the small solitary phaeodarians (Radiolaria) based on 18S rRNA sequences by single cell PCR analysis. Marine Micropal. 59, 104–114. Please cite this article in press as: Moreira, D. et al., Global eukaryote phylogeny: Combined small- and large-subunit ribosomal DNA trees support monophyly of Rhizaria, Retaria and Excavata, Mol. Phylogenet. Evol. (2006), doi:10.1016/j.ympev.2006.11.001
© Copyright 2026 Paperzz