Cell, Vol. 107, 419–425, November 16, 2001, Copyright 2001 by Cell Press Reconstructing/Deconstructing the Earliest Eukaryotes: How Comparative Genomics Can Help Joel B. Dacks and W. Ford Doolittle1 Program in Evolutionary Biology Canadian Institute for Advanced Research Department of Biochemistry and Molecular Biology Dalhousie University Halifax, Nova Scotia Canada We could reconstruct the evolution of eukaryote-specific molecular and cellular machinery if some living eukaryotes retained primitive cellular structures and we knew which eukaryotes these were. It’s not clear that either is the case, but the expanding protist genomic database could help us in several ways. Introduction Almost thirty years ago, the Canadian microbiologist Roger Stanier (Stanier, 1974) asserted that “the differences in structure and function between prokaryotic and eukaryotic cells are many and profound, and no contemporary biological group has a cellular organization that can plausibly be interpreted as intermediate.” Microbiologists’ understanding now is more nuanced in several ways. We divide Life into three domains (Bacteria, Archaea, and Eukarya) rather than two (prokaryotes and eukaryotes). We recognize an astonishing diversity of form and function within and between the two prokaryotic domains, and the defining positive prokaryotic characters recognized by Stanier are no longer universal among them. And increasingly we can confirm the presence in Bacteria or Archaea or both of “primitive” forms of key macromolecules and processes previously thought to be eukaryote specific. For instance, bacterial MreB, first claimed as a homolog to eukaryotic actin only on the basis of predicted structural similarity in ATPase domains (Bork et al., 1992) has recently been shown to form “cytoskeletal” actin-like filaments in vivo (Jones et al., 2001) and in vitro (van den Ent et al., 2001). Nevertheless, differences at the level of cell structure and function remain “many and profound.” It is still reasonable to speak of the prokaryote-eukaryote transition as one of cellular evolution’s greatest leaps. It is still sensible to ask how and when the various complex macromolecular machines (cytoskeletons, endomembrane systems, spliceosome, membrane-bounded organelles, and many others) that all the best-known eukaryotes have and all known prokaryotes lack—or possess only in inchoate form—came into being. One can approach this question from the bottom up, looking to diverse prokaryotes for homologs of components and subassemblies of complex eukaryotic cellular machines (as in the case of actin and MreB), or from the top down, searching among eukaryotes for simpler versions of such machines (as we will describe below). Either way, there are principles of logic and parsimony that can help in inferring when in cellular evolution com1 Correspondence: [email protected] Review plex cellular features and their component parts first appeared. In the simplest cases, features found in homologous form in all descendants of a common ancestor must, barring lateral transfer, have been present in that ancestor (must be primitive or ancestral features). More complex cases involve traits found in some descendants (for instance, as designated “⫹” in species B of Figure 1A) but not in others (species A). These could either have been gained (in the lineage leading to B) or lost (in the lineage leading to A) after their divergence from their common ancestor (X). One can tell which by looking at “outgroups” (O). If (and only if) some of these show the ⫹ trait, parsimony demands that X was ⫹, and that A has lost this feature. To apply such reasoning, it is essential to know the phylogeny, and to make interesting and sensible conclusions about early eukaryotic cellular evolution, it is essential to have a deeply branched eukaryotic phylogeny. Too many eukaryotic features have been assumed to be universal (and thus primitive) for all eukaryotes simply because they are “present from yeast to man.” There are likely very many early eukaryotic branches (many outgroups) below the common ancestor of yeast and man, whatever the true structure of the eukaryotic tree. Before molecular methods achieved their current dominance, however, attempts to reconstruct phylogeny were often conflated with attempts to infer when complex traits and their components first appeared. For instance, it seemed reasonable to suggest that the simplest contemporary eukaryotes branched off the main trunk of the eukaryotic tree very early—because the first eukaryotic cells obviously must have been simpler than modern ones and simplicity is unlikely to be a secondarily derived feature. Among such simple contemporary eukaryotes were the diplomonads (including the intestinal parasite Giardia), parabasalids (including the sexually transmitted disease-causing agent Trichomonas), oxymonads, microsporidia (such as the human pathogen Encephalitozoon), pelobionts (Mastigamoeba), entamoebae, and the heteroloboseid amoebae such as Naegleria. These protists are variously deficient in such basic eukaryotic cellular components as mitochondria, peroxisomes, and Golgi dictyosomes. For much of the last decade, molecular phylogeny seemed to support an early branching for many of these simple protists. (Note that we should call such eukaryotes “early branching” or “deeply diverging” rather than “ancestral” or “primitive,” since no contemporary cell can be any other’s ancient ancestor, and none is likely to retain primitive states of all important characters.) Small subunit (SSU) ribosomal RNA gene sequences, which have served as the gold standard molecular measure for reconstructing phylogenetic relationships, showed the amitochondriate diplomonads, microsporidia, and parabasalids at the base of the eukaryotic tree (Figure 1B). Above them, mitochondriate protist lineages such as Heterolobosea and Euglenozoa emerged, followed by the unresolved cluster of animals, fungi, plants, and other lineages (including some protists) commonly called the “eukaryotic crown” (Sogin, 1991). Cell 420 Figure 1. Parsimony Diagram and Representative Tree of Eukaryotes (A) demonstrates the principles of parsimony in evolutionary deduction (see text). (B) is a redrawn version of the tree of eukaryotes based on SSU rDNA from Sogin (1991). This tree, and the early and sequential divergence of several protist groups it implies, have provided an invaluable stimulus and guide to much evolutionary research in the last two decades. However, the agnostic representation in Figure 2 may be more consistent with the bulk of available reliable data. Now however, things are much less certain. In some cases, the simplicity of supposed deeply diverging protist lineages is obviously due to secondary loss of complex cell structures, likely a consequence of adopting a parasitic lifestyle. Although surely the first eukaryotes were simpler in structure than modern cells, there is no compelling evidence that the last common ancestor of all surviving eukaryotes was. Nor is there any agreed upon and robust deep eukaryotic phylogeny. While the lack of resolution may be largely a reflection of our methods of phylogenetic analysis, some feel that eukaryotes diversified so rapidly (in a “Big Bang” radiation) that the order of the early branchings will never be known (Philippe et al., 2000a). Here, we summarize the problems, and consider whether comparative genomics might hold the answers. Our take on these issues is not unlike (and was influenced by) that presented earlier by Embley and Hirt (1998) and by Roger (1999), and of necessity leaves out much: for instance, the complex supportive arguments for early mitochondria based on hydrogenosomes (Tachezy et al., 2001). Primitive Simplicity versus Secondary Simplification The deepest of the deeply diverging eukaryotes were until recently thought to be those mitochondrion-lacking protists that Cavalier-Smith (1983) called “Archezoa”— comprising, among others, the diplomonads (Giardia), parabasalids (Trichomonas), and microsporidia. The prevailing belief about mitochondria since Margulis (1970) had been that these eukaryotic organelles are the degenerate descendants of endosymbiotic bacteria engulfed by some early amitochondriate eukaryote (the host). It was logical to propose that the immediate ancestor of this eukaryote left other surviving descendants that never harbored such endosymbionts or never converted them to mitochondria. Those descendants would have been the direct ancestors of modern archezoa. This appealing “Archezoa Hypothesis” now seems likely to be false, based on two types of evidence: First, several gene sequences (Hirt et al., 1999; Keeling and Doolittle, 1996; Van de Peer et al., 2000) disagree with SSU rDNA on the deep branching of microsporidia (placing them instead among or as sister to fungi), while the deep divergences of diplomonads and parabasalids obtained with several molecular datasets are now seen as “long branch artifacts” (see below). Second, all three lineages harbor, in their nuclear genomes, at least one gene that obviously derived from mitochondrial genomes. Initial evidence for this came in 1996, when four groups simultaneously described a mitochondrial-type cpn60 gene in nuclear DNA of Trichomonas vaginalis (Bui et al., 1996; Germot et al., 1996; Horner et al., 1996; Roger et al., 1996). The same gene was since found in the diplomonads Giardia lamblia (Roger et al., 1998) and Spironucleus barkhanus (Horner and Embley, 2001), and thorough phylogenetic analyses place all such “archezoal” cpn60s with a mitochondrial cluster (Horner and Embley, 2001). Of course, their functions cannot be “mitochondrial”: current thinking is that, at least in parabasalids, cpn60 serves the hydrogenosome—a hydrogen-generating organelle that likely shares a common origin with mitochondria. Several other amitochondriate lineages have been shown to have nuclear genes of mitochondrial origin (Clark and Roger, 1995; Germot et al., 1997) or to be embedded in groups that have likely mitochondrial homologs (Dacks et al., 2001), and thus (by the logic of Figure 1A) likely also to be secondarily deficient in these organelles. At the moment, it is unclear that there are any living direct descendants of the premitochondriate “host,” if such ever existed. Thus, parsimony says that the last common ancestor of all currently known eukaryotes had mitochondria. Deep protists should also, according to prevailing theory, lack introns. Not only would this be consistent with the general observation that more complex organisms generally have more introns (Logsdon, 1998), it would jibe with the popular theory that spliceosomal introns derive from group II introns were first introduced into eukaryotes by the premitochondrial symbiont (CavalierSmith, 1991). But if all known eukaryotes once had mitochondria, did they also all once have (indeed do they all still have) spliceosomal introns? Microsporidia surely do: introns have now been described in some of their genes (Biderre et al., 1998) and U2 and U6 spliceosomal RNAs are present (Fast et al., 1998). Although no introncontaining genes have yet been described in diplomonads or parabasalids, a gene for the essential and highly conserved splicing component PRP8 has been found in Trichomonas (Fast and Doolittle, 1999) and a PRP8 gene and several other splicing factors are listed at the Giardia Genome Review 421 Table 1. Presence of Endomembrane Genes in Diverse Eukaryotic Genome Projects Higher taxon organism Syntaxin Snap25 Rab ARFGap COPI Sec1 Ykt6p Fungi Land Plants Animal Diplomonad Kinetoplastid Apicomplexa Apicomplexa Slime molds Entamoebae Red Algae Stramenopiles Green algae Saccharomyces Arabidopsis Human Giardia Trypanosoma Plasmodium Theileria Dictyostelium Entamoeba Porphyra Phytophthora Chlamydomonas **** **** **** *** * *** * **** * ** ** ** **** **** **** NI NI NI NI NI NI NI NI NI **** **** **** *** **** *** **** **** **** ** * **** **** *** **** ** * *** * NI * NI ** **** **** **** **** * *** ***-a * NI * ** * ** **** **** **** ** * *** * ** * NI ** ** **** **** **** ** * * * ** * ** ** ** This summarizes searches as of Oct 2001. **** indicates a published reference, *** means that a homolog is listed in GenBank, ** denotes an identified BLAST hit listed on the relevant genomic project web page, * indicates that our search identified a putative homolog by BLAST with a score less than p ⫽ 0.05. NI ⫽ Not Identified: a homolog could not be reliably identified by any of the above criteria. a ⫽ there is a COPI homolog identified in GenBank for the apicomplexan Toxoplasma. This data was obtained by searching, using keywords/BLASTing either GenBank or the following genomics projects, which we thank for their contributions: Giardia ⫽ http://hermes.mbl.edu/baypaul/Giardia-HTML/index2.html, Chlamydomonas and Porphyra ⫽ http://www.kazusa.or.jp/en/plant/database.html, Dictyostelium ⫽ http://www.csm.bio1.tsukuba.ac.jp/cDNAproject.html, Phytophthora ⫽ http://www.ncgr.org/pgc/indes.html and Theileria ⫽ http://www.tigr.org/tdb/e2k1/tpa1/. While preliminary lists of this type can allow general statements about evolutionary events, to make robust and detailed conclusions using genomic data, it is also important to obtain near complete open reading frames (ORF) of putative proteins identified by BLAST. Potentially misleading BLAST results of partial sequences aside, the full ORF can be used for phylogentic analysis, or to identify those rare evolutionary events discussed below. The verified sequence will also allow for comparative analysis of functionally characterized residues. genome project web site. The few gene sequences available for other putatively early eukaryotes such as Reclinomonas (Archibald et al., 2001; Edgcomb et al., 2001) and Mastigamoeba (Stiller et al., 1998) contain introns, sometimes at high density. Although one awaits proof of spliceable introns in genes of Giardia and Trichomonas, the odds are that all known eukaryotes have—or once had—these elements in their genes, and the wherewithal to remove them. A number of putatively deeply diverging lineages including Heterolobosea, diplomonads, Entamoeba, Mastigamoeba, and oxymonads lack recognizable stacked Golgi bodies, and some were once thought to have diverged prior to this key eukaryotic innovation (CavalierSmith, 1983). In the case of Giardia (Lujan et al., 1995) and Entamoeba (Ghosh et al., 1999), however, there is evidence that a Golgi-like structure can be induced, and that some Golgi enzymes are present in the cell. In the case of such a complex and variable system, mapping components and subassemblies to eukaryote phylogeny might provide a picture of progressive complexification. The diversity of Rab proteins (GTPase proteins involved in vesicular transport) has been examined, with homologs identified in animals, fungi (Armstrong, 2000), and plants (Borg and Poulsen, 1994) as well as diverse protists (Bush et al., 1993; Field and Boothroyd, 1995; Janoo et al., 1999; Rodriguez et al., 2000). However, the majority of proteins involved in the endomembrane system have not been examined through comparative genomics. Table 1 summarizes our recent preliminary searches of completed and in-progress genomes for several crucial genes involved in the endomembrane system. Although Snap-25 homologs were not found outside animals, plants, and fungi, most proteins searched did have at least one homolog present in di- verse taxa. The last common ancestor of all extant eukaryotes likely had complicated vesicular transport machinery. These are but three examples of systems once thought to be primitively absent in deeply diverging lineages that seem now more likely to have been secondarily lost in such cells, or cryptically present in all eukaryotes. Others, including the protein folding apparatus (Archibald et al., 2000) and translation elongation release factors (Inagaki and Doolittle, 2000), are also well documented. The last common eukaryote ancestor would probably not be thought primitive, were it available for examination today. Either (1) the “many and profound” differences between eukaryotes and prokaryotes appeared very quickly in evolution, (2) many transitional stages have left no survivors, or (3) those survivors have yet to be identified. Problems with Phylogeny Another barrier to reconstructing the prokaryoteeukaryote transition is uncertainty about the phylogenetic relationships of the lineages that are known. Many protein sequences from eukaryotes, developed to be reliable markers of eukaryotic organismal evolution, do not reproduce the small subunit rDNA tree. Similar disharmony among prokaryotic phylogenies based on different genes may frequently be due to lateral gene transfer (Doolittle, 1999), and eukaryotes are surely not immune to transfer. Nevertheless, such incongruence in eukaryotes has most often been attributed to failures in methods of phylogenetic reconstruction. For computational reasons, early models of phylogenetic analysis involved radical and unrealistic simplifications. These included the assumption that all sites in a protein vary equally, and that all changes between nucleotides or Cell 422 amino acids occur with the same frequency (Swofford et al., 1996). Most importantly, however, was the faulty assumption that the same gene in different organisms evolves at a constant rate. This led to an artifact called long branch attraction (LBA) (Felsenstein, 1978). In phylogenetic reconstruction, sequences that evolve at higher rates are artificially attracted to each other and (for the same reasons) to sequences that are very different because they diverged very long ago. For eukaryotic trees rooted with bacterial or archaeal outgroup sequences (which present long branches because they are indeed anciently diverged), the result will be the artifactual placement of rapidly evolving sequences at the root of the tree. The recognition of this artifact, and attempts to compensate for it with more sophisticated computer algorithms and biologically accurate models of sequence evolution, have thrown into doubt the ancient nature of many of the organisms once thought to be deeply diverging. Microsporidia were the first to go. Phylogenies of tubulins and RNA polymerase as well as reanalyses of other markers have shown that the microsporidia are not ancient eukaryotes, but either degenerate members or very close relatives of the fungi (Hirt et al., 1999; Keeling and Doolittle, 1996; Van de Peer et al., 2000). The ancient nature of the parabasalids and diplomonads has also come into question: these organisms do present long branches in many phylogenetic reconstructions (Hirt et al., 1999; Stiller and Hall, 1999). However, no alternative placement has been suggested for these taxa and so their status as deeply diverging lineages is only in doubt, not disproved. In doubt as well, though, is the very structure of the eukaryotic tree. Philippe et. al. (2000b) have argued that long branch attraction can provide false support for the ladder-like structure of sequentially emerging taxa in the eukaryotic phylogenetic tree, seen in Figure 1B. They show that correction for this artifact produces a multifurcated tree, whose deep branching order is unresolved (Philippe et al., 2000a). This, they propose, is the result of a real biological phenomenon, not methodological failure. Their “Big Bang” hypothesis suggests that many of the extant eukaryotic lineages evolved very rapidly from one another: the branching pattern is unresolved because, for most markers, not enough mutations occurred between branchings to allow reconstruction of the order of events (Philippe et al., 2000a). Such a view is of course consistent with (although not required by) the conclusion drawn above that extant characterized eukaryotes diverged from an already complex eukaryotic ancestor. However, a rapid radiation does not necessarily mean that the phylogeny of eukaryotes is an insurmountable or unresolvable question. How Genomics Might Help with Phylogeny Scientists almost always assert that what is needed is “more data.” Here, we need more data of at least three sorts. First, one can look harder for surviving direct descendants of the primitively simple eukaryotes that must once have thrived in ancient anaerobic habitats, according to all currently fashionable theories of cell evolution (Lopez-Garcia and Moreira, 1999; Margulis, 1970; Figure 2. Eukaryotic Relationships Based on Consensus Data This scheme takes into account (Baldauf and Palmer, 1993; Baldauf et al., 2000; Cavalier-Smith and Chao, 1996; Dacks et al., 2001; Fast et al., 2001; Moreira et al., 2000; Simpson and Patterson, 2001), among others. Taxa with full genome sequencing or GSS projects underway are shown in Blue, while those with EST projects only are shown in Red. Taxa with both EST and full genomic initiatives are shown in purple. Martin and Muller, 1998). Searching by microscopic methods in benthic environments have uncovered interesting new organisms such as Reclinomonas (Flavin and Nerad, 1993), Trimastix (Brugerolle and Patterson, 1997), Malawimonas (O’Kelly and Nerad, 1999), and Carpediemonas (Simpson and Patterson, 1999). For prokaryotes, it is estimated that we only know 1% of the actual biodiversity that exists globally (Pace, 1997). For eukaryotes the fraction might be higher than that since diversity can be more easily seen, but by how much? Recent environmental PCR studies of ocean water have revealed (through their SSU rDNA sequences) novel eukaryotic groups (Lopez-Garcia et al., 2001). When environmental genomic sampling methods (such as characterizing BAC clones from whole communities or ecosystems [Beja et al., 2000]) become widely applied, our appreciation of eukaryotic diversity, not only phylogenetic but genomic, physiological, and structural diversity, will expand enormously. Second, the multiplicity of new individual gene sequences that emerge from genome projects and genome surveys (such as single-pass and EST analyses) will allow different and more compelling sequencebased phylogenetic reconstructions. As we have seen in the past few years, single strong pieces of evidence that are consistent with previous inconclusive data can help us lock down phylogenetic positions. Sequences from different genes will be useful for reconstructing relationships between different organisms. The debate about whether red and green algae share a recent common ancestor seemed played to a stalemate, until sequences of the EF2 gene strongly placed them together (Moreira et al., 2000). Similarly adding an additional organism can help to tack down a taxon that had gone phylogenetically adrift, as in the case of the oxymonad and Trimastix relationship (Dacks et al., 2001). More genome sequences will help us find more insertion/deletions such as the EF1␣ insertion that unites animals and Genome Review 423 fungi (Baldauf and Palmer, 1993). These can give great confidence because they rely on positive evidence for extremely unlikely events, in addition to whatever phylogenetic signal their sequences provide. Increasing gene sequence from genomic projects also allows us to combine many genes into concatenated multigene datasets. This approach has been used successfully a number of times in the past few years to yield robust evidence for relationships that were previously contested (Martin et al., 1998; Baldauf et al., 2000). Some of these new or newly affirmed relationships are illustrated in Figure 2, which summarizes what we feel to be the most wellsupported relationships among eukaryotes, based on many sorts of data. Third, data from EST and partial and complete genome projects for many protists will allow comparative analyses of their (partial or complete) gene complements, or “proteomes,” in popular parlance. There are several ways in which information on gene content could augment sequence-based phylogenetic analyses of individual or concatenated genes, by revealing unique rare gene duplication, fusion, or transfer events. These can convincingly link existing groups to single inferred ancestors, just as insertions and deletions within genes can do. For instance, Philippe and collaborators (2000a, 2000b) recently identified a fusion of dihydrofolate reductase (DHFR) and thymidylate synthase (TS) genes that unites plants, alveolates, and Euglenozoa. In a more complex case, Fast et al. (2001) propose that the plastidtargeted GAPDH (glyceraldehyde-3-phosphate dehydrogenase) genes common to apicomplexa, dinoflagellates, heterokonts, and perhaps cryptomonads derive from a unique duplication and retargetting of cytosolic GAPDH at the base of this group, which also includes the secondarily aplastidic ciliates. Unambiguous cases of such unique genetic events may not only serve to unite groups, but also to root the eukaryotic tree. Rooting requires events that unite all eukaryotic taxa save one; that one, provided it convincingly shares the pre-event character state with prokaryotic outgroups, can then be taken as the deepest branch. Keeling and Palmer (2000) recently found two deletions in enolase shared by all known eukaryotes save parabasalids (such as Trichomonas). Parabasalids are themselves like archaea in enolase structure, and on this basis these authors nominated parabasalids as the deepest eukaryotes. Any single such one-of-a kind event can in principle establish relationships between the unconnected groups of Figure 2, but molecular evolutionists are as often wrong as right in specific cases. Thus, it is comforting that genomic data will likely provide scores of such examples. Protist genomics is in its infancy, about where prokaryotic genomics was six years ago when the first complete sequences began to appear. What evolutionary microbiologists will soon have are phylogenetic trees for tens of thousands of genes and partial or complete gene contents (through EST, genome surveys and complete sequences) for dozens of protists about many of whose evolutionary relationships we can currently only guess. As these data accumulate, comparative analyses of the “parts lists” of eukaryotic molecular machines and cellular organelles much more detailed and complete than Table 1 will be assembled. Our conclusion that that last common eukaryotic ancestor was a complex cell notwithstanding, there will surely be taxonspecific differences in parts lists that support more inclusive groupings of the taxa in Figure 2. Conversely, as such groupings (and the eukaryotic root) become established in multiple overlapping analyses, one can undertake a detailed reconstruction of the evolutionary history of specific eukaryotic cellular structures. If we do find new eukaryotic lineages that are primitively simple, then the “many and profound” differences that distinguish eukaryotes from prokaryotes can be tracked to their origins. If we don’t, we can content ourselves with detailing the many elaborations wrought in such structures over one to two billion years of eukaryotic cellular radiation. From a practical standpoint, until we have a wellresolved phylogeny, it is necessary to sample as many eukaryotes as possible. EST projects provide evidence for protein presence and expression levels. Full genome sequences are going to be harder to come by, but, importantly, provide information about protein absence, crucial when ascribing simplified protein complements to potentially early-diverging lineages. Although genomics projects are currently clustered in the better-known taxa, as a wider diversity is covered, we will be able to glean more and more information about eukaryotic biology. By looking at the detailed phylogenies and paralog complements from diverse eukaryotes, we may be able to reconstruct, on a molecular level, aspects of our earliest eukaryotic ancestors’ cell biology. Acknowledgments We would like to thank A.J. Roger, J.M. Logsdon, A.G.B. Simpson, and R.A. Singer for helpful discussion, and the latter two also for critical reading of the manuscript. This work was supported by Canadian Institutes of Health Research (C.I.H.R.) grant MT4467 to W.F.D. J.B.D. is supported by a C.I.H.R. Doctoral Research Award and W.F.D. by a Canada Research Chair. References Archibald, J.M., Logsdon, J.M., Jr., and Doolittle, W.F. (2000). Origin and evolution of eukaryotic chaperonins: phylogenetic evidence for ancient duplications in CCT genes. Mol Biol Evol 17, 1456–1466. Archibald, J.M., Doolittle, W.F., and O’Kelley, C.J. (2001). The chaperonin genes of jakobid and jakobid-like flagellates: implications for eukaryotic evolution. Mol. Biol. Evol., in press. Armstrong, J. (2000). Membrane traffic between genomes. Genome Biol 104, 1–4. Baldauf, S.L., and Palmer, J.D. (1993). Animals and fungi are each other’s closest relatives: congruent evidence from multiple proteins. Proc. Natl. Acad. Sci. USA 90, 11558–11562. Baldauf, S.L., Roger, A.J., Wenk-Siefert, I., and Doolittle, W.F. (2000). A kingdom-level phylogeny of eukaryotes based on combined protein data. Science 290, 972–977. Beja, O., Aravind, L., Koonin, E.V., Suzuki, M.T., Hadd, A., Nguyen, L.P., Jovanovich, S.B., Gates, C.M., Feldman, R.A., Spudich, J.L., et al. (2000). Bacterial rhodopsin: evidence for a new type of phototrophy in the sea. Science 289, 1902–1906. Biderre, C., Metenier, G., and Vivares, C.P. (1998). A small spliceosomal-type intron occurs in a ribosomal protein gene of the microsporidian Encephalitozoon cuniculi. Mol Biochem Parasitol 94, 283–286. Borg, S., and Poulsen, C. (1994). Molecular analysis of two Ypt/ Rab-related sequences isolated from soybean (Glycine max) DNA libraries. Plant Mol. Biol. 26, 175–187. Cell 424 Bork, P., Sander, C., and Valencia, A. (1992). An ATPase domain common to prokaryotic cell cycle proteins, sugar kinases actin and hsp 70 heat shock proteins. Proc. Natl. Acad. Sci. USA 89, 7290– 7294. Brugerolle, G., and Patterson, D.J. (1997). Ultrastructure of Trimastix convexa Hollande, an amitochondriate anaerobic flagellate with a previously undescribed organization. Eur. J. Prot. 33, 121–130. Bui, E.T., Bradley, P.J., and Johnson, P.J. (1996). A common evolutionary origin for mitochondria and hydrogenosomes. Proc. Natl. Acad. Sci. USA 93, 9651–9656. Bush, J., Franek, K., Daniel, J., Spiegelman, G.B., Weeks, G., and Cardelli, J. (1993). Cloning and characterization of five novel Dictyostelium discoideum rab-related genes. Gene 136, 55–60. Cavalier-Smith, T. (1983). A 6-kingdom classification and a unified phylogeny. In Endocytobiology II, H.E.A. Schenk and W. Schwemmler, eds. (Berlin: Walter de Gruyter), pp. 1027–1034. Cavalier-Smith, T. (1991). Intron phylogeny: a new hypothesis. Trends Genet. 7, 145–148. Cavalier-Smith, T., and Chao, E.E. (1996). Sarcomonad ribosomal RNA sequences, rhizopod phylogeny, and the origin of euglyphid amoebae. Arch. Protistenkd. 147, 227–236. Clark, C.G., and Roger, A.J. (1995). Direct evidence for secondary loss of mitochondria in Entamoeba histolytica. Proc. Natl. Acad. Sci. USA 92, 6518–6521. Dacks, J.B., Silberman, J.D., Simpson, A.G., Moriya, S., Kudo, T., Ohkuma, M., and Redfield, R.J. (2001). Oxymonads are closely related to the excavate taxon Trimastix. Mol. Biol. Evol. 18, 1034–1044. Doolittle, W.F. (1999). Phylogenetic classification and the universal tree. Science 284, 2124–2129. Edgcomb, V.P., Roger, A.J., Simpson, A.G., Kysela, D.T., and Sogin, M.L. (2001). Evolutionary relationships among “jakobid” flagellates as indicated by alpha- and beta-tubulin phylogenies. Mol. Biol. Evol. 18, 514–522. Embley, T.M., and Hirt, R.P. (1998). Early branching eukaryotes? Curr. Opin. Genet. Dev. 8, 655–661. Fast, N.M., and Doolittle, W.F. (1999). Trichomonas vaginalis possesses a gene encoding the essential spliceosomal component, PRP8. Mol. Biochem. Parasitol. 99, 275–278. Fast, N.M., Kissinger, J.C., Roos, D.S., and Keeling, P.J. (2001). Nuclear-encoded, plastid-targeted genes suggest a single common origin for apicomplexan and dinoflagellate plastids. Mol. Biol. Evol. 18, 418–426. Fast, N.M., Roger, A.J., Richardson, C.A., and Doolittle, W.F. (1998). U2 and U6 snRNA genes in the microsporidian Nosema locustae: evidence for a functional spliceosome. Nucleic Acids Res. 26, 3202– 3207. Felsenstein, J. (1978). Cases in which parsimony or compatibility methods will be positively misleading. Systematic Zoology 25, 401–410. Field, M.C., and Boothroyd, J.C. (1995). Trypanosoma brucei: molecular cloning of homologues of small GTP-binding proteins involved in vesicle trafficking. Exp. Parasitol. 81, 313–320. Flavin, M., and Nerad, T.A. (1993). Reclinomonas americana N. G., N. Sp., a new freshwater heterotrophic flagellate. J. Eukaryot. Microbiol. 40, 172–179. Germot, A., Philippe, H., and Le Guyader, H. (1996). Presence of a mitochondrial-type 70-kDa heat shock protein in Trichomonas vaginalis suggests a very early mitochondrial endosymbiosis in eukaryotes. Proc. Natl. Acad. Sci. USA 93, 14614–14617. Germot, A., Philippe, H., and Le Guyader, H. (1997). Evidence for the loss of mitochondria in Microsporidia from a mitochondrial-type HSP70 in Nosema locustae. Mol. Biochem. Parasitol. 87, 159–168. dence from the largest subunit of RNA polymerase II and other proteins. Proc. Natl. Acad. Sci. USA 96, 580–585. Horner, D.S., and Embley, T.M. (2001). Chaperonin 60 phylogeny provides further evidence for secondary loss of mitochondria among putative early-branching eukaryotes. Mol. Biol. Evol. 18, 1970–1975. Horner, D.S., Hirt, R.P., Kilvington, S., Lloyd, D., and Embley, T.M. (1996). Molecular data suggest an early acquisition of the mitochondrion endosymbiont. Proc. R. Soc. Lond. B Biol. Sci. 263, 1053–1059. Inagaki, Y., and Doolittle, W.F. (2000). Evolution of the eukaryotic translation termination system: origins of release factors. Mol. Biol. Evol. 17, 882–889. Janoo, R., Musoke, A., Wells, C., and Bishop, R. (1999). A Rab1 homologue with a novel isoprenylation signal provides insight into the secretory pathway of Theileria parva. Mol. Biochem. Parasitol. 102, 131–143. Jones, L.J., Carballido-Lopez, R., and Errington, J. (2001). Control of cell shape in bacteria: helical, actin-like filaments in Bacillus subtilis. Cell. 104, 913–922. Keeling, P.J., and Palmer, J.D. (2000). Parabasalian flagellates are ancient eukaryotes. Nature 405, 635–636. Keeling, P.J., and Doolittle, W.F. (1996). Alpha-tubulin from earlydiverging eukaryotic lineages and the evolution of the tubulin family. Mol Biol Evol 13, 1297–1305. Logsdon, J.M., Jr. (1998). The recent origins of spliceosomal introns revisited. Curr. Opin. Genet. Dev. 8, 637–648. Lopez-Garcia, P., and Moreira, D. (1999). Metabolic symbiosis at the origin of eukaryotes. Trends Biochem. Sci. 24, 88–93. Lopez-Garcia, P., Rodriguez-Valera, F., Pedros-Alio, C., and Moreira, D. (2001). Unexpected diversity of small eukaryotes in deep-sea Antarctic plankton. Nature 409, 603–607. Lujan, H.D., Marotta, A., Mowatt, M.R., Sciaky, N., LippincottSchwartz, J., and Nash, T.E. (1995). Developmental induction of Golgi structure and function in the primitive eukaryote Giardia lamblia. J. Biol. Chem. 270, 4612–4618. Margulis, L. (1970). Origin of eukaryotic cells; evidence and research implications for a theory of the origin and evolution of microbial, plant, and animal cells on the Precambrian earth (New Haven: Yale University Press). Martin, W., and Muller, M. (1998). The hydrogen hypothesis for the first eukaryote. Nature 392, 37–41. Martin, W., Stoebe, B., Goremykin, V., Hapsmann, S., Hasegawa, M., and Kowallik, K.V. (1998). Gene transfer to the nucleus and the evolution of chloroplasts. Nature 393, 162–165. Moreira, D., Le Guyader, H., and Phillippe, H. (2000). The origin of red algae and the evolution of chloroplasts. Nature 405, 69–72. O’Kelly, C.J., and Nerad, T.A. (1999). Malawimonas jakobiformis n. gen., n. sp. (Malawimonadidae n. fam.): A Jakoba-like heterotrophic nanoflagellate with discoidal mitochondrial cristae. J. Euk. Micro. 46, 522–531. Pace, N.R. (1997). A molecular view of microbial diversity and the biosphere. Science 276, 734–740. Philippe, H., Germot, A., and Moreira, D. (2000a). The new phylogeny of eukaryotes. Curr. Opin. Genet. Dev. 10, 596–601. Philippe, H., Lopez, P., Brinkmann, H., Budin, K., Germot, A., Laurent, J., Moreira, D., Muller, M., and Le Guyader, H. (2000b). Earlybranching or fast-evolving eukaryotes? An answer based on slowly evolving positions. Proc. R. Soc. Lond. B Biol. Sci. 267, 1213–1221. Roger, A.J. (1999). Reconstructing early events in eukaryotic evolution. Am. Nat. 154, S146–S163. Rodriguez, M.A., Garcia-Perez, R.M., Garcia-Rivera, G., LopezReyes, I., Mendoza, L., Ortiz-Navarrete, V., and Orozco, E. (2000). An Entamoeba histolytica rab-like encoding gene and protein: function and cellular location. Mol. Biochem. Parasitol. 108, 199–206. Ghosh, S.K., Field, J., Frisardi, M., Rosenthal, B., Mai, Z., Rogers, R., and Samuelson, J. (1999). Chitinase secretion by encysting Entamoeba invadens and transfected Entamoeba histolytica trophozoites: localization of secretory vesicles, endoplasmic reticulum, and Golgi apparatus. Infect. Immun. 67, 3073–3081. Roger, A.J., Clark, C.G., and Doolittle, W.F. (1996). A possible mitochondrial gene in the early-branching amitochondriate protist Trichomonas vaginalis. Proc. Natl. Acad. Sci. USA 93, 14618–14622. Hirt, R.P., Logsdon, J.M., Jr., Healy, B., Dorey, M.W., Doolittle, W.F., and Embley, T.M. (1999). Microsporidia are related to Fungi: evi- Roger, A.J., Svard, S.G., Tovar, J., Clark, C.G., Smith, M.W., Gillin, F.D., and Sogin, M.L. (1998). A mitochondrial-like chaperonin 60 Genome Review 425 gene in Giardia lamblia: Evidence that diplomonads once harbored an endosymbiont related to the progenitor of mitochondria. Proc. Natl. Acad. Sci. USA 95, 229–234. Simpson, A.G., and Patterson, D.J. (2001). On core jakobids and excavate taxa: the ultrastructure of Jakoba incarcerata. J. Euk. Micro. 48, 480–492. Simpson, A.G.B., and Patterson, D.J. (1999). The ultrastructure of Carpediemonas membranifera: (Eukaryota), with reference to the “excavate hypothesis”. Eur. J. Prot. 35, 353–370. Sogin, M.L. (1991). Early evolution and the origin of eukaryotes. Curr. Opin. Gen. Dev. 1, 457–463. Stanier, R.Y. (1974). The origins of photosynthesis in Eukaryotes. In Evolution in the Microbial World, M.J. Carlile and J. J. Skehel, eds. (Cambridge, UK: Cambridge University Press), pp. 219–240. Stiller, J.W., and Hall, B.J. (1999). Long-branch attraction and the rDNA model of early eukaryotic evolution. Mol Biol Evol 16, 1270– 1279. Stiller, J.W., Duffield, E.C., and Hall, B.D. (1998). Amitochondriate amoebae and the evolution of DNA-dependent RNA polymerase II. Proc. Natl. Acad. Sci. USA 95, 11769–11774. Swofford, D.L., Olsen, G.J., Waddell, P.J., and Hillis, D.M. (1996). Phylogenetic Inference. In Molecular Systematics, D.M. Hillis, C. Moritz and B. K. Mable, eds. (Sunderland, Mass.: Sinauer Associates Inc.), pp. 407–514. Tachezy, J., Sanchez, L.B., and Muller, M. (2001). Mitochondrial type iron sulfur cluster assembly in the amitochondriate eukaryotes Trichomonas vaginalis and Giardia intestinalis, as indicated by the phylogeny of IscS. Mol Biol Evol 18, 1919–1928. Van de Peer, Y., Ben Ali, A., and Meyer, A. (2000). Microsporidia: accumulating molecular evidence that a group of amitochondriate and suspectedly primitive eukaryotes are just curious fungi. Gene 246, 1–8. van den Ent, F., Amos, L.A., and Lowe, J. (2001). Prokaryotic origin of the actin cytoskeleton. Nature 413, 39–44.
© Copyright 2026 Paperzz