Inlernalionol Journal/or Parasilo/og.y, Vol. 26, No. 6, pp. 589417, 1996 ~0 1996 Australian Society for Parasitology. Pubhshed by Elsevia Science Ltd Printed in Great Britain 002&7519/96 $15.00+0.00 SOO20-7519(96)00044-6 Copyright Pergamon PII: INVITED REVIEW Phylogenetic Tree-building DAVID Molecular Parasitology A. MORRISON Unit, University of Technology Sydney, Westbourne Street, Gore Hill, NS W 2065, Australia (Received 13 October 1995; accepted 3 March 1996) Abstract-Morrison D. A. 1996. Phylogenetic tree-building. Iniernationul Journal fir Parasitology 26: 589-617. Cladistic analysis is an approach to phylogeny reconstruction that groups taxa in such a way that those with historically more-recent ancestors form groups nested within groups of taxa with moredistant ancestors. This nested set of taxa can be represented as a branching diagram or tree (a cladogram), which is an hypothesis of the evolutionary history of the taxa. The analysis is performed by searching for nested groups of shared derived character states. These shared derived character states dehne monophyletic groups of taxa (clades), which include all of the descendants of the most recent common ancestor. If all of the characters for a set of taxa are congruent, then reconstructing the phylogenetic tree is unproblematic. However, most real data sets contain incongruent characters, and consequently a wide range of tree-building methods has been developed. These methods differ in a variety of characteristics, and they may produce topologically distinct trees for a single data set. None of the currently-available methods are simultaneously efficient, powerful, consistent and robust, and thus there is no single ideal method. However, many of them appear to perform well under a wide range of conditions, with the exception of the UPGMA method and the fnvariants method. Copyright 0 1996 Australian Society for Parasitology. Published by Elsevier Science Ltd. Key words: Phylogeny; evolution; cladistics; cladograms. English of Hennig (1966). These methods are now usually referred to as “cladistics”, and the evolutionary diagrams they produce “cladograms”, to distinguish them from all prior phylogenetic studies, many of which were neither explicit nor repeatable (note that I am using the word cladistics to include a wide class of explicit and repeatable phylogenetic analyses, which may be a broader definition than would be accepted by many phylogeneticists, who would restrict the term to the more strictly Hennigian methods). The current interest in cladistic analyses has inevitably led to a proliferation of data analysis techniques; and the apparent plethora of methods for phylogenetic analysis is typical of a young science still coming to terms with both its aspirations and its constraints. It is therefore important for practitioners to understand the limitations of the available techniques as well as to appreciate their capabilities (Felsenstein, 1982). Unfortunately, the current wide choice among possible phylogenetic methods seems to be daunting INTRODUCTION If we accept the proposition that evolution exists, then meaningful comparisons among organisms must ultimately include a phylogenetic context (Maddison & Maddison, 1992). This is because the evolutionary relationships among a group of taxa constrain any other possible relationships that might exist. It is thus not surprising that in biology there has been a long history of attempts to deal with the reconstruction of genealogical history (Nelson & Platnick, 1981; Mayr, 1982; Stevens, 1994), notwithstanding the difficulties associated with producing testable hypotheses about historically unique events. It is, however, only in the last 30 years that widespread attempts have been made by systematists to produce explicit and repeatable methods for the construction of phylogenetic trees (Felsenstein, 1982), notably with the translation from German into Tel: +61-2-3304159; Fax: +61-2-3304003; E-mail: [email protected]. 589 590 D. A. Morrison for many people, and they thus acquire little knowledge of the relative advantages and disadvantages of the various methods. This is unfortunate, because the choice of data-analysis methods should be based on their apparent appropriateness for the data at hand, rather than on the local availability of computer programs or on historical inertia (Hillis, Allard & Miyamoto, 1993). It is my purpose here to review the phylogenetic inference methods that are currently available, and to indicate what is currently known about their strengths and weaknesses. This will allow a more informed decision to be made when assessing which of the methods might be appropriate for a particular data set. Along the way, I will attempt to make clear some of the aspects of phylogenetic analysis that are obviously misunderstood by non-specialists, and to dispel a few widely-held misconceptions. My discussion will focus on molecular sequence data (particularly DNA and RNA), since trees derived from this source are increasingly those with which parasitologists are working (e.g., Nadler, 1990), there being a limit to the usefulness of phenotypic characteristics for constructing phylogenetic trees for most parasites. Indeed, it is the proliferation of molecular data sets that is introducing many people from outside of systematics to the science of phylogeny (Miyamoto & Cracraft, 1991; Hillis, Huelsenbeck & Cunningham, 1994a); and it is for this reason that phylogenetic principles need to be clearly introduced to nonexperts. Furthermore, many of the recent advances in cladistics have been motivated by attempts to deal with problems that are specific to molecular data (Penny et al., 1990), and the similarities and differences between traditional and molecular phylogeny thus need to be emphasized. I do not intend to describe the tree-building techniques in any great detail, partly because most of them are covered in the excellent review by Swofford & Olsen (1990) and in the books by Nei (1987) and Li & Graur (1991), and partly because nothing is more intimidating to most biologists than mathematics. I concentrate much more on those aspects of phylogenetic analysis that are likely to be of most practical and theoretical interest to nonexperts. I start by summarizing the principles of cladistic analysis, before proceeding to a review of the alternative methods for constructing cladograms. A number of introductory reviews covering similar topics, but from different perspectives, include those of Felsenstein (1988), Olsen (1988) Sneath (1989), Beanland & Howe (1992), Hillis et al. (1993) and Stewart (1993); and there is also the worthwhile introductory book by Forey et al. (1992). PHYLOGENETIC ANALYSIS Cladistics The study of evolutionary processes has often been considered to be unscientific because it deals with historically unique events (Popper, 1957). Hypotheses concerning these events are thus not universal (in either space or time) and, therefore, they are considered to be untestable in the contemporary world. The sociological development of phylogenetic analysis has consequently been based largely on the erection of what have been called “evolutionary scenarios” describing the presumed genealogical history of the organisms under study. The number of such scenarios that may be created is, of course, limited solely by the imagination of the researcher, and none of the scenarios are likely to be open to falsification. Cladistic analysis can thus be seen as an attempt to base phylogenetic analysis on a more objective footing, where the phylogenetic hypotheses are explicitly stated, along with the evidence supporting (and contradicting) them, and are then subjected to quantitative testing. Its practitioners therefore claim that cladistics is designed to make phylogenetic analysis into an hypothetico-deductive science, where explicit hypotheses are subjected to repeatable attempts at falsification. Note that the claimed advantages of cladistic analysis are not intended to denigrate pre-cladistic biologists, nor is there any suggestion that these biologists did not apply their minds to phylogenetic questions. However, it is clear that pre-cladistic phylogenetic analyses were not necessarily based on repeatable methods that produced explicit hypotheses of evolutionary relationship which could be subjected to falsification (Nelson & Platnick, 1981). Furthermore, the taxonomic groups produced by pre-cladistic biologists were not necessarily monophyletic (see below), and therefore did not always reflect evolutionary history. Post-Darwinian biologists have had the unenviable task of producing taxonomic schemes that should, in theory, reflect evolutionary history, without any theoretical framework for how they should go about discovering what the evolutionary history actually was (Stevens, 1994). Cladistics is an attempt to provide this theoretical framework. Cladistic analysis as an approach to phylogeny attempts to group taxa on the basis of their ancestry. In the analysis, taxa are grouped in such a way that those with historically more-recent ancestors form groups nested within groups of taxa with moredistant ancestors. The analytical technique is based on a widely-held view of the mode of the evolutionary process: species are lineages undergoing Invited Review divergent evolution with modification of their intrinsic attributes, the attributes being transformed through time from ancestral to derived states. Thus, if a species is a group of populations, then if 1 group of populations acquires a new (derived, advanced or apomorphic) character state while the rest do not (i.e. they retain the ancestral, primitive or plesiomorphic state) then these populations constitute a new species. This new species then forms a separate historical lineage that diverges from the other populations, and maintains its own historical tendencies and fate. Evolution can thus be thought of as a branching sequence of historical lineages, and indeed the only diagram in Darwin (1859) represents precisely such a branching sequence. When a character changes from an ancestral state to a derived state in a lineage it is historically unique (a novelty), and it will be passed on to all of the descendants of that lineage (even if the character is later modified into something else). Therefore, the branching sequence of evolution can be deduced by searching for nested groups of shared derived character states (synapomorphies) among the taxa being analysed. So, if a derived character state is observed in 2 or more taxa, then we can hypothesize that they share this apomorphy because they are descended from a common ancestor, and that they inherited the apomorphy from that ancestor. The possession of a shared ancestral state (a symplesiomorphy) tells us nothing about the phylogeny of the taxa, since this state was inherited from an ancestor that is also held in common with those taxa possessing the derived state. Thus, cladistic analysis is simply the search for nested sets (a hierarchy) of synapomorphies among the taxa. Each synapomorphy represents an ancestral lineage that has diverged from its related lineages, thus being contemporary evidence for a prior evolutionary event (Table 1). Clearly, the only characters that are of use for a cladistic analysis of a group of contemporary taxa are those features that reflect their evolutionary history. There is, of course, no simple method for determining which features these are, but at least the method forces the practitioner to be explicit about the characters that have been chosen for the analysis. Homology For this analytical technique to work, homologous rather than analogous character states must be compared across the taxa (Hall, 1994). That is, for all of the taxa we must compare like with like, particularly with reference to the evolutionary origin of the attribuies. Characters and their states are thus hypotheses of evolutionary homology. As an example, 591 for Character 16 in Table 1 possession of denticles, dermal scales, epidermal scales, feathers and hair are all considered to be homologous character states of a single character. In practice, characters and their states are postulated as homologous on the basis of their structural, positional, ontogenetic, compositional and/or functional correspondences; and they are postulated between different taxa so as to maximize the number of one-to-one correspondences of their parts (Stevens, 1984). The choice of characters to be included in a cladistic analysis may be somewhat arbitrary, but can include intrinsic attributes such as morphology, anatomy, embryology, behaviour, physiology, ultrastructure, cytology, biochemistry, immunology, ecology and even geography. The important points are that the characters used in the analysis are hypothesized to reflect the evolutionary history of the taxa, and that the character states of a single character are hypothesized to have a unique evolutionary origin. Concepts of homology are often intuitively obvious when dealing with, for example, morphological data, and these concepts can be put into practice through a detailed study of ontogeny (e.g., the homologies postulated in Table 1 are thus mostly unproblematic); and the alignment of molecular sequences is the direct equivalent of these homology assessments (Mindell, 1991). The concepts of homology in molecular and morphological studies are thus fundamentally the same (de Pinna, 1991; 1993). So, alignment of molecular Williams, sequences involves a series of hypotheses of homology among the taxa, with 1 hypothesis of homology for each position (nucleotide or amino acid) in the sequence. It is important to recognize this, because it is clear that often very little attention is paid by molecular biologists to this point to a morphologist the assessment of homology is an important (and time-consuming) component of phylogenetic analysis, but to a molecular biologist the assessment of homology is apparently often an afterthought. The correct formulation of hypotheses of homology is just as important for molecular data as it is for morphological data; and it is clear from the literature that many of the so-called controversies about the phylogeny of particular groups of parasites are based as much on disagreements about sequence alignment as on disagreements about actual evolutionary events (e.g., Ellis & Morrison, 1995). Unfortunately, for molecular data there is little possibility of further investigations (such as ontogeny) to assess homology, and so in practice homology assessment is very different for molecular studies (Mindell, 1991). Positional homology can be represented by either identical character states none no no no no no no scale-less 16 Body covering Em al denticles no no no no no no none no no no no no yes yes Sharks no no no no no none no no no no no no dermal scales yes yes yes yes yes Ermal scales no no no no no smooth epidermis II0 no none yes yes yes yes no no no none no no no no no yes yes Frogs yes yes yes Teleosts Lungfish no none no no no no no no smooth epidermis yes yes yes Yes yes yes yes Salamanders no yes none no no no no no no epidermal scales yes yes yes yes yes yes Turtles no yes two yes yes no no no no epidermal scales yes yes yes yes yes yes Lizards no yes two yes yes no no no no epidermal scales yes yes yes yes yes yes Snakes no yes two no no yes Yes yes no epidermal yes yes yes yes yes Crocodiles phenotypic data for extant vertebrates that might be useful for reconstructing their evolutionary history no yes one no no no no no yes hair feathers yes yes yes yes yes yes Mammals no yes two no no yes yes yes yes yes yes yes yes yes yes Birds These 16 characters each have at least 2 character states that are hypothesized to be homologous, one of which is postulated to be ancestral (the states “no”, “none”, and “scale-less”). The other (one or more) states are postulated to be derived from this ancestral state, and the taxa that share these derived states are then hypothesized to have inherited these states from a common ancestor. These shared derived character states (synapomorphies) thus group the taxa into sets that reflect their evolutionary history. Note that Character I has only I state in this data set, because the derived state has been inherited by all of these taxa, and it thus forms a synapomorphy for the group (vertebrates) as a whole. Furthermore, Characters 9 and I6 have more than 2 homologous states, thus implying that the ancestral state has been modified more than once during the evolutionary history of these taxa; each of these derived states forms its own set of taxa. A possible cladogram derived from these data is shown in Fig. I. The data are from Romer (1971) and Maddison & Maddison (1992). Note that this non-parasitological example has been chosen because the homology assessmentsand the resulting cladogram are not controversial, which is something that cannot be said for any parasitological example known to me. IlO no no no no no no yes Lampreys 1 Internal skeleton 2 Jaws 3 Ossified skeleton 4 Internal nostrils 5 Atria1 septum 6 Four limbs 7 Teeth pedicillate 8 Amniotic egg 9 Temporal fenestrae IO Hemipenes 11 Suspensorium streptosylous I2 Antorbital fenestrae 13 Lateral fenestrae ossified I4 Gizzard I5 Homeothermy Character Table I-Some Invited (nucleotides or amino acids) in all sequences, substitutions in 1 or more sequences (representing point mutations), or insertions/deletions (indels) in 1 or more sequences. The most problematic aspect of sequence alignment is the positioning of indels, and this problem becomes more acute for more divergent sequences. It is worthwhile in this context to distinguish between gaps, which are introduced into the sequences by the alignment procedure, and indels, which are actual mutation events (Olsen, 1988); clearly, the objective is to introduce into the sequences only gaps that truly represent indels. There are 3 possible scenarios for the degree of difficulty of sequence alignment. Firstly, there may be relatively few indels, in which case a robust sequence alignment can usually be produced by hand. Such a situation is shown, for example, by protein-coding parts of mtDNA (Miyamoto & Cracraft, 1991) and the plant rbcL gene (Chase et al., 1993). Secondly, the sequence may represent a molecule for which there is an a priori biological model of secondary structure in which certain active sites must be maintained; the alignment is then constrained by the base-pairing of the model (Olsen, 1988). Such a situation is shown, for example, by the 5s (Specht, Wolters & Erdman, 1990), 12s (Gutell et al., 1985) and 18s (Van de Peer et a/., 1994) rRNA genes. Thirdly, there may be many indels and no a priori structure model. Under these circumstances it is usual to use a mathematical algorithm to produce the alignment. These algorithms all attempt to produce a sequence alignment that optimizes some chosen criterion of match between the individual sequences (cost). That is, the sequences are compared using a pattern-matching process that searches for correspondence between the elements of the sequences, introducing gaps into the sequences as required to maximize some criterion for optimality of the correspondence. There are many algorithms currently available (see Waterman, 1989; Doolittle, 1990; Chan, Wong & Chiu, 1992; and McClure, Vasi & Fitch, 1994), which optimize a variety of mathematical functions measuring the overall alignment cost. When there are more than 2 sequences, most of these algorithms use exact procedures (see below for a definition) to align the sequences pair-wise, but then use heuristic procedures (see below for a definition) to braid these alignments into a multiple alignment. Thus, these procedures do not guarantee to produce the globally optimal alignment; nor do they guarantee that the optimal alignment (even if they could find it) represents the true alignment (Thorne & Kishino, 1992). I am not going to discuss these various algorithms here, because this is an area of active research and Review 593 very little is known about their theoretical or practical limitations (Thorne & Kishino, 1992; McClure et al., 1994). However, 1 important generalization may be made: the differences between the alignments produced by the various algorithms is often less than the differences produced by varying the “gap weights”. These weights refer to the relative cost of inserting a new gap into a sequence or extending an already-existing gap, and there is no way of determining analytically what these weights should be (Rinsma-Melchert, 1993). Most of the computer programs that implement the alignment algorithms have default values for the weights that are designed to produce “biologically interesting” results, and very few molecular biologists seem to be willing to deviate from these default choices. It is, however, clear that to simply report that a particular computer program was used to align the sequences is meaningless (since the work cannot be verified) unless the weight values are also reported (Wheeler, 1995). In dealing with the problematic nature of sequence alignment, molecular biologists often delete parts of their sequences from the cladistic analysis. The rationale for this is that those parts of the sequences that cannot be aligned reliably should be excluded from the estimation of the phylogeny (Olsen, 1988). This is probably very sensible (Smith, 1994), but unfortunately there is often no objective criterion given for deciding which parts of the alignment are ambiguous, the decisions usually being made by “visual inspection”. Gatesy, DeSalle & Wheeler (1993) have suggested that those parts of the alignment that are sensitive to the gap weights (i.e. where the alignment varies significantly when the gap weights are changed) may constitute unreliable hypotheses of homology and may therefore be candidates for exclusion; and Ellis & Morrison (1995) have shown that for some organisms it is the doublestranded parts of rRNA that may contain most of the phylogenetic information when the sequences are aligned according to secondary structure. Thus, there is considerable room for further research into the problems of sequence alignment. It is also important to recognize that there is also a more general level of homology assessment for molecular data. The sequences being compared must themselves be homologous rather than analogous (Fitch, 1970). Thus, only orthologous sequences will reflect the historical relationships of species, while paralogous sequences (e.g., the 2 sequences that result from a gene duplication) will reflect only gene history, xenologous sequences (e.g., recentlyincorporated sequences such as result from horizontal gene transfer) will only partly reflect gene history, and plerology (e.g., the inter-mixture of exons and 594 D. A. Morrison introns) will only reconstruct a composite gene history (Patterson, 1988; Williams, 1993). All molecular studies thus rely on the assumption that the sequences from the taxa being compared are orthologous, and this may be a dubious assumption for distantly related taxa (Sneath, 1989). Any sequence for which orthology has not been established should be omitted from the analysis (Olsen, 1988). Furthermore, for orthologous sequences there is the implicit assumption that the sequences being compared are actually from the organisms being studied, rather than from some other co-habiting organism. For example, it is now recognized for the ITS gene that many of the published sequences purporting to be from conifers and ferns are actually those of fungi, while the published Mimulus (monkey-flower) sequences are actually those of green algae (P. Weston, personal communication). As a tinal observation on molecular data, it is worth emphasizing the distinction between species trees and gene trees. The result of cladistic analysis of molecular data is a gene tree (provided that the entire gene has been sequenced), hypothesizing relationships among the genes or genomes that have been sampled, whereas a species tree reflects the actual evolutionary pathways (Pamilo & Nei, 1988). The gene tree may be fundamentally incongruent with the true species phylogeny, with the genome tree, and with other gene trees (e.g., Cao et al., 1994; Cummings, Otto & Wakeley, 1995), due to various phenomena such as allelic polymorphism, introgression, lineage sorting, unequal rates of speciation and gene mutation, lateral transfer, hybridization, or mistaken orthology (Pamilo & Nei, 1988; Penny et al., 1990; Doyle, 1992; de Queiroz, Donoghue & Kim, 1995). Thus, for the reconstruction of phylogenetic history, a single gene may be, in practice, no more useful than a single morphological character (Doyle, 1992). It may, therefore, be unwise to assume tacitly in molecular studies that the number of characters being used to reconstruct a phylogeny is equivalent to the number of positions in the sequence. Furthermore, a living organism is an integrated functioning whole, not just a collection of unrelated genetic attributes. Thus, an organism is a collection of interactions between genes, and between genes and their environment (i.e. a phenotypic whole), and it is the organism as a whole that takes part in the evolutionary process. Consequently, there is no more reason for genetics to reflect phylogeny than for anything else to do so (de Queiroz et al., 1995). In fact, morphological characters may be a better reflection, because they integrate many genetic and phenotypic characters. Polarity Having determined the homology of the character states, the key to cladistic analysis is the distinction between derived character states and ancestral states (character polarity). It is important to note that this is a local concept that applies only to a particular set of taxa. By this I mean that a character state is only considered to be derived relative to a specified ancestral state, and it may well be the ancestral state for a further derived state. As an example, for Character 16 in Table 1 possession of epiderma1 scales is a derived state relative to possession of dermal scales, but is an ancestral state relative to possession of either feathers or hair. Thus, possession of epidermal scales is a synapomorphy for Turtles+Lizards+Snakes+Crocodiles+ Birds+Mammals, while possession of feathers is a synapomorphy for Birds and possession of hair is a synapomorphy for Mammals; this character does not supply a synapomorphy for the grouping only of Turtles+Lizards+Snakes+Crocodiles. Possession of epidermal scales is thus a synapomorphy at a more general level than is possession of either feathers or hair. There is thus a hierarchical relationship between ancestral and derived character states, and recognizing synapomorphies therefore involves determining the correct level of generality of homologies. Clearly then, the success of a cladistic analysis rests on the correct determination of the relative apomorphy of the character states, and numerous criteria have been proposed for doing this (Crisci & Stuessy, 1980; Stevens, 1980; Bryant, 1992; Nixon & Carpenter, 1993). Most of these criteria rely on illogical arguments or on assumptions that are either false or untestable; and it is worthwhile recognizing only 2 objective possibilities: the direct method; and outgroup analysis. These are complementary methods, and both may thus be used in any 1 data set. For the direct method, the hierarchical relationship between ancestral and derived character states is observed directly, and it does not require any preexisting hypotheses of character polarity. On the other hand, the outgroup analysis method does not directly observe character polarity, and it relies upon an hypothesis of the relationship of the taxa under study to their near relatives. The direct method (Weston, 1988) states that: if 1 character state is possessedby all of the taxa that also possess the alternative state, and in addition it is possessedby some taxa that do not possessthe other state, then it is postulated to be the ancestral state. For example, open gill slits are possessed by all chordates in at least the embryonic stage, but in tetrapod chordates these close early in development; Invited Review 595 Fig. 1. A phylogenetic tree derived from the character data shown in Table 1. Almost all of the characters shown in Table 1 have derived states that are congruent with this cladogram; that is, the synapomorphies form a perfect series of nested sets. For example, Characters 10 and 11 form a set consisting of the Lizards+Snakes, while Characters 12-14 form a set of the Birds+Crocodiles; these characters are plotted on the branch referring to the hypothesized ancestor where the derived character state arose (and the other characters can be plotted in similar fashion). The only exception to this congruence concerns Character 15, for which 2 evolutionary origins of the derived state must be postulated (i.e. it appears twice on the cladogram): that is, there is an apparent convergence on this cladogram, where the derived state is postulated to have arisen in 2 separate ancestors. This implies a mistaken hypothesis of homology (a homoplasy) for homeothermy in Birds and Mammals. Other cladograms could be constructed in which this convergence does not occur (i.e. in which Character 15 is a synapomorphy uniting Birds+Mammals), but in all of these cases at least 2 of the other characters must then be homoplasious. thus, all chordates possess open gill slits but only some possess both open gill slits (early in development) and closed gill slits (later in development). Consequently, possession of closed gill slits is hypothesized to be derived relative to possession of open gill slits. This type of argumentation can be applied to many types of characters (Nelson, 1978; Weston, 1988, 1994), but it is probably of limited utility for molecular data. However, Weston (1994) has successfully applied the direct method to gene duplications (paralogy) to polarize the a and b subunits of ATPase based on taxa from the archaebacteria, eubacteria and eukaryotes, thus providing a “root” for the tree of life. The outgroup analysis (or indirect) method (Watrous & Wheeler, 1981) states that: if a character state is found in both the ingroup (the group of taxa under study) and also in the outgroup (the sister group of taxa), then it is postulated to be the ancestral state. For this method to provide unequivocal evidence of character polarity, the outgroup should consist of at least 2 sequential sister groups (Maddison, Donoghue & Maddison, 1984). For example in Fig. 1, if we were interested in the phylogeny of the tetrapods (the ingroup thus consisting of Frogs+Salamanders+Turtles+Lizards+ Snakes+Crocodiles+Birds+Mammals), then the relevant sister groups (the outgroup) would be at least the Lunglish and the Teleosts. Thus for Character 8 in Table 1, possession of an amniotic egg is hypothesized to be apomorphous relative to the lack of such an egg because all members of the outgroup and some members of the ingroup lack it, while only some members of the ingroup possess it. This type of argumentation relies on the existence of a corroborated higher level phylogeny for the taxa being studied, because we need a priori knowledge of the sister groups of the ingroup. Such higher level phylogenies in turn may rely on other outgroup comparisons, and so on in a regress back to the origin of life. Ultimately, we must rely on the direct method for at least some of the characters in some of the analyses. Outgroup analysis is the most common type of argumentation in cladistic analysis, but it is now usually implemented in a different way from its original formulation, particularly when analysing molecular data (Smith, 1994). Instead of first determining character state polarity and then producing the cladogram, it is now more common to produce an 596 D. A. Morrison unrooted cladogram (a network) based on simultaneous analysis of all of the characters of both the ingroup and the outgroup and then to determine the root of the tree using the position where the outgroup joins the ingroup (see below). This type of method was first described by Farris (1972), and Nixon & Carpenter (1993) provide a recent detailed summary. In this method, there are no guarantees that any particular number of outgroups or any particular choice of outgroups will ensure that the cladogram accurately reflects the evolutionary history (Li et al., 1987; Smith, 1994; Adachi & Hasegawa, 1995). In practice, the phylogenetic inferences will be more robust if more outgroup taxa are chosen rather than fewer, and the more closely-related these taxa are to the ingroup (Hendy & Penny, 1989; Wheeler, 1990; Nixon & Carpenter, 1993). For molecular data, a distantly-related outgroup may have surpassed the point of saturation of base substitutions, and there will thus be a loss of phylogenetic signal through evolutionary time as a result of random sequence noise (Smith, 1994). The outgroup may thus, in effect, be random, and as such it will simply join the rest of the cladogram on the longest internal branch in the ingroup and will itself have a long terminal branch (Wheeler, 1990). The best strategy may be for the outgroup to consist of several taxa from the sister group to the ingroup (Smith, 1994). Conclusion Ideally, all hypotheses of homology and relative apomorphy will be congruent with one another. That is, the sets of synapomorphies for a group of taxa will form a perfect nested hierarchy, and the polarization of the character states provides a root for the tree. The construction of a cladogram from the data would then be unproblematic (Fig. 1); and it was indeed under these circumstances that Hennig (1966) introduced his phylogenetic method. In most of the real world, there are, however, always characters that are incongruent with each other (homoplasies). These incongruent characters are postulated to result from either reversals, where a derived character state reverts to the ancestral state, parallelisms, where the same derived character state arises in separate evolutionary lineages, or convergences, where superficially similar character states have arisen in separate lineages. Homoplasies are thus mistaken hypotheses of homology. In order to deal with the common existence of homoplasies, a range of tree-building techniques has been developed in cladistics. Each technique implements a different stratagem for how homoplasies are to be treated, and each technique may thus produce a different hypothesis of phylogenetic history for the taxa being analysed. If a data set is perfectly congruent (i.e. all of the characters reflect the same speciation and phylesis events), then all of the techniques will produce the same cladogram, which will be the cladogram produced by the original method of Hennig (1966). However, apparent incongruences almost always exist in the real world, and therefore a range of methods has been developed to try to detect the phylogenetic pattern that underlies the apparent contradictions. Cladistics is basically axiomatic, in the sense that if the assumptions (the axioms) are accepted then the rest (the corollaries) follows directly from them. Thus, like all axiomatic propositions, if one or more of the assumptions are invalid, then the rest of the edifice falls with them. The key assumption is that evolution is mainly a divergent process. Homoplasies are thus assumed to represent “errors”, rather than evidence in favor of some form of alternative evolutionary process (i.e. that does not produce a tree-like set of relationships), such as hybridization, endosymbiosis, recombination, gene duplication (producing pseudogenes), or lateral transfer. You might like to keep this in mind when you next attempt to interpret an apparently fully-bifurcating cladogram, as such alternative evolutionary processes are increasingly being recognized as relatively common. Phylogenetic trees For the purposes of cladistics, it is assumed that the phylogenetic pattern of evolutionary history can be represented as a branching diagram like a tree (a cladogram), with the terminal branches (or leaves) linking the taxa being analysed and the internal branches (or internodes) linking hypothesized ancestral taxa (nodes) (Fig. 1). The cladogram is thus usually constructed purely from a knowledge of contemporary taxa, but it implies the existence of ancestors with particular characteristics. Traditionally, the descendants of an ancestor on a cladogram are called daughters, while the siblings after a speciation event are called sisters (so a descendant is a daughter relative to its ancestor and a sister relative to its other sibling). Note that if either of the daughters undergoes further speciation then the sister to a particular contemporary taxon may actually be a group of contemporary taxa (i.e. all of the descendants of its sister); thus the sister to the Turtles in Fig. 1 is the group formed by the Lizards+Snakes+ Crocodiles+Birds+Mammals. Two types of evolutionary information can be represented on a cladogram. Firstly, the speciation Invited Review 591 Fig. 2. The same phylogenetic tree as shown in Fig. I, but with the branch lengths drawn proportional to the number of characters being transformed from an ancestral to a derived state along that branch (the number of character state changes is also marked on each branch). This cladogram thus shows both the phylesis and the speciation implied by the data in Table I. events in evolutionary history are represented by the branching sequence of the tree. Secondly, the phylesis events may be represented by the relative length of the branches on the tree (in which case it is sometimes referred to as a phylogram). Thus there is an explicit distinction between those evolutionary events that modify a taxon without causing new taxa to originate (phylesis) and those evolutionary events that do lead to the origin of new taxa (speciation). Both of these events are involved in evolutionary divergence among taxa. However, only the branching sequence (the topology) is essential to a cladogram, and indeed this is the only information shown in Fig. 1; the equivalent tree showing both speciation and phylesis is shown in Fig. 2. This is not to say that the phylesis events are not of interest, but merely that their pattern is not an essential part of the interpretation of the tree; and indeed only some of the phylesis can be shown on the tree (i.e. only for those characters being used) while all of the speciation may be included. Interestingly, the book of Darwin (1859) is basically about phylesis (the transformation of 1 species into another) rather than about speciation (the formation of 2 daughter species from 1 ancestral species), in spite of its title. The branching sequence of a cladogram organizes taxa into monophyletic groups (clades), and the best way to discuss the evolutionary pattern represented by a cladogram is in terms of which taxa form monophyletic groups on the tree. A monophyletic group is: a group of taxa descended from a single ancestor and which includes all of the taxa descended from this ancestor. Groups that do not include all of the descendants of their most recent common ancestor are called paraphyletic groups. Thus in Fig. 1 Crocodiles+Birds form a monophyletic group, as do Lizards+Snakes+Crocodiles+Birds, and also Lizards+Snakes+CrocodiIes+Birds+Mammals, but Lizards+Snakes+Crocodiles form a paraphyletic group (because their most recent common ancestor also has Birds as a descendant); the Class Reptilia as usually defined is thus not a monophyletic group (it would have to include all of the members of the traditional Class Aves and Class Mammalia as well). Simple though it may seem, it is my impression that the recognition of monophyletic groups, and thus the description and interpretation of the evolutionary pattern shown by a cladogram, appears to be one of the more difficult aspects of cladistics for non-experts (see also the specific examples discussed by Penny et al., 1990 and Brooks & McLennan, 1992). It should also be recognized that any 1 cladogram can actually be drawn as a tree in many different ways (Fig. 3). This is because the cladogram only specifies the temporal order of the branching D. A. Morrison Fig. 3. Several ways of presenting the same rooted phylogenetic tree. (A) Shows a cladogram of the hypothetical relationships among 5 taxa; (B) is the same tree as in A but with 1 of the branches (marked) rotated; (C) is the same tree as in (B), but with another one of the branches also rotated; (D) is the same tree as in (C) but with another 2 of the branches rotated. All 4 of these trees represent exactly the same evolutionary history for the 5 taxa involved - that is, they specify exactly the same cladistic relationships among the taxa (e.g. Taxonl is shown as the sister to Taxon in all of the trees, and Taxon is the sister to this pair). There are a further 11 ways of drawing exactly this same tree, as well. sequence daughter ancestor from the is drawn root to the tips, on the tree to the and left which of the and which is drawn to the right has no evolutionary meaning. Consequently, the taxa can be rotated around each internal branch on the tree without in any way affecting the meaning of the cladogram. The best analogy here is to think of a mobile hanging from the ceiling, and to think about the way in which its parts can rotate freely. The mobile does not change its basic form no matter how much the parts are rotated, and similarly the phylogenetic information in the cladogram does not change no matter which way we rotate the branches. It is thus very easy for non-experts to think that 2 trees are different, when in fact the cladogram still specifies exactly the same evolutionary relationship among the taxa (Fig. 3). The distinction between rooted and unrooted trees is very important (Fig. 4), but it is also often problematical for non-experts. As noted above, the most common method of cladistic analysis (particularly for sequence data) involves producing an unrooted tree and then rooting the tree using the outgroup. This rooting of the tree then indicates the direction of evolutionary change on the cladogram; and this allows hypotheses of relative character-state polarity (ancestral vs derived) to be produced, and also allows inferences about the composition of monophyletic groups to made. So, unrooted trees do not indicate character polarity, nor do they unequivocally show monophyletic groups. This is because a group that appears to be monophyletic on an unrooted tree will be paraphyletic if the tree is rooted within this group. For example, in Fig. 4 the group Taxonl+Taxon2+ Taxon is only indicated as monophyletic in Tree E, because this is the only tree of those shown in which the root lies outside of this group. Any specified unrooted tree can actually be rooted on any of its branches (Fig. 4), which is why it is important to have an objective criterion (such as an outgroup, or some characters polarized by the direct method) for determining the position of the root. Invited Review 599 A B C Fig. 4. Several of the rooted phylogenetic trees that can be derived from a single unrooted phylogenetic tree. (A) Shows an unrooted cladogram of the hypothetical relationships among 5 taxa; (B) is the rooted cladogram that results from rooting the tree in (A) on the terminal branch leading to Taxonl; (C) is the rooted cladogram that results from rooting the tree in (A) on the internal branch connecting Taxonl+TaxonZ with the other taxa; (D) is the rooted cladogram that results from rooting the tree in (A) on the terminal branch leading to Taxon3; (E) is the rooted cladogram that results from rooting the tree in (A) on the terminal branch leading to Taxon5. The unrooted tree in (A) could also be rooted on the terminal branch leading to Taxon2, on the terminal branch leading to Taxon4, and on the internal branch connecting Taxon4+TaxonS with the other taxa. There are thus as many possible rooting points on an unrooted tree as there are branches on the tree. Note that the unrooted tree does not specify the direction of evolutionary change, while each of the rooted trees specifies a different direction of evolutionary change. It is worthwhile pointing out here that, in spite of its limitations, an unrooted tree is, in itself, still a valuable phylogenetic commodity. This is because it limits the possible cladograms (rooted trees) that are supported by the data. For example, for 5 taxa there are 105 possible rooted trees, and yet the unrooted tree shown in Fig. 4 supports only 7 of these possible trees (i.e. we have rejected 98 possibilities). Furthermore, it will be possible to answer many phylogenetic questions using only the unrooted tree. For example, none of the rooted trees that can be derived from the unrooted tree shown in Fig. 4 has Taxonl+Taxon3 as a monophyletic group (since Taxon must also be included). Therefore, if this grouping was one of the hypotheses of interest, then we can test it adequately without ever needing to 600 D. A. Morrison Table 2-Some characteristics of commonly-used tree-building methods Method Procedure” Datab Taxa’ Data* 1 Unweighted pair-group 2 Neighbor-joining 3 Neighborliness 4 Minimum-evolution 5 Distance-Wagner 6 Least-squares 7 Maximum-parsimony 8 Weighted-parsimony 9 Compatibility 10 Maximum-likelihood 11 Invariants 12 Spectral analysis ak ak ak opt opt opt opt opt opt opt opt opt dist dist dist dist dist dist orig orig orig orig orig orig all all quart all all all all all all all quart all morph morph morph morph morph morph morph morph morph molec molec molec “Method chooses uses an algorithm bMethod analyses taxa (dist). ‘Method performs dMethod suitable studies (molec). a tree based on an optimality criterion (opt) (i.e. a search method) or simply (alg) (i.e. a constructive method). the original data (orig) or data that have been converted to distances between calculations on all taxa (all) or only for both morphological and molecular on subsets of taxa (quart). studies (morph) or only for molecular determine the root of the tree. Consequently, the production of an unrooted tree is a very big step towards the goal of producing a cladogram. Finally, it is worthwhile remembering that a cladogram represents a set of hypotheses concerning evolutionary history. There are hypotheses about: character homology; character polarity; and phylogenetic relationships of taxa. All of these hypotheses are stated explicitly by the cladogram itself, irrespective of whether they have actually been shown on the diagram. The cladogram does not constitute an experimental test of the evolutionary history of the taxa, but it is a test of hypotheses of possible phylogenetic patterns. Becausethese hypotheses of pattern are stated explicitly, they are open to objective tests of their robustness. Furthermore, the hypotheses of pattern are open to further objective testing through cladistic analyses of other data setsfor the same set of taxa. This may be as close as we will ever get to a the data matrix directly, while others require that the data have been converted to distances between taxa before the analysis (i.e. a matrix of all possible pair-wise distances between the taxa is calculated). Furthermore, some of the methods that use an optimality criterion evaluate that criterion over all taxa, while some methods evaluate the criterion on subsets of the taxa (usually 4 taxa, called quartets). Some methods have been developed specifically for sequence data, while others may be applied to any type of data. The more commonly-used methods (Huelsenbeck, 1995a) and their characteristics are summarized in Table 2 , and are discussed below. For those methods that have an optimality criterion, it is important to recognize that they are based on a double-level optimization, where first the optimality criterion is optimized for a given tree topology, and then the tree that optimizes these optimal values is selected (i.e. first you make each tree as hypothetico-deductive good Methods for constructing science of phylogeny. trees Tree building. The tree-building methods that have been developed vary in many properties (Swofford & Olsen, 1990; Penny, Hendy & Steel, 1992). Some methods choose the tree from among all of those possible that either maximizes or minimizes some optimality criterion (search methods), while other methods merely follow an algorithm (a pre-defined sequence of operations) to produce a single tree (constructive methods). Some of the methods analyse as you can, and then you find the tree that is the best). Methods thus need to be known that accomplish the optimization for a specified tree, and then methods need to be known for searching among the trees for the optimal one. The first class of methods will be unique for each tree-building procedure, but the second class of methods is more general. There are several strategies for finding the optimal tree (or trees). Exact methods are those that guarantee to find the optimal solution, while heuristic methods are computationally-efficient strategies that should produce a solution that is at least close to the Invited Review optimal one even if it does not find the optimum. The most efficient (see below) of the exact methods involve the use of the branch-and-bound strategy (Hendy & Penny, 1982). The need for heuristic methods is simply one of practicality - the number of possible trees to be tested by the exact methods increases exponentially with increasing numbers of taxa (Felsenstein, 1978a). Heuristic search strategies first try to find a tree that is a good estimate of the final solution, usually by sequentially adding the taxa to the growing tree, adding each taxon in the optimal place on the tree. Most heuristic strategies then search through those trees that are similar to this initial tree in order to find a better solution, usually using branch swapping (Swofford & Olsen, 1990). The important point to note is that heuristic searches do not necessarily find the optimal solution, nor do they necessarily get particularly close to it. It is therefore important to accurately describe any heuristic procedures used in an analysis, so that the likely success of the method can be assessed. There are many methods for calculating the distances for those methods that require them. Distances specify a relationship between pairs of taxa, with taxa that have identical states for all of the characters having zero distance between them, and taxa that have no character states in common having an arbitrarily large distance. Some data that are useful for cladistic purposes, such as those from immunology, nucleic acid hybridization and breeding experiments, are automatically expressed as pair-wise distances (West & Faith, 1990); but most character data need to be mathematically transformed into distances. The basic assumption when analysing distances is that the distances used provide an estimate of the evolutionary divergence among the taxa. The wide array of possible general measures of inter-taxon relationship is summarized by Clifford & Stephenson (1975). The most commonly used direct distance measures are the manhattan (sum of the character-state differences for all of the characters) and euclidean (square-root of the sum of the squared differences) distances, although the former may have more biological meaning (West & Faith, 1990). For data with a mixture of multi-state characters (i.e. where different characters have different numbers of states) it will be necessary to standardize the data first, and the most popular standardized distance is the gower (range-standardized manhattan) distance. It is also possible to convert measures of similarity into distances (Swofford & Olsen, 1990), and there are many published versions of such measures (Hubalek, 1982). For DNA/RNA sequence data, the distance between taxa can be estimated by the total number of 601 nucleotide differences between them (Gojobori, Moriyama & Kimura, 1990). However, rather than just count the number of observed differences in nucleotides between 2 taxa, it is usual to recognize that there are likely to have been many superimposed substitution/indel events through evolutionary time (e.g., multiple substitutions at any one nucleotide position), and that these superimposed events cannot be observed in the contemporary taxa. Many methods have thus been proposed to try to correctly estimate sequence divergence from observed nucleotide differences (see Gojobori et al., 1990; Pollock & Goldstein, 1995; Rzhetsky & Nei, 1995), based on models of nucleotide evolution that vary from very simple to quite sophisticated. These methods work by estimating the expected amount of nucleotide divergence between the sequences, assuming some specified probability that nucleotide changes will occur during an evolutionary timeperiod (such as that the number of changes follows a Poisson frequency distribution), usually by diIIerentially weighting the various transitions and transversions that are possible among the 4 nucleotide character-states (A, C, G, T/U) to reflect their relative probability of occurrence. The differences observed between the taxa are thus “corrected” for unobserved multiple substitutioniindel events, and the distances derived from these corrected data should provide a better estimate of the evolutionary divergence among the taxa. The studies to date indicate that the simplest of the various models of nucleotide evolution that is likely to apply to real data sets is the 2-parameter model of Kimura (1980), but that it may not be applicable when the sequences are distantly related and/or the total number of nucleotides examined is large (Rzhetsky & Nei, 1995). Part of the problem here is that the rate of nucleotide substitution varies within evolutionary hneages (Li er al., 1987) as well as among sites within a sequence (Olsen, 1987; Shoemaker & Fitch, 1989). Consequently, the nucleotide evolution models may be better if the variance of the estimates is reduced (Pollock & Goldstein, 1993, and if they assume that the rate of nucleotide substitution varies from site to site according to a gamma frequency distribution (Jin & Nei, 1990). Many other methods have also been proposed specifically for calculating distances from proteincoding DNA sequences, based on comparing codons rather than nucleotides. They recognize the distinction between non-synonymous and synonymous nucleotide substitutions (i.e. those that do and do not result in changes in the amino acid coded for), and include both weighted and unweighted pathways methods (Gojobori et al., 1990). Distances may also 602 D. A. Morrison be calculated directly from amino acid sequences (Ota & Nei, 1994). Furthermore, a number of methods have been proposed specifically for converting frequency data (such as allelic or genotypic frequencies) to distances, based on different genetic models (Rogers, 1991), and also for calculating distances from restriction endonuclease data (Nei & Li, 1979). Tree-building methods. Here, I provide only a brief introduction to the procedures employed by the various tree-building methods. More details of most of these methods, along with worked examples, are provided by Swofford & Olsen (1990), and Charleston, Hendy & Penny (1994) provide a brief mathematical exposition. The Unweighted pair-group method using arithmetic average (UPGMA) is an algorithmic method that analyses distance data (Michener & Sokal, 1957; Sokal & Michener, 1958). It groups the taxa based on increasing distances between the taxa, starting by grouping the 2 taxa with the smallest distance, and then progressively adding more-distant taxa to the group or to new groups. It assumes that the data are ultrametric; that is, a tree can be constructed so that the observed distance between any 2 taxa (i.e. in the original data matrix) is equal to the sum of the branch lengths joining them (i.e. the distance as measured along the branches connecting the 2 taxa on the tree), and that the tree is rooted so that all of the taxa are equidistant from the root. Neighbor-joining (Saitou & Nei, 1987; Studier & Keppler, 1988) is another algorithmic method that analyses distance data. It is similar to UPGMA in that it starts by grouping the 2 taxa with the smallest distance, and then progressively adds more-distant taxa to the group or to new groups. However, it differs in modifying the distance matrix at each step, so that each pair of groups is adjusted on the basis of their average divergence from all other groups. The method thus relaxes the ultrametric assumption, and assumes only that the data are additive; that is, a tree can be constructed so that the observed distance between any 2 taxa (i.e. in the original distance matrix) is equal to the sum of the branch lengths connecting them on the tree (patristic distance or pathlength). The Neighborliness procedure (Sattath & Tversky, 1977; Fitch, 1981) is the final algorithmic method discussed that analyses distance data. It is based on evaluating the 4-point metric (Dobson, 1974), a relaxed definition of additivity for a set of 4 taxa. It specifies the conditions for clustering 4 taxa into 2 pairs, based on the relative ranking of the patristic distances between all possible pairs of the taxa. Neighborliness is thus a quartet method, evaluating the 4-point metric for all possible subsets of 4 taxa (quartets) to decide how the 4 taxa should be clustered, and then building clusters of the taxa that are consistent with the largest fraction of the quartets (i.e. the tree is the one for which the 4-point metric is satisfied the largest number of times). Other algorithmic methods that analyse distance data have been described by Farris (1977), Chakraborty (1977), Klotz & Blanken (1981), Li (1981), Tateno, Nei & Tajima (1982), Hasegawa, Kishino & Yano (1985), and Charleston et al. (1994); but none of these has been widely used. The Minimum-evolution procedure (Edwards & Cavalli-Sforza, 1964) analyses distance data using an optimality criterion. The tree chosen is the one that minimizes the sum of the lengths of all of the branches on the tree, these branch lengths being estimated using linear algebra on the observed distances (i.e. from the original distance matrix) between 3 taxa inter-connected by a common node. Rzhetsky & Nei (1993) provide an exact method for optimizing each tree; and Rzhetsky & Nei (1992) have suggested an heuristic method for finding the optimal tree by pointing out that the neighbor-joining algorithm is guaranteed to find the Minimum-evolution tree for 4 taxa. The Distance-Wagner method (Farris, 1972) also analyses distance data using an optimality criterion. The optimality criterion is the same as for the Minimum-evolution procedure: the desired tree is the one that minimizes the sum of all of the branch lengths on the tree. However, in this case all of the patristic distances (along the tree branches) connecting any 2 taxa are constrained to be greater than the observed distances between these taxa, whereas the Minimum evolution procedure assumes that some of the patristic distances will be greater and some smaller than the observed distances. Exact methods for optimizing each tree have been described (Beyer et aZ., 1974; Waterman et al., 1977), and heuristic procedures for tinding the optimal tree are usually used (Farris, 1972; Swofford, 1981; Faith, 1985). The Weighted least-squares (Fitch & Margoliash, 1967) and the Unweighted least-squares (Cavalli-Sforza & Edwards, 1967) are related methods that analyse distance data using an optimality criterion. They both choose the tree that minimizes the error (i.e. the measured discrepancy) between the observed distances between the taxa (i.e. the original distance matrix) and the patristic distances connecting the taxa on the tree (pathlength). So, for each possible tree they compare the sum of all of the squared differences between the distances in Invited Review the original matrix and those on the tree, and they then choose the tree that minimizes this sum. The 2 methods differ in the formula they actually use to measure the discrepancy between the observed and patristic distances, the unweighted method assuming that all of the distance estimates have the same absolute error and the weighted method assuming that they are have the same percentage error. Exact methods for optimizing each tree have been described (Kidd & Sgaramella-Zonta, 1971; Beyer et aZ., 1974; Waterman PZ al, 1977), and heuristic procedures for finding the optimal tree are usually used. The Maximum-parsimony method (Farris, 1970; Fitch, 1971) is the first method to be discussed that analyses the original data matrix using an optimality criterion. It is probably the most popular method among cladists, and indeed many cladists restrict the definition of cladistics to include this method alone. The optimal tree is the one that has the minimum number of inferred character state changes (i.e. shortest tree length or fewest evolutionary steps). Exact methods for optimizing each tree have been described (Fitch, 1971; Swofford & Maddison, 1987). Exact methods for finding the optimal tree include an exhaustive search (i.e. testing every possible tree topology) and branch-and-bound methods, but heuristic methods must be used for analyses with more than about 20 taxa (Swofford & Olsen, 1990). Special methods have been developed for frequency data, such as allelic or genotypic frequencies (Swofford & Berlocher, 1987). The Weighted-parsimony method (Farris, 1969; Sankoff, 1975) is a generalization of the Maximumparsimony method that allows the characters and character states to be weighted so that they contribute unequally to the calculation of optimality (note that character weighting and character-state weighting are separate ideas; Williams & Fitch, 1989). The optimal tree is the one that has the minimum sum of the weights. The rationale for this is that the unequal weights reflect unequal probabilities of change in the characters (Felsenstein, 1981 b), the weighting usually being applied so that evolutionarily conservative characters or states can contribute more to the reconstruction of the phylogeny (Olsen, 1987). The magnitude of the weights to be applied can be decided either a priori (i.e. decided before the analysis) or a posteriori (i.e. calculated after an initial cladogram has been constructed) (Neff, 1986). Several general a priori weighting schemes have been proposed, such as Dollo-parsimony (Le Quesne, 1974), where no parallelisms or convergences are permitted, and Camin-Sokal-parsimony (Camin & Sokal, 1965), where no reversals are allowed, as well as compatibility weighting (Penny & Hendy, 1986; 603 Sharkey, 1989). However, many a priori weighting schemes have also been developed specifically for molecular sequences, where the genetic mechanisms responsible for nucleotide substitutions make it seem unreasonable to assume equal weights for all changes. Differential weights can be applied to (see Hillis et al., 1993): (1) different sequence positions (character weighting), thus emphasizing functional inequalities along the sequences (e.g., different codon positions in translated genes, stems vs loops in structural RNAs, hypervariable positions in mtDNA); or (2) the same sequence positions (character-state weighting), thus emphasizing mutational biases (e.g., transitions vs transversions, relative substitution frequencies, base composition, synonymous vs non-synonymous changes). A posteriori weighting schemes include successive approximations (Farris, 1969), quadratic weighting (Fitch & Yasunobu, 1975), transformation series (Mickevich, 1982), dynamic weighting (Williams & Fitch, 1989) and homoplasy weighting (Goloboff, 1993). Exact methods for optimizing each tree using weights have been described (Sankoff & Cedergren, 1983). The Compatibility method (Le Quesne, 1969, 1982) also analyses the original data matrix using an optimality criterion. The optimal tree is the one that has the largest set (clique) of mutually compatible characters (i.e. characters for which there is no homoplasy on the tree). The method proceeds by testing the pair-wise compatibility of all of the characters, and then examining all possible cliques of characters. The method thus relies on the theorem that a group of characters which are pair-wise compatible are jointly compatible. This is always true for both polarized and unpolarized 2-state characters (Estabrook & McMorris, 1980), but for unpolarized multi-state characters it is possible for a collection of pair-wise compatible characters not to have any phylogeny for which they are all compatible (Fitch, 1975). Branch-and-bound methods are available for finding the maximal cliques. The Maximum-likelihood procedure (Felsenstein, 1973a,b) is another method that analyses the original data matrix using an optimality criterion. So far, practical likelihood methods have only been developed for analysing nucleotide sequence data (Felsenstein, 198la), protein sequence data (Kishino, Miyata & Hasegawa, 1990), and restriction site data (Felsenstein, 1992), with restricted maximumlikelihood methods also available for continuous characters (Felsenstein, 198lc). The optimal tree is the one that maximizes the statistical likelihood that the specified evolutionary model produced the observed data. For example, for a given model of nucleotide evolution (the most commonly-used one is 604 D. A. Morrison the generalized Kimura 2-parameter model described by Kishino & Hasegawa, 1989), formulae are derived that describe the probability that an initial nucleotide will be transformed into a specified nucleotide during an evolutionary time-period. The likelihood for each nucleotide position is then equal to the prior probability of finding the initial nucleotide at that position multiplied by the probability of transformation. The likelihood of the divergence of 2 sequences during the time-period is then the product of the likelihoods at each position, and the overall likelihood for a tree is the product of the likelihoods along the branches. No exact methods for optimizing each tree have been described, and iterative (“hill-climbing”) methods are used (Fukami & Tateno, 1989). Exact methods for finding the optimal tree involve an exhaustive search (since suitable branch-and-bound methods are unknown), and so heuristic methods (Felsenstein, 1981a) are usually used for analyses with more than about 5 taxa. The Invariants (or Evolutionary parsimony) method (Lake, 1987; Cavender & Felsenstein, 1987) also analyses the original data matrix using an optimality criterion, although it was developed specifically for analysing nucleotide sequence data. It is based on evaluating the number of operator invariants (linear for the method of Lake; quadratic for the method of Cavender & Felsenstein) for 4 taxa, which specifies the conditions for clustering the 4 taxa into 2 pairs, based on the relative number of transversions on the internal branch of the unrooted tree to those on the terminal branches. The use of Invariants is thus a quartet method, evaluating the invariants for all possible subsets of 4 taxa (quartets) to decide how the 4 taxa should be clustered, and then building clusters of the taxa that are consistent with the largest fraction of the quartets (Lake, 1988). Exact methods are used for the evaluation of the invariants. Spectral analysis (Penny, Hendy & Henderson, 1987; Hendy & Penny, 1993) is the final procedure that analyses the original data matrix using an optimality criterion. It has been developed only for nucleotide data, originally with 2-state characters (Hendy, 1989) and more recently with 4 states (Hendy, Steel & Penny, 1994). This procedure first uses the hadamard transformation (Hendy & Charleston, 1993) together with a model of sequence evolution (such as the 2-character-state model of Cavender, 1978a,b) to adjust (or correct) the observed data for unobserved character-state changes (e.g., multiple substitutions at a sequence position), thus producing a matrix of expected data (i.e. expected number of character-state changes under the evolutionary model adopted). These corrected data are then used to estimate the degree of support for every possible subset of taxa, by summing the number of the expected character-state changes that are compatible with each subset. The Closest-tree method (Hendy, 1991; Steel et d, 1992), a compatibility-type method based on minimizing the sum-of-squares, is then used to select the optimal tree as the one with the largest set of mutually compatible subsets of taxa. Exact methods for finding the optimal tree include branch-and-bound methods (Hendy & Penny, 1989), with a limit of about 20 taxa. Characteristics. Clearly, it is impossible to compare the many tree-building methods based on their relative ability to detect the phylogenetic pattern underlying the apparent incongruences in real data sets, because the real phylogenetic pattern is almost never known. Instead, many other criteria have been proposed to assess the relative usefulness of the various tree-building methods, although many of these criteria have not yet been effectively evaluated for most of the methods. Those criteria about which we know the most are outlined here - no single method meets all of the criteria (Penny et al., 1990, 1992). The Computational Eficiency of a method refers to the relative speed with which a solution is produced (Penny et al., 1992). All of the algorithmic procedures are relatively efficient, as the computation time does not increase dramatically as the number of taxa is increased. On the other hand, none of the optimality methods is efficient, because the number of trees from which the optimal one needs to be chosen (i.e. at the second step in the double-optimization) increases exponentially as the number of taxa increases (Felsenstein, 1978a). Finding the optimal tree using exact methods is an example of the large class of NP-complete mathematical problems (Graham & Foulds, 1982), for which no efficient procedure is known (or is expected ever to be known). So, for large data sets (both in terms of number of taxa and number of characters) heuristic methods must be used for all of the optimality methods if a solution is to be produced within a reasonable time. This means that finding the optimal solution cannot be guaranteed for the analysis of large data sets - just how large “large” is depends on the computing power (both software and hardware) available for the analysis and the patience of the cladist. This situation is compounded for data sets with a relatively low phylogenetic signal (see below), which are likely to have multiple optimal trees that are structurally quite different (Maddison, 1991) -these “islands” of trees may not easily be found by heuristic techniques (Page, 1993). Invited Review 605 The Maximum-likelihood method is even more quartet methods ignore that part of the information inefficient than all of the other methods that use that comes from the incompatibilities among the optimality criteria. For Maximum-likelihood, the quartets, while Compatibility analysis assumes that first step in the double-optimization (i.e. optimizing incompatible characters have no phylogenetic each tree topology) has no simple analytical solution information at all (Hill, 1975). Furthermore, the (Fukami & Tateno, 1989), unlike all of the other Parsimony and Compatibility methods do not use methods, and so computationally-intensive iterative characters that have constant states across all of the procedures must be used. This problem is exacer- taxa, nor characters where only 1 of the states bated by the fact that there may be more than 1 way occurs in more than 1 taxon (autapomorphies); and of optimizing a particular tree topology (Steel, 1994). these may form a large proportion of sequence data These problems are then compounded by the lack of (for example, I looked at an arbitrary selection of a branch-and-bound algorithm for the second step in 20 molecular studies published in 1994, and found the double-optimization; and so there is a severelimit that on average 84% of the nucleotide positions to the sizeof the data set that it is practical to analyse were uninformative for Parsimony/Compatibility by the Maximum-likelihood method. analysis using these 2 criteria). The MaximumThe Power of a method refers to the response of likelihood, Invariants and Spectral-analysis methods the method to increasing amounts of data (Olsen, ignore characters with missing character-state data 1988) - with an increasing number of characters in for any of the taxa, including indels for sequence the original data matrix we expect a method to data, which may also constitute a sizable proconverge to a single tree (i.e. a limit is reached portion of the original data matrix. The Invariants beyond which further increases in the data do not method is also based entirely on transversion substichange the tree produced by the method) (Penny & tutions, and there may not be enough in a data set Hendy, 1986). A powerful method is one in which the to effectively infer the branching pattern, or the convergence occurs with a relatively small number of frequency of transitions may confound the pattern characters (Hillis, Huelsenbeck & Swofford, 1994b). (Jin & Nei, 1990; West & Faith, 1990); it thus appears to have very low power (Hillis et al., Power is also related to the amount of information in the original data matrix that is actually used to 1994a,b). construct the tree (Penny et al., 1992, 1993) - all of The Consistencyof a tree-building method refers to the tree-building methods omit some of the character its ability to converge to the correct tree (Felsenstein, information, and so the power of these methods may 1988). That is, with an increasing number of characbe reduced. However, it is important to note that not ters in the data matrix a consistent method converges all of the character information is necessary for the to the single correct tree, and a method is inconsistent reconstruction of an evolutionary tree, and that if there are any possible situations where it converges therefore it is not loss per se but loss of relevant to the wrong tree (i.e. there are systematic or noninformation that is important (Shoemaker & Fitch, random errors). Consistency is related to the assump1989; Penny et al., 1993). tions (e.g., the evolutionary model) made by the Compared with the character-based methods, all tree-building method - if the assumptions are not of the distance methods involve a very large loss of met for a particular data set then consistency cannot information in converting the original data into be expected. distances. This is because there will be multiple There are fundamental assumptions underlying all character data sets that produce exactly the same of the tree-building methods, such as that evolution distance matrix (Felsenstein, 1982; Penny, 1982), and proceeds as a continuous-time markov process this problem becomes worse for increasing numbers (Beanland & Howe, 1992), whether they are explicit of taxa and characters (Steel, Hendy & Penny, 1988). or not for a particular type of data (e.g Rodriguez Moreover, for sequence data it is not clear how to et aZ., 1990). The most notable assumption is that calculate distances when the aligned sequences con- character-state changes are independent and identitain terminal length variations, indels or ambiguous cally distributed (Cavender, 1978a; Olsen, 1987; nucleotides (since the models of nucleotide evolution Shoemaker & Fitch, 1989; Sanderson, 1995); which are based on estimating rates of transitions and means that no character-state change affects any transversions only) - including or excluding these other character-state change, and that all characterfeatures in the calculations can produce differ- state changes are equally likely. If evolution has ent distance estimates (Swofford & Olsen, 1990; occurred then this assumption is generally not true, Beanland & Howe, 1992). because of the common history of the taxa during The character-based methods also do not use all at least some evolutionary period. The distance of the data (Penny et aZ., 1990). For example, the methods will be particularly sensitive to this problem 606 D. A. Morrison (Felsenstein, 1988), although all of the methods suffer from it (Felsenstein, 1982). The distance methods will also be inconsistent if the data are not additive (Felsenstein, 1988), as there is no exact solution for estimating the tree under these circumstances (Swofford & Olsen, 1990). Additivity of taxonomic data is unlikely because the distances are only estimates of evolutionary divergence, and are therefore expected to vary randomly around the true divergence value (i.e. the data will be additive only if there is no homoplasy). One of the consequences of this is that trees with negative branch lengths are theoretically possible when using the Neighbor-joining, Minimumevolution and Least-squares methods (Swofford & Olsen, 1990). UPGMA adds the further constraint that the data are ultrametric; that is, that evolutionary divergence among the taxa occurs at a uniform rate through time (such as occurs with a molecular clock). This is also unrealistic. All of the methods will also be inconsistent if the amount of homoplasy is large, because homoplasy makes unrelated taxa appear to be more similar (Olsen, 1987; Swofford & Olsen, 1990). This will be a particular problem if the frequency of characterstates varies between taxa (Penny et al., 1990); for example, for sequence data similar GC-content often unites unwanted branches on a tree (Sidow & Wilson, 1990, 1991; Lockhart et al., 1992; Marshall, 1992; Hasegawa & Hashimoto, 1993) and all tree-building methods are sensitive to this (Lockhart e? al., 1992, 1994). Methods may also be inconsistent when there are juxtaposed long and short branches on the tree (e.g., unequal rates of phylesis or speciation), as these branches will tend to join together (often referred to as “long branches attracting”). This has long been recognized as a particular problem for the Parsimony and Compatibility methods (Cavender, 1978a; Felsenstein, 1978b; Hendy & Penny, 1989; Zharkikh & Li, 1993; Takezaki & Nei, 1994), but we now recognize that it also applies to all of the distance methods as well as to the Invariants and Closest-tree methods (Jin & Nei, 1990; Penny et al., 1990; de Bry, 1992; Huelsenbeck & Hillis, 1993; Steel, Hendy & Penny, 1993; Zharkikh & Li, 1993; Hillis et al., 1994b). The Maximum-likelihood method appears to suffer least from this problem (Hillis et al., 1994b). The Robustness of a tree-building method refers to how much the assumptions of the method can be violated before the method becomes inconsistent (Penny et al., 1990). This may be the most important criterion, because the idealized assumptions of the methods are likely to be violated by real data sets (Huelsenbeck, 1995a,b). Many of the consistency problems listed above may thus not necessarily preclude use of the various methods if they are robust. The distance methods, for example, may be quite robust to the problem of non-additivity if the distance estimates are corrected for multiple character-state changes and the problems of negative branch-lengths are addressed (Felsenstein, 1988); and even UPGMA does not require perfectly ultrametric data (Colless, 1970). The consistency problems associated with homoplasy can also potentially be solved (i.e. the methods can be made robust) by any method that provides a framework for making corrections for multiple character-state changes (Penny er al., 1993), such as the use of transformations and corrected distances (e.g., for unequal nucleotide composition, use of the LogDet transformation, Lockhart et al., 1994; compositional statistics, Sidow & Wilson, 1990, 1991; or character weighting, Marshall, 1992). Consistency can often be restored (i.e. the methods made robust) when there are juxtaposed long and short branches by adding taxa to the analysis that join the tree on the long branches (Hendy & Penny, 1989; Penny et al., 1990; Lento et al., 1995), or by using character weighting schemes that shorten the branches (Swofford & Olsen, 1990; Hillis et al., 1993). However, the preferred method is to make corrections for multiple character-state changes using transformations (Jin & Nei, 1990; de Bry, 1992; Steel et al., 1993; Huelsenbeck, 1995a). Note that the attempts to improve the consistency of the tree-building methods by using corrections and transformations apply only to molecular data, because it is possible to use a consistent model of character-state changes that will apply to all of the characters in any 1 data set. It is not obvious how to appropriately correct or transform other types of data, since each character will have its own model of character-state change. A popular approach to studying the robustness of the various methods is by using simulated data, where the original tree and the characteristics of the data are known, and the behavior of the tree-building methods under these conditions is evaluated (e.g., Tateno et al., 1982; Li et a!., 1987; Sourdis & Krimbas, 1987; Kim & Burgman, 1988; Saitou & Imanishi, 1989; Jin & Nei, 1990; Rohlf et al., 1990; Nei, 1991; Sidow & Wilson, 1991; Hasegawa & Fujiwara, 1993; Huelsenbeck & Hillis, 1993; Charleston et al., 1994; Hillis et al., 1994a,b; Kuhner & Felsenstein, 1994; Tateno, Takezaki & Nei, 1994; Gaut & Lewis, 1995; Huelsenbeck, 1995a,b; Tillier & Collins, 1995; Yang, 1995). There is clearly a large conceptual gap between simulated data and real data (Miyamoto & Cracraft, 1991; Hillis et al., 1994a), and most of the simulations apply only to the Invited Review branching order of a limited number of taxa under specific evolutionary models. The jury is still out on the results (cf. Felsenstein, 1988), but these studies show that many of the tree-building methods are actually quite effective (i.e. robust) over a wide range of evolutionary conditions. The most robust of the methods appears to be the Maximum-likelihood procedure, with UPGMA and the Invariants method being the least robust. The robustness of the other methods seems to depend very much on the ability to correctly utilize transformations for multiple character-state changes or to appropriately apply character or character-state weighting; clearly this requires a priori knowledge about the evolutionary process in the taxa being studied that is unlikely to be available. It is unfortunate that Spectral analysis has not been incorporated into most of the simulation studies to date, in order to test whether its theoretical advantages are met in practice. Finally, there are methodological and philosophical considerations that are often used to compare the techniques (Miyamoto & Cracraft, 1991). In particular, the algorithmic procedures are often criticized because they provide no method for ranking alternative tree topologies (i.e. there is no criterion to compare trees and decide how “good” any of them are as estimates of phylogeny) (Swofford & Olsen, 1990). This is a serious limitation if we treat a cladogram as being an attempt to estimate the true phylogeny, because we are using incomplete information and there is thus a degree of uncertainty about the correctness of our estimate (Felsenstein, 1988). It is thus always a good idea to assess how much better the optimal tree is than the next-most-optimal tree (or set of trees) (Swofford, 1991), especially as these trees may not be independent of each other (Penny et al., 1995). After tree construction. We can never know whether our cladogram actually represents the true phylogenetic history of the taxa concerned. Therefore, there are 3 further issues that need to be dealt with once a phylogenetic tree has been constructed using 1 or more of the methods outlined above: assessing the magnitude of the phylogenetic signal in the cladogram; assessing the robustness of the monophyletic groups represented on the cladogram; and comparing the structure of multiple cladograms from the same set of taxa. Although these issues are extremely important for cladistic analysis, they are only briefly introduced here because a full coverage would require a separate (although shorter) review. Even randomly-generated data will lead to the construction of a cladogram if any of the treebuilding techniques is applied to them. It is therefore 607 important to quantitatively assess the magnitude of the phylogenetic signal (i.e. that part of the character variation that is potentially informative about the evolutionary history) in a set of data - is the support for a particular tree any better than would be expected from a random data set? Quantifying this information also allows cladograms to be compared effectively with each other - a cladogram with a stronger phylogenetic signal is to be preferred. Four classes of techniques have been developed: optimality measurements (for those methods that use an optimality criterion), such as consistency and homoplasy indices (Archie, 1989; Farris, 1989, 1991) for the Parsimony methods, likelihood values for the Maximum-likelihood methods, and sums-of-squares for the Least-squares methods; skewness indices (Huelsenbeck, 1991; Hillis & Huelsenbeck, 1992); analytical statistical tests (Felsenstein, 1988; Li & Gouy, 1991; Li & Zharkikh, 1995); and randomization (or permutation) statistical tests (Goloboff, 1991; KBllersjij et aZ., 1992; Ahoy, 1994; Faith & Ballard, 1994). A cladogram is interpreted in terms of the monophyletic groups of taxa that it hypothesizes. It is therefore important to quantitatively assess the robustness of all of the groups indicated by the branching sequence of a cladogram (i.e. the degree of support for each branch in the tree) - is the support for a particular group any better than would be expected from a random data set? Many classes of techniques have been developed, including: analytical procedures, such as confidence limits, branch-length variances and likelihood-ratio tests (Li & Gouy, 1991; Li & Zharkikh, 1995); resampling procedures, such as the bootstrap (Li & Zharkikh, 1994, 1995; Sanderson, 1995), the jacknife (Felsenstein, 1988) and topology-dependent permutation (Faith, 1991); and non-statistical procedures, such as the decay index (Bremer, 1994), clade stability (Davis, 1993), and spectral signals (Hendy & Penny, 1993). It is unlikely that different tree-building methods will always produce identical trees for the same data set, given their different underlying assumptions, and it is also possible for multiple optimal trees to be produced by a single tree-building method. Furthermore, we may wish to compare cladograms produced using different sets of characters (since congruence among data sets may constitute the strongest achievable evidence for the true phylogeny; Penny & Hendy, 1986), and these may differ as a result of sampling error, different stochastic processes, or different branching histories (de Queiroz et al., 1995). It is therefore usually necessary to quantitatively compare the structure of multiple trees from the same set of taxa. For example, which monophyletic groups 608 D. A. Morrison are supported by (congruent among) which set of trees? - monophyletic groups that are supported by several trees are to be preferred. Several classesof techniques have been developed (most of them reviewed by Swofford, 1991): consensus trees; largest common pruned trees; consensus indices; tree comparison metrics; partition tests (Farris et al., 1995); and randomization tests (Rodrigo et aZ., 1993). A parasitological example. To explore the practical consequences for phylogeny reconstruction of the wide range of available tree-building methods, it is worthwhile exploring a concrete example. This parasitological example illustrates and emphasizes many of the points made in earlier sections, as well as providing a brief step-by-step guide to a cladistic analysis. Methods. First, we need to consider the background to the problem that we intend to address with the cladistic analysis. The members of the genus Sarcocystis are parasitic protozoans with heteroxenous life cycles, in which both hosts are mammals. The sexual stage is in the definitive host (often a predator species) and the asexual stage is in the intermediate host (often a prey species), with the sporocysts being passed in the faecesof the definitive host and ingested by the intermediate host. Studies of this genus may be of economic importance, because some of the species are important pathogens of livestock and humans. The current classification of the phylum Apicomplexa (Sporozoa) is based largely on ultrastructural analysis, on details of the life-cycles, and on the location of developmental stages. The internal arrangement of the phylum is currently rather unstable, at least partly because of a lack of knowledge of the life-cycles for many taxa. Cox (1994) treats the coccidial parasites as a class (the Coccidea), with Sarcocystis in the order Eimeriida. The family Sarcocystidae is often subdivided into 2 subfamilies, the Sarcocystinae (3 genera, 100 species) and the Toxoplasmatinae (2-3 genera, 15 species) (Levine, 1985). However, the phylogenetic relationships within the Sarcocystidae remain ambiguous, at least partly because of difficulties in assessing the homology of phenotypic characters. It is possible that molecular data might alleviate these problems, although there has been little consistency among molecular studies to date. While the family as a whole is generally considered to be monophyletic, several analyses of small-subunit ribosomal RNA (ssrRNA) sequences have questioned the monophyly of Sarcocystis itself (Barta, Jenkins & Danforth, 1991; Tenter, Baverstock & Johnson, 1992; Ellis et al., 1994). In particular, these studies suggest a strong correlation between the parasite phylogeny and that of their definitive hosts, with species of the Sarcocystidae forming 2 clades based on the use of either canids or felids as their definitive hosts. Thus Surcocystis(felid or canid hosts) is only monophyletic if both Toxoplasma (felid host) and Neospora (unknown host) are included within it. Second, we need to consider the data that are available to tackle this problem (i.e. the ingroup). There are currently complete ssrRNA sequences available for 8 species of the Sarcocystidae (Sarcocystis arieticanis, S. fusiformis, S. gigantea, S. muris, S. neurona, S. tenella, Neospora caninum, Toxoplasma gondii), which can be used to test the monophyly of Surcocystis. Several published sequences are available for some of the species (9 for T. gondii, 2 for N. caninum), and for the purposes of this example consensus sequences were derived for these species (after alignment; see below) using the MacClade 3.04 computer program (Maddison & Maddison, 1992). The standard IUPAC ambiguity codes were used for those few nucleotide positions with more than 1 possible character-state in the sequence (1 for S. muris, 1 for N. caninum, 26 for T. gondii). Thus, we have a representative sample of the family (e.g., species with both felid and canid hosts), and we have taken inter-individual variation into account as far as possible. Third, we need to consider how the cladogram will be rooted (i.e. the outgroup), sincemonophyly can be assessed only on a rooted tree. The members of the Eimeriida that are most closely-related to the Sarcocystidae (see Levine, 1985) are the members of Eimeriu (family Eimeriidae), for which 7 species have published ssrRNA sequences (E. acervulina, E. brunetti, E. maxima, E. mitis, E. necatrix, E. praecox, E. tenella). For the outgroup, a single consensus sequence was derived from the sequencesfor these 7 Eimeria species(after alignment) using the MacClade program, with the standard IUPAC ambiguity codes being used for those 68 nucleotide positions with more than 1 character-state in the consensus sequence. These ambiguous character-states can be dealt with effectively by some analysis techniques, while others treat them as missing data. For this data set this different treatment has very little effect, because in almost all casesthe Sarcocystis sequences are invariant for these characters (i.e. the variability is only within the Eimeria phylogeny itself and it thus does not influence the reconstruction of the Sarcocystis phylogeny). Fourth, we need to consider character homologies (i.e. sequence alignment). There are 9 sequences in Invited Review the analysis, 8 for the ingroup and 1 for the outgroup. The ssrRNA sequences for all of these species were aligned using the secondary structure model of Van de Peer et al. (1994) - all of the sequences were obtained from this computer database. For the cladistic analysis, the single-stranded regions were omitted, as Ellis & Morrison (1995) have indicated that it is the double-stranded region that contains most of the phylogenetic information for these taxa. There are thus 694 nucleotide positions in the analysis. This procedure effectively deals with both the problems associated with positions of equivocal alignment (since there are very few indels in the double-stranded region), as well as those associated with relative character weighting based on structural/ functional inequalities along the sequences. Fifth, we need to consider the tree-building methods. Four tree-building methods were used (covering the full range of possibilities), in order to test the sensitivity of the analysis to this source of variability. Two of these were distance-based methods, 1 of them algorithmic (Neighbor-joining) and the other with an optimality criterion (Unweighted leastsquares). The other 2 methods were character-based with optimality criteria, one developed specifically for nucleotide sequences (Maximum-likelihood) and one not (Weighted-parsimony). The Phylip 3.57 computer program (Felsenstein, 1995) was used for the first 3 methods, employing the heuristic search with global branch-swapping for those methods with an optimality criterion. The PAUP 3.1.1 computer program (Swofford, 1993) was used for the final method, employing the branch-and-bound search. Note that in order to test the monophyly of Surcocystis, the only part of the cladogram that needs to be interpreted is the branching order (not the branch lengths), since we will be independently assessing the degree of support for monophyly (see below). Sixth, we need to consider the evolutionary model or models that we wish to employ. Two models of character-state change were used, in order to test the sensitivity of the analysis to this source of variability. The first model assumed that all nucleotide changes were equally probable at each sequence position, while the second model assumed that transitions were twice as likely as transversions (for the trees shown in Fig. 5 the transition : transversion ratio is about 1 S, as determined by the MacClade program). For the maximum-likelihood method, the transition : transversion ratio was set as part of the specified model. For the distance-based methods, the Kimura distance measure was used to estimate sequence divergence, and the transition : transversion ratio was set to either 1 or 2. For the Parsimony method, a step- 609 matrix was used, with transitions weighted as 1 and transversions weighted as either 1 or 2. The nucleotide frequencies are approximately equal for all of the sequences, and so there is probably no need to correct for GC-bias (the GC content varies from 50 to 60% across the sequences). Next, we need to consider the magnitude of the phylogenetic signal in the data (i.e. the robustness of the cladograms). For those methods with an optimality criterion, permutation tests were used. Twohundred permutations of the data were produced for each model and each tree-building method, and the percentage of these replicates that produced trees that were at least as optimal as the observed tree was calculated. The Phylip program implements this option for its tree-building methods. For the Parsimony analyses, some of the permutations were performed using the RandomCladistics 3.0 computer program (Siddall, 1994); for the remainder, the Phylip program was used to create the permuted replicates and these were then individually fed into the PAUP program. Finally, we need to decide how we are going to assess the magnitude of the support that the data have for the monophyly of Sarcocystis. Bootstrap resampling was used, which is implemented in both the Phylip and PAUP programs. Two-hundred bootstrap replicates were produced for each model and each tree-building method, and the percentage of the trees derived from these replicates that indicated monophyly of Surcocystis was calculated. Results. Three cladograms resulted from the 8 analyses (Fig. 5), with each of the optimality methods finding a single most-optimal tree for each analysis. The Neighbor-joining and Maximum-likelihood analyses produced the same tree irrespective of the model used (Table 3), this tree supporting monophyly of Sarcocystis. However, the Weighted-parsimony method produced different trees for the 2 models (Table 3), both supporting monophyly of Surcocystis, but differing in the relative placement of S. muris and S. neurona. However, for the model in which transitions and transversions were equally probable, the tree in Fig. 5A is only one step longer (i.e. one more characterstate change) than the tree in Fig. 5B, and so there is actually very little conflict between the 2 methods. This result emphasizes the importance of considering sub-optimal trees in a cladistic analysis. The Unweighted least-squares method produced the same tree irrespective of the model used {Table 3), but this tree does not support monophyly of Surcocystis, as S. muris and S. neurona are removed from the clade. The contradiction of monophyly in 610 D. A. Morrison , Eimeria T. gondii gondii N. caninum caninum S. murk neurona S. neurona muris S. gigantea gigantea S. fusiformis fusiformis S. tenella S. tenella \/ S. arieticanis C / \S. arieticanis Eimeria S. muris S. neurona T. gondii N. caninum S. gigantea S. fusiformis S. tenella S. arieticanis Fig. 5. The 3 phylogenetic trees produced from the cladistic analysis of Sarcocysfis Sarcocystis ssrRNA. Table 3 shows which analysis produced which of these trees. Trees (A) and (B) support monophyly of Sarcocystis, whereas tree (C) does not, this cladogram is supported only by a branch with negative length (i.e. the branch from the ancestor of the ingroup to the ancestor of T. gondii+ N. caninum+S gigantea+S. fusiformis+S. tenella+S. arieticanis). This situation is not realistic, and so the tree cannot be considered seriously as a possible representation of the real evolutionary history of these taxa. If this tree-building method is constrained to use only positive branch lengths (which is an option in the Phylip program), then the cladogram produced does indeed support monophyly of Sarcocystis using both models (they produce the cladogram shown in Fig. 5A). The permutation tests indicate that there is very strong phylogenetic structure in these ssrRNA data, as no permutations were found that produced trees that were anywhere near as optimal as those produced by the original data set (Table 3). The phylogeny of Sarcocystis can thus be usefully evaluated using this data set. The bootstrap resampling indicates relatively strong support for the Sarcocystis clade (i.e. Sarcocystis is monophyletic in more than 85% of the replicates) for all of the methods in which it occurs (Table 3). This support is apparently stronger (i.e. the bootstrap percentage is higher) when transitions are considered to be more likely than transversions. Even the Unweighted least-squares method provides some support for monophyly, because the negative branchlength that contradicts the monophyly is actually quite short. It is thus reasonable to conclude from this analysis that the ssrRNA sequence data do support monophyly of Sarcocystis, and thus also support the way in Invited Table 3-Results Tree-building method” Neighbor-joining ti=tv ti=2tv Unweighted least-squares ti=tv ti=2tv Weighted-parsimony ti=tv ti=hv Maximum-likelihood ti=tv ti=2tv of the cladistic Review analysis Optimality measure Treeb -e -e 611 of Surcocystis ssrRNA Permutation percentagec Bootstrap percentaged A A -e 85 91.5 0.00241’ 0.00335r C c 0 0 25 42.5 3298 473h B A 0 0 91 98 - 2499.7’ -2502.1’ A A 0 0 99 99.5 e “ti=tv, transitions and transversions equal; ti=2tv. transitions twice as likely as transversions. ‘As shown in Fig. 5. ‘Percentage of the permutation replicates that produced trees at least as optimal as the observed tree. dPercentage of the bootstrap replicates that support monophyly of Sarcocystis. ‘No optimality criterion, and so no calculations possible. ‘Sum of squares. aNumber of character-state changes. ‘Number of weighted character-state changes. ‘Log-likelihood. which the Sarcocystidae is currently divided into 2 subfamilies (if the recently-described Neospora is included in the Toxoplasmatinae) (note that in a Popperian sense the data actually “fail to disprove” the hypothesis rather than “support” it). This conclusion is robust both to variations in the evolutionary model and to the tree-building method used. It is not, however, necessarily robust to the sequence alignment used (see Ellis & Morrison, 1995). There does, however, seem to be some uncertainty with respect to the placement of Smuris and S. neurona (which is the source of non-monophyly detected by the previous studies of ssrRNA referred to above). This uncertainty is only likely to be resolved by including in the analysis other species that are closely-related to these 2 species (i.e. that will join the tree on the terminal branches leading to these 2 species). This is the obvious direction to take for future research into the question posed for the analysis presented here. Some advice for phylogeny reconstruction. The following is a list of topics that need to be considered when carrying out a cladistic analysis. The topics are more-or-less in the order in which they will need to be considered in practice. Include in the analysis all of the taxa for which there are data. Adding or deleting any taxon may affect the outcome of the analysis. This is one of the biggest sources of problems when analysing molecular data. Include samples from more than 1 individual or population if possible. This is the norm when assessing morphological characters, because if intra-taxon variability exists then it will be confounded with intertaxon variability in the analysis. For molecular data, a consensus sequence can be used for each taxon. The ingroup must be monophyletic for the cladogram to accurately reflect phylogenetic history. If the analysis shows that the ingroup and outgroup taxa are inter-mixed in the cladogram, then it is not possible to root the cladogram in such a way that the ingroup is monophyletic. You should thus seriously consider your original rationale for assuming monophyly. Long branches on a cladogram (indicating apparently extensive phylesis) should be avoided if this is possible. They can often be shortened by including taxa that join the tree on these branches (i.e. the branches actually show speciation rather than phylesis, but they are not doing so because the relevant taxa are not in the data set). The outgroup should consist of more than 1 taxon closely-related to the ingroup. The more distantlyrelated is the outgroup then the more arbitrary is the apparent position of the root on the cladogram. 612 D. A. Morrison For molecular data, use entire sequencesif they are available. There is no reason to expect part of a gene sequence (effectively a sample) to reflect accurately the total gene sequence, unless all of the nucleotide positions agree on the phylogenetic branching order. Homology assessments should be taken very seriously. For molecular data, this means that sequence alignment should be given careful consideration, using models of secondary structure if they are available. If computer programs are used, then vary the weighting parameters available in order to detect those parts of the sequence alignment that are not robust, and delete these regions from the analysis. Assess the characteristics of the data before the cladogram is constructed (e.g., GC-content for nucleotide data), in order to choose rationally among the available tree-construction algorithms. Use several cladogram-construction algorithms if this seemsto be appropriate for the data. If heuristic tree-search methods are used then describe them precisely, so that the likely successof having found the optimal tree can be assessed. For nucleotide data, correct for multiple character-state changes if possible. Assessthe magnitude of the phylogenetic signal in the cladogram. Assessthe robustness of the monophyletic groups represented on the cladogram. Quantitatively compare the cladograms if there is more than one (e.g., multiple optimal trees, trees from different cladogram-construction algorithms, or trees from different data sets). Remember that the cladogram represents a series of hypotheses that are amenable to further test. It is not a representation of reality, nor is it the final word on the phylogeny of the taxa analysed. Conclusion. Many biologists treat data analysis in a manner that is completely antithetical to their apparent attitude to the other, more biological aspects of their scientific pursuits (cf. Felsenstein, 1988). In particular, there is a widespread view among non-experts that there should be only 1 correct method for phylogenetic data analysis, and that it is about time that the experts told us what it is. This view is clearly rather naive, because it assumes that all phylogenetic data sets have identical characteristics. This is patently untrue, as it will be for UN data sets(whether biological or not) that incorporate an historical component. Furthermore, currently no methods are simultaneously efficient, powerful, consistent and robust (Penny et al., 1990, 1992), and so there is no single ideal method. The future development of cladistic techniques will presumably follow the well-documented path taken by statistics, in which a series of mathematical tests have been developed (and are still being actively developed) that make different assumptions about the nature of the data to be analysed, so that after the data have been evaluated an informed decision can be made about which test is the most appropriate for the particular task at hand. Thus, out of the current plethora of phylogenetic techniques should emerge a subset of mathematical methods that are known to be robust and consistent under certain circumstances, and it will be from amongst these methods that an objective choice (based on efficiency and power) will be made by the cladist. The current vogue for simply presenting the results of analyses using several tree-building methods is thus both creditable and naive. It is creditable because it emphasizes our current uncertainty about cladistic analysis; and it is naive because there appears to be no rationale being used for the choice from among the available methods. Clearly, what must be done for each phylogenetic data set is for the nature of the data to be evaluated, and then for a reasoned choice to be made about which methods might be appropriate for subsequent analysis, along with an assessment of the support for the phylogenetic hypotheses generated. Two recent studies that incorporate a carefully-reasoned evaluation of the data, followed by a series of phylogenetic analyses that are themselves then subjected to reevaluation, are those of Lento et al. (1995) and Penny et al. (1995), and these could serve as useful models. Acknowledgements-Thanks to Steve Barker, Peter Baverstock, Dave Booth, Mike Crisp, John Ellis, Dan Faith, Alan Johnson and John Trueman for commenting on an earlier version of the manuscript; and to John Ellis for providing the ssrRNA data file. REFERENCES Adachi J. & Hasegawa M. 1995. Phylogeny of whales: dependence of the inference on species sampling. Molecular Biology and Evolution 12: 177-I 79. Alroy J. 1994. Four permutation tests for the presence of phylogenetic structure. Systematic Biology 43: 430-437. Archie J. W. 1989. Homoplasy excess ratios: new indices for measuring levels of homoplasy in phylogenetic systematics and a critique of the consistency index. Systematic Zoology 38: 2533269. Barta J. R., Jenkins M. C. & Danforth H. D. 1991. Evolutionary relationships of avian Eimeria species among other apicomplexan protozoa: monophyly of the Apicomplexa is supported. Molecular Biology and Evolution 8: 345-355. Beanland T., J. & Howe C. J. 1992. The inference of evolutionary trees from molecular data. Comparative Biochemistry and Physiology 102B: 643-659. Invited Beyer W. A., Stein M. L., Smith T. F. & Ulam S. M. 1974. A molecular-sequence metric and evolutionary trees. Mathematical Biosciences 19: 9-25. Bremer K. 1994. Branch support and tree stability. Cladistics 10: 295-304. Brooks D. R. & McLennan D. A. 1992. The evolutionary origin of Plasmodium falciparum. Journal of Parasitology 78: 564566. Bryant H. N. 1992. The polarization of character transformations in phylogenetic systematics: role of axiomatic and auxiliary assumptions. Systematic Zoology 40: 433-5. Camin J. H. & Sokal R. R. 1965. A method for deducing branching sequences in phylogeny. Evolution 19: 31 l-326. Cao Y., Adachi J., Janke A., Plabo S. & Hasegawa M. 1994. Phylogenetic relationships among eutherian orders estimated from inferred sequences of mitochondrial proteins: instability of a tree based on a single gene. Journal of Molecular Evolution 39: 519-527. Cavalli-Sforza L. L. & Edwards A. W. F. 1967. Phylogenetic analysis: models and estimation procedures. American Journal of Human Genetics 19: 233-257 (also published in Evolution 32: 550-570). Cavender J. A. 1978. Taxonomy with confidence. Mathematical Biosciences 40: 271-280. Cavender J. A. 1978. Erratum. Mathematical Biosciences 44: 308. Cavender J. A. & Felsenstein J. 1987. Invariants of phylogenies in a simple case with discrete states. Journal of Classification 4: 57-71. Chakraborty R. 1977. Estimation of time of divergence from phylogenetic studies. Canadian Journal of Genetics and Cytology 19: 217-223. Chan S. C., Wong A. K. C. & Chiu D. K. Y. 1992. A survey of multiple sequence comparison methods. Bulletin of Mathematical Biology 54: 563-598. Charleston M. A., Hendy M. D. & Penny D. 1994. The effects of sequence length, tree topology, and number of taxa on the performance of phylogenetic methods. Journal of Computational Biology 1: 133-I 5 1. Chase M. W., Soltis D. E., Olmstead R. G., Morgan D., Les D. H., Mishler B. D., Duvall M. R., Price R. A., Hills H. G., Qiu Y.-L., Kron K. A., Rettig J. H.. Conti E., Palmer J. D., Manhart J. R., Sytsma K. J., Michaels H. J., Kress W. J., Karol K. G., Clark W. D., Hedrin M., Gaut B. S., Jansen R. K., Kim K.-J., Wimpee C. F., Smith J. F., Furnier G. R., Strauss S. H., Xiang Q.. Plunkett, G. M.. Soltis P. S., Swensen S. M., Williams S. E., Gadek P. A., Quinn C. J., Eguiarte L. E., Golenberg E.. Learn G. H., Graham S. W., Barrett S. C. H.,bayanandan S. & Albert V. A. 1993. Phylogenetics of seed plants: an analysis of nucleotide sequences from the plastid gene rbcL. Annals of the Missouri Botanical Garden 80: 5288580. Clifford H. T. & Stephenson W. 1975. An Introduction to Numerical Classification. Academic Press, New York. Colless D. H. 1970. The phenogram as an estimate of ohvlogenv. Svstematic Zoologv 19: 352-362. Co; F. E. G. -1994. The evolutionary expansion of the Sporozoa. International Journal for Parasitology 24: 1301-1316. Crisci J. V. & Stuessy T. F. 1980. Determining primitive character states for phylogenetic reconstruction. Systematic Botany 5: 112-135. Cummings M. P., Otto S. P. & Wakeley J. 1995. Sampling properties of DNA sequence data in phylogenetic analysis. Molecular Biology and Evolution 12: 814-822. Review 613 Darwin C. 1859. On the Origin of Species by Means of Natural Selection. John Murray, London. Davis J. I. 1993. Character removal as a means for assessing the stability of clades. Cladistics 9: 201-210. deBry R. W. 1992. The consistency of several phylogenyinference methods under varying evolutionary rates. Molecular Biology and Evolution 91 537-551. de Pinna M. C. C. 1991. Concepts and tests of homology -. in the cladistic paradigm. Clad&tics 7: 367-394. de Queiroz A., Donoghue M. J. & Kim J. 1995. Separate versus combined analysis of phylogenetic evidence. Annual Review of Ecology and Systematics 26: 657-68 1. Dobson A. J. 1974. Unrooted trees for numerical taxonomy. Journal of Applied Probability 11: 32-42. Doolittle R. F. (Editor) 1990. Molecular evolution: computer analysis of protein and nucleic acid sequences. Methods in Enzymology 183: 303-502. Doyle J. J. 1992. Gene trees and species trees: molecular systematics as one-character taxonomy. Systematic Botany 17: 144163. Edwards A. W. F. & Cavalli-Sforza L. L. 1964. Reconstruction of evolutionary trees. In: Phenetic and Phylogenetic Classification (Edited by Heywood V. H. & McNeil J.). pp. 67-76. Systematics Association, London. Ellis J. & Morrison D. 1995. Effects of sequence alignment on the phylogeny of Sarcocyscis deduced from 18s rDNA seauences. Parasitologv Research 81: 696699. Ellis .I., Lutoh K., Baverstock PTR., Brindley P. J., Nimmo K. A. & Johnson A. M. 1994. The phylogeny of Neospora car&urn. Molecular Biochemistry and Parasitology 64: 303-3 11. Estabrook G. F. & McMorris F. R. 1980. When is one estimate of evolutionary relationships a refinement of another? Journal of Mathematical Biology 10: 367-373. Faith D. P. 1985. Distance methods and the approximation of most-parsimonious trees. Syscernaric Zoology 34: 312-325. Faith D. P. 1991. Cladistic permutation tests for monophyly and nonmonophyly. Systematic Zoology 40: 366375. Faith D. P. & Ballard J. W. 0. 1994. Length differences and topology-dependent tests: a response-to KIllersjo et al. Cladistics 10: 57&t. Farris J. S. 1969. A successive approximations approach to character weighting. Systematic Zoology 18: 374385. Farris J. S. 1970. Methods for computing Wagner trees. Systematic Zoology 19: 83-92. Farris J. S. 1972. Estimating phylogenetic trees from distance matrices. American Naturalist 106: 645-668. Farris J. S. 1977. On the phenetic approach to vertebrate __ classitication. In: Major Patterns in Vertebrate Evolution (Edited bv Hecht M. D.. Goodv P. C. & Hecht B. M.). pp. 8233850. Plenum Press, New York. Farris J. S. 1989. The retention index and the resealed consistency index. Cladistics 5: 417419. Farris J. S. 1995. Excess homoplasy ratios. Ciadistics 7: 81-91. Farris J. S., KHllersjG M., Kluge A. G. & Bult C. 1995. Testing significance of incongruence. Cladistics 10: 315319. Felsenstein J. 1973. Maximum-likelihood estimation of evolutionary trees from continuous characters. American Journal of Human Genetics 25: 471492. Felsenstein J. 1973. Maximum-likelihood and minimumsteps methods for estimating evolutionary trees from data on discrete characters. Systematic Zoology 22: 24&249. 614 D. A. Morrison Felsenstein J. 1978. The number of evolutionary trees. Systematic Zoology 21: 27-33. Felsenstein J. 1978. Cases in which parsimony and compatibility methods will be positively misleading. Systematic Zoology 27: 401410. Felsenstein J. 1981. Evolutionary trees from DNA sequences: a maximum likelihood approach. Journal of Molecular Evolution 17: 368-316. Felsenstein J. 1981. A likelihood approach to character weighting and what it tells us about parsimony and compatibility. Biological Journal of the Linnean Society 16: 183-196. Felsenstein J. 1981. Evolutionary trees from gene frequencies and quantitative characters: finding maximum likelihood estimates. Evolution 35: 122991242. Felsenstein J. 1982. Numerical methods for inferring evolutionary trees. Quartet+ Review of Biology 51: 379404. Felsenstein J. 1988. Phylogenies from molecular sequences: inference and reliability. Annual Review of Genetics 22: 521-565. Felsenstein J. 1992. Phylogenies from restriction sites: a maximum-likelihood approach. Evolution 46: 159-173. Felsenstein J. 1995. PHYLIP (Phylogeny Inference Package). University of Washington, Seattle. Fitch W. M. 1970. Distinguishing homologous from analogous proteins. Systematic Zoology 19: 99-l 13. Fitch W. M. 1971. Toward defining the course of evolution: minimum change for a specific tree topology. Systematic Zoology 20: 406416. Fitch W. J. 1975. Toward finding the tree of maximum parsimony. In: Proceedings of the Eighth International Conference on Numerical Taxonomy (Edited by Estabrook G. F.), pp. 189-230. Freeman, San Francisco. Fitch W. M. 1981. A non-sequential method for constructing trees and hierarchical classifications. Journal of Molecular Evolution 18: 3&37. Fitch W. M. & Margoliash E. 1967. Construction of phylogenetic trees. Science 155: 279-284. Fitch W. M. & Yasunobu K. T. 1975. Phylogenies from amino acid sequences with gaps. Journal of Molecular Evolution 5: l-24. Forey P. L., Humphries C. J., Kitching I. J., Scotland R. W., Siebert D. J. & Williams D. M. 1992. Cladistics. A Practical Course in Systematics. Clarendon Press, Oxford. Fukami K. & Tateno Y. 1989. On the uniqueness of the maximum likelihood method for estimating molecular trees: uniqueness of the likelihood point.- Journal of Molecular Evolution 31: 5 1 I-523. Gatesy J., DeSalle R. & Wheeler W. 1993. Alignmentambiguous nucleotide sites and the exclusion of systematic data. Molecular Phvlogenetics and Evolution 2: 152-157. Gaut B. S. & Lewis P. 0. 1995. Success of maximum likelihood phylogeny inference in the four-taxon case. Molecular Biology and Evolution 12: 152162. Gojobori T., Moriyama E. N. & Kimura M. 1990. Statistical methods for estimating sequence divergence. Methods in Enzymology 183: 431-550. Goloboff P. A. 1991. Homoplasy and the choice among cladograms. Cladistics 7: 215-232. Golobolf P. A. 1993. Estimating character weights during tree search. Cladistics 9: 83-91. Graham R. L. & Foulds L. R. 1982. Unlikelihood that minimal phylogenies for a realistic biological study can be constructed in reasonable computational time. Mathematical Biosciences 60: 133-142. Gutell R. R., Weiser B., Woese C. R. & Noller H. F. 1985. Comparative anatomy of 16-S-like ribosomal RNA. Progress in Nucleic Acids Research and Molecular Biology 32: 155-216. Hall B. (Editor) 1994. Homology: The Hierarchical Basis of Comparative Biology. Academic Press, San Diego. Hasegawa M., Kishino H. & Yano T. 1985. Dating of the human-ape splitting by a molecular clock of mitochondrial DNA. Journal of Molecular Evolution 22: 160-174. Hasegawa M. & Fujiwara M. 1993. Relative efficiencies of the maximum likelihood, maximum parsimony, and neighbor-joining methods for estimating protein phylogeny. Molecular Phylogenetics and Evolution 2: l-5. Hasegawa M. & Hashimoto T. 1993. Ribosomal RNA trees misleading? Nature 361: 23. Hendy M. D. 1989. The relationship between simple evolutionary tree models and observable sequence data. Systematic Zoology 38: 310-321. Hendy M. D. 1991. A combinatorial description of the closest tree algorithm for finding evolutionary trees. Discrete Mathematics 96: 5 l-58. Hendy M. D. & Charleston M. A. 1993. Hadamard conjugation: a versatile tool for modelling nucleotide sequence evolution. New Zealand Journal of Botany 31: 23 l-237. Hendy M. D. & Penny D. 1982. Branch and bound algorithms to determine minimal evolutionary trees. Mathematical Biosciences 59: 277-290. Hendy M. D. & Penny D. 1989. A framework for the quantitative study of evolutionary trees. Systematic Zoology 38: 297-309. Hendy M. D. & Penny D. 1993. Spectral analysis of phylogenetic data. Journal of Classification 10: 5-24. Hendy M. D., Steel M. A. & Penny D. 1994. A discrete Fourier analysis for evolutionary ttees. Proceedings of the National Academy of Sciences of the U.S.A. 91: 3339-3343. Hennig W. 1966. Phylogenetic Systematics. University of Illinois Press, Urbana IL. (Translated by Davis D. D. & Zangerl R. from Hennig W. 1950. Grund:iige einer Theorie der Phylogenetischen Systematik. Deutscher Zentralverlag. Berlin.). Hill L. R. 1975. Problems arising from some tests of Le Quesne’s concept of uniquely derived characters. In: Proceedings af the Eighth International Conference on Numerical Taxonomy (Edited by Estabrook G. F.), pp. 375-398. Freeman, San Fmncisco. Hillis D. M. & Huelsenbeck J. P. 1992. Signal, noise. and reliability in molecular phylogenetic analyses. Journal of Heredity 83: 189-195. Hillis D. M., Allard M. W. & Miyamoto M. M. 1993. Analysis of DNA sequence data: phylogenetic inference. Methods in Enzymology 224: 456487. Hillis D. M.. Huelsenbeck J. P. & Cunningham C. W. 1994. Application and accuracy of molecular phylogenies. Science 264: 671-677. Hillis D. M., Huelsenbeck J. P. & SwotTord D. L. 1994. Hobgoblin of phylogenetics? Nature 369: 363-364. Hubalek Z. 1982. Coefficients of association and similarity based on binary (presence-absence) data: an evaluation. Biological Reviews 51: 669689. Invited Huelsenbeck J. P. 1991. Tree-length distribution skewness: an indicator of phylogenetic information. Systematic zoology 40: 257-270. Huelsenbeck J. P. 1995. Performance of phylogenetic methods in simulation. Systematic Biology 44: 1748. Huelsenbeck J. P. 1995. The robustness of two phylogenetic methods: four-taxon simulations reveal a slight superiority of maximum likelihood over neighbor joining. Molecular Biology and Evolution 12: 843-849. Huelsenbeck J. P. & Hillis D. M. 1993. Success of phylogenetic methods in the four-taxon case. Sysrematic Biology 4L: 247-264. Jin L. & Nei M. 1990. Limitations of the evolutionary parsimony method of phylogenetic analysis. Molecular Biology & Evolution 7: 82-102. Kallersjii M., Farris J. S., Kluge A. G. & Bult C. 1992. Skewness and vermutation. Cladistics 8: 275-287. Kidd K. K. & Sgaramella-Zonta L. A. 1971. Phylogenetic analysis: concepts and methods. American Journal of Human Genetics 23: 235-252. Kim J. & Burgman M. A. 1988. Accuracy of phylogeneticestimation methods under unequal evolutionary rates. Evolution 42: 596602. Kimura M. 1980. A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. Journal of Molecular Evolution 16: 11 I-120. Kishino H. & Hasegawa M. 1989. Evaluation of the maximum likelihood estimate of the evolutionary tree topologies from DNA sequence data, and the branching order in Hominoidea. Journal of Molecular Evolution 29: 170-179. Kishino H., Miyata T. & Hasegawa M. 1990. Maximum likelihood inference of protein phylogeny, and the origin of chlorovlasts. Journal of Molecular Evolution 30: 151-160. Klotz L., C. & Blanken R. L. 1981. A practical method for calculating evolutionary trees from sequence data. Journal of Theoretical Biology 91: 261-272. Kuhner M. K. & Felsenstein J. 1994. A simulation comparison of phylogeny algorithms under equal and unequal evolutionary rates. Molecular Biology and Evolution 11: 459468. Lake J. A. 1987. A rate-independent technique for analysis of nucleic acid sequences: evolutionary parsimony. Molecular Biology and Evolution 4: 167-191. Lake J. A. 1988. Origin of the eukaryotic nucleus as determined by rate-invariant analysis of rRNA. Nature 331: 184188. Le Quesne W. J. 1969. A method of selection of characters in numerical taxonomy. Sysfematic Zoology 18: 201-205. Le Quesne W. J. 1974. The uniquely evolved character concept and its cladistic application. Sysfemaric Zoology 23: 513-517. Le Quesne W. J. 1982. Compatibility analysis and its applications. Zoological Journal of the Linnean Society 74: 267-275. Lento G. M., Hickson R. E., Chambers G. K. & Penny D. 1995. Use of spectral analysis to test hypotheses on the origin of pinnipeds. Molecular Biology and Evolution 12: 28852. Levine N. D. 1985. Phylum II. Apicomplexa Levine 1970. In: An Illustrated Guide to the Protozoa (Edited by Lee J. J., Hutner S. H. & Bovee E. C.), pp. 322-374. Society of Protozoologists, Lawrence. Review 615 Li W.-H. 198 1. Simple method for constructing phylogenetic trees from distance matrices. Proceedings of the National Academy of Sciences of the U.S.A. 78: 1085-1089. Li W.-H. & Gouy M. 1991. Statistical methods for testing molecular phylogenies. In: Phylogenetic Analysis of DNA Sequences (Edited by Miyamoto M. M. & Cracraft J.), pp. 2499277. Oxford University Press, New York. Li W.-H. & Graur D. 1991. Fundamentals of Molecular Evolution. Sinauer Associates, Sunderland. Li W.-H. & Zharkikh A. 1994. What is the bootstrap technique? Systematic Biology 43: 424430. Li W.-H. & Zharkikh A. 199.5. Statistical tests of DNA phylogenies. Systematic Biology 44: 49-63. Li W.-H., Wolfe K. H., Sourdis J. & Sharp P. M. 1987. Reconstruction of phylogenetic trees and estimation of divergence times under nonconstant rates of evolution. Cold Spring Harbor Symposium on Quantitative Biology 52: 847-856. Lockhart P. J., Penny D., Hendy M. D., Howe C. J., Beanland T. J. & Larkum A. W. D. 1992. Controversy on chloroplast origins. Federation of European Biochemical Societies Letters 301: 127-131. Lockhart P. J., Steel M. A., Hendy M. D. & Penny D. 1994. Recovering evolutionary trees under a more realistic model of sequence evolution. Molecular Bio1og.v and Evolution 11: 605-612. Maddison D. R. 1991. The discovery and importance of multiple islands of most-parsimonious trees. Systematic Zoology 40: 315-328. Maddison W. P. & Maddison D. R. 1992. MacClade: Analysis of Phylogeny and Character Evolution. Sinauer Associates, Sunderland. Maddison W. P., Donoghue M. J. & Maddison D. R. 1984. Outgroup analysis and parsimony. Systematic Zoology 33: 83-103. Marshall C. R. 1992. Substitution bias, weighted parsimony, and amniote phylogeny as inferred from 18s rRNA sequences. Molecular Biology and Evolution 9: 37c373. Mayr E. 1982. The Growth of Biological Thought. Harvard University Press, Harvard. McClure M. A., Vasi T. K. & Fitch W. M. 1994. Comparative analysis of multiple protein-sequence alignment methods. Molecular Biologv and Evolution 11: 571-592. Michener C. D. & Sokal R. R. 1957. A quantitative ;;:;;;I to a problem in classification. Evolution 11: Mickevich M. F. 1982. Transformation series analysis. Systematic Zoology 31: 461478. Mindell D. P. 1991. Aligning DNA sequences: homology and phylogenetic weighting. In: Phylogenetic Analysis of DNA Sequences (Edited by Miyamoto M. M. & Cracraft J.), pp. 73-89. Oxford University Press, New York. Miyamoto M. M. & Cracraft J. 1991. Phylogenetic inference, DNA sequence analysis, and the future of molecular systematics. In: Phylogenetic Analysis of DNA Sequences (Edited by Miyamoto M. M. & Cracraft J.), pp. 3317. Oxford University Press, New York. Nadler S. A. 1990. Molecular approaches to studying helminth population genetics and phylogeny. international Journai for Parasitology 20: 11-29. Neff N. A. 1986. A rational basis for a priori character weighting. Systematic Zoology 35: 110-123. Nei M. 1987. Molecular Evolutionary Genetics. Columbia University Press. New York. 616 Nei D. A. Morrison M. 1991. Relative efficiencies of different tree-making methods for molecular data. In: Phylogenetic Analysis of DNA Sequences (Edited by Miyamoto M. M. & Cracraft J.), pp. 90-128. Oxford University Press, New York. Nei M. & Li W.-H. 1979. Mathematical model for studying genetic variation in terms of restriction endonucleases. Proceedings of the National Academy of Sciences of the U.S.A. 16: 526995213. Nelson G. J. 1978. Ontogeny, phylogeny, palaeontology, and the biogenetic law. Systematic Zoology 27: 324345. Nelson G. J. & Platnick N. I. 1981. Systematics and Biogeography. Cladistics and Vicariance. Columbia University Press, New York. Nixon K. C. & Carpenter J. M. 1993. On outgroups. Cladistics 9: 4133426. Olsen G. J. 1987. Earliest phylogenetic branchings: comparing rRNA-based evolutionary trees inferred with various techniques. Cold Spring Harbor Symposium on Quantitative Biology 52: 825-837. Olsen G. J. 1988. Phylogenetic analysis using ribosomal RNA. Methods in Enzymology 164: 793-812. Ota T. & Nei M. 1994. Estimation of the number of amino acid substitutions per site when the substitution rate varies among sites. Journal of Molecular Evolution 38: 642-643. Page R. D. M. 1993. On the islands of trees and the efficiency of different methods of branch swapping in finding most-parsimonious trees. Systematic Biology 42: 200-210. Pamilo P. & Nei M. 1988. Relationship between gene trees and species trees. Molecular Biology and Evolution 5: 568-583. Patterson C. 1988. Homology in classical and molecular biology. Molecular Biology and Evolution 5: 603-625. Penny D. 1982. Towards a basis for classification: the incompleteness of distance measures, incompatibility analysis and phenetic classification. Journal of Theoretical Biology 96: 129-142. Penny D. & Hendy M. D. 1986. Estimating the reliability of evolutionary trees. Molecular Biology and Evolution 3: 403417. Penny D., Hendy M. D. & Henderson I. M. 1987. Reliability of evolutionary trees. Cold Spring Harbor Symposium on Quantitative Biology 52: 8577862. Penny D., Hendy M. D., Zimmer E. A. & Hamby R. K. 1990. Trees from sequences: panacea or Pandora’s box? Australian Systematic Botany 3: 21-38. Penny D., Hendy M. D. & Steel M. A. 1992. Progress with methods for constructing evolutionary trees. Trends in Ecology and Evolution 7: 73-19. Penny D., Watson E. E., Hickson R. E. & Lockhart P. J. 1993. Some recent progress with methods for evolutionary trees. New Zealand Journal of Botany 31: 275-288. Penny D., Steel M., Waddell P. J. & Hendy M. D. 1995. Improved analyses of human mtDNA sequences support a recent African origin of Homo sapiens. Molecular Biology and Evolution 12: 863-882. Pollock D. D. & Goldstein D. B. 1995. A comparison of two methods for constructing evolutionary distances from a weighted contribution of transition and transversion differences. Molecular Biology and Evolution 12: 713-717. Popper K. R. 1957. The Poverty of Historicism. Routledge & Kegan Paul, London. Rinsma-Melchert I. 1993. The expected number of matches in optimal global sequence alignments. New Zealand Journal of Botany 31: 219-230. Rodrigo A. G., Kelly-Borges M., Bergquist P. R. & Bergquist P. L. 1993. A randomisation test of the null hypothesis that two cladograms are sample estimates of a parametric phylogenetic tree. New Zealand Journal of Botany 31: 257-268. Rodriguez F., Oliver J. L., Marin A. & Medina J. R. 1990. The general stochastic model of nucleotide substitution. Journal of Theoretical Biology 142: 485-501. Rogers J. S. 1991. A comparison of the suitability of the Rogers, Modified Rogers, Manhattan, and CavalliSforza and Edwards distances for inferring phylogenetic trees from allele frequencies. Systematic Zoology 40: 63-73. Rohlf F. J., Chang W. S., Sokal R. R. & Kim J. 1990. Accuracy of estimated phylogenies: effects of tree topology and evolutionary model. Evolution 44: 1671-1684. Romer A. S. 1971. The Vertebrate Body, 4th edition. Saunders, Philadelphia. Rzhetsky A. & Nei M. 1992. A simple method for estimating and testing minimum-evolution trees. Molecular Biology & Evolution 9: 945-967. Rzhetsky A. & Nei M. 1993. Theoretical foundation of the minimum-evolution method of phylogenetic inference. Molecular Biology and Evolution IO: 1073-1095. Rzhetsky A. & Nei M. 1995. Tests of applicability of several substitution models for DNA sequence data. Molecular Biology and Evolution 12: 131-151. Saitou N. & Imanishi T. 1989. Relative efficiences of the Fitch-Margoliash, maximum-parsimony, maximumlikelihood, minimum-evolution, and neighbor-joining methods of phylogenetic tree construction in obtaining the correct tree. Molecular Biology and Evolution 6: 514-525. Saitou N. & Nei M. 1987. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Molecular Biology and Evolution 4: 406425. Sanderson M. J. 1995. Objections to bootstrapping phylogenies: a critique. Systematic Biology 44: 299-320. Sankoff D. 1975. Minimal mutation trees of sequences. SIAM Journal of Applied Mathematics 28: 3542. Sankoff D. & Cedergren R. J. 1983. Simultaneous comparison of three or more sequences related by a tree. In: Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparisons (Edited by Sankotf D. & Kruskal J. B.), pp. 253-263. Addison-Wesley, London. Sattath S. & Tversky A. 1977. Additive similarity trees. Psychometrika 42: 319-345. Sharkey M. J. 1989. A hypothesis-independent method of character weighting for cladistic analysis. Cladistics 5: 63-86. Shoemaker J. S. & Fitch W. M. 1989. Evidence from nuclear sequences that invariable sites should be considered when sequence divergence is calculated. Molecular Biologv and Evolution 6: 210-289. Siddall %. E. 1994. Randomcladistics. University of Toronto, Toronto. Sidow A. & Wilson A. C. 1990. Compositional statistics: an improvement of evolutionary parsimony and its application to deep branches in the tree of life. Journal of Molecular Evolution 31: 51-68. Sidow A. & Wilson A. C. 1991. Compositional statistics evaluated by computer simulations. In: Phylogenetic Analysis of DNA Sequences (Edited by Miyamoto M. M. Invited & Cracraft J.), pp. 129-146. Oxford University Press, New York. Smith A. B. 1994. Rooting molecular trees: problems and strategies. Biological Journal of the Linnean Society 51: 279-292. Sneath P. H. A. 1989. Analysis and interpretation of sequence data for bacterial systematics: the view of a numerical taxonomist. Systematic and Applied Microbiology 12: 15-31. Sokal R. R. & Michener C. D. 1958. A statistical method for evaluating systematic relationships. University of Kansas Science Bulletin 28: 1409-1438. Sourdis J. & Krimbas C. 1987. Accuracy of phylogenetic trees estimated from DNA sequence data. Molecular Biology and Evolution 4: 159-l 66. Specht T., Wolters J. & Erdman V. A. 1990. Compilation of 5s rRNA and 5s rRNA gene sequences. Nucleic Acids Research Suppl. 18: 2215-2235. Steel M. A. 1994. The maximum likelihood point for a phylogenetic tree is not unique. Systematic Biology 43: 560-564. Steel M. A., Hendy M. D. & Penny D. 1988. Loss of information in genetic distances. Nature 336: 118. Steel M. A., Hendy M. D. & Penny D. 1993. Parsimony can be consistent! Systematic Biology 42: 581-587. Steel M. A., Hendy M. D., Szbkely L. A. & Erd(is P. L. 1992. Spectral analysis and a closest tree method for genetic sequences. Applied Mathematics Letters 5: 63-67. Stevens P. F. 1980. Evolutionary polarity of character states. Annuaf Review of Ecology and Systemafics 11: 333-358. Stevens P. F. 1984. Homology and phylogeny: morphology and systematics. Systema& Bot&y 9: 395409. Stevens P. F. 1994. The Development of Biological Systematics: Anroine-Laurent de Jusiieu, Nature, and the kafural System. Columbia University Press, New York. Stewart C.-B. 1993. The powers and pitfalls of parsimony. Nature 361: 603-607. Studier J. A. & Keppler K. J. 1988. A note on the neighbor-joining algorithm of Saitou and Nei. Molecular Biology and Evolution 5: 729-731. Swofford D. L. 1981. On the utility of the distance Wagner procedure. In: Advances in Cladistics (Edited by Funk V. A. & Brooks D. R.), pp. 2543. New York Botanical Garden, New York. Swofford D. L. 1991. When are phylogeny estimates from molecular and morphological data incongruent? In: Phylogenetic Analysis of DNA Sequences (Edited by Miyamoto M. M. & Cracraft J.), pp. 295-333. Oxford University Press, New York. Swofford D. L. 1993. PA UP: Phylogenetic Analysis Using Parsimonv. Smithsonian Institution, Washington. Swofford D. L. & Berlocher S. H. 1987. Inferring evolutionary trees from gene frequency data under the-principle of maximum oarsimonv. Svstematic Zoology 36: 293-325. Swofford D. i. & Mahdiion W. P. 1987ykeconstructing ancestral character states under Wagner parsimony. Mathematical Biosciences 87: 199-229. Swofford D. L. & Olsen G. J. 1990. Phylogeny reconstruction. In: Molecular Sysfematics (Edited by Hillis D. M. & Moritz C.), pp. 411-501. Sinauer Associates, Sunderland. Takezaki N. & Nei M. 1994. Inconsistency of the maximum parsimony method when the rate of nucleotide substitution is constant. Journal oj’ Molecular Evolution 39: 210-218. Review 617 Tateno Y., Nei M. & Tajima F. 1982. Accuracy of estimated phylogenetic trees from molecular data. I. Distantly related trees. Journal of Molecular Evolution 18: 387404. Tateno Y., Takezaki N. & Nei M. 1994. Relative efficiencies of the maximum-likelihood, neighbor-joining, and maximum-parsimony methods when substitution rates vary with site. Molecular Biology and Evolution 11: 261-277. Tenter A. M., Baverstock P. R. & Johnson A. M. 1992. Phylogenetic relationships of Sarcocysfis species from sheep, goats, cattle and mice based on ribosomal RNA sequences. International Journal for Parasitology 22: 503-513. Thorne J. L. & Kishino H. 1992. Freeing phylogenies from artifacts of alignment. Molecular Biology & Evolution 9: 1148-1162. Tillier E. R. M. & Collins R. A. 1995. Neighbor joining and maximum likelihood with RNA sequences: addressing the interdependence of sites. Molecular Biology and Evolution 12: 7-15. Van de Peer Y., Van den Broeck I., De Rijk P. & De Wachter R. 1994. Database on the structure of small ribosomal subunit RNA. Nucleic Acids Research 21 3488-3494. Waterman M. S., Smith T. F., Singh M. & Beyer W. A. 1977. Additive evolutionary trees. Journal of Theoretical Biology 64: 199-213. * Waterman M. S. 1989. Mathematical Methods for DNA Sequences. CRC Press, Boca Raton. Watrous L. E. & Wheeler Q. D. 1981. The outgroup comparison method of character analysis. Systematic zoology 30: l-l 1. West J. G. & Faith D. P. 1990. Data, methods and assumptions in phylogenetic inference. Australian Systematic Botany 3: 9-20. Weston P. H. 1988. Indirect and direct methods in systematics. In: Ontogeny and Sysfematics (Edited by Humphries C. J.), pp. 27-56. Columbia University Press, New York. Weston P. H. 1994. Methods for rooting cladistic trees. In: Models in Phvlopenv Reconstruction (Edited bv Scotland R. W., Siebeit b. j. & Williams D: M.), pp- 125-155. Clarendon Press, Oxford. Wheeler W. C. 1990. Nucleic acid sequence phylogeny and random outgroups. Cladisfics 6: 363-367. Wheeler W. C. 1995. Sequence alignment, parameter sensitivity, and phylogenetic analysis of molecular data. Systematic Biology 44: 321-331. Williams D. M. 1993. A note on molecular homology: multiple patterns from single datasets. Cladistics 9: 233-245. Williams P. L. & Fitch W. M. 1989. Finding the minimal change in a given tree. In: The Hierarchy of Life: Mole&es and Morphology in Phylogenetk knal&is (Edited bv Fernholm B.. Bremer K. & Jiirnvall H.), pp. 453470. Elsevier, Amsterdam. Yang 2. 1995. Evaluation of several methods for estimating phylogenetic trees when substitution rates differ over nucleotide sites. Journal of Molecular Evolution 40: 689-697. Zharkikh A. & Li W.-H. 1993. Inconsistency of the maximum-parsimony method: the case of five taxa with a molecular clock. Systematic Biology 42: 113-125.
© Copyright 2026 Paperzz