Syst. Biol. 52(4):439–459, 2003 c Society of Systematic Biologists Copyright ° ISSN: 1063-5157 print / 1076-836X online DOI: 10.1080/10635150390218132 Phylogenetics of Advanced Snakes (Caenophidia) Based on Four Mitochondrial Genes CHRISTOPHER M. R. K ELLY,1,2 NIGEL P. B ARKER,3 AND M ARTIN H. VILLET 1 1 Department of Zoology and Entomology, Rhodes University, Grahamstown, South Africa Department of Zoology, Oxford University, Oxford OX1 3PS, England; E-mail: [email protected] 3 Molecular Ecology and Systematics Facility, Department of Botany, Rhodes University, Grahamstown, South Africa 2 Abstract.—Phylogenetic relationships among advanced snakes (Acrochordus + Colubroidea = Caenophidia) and the position of the genus Acrochordus relative to colubroid taxa are contentious. These concerns were investigated by phylogenetic analysis of fragments from four mitochondrial genes representing 62 caenophidian genera and 5 noncaenophidian taxa. Four methods of phylogeny reconstruction were applied: matrix representation with parsimony (MRP) supertree consensus, maximum parsimony, maximum likelihood, and Bayesian analysis. Because of incomplete sampling, extensive missing data were inherent in this study. Analyses of individual genes retrieved roughly the same clades, but branching order varied greatly between gene trees, and nodal support was poor. Trees generated from combined data sets using maximum parsimony, maximum likelihood, and Bayesian analysis had medium to low nodal support but were largely congruent with each other and with MRP supertrees. Conclusions about caenophidian relationships were based on these combined analyses. The Xenoderminae, Viperidae, Pareatinae, Psammophiinae, Pseudoxyrophiinae, Homalopsinae, Natricinae, Xenodontinae, and Colubrinae (redefined) emerged as monophyletic, whereas Lamprophiinae, Atractaspididae, and Elapidae were not in one or more topologies. A clade comprising Acrochordus and Xenoderminae branched closest to the root, and when Acrochordus was assessed in relation to a colubroid subsample and all five noncaenophidians, it remained associated with the Colubroidea. Thus, Acrochordus + Xenoderminae appears to be the sister group to the Colubroidea, and Xenoderminae should be excluded from Colubroidea. Within Colubroidea, Viperidae was the most basal clade. Other relationships appearing in all final topologies were (1) a clade comprising Psammophiinae, Lamprophiinae, Atractaspididae, Pseudoxyrophiinae, and Elapidae, within which the latter four taxa formed a subclade, and (2) a clade comprising Colubrinae, Natricinae, and Xenodontinae, within which the latter two taxa formed a subclade. Pareatinae and Homalopsinae were the most unstable clades. [Advanced snakes; Bayesian analysis; Caenophidia; Colubroidea; data combination; maximum likelihood; maximum parsimony; mitochondrial DNA; supertrees.] Snakes have conventionally been divided into two snakes.” Dowling and Duellman (1978) and Dowling major monophyletic groups, the Scolecophidia (blind et al. (1983, 1996) placed it within the Colubroidea, snakes) and the Alethinophidia (true snakes) (Rieppel, associated with Homalopsinae or Natricinae. Recent 1979; Cadle, 1987; McDowell, 1987). Monophyly of both mtDNA-based studies (e.g., Kraus and Brown, 1998; blind snakes and true snakes is supported by a variety Gravlund, 2001) have not resolved this issue, but of osteological, soft anatomy, and ecological characters Slowinski and Lawson (2002), using both mtDNA and (Lee and Scanlon, 2002). Various divisions of true snakes nuclear gene sequences, resolved Achrochordus as basal have been attempted. Hoffstetter (1955) split them into to the Colubroidea with moderate to good bootstrap the Henophidia (primitive snakes) and the Caenophidia support. (advanced snakes), an arrangement followed by Romer Colubroidea is substantially the most speciose snake (1956), Underwood (1967), and Rieppel (1979). How- lineage, incorporating approximately 80% (2,500 species) ever, there is little support for monophyly of primitive of recognized extant snake diversity (Dowling and snakes (McDowell, 1987; Rieppel, 1988). In this article, Duellman, 1978). The proper subdivision of Colubroidea taxa known to be nonmonophyletic will be distinguished and the relationships of its constituent clades are subby enclosure in single quotation marks. Currently, three jects of continued debate and conflict (Cadle, 1987; successively less inclusive clades are recognized within McDowell, 1987). For instance, Underwood (1967) recogtrue snakes: the Macrostomata, the Caenophidia, and nized six colubroid families and 15 subfamilies based on the Colubroidea. Lee and Scanlon (2002) listed the oste- a wide variety of morphological characters, Dowling and ological, soft anatomy, and ecological synapomorphies Duellman (1978) split the group into three families and of these clades. Immunological evidence (Cadle, 1988) 13 subfamilies using hemipenial morphology, maxillary and mitochondrial DNA (mtDNA) sequence data (Heise dentition, and vertebral characters, and McDowell (1987) et al., 1995; Kraus and Brown, 1998; Gravlund, 2001) identified four families and 18 subfamilies on the basis of generally lend additional support to the monophyly of morphological and immunological data. Currently, four these three subgroups of true snakes. Noncaenophidian families are formally recognized: Atractaspididae (one macrostomatans are referred to here as “basal macros- or two subfamilies, 3% of colubroid diversity), Viperidae tomatans.” (two to four subfamilies, 10%), Elapidae (two subfamiAs currently defined, Caenophidia (advanced snakes) lies, 12%), and “Colubridae” (approximately 10 subfamicomprises a sister relationship between Colubroidea and lies, 75%) (Greene, 1997; Pough et al., 2001). In this study, the genus Acrochordus (Acrochordidae) (Rieppel, 1988; we adopted a composite arrangement of the Colubroidea Greene, 1997; Pough et al., 2001, Lee and Scanlon, 2002). (Table 1) adapted from Smith et al. (1977), Heymans However, the placement of Acrochordus is contentious. (1982), Groombridge (1986), McDowell (1987), Cadle Hoffstetter and Gayrard (1965), Underwood, (1967), and (1987, 1988), Underwood and Kochva (1993), Dowling Heise et al. (1995) placed Acrochordus in the “primitive et al. (1996), Greene (1997), Slowinski et al. (1997), and 439 440 VOL. 52 SYSTEMATIC BIOLOGY TABLE 1. Taxa used in this study, their gene representation, and sources of sequences (GenBank accession number or author). Sequences are full length unless otherwise specified. The classification scheme employed is explained in the text. Multiple representatives of single genera were assembled into supraspecific terminals at the generic level, and Typhlops lumbricalis and Rhamphotyphlops braminus were assembled into a supraspecific terminal at the family level. Numbers in parentheses after taxon names indicate the data set (1, 2, or both) for each taxon. Sources of sequences Classification Scolecophidia Typhlopidae Typhlops lumbricalis (2) Rhamphotyphlops braminus (2) Alethinophidia Macrostomata Tropidophiidae Tropidophis haitianus (2) Tropidophis wrighti (2) Loxocemidae Loxocemus bicolour (2) Pythonidae Python reticulatus (2) Boidae Boa constrictor (1, 2) Caenophidia Acrochordidae Acrochordus granulatus (1, 2) Acrochordus javanicus (1, 2) Colubroidea Atractaspididae Aparallactinae Aparallactus guentheria (1) Aparallactus werneri (1) Atractaspininae Atractaspis bibronii (1) Atractaspis corpulenta (1) Atractaspis micropholis (1) “Colubridae” Calamariinae Oreocalamus hanitschi (1) Colubrinae Boiga cynodon (1) Boiga dendrophila (1) Coluber constrictor (1) Crotaphopeltis hotamboeia (1) Dasypeltis medici (1) Dasypeltis scabraa (1) Dendrelaphis calligastra (1) Dendrelaphis pictus (1) Dinodon semicarinatum (1, 2) Elaphe flavolineata (1) Elaphe obsoleta (1) Gastropyxis smaragdina (1) Lycodon capucinus (1) Lycodon laoensis (1) Unplaced Grayia ornata (1) Homalopsinae Cerberus rhynchops (1) Enhydris enhydris (1) Enhydris plumbea (1) Natricinae Macropisthodon rudis (1) Natrix natrix (1) Nerodia fasciata (1) Nerodia rhombifera (1) Nerodia taxispilota (1) Thamnophis butleri (1) Thamnophis elegans (1) Pareatinae Aplopeltura boa (1) Pareas nuchalis (1) Psammophiinae Hemirhagerrhis nototaenia (1) Hemirhagerrhis viperinusa (1) Cytb ND4 12Sr RNA 16S rRNA Z46444 (0.32 kb) Z46475 Z46445 (0.38 kb) Z46476 U69845 Z46456 (0.38 kb) Z46486 U69860 Z46448 (0.38 kb) Z46478 Z46470 Z46495 Z46472 Z46502 Z46597 Z46499 Z46468 Z46525 L01765 (0.37 kb) Gravlund, 2001 (0.28 kb) Gravlund, 2001 (0.28 kb) L01770 Gravlund, 2001 Gravlund, 2001 (0.36 kb) U69865 (0.78 kb) U69869 (0.76 kb) U69746 (0.80 kb) Forstner et al., 1995 AF217841 (0.80 kb) U49296 AY235730 (0.77 kb) U49315 U49314 AF039261 (0.65 kb) U49306 AF217818 (0.80 kb) U49303 U49300 AY235729 (0.79 kb) AF139569 NC001945 U49304 NC001945 U49301 AF283643 (0.80 kb) NC001945 NC001945 Z46469 (0.36 kb) AF158435 (0.29 kb) Z46493 AF158504 (0.34 kb) Z46455 Z46485 AF158434 (0.29 kb) AF158503 (0.34 kb) Z46458 Z46492 AF158461 (0.28 kb) AF158530 (0.34 kb) Z46452 Z46481 Gravlund, 2001 (0.28 kb) Gravlund, 2001 U49317 U49327 U49328 U49326 AF172392 (0.30 kb) L33334 (0.30 kb) U49322 U49324 AF305671 (0.31 kb) U49312 U49311 AY235725 (0.75 kb) (Continued on next page) 2003 441 KELLY ET AL.—PHYLOGENETICS OF ADVANCED SNAKES TABLE 1. Continued Sources of sequences Classification Malpolon monspessulanusa (1) Psammophis condenarus (1) Psammophis mossambicusa (1) Psammophylax variabilisa (1) Rhamphiophis acutusa (1) Rhamphiophis oxyrhynchus (1) Xenoderminae Achalinus rufescens (1) Xenodermus javanicus (1) Xenodontinae Alsophis portoricensis (1) Dipsas catesbyi (1) Farancia abacura (1, 2) Helicops angulatus (1) Helicops pictiventris (1) Heterodon nasicus (1) Heterodon simus (1) Hydrodynastes bicinctus (1) Liophis reginae (1) Xenodon neuwiedi (1) Xenodon servus (1) Lamprophiinae Lamprophis fuliginosus (1) Lamprophis inornatusa (1) Mehelya capensis (1) Mehelya nyassaea (1) Pseudoxyrophiinae Leioheterodon madagascariensis Madagascarophis colubrine (1) Unplaced Homoroselaps lacteus (1) Elapidae Elapinae Aspidelaps scutatus (1) Boulengerina annulata (1) Boulengerina sp. (1) Bungarus fasciatus (1) Dendroaspis polylepis (1) Dendroaspis sp. (1) Elapsoidea nigra (1) Maticora bivirgata (1) Micruroides euryxanthus (1) Micrurus diastema (1, 2) Micrurus fulvius (1, 2) Naja kaouthia (1) Naja naja (1) Ophiophagus hannah (1) Hydrophiinae Hydrophis semperi (1) Laticauda colubrina (1) Viperidae Azemiophinae Azemiops feae (1) Causinae Causus defilippi (1) Causus rhombeatus (1) Crotalinae Agkistrodon contortrix (1) Calloselasma rhodostoma (1) Crotalus horridus (1, 2) Crotalus viridis (1, 2) Trimeresurus albolabris (1) Trimeresurus tokarensis (1) Viperinae Bitis arietansa (1) Vipera ammodytes (1) Vipera ursinii (1) a Cytb ND4 AY235721 AY235722 AY235724 (0.47 kb) AY235723 (0.38 kb) 12Sr RNA 16S rRNA Gravlund, 2001 (0.28 kb) Z46450 Gravlund, 2001 (0.33 kb) Z46479 Gravlund, 2001 (0.28 kb) Gravlund, 2001 Z46443 Z46738 U49319 U49320 U49308 U69832 (0.80 kb) U49307 U49310 Forstner et al., 1995 AF158448 (0.29 kb) Z46459 Z46467 AF158408 (0.29 kb) AF158517 (0.34 kb) Z46496 Z46491 AF158478 (0.34 kb) AF158428 (0.29 kb) AF158494 (0.32 kb) AF158430 (0.29 kb) AF158433 (0.29 kb) AF158479 (0.34 kb) AF158501 (0.34 kb) AF217840 (0.80 kb) AF236814 (0.30 kb) Z46449 Z46474 Z46457 Z46489 AY235727 (0.77 kb) Gravlund, 2001 (0.28 kb) Gravlund, 2001 AY235726 U49318 U49313 AF217833 (0.80 kb) AF217828 (0.80 kb) AF217829 (0.80 kb) U96790 U96792 Z46466 Z46501 U96795 U96804 U96800 Z46433 Z46454 Z46483 Z46484 AF217842 (0.80 kb) Z46453 Z46451 Z46482 Z46480 AF217822 (0.80 kb) AF217834 (0.80 kb) U96798 U96799 AF217830 (0.80 kb) AF217832 (0.80 kb) U49297 AF217820 (0.80 kb) AF217812 (0.80 kb) AF217823 (0.80 kb) AF217839 (0.80 kb) AF217835 (0.80 kb) AF182539 (0.17 kb) U49298 U41865 AF057187 AF057234 AF057186 AF057233 Z46473 AF057190 Z46460 Z46524 AF057237 Z46497 U41866 AF039268 (0.64 kb) AF171918 (0.66 kb) U41868 U41878 AF147860 (0.68 kb) AF171909 (0.66 kb) U41882 U41890 AY235728 AF182552 (0.17 kb) Taxa for which Cytb data were generated in this study. Z46446 Z46739 Z46471 L01768 (0.37 kb) Z46498 L01769 442 SYSTEMATIC BIOLOGY Pough et al. (2001). The authorship of names for the higher categories used here is given in these papers. Unless otherwise specified, subsequent reference to higher taxa will follow Table 1. Underwood and Kochva (1993) found the Atractaspididae to be monophyletic based on a variety of morphological characters. This family has been poorly represented in most molecular studies, but its monophyly is tentatively supported by mtDNA data (Kraus and Brown, 1998). However, on the basis of serum albumin divergence estimates, Cadle (1994) questioned the monophyly of Atractaspididae. Viperidae appears to be monophyletic based on morphology (Liem et al., 1971) and mtDNA (Heise et al., 1995; Kraus and Brown, 1998; Gravlund, 2001), as does Elapidae (morphology: McDowell, 1968; McCarthy, 1985; mtDNA: Kraus and Brown, 1998; Slowinski and Keogh, 2000). “Colubridae,” by far the largest and most diverse colubroid family, is demonstrably nonmonophyletic (Heise et al., 1995; Kraus and Brown, 1998; Gravlund, 2001). Two basic issues have plagued attempts at elucidation of relationships within advanced snakes. The first is the difficulty of identifying monophyletic subdivisions. Such groupings have traditionally been based on morphological data (Bogert, 1940; Underwood, 1967; Bourgeois, 1968; Dowling and Duellman, 1978). However, because snakes lack limbs all of the morphological variation associated with such appendages is absent in snakes, resulting in a restricted set of potential morphological characters (Dowling et al., 1996). The great diversity of advanced snakes has led to difficulty in assessment of these characters across the entire group (Kraus and Brown, 1998), and high degrees of homoplasy are associated with many such characters (Kraus and Brown, 1998). Because of these factors, few wellsupported clades have been recognized, leading to conflict in proposed classifications. Recent mtDNA-based studies on select groups of advanced snakes have contributed substantially to identification of monophyletic groups (Slowinski and Keogh, 2000; Vidal et al., 2000), but much more such research is required. Second, the generation of valid hypotheses of relationship also is problematic. Recent molecular studies (Vidal et al., 2000; Lenk et al., 2001) have contributed to resolution of relationships within putative monophyletic subsets of advanced snakes. However, all existing phylogenies of the group as a whole differ to some extent in their topology, and little consensus has been reached on the placement of even well-supported clades. Many researchers agree that Acrochordidae is sister to the Colubroidea (Cadle, 1988; Rieppel, 1988, Lee and Scanlon, 2002), within which Viperidae is basal (Cadle, 1988; Rieppel, 1988; Heise et al., 1995). However, McDowell (1979, 1987) and Heise et al. (1995) considered Acrochordidae to be far distant from the Colubroidea, and the basal position of the Viperidae within Colubroidea was disputed by Kraus and Brown (1998) and Gravlund (2001). Thus, even these general arrangements are questionable. VOL. 52 The aims of this study were (1) to identify monophyletic higher taxa within advanced snakes, (2) to assess the support for currently recognized higher taxa (Table 1), (3) to generate robust hypotheses of relationship within (where possible) and between the monophyletic taxa identified, and (4) to investigate the position of Acrochordus in relation to colubroids and “basal macrostomatans.” We applied four different methods of phylogenetic analysis: supertree consensus (Gordon, 1986; Sanderson et al., 1998), maximum parsimony (MP), maximum likelihood (ML), and Bayesian inference (Mau, 1996; Rannala and Yang, 1996; Mau and Newton, 1997; Yang and Rannala, 1997; Larget and Simon, 1999; Mau et al., 1999). These analyses were based on DNA sequence fragments from four (aims 1–3) or three (aim 4) mitochondrial genes (investigated singly and in various combinations). We included representatives from most proposed families and subfamilies of advanced snakes, from three of the “basal macrostomatan” families, and from the blind snakes. Many morphological and molecular assessments of relationships among advanced snakes have been hindered by limited character inclusion and by taxon sampling problems. The effects of taxon sampling have been reviewed by Lecointre et al. (1993) and, in the current context, by Kraus and Brown (1998). Few new sequence data were generated in this study, but our analyses brought together for the first time published data from many different reports. We have thus included significantly more taxa and characters than have been used in previous studies. M ATERIALS AND M ETHODS Character and Taxon Sampling It is well known that phylogenetic reconstruction benefits from increased character inclusion, provided all characters share a common evolutionary history. This is generally because of the increase in signal-to-homoplasy ratio that occurs as more data are added, with a consequent increase in the likelihood of resolving hidden phylogenetic signal (e.g., de Queiroz et al., 1995; Nixon and Carpenter, 1996). In the molecular paradigm, it is advantageous to include data from multiple genes because the evolutionary rates of different genes often vary (Crozier, 1990). Each gene may thus help to resolve relationships at different hierarchical levels. In this study, we included as many data from as many genes as were available. Mitochondrial DNA phylogenies of many vertebrate groups are strongly influenced by taxon sampling (Philippe and Douzery, 1994; Milinkovitch et al., 1996; Kraus and Brown, 1998; Matthee et al., 2001). We thus favored maximum taxon inclusion despite the consequent introduction of large quantities of missing data. Elimination of incomplete taxa a priori removes potentially important character interactions from the analysis (Novacek, 1992; Kearney, 1998) and falsely assumes that all incomplete taxa will be equally detrimental to analyses (Kearney, 1998). Anderson (2001) noted that because the stability of a taxon on a tree depends on topology, 2003 KELLY ET AL.—PHYLOGENETICS OF ADVANCED SNAKES elimination of the contribution of a taxon to topology prevents it from determining a stable position. Consequently, the inclusion of incomplete taxa is required to maintain a well-resolved tree. Our complete data set comprised 98 species (91 advanced snakes, 5 “basal macrostomatans,” and 2 blind snakes) represented by varying combinations of four mitochondrial gene fragments: cytochrome b (Cytb; maximum 804 bp), 12S ribosomal RNA (rRNA; maximum 421 bp and 31 gap characters), 16S rRNA (maximum 381 bp and 7 gap characters), and NAD dehydrogenase subunit 4 (ND4; maximum 694 bp) (Table 1). We included representatives from four of the (approximately) seven “basal macrostomatan” families and all families and subfamilies of advanced snakes proposed to date except Calliopheinae (Elapidae) and Pseudoxenodontinae (“Colubridae”) (McDowell, 1987), which contain one and two genera, respectively. Terminal Taxa Whenever possible, we applied the exemplar approach (Yeates, 1995; Bininda-Emonds et al., 1998), in which species are used as terminal entities. However, to maximize gene representation per terminal in analyses involving gene combinations (and thus minimize missing data), it was often necessary to combine data from multiple species to form supraspecific terminals (Bininda-Emonds et al., 1998) at the generic level and, in one case (Typhlopidae), at the family level. The greatest potential fault with this approach (the assumption of monophyly of supraspecific taxa; Yeates, 1995; Kron and Judd, 1997; Bininda-Emonds et al., 1998) was not considered a problem in this study. Splitting of the Data Set The first three aims of this article involved assessment of relationships within advanced snakes rather than relationships between advanced snakes and “basal macrostomatans.” To achieve these aims we created data set 1 with the following characteristics: (1) inclusion of data from all four genes, (2) inclusion of all advanced snake taxa, and (3) exclusion of all blind snakes and all “basal macrostomatan” taxa except Boa constrictor, which was used as the outgroup. We excluded these two groups of snakes (1) because ND4 sequence data was lacking for all “basal macrostomatans” except Boa and we wanted to have an outgroup with no missing data and (2) because we wanted to reduce the chances of long-branch attraction as far as possible and some “basal macrostomatans” and blind snakes are only distantly related to advanced snakes. In data set 1 with one outgroup and 62 ingroup terminals, 27 taxa are supraspecific, 22 lack Cytb characters, 13 lack 12S characters, 19 lack 16S characters, 29 lack ND4 characters, and 16 (including the outgroup) are represented by all four genes. Our final aim was to investigate the position of Acrochordus in relation to colubroids and “basal macrostomatans.” We created data set 2 with the following characteristics: (1) exclusion of ND4 data because these data 443 were lacking for all “basal macrostomatans,” (2) inclusion of only a representative colubroid sample because analysis of data set 1 showed that Acrochordus was not nested within the Colubroidea, (3) inclusion of all “basal macrostomatans,” and (4) inclusion of blind snakes as the outgroup. In data set 2 with one outgroup and nine ingroup terminals, five taxa are supraspecific, and all are represented by the Cytb, 12S, and 16S genes. Molecular Techniques Mitochondrial Cytb gene sequences from 10 species (Appendix) were generated for the present study. The remaining data were obtained from GenBank or from the literature (Table 1). Total genomic DNA was extracted from liver, muscle, or blood samples using a modified Chelex 100 protocol (Walsh et al., 1991) and was used as a template in polymerase chain reaction (PCR) amplifications (Saiki et al., 1986). For each species, a fragment of the Cytb gene a maximum of 804 bp long was amplified with primers L14724 (50 -CGA AGC TTG ATA TGA AAA ACC ATC GTT G-30 ) (Irwin et al., 1991) and H15547b (50 -AAT AGG AAG TAT CAT TCT GGT TTA ATG-30 ) (Campbell, 1997, unpubl.). Reactions were conducted in 50- or 100-µl volumes typically with 35 cycles of 45 sec at 95◦ C (denaturation), 45 sec at 52◦ C (annealing), and 3 min at 72◦ C (extension), followed by a final 10-min extension at 72◦ C. PCR products were purified using the QIAquick PCR Purification Kit (Quiagen). Sequencing was carried out with an ABI 3100 Genetic Analyzer, using the BigDye Terminator Cycle Sequencing Kit (Applied Biosystems) according to the manufacturer’s protocols. Both strands were sequenced for most taxa. Primers used for sequencing included the amplification primers and two internal primers: SNKP2 (50 -TGA (GA)GA CAA ATA TCA TTC-30 ; this study) and SNKP3 (50 -GAA TGA TAT TTG TC(C/T) TCA-30 ; this study). New sequences were edited and contiguous sequences were assembled using the program Sequencher 3.0 (GeneCodes Corporation). Sequence Alignment All sequences were aligned manually using the software DAPSA 4.0 (Harley, 1996). The arrangement of Vidal et al. (2000), based on secondary structure, was used as a guide for alignment of 12S sequences. Sequence Characteristics Unless otherwise specified, sequence characteristics were investigated using data set 1 only, and analyses were performed using PAUP∗ 4.0b8 (Swofford, 2000). When using methods based on explicit models of DNA evolution, we employed ModelTest 3.04 (Posada and Crandall, 1998) to identify the model of DNA substitution that best fit the data. Investigations of mutational saturation were conducted for each gene by plotting pdistance versus corrected estimates (based on the model identified by ModelTest) of proportional sequence divergence. In all cases transitions (TIs) and transversions 444 VOL. 52 SYSTEMATIC BIOLOGY (TVs) were considered separately, and protein-coding genes were further divided into first, second, and third codon positions. Phylogenetic signal in each single gene and in data sets 1 and 2 was investigated using the g1 statistic for skewness of tree length distributions (Hillis and Huelsenbeck, 1992), estimated from 100,000 random trees, and the published critical values. However, a significant result may occur if phylogenetic signal is confined to limited portions of the topology. This possibility was tested in data set 1 by deleting all clades with bootstrap support of ≥50% and then recalculating the g1 statistic. The compliance of mtDNA with the molecular clock hypothesis was tested by applying the likelihood ratio test (Felsenstein, 1988) to a subset of data set 1 comprising all four genes and including only those taxa represented by all genes. Data Combination Most phylogenetic analyses were conducted on combinations of the data from all four genes included in this study. Data combination is justified here for the following reasons. First, the entire animal mitochondrial genome is inherited as a single unit, so different mitochondrial genes are not independent estimates of organismal phylogeny (Moore, 1995; Page, 2000). Consequently, all four genes should share the same evolutionary history and should retrieve the same phylogenetic relationships. Second, no strong conflict in signal was revealed by separate analyses of individual genes, thus providing no reason to suspect that these genes differ in the phylogenetic relationships they retrieve. Phylogenetic Analyses Data set 1 was analyzed using supertree consensus methods, MP, ML, and Bayesian inference. Analysis of data set 2 involved all of the above methods except supertree consensus. Subset Analysis and Supertree Consensus (Data Set 1) Data set 1 included many missing data entries. In phylogeny reconstruction, missing characters can result in almost random placement of affected taxa (Nixon and Wheeler, 1992), leading to a proliferation of best trees and a consequent reduction in the resolution of consensus trees (Huelsenbeck, 1991; Novacek, 1992). To avoid this problem, we divided data set 1 into 15 subsets: each single gene and all combinations of two, three, and four genes. Each subset included only those terminal taxa represented by all of the subset’s constituent genes, thus avoiding inclusion of missing data. All data subsets were analyzed using MP (see Maximum parsimony section below). The data subset combining all four genes was analyzed using both MP and ML (see Maximum likelihood section below). The matrix representation with parsimony (MRP) supertree consensus method (Baum, 1992; Ragan, 1992), implemented in the program RadCon (Thorley and Page, 2000), was then used to combine the resultant most-parsimonious trees (MPTs). Supertree consensus methods facilitate fusion of trees with differ- ent but overlapping terminal sets. Nodes of the input trees were weighted according to their bootstrap support (Ronquist, 1996), with 70–79% bootstrap support weighted 3, 80–89% weighted 6, and 90–100% weighted 9 (base weight = 1). Following the suggestion of BinindaEmonds and Bryant (1998), Camin–Sokal parsimony (prohibiting reversals) and Fitch parsimony (allowing reversals) were assessed as alternatives. The most appropriate analysis was identified by tree comparison using the Shimodaira–Hasegawa (SH) test (see Hypothesis Testing section below) (Goldman et al., 2000). MP, ML, and Bayesian Analyses Maximum parsimony.—Gaps in alignments were coded using the complex indel coding system (Simmons and Ochoterena, 2000), which allows all available information to be utilized. This method treats gaps that have different 50 and/or 30 termini as presence/absence characters and is based on the application of six rules. Three weighting schemes for nucleotide data were investigated for MP analysis of data sets 1 and 2 and of the subsets of data set 1. Scheme 1 involved equal weighting, in which all nucleotide positions were weighted equally regardless of type or position. Scheme 2 involved differential position weighting, in which saturated substitution subsets were downweighted according to their TI:TV ratio, as estimated by ML under the HKY85 + I + 0 (Hasegawa et al., 1985) model of nucleotide substitution. Scheme 3 involved following the suggestion of Lenk et al. (2000), in which the optimal DNA substitution model identified by ModelTest was transferred to an MP framework by construction of an asymmetric step matrix based on the inverse values of the instantaneous substitution rate matrix (Q matrix) of the model, rounded to the nearest integer, and adjusted to satisfy the triangle inequality (Sankoff, 1975). Each of these three schemes was applied with (denoted “a”) and without subsequent weighting based on the successive approximation method (Farris, 1969), using the rescaled consistency index of equally weighted characters as a measure of homoplasy. Trees from each scheme were compared using the SH test, and the most suitable weighting scheme was thus identified. Where there was no significant difference between the trees from different schemes (P ≥ 0.05), the scheme producing the fewest MPTs was chosen. All MP searches were performed with 1,000 random sequence-addition replicates and tree bisection– reconnection (TBR) branch swapping using PAUP∗ 4.0b8 and including only informative characters. We used heuristic searches in most cases, but data set 2 and some of the subsets of data set 1 were small enough to make exhaustive searches possible. Nonparametric bootstrap analyses (Felsenstein, 1985) with 1,000 pseudoreplicates and 10 random sequence-addition replicates were conducted, applying character weights where relevant. All characters were sampled with equal probability. The stability of taxa across sets of bootstrap trees was assessed using the leaf stability (LS) method (Thorley and Wilkinson, 1999) implemented in RadCon. 2003 KELLY ET AL.—PHYLOGENETICS OF ADVANCED SNAKES Maximum likelihood.—ML analysis was restricted to data sets 1 and 2 and to the subset of data set 1 comprising all four genes and only those taxa represented by all genes. The general time reversible model with some sites assumed to be invariable and with variable sites assumed to follow a discrete gamma distribution (GTR + I + 0; Rodrı́guez et al., 1990; Yang, 1994) was selected by ModelTest as the optimal model of nucleotide substitution for all datasets. PAUP∗ 4.0b8 was used to implement heuristic ML searches with TBR branch swapping and with starting trees generated by neighbor joining (NJ; Saitou and Nei, 1987). Parameters under the GTR + I + 0 model were set according to the corresponding ModelTest outputs. Computational restrictions precluded bootstrap analysis within the ML paradigm. Bayesian inference.—Bayesian phylogenetic analyses of data sets 1 and 2 were conducted with MrBayes 3.0b3 (Huelsenbeck and Ronquist, 2001), using the GTR + I + 0 model of nucleotide substitution. Model parameter values were treated as unknown and were estimated in each analysis. Random starting trees were used, and analyses were run for 1 million generations, sampling the Markov chains every 100 generations. We used three methods to ensure that our analyses were not trapped in local optima: (1) each analysis was run twice, starting from different random trees, and the log-likelihood (lnL) values at stationarity were compared for convergence (Huelsenbeck and Bollback, 2001); (2) Metropolis-coupled Markov chain Monte Carlo was applied, allowing a more extensive exploration of parameter space (Huelsenbeck and Ronquist, 2001), with one cold and three incrementally heated Markov chains, applying the MrBayes default heating values; and (3) topologies and posterior clade probabilities from different runs were compared for congruence (Huelsenbeck and Imennov, 2002). We plotted the log-likelihood scores of sample points against generation time and considered stationarity of Markov chains to have been reached when the log-likelihood values reached a stable equilibrium (Huelsenbeck and Ronquist, 2001). All sample points prior to stationarity were discarded as burn-in values, and remaining points were used to generate a 50% majority rule consensus tree, with each clade posterior probability value represented by the proportion of sample points recovering that specific clade. Hypothesis Testing It was often necessary to statistically compare alternative phylogenetic hypotheses, e.g., to facilitate choice of the best weighting scheme in MP analyses. For such comparisons, we used the SH test with 10,000 bootstrap pseudoreplicates and the resampling estimated log-likelihood approximation. This test was applied in place of the Kishino–Hasegawa (KH) test (Kishino and Hasegawa, 1989) because the KH test is invalid when topologies being compared are specified a posteriori (Shimodaira and Hasegawa, 1999). One potential problem with the SH test is that it may appear conservative 445 (Shimodaira, 2002), but in the current context this was not considered a serious issue. R ESULTS Alignment and Sequence Characteristics Sequences generated in this study were deposited in GenBank under accession numbers AY235721–AY235730 (see Table 1). Alignment of protein coding sequences (Cytb and ND4) was unambiguous. No gaps were introduced into the ND4 alignment, and the Cytb alignment required a single triplet gap near the beginning of the Cytb gene of Lamprophis inornatus. For the 16S sequences, alignment was unambiguous except in two highly variable regions corresponding to loops (sites 2145–2170 and 2183–2189 in the mitochondrial genome of Dinodon semicarinatum), which were excluded from the alignment. The 12S and 16S alignments required the introduction of 31 and 7 gaps, respectively (data set 1), and 26 and 3 gaps, respectively (data set 2). Data set 1 comprised 2,338 characters (with approximately 35% missing data), 1,303 variable characters, and 1,071 parsimony-informative characters. Data set 2 comprised 1,623 characters (with insignificant missing data), 780 variable characters, and 503 parsimony-informative characters. Sequence properties were determined based on data set 1 only. Character information and nucleotide composition of sequences are summarized in Table 2. These observations reflect patterns obtained by other authors for mitochondrial genes (Campbell, 1997, unpubl.; Slowinski and Keogh, 2000; Lenk et al., 2001). Table 3 includes minimum, maximum, and mean corrected (based on the scheme identified by ModelTest) sequence divergences between advanced snake taxa after exclusion of terminals with extensive missing data. These values suggest that Cytb and ND4 genes evolve at comparable rates, as do 12S and 16S, but that the mitochondrial RNA genes evolve much more slowly than do the protein-coding genes. This finding is in agreement with other published findings (Crozier, 1990). Plots of uncorrected versus corrected sequence divergence (not shown) were linear for both substitution types in the 12S and 16S sequences and at first and second coding positions in the Cytb and ND4 sequences. In both of the protein genes, third-position TVs deviated from linearity but maintained a positive gradient. However, for third-position TIs the relationship had a final gradient of effectively zero, suggesting that this class of substitution is saturated. The TI:TV ratios for third positions were estimated at 64:1 (Cytb) and 16:1 (ND4) (data set 1) and at 36:1 (Cytb) (data set 2), so third-position TIs were downweighted in the relevant MP weighting schemes by factors of 64, 16, and 36, respectively. Significant phylogenetic signal (P < 0.01) was present in each single gene (g1 = −0.213 for Cytb, −0.376 for ND4, −0.430 for 12S, −0.440 for 16S) in data set 1 both before (63 terminals, g1 = −0.375) and after (36 terminals, g1 = −0.302) removal of clades with bootstrap support >50% and in data set 2 (10 terminals, g1 = −0.600). The likelihood-ratio test strongly rejected the hypothesis that 446 VOL. 52 SYSTEMATIC BIOLOGY TABLE 2. Nucleotide composition and character information for each gene fragment, based on data set 1. For third positions in protein-coding genes, character information is given for transitions (TI) and transversions (TV) separately. There was no significant change in numbers of variable and parsimony-informative characters when calculations were repeated without the outgroup. Tests of base homogeneity across taxa were based on variable sites only. Base frequencies Gene Cytb ND4 12S 16S a Codon position or data type 1 2 3 1 2 3 Bases Gaps Bases Gaps No. (%) characters A C G T Probability of base homogeneity across taxaa 0.33 0.22 0.38 0.36 0.16 0.46 0.38 0.23 0.27 0.38 0.27 0.31 0.32 0.24 0.18 0.12 0.05 0.18 0.12 0.05 0.18 0.26 0.39 0.19 0.20 0.41 0.17 0.20 1.000 1.000 0.000 1.000 1.000 0.000 0.999 0.35 0.21 0.19 0.25 1.000 Total Variable Parsimony informative 268 268 87 TI, 181 TV 231 231 58 TI, 174 TV 421 31 381 7 128 (48%) 74 (28%) 86 TI, 177 TV (98%) 131 (57%) 68 (29%) 58 TI, 172 TV (99%) 229 (54%) 31 (100%) 142 (37%) 7 (100%) 98 46 83 TI, 172 TV 110 47 52 TI, 169 TV 173 21 99 1 Based on a chi-square test. mtDNA conforms to molecular clock assumptions (n = 16, χ 2 = 36.12, P < 0.001). Phylogenetic Analyses of Data Set 1 Subset analysis.—Relevant information on each data subset is given in Table 4. Optimal MP analysis of individual genes produced a single MPT in each case (not shown). These analyses retrieved roughly comparable clades, but branching order varied considerably between gene trees. However, none of the differences between these trees were strongly supported by bootstrap values, and one of the most striking features of all four phylogenies was extremely poor bootstrap support, especially for backbone nodes. This result parallels trends in bootstrap or decay index support obtained in similar mtDNA sequence–based studies involving both snake phylogenetics (Kraus and Brown, 1998; Slowinski and Keogh, 2000; Vidal et al., 2000; Gravlund, 2001) and the phylogenetics of other groups (e.g., artiodactyl mammals: Matthee et al., 2001). However, there was a marked increase in bootstrap support (Table 4) and tree congruence in analyses involving gene combinations. Predictably, for analyses of data subsets the proportion of nodes with bootstrap support of ≥50% was significantly correlated positively with the number of variable and parsimony-informative characters and negatively with the number of terminal taxa (P < 0.05). Also, it is significant to note that the MPT from analysis of the Cytb + ND4 data subset (the only genes representing TABLE 3. Sequence divergences between ingroup taxa for each gene (after exclusion of terminals with extensive missing data), corrected using the results of ModelTest analyses. Corrected pairwise distances Gene Maximum Minimum Mean Cytb ND4 12S 16S 2.27 2.95 0.70 0.39 0.33 0.16 0.05 0.04 1.26 1.26 0.31 0.15 Aparallactus) contained a monophyletic Atractaspididae, with 99% bootstrap support. Figure 1 illustrates the single trees from MP and ML analysis of the data subset comprising all four genes and including only those taxa represented by all genes. Almost all nodes in the MP tree have support values significantly >50%. In the retrieval of major clades and the relationships between these clades, the ML tree is entirely consistent with the optimal topologies from all analyses of data set 1 (Figs. 2–5), and the MP tree differs only in the placement of Atractaspididae, despite extensive taxon sampling problems. Supertree consensus.—In the MRP supertree consensus analysis, Fitch parsimony performed better than Camin–Sokal parsimony, although this difference was not significant (P > 0.05). Figure 2 illustrates the tree with the highest log-likelihood from the set of 13 mostparsimonious supertrees retrieved. Maximum parsimony.—Weighting Scheme 1a (equally weighted data with successive approximation) was identified as the optimal scheme for MP analysis of data set 1. This scheme was superior to all other weighting schemes, significantly so for all except Scheme 1 (without successive approximation) (P < 0.05). Figure 3 illustrates the tree with the highest log-likelihood from the set of 11 MPTs retrieved. Eighty percent of nodes have ≥50% bootstrap support, and 58% of nodes have ≥70% bootstrap support. LS values for the MP bootstrap tree profile are given in Table 5. Pareatines (Aplopeltura boa and Pareas nuchalis) and homalopsines (Enhydris and Cerberus rhynchops) were considerably less stable than other taxa. Maximum likelihood.—ML analysis of data set 1 produced a single tree (Fig. 4). Bayesian inference.—The two independent Bayesian analyses of data set 1 converged on similar log-likelihood scores and reached stationarity at no later than 200,000 generations (plots not shown). The first 2,000 sample points were thus discarded as burn-in, leaving 8,000 samples for construction of a 50% majority rule consensus tree. The consensus topologies of both runs were almost identical, and the posterior probability values supporting congruent nodes were highly correlated (not shown), 2003 447 KELLY ET AL.—PHYLOGENETICS OF ADVANCED SNAKES TABLE 4. Subsets of data set 1 (single genes and all combinations of two, three, and four genes) for supertree consensus. Weighting schemes are equal weighting (1), differential codon position weighting in which saturated substitution subsets were downweighted according to their TI:TV ratio (2), and differential weighting based on the output of ModelTest (3). Those schemes incorporating further weighting by successive approximation are marked with an “a.” Those marked with an asterisk were significantly better than competing schemes (α = 0.05), as assessed using the SH test. Where differences between weighting schemes were not significant, the scheme producing the fewest MPTs was chosen. Bootstrap analyses applied character weights where relevant. No. characters Data subset Cytb ND4 12S 16S Cytb, ND4 Cytb, 12S Cytb, 16S ND4, 12S ND4, 16S 12S, 16S Cytb, ND4, 12S Cytb, ND4, 16S Cytb, 12S, 16S ND4, 12S, 16S All genes No. terminals Variable Parsimony informative 41 34 50 44 18 38 31 22 23 43 16 16 31 22 16 465 429 260 149 797 707 574 613 524 398 981 890 808 736 1086 399 378 194 100 636 570 468 479 418 288 742 681 631 559 809 Best weighting scheme Scheme No. MPTs 3a 2 1 1 1a∗ 1a 1a 1a 1a∗ 1 1a 1a 1a 1a∗ 1a 1 2 102 64 1 1 1 1 1 2 1 1 1 1 1 providing further evidence that the analyses converged. Figure 5 illustrates the 50% majority rule consensus tree from one of the runs. Forty-six percent of nodes have posterior probability values ≥95%. Phylogenetic relationships.—These results are based on Figures 2–5 (MRP, MP, ML, and Bayesian analyses of data set 1). All analyses supported monophyly of advanced snakes with respect to Boa constrictor, and clades 1–9 (Figs. 2–5) were always retrieved (ignoring Homoroselaps, the placement of which appeared to be relatively random). In clade 2 (Viperidae), there was much support for monophyly of the subfamily Crotalinae and of the Old World (Trimeresurus and Calloselasma) and New World (Crotalus and Agkistrodon) crotalinae taxa. Viperinae was not monophyletic. In clade 6, MP and ML analyses provided moderate support for monophyly of the Pseudoxyrophiinae, but Lamprophiinae and Atractaspididae were only monophyletic in the MRP tree. The MP, ML, and Bayesian topologies placed Aparallactinae in the Lamprophiinae, as sister to Mehelya, and Atractaspininae was variously associated with the Elapidae, Lamprophiinae, and Homoroselaps. Elapidae was monophyletic in the MP and ML topologies. Two elapid clades were present throughout, Maticora + Micrurus + Micruroides and Naja + Boulengerina, and a Dendroaspis + Hydrophis clade was retrieved in the MRP, MP, and ML analyses. The elapid subfamilies Elapinae and Hydrophiinae were not monophyletic. In clade 7 (Natricinae excluding Macropisthodon), the African and European genus Natrix was basal to a New World (Nerodia and Thamnophis) clade (Thamnophiini; Rossman and Eberle, 1977). The Southeast Asian species Macropisthodon rudis was consistently placed within Colubrinae. Chosen scheme (if different from best scheme) Scheme No. MPTs 2a 1a 1a 1 1 1 1a 1 Proportion of nodes with bootstrap support ≥50% 0.47 0.61 0.36 0.32 0.93 0.71 0.75 0.89 0.90 0.60 0.92 0.85 0.86 0.95 1.00 In clade 8 (Xenodontinae), there was much support for monophyly of the Xenodontini (e.g., Jenner, 1981, unpubl.), here represented by Xenodon, and Liophis. In the MP and ML topologies, the South American taxa (Hydrodynastes, Helicops, Alsophis, Xenodon and Liophis) and Central American taxa (Dipsas) formed a clade distinct from Farancia and Heterodon (North American representatives). In the ML and Bayesian topologies, the South, Central, and North American taxa separated into distinct clades. Homoroselaps emerged within the Xenodontinae in the MRP and MP topologies. In clade 9, the Colubrinae was monophyletic (excluding Dendrelaphis, with Gastropyxis in the Bayesian topology), and Macropisthodon, Dendrelaphis, and Calamariinae (Oreocalamus) were always closely associated. Within Colubrinae, all trees contained the clades Lycodon + Dinodon and Crotaphopeltis (Dasypeltis, Boiga), and in the MP, ML, and Bayesian topologies these two clades were collectively monophyletic. Grayia usually emerged in a relatively basal position in clade 9. There was widespread agreement between the four analyses regarding relationships between these major clades, and the basal portion of the tree was identical for all reconstructions: Boa((Acrochordidae, Xenoderminae) (Viperidae (remaining Colubroidea))). In the MRP, ML, and Bayesian topologies, Pareatinae was the next to branch off. Clade 4 and clade 6 were always sister groups, clade 7 and clade 8 were sister groups in the MRP, MP, and ML topologies, and clades 7–9 were always collectively monophyletic. Phylogenetic Analyses of Data Set 2 Maximum parsimony.—Weighting Scheme 1 (equally weighted data without successive approximation) was identified as the optimal scheme for MP analysis of 448 SYSTEMATIC BIOLOGY VOL. 52 FIGURE 1. The single best phylograms from MP and ML analysis of all four genes combined, including only those terminal taxa with no missing genes. (a) MP, applying the optimal weighting scheme (Scheme 1a): consistency index = 0.60, retention index = 0.68. (b) ML, applying the GTR + I + 0 model of DNA substitution with parameter values estimated by ModelTest: −ln L = 16585.7. Numbers are bootstrap (MP) support values for the associated nodes. Supraspecific terminals are labeled with generic names only. Labels next to the tree are subfamily and family names (Table 1). 2003 KELLY ET AL.—PHYLOGENETICS OF ADVANCED SNAKES 449 FIGURE 2. The most likely of 13 best phylograms (generated using Fitch parsimony) from the MRP supertree consensus method, using as input the MPTs resulting from optimal weighted analyses of all subsets of data set 1: consistency index (CI) = 0.64, retention index (RI) = 0.86. Nodes on the input trees were weighted according to their bootstrap support. When assessed in relation to the original unweighted sequence data, CI = 0.25, RI = 0.31. Open circles mark nodes that collapse in the strict consensus, and asterisks mark nodes shared with the MP hypothesis (Fig. 3). Branch lengths are proportional to the number of unambiguous changes in the original sequence data. Supraspecific terminals are labeled with generic names only. Labels next to the tree are subfamily names (Table 1), and the associated numbers correspond to clade numbers in the text. data set 2. Figure 6a illustrates the single MPT retrieved. Seventy-one percent of nodes have ≥50% bootstrap support, and 29% of nodes have ≥70% bootstrap support. Maximum likelihood.—ML analysis of data set 2 produced a single tree (Fig. 6b). Bayesian inference.—The two independent Bayesian analyses of data set 2 converged on similar loglikelihood scores and reached stationarity at no later than 50,000 generations (plots not shown). The first 500 sample points were thus discarded as burn-in, 450 SYSTEMATIC BIOLOGY VOL. 52 FIGURE 3. The most likely of 11 best phylograms from MP analysis of data set 1, applying the optimal weighting scheme (Scheme 1a): consistency index = 0.36, retention index = 0.54. Open circles mark nodes that collapse in the strict consensus. Asterisks mark nodes shared with the ML hypothesis (Fig. 4) (ignoring Homoroselaps). Numbers on the tree are bootstrap support values for the associated nodes, and branch lengths are proportional to the number of unambiguous changes. Supraspecific terminals are labeled with generic names only. Labels next to the tree are subfamily names (Table 1), and the associated numbers correspond to clade numbers in the text. leaving 9,500 samples for construction of a 50% majority rule consensus tree. The consensus topologies of both runs were identical, and the posterior probability values supporting congruent nodes were highly correlated (not shown). Figure 6c illustrates the 50% majority rule consensus tree from one of the runs. Sixty- seven percent of nodes have posterior probability values ≥95%. Phylogenetic relationships.—In their placement of Acrochordus as sister to the Colubroidea, the three topologies were identical. Support for this relationship was poor in the MP reconstruction but high in the Bayesian analysis. 2003 KELLY ET AL.—PHYLOGENETICS OF ADVANCED SNAKES TABLE 5. The stability of terminal taxa (LS values) across the MP bootstrap tree profile of data set 1 (1,635 rooted trees) returned by the program RadCon. Genus (leaf) Boa Acrochordus Achalinus Xenodermus Crotalus Agkistrodon Trimeresurus Calloselasma Vipera Azemiops Bitis Causus Micrurus Micruroides Naja Boulengerina Maticora Dendroaspis Aspidelaps Bungarus Dasypeltis Boiga Dinodon Lycodon Crotaphopeltis Ophiophagus Coluber Elapsoidea Elaphe Hydrophis Laticauda Aparallactus Mehelya Helicops Atractaspis Farancia Alsophis Heterodon Madagascarophis Gastropyxis Dipsas Leioheterodon Grayia Lamprophis Dendrelaphis Nerodia Thamnophis Macropisthodon Natrix Hydrodynastes Xenodon Liophis Homoroselaps Psammophylax Hemirhagerrhis Rhamphiophis Malpolon Psammophis Oreocalamus Enhydris Cerberus Aplopeltura Pareas LS (difference) 1.000 0.997 0.996 0.996 0.982 0.982 0.982 0.982 0.982 0.982 0.982 0.982 0.884 0.884 0.884 0.884 0.884 0.882 0.882 0.881 0.881 0.881 0.881 0.881 0.880 0.880 0.879 0.879 0.879 0.878 0.877 0.877 0.876 0.874 0.874 0.874 0.874 0.873 0.871 0.870 0.869 0.867 0.864 0.858 0.858 0.854 0.854 0.853 0.852 0.850 0.848 0.848 0.817 0.810 0.810 0.810 0.810 0.809 0.801 0.717 0.717 0.634 0.634 451 D ISCUSSION Utility of mtDNA In this study, individual genes were often incongruent in their retrieval of phylogenetic relationships (although none of this incongruence was strongly supported by bootstrap values), and support overall was extremely poor, especially for internal nodes. In contrast, different gene combinations retrieved congruent relationships and nodal support was generally high. The significant positive and negative correlations of bootstrap support with the number of variable and parsimony-informative characters and the number of terminal taxa, respectively, suggests that support (and by implication accuracy) is largely dependent on the character: terminal taxon ratio (a conclusion reached by Bremer et al., 1999, for the Rubiaceae). This is likely to apply especially to data with high levels of homoplasy, so we infer that all of the genes used here are substantially homoplastic at the hierarchical levels investigated. The pattern of nodal support was similar in many of the trees; support was moderate or high up to the level of the Viperidae. Beyond this similarity, support was generally restricted to terminal and subterminal nodes, with little support for many backbone nodes. This pattern was consistent for both the rapidly evolving (Ctyb and ND4) and the more slowly evolving (12S and 16S) genes, suggesting that the relatively rapid overall rate of mtDNA evolution is not a factor hampering retrieval of higher relationships. Some researchers have suggested that the Miocene radiation of Colubroidea was extremely rapid (e.g., Cadle, 1994; Greene, 1997). An explosive colubroid radiation above the level of the Viperidae would account in part for the patterns and high levels of homoplasy observed here. Any rapid radiation will result in short branch lengths linking radiated taxa if the rate of evolution of characters used in tree construction is not speeded by the radiation. There is no reason to expect the rate of evolution of mtDNA to increase during rapid radiation. Additionally, if the radiation were relatively ancient (as postulated here), there would be loss of phylogenetic signal along the short branches due to multiple hits (substitutional saturation), resulting in homoplasy. Single mitochondrial genes are not optimal for investigations of higher relationships among advanced snakes, but the effectiveness of mtDNA increases with increasing character:taxon ratios. Despite extensive taxon sampling problems, the trees in Figure 1 are almost entirely consistent with all of the optimal topologies from analyses of data set 1. Thus, the failure of previous molecular studies to accurately retrieve higher relationships among advanced snakes may have been due to insufficient character inclusion rather than taxon sampling problems. Caenophidian Phylogenetic Relationships Many of the groups for which we have found evidence of monophyly are well supported by morphological characters and have been retrieved in other molecular 452 SYSTEMATIC BIOLOGY VOL. 52 FIGURE 4. The single ML phylogram from analysis of data set 1 under the GTR + I + 0 model of DNA substitution, with parameter values estimated by ModelTest: −ln L = 34275.4. Asterisks mark nodes shared with the MP hypothesis (Fig. 3) (ignoring Homoroselaps). Branch lengths are proportional to the number of substitutions per site. Supraspecific terminals are labeled with generic names only. Labels next to the tree are subfamily names (Table 1), and the associated numbers correspond to clade numbers in the text. studies. The Viperidae, Xenoderminae, Pareatinae, Psammophiinae, Pseudoxyrophiinae, Homalopsinae, Natricinae, Xenodontinae, and Colubrinae (with redefinition) fall into this category (e.g., Rossman and Eberle, 1977; McDowell, 1987; Cadle, 1988, 1994; Heise et al., 1995; Dowling et al., 1996; Kraus and Brown, 1998; Vidal et al., 2000; Gravlund, 2001). Our results serve to reinforce these clades. Clade 1.—Consequent upon the presence of a wellsupported basal (Acrochordus + Xenoderminae) clade in all of our topologies, we suggest removal of Xenoderminae from the Colubroidea. Kraus and Brown 2003 KELLY ET AL.—PHYLOGENETICS OF ADVANCED SNAKES 453 FIGURE 5. The 50% majority rule consensus tree from Bayesian analysis of data set 1 under the GTR + I + 0 model of DNA substitution: mean −ln L = 34339.7. Asterisks mark nodes shared with the ML hypothesis (Fig. 4) (ignoring Homoroselaps). Numbers on the tree are posterior probability values for the associated nodes, and branch lengths are averaged over all input trees. Supraspecific terminals are labeled with generic names only. Labels next to the tree are subfamily names (Table 1), and the associated numbers correspond to clade numbers in the text. (1998) also reached this conclusion (using the ND4 data included here), and Dowling (1989, unpubl.) placed Xenoderminae with Homalopsinae in his superfamily “Acrochordoidea.” Clade 2.—Within Viperidae, Azemiops feae has been considered either basal (Liem et al., 1971) or related to Asian representatives of the Crotalinae (e.g., Parkinson, 1999). However, all analyses of data set 1 strongly support placement of Azemiops in a clade with Viperinae and Causinae. Causinae was suggested by Groombridge (1984) to be sister to the remaining viperids, and Dowling et al. (1996) found Causinae to lie outside a Viperinae + Crotalinae clade. However, Causinae and Viperinae are united by several synapomorphic morphological 454 SYSTEMATIC BIOLOGY VOL. 52 FIGURE 6. The best phylograms from MP, ML, and Bayesian analyses of data set 2. (a) MP (single tree), applying the optimal weighting scheme (scheme 1, without subsequent successive approximation): length = 1,514, consistency index = 0.53, retention index = 0.32. (b) ML (single tree), applying the optimal model (GTR + I + 0) and parameters identified by ModelTest: −ln L = 8969.2. (c) Bayesian analysis (50% majority rule consensus tree), applying the GTR + I + 0 model of DNA substitution: mean −ln L = 8979.4. Numbers are bootstrap (MP) and posterior probability (Bayesian analysis) values for the associated nodes. Supraspecific terminals are labeled with generic or family names only. Labels next to the tree are higher taxon names (Table 1). 2003 KELLY ET AL.—PHYLOGENETICS OF ADVANCED SNAKES characters (Underwood, 1999), and our results suggest a link between these taxa, although not the sister relationship assumed by Lenk et al. (2001). Despite much support for our groupings, we consider our taxon sampling within the Viperidae to be insufficient to clarify subfamily relationships. The basal position of Viperidae within the Colubroidea, as obtained here, is supported by the immunological results of Cadle (1988). The preferred topology of Kraus and Brown (1998, generated from ND4 sequences using TVs only) places Viperidae in a terminal position within Colubroidea, whereas their NJ tree based on all variable sites is consistent with the results of the present study. Our saturation analysis of ND4 sequences indicates that only third-position TIs are substantially saturated. We thus consider Kraus and Brown’s exclusion of other transitions to be in error. Clade 3.—Pareatinae is a morphologically well-defined group (McDowell, 1987), but its affinities are obscure. In our MRP, ML, and Bayesian topologies, Pareatinae branched immediately after the Viperidae and was sister to the remaining Colubroidea. However, support for this relationship was generally poor and pareatines were relatively unstable in the MP bootstrap tree profile (Table 5). Similarly, Slowinski and Lawson (2002) were unable to confidently resolve the relationships of this group other than to ascertain that it tends to occupy a basal position in the Colubroidea, sometimes associating with the Viperidae. We can thus make no conclusions about placement of the Pareatinae. Clade 4.—Psammophiinae is also morphologically well defined (Bogert, 1940; Underwood, 1967; Bourgeois, 1968; Dowling and Duellman, 1978); however, the relationship of Psammophiinae to other taxa is disputed. On the basis of retinal characters, hemipenial morphology (Underwood, 1967), and minimal molecular evidence (George and Dessauer, 1970; Minton and Salanitro, 1972), Rasmussen (1985) included them in the Colubrinae. However, our results support no such association, and Cadle (1994) found significantly greater albumin divergence among psammophiine genera than among colubrines, suggesting a much earlier divergence of the former group compared with the latter (assuming a molecular clock). McDowell (1987) included psammophiines in his Boaedontinae (roughly comparable to the Lamprophiinae of this study), but this position is also incongruent with the present results. All of our analyses placed Psammophiinae around the middle of the tree, as sister to clade 6. Clade 5.—Homalopsinae has been associated with the Acrochordidae and the Natricinae (e.g., Dowling and Duellman, 1978), with Dasypeltis (now considered a colubrine) (Underwood, 1967; Rasmussen, 1985), with the Boiginae (now part of the Colubrinae) (Underwood, 1967), and with Viperidae (Rasmussen, 1985; Gravlund, 2001). The present results support none of these relationships but offer no solid alternatives. The retrieval of Homoroselaps as sister to the Homalopsinae in our Bayesian analysis is almost certainly erroneous. The affinities of Homalopsinae remain obscure. 455 Clade 6.—Several key issues relate to clade 6. The first issue is the monophyly of the Atractaspididae. The Atractaspididae as here constituted was originally proposed by Bourgeois (1968) on the basis of skull morphology (her subfamily Aparallactinae, within the “Colubridae”). Heymans (1975) identified the same grouping on the basis of jaw musculature and erected the family Atractaspididae (Heymans, 1982), within which Atractaspis was assigned to the subfamily Atractaspidinae (our Atractaspininae). The remaining genera were placed in the subfamily Aparallactinae. Since then, the monophyly of Atractaspididae (with respect to Atractaspis) has been debated, and the placement of Aparallactus within Atractaspididae has been questioned. Underwood and Kochva (1993) identified two unique morphological synapomorphies for the Atractaspididae: wedge-shaped dorsal process of the premaxilla and lateral position of the rectal glands. In our MRP topology, the Atractaspididae was monophyletic and was sister to the remainder of clade 6. The MPT from analysis of the Cytb + ND4 data subset (the only genes representing Aparallactus) also contained a monophyletic Atractaspididae, with 99% bootstrap support (in the absence of lamprophiine representatives). However the MP, ML, and Bayesian trees placed Aparallactus sister to Mehelya (Lamprophiinae). This placement is in agreement with the findings of McDowell (1987), who placed Aparallactus in his Boaedontinae (roughly comparable to our Lamprophiinae), and with the immunological data of Cadle (1994), which revealed no close relationship between Aparallactus and Atractaspis. We consider our sample of atractaspid taxa insufficient to resolve this issue, and solid conclusions on the status of the Atractaspididae await inclusion of additional related taxa. The second issue is the position of Homoroselaps. Prior to McDowell’s (1968) study, this southern African genus was considered an elapid on the basis of its proteroglyphous maxilla. McDowell (1968) argued for its transfer to the Aparallactinae, but Underwood and Kochva (1993) returned it to the Elapidae based on a variety of morphological characters. However, Underwood and Kochva stated that the majority of evidence indicated a primary dichotomy between Homoroselaps and the Elapidae. In a comprehensive mtCytb study of the Elapidae, Slowinski and Keogh (2000) found Homoroselaps to lie outside a clade containing the remaining elapids and found no association betweenHomoroselaps and their xenodontine representatives (Farancia and Heterodon). Our placements of Homoroselaps within the Xenodontinae (MRP and MP topologies) and as sister to the Homalopsinae (Bayesian topology) are almost certainly erroneous, but our ML tree suggests that the affinities of Homoroselaps may lie with the Atractaspididae. The third issue is the monophyly of and relationships between the Lamprophiinae and Pseudoxyrophiinae. As constituted here, Lamprophiinae + Pseudoxyrophiinae would be roughly equivalent to the Boaedontinae of McDowell (1987) and would also be similar to the 456 SYSTEMATIC BIOLOGY former Lycodontinae of Underwood (1967), Dowling and Duellman (1978), Dowling et al. (1983), and Cadle (1987). The type genus of Lycodontinae (Lycodon) is a colubrine, so this name has been relegated to the synonymy of Colubrinae (Cadle, 1994; Kraus and Brown, 1998). Lamprophiinae and Pseudoxyrophiinae contain most noncolubrine, nonnatricine, nonpsammophiine African snakes (Lamprophiinae) and Madagascan snakes (Pseudoxyrophiinae) belonging to the “Colubridae.” We know of no unique morphological synapomorphies supporting either of these groupings. McDowell (1987) tentatively included Mehelya in his Boaedontinae but noted that the salivary glands, hemipenes, and vertebrae were atypical. He also tentatively included Aparallactus on the basis of skull musculature. Our results certainly suggest that Mehelya is a lamprophiine and that it is closely related to Aparallactus. Based on immunological comparisons, Cadle (1994) found evidence of monophyly of a sample of three African lamprophiines (Lamprophis, Mehelya, and Lycodonomorphus) but retrieved no association between this clade and Leioheterodon (a pseudoxyrophiine). Our analyses recovered no especially close relationship between the Lamprophiinae and Pseudoxyrophiinae, and the MRP and Bayesian analyses placed Pseudoxyrophiinae within the Elapidae. Thus, inclusion of additional taxa from the Lamprophiinae, Pseudoxyrophiinae, Atractaspididae, and Elapidae will be necessary to resolve the status and composition of the Lamprophiinae and Pseudoxyrophiinae. The fourth issue is the composition of and relationships within the Elapidae. Our failure to demonstrate monophyly of the Elapinae and Hydrophiinae may result from taxon sampling problems rather than from real disagreements. We consider our sample of elapid taxa too small for solid conclusions about intraelapid relationships to be made. However, the strongly supported association of Dendroaspis and Hydrophis is interesting in its possible implications and requires further investigation. At the least it suggests that the African mambas (Dendroaspis) are not typical elapines, an inference supported by hemipenial morphology (Branch, pers. comm.). The fifth issue is the age of the elapid, atractaspid, and lamprophiine radiations. Each of these three taxa has been considered by various authors to be an ancient colubroid lineage: Elapidae (McDowell, 1987; Gravlund, 2001), Atractaspididae (Rasmussen, 1985; McDowell, 1987; Underwood and Kochva, 1993; Cadle, 1994; Gravlund, 2001), and Lamprophiinae (Cadle, 1994; Gravlund, 2001). All of our analyses, however, suggest that these taxa group together in a relatively terminal position on the colubroid tree and are not extremely early radiations. Clade 7.—Natricinae is characterized mainly by hemipenial synapomorphies; the entire hemipenis is spinose, usually with large asymmetrically placed spines at the base. Calyces are usually absent, and the sulcus spermaticus is forked and centripetal or is simple (McDowell, 1987). Most studies have found the Natricinae to be monophyletic, with the New World taxa (Thamnophiini) and Old World taxa grouping separately VOL. 52 (Rossman and Eberle, 1977; Dowling et al., 1996; Kraus and Brown, 1998; Fig. 6). All of our topologies contained a monophyletic core natricine group but placed the one Asian taxon (Macropisthodon rudis) with the Colubrinae. However, in a suboptimal ML tree (not significantly different from the best topology) Macropisthodon emerged as sister to the remaining Natricinae. The Natricinae are usually found to be closest to the Colubrinae. Dowling et al. (1996) and Cadle (1994) retrieved this relationship from protein comparisons and immunological data, respectively. However, our analyses suggest that Natricinae is most closely allied to the Xenodontinae. This result was also obtained by Dowling et al. (1983), although the natricine–colubrine link was retrieved as an equally plausible alternative. However, because Macropisthodon grouped with the colubrines in our best trees, addition of further Asian taxa to our analysis may shift the Natricinae towards a sister relationship with the Colubrinae. Clade 8.—Xenodontinae is an extremely large (>600 species) and heterogeneous New World assemblage (Greene, 1997) characterized mainly by a forked sulcus spermaticus or, if the sulcus is single, a unicapitate hemipenis (Jenner, 1981, unpubl.). Based on albumin immunological data, Cadle (1984, 1985) recognized two major lineages: the South American xenodontines and the Central American xenodontines. However, he found no clear evidence of relationship between the two, and the North American representatives showed no affinities with either group. However, using mtDNA 12S and 16S gene fragments and a comprehensive sample of taxa, Vidal et al. (2000) found Xenodontinae to be monophyletic and divided into North, Central, and South American clades, with the latter two being collectively monophyletic. Within the South American group, the tribes Xenodontini, Hydropsini, and Pseudoboini were monophyletic. Most xenodontines possess derived type IIIb retinas, whereas pseudoboine retinas are of the ancestral type I, which suggests that retinal composition is an extremely labile character (possibly strongly correlated with lifestyle) and is thus of little use in snake phylogenetics. Our results supported monophyly of the Xenodontini and of Xenodontinae as a whole. The MP, ML, and Bayesian hypotheses agreed that the South and Central, American forms are collectively monophyletic with respect to North American taxa, and in the ML and Bayesian topologies the South, Central, and North American taxa separated into distinct clades. The present results and those of Vidal et al. (2000) suggest that xenodontines are derived colubroids. This finding is in discord with claims by numerous other authors that the Xenodontinae represents a cluster of extremely old New World colubroids (e.g., Cadle, 1984, 1985; Dowling et al., 1983; Dowling et al., 1996). However, Kraus and Brown (1998) suggest some inconsistencies associated with these studies that may explain the observed discrepancies. Our retrieval of Homoroselaps within the Xenodontinae is almost certainly erroneous. Clade 9.—Colubrinae is one of the most often retrieved clades in phylogenetic analyses of snakes. It is generally diagnosed by the apparently synapomorphic presence of 2003 KELLY ET AL.—PHYLOGENETICS OF ADVANCED SNAKES asymmetric hemipenes, although not all colubrine taxa exhibit this condition (e.g., McDowell, 1987; Kraus and Brown, 1998). Most studies agree that Colubrinae occupies a terminal position in the Colubroidea (e.g., Cadle, 1988, 1994; Heise et al., 1995; Dowling et al., 1996), a result also obtained by us. We agree with McDowell (1987), Cadle (1994), and Kraus and Brown (1998) that Lycodon (and probably other members of Dowling and Duellman’s, 1978, tribe Lycodontini) belongs in the Colubrinae. Further, our analyses consistently retrieved a close and well-supported association between Dinodon (also part of the tribe Lycodontini) and Lycodon, a result mirrored by Vidal et al. (2000). We thus suggest that a tribe Lycodontini be recognized within the Colubrinae, comprising (until more data are brought to bear on the issue) the taxa included in the Lycodontini of Dowling and Duellman (1978). Another grouping retrieved in all of our analyses of data set 1 was Crotaphopeltis(Dasypeltis, Boiga). These taxa (excluding Dasypeltis) have been included in the subfamily Boiginae of Bourgeois (1968) and the tribe Boigini of Dowling and Duellman (1978) and were found by Gravlund (2001) to form a clade. Dasypeltis possesses a suite of highly specialized adaptations linked to its exclusive diet of birds’ eggs, and its placement on the basis of morphological characters has been problematic. Dowling and Duellman (1978) recognized its colubrine affinities and placed it in their tribe Colubrini of the Colubrinae. However, Dasypeltis has often been assigned its own family (Dasypeltinae; Bourgeois, 1968) and has even been associated with the Homalopsinae (Underwood, 1967; Rasmussen, 1985). Our results and those of Gravlund (2001, data included here) clearly support inclusion of Dasypeltis in the Colubrinae, and we suggest recognition of Dowling and Duellman’s tribe Boigini, expanded to include Dasypeltis. Our MP, ML, and Bayesian trees showed a sister relationship between the Lycodontini and Boigini of this study, a result also obtained by Heise et al. (1995), Vidal et al. (2000), and Gravlund (2001). Based on only two taxa, Dendrelaphis (Asian) and Gastropyxis (African), we find no support for the tribe Philothamnini (Dowling and Duellman, 1978). The choice of Oreocalamus as our sole representative of the Calamariinae was unfortunate because the placement of this genus is uncertain (Coborn, 1991). McDowell (1987) considered Calamariinae to be a morphologically well-defined group of unknown affinities. Our results consistently placed it in close association with the Colubrinae, normally grouped with Dendrelaphis and Macropisthodon. Oreocalamus and Dendrelaphis are both restricted to East Asia, so their proposed relationship may be worth further pursuit (Kraus and Brown, 1998). Colubrine affinities of the Calamariinae were also suggested by Dowling and Duellman (1978), who recognized the taxon as a tribe of their Colubrinae. The enigmatic genus Grayia has previously been associated with the Colubrinae based on immunological data (Cadle, 1994). This is surprising given the large differences in hemipenial morphology between the two taxa: Grayia has a symmetrical hemipenis with a forked sulcus spermaticus, whereas Colubrinae is characterized 457 by an asymmetric organ. Our data are insufficient to resolve the exact relationship between Grayia and the Colubrinae, but in all our trees Grayia occupied a relatively basal position. We found no association of Grayia with the Natricinae (McDowell, 1987; Cadle, 1994) or the Xenodontinae (McDowell, 1987). Position of Acrochordus Acrochordidae and the Colubroidea share a variety of morphological synapomorphies, but multiple morphological synapomorphies of the Colubroidea act to separate these two taxa (Lee and Scanlon, 2002), and Schwaner and Dessauer (1982) found substantial differences in the transferrins of these two groups. Consequently, Acrochordidae and Colubroidea have generally been considered sister taxa (Rieppel, 1988; Greene, 1997; Pough et al., 2001; Lee and Scanlon, 2002), a relationship also retrieved by Slowinski and Lawson (2002). Our analyses strongly support a sister relationship between Acrochordus + Xenoderminae and the Colubroidea, and based on Figure 6 we reject claims by other authors (McDowell, 1987; Heise et al., 1995) that Acrochordidae may occupy a position closer to the root of the Alethinophidia. In this study, we found no association between Acrochordidae and either Natricinae or Homalopsinae (contra Dowling and Duellman, 1978; Dowling et al., 1983, 1996). ACKNOWLEDGMENTS We thank Bill Branch, Don Broadley, and Ulrich Joger for provision of tissue samples. Peter Gravlund allowed us to use some of his 12S and 16S sequence data not yet available in GenBank. Peter Lenk explained the method for conversion of ModelTest results to MP weighting schemes, the PAUP∗ support group explained some of the intricacies of that program, and Bill Branch and Don Broadley provided useful biological information. Chris Simon, L. Lacey Knowles, and two anonymous reviewers provided constructive criticism. C.M.R.K. thanks TLJC and Jan Legge for continued support and encouragement. This study was funded by National Research Foundation (South Africa) grants to N.P.B. and M.H.V. and by the Rhodes Trust (C.M.R.K.). R EFERENCES ANDERSON, J. S. 2001. The phylogenetic trunk: Maximal inclusion of taxa with missing data in an analysis of the Lepospondyli (Vertebrata, Tetrapoda). Syst. Biol. 50:170–193. B AUM , B. R. 1992. Combining trees as a way of combining data sets for phylogenetic inference, and the desirability of combining gene trees. Taxon 41:3–10. BININDA-EMONDS , O. R. P., AND H. N. B RYANT . 1998. Properties of matrix representation with parsimony analyses. Syst. Biol. 47:497– 508. BININDA-EMONDS , O. R. P., H. N. B RYANT , AND A. P. R USSELL. 1998. Supraspecific taxa as terminals in cladistic analysis: Implicit assumptions of monophyly and a comparison of methods. Biol. J. Linn. Soc. 64:101–133. B OGERT , C. M. 1940. Herpetological results of the Vernay Angola Expedition. Part 1. Snakes, including an arrangement of African Colubridae. Bull. Am. Mus. Nat. Hist. 77:1–107. B OURGEOIS , M. 1968. Contribution à la morphologie comparée du crâne des Ophidiens de l’Afrique Centrale. Publ. Univ. Off. Congo 13:1–239. B REMER, B., R. K. J ANSEN, B. O XELMAN, M. B ACKLUND , H. LANTZ, AND K. K IM . 1999. More characters or more taxa for a robust phylogeny— Case study from the coffee family (Rubiaceae). Syst. Biol. 48:413–435. CADLE, J. E. 1984. Molecular systematics of xenodontine colubrid snakes. III. An overview of xenodontine phylogeny and the history of New World snakes. Copeia 1984:641–652. 458 SYSTEMATIC BIOLOGY CADLE, J. E. 1985. The Neotropical colubrid snake fauna: Lineage components and biogeography. Syst. Zool. 34:1–20. CADLE, J. E. 1987. Geographic distributions: Problems in phylogeny and zoogeography. Pages 77–105 in Snakes: Ecology and evolutionary biology (R. A. Seigel, J. T. Collins, and S. S. Novak, eds.). Macmillan, New York. CADLE, J. E. 1988. Phylogenetic relationships among advanced snakes: A molecular perspective. Univ. Calif. Publ. Zool. 119:1–77. CADLE, J. E. 1994. The colubrid radiation in Africa (Serpentes: Colubridae): Phylogenetic relationships and evolutionary patterns based on immunological data. Zool. J. Linn. Soc. 110:103–140. COBORN, J. 1991. The atlas of snakes of the world. T.F.H. Publications, Neptune City, New Jersey. CROZIER , R. H. 1990. From population genetics to phylogeny: Uses and limits of mitochondrial DNA. Aust. Syst. Bot. 3:111–124. DE Q UEIROZ, A., M. J. D ONOGHUE, AND J. K IM . 1995. Separate versus combined analysis of phylogenetic evidence. Annu. Rev. Ecol. Syst. 26:657–681. D OWLING , H. G., AND W. E. D UELLMAN. 1978. Systematic herpetology: A synopsis of families and higher categories. HISS Publications, New York. DOWLING , H. G., C. A. HAAS , S. B. HEDGES , AND R. HIGHTON. 1996. Snake relationships revealed by slow-evolving proteins: A preliminary survey. J. Zool. 241:1–28. DOWLING , H. G., R. HIGHTON, G. C. M AHA, AND L. R. M AXSON. 1983. Biochemical evaluation of colubrid snake phylogeny. J. Zool. 201:309–329. FARRIS , J. S. 1969. A successive approximations approach to character weighting. Syst. Zool. 18:374–385. FELSENSTEIN, J. 1985. Confidence limits on phylogenies: An approach using the bootstrap. Evolution 39:783–791. FELSENSTEIN, J. 1988. Phylogenies from molecular sequences: Inferences and reliability. Annu. Rev. Genet. 22:521–565. G EORGE, D. W., AND H. C. D ESSAUER. 1970. Immunological correspondence of transferrins and the relationships of colubrid snakes. Comp. Biochem. Physiol. 33:617–627. G OLDMAN, N., J. P. ANDERSON, AND A. G. R ODRIGO . 2000. Likelihoodbased tests of topologies in phylogenetics. Syst. Biol. 49:652– 670. GORDON, A. 1986. Consensus supertrees: The synthesis of rooted trees containing overlapping sets of leaves. J. Classif. 3:335–348. GRAVLUND , P. 2001. Radiation within the advanced snakes (Caenophidia) with special emphasis on African opistoglyph colubrids, based on mitochondrial sequence data. Zool. J. Linn. Soc. 72:99–114. GREENE, H. W. 1997. Snakes: The evolution of mystery in nature. Univ. California Press, Berkeley. GROOMBRIDGE, B. C. 1984. The facial carotid artery in snakes (Reptilia: Serpentes): Variations and possible cladistic significance. Amphib.Reptilia 5:145–155. GROOMBRIDGE, B. C. 1986. Phyletic relationships among viperine snakes. Pages 219–222 in Studies in herpetology; Proceedings of the 3rd Ordinary General Meeting of the Societas European Herpetologica (Z. Rocek, ed.). Charles Univ., Prague. HARLEY, E. H. 1996. DAPSA, version 4.0. A program for DNA and protein sequence alignment. Univ. Cape Town, Cape Town. HASEGAWA, M., H. K ISHINO , AND T. YANO . 1985. Dating of the human– ape splitting by a molecular clock of mitochondrial DNA. J. Mol. Evol. 22:160–174. HEISE, P. J., L. R. M AXSON, H. G. D OWLING , AND S. B. HEDGES . 1995. Higher-level snake phylogeny inferred from mitochondrial DNA sequences of 12S rRNA and 16S rRNA genes. Mol. Biol. Evol. 12:259– 265. HEYMANS , J. C. 1975. La musculature mandibulaire et le groupe parotidien des Aparallactinae et Atractaspinae (Serpentes Colubridae) a majorite fouisseurs. Rev. Zool. Afr. 89:889. HEYMANS , J. C. 1982. Contribution a la phylogenèse des ophidians de l’Afrique centrale. Ann. Soc. R. Zool. Belg. 112:79–87. HILLIS , D. M., AND J. P. HUELSENBECK . 1992. Signal, noise, and reliability in molecular phylogenetic analyses. J. Hered. 83:189–195. HOFFSTETTER, R. 1955. Squamates de type moderne. Pages 606–662 in Traı̂té de paléontology, Volume 5 (J. Piveteau, ed.). Masson, Paris. VOL. 52 HOFFSTETTER, R., AND Y. G AYRARD . 1965. Observations sur l’ostéologie et la classification des Acrochordidae (Serpentes). Bull. Mus. Natl. Hist. Nat. 36:677–696. HUELSENBECK , J. P. 1991. When are fossils better than extant taxa in phylogenetic analysis? Syst. Zool. 40:458–469. HUELSENBECK , J. P., AND J. P. BOLLBACK . 2001. Empirical and hierarchical Bayesian estimation of ancestral states. Syst. Biol. 50:351–366. HUELSENBECK , J. P., AND N. S. IMENNOV . 2002. Geographic origin of human mitochondrial DNA: Accommodating phylogenetic uncertainty and model comparison. Syst. Biol. 51:155–165. HUELSENBECK , J. P., AND F. RONQUIST . 2001. MrBayes: Bayesian inference of phylogeny. Department of Biology, Univ. Rochester, Rochester, New York. I RWIN, D. M., T. D. K OCHER, AND A. C. WILSON. 1991. Evolution of the cytochrome b gene of mammals. J. Mol. Evol. 32:128–144. J ENNER, J. V. 1981. A zoogeographic study and the taxonomy of the xenodontine colubrid snakes. Unpublished Ph.D. dissertation, New York University. KEARNEY, M. 1998. Systematics of the amphisbaenian family Rhineuridae: Missing data and resolution. J. Vertebr. Paleontol. 18(suppl.):55A. K ISHINO , H., AND M. HASEGAWA. 1989. Evaluation of the maximum likelihood estimate from the evolutionary tree topologies from DNA sequence data, and the branching order in Hominoidea. J. Mol. Evol. 29:170–179. K RAUS , F., AND W. M. B ROWN. 1998. Phylogenetic relationships of colubroid snakes based on mitochondrial DNA sequences. Zool. J. Linn. Soc. 122:455–487. KRON, K. A., AND W. S. J UDD . 1997. Systematics of the Lyonia group (Andromedeae, Ericaceae) and the use of species as terminals in higher-level cladistic analyses. Syst. Bot. 22:479–492. LARGET , B., AND D. L. S IMON. 1999. Markov chain Monte Carlo algorithms in the Bayesian analysis of phylogenetic trees. Mol. Biol. Evol. 16:750–759. LECOINTRE, G., H. PHILIPPE, H. L. V ÂN LÊ, AND H. LE G UYADER . 1993. Species sampling has a major impact on phylogenetic inference. Mol. Phylogenet. Evol. 2:205–224. LEE, M. S. Y., AND J. D. S CANLON. 2002. Snake phylogeny based on osteology, soft anatomy and ecology. Biol. Rev. 77:333–401. LENK , P., S. K ALYABINA, M. WINK , AND U. J OGER. 2001. Evolutionary relationships among the true vipers (Reptilia: Viperidae) inferred from mitochondrial DNA sequences. Mol. Phylogenet. Evol. 19:94– 104. LIEM , K. L., H. M ARX, AND G. R. R ABB . 1971. The viperid snake Azemiops: Its comparative cephalic anatomy and phylogenetic position in relation to Viperinae and Crotalinae. Fieldiana Zool. 59:65– 126. M ATTHEE, C. A., J. D. B URZLAFF, J. F. TAYLOR, AND S. K. D AVIS . 2001. Mining the mammalian genome for artiodactyl systematics. Syst. Biol. 50:367–390. MAU, B. 1996. Bayesian phylogenetic inference via Markov chain Monte Carlo methods. Ph.D. Dissertation, Univ. Wisconsin, Madison. MAU, B., AND M. NEWTON. 1997. Phylogenetic inference for binary data on dendrograms using Markov chain Monte Carlo. J. Comput. Graph. Stat. 6:122–131. MAU, B., M. NEWTON, AND B. LARGET . 1999. Bayesian phylogenetic inference via Markov chain Monte Carlo methods. Biometrics 55:1– 12. MCCARTHY, C. J. 1985. Monophyly of elapid snakes (Serpentes: Elapidae): An assessment of the evidence. Zool. J. Linn. Soc. 83:79–93. MCD OWELL, S. B. 1968. Affinities of the snakes usually called Elaps lacteus and E. dorsalis. Zool. J. Linn. Soc. 47:561–578. MCD OWELL, S. B. 1979. A catalogue of the snakes of New Guinea and the Solomons, with special reference to those in the Bernice P. Bishop Museum. Part III. Boinae and Acrochordoidea. J. Herpetol. 13:1–92. MCD OWELL, S. B. 1987. Systematics. Pages 3–50 in Snakes: Ecology and evolutionary biology (R. A. Seigel, J. T. Collins, and S. S. Novak, eds.). Macmillan, New York. MILINKOVITCH, M. C., R. G. LED UC, J. ADACHI , F. FARNIR, M. G EORGES , AND M. HASEGAWA. 1996. Effects of character weighting and species sampling on phylogeny reconstruction: A case study based on DNA sequence data in cetaceans. Genetics 144:1817–1833. 2003 459 KELLY ET AL.—PHYLOGENETICS OF ADVANCED SNAKES M INTON, S. A., AND S. K. S ALANITRO . 1972. Serological relationships among some colubrid snakes. Copeia 1972:246–252. MOORE, W. S. 1995. Inferring phylogenies from mtDNA variation: Mitochondrial gene trees versus nuclear gene trees. Evolution 49:718–726. NIXON, K. C., AND J. M. CARPENTER. 1996. On simultaneous analysis. Cladistics 12:221–241. NIXON, K. C., AND Q. D. WHEELER. 1992. Extinction and the origin of species. Pages 119–143 in Extinction and phylogeny (M. J. Novacek and Q. D. Wheeler, eds.). Columbia Univ. Press, New York. NOVACEK , M. J. 1992. Fossils, topologies, missing data and the higher level phylogeny of eutherian mammals. Syst. Biol. 41:58–73. PAGE, R. D. M. 2000. Extracting species trees from complex gene trees: Reconciled trees and vertebrate phylogeny. Mol. Phylogenet. Evol. 14:89–106. PARKINSON, C. L. 1999. Molecular systematics and biogeographical history of pitvipers as determined by mitochondrial ribosomal sequences. Copeia 1999:576–586. PHILIPPE, H., AND E. D OUZERY. 1994. The pitfalls of molecular phylogeny based on four species, as illustrated by the Cetacea/Artiodactyla relationships. J. Mammal. Evol. 2:133–152. POSADA, D., AND K. A. CRANDALL. 1998. ModelTest: Testing the model of DNA substitution. Bioinformatics 14:817–818. POUGH, F. H., R. M. ANDREWS , J. E. CADLE, M. L. CRUMP, A. H. S AVITZKY, AND K. D. WELLS . 2001. Herpetology, 2nd edition. Prentice-Hall, Englewood Cliffs, New Jersey. R AGAN, M. A. 1992. Phylogenetic inference based on matrix representation of trees. Mol. Phylogenet. Evol. 1:53–58. RANNALA, B., AND Z. YANG . 1996. Probability distribution of molecular evolutionary trees: A new method of phylogenetic inference. J. Mol. Evol. 43:304–311. RASMUSSEN, J. B. 1985. A re-evaluation of the systematics of the African rear-fanged snakes of Bogert’s groups XIII–XVI, including a discussion of some evolutionary trends within Caenophidia. Pages 531– 548 in Proceedings of the International Symposium on African Vertebrates (K.-L. Schuchmann, ed.). Zoologisches Forschungsinstitut und Museum Alexander Koenig, Bonn. RIEPPEL, O. 1979. A cladistic classification of primitive snakes based on skull structure. Z. Zool. Syst. Evolutionsforsch. 17:140–150. RIEPPEL, O. 1988. A review of the origin of snakes. Evol. Biol. 22:37–130. RODR ı́ GUEZ, F., J. F. OLIVER, A. M AR ı́ N, AND J. R. M EDINA. 1990. The general stochastic model of nucleotide substitution. J. Theor. Biol. 142:485–501. R OMER, A. S. 1956. Osteology of the reptiles. Univ. Chicago Press, Chicago. R ONQUIST , F. 1996. Matrix representation of trees, redundancy and weighting. Syst. Biol. 45:247–253. R OSSMAN, D. A., AND W. G. EBERLE. 1977. Partition of the genus Natrix, with preliminary observations on evolutionary trends in natricine snakes. Herpetologica 33:34–43. SAIKI , R. K., T. L. B UGAWAN, G. T. HORN, K. B. M ULLIS , AND H. A. ERLICH. 1986. Analyses of enzymatically amplified β-globin and HLADQα DNA with allele-specific oligonucleotide probes. Nature 324:163–166. S AITOU, N., AND M. NEI . 1987. The neighbor-joining method: A new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 4:406– 425. S ANDERSON, M. J., A. PURVIS , AND C. HENZE. 1998. Phylogenetic supertrees: Assembling the trees of life. Trends Ecol. Evol. 13:105–109. SANKOFF, D. 1975. Minimum mutation trees of sequences. J. Appl. Math. 21:35–42. S CHWANER, T. D., AND H. C. D ESSAUER. 1982. Comparative immunodiffusion survey of snake transferrins focused on the relationships of the natricines. Copeia 1982:541–549. SHIMODAIRA, H. 2002. An approximately unbiased test of phylogenetic tree selection. Syst. Biol. 51:492–508. SHIMODAIRA, H., AND M. HASEGAWA. 1999. Multiple comparisons of log-likelihoods with applications to phylogenetic inference. Mol. Biol. Evol. 16:1114–1116. SIMMONS , M. P., AND H. O CHOTERENA. 2000. Gaps as characters in sequence-based phylogenetic analyses. Syst. Biol. 49:369–381. SLOWINSKI , J. B., AND J. S. K EOGH. 2000. Phylogenetic relationships of elapid snakes based on cytochrome b mtDNA sequences. Mol. Phylogenet. Evol. 15:157–164. SLOWINSKI , J. B., A. K NIGHT , AND A. R. R OONEY. 1997. Inferring species trees from gene trees: A phylogenetic analysis of the Elapidae (Serpentes) based on the amino acid sequences of venom proteins. Mol. Phylogenet. Evol. 8:349–362. SLOWINSKI , J. B., AND R. LAWSON. 2002. Snake phylogeny: Evidence from nuclear and mitochondrial genes. Mol. Phylogenet. Evol. 24:194–202. SMITH, H. M., R. B. S MITH, AND H. L. S WAIN. 1977. A summary of snake classification (Reptilia: Serpentes). J. Herpetol. 11:115–121. S WOFFORD , D. L. 2000. PAUP∗ : Phylogenetic analysis using parsimony (∗ and other methods), version 4. Sinauer, Sunderland, Massachusetts. T HORLEY, J. L., AND R. D. M. PAGE. 2000. RadCon: Phylogenetic tree comparison and consensus. Bioinformatics 16:486–487. T HORLEY, J. L., AND M. WILKINSON. 1999. Testing the phylogenetic stability of early tetrapods. J. Theor. Biol. 200:343–344. UNDERWOOD , G. 1967. A contribution to the classification of snakes. British Museum of Natural History, London. UNDERWOOD , G. 1999. Morphological evidence on the affinities of vipers. Kaupia 8:3–8. UNDERWOOD , G., AND E. K OCHVA. 1993. On the affinities of the burrowing asps Atractaspis (Serpentes: Atractaspididae). Zool. J. Linn. Soc. 107:3–64. VIDAL, N., S. G. K INDL, A. WONG , AND S. B. HEDGES . 2000. Phylogenetic relationships of xenodontine snakes inferred from 12S and 16S ribosomal RNA sequences. Mol. Phylogenet. Evol. 14:389–402. ° R WALSH, P. S., D. A. M ETZGER, AND R. HIGUCHI . 1991. Chelex 100 as a medium for simple extraction of DNA for PCR-based typing from forensic material. BioTechniques 10:506–513. YANG , Z. 1994. Estimating the pattern of nucleotide substitution. J. Mol. Evol. 39:105–111. YANG , Z., AND B. RANNALA. 1997. Bayesian phylogenetic inference using DNA sequences: A Markov chain Monte Carlo method. Mol. Biol. Evol. 14:717–724. YEATES , D. K. 1995. Groundplans and exemplars: Paths to the tree of life. Cladistics 11:343–357. First submitted 3 January 2002; reviews returned 13 September 2002; final acceptance 29 March 2003 Associate Editor: L. Lacey Knowles APPENDIX 1: LOCALITY D ATA AND VOUCHER NUMBERS OF TAXA FOR WHICH CYTB S EQUENCES WERE G ENERATED Accession numbers refer to snake specimens rather than tissue samples. Collection acronyms are as follows: CAS, California Academy of Sciences; CMRK, Christopher M. R. Kelly; HLMD, Hessisches Landesmuseum Darmstadt; NMZB, Natural History Museum, Bulawayo, Zimbabwe; PEM, Port Elizabeth Museum, South Africa. Aparallactus guentheri, East Usambara Mountains, Tanzania (PEM R5678); Dasypeltis scabra, Moebase, northern Mozambique (PEM R13479); Hemirhagerrhis viperinus, Kaokoveld, Namibia (CAS (AMB5989)); Malpolon monspessulanus, Telmit, Morocco (HLMD-RA1180); Psammophis mossambicus, Namagure, northern Mozambique (PEM R13217); Psammophylax variabilis, Mulanje Massif, Malawi (CMRK M01); Rhamphiophis acutus, Luzamba, Angola (PEM R13485); Lamprophis inornatus, Kasuga, Eastern Cape, South Africa (PEM R5676); Mehelya nyassae, Kwekwe, Zimbabwe (NMZB 16570); Bitis arietans, Grahamstown, South Africa (PEM R5661).
© Copyright 2026 Paperzz