Points of View

Points of View
Syst. Biol. 53(2):333–342, 2004
c Society of Systematic Biologists
Copyright ISSN: 1063-5157 print / 1076-836X online
DOI: 10.1080/10635150490423962
Accepting Partnership by Submission? Morphological Phylogenetics
in a Molecular Millennium
R ONALD A. J ENNER
University Museum of Zoology, University of Cambridge, Downing Street, Cambridge CB2 3EJ, United Kingdom; E-mail: [email protected]
Organismic and morphological approaches to biology
have lately come under attack from a variety of corners
in both teaching and research. However, the main motivations for these attacks are not scientific but are instead
reflective of economic pressures on teaching and the current fashionability of molecular approaches in research.
For example, over the last few decades the number of
students taking courses in invertebrate zoology in North
American universities has declined significantly. This decline has largely been abetted by the abandonment of
classes with small enrolments as increasingly more students are encouraged to take classes in the more fashionable disciplines of molecular and cell biology (Fautin and
Watling, 1999). More recently, the fact that systematics is
a time-consuming science dependent upon the mastering of specialist knowledge has been cited as a motivation
for redirecting research toward DNA taxonomy (Pennisi,
2003; Tautz et al., 2003), and the utility of morphology for
phylogeny reconstruction has recently been questioned
on several grounds (see Hillis and Wiens, 2000; Baker
and Gatesy, 2002).
The most recent challenge to the value of morphology for phylogenetics was published in a recent issue of
this journal (Scotland et al., 2003). After a critical look at
morphological phylogenetics, Scotland et al. (2003:543)
reached a strong conclusion: “We disagree that morphology offers any hope for the future to resolve phylogeny
at lower or higher taxonomic levels.” Instead, they advocated a “more limited role for morphological data in phylogeny reconstruction” (2003:545), which consists chiefly
of studying fewer morphological characters in the context of molecular phylogeny. Because the validity of these
arguments would impinge upon morphological phylogenetic research ranging from algae to elephants, a detailed response to Scotland et al. ( hereinafter SEA) is
necessary.
SEA adduced several arguments in support of their
contention that morphology has little if anything to offer to the future of phylogeny reconstruction and contended that we should be investing our time and effort
into obtaining more molecular phylogenetic data. Here, I
evaluate the main arguments of SEA and expose several
weaknesses in their reasoning, the correction of which
leaves the importance and future promise of morphology for phylogenetics fully intact.
NUMBERS OF CHARACTERS , PHYLOGENETIC ACCURACY,
AND CLADE S UPPORT
The coordinating themes of the critique of SEA are
their concerns for phylogenetic accuracy and quantitative clade support. SEA merged these two aspects of
phylogenetic analysis into a universal yardstick that they
used to assess the relative value of molecules and morphology. SEA (2003:539–540) claimed that in general both
accuracy and support are positively correlated with the
number of phylogenetic characters. First, SEA (2003:539)
concluded that “the number of characters needed in simulation studies to recover accurate trees is an order of
magnitude greater than that available from morphology.” Second, SEA (2003:540) claimed that “the low character/taxon ratio in many morphological studies itself
precludes high support values.” Do these findings provide an irrefutable basis for discarding morphological
evidence from phylogeny reconstruction?
Phylogenetic Accuracy
SEA judged the relative worth of molecular and morphological data with respect to the accuracy of phylogeny reconstruction. They asserted that increasing
phylogenetic accuracy generally depends on increasing
the number of characters, as is summarized in their
figure 1a. As SEA made clear throughout their article, their main contention is that morphological data
are inferior to molecular data in contributing accurate
phylogenetic signal.
Despite comparing the performance of morphology
versus that of molecules with respect to phylogenetic accuracy, SEA failed to discuss how phylogenetic accuracy
333
334
SYSTEMATIC BIOLOGY
can be estimated. Phylogenetic accuracy can only be assessed when the true phylogeny is known prior to the
analysis. Phylogenetic accuracy is therefore assessed either by simulation studies or by study of known phylogenies (laboratory strains) (Hillis et al., 1994a, 1994b;
Hillis, 1995; Givnish and Sytsma, 1997). Consequently,
given the typical limitations of available molecular or
morphological evidence for a particular set of taxa for
which we do not know the phylogeny, we simply cannot
know whether our phylogenetic estimate is accurate, i.e.,
represents the true phylogeny. This realization seriously
compromises the value of phylogenetic accuracy as an
empirical guide in most phylogenetic studies (Siddall
and Kluge, 1997; Siddall, 1998; Farris, 1999).
SEA’s arguments about phylogenetic accuracy are
based on simulation studies, summarized in their
figure 1a. SEA (2003:542) stated that “DNA sequence data
at least offer the unique potential of scoring large numbers of unambiguous characters and character states.”
SEA stated (2003:541) that in contrast to this bounty of
sequence data, simulations show that “the number of
unambiguously coded morphological characters for any
study is finite . . . and less than the number typically required to accurately reconstruct phylogenies in simulation studies.” However, simulation studies can reliably
assess phylogenetic accuracy only when the data evolve
precisely according to the assumed model of evolution
(Hillis et al., 1994a; Hillis, 1995). If the data do not evolve
according to the specified model, then the phylogeny
may not be accurately reconstructed, and the number of
characters necessary to accurately reconstruct the phylogeny cannot be correctly estimated. The number of
characters needed to accurately reconstruct a phylogeny
varies according to the nature of the data at hand, the
number of taxa, and the chosen model of evolution (Hillis
et al., 1994a, 1994b, 2003; Hillis, 1996, 1998; Rosenberg
and Kumar, 2003). Therefore, it is very difficult to generalize the number of characters needed to accurately
reconstruct phylogeny in given instances. Models for sequence evolution have been selected as much (or more)
for their computational convenience as for their approximation to reality (Farris, 1999:201, 202; but see Sullivan and Swofford, 2001). SEA wrote (2003:541) that “the
exact processes underlying nucleotide substitution are
more complex than the simple models used in phylogenetic reconstruction.” This admission makes SEA’s appeal to accuracy largely a moot point because widely
employed methods cannot guarantee phylogenetic accuracy. With respect to morphology, SEA (2003:542) emphasized “our current inability to incorporate models
of morphological evolution into phylogeny reconstruction methods,” which makes it extremely difficult to determine the number of morphological characters necessary to accurately reconstruct phylogeny in specific
cases. However, the development of new models for
dealing with morphological data in a likelihood context (Lewis, 2001) may improve our ability to combine
morphological and molecular data in single phylogenetic
analyses.
VOL. 53
In repeatedly contrasting molecules and morphology,
SEA assumed that these are two distinct and clearly
alternative categories of data for phylogeny reconstruction. The greater potential of molecules to contribute to
increased phylogenetic accuracy is then chiefly a result of
the genome representing a much bigger reservoir of potentially useful unambiguous phylogenetic data. However, is this anything more than a truism?
Millions or billions of genomic nucleotides offer a
vastly bigger pool of potentially useful phylogenetic
variation than the phenotype of even the morphologically most complex eukaryotes. By the same argument,
one might as well conclude that because even entire mitochondrial genomes are typically dwarfed by the size
of the nuclear genome, they are therefore inferior phylogenetic indicators. However, the quality of a data set
and its size are two distinct attributes, and even limited
data sets may contribute phylogenetic accuracy. In practice, molecular phylogenetic studies are typically limited to one or a few genes, the analysis of which is far
more problematic than hinted at by SEA. Such limited
data frequently fail to provide enough information for
a robust and accurate phylogenetic estimate. For example, despite intense taxon sampling, the 18S ribosomal
RNA locus has so far yielded no stable and maximally
accurate hypothesis of metazoan relationships (Peterson
and Eernisse, 2001; Giribet, 2002). This result is not surprising because different genes may evolve at different
rates and may consequently be suited best for resolving
different levels of a phylogeny. In a recent study, Rokas
et al. (2003) made this potential particularly clear. They
performed a phylogenetic analysis of 106 orthologous
genes from seven species of yeast from the genus Saccharomyces. Their results (Rokas et al., 2003:798) suggested
that “data sets consisting of single or a small number of
concatenated genes have a significant probability of supporting conflicting topologies.” Thus, single molecular
data sets are potentially equally fallible with respect to
accurate phylogeny reconstruction as are morphological
data sets. Moreover, “the support for a given branch was
strongly dependent on the gene analysed” (Rokas et al.,
2003:799), which indicates that different genes are complementary in providing support for different parts of the
phylogeny. In line with these findings, workers increasingly have adopted multigene approaches to phylogeny
reconstruction to benefit from complementing data partitions that provide resolution at different levels in the
phylogeny (Colgan et al., 1998; Edgecombe et al., 2000,
2002; Baker et al., 2001; Giribet et al., 2001; Giribet, 2002;
Collin, 2003; Rokas et al., 2003), whereas each data set
alone, be it molecular or morphological, may have only
limited resolving power. So far, having a large proportion of the genome sequenced is an ideal that has only
been realized for a few species.
However, even a multigene approach may not necessarily solve all problems when the number of genes
is limited. Different genes may yield conflicting phylogenetic signals on all taxonomic levels across the tree
of life for a variety of biological and analytical reasons,
2004
POINTS OF VIEW
including choice of optimality criterion, comparison of
paralogous genes, taxon sampling, mutational saturation and long branch attraction, changing substitution
rates of a site across a gene and/or across a phylogeny,
compositional biases, lateral gene transfer, similar selective regimes leading to molecular convergence, and
lineage sorting, and the same gene may yield conflicting phylogenies when analyzed on the nucleotide or
amino acid levels (Miyamoto and Fitch, 1995; Maddison,
1997; Naylor and Brown, 1998; Hillis and Wiens, 2000;
Lockhart et al., 2000; Sota and Vogler, 2001; Sullivan
and Swofford, 2001; Taggart et al., 2001; Gribaldo and
Philippe, 2002; Lopez et al., 2002; Rydin et al., 2002; Shaw,
2002; Simmons et al., 2002a, 2002b; Machado and Hey,
2003; Rokas et al., 2003).
Consequently, not all parts of the genome necessarily uniformly and positively contribute to phylogenetic
accuracy, and their combined use in a single phylogenetic analysis may therefore not lead to increased accuracy. This makes SEA’s dichotomy between all molecular sequence data on the one hand and morphology
on the other not particularly meaningful. SEA’s conclusion (2003:539) “that a main constraint of morphologybased phylogenetic inference concerns the limited number of unambiguous characters available for analysis”
is equally applicable to many available molecular data
sets. Clearly, the well-known limitations of individual
molecular data sets have not led to the general dismissal
of molecular evidence. The acknowledged limitations of
morphological data sets should therefore no more be
used as an argument to dismiss the value of morphology
for phylogeny reconstruction. Even considering the potential of large amounts of genomic information, we do
not know whether we can reasonably expect genomes
to furnish the required amount of data to resolve all
phylogenetic problems on all phylogenetic levels. SEA
(2003:543) acknowledged as much: “an honest observer
would have to agree that even whole genomes for all
species will probably not yield a fully resolved, highly
confident tree.”
Quantitative Clade Support
SEA (2003:540) argued that morphological phylogenetic studies generally have “too few morphological
characters to provide confidence in any given estimate
of phylogeny.” SEA generalized the findings of Bremer
et al. (1999) and a few other studies in their figure 1b
and claimed (2003:540) that these results “demonstrated
explicitly that the character/taxon ratio for morphological studies is such that bootstrap percentages are
likely to be low.” This statement holds true for the
morphological data sets of two plant families analyzed
by Bremer et al. (1999). These data sets had low numbers
of characters: 3.2 and 2.6 characters/taxon for Rubiaceae
and Apocynaceae, respectively. Bremer et al. (1999) cited
the low number of available characters as the cause of
the low support measures for the phylogenetic analyses
of these data sets. The bigger molecular data sets had
335
better support values. I appreciate the potential value
of statistical support measures, leaving aside difficulties
of interpreting commonly used support values such as
bootstrap percentages (reviewed by Sanderson, 1995;
Siddall, 2002) and difficulties of justifying the use of
probabilistic arguments in historical inference in the
first place (Siddall and Kluge, 1997; Swofford et al., 2001;
Kluge, 2002). Morphology may often provide more
coarse grained phylogenetic resolution than molecular
data because of lower character/taxon ratios, but this
does not necessarily lead to lower quantitative support
measures. It is not only the number of characters but the
distribution of homoplasy that will determine how well
a given data set supports a phylogeny. Therefore, one
cannot uncritically generalize the findings of Bremer et
al. (1999) that small morphological data sets are poorer
performers than larger molecular data sets.
Kress et al. (2001) analyzed a morphological data set
for the Zingiberales (a group of monocotyledons including bananas and gingers) with 2.8 characters/taxon and
found that 57% of the ingroup nodes were supported
with bootstrap values >75%. In contrast, two of three
analyzed molecular data sets for this group generated
lower percentages of ingroup nodes with bootstrap values >75%, despite a much larger number of characters. It is unwarranted to generalize from a few morphological data sets discussed by SEA to the conclusion
(2003:540) that high support values are precluded “in
many morphological studies.” Whether high support
values are generated depends entirely on the morphological data set under consideration. In a recent comprehensive morphological analysis of snake phylogeny, Lee
and Scanlon (2002) found that >70% of the ingroup nodes
had bootstrap values of >75%. Similarly, in a morphological phylogenetic analysis of arthropod relationships
Edgecombe et al. (2000) also found that >70% of the ingroup nodes had bootstrap support values of >75%, and
high quantitative support values for morphological analyses are certainly not rare (Wiens and Hollingsworth,
2000; Damgaard and Sperling, 2001; Kress et al., 2001;
Gatesy et al., 2003). SEA presented no convincing evidence that morphological phylogenetic evidence is generally undesirable because “the low character/taxon
ratio in many morphological studies itself precludes high
support values” (2003:540). Data sets of single molecules
may provide contributions to clade support as limited
at that provided by morphological data sets. The important point is that the available character space should be
sampled as comprehensively as possible, and morphology may contribute valuable evidence in combination
with molecular data.
Although SEA acknowledged (2003:545) the potential
value of combined analyses of both molecular and morphological data, they failed to mention that in the study
of Bremer et al. (1999), from which they adapted their
Figure 1a, the support measures of the combined analyses of molecules and morphology were significantly
better than those of either the morphological or molecular data sets alone. Similar positive contributions of
336
SYSTEMATIC BIOLOGY
morphology to quantitative clade support measures in
combined analyses have been observed for a host of
other taxa across the tree of life (Flores-Villela et al., 2000;
Kress et al., 2001; Winterton et al., 2001; Edgecombe and
Giribet, 2002; Giribet et al., 2002; Stach and Turbeville,
2002; Jeffery et al., 2003; Schulmeister, 2003; Wahlberg
and Nylin, 2003). The positive contribution of morphology to combined analyses may also become apparent
from other measures, such as partitioned branch support values (Baker et al., 1998; Gatesy and Arctander,
2000; Damgaard and Sperling, 2001; Murrell et al.,
2001; Gatesy, 2002; Meier and Baker, 2002; Remsen and
O’Grady, 2002; Gatesy et al., 2003; Wahlberg and Nylin,
2003).
Even though morphological data sets may contain relatively small numbers of characters in comparison to
the total amount of included molecular data, morphology may contribute more support and stability to the
combined analysis when considered per character (Baker
et al., 1998; Gatesy, 2002; Meier and Baker, 2002). For example, Baker et al. (1998) and Meier and Baker (2002)
found that morphology may be up to twice as informative as molecules per included character in a combined
analysis across a wide range of different taxa.
Morphology also may contribute to combined data
analyses in the form of hidden clade support that becomes apparent only when data sets are combined
(Gatesy and Arctander, 2000; Damgaard and Sperling,
2001; Murrell et al., 2001; Gatesy, 2002; Gatesy et al.,
2003). Thus, morphology may be partially incongruent with molecular data when analyzed separately, but
when combined the total evidence may generate a wellsupported phylogeny. For the same reason combination
of apparently conflicting molecular data sets may lead to
a single well-supported phylogeny (Rokas et al., 2003).
In some cases, morphology and molecules supply
complementary evidence, where each type of data may
contribute to different parts of the phylogeny (Giribet
and Wheeler, 2002). A strength of morphology is that
characters informative at many different levels in the
phylogeny are habitually included in a single data set,
whereas the average substitution rate of a given gene
may limit its utility to particular levels in a phylogeny.
One or a few genes may not provide sufficient phylogenetic signal across all levels in a phylogeny, especially
when the included taxa span a broad range of divergence dates. In other cases, morphology may contribute
unique phylogenetic signal where molecules are completely mute (Mattern and McLennan, 2000). In some
combined analyses, morphology may increase support
of certain nodes while lowering that of others ( Klompen
et al., 2000; Stach and Turbeville, 2002), and in other cases
morphology may not add any significant support to combined data analyses (Sorhannus, 2001). Although conflict
between morphological and molecular evidence may occur and some workers remain skeptical about combining
morphological and molecular data into a single phylogenetic analysis (SEA did not object to combined analysis
of morphological and molecular data; 2003:545), these
VOL. 53
examples clearly show that morphological data may improve clade support values.
HOMOLOGY ASSESSMENT AND CHARACTER CODING
SEA nominated the limited number of unambiguous
characters as the main shortcoming of morphological
phylogenetic data. To make matters even worse, SEA
indicated that any attempt to solve this problem by increasing the number of morphological characters will be
doomed to failure. First, SEA (2003:545) concluded that
“much of the useful morphological diversity has already
been scrutinized.” Although this statement may be true
for well-studied taxa such as the seed plants discussed
by SEA, there is no indication that it holds true for other
poorly studied taxa. The widespread use of new morphological data in phylogenetic analyses of many different taxa leaves no doubt that much phenotypic variation
remains to be explored (Lee, 1995; Klass, 2001; Hooge
et al., 2002; Giribet et al., 2002; Sørensen, 2002; Strong,
2003).
Second, SEA (2003:541) argued that the need to properly code and define morphological characters will only
add “a level of subjectivity and interpretation” to the
analysis, which merely “increases the level of ambiguous
or problematic characters.” SEA (2003:542) concluded:
“For morphological studies comprising a relatively high
number of characters, both character coding and character conceptualization become increasingly important
variables that may have a negative impact on a study
as more characters are added (Fig. 1).” SEA buttressed
their conclusions with figures 1c and 1d, which show that
ambiguity of both homology assessment and character
coding will quickly increase with the addition of more
characters.
These arguments erect nothing but a straw man. SEA
fabricated these graphs without marshalling any evidence to support them. There is nothing to support SEA’s
contention that increasing the number of characters in a
morphological data set will necessarily lead to increasingly ambiguous homology determination and character coding. The only example SEA (2003:542) presented
where an increase in the number of morphological characters “made no significant difference to the results”
scarcely provides the basis for these generalizations applicable across the tree of life. In contrast, I expect an increase in the quality of morphological phylogenetic data
with time as the development of new analytical techniques opens up unexplored sources of data and facilitates the validation of older information. For example,
in metazoan phylogenetics powerful techniques such as
transmission electron microscopy, molecular developmental biology, and cell lineage tracing are contributing to an improvement of the morphological data sets as
new data are added and old mistakes are corrected (e.g.,
Giribet, 2002; Jenner, 2004).
The main problem of SEA’s interpretation here is that
they assumed that ease of homology assessment and
character coding can be regarded as proxies of a single,
2004
POINTS OF VIEW
general phylogenetic signal in a data set and are as such
related as single values to the number of characters in
an analysis. However, data sets do not contain a single
phylogenetic signal. The phylogenetic signal in a data set
is hierarchically structured, and different characters are
significant on different phylogenetic levels. Certain characters are therefore relevant for placing certain taxa but
are irrelevant for placing others. Consequently, there is
no simple relationship between an increase in the number of morphological characters and the ambiguity of
character coding and homology assessment that can be
captured in a graph, such as figures 1c and 1d, and that
is valid throughout the whole phylogeny.
SEA’s arguments about defining and coding characters
are reflective of a deplorable appreciation of the morphological scholarship that is at the heart of phylogenetics.
This attitude mirrors a worrying statement in the chapter on animal phylogeny in the recently published new
edition of the widely used textbook on invertebrate zoology by Brusca and Brusca (2003:874): “The process of a
priori character assessment is perhaps the weakest link
in morphological phylogenetic biology.” However, character assessment is the only empirical anchor of phylogenetics. If properly conducted, comparative morphological study will be the greatest strength of a phylogenetic
analysis, and when performed less than critically it may
be the greatest weakness. The objectivity of morphological phylogenetic analyses lies in intersubjective testing,
in which previously compiled data are reanalyzed and
reevaluated for their congruence with other characters
(Kluge, 1997, 1998; Ax, 1999). SEA’s recommendations
of greater explicitness in character selection and care in
morphological studies will naturally contribute to the
quality of morphological phylogenetic analyses.
With respect to character coding and character conceptualization in molecular phylogenetics, SEA (2003:541)
concluded that for “aligned sequence data, there is no
ambiguity in assessing character states,” and “there is no
ambiguity that the unit of comparison is the nucleotide.”
In contrast, the definition of homologous characters and
the coding of character states in morphological phylogenetics is much more problematic. However, the important issue is not only whether two nucleotides at the
same position are identical but also whether they are
historically identical, i.e. homologous. Nucleotides are
characters of relatively low complexity, and the character state space for nucleotides is much more restricted
than that for morphology. In certain circumstances, this
restriction creates a considerable danger that the same
nucleotide has evolved independently in the same position. This realization has been an incentive to develop
models of evolution that estimate the probability that
the same nucleotides at a site are historically identical,
to explore the value of more complex molecular characters (Swofford et al., 1996; Rokas and Holland, 2000),
and to develop alignment methods that use comparative
molecular anatomical information (secondary and tertiary structure) to improve the probability of recovering
homology (Kjer, 1995; Hickson et al., 2000). In contrast,
337
morphology generally presents a richer space of more
complex characters that allows a more fine-grained comparison of potential homology, which may help explain
why in certain cases morphology may be qualitatively
superior to molecules when considered per character
(Baker et al., 1998; Gatesy, 2002; Meier and Baker, 2002).
The definition and coding of morphological characters is certainly a difficult subject, and for a given set
of taxa alternative approaches to homology assessment
and character coding may indeed lead to different phylogenies. However, as the articles cited by SEA indicate,
there is no easy solution to this problem, and subjective choices are unavoidable. One way to deal with this
issue is to perform a sensitivity analysis with different
character codings to see how the phylogeny changes
with changing character codings (Jenner, 2002; Rieppel
and Kearney, 2002; Simmons and Geisler, 2002). This approach at least allows a better understanding of the relationship between assumptions and hypothesis.
Most important in the context of this critique, however, is that despite the admitted difficulties of morphological character analysis, there is no evidence to suggest
that morphology generally performs more poorly than
molecules in estimating phylogeny (Hillis and Wiens,
2000; Baker and Gatesy, 2002). SEA (2003:543) cited an
example where molecular evidence from three genes is
taken to suggest that phylogenetic analyses of morphology and fossils have led to inaccurate phylogenetic hypotheses of the angiosperms. Although this statement
may be true, we cannot automatically conclude that morphology always suggests the wrong answer whenever
molecules and morphology conflict. In many cases different molecular analyses may conflict with each other
(Sota and Vogler, 2001; Taggart et al., 2001; Rydin et al.,
2002; Shaw, 2002; Machado and Hey, 2003; Rokas et al.,
2003), and morphology may be more reliable than some
molecular data partitions (Wiens and Hollingsworth,
2000; Damgaard and Sperling, 2001). The findings of
Rokas et al. (2003) potently illustrate that data from one
or a small number of different genes may lead to a robustly supported phylogeny that may be in direct conflict with an alternative phylogeny based on another set
of genes, even for relatively closely related species. In
the context of such limited amounts of data we simply
cannot know whether we have accurately reconstructed
the true phylogeny. We can only sample character space
as comprehensively as possible and find the most wellcorroborated hypothesis.
SEA (2003:541) claimed that these problems of “subjectivity and interpretation” are absent from molecular
data, because “areas of ambiguity [in sequence alignment] can be excluded.” As recent research has shown,
to choose this way of least resistance may be thoroughly
misleading, and this short statement seriously underplays the degree of subjectivity and interpretation associated with molecular phylogenetics. Apart from the
choice of included taxa, “subjectivity and interpretation”
in molecular phylogenetics may reside in the choice of
the gene, the decisions of how to determine homology
338
SYSTEMATIC BIOLOGY
when insertion and deletion events or exon shuffling occur, the choice of phylogenetic analysis parameters, the
choice of whether nucleotides or amino acids are analyzed for protein-coding genes, and the choice of the optimality criterion. Different decisions for these variables
may result in different phylogenies (Giribet and Wheeler,
2002; Giribet et al., 2002; Simmons et al., 2002a, 2002b;
Rokas et al., 2003). Horizontal gene transfer and gene
duplication may add further difficulties to homology assessment in molecular phylogenetic analyses. Moreover,
SEA’s suggestion to simply exclude all information associated with ambiguously aligned sequences may also
cause misleading results, as illustrated by the following
example.
For more than a decade, 18S ribosomal DNA (rDNA)
sequences suggested that birds were most closely related
to mammals (hematotherm hypothesis). This hypothesis
was in conflict with results derived from a large amount
of traditional (morphological and paleontological) and
other molecular data, which instead united birds with
crocodilians. After different workers analyzed the 18S
data in various different ways, they concluded that this
was an example of different molecules giving significantly different estimates of phylogeny. However, in a
recent study Xia et al. (2003) convincingly showed that
the conflict between 18S data and the traditional and
other molecular data was an artifact attributable to two
main factors: misalignment of sequences and inappropriate estimation of base frequency parameters. Crucial to
the resolution of this paradox was the incorporation of
information from the most variable regions of the 18S
molecule that were most difficult to align unambiguously. This study clearly showed that restricting the data
set to only the least unambiguous sites might produce
a thoroughly misleading phylogeny. The problem that
“different workers will perceive and define characters in
different ways” (2003:541) is therefore certainly not limited to morphological data. The recent development of
methods to deal with ambiguously aligned regions or
to circumvent multiple sequence alignment altogether
(for review, see Lee, 2001) further undermines the fundamental dichotomy between the value and treatment of
molecular and morphological data as portrayed by SEA.
VALUE OF M ORPHOLOGY IN PHYLOGENETICS
SEA (2003:544) stated that “a continued role for morphology in phylogeny reconstruction seems a reasonable
expectation,” and they envisioned the role of morphology in systematics in one of three ways. In the first form,
which is the one strongly recommended by SEA, fewer
morphological features are studied in depth and are then
mapped onto a molecular phylogeny “on the basis that
morphological characters can be diagnostic for nodes
on molecular trees” (2003:545). Each morphological character is studied separately with respect to a node in the
molecular phylogeny, and characters found to be incongruent “are not incorporated into the phylogenetic hypothesis” (2003:545). SEA (2003:545) claimed that this approach “is akin to Patterson’s (1982) congruence test.”
VOL. 53
If one adheres to cladistics as a falsificationist research
program, the problem here is the claim of congruence.
Patterson (1982) was clearly concerned about the congruence of characters against all others. Standard cladistic
analysis operates on the basis of character congruence,
in which the congruence of all characters is assessed simultaneously. Only this approach will guarantee finding
the globally most-parsimonious cladogram that embodies maximal explanatory power. By comparing individual morphological characters against an already resolved
molecular phylogeny, one is not assessing overall congruence of characters, despite SEA’s confusing invocation of Patterson’s congruence test. If 99 of 100 characters were incongruent with the molecular phylogeny but
entirely congruent among themselves, SEA’s approach
would nevertheless disregard them as homoplasies. This
approach is more akin to clique analysis, the logical basis of which for phylogeny reconstruction was long ago
questioned (Farris, 1983). This approach purposely removes any testing power from the phylogenetic analysis, which is in direct conflict with the falsificationist
premises of cladistics.
Even if morphological characters were not directly
used to reconstruct a phylogeny but, as suggested by
SEA, were merely mapped onto a molecular phylogeny
to “provide evidence for a more limited number of monophyletic taxa” (2003:545), we would still have to confront exactly the same problems of character definition
and character coding that SEA nominated as the greatest weaknesses of morphological phylogenetic studies.
Before characters can be mapped onto a molecular phylogeny as a limited number of additional synapomorphies, they must be defined. Just as decisions about character coding determine on what level in the phylogeny a
character is informative, so too is the mapping of a character in a certain place in a phylogeny inextricably linked
with particular assumptions about character coding. For
example, SEA (2003:541) cited the famous “no tails, red
tails, blue tails” example to illustrate that even for relatively simple characters it may be difficult to choose
an appropriate coding method. For these data, multistate coding does not capture the variation of a tail being present versus absent (see discussion by Hawkins
et al., 1997). Consequently, if we map the presence of
tails as a synapomorphy on a particular level in a molecular phylogeny, we necessarily imply that the observed
variation is not conceptualized as a multistate character.
Thus, even though decisions about character coding are
not explicitly made in a data matrix, implicit decisions
cannot be avoided.
SEA recommended the study of fewer rather than
more characters because rigorous character study is too
time consuming. Today, the high pace of publication in
the field of phylogenetics is principally set by molecular
studies, where automated methods allow much faster
data collection than can be achieved when studying
comparative morphology. Unfortunately, this publication pressure may in some cases lead to the publication
of phylogenetic studies based on hastily and uncritically
compiled morphological data sets of low quality (Jenner,
2004
POINTS OF VIEW
2001, in press). However, we want to understand as
much as we can about the evolution of morphology, and
molecules, including features for which homology assessment is difficult. That difficult science can be time
consuming can hardly be a scientific argument for studying fewer rather than more characters.
SEA’s second and third proposed strategies for incorporating morphology into phylogenetics are actually the
same. As a second approach, SEA advocated a continuation of current practice. That is, morphological and
molecular data both have their value, and they are analyzed both separately and in combination to study evolution. The third strategy is similar except only “those
characters that are unproblematic in terms of homology
assessment and character coding are selected” (2003:545)
and analyzed together with the molecular data. These
two approaches actually provide the same possibilities as
the first recommended approach, and as discussed with
respect to combined analyses, more in addition. SEA realized that careful morphological study is valuable, and
I could not agree more.
However, SEA’s recommendation to select only unambiguously homologous morphological characters is
problematic. All primary homology decisions and character state assignments are provisional and can in principle be corroborated and refuted. By concluding that the
definition and coding of even simple characters can be
problematic (the tails example), SEA themselves show
that prior to a phylogenetic analysis there are no “unambiguously coded morphological characters” (2003:541).
Workers who attempted to separate problematic from
supposedly more reliable characters prior to a phylogenetic analysis confirm this. Their results show that the
phylogenetic utility of characters is very difficult to predict a priori (Jenner and Schram, 1999: Table 3; Schander
and Sundberg, 2001; Collin, 2003; Gatesy et al., 2003;
Strong, 2003).
By concluding (2003:543) that morphology cannot offer “any hope for the future to resolve phylogeny at lower
or higher taxonomic levels,” SEA gave very little consideration to fossils. Although fossils were briefly discussed in connection with taxon sampling, SEA did not
reach any firm conclusion about their value for phylogenetics other than that their contribution to phylogenetic accuracy is uncertain in view of the problems of
homology assessment and character coding. However,
the limited amount of molecular and morphological evidence typically available to us does not allow us to determine whether the true phylogeny has been accurately
reconstructed in the first place. As an additional problem of including fossils in phylogenetic analyses, SEA
(2003:543) cited the “large amounts of missing data.” In
contrast, Wiens (2003:536) concluded on the basis of simulation studies that “the proportion of missing data cells
in the incomplete taxa is a poor indicator of their impact
on phylogenetic accuracy.” Accuracy is more related to
the distribution of missing data in the matrix and the
number of characters that can be scored for the incomplete taxa. As Baker and Gatesy (2002:171) concluded;
“If morphological evidence is ignored, the phylogeny of
339
over 99% of life is ignored.” In certain groups, such as
the certartiodactyls, >85% of the genera are known only
from fossils (Gatesy, 2002). The importance of fossils for
reconstructing phylogeny can perhaps best be illustrated
by the importance of stem groups ( Wills et al., 1998; Budd
and Jensen, 2000; Budd, 2002; Holmer et al., 2002; Lee and
Scanlon, 2002; Mallatt and Chen, 2003; Ruta et al., 2003).
Stem groups exemplify the most basal divergences of extant taxa, and their value in bridging the often considerable morphological gaps between disparate body plans
is of central importance to reconstructing the full diversity of the tree of life. With the exception of ancient DNA,
rigorous morphological phylogenetic analysis is the only
way in which fossils can be incorporated into the phylogeny of life. Failure to do so will leave phylogenies of
extant organisms forever uprooted.
SEA (2003:545) admitted the value of reciprocal illumination (Hennig, 1966) in which independent data can
shed unique light on a problem. However, they apparently fail to realize that by restricting morphological phylogenetic analyses, the power of reciprocal illumination
is crippled. Phylogenetic analyses of different data sets
provide valuable and unique independent perspectives
on both phylogenetic questions and questions of character evolution. The existence of a dialogue between morphological and molecular hypotheses guarantees continued attention and excitement in phylogenetic research
on many taxa on many different phylogenetic levels. For
example, a recent phylogenetic analysis of 18S rDNA sequences in asellote isopods failed to yield results strongly
expected on the basis of careful morphological study
(Wägele et al., 2003). This discordance stimulated a more
detailed look at the molecular data, yielding several interesting insights that may otherwise have remained
invisible. Similarly, Vidal and Hedges (2002) recently analyzed snake phylogeny with nuclear and mitochondrial
genes. They found that the morphologically strongly
supported clade of macrostomatans (Lee and Scanlon,
2002) is contradicted by molecular evidence. Because
some workers have claimed that most apparent conflict
between molecular and morphological phylogenies is
due to weak support for either or both of the estimates
and that strongly supported and misleading morphological phylogenies are relatively rare (Hillis and Wiens,
2000; Wiens et al., 2003), such a potential case for strong
conflict is worth exploring. It allows us to explore the
possibility that in this case morphological phylogenetic
analyses may suffer from the confounding effect of concerted homoplasy, possibly in the characters related to
wide gape size. This reassessment may teach us something important about the extent of parallel evolution in a
complex of adaptive features. These insights into evolution are dependent upon the compilation, analysis, and
comparison of independently compiled molecular and
morphological data sets.
Science derives its greatest strength from its multifaceted nature. Artificial restriction of approaches
can lead only to impoverishment of science. Many
researchers see the merits of analyzing molecular and
morphological data sets separately and in combination to
340
SYSTEMATIC BIOLOGY
compare the results (Littlewood et al., 1999; Edgecombe
et al., 2000; Flores-Villela et al., 2000; Janies, 2001; Kress
et al., 2001; Wheeler et al., 2001; Winterton et al., 2001;
Edgecombe and Giribet, 2002; Giribet and Wheeler, 2002;
Giribet et al., 2002; Stach and Turbeville, 2002; Jeffery
et al., 2003; Schulmeister, 2003; Wahlberg and Nylin,
2003). As SEA admitted, study of morphological characters is necessary anyway because one goal of phylogenetic analysis is to understand character evolution. We
might as well submit the collected data to as many potentially fruitful uses as possible.
By discouraging morphological phylogenetics, we
take away a main formal impetus for morphological
comparative research. Combined with the continuing
marginalization of organismic biology in academic curricula (Fautin and Watling, 1999), this attitude introduces
the real danger that potentially interested students will
not be attracted to systematic biology, and this dearth of
students may seriously undermine the future and versatility of our science for largely nonscientific reasons.
SEA failed to present any convincing scientific arguments for narrowing the focus of phylogenetics to mainly
molecular research. In writing that morphology “may
not be able to resolve the full branch structure of the
tree of life,” SEA (2003:543) merely presented an empty
statement, because no single data set, molecular or morphological, can reasonably be expected to be the Holy
Grail of phylogenetics, as they themselves underscore by
stating (2003:543) that even whole genomes “will probably not yield a fully resolved, highly confident tree.”
The only scientifically valid reason that I can think of for
excluding or severely limiting the use of morphological
evidence for phylogeny reconstruction as proposed by
SEA is when we know a priori that these data will be
positively misleading. SEA failed to show that we have
this knowledge. Instead, they presented a few examples
from botanical phylogenetics where the contribution of
morphological data to resolving the phylogeny was limited. However, as shown here, these examples scarcely
provide enough foundation to question the general value
of morphology for other parts of the tree of life. In the
absence of evidence to the contrary, I am convinced that
an equal partnership between molecular and morphological phylogenetics is our best bet for future progress.
ACKNOWLEDGMENTS
I gratefully acknowledge helpful comments on the manuscript by
Tim Collins, Chris Simon, Gavin Naylor, Richard Olmstead, Jonathan
Bennett, and an anonymous reviewer. A Marie Curie Individual Fellowship of the European Community program Improving Human
Potential under contract number HPMF-CT-2002-01712 supports my
work.
R EFERENCES
Ax, P. 1999. Das System der Metazoa II. Ein Lehrbuch der phylogenetischen Systematik. Gustav Fischer Verlag, Stuttgart, Germany.
Baker, R. H., and J. Gatesy. 2002. Is morphology still relevant? Pages
163–174 in Molecular systematics and evolution: Theory and practice
(R. DeSalle, G. Giribet, and W. Wheeler, eds.). Birkhäuser Verlag,
Basel, Switzerland.
VOL.
53
Baker, R. H., G. S. Wilkinson, and R. DeDalle. 2001. Phylogenetic utility
of different types of molecular data used to infer evolutionary relationships among stalk-eyed flies (Diopsidae). Syst. Biol. 50:87–105.
Baker, R. H., X. B. Yu, and R. DeSalle. 1998. Assessing the relative contribution of molecular and morphological characters in simultaneous
analysis trees. Mol. Phylogenet. Evol. 9:427–436.
Bremer, B., R. K. Jansen, B. Oxelman, M. Backland, H. Lantz, and K.-J.
Kim. 1999. More characters or more taxa for a robust phylogeny—
Case study from the coffee family (Rubiaceae). Syst. Biol. 48:413–435.
Brusca, R. C., and G. J. Brusca. 2003. Invertebrates, 2nd edition. Sinauer,
Sunderland, Massachusetts.
Budd, G. E. 2002. A palaeontological solution to the arthropod head
problem. Nature 417:271–275.
Budd, G. E., and S. Jensen. 2000. A critical reappraisal of the fossil record
of the bilaterian phyla. Biol. Rev. 75:253–295.
Colgan, D. J., A. McLauchlan, G. D. F. Wilson, S. P. Livingston, G. D.
Edgecombe, J. Macaranas, G. Cassis, and M. R. Gray. 1998. Histone
H3 and U2 snRNA DNA sequences and arthropod molecular evolution. Aust J. Zool. 46:419–437.
Collin, R. 2003. The utility of morphological characters in gastropod
phylogenetics: An example from the Calyptraeidae. Biol. J. Linn. Soc.
78:541–593.
Damgaard, J., and F. A. H. Sperling. 2001. Phylogeny of the water strider genus Gerris Fabricius (Heteroptera: Gerridae) based on
COI mtDNA, EF-1α nuclear DNA and morphology. Syst. Entomol.
26:241–254.
Edgecombe, G. D., and G. Giribet. 2002. Myriapod phylogeny and
the relationships of Chilopoda. Pages 143–168 in Biodiversidad,
taxonomı́a y biogeografı́a de artrópodos de México: Hacia una
sı́ntesis de su conocimiento (J. Llorente Bousquets, J. J. Morrone,
and H. Ponce Ulloa, eds.). Prensas de Ciencias, UNAM, Mexico City,
Mexico.
Edgecombe, G. D., G. Giribet, and W. C. Wheeler. 2002. Phylogeny of
Henicopidae (Chilopoda: Lithobiomorpha): A combined analysis of
morphology and five molecular loci. Syst. Entomol. 27:31–64.
Edgecombe, G. D., G. D. F. Wilson, D. J. Colgan, M. R. Gray, and G.
Cassis. 2000. Arthropod cladistics: Combined analysis of histone H3
and U2 snRNA sequences and morphology. Cladistics 16:155–203.
Farris, J. S. 1983. The logical basis of phylogenetic analysis. Pages 7–36
in Advances in cladistics, Volume 2 (N. I. Platnick and V. A. Funk,
eds.). Columbia Univ. Press, New York.
Farris, J. S. 1999. Likelihood and inconsistency. Cladistics 15:199–204.
Fautin, D. G., and L. Watling. 1999. Review of textbooks of invertebrate
zoology. Am. Zool. 39:818–824.
Flores-Villela, O., K. M. Kjer, M. Benabib, and J. W. Sites. 2000. Multiple data sets, congruence, and hypothesis testing for the phylogeny
of basal groups of the lizard genus Sceloporus (Squamata, Phrynosomatidae). Syst. Biol. 49:713–739.
Gatesy, J. 2002. Relative quality of different systematic datasets for
certartiodactyl mammals: Assessments within a combined analysis framework. Pages 45–67 in Molecular systematics and evolution:
Theory and practice (R. DeSalle, G. Giribet, and W. Wheeler, eds.).
Birkhäuser Verlag, Basel, Switzerland.
Gatesy, J., G. Amato, M. Norell, R. DeSalle, and C. Hayashi. 2003. Combined support for wholesale taxic atavism in gavialine crocodylians.
Syst. Biol. 52:403–422.
Gatesy, J., and P. Arctander. 2000. Hidden morphological support for
the phylogenetic placement of Pseudoryx nghetinhensis with bovine
bovids: A combined analysis of gross anatomical evidence and DNA
sequences from five genes. Syst. Biol. 49:515–538.
Giribet, G. 2002. Current advances in the phylogenetic reconstruction of
metazoan evolution. A new paradigm for the Cambrian explosion?
Mol. Phylogenet. Evol. 24:345–357.
Giribet, G., G. D. Edgecombe, and W. C. Wheeler. 2001. Arthropod
phylogeny based on eight molecular loci and morphology. Nature
413:157–161.
Giribet, G., G. D. Edgecombe, W. C. Wheeler, and C. Babbitt. 2002. Phylogeny and systematic position of Opiliones: A combined analysis of
chelicerate relationships using morphological and molecular data.
Cladistics 18:5–70.
Giribet, G., and W. Wheeler. 2002. On bivalve phylogeny: A high-level
analysis of the Bivalvia (Mollusca) based on combined morphology
and DNA sequence data. Invertebr. Biol. 121:271–324.
2004
POINTS OF VIEW
Givnish, T. J., and K. J. Sytsma. 1997. Consistency, characters, and the
likelihood of correct phylogenetic inference. Mol. Phylogenet. Evol.
7:320–330.
Gribaldo, S., and H. Philippe. 2002. Ancient phylogenetic relationships.
Theor. Popul. Biol. 61:391–408.
Hawkins, J. A., C. E. Hughes, and R. W. Scotland. 1997. Primary homology assessment, characters and character states. Cladistics 13:275–
283.
Hennig, W. 1966. Phylogenetic systematics. Univ. Illinois Press, Urbana.
Hickson, R. E., C. Simon, and S. W. Perrey. 2000. The performance of several multiple-sequence aligment program in relation to secondarystructure features for an rRNA sequence. Mol. Biol. Evol. 17:530–539.
Hillis, D. M. 1995. Approaches for assessing phylogenetic accuracy.
Syst. Biol. 44:3–16.
Hillis, D. M. 1996. Inferring complex phylogenies. Nature 383:130–131.
Hillis, D. M. 1998. Taxonomic sampling, phylogenetic accuracy, and
investigator bias. Syst. Biol. 47:3–8.
Hillis, D. M., J. P. Huelsenbeck, and C. W. Cunningham. 1994a. Application and accuracy of molecular phylogenies. Science 264:671–677.
Hillis, D. M., J. P. Huelsenbeck, and D. L. Swofford. 1994b. Hobgoblin
of phylogenetics? Nature 369:363–364.
Hillis, D. M., D. D. Pollock, J. A. McGuire, and D. J. Zwickl. 2003. Is
sparse taxon sampling a problem for phylogenetic inference? Syst.
Biol. 52:124–126.
Hillis, D. M., and J. J. Wiens. 2000. Molecules versus morphology in
systematics: Conflicts, artifacts, and misconceptions. Pages 1–19 in
Phylogenetic analysis of morphological data (J. J. Wiens, ed.). Smithsonian Institution Press, Washington, D.C.
Holmer, L. E., C. B. Skovsted, and A. Williams. 2002. A stem group brachiopod from the Lower Cambrian: Support for a Micrina (halkieriid)
ancestry. Palaeontology 45:875–882.
Hooge, M. D., P. A. Haye, S. Tyler, M. K. Litvaitis, and I. Kornfield.
2002. Molecular systematics of the Acoela (Acoelomorpha, Platyhelminthes) and its concordance with morphology. Mol. Phylogenet.
Evol. 24:333–342.
Janies, D. 2001. Phylogenetic relationships of extant echinoderm
classes. Can. J. Zool. 79:1232–1250.
Jeffery, C. H., R. B. Emlet, and D. T. J. Littlewood. 2003. Phylogeny and
evolution of developmental mode in temnopleurid echinoids. Mol.
Phylogenet. Evol. 28:99–118.
Jenner, R. A. 2001. Bilaterian phylogeny and uncritical recycling of
morphological data sets. Syst. Biol. 50:730–742.
Jenner, R. A. 2002. Boolean logic and character state identity: Pitfalls of
character coding in metazoan cladistics. Contrib. Zool. 71:67–91.
Jenner, R. A. In press. The scientific status of metazoan cladistics: Why
current research practice must change. Zool. Scr.
Jenner, R. A. 2004. Towards a phylogeny of the Metazoa: Evaluating alternative phylogenetic positions of Platyhelminthes, Nemertea, and
Gnathostomulida, with a critical reappraisal of cladistic characters.
Contrib. Zool. 73:3–163.
Jenner, R. A., and F. R. Schram. 1999. The grand game of metazoan
phylogeny: Rules and strategies. Biol. Rev. 74:121–142.
Kjer, K. M. 1995. Use of rRNA secondary structure in phylogenetic
studies to identify homologous positions: An example of alignment
and data presentation from the frogs. Mol. Phylogenet. Evol. 4:314–
330.
Klass, K. D. 2001. Morphological evidence on blattarian phylogeny:
”Phylogenetic histories and stories” (Insecta, Dictyoptera). Dtsch.
Entomol. Z. 48:223–265.
Klompen, J. S. H., W. C. Black IV, J. E. Keirans, and D. E. Norris. 2000.
Systematics and biogeography of hard ticks, a total evidence approach. Cladistics 16:79–102.
Kluge, A. G. 1997. Testability and the refutation and corroboration of
cladistic hypotheses. Cladistics 13:81–96.
Kluge, A. G. 1998. Sophisticated falsification and research cycles: Consequences for differential character weighting in phylogenetic systematics. Zool. Scr. 26:349–360.
Kluge, A. G. 2002. Distinguishing ”or” from ”and” and the case for
historical identification. Cladistics 18:585–593.
Kress, W. J., L. M. Prince, W. J. Hahn, and E. A. Zimmer. 2001. Unraveling the evolutionary radiation of the families of the Zingiberales
using morphological and molecular evidence. Syst. Biol. 50:926–
944.
341
Lee, M. S. Y. 1995. Historical burden in systematics and the interrelationships of ”parareptiles.” Biol. Rev. 70:459–547.
Lee, M. S. Y. 2001. Unalignable sequences and molecular evolution.
Trends Ecol. Evol. 16:681–685.
Lee, M. S. Y., and J. D. Scanlon. 2002. Snake phylogeny based on osteology, soft anatomy and ecology. Biol. Rev. 77:333–401.
Lewis, P. O. 2001. A likelihood approach to inferring phylogeny from
discrete morphological characters. Syst. Biol. 50:913–925.
Littlewood, D. T. J., K. Rohde, and K. A. Clough. 1999. The interrelationships of all major groups of Platyhelminthes: Phylogenetic evidence
from morphology and molecules. Biol. J. Linn. Soc. 66:75–114.
Lockhart, P. J., D. Huson, U. Maier, M. J. Fraunholz, Y. Van de Peer,
A. C. Barbrook, C. J. Howe, and M. A. Steel. 2000. How molecules
evolve in Eubacteria. Mol. Biol. Evol. 17:835–838.
Lopez, P., D. Casane, and H. Philippe. 2002. Heterotachy, an important
process of protein evolution. Mol. Biol. Evol. 19:1–7.
Machado, C. A., and J. Hey. 2003. The causes of phylogenetic conflict
in a classic Drosophila species group. Proc. R. Soc. Lond. B 270:1193–
1202.
Maddison, W. P. 1997. Gene trees in species trees. Syst. Biol. 46:523–
536.
Mallatt, J., and J. Y. Chen. 2003. Fossil sister group of craniates: Predicted
and found. J. Morphol. 258:1–31.
Mattern, M. Y., and D. A. McLennan. 2000. Phylogeny and speciation
of felids. Cladistics 16:232–253.
Meier, R., and R. Baker. 2002. A cladistic analysis of Diopsidae (Diptera)
based on morphological and DNA sequence data. Insect Syst. Evol.
33:325–336.
Miyamoto, M. M., and W. M. Fitch. 1995. Testing the covarion hypothesis of molecular evolution. Mol. Biol. Evol. 12:503–513.
Murrell, A., N. J. H. Campell, and S. C. Barker. 2001. A total-evidence
phylogeny of ticks provides insights into the evolution of life cycles
and biogeography. Mol. Phylogenet. Evol. 21:244–258.
Naylor, G. J. P., and W. M. Brown. 1998. Amphioxus mitochondrial DNA,
chordate phylogeny, and the limits of inference based on comparisons of sequences. Syst. Biol. 47:61–76.
Patterson, C. 1982. Morphological characters and homology. Pages 21–
74 in Problems of phylogenetic reconstruction (K. A. Joysey and A.
E. Friday, eds.). Academic Press, London.
Pennisi, E. 2003. Modernizing the tree of life. Science 300:1692–1697.
Peterson, K. J., and D. J. Eernisse. 2001. Animal phylogeny and the
ancestry of bilaterians: Inferences from morphology and 18S rDNA
gene sequences. Evol. Dev. 3:170–205.
Remsen, J., and P. O’Grady. 2002. Phylogeny of Drosophilinae (Diptera:
Drosophilidae), with comments on combined analysis and character
support. Mol. Phylogenet. Evol. 24:249–264.
Rieppel, O., and M. Kearney. 2002. Similarity. Biol. J. Linn. Soc. 75:59–82.
Rokas, A., and P. W. H. Holland. 2000. Rare genomic changes as a tool
for phylogenetics. Trends Ecol. Evol. 15:454–459.
Rokas, A., B. L. Williams, N. King, and S. B. Carroll. 2003. Genome-scale
approaches to resolving incongruence in molecular phylogenies. Nature 425:798–804.
Rosenberg, M. S., and S. Kumar. 2003. Taxon sampling, bioinformatics,
and phylogenomics. Syst. Biol. 52:119–124.
Ruta, M., M. I. Coates, and D. L. J. Quicke. 2003. Early tetrapod relationships revisited. Biol. Rev. 78:251–345.
Rydin, C., M. Källersjö, and E. M. Friis. 2002. Seed plant relationships
and the systematic position of Gnetales based on nulcear and chloroplast DNA: Conflicting data, rooting problems, and the monophyly
of conifers. Int. J. Plant Sci. 163:197–214.
Sanderson, M. J. 1995. Objections to bootstrapping phylogenies: A critique. Syst. Biol. 44:299–320.
Schander, C., and P. Sundberg. 2001. Useful characters in gastropod
phylogeny: Soft information or hard facts? Syst. Biol. 50:136–141.
Schulmeister, S. 2003. Simultaneous analysis of basal Hymenoptera
(Insecta): Introducing robuts-choice sensitivity analysis. Biol. J. Linn.
Soc. 79:245–275.
Scotland, R. W., R. G. Olmstead, and J. R. Bennett. 2003. Phylogeny
reconstruction: The role of morphology. Syst. Biol. 52:539–548.
Shaw, K. L. 2002. Conflict between nuclear and mitochondrial DNA
phylogenies of a recent species radiation: What mtDNA reveals and
conceals about modes of speciation in Hawaiian crickets. Proc. Natl.
Acad. Sci. USA 99:16122–16127.
342
SYSTEMATIC BIOLOGY
Siddall, M. E. 1998. Success of parsimony in the four-taxon case: Longbranch repulsion by likelihood in the Farris zone. Cladistics 14:209–
220.
Siddall, M. E. 2002. Measures of support. Pages 80–101 in Techniques
in molecular systematics and evolution (R. DeSalle, G. Giribet, and
W. Wheeler, eds.). Birkhäuser Verlag, Basel, Switzerland.
Siddall, M. E., and A. G. Kluge. 1997. Probabilism and phylogenetic
inference. Cladistics 13:313–336.
Simmons, M. P., H. Ochoterena, and J. V. Freudenstein. 2002a. Amino
acid vs. nucleotide characters: Challenging preconceived notions.
Mol. Phylogenet. Evol. 24:78–90.
Simmons, M. P., H. Ochoterena, and J. V. Freudenstein. 2002b. Conflict
between amino acid and nucleotide characters. Cladistics 18:200–206.
Simmons, N. B., and J. H. Geisler. 2002. Sensitivity analysis of different methods of coding taxonomic polymorphism: An example from
higher-level bat phylogeny. Cladistics 18:571–584.
Sørensen, M. V. 2002. On the evolution and morphology of the rotiferan
trophi, with a cladistic analysis of Rotifera. J. Zool. Syst. Evol. Res.
40:129–154.
Sorhannus, U. 2001. A ”total evidence” analysis of the phylogenetic
relationships among the photosynthetic stramenopiles. Cladistics
17:227–241.
Sota, T., and A. P. Vogler. 2001. Incongruence of mitochondrial and nuclear gene trees in the carabid beetles Ohomopterus. Syst. Biol. 50:39–
59.
Stach, T., and J. M. Turbeville. 2002. Phylogeny of the Tunicata inferred from molecular and morphological characters. Mol. Phylogenet. Evol. 25:408–428.
Strong, E. E. 2003. Refining molluscan characters: Morphology, character coding and a phylogeny of the Caenogastropoda. Zool. J. Linn.
Soc. 137:447–554.
Sullivan, J., and D. L. Swofford. 2001. Should we use model-based
methods for phylogenetic inference when we know that assumptions
about among-site rate variation and nucleotide substitution pattern
are violated? Syst. Biol. 50:723–729.
Swofford, D. L., G. J. Olsen, P. J. Waddell, and D. M. Hillis. 1996. Phylogenetic inference. Pages 407–514 in Molecular systematics, 2nd
edition (D. M. Hillis, C. Moritz, and B. K. Mable, eds.). Sinauer,
Sunderland, Massachusetts.
Swofford, D. L., P. J. Waddell, J. P. Huelsenbeck, P. G. Foster, P. O.
Lewis, and J. S. Rodgers. 2001. Bias in phylogenetic estimation and its
VOL. 53
relevance to the choice between parsimony and likelihood methods.
Syst. Biol. 50:525–539.
Taggart, T. W., B. I. Crother, and M. E. White. 2001. Palm-pitviper (Bothriechis) phylogeny, mtDNA, and consilience. Cladistics 17:355–370.
Tautz, D., P. Arctander, A. Minelli, R. H. Thomas, and A. P. Vogler. 2003.
A plea for DNA taxonomy. Trends Ecol. Evol. 18:70–74.
Vidal, N., and S. B. Hedges. 2002. Higher-level relationships of snakes
inferred from four nuclear and mitochondrial genes. C. R. Biol.
325:977–985.
Wägele, J.-W., B. Holland, H. Dreyer, and B. Hackethal. 2003. Searching
factors causing implausible non-monophyly: ssu rDNA phylogeny
of Isopoda Asellota (Crustacea: Peracarida) and faster evolution in
marine than in freshwater habitats. Mol. Phylogenet. Evol. 28:536–
551.
Wahlberg, N., and S. Nylin. 2003. Morphology versus molecules: Resolution of the positions of Nymphalis, Polygonia, and related genera
(Lepidoptera: Nymphalidae). Cladistics 19:213–223.
Wheeler, W. C., M. Whiting, Q. D. Wheeler, and J. M. Carpenter. 2001.
The phylogeny of the extant hexapod orders. Cladistics 17:113–169.
Wiens, J. J. 2003. Missing data, incomplete taxa, and phylogenetic accuracy. syst. Biol. 52:528–538.
Wiens, J. J., P. T. Chippindale, and D. M. Hillis. 2003. When are phylogenetic analyses misled by convergence? A case study in Texas cave
salamanders. Syst. Biol. 52:501–514.
Wiens, J. J., and B. D. Hollingsworth. 2000. War of the iguanas: Conflicting molecular and morphological phylogenies and long-branch
attraction in iguanid lizards. Syst. Biol. 49:143–159.
Wills, M. A., D. E. G. Briggs, R. A. Fortey, M. Wilkinson, and P. H. A.
Sneath. 1998. An arthropod phylogeny based on fossil and recent
taxa. Pages 33–105 in Arthropod fossils and phylogeny (G. D. Edgecombe, ed.). Columbia Univ. Press, New York.
Winterton, S. L., L. Yang, B. M. Wiegmann, and D. K. Yeates. 2001.
Phylogenetic revision of Agapophytinae subf.n. (Diptera: Therevidae) based on molecular and morphological evidence. Syst. Entomol.
26:173–211.
Xia, X., Z. Xie, and K. M. Kjer. 2003. 18S Ribosomal RNA and tetrapod
phylogeny. Syst. Biol. 52:283–295.
First submitted 31 July 2003; reviews returned 5 October 2003;
final acceptance 26 November 2003
Associate Editor: Tim Collins
Syst. Biol. 53(2):342–355, 2004
c Society of Systematic Biologists
Copyright ISSN: 1063-5157 print / 1076-836X online
DOI: 10.1080/10635150490423971
Inconsistencies in Arguments for the Supertree Approach: Supermatrices versus
Supertrees of Crocodylia
J OHN G ATESY,1 R ICHARD H. B AKER,2 AND CHERYL HAYASHI 1
1
2
Department of Biology, University of California, Riverside, California 92521, USA; E-mail: [email protected] (J.G.)
Evolutionary Genomics Department, DOE Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, California 94598, USA
Multiple data sets are now available for examining the
phylogenetic relationships of certain well-studied taxa.
Integrating this information into unified statements of
evolutionary history is a major challenge in the field of
systematics, with supertree and supermatrix methods
emerging as competing strategies for this task. In the supermatrix approach, all relevant character data are joined
into a single data set, and a total evidence analysis of this
large matrix is used to test competing phylogenetic hypotheses (Kluge, 1989). In supertree methods, characters
first are partitioned into separate data sets, these data
sets are analyzed individually, and source trees derived
from the preliminary analyses then are combined to yield
large synthetic trees (Sanderson et al., 1998).
Gatesy et al. (2002) highlighted some problems with
previously published supertrees (Purvis, 1995; BinindaEmonds et al., 1999; Jones et al., 2002) in the context
of a recent supertree analysis of mammals (Liu et al.,
2001): (1) the supertree data set of Liu et al. included
a variety of unnecessary appeals to authority, such as
2004
POINTS OF VIEW
assumptions of monophyly and poorly justified source
phylogenies; (2) this supertree data set also included extensive duplications of data; (3) the supertree method utilized by Liu et al., matrix representation with parsimony
(MRP; Baum, 1992; Ragan, 1992), ignores/misinterprets
hidden character support in different source data sets;
(4) these issues seem to explain much of the topological
incongruence between the trees presented by Liu et al.
and previous supermatrix results (e.g., Matthee et al.,
2001); (5) other published mammalian supertree analyses (Purvis, 1995; Bininda-Emonds et al., 1999; Jones
et al., 2002) also are characterized by these same problems; and (6) such supertrees are not efficient summaries of previous systematic work and should not be
the basis for critical tests of evolutionary hypotheses.
Many of the potential difficulties with supertree analysis listed above had been noted previously by proponents of the supertree method (e.g., Purvis, 1995;
Bininda-Emonds and Bryant, 1998; Sanderson et al.,
1998; Bininda-Emonds et al., 1999; Bininda-Emonds and
Sanderson, 2001; Jones et al., 2002; Pisani and Wilkinson,
2002), but these problems unfortunately characterize the
great majority of published supertrees, up to and including those in the most current studies (e.g., Kennedy
and Page, 2002; Pisani et al., 2002; Salamin et al.,
2002).
Bininda-Emonds et al. (2003) published a response
to Gatesy et al. (2002) that defended the supertree approach and outlined a series of corrections to the problems summarized above. Here, we comment on this
response within the context of a comprehensive supertree/supermatrix analysis of Crocodylia (alligators,
caiman, crocodiles, and gavials). We use an empirical
comparison of supertree and supermatrix approaches
for identical sets of characters and taxa to highlight basic differences between the two competing methods, and
suggest that the solutions presented by Bininda-Emonds
et al. (2003) do not correct for deficiencies in previous implementations of supertree procedures.
S UPERMATRICES AND S UPERTREES OF CROCODYLIA
Crocodylia, the relictual sister group of Aves, is composed of ∼22 extant species and their close fossil relatives. The majority of generic diversity is extinct, and
many extant crocodylian species are highly endangered (Brochu, 2003). A comprehensive phylogeny of
Crocodylia would provide a useful framework for estimation of conservation priorities and for synthesis of
comparative data in this group. Supermatrix analyses
of Crocodylia have been completed (Poe, 1996; Brochu,
1997; Brochu and Densmore, 2001; Gatesy et al., 2003),
but a supertree study has not been attempted. We used
published data and 41 new sequences from four nuclear
genes (∼2,000 base pairs [bp] of recombination activating protein-1 [RAG-1], ∼680 bp of ATP7A, ∼560 bp of
brain-derived neurotrophic factor [BDNF], and ∼635 bp
of lactate dehydrogenase-A [LDH-A]) to construct MRP
supertrees and more comprehensive supermatrix topologies for Crocodylia.
343
Methods
Crocodylian DNA samples were those utilized by
Gatesy et al. (2003). New data for exons from RAG-1,
ATP7A, and BDNF (see Fig. 2) were amplified and sequenced using the PCR methods and primers described
by Gatesy et al. (2003) and Murphy et al. (2001). New
data for intron 7 and exon 8 of LDH-A (see Fig. 2) were
amplified by PCR and sequenced using the primers
LAI7 F1 (5 -TGGCTGAAACTGTTATGAAGAACC-3 )
and LAI7 R1 (5 -TGGATTCCCCAAAGTGTATCTG-3 ).
The PCR conditions were 94◦ for 1 min, 53◦ for 1 min,
and 72◦ for 1 min for 50 cycles.
Data from the literature and the NCBI database and
new DNA sequences were organized into 17 characterbased data sets (Figs. 1–3). These included morphological traits, sections of five nuclear genes, five fragments
of mitochondrial DNA (mtDNA), two restriction fragment length polymorphism (RFLP) matrices, two allozyme data sets, chromosomal morphology, and nest
type (Cohen and Gans, 1970; Greer, 1970; Campbell, 1972;
Densmore, 1983; Desjardins and Morais, 1990; Densmore
and White, 1991; Gatesy and Amato, 1992; Brooks and
McLennan, 1993; Gatesy et al., 1993, 2003, Poe, 1996;
Brochu, 1997, 2002, 2003, in prep.; Harlid et al., 1997;
Janke and Arnason, 1997; Caccone et al., 1999; Groth and
Barrowclough, 1999; Mindell et al., 1999; Brochu and
Gingerich, 2000; Roman and Bowen, 2000; Janke et al.,
2001; White and Densmore, 2001; Ray and Densmore,
2002; Harshman et al., 2003; Wu et al., unpubl., Zhu
et al., unpubl.). The last two data sets included only
three traits and one trait, respectively. However, because
the chromosomal and nest characters did not fit logically into any of the other character sets, they were
each considered separate data sets. DNA sequence alignments were either published (White and Densmore,
2001; Gatesy et al., 2003; Harshman et al., 2003) or implied optimization alignments derived from analyses in
POY (Gladstein and Wheeler, 1997) using the following
commands: oneasis, random 10, slop 5, checkslop 10,
molecularmatrix, impliedalignment, indices, stats, time,
spewbinary, trailinggap 3, internal gaps weighted 3, and
substitutions weighted 2. For alignments of nuclear loci,
some adjacent gaps in POY output were consolidated.
Four data sets that were “uncombinable” with character
data also were considered (Fig. 2). These were published
phenograms based on globin peptide differences and albumin immunological distances (Densmore, 1983) and
two Brooks parsimony analysis trees based on the coevolution of crocodylian parasites (Brooks and McLennan,
1993). Delimitation of data sets loosely followed the protocol outlined by Bininda-Emonds et al. (2003, in press)
for partitioning data in supertree analyses.
Supermatrix construction was performed as described
by Gatesy et al. (2003). Morphological character evidence
was included for extinct crocodyliform taxa (61 operational taxonomic units [OTUs]), and morphological,
molecular, and behavioral character data were incorporated for extant crocodylian taxa (22 OTUs). Outgroups
were exemplars from Testudines (turtles) and Aves
(birds), and monophyly of the ingroup, Crocodyliformes
344
SYSTEMATIC BIOLOGY
VOL. 53
FIGURE 1. Strict consensus tree of minimum length topologies for the morphological data set of 164 parsimony-informative characters. The
number of optimal trees and minimum tree length are indicated. Thick shaded branches connect the 23 extant taxa; all other taxa are extinct.
Dark shaded circles at nodes mark clades that were inconsistent with the strict consensus of optimal topologies for the supermatrix data set of
all character data (see Fig. 3). Subgroups of Crocodylia, as defined by Brochu (2003; see Gatesy et al., 2003), are delimited by brackets to the right
of species names. The strict consensus was rooted with Paleognathae (Aves).
2004
POINTS OF VIEW
345
FIGURE 2. Strict consensus trees derived from minimum-length topologies for 16 character-based data sets of extant crocodylians and four
published topologies based on “uncombinable” data. The number of optimal trees and minimum tree length for each of the character-based
data sets are indicated. The “uncombinable” data sets are either phenetic or based on coevolutionary studies of parasites. Dark shaded circles at
nodes mark groupings that were inconsistent with the strict consensus of optimal topologies for the supermatrix data set (see Fig. 3). Trees with
outgroup taxa were rooted with Testudines or Aves (Neognathae + Paleognathae). Topologies that lacked these taxa were drawn to maximize
congruence with the supermatrix topology (Fig. 3). Genera are abbreviated: C. = Caiman; M. = Melanosuchus; P. = Paleosuchus; A. = Alligator;
Cr. = Crocodylus; O. = Osteolaemus; T. = Tomistoma; G. = Gavialis. Allozymes “A” are allozymes coded for alligatoroids, and allozymes “C” are
the same allozymes coded for nonalligatoroid crocodylians. All data for LDH-A and ATP7A were sequenced for this study. The RAG-1 sequence
for Testudines (Geochelone yniphora) and all BDNF sequences except that for Alligator mississippiensis also are new with this study.
346
SYSTEMATIC BIOLOGY
VOL. 53
FIGURE 3. Strict consensus tree of minimum-length topologies for the supermatrix of all character data (1,464 informative characters; 6,924
total characters). The number of optimal trees and minimum tree length are indicated. Characters in the supermatrix are listed across the top
of the figure. Two segments each of RAG-1 and cytochrome b have been sequenced, and these are denoted as A and B. Open circles to the left
of taxon names indicate characters sampled for each taxon, and the large shaded box shows missing data. Open circles at nodes indicate novel
clades that were supported by the supermatrix analysis but were not implied by any combination of source trees from the separate analyses of
individual data sets (Figs. 1, 2). Dark shaded circles at nodes indicate clades that were not stable to exclusion of the unmapped RFLP character
data, which might not be logically independent (see discussion by Gatesy et al., 2003). Subgroups of Crocodylia to the right of species names
were defined as in Figure 1, but because the clades have phylogenetic definitions (Brochu, 2003), the content of these taxa differed from that in
Figure 1. The strict consenus was rooted with Testudines. The supermatrix data set (SuperMatrixCroc.dat) is available on the Systematic Biology
website.
2004
347
POINTS OF VIEW
(see Brochu, 2003), was assumed in all phylogenetic analyses. The parsimony search of the supermatrix in PAUP∗
4.0b10 (Swofford, 1998) was heuristic, with all characters
unordered, equal weighting of transformations, indels
treated as missing data, 100 random taxon addition replicates, tree bisection–reconnection branch swapping, and
the “amb-” option in effect. A strict consensus tree was
derived from all optimal topologies. Misrepresentation
of homologous sites is unavoidable in restriction fragment data that are not mapped (see Swofford and Olsen,
1990). Therefore, an additional search was executed to
determine the stability of systematic results to exclusion
of the unmapped restriction fragment data (Densmore
and White, 1991).
The same taxa and characters in the supermatrix were
encoded into an MRP supertree data set. In the MRP
framework, individual characters are not directly interpreted as phylogenetic evidence. Instead, topologies
supported by different data sets (source trees) are encoded into a matrix and used to reconstruct the evolutionary history of a particular group (Baum, 1992; Ragan,
1992). Separate parsimony analyses were conducted for
the 17 character-based data sets. A strict consensus tree
was derived from optimal topologies for each data set,
and each resolved group (Figs. 1, 2) was coded as a separate nodal character in the MRP supertree data set. Published topologies for the four “uncombinable” data sets
also were encoded into the MRP supertree data set of
86 OTUs. To facilitate comparison with the supermatrix analysis, the MRP matrix was first analyzed with
the “uncombinable” data sets excluded. PAUP∗ searches
were conducted under the same conditions, with equal
weighting of all nodal characters.
Results
The supermatrix analysis produced 720 optimal
topologies (minimum length = 4,272) and a wellresolved strict consensus (Fig. 3); when individual gaps
were treated as informative characters, the same set of
trees was obtained. No groups supported by the present
analysis conflicted with the previous, more limited supermatrix analysis of Gatesy et al. (2003). Incorporation
of four nuclear loci, two segments of mt DNA, RFLPs, allozymes, and 16 new ingroup taxa did not alter the basic
structure of the crocodylian tree; relationships among extant species were completely consistent with those presented by Gatesy et al. (2003). Overall, 11 groups that
emerged in the supermatrix analysis (Fig. 3) were not implied by any combination of trees supported by separate
analyses of the 17 data sets in the supermatrix (Figs. 1,
2). Such novel clades are an expected outcome of total
evidence analyses (Barrett et al., 1991).
The MRP supertree analysis of the character-based
data sets yielded 103,633 optimal topologies with a minimum length of 216 steps (Fig. 4). Because the placements
of many extinct taxa were not stable, basal relationships
of Crocodylia were poorly resolved in the strict consensus of minimum-length trees. Five more nodes were resolved in the supermatrix search relative to the MRP supertree search, and six clades supported by the supertree
data set were strictly incongruent with the supermatrix
analysis of the same data (Figs. 3, 4). Some of the best trees
for the supertree data set were very poor fits to the supermatrix. For example, the topology shown in Figure 4,
which was optimal according to the MRP supertree analysis, included 18 nodes that strictly conflicted with supermatrix results and demanded an extra 43 steps beyond the minimum length from the supermatrix data set
(Fig. 3). This difference in number of steps was significant
according to the Wilcoxon signed-rank test of Templeton
(1983), with P ≤ 0.0001. Simply constraining the supermatrix to retain all groups strictly supported by the supertree analysis resulted in a cost of 21 extra steps beyond minimum length for the supermatrix (4,293 steps).
A comparison of minimum-length trees with and without this supertree constraint showed that the difference
in character steps was significant (P ≤ 0.0001). Unlike
the supermatrix analysis, no emergent novel clades were
strictly supported by the MRP supertree data set.
I NCONSISTENCIES IN ARGUMENTS FOR S UPERTREES
Several arguments in defense of the approach used in
previous supertree analyses of mammals have been published recently (Bininda-Emonds and Sanderson, 2001;
Bininda-Emonds et al., 2002, 2003, in press). Within
the context of our supertree/supermatrix analyses of
Crocodylia, we suggest that many of these rationalizations are poorly justified.
Unambiguous Data Duplication
Springer and DeJong (2001), Gatesy et al. (2002),
and Gatesy and Springer (in press) noted that many
published MRP supertrees of mammals (Purvis, 1995;
Bininda-Emonds et al., 1999, Liu et al., 2001; Jones et al.,
2002) included unequivocal duplications of data, i.e., redundant sampling of the same characters for the same
taxa in the same data matrix. Most published supertrees
include clear-cut duplications of primary character evidence, the ultimate source data for both supertree and
supermatrix analyses. However, any unambiguous duplication of characters in a data matrix is problematic
because it entails arbitrary and unjustified character
weighting. Even if duplication of data had no effect on inferred relationships, it would still not be justified because
it would surely influence interpretations of support for
competing topologies.
Bininda-Emonds et al. (2003) partly dismissed the difficulty of data duplication in past supertree analyses by
arguing that supermatrix analyses suffer from similar
problems. They noted (2003:725):
Clearly, data duplication exists [in previously published supertrees].
This duplication violates a key assumption of phylogenetic analysis,
namely that the source data are independent. However, this assumption is routinely violated in analyses based on primary character
data. For instance, several characters are often described for a single
morphological structure. In molecular studies, secondary structure
(e.g., stem regions in tRNAs, protein folding) or codon position in
coding DNA mean that nonindependent compensatory mutations
may accompany primary ones. The combination of phenotypic and
genotypic data (i.e., morphological and molecular, respectively) in
348
SYSTEMATIC BIOLOGY
VOL. 53
FIGURE 4. Strict consensus tree of minimum-length topologies for the MRP supertree data set of 190 informative nodal characters that
included all character-based source trees (left), and one of the minimum-length trees for this MRP matrix that is significantly incongruent with
trees supported by the supermatrix (right). The number of optimal trees and the minimum tree length are indicated. For the topology on the right,
tree length for the supermatrix of all character data (see Fig. 3) also is shown. Dark shaded circles at nodes mark clades that were inconsistent
with the strict consensus for this supermatrix data set. Open circles at nodes indicate novel clades that were not supported by supermatrix
analysis (Fig. 3) and were not implied by any combination of source trees from the separate analyses of individual data sets (Figs. 1, 2). When
the five mtDNA data sets were merged and considered one data set, MRP supertree analysis strictly supported slightly different relationships
within Crocodylinae. Crocodylus cataphractus grouped with other extant Crocodylus and C. palaeindicus to the exclusion of Osteolaemus and all
other crocodylines. Furthermore, C. rhombifer and C. moreletii formed a monophyletic group. When the four “uncombinable” source trees (see
Fig. 2) were included in the analysis, relationships were consistent with the strict consensus shown; the basal polytomy among major crocodylian
taxa remained. However, the immunological data set for albumin is partially redundant with the allozyme data set, which also summarized
molecular variation in albumin. Subgroups of Crocodylia are to the right of species names, and trees were rooted with Testudines. The MRP
supertree data set (SuperTreeCroc.dat) is available an the Systematic Biology website.
a supermatrix approach must represent duplication at some level.
Thus, issues of data duplication and nonindependence are not limited to the supertree approach.
This reasoning is not adequate to justify data duplication in supertree analysis for three reasons. First, the unambiguous duplication of evidence in many published
supertrees (e.g., Purvis, 1995; Bininda-Emonds et al.,
1999; Liu et al., 2001; Jones et al., 2002) is quite different
from the potential phylogenetic correlation of characters
in a supermatrix analysis. The former is an unequivocal,
arbitrary redundancy that is introduced by the investigator. Phylogenetic correlations among characters are not
so easily observed and can be detected only through detailed systematic analysis. For example, replication of the
mitochondrial 12S ribosomal DNA (rDNA) sequences in
our crocodylian supermatrix analysis would arbitrarily
increase the impact of this data set on the final hypothesis
of relationships. This step clearly would be unwarranted
and avoidable. In contrast, phylogenetic character correlation is not trivial to detect and accommodate into the
tree building process (see Yang, 1997; Wollenberg and
2004
POINTS OF VIEW
Atchley, 2000; O’Keefe and Wagner, 2001). For 12S rDNA,
interdependence among sites in the stem regions of this
gene may vary widely from one site to another site, over
time, and among lineages (Kraus et al., 1992; Gatesy et al.,
1994).
Second, the unambiguous duplication of evidence in
many published supertrees (e.g., Purvis, 1995; BinindaEmonds et al., 1999; Liu et al., 2001; Jones et al., 2002)
also is quite different from the potential redundancy between some genotypic and phenotypic characters in a
supermatrix analysis. The former is an unequivocal redundancy. By contrast, it generally is not obvious when
specific phenotypic characters are redundant with particular genotypic characters in an analysis.
Third, the problem of clear-cut data duplication that is
evident in most published supertrees exists in addition
to rather than instead of any nonindependence among
primary character data and any redundancies between
phenotypic and genotypic characters. Because the source
trees used in supertree analysis are generated from primary character data, supertrees are affected by the same
character correlations and redundancies that are present
in supermatrix analysis. The explicit data duplications
in many published supertree analyses simply add an
extra, more egregious form of nonindependence among
characters. Such redundancies are not justified and are
easily avoided. For example, our character-based MRP
supertree data set for Crocodylia had no clear-cut duplications of evidence (see Fig. 4), and other published supertree analyses also have avoided redundancy (Daubin
et al., 2001).
Definition of Pseudoindependent Phylogenetic Hypotheses
In both supertree and supermatrix analysis, primary
character data (hypotheses of homology) are critically
important; the definition of characters and character
states by the systematist determines final systematic results (Patterson, 1982). However, in supertree analysis,
delimitation of data sets also is crucial. Contrasting definitions of data set have the potential to influence systematic results in the supertree framework but not in the
supermatrix approach. For example, when all mtDNA
sequences were considered one data set, as opposed to
the five data sets outlined in our MRP supertree analysis
of Crocodylia, different phylogenetic conclusions were
obtained (see Fig. 4).
In recent publications, Bininda-Emonds et al. (2003,
in press) outlined an “explicitly defined set of rules” to
identify “phylogenetic hypotheses that can reasonably
be viewed as being independent” (2003:725) and therefore appropriate for supertree analysis.
1. “Delimiting pseudo-independent evolutionary ‘packets’ based on genes may be defensible given
the recognition of the gene tree/species tree dichotomy. . . . Non-overlapping data sets (e.g. different genes) are considered to be independent, even if
they appear on a single heritable unit like mt DNA”
(2003:725).
349
2. “Different portions of the same gene are not independent for an overlapping set of taxa, even if these gene
portions do not overlap at all” (2003:725).
3. “We also hold unique combinations of genes to be independent sources and independent from data sets
containing subsets of all the genes in the combination”
(in press).
4. “Different morphological data sets are equivalent to
novel combinations of genes and so are considered to
be independent of one another unless one data set is
contained completely within another” (in press).
5. “Trees for non-overlapping taxon sets, even if they are
derived from the same set of characters, are independent by practical necessity” (2003:725).
These five rules allow for extensive duplications of
character data in supertree analyses and are not logically
consistent for the reasons listed below.
The recognition that a given gene tree may differ from
the species tree is not sufficient justification for asserting the primacy of genes as independent evolutionary
units. In any particular case study, there could be substantial independence among characters within genes
as well as nonindependence among genes. This point
has been made by some of the authors themselves, who
stated (Bininda-Emonds et al., 2002:284),
The nonindependence of molecular data owing to linkage associations between genes is often underestimated. For example, it is often
argued that mt DNA with its many genes constitutes a single phylogenetic data source because it forms a single heritable unit that is
not normally subject to recombination.
This view is incompatible with rule 1.
Rules 1 and 3 are inconsistent with each other. Rule
1 states that separate genes are the independent units
of analysis, but rule 3 states that the same gene can be
included multiple times in the same analysis because
of novel phylogenetic signals that might emerge when
different genes are analyzed together. These conflicting
rules make the protocol for defining independent phylogenetic hypotheses arbitrary and prone to extensive data
duplication. If all novel combinations of genes were allowed into supertree analysis, it would have been acceptable for us to include the combined molecular tree from
Gatesy et al. (2003) in our crocodylian MRP supertree
analysis. The tree from Gatesy et al. (2003) overlaps with
∼3,000 nucleotides and four genes in our supertree data
set. We contend that such duplications of genetic data
are not justified in phylogenetic analysis.
According to our view of systematics, it is assumed
that each character state in a morphological data set is
encoded by at least one independent genetic difference.
Thus, by proxy many “genes” are included in a single
morphological data set. The primacy of genes as the independent units of analysis in supertree studies (rule 1)
is, therefore, not compatible with the fact that morphological matrices also were considered independent units
by Bininda-Emonds et al. (2003, in press).
Rule 4 permits extensive duplications of morphological characters. For example, Brochu (1997) analyzed a
350
SYSTEMATIC BIOLOGY
VOL. 53
FIGURE 5. Three minimum-length topologies based on a fragment of the RAG-1 gene (∼2,000 bp). There is no overlap in taxa between trees a
and b. Tree c includes all of the taxa in trees a and b. Thirteen identical nucleotide substitutions unequivocally supported the grouping of Caiman
+ Alligator in each of these trees. These changes were at the following positions: 55, C → T; 310 C → T; 311, A → G; 445, G → A; 467, C → T; 593,
A → C; 706, A → G; 940, T → C; 1370, G → A; 1691, T → C; 1741, A → G; 1891, G → A; 1906, A → G. Despite the lack of taxonomic redundancy
in trees a and b, there is redundancy in the character data.
morphological data set of 164 characters for 61 crocodyliform taxa. In our supertree analysis, we used a recently
revised version of Brochu’s data set that included 164
morphological characters for 83 crocodyliform taxa. The
data sets shared 162 characters and 60 taxa in common,
so each data set had some unique characters and unique
taxa. According to rule 4, these grossly overlapping data
sets are pseudoindependent, and it would be valid to include each source tree into the same supertree analysis
because neither data set “is contained completely within
another” (Bininda-Emonds et al., in press). Morphological data sets for Crocodylia with slightly less overlap in
characters (e.g., Norell, 1989; Wu et al., 1996; Buscalioni
et al., 2001) also would be considered pseudoindependent according to rule 4.
Together, rules 3 and 4 basically allow any set of characters to be independent from all other data sets, as
long as rule 2 is not broken. By our calculation, if these
rules were taken literally, minimally eight previously
published analyses that included 12S rDNA data for
crocodylians would be considered independent source
trees (Poe, 1996; Brochu, 1997; Brochu and Densmore,
2001; Gatesy et al., 2003, this study). We do not think it is
prudent to arbitrarily octuplicate characters in systematic studies.
Data sets for nonoverlapping taxon sets, even if they
are derived from the same set of characters, are considered independent by practical necessity (rule 5), but
such nonoverlapping taxon sets can include duplications of character evidence. For example, a source tree
for the nuclear gene RAG-1 that included only Caiman
crocodilus + Alligator mississippiensis + Crocodylus rhombifer + Paleognathae would be phylogenetically redundant with a source tree for RAG-1 that included only
Caiman latirostris + Alligator sinensis + Crocodylus cataphractus + Neognathae (see Fig. 5). These data sets have
no overlap of taxa, yet each matrix supports Caiman +
Alligator, and there is duplication of character evidence;
the same nucleotide substitutions in each of these two
data sets unequivocally support the same grouping of
genera (Fig. 5). Complete nonoverlap of taxa does not
guarantee avoidance of data duplication in supertree
analysis; this data duplication is not a practical necessity and is completely avoided in supermatrix analyses.
Source Data Quality, Quantity, and Availability
Gatesy et al. (2002) pointed out that some published supertrees included source trees based on dubious data (see
discussions by Purvis, 1995; Bininda-Emonds et al., 1999;
Springer and DeJong, 2001; Jones et al., 2002; Gatesy and
Springer, in press) and that incorporation of problematic source topologies makes these published supertrees
weak phylogenetic statements. Bininda-Emonds et al.
(2003:725) countered that data quality is a more general issue in systematics and is not specific to supertree
2004
POINTS OF VIEW
analysis: “The use of poor data may compromise the results in any phylogenetic analysis (i.e. including a supermatrix analysis), and researchers should ensure that all
data used are of the highest achievable quality.” We agree
with this statement but still think that in an effort to incorporate numerous source trees and to achieve broad taxonomic sampling many previous supertree analyses (e.g.,
Purvis, 1995; Bininda-Emonds et al., 1999; Liu et al., 2001;
Jones et al., 2002; Kennedy and Page, 2002) failed to exclude dubious data, especially in comparison with published supermatrix studies. For example, Liu et al. (2001)
employed source trees based on reviews by authorities,
Kennedy and Page (2002) utilized a morphological tree
that was not derived from an explicit data set/analysis,
and other supertree studies have included topologies
based on informal taxonomies (e.g., Jones et al., 2002).
Purvis (1995) and Bininda-Emonds et al. (1999) downweighted suspect source trees in their mammalian supertree analyses and noted that this procedure had little
effect on topology. However, a more stringent test for
the effect of weak source trees is to give these topologies zero weight in the supertree analysis. When such
searches were executed by Purvis (1995), resolution in
his supertree for Primates decreased significantly. Thus,
many of the relationships in the preferred supertree of
Purvis (1995) had no basis in terms of explicit hypotheses
of homology and were the result of dubious source trees.
In the construction of our crocodylian supermatrix/supertree data sets, we were confronted by a number of data quality issues. For example, how should we
code ambiguous regions of sequence alignments for mitochondrial rDNA genes? Are the chromosomal traits
characterized in enough detail to warrant inclusion? Are
unmapped RFLPs valid evidence? Are outlier sequences
PCR contaminants (e.g., the unpublished Crocodylus porosus sequence for mitochondrial cytochrome b [NCBI
AF306452] apparently is a Homo sapiens sequence)? We
suspect that some researchers will disagree with our decisions regarding which data to include or exclude. However, each of these data quality issues was dealt with at
the character level, and we suggest that future supertree
studies also should engage in character analysis at this
fine scale instead of simply recording topologies reported
by other researchers. The supermatrix approach naturally facilitates the reassessment of primary character
data because these data actually are analyzed in this
framework.
Several authors have argued that an advantage of supertree analysis is that systematic data, which cannot
be combined into a supermatrix, can be analyzed in a
supertree analysis (e.g., Purvis, 1995; Sanderson et al.,
1998; Bininda-Emonds et al., 1999, 2002, 2003; Kennedy
and Page, 2002). However, among the wide range of
phylogenetic evidence available to modern systematists,
only coevolutionary parasite trees, reviews that are based
on the opinions of authorities, classifications that have
no clear empirical basis, DNA/DNA hybridization data
that do not distinguish paralogy from orthology (Marks
et al., 1988), immunological distances that are nearly
obsolete, and other vague phenetic measures such as
351
genomic fingerprints are excluded from supermatrices.
For our crocodylian supertree data set, inclusion of the
four “uncombinable” source trees (Fig. 2) had little effect
on the supertree topology. The polytomy at the base of
Crocodylia remained, and relationships generally were
consistent with the strict consensus tree shown in Figure
4. Exclusion of the vague uncombinable data from our
supermatrix did not seem like much of a loss. Overall, the
amount of available quality data that cannot be incorporated into a supermatrix is so small that this can hardly
serve as a major justification for the use of supertrees.
Proponents of supertrees have argued that extended
taxonomic coverage is one of the primary benefits of using source trees that are incompatible with supermatrix
analysis. For example, the use of taxonomies in supertree
analysis can yield comprehensive phylogenetic hypotheses that include all extant species in a group (e.g., Jones
et al., 2002), but the empirical basis for a particular taxonomy might not be clearly stated. Many classifications
do not result from explicit analyses of documented data
matrices and are based simply on the cumulative knowledge (or whims) of particular authorities (see Gatesy and
Springer, in press). In the 21st century, phylogenetic analyses of primary character data from hundreds of taxa are
quite feasible (e.g., Soltis et al., 1999). Therefore, we suggest that if no explicit character information exists for a
particular taxon, the most useful response from a systematist would be to collect new character data for that
taxon, not to include that taxon in a supertree analysis.
Until a framework of relationships based on actual character data is presented, broad comparative evolutionary
studies seem premature.
Hidden Support, Novel Clades, and the MRP
Supertree Method
An interesting property of some supertree methods
and the supermatix approach is that novel clades can
emerge with the combination of diverse data sets. Such
novel clades either contradict all separate analyses of individual data sets or are not implied by any combination
of trees supported by the separate analyses. In supermatrix analysis, the basis for this hidden support is well
characterized (Barrett et al., 1991; Gatesy et al., 1999; Lee
and Huggal, 2003) and is consistent with the most basic
principles of cladistic parsimony analysis (Farris, 1983)
and likelihood calculations (Felsenstein, 1981). In MRP
supertree analysis, the phenomenon is considered an artifact of an illogical methodology (Goloboff and Pol, 2002;
Pisani and Wilkinson, 2002).
Bininda-Emonds and Sanderson (2001:574) suggested
that “MRP is necessarily an approximation of a total evidence solution, intended primarily for those situations
when the latter technique cannot be applied. . . . At least
in theory, MRP supertree construction might also be expected to be a reasonable approximation.” To us, an important issue is whether MRP, or any supertree method,
can evaluate secondary phylogenetic signals that are
hidden in separate data sets. If this is not the case, how
can an MRP supertree study approximate a supermatrix
352
SYSTEMATIC BIOLOGY
analysis where a large percentage of the character support is hidden?
Several recent supermatrix studies have shown that
significant character support can be hidden in different
data sets. For example, in an analysis of five data sets
for Bovidae (cattle, sheep, and antelope), ∼85% of the
character support was emergent (Gatesy and Arctander,
2000), and in an analysis of 17 data sets for Cetacea
(whales), ∼45% of the support was hidden in the separate character partitions (Gatesy et al., 1999). Likewise,
a recently published supermatrix analysis of three data
sets showed extensive hidden support for relationships
among the four major extant clades of Crocodylia. Approximately 70% of the character support was emergent,
and several novel clades were recovered in the combined
analysis (Gatesy et al., 2003).
Our supermatrix analysis of Crocodylia was topologically consistent with the previous, more limited analysis
of Gatesy et al. (2003) and resolved 11 novel clades (Fig. 3)
that were not implied by any combination of trees supported by the separate analyses of individual data sets.
For example, a grouping of Paratomistoma courtii closer
to Gavialis gangeticus than to Tomistoma schlegelii was favored in the supermatrix analysis, but this relationship
was contradicted by the separate analysis of the morphological partition and was not implied by any combination of topologies derived from the individual data sets
(Figs. 1, 2).
In contrast to the supermatrix analysis, MRP supertree
analysis of the same data strictly supported no emergent
groupings (Fig. 4). This result could be interpreted as
an indication that hidden signals in the primary character data were ignored and/or distorted. Examination
of optimal topologies for the supertree data set showed
that individual trees expressed many novel clades, but
these were not necessarily the emergent relationships
supported by supermatrix analysis. Some of the optimal
supertrees were significantly incongruent with topologies supported by the supermatrix, and the basal fossil
taxa were placed in a bewildering array of equally parsimonious alternative resolutions (Fig. 4). In this empirical
test case, MRP supertree analysis clearly did not distill
the same hidden signals that emerged in supermatrix
analysis (Fig. 3), and MRP was not a good approximation of total evidence.
Almost all published supertree analyses have utilized
the MRP method (e.g., Purvis, 1995; Bininda-Emonds
et al., 1999; Daubin et al., 2001; Liu et al., 2001; Jones et al.,
2002; Kennedy and Page, 2002; Salamin et al., 2002), but
the logical basis for this procedure is unclear. In parsimony and likelihood analyses of supermatrices, the fit of
each character to competing topologies can be calculated
precisely, and there is a clear connection between implied
evolutionary character transformations and the choice
among competing phylogenetic hypotheses (Farris, 1983;
de Queiroz and Poe, 2001). MRP and all other previously
described supertree methods lack these critical properties. In MRP analysis, detailed heterogeneous information encoded by the character data is lost (Pisani and
Wilkinson, 2002), and support that is hidden in different
VOL. 53
data sets by conflicting patterns of homoplasy is completely ignored or misinterpreted. Sometimes MRP analysis yields novel emergent clades that also are supported
by supermatrix analysis (Bryant, in press), sometimes
MRP mimics strict consensus results and conflicts with
supermatrix results (Pisani and Wilkinson, 2002), and in
other situations MRP analysis can produce novel systematic hypotheses that conflict with all data sets that compose the supertree data set and that are not supported
by supermatrix analysis of all data sets. Thus, MRP supertree analysis is an unpredictable systematic “black
box” (Rodrigo, 1993, 1996; Goloboff and Pol, 2002; Pisani
and Wilkinson, 2002; Bryant, in press). Wilkinson et al.
(2001:300) concluded that MRP supertrees “resolve conflicts without promoting any understanding. . . . In the
light of the known behavior of MRP, it remains to be
demonstrated that there is any convincing justification
for this approach to supertree construction.”
Bininda-Emonds and Sanderson (2001) suggested that
MRP is well grounded in basic graph and network theory, but MRP is not well grounded in phylogenetic theory. MRP contrasts with certain supertree methods, such
as gene tree analysis, that reconcile source tree incompatibility in terms of evolutionary events such as gene duplication, gene loss, and horizontal transfer (Maddison,
1997). Slowinski and Page (1999:818) argued that MRP “is
flawed because homoplasy in this context has no obvious
biological meaning.” Thus, even if there is no duplication
of character data, the resolution of incongruence among
different source trees in an MRP data set has not been
justified adequately (see Rodrigo, 1993, 1996; Slowinski
and Page, 1999; Wilkinson et al., 2001; Goloboff and Pol,
2002; Bryant, in press).
Given the lack of a clearly articulated evolutionary
framework for MRP supertree analysis, proponents of
this method have turned to computer simulations to support the general utility of the MRP method. These simulations have shown that under certain circumstances MRP
supertree analysis can approximate total evidence supermatrix results and produce accurate phylogenetic trees
(Bininda-Emonds and Sanderson, 2001). It is important
to recognize the specific assumptions of the simulations,
however, and to compare these assumptions with conditions in empirical supertree studies.
In the simulations, all component data sets had the
same number of characters, the rate of evolution for
each data set on each branch was identical, all characters were multistate nucleotides, and a single model
of evolution was utilized (Bininda-Emonds and Sanderson, 2001). None of these conditions were duplicated in
published MRP supertree data sets (e.g., Purvis, 1995;
Bininda-Emonds et al., 1999; Liu et al., 2001; Jones
et al., 2002; Kennedy and Page, 2002). More importantly, unlike published supertrees, the simulated data
of Bininda-Emonds and Sanderson (2001) did not include character redundancies, poor data, unnecessary
assumptions of monophyly, and other appeals to authority (see Springer and de Jong, 2001; Gatesy et al.,
2002; Gatesy and Springer, in press). Until these inconsistencies between theoretical and actual supertree data
2004
POINTS OF VIEW
sets are sorted out, by simulating informal taxonomies,
review papers, and extensive character duplications on
a computer, or by removing these problems from published MRP supertree data sets, the relevance of the simulations to empirical studies is questionable. Therefore,
we disagree with the conclusion of Bininda-Emonds and
Sanderson (2001:575), based on their simulation results,
that, “published supertrees should be judged and used
with about the same degree of confidence that a total
evidence tree might.” We see no logical relationship between most published supertrees and these simulations.
Supertrees as Summaries of Previous Work
Gatesy et al. (2002) noted that it would be difficult
to argue that a relatively crude supertree data set is a
better description of past systematic research than a detailed supermatrix. In contrast, Bininda-Emonds et al.
(2003:727–728) recently countered that
Gatesy et al. held supertrees to be “imprecise summaries of previous work.” It is unclear to us why this might be the case. We
contend that supertrees are extremely precise summaries of previous work. . . . Because the supertree approach works at the level of
the source trees, it can easily compare the phylogenetic stability of
a taxon or a species with the amount of research effort it has received. Thus unlike supermatrix approaches, supertrees can more
easily highlight taxa that have been well studied, but the relationships within which remain controversial.
The logical basis of phylogenetic robustness measures
in supertree analysis is unclear (see Purvis, 1995; Pisani
et al., 2002; Pisani and Wilkinson, 2002). Alternatively,
in a supermatrix context interpretation of stability, as
measured by a standard support index (e.g., decay index; Bremer, 1994), relative to research effort, as indicated
by the amount of character data for a taxon (see Fig. 3),
is extremely straightforward. If a supermatrix contains
many character data for certain taxa and a clade that contains those taxa has a low decay index, then that relationship might reasonably be called controversial. For supermatrices, interpretations of research effort and stability
are quite straightforward, and unlike supertrees the distribution of character support among data sets can be
assessed in this framework (for examples, see Baker and
DeSalle, 1997; Gatesy et al., 1999, 2003; Lee and Hugall,
2003).
A supermatrix contains all of the information in a supertree data set but without confusing, imprecise data
redundancies and without inclusion of dubious phylogenetic information. Our character-based MRP data set
for Crocodylia can be derived directly from our supermatrix of Crocodylia, because all of the information in
the supertree data set is contained in the supermatrix
data set. However, the reverse does not hold. Supertree
data sets do not encode as much information as do supermatrices, and supermatrices always will be more general/efficient summaries of primary systematic data, i.e.,
characters scored for taxa.
Trees based on supermatrix analysis are accountable
because the actual characters that support particular relationships are directly accessible to all researchers. In
many previous supertree analyses, phylogenetic results
353
are not accountable for three reasons. First, the primary
character data (hypotheses of homology), which are the
ultimate source data for phylogenetic trees, are not represented in supertree data sets. Second, many source
trees encoded into previously published supertree data
sets have no recorded empirical basis and are not accountable themselves. For example, taxonomies that lack
documented support from any data source have been
included in several published supertree matrices (e.g.,
Jones et al., 2002). Third, the MRP method, which has
been utilized in most published supertree analyses, does
not resolve conflicts among source trees in any justifiable
way, again making support for clades unaccountable (see
Rodrigo, 1993, 1996; Wilkinson et al., 2001; Goloboff and
Pol, 2002; Pisani and Wilkinson, 2002).
CONCLUSIONS
We reassessed several difficulties in previously published supertree analyses that were noted by Gatesy et al.
(2002) and contend that solutions to these problems, as
presented by Bininda-Emonds et al. (2003), are not valid
for various reasons.
Unaccountable data, such as taxonomies with no empirical basis, should not be considered acceptable phylogenetic data in past or future supertrees. Given the ease
with which character data can be collected, it is unnecessary to use taxonomies and other outdated information
in modern systematic analyses. If character data for the
taxa of interest do not yet exist, the most reasonable response from a systematist would be to collect new character data, not to make a supertree that includes the taxa
of interest.
Homologous characters are the ultimate source data
for both supermatrix and supertree studies. Unnecessary
duplications of these character data are not justified in
supertree analysis. The definition of independent source
trees according to conflicting criteria (i.e., the gene is the
independent unit of analysis, and any combination of
genes or data sets also is an independent unit) is illogical.
This framework (Bininda-Emonds et al., 2003, in press)
has no safeguards for reducing duplications of evidence
that are present in many published supertree studies.
Bininda-Emonds et al. (2003) correctly noted that systematic data collection protocols can eliminate duplication of
characters in supertree analysis, but the framework outlined by these authors did not achieve this goal.
In supermatrix analysis, the delimitation of characters and character states determines final systematic results. In supertree analysis, the delimitation of characters, character states, and data sets determines final
systematic results (see Fig. 4). Thus, the definition of
data sets (pseudoindependent phylogenetic hypotheses
of Bininda-Emonds et al., 2003) is critical to the supertree approach. This is not the case for the supermatrix approach and makes the supertree approach more
assumption laden relative to the supermatrix approach.
Genes are linked in linear, continuous arrays (chromosomes); specifying individual genes as independent
units of analysis in systematics is thus unjustified and
354
SYSTEMATIC BIOLOGY
at the minimum controversial (see Miyamoto and Fitch,
1995; Kluge, 1997; Maddison, 1997; Siddall, 1997).
MRP, by far the most popular method for building
large supertrees, ignores/misinterprets hidden character support in different data sets and resolves conflicts among source trees in erratic and unjustified
ways (Wilkinson et al., 2001). All methods of supertree
construction that have been proposed thus far ignore/misinterpret secondary phylogenetic signals in different data sets.
Unrealistic simulations have shown that MRP supertree analyses can mimic total evidence analyses
(Bininda-Emonds and Sanderson, 2001). However, the
conditions of published simulations do not match the
conditions of most published MRP supertree studies
(e.g., Purvis, 1995; Bininda-Emonds et al., 1999, Liu et al.,
2001; Jones et al., 2002; Kennedy and Page, 2002; Pisani
et al., 2002; Salamin et al., 2002). Duplications of data,
informal taxonomies, and reviews were not modeled,
therefore the simulations have few implications for assessing the accuracy of most published supertrees.
Published mammalian supertrees (Purvis, 1995;
Bininda-Emonds et al., 1999, Liu et al., 2001; Jones et al.,
2002) are not efficient summaries of previous systematic work. For a comparable sample of primary character
data, a supermatrix contains all of the information in a
supertree data set and much more. Many previously published supertrees are both imprecise, i.e., not minutely
exact, and unaccountable and should not be the basis for
critical tests of evolutionary hypotheses.
ACKNOWLEDGMENTS
NSF grants to J.G. (EAR-0228629, DEB-9985847, DEB-0213171, and
DEB-0212572) provided funding for this work, and R.B. was supported
by an NIH National Research Service Award #1F32GM67463-01. O.
Bininda-Emonds, R. Page, D. Pol, C. Simon, and M. Steel offered helpful
comments. We are especially grateful to C. Brochu, L. Densmore, J.
Harshman, S. Poe, and P. White for rapidly sending us data matrices
by e-mail. G. Amato, P. Brazaitis, Miami Metro Zoo, and the Wildlife
Conservation Society provided DNA and tissue samples. We also thank
M. Lee for suggesting that the BDNF and ATP7A primers from Murphy
et al. (2001) might be useful for resolving crocodylian relationships.
R EFERENCES
Baker, R., and R. DeSalle. 1997. Multiple sources of character information and the phylogeny of Hawaiian drosophilids. Syst. Biol. 46:654–
673.
Barrett, M., M. Donoghue, and E. Sober. 1991. Against consensus. Syst.
Zool. 40:486–493.
Baum, B. 1992. Combining trees as a way of combining data sets for
phylogenetic inference, and the desirability of combining gene trees.
Taxon 41:3–10.
Bininda-Emonds, O., and H. Bryant. 1998. Properties of matrix representation with parsimony analyses. Syst. Biol. 47:497–508.
Bininda-Emonds, O., J. Gittleman, and A. Purvis. 1999. Building large
trees by combining phylogenetic information: A complete phylogeny
of the extant Carnivora (Mammalia). Biol. Rev. 74:143–175.
Bininda-Emonds, O., J. Gittleman, and M. Steel. 2002. The (super)tree
of life: Procedures, problems, and prospects. Annu. Rev. Ecol. Syst.
33:265–289.
Bininda-Emonds, O., K. Jones, S. Price, R. Grenyer, M. Cardillo, M.
Habib, A. Purvis, and J. Gittleman. 2003. Supertrees are a necessary
not-so-evil: A comment on Gatesy et al. Syst. Biol. 52:724–729.
Bininda-Emonds, O., K. Jones, S. Price, R. Grenyer, M. Cardillo, and
A. Purvis. In press. Garbage in, garbage out: Some practical issues
VOL.
53
in supertree construction. In Phylogenetic supertrees: Combining information to reveal the tree of life (O. Bininda-Emonds, ed.). Kluwer
Academic, Dordrecht, The Netherlands.
Bininda-Emonds, O., and M. Sanderson. 2001. Assessment of the accuracy of matrix representation with parsimony analysis supertree
construction. Syst. Biol. 50:565–579.
Bremer, K. 1994. Branch support and tree stability. Cladistics 10:295–
304.
Brochu, C. 1997. Morphology, fossils, divergence timing, and the phylogenetic relationships of Gavialis. Syst. Biol. 46:479–522.
Brochu, C. 2002. Thecachampsoides minor and early gavialoid history:
Coastal Atlantic origins of longirostrine crocodylians. J. Vertebr. Paleontol. 22:39A.
Brochu, C. 2003. Phylogenetic approaches toward crocodylian history.
Annu. Rev. Earth Planet. Sci. 31:357–397.
Brochu, C., and L. Densmore. 2001. Crocodile phylogenetics: A summary of current progress. Pages 3–8 in Crocodilian biology and evolution (G. Grigg, F. Seebacher, and C. Franklin, eds.). Surrey Beatty
and Sons, Chipping Norton, New South Wales.
Brochu, C., and P. Gingerich. 2000. New tomistomine crocodylian from
the middle Eocene (Bartonian) of Wadi Hitan, Fayum Province,
Egypt. Contrib. Mus. Paleontol. Univ. Mich. 30:251–268.
Brooks, D., and D. McLennan. 1993. Parascript: Parasites and the language of evolution. Smithsonian Institution Press, Washington, D.C.
Bryant, H. In press. The cladistics of matrix representation with parsimony analysis. In Phylogenetic supertrees: Combining information
to reveal the tree of life (O. Bininda-Emonds, ed.). Kluwer Academic,
Dordrecht, The Netherlands.
Buscalioni, A., F. Ortega, D. Weishampel, and C. Jianu. 2001. A revision
of the crocodyliform Allodaposuchus precedens from the Upper Cretaceous of the Hateg Basin, Romania. Its relevance in the phylogeny
of Eusuchia. J. Vertebr. Paleontol. 21:74–86.
Caccone, A., J. Gibbs, V. Ketmaier, E. Suatoni, and J. Powell. 1999. Origin
and evolutionary relationships of giant Galapagos tortoises. Proc.
Natl. Acad. Sci. USA 96:13223–13228.
Campbell, H. 1972. Ecological or phylogenetic interpretations of
crocodilian nesting habits. Nature 238:404–405.
Cohen, M., and C. Gans. 1970. The chromosomes of the order
Crocodilia. Cytogenetics 9:81–105.
Daubin, V., M. Gouy, and G. Perriere. 2001. Bacterial molecular phylogeny using supertree approach. Genome Inf. 12:155–164.
Densmore, L. 1983. Biochemical and immunological systematics of the
order Crocodilia. Pages 397–465 in Evolutionary biology, Volume 16
(M. K. Hecht, B. Wallace, and G. T. Prance, eds.). Plenum Press, New
York.
Densmore, L., and P. White. 1991. The systematics and evolution of
the Crocodilia as suggested by restriction endonuclease analysis of
mitochondrial and nuclear ribosomal DNA. Copeia 1991:602–615.
De Queiroz, K., and S. Poc. 2001. Philosophy and phylogenetic inference: A comparison of likelihood and parsimony methods in the
context of Karl Popper’s writings on corroboratio Syst. Biol. 50:305–
321.
Desjardins, P., and R. Morais. 1990. Sequence and gene organization
of the chicken mitochondrial genome: A novel gene order in higher
vertebrates. J. Mol. Biol. 212:599–635.
Farris, J. 1983. The logical basis of phylogenetic analysis. Pages 7–36
in Advances in cladistics, Volume 2 (N. Platnick and V. Funk, eds.).
Columbia Univ. Press, New York.
Felsenstein, J. 1981. Evolutionary trees from DNA sequences: A maximum likelihood approach. J. Mol. Evol. 17:368–376.
Gatesy, J., and G. Amato. 1992. Sequence similarity of 12S ribosomal
segment of mitochondrial DNAs of gharial and false gharial. Copeia
1992:241–244.
Gatesy, J., G. Amato, M. Norell, R. DeSalle, and C. Hayashi. 2003. Combined support for wholesale taxic atavism in gavialine crocodylians.
Syst. Biol. 52:403–422.
Gatesy, J., and P. Arctander. 2000. Hidden morphological support for
the phylogenetic placement of Pseudoryx nghetinhensis with bovine
bovids: A combined analysis of gross anatomical evidence and DNA
sequences from five genes. Syst. Biol. 49:515–538.
Gatesy, J., R. DeSalle, and W. Wheeler. 1993. Alignment-ambiguous nucleotide sites and the exclusion of systematic data. Mol. Phylogenet.
Evol. 2:152–157.
2004
POINTS OF VIEW
Gatesy, J., C. Hayashi, R. DeSalle, and E. Vrba. 1994. Rate limits for
mispairing and compensatory change: The mitochondrial ribosomal
DNA of antelopes. Evolution 48:188–196.
Gatesy, J., C. Matthee, R. DeSalle, and C. Hayashi. 2002. Resolution of
a supertree/supermatrix paradox. Syst. Biol. 51:652–664.
Gatesy, J., P. O’Grady, and R. Baker. 1999. Corroboration among data
sets in simultaneous analysis: Hidden support for phylogenetic relationships among higher level artiodactyl taxa. Cladistics 15:271–
313.
Gatesy, J., and M. Springer. In press. A critique of matrix representation with parsimony supertrees. In Phylogenetic supertrees: Combining information to reveal the tree of life (O. Bininda-Emonds, ed.).
Kluwer Academic, Dordrecht, The Netherlands.
Goloboff, P., and D. Pol. 2002. Semi-strict supertrees. Cladistics 18:514–
525.
Gladstein, D., and W. Wheeler. 1997. POY: Phylogeny reconstruction
via direct optimization of DNA data, version 2.0 American Museum
of Natural History, New York.
Greer, A. 1970. Evolutionary and systematic significance of crocodilian
nesting habits. Nature 227:523–524.
Groth, J., and G. Barrowclough. 1999. Basal divergences in birds and
the phylogenetic utility of the nuclear RAG-1 gene. Mol. Phylogenet.
Evol. 12:115–123.
Harlid, A., A. Janke, and U. Arnason. 1997. The mtDNA sequence of the
ostrich and the divergence between paleognathous and neognathous
birds. Mol. Biol. Evol. 14:754–761.
Harshman, J., C. Huddleston, J. Bollback, T. Parsons, and M. Braun.
2003. True and false gharials: A nuclear gene phylogeny of
Crocodylia. Syst. Biol. 52:386–402.
Janke, A., and U. Arnason. 1997. The complete mitochondrial genome
of Alligator mississippiensis and the separation between recent Archosauria (birds and crocodiles). Mol. Biol. Evol. 14:1266–1272.
Janke, A., D. Erpenbeck, M. Nilsson, and U. Arnason. 2001. The mitochondrial genomes of the iguana (Iguana iguana) and the caiman
(Caiman crocodylus): Implications for amniote phylogeny. Proc. R. Soc.
Lond. B 268:623–631.
Jones, K., A. Purvis, A. MacLarnon, O. Bininda-Emonds, and N.
Simmons. 2002. A phylogenetic supertree of the bats (Mammalia:
Chiroptera). Biol. Rev. 77:223–259.
Kennedy, M., and R. D. M. Page. 2002. Seabird supertrees: Combining
partial estimates of procellariiform phylogeny. Auk 119:88–108.
Kluge, A. 1989. A concern for evidence and a phylogenetic hypothesis
of relationships among Epicrates (Boidae, Serpentes). Syst. Zool. 38:7–
25.
Kluge, A. 1997. Testability and the refutation and corroboration of
cladistics hypotheses. Cladistics 13:81–96.
Kraus, F., L. Jarecki, M. Miyamoto, S. Tanhauser, and P. Laipis. 1992.
Mispairing and compensational changes during the evolution of mitochondrial ribosomal DNA. Mol. Biol. Evol. 9:770–774.
Lee, M., and A. Hugall. 2003. Partitioned likelihood support and the
evaluation of data set conflict. Syst. Biol. 52:15–22.
Liu, F., M. Miyamoto, N. Freire, P. Ong, M. Tennant, T. Young, and K.
Gugel. 2001. Molecular and morphological supertrees for eutherian
(placental) mammals. Science 291:1786–1789.
Maddison, W. 1997. Gene trees in species trees. Syst. Biol. 46:523–536.
Marks, J., C. Schmid, and V. Sarich. 1988. DNA hybridization as a guide
to phylogeny: Relations of the Hominoidea. J. Hum. Evol. 17:769–
786.
Matthee, C., J. Burzlaff, J. Taylor, and S. Davis. 2001. Mining the mammalian genome for artiodactyl systematics. Syst. Biol. 50:367–390.
Mindell, D., M. Sorenson, D. Dimcheff, M. Hasegawa, J. Ast, and T. Yuri.
1999. Interordinal relationships of birds and other reptiles based on
whole mitochondrial genomes. Syst. Biol. 48:138–152.
Miyamoto, M., and W. Fitch. 1995. Testing species phylogenies and
phylogenetic methods with congruence. Syst. Biol. 44:64–76.
Murphy, W., E. Eizrik, W. Johnson, Y. Zhang, O. Ryder, and S. O’Brien.
2001. Molecular phylogenetics and the origins of placental mammals.
Nature 409:614–618.
Norell, M. 1989. The higher level relationships of the extant Crocodylia.
J. Herpetol. 23:325–335.
355
O’Keefe, F., and P. Wagner. 2001. Inferring and testing hypotheses
of cladistic character dependence by using character compatibility.
Syst. Biol. 50:657–675.
Patterson, C. 1982. Morphological characters and homology. Pages 21–
74 in Problems of phylogenetic reconstruction (A. Joysey and A. Friday, eds.). Academic Press, London.
Pisani, D., and M. Wilkinson. 2002. Matrix representation with parsimony, taxonomic congruence, and total evidence. Syst. Biol. 51:151–
155.
Pisani, D., A. Yates, M. Langer, and M. Benton. 2002. A genus-level
supertree of the Dinosauria. Proc. R. Soc. Lond. B 269:915–921.
Poe, S. 1996. Data set incongruence and the phylogeny of crocodilians.
Syst. Biol. 45:393–414.
Purvis, A. 1995. A composite estimate of primate phylogeny. Philos.
Trans. R. Soc. Lond. B 348:405–421.
Ragan, M. 1992. Phylogenetic inference based on matrix representation
of trees. Mol. Phylogenet. Evol. 1:53–58.
Ray, D., and L. Densmore. 2002. The crocodilian mitochondrial control
region: General structure, conserved sequences, and evolutionary
implications. J. Exp. Zool. 294:334–345.
Rodrigo, A. 1993. A comment on Baum’s method for combining phylogenetic trees. Taxon 42:631–636.
Rodrigo, A. 1996. On combining cladograms. Taxon 45:267–274.
Roman, J., and B. W. Bowen. 2000. The mock turtle syndrome: Genetic
identification of turtle meat purchased in the south-eastern United
States of America. Anim. Conserv. 3:61–65.
Salamin, N., T. Hodkinson, and V. Savolainen. 2002. Building supertrees: An empirical assessment using the grass family (Poaceae).
Syst. Biol. 51:136–150.
Sanderson, M., A. Purvis, and C. Henze. 1998. Phylogenetic supertrees:
Assembling the trees of life. Trends Ecol. Evol. 13:105–109.
Siddall, M. 1997. Prior agreement: Arbitration or arbitrary. Syst. Biol.
46:766–770.
Slowinski, J. B., and R. D. M. Page. 1999. How should species phylogenies be inferred from sequence data. Syst. Biol. 48:814–825.
Soltis, P., D. Soltis, and M. Chase. 1999. Angiosperm phylogeny inferred from mutiple genes as a tool for comparative biology. Nature
402:402–404.
Springer, M., and W. DeJong. 2001. Which mammalian supertree to
bark up? Science 291:1709–1711.
Swofford, D. 1998. PAUP*: Phylogenetic analysis using parsimony (∗ and other methods), version 4.0b8. Sinauer, Sunderland,
Massachusetts.
Swofford, D., and G. Olsen. 1990. Phylogeny reconstruction. Pages 411–
501 in Molecular systematics, 1st edition (D. Hillis and C. Moritz,
eds.). Sinauer, Sunderland, Massachusetts.
Templeton, A. 1983. Phylogenetic inference from restriction endonuclease cleavage site maps with particular reference to the evolution
of humans and the apes. Evolution 37:221–244.
White, P., and L. Densmore. 2001. DNA sequence alignments and
data analysis methods: Their effect on the recovery of crocodylian
relationships. Pages 29–37 in Crocodilian biology and evolution
(G. Grigg, F. Seebacher, and C. Franklin). Surrey Beatty and Sons,
Chipping Norton, New South Wales.
Wilkinson, M., J. L. Thorley, D. T. J. Littlewood, and R. A. Bray. 2001.
Towards a phylogenetic supertree of Platyhelminthes? Pages 292–
301 in Interrelationships of the Platyhelminthes (D. T. J. Littlewood
and R. A. Bray, eds.). Taylor and Francis, London.
Wollenberg, K., and W. Atchley. 2000. Separation of phylogenetic and
functional associations in biological sequences by using the parametric bootstrap. Proc. Natl. Acad. Sci. USA 97:3288–3291.
Wu, X., D. Brinkman, and A. Russell. 1996. A new alligator from the Upper Cretaceous of Canada and the relationships of early eusuchians.
Palaeontology 39:351–375.
Yang, Z. 1997. PAML: A program package for phylogenetic analysis by
maximum likelihood. Comput. Appl. Biosci. 13:555–556.
First submitted 30 March 2003; reviews returned 12 June 2003;
final acceptance 20 November 2003
Associate Editor: Mike Steel
356
SYSTEMATIC BIOLOGY
VOL. 53
Syst. Biol. 53(2):356–359, 2004
c Society of Systematic Biologists
Copyright ISSN: 1063-5157 print / 1076-836X online
DOI: 10.1080/10635150490440396
Trees Versus Characters and the Supertree/Supermatrix “Paradox”
O LAF R. P. B ININDA-EMONDS
Lehrstuhl für Tierzucht, Technical University of Munich, Alte Akademie 12, 85354 Freising-Weihenstephan, Germany;
E-mail: [email protected]
In a pair of recent articles, Gatesy and colleagues
(Gatesy et al., 2002, 2004; also Gatesy and Springer, 2004)
have strongly criticized several recent supertree studies.
In so doing, they have pointed out important, but correctable, shortcomings in how the supertree approach
was applied in specific instances, and have helped to
fine-tune the methodology of this comparatively young
field.
However, their equally strong critiques of the MRP
method, if not the supertree approach as a whole,
derive from a faulty basis for comparison. Gatesy
et al. (2004:XXX) state (correctly) that primary character data are “the ultimate source data for both supertree and supermatrix analyses” (also p. XXX), and
use this statement to justify comparing both approaches
on this level. However, because any connection between the primary character data and the supertree
analysis is highly indirect—a feature of supertree construction that they also criticize—it is invalid to judge
supertrees according to criteria designed for characterbased phylogenetic reconstruction. Instead, the (MRP)
supertree approach should be judged with respect to
the data that it uses directly, namely the phylogenetic
hypotheses presented in the source trees. As I hope to
show, recognizing that supertree and supermatrix analyses operate at different levels blunts most of Gatesy
et al.’s criticisms of the supertree approach, thereby
resolving the “paradox” they mention in their earlier
paper.
S OURCE TREE COLLECTION AND D ATA D UPLICATION
As part of our efforts to construct a supertree for all extant species of mammal, we drew up a list of guidelines to
help us decide which source trees were suitable for inclusion (summarized in Bininda-Emonds et al., 2003, 2004).
These guidelines were based on the same two major
issues raised independently by Gatesy and colleagues:
data duplication and source tree quality.
As noted by Gatesy et al. (2004), our rules still allow
for the duplication of the primary character data among
source trees. However, we do not hold this to necessarily
be problematic. Duplication can occur at this level and
still result in independent phylogenetic hypotheses because a phylogenetic tree is composed of more than the
data going into it (Bininda-Emonds et al., 2003, 2004). All
assumptions made in the analysis (e.g., the alignment,
any weighting schemes, the model of evolution used)
as well as the form of the analysis itself (i.e., the opti-
mization criterion used) can impact on the resultant phylogeny. We raised the example previously where different assumptions of rooting for virtually the same data set
gave very different hypotheses about the phylogenetic
relationships among cetaceans (see Bininda-Emonds et
al., 2003). Another cogent example of the effect any auxiliary assumptions can have on our phylogenetic hypotheses is the detailed study of Maddison et al. (1999) on the
phylogeny of carabid beetles, where different manipulations of the same base data set produced very different
trees. Even the large molecular supermatrix of Madsen
et al. (2001) yielded a different set of relationships when
reanalyzed under a different set of assumptions by Malia
et al. (2003).
In short, our guidelines specified a level of primary
data duplication that we held still resulted in reasonably
independent phylogenetic hypotheses. Others will undoubtedly disagree, including Gatesy et al., for whom
all primary data duplication is problematic. In the end,
what is important is for the researcher to assess data independence in the supertree analysis at the appropriate
level, and this is at the level of the source tree and not
the primary character data. Moreover, as we stressed, the
rules were not designed to be applied literally and inflexibly, but to be interpreted according to the data at hand
and the specific question being asked (Bininda-Emonds
et al., 2004:277). This is in line with conventional phylogenetic analyses, where hard-and-fast rules with respect
to which data to include, how to process them (e.g., aligning molecular data or scoring morphological data), and
how to weight or analyze them are extremely rare.
T HE T HEORETICAL B ASIS OF MRP
S UPERTREE CONSTRUCTION
Gatesy et al. (2004; also Gatesy and Springer, 2004)
argued that MRP lacks a logical basis and, as such, constitutes a systematic “black box” that is inappropriate
for phylogeny reconstruction. In part, their perception
of the lack of a logical basis to MRP derives from their
attempts to judge it according to inappropriate criteria.
However, they also reiterate previous criticisms (e.g.,
Rodrigo, 1993, 1996; Slowinski and Page, 1999) that
the use of parsimony as an optimization criterion in
MRP is unfounded because any “homoplasy” on a supertree cannot be interpreted in a biologically meaningful way (i.e., as instances of convergence, parallelism, or
reversal).
2004
POINTS OF VIEW
However, incongruence in a supertree analysis is simply that, and there is no reason to equate it with homoplasy. In its purest form, the principle of parsimony
makes no statements regarding either homoplasy or incongruence having to be biologically interpretable. It
merely asserts that the preferred hypothesis is the one
that minimizes the number of ad hoc assumptions (i.e., the
simplest possible solution, loosely speaking). As such,
the use of parsimony in MRP has the same logical basis as that for analyzing character data, namely to find
the solution with the minimum amount of incongruence
(as measured by the objective function of a parsimony
analysis) to the data being analyzed. Homoplasy is instead a post hoc explanation that biologists use to explain incongruence in character data, the same as when
specific instances of incongruence are held to represent
faulty hypotheses of homology on the part of the investigator. Because (MRP) supertree analysis does not
analyze character data, there is no need to invoke the
idea of homoplasy, nor require incongruence to have a
biological meaning (although it can in supertree methods such as gene-tree parsimony; Slowinski and Page,
1999).
Gatesy et al. (2004) noted that MRP supertrees at times
variously resemble or contradict the results of either supermatrix or taxonomic congruence analyses, and use
this “inconsistent” behavior as evidence for the blackbox nature of MRP. The flaw in the argument is seen
easily: one could use it to show that parsimony is also
a black box because it produces results that are sometimes closer to phenetic methods like NJ and sometimes
to probabilistic methods like ML or Bayesian analysis.
The reality is that different methods will converge on the
same answer at different times because of the nature of
the data being analyzed and not because of any black-box
qualities to the method.
Nor does the fact that most conventional characterbased support measures (e.g., bootstrap frequencies or
Bremer support) are invalid when applied to MRP supertrees invalidate the entire approach or cast its logical basis into doubt (as implied by Gatesy et al., 2004).
Instead, it merely argues that appropriate supertreespecific support measures be developed that operate
at the level of trees and not characters. Several such
measures already exist: triplet- and quartet-fit similarity
measures (Page, 2002; Piaggio-Talice et al., 2004), or the
QS index (Bininda-Emonds, 2003).
HIDDEN S UPPORT
The inability of all supertree methods to account fully
for hidden support in the character data is an accepted
limitation, but a necessary tradeoff, of the combining of
tree topologies in a supertree approach. As such, the
validity of any novel clades in a supertree analysis is
open to question (Pisani and Wilkinson, 2002; Gatesy
et al., 2004). Fortunately, however, such clades appear
to be exceptionally rare, at least for MRP supertrees.
Simulation results indicate that novel clades occurred
predominantly, but still at a frequency of <0.2%, when
357
just two source trees were combined (Bininda-Emonds,
2003). This merely indicates that phylogenies should not
be constructed from limited data, be they source trees or
character data. Most supertrees have been constructed
from many more source trees than under these limiting
conditions, and no novel clades have been reported for
any of the major supertree studies (see Bininda-Emonds,
2003), including that of Gatesy et al. (2004).
However, I would argue against the assertion of
Gatesy et al. (2004) that novel clades in a supermatrix
analysis are always justified because they derive from
hidden support in character data. Consider the example
they cited with approval regarding the novel clade associated with Paratomistoma courtii. In the supermatrix
tree, this species clusters as the sister group to the remaining species of Gavialinae. However, this position
conflicts with its placement deep within Tomistominae
from the analysis of the morphological data set, the only
one to specify the position of this fossil species. Although
a subsignal within this data set does cluster P. courtii
within gavialines (J. Gatesy, pers. comm.), it is unclear
why hidden support is preventing this species in the
supermatrix tree from remaining as sister to the clade
comprising Gavialosuchus eggenbergensisi, Tomistoma lusitanica, and Tomistoma schlegeli, a clade present in both the
morphological and supermatrix trees. Instead, the novel
placement of P. courtii appears to be an artifact of missing data arising from this case of “taxon sampling.” The
lack of other, largely molecular, data for P. courtii means
that its final position is determined largely by being optimized on the scaffold imposed on it by the much more
numerous remaining data, where 13 of the 17 data sets
favor a sister group relationship between gavialines and
tomistomines (in contrast to the morphological data set).
In a sense, P. courtii is being “left behind” while the extant
species with more data sort themselves out. The end result is that its final position does not reflect the only data
available for it. Such artifacts need not be limited to fossil
species either, but to any poorly sampled species. Given
the patchy distribution of molecular data (see Sanderson
et al., 2003), it is likely that some novel clades arising because of hidden support in purely molecular data sets
might be equally suspect, and should not be accepted
uncritically.
D ATA Q UALITY
Gatesy et al. (2002, 2004) counter that the greater degree of taxonomic completeness that supertrees make
possible comes at the cost of having to include what they
hold to be source trees of poor quality (taxonomies in
particular), whether as a result of “dubious data” or invalid analytical techniques. They therefore question the
utility of several supertree studies as a framework under
which to study evolutionary phenomena. The potential
corollaries of this criticism to supertree construction are
twofold: 1) that trees derived from poor data or analyses are necessarily inaccurate, and 2) that any inaccuracy
is detrimental to the resulting supertree. However, evidence suggests otherwise in both cases.
358
SYSTEMATIC BIOLOGY
The history of phylogenetic analysis is arguably one of
broadly congruent results rather than widespread disagreement. Specific exceptions abound, of course, but
I would suggest that the relative amount and degree
of conflict has been overplayed. For example, I have
shown elsewhere that estimates of phylogeny within the
mammalian order Carnivora are indistinguishable statistically for the most part (Bininda-Emonds, 2000). This
included phylogenies derived from good and poor data
or analyses (including taxonomies and other “data-free”
phylogenies). Naturally, this finding applies only to the
Carnivora. However, it implies that source trees derived
from “dubious” data or techniques should not be excluded automatically, but subjected to the same process
of data assessment that Gatesy et al. (2002, 2004) advocate, and which is possible in a supertree framework
(contrary to their claims).
An implicit assumption in phylogenetic analysis is
that phylogenetic signal is coherent and will outweigh
any non-phylogenetic “signals,” which are random or at
least less coherent. This is, in fact, the principle underlying signal enhancement and hidden support. However,
it also explains why the inclusion of poor source trees
need not be detrimental to a supertree analysis: any inaccurate information from such source trees should be
outweighed by the coherent phylogenetic signal from the
remaining source trees. Indirect support for this derives
from the observation that heavily downweighting poor
source trees had little appreciable effect in most supertree
studies (e.g., Purvis, 1995; Bininda-Emonds et al., 1999;
Jones et al., 2002; Stoner et al., 2003). Altogether, differential weighting schemes and other sensitivity analyses
would seem to therefore be reasonable counterstrategies
to ascertain any effects owing to the inclusion of poor
data.
On a more practical note, Gatesy et al. (2004) argue
that the lack of good data for a species should be taken
as a sign to collect some. I agree wholeheartedly. But,
what do we do in the meantime, especially in the face of
the looming biodiversity crisis? In view of the increasing importance of phylogenetic studies to conservation
biology (Purvis et al., in press), the stand of Gatesy et al.
that poorly known species be excluded from phylogenetic (super)trees and the comparative analyses based
on them has serious consequences. Most methods that
can identify threatened, species-poor clades, and therefore possible factors correlating with this threat, rely on
complete taxon sampling (see Gittleman et al., 2004). Yet,
threatened species are precisely those for which good
data are often lacking (see McKinney, 1999; Mace et al.,
2003; Gittleman et al., 2004), and this will likely be true for
some time to come, especially for less charismatic organisms (e.g., most invertebrate groups). Surely it is better
for conservation biologists to draw inferences now, and
ones that are based on phylogenies that only might be inaccurate as a result of using poor data, rather than to wait
and start conservation efforts after enough good phylogenetic data has been amassed (when it might be too
late). Gatesy et al. (2004) rightly note that their complete
VOL.
53
phylogeny of Crocodylia will have important implications for the conservation of this group, but complete
phylogenies such as this, even for other equally small
groups, remain very much the exception.
T HE FUTURE: G LOBAL CONGRUENCE
AND D IVIDE-AND -CONQUER
The recognition that the supertree and supermatrix
approaches analyze different data using different assumptions and methods has an important consequence.
Contrary to Gatesy et al. (2004:XXX), these approaches
should be seen as being complementary, and not competing, strategies for phylogenetic reconstruction in a
manner akin to the global congruence approach (sensu
Lapointe et al., 1999). Where these different approaches
both support the same set of relationships for a comparable set of studies, we can have increased confidence in
the reality of those relationships. By contrast, relationships upon which the approaches disagree, especially
poorly supported relationships, should be examined
more closely for possible causes of this conflict (e.g., data
used or assumptions made in either approach, or true
conflict), and targeted for additional data collection and
analysis.
Gatesy and colleagues have done much to improve
the phylogenetic database by generating and collating a
tremendous amount of character data on two very different taxonomic groups. However, they have also now
produced two sets of twin supertree-supermatrix analyses that could be profitably compared to help elucidate
the phylogenetic relationships of the respective groups,
especially outstanding areas of uncertainty. Together, the
global solution provided by the supertree and supermatrix approaches is stronger than the solution from either
analysis alone.
Even so, the full promise of this complementarity
has yet to be realized. As part of a divide-and-conquer
strategy—whereby a large phylogenetic problem is broken down into many smaller, computationally easier
ones, the results of which are later combined—supertree
construction could play a vital role in the analysis of very
large supermatrices. Preliminary results indicate this to
be the case (Sanderson et al., 2003; Roshan et al., 2004).
Therefore, the complementary nature of the supertree
and supermatrix approaches will become increasingly
important as we tackle ever-larger portions of the Tree of
Life for analysis.
ACKNOWLEDGMENTS
I thank John Gatesy, John Gittleman, Kate Jones, Andy Purvis, and
David Williams for their helpful comments, and the German research
program BMBF, through the “Bioinformatics for the Functional Analysis of Mammalian Genomes” (BFAM) project, for financial support.
R EFERENCES
Bininda-Emonds, O. R. P. 2000. Factors influencing phylogenetic inference: A case study using the mammalian carnivores. Mol. Phylogenet. Evol. 16:113–126.
2004
POINTS OF VIEW
Bininda-Emonds, O. R. P. 2003. Novel versus unsupported clades: Assessing the qualitative support for clades in MRP supertrees. Syst.
Biol. 52:839–848.
Bininda-Emonds, O. R. P., J. L. Gittleman, and A. Purvis. 1999. Building
large trees by combining phylogenetic information: A complete phylogeny of the extant Carnivora (Mammalia). Biol. Rev. 74:143–175.
Bininda-Emonds, O. R. P., K. E. Jones, S. A. Price, M. Cardillo, R.
Grenyer, and A. Purvis. 2004. Garbage in, garbage out: Data issues
in supertree construction. Pages 267–280 in Phylogenetic supertrees:
Combining information to reveal the Tree of Life (O. R. P. BinindaEmonds, ed.). Kluwer Academic, Dordrecht, the Netherlands.
Bininda-Emonds, O. R. P., K. E. Jones, S. A. Price, R. Grenyer, M.
Cardillo, M. Habib, A. Purvis, and J. L. Gittleman. 2003. Supertrees
are a necessary not-so-evil: A comment on Gatesy et al. Syst. Biol.
52:724–729.
Gatesy, J., R. H. Baker, and C. Hayashi. 2004. Inconsistencies in arguments for the supertree approach: Supermatrices versus supertrees
of Crocodylia. Syst. Biol. 53:XXX–XXX.
Gatesy, J., C. Matthee, R. DeSalle, and C. Hayashi. 2002. Resolution of
a supertree/supermatrix paradox. Syst. Biol. 51:652–664.
Gatesy, J., and M. S. Springer. 2004. A critique of matrix representation with parsimony supertrees. Pages 369–388 in Phylogenetic supertrees: Combining information to reveal the Tree of Life
(O. R. P. Bininda-Emonds, ed.). Kluwer Academic, Dordrecht, the
Netherlands.
Gittleman, J. L., K. E. Jones, and S. A. Price. 2004. Supertrees: Using complete phylogenies in comparative biology. Pages 439–460 in Phylogenetic supertrees: Combining information to reveal the Tree of Life
(O. R. P. Bininda-Emonds, ed.). Kluwer Academic, Dordrecht, the
Netherlands.
Jones, K. E., A. Purvis, A. MacLarnon, O. R. P. Bininda-Emonds, and
N. B. Simmons. 2002. A phylogenetic supertree of the bats
(Mammalia: Chiroptera). Biol. Rev. 77:223–259.
Lapointe, F.-J., J. A. W. Kirsch, and J. M. Hutcheon. 1999. Total evidence, consensus, and bat phylogeny: A distance based approach.
Mol. Phylogenet. Evol. 11:55–66.
Mace, G. M., J. L. Gittleman, and A. Purvis. 2003. Preserving the Tree
of Life. Science 300:1707–1709.
Maddison, D. R., M. D. Baker, and K. A. Ober. 1999. Phylogeny of
carabid beetles as inferred from 18S ribosomal DNA (Coleoptera:
Carabidae). Syst. Entomol. 24:103–138.
Madsen, O., M. Scally, C. J. Douady, D. J. Kao, R. W. DeBry, R. Adkins,
H. M. Amrine, M. J. Stanhope, W. W. de Jong, and M. S. Springer.
359
2001. Parallel adaptive radiations in two major clades of placental
mammals. Nature 409:610–614.
Malia, M. J., Jr., D. L. Lipscomb, and M. W. Allard. 2003. The misleading
effects of composite taxa in supermatrices. Mol. Phylogenet. Evol.
27:522–527.
McKinney, M. L. 1999. High rates of extinction and threat in poorly
studied taxa. Conserv. Biol. 13:1273–1281.
Page, R. D. M. 2002. Modified mincut supertrees. Pages 537–552 in Algorithms in Bioinformatics, Second International Workshop, WABI,
2002, Rome, Italy, September 17–21, 2002, Proceedings (R. Guigó and
D. Gusfield, eds.). Springer, Berlin.
Piaggio-Talice, R., J. G. Burleigh, and O. Eulenstein. 2004. Quartet supertrees. Pages 173–191 in Phylogenetic supertrees: Combining information to reveal the Tree of Life (O. R. P. Bininda-Emonds, ed.).
Kluwer Academic, Dordrecht, the Netherlands.
Pisani, D., and M. Wilkinson. 2002. Matrix representation with parsimony, taxonomic congruence, and total evidence. Syst. Biol. 51:151–
155.
Purvis, A. 1995. A composite estimate of primate phylogeny. Philos.
Trans. R. Soc. Lond. B 348:405–421.
Purvis, A., J. L. Gittleman, and T. M. Brooks. In press. Phylogeny and
conservation. Cambridge University Press, Cambridge.
Rodrigo, A. G. 1993. A comment on Baum’s method for combining
phylogenetic trees. Taxon 42:631–636.
Rodrigo, A. G. 1996. On combining cladograms. Taxon 45:267–274.
Roshan, U., B. M. E. Moret, T. L. Williams, and T. Warnow. 2004. Performance of supertree methods on various data set decompositions.
Pages 301–328 in Phylogenetic supertrees: Combining information
to reveal the Tree of Life (O. R. P. Bininda-Emonds, ed.). Kluwer
Academic, Dordrecht, the Netherlands.
Sanderson, M. J., A. C. Driskell, R. H. Ree, O. Eulenstein, and
S. Langley. 2003. Obtaining maximal concatenated phylogenetic
data sets from large sequence databases. Mol. Biol. Evol. 20:1036–
1042.
Slowinski, J. B., and R. D. M. Page. 1999. How should species
phylogenies be inferred from sequence data? Syst. Biol. 48:814–
825.
Stoner, C. J., O. R. P. Bininda-Emonds, and T. M. Caro. 2003. The
adaptive significance of coloration in lagomorphs. Biol. J. Linn. Soc.
79:309–328.
First submitted 31 December 2003; final acceptance 29 January 2004
Associate Editor: Mike Steel