PACT: an efficient and powerful algorithm for generating area

Journal of Biogeography (J. Biogeogr.) (2005) 32, 755–774
SPECIAL
PAPER
PACT: an efficient and powerful
algorithm for generating area cladograms
Maggie Wojcicki and Daniel R. Brooks*
Department of Zoology, University of
Toronto, Toronto, ON, Canada
ABSTRACT
Aim To introduce and describe the functioning of a new algorithm, phylogenetic
analysis for comparing trees (PACT), for generating area cladograms that provide
accurate representation of information contained in taxon–area cladograms.
Methods PACT operates in the following steps. Convert all phylogenies to
taxon–area cladograms. Convert all taxon–area cladograms to Venn diagrams.
Choose any taxon–area cladogram from the set of taxon–area cladograms to be
analysed and determine its elements. This will be the template area cladogram.
Select a second taxon–area cladogram. Determine its elements. Document which
elements in the second tree occur in the template tree (denoted by ‘Y’) and which
do not (denoted by ‘N’). Each ‘Y’ indicates a match with previous pattern and
these are combined. Each ‘N’ is a new element and is attached to the template area
cladogram at the node where it is linked with a Y. This requires two rules: (1)
‘Y + Y ¼ Y’ (combine common elements) as long as they are connected at the
same node; and (2) ‘Y + N ¼ YN’ (add novel elements to the template area
cladogram at the node where they first appear). Once the novel elements in the
second taxon–area cladogram have been added to the template area cladogram,
see if any of them can be further combined. This requires three additional rules:
(1) ‘Y(Y) ¼ Y(Y)’ (do not combine Y’s if they are attached at different nodes on
the template area cladogram); (2) ‘Y + YN ¼ YN’ (Y is part of group YN); and
(3) ‘YN + YN ¼ YNN’ (Y is the same for each, but each N is different). Repeat
for all available taxon–area cladograms.
Results Three exemplars demonstrate that PACT provides the most accurate
area cladograms for vicariance-driven biotic diversification, dispersal-driven
biotic diversification and taxon pulse-driven biotic diversification. PACT can also
be used as an a priori method of biogeographical analysis.
Main conclusions PACT embodies all the strong points and none of the
weaknesses of previously proposed methods of historical biogeography. It is most
useful as an a posteriori method, but it is also superior to all previous a priori
methods because it does not specify costs, or weights or probabilities, or
likelihoods of particular biogeographical processes a priori and is thus sensitive to
clade-specific historical contingencies.
*Correspondence: Daniel R. Brooks,
Department of Zoology, University of Toronto,
25 Harbord Street, Toronto, ON M5S 3G5,
Canada.
E-mail: [email protected]
Keywords
Cladistic biogeography, community evolution, dispersal, evolutionary radiations, historical biogeography, phylogenetic analysis, phylogeny, speciation,
taxon pulse, vicariance.
Formal methods of historical biogeographical analysis using
phylogenetic trees began appearing more than 25 years ago
(Platnick & Nelson, 1978). Today, two classes of methods for
documenting historical biogeographical patterns exist. All
begin by converting phylogenetic trees into taxon–area
cladograms (Morrone & Carpenter, 1994; Enghoff, 1996),
ª 2005 Blackwell Publishing Ltd www.blackwellpublishing.com/jbi
doi:10.1111/j.1365-2699.2004.01148.x
INTRODUCTION
755
M. Wojcicki and D. R. Brooks
replacing the name of each species with notations indicating its
geographical distribution. Cladistic biogeographical methods,
also called a priori methods (Van Veller et al., 1999, 2000,
2001, 2002; Van Veller & Brooks, 2001), produce the most
parsimonious pattern of area relationships for one or more
taxon–area cladograms, given the constraints of an a priori
model (assumptions 1 and 2 associated with various modifications of component analysis; costs associated with eventbased models such as DIVA; probabilities associated with
likelihood models such as Jungles). Phylogenetic biogeographical methods, also called a posteriori methods (Van Veller et al.,
1999, 2000, 2001, 2002; Van Veller & Brooks, 2001), utilize
simultaneous comparisons among multiple clades to generate
the most parsimonious pattern of area relationships without
reference to an a priori model. All but one of the a posteriori
methods is similar to the a priori methods in permitting each
area to occur only once.
All methods recognize three classes of biogeographical
patterns: (1) complete matching between the general pattern
and any given taxon–area cladogram, usually interpreted as
indicating vicariance, but recognized by some as possibly being
the result of sequential speciation by colonization in each clade
(Fig. 1); (2) incomplete matching, suggesting extinction in one
of the lineages (Fig. 2); and (3) duplication of all or part of the
pattern, suggesting sympatric speciation in the common
ancestor of the duplicated lineages (also known as lineage
duplication) (Fig. 3).
Three additional types of patterns have been considered
complicating factors that obscure the general area relationships. One of these is speciation by dispersal on the part of one
or more members of the co-occurring clades (peripheral
isolates allopatric speciation), introducing unique area relationships (Fig. 4). The remaining two types of patterns
represent cases in which more than one phylogenetic event
affects the same area, producing reticulated area relationships:
(1) two or more separate speciation events within a clade each
resulting in at least two non-sister species inhabiting the same
area (Fig. 5); and (2) post-speciation dispersal leading to the
occurrence of the same species in more than one area (also
known as the widespread species problem) (Fig. 6). All the
methods discussed above, except secondary BPA (Brooks et al.,
2001), treat examples of these phenomena as exceptions to a
presumed single general pattern.
Figure 1 A taxon–area cladogram showing a particular set of area
relationships involving areas A, B, C and D, stipulated to be the
general pattern (left), and a second taxon–area cladogram showing
the same area relationships as the general pattern (right). Letters ¼ areas.
756
Figure 2 A taxon–area cladogram showing a particular set of area
relationships involving areas A, B, C and D, stipulated to be the
general pattern (left), and a second taxon–area cladogram showing
area relationships among areas A, C and D, interpreted as having
lost, through extinction (also known as lineage sorting) a species
occurring in area B that was the sister species of the common
ancestor of the species occurring in areas C and D (right). Letters ¼ areas.
Analysis of complex biogeographical patterns has been
hampered by the lack of an algorithm for producing area
cladograms that permits reticulated area relationships. Wiley
(1986, 1988a,b) and Zandee & Roos (1987) introduced what
they called assumption 0 to historical biogeographical studies.
Assumption 0 stipulates that all the information in each
taxon–area cladogram must be used in a biogeographical
analysis and that the area relationships depicted in the final
area cladogram must be logically consistent with the phylogenetic relationships depicted in every taxon–area cladogram
used to construct the area cladogram. Assumption 0 cannot
be fully satisfied for cases of reticulated area relationships
using any method that requires each area to appear only once
in a general area cladogram. Brooks (1990) therefore proposed
modifying one a priori method, BPA, to allow for reticulated
area relationships. Empirical studies using this modification of
BPA suggest that the historical biogeography of most biotas is
a complex and historically unique combination of most, if not
all, of the above six classes of patterns. In particular, it appears
that the majority of areas of endemism have reticulated
histories (Brooks & McLennan, 2001; Brooks et al., 2001;
Green et al., 2002; McLennan & Brooks, 2002; Spironello &
Brooks, 2003; Bouchard et al., in press; Halas et al., 2005). No
other methods of historical biogeography have been modified
to allow for reticulated area relationships. BPA can be
implemented using standard methods in phylogenetic analysis
(Brooks & McLennan, 2001, 2002), but only with laborious
manipulations of the data. All taxon–area cladograms need to
be converted into binary matrices, and each area duplication
requires that the matrix be re-formulated. This re-formulation
produces large numbers of pseudo-missing data codes representing the areas not affected by the unique events requiring
the duplication (Brooks & McLennan, 1991, 2002). Performing such an analysis for complex data sets is thus timeconsuming. In addition, because one cannot specify the
number and types of area duplications that will be needed a
priori, some have been led to believe that the duplication
convention in BPA is idiosyncratic rather than algorithmic
(e.g. Ronquist, 2002; Siddall & Perkins, 2003).
Journal of Biogeography 32, 755–774, ª 2005 Blackwell Publishing Ltd
Phylogenetic analysis for comparing trees
Figure 3 A taxon–area cladogram showing a particular set of area
relationships involving areas A, B, C and D, stipulated to be the
general pattern (left), and a second taxon–area cladogram of two
major parts, each of which shows the same area relationships as
the first taxon–area cladogram, interpreted as having experienced
a sympatric speciation event (lineage duplication) in the common
ancestor of the clade (right). Letters ¼ areas.
Figure 4 A taxon–area cladogram showing a particular set of area
relationships involving areas A, B, C and D, stipulated to be the
general pattern (left), and a second taxon–area cladogram showing
area relationships among areas A, B, C, D and E, with the addition
in area E of a sister species of the species occurring in area B in the
second taxon–area cladogram, interpreted as an instance of peripheral isolates speciation (allopatric speciation by dispersal)
(right). Letters ¼ areas.
Figure 6 A taxon–area cladogram showing a particular set of area
relationships involving areas A, B, C, and D, stipulated to be the
general pattern (left), and a second taxon–area cladogram showing
the same area relationships as the first taxon–area cladogram,
except that the species occurring in area A also occurs in area D;
and a general area cladogram representing the area relationships
supported by both taxon–area cladograms (right). The species
occurring in areas A and D is interpreted as a case of post-speciation dispersal from area A to area D. Letters ¼ areas.
must be used, and the final area cladogram must be logically
consistent with all input taxon–area cladograms), and does
not presuppose any degree of vicariance. Unlike BPA,
however, this algorithm does not require that the taxon–area
cladograms be converted into matrices, nor does it require
two-stage analysis. The inspiration for this algorithm comes
from considering Venn diagram representations of taxon–
area cladograms as strings of hierarchically organized characters. The algorithm uses the string input to build a tree-like
data structure that can be searched for points of agreement
and disagreement with additional input taxon–area cladograms (Cormen et al., 2001). We assume that the history
of the geographical context of speciation, dispersal and
extinction for any assemblage of clades comprises a long
and complex combination of strings. We also assume that
no single clade contains the complete information, even
about its own particular history. By combining the
partial information from each of many clades, however, we
can reconstruct substantial parts of the biogeographical
record of life by integrating information from multiple
clades. As the hierarchical organization of the strings of
characters stems from phylogenetic relationships, we refer to
this algorithm as phylogenetic analysis for comparing trees
(PACT).
MECHANICS OF THE ALGORITHM
Figure 5 A taxon–area cladogram showing a particular set of area
relationships involving areas A, B, C and D, stipulated to be the
general pattern (left), and a second taxon–area cladogram showing
the same area relationships, with the addition of a species in area A
that is the sister species of the species occurring in area D, indicating that the species occurring in area A arose from two different
ancestors. Area A is thus said to have a reticulated history (right).
Letters ¼ areas.
In this contribution, we present an algorithm for deriving
area cladograms from phylogenies which, like BPA, satisfies
assumption 0 (all information in all taxon–area cladograms
Journal of Biogeography 32, 755–774, ª 2005 Blackwell Publishing Ltd
The following exemplars represent three different biogeographical scenarios: (1) vicariance-driven diversification, (2)
dispersal-driven diversification and (3) taxon pulse-driven
diversification.
Exemplar 1: Vicariance-driven diversification
Step 1. Convert all phylogenetic trees of interest into taxon–
area cladograms. This is accomplished by replacing the names
of the species with the areas they inhabit.
Step 2. Convert the taxon–area cladograms into Venn diagrams
(Table 1). The Venn diagrams comprise two classes of
757
M. Wojcicki and D. R. Brooks
elements, ‘leaves’ and ‘nodes’. A leaf is a single area, and a node
is any grouping of at least two areas. Nodes are represented by
inclusive open [‘(’] and closed [‘)’] parentheses in the Venn
diagram. When a given species inhabits more than one area, a
leaf designates each of the areas and all the areas inhabited by
that species are contained within a single node.
Step 3. Choose any taxon–area cladogram from the set of
taxon–area cladograms to be analysed and determine its
elements. We will refer to this as the template area cladogram.
Template area cladogram (taxon–area cladogram 1 in Table 1):
(A(B(CD)))
PACT reads the Venn diagram representing the second
taxon–area cladogram from left to right, element by element.
Each time a closed parenthesis [‘)’] is encountered, indicating
a grouping of at least two areas, PACT moves backwards,
until it reaches an open parenthesis [‘(’], collecting the data
for the grouping thus created. Next, PACT represents the
grouping signified by the inclusive parentheses by a node,
which is a data structure designating a grouping and which is
used in integrating the taxon–area cladogram with the
template area cladogram. In this case, the first closed
parenthesis is reached after D. PACT then reads backwards
(to the left) collecting leaves and nodes until it reaches the
open parenthesis, in this case C + D. Once the data collection
is complete a node containing the leaves CD replaces the
parentheses around C and D. If we called that node ‘X’, the
Venn diagram would now be (A(BX)). PACT continues
reading to the left, searching for the next open parenthesis.
The next open parenthesis bracket forming a node, ‘Y’,
containing the leaf B and X. The taxon–area cladogram
is now modified to (AY), a grouping that receives its own
node, ‘Z’. The template area cladogram is now represented by
four leaves and three nodes: A, B, C, D, Z[A(B(CD)))],
Y[(B(CD))] and X[(CD)].
Step 4. Select a second taxon–area cladogram. Determine its
elements as in step 1 and then compare each of them with the
template area cladogram.
Template area cladogram: (A(B(CD)))
Taxon–area cladogram 2 (Table 1): (A(B(CD)))
PACT reads the second taxon–area cladogram in the same
manner as it reads the template area cladogram. In this case,
the first closed parenthesis is reached after D. PACT then reads
to the left, collecting leaves and nodes until it reaches the open
Table 1 Nine taxon–area cladograms represented as Venn diagrams
1
2
3
4
5
6
7
8
9
(A(B(CD)))
(A(B(CD)))
(A(CD))
((A(B(CD)))(A(B(CD))))
(A((BE)(CD)))
(A(B(C(DA))))
(A(BE))
(A(CD))
(A(A(B(CD))))
758
parenthesis, in this case it collects leaves C + D. Once the data
collection is complete, the parentheses around C and D are
replaced by a node containing the leaves CD. The next open
parenthesis forms a node, containing the leaf B and the node
(CD). Finally, the last open parenthesis forms a node
containing the leaf A and the node (B(CD)). The taxon–area
cladogram is now represented by four leaves and three
nodes: A, B,C, D, (A(B(CD))), (B(CD)) and (CD). The
next step is to integrate the taxon–area cladogram with the
template area cladogram. This is accomplished by maximizing
the matches between their respective leaves and nodes,
and then adding novel elements by creating novel nodes
at appropriate levels in the template area cladogram. Next,
PACT re-reads the elements of the taxon–area cladogram,
comparing them with the elements of the template area
cladogram. Each element in the input taxon–area cladogram
that also occurs in the template area cladogram is designated
with a ‘Y’; any element of the input taxon–area cladogram
that is not found in the template area cladogram is designated
with a ‘N’:
ðAðBðCDÞÞÞ Y; A Y þ ðBðCDÞÞ Y; B Yþ
ðCDÞ Y; C Y þ D Y
This produces the first and most basic rule of the algorithm,
the ‘Y + Y ¼ Y’ rule. In this case, each element of the input
taxon–area cladogram is congruent with an element in the
template area cladogram (all elements in tree 2 are Y’s), so
trees 1 and 2 can be combined completely. The general area
cladogram resulting from the combination of trees 1 and 2 is
thus (A(B(CD))) (Fig. 7).
PACT performs this search in the sequence in which groups
appear in the input taxon–area cladogram to be combined
with those in the template area cladogram. This speeds up the
process of analysing the new cladogram and making combinations and addition to the template. In the case above, for
example, PACT would have recognized that, because
(A(B(CD))) ¼ Y in the input taxon–area cladogram, all
elements in the input taxon–area cladogram corresponded to
elements in the template area cladogram, and would have
made the combination immediately.
Step 5. Add a third taxon–area cladogram (tree 3) and repeat
steps 2 and 3, comparing it with the tree resulting from the
combination of the previous steps.
Template area cladogram: (A(B(CD)))
(A(B(CD))); A + (B(CD)); B + (CD); C + D
Taxon–area cladogram 3 (Table 1): A(CD))
(A(CD)) ) N; A ) Y + (CD) ) Y; C ) Y + D ) Y
Figure 7 PACT-derived area cladogram for taxon–area cladograms 1–4 in Table 1. Letters ¼ areas.
Journal of Biogeography 32, 755–774, ª 2005 Blackwell Publishing Ltd
Phylogenetic analysis for comparing trees
In this case, there is a mismatch between the template area
cladogram and the input taxon–area cladogram at the initial
level, indicated by N. At this point no decision can be made as
to why the mismatch occurs, so PACT does not produce any
changes and moves on. All remaining elements in taxon–area
cladogram 3 are ‘Y’, so we can combine them with the template
area cladogram. At this point we can begin to consider the
mismatch, but we discover that in this case, the entire input
taxon–area cladogram has been combined with the template
area cladogram. The ‘N’ seems to have disappeared. The reason
for this is that the template area cladogram differs from the
input taxon–area cladogram only by containing information
not found in the input taxon–area cladogram. The absence
of B in the input taxon–area cladogram does not affect
the placement of B in the template area cladogram, and
thus does not affect the topology of the area cladogram. The
general area cladogram for trees 1 + 2 + 3 is still (A(B(CD)))
(Fig. 7).
Step 6. Add the next tree (4) and repeat steps 2–3.
Template area cladogram: (A(B(CD)))
(A(B(CD))); A + (B(CD)); B + (CD); C + D
Taxon–area cladogram 4 (Table 1): (A(B(CD)))(A(B(CD))))
(A(B(CD)))(A(B(CD)))) ) N; (A(B(CD))) ) Y; A ) Y +
(B(CD)) ) Y;
B ) Y + (CD) ) Y;
C ) Y + D ) Y;
(A(B(CD))) ) Y; A ) Y + (B(CD)) ) Y; B ) Y + (CD) ) Y;
C)Y+D)Y
Once again, the only N occurs at the level of the entire input
taxon–area cladogram, and that N disappears once the lower
levels are combined with the template. In this case, the input
taxon–area cladogram appears more complex than the template, but only because it contains two identical representations of the template area cladogram. This is the diagnostic
signature of lineage duplication, sympatric speciation within an
ancestor producing two co-occurring lineages. This does not
affect the pattern of relationships among areas, so the general
area cladogram for trees 1 + 2 + 3 + 4 is still (A(B(CD)))
(Fig. 7).
Step 7. Add the next tree (5) and repeat steps 2–3.
Template area cladogram: (A(B(CD)))
(A(B(CD))); A + (B(CD)); B + (CD); C + D
Taxon–area cladogram 5: (A((BE)(CD)))
(A((BE)(CD))) ) N; A ) Y + ((BE)(CD)) ) N; (BE) ) N +
(CD) ) Y; B ) Y + E ) N; C ) Y + D ) Y
Reading from the left, PACT encounters (BE); B in the input
taxon–area cladogram is Y, and because B and BE are
connected at the same node, both B’s can be combined. E,
which is not found in the template area cladogram, is thus a
novel (‘N’) element, and added to the template area cladogram
at that point, creating a (BE) grouping (a new node) in the
template. The next closed parenthesis is encountered at (CD);
both C and D as well as the grouping CD are Y in the template
area cladogram, so there is no change at this point. The next
closed parenthesis is ((BE)(CD)). This combination already
exists in the template area cladogram due to the modification
made earlier in which E was added to the template area
cladogram. Finally, PACT encounters A, which is Y, and is
Journal of Biogeography 32, 755–774, ª 2005 Blackwell Publishing Ltd
combined with the template. The resulting area cladogram is
(A((BE)(CD))) (Fig. 8).
Step 8. Add the next tree (6) and repeat steps 2–3, comparing it
with the tree produced by 1 + 2 + 3 + 4 + 5 (Fig. 8).
Template area cladogram: (A((BE)(CD))))
(A((BE)(CD)))); A + ((BE)(CD)); (BE) + (CD); B + E; C + D
Taxon–area cladogram 6: (A(B(C(DA))))
(A(B(C(DA)))) ) N; A ) Y* + (B(C(DA))) ) N; B ) Y +
(C(DA)) ) N; C ) Y + (DA) ) N; D ) Y + A ) Y*
This case is directly analogous to the previous one. Reading
from left to right, the algorithm initially encounters (AD),
which is not found in the template area cladogram (‘N’). The
‘A’ in (AD) is thus considered a novel element (‘N’) and the
input taxon–area cladogram is modified to
ðAðBðCðDAÞÞÞÞ N; A Y þ ðBðCðDAÞÞÞ N; B Yþ
ðCðDAÞÞ N; C Y þ ðDAÞ N; D Y þ A N
Next, PACT encounters (CD) in the template tree and
(C(DA)) in the input taxon–area cladogram tree. C is a
common element in both cladograms, and can be combined.
This leaves D in the template area cladogram and (DA) in
the input taxon–area cladogram connected at the same node.
This means that both D’s can be combined, creating a
(C(DA)) grouping (and new node) in the template area
cladogram. At the next level, we find the grouping (BE) in
the template area cladogram and the leaf B in the input
taxon–area cladogram. As in step 7, above, both B’s can be
combined, leaving the grouping (BE) in the template area
cladogram intact. At the next level, we encounter leaf A in
both cladograms, which are combined. This confirms
PACT’s initial assessment of Y for leaves A (basal most),
B, C and D. The input taxon–area cladogram contains a
novel grouping (DA) not found in the template and the
template contains a grouping (BE) not seen in the input
taxon–area cladogram. The resulting area cladogram is
(A((BE)(C(DA)))) (Fig. 9).
The situation presented by taxon–area cladograms 5 and 6,
above, represent cases of what we call the ‘Y + YN ¼ YN’ rule.
For clade 5, ‘Y’ ¼ B and ‘YN’ ¼ BE; for clade 6 ‘Y’ ¼ D and
‘YN’ ¼ DA. Next, we consider taxon–area cladograms 7 and 8
in Table 1 on their own, in order to demonstrate a final
combination rule.
Step 9. Choose one area cladogram to be the template (we
choose 7 in this case, but one could also choose 8 without
changing the results).
Taxon–area cladogram 7: (A(BE))
A(BE)); A + (BE); B + E
Figure 8 PACT-derived area cladogram for taxon–area cladograms 1–5 in Table 1. Letters ¼ areas.
759
M. Wojcicki and D. R. Brooks
Figure 9 PACT-derived area cladogram for taxon–area cladograms 1–6 in Table 1. Letters ¼ areas.
Taxon–area cladogram 8: (A(CD))
(A(CD)) ) N; A ) Y + (CD) ) N; C ) N + D ) N
‘A’ is the only common element (Y) in both taxon–area
cladograms. The groups (BE) and (CD) contain no elements in
common, but each is connected at a node with A. In this case,
although many dichotomous area cladograms consistent with
the data are possible, we have no evidence supporting any
particular one. Therefore, the resultant area cladogram is
(A(BE)(CD)) (Fig. 10). This is an example of what we call the
‘YN + YN ¼ YNN’ rule, where A ¼ ‘Y’, (BE) ¼ ‘N’ and
(CD) ¼ ‘N’.
Step 10. We can now combine the area cladogram for taxon–
area cladograms 7 and 8 (Fig. 10) with the template area
cladogram (Fig. 9).
Template area cladogram for clades 1–6: (A((BE)(C(DA))))
Area cladogram for clades 7–8: (A(BE)(CD))
(A(BE)(CD)) ) N; A ) Y + (BE) ) Y + (CD) ) N; B ) Y +
E ) Y; C ) Y + D ) Y
D in the area cladogram for clades 7–8 and (DA) in the
template area cladogram is a case of the ‘Y + YN ¼ YN’ rule,
so D in the area cladogram for areas 7–8 is combined with D in
(DA) in the template area cladogram); therefore (CD) in the
area cladogram for clades 7–8 is combined with (C(DA)) in the
template. A and (BE) are both Y, so they are combined. At this
point, all elements in the area cladogram for clades 7–8
have been integrated with the template area cladogram
(A((BE)(C(DA)))) (Fig. 9).
Step 11. Finally, we add taxon–area cladogram 9 in Table 1 to
the template area cladogram.
Template area cladogram: (A((BE)(C(DA))))
(A((BE)(C(DA)))); A + ((BE)(C(DA))); (BE) + (C(DA));
C + (DA); B + E; D + A
Taxon–area cladogram 9: (A(A(B(CD))))
(A(A(B(CD)))) ) N;
A ) Y* + A(B(CD)) ) N;
A ) Y* + (B(CD)) ) N; B ) Y + (CD) ) N; C ) Y + D ) Y
Once again, reading from left to right, PACT first encounters (CD). We begin with CD in the input taxon–area
cladogram + (C(DA)) in the template area cladogram. This
is a case of the ‘Y + YN ¼ YN’ rule, so D in the taxon–area
Figure 10 PACT-derived area cladogram for taxon–area cladograms 7–8 in Table 1. Letters ¼ areas.
760
cladogram is combined with (DA) in the template area
cladogram. Next, (B(CD)) in the input taxon–area cladogram,
now considered (B(C(DA))), is connected at the same node in
the template area cladogram as ((BE)(C(DA))). (C(DA)) is the
same in both cases, so they are combined, leaving B and (BE),
another case of the ‘Y + YN ¼ YN’ rule. At the next node, the
template area cladogram and the taxon–area cladogram are
both A, so they are combined.
Finally, the input taxon–area cladogram has an additional
A, originally designated Y, because A occurs twice in the
template area cladogram and twice in the taxon–area cladogram. At this point, we have already accounted for both A’s in
the template area cladogram, so PACT must still account for
the second A in the input taxon–area cladogram. One
possibility is that the two A’s in the taxon–area cladogram
are paraphyletic because they represent an episode of sympatric speciation (lineage duplication), in which case both could
be combined. PACT does not combine these A’s, for a
methodological and a biological reason, respectively. First,
single areas are not sufficient grounds for grouping or
combining areas. PACT will not create groupings of areas in
the absence of any evidence of groupings. Secondly, sympatric
speciation is not the only possible explanation for the
paraphyletic status of the area A’s. Combining the two A’s
would be tantamount to making a choice in favour of
sympatric speciation in the face of ambiguity, rather than
waiting for additional data (more taxon–area cladograms) to
resolve the ambiguity.
This provision in PACT prevents over-combining data; we
call it the ‘Y(Y)’ ¼ ‘Y(Y)’ or ‘Y(Y) „ Y’ rule, or ‘do not
combine single common areas attached to different nodes’. All
current methods, including secondary BPA, violate this rule.
Three short exemplars underscore the importance of this final
rule. Figure 11 depicts two taxon–area cladograms, which
share only a single area (A). Because there are no area
groupings in common, PACT does not make any combinations.
The same is true for the exemplar in Fig. 12, in which each
taxon–area cladogram contains the same areas, but has no
groupings of areas in common; once again, PACT makes no
combinations.
By representing these sorts of ambiguity with duplications,
PACT differs from other methods of historical biogeography,
which may create ambiguity by over-combining data. In
matrix representation methods, including BPA, this is called
inclusive ORing, which is known to create other systemic
analytical problems (Cressey et al., 1983; Brooks & McLennan,
1991, 2002). Consider the taxon–area cladograms ((AC)B) + (A(AB)). PACT produces (A((AC)B)) for these two
taxon–area cladograms. If we combine the A’s in taxon–area
cladogram 2, the result would be ((AC)B). Now add a third
taxon–area cladogram, (A(CB)). The PACT result is still
(A((AC)B)), supporting an interpretation that all three taxon–
area cladograms are parts of a single complex pattern, one part
of which is missing in each. If we had combined the A’s in
taxon–area cladogram 2, however, the result would be an
unresolved polytomy (ACB). At this point, all methods,
Journal of Biogeography 32, 755–774, ª 2005 Blackwell Publishing Ltd
Phylogenetic analysis for comparing trees
Figure 11 PACT does not make combinations based on single areas. ‘A’ is shared between the two taxon–area cladograms on the left, but
the other elements are unique; thus, there is no evidence to support combining the ‘A’s’. The PACT-derived area cladogram is shown on the
right. Letters ¼ areas.
Figure 12 PACT does not make combinations based on single areas. All four areas are shared between the two taxon–area cladograms on
the left, but their relative relationships are different; thus, there is no evidence supporting any combinations, producing the PACT-derived
area cladogram on the right. Letters ¼ areas.
including secondary BPA, would infer that the taxon–area
cladograms had no information in common, although cladistic
(a priori) methods could invoke various assumptions, probabilities, costs, or likelihoods to support one of the possible
resolutions of the polytomy.
PACT thus treats the basal-most A as a new element added
to the template area cladogram, which is modified to
(A(A((BE)(C(DA))))) (Fig. 13). All available taxon–area cladograms have now been incorporated, resulting in the final area
cladogram (Fig. 13; Table 2 summarizes PACT). Some may
notice at this point that either of the two basal A’s in taxon–
area cladogram 9 could be considered the same as the basal A
in the template area cladogram. We will return to this issue
later, but at this point we will note only that this ambiguity
does not affect the construction of the area cladogram, only the
mapping of particular species onto the area cladogram when
we begin to derive evolutionary inferences from the area
cladogram.
Exemplar 2: Dispersal-driven diversification
We believe that a general method for historical biogeographical
analysis should be able to distinguish biotic diversification
driven primarily by vicariance from those driven primarily by
dispersal, rather than simply searching for support for
vicariance. We next present a two-clade exemplar, in which
each clade colonized the same set of areas (B, C, D and E) from
different source areas (A and F). We explain the dispersal
scenario supported by this exemplar more fully in the
discussion.
Journal of Biogeography 32, 755–774, ª 2005 Blackwell Publishing Ltd
Figure 13 PACT-derived area cladogram for taxon–area cladograms 1–9 in Table 1. Letters ¼ areas.
Template area cladogram (Fig. 14): (A(B(D((CE)(CD)))));
A + (B(D((CE)(CD)))); B + (D((CE)(CD))); D + ((CE)(CD));
(CE) + (CD); C + E; C + D
Taxon–area cladogram 2 (Fig. 15): (F(C(DB)((D(BE))(CD))));
F + (C(DB)((D(BE))(CD)));
C + (DB) + ((D(BE))(CD));
(D(BE)) + (CD); D + (BE); B + E; C + D
Reading taxon–area cladogram 2 from left to right, PACT
reaches the first closed parenthesis following B and reads
backwards, forming node[DB]. Taxon–area cladogram 2 is
modified to (F(C node[DB]((D(BE))(CD)))). The closest D
and B are a node apart in the template area cladogram and
they are both single areas (leaves) grouped with nodes.
Therefore D ) N, B ) N and node[DB] ) N. PACT cannot
place this node in the template area cladogram at this point,
so there is no change to the template area cladogram. The
next closed parenthesis in taxon–area cladogram 2 follows E.
Reading backwards the node[BE] is formed and taxon–area
cladogram 2 is modified to (F(C node[DB]((D node
[BE])(CD)))). The closest B and E are several nodes apart
in the template area cladogram. B is a leaf grouped with a
node and E is a leaf grouped with a leaf C. If node[BE]
761
M. Wojcicki and D. R. Brooks
Table 2 Summary of the PACT algorithm
1. Convert all phylogenies to taxon–area cladograms.
2. Convert all taxon–area cladograms to Venn diagrams.
3. Choose any taxon–area cladogram from the set of taxon–area cladograms to be analysed, and determine its elements. This will be the template area
cladogram. The initial choice of a template area cladogram does not affect the final outcome (the analysis will proceed more quickly if you begin with
the most complex taxon–area cladogram).
4. Select a second taxon–area cladogram. The order in which you add taxon–area cladograms does not affect the final outcome (the analysis will
proceed more quickly if you begin with the most complex taxon–area cladograms). Determine its elements as in step 1. Then document which
elements in the second tree occur in the template tree (denoted by ‘Y’), and which do not (denoted by ‘N’). Each ‘Y’ indicates a match with previous
pattern, and these are combined. Each ‘N’ is a new element, and is attached to the template area cladogram at the node where it is linked with a Y. To
get to this point, we invoke two rules:
‘Y + Y ¼ Y’ (combine common elements) as long as they are connected at the same node
‘Y + N ¼ YN’ (add novel elements to the template area cladogram at the node where they first appear)
5. Once the novel elements in the second taxon–area cladogram have been added to the template area cladogram, see if any of them can be further
combined. At this point, we invoke three additional rules:
‘(Y(Y) ¼ (Y(Y)’ (do not combine Y’s if they are attached at different nodes on the template area cladogram)
‘Y + YN ¼ YN’ (Y is part of group YN)
‘YN + YN ¼ YNN’ (Y is the same for each, but each N is different)
6. Add the remaining taxon–area cladograms by repeating steps 4–5 for all remaining taxon–area cladograms. When novel elements (groups of more
than one area) are combined with an template area cladogram and their relationships with other groups existing in the template area cladogram are
not accounted for in the input taxon–area cladogram – they are denoted as a novel element to these unaccounted for groups which may, in fact, be
grouped together with them on a later occasion.
would be in a node with C then it would be a case of
Y + YN ¼ YN, where B is a new element. PACT reads to
the next closed parenthesis and backwards resulting in
node[D + BE], therefore the previous combination is a case
of the ‘YN + YN ¼ YNN’ rule and results in node[BCE].
The template area cladogram is modified to contain
node[BCE]. Leaf D and node[BCE] are a node apart in
the template area cladogram therefore the D grouped with
node[BCE] in taxon–area cladogram 2 is a new element and
is a case of Y + YN ¼ YN. Taxon–area cladogram 2 is
modified to (F(C node[DB](node[D + BCE](CD)))) and the
template area cladogram is also modified at node[BCE] to
node[D + BCE] and all the nodes that contained node[BCE]. The next closed parenthesis is reached after D,
forming node[CD]. C ) Y, D ) Y and node[CD] ) Y in the
template area cladogram; therefore, it is not changed and
taxon–area cladogram 2 is modified to (F(C node[DB]
(node[D + BCE] node[CD]))). The next closed parenthesis
is reached after node[CD], forming node[DBCE + CD],
node[CD] ) Y, node[D + BCE] ) Y and node[CD + DBCE] ) Y in the template area cladogram; therefore, it is
not changed and taxon–area cladogram 2 is modified to
Figure 14 Taxon–area cladogram for a group colonizing areas B,
C, D and E from area A. Letters ¼ areas.
762
Figure 15 Taxon–area cladogram for a group colonizing areas
B, C, D and E from area F. Letters ¼ areas.
(F(C node[DB] node[DBCE + CD])). Reading to the left,
the next closed parenthesis is node[C + DB + DBCECD].
C ) N, node[DB] ) N and node[DBCECD] ) Y. In the
template area cladogram, node[DBCECD] is grouped
with leaf D. These are combined, forming node
[C + DB + D + DBCECD]. The combination of node[DB] + leaf D is simplified to node[DB]; the template area
cladogram is modified at node[D + DBCECD] to node
[DB + C + DBCECD] and taxon–area cladogram 2 is
modified to (F node[DB + C + DBCECD]). The last closed
parenthesis forms node[F + DBCDBCDCD]. F ) N and
node[DB + C + DBCECD] ) Y in the template area cladogram. In the template area cladogram node[DB + C + DBCECD] is grouped with leaf B. Therefore, the combination
results in node[F + B + DBCDBCDCD]. In the template
area cladogram, node[B + DBCDBCDCD] is modified to
node[F + B + DBCDBCDCD]. PACT is finished combining
taxon–area cladogram 2 with the template, resulting in the
new template area cladogram (A(BF((DB)C((D(EBC))(DC))))))
(Fig. 16).
The BPA result for this exemplar, which also satisfies
assumption 0, is slightly different (Fig. 17). This is because
Journal of Biogeography 32, 755–774, ª 2005 Blackwell Publishing Ltd
Phylogenetic analysis for comparing trees
BPA roots both clades at the base of the area cladogram. This
implies that both clades became associated with the areas in
which they occur at the same time. When we make that
assumption, we obtain the area cladogram shown in Fig. 17.
This result combines the basal-most B from clade 1 with the
BC grouping in clade 2, rather than combining the basal-most
C from clade 1 with the BC grouping in clade 2. While this is a
possibility, there is no direct evidence supporting such an a
priori rooting assumption, based on only two taxon–area
cladograms, so PACT produces a more conservative, and
preferable, result. BPA could produce the same result as PACT
if at least three clades were analysed, two of which showed an
unambiguous rooting with area A.
Exemplar 3: Taxon pulse-driven diversification
All a priori and a posteriori methods, including secondary
BPA, assume that general patterns are the result of
speciation resulting from the formation of geographical
barriers (vicariance) and that speciation resulting from active
dispersal across barriers does not produce general patterns,
because the ability to disperse across existing barriers and to
form founder populations should be constrained by cladespecific vagility and population biology. The exemplars
depicted in Table 1 all conform to this general expectation.
Wiley (1981) noted that some circumstances, such as
colonization of islands, might produce general distribution
patterns based on dispersal rather than vicariance, and
Endler (1982) suggested that such correlated dispersal
patterns might be common. More recently, Bouchard &
Brooks (in press) demonstrated that the evolution of
flightlessness in insects endemic to areas within the Australian Tropical Rainforest had no effect on how widespread
species were or on modes of speciation.
Erwin (1979, 1981, 1985, 1991) and Erwin & Adis (1982)
pointed out that the maximum vicariance model does not
account for common ancestors becoming widespread enough
to be affected by vicariance, or for post-vicariance dispersal
by multiple members of the same biota setting the stage for
new vicariant events. What happens, however, when a
previously existing barrier, one that produced vicariant
Figure 16 Area cladogram for taxon–area cladograms shown in
Figs 13 and 14. Letters ¼ areas.
Journal of Biogeography 32, 755–774, ª 2005 Blackwell Publishing Ltd
Figure 17 BPA area cladogram for taxon–area cladograms shown
in Figs 13 and 14. Letters ¼ areas.
speciation among members of multiple clades, breaks down?
Erwin asserted that this should produce multiple clades
dispersing from their areas of origin, i.e. general biogeographical distribution patterns based on dispersal, or biotic
expansion, including many widespread taxa. Halas et al.
(2005) recently suggested some protocols for discerning
taxon pulse-driven diversification, or complex biotic evolution driven by biotic expansion alternating with episodes of
vicariance. In part, their protocol requires a method for
generating area cladograms that is sensitive to complex
biogeographical histories, widespread taxa, and analysis of
multiple clades. They also noted that a rigorous assessment
of the generality of taxon pulse radiations could not be
undertaken easily in the absence of an efficient algorithm for
analysing multiple clades. We next present a 17-clade
exemplar (Table 3) that embodies a taxon pulse radiation,
to demonstrate that PACT is the kind of efficient algorithm
discussed by Halas et al. In this exemplar, we use a
telegraphic version of the discussions of the first two
exemplars, to save space. For example, when we state that
an element of an input taxon–area cladogram ‘is combined’,
it should be understood that we mean ‘is combined with the
template’.
Step 1.
Template area cladogram (taxon–area cladogram 1 Table 3):
(((TH)(IS))(R(E(AL))))
Taxon–area cladogram 2 (Table 3): (((TH)(IS))(T(R(EE))))
(1) T ) Y;H ) Y; TH ) Y; I ) Y; S ) Y; IS ) Y;
TH + IS ) Y,
so
((TH)(IS))
is
combined.
(2)
E ) Y + E ) Y, so (EE) becomes E. R ) Y but (RE) ) N.
In the template area cladogram, E is grouped with AL and R
is grouped with EAL, so R and E are single areas a node
apart. PACT cannot determine if either or both are the same
as E + R in the input taxon–area cladogram, so they are
considered new areas, forming a new element RE. At this
point, we do not know where RE it fits in the template area
cladogram. (3) T ) N, so this is a new element and there is
still not enough information to place T and RE. (4) When
the PACT considers THIS + TRE, TRE can be placed at the
same node as THIS + REAL. TRE and REAL are novel
763
M. Wojcicki and D. R. Brooks
Table 3 Taxon–area cladograms for 17 clades and eight areas
(called T, H, I, S, R, E, A and L), represented as Venn diagrams
Clade
Clade
Clade
Clade
Clade
Clade
Clade
Clade
Clade
Clade
Clade
Clade
Clade
Clade
Clade
Clade
Clade
1 (((TH)(IS))(R(E(AL))))
2 (((TH)(IS))(T(R(EE))))
3 (((TH)(IS))((IS)(R(E(AL)))))
4 (((TH)(IS))((IS)(T(R(EE)))))
5 (((TH)(IS))((T(HE))(T(R(EE)))))
6 (((TH)(IS))((T(HE))(R(E(AL)))))
7 (((TH)(IS))((IS)((T(HE))(R(E(AL))))))
8 ((TH)((IS)((T(HE))(T(R(EE))))))
9 ((T(HE))((R((E(AL)))(T(R(EE)))))
10 ((R(E(AL)))(T(R(EE))))
11 ((T(IS))(R(E(AL))))
12 (((TH)(IS))(T(RE)))
13 ((TH)((IS)(R(E(AL)))))
14 (H((IS)(T(R(EE)))))
15 ((T(HE))(T(R(EE))))
16 (T((IS)((HE)(T(R(EE))))))
17 ((H(IS))(T(R(EE))))
elements in relation to each other and are marked as such.
The template area cladogram is modified to (((TH)(IS))
(T(RE))(R(E(AL)))).
Step 2.
Template area cladogram: (((TH)(IS))(T(RE))(R(E(AL))))
Taxon–area cladogram 3 (Table 3): (((TH)(IS))((IS)
(R(E(AL)))))
(1) T ) Y; H ) Y; TH ) Y; I ) Y; S ) Y; IS ) Y;
TH + IS ) Y: ((TH)(IS)) is combined. I ) N; S ) N;
IS ) N; No known placement at this point. (2) A ) Y;
L ) Y; AL ) Y; E ) Y; E + AL ) Y; R ) Y; R + EAL ) Y
combined and form (R(E(AL))) in the template area cladogram. (3) IS + REAL, now IS can be placed at same node as
REAL. (4) THIS + ISREAL ) N; the information here is that
IS is grouped with REAL but not with THIS therefore THIS,
REAL and IS cannot share the same node. (5) Does TRE share
a node with IS and REAL or THIS and ISREAL? It is not
known at this point. IS and REAL are grouped together but IS
is marked as a novel element where the relationship with TRE
is unknown (i.e. it is possible that TRE may be grouped with
either IS and REAL or may remain as it is in the trichotomy of
ISREAL, TRE and THIS). The template area cladogram is thus
modified to (((TH)(IS))((T(RE))(IS)(R(E(AL))))).
Step 3.
Template area cladogram: (((TH)(IS))((IS)(T(RE))(R
(E(AL)))))
Taxon–area cladogram 4 (Table 3): (((TH)(IS))((IS)(T
(R(EE)))))
(1) T ) Y;H ) Y; TH ) Y; I ) Y; S ) Y; IS ) Y;
TH + IS ) Y: ((TH)(IS)) is combined. (2) I ) Y; S ) Y;
IS ) Y. (3) E ) Y; E ) Y; E + E ¼ E; R ) Y; R + E ) Y;
T + RE ) Y. (4) IS + TRE ) N; IS and TRE share the same
764
node, therefore IS is no longer marked as a novel element with
no relationship to TRE forming ((IS)(T(RE))(R(E(AL)))).
ISTRE is therefore represented as ISTREREAL in the template
area cladogram. (5) THIS + ISTRE-N, but ISTRE is represented by the newly formed ((IS)(T(RE))(R(E(AL)))) in the
template area cladogram; therefore, THIS + ISTREREAL ) Y.
The template area cladogram is modified to (((TH)(IS))((IS)
(T(RE))(R(E(AL))))).
Step 4.
Template area cladogram: (((TH)(IS))((IS)(T(RE))(R
(E(AL)))))
Taxon–area cladogram 5 (Table 3): (((TH)(IS))((T(HE))(T
(R(EE)))))
(1) T ) Y;H ) Y; TH ) Y; I ) Y; S ) Y; IS ) Y;
TH + IS ) Y: ((TH)(IS)) ) Y and is combined. (2) H ) N;
E ) Y; HE ) N; in the template area cladogram one E is
grouped with AL, which would immediately lead to this E
being N if these were all the Es in the template area
cladogram, but there is also E grouped with R. At this point
it is not known whether there is R in cladogram 5 therefore
no decision is made. (3) T ) Y, but T is grouped with RE,
no decision has been made as to whether R is in cladogram
5, so no decision can be made at this point. (4) E ) Y;
E ) Y; E + E ¼ E; R ) Y; R + E ) Y; and R has been found
in cladogram 5 that is also grouped with E, therefore the E
grouped with H in cladogram 5 is considered N. (5) T ) Y,
(T(RE)) ) Y; T has been found in cladogram 5 grouped
with RE as it is in the template area cladogram therefore T
grouped with HE in cladogram 5 is also set to N. (6)
(T(HE)) + (T(RE)); (T(HE)) is added to the template area
cladogram at the same node as (T(RE)). (7) THIS + THETRE ) N; THIS and THETRE are more than one level apart
and cannot be added. REAL (and all its components) are
not represented in cladogram 5, therefore the novel element
THE is marked as novel with its relationship to REAL and
IS unknown. The template area cladogram is thus modified
to (((TH)(IS))((IS)((T(HE))(T(RE)))(R(E(AL)))).
Step 5.
Template area cladogram: (((TH)(IS))((IS)(T(HE)) (T(RE))
(R(E(AL))))
Taxon–area cladogram 6 (Table 3): (((TH)(IS))((T(HE))(R
(E(AL)))))
(1) T ) Y;H ) Y; TH ) Y; I ) Y; S ) Y; IS ) Y;
TH + IS ) Y: ((TH)(IS)) ) Y and is combined. (2) H ) Y;
E ) Y;(HE) ) Y; T ) Y; (T(HE)) ) Y and is combined. (3)
R ) Y; E ) Y; A ) Y; L ) Y; (AL) ) Y; (E(AL)) ) Y;
(R(E(AL))) ) Y and is combined. (4) ((T(HE))(R(E
(AL)))) ) N, (T(HE)) shares a node with (T(RE)) in the
template area cladogram, but in cladogram 6 (T(HE)) shares a
node with (R(E(AL))). In the template area cladogram
(R(E(AL))) shares a node with (T(HE)) + (T(RE)) and THE
is no longer marked as a novel element with no known
Journal of Biogeography 32, 755–774, ª 2005 Blackwell Publishing Ltd
Phylogenetic analysis for comparing trees
relationship to REAL. The template area cladogram is thus
modified to (((TH)(IS))((IS)((T(RE))(T(HE))((R(E(AL)))))
Step 6.
Template area cladogram: (((TH)(IS))((IS)((T(RE))(T(HE))
((R(E(AL)))))
Taxon–area
cladogram
7
(Table 3):
(((TH)(IS))((IS)((T(HE))(R(E(AL))))))
(1) T ) Y;H ) Y; TH ) Y; I ) Y; S ) Y; IS ) Y;
TH + IS ) Y: ((TH)(IS)) ) Y and is combined. (2) I ) Y;
S ) Y; (IS) ) Y and is combined. (3) H ) Y; E ) Y;
(HE) ) Y; T ) Y; (T(HE)) ) Y and is combined. (4) A ) Y;
L ) Y; (AL) ) Y;E ) Y; (E(AL)) ) Y; R ) Y; (R(E(AL))) ) Y
and is combined. (5) ((T(HE)(R(E(AL)))) ) N but
((T(HE))(R(E(AL)))(T(RE))) is in the template area cladogram; THE and REAL are not grouped together although they
are at the same level because THE is no longer a novel element
in relation to REAL (see step 5) and THEREAL is represented
as THEREALTRE in the area cladogram. ((IS)((T(HE))(R
(E(AL))))) ) N but THEREAL is represented as THEREALTRE
in the template area cladogram and therefore IS + THEREAL
is represented as IS + THEREALTRE in the template area
cladogram. (6) THIS + ISTHEREAL ) N; but ISTHEREAL is
represented as ISTHEREALTRE in the template area cladogram and THIS + ISTHEREALTRE ) Y. The template area
cladogram is unchanged.
Step 7.
Template area cladogram: (((TH)(IS))((IS)((T(RE))(T(HE))
((R(E(AL)))))
Taxon–area cladogram 8 (Table 3): ((TH)((IS)((T(HE))(T(R(EE))))))
(1) T ) Y; H ) Y; (TH) ) Y and is combined. (2) I ) Y;
S ) Y; (IS) ) Y and is combined. (3) H ) Y; E ) Y;
(HE) ) Y; T ) Y; (T(HE)) ) Y and is combined. (4) E ) Y;
E ) Y; E + E ¼ E; R ) Y; (RE) ) Y; T ) Y; (T(RE)) ) Y and
is combined. (5) ((T(HE)(T(RE))) ) N; one node apart but
neither TRE nor THE are novel in relationship to each other,
their relationship has been resolved and therefore there is
no change; THETRE is represented by THEREAL. (6)
((IS)((T(HE))(T(RE)))) is represented as ISTHEREALTRE in
the template area cladogram. (7) TH + ISTHETRE is represented in THIS + ISTHEREALTRE. The template area cladogram is unchanged.
R ) Y; (R(E(AL))) ) Y and is combined. (3) E ) Y; E ) Y;
E + E ¼ E; R ) Y; (RE) ) Y;T ) Y; (T(RE)) ) Y and is
combined. (4) ((T(RE))(R(E(AL)))) ) N; this grouping is
connected to the template area cladogram at the same node as
(T(HE) + (R(E(AL)) + (T(RE)), so (R(E(AL)) and (T(RE))
can be combined and they are no longer known as novel
elements in relation to each other as they had been marked in
step 1. The template area cladogram is modified to
(((TH)(IS)((IS)((T(HE))((R(E(AL)))(T(RE)))))).
Step 9.
Template area cladogram: (((TH)(IS)((IS)((T(HE))(R(E(AL)))(T(RE))))))
Taxon–area cladogram 10 (Table 3): ((R(E(AL)))(T(R(EE))))
(1) A ) Y; L ) Y; (AL) ) Y and is combined. (2) E ) Y
(E(AL)) ) Y; R ) Y; (R(E(AL))) ) Y and is combined. (3)
E ) Y; E ) Y; E + E ¼ E; R ) Y; (RE) ) Y;T ) Y;
(T(RE)) ) Y and is combined. (4) ((R(E(AL)))(T(R(EE))))
) Y and is combined. The template area cladogram is
unchanged.
Step 10.
Template area cladogram: (((TH)(IS)((IS)((T(HE))(R(E(AL)))(T(RE))))))
Taxon–area cladogram 11 (Table 3): ((T(IS))(R(E(AL))))
(1) I ) Y; S ) Y; (IS) ) Y; T ) Y; (T(IS)) ) N, but T
connects at the same node as (TH), making this an example of
the Y + YN ¼ YN rule, thus TIS is represented as THIS in the
template area cladogram. (2) A ) Y; L ) Y; (AL) ) Y;E ) Y;
(E(AL)) ) Y; R ) Y; (R(E(AL))) ) Y and is combined. (3)
TIS + REAL are more than one node apart and are represented
as THIS + ISTHEREALTRE in the template area cladogram.
The template area cladogram does not change.
Step 11.
Template
area
cladogram:
(((TH)(IS)((IS)((T(HE))
(R(E(AL)))(T(RE))))))
Taxon–area cladogram 12 (Table 3): (((TH)(IS))(T(RE)))
(1) T ) Y;H ) Y; (TH) ) Y; I ) Y; S ) Y; (IS) ) Y;
(TH) + (IS) ) Y: ((TH)(IS)) ) Y and is combined. (2)
T ) Y; R ) Y; E ) Y; (RE) ) Y; (T(RE)) ) Y and is combined. (3) THIS + TRE are more than one node apart in and
are represented as THIS + ISTHEREALTRE in the template
area cladogram. The template area cladogram is unchanged.
Step 8.
Template area cladogram: (((TH)(IS)((IS)((T(HE))(R(E
(AL)))(T(RE)))))
Taxon–area cladogram 9 (Table 3): ((T(HE))((R(E(AL)))
(T(R(EE)))))
(1) H ) Y; E ) Y; (HE) ) Y; T ) Y; (T(HE)) ) Y and is
combined. (2) A ) Y; L ) Y; (AL) ) Y; E ) Y; (E(AL)) ) Y;
Journal of Biogeography 32, 755–774, ª 2005 Blackwell Publishing Ltd
Step 12.
Template
area
cladogram:
(((TH)(IS)((IS)((T(HE))
(R(E(AL)))(T(RE))))))
Taxon–area cladogram 13 (Table 3): ((TH)((IS)(R(E(AL)))))
(1) T ) Y; H ) Y; (TH) ) Y and is combined. (2) I ) Y;
S ) Y; (IS) ) Y and is combined. (3) A ) Y; L ) Y; (AL) ) Y;
765
M. Wojcicki and D. R. Brooks
E ) Y; (E(AL)) ) Y; R ) Y; (R(E(AL))) ) Y and is combined.
(4) ((IS) and (R(E(AL)))) are more than one node apart in the
template and are represented by ((IS)((T(HE))((R(E(AL)))
(T(RE))))). (5) TH + ISREAL are more than one node apart in
the template and are represented by THIS + ISTHEREALTRE.
The template area cladogram is unchanged.
Step 13.
Template
area
cladogram:
(((TH)(IS)((IS)((T(HE))
(R(E(AL)))(T(RE))))))
Taxon–area cladogram 14 (Table 3): (H((IS)(T(R(EE)))))
(1) I ) Y; S ) Y; (IS) ) Y and is combined. (2) E ) Y; E ) Y;
E + E ¼ E; R ) Y; (RE) ) Y; T ) Y; (T(RE)) ) Y and is
combined. (3) IS + TRE ) N; more than one node apart and
represented in ISTHEREALTRE. (4) H ) Y; H + ISTRE ) N; if
ISTRE is represented by ISTHEREALTRE, H represented in THIS;
therefore the combination is represented in THIS + ISTHEREALTRE. The template area cladogram is unchanged.
Step 14.
Template area cladogram: (((TH)(IS)((IS)((T(HE))(R(E(AL)))(T(RE))))))
Taxon–area cladogram 15 (Table 3): ((T(HE))(T(R(EE))))
(1) H ) Y; E ) Y; (HE) ) Y; T ) Y; (T(HE)) ) Y and is
combined. (2) E ) Y; E ) Y; E + E ¼ E; R ) Y; (RE) ) Y;
T ) Y;(T(RE)) ) Y and is combined. (3) ((T(HE)(T(RE))) )
N; this combination is one node apart in the template, but the
relationship of TRE and THE is resolved (only if one of these
were novel with an unknown relationship to the other would
such a combination be made as a polytomy). The template area
cladogram is unchanged.
Step 15.
Template area cladogram: (((TH)(IS)((IS)((T(HE))(R(E(AL)))(T(RE))))))
Taxon–area cladogram 16 (Table 3): (T((IS)((HE)(T(R(EE))))))
(1) I ) Y; S ) Y; (IS) ) Y and is combined. (2) H ) Y;
E ) Y; (HE) ) Y and is combined. (3) E ) Y; E ) Y;
E + E ¼ E; R ) Y; (RE) ) Y; T ) Y;(T(RE)) ) Y and is
combined. (4) ((HE)(T(RE))) ) N; are more than one node
apart but are represented in THEREALTRE and is combined.
(5) IS + HETRE is therefore represented in IS + THEREALTRE and is combined. (6) T ) Y; T + ISHETRE is represented
in THIS + ISHETRE and is combined. The template area
cladogram is unchanged.
(1) I ) Y; S ) Y; (IS) ) Y; H ) Y; H(IS) ) N, but H + IS is
represented as TH + IS in the template area cladogram and is
combined. (2) E ) Y; E ) Y; E + E ¼ E; R ) Y; (RE) ) Y;
(T(RE)) ) Y and is combined. (3) HIS + TRE is represented
as THIS + ISTHEREALTRE in the template and is combined.
The template area cladogram is unchanged.
The area cladogram derived by PACT for the 17 taxon–area
cladograms listed in Table 3 is shown in Fig. 18.
DISCUSSION
A number of features make PACT an efficient and robust
algorithm. First, one need not convert taxon–area cladograms
into binary matrices for analysis. Secondly, at any given step,
PACT compares only two trees, the template area cladogram
and the taxon–area cladogram being integrated with the
template. Thirdly, area combinations and duplications arise
naturally from the analysis; reticulated area relationships are
neither ruled out, as in all a priori methods, primary BPA and
CCA, nor do they have to be inferred post hoc, as in secondary
BPA. Fourthly, PACT makes combinations with the template
going from the terminal levels of the input taxon–area
cladogram to the base. This feature saves time, and is the
means by which PACT is kept from assuming that all input
taxon–area cladograms were associated with each other from
the beginning. It corresponds to dual assumptions that,
whenever there is ambiguity, we should seek to make
combinations, but at the same time, we should do so beginning
with the most recently evolved taxa, because geographical
distributions of recently evolved species should, ceteris paribus, better reflect the conditions under which the species
evolved than geographical distributions of ancient species.
Finally, PACT treats four of the six classes of biogeographical patterns discussed in the introduction (depicted in Figs 2
& 4–6) as variations on a common theme called ‘partial
congruence’. For PACT, however, ‘partial congruence’ means
partial congruence with respect to a final, complex pattern of
area relationships, which may not be expressed fully by any of
the input taxon–area cladograms. None of the taxon–area
cladograms in any of the three exemplars shows the final
Step 16.
Template
area
cladogram:
(((TH)(IS)((IS)((T(HE))
(R(E(AL)))(T(RE))))))
Taxon–area cladogram 17 (Table 3): ((H(IS))(T(R(EE))))
766
Figure 18 Area cladogram derived from taxon–area cladograms
in Table 3. Letters ¼ areas.
Journal of Biogeography 32, 755–774, ª 2005 Blackwell Publishing Ltd
Phylogenetic analysis for comparing trees
pattern in its entirety, so it would be impossible to predict the
area cladogram a priori for any of those cases. Consequently,
we believe that PACT integrates the most robust aspects of a
priori and a posteriori methods while avoiding their various
drawbacks.
Interpreting the area cladogram
Obtaining an area cladogram is a necessary, but not sufficient,
step in explaining evolutionary phenomena having a geographical context. These include speciation, dispersal and
extinction patterns for the species represented by the taxon–
area cladograms, evolutionary radiations and patterns of
community assemblage (Brooks & McLennan, 2002), including species–area relationships (Halas et al., 2005). PACT does
not invoke or prohibit any evolutionary processes. As a result,
area cladograms produced by PACT are excellent for such
comparative phylogenetic studies.
Exemplar 1
Inferring evolutionary processes from patterns depicted by
area cladograms begins with mapping each taxon–area cladogram onto the final area cladogram, giving a unique
numerical code to each of the branches (including the terminal
ones) for each taxon–area cladogram. Figs 19–27 depict the
taxon–area cladograms for exemplar 1 (Table 1) coded in this
manner. Figure 28 is the final area cladogram with the codes
for those taxa mapped onto it, and with the inferences of
speciation modes, dispersal and extinction highlighted. In this
exemplar, the area cladogram (A(B(CD))) accounts for nearly
90% (63 of 71) of the data points represented by the nine
taxon–area cladograms, strongly suggesting vicariance. The
remaining eight events involve two species occurring in area E
and two cases of reticulated histories involving area A. The
absence of clades 3 and 8 in area B and of clade 7 in areas C
and D are most parsimoniously explained as episodes of
extinction. Alternatively, the presence of species 37 and
55 occurring in area E and species 48 occurring in area A are
most parsimoniously interpreted as episodes of peripheral
isolates speciation through dispersal from area B and area D,
respectively. The occurrence of species 63 in area A may be a
peripheral isolates speciation event, part of an older vicariance
sequence, or part of a sympatric speciation event also involving
species 64 and ancestor 70; given the data at hand, we cannot
make a stronger inference.
Exemplar 2
Figures 29 and 30 depict the taxon–area cladograms for
exemplar 2 (Figs 15 and 16) coded for mapping onto the
final area cladogram. Figure 31 is the final area cladogram with
the codes for those taxa mapped onto it. In this case, each clade
occurs throughout areas B, C, D and E, but there is no single
pattern among those areas that accounts for the majority of
species occurrences. In addition, each clade appears to have
Journal of Biogeography 32, 755–774, ª 2005 Blackwell Publishing Ltd
become associated with areas B, C, D and E from different
sources (A and F), and at slightly different times. Finally, the
two clades appear to share area relationships in less than 50%
(nine of 19) branches on the area cladogram in Fig. 31.
Consequently, this area cladogram represents a scenario of
colonization of a common set of areas by two different clades
from two different sources, which became historically linked as
a result of colonization of a common set of areas. In such cases,
it is not possible to make a parsimonious inference of
extinction, because ‘colonization + extinction’ (two events)
will always be less parsimonious than ‘never colonized’ (one
non-event).
Exemplar 3
Figures 32–48 depict the taxon–area cladograms for exemplar
3 (Table 3) coded for mapping onto the final area cladogram.
Figure 49 is the final area cladogram (Fig. 18) with the codes
for those taxa mapped onto it. In this exemplar, the area
cladogram comprises general sets of area relationships,
suggesting vicariance, but those area relationships include
eight reticulations involving 75% of the areas (two duplications each for T and E, one each for I, H, R and S), indicating
dispersal. Fourteen (82%) of the clades appear to have been
associated with each other from the beginning of the biogeographical scenario represented by the area cladogram,
suggesting vicariance, whereas clades 9, 10 and 15 became
associated with the areas at later times, again indicating
dispersal. Lastly, Fig. 50 depicts the area cladogram for
exemplar 3, with three rectangles heuristically depicting
increasing temporal scale. In this case, the shortest temporal
scale (smallest rectangle) encompasses only four areas, T, H, I
and S, two of which (I and S) exhibit reticulated relationships.
As we expand the temporal scale (medium rectangle), we add
an additional area (E), but also add reticulated relationships
for areas T and H. Finally, at the longest temporal scale (largest
rectangle), we add three additional areas, R, A and L, and also
additional area reticulations for areas E and R. This mixture of
vicariance and dispersal corresponds to a taxon pulse radiation
(Erwin, 1979; for recent phylogenetic biogeographical documentation of taxon pulse radiations, see Spironello & Brooks,
2003; Bouchard et al., in press; Halas et al., 2005).
The limits of pattern analysis
Although there is a single PACT result for any set of one or
more taxon–area cladograms, there may be more than one
equally parsimonious mapping for particular species on that
result, and thus more than one equally parsimonious inference
of processes. For example, taxon–area cladogram 9 in exemplar
1 (Table 1) has two basal A’s, dictating that the final area
cladogram has two basal A’s. The issue of which of those A’s is
part of the general pattern and which is a unique element is
ambiguous. A similar case arises in analysis of taxon–area
cladogram 14 (Table 3); PACT places ‘T(RE)’ as part of the
group T(R(EE))), but T(RE)) can be placed in three other
767
M. Wojcicki and D. R. Brooks
Figure 19–27 Taxon–area cladograms in Table 1 numbered for mapping on the area cladogram depicted in Fig. 11. Letters ¼ areas. (19)
Taxon–area cladogram 1. (20) Taxon–area cladogram 2. (21) Taxon–area cladogram 3. (22) Taxon–area cladogram 4. (23) Taxon–area
cladogram 5. (24) Taxon–area cladogram 6. (25) Taxon–area cladogram 7. (26) Taxon–area cladogram 8. (27) Taxon–area cladogram 9.
configurations on the final area cladogram without violating
assumption 0 (Fig. 51).
As we noted earlier, these ambiguities do not affect the
way in which PACT builds the area cladogram. Such
ambiguities, therefore, indicate complexities in the data, not
a shortcoming of the algorithm. That is, there are limitations on what we can robustly infer from patterns alone. For
these cases, we must have additional information, specifically
information about the ages of the speciation events not
based on biogeographical patterns (see also Brooks &
McLennan, 2002). Two additional sources of information
have been suggested, the fossil record (Lieberman, 2000,
2003a,b) and molecular clock estimates (Lieberman, 2001,
2003a,b; Donoghue & Moore, 2003), each of which has its
768
own potential drawbacks. For example, the age of a taxon
represented by a fossil is only its minimum age of origin.
Conversely, when reinforcement has played a role in
completing speciation, molecular clock estimates may overestimate the age of a speciation event due to increased rates
of divergence resulting from selection acting on potential
isolating mechanisms (Brooks & McLennan, 2001).
PACT and model-based approaches
PACT can mimic a priori methods of analysis, if the model
upon which the a priori method is based is presented in the
form of an area cladogram. That area cladogram can then be
used as the template area cladogram, and the taxon–area
Journal of Biogeography 32, 755–774, ª 2005 Blackwell Publishing Ltd
Phylogenetic analysis for comparing trees
Figure 28 Area cladogram 1 depicted in
Fig. 11 with information derived from all
nine clades in Table 1 mapped onto it. Letters ¼ areas.
Figure 32 Taxon–area cladogram 1 in Table 3 numbered for
mapping on the area cladogram depicted in Fig. 18. Letters ¼ areas.
Figure 29–31 Mapping information from taxon–area cladograms
onto the general area cladogram. (29) Taxon–area cladogram
depicted in Fig. 15 numbered for mapping on the area cladogram
depicted in Fig. 17. (30) Taxon–area cladogram depicted in Fig. 16
numbered for mapping on the area cladogram depicted in Fig. 17.
(31) Area cladogram depicted in Fig. 17 with information derived
from both clades mapped onto it. Letters ¼ areas.
Journal of Biogeography 32, 755–774, ª 2005 Blackwell Publishing Ltd
Figure 33 Taxon–area cladogram 2 in Table 3 numbered for
mapping on the area cladogram depicted in Fig. 18. Letters ¼ areas.
cladogram(s) can be compared with it using PACT as
described above. Each element of any input taxon–area
cladogram that can be combined with a template element
769
M. Wojcicki and D. R. Brooks
Figure 34 Taxon–area cladogram 3 in Table 3 numbered for
mapping on the area cladogram depicted in Fig. 18. Letters ¼ areas.
Figure 35 Taxon–area cladogram 4 in Table 3 numbered for
mapping on the area cladogram depicted in Fig. 18. Letters ¼ areas.
Figure 36 Taxon–area cladogram 5 in Table 3 numbered for
mapping on the area cladogram depicted in Fig. 18. Letters ¼ areas.
represents a corroboration of the model, and each element
that must be added to the template represents a specific
falsification of the model. Used in this manner, PACT is
superior to all other a priori methods of biogeographical
analysis because it does not specify costs, or weights, or
probabilities, or likelihoods of particular biogeographical
770
Figure 37 Taxon–area cladogram 6 in Table 3 numbered for
mapping on the area cladogram depicted in Fig. 18. Letters ¼ areas.
Figure 38 Taxon–area cladogram 7 in Table 3 numbered for
mapping on the area cladogram depicted in Fig. 18. Letters ¼ areas.
Figure 39 Taxon–area cladogram 8 in Table 3 numbered for
mapping on the area cladogram depicted in Fig. 18. Letters ¼ areas.
processes a priori, and is thus sensitive to clade-specific
historical contingencies.
CONCLUSIONS
If the evolution of the biosphere were the result of a single
dispersal event, followed by vicariance and extinction (lineage
sorting), there would be one species per area on this planet. If the
Journal of Biogeography 32, 755–774, ª 2005 Blackwell Publishing Ltd
Phylogenetic analysis for comparing trees
Figure 43 Taxon–area cladogram 12 in Table 3 numbered for
mapping on the area cladogram depicted in Fig. 18. Letters ¼ areas.
Figure 40 Taxon–area cladogram 9 in Table 3 numbered for
mapping on the area cladogram depicted in Fig. 18. Letters ¼ areas.
Figure 41 Taxon–area cladogram 10 in Table 3 numbered for
mapping on the area cladogram depicted in Fig. 18. Letters ¼ areas.
Figure 44 Taxon–area cladogram 13 in Table 3 numbered for
mapping on the area cladogram depicted in Fig. 18. Letters ¼ areas.
Figure 42 Taxon–area cladogram 11 in Table 3 numbered for
mapping on the area cladogram depicted in Fig. 18. Letters ¼ areas.
Figure 45 Taxon–area cladogram 14 in Table 3 numbered for
mapping on the area cladogram depicted in Fig. 18. Letters ¼ areas.
evolution of the biosphere were the result of episodes of
sympatric speciation (lineage duplication), followed by a single
dispersal event, followed by vicariance and extinction (lineage
sorting), each biota would be a clade. In either case, a priori and a
posteriori methods would give the same answer, and all biotic
diversification could be represented in a simple area cladogram.
Nowhere on this planet does either of these possibilities occur,
however. Therefore, we believe that maximum vicariance is
falsified as the null hypothesis for historical biogeography. With
what do we replace it?
Biogeographers have long thought that lengthening temporal scale tends to increase spatial scale. As the temporal scale
lengthens, the spatial scale will increase through dispersal into
new areas. At the same time, there may be dispersal into areas
already inhabited by other clade members, producing reticulated area relationships (e.g. Fig. 50), making the relationship
between temporal and spatial scale complex. Such complex
area relationships are a feature of the taxon pulse radiation
hypothesis first proposed by Erwin (1979). Recent empirical
studies using BPA have documented taxon pulse radiations for
single clades (Spironello & Brooks, 2003) and multiple clades
(Bouchard et al., in press; Halas et al., 2005). The taxon pulse
hypothesis, embodying general patterns based on dispersal as
well as vicariance, would seem to be the logical replacement, at
this point in time, for the doctrine of maximum vicariance.
If area reticulations are not noise or error, but rather
information that must be documented and explained, the
methodological bases for historical biogeography become clear.
Journal of Biogeography 32, 755–774, ª 2005 Blackwell Publishing Ltd
771
M. Wojcicki and D. R. Brooks
Figure 46 Taxon–area cladogram 15 in Table 3 numbered for
mapping on the area cladogram depicted in Fig. 18. Letters ¼ areas.
Figure 48 Taxon–area cladogram 17 in Table 3 numbered for
mapping on the area cladogram depicted in Fig. 18. Letters ¼ areas.
based on vicariance from those based on biotic expansion
and (3) analyse as many different clades as possible. All these
integrated tasks require an efficient and powerful algorithm
such as PACT.
ACKNOWLEDGEMENTS
Figure 47 Taxon–area cladogram 16 in Table 3 numbered for
mapping on the area cladogram depicted in Fig. 18. Letters ¼ areas.
Historical biogeographers need to (1) discover and evaluate
both general patterns and their exceptions, including reticulated area relationships, (2) distinguish general patterns
We wish to thank Patrice Bouchard (Agriculture Canada), Brian
Crother, Sarah Temple, Erica Perrer, Erik Johnson and Cathy
DiBenedetto (Southeastern Louisiana University), Ashley Dowling (University of Michigan), Eric Hoberg (US National
Parasite Collection, USDA, Beltsville, Maryland), Bruce Lieberman (University of Kansas), Deborah McLennan (University
of Toronto), Soren Nylin, Niklas Janz and Niklas Wahlberg
(Stockholm University), Marco Van Veller (Wageningen Uni-
Figure 49 Area cladogram for exemplar 3 (Fig. 18). Numbers in boldface below refer to numbers on the area cladogram; numbers following
‘¼’ refer to numbered branches from the taxon–area cladograms in Figs 32–48; numbers with an asterisk (*) represent absences of the clade
from that part of the area cladogram. 1 ¼ 1, 16, 31, 50, 69, 90, 111, 136, 193, 206, 219, 260*; 2 ¼ 2, 17, 32, 51, 70, 91, 112, 137, 207, 220, 234*,
277*; 3 ¼ 3, 18, 33, 52, 71, 92, 113, 194, 208, 278; 4 ¼ 4, 19, 34, 53, 72, 93, 114, 195, 209, 279; 5 ¼ 35, 54, 115, 138, 221, 235, 261; 6 ¼ 36, 55, 116,
139, 222, 236, 262; 7 ¼ 73, 94, 117, 140, 157, 247; 8 ¼ 74, 95, 118, 141, 158, 248, 263; 9 ¼ 75, 96, 119, 142, 159, 249, 264; 10 ¼ 20, 37, 97, 120,
160, 178, 196, 223; 11 ¼ 21, 38, 98, 121, 161, 179, 197, 224; 12 ¼ 22, 39, 99, 122, 162, 180, 198, 225; 13 ¼ 23, 40, 100, 123, 163, 181, 199,
226; 14 ¼ 5, 56, 76, 143, 164, 182, 210, 237, 250, 265, 280; 15 ¼ 6, 57, 77, 144, 165, 183, 211, 238, 251, 266, 281; 16 ¼ 7, 8, 11, 58, 59, 63, 78, 79, 84,
104, 145, 146, 148, 166, 167, 171, 184, 185, 188, 212*, 239, 240, 241, 252, 253, 255, 267, 268, 269, 282, 283, 285; 17 ¼ 9, 24, 41, 60, 81, 102, 124,
154*, 213, 231*, 234*, 260*, 277*; 18 ¼ 10, 25, 42, 61, 82, 103, 125, 202, 214, 284; 19 ¼ 80, 101, 126, 147, 168, 254, 271*; 20 ¼ 26, 44,
127, 169, 186, 200, 227; 21 ¼ 27, 45, 128, 170, 187, 201, 228; 22 ¼ 12, 64, 85, 105, 149, 172, 189, 215, 242, 256, 270, 286; 23 ¼ 14, 65, 86, 107, 151,
174, 191, 217, 244, 258, 272, 288; 24 ¼ 29, 46, 130, 173, 190, 204, 230; 25 ¼ 83, 106, 129, 150, 175, 257, 271*; 26 ¼ 42, 62, 131, 152, 229, 243, 273;
27 ¼ 13, 28, 47, 66, 133, 154*, 203, 216,231*, 234*, 260*, 287; 28 ¼ 176, 192; 29 ¼ 109, 132, 153, 177, 259, 274; 30 ¼ 48, 67, 88, 134, 155,
232, 275; 31 ¼ 15, 30, 49, 68, 89, 110, 135, 156, 205, 218, 233, 246, 276, 289. Letters ¼ areas.
772
Journal of Biogeography 32, 755–774, ª 2005 Blackwell Publishing Ltd
Phylogenetic analysis for comparing trees
Figure 50 Area cladogram depicted in Figs 18 and 49, with
rectangles representing an heuristic view of increasing temporal
scale associated with increasing spatial scale as well as increasing
historical reticulations of areas. The smallest rectangle indicates
five areas and two reticulations, the medium-sized rectangle
indicates eight areas and three reticulations, and the largest rectangle indicates eight areas and eight reticulations. Letters ¼ areas.
Figure 51 Area cladogram for 17 taxon–area cladograms in
Table 3 indicating possible mappings of the (T(RE)) portion of
taxon–area cladogram 14, denoted by areas in larger fonts. PACT
selects the most compact grouping possible, placing all elements of
(T(RE)) within a single clade of areas on the area cladogram, but
other placements (italicized letters) are logically consistent with
the area cladogram. This ambiguity does not affect the construction of the area cladogram. Letters ¼ areas.
versity) and Rino Zandee (Leiden University) for patient and
effective feedback during the development of PACT. We also
thank approximately 75 students in classes at the University of
Toronto and Stockholm University during the autumn of 2003,
who were the first undergraduates to be taught PACT. DRB
acknowledges funding from the Natural Sciences and Engineering research Council (NSERC) of Canada.
REFERENCES
Bouchard, P. & Brooks, D.R. (in press) Effect of vagility
potential on dispersal and speciation in rainforest insects.
European Journal of Evolutionary Biology.
Bouchard, P., Brooks, D.R. & Yeates, D.K. (in press) Mosaic
macroevolution in Australian wet tropics arthropods:
community assemblage by taxon pulses. Rainforest: past,
Journal of Biogeography 32, 755–774, ª 2005 Blackwell Publishing Ltd
present, future (ed. by C. Moritz and E. Bermingham).
University of Chicago Press, Chicago, IL.
Brooks, D.R. (1990) Parsimony analysis in historical biogeography and coevolution: methodological and theoretical
update. Systematic Zoology, 39, 14–30.
Brooks, D.R. & McLennan, D.A. (1991) Phylogeny, ecology and
behavior – a research program in comparative biology. University of Chicago Press, Chicago, IL.
Brooks, D.R. & McLennan, D.A. (2001) A comparison of
a discovery-based and an event-based method of historical biogeography. Journal of Biogeography, 28, 757–
767.
Brooks, D.R. & McLennan, D.A. (2002) The nature of diversity
– an evolutionary voyage of discovery. University of Chicago
Press, Chicago, IL.
Brooks, D.R., Van Veller, M.G.P. & McLennan, D.A. (2001) How
to do BPA, Really. Journal of Biogeography, 28, 343–358.
Cormen, T.H., Leiserson, C.E., Rivest, R.L. & Clifford, S.
(2001) Introduction to algorithms. McGraw-Hill, Toronto.
Cressey, R.F., Collette, B. & Russo, J. (1983) Copepods and
scombrid fishes: A study in host-parasite relationships.
Fisheries Bulletin, 81, 227–265.
Donoghue, M.J. & Moore, B.R. (2003) Toward an integrative
historical biogeography. Integrative and Comparative Biology, 43, 261–270.
Endler, J.A. (1982) Problems in distinguishing historical from
ecological factors in biogeography. American Zoologist, 22,
441–452.
Enghoff, H. (1996) Widespread taxa, sympatry, dispersal, and
an algorithm for resolved area cladograms. Cladistics, 12,
349–364.
Erwin, T.L. (1979) Thoughts on the evolutionary history of
ground beetles: hypotheses generated from comparative
faunal analyses of lowland forest sites in temperate and
tropical regions. Carabid beetles: their evolution, natural
history, and classification (ed. by T.L. Erwin, G.E. Ball and
D.R. Whitehead), pp. 539–592. W. Junk, The Hague.
Erwin, T.L. (1981) Taxon pulses, vicariance, and dispersal:
an evolutionary synthesis illustrated by carabid beetles.
Vicariance biogeography – a critique (ed. by G. Nelson and
D.E. Rosen), pp. 159–196. Columbia University Press,
New York.
Erwin, T.L. (1985) The taxon pulse: a general pattern of lineage
radiation and extinction among carabid beetles. Taxonomy,
phylogeny, and zoogeography of beetles and ants (ed. by G.E.
Ball), pp. 437–472. W. Junk, Dordrecht.
Erwin, T.L. (1991) An evolutionary basis for conservation
strategies. Science, 253, 750–752.
Erwin, T.L. & Adis, J. (1982) Amazonian inundation forests:
their role as short-term refuges and generators of species
richness and taxon pulses. Biological diversification in the
tropics (ed. by G. Prance), pp. 358–371. Columbia University
Press, New York.
Green, M., Van Veller, M.G.P. & Brooks, D.R. (2002) Assessing
modes of speciation: range asymmetry and biogeographical
congruence. Cladistics, 18, 112–124.
773
M. Wojcicki and D. R. Brooks
Halas, D., Zamparo, D. & Brooks, D.R. (2005) A historical
biogeographical protocol for studying biotic diversification
by taxon pulses. Journal of Biogeography, 32, 249–260.
Lieberman, B.S. (2000) Paleobiogeography. Plenum/Kluwer
Academic Press, New York, NY.
Lieberman, B.S. (2001) Applying molecular phylogeography to
test paleoecological hypotheses: a case study involving Amblema plicata (Mollusca: Unionidae). Evolutionary paleoecology (ed. by W.D. Allmon and D.J. Bottjer), pp. 83–103.
Columbia University Press, New York, NY.
Lieberman, B.S. (2003a) Unifying theory and methodology in
biogeography. Evolutionary Biology, 33, 1–25.
Lieberman, B.S. (2003b) Paleobigeography – the relevance of
fossils to biogeography. Annual Review of Ecology, Evolution
and Systematics, 34, 51–69.
McLennan, D.A. & Brooks, D.R. (2002) Complex histories of
speciation and dispersal: an example using some Australian
birds. Journal of Biogeography, 29, 1055–1066.
Morrone, J.J. & Carpenter, J.M. (1994) In search of a method
for cladistic biogeography: an empirical comparison of
Component Analysis, Brooks Parsimony Analysis, and
Three-area Statements. Cladistics, 10, 99–153.
Platnick, N.I. & Nelson, G. (1978) A method of analysis
for historical biogeography. Systematic Zoology, 27,
1–16.
Ronquist, F. (2002) Parsimony analysis of coevolving species
associations. Tangled trees (ed. by R.D.M. Page), pp. 22–64.
University of Chicago Press, Chicago, IL.
Siddall, M.E. & Perkins, S.L., (2003) Brooks parsimony analysis: a valiant failure. Cladistics, 19, 554–564.
Spironello, M. & Brooks, D.R. (2003) Dispersal and diversification in the evolution of Inseliellium, an archipelagic dipteran group. Journal of Biogeography, 30, 1563–1573.
Van Veller, M.G.P. & Brooks, D.R. (2001) When simplicity is
not parsimonious: inductive and hypothetico-deductive
approaches in historical biogeography. Journal of Biogeography, 28, 1–11.
Van Veller, M.G.P., Zandee, M. & Kornet, D.J. (1999) Two
requirements for obtaining valid common patterns under
different assumptions in vicariance biogeography. Cladistics,
15, 393–406.
774
Van Veller, M.G.P., Kornet, D.J. & Zandee, M. (2000) Methods
in vicariance biogeography: assessment of the implementation of assumptions zero, 1 and 2. Cladistics, 16, 319–345.
Van Veller, M.G.P., Kornet, D.J. & Zandee, M. (2001) A posteriori and a priori methodologies for testing hypotheses of
causal processes in vicariance biogeography. Cladistics, 7,
248–259.
Van Veller, M.G.P., Zandee, M. & Kornet, D.J. (2002) Testing
hypotheses regarding general patterns in vicariance biogeography with a posteriori and a priori methods. Cladistics, 18,
207–217.
Wiley, E.O. (1981) Phylogenetics – the theory and practice of
phylogenetic systematics. Wiley & Sons, New York.
Wiley, E.O. (1986) Methods in vicariance biogeography. Systematics and evolution (ed. by P. Hovenkamp), pp. 283–306.
University of Utrecht Press, Utrecht.
Wiley, E.O. (1988a) Parsimony analysis and vicariance biogeography. Systematic Zoology, 37, 271–290.
Wiley, E.O. (1988b) Vicariance biogeography. Annual Review
of Ecology and Systematics, 19, 513–542.
Zandee, M. & Roos, M.C. (1987) Component-compatibility in
historical biogeography. Cladistics, 3, 305–332.
BIOSKETCHES
Maggie Wojcicki is a Research Assistant in the laboratory of
Prof. D.R. Brooks, University of Toronto. She plans to begin a
doctoral program in Computer Science in the near future.
Daniel R. Brooks is Professor of Zoology, University of
Toronto, specializing in the systematics and evolution of
parasitic helminths. He is currently coordinating the inventory
of eukaryotic parasites of vertebrates, Area de Conservacion
Guanacaste, Costa Rica. He is co-author of Phylogeny, ecology
and behavior: A research programme in comparative biology
(1991), Parascript: Parasites and the language of evolution
(1993) and The nature of diversity: An evolutionary voyage of
discovery (2002).
Editor: Philip Stott
Journal of Biogeography 32, 755–774, ª 2005 Blackwell Publishing Ltd