Dynamic Genome Organization and Gene Evolution by Positive

Dynamic Genome Organization
Geminivirus (Geminiviridae)
and Gene Evolution
James M. Bradeen,’ Marja C. P. Timmermatq2
Waksman
Institute,
Rutgers, The State University
by Positive Selection in
and Joachim Messing
of New Jersey
Geminiviruses
(Geminiviridae)
are a diverse group of plant viruses differing from other known plant viruses in
possessing circular, single-stranded
DNA. Current classification divides the family into three subgroups, defined in
part by genome organization, insect vector, and plant host range. Previous phylogenetic assessments of geminiviruses
have used DNA and/or amino acid sequences from the replication-associated
and coat protein genes and have relied
predominantly
on distance analyses. We used amino acid and DNA sequence data from the replication-associated
and coat protein genes from 22 geminivirus types in distance and parsimony analyses. Although the results of our
analyses largely agree with those reported previously, we could not always predict viral relationships based on
genome organization, plant host, or insect vector. Loss of correlation of these traits with phylogeny is likely due to
improved sampling of geminivirus types. Unrooted parsimony trees suggest multiple independent origins for the
monopartite genome. Genome organization is therefore a dynamic character. Estimates of nonsynonymous
and
synonymous nucleotide substitutions for extant and inferred ancestral sequences were used to evaluate hypotheses
that the replication-associated
and coat protein sequences evolve to accommodate
plant host and insect vector
specificities, respectively.
Results suggest that plant host specificity does not solely direct replication-associated
protein evolution but that coat protein sequence does evolve in response to insect vector specificity. Genome
organization and, possibly, plant host specificity are not reliable taxonomic characters.
Introduction
Worldwide,
virus infection of agronomically
important plant species accounts for millions of dollars in
lost productivity
each year. Among plant viruses, geminiviruses constitute an important group, infecting both
monocot and dicot plant species. Disease symptoms are
diverse but frequently
include foliage discolorations
such as streaks (e.g. Digitaria streak, Miscanthus streak,
and maize streak viruses) and mosaic patterns (e.g. Abutilon mosaic, bean calico mosaic, Chloris striate mosaic,
and tomato golden mosaic viruses), stunted plant development, and twisted, curled, or cupped foliage. While
90% of known plant viruses have RNA genomes, geminiviruses
contain circular, single-stranded
DNA genomes and replicate via double-stranded
DNA intermediates.
All geminiviruses
are characterized
by a
twinned (geminate) capsid morphology
and have monopartite (single) or bipartite (double) genomes (Timmermans, Das, and Messing 1994). Proposed correlations between genome organization,
insect vector specificity, and plant host infectivity have raised interesting
questions about geminivirus
evolution.
Current nomenclature
describes the viral family
Geminiviridae
with three subgroups (Briddon and Markham 1995). Geminiviral
subgroups were originally described solely by genome organization,
plant host, and
insect vector specificities.
Recently, the classifications
were broadened to incorporate isolates with unique combinations of characters. Subgroup I contains viruses with
monopartite
genomes
that are leafhopper-transmitted
’Present address: USDA-ARS
University of Wisconsin, Madison.
* Present address:
Department
Key words: coat protein,
and Department
of Biology,
geminivirus,
of Horticulture,
Yale University.
nucleotide
substitution.
Address
for correspondence
and reprints: Joachim Messing,
Waksman Institute, PO. Box 759, Rutgers University, Piscataway, New
Jersey 08855. E-mail: [email protected].
Mol.
Biol. Evol. 14( 11):1114-l 124. 1997
0 1997 by the Society for Molecular Biology and Evolution.
1114
ISSN: 0737-4038
and predominantly
infect monocots. Subgroup II contains viruses with monopartite
genomes that are leafhopper-transmitted
but infect dicots. Subgroup III contains bipartite and monopartite viruses. These viruses are
whitefly-transmitted
and infect dicots. Geminiviral
subgroups are thought to be monophyletic.
Howarth and
Goodman (1986) suggest that progenitor geminiviruses
were monopartite (subgroup I) viruses and that bipartite
(subgroup III) viruses later arose by acquisition of dicot
infectivity and whitefly transmissibility.
Evidence supporting this hypothesis includes similarity in overall genome organization
between the two subgroups; amino
acid similarities between coding regions of the two genomes of bipartite viruses, suggesting a common origin
for the two genomes; and distance analysis of the replication-associated
protein (Kikuno et al. 1984; Howarth
and Goodman 1986). Subgroup II viruses may have arisen either from subgroup I viruses by the acquisition of
dicot infectivity or from subgroup III viruses by loss of
the second genome.
Apparent dichotomies
dividing viral isolates into
monocot- and dicot-infecting
groups have been noted in
distance analyses of the replication-associated
amino
acid sequence. Similarly, dichotomies dividing viral isolates into leafhopperand whitefly-transmitted
groups
have been noted in distance analyses of the coat protein
amino acid sequence. These observations have led to the
hypothesis that these geminiviral
genes may evolve to
accommodate
plant host and insect vector specificities,
respectively
(Howarth and Vandemark 1989).
In the current study, we examine the evolutionary
relationships
of 39 geminivirus
isolates representing
22
viral types. Inclusion of viral isolates from each of the
three subgroups, including newly described viruses with
unique combinations
of characteristics,
allows evaluation of current geminiviral
classification,
determination
of the phylogenetic
importance of characteristics
used
in part to define geminiviral taxa, and estimation of mo-
Genome
lecular phylogenies for currently described geminiviruses. Additionally,
comparison of ancestral DNA sequences inferred from parsimony analyses evaluates proposals
of evolutionary
response in particular geminivirus genes
to accommodate
plant host or insect vector specificities.
Materials
and Methods
Replication-associated
and coat protein DNA sequences were obtained from GenBank (table 1). Replication-associated
and coat protein sequences were identified using published accounts, GenBank information,
or comparison with known coding regions. Replicationassociated protein sequence introns were identified and
removed prior to analysis (Schalk et al. 1989; Mullineaux, Guerineau,
and Accotto 1990). DNA sequence
translation, DNA and protein sequence alignments, and
percent dissimilarity
calculations
were done using GCG
version 7.2 (Genetics Computer Group, Inc., Madison,
Wis.). Sequence dissimilarity
calculations
and aligned
sequences were analyzed using PHYLIP version 3.5~
(Felsenstein
1989) and PAUP version 3.1.1 (Swofford
1993), respectively. PHYLIP analyses include neighborjoining and 10 replications each of Fitch and Kitsch with
Jumble option using percent dissimilarity
among amino
acid sequences. Sums of squares from the best Fitch and
Kitsch trees were used to test the hypothesis of a molecular clock (Felsenstein
1984). PAUP analyses included a heuristic search with 100 replications
of random
taxon addition (to identify topographical
“islands”
inconsistent
with initially
identified
trees [Maddison
199 11) and heuristic bootstrap (Felsenstein
1985) and
heuristic decay analyses (Bremer 1988; Donoghue et al.
1992) using both amino acid and DNA sequences. When
multiple most-parsimonious
trees were identified, consensus trees were computed. Trees consistent with specific evolutionary
hypotheses were generated by heuristic search with 100 replications
of random taxon addition using specified topological
constraints.
Shortest
identified constraint trees were compared with most-parsimonious trees by nonparametric
(Templeton 1983) and
parametric
(Kishino and Hasegawa
1989) procedures.
Nonsynonymous
(K,) and synonymous
(KS) nucleotide
substitutions (Li 1993) were estimated for pairs of extant
and inferred ancestral replication-associated
and coat
protein DNA sequences. &/KS ratios and plots of K,
versus KS were used to evaluate for data trends indicative of gene evolution via positive selection.
Results and Discussion
The bipartite geminiviruses
contain six conserved
open reading frames (ORFs), and the monopartite geminiviruses contain four. Among the homologous
genes,
the replication-associated
and the coat protein genes are
the most conserved (Howarth and Goodman 1986) and
have been used extensively
for evolutionary
analysis
(Howarth and Goodman 1986; Howarth and Vandemark
1989; Timmermans,
Das, and Messings 1994; Padidam,
Beachy, and Fauquet 1995a). We have estimated phylogenies using both of these genes in an expanded set
and Gene Evolution
in Geminivirus
of geminivirus
isolates, using both distance and parsimony analyses.
Distance analysis assumes random sampling of all
geminivirus types. Because geminiviruses
differ in their
agronomic importance and because research efforts are
disproportionately
expended toward agronomically
important types (Matthews 1985), random sampling may
be an inappropriate assumption. It is noteworthy that the
viruses included in this study, which includes most geminiviral types molecularly characterized to date, are predominantly infectious agents of economically
important
crop families (Fabaceae, Malvaceae, Poaceae, Solanaceae). These plant hosts are not closely related species
but diverse families including both monocots and dicots.
(BCTV [abbreviations
for isolates are listed in table 11,
which is known to infect 44 different dicot families, was
originally
described as a disease agent of the agronomically important sugar beet [Beta vulgar-is; Mumford
19741.) The insect vectors, like the hosts, represent divergent groups. Geminiviruses
are therefore diverse in
genome arrangement,
plant host specificity, and insect
vector specificity. That this diverse group of viruses infects only agronomically
important plant families is unlikely.
The relationship of geminiviruses
to other plant viruses is unclear (Matthews
1985). Most plant virus
groups are RNA-encoded.
Among the DNA-encoded
plant viruses, geminiviruses
are unique in that they contain single-stranded
DNA. As noted previously,
the
characteristic geminate structure of geminiviruses
distinguishes them morphologically
from other plant viruses.
Frischmuth,
Zimmat, and Jeske (1990) described two
ORFs on DNA A of AbMV with apparent prokaryotic
features, features lacking in other geminiviruses.
The authors made no claims about the phylogenetic importance
of this finding. Koonin and Ilyina (1992) noted conserved sequence motifs between the replication initiator
proteins of certain eubacterial plasmids and the geminivirus replication-associated
protein and suggested that
this similarity may be of phylogenetic
origin. However,
at the present time, no concrete evolutionary
lineage can
be drawn between these otherwise dissimilar taxa. Because of the morphological
and genetic uniqueness
of
geminiviruses,
attempts to identify an appropriate outgroup for polarizing
characters proved unsuccessful.
Like certain distance analyses,
midpoint
rooting assumes a molecular clock, an assumption
inconsistent
with our observations
(unpublished
data) and those of
Howarth and Vandemark (1989). We have therefore chosen to present unrooted phylogenies.
Conclusions based
on unrooted parsimony analyses are independent of true
tree polarity.
Distance and parsimony analyses are in agreement
with those presented previously
(Howarth and Vandemark 1989; Timmermans,
Das, and Messing 1994; Padidam, Beachy, and Fauquet 1995a) and with each other,
with the exception of certain TYLCV isolates, as discussed below. Parsimony analyses of the amino acid and
DNA sequences are likewise in agreement. Only parsimony analyses of the DNA sequences are presented
here. In all analyses, two broadly defined, well-support-
Table 1
Geminivirus
Isolate Characteristics
Virus
Abutilon mosaic ............
African cassava mosaic. ......
Bean calico mosaic ..........
Bean dwarf mosaic
Bean golden mosaic
Beet curly top
..........
.........
..............
Chloris striate mosaic ........
Digitaria streak .............
Maize streak. ...............
Miscanthus streak ...........
Mungbean yellow mosaic. ....
Panicum streak .............
Potato yellow mosaic ........
Squash leaf curl. ............
Sugarcane streak ............
Tobacco yellow dwarf. .......
Tomato golden mosaic .......
Tomato leaf crumple. ........
Tomato leaf curl
Tomato mottle
Tomato yellow
Wheat dwarf
............
..............
leaf curl ......
... ......... ...
Abbreviation”
GenBank
Accession No.
Genomeb
AbMV
ACMV
BCaMV- 1 rep
BCaMV-2 rep
BCaMV-3 rep
BDMV
BGMV- 1
BGMV-2
BGMV-3
BGMV-4
BCTV- 1
BCTV-2
CSMV
DSV
MSV- 1
MSV-2
MiSV
MYMV
PanSV
PYMV
SLCV
ssv
TYDV
TGMV
TLCrV-I cp
TLCrV-2 rep
TLCV- 1
TLCV-2
TLCV-3
ToMoV
TYLCV- 1
TYLCV-2
TYLCV-3
TYLCV-4
TYLCV-5
TYLCV-6
TYLCV-7
WDV- 1
WDV-2
Xl5983
x 17095
uOO121
L27264
L22758
M88179
Ml0070
LO1635
M88686
M9 1604
U023 11
M24597, X04 144
M2002 1
M23022
YOO5 14
X0 1633, K02026
DO1030
D14703
X60168
DO0940
M38183
S64567
M81103
K02029
L34747
L27267
u15015, u15017
U15016
S5325 1
L14460
X61 153
Xl5656
X763 19
22575 1
228390
L27708
M59838
X02869
X82104
B
B
B
B
B
B
B
B
B
B
M
M
M
M
M
M
M
B
M
B
B
M
M
B
B
B
B
B
M
B
M
M
M
M
M
M
B
M
M
d BCaMV and TLCrV include one or more isolates for which only the replication
b Genome organization: M, monopartite; B, bipartite.
‘ Insect vector: L, leafhopper; W, whitefly.
associated
Host
Malvaceae
Euphorbaceae,
Fabaceae
Fabaceae
Fabaceae
Fabaceae
Fabaceae, Malvaceae
Fabaceae, Malvaceae
Fabaceae
Fabaceae
Fabaceae
44 dicot families
44 dicot families
Poaceae
Poaceae
Poaceae
Poaceae
Poaceae
Fabaceae
Poaceae
Solanaceae
Cucurbitaceae
Poaceae
Fabaceae, Solanacceae
Solanaceae
Fabaceae, Solanaceae
Fabaceae, Solanaceae
Solanaceae
Solanaceae
Solanaceae
Solanaceae
Solanaceae
Solanaceae
Solanaceae
Solanaceae
Solanaceae
Solanaceae
Solanaceae
Poaceae
Poaceae
Vectof
Bemisia tabaci (W)
Bemisia tabaci (W)
Bemisia tabaci (W)
Bemisia tabaci (W)
Bemisia tabaci (W)
Bemisia tabaci (W)
Bemisia tabaci (W)
Bemisia tabaci (W)
Bemisia tabaci (W)
Bemisia tabaci (W)
Circulifer tenellus (L)
Circulifer tenellus (L)
Nesoclutha pallida (L)
Nesoclutha declivata (L)
Cicadulina mbila (L)
Cicadulina mbila (L)
Unknown
Unknown
Cicadulina mbila (L)
Bemisia tabaci (W)
Bemisia tabaci (W)
Cicadulina mbila (L)
Orosius argentatus (L)
Bemisia tabaci (W)
Bemisia tabaci (W)
Bemisia tabaci (W)
Bemisia tabaci (W)
Bemisia tabaci (W)
Bemisia tabaci (W)
Bemisia tabaci (W)
Bemisia tabaci (W)
Bemisia tabaci (W)
Bemisia tabaci (W)
Bemisia tabaci (W)
Bemisia tabaci (W)
Bemisia tabaci (W)
Bemisia tabaci (W)
Psammotettix alienus (L)
Psammotettix alienus (L)
protein (“rep”) or the coat protein (“cp”) gene sequence is available.
Reference
Frischmuth, Zimmat, and Jeske (1990)
Morris et al. (1990)
Loniello et al. (unpublished data)
Loniello et al. (unpublished data)
Loniello et al. (unpublished data)
Gilbertson et al. (1991)
Howarth et al. (1985)
Gilbertson et al. (1991)
Gilbertson et al. (199 1)
Gilbertson et al. (1991)
Stenger et al. (1994)
Stanley et al. (1986)
Andersen et al. (1988)
Donson et al. (1987)
Lazarowitz (1988)
Mullineaux et al. (1984)
Chatani et al. (1991)
Morinaga, Ikegami, and Miura (1993)
Briddon et al. (1992)
Coutts et al. (199 1)
Lazarowitz and Lazdins (199 1)
Hughes, Rybicki, and Kirby (1993)
Morris et al. (1992)
Hamilton et al. (1984)
Paplomatas et al. (1994)
Loniello et al. (unpublished data)
Padidam, Beachy, and Fauquet (19956)
Padidam, Beachy, and Fauquet (1995b)
Dry et al. (1993)
Abouzid, Polston, and Hiebert (1992)
Kheyr-Pour et al. (1992)
Navot et al. (1991)
Antignus and Cohen (1994)
Noris et al. (1994)
Crespi et al. (unpublished data)
Reina et al. (unpublished data)
Rochester et al. (1994)
MacDowell et al. (1985)
Bendahmane et al. (unpublished data)
Genome
ed groups of isolates are evident. The first includes only
monopartite,
monocot-infecting,
leafhopper-transmitted
viruses (MSV, DSV, SSV, and PanSV) and will henceforth be referred to as the “African streak virus clade,”
following the precedent of Hughes, Rybicki, and Kirby
(1993). The second group includes both mono- and bipartite, dicot-infecting,
predominantly
whitefly-transmitted viruses and will henceforth
be referred to as the
“BGMV clade.” The constitution
of the BGMV clade
differs slightly for replication-associated
and coat protein analyses (compare figs. 1 and 2). We cannot rule
out the possibility that this discrepancy
may be due to
viral recombination,
the replication-associated
and coat
protein sequences
originating
from separate lineages.
Observed tree topologies are consistent with the possibility that TYLCV-1 and TYLCV-6 are viral recombinants. Viral recombination
has been suggested for an
Indian isolate of TLCV (Padidam, Beachy, and Fauquet
199%) and for BCTV (Padidam, Beachy, and Fauquet
199%). Gilbertson et al. (1993) demonstrate that “pseudorecombinants”
(bipartite geminiviruses
composed of
the A genome of one bipartite geminivirus
and the B
genome of a distantly related bipartite geminivirus)
can
function to cause disease symptoms in host plants. The
authors speculate that pseudorecombination
followed by
genetic recombination
between the A and B genomes
might be an important means of geminivirus
evolution.
Agreement between the replication-associated
and coat
protein analyses suggests no evidence for viral recombinants, except for TYLCV-1 and -6.
Dynamic
Genome
Organization
Parsimony analyses of both the replication-associated and the coat protein sequences (figs. 1 and 2) reveal
complex
relationships
between
mono- and bipartite
geminiviruses.
Regardless of the true tree polarity, monopartite viruses do not constitute a monophyletic
clade.
For example, TYLCV isolates, with the exception of
TYLCV-7, are monopartite viruses. However, these isolates are most closely related to bipartite (subgroup III)
geminiviruses
and are only distantly related to other
monopartite viruses (subgroups I, II; figs. 1 and 2). Furthermore, in neither the replication-associated
nor the
coat protein parsimony trees do the monopartite TYLCV
isolates constitute
a monophyletic
clade, suggesting
multiple origins for the monopartite
viruses. Most-parsimonious trees for the replication-associated
and coat
protein DNA sequences were compared with the shortest
trees consistent
with the assumption
that monopartite
and bipartite viruses constitute
separate monophyletic
clades. Constraint trees differed significantly (a = 0.05)
from most-parsimonious
trees. (Tree lengths were 6,458
vs. 6,5 12 steps for the replication-associated
protein
DNA most-parsimonious
and constraint trees, respectively, and 4,392 vs. 4,540 steps for the coat protein
DNA most-parsimonious
and constraint
trees, respectively.)
Parsimony
dictates that some of the monopartite
isolates in this study arose from bipartite isolates by
reversion. These derivatives may include the monopartite TYLCV and TLCV isolates. Consistent
with this
and Gene Evolution
in Geminivirus
1117
possibility,
genome
organization
of the monopartite
TYLCV and TLCV isolates more closely resembles that
of bipartite than that of other monopartite geminiviruses
(Navot et al. 1991; Kheyr-Pour et al. 1992; Dry et al.
1993). Genome organization
is dynamic and is not a
reliable taxonomic character.
Unrooted parsimony analyses support the possibility that the monopartite TYDV and BCTV isolates arose
from a dicot-infecting
bipartite virus by loss of the second genome (reversion), or directly from the progenitor
monopartite
group by acquisition of dicot-infecting
capabilities. If the African streak virus clade is the true
progenitor type, the replication-associated
and coat protein parsimony trees (figs. 1 and 2) predict TYDV to be
monopartite-derived.
Morris et al. (1992) concluded that
TYDV genome organization more closely resembles genome organization
of subgroup I monopartite than that
of bipartite viruses, consistent with this possibility. Recent modifications
to geminivirus
classification
include
assignment
of TYDV to subgroup I, despite its dicot
host specificity (Briddon and Markham 1995). If the African streak virus clade is the progenitor type, the replication-associated
protein parsimony trees (fig. 1) predict and the coat protein parsimony trees (fig. 2) allow
BCTV to be bipartite-derived.
Genome characteristics of
BCTV resemble those of both monopartite and bipartite
viruses (Stanley et al. 1986; Timmermans,
Das, and
Messing 1994). Our results support deemphasis of genome organization
as a taxonomic character for geminiviruses.
Kikuno et al. (1984) suggested, based on ORF amino acid similarity between the two genomes of cassava
latent virus, that bipartite geminiviruses
(subgroup III)
arose from a single genome progenitor. Howarth and
Goodman
(1986) noted similar ORF similarities
for
BGMV and TGMV isolates and concluded,
based on
mutation-distance
analysis of the replication-associated
protein, that subgroup I monopartite viruses arose prior
to bipartite (subgroup III) viruses and that the monopartite condition may be a primitive feature. Lack of an
appropriate outgroup for rooting geminivirus parsimony
trees precludes confirmation
or refutation of this hypothesis. The circumstantial
evidence is nevertheless
compelling.
Furthermore,
we find it conceptually
appealing that evolution should proceed from the less complicated monopartite
genome to the more complex bipartite genome. We speculate, in agreement with Howarth and Goodman (1986), that evolution likely proceeded
from some subgroup
I monopartite
virus
(perhaps a member of the African streak virus clade) to
a relatively more complex bipartite virus (subgroup III).
Bipartite viruses then evolved from this progenitor, occasionally giving rise to monopartite viruses. These derived monopartites
retain the host and vector specificities of their bipartite progenitors.
Gene Evolution
by Positive
Selection
In addition to genome organization,
plant host and
insect vector specificities
have been used to classify
geminiviruses.
Howarth and Vandemark
( 1989) concluded, based on distance dendrogram topology that rep-
Predominantly
Dicot-infecting
Monocot-infecting
/
ss
TyLCV-1
III M W D (Solanaceae)
BCaMV-1
III B W D (Fabaceae)
- -
’
I
I
I
I
-
I M L M (Poaceae)
I M L M (Poaceae)
I M L M (Poaceae)
DSV -
I M ? M (Poaceae)
ssv
I M L D (FabaceaeEolanaceae)
PanSV
I M L M (Poaceae)
TYDV
I M L M (Poaceae)
Misv
__
I
I
African
StreakVirus
Clade
;
,
I
I
I
__
I
M L M (Poaceae)
r
56
24)
WDV- 1
I M L M (Poaceae)
WDV-2
I M L M (Poaceae)
CSMV
1
>4
- (3) 55
1
(4)*
(5) *
L
(91 T
100
(10)
I
y
I
(13)
I
SLCV
III B W D (Cucurbitaceae)
BGMV-2
III B W D (Fabaceae)
BGMV4
III B W D (Fabaceae)
BGMV- 1
III B W D (Fabaceae)
AbMV
III B W D (Malvaceae)
ToMoV
III B W D (Solanaceae)
I
TLCrV-2
III B W D (Fabaceae)
I
BDMV
III B W D (Fabaceae)
PYMV
III B W D (Solanaceae)
BGMV-3
III B W D (Fabaceae)
; BGMV
I Clade
TGMV
III B W D (Solanaceae)
I
TLCV- 1
III B W D (Solanaceae)
TLCV-2
III B W D (Solanaceae)
TLCV-3
III B W D (Solanaceae)
TYLCV-7
III B W D (Solanaceae)
MYMV
III B ? D (Fabaceae)
TYLCV-6
III M W D (Solanaceae)
I
TYLCV-4
III M W D (Solanaceae)
I
TYLCV-5
III M W D (Solanaceae)
TYLCV-2
III M W D (Solanaceae)
TYLCV-3
III M W D (Solanaceae)
ACMV
III B W D (Fabaceae)
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
<5(
a4
L
95
A
(11)
93
A
(14:
I
I
I
I
I
I
I
I
I
I
I
BCTV-
1
BCTV-2
II M L D (44 Families)
II M L D (44 Families)
I
I
I
I
Genome
lication-associated
protein evolution may be constrained
by plant host (monocot vs. dicot) specificity. Analysis
of the replication-associated
protein DNA sequences
(fig. 1) reveals a tendency for viral isolates to associate
based on plant host specificity (mono- vs. dicot), with
the exception of the subgroup I virus TYDV. While the
replication-associated
amino acid sequence parsimony
analysis allows (but does not require) association
of
TYDV with other dicot-infecting
viruses (data not
shown), the replication-associated
protein amino acid
distance (data not shown) and DNA parsimony (fig. 1)
analyses argue for a closer relationship between TYDV
and the monocot-infecting
MiSV and WDV than between TYDV and other dicot-infecting
viruses. (Coat
protein analysis concurs
[fig. 21.) However, shortest
identified constraint trees forcing monocot- and dicotinfecting viruses to form monophyletic
clades did not
differ significantly
(cx = 0.05) from most-parsimonious
trees for either the replication-associated
(6,478 vs.
6,458 steps, respectively)
or the coat protein (4,407 vs.
4,392 steps, respectively)
DNA sequences.
Among certain dicot-infecting
viruses, ability to infect particular plant families is para- or polyphyletically
derived.
For example,
isolates
of the pseudotaxon
TYLCV which infect solanaceous
plants occupy positions basal to all (replication-associated
protein, fig. 1)
or most (coat protein, fig. 2) other dicot-infecting
viruses
(exclusive
of TYDV). This suggests that Solanaceaeinfecting viruses may be more closely related to viruses
infecting Cucurbitaceae,
Euphorbaceae,
Fabaceae, Malvaceae, and, for the replication-associated
protein, up to
44 other dicot families, than they are to other Solanaceae-infecting
viruses! Constraint trees consistent with
the assumption
that viruses infecting the Solanaceae
constitute a monophyletic
clade were generated from the
replication-associated
and coat protein DNA sequences.
Shortest identified
constraint
trees (6,822 and 4,7 16
steps for the replication-associated
and coat protein
DNA sequences, respectively)
differed significantly
(cx
= 0.05) from most-parsimonious
trees (6,458 and 4,392
steps for the replication-associated
and coat protein
DNA sequences, respectively).
Similarly, shortest identified constraint trees consistent with the assumption that
viruses infecting the Fabaceae constitute a monophyletic
clade (6,7 19 and 4,526 steps for the replication-associated and coat protein DNA sequences, respectively) differed significantly
from most-parsimonious
trees. Regardless of true tree polarity, among the geminivirus isolates used in this study, the ability to infect either the
Solanaceae or the Fabaceae cannot be a monophyletic-
and Gene Evolution
in Geminivirus
1119
ally derived trait. In contrast, shortest identified constraint trees consistent with the assumption that viruses
infecting the Poaceae constitute a monophyletic
clade
(6,478 and 4,407 steps for the replication-associated
and
coat protein DNA sequences, respectively) did not differ
significantly
from most-parsimonious
trees, suggesting
that the Poaceae-infecting
viruses included in this study
could have arisen from a common origin that is distinct
from that of any dicot-infecting
virus. Further sampling
of geminiviruses
might allow future evaluation of the
ability to infect other plant families.
Using distance analyses, Howarth and Vandemark
(1989) concluded that coat protein sequence evolution
may be constrained by insect vector specificity. Consistent with this possibility, our distance (not shown) and
parsimony (fig. 2) analyses of the coat protein amino
acid and DNA sequences reveal a dichotomy based on
insect vector specificity (whitefly- vs. leafhopper-transmitted). (The replication-associated
parsimony analysis
mostly agrees [fig. 11.)
We used estimates of nonsynonymous
and synonymous nucleotide substitutions
to evaluate the hypotheses of evolutionary
response of the replication-associated and coat proteins to accommodate
plant host and
insect vector specificities, respectively. In the absence of
positive selection, synonymous
nucleotide substitutions
(those preserving amino acid sequence) are expected to
survive more frequently than nonsynonymous
substitutions (those altering amino acid sequence). An increased
ratio of nonsynonymous-to-synonymous
changes suggests selective pressure in favor of amino acid sequence
alteration. Little is known about the rate of divergence
among or the time of origin of geminiviruses.
Definition
of evolutionary
events, as indicated by comparatively
high KJK, ratios, can become obscured over time by
accumulation
of synonymous changes. In the absence of
information
about the rate of divergence or the time of
origin, we have examined K, and KS estimates from both
extant and inferred
ancestral
sequences.
The latter
should provide a more accurate estimate by removing
substitutions
occurring
along terminal
branches.
Because similarity between certain pairs of coat protein
sequences is insufficient to allow meaningful estimates
of K, and KS (Li 1993), we have also limited our analyses of this gene to within- and between-group
comparisons
using the phylogenetically
defined African
streak virus and BGMV clades and viral sequences at
or near the proposed “switch point” between leafhopper- and whitefly-transmitted
groups. Although our analyses are unrooted, topology predicts for the replication-
t
FIG. l.-One
of two most-parsimonious
trees (6,458 steps, consistency index = 0.384, retention index = 0.601) identified by 100 heuristic
search replications with random taxon addition of the replication-associated
protein DNA sequence. Most-parsimonious
trees differed only in
the presence/absence
of the branch separating CSMV and other predominantly
monocot-infecting
viruses (indicated by a decay index of cl).
Isolate abbreviations
are listed in table 1. Numbers above branches are bootstrap percentages. Bootstrap values of less than 50% are indicated
as “(50.”
Numbers below branches are decay indices. Branches present in consensus trees four steps longer than the most-parsimonious
trees
are indicated as “>4.”
Numbers in parentheses at individual nodes represent inferred ancestral DNA sequences used to evaluate possible
evolutionary
response of the replication-associated
protein sequence to plant host specificity. Virus subgroups and characteristics
are listed
adjacent to isolate abbreviations.
“Genome” organization is monopartite (M) or bipartite (B); “Insect Vector” is leafhopper (L) or whitefly (W);
“Plant Host” is dicot (D) or monocot (M). Plant host families are listed in parentheses. Isolates are divided into dicot-infecting and predominantly
monocot-infecting
groups.
Whitefly-transmitted
Leafhopper-transmitted
__
I
I
African
Streak Virus
Clade
’
:
I
I
I
L_
II M L D (44 Families)
BCTV- 1
II M L D (44 Families)
BCT’V-2
100
>4
TYLCV-6
III M W D (Solanaceae)
TYLCV-7
III B W D (Solanaceae)
TLCrV- 1
III B W D (Solanaceae)
ToMoV
III B W D (Solanaceae)
__
I
I
I
I
I
I M L M (Poaceae)
PanSV
AbMV
III B W D (Malvaceae)
I M L M (Poaceae)
MSV-1
PYMV
III B W D (Solanaceae)
MSV-2
BDMV
III B W D (Fabaceae)
BGMV-3
III B W D (Fabaceae)
I
88
(5) >4
SLCV
III B W D (Cucurbitaceae)
I
.
TGMV
III B W D (Solanaceae)
I M L M (Poaceae)
I
M L M (Poaceae)
(11)
DSV
I M L M (Poaceae)
ssv
;
(12)
I M L M (Poaceae)
I M L M (Poaceae)
I M L D (Solanaceae)
I M L M (Poaceae)
CSMV
I M L M (Poaceae)
WDV-2
I M ? M (Poaceae)
MiSV
I
I
I
I
I
I
I
I
TYDV
WDV-1
(6)$
I
BGMV-2
III B W D (Fabaceae)
BGMV4
III B W D (Fabaceae)
BGMV-1
III B W D (Fabaceae)
TYLCV-2
III M W D (Solanaceae)
TYLCV-3
III M W D (Solanaceae)
TYLCV-1
III M W D (Solanaceae)
TYLCV-5
III M W D (Solanaceae)
TYLCV4
III M W D (Solanaceae)
I
(7) *
100
>4
I
(9) *
m
>4
(14)
r
92
%I
+-
+
; BGMV
I Clade
I
I
I
I
I
I
I
I
(19) L
(18)
I
I
ACMV
III B W D (Fabaceae)
I
TLCV-3
III M W D (Solanaceae)
I
TLCV-1
III B W D (Solanaceae)
I
TLCV-2
III B W D (Solanaceae)
I
MYMV
III B ? D (Fabaceae)
I
+
(15)
I
<50
3
I
l
,,I
FIG. 2.-One
of two most-parsimonious
trees (4,392 steps, consistency index = 0.438, retention index = 0.685) identified by 100 heuristic search replications with random taxon addition of the
coat protein DNA sequence. Most-parsimonious
trees differed only in the presence/absence
of the branch separating AbMV and PYMV (indicated by a decay index of cl). Isolate abbreviations are
listed in table 1. Numbers above branches are bootstrap percentages. Bootstrap values of less than 50% are indicated as “~50.” Numbers below branches are decay indices. Branches present in consensus
trees four steps longer than the most-parsimonious
trees are indicated as “>4.” Numbers in parentheses at individual nodes represent inferred ancestral DNA sequences used to evaluate possible
evolutionary response of the coat protein sequence to insect vector specificity. Virus characteristics
are as in figure 1. Isolates are divided into leafhopper-transmitted
and whitefly-transmitted
groups.
Genome
associated protein that the dicot- and/or predominantly
monocot-infecting
viruses arose after the dicot/monocot
switch point. Similarly, for the coat protein, the leafhopper- and/or whitefly-transmitted
viruses arose after
the leafhopper/whitefly
switch point. Because of the
possibility of accumulation
of synonymous
changes after the switch points, K, and KS values near (and basal
to) the switch points will be underestimated.
Elevated
KJK, ratios near (or basal to) the switch points therefore
indicate that, despite subsequent synonymous
changes,
the effects of a defining evolutionary
event are still molecularly evident. Conclusions based on K,IK, ratios and
K, versus KS plots for extant and inferred ancestral sequences were identical for both the replication-associated and coat protein genes. Here we present only K,
versus KS plots for inferred ancestral sequences.
If the replication-associated
protein amino acid sequence were evolving in response to plant host specificity, a dramatic increase in the frequency of nonsynonymous relative to synonymous
changes would be expected near the switch point between predominantly
monocot-infecting
and predominantly
dicot-infecting
virus groups. In figure 1, this switch point occurs along
the branch separating nodes 9 and 10. Pairwise estimates
of KJK, ratios for all inferred ancestral replication-associated protein DNA sequences were examined (not
shown). Values near the monocot/dicot
switch point are
in general higher than the population
average (0.56),
with the average KJK, value near (i.e., immediately adjacent to or one node removed from) the monocot/dicot
switch point being 0.85. However, Ka/Ks values reach a
maximum not near the switch point but in comparison
of sequences 32 and 33 (1.43). These ancestral sequences are embedded within the dicot-infecting
group and
are most closely related to BGMV strains. Furthermore,
some KJK, values within the predominantly
monocotinfecting clade are equal to or greater than those near
the monocot/dicot
switch point. Figure 3 shows plotted
pairwise data points for all inferred ancestral replicationassociated protein DNA sequences. Accompanying
data
trend lines suggest no substantial
differences between
within-monocot-infecting,
within-dicot-infecting,
and
K, and KS estimates.
between-dicot/monocot-infecting
These observations
suggest that, although there may be
some evolutionary
response of the replication-associated
protein gene to plant host specificity (as defined by Howarth and Vandemark [1989]; i.e., monocot vs. dicot),
other factors are likely involved. This contradicts
the
apparent dichotomy observed in our distance and parsimony analyses (exclusive of TYDV) and in the distance analysis of Howarth and Vandemark (1989). Alternatively, synonymous
substitutions
accumulated after
the switch point may be sufficient to obscure a more
pronounced
effect. Padidam,
Beachy,
and Fauquet
(199%~) concluded that host range does corroborate evolutionary relationships
predicted by total genome DNA
sequence and may not be a reliable taxonomic character.
If the coat protein amino acid sequence were evolving in response to insect vector specificity, a dramatic
increase in the frequency of nonsynonymous
changes
relative to synonymous
changes would be expected near
and Gene Evolution
in Geminivirus
MM
DD
DM
KS
FIG. 3.-Plot
of K, versus KS (nonsynonymous
vs. synonymous
nucleotide substitutions) estimates for all pairwise comparisons of inferred ancestral replication-associated
protein DNA sequences. Data
symbols and trend lines indicate values within dicot-infecting
(B and
DD), within monocot-infecting
(A and MM), and between dicot- and
monocot-infecting
(0 and DM) isolates.
the switch point between leafhopper- and whitefly-transmissible virus groups. In figure 2, this switch point occurs along the branch separating nodes 11 and 12. Distances among extant and ancestral coat protein DNA
sequences are in general higher than those observed
among extant and ancestral replication-associated
protein DNA sequences (data not shown), suggesting that
protein is more highly conthe replication-associated
served.
Pairwise estimates of K, and KS were examined for
inferred ancestral coat protein DNA sequences (data not
shown). K,IK, values peak near the leafhopper/whitefly
switch point and all switch point values (average 0.82)
substantially
exceed African streak virus (average 0.28)
and BGMV (0.15) clade estimates. Figure 4 shows plotted pairwise data points for inferred ancestral coat protein DNA sequences.
Accompanying
data trend lines
suggest substantial
differences
between
leafhopper/
whitefly switch point values and within African streak
virus and BGMV clade estimates. These observations
support the hypothesis that the coat protein sequence
evolves in response to insect vector specificity. It is interesting to note that observed K,IK, values are consistent with a gradual progression from leafhopper transmission to whitefly transmission.
Sequence 12 is the earliest inferred
whitefly-transmitted
ancestor. This sequence
may be oniy inefficiently
transmitted
by
whiteflies. Subsequent further nonsynonymous
changes
in DNA sequence give rise to sequences 13 and 14, as
demonstrated
by high K.JKs ratios in comparisons
of
these sequences with sequence 12 (0.86 for sequence 12
vs. 13 and 1.13 for sequence 12 vs. 14). (The reverse
pattern, progression from whitefly transmission
to leafhopper transmission,
is less likely, as it requires a grad-
1122
Bradeen
et al.
LW
LL
ww
1
0
50
100
150
200
250
KS
vs. synonymous
FIG. 4.-Plot
of K, versus KS (nonsynonymous
nucleotide substitutions) estimates for pairwise comparisons of inferred
ancestral coat protein DNA sequences near the leafhopper/whitefly
switch point and within the African streak virus and BGMV clades.
Data symbols and trend lines indicate values within the leafhoppertransmitted African streak virus clade (m and LL), within the whiteflytransmitted BGMV clade (A and
and near the leafhopper/whitefly switch point (0 and LW).
ual loss of whitefly transmissibility
in the progenitor
clade prior to acquisition of leafhopper transmissibility.)
Consistent with our conclusion,
Briddon et al. (1990)
demonstrated,
using an ACMV construct modified by
the replacement of its native coat protein gene with that
of BCTV, that the coat protein gene alone is sufficient
to alter insect vector specificity. Geographical
isolation
was likely involved in the divergence and adaptation of
the coat protein sequence from the earliest whiteflytransmitted
viruses (Padidam,
Beachy, and Fauquet
1995a).
Average within-group
K,IK, values are notably
higher for African streak (leafhopper)
virus clade sequences (0.28) than for BGMV (whitefly) clade sequences (0.15). Padidam, Beachy, and Fauquet (1995a)
suggested that the coat protein is less constrained evolutionarily in leafhopper- than in whitefly-transmitted
viruses. The opposite may be true. Lower K,IK, values for
within-BGMV
comparisons could reflect the fact that all
BGMV viruses are vectored by the same whitefly species (Bemisia tabaci), whereas African streak clade viruses are vectored by at least two different genera of
leafhopper (Cicadulina and Nesoclutha).
This suggests
that the coat protein may be important not only in specifying leafhopper versus whitefly transmissibility,
but
also in specifying individual genera or species of insect
vector. Alternatively,
the coat protein sequence may be
constrained in leafhopper-transmitted
types by factors in
addition to insect vector specificity. Consistent with this
possibility, it has been demonstrated
for several whitefly-transmitted
viruses that an intact coat protein is not
necessary for infectivity following mechanical or agrobacterium
inoculation
(Gardiner et al. 1988; Klinken-
berg, Ellwood, and Stanley 1989; Padidam, Beachy, and
Fauquet 1995b). In contrast, the coat protein is important for systemic viral spread and symptom development
for leafhopper-transmitted
viruses (Boulton et al. 1989;
Briddon et al. 1989; Lazarowitz et al. 1989; Woolston
et al. 1989).
Geminiviruses
are as diverse as the agronomic diseases for which they are responsible. Classification
systems in the past have relied on genome organization,
plant host, and insect vector. Evidence presented here
suggests that genome organization
and plant host specificity may not necessarily reflect evolutionary
relationships. Our research supports recent modifications
in
geminivirus
classification
that includes deemphasis
of
these characteristics.
Future taxonomic efforts may rely
heavily on distance and parsimony analyses of the replication-associated
and coat protein sequences. Evidence
that the coat protein evolves in response to insect vector
specificity suggests that geminiviruses
might be useful
specimens for the study of gene evolution.
Acknowledgments
The authors wish to thank Dr. Gregory King and
Dr. Vineet Bafna for help with computer programming
and Dr. Robert Vrijenhoek
for helpful advice and insight. This work was supported by grant DE-FG0295ER20194 from the U.S. Department of Energy to J.M.
LITERATURE
ABOUZID,
CITED
A. M., J. E. POLSTON,
and E. HIEBERT.
1992.
The
nucleotide sequence of tomato mottle virus, a new geminivirus isolated from tomatoes in Florida. J. Gen. Virol. 73:
3225-3229.
ANDERSEN, M. T, K. A. RICHARDSON, S. HARBISON, and B.
A. M. MORRIS. 1988. Nucleotide sequence of the geminivirus chloris striate mosaic virus. Virology 164:443-449.
Y., and S. COHEN. 1994. Complete nucleotide sequence of an infectious clone of a mild isolate of tomato
yellow leaf curl virus (TYLCV). Phytopathology
84:707712.
BOULTON, M. I., H. STEINKELLNER,J. DONSON, P G. MARKHAM, D. I. KING, and J. W. DAVIES. 1989. Mutational analysis of the virion-sense genes of maize streak virus. J. Gen.
Virol. 70:2309-2323.
BREMER, K. 1988. The limits of amino acid sequence data in
angiosperm phylogenetic reconstruction. Evolution 42:795803.
BRIDDON, R. W., P LUNNESS, L. C. L. CHAMBERLIN, M. S.
PINNER, H. BRUNDISH, and I? G. MARKHAM. 1992. The nucleotide sequence of an infectious insect-transmissible
clone
of the geminivirus Panicurn streak virus. J. Gen. Virol. 73:
ANTIGNUS,
1041-1047.
BRIDDON, R. W. and E? G. MARKHAM. 1995. Geminiviridae.
Pp. 158-l 65 in E A. MURPHY, C. M. FAUQUET, M. A.
MAYO, A. W. JAVIS, S. A. GHABRIAL, M. D. SUMMERS, G.
l? MARTELLI, and D. H. L. BISHOP, eds. The classification
and nomenclature
of viruses: sixth report of the International Committee on Taxonomy of Viruses. Archives of Virology. Suppl. 10. Springer Verlag, Wein.
BRIDDON, R. W., M. S. PINNER, J. STANLEY, and I? G. MARKHAM. 1989. The coat protein of beet curly top virus is essential for infectivity. Virology 172:628-633.
Genome
-
1990. Geminivirus coat protein gene replacement alters insect specificity. Virology 177:85-94.
CHATANI, M., Y. MATSUMOTO, H. MIZUTA, M. IKEGAMI, M. I.
BOULTON, and J. W. DAVIES. 1991. The nucleotide sequence
and genome structure of the geminivirus miscanthus streak
virus. J. Gen. Virol. 72:2325-233 1.
COUTTS, R. H. A., R. S. COFFIN, E. J. E ROBERTS, and W. D.
0. HAMILTON. 1991. The nucleotide sequence of the infectious cloned DNA components of potato yellow mosaic virus. J. Gen. Virol. 72:1515-1520.
DONOGHUE, M. J., R. G. OLMSTEAD, J. E SMITH, and J. D.
PALMER. 1992. Phylogenetic
relationships
of Dipsacales
based on &CL sequences. Ann. MO. Bot. Gard. 79:333-345.
DONSON, J., G. F? ACCOTTO, M. I. BOULTON, I? M. MULLINEAUX, and J. W. DAVIES. 1987. The nucleotide sequence
of a geminivirus from Digitaria sanguinalis. Virology 161:
160-169.
DRY, I. B., J. E. RIGDEN, L. R. KRAKE, I? M. MULLINEAUX,
and M. A. REZAIAN. 1993. Nucleotide sequence and genome organization of tomato leaf curl geminivirus. J. Gen.
Virol. 74:147-151.
FELSENSTEIN, J. 1984. Distance methods for inferring phylogenies: a justification. Evolution 38: 16-24.
-.
1985. Confidence limits on phylogenies: an approach
using the bootstrap. Evolution 39:783-29 1.
1989. PHYLIP-phylogeny
inference package. Version-3.2. Cladistics 5: 164-166.
FRISCHMUTH, T., G. ZIMMAT, and H. JESKE. 1990. The nucleotide sequence of abutilon mosaic virus reveals prokaryotic
as well as eukaryotic features. Virology 178:461-468.
GARDINER, W. E., G. SUNTER, L. BRAND, J. S. ELMER, S. G.
ROGERS, and D. M BISARO. 1988. Genetic analysis of tomato golden mosaic virus: the coat protein is not required
for systemic spread or symptom development. EMBO J. 7:
899-904.
GILBERTSON, R. L., J. C. FARIA, S. E HANSON, E J. MORALES,
I? AHLQUIST, D. F? MAXWELL, and D. R. RUSSELL. 1991.
Cloning of the complete DNA genomes of four bean-infecting geminiviruses
and determining their infectivity by
electric discharge particle acceleration. Phytopathology
81:
980-985.
GILBERTSON, R. L., S. H. HIDAYAT, E. J. PAPLOMATAS, M. R.
ROJAS, Y. Hou, and D. I? MAXWELL. 1993. Pseudorecombination between infectious cloned DNA components of tomato mottle and bean dwarf mosaic geminiviruses. J. Gen.
Virol. 74:23-3 1.
HAMILTON, W. D. O., V. E. STEIN, R. H. A. COUTTS, and K.
W. BUCK. 1984. Complete nucleotide sequence of the infectious cloned DNA components of tomato golden mosaic
virus: potential coding regions and regulatory sequences.
EMBO J. 3:2197-2205.
HOWARTH, A. J., J. CATON, M. BOSSERT, and R. M. GOODMAN.
1985. Nucleotide sequence of bean golden mosaic virus and
a model for gene regulation in geminiviruses.
Proc. Natl.
Acad. Sci. USA 82:3572-3576.
HOWARTH, A. J., and R. M. GOODMAN. 1986. Divergence and
evolution of geminivirus genomes. J. Mol. Evol. 23:323329.
HOWARTH, A. J., and G. J. VANDEMARK. 1989. Phylogeny of
geminiviruses.
J. Gen. Virol. 70:27 17-2727.
HUGHES, E L., E. I? RYBICKI, and R. KIRBY. 1993. Complete
nucleotide sequence of sugarcane streak Monogeminivirus.
Arch. Virol. 132: 17 l-l 82.
KHEYR-POUR, A., M. BENDAHMANE, V. MATZEIT, G. I? AcCOTTO, S. CRESPI, and B. GRONENBORN. 1992. Tomato yellow leaf curl virus from Sardinia is a whitefly-transmitted
and Gene Evolution
in Geminivirus
1123
monopartite
geminivirus.
Nucleic Acids Res. 19:67636769.
KIKUNO, R., H. TOH, H. HAYASHIDA, and T MIYATA. 1984.
Sequence similarity between putative gene products of geminiviral DNAs. Nature 308:562.
KISHINO, H., and M. HASEGAWA. 1989. Evaluation of the maximum likelihood estimate of the evolutionary tree topologies from DNA sequence data, and the branching order in
Hominoidea. J. Mol. Evol. 29:170-179.
KLINKENBERG,E A., S. ELLWOOD, and J. STANLEY. 1989. Fate
of African cassava mosaic virus coat protein deletion mutants after agroinoculation.
J. Gen. Virol. 70: 1837-1844.
KOONIN, E. V., and T. V. ILYINA. 1992. Geminivirus replication
proteins are related to prokaryotic plasmid rolling circle
DNA replication initiator proteins. J. Gen. Virol. 73:27632766.
LAZAROWITZ, S. G. 1988. Infectivity and complete nucleotide
sequence of the genome of a South African isolate of maize
streak virus. Nucleic Acids Res. 16:229-249.
LAZAROWITZ, S. G., and I. B. LAZDINS. 1991. Infectivity and
complete nucleotide sequence of the cloned genomic components of a bipartite squash leaf curl geminivirus with a
broad host range phenotype. Virology 180:58-69.
LAZAROWITZ, S. G., A. J. PINDER, V. D. DAMSTEEGT, and S.
G. ROGERS. 1989. Maize streak virus genes essential for
systemic spread and symptom development.
EMBO J. 8:
1023-1032.
LI, W. 1993. Unbiased estimation of the rates of synonymous
and nonsynonymous
substitution. J. Mol. Evol. 36:96-99.
MACDOWELL, S. W., H. MACDONALD, W. D. 0. HAMILTON,R.
H. A. COUTTS, and K. W. BUCK. 1985. The nucleotide sequence of cloned wheat dwarf virus DNA. EMBO J. 4:
2173-2180.
MADDISON, D. R. 1991. The discovery and importance of multiple islands of most parsimonious
trees. Syst. Zool. 40:
3 15-328.
MATTHEWS, R. E. F. 1985. Viral taxonomy for the nonvirologist. Annu. Rev. Microbial. 39:45 l-474.
MORINAGA, T., M. IKEGAMI, and K. MIURA. 1993. The nucleotide sequence and genome structure of mung bean yellow
mosaic geminivirus. Microbial. Immunol. 37:47 l-476.
MORRIS, B., L. COATES, S. LOWE, K. RICHARDSON, and I?
EDDY. 1990. Nucleotide sequence of the infectious cloned
DNA components of African cassava mosaic virus (Nigerian strain). Nucleic Acids Res. l&197-198.
MORRIS, B. A. M., K. A. RICHARDSON, A. HALEY, X. ZHAN,
and J. E. THOMAS. 1992. The nucleotide sequence of the
infectious cloned DNA component of tobacco yellow dwarf
virus reveals features of geminiviruses
infecting monocotyledonous plants. Virology 187:633-642.
MULLINEAUX, I? M., J. DONSON, B. A. M. MORRIS-KRSINICH,
M. I. BOULTON, and J. W. DAVIES. 1984. The nucleotide
sequence of maize streak virus DNA. EMBO J. 3:30633068.
MULLINEAUX, I? M., E GUERINEAU, and G. I? ACCOTTO. 1990.
Processing
of complementary
sense RNAs of Digitaria
streak virus in its host and in transgenic tobacco. Nucleic
Acids Res. l&7259-7265.
MUMFORD, D. L. 1974. Purification of curly top virus. Phytopathology 64: 136-139.
NAVOT, N., E. PICHERSKY, M. ZEIDAN, D. ZAMIR, and H.
CZOSNEK. 199 1. Tomato yellow leaf curl virus: a whiteflytransmitted geminivirus with a single genomic component.
Virology 185:151-161.
NORIS, E., E. HIDALGO, G. F? ACCOTTO, and E. MORIONES.
1994. High similarity among the tomato yellow leaf curl
virus isolates from the West Mediterranean
Basin: the nu-
1124 Bradeen et al.
cleotide sequence of an infectious clone from Spain. Arch.
Virol. 135: 165-170.
PADIDAM, M., R. N. BEACHY, and C. M. FAUQUET. 199%.
Classification and identification of geminiviruses
using sequence comparisons. J. Gen. Virol. 76:249-263.
-.
199%. Tomato leaf curl geminivirus from India has a
bipartite genome and coat protein is not essential for infectivity. J. Gen. Virol. 76:25-35.
PAPLOMATAS, E. J., V. I? PATEL, Y. Hou, A. 0. NOUEIRY, and
R. L. GILBERTSON. 1994. Molecular characterization
of a
new sap-transmissible
bipartite genome geminivirus infecting tomatoes in Mexico. Phytopathology
84: 12 15-l 224.
ROCHESTER, D. E., J. J. DEPAULO, C. M. FAUQUET, and R. N.
BEACHY. 1994. Complete nucleotide sequence of the geminivirus tomato yellow leaf curl virus, Thailand isolate. J.
Gen. Virol. 75:477-485.
SCHALK, H. J., V. MATZEIT, B. SCHILLER, J. SCHELL, and B.
GRONENBORN. 1989. Wheat dwarf virus, a geminivirus of
graminaceous plants, needs splicing for replication. EMBO
J. 8:359-364.
STANLEY, J., I? G. MARKHAM, R. J. CALLIS, and M. S. PINNER.
1986. The nucleotide sequence of an infectious clone of the
geminivirus beet curly top virus. EMBO J. 5:1761-1767.
STENGER, D. C. 1994. Complete nucleotide sequence of the
hypervirulent
CFH strain of beet curly top virus. Mol.
Plant-Microbe Interact. 7: 154-157.
SWOFFORD, D. L. 1993. PAUP: phylogenetic
analysis using
parsimony. Version 3.1.1. Smithsonian Institution, Washington, D.C.
TEMPLETON, A. R. 1983. Phylogenetic inference from restriction endonuclease cleavage site maps with particular reference to the evolution of humans and the apes. Evolution
37:221-244.
TIMMERMANS,M. C. l?, 0. I? DAS, and J. MESSING. 1994. Geminiviruses and their uses as extrachromosomal
replicons.
Annu. Rev. Plant Physiol. Plant Mol. Biol. 45:79-l 12.
WOOLSTON, C. J., H. V. REYNOLDS, N. J. STACEY, and F? M.
MULLINEAUX. 1989. Replication of wheat dwarf virus DNA
in protoplasts and analysis of coat protein mutants in protoplasts and plants. Nucleic Acids Res. 17:6029-6041.
JULIAN I? ADAMS, reviewing
Accepted
August
1, 1997
editor