Journal of General Virology (1993), 74, 2225-2231. Printed in Great Britain 2225 Complete nucleotide sequence of pepper huasteco virus: analysis and comparison with bipartite geminiviruses I. Torres-Pacheco,~- J. A. Garz6n-Tiznado,~" L. Herrera-Estrella and R. F. Rivera-Bustamante* Departamento de Ingenier[a Gendtica, Centro de Investigaci6n y de Estudios Avanzados del LP.N., Unidad Irapuato, Apartado Postal no. 629, Irapuato, Gto. Mdxico The complete nucleotide sequence of pepper huasteco geminivirus (PHV) isolated in Northern Mexico was determined. The PHV genome consists of two circular ssDNA molecules of 2631 bases (PHV A) and 2589 bases (PHV B). PHV has a genome organization typical of a bipartite geminivirus with four open reading frames (ORFs) (AR1, ALl, AL2 and AL3) in component A and two (BR1 and BL1) in component B. An unexpected ORF was found in the complementary sense strand of PHV A. This ORF, termed AL5, is found entirely inside, but in the opposite orientation to AR1 (encoding the coat protein). AL5 shows some homology to equivalent but smaller ORFs predicted in other geminiviruses. Phylogeny trees based on pairwise comparisons ofAR1, AL2, AL3, BL1 and BR1 predicted proteins placed PHV among the western hemisphere geminiviruses. A phylogeny tree based on ALl (replicase-encoding OR]=), on the other hand, placed PHV with eastern hemisphere geminiviruses, i.e. African cassava mosaic virus and the Sardinia and Israel isolates of tomato yellow leaf curl virus. Possible mechanisms for the 'hybrid or transition nature' of PHV are discussed. Introduction Genetic analyses have suggested the possible function for some ORF products. A R 1 0 R F is considered to be the coat protein gene, although to our knowledge direct evidence of this has been reported for only two cases [e.g. sequencing of the African cassava mosaic virus (ACMV) coat protein and immunological identification of the tomato golden mosaic virus (TGMV) AR1 gene product after expression in Escherichia coli] (Stanley & Gay, 1986; Murayama et al., 1991). The products of the ALl, AL2 and A L 3 0 R F s have been implicated in several steps of the replication process (Elmer et al., 1988; Sunter et al., 1990). In addition, the AL2 protein has been reported as a possible trans-activator needed for expression of AR1 and BR1 genes (Sunter & Bisaro, 1991). The ORFs found in component B, BL1 and BR1, on the other hand, have been suggested as being responsible for the movement of the virus throughout the plant (Lazarowitz, 1992; von Arnim & Stanley, 1992). Recently we reported the cloning of both components of pepper huasteco virus (PHV), a new whiteflytransmitted bipartite geminivirus affecting pepper crops in northern Mexico (Garz6n-Tiznado et al., 1993). This non-mechanically transmissible geminivirus has been detected in several horticultural areas in the country, usually in association with other non-characterized geminiviruses. These complex viral diseases are currently the most important phytopathological problem in Mexican horticulture. Monomeric clones of PHV DNAs (A + B) are infectious with a 60 to 70 % efficiency when Geminiviruses are a group of plant viruses that have a small 'twinned' isometric particle and a genome containing one or two ssDNA molecules. Most geminiviruses can be classified into four subgroups according to their host-range, insect vector and genome organization. All bipartite geminiviruses described so far show similar genomic organization with four open reading frames (ORFs) in component A and two ORFs in component B. Component A has one ORF (AR1) in the positive or viral sense and three ORFs (ALl, AL2 and AL3) in the negative or complementary sense. In component B, one ORF is found in each orientation (BR1 and BL1). In both cases, the ORFs extend in opposite directions from a 180 to 200 base common region shared by both components. Some sequences in this common region have been reported to interact with the respective putative replicase (ALl protein) and are unique for each geminivirus except for a conserved 30 base sequence element with a potential hairpin structure (Lazarowitz, 1992). Nucleotidesequencesof PHV are availablefromthe EMBLdatabase with the followingaccession numbers: PHV A, X70418; PHV B, X70419. ? Permanent address: Instituto Nacional de InvestigacionesForestales y Agropecuarias,ApartadoPostal no. 112,Celaya,Gto. Mrxico. 0001-1698 © 1993 SGM Downloaded from www.microbiologyresearch.org by IP: 88.99.165.207 On: Sun, 18 Jun 2017 14:44:13 2226 L Torres-Pacheco and others introduced into pepper plants by a biolistic procedure. Neither A nor B D N A produces any symptoms when inoculated alone (Garzdn-Tiznado et al., 1993). Here, we report the complete nucleotide sequence of PHV DNAs. We also compared PHV predicted proteins with those of previously reported geminiviruses in an attempt to address the question of the possible origin of this important geminivirus problem in Mexico. Methods Sequencing of PHV. All standard procedures utilized during the cloning and sequencingof viral DNA were according to Sambrook et al. (1989). Viral inserts contained in plasmids pIGV21 and pIGV22 were sequenced using Sequenase 2.0 kits from United States Biochemicals according to the manufacturer's instructions. In some cases, restrictionfragments from pIGV21 and pIGV22 were subcloned into Bluescript SK+ (Stratagene) for sequencing. Synthetic oligonucleotides were used as primers to sequence regions where no convenient restriction sites could be found. Oligonucleotides were produced in a DNA SynthesizerModel 381Afrom Applied Biosystems. Computer analysis. Sequences were assembled and analysed using GeneWorks2.2 and PCGene6.5 (Intelligenetics)and MacDNAsis 2.0 (Hitachi) softwarepackages. The followinggeminivirussequenceswere obtained directly from the EMBL and GenBank databases and used for comparisons: ACMV (X17095, X17096) (Stanley & Gay, 1986); abutilon mosaic virus, AbMV (X15983, X15984) (Frischmuth et al., 1990); beet curly top virus, BCTV(X04144)(Stanleyet al., 1986);bean golden mosaic virus, Puerto Rico isolate, BGMV-PR (D00200, D00201) (Howarth et al., 1985), Brazil isolate, BGMV-BZ (M88686, M88687), and Guatemala isolate, BGMV-GA (M91604, M91605) (Gilbertson et al., 1991); bean dwarf mosaic virus, BDMV (M88179, M88180) (Hidayat et al., 1993); potato yellow mosaic virus, PYMV (D00940, D00941) (Coutts et al., 1991); TGMV (K02029, M73794) (Hamiltonet al., 1984;yon Arnim & Stanley, 1992);tomato yellowleaf curl virus, Sardinia, TYLCV-S (X61153) and Israel isolates, TYLCVI (X15656) (Kbeyr-Pour et al., 1991; Navot et al., 199t); squash leaf curl virus, SqLCV (M38182, M38183) (Lazarowitz& Lazdins, 1991). Phylogeny trees were produced with the GeneWorks software package which uses the UPGMA (unweightedpair group method with arithmetic mean) procedure (Nei, 1987). The program algorithm does pairwise comparisons and first clusters the two most similar sequences and calculates the average similarityof those sequences. The program then clusters the next two most similar objects. Results and Discussion P H V genornic organization Sequence analysis revealed PHV as a typical bipartite geminivirus whose genome is divided in two singlestranded molecules termed PHV A and PHV B. Components A and B are 2631 and 2589 bases long, respectively. Fig. 1 presents the nucleotide sequence o f both components. Also shown is the 30 base stem-loop region shared by all geminiviruses infecting dicotyledons (GGCCAT/ACCGNT/AA/TTAATATTACCGGA/TTGGCC). Nucleotide 1 in both cases corresponds to the first base of the 30 base conserved region. This region contains the typical loop structure T A A T A T T A C , found in hairpin structures of all monocotyledon- and dicotyledon-infecting geminiviruses, and is believed to be involved in virus replication (Lazarowitz, 1987). Similarly to all previously reported geminiviruses, the base composition of PHV showed that thymidine is present at the highest percentage, whereas cytosine is the lowest in occurrence. Individually, the base composition of each component was: PHV A, 26.25 % A, 19.45 % C, 23.82% G and 30.47% T; PHV B, 28.23% A, 18.35% C, 21.71% G and 31.71% T. The genomic organization of PHV is similar to that of all bipartite geminiviruses. Fig. 2 illustrates the coding capacity of PHV. Component A contains five putative ORFs and component B contains two. The ORFs are found in both orientations: viral or positive-sense and complementary or negative-sense. The nomenclature used for PHV ORFs is based on analogies with previously reported geminiviruses. Analysis o f P H V ORFs Computer analysis suggests a subdivision for bipartite geminiviruses into western (WH) and eastern (EH) hemisphere geminivirus subgroups. W H geminiviruses included BGMV, T G M V and AbMV whereas E H geminiviruses were represented by A C M V and TYLCV (Howarth & Vandemark, 1989). The analysis was based on pairwise homologies of coat proteins (ARl-encoded) and the product of the A L 1 0 R F (putative replicase). To determine the situation of PHV among these subgroups, the predicted protein from PHV AR1 was compared pairwise with the equivalent proteins from several other geminiviruses (Table i). PHV AR1 showed a high degree of similarity (82 to 90%) with the W H geminivirus subgroup, closest to the Guatemalan isolate of B G M V (BGMV-GA) (Gilbertson et al., 1991). The degree of similarity decreased (68 %) when the PHV AR1 protein was compared with the equivalent proteins from members of the E H subgroup: A C M V and the Sardinia and Israel isolates of T Y L C V (TYLCV-S and TYLCV-I, respectively) (Kheyr-Pour et al., 1991; Navot et al., 1991). A phylogeny tree based on pairwise homology data showed an evident grouping of the viruses placing PHV, as expected, with the W H cluster (Fig. 3 a). Alignments of coat proteins from all viruses included in Fig. 3 revealed several highly conserved domains (data not shown). In contrast, no significant homology was detected when the PHV AR1 protein was compared with coat proteins from geminiviruses transmitted by leafhoppers, e.g. BCTV (29 %), maize steak virus and wheat dwarf virus (data not shown). A similar analysis was applied to the putative replicase, encoded by the A L l ORF, and several interesting Downloaded from www.microbiologyresearch.org by IP: 88.99.165.207 On: Sun, 18 Jun 2017 14:44:13 Complete nucleotide sequence of PHV 2227 A GGC•AT•CGTTAT••AATATTACCGGATGGCCGAC•G•TTA•CTTATCTATCCGTA•TGCTTTATTTGAATTAAAGATGTTACTTTTATGCTATCCAATGAA 100 GCGTAGCGTCTGGGAAGCTTAGTTATCAGTTCCAGACGTGGGGACCAAGTAGTGTATGA•CACTTTA.•-•-GACTGTCAGCTTTATAAATTGAAATTAAAAC 200 ATAAGTGGT~CATGTACCTTTAATTCA.6~TGCCTAAGCGTGATGCTCCTTGGCGATTAACGGCGGGGACCGCCAAGATTAGCCGAACTGGCAATAATTC ACGGGCTCTTATCATGGGCCCGAGTACTAGCAGGGCCTCAGC-•TGGGTTAATCGCCCAATGTACAGGAAGCCCCGGATTTATCGTATGTACAGAACTCCG 400 GATGTGCCGAAAGGTTGTGAAGGTCCCTGTAAGG.•TCAATCG.•TFGAACAACGACATGACGTCTCTCATGTTGGTAAGGTTAT•TGTATATCCGACGTAA 500 CTCGTGGTAATGGTATTACCCATCGTGTTGGCA/~CGATTCTGCG~TAAGTCTGTCTATATTCTGGGCAAAATCTGGATGGATGAAAATATTAAGTTGAA 600 GAACCATACCAACAGTGTCATGTTTTGGTTG GTTAGGGATAGGAGACCCTACGGTACGCCTATGGA'I"ITTGGCCAAGTCTTTAACATGTATGACAACGAG700 •••AGTA••GCTACTGTGAAGAACGATCTTCGGGATCGTTATCAAGTTATGCATAGATTcTATGCTAAGGTCACTGGTGGGCAATATGCAAGCAACGAGC 800 AAGCCTTGGTTAGGCGTTTCTGGAAGGTGAACAACCATGTTGTGTATAACCATCAAGAAGCTGGGAAATATGAGAACCACACGGAGAATGCGCTGTTATT 900 GTATATGGCATGTACTCATGCATCTAATCCCGTGTATGCAACACTCAAAATTCGGGTCTATTTTTATGACTCGATAATGAATTAATAAAGTTTGTATA•T I000 ATTTCATGATTCTCAAGTACAGCATTGACATAACG1-~TGTCTGTAGCAAACGAAACAGCCCTAA1TACATTGTTAACTGAAATAAGACCTAAGTTATCTA11~ GATAAAACATGACAAGCAATTTAAATcTATTTAAGTAAATCTGCCCAGAAATCGTCGTCAACGTCGTCCAGACTTGGAAGTTGAAGTAGGCTTTGTGGAG1200 ACCCAACGCTGTCCTCATGTTGTGGTTrGCTCTGACTTGAATGTGAAATACCCGGCTGCGTGTGTACATTGGCGTCTCCACTAGCCGTATTTTGAAATAT1300 AGGGGATTTCGAAGCTCCCAGATAAAAACGCCATTCGCTGCTTGAGCTGCAGTGATGGGTACCCCGGTGCGTAAATCCATTGTTAACACAGTTAATATGT 1400 ATATAAATTGAACAGCCGCAAGCGAGATcAATCCTTCTACGTCGTATCTGTCTCTTTGCAAATCTATGGCGAAGTTTGACTTCCGGTGGTGAAGATAGCT1500 TCTTCGATGGTGACGTAGATGGCGTTT1-•TTGGACCCAGTCATTGAGGCTCCTATTTTTCTCTTCGCTGAGGTAGTCTTTATAGGAAGACTGGGGGCCCG 1600 GATTGCAAAGAAAGATAGTGGGTATCCCACCTTTAATTTGAATrGGTTTGCCGTATTTGCAGTTTGA1TGCCAATCCCGTTGGGCCCCCATAAACTCTrT1700 AAAGTGCTTAACATAATGCGGAGGGATGTCATCAATGACGTTATACCATGCATrATTGGAGTAGAT•TTTGGGCTGAGATCCATATGACCACATATGTAA 1800 TTGTGTGGGCCGAGACTTCGGGCCCATAATGT•TTGCCTGTCCGTGAAGGACCCTCGACCACTAATGACAATGGTCTCATTGGCCGCGCAGCGGCATCAC 19~ ATACATTATCAGACACCCATTGTGTCA1-FATTGCAGGCACATTATTAAAGGACGCCTGTTGAAATGGAGGAACCCACGGTTCCGGGGGAGTTTGGAATAT2000 CCGATTAGCGTTTGACACAATGTTATGAAATTGGAGGAAGAAATGCTGAGGTTGTTCTTCCTTTATGATCTGCAGAGCTTCTTCTGCAGATGCTGAATTTZIO0 AACGCCTFAGCATATGTGTCATTAGCAGACTGCTGTCCTCCTCTAGCAGATCTGCCGTCTATTTGGAATTCTCCCCATTCTACGGTATCGCCGTCTTTGTZZ~ CGATGTACGTCTTGACGTCGGAGCTTGATTTAGCTCCCTGAATGTTCGGATGGAAATGTGCTGATCTGGTAGAGGATACGAGGTCAAAGAATCGGTTGTTZ300 CGTGCATFGGTATTTT~CTT~GAA~TGAATAAG~A~GTGCAGATGAGG.~TGCCCATCTTCATGAGATTCTrTGCA/AATTTTGATGTACTTCTTGTTTACC2400 GGCGTCGAGAGG.~TTTGTAGTTGAGCGAGACGCTCTTCT~TGGAAATGGAA~AT~GTGGATAGGTGAGGAAATAATTCTTGGCATTTAAACGAAATCGTT 25~ TAGGTAATGGCATAT-~-FGTAATAAGAGAGGTGTACACCGATTGGAGCTCTTTAACCTGGGCTTATTGTATCGGTGTATTGGTAGCCAATATATAGTATATZ6~ GGGAGTTATCTAGGATCTTCGTACACGTGAG Z63'I B GGCCATCCGTTATAATA`•-•-A•CGGATGG••GA•CGCTT•CACTCTCTTTCCTTTGGGACAGCTGGCGCGCACTATGTA-FrATGTTTACGTGGCATCATGT 100 GGGTCG.i-[GGATGAA~CCAATCGCG~GCCTTCAT-~TCAAATTAAAGTGTGTGT~CATACATCGAGAAATGTGTAATGACGTGGAGCGTTCTCCACCATrC ZOO CTGAATCGTTAGATAATTGTTTGAC•AGGACCACAGCTGTCAT•TGGGACCACACGTCCTTTGGGACCACCACTATAATGATAATGTTTCCTGTTA•TGC 300 GGTCCACGTGGTCCAATTAAATTGCACCTCGCGAGTCTACATATCCACAATTTTGAATATCCTATTCTATAAAATGGCTTCCAT•TTTATATTCAAAATT400 ATATTCACATCTCTTTTAATATATATTTATCTTTAAGCAATTTAATATGTATTCTACTAGATTTAGACGTGGGTTATCCTATGTTCCACGGCGTTATAAT 500 CCA~GTAATTATGGTTTTAAACGTACATTCGTCGTTAAACGTG~TGATGCTAAA~GACGTCAGACTCAAGTGAAGAAACTAACAGAAGATGTTAAAATGT 600 CATCACAACGCAT•CATGAAAATCAATATGGTCCAGAATTTGTCATGGCGCATAATACAGCAATATCTACATTCATCAATTATCCCCAACTGTGTAAGAC 700 TCAG•CCAATCGTAGTAGGTCATATAT•.AAG-•TAAAATCGTTACATFTTAAGGGAACCTTAAAGATCGAACGTGTTGGGTCTGAGGTAAATATGGCTGGG 800 TTAAATCCGAAGATTGAGGGTGTGTTTACTGTGGT•-TTAGTTGTTGACCGTAAGCCACA.FrTGAATCCTACTGGTAACTTGCTACAGTTTGACGAGTTATO~0 TTGGTGCAAGAATTCACAGTCTAGGGAACTTAGCCGTTACCCCGGCGTTGAAAGAACGGTTCTACATACTGCATGTGTTGAAGCGAGTTATCTCCGTTGA 1000 GAAGGATAGTATGATGCTGGAC CTAGAAGGATCCAC'I-I'GTCTCTCTAGTCGGCGTTATAATTGTTG GTCTACATTTAAGGACC'I-I'GATCCTTCGTCATGT"11~ AACGGCGTCTATGATAATATAAGCAAAAACGCCATA-~TAGTTTATTATTGTTGGATGTCGGATGCTATGTCTAAGGC~TCCACAT-~TGTATCA-~--~TGATT1200 TGGACTATTTTGGTTAAGAAATAATTGACTTGC GTAGTTTGCTCATATTTGTATTTTGTCACAAAATAAAATATTATTATCTTAGCGACTTCGGTTGTGT 1300 CGGATTACAATTACTGTTAATACATTCATGGACCGTAGTCCTTACAAGCTCATTCAACTGGGCCAAG GACATAGTTATATTTGATTGAGAGC GTGTTAGA1400 CCCACTTGTGATGCTGAATCACCTGGGTCCAAAA CACTTCCGCCTAACTGATGAAGATCTTTATACGGATGTAATGCGCTATGTCCTTGGTTGTCAG CAT 1500 CTGTGTGAGTGGTTCCTATGGTGCTTCTACAAGCCCAGGATTCACCTGGTTTTAATTCAATTGGGCCTGTAATGCCGAACC-•TGACATGGATGCTGACCT 1600 CAATGG'ITITCTCTCCCACCTGCCGTAGTCCACATGTGTAAAGTCCACATCGTTATGGGTGAACTGTTTCGATAAAATCTTCACCGTC GGAGCCCGGAAA1700 GGTATATCCACGGAGTG.•-••TAGCTGTGGACAACTTCAATTTCCCTTTGAACTTGGCAAAATGGGTGTTCTGATGTACGTTAGTATCGGAGACTCTGTAAT 181~ ATAGCTTCCAGGGTATGGGGTCCTTCAAGGAGAAGAAGGATGCTGAGAAATAATGGAGATCGATGTTACATCTTAGTGGAAATGTCCAAGAAGCTTGTAA1900 TGATTCATTGTCTGTCATTCGTTTGTCATGGATTTCCACTATGACCGACCCAGTGGCGT.•TATCGGAACTTGCTGTCTATACTCGATAACG CAATGGTCA20~ ATTTTCATACAGCTACGACTAAGTCTGGCAGCGTACTGCGACGCCGTTGACGGAAATTGAAGTATTATCTCCGTTAAGTCATGAGAGAGCTGATATTCAT 2100 CTCTATGTGACTCTATATAA-•-•.GAATGCGCTAGGAGGATTCGCCAACCATGAATCCATATATGAAAATTTGGCAGCGCACGTGAAGGCTTACGGAGTCTG ?_.2~ AATCTGGTAATAAGAAGCTATACCTAACAATGTTAATGGTAATGAAAATGACAAATTACTATrTGCTGAAAGAGTTCAAAAATAAATGCTTACTTAGTTA 2300 TTAAGATATTGCTATTAGCAGCAACAATATATGAGGAAACCGGTGAGGATGAAAGCAAAAGCGTCTTCAGAAGACAGAGCAGAAAGAATTGGTATGAATA2400 ATr'AAATGAACAGGCAGTGTCGTTATATAGAAGATCATTGTGTTTTAGAGAGAGAAAATITTGCAGTGG CATTI'GTGTAATATGGAGGGGTACACCGATTZ5~ GGAGCTCTTT~d~CCTGGGCTTA-~TGTAT~GGTGTATTGGTAGCCAATATATAGTATATGGGAGTTATCTAGGATCTTCGTACACGTGGA 2589 Fig. 1. Nucleotide sequence of the viral sense strand of both components of PHV (A and B). The underlined regions (30 bases) correspond to the consensus found in all bipartite geminiviruses. In both cases, nucleotide 1 corresponds to the first base of the 30-mer consensus. Downloaded from www.microbiologyresearch.org by IP: 88.99.165.207 On: Sun, 18 Jun 2017 14:44:13 I. Torres-Pacheco and others 2228 (a) ALl ~ L TYLCV-S TYLCV-I ACMV 5 ~ ARll ~" ~ BGMV-PR BGMV-GA SqLCV TGMV BDMV PYMV BGMV-BZ PHV AbMV R1 AL3 Fig. 2. Proposed genomic organization for PHV. Arrows represent ORFs in both orientations. CR represents the 180 base common region for both components which contains the 30 base stem-loop structure found in all bipartite geminiviruses. ORF coordinates are: AR1,230 to 982; ALl, 2513 to 1467; AL2, 1543 to 1130; AL3, 1380 to 985; AL5, 853 to 341; BR1,447 to 1214; BL1, 2158 to 1280. .[_~~ (b) T a b l e 1. Amino acid sequence similarities (%) between putative translation products o f P H V and selected geminiviruses* U I AbMV ACMV BCTV BDMV BGMV-PR BGMV-BZ BGMV-GA PYMV SqLCV TGMV TYLCV-I TYLCV-S AR1 ALl AL2 AL3 BR1 BL1 82 68 29 87 88 86 90 87 88 86 68 68 65 65 61 67 64 61 66 63 48 66 68 69 54 50 30 59 48 50 49 52 51 54 50 49 67 48 43 66 67 67 68 66 66 67 50 48 61 37 NA'~ 67 68 67 67 65 53 68 NA NA 63 46 NA 82 82 76 84 81 77 78J; NA NA * All values were calculated using the MacDNAsis program. Other programs (GeneWorks and UWGCG) gave similar results. ~ NA, Not available; TYLCV and BCTV do not have component B. ;~ For TGMV BL1 the sequence used was as reported by von Arnim & Stanley (1992) because the sequence originally reported by Hamilton et al. (1984) contained a mistake that produced a truncated BL1 protein. o b s e r v a t i o n s w e r e o b t a i n e d ( T a b l e 1 a n d Fig. 3 b). F i r s t , PHV presented the highest similarity with both isolates o f T Y L C V , S a r d i n i a , 6 9 % , a n d Israel, 6 8 % . T h e similarity to the WH geminivirus cluster was slightly l o w e r ( 6 1 % to 6 7 % ) . W h e n a n A L l p r o t e i n - b a s e d phylogeny tree was constructed, PHV clearly clustered w i t h t h e E H g e m i n i v i r u s e s ( A C M V a n d T Y L C V ) (Fig. 3 b). A l i g n m e n t o f A L 1 p r e d i c t e d p r o t e i n s s u p p o r t e d t h e c l a s s i f i c a t i o n d i s p l a y e d in t h e p h y l o g e n y tree. F i g . 4 s h o w s a d o m a i n o f 40 a m i n o a c i d s f o r s e v e r a l r e p l i c a s e s t h a t c o n t a i n s s o m e r e s i d u e s f o u n d in E H g e m i n i v i r u s e s ( A C M V , T Y L C V - I a n d T Y L C V - S ) a n d p r e s e n t also in P H V , b u t n o t f o u n d in a n y o t h e r W H g e m i n i v i r u s . I n a d d i t i o n to this d o m a i n , o t h e r r e s i d u e s c o n s e r v e d b e t w e e n P H V a n d E H g e m i n i v i r u s e s a r e V(218), C(241), TGMV BGMV-BZ BDMV PYMV BGMV-GA AbMV BCTV TYLCV-I ACMV PHV SqLCV Fig. 3. Phylogeny trees for several geminiviruses. (a) Tree based on pairwise homologies of the coat proteins. (b) Tree based on the ALl products (replicase). Protein sequences were obtained directly from databases as described in Methods. The AbMV ALl sequence was corrected as suggested by Gilbertson et al. (1993). The figures are not to the same scale. When the standard error value for a branch point overlaps the standard error value of a point below or above it, it is not possible to say which branched first. This was expressed as collapsed nodes that produced tri- and higher -chotomies. PHV TYLCV-S TYLCV-I ACMV BGMV-BZ PYMV BGMV-GA BGMYLPR BDMV TGMV SqLCV AbMV BCTV Consensus (255-294) (253-294) (253-292) (254-293) (256-297) (256-297) (256-297) (256-297) (256-297) (257-297) (252-293) (256-297) (256-297) AWYNVZDDZPPHYVK--HFK EFMGAQRDWQ SNCKYGKPI( 294 AWYNVZDDVDPHYLK--HFK EFMGAQRDWQ 5NTKYGKPI( 294 AWYNVTDDVDPHYLK--HFK EFMGAQRDWQ SNTKYGKPI( 292 AWYNVZDDVDPHYI.K--HFK EFM6SQRDI~ SNTIOfGKPV( 293 AEYNVZDDZAPHYLKLK}MK ELMGAQKDWQ SNCKYgKPV( 297 VEYNVTDDVAPQYLKLKHWK ELLGAQRDI~ SNCKYGKPV( 297 VEYNVZDDZS PNYLKLKI'~K ELIGAQY,DIVQ SNCXYGKPV( 297 VEYNVZDDZSPiCYLKLKHWK ELIGAQKDM~ SNCKYGKPV( 29 ? VEYNVZODVA PHYLKLKHWK ELZGAqK~ SNCK~GKtW( 297 VEYNVIDOV'r PQYLKLKHWK EL'rGAQRDWQ TNCKYGKPV( 297 VKYNVZDDVAPHYLKLK~K ELZGAQRDWQ SNCKYGKPV( 293 VEYNVIDDVA PHYLKLK~K ELLGAQKDWQ SNCKLAKPV( 297 VEYNVZDDVDPTYLK~KFmK HLIGAQKEWQ TNLKYGKPRV 297 **YNVIDD-- P-YLK**H*K E**GAQRDWQ SN*KYGKP*Q 48 Fig. 4. Alignment of a 40 amino acid domain of selected ALl proteins. A 40 amino acid domain from several ALl proteins was selected and aligned by computer. The consensus sequence is shown in the bottom line. The asterisks (*) mark residues conserved among PHV, ACMV and TYLCV and different to the WH consensus. Dashes (-) mark positions where more than three different amino acids can be found (highly variable). Letters indicate residues conserved in all geminiviruses. T(299), F(301) a n d P(307). A m o r e d e t a i l e d analysis, however, showed that the ALl protein also of PHV has s o m e r e s i d u e s f o u n d o n l y in W H g e m i n i v i r u s e s Downloaded from www.microbiologyresearch.org by IP: 88.99.165.207 On: Sun, 18 Jun 2017 14:44:13 Complete nucleotide sequence of P H V [PEPW(177 to 180), N(191), C(275)]. Although the ALl protein comparison suggests a close relationship between PHV and EH geminiviruses, the analyses of the other ORFs as well as the fact that PHV ALl also possesses some WH characteristics do not allow a definitive overall clustering of PHV. These results propose PHV as a hybrid or transition geminivirus between the WH and EH types. Trees obtained using other software packages (e.g. PCGene CLUSTAL program) showed minor variations for the position of some geminiviruses inside the main clusters. However in all cases the clustering of PHV remained identical. In any case, the main purpose of the tree is to illustrate similarities between sequences encoded by a specific ORF. Implications for an evolutionary path will require more rigorous analysis. In contrast to the results found with the AR1 protein where the similarity between PHV and SqLCV was one of the highest (88 %), the similarity between products of the PHV ALl and SqLCV ALl ORFs was the lowest (48 %), even lower than the value obtained with equivalent proteins from BCTV (61%) (Stanley et al., 1986). When SqLCV was included in the ALl phylogeny tree it was found to stand by itself forming a separate branch, whereas in the AR1 phylogeny tree it was clearly clustered with the WH geminiviruses. BCTV, which is a subgroup II geminivirus transmitted by leafhoppers, was included in the comparisons used for the ALl tree as an internal control of the method, and clustered with the WH bipartite subgroup as reported previously (Howarth & Vandemark, 1989). Phylogeny trees based on similarities between products of ORFs AL2 and AL3 displayed similar results to the AR1 phylogeny tree. The clustering of PHV was similar in all three cases and the only difference was the degree of similarity that PHV showed with the rest (Table 1). Computer analysis revealed two possible initiation codons, in the same reading phase, for the PHV AL2 ORF. The first one generates a protein of 208 amino acids. The second initiation codon is found 210 bases downstream producing a protein with only 138 amino acids. Sequence and size comparisons with AL2 proteins from other geminiviruses suggested that the smaller protein is the actual product. Point mutation experiments are being carried out to dismiss the possibility of any initiation of translation at the first ATG codon. An unexpected fifth ORF (ORF AL5) was detected in component A. The nomenclature of AL5 (instead of AL4) was suggested to avoid confusion with the previously described A L 4 0 R F for other geminiviruses. This ORF is located entirely inside ORF AR1 (coat protein) but in the opposite orientation. At present it is not known whether this ORF is transcribed and translated. To verify whether this situation is unique for 2229 PHV or whether similar proteins could be found in other geminiviruses, the sequences of all previously reported geminiviruses were analysed. The survey included ORFs producing proteins smaller than 10000 Mr which has been a common size limit for geminiviral proteins. Several viruses (BGMV, BDMV and PYMV) had similar but smaller ORFs. Although the overall pairwise similarity was relatively low in all cases (50 %), computer alignments of the predicted AL5 proteins displayed some conserved regions. A genetic study is also underway to determine the importance of AL5. For the B L 1 0 R F , PHV showed the largest degree of similarity (84%) to BGMV-GA and the lowest to ACMV (42%). A similar situation, although with a lower degree of similarity, was found for BR1 where BGMV-PR, TGMV (68 %) and BDMV, BGMV-BZ, BGMV-GA (67 %) showed the higher degree of similarity to PHV and the lowest to ACMV (37%). Phylogeny trees placed all WH geminiviruses in one homogeneous cluster leaving only ACMV on an independent branch (data not shown). The AR1, ALl, BR1 and B L 1 0 R F s have typical eukaryotic promoters with CAAT and TATA boxes and a cap signal. The distances between TATA boxes and the cap signal are similar although not identical. The position of the CAAT box showed more divergence. Perhaps the nearby stem-loop structure (e.g. for ALl) could have some effect on the structure of the promoter. The position of the poly(A) signal was also different since the one for AR1 is virtually on the stop codon whereas BRI has its signal a few bases downstream. This might be a reflection of the fact that the AR1 stop codon overlaps with that of AL3 resulting in the absence of an intergenic region (see Fig. 2). Component B, on the other hand, presents an intergenic region of 59 bases. The poly(A) signal for ALl was not clearly identified. In addition, no typical regulatory sequences could be found for ORFs AL2, AL3 and AL5. The expression of these overlapping ORFs has yet to be determined. The overall analysis of the PHV genome showed it to be a WH geminivirus. However, the fact that the PHV ALl protein is more closely related to the equivalent proteins from ACMV and TYLCV suggests the possibility of some type of recombination between geminiviruses. There are several non-characterized geminiviruses in the same geographical area infecting pepper and tomato crops (Brown et al., 1986; Brown & Nelson, 1988, 1989; Brown & Poulos, 1990). It is possible that a virus similar to TYLCV, at the molecular level, might be present in the area. Mixed infections are common and several diseases have been reported to be caused by a mixture of geminiviruses (Brown et al., 1989; Brown & Nelson, 1989). The presence of several geminiviruses in the same plant provides the opportunity for recombination. This Downloaded from www.microbiologyresearch.org by IP: 88.99.165.207 On: Sun, 18 Jun 2017 14:44:13 2230 L Torres-Pacheco and others opportunity is increased in this area since possible hosts for the viruses are found in all seasons. The relatively low degree of similarity (68 %) between PHV and TYLCV replicases suggests that this possible recombination event is not a recent one, and that the viruses have had enough time to diverge to the present level. Several experiments in which components of bipartite geminiviruses were exchanged (pseudorecombination) have been recently reported (Gilbertson et al., 1993; yon Arnim & Stanley, 1992). This pseudorecombination mechanism can now be added to the intermolecular recombination reported for ACMV (Etessami et al., 1989; Klinkenberg et at., 1989; Stanley & Townsend, 1986). Both mechanisms, independently or acting in a concerted manner, might be important in nature for the generation of new viruses. The unexpected clustering of SqLCV also supports a recombination hypothesis. SqLCV AR1 is highly similar to those of WH geminiviruses whereas SqLCV ALl is clustered completely on its own. An alternative explanation is simply that PHV, ACMV and TYLCV share a closer common ancestor and have diverged in adapting themselves to the surrounding environment (e.g. vectors and host plants in the respective geographical area). The characterization of more geminiviruses from the same area where PHV was isolated could provide new information about their phylogenetic relationships. That Mexico is a centre of diversity for both pepper and tomatoes might be of importance for the number of geminiviruses being found in this area. Nevertheless, the sudden outbreak of diseases caused by geminiviruses in the late 1980s still remains unexplained. We thank B. Jimenez for her generous help during the sequencing of PHV and A. Alvarez for critical reading of the manuscript. We would also like to thank the anonymous referees for their critical comments towards improvement of the manuscript. This research was supported by a grant from CONACYT-M6xico (1150-N9202) to R.F.R.-B. I.T.-P. also acknowledges fellowship support from CONACYT. References BROWN, J. K. & NELSON, M. R. (1988). Transmission, host range, and virus-vector relationships of chino del tomate virus, a whiteflytransmitted geminivirus from Sinaloa, M~xico. Plant Disease 72, 866-869. BROWN, J.K. & NELSON, M.R. (1989). Two whitefly-transmitted geminiviruses isolated from pepper affected with tigrd disease. (Abstract). Phytopathotogy 79, 908. BROWN, J. K. & POULOS, B. T. (1990). Serrano golden mosaic virus: a newly identified whitefly-transmitted geminivirus of peppers and tomato in the United States and Mexico. Plant Disease 7,1, 720. BROWN, J.K., GOLDSTEIN, D.E. & NELSON, M.R. (1986). Partial characterization of a geminivirus isolated from tomato with yellow leaf curl symptoms. (Abstract). Phytopathotogy 76, 842. BROWN, J.K., CAMPODONICO, O.P. & NELSON, M.R. (1989). A whitefly-transmitted geminivirus from peppers with tigr~ disease. (Abstract). Plant Disease 73, 610. COUTTS, R. H.A., COFFIN, R.S., ROBERTS, E . J . F . & HAMILTON, W. D. O. (1991). The nucleotide sequence of the infectious cloned DNA components of potato yellow mosaic virus. Journal of General Virology 72, 1515 1520. ELMER, J. E., BRAND, L., SUNTER, G., GARDINER, W. E., BISARO,D. M. & ROGERS, S.G. (1988). Genetic analysis of the tomato golden mosaic virus. II. The product of the ALl coding sequence is required for replication. Nucleic Acids Research 16, 7043-7060. ETESSAMI,P., WATTS, J. & STANLEY,J. (1989). Size reversion of African cassava mosaic virus coat protein gene deletion mutants during infection of Nicotiana benthamiana. Journal of General Virology 70, 277-289. FRISCttMUTH, T., ZIMMAT, G. & JESKE, H. (1990). The nucleotide sequence of abutilon mosaic virus reveals prokaryotic as well as eukaryotic features. Virology 178, 461~68. GARZON-TIZNADO, J. A., TORRES-PACHECO, I., ASCENCIO-IBANEZ,J. T., HERRERA-ESTm~LLA, L. & RWERA-BUSTAMANTE, R.F. (1993). Inoculation of peppers with infectious clones of a new geminivirus by a biolistic procedure. Phytopathology 83, 514~521. GILBERTSON, R.L., FARIA, J.C., HANSON, S.F., MORALES, F.J., AHLQUIST, P., MAXWELL,D. P. & RUSSELL, D. R. (1991). Cloning of the complete DNA genomes of four bean-infecting geminiviruses and determining their infectivity by electric discharge particle acceleration. Phytopathology 81,980-985. GILBERTSON,R. L., HIDAYAT, S. H., PAPLOMATAS,E. J., ROJAS, M. R., Hou, Y.-M. & MAXWELL, D.P. (1993). Pseudorecombination between infectious cloned DNA components of tomato mottle and bean dwarf mosaic geminiviruses. Journal of General Virology 74, 23-31. HAMILTON,W. D. O., STEIN,V. E., COUTTS,R. H. A. & BUCK, K. W. (1984). Complete nucleotide sequence of the infectious cloned DNA components of tomato golden mosaic virus : potential coding regions and regulatory sequences. EMBO Journal 3, 2197-2205. HIDAYAT, S. H., GILBERTSON, R. L., HANSON, S. F., MORALES, F.J., AHLQUIST, P., RUSSELL, D. R. & MAXWELL, D. P. (1993). Complete nucleotide sequence of the infectious cloned DNAs of bean dwarf mosaic geminivirus. Phytopathotogy 83, 181 187. HOWARTH, A. J. & VANDEMARK, G. J. (1989). Phylogeny of geminiviruses. Journal of General Virology 70, 271 22727. HOWARTH, A.J., CATON, J., BOSSERT, M. & GOODMAN, R. (1985). Nucleotide sequence of bean golden mosaic virus and a model for gene regulation in geminivirus. Proceedings of the National Academy of Sciences, U.S.A. 82, 3572-3576. KHEYR-POUR, A., BENDAHMANE, M., MATZEIT, V., ACCOTTO, G. P., CRESPI, S. & GRONENBORN, B. (199 l). Tomato yellow leaf cuff virus from Sardinia is a whitefly-transmitted monopartite geminivirus. Nucleic Acids' Research 19, 6763-6769. KLINKENBERG, F.A., ELLWOOD, S. & STANLEY, J. (1989). Fate of African cassava mosaic virus coat protein deletion mutants after agroinoculation. Journal of General Virology 70, 1837 1844. LAZAROWITZ, S. G. (1987). The molecular characterization of geminiviruses. Plant Molecular Biology Reporter 4, 177-192. LAZAROWITZ, S. G. (1992). Geminiviruses: genome structure and gene function. Critical Reviews in Plant Sciences 11, 322349. LAZAROWlTZ, S. G. & LAZDINS, I. B. (1991). Infectivity and complete nucleotide sequence of the cloned genomic components of a bipartite squash leaf curl gelninivirus with a broad host range phenotype. Virology 180, 58 69. MURAYAMA, A., YAMAYA,Y. & IKEGAMI,M. (1991). Identification of the coat protein gene of bean golden mosaic geminivirus. Intervirology 32, 258 261. NAVOT, N., PICHERSKY, E., ZEIDAN, M., ZAMIR, D. & CZOSNEK, H. (1991). Tomato yellow leaf curl virus: a whitefly-transmitted geminivirus with a single genomic component. Virology 185, 151-161. NEI, M. (1987). Molecular Evolutionary Genetics. New York: Columbia University Press. SAMBROOK, J., FRITSCH, E.F. & MANIATIS, T. (1989). Molecular Cloning: A Laboratory Manual, 2nd edn. New York: Cold Spring Harbor Laboratory. STANLEY, J. & GAY, M. R. (1986). Nucleotide sequence of cassava latent virus. Nature, London 301, 260-262. STANLEY, J. & TOWNSEND, R. (1986). Infectious mutants of cassava latent virus generated in vivo from intact recombinant DNA clones Downloaded from www.microbiologyresearch.org by IP: 88.99.165.207 On: Sun, 18 Jun 2017 14:44:13 Complete nucleotide sequence o f P H V containing single copies of the genome. Nucleic Acids Research 14, 5981 5998. STANLEY, J., MARKHAM, P. G., CALLIS, R. J. & PINNER, M. S. (1986). The nucleotide sequence of an infectious clone of the geminivirus beet curly top virus. EMBO Journal 5, 1761-1767. SLrNTER, G. & BISARO,D. M. (1991). Transactivation in a geminivirus: AL2 gene product is needed for coat protein expression. Virology 180, 416-419. SUNTER, G., HARTITZ, M.O., HORMUZDI, S.G., BROUGH, C.L. & 2231 BISARO, D.M. (1990). Genetic analysis of tomato golden mosaic virus: ORF AL2 is required for coat protein accumulation while ORF AL3 is necessary for efficient DNA replication. Virology 179, 69-77. VON ARNIM, A. t~; STANLEY,J. (1992). Determinants of tomato golden mosaic virus symptom development located on DNA B. Virology 186, 286-293. (Received 15 March 1993; Accepted 8 June 1993) Downloaded from www.microbiologyresearch.org by IP: 88.99.165.207 On: Sun, 18 Jun 2017 14:44:13
© Copyright 2026 Paperzz