Journal of General Virology (1993), 74, 2437-2443. Printed in Great Britain 2437 Nucleotide sequence evidence for the occurrence of three distinct whitefly-transmitted geminiviruses in cassava Y. G. H o n g , 1 D. J. Robinson I and B. D. Harrison 2. 1 Scottish Crop Research Institute, Invergowrie, Dundee DD2 5DA and 2Department of Biological Sciences, University of Dundee, Dundee DD1 4HN, U.K. The complete nucleotide sequence of the D N A of Indian cassava mosaic virus (ICMV) and a key part of that of a group B isolate of African cassava mosaic virus from Malawi (ACMV-M) were determined and compared at the nucleotide and encoded amino acid levels with the published sequences of an ACMV group A isolate (ACMV-K) and other whitefly-transmitted geminiviruses (WTGs). The D N A of ICMV consists of two circular single-stranded molecules, DNA-A [2815 nucleotides (nt)] and DNA-B (2645 nt), which differ substantially in sequence from the genome components of ACMV-K (DNA-A 70 %, DNA-B 47 % sequence identity) and other WTGs. ICMV DNA-A contains eight open reading frames (ORFs) encoding proteins of > 100 amino acid residues, of which four ORFs (one genome sense, three complementary sense) are comparable to those of other WTGs. DNA-B contains one O R F in each sense, as in other WTGs. None of the putative viral proteins are more similar in amino acid sequence to the proteins of A C M V - K than to those of another WTG. The coat protein of ACMV-M is more like that of tomato yellow leaf curl virus from Sardinia (86% sequence identity) than those of ICMV or ACMV-K. The intergenic regions of ACMV-K, ACMV-M and ICMV DNAs differ in size, and largely in sequence, except for two 30 to 40 nt sequences which are also conserved in other W T G s and can form stem-loop structures. The intergenic region of ICMV D N A contains three copies of a 41 nt sequence, and that of ACMV-M D N A contains an imperfect repeat of a 34 nt sequence which resembles the repeated sequence in ICMV DNA. The differences between ACMV-K, ACMV-M and ICMV are considered great enough to justify their separation as isolates of three distinct W T G s : African cassava mosaic virus, East African cassava mosaic virus and Indian cassava mosaic virus. Introduction isolates could be assigned to three discrete clusters on the basis of their patterns of reaction (Harrison & Robinson, 1988). Evidence for these groupings has been strengthened by serological tests on virus isolates from cassava in 11 additional countries in tropical Africa or the Indian subcontinent (Harrison et al., 1991b; M. M. Swanson & B. D. Harrison, unpublished). Group A isolates react with at least 15 of the MAbs and occur in West Africa, Burundi, Chad and Uganda, and in the western parts of Kenya and Tanzania. Group B isolates react with five to nine of the MAbs and are found in Malawi, Madagascar, Zimbabwe and the eastern parts of Kenya and Tanzania. Group C isolates occur in India and Sri Lanka. They react with only two or three of the MAbs and are considered to represent a separate geminivirus, which was named Indian cassava mosaic virus (ICMV; Harrison et al., 1991b). These groupings are associated with biological differences. For example, whereas group A isolates are readily transmitted to, and maintained in, Nicotiana benthamiana, in which they accumulate best at approx. Cassava mosaic, the most economically important virus disease of a root crop in Africa, is caused by infection with a whitefly-transmitted geminivirus (WTG), African cassava mosaic virus (ACMV; Bock & Woods, 1983). A similar disease occurring in southern India is likewise associated with a WTG, particles o f which are serologically related to those of ACMV (Bock & Harrison, 1985). However, serological relationships among different W T G s are common, as shown by tests with polyclonal antisera (Roberts et al., 1984) or monoclonal antibodies (MAbs; Harrison et al., 1991a; Swanson et al., 1992a, b). Tests on 87 virus isolates from mosaicaffected cassava in 10 countries with 17 MAbs to the West Kenyan type strain of ACMV showed that the The ICMV nucleotide sequences presented in this paper have been submitted to the EMBL database and assigned the accessionnumbers Z24758 (DNA-A) and Z24759 (DNA-B). 0001-1764 © 1993 SGM Downloaded from www.microbiologyresearch.org by IP: 88.99.165.207 On: Wed, 14 Jun 2017 16:53:13 2438 Y. G. H o n g , D. J. R o b i n s o n a n d B. D. H a r r i s o n 23 °C, g r o u p B isolates (such as A C M V - C ) are transmitted to N . b e n t h a m i a n a with difficulty a n d only rarely at t e m p e r a t u r e s below 30 °C, a n d they are h a r d or impossible to m a i n t a i n i n this species. Also, they a t t a i n only low c o n c e n t r a t i o n s in N . benthamiana, even at 30 °C. G r o u p C isolates can be m a i n t a i n e d in N. b e n t h a m i a n a b u t m u l t i p l y best at 30 °C ( R o b i n s o n et al., 1984; H a r r i s o n et al., 1987). The groupings are also s u p p o r t e d by the results of nucleic acid h y b r i d i z a t i o n tests with probes for the two circular s s D N A molecules which m a k e u p the viral genome. T h u s whereas a D N A p r o b e for D N A - A ( D N A - 1 ) o f the g r o u p A type isolate of A C M V (from western K e n y a , d e n o t e d A C M V - K ) detected all isolates tested, a full-length p r o b e for D N A B ( D N A - 2 ) reacted only weakly, if at all, with the D N A of isolates in groups B or C ( R o b i n s o n et al., 1984; H a r r i s o n et al., 1987). The D N A - A p r o b e also detected other W T G s whereas the D N A - B p r o b e did n o t (Roberts et al., 1984). W e have n o w explored the relationships a m o n g the virus isolates f r o m cassava by nucleotide sequence c o m p a r i s o n s . T h e sequences of the two D N A species of A C M V - K (group A) were d e t e r m i n e d b y Stanley & G a y (1983). W e have d e t e r m i n e d the complete sequences of the two D N A species o f a n isolate of I C M V , together with key regions of the g e n o m e of a g r o u p B isolate o f A C M V from M a l a w i ( A C M V - M ) . T h e results provide further evidence o f the distinctness of the three isolates, a n d show that the differences a m o n g t h e m are c o m p a r a b l e in degree to those a m o n g different geminiviruses. They also show that the nucleotide sequences o f I C M V a n d A C M V - M c o n t a i n previously u n d e s c r i b e d repeats in the n o n - c o d i n g region which occurs in b o t h D N A - A a n d D N A - B . Methods Virus sources. The stock culture of ICMV was obtained from a vegetativelypropagated cassava plant derived from a stake collected in 1986 at Trivandrum, Kerala, India. The virus was transmitted by inoculation of sap to N. benthamiana and maintained in this host by mechanical inoculation. ACMV isolate M (ACMV-M) was from a cassava plant collected at Jalawe, Rumphi, Malawi, and propagated vegetatively in the glasshouse at Dundee. The viruses were kept under licence from the Scottish Office Agriculture and Fisheries Department. Extraction o f I C M V DNA. ICMV ssDNA was extracted as described by Robinson et al. (1984) from purified virus particles (Sequeira & Harrison, 1982). ICMV dsDNA was extracted from infected N. benthamiana leaves and further purified by CsCI gradient centrifugation (Sambrook et al., 1989). Preparation of total DNA from cassava infected with ACMV-M. About 50 mg cassava leaf tissue was ground with a pestle and mortar, and 250 gl of ice-coldextraction buffer (50 mM-Tris-HC1pH 8.0, 5 mMEDTA, 0"1% BSA, 1"25M-NaC1, 0.1% 2-mercaptoethanol) and 25 gl 10 % SDS were added. After incubation at 65 °C for 5 min, the mixture was extracted once each with phenol :chloroform (4:1, v/v) and with chloroform. Sodium acetate (3 M, pH 5"2, 0'1 vol.) and two volumes of ethanol were added to the aqueous phase, which was kept at - 7 0 °C for 20 rain. DNA was pelleted by centrifugation at 12000g for 10 rain. The DNA was washed with 70% ethanol, dried in a vacuum and redissolved in 200 gl of TE (10 mN-Tris HCI, 1 mM-EDTA, pH 8.0). Cloning o f l C M V DNA. DNA extracted from ICMV particles was made double-stranded with the Klenow fragment of Escherichia coli DNA polymerase I, using as primer an oligonucleotide [5' d(GTAATATTA) 3'] which is complementary to a sequence that is conserved within the intergenic region of all geminivirusessequenced so far. The products were cut with EcoRI and cloned in pUC19. A part of ICMV DNA-B not represented among these clones was synthesized by PCR, using primers deduced from the adjacent sequences, and was cloned in pUC19 after digestion with restriction enzymes. Full-length DNA clones of both DNA-A and DNA-B were also constructed by inserting dsDNA, purified from infected plants, in the BglII and BamHI sites, respectively, of pUC19. PCR and cloning of the coat protein gene and common region of ACMV-M. A virus-specific DNA fragment was amplified from total DNA from infected plant tissue by PCR using two degenerate primers whose sequences were derived from conserved regions in published sequences of WTG genomes. The 20-mer, P1 (5' d[GG(C/T)CACGA(T/C/A)TTAAT(T/G)AG(G/A)GAI3'), corresponded to residues 397 to 416, and the 24-mer,P2 (5' d[GTTGAT(C/T)ATGTATTGT(A/ T)(T/C)(G/C/A)ATGTA] 3"), to residues 1276 to 1299 in ACMV-K DNA-A (Stanley & Gay, 1983). Reaction mixtures (100 gl) contained 1 to 2 pg total DNA, 75 pmol each of P1 and P2, 100 gM of each dNTP, 2.5 mM-MgCI~,50 mM-KC1, 0.1 mg/ml gelatin, 10 mM-Tris HC1 pH 8.0 and 2-5 units of Taq DNA polymerase (Cambio). The reaction mixture was overlaid with 50 gl of light oil, and PCR was initiated with 1 cycle of 94 °C for 2 rain, 54 °C for 1.5 rain, 72 °C for 2 min, followed by 30 cycles of 94 °C for 30 s, 54 °C for 1 min and 72 °C for 1.5 min, and finally by 1 cycle of 72 °C for 5 min, in a Cambio Intelligent Heating Block. After electrophoresis in a 1.0% LMT agarose (Gibco BRL) gel, the PCR product was recovered by extraction with phenol--chloroform, then treated successively with proteinase K and T4 DNA polymerase (Sambrook et al., 1989), and cloned into the HincII site of pUC19. DNA sequencing. Nucleotide sequences were determined by the dideoxynucleotide chain termination method (Sanger et al., 1977) which used Klenow fragment polymerase (Pharmacia) or Sequenase (US Biochemicals). Single-stranded DNA, obtained from subclones in M 13rap18 or M 13rap19, and double-stranded plasmid DNA were used as templates. Compressions in some sequences were resolved by replacing dGTP with 7-deaza-dGTP in the sequencing reaction. Data analysis. Sequences were aligned and analysed using the UWGCG computer programs (Devereux et al., 1984). Sequencesof the followinggeminiviruseswere used in the comparisons: abutilon mosaic (AbMV; Frischmuth et al., 1990), ACMV-K (Stanley & Gay, 1983), ACMV Nigerian isolate (ACMV-N; Morris et al., 1990), bean golden mosaic (BGMV; Howarth et al., 1985), potato yellowmosaic (PYMV; Coutts et al., 1991), squash leaf curl (SqLCV; Lazarowitz & Lazdins, 1991), tomato golden mosaic (TGMV; Hamilton et al., 1984), tomato leaf curl-Australia (TLCV-A; Dry et al., 1993), tomato yellowleaf curlIsrael (TYLCV-I; Navot et al., 1991) and TYLCV-Sardinia (TYLCVS; Kheyr-Pour et al., 1992) viruses. Results Organization of the ICMV genorne I C M V was f o u n d to have two species of circular s s D N A , D N A - A a n d D N A - B . Each species was completely sequenced in b o t h directions. D N A - A was sequenced Downloaded from www.microbiologyresearch.org by IP: 88.99.165.207 On: Wed, 14 Jun 2017 16:53:13 Three geminiviruses from cassava (a) ,- * ,kTA ~ 340 ~ ~ . r c N A = ~ X It P a J~ D = n Z Z e = T P c a c B K V ~ ]~ X R , L N ~ Y D J P Y = 400 J It 400 x x v e T v n v ~ x Q s w ~ e e e z s x x e e w Y e x Y n e e o v p s o c s o p C K VQ v K g Z D F G B V V N N F D W ~ y A J I~ N A CA ~ T y K N J p Z J J H Z L g K Z WNp Q & I N & L e V lq V Y D V V H R A T SN I G X VlIC ~ Z X ~ S ~'~ NX P ~ T A T V K I ~ N E R le $e ~q V ~ X Z N R I~ y V ~ V Y y ~ fir ~ D (~ V V T levx J R Q X r l, ~v D R Y Q V O K Jt G X * ~ N ' ~ ' A O A A ~ A ' e ~ A z Q KN N Z z fl C Z v Ni I • fl P R V X Y ~ Y l P l~ g lq C G A Jt -- :--- L p C oN Q T H ~ P o t ~ P V D Q • R Y C x • O K W N A ~ V ~ Q Y B ~g I~ T I[ N A OQ L ~ L Y Y ~ N L 1~20 X I L W S O • v : ~ v 'r z ~ D 0 N T Z .T, 1~ o Y R J D ~ Y y ~ L x J V I " / ~ A q ' N ~ ~ k ' I ~ C ' G • CA Q X X L V P D N v V B 1(10 Q Z R X K F R W X 0 Z O R L H K B M O ,~ R ~ A P V A A AE ~ R D 0 p VQ ~0 I N Pi, ~'t 0B ~ ~' fJ B L P ~' p N E y (b) 5 ~ W y L X T Lo R Z I " ~'t'Xq'A,k p A G l d It i ~ R dR y R dt K D Q F G p D p T • H N ~ N L VN Z J le T O T V ~ • K Y B A T K PL P JY P O L F O I~ A o 0 D ~ Z ~ VII ~ J8 k g 3F p ~ Q P L ~ : ¥ • ~ Vtl' ~ J v N Z I~ ~ T JC 1~ : ~ O - ~ ~ T "4 K HA n K , II V O X ~ G , O • ~ w II ~ p V R]P y K dr J ° N m • • C ~ - ~ L ~ I( , ~ e V B ~ . y T K ~ Y N x x x a A y ~r p I~ • T r r pit n It X L A R It N .~ F G O D ~ A R ~. ~' 5" Y J~ T y ~ • X PV X T]e L dl O ~ ~ V G D Y • D • S P if L O L L F T C V LV Jt D K Z N A B E DR P TV G P Z 0D L~Y R R F O , ~ O R Y P Z W T g F K D D G O ~, IB D V J • 0 L | ]i( Y C K Y V T t Y r C Z A X ~ c ~ v T ~ S Z Z D ,'L~'.A,r . ,z.~',Je,,~~ N ~ D , ~ __ Y A D L '~ ~ CAq,CC"Z,'pCAJ k L ~ & HO M ' V L 120 K it ' '/ ~ ~ z ~ ' A 0 D ~ k 't'~A'rk OOGAT 1080 1200 JI lOOO ~ 2400 2520 2645 Fig. 1. Nucleotide and encoded amino acid sequences of I C M V D N A - A (a) and ICMV DNA-B (b). Downloaded from www.microbiologyresearch.org by IP: 88.99.165.207 On: Wed, 14 Jun 2017 16:53:13 2439 2440 Y. G. Hong, D. J. Robinson and B. D. Harrison 2815/1 (((~ A A+ ~ A R 0 ALlt(I 2000"~~/AR1 A-)1 2000"v~ ~_~/,./~ lOOO £ AL5 AL4 -1ooo AL3 2645/1 2645/1 2000 shown in black (Fig. 2). The amino acid sequences of the putative proteins encoded by these ORFs are shown in Fig. 1. 2815/1 R IBR 1 1000 2000 ~ ~ 0 0 0 Fig. 2. Arrangement of ORFs in ICMV DNA-A (2815 nt) and DNAB (2645nt) shown in the genome (+) and complementary( - ) senses. ORFs in black (ALl, AL2, AL3, AR1, BL1 and BR1) are conservedin most, and those in grey (AL0, AL4, AL5 and AR0) are conservedin a few other WTGs. Potential promoter sequences (V), polyadenylation signals (V) and the common region (CR) are also indicated. from three contiguous cDNA clones bounded by EcoRI sites. Sequences at the junctions were confirmed by sequencing appropriate fragments of a full-length clone of DNA-A. More than half of DNA-B was sequenced from two cDNA clones which abutted at an EcoRI site. The sequences at the junction, and of the gap remaining, were determined from clones of the products obtained by PCR using primers based on the known sequences. Subsequently the part of a full-length-clone that overlapped both ends of the larger cDNA clone was sequenced. The sequences from the full-length clone and of the PCR-derived product differed in only four out of 1630 nucleotides, and only three of these differences resulted in an amino acid change. Where such nucleotide differences occurred, the version occurring in the fulllength clone is given precedence. The final sequences [DNA-A, 2815 nucleotides (nt); DNA-B, 2645 nt] are shown in Fig. 1. As in other WTGs, the sequences of DNA-A and DNA-B are different except for the common (intergenic) region (labelled CR in Fig. 2), which is shared. The first nucleotide in this common region is defined as number 1 in each sequence. In total, the two DNA species contain 10 open reading frames (ORFs), encoding polypeptides of more than 100 amino acid residues, in either the genome ( + ) sense or complementary ( - ) sense strand (Fig. 2). The ORFs shown in black in Fig. 2 are found in several other WTGs, and those in dark grey are shared with a few other WTGs (such as ACMV-K). Potential promoter sequences (TATA or CAAT boxes) and polyadenylation signals (A/GATAA) are available for all the ORFs Comparison of DNA sequences of I C M V and other geminiviruses When the nucleotide sequences are compared with those of other geminiviruses, such as ICMV, ACMV-K, BGMV from Puerto Rico and TYLCV-S (which lacks a DNA-B), the sequence of ICMV DNA-A is equally similar (70 %) to the comparable genome components of ACMV-K and TYLCV-S, but is less similar to BGMV DNA-A (62%), especially at nucleotides 1 to 500 but also at other points in the sequence. As with other WTGs, much less similarity is found among the DNA-B sequences: ICMV and ACMV-K, 47 %, and ICMV and BGMV, 46%. Moreover, no part of DNA-B is well conserved although the region at about nucleotides 1800 to 2000 is somewhat more conserved than the rest of the sequence. Similarities of I C M V gene products with those of other whitefly-transmitted geminiviruses The putative products of the six ORFs (black in Fig. 2) found in ICMV and 10 other WTGs were compared. Table 1 shows the percentage amino acid sequence identities obtained in these comparisons. Among the proteins encoded by ICMV DNA, those derived from genes AR1 and ALl are the most strongly conserved when compared with the proteins of other WTGs. The proteins encoded by the ICMV genes are most similar to those of TYLCV-I (AR1), TLCV-A (ALl, AL2, AL3), SqLCV (BR1) and TGMV (BL1). Thus none of the encoded proteins is more similar to those of ACMV-K than to the proteins of another WTG. As expected from comparisons of the nucleotide sequences, the products of the ORFs in DNA-B are much less conserved between ICMV and other WTGs than those of the DNA-A ORFs (Table 1). Also, the amino acid sequence identities obtained by comparing ICMV with other WTGs were all markedly less than those obtained by comparing ACMVK and ACMV-N (93 to 97% for different ORFs). In addition, it was found that although TLCV-A, TYLCVI and TYLCV-S have only one genome segment (Navot et al., 1991; Kheyr-Pour et al., 1992; Dry et al., 1993), the proteins encoded are as similar to those of ICMV and other WTGs with bipartite genomes as are the proteins of many of the bipartite genome viruses to one another. In general, the data summarized in Table 1 indicate that ICMV is substantially different from other WTGs and support the separation of ICMV and ACMVK/ACMV-N as two distinct viruses. Downloaded from www.microbiologyresearch.org by IP: 88.99.165.207 On: Wed, 14 Jun 2017 16:53:13 Three geminiviruses from cassava Table 1. Percentage amino acid sequence identity of putative products of ORFs of some whitefly-transmitted geminiviruses compared with I C M V Viruses compared ORF ICMV/other ICMV/ACMV-K ACMV-K/ACMV-N WTGs*(range) AR1 ALl AL2 AL3 BR1 BL1 78 73 59 63 29 42 96 97 94 93 93 97 74-79 52-81 4%70 49-68 2%33 3646 * AbMV, BGMV, PYMV, SqLCV, TGMV, TLCV-A, TYLCV-I and TYLCV-S. 1 ACMV -M . . 5~ K'RPGD I I I STPVS . . KVRRRLNFD~ 50 p~I~IRVVAPTVRVT* RS KLWTNRPM AC~/V-K ............. G ............. R..AT .... H. ~N.KRA.V ACMV-N ............. G ............. R..AT .... H..N.KRA.M ICMV ..... A ....... G ............. SS.AAV 51 ACMV-M . . YRKPKMYRMYRS AC~4V- K ACMV-N . .... T ......... I.R .......... .... M ......... I.R .......... N...RW A(3WIV - M GI TRRVGKRWCVKS I01 . . . F ........ L. ICKVI..,T..P F ........ L. ICKVI.. 150 KKQNRTNRVMFFLVRDRRPYGPS I .......... ACMV-N .L ......... I ...................... IC~V .L .......................... 151 . . ,T° .I . .L ......... ACMV- ,T..P F.S.H..V.I.K.M.I.. I YI LGKIWMDENT L..T ........ N.I.Y.L ....... NA N...Y.L ...... IT'/" TK .... S ........... . I i00 A(~MV - K ACMV-M ..... YEQRDDVKHTGMVRCVSDVARGS ..................... VDK* . 200 PQDFGQVFNMFDNEPSTATVENDLRDRy~/LRKFYATVVGGPSGMKEQAL K ...... ACMV-N ...... I(I~V ..... I ............ I ....... F ...... I ............ IE ...... F ...... E ................ 201 Af3vIV - M . . MH ......... . H ...... H...I LYC ...... ........... WH...T..QYAS . K .... Y.L.H..T ACMV- N .... Y.L.H..T IC~V .R .... V..Y ACMV-M YFYDAVTN ACMV-K .... SIG. AC~V-N .... SIG. ICMV .... S.S. ..... 250 VKRFFRI~QEQAKI'ENHTENALLLY~%CTHAS~TLK ACMV- IR I ..... AG ............................... ..... AG ............................... .... Q..AG .......... (82 %) and ICMV (80 %). Of the viruses listed in Table l, PYMV has the coat protein with the least similar sequence (71% identity) to that of ACMV-M. Comparison of the sequences of the four W T G isolates from cassava shows that amino acid changes among them occur at many points in the coat protein, except that there are very few changes between the sequences of A C M V - K and A C M V - N (Fig. 3). The sequence data therefore support the serological data (Harrison et al., 1991 b) in distinguishing the ACMV group B isolate from group A and ICMV isolates. All four coat protein ORFs, like those of several other WTGs, end with two stop codons, TAATAA. Nucleotide sequences of I C M V and A C M V - M common regions .... .... K.*QS . PDVPKGCEGPCKVQS ICf4V ...... 2441 M .................... 258 Fig. 3. Differencesamong the amino acid sequencesof the coat proteins of ACMV-M, ACMV-K, ACMV-N and ICMV. The asterisks mark gaps inserted to maintain alignment. Coat protein sequence of A C M V - M A cloned 1600 nt fragment of ACMV-M DNA, which contained both the coat protein gene (ORF AR1) and the intergenic region, was sequenced in both directions and each nucleotide was determined at least four times. The deduced amino acid sequence of ACMV-M coat protein (Fig. 3) is similar in size (257 residues) to the sequences of the coat proteins of ICMV (256 residues), and A C M V - K and ACMV-N (both 258 residues). However, among WTGs, the ACMV-M sequence is most similar to the comparable sequences of TYLCV-S (86 % identity) and TYLCV-I (85 %), and somewhat less similar to the sequences of ACMV-N (83 %), A C M V - K Another indication of the extent of relationship between two WTGs can be obtained by comparing the nucleotide sequences of the common regions in their DNA. In different geminiviruses these sequences are substantially different, apart from a sequence of about 30 nt that is capable of forming the stem-loop structure which is conserved in all geminiviruses. Among the cassava geminiviruses, the common region differs considerably in length. In A C M V - K it is 196 nt whereas DNA-A and DNA-B of ICMV share a stretch of 404 nt (Fig. 4) with only four differences. In ACMVM, however, the common region (defined as the sequence between O R F A L l and AR0; see Fig. 2) is about 225 nt. Although some regions of similarity occur outside the stem-loop sequence, the sequences of the common regions of the three isolates are substantially different (Fig. 4). Nevertheless the sequence similarities of the common regions of the cassava geminiviruses (ACMVK / I C M V 64%, A C M V - K / A C M V - M 63%, ACMVM / I C M V 59 %) are mostly somewhat greater than those between any o f the cassava viruses on the one hand and other WTGs, such as BGMV, T G M V and TYLCV-S, on the other (44 to 59 %). The differences in size of the common regions of the cassava geminivirus DNAs are associated with the occurrence of repeated sequences. In ICMV, both DNAA and DNA-B have a 41 nt sequence which occurs three times with only two nucleotide changes (Fig. 4). Similarly, in ACMV-M, a 34 nt sequence is repeated with only five changes (Fig. 4). The common regions of all previously sequenced geminivirus DNAs contain a sequence predicted to form a stable stem-loop structure. The equivalent structure, including the sequence T A A T A T T A C in its loop, is also found in ICMV and ACMV-M ( - 8 4 - 8 k J/tool; Fig. 4). In addition, formation of another such structure in the D N A of both viruses and of A C M V - K can be predicted Downloaded from www.microbiologyresearch.org by IP: 88.99.165.207 On: Wed, 14 Jun 2017 16:53:13 2442 Y. G. Hong, D. J. Robinson and B. D. Harrison Af~IV - K (A) 1 . . . . ...... CTC.AACTAGAGACACTCTTGAGCATCTCCTCCTATTAATTC-GAG II I ITITIII I 11 11111 I IIIIIill A C I M V - M (A) ATG CGAGCGAATTC-GAGACACCTCGGTTGATGTCCTCTACTGAATTq~AG IC~MV (A) TTCTCTCTC A(3MV- K (A) 51 ACATTATATAGGTGTCT A C I M V - M (A) ACAATATATAGTTGTCT IC~V(A) T CAATATATACATGAGTAC CAAA~TAGATGTAAATAATGGAA i01 . . ...... TAA~TTAAAAGG . A , C ~ V - K (A) ACMV-M(A) TTTTCAAAATTTGAACCAAAA i I 11 II[I li CTCAATCGGTACTCAGATAGTTrAGCC [ II C C CATATTAGGTAC i00 . .C T A A A T G G C A T T C T T G T A A T A A G ~ C T r Ill IIIIII1 IIIII I fill11111 IIIIIIIII 50 i 11111 I I I 1 •C C A A A T G G C A T A A T G G T A A T T A T G T A G A T C T A II I ~IIlllI[lIIl IIII II I .... . CTCAAAA~CAGAACACCCAAGGGG 150 I[llllIl fill II t lIIIllll I(3MV (A) ... A T A T A A ~ T r C A A A A G C A C M V - K (A) 151 CCAACCGTATAATATTAC .G C II . . . . . . . . . . . . . . . . . . . . . . . . GG Itl 11 I1 ........................ CGGTTC-GCC CCGCCCCC GG 200 ............... llI lllllllliIIIIIIII llIII llIII . ~ C M V - M (A) C CATC CGTATAATATTAC IC~/V (A) CCATCCGTATAATATTACCGGA~CCCCCC'~CTTTATGGTGGTC C GGR TGGCt"GCGCCCGRAARA G R R . G~'~gGA . A C M V - K (A) 201 ............................. ACMV-M . .CCC~CTAGGRTGOCCO~GRARTGTGGTCCt"61~CC_~C [llIIIlllllllI[llHllllilIlllilll I III[l . TTTAATGTGGTC . 250 CCCGCGCAC llllllllIlllll]lll (A) IIIIII [[[IIIIIIIIIII I [IIIIIIII I ICSIV (A) CCCCCCAOgTGGA~gGcogCGCCCCCCTt~'TTATGGTGGTC~CCCCC~Og ACI~/V- K (A) 251 TACT~ 300 I If A C I M V - M (A) %~fGTT I C ~ V (A) TGGA TGGCCGCGCCCCCCG 301 . . . . C CCCCACTCAGAACGCTCGCTCAAAGCTTGTATATGTGTGGTCCCCCTTT 350 I C M V (A) 351 400 I C M V (A) I%AGTACTTC-GGCAACAAGTCGR~C-CCTGTACAATGTGGGACC~-Fx'~'ACTA IC~4V (A) 401 AACGA~Ff ill ~'~-~u ~ ~ CG2~-aAGATGT 435 CCCTGAGTCT~TCCACGGTTTCCGGTG Fig. 4• Comparisonof the nucleotidesequencesof the common regions of ACMV-K DNA-A [ACMV-K(A)], ACMV-M DNA-A [ACMVM(A)] and ICMV DNA-A [ICMV(A)]. The sequences are aligned to show identities. The conserved sequences that can form the two stem-loop structures are underlined. The sequences that occur three times in ICMV DNA, and twicein ACMV-M DNA, are shown in bold italics• (Fig• 4). This contains the sequence T A T A T A in the loop and is somewhat less stable (--42.4 to - 5 4 . 6 kJ/mol). Discussion The genome organization of ICMV is typical for WTGs with bipartite genomes, and contains six ORFs that are equivalent to those of other viruses of this kind. In addition it contains two ORFs (AR0 and AL0) that are found both in A C M V - K and in the monopartite genome WTG, TYLCV-S, but do not occur in all WTGs. Despite these general similarities, the nucleotide sequences o f ICMV D N A - A and DNA-B, respectively, are only 62 to 70 % and < 50 % similar to the sequences of A C M V - K and other WTGs. Also, the putative gene products of ICMV are substantially different (29 to 81% amino acid identity) from those of other WTGs. These differences, together with the large and consistent differences between the epitope profiles of their particles (Harrison & Robinson, 1988), and the biological differences already mentioned (Harrison et al., 1987), support the view that ICMV and A C M V - K are distinct WTGs. Their distinctness is further emphasized by the inability of ICMV and A C M V - K to form viable pseudo-recombinants by reassortment of their genome segments (J. Stanley, personal communication), and by the remarkably small amount of variation in epitope profile found e i t h e r among ACMV isolates from West Africa or among ICMV isolates from India (Harrison & Robinson, 1988)• All these comparisons indicate that ICMV and ACMVK are consistently different, and justify the status of two distinct viruses. Among WTGs in general, it is becoming increasingly difficult to decide whether two virus isolates are best considered strains of the same virus or distinct viruses. The problem is exacerbated by the difficulty in establishing the host ranges o f the many non-sap-transmissible WTGs. These host ranges must perforce be determined by inoculation with virus-carrying vector whiteflies and can be artificially restricted by the feeding preferences of the vector, with the result that apparently non-overlapping host ranges can in some instances be shown, by more exhaustive work (such as tests by agroinoculation), to overlap or even to coincide• Also, comparison of the amino acid sequences of W T G proteins (Table 1) shows that great similarities can exist between viruses that are biologically very different. One pointer to the status of the relationship between two W T G isolates is whether or not they can form pseudo-recombinants. Another seems to be provided by comparison of the nucleotide sequences of their common (intergenic) regions. Although this sequence typically includes two regions which can form stem-loop structures, the loops of which are strongly conserved, other regions differ greatly in different WTGs. When judged on the size or sequence of their common regions, both ICMV and ACMV-M are substantially more different from each other and from A C M V - K than would be expected for strains of the same virus, reinforcing the conclusions drawn from comparisons of the putative viral polypeptides. G o o d taxonomic grounds therefore exist for giving them different names, and we suggest that isolates closely similar to ACMV-M should be called East African cassava mosaic virus (EACMV). However, although it is important for virologists and diagnosticians to distinguish between ACMV, EACMV and ICMV, the value o f this distinction to those concerned with cassava as a crop is uncertain• This will depend on whether the three viruses differ in other properties, such as relative transmissibility by different whitefly biotypes or species, interaction with virus Downloaded from www.microbiologyresearch.org by IP: 88.99.165.207 On: Wed, 14 Jun 2017 16:53:13 Three geminiviruses from cassava resistance genes in cassava or occurrence in plant species other than cassava. Accepting that the function o f some viral gene products, and possibly of some viral non-coding nucleotide sequences, necessitates interactions with host components, the occurrence of related viruses in the same plant species provides an opportunity to assess the evidence of adaptation of virus to host species. Among the putative viral polypeptides of A C M V - K and ICMV, none are more similar to one another than to those of WTGs from hosts other than cassava. However, the common regions of ACMV, EACMV and ICMV share, as well as the stem-loop regions (nucleotides 147 to 179 and 52 to 62 in Fig. 4) found in other WTGs, some additional motifs which are not found in other WTGs. These are A A A T G G C A T (residues 71 to 79), AATT T G A A (residues 108 to 115) and G T G G T C C C C (residues 235 to 243). Whether these sequences interact with cassava-specific materials remains to be discovered. Perhaps the most unexpected features of the nucleotide sequences of ICMV and EACMV are the three repeats of 41 nucleotides in the common region of ICMV and the twice (imperfectly) repeated similar sequence in EACMV. The function of these reiterations is unknown. We thank P. F. McGrath for providing the source of)~CMV-M and J. Stanley for allowing us to quote his unpublished results. The work was funded in part by the EC [Contract TS2A-0137-C(CD)]. References BOCK, K. R. & HARRISON, B. D. (1985). African cassava mosaic virus. AAB Descriptions of Plant Viruses, no. 297. BOCK, K.R. & WOODS, R.D. (1983). Etiology of African cassava mosaic disease. Plant Disease 67, 994-995. COUTTS, R. H. A., COFFIN, R.S., ROBERTS, E . J . F . & HAMILTON, W. D. O. (1991). The nucleotide sequence of the infectious cloned DNA components of potato yellow mosaic virus. Journal of General Virology 72, 1515 1520. DEVEREUX,J., HAEBERLI,P. & SMITHI]?S,O. (1984). A comprehensive set of sequence analysis programs for the VAX. Nucleic Acids Research 12, 387 395. DRY, I.B., RIGDEN, J.E., KRAKE, L.R., MULLINEAUX, P.M. & REZAIAN, M.A. (1993). Nucleotide sequence and genome organization of tomato leaf curl geminivirus. Journal of General Virology 74, 147 151. FRISCHMUTH, T., ZIMMAT, G. & JESKE, H. (1990). The nucleotide sequence of abutilon mosaic virus reveals prokaryotic as well as eukaryotic features. Virology 178, 461-467. HAMILTON,W. D. O., STEIN,V. E., COUTTS,R. H. A. & BUCK, K. W. (1984). Complete nucleotide sequence of the infectious cloned DNA components of tomato golden mosaic virus : potential coding regions and regulatory sequences. EMBO Journal 3, 2197-2205. 2443 HARRISON, B.D. & ROBINSON, D.J. (1988). Molecular variation in vector-borne plant viruses: epidemiological significance. Philosophical Transactions of the Royal Society of London B 321, 447-462. HARRISON, B. D., LENNON, A. M., MASSALSKI,P. R., ROBINSON, D. J. & THOMAS, J.E. (1987). Geographical variation in geminivirus isolates associated with cassava mosaic disease. Report of the Scottish Crop Research Institute for 1986, 179-180. HARRISON, B. D., MUNIYAPPA,V., SWANSON,M. M., ROBERTS,I. M. & ROBINSON, D. J. (1991 a). Recognition and differentiation of seven whitefly-transmitted geminiviruses from India and their relationships to African cassava mosaic and Thailand mung bean yellow mosaic viruses. Annals of Applied Biology 118, 299-308. HARRISON, B. D., SWANSON,M. M., MCGRATH, P. F. & FARGETTE, D. (1991b). Patterns of antigenic variation in whitefly-transmitted geminiviruses. Report of the Scottish Crop Research Institute for 1990, 88 90. HOWARTH, A. J., CATON, J., BOSSERT, M. & GOODMAN, R. M. (1985). Nucleotide sequence of bean golden mosaic virus and a model for gene regulation in geminiviruses. Proceedings of the National Academy of Sciences, U.S.A. 82, 3572-3576. KHEYR-PouR, A., BENDAHMANE, M., MATZEIT, V., ACCOTTO, J.P., CRESPI, S. & GRONENBORN, B. (1992). Tomato yellow leaf curl virus from Sardinia is a whitefly-transmitted monopartite geminivirus. Nucleic Acids Research 19, 6763-6769. LAZAROWITZ, S. G. &; LAZDINS, I. B. (1991). Infectivity and complete nucleotide sequence of the cloned genomic components of a bipartite squash leaf curl geminivirus with a broad host range phenotype. Virology 180, 58-69. MORRIS, B., COATES, L., LOWE, S., RICHARDSON,K. & EDDY, P. (1990). Nucleotide sequence of the infectious cloned DNA components of African cassava mosaic virus (Nigerian strain). Nucleic Acids Research 18, 19~198. NAVOT, N., PICHERSKY, E., ZEIDAN, M., ZAMIR, D. & CZOSNEK, H. (1991). Tomato yellow leaf curl virus: a whitefly-transmitted geminivirus with a single genomic molecule. Virology 185, 151 161. ROBERTS, I. M., ROBINSON,D. J. & HAgglSON, B. D. (1984). Serological relationships and genome homologies among geminiviruses. Journal of General Virology 65, 1723 1730. ROBINSON, O. J., HARRISON, B. D., SEQUEIRA,J. C. • DUNCAN, G. H. (1984). Detection of strains of African cassava mosaic virus by nucleic acid hybridisation and some effects of temperature on their multiplication. Annals" of Applied Biology 105, 483-493. SAMBROOK, J., FRITSCH, E.F. & MANIATIS, T. (1989). Molecular Cloning: A Laboratory Manual, 2nd edn. New York: Cold Spring Harbor Laboratory. SANGER, F., NICKLEN, S. & COULSON, A. R. (1977). DNA sequencing with chain-terminating inhibitors. Proceedings of the National Academy of Sciences, U.S.A. 74, 5463-5467. SEQLrEIRA, J.C. & HARRISON, B.D. (1982). Serological studies on cassava latent virus. Annals of Applied Biology 101, 33~'2. STANLEY, J. & GAY, M. R. (1983). Nucleotide sequence of cassava latent virus DNA. Nature, London 301,260-262. SWANSON, M.M., VARMA, A., MUNIYAPPA, V. ~¢ HARRISON, B.O. (1992a). Comparative epitope profiles of the particle proteins of whitefly-transmitted geminiviruses from nine crop legumes in India. Annals of Applied Biology 120, 425-433. SWANSON,M.M., BROWN, J. K., POULOS, B.T. & HARRISON,B. D. (1992b). Genome affinities and epitope profiles of whitefly-transmitted geminiviruses from the Americas. Annals' of Applied Biology 121,285-296. (Received 15 April 1993; Accepted 12 July 1993) Downloaded from www.microbiologyresearch.org by IP: 88.99.165.207 On: Wed, 14 Jun 2017 16:53:13
© Copyright 2026 Paperzz