Journal of General Virology (1991), 72, 2895-2903. Printed in Great Britain 2895 Diverse groups of plant RNA and DNA viruses share related movement proteins that may possess chaperone-like activity Eugene V. Koonin, 1 Arcady R. Mushegian,2*~ Eugene V. Ryabov 2 and Valerian V. Dolja2:~ ~Institute o f Microbiology, U S S R A c a d e m y o f Sciences, Moscow 117811 and ZA. N. Belozersky Laboratory, Moscow State University, Moscow 119899, Russia Amino acid sequences of plant virus proteins mediating cell-to-cell movement were compared to each other and to protein sequences in databases. Two families of movement proteins have been identified, the members of which show statistically significant sequence similarity. The first, larger family (I) encompasses the movement proteins of tobamo-, tobra-, caulimo- and comoviruses, apple chlorotic leaf spot virus (ACLSV) and geminiviruses with bipartite genomes. Thus this family includes viruses which move by two methods, those requiring the coat protein for the cell-to-cell spread (comoviruses) and those not having this requirement (tobamoviruses). The previously unsuspected relationship between the movement proteins of RNA and DNA viruses having no RNA stage in their life cycle (geminiviruses) suggested that their movement mechanisms might be similar. The second, smaller family (II) consists of the movement proteins of tricornaviruses (bromoviruses, cucumoviruses, alfalfa mosaic virus and tobacco streak virus) and dianthoviruses. Alignment of the sequences of family I movement proteins highlighted two motifs, centred at conserved Gly and Asp residues, respectively, which are assumed to be crucial for the movement protein function(s). Screening the amino acid sequence database revealed another conserved motif that is shared by a large subset of family I movement proteins (those of caulimo- and comoviruses, and ACLSV) and the family of cellular 90K heat shock proteins (HSP90). Based on the analogy to HSP90, it is speculated that many plant virus movement proteins may mediate virus transport in a chaperone-like manner. Introduction infected cells, and more specifically within the plasmodesmata (Tomenius et al., 1987; Moser et al., 1988). Moreover, an increased size exclusion limit of plasmodesmata has been revealed in transgenic plants expressing the 30K protein (Wolf et al., 1989). These observations appear to constitute conclusive evidence for the involvement of the 30K protein in virus movement. In other plant virus systems the evidence for transport function is less solid. Nevertheless, by the inoculation of plant cells with subsets of genome segments of multicomponent viruses, artificial mutagenesis of individual genes and complementation of movement by distantly related viruses, the movement function has been assigned to specific gene products of tricornaviruses (Allison et al., 1989), calulimoviruses (Stratford & Covey, 1989), comoviruses (Wellink & van Kammen, 1989), hordeiviruses (Petty & Jackson, 1990) and bipartite geminiviruses (Davies & Stanley, 1989). The complementation experiments seem particularly revealing, demonstrating that there is probably much in common in the movement mechanisms of distantly related viruses (Taliansky et al., 1982; Malyshenko et al., 1989). Recently, it has been demonstrated that the 30K protein of TMV and the putative caulimovirus move- A growing body of evidence indicates that cell-to-cell spread of virus within the infected plant is mediated by specific virus-encoded proteins (reviewed by Hull, 1989; Atabekov & Taliansky, 1990; Citovsky & Zambryski, 1991). The best studied example is the 30K protein of tobacco mosaic virus (TMV), the participation of which in virus movement has been suggested by several types of experiments. (i) TMV mutants Ni2519 and Lsl, bearing mutations in the 30K gene (Zimmern & Hunter, 1983; Meshi et al., 1987), display temperature-sensitive (ts) cell-to-cell movement, but can be rescued by a wild-type helper which apparently provides a product acting in trans (Taliansky et al., 1982; Malyshenko et al., 1989). (ii) Transgenic plants expressing the 30K protein have been shown to complement movement-defective TMV mutants at non-permissive temperatures (Deom et al., 1987). (iii) The 30K protein is localized to the cell walls of "~Present address: Department of Plant Pathology,Universityof Kentucky, Lexington,Kentucky40546, U.S.A. Presentaddress:Departmentof Biology,TexasA & M University, College Station, Texas 77843, U.S.A. 0001-0341 © 1991 SGM Downloaded from www.microbiologyresearch.org by IP: 88.99.165.207 On: Mon, 31 Jul 2017 15:03:55 2896 E. V. Koonin and others m e n t p r o t e i n b i n d s s R N A a n d s s D N A ( C i t o v s k y et al., 1990, 1991). C i t o v s k y & Z a m b r y s k i (1991) p r o p o s e d t h a t s i n g l e - s t r a n d e d nucleic a c i d b i n d i n g m i g h t be a c o m m o n p r o p e r t y o f p l a n t virus m o v e m e n t p r o t e i n s w h i c h is i n d i s p e n s a b l e for t h e i r function. C o n s i d e r a b l e effort has b e e n p u t into c o m p a r i n g a m i n o a c i d sequences o f ( p u t a t i v e ) m o v e m e n t p r o t e i n s o f p l a n t viruses. M o s t o f this w o r k s t a r t e d w i t h the s e q u e n c e o f the best s t u d i e d m o v e m e n t p r o t e i n , the 3 0 K p r o t e i n o f T M V . V a r i o u s degrees of s i m i l a r i t y b e t w e e n the 30K p r o t e i n a n d the p u t a t i v e m o v e m e n t p r o t e i n s o f t o b r a v i r u s e s ( B o c c a r a et al., 1986), c o m o v i r u s e s a n d n e p o v i r u s e s ( M e y e r et al., 1986), a n d u n e x p e c t e d l y c a u l i m o v i r u s e s (Hull et al., 1986; H a s e g a w a et al., 1989), h a v e b e e n r e p o r t e d . A t t e m p t s h a v e also b e e n m a d e , w i t h s o m e w h a t conflicting results, to e s t a b l i s h r e l a t i o n s h i p s b e t w e e n the p u t a t i v e m o v e m e n t p r o t e i n s of b r o m o - , cucumo-, ilar- a n d d i a n t h o v i r u s e s (Savitry & M u r t h y , 1983; X i o n g & L o m m e l , 1989; A l l i s o n et al., 1989). Several o b s e r v a t i o n s h a v e b e e n r e p o r t e d on a p p a r e n t sequence s i m i l a r i t i e s b e t w e e n virus m o v e m e n t p r o t e i n s a n d v a r i o u s cellular p r o t e i n s ( Z i m m e r n , 1983; Hull et al., 1986; M a r t i n e z - I z q u i e r d o et al., 1987). It is n o t easy to assess t h e actual value o f m o s t o f these o b s e r v a t i o n s b e c a u s e (i) c o m p a r i s o n s usually were m a d e using only one r e p r e s e n t a t i v e sequence o f the m o v e m e n t p r o t e i n s o f a virus g r o u p (e.g. t o b a m o v i r u s e s ) a n d / o r a cellular p r o t e i n family, a n d (ii) in m a n y cases only very s h o r t s e g m e n t s were aligned. Recently, M e l c h e r (1990) p u b l i s h e d a m o r e t h o r o u g h c o m p u t e r - a s s i s t e d analysis of the s e q u e n c e s o f t o b a m o - , tobra-, c a u l i m o - a n d t r i c o r n a v i r u s m o v e m e n t proteins, c o n f i r m i n g a significant s i m i l a r i t y b e t w e e n the p r o t e i n s o f the first three groups, a n d a m u c h m o r e r e m o t e r e l a t i o n s h i p b e t w e e n t h e m a n d the t r i c o r n a v i r u s e s . H o w e v e r , p u t a t i v e c o n s e r v e d m o t i f s t h a t m i g h t be helpful in further analysis o f virus m o v e m e n t p r o t e i n s a n d t h e i r t e n t a t i v e cellular h o m o l o g u e s h a v e n o t b e e n clearly defined. W e sought to e x t e n d this analysis by s y s t e m a t i c a l l y c o m p a r i n g the sequences o f the v a s t m a j o r i t y o f p l a n t virus p r o t e i n s i m p l i c a t e d in cell-to-cell m o v e m e n t , a n d by s c r e e n i n g a m i n o a c i d sequence d a t a b a s e s for t h e i r possible cellular homologues. O u r analysis h i g h l i g h t e d an u n e x p e c t e d r e l a t i o n s h i p b e t w e e n a large collection o f p l a n t virus m o v e m e n t p r o t e i n s a n d the f a m i l y o f u b i q u i t o u s cellular 9 0 K h e a t shock p r o t e i n s (HSP90), w h i c h possess m o l e c u l a r c h a p e r o n e activity, i.e. t h e y are i n v o l v e d in i n t r a c e l l u l a r p r o t e i n trafficking ( W e l c h , 1990). T h e p o s s i b i l i t y o f a s i m i l a r a c t i v i t y for the virus m o v e m e n t p r o t e i n s is discussed. Methods Amino acid sequences. The amino acid sequences of the (putative) movement proteins of the following viruses were analysed. Tobamo- viruses: TMV, TMV strain Om (TMVOm), TMV tomato strain (TMVTo), TMV cowpea strain (TMVCo) and cucumber green mottle mosaic virus (CGMMV). Tobraviruses: pea early browning virus (PEBV) and tobacco rattle virus strain SY (TRV). Geminiviruses: tomato golden mosaic virus (TGMV), bean golden mosaic virus (BGMV) and cassava latent virus (CLV). Caulimoviruses: cauliflower mosaic virus strains D/H, Strasbourg and CM1841 (CaMVD, -S and -C), figwort mosaic virus (FMV), carnation etched ring virus (CERV) and soybean chlorotic mottle virus (SCMV). Comoviruses: red clover mottle virus (RCMV) and cowpea mosaic virus (CPMV). Cucomovirus: cucumber mosaic virus (CMV). Apple chlorotic leaf spot virus (ACLSV; referred to as a clostero virus, but at present included in capilloviruses) and beet yellows closterovirus (BYV). Bromoviruses: cowpea chlorotic mottle virus (CCMV), broad bean mosaic virus (BBMV) and brome mosaic virus (BMV). Ilarviruses: tobacco streak virus (TSV) and alfalfa mosaic virus (AIMV). Dianthoviruses: red clover necrotic mosaic virus (RCNMV). Nepoviruses: tobacco black ring virus (TBRV) and Hungarian grapevine chrome mosaic virus (GCMV). The sequences were from the SWISSPROT database (Release 14), except for those of ACLSV (German et al., 1990) and BYV (Agranovsky et al., 1991a). Computer-assisted sequence analysis. Initial pairwise comparison of the sequences was by the DOTHELIX program, generating a diagonal plot and allowing precise delineation of sequence segments displaying similarity with the highest deviation from the random expectation (Leontovich et al., 1990). Sequence database screening was by the SMART program, implementing a simplified version of the DOTHELIX algorithm for comparison of a given 'probe' sequence with all sequences in the database. Programs DOTHELIX and SMART are modules of the GENEBEE program package for biopolymer sequence analysis (Brodsky et al., 1991). To determine the boundaries of the segments to be aligned, which is critical for obtaining the optimal alignment, the results of DOTHELIX comparisons were used. Multiple sequence alignments were generated in a stepwise manner using the OPTAL program (Gorbalenya et al., 1989). The statistical evaluation of alignment by OPTAL includes calculation of two values: distance, D = -ln[(S ° - Sr)/(Sm - Sr)] x 100 (Feng et al., 1985) and the adjusted alignment score, AS = (S° - Sr/a), where S° is the observed alignment score, Sr is the mean score obtained upon 25 random simulations of the alignment procedure, Sm is the average of the scores obtained upon comparison of each of the two compared sequences (alignments) with itself, and cr is the standard deviation of the score. Generally, D values of less than 200 and AS values of more than 7 standard deviations (S.D.)can be considered a firm indication of a genuine relationship between two sequences or alignments; values of 200 to 270 and 4 to 7 S.D. can be considered significant, provided there is some additional (i.e. functional) evidence of the suspected relationship (Doolittle, 1986; and unpublished observations). Throughout this study the log odds matrix of Dayhoff (Dayhoffet al., 1983) was used for calculation of sequence comparison scores. Protein secondary structure predictions were by the PROTEIN2 program of the GENEBEE package implementing the modified algorithm of Gamier (Gibrat et al. 1987). Results and Discussion Grouping o f plant virus m o v e m e n t proteins by sequence similarity O u r first a p p r o a c h was to g e n e r a t e local s i m i l a r i t y plots for p a i r s o f (putative) m o v e m e n t p r o t e i n s e n c o d e d b y viruses o f different g r o u p s to d e l i n e a t e s i m i l a r regions a n d to e s t a b l i s h t h e a p p r o x i m a t e h i e r a r c h y o f relation- Downloaded from www.microbiologyresearch.org by IP: 88.99.165.207 On: Mon, 31 Jul 2017 15:03:55 Plant virus movement proteins TMV TMVOm Tbg/To "f~Co CGMMV PEBV TRV 30K 3OK 30K 30K 30K 29K 29K TGMV BGMV CLV 21K 33K 33K 21 21 21 23 24 32 26 K I LPSM f t p V K s V M c S K V D K I -MVHENESL s - E V N L L - K - g V - K L ID ....... sK I LPSMf t pVKs VMcSKVDKI -MVHENESLs-EVNLL-K- gV-KL I D. . . . . . . sKLLPSM f t pVKsVMVSKVDKI -MVHENESLs - E V N L L - K - g V - K L I E . . . . . . . gK - F K w K - a p S R v C s I v q s D T I s M t A N G r S L f - T f D V L - K - D V 1 K h a E. . . . . . . EY K-LPNyfmILKiLsITdfSvV-KaQsyESLV-pVkLL-R-gV-DLTK ....... HLMVdGArKrpKYfHRrRETvLsNVAGSLTEh-kLgIfTi-EdvRNVK ....... sY LfVdGRrKrpKYfHRPRETvLnhVGGkkSEh-kLDVfDQ-rdyKMIK. . . . . . . sY 2 S q L A c P p N A F N Y I e S n R - D E y - Q L s h D L T E I - i L Q f p - S - T A S Q L S.a. . . . . . 2 S q L A N P p N A F N Y I e S q R - D E y - Q L s h D L T E I - I L Q f p - S - T A S Q L S.a. . . . . . CaMVD CaM'¢S CaMVC FMV ORF1 ORF1 ORF1 ORF1 CERV ORFI SCMV ORF1 RCMV 49K CPMV 5SK 70 KaFSRK-NEILYCVSTKELSV-DIHDATgKV-yLPLITKeEINKRLsslkpe-vRR 7 0 KaFSRK-NEILYCVSTKELSV-DIHDATgKV-yLPLITKeEINKRLsslkpe-vRK 70 KaFSRK-NEILYCVSTKELSV-DIHDATgKV-yLPLITReEINKRLsslkpe-vRK 68 TaFT~RK-NEIFYCVSTKEMSV-DIkDvSgQV-yLPLITKqEIQQKLMkidpS-vRs 61KaFSRK-NhIYYhVnYKEfSV-DIcDThgKN-yLPLVTKSEIKKNLDkikdeKvRs 45 nVFSRd-NILKFgLMTgEVQI-pIEQTDgsV-fLAtINKeQITKRIskieeK-Og~R 131 TsLPGG R E V E Y - - K h l D A g h - L L A D T N v V I - D V P L V P Q I a A r t p T D y n f g T s r D K IS9 S k A A A G - M G F I n - ~ K h m L T g N - I L A Q p T T V L - D I P V t k D k T L a m a S D - f i R K E N L A C L S V 50K 34 KkASDL-MIhWNeEVFKVNpE~D/AGDgfRLaSIPVIpSsEVQaVI_RK . . . . : : ::: : "G" * TMV TMVOm TMVTo TMVCo CGP~IV RL RF 0 MdtSvPvIsSDYIHSaR-TEy-KLtNDESp[-TLQfp-S-TLERtrV ....... HI Re "D" • ! i !! 30K 30K 30K 30K 30K 29K 29K g y v C L - a G ]VVt g e w - N L P D N C F G G V s V C L V D K R M - - E R a D E A tL G S Y - y t a A A k k R F q g y v C L - a G IVVt g e w - N L P D N C r G G V s V C L V D K R M - - E R a D E A t LGSY- y t a A A k R R F q g y v C L - V G i VVsgew-NLPDNCr GGVs VCMVDKRM - - ERaDEA t LGSY -y £a A A k k R F q TyvDV - L G v V L s g q w - L L P K G T p G S A E I IL L D S R L - - - K g K a S V L A V F - n c r A A T q E F q Lyv t L - L G v W s g V w - N V P E s C r G G A t VALVDTRM-- hSVAEG t I cKF- sAPAt v rEFS Kf i r I-VGiq Ik V t S - h L P R D T S G f LQ IDL IDSRL tO g P d ~ s k V L O R F - V A k A C D n t sV af i k I-VGvqLVVt S - h L P a D T p G f IQ IDLLDSRLt ekRKKgkt IQ R F - K A F A C D n c s V TCMV I~GMV CLV 21K 33K 33K SrSCMkIdhcVl e fRqQVPiNaTGSVvVEIhDKRM---TDNESLqASWtFpvrCNiDLh SrSCMkI dhc VI eyRqQVPi NaTGSVvVEI hDKRM-- -TDNESLqASWt Fp i rCNiDLh MGKCMkVdhWI eyRnQVPfNaQGSViVTI RDTRL---SDEQqdqAqF t FpiGCNvDLh CaMVD CablVS CaMVC FMV CERV SCMV ORFI ORFI ORF1 ORF1 ORF1 TmSMVIlLGAVk l LLKA(]FRNGID t p Ik I AL IDDR l - - N S R R D c L L G a a - K G N L A y G K F M T m S M V R L G A V k ILLKAQFRNG iD t p I k IALI DDR I - - N S R R D c L L G a a - K G N L A y G K F M Im S M V H L G A V k ILLKAQFRNG iDtp Ik IAL IDDR I - - N S R R D c L L G a a - K G N L A y G K F M K i S M I H L G A V k I L L T A Q F R Q G iD t S V k M A L I D D R I - - v N R K D S L L G a a - R G N L A y G K F M TISD IH F G A I k V L I K A r E R E G i N s D IkMAL IDDR I - -TDRQDS ILGaa - H G N L v y G K F M LIRyVH I s tLQVL IKStFIKG-Dt p L E L T L R D N R L - - 1NLEESK I A V g - H G N L K y G K M K PEBV . TRV RCMV 0RF1 49K SATaLbIVGAI E W I Q S y a s s E C D 1 mAGMMLVDTfh - -SRpEN AI r sVY- I v P IRgGmFM CPMV 58K KTSaI HIGAIE I I I QgfaspEgD i mGGfLLVDS1h - -TDLANAI r s I F-VAPMRzGRpV ACLSV SOK S T n I VHWGALs I s IDALFRK - - N a G s G V C y V D N R w - - E T f E Q A M L Q K F - H f N L d S G s a T : ::: : : : : : : : : : ! TMV TMVOm TMVTo TMVCo CGMMV PEBV TRV 30K 30K 30K 30K 30K 291( 29K ! • ! FkVvPNYAIT%QD-AMkNvWQVLMNIrN~/k-MsAG--Fc FkVvPNYAITtOD-AMkNvWQVLVNIrNVk-MsAG--Fc FkVvPNYGITtKD-AekNIWQlrLVNIkNVk-MsAG--Yc FLISPgYSLTcaD-ALkI~FEIsCNVIDLp-VI(DG--FT VrFiPNYSWaaD-ALrDPWSLFVRLSNVg-IKDG--FH VQYkFDYmVStRE-NIaDIWKIGtVaVNVp-VvDd--cy aQYkVEYSIStQE-NV!DvWKVGCIsEgVp-VcDG--ty ! ~L--sLeF ~L--sLeF ~L--sLeF ~L--sVel >L--tLeV ~F--sVeV ~F--sleV 109 109 lOS 122 I01 82 82 TC44V 21K BGMV 33K CLV 33K Y-FSSsF-fSLKD-pI--PWKLYYKVcDsN-VhQrtHFakFkGKLkL 43 Y-FSSsF-fSI-KD-pl--PWKLYYRVSDtN-VhQrtHFakFkGKLkL 150 Y - F S a s Y - f S ID D - N V - - P W Q L L Y K V E D s N -VKNG I tFaq IkAKLkL 157 CaMVD CaMVS CaMVC FMV CERV SCMV ORFI ORF] ORFI ORFI ORFI ORFI FtVyPkFGISLNTqrLNQTLSLIhDFENkNLMNkGDkVmtltyIVGY FtVyPkFGISLNTqrLNQTLSLIhDFENkNLMNkGDkVmtltyVVGY FtVyPkFGISLNTqrLNQTLSLIhDFENkNLMNkGDkVmtItylVGY Ft V y P k F A L S L Q S k N L D K T L S f Ih Q F E r k D L M K t G D k V F tVtyL IGY FtVyPkYTTSILDqrLDRTLaflhhFErNDLMRkGDkVFsItyLVAY RCMV CPblV 49K 58K r aL c F p n T L V p M D s D INNr F K V V F s L p N N D F I ~ G sk 1GHv s I nMaGc 121 r V V T F p n T L a p V S c D L N N r F K L I C s L p N C D I V Q G s Q v A e v s V n V a G c 128 ACLSV 5OK FDVnLQIGLSLKDIDLDRSIILnYKFLrrNFMKEGNHAFslsyRInY LVtSPNFpVSLDDpGLsNS]SVAVMFENLNFkfEsyplSvrVGNMcF 102 102 102 i01 102 103 215 Fig. 1. Alignment of conserved domains of family I virus movement proteins. Protein sequences of distinct virus groups, tobamo-/tobra-, gemini-, caulimo- and comoviruses, and ACLSV, are separated by blank lines. The numbers of amino acid residues between the protein termini and the aligned segments are indicated. Double asterisks indicate identical or similar residues in all sequences; single asterisks indicate residues identical or similar in the sequences of tobamo-, tobra- and geminivirus proteins; colons indicate residues identical or similar in caulimovirus, comovirus and ACLSV sequences; exclamation marks indicate residues identical or similar in tobamo-, tobra- 2897 ship. Then multiple alignments were generated in a stepwise manner using this hierarchy. In this way it was confirmed that the 30/29K movement proteins of tobamo- and tobraviruses form a tight group, aligning over their entire length (about 300 amino acid residues) with a convincing AS of 7-4 S.D. Quite unexpectedly, it was shown that this group is related to the BL1 proteins, encoded by the B D N A components of bipartite geminiviruses and implicated in virus movement (Etessami et al., 1988; Brough et al., 1988). A statistically significant alignment (AS about 6-1 S.D.) was obtained for the N-terminal regions of approximately 140 amino acid residues, which constitute about 8 0 ~ of the smallest geminivirus protein. Comparisons of the putative caulimovirus movement proteins (ORF 1 products) revealed an approximately 180 residue region of significant similarity with the 58/48K proteins of comoviruses (AS of approximately 7.1 S.D.) and the 50K protein of ACLSV (AS about 6.9 S.D. upon comparison of the ACLSV sequence with the alignment of caulimo- and comoviral proteins). The two alignments thus obtained were combined to yield a composite alignment of more than 5-7 S.D. above the random expectation (Fig. 1). Even more convincing scores (about 6.5 S.D.) could be obtained upon aligning the sequences of tobamovirus/tobravirus and caulimovirus proteins separately. We define the set of proteins in the alignment in Fig. 1 as family I of plant virus movement proteins. Within this family, the scores for the alignments reflecting newly observed non-trivial relationships, such as those between the geminivirus and tobamovirus/ tobravirus proteins, or between caulimovirus and comovirus proteins, were comparable to that for the obvious alignment between the tobamo- or tobravirus proteins themselves. As in other vast protein families, not all of the groups constituting family I showed convincing similarity to each other when compared directly. In each of such cases the relationship could be established by comparison with a third group. An important example is the comparison of the movement proteins of tobamo- and tobraviruses, and comoviruses. Upon direct alignment, these proteins showed only a marginally significant similarity (4.1 S.D. when compared within the boundaries shown in Fig. 1). However, both groups were very similar to the movement proteins of caulimoviruses. Together with the and caulimovirus proteins (in each case one exception in the tobamovirus or caulimovirus sequence was allowed). Residues identical or similar in sequences of different groups are shown in upper case. Amino acid residues were grouped as follows: A, G; S, T; I, L, M, V; F, Y, W; K, R; D, E, N, Q. The two conserved motifs are designated as indicated in the text as is the scheme of the alignment generation. The ASs were over 5.5 s.o. and the distances (see Methods) less than 225 for each alignment step. Downloaded from www.microbiologyresearch.org by IP: 88.99.165.207 On: Mon, 31 Jul 2017 15:03:55 2898 E. V. Koonin and others CCMV BBMV BMV CMV 32K 48 GMytNLssNNrlnyiDLV-pkNTGsraHqLFkSEFeE-'Rh'pss-G-vL-IprvLVFL--VrTt 32K 47 GrytELkaNrrlnhlDLV-pkvhGmkmLSMFrSEydk--GnVpss-G-vLnlprlLiYL--VrTs 3ZK 49 GcynyLksNEpRnylDhV-pkShvsawLSw~TSKydk--GeLpsR-G-fMnVpriVcFh--VrT% 32K 40 GrhhwMraDNalsvRpLV-pQvTsnnLLSfFkSgyda--C~LrsK-G-yMsVpqvLcW--trTg AIMV TSV 32K 37 sMsI-LgpNQIKICtQLVIsNva~d~VVLSLVSkEkksIIr~IpKIC4~rMy%~hhSalYLIy Mpnl 32K 2g sgsVDLNmg-IBRCaafp-AENTGafLcELtTkEtksFIGKfsdKVrgrVf'DhaVIhMmyIPvl RCNMV 37K 16 GVaVsLNrytdwKCRs---GvSeAPLIpasMmSKItdY-AKttaK-GNsVaLNyThV~slaPTI -D,,? TBRV GCMV 93K B4K 617 593 aa LYafcViMwghssDAEtaslcgagvYlgdnqaavlELPLvcSyl-gnslEd-fda aa VYafcViMwghssNAEQaslsgayvYlgdgeasvlQLPLlcgyi-gnalEd~4ea HSP90 GRP94 HSP82 H~Pg2 HSP90 HSP83 HSP90 human mouse D.m. maize cabbage T. crucei S.C 363 408 347 348 333 337 342 aa LYvRRVFIND--~sCDELipEYL-NF-IRgVvdSEDLPLNISRE-mLQQsKILkv aa LYvRRVFItd---DFHDMmpkYL-NF-VKgVvdSDDLPLNVSRE-TLQQhKLLkv aa LYvRRVFIMd---NCEDLIpEYL-NF-NKgVvdSEDLPLNISRE-mLQOJ~VLkv aa LYvRRVFIMd---NCEELIpEWL-gF-VKgVvdSDDLPLNISRE-TLQQNKILkv LYvRRVFIMd---NCEDIlpEYL-gF-VKglvdSEDLPLNISRE-TLQQNKILkv LYvRRVFIMd---NCEDLcpEWL-gF-LRgVvdSEDLPLNISRE-NLQQNKILkv LYvRRVFItd---EAEDLIpEWL-SF-VKgVvdSEDLPLNLSRE-mLQQNKIMkv LYvqRVFIMd--~DAEQfmpNYL-rF-VRgLidSsDLPLNVSRE-ILQDstVtrN HTPG E . c o l i CCMV BBMV BMV CMV tVTeSGSVTIRLYDLisAssveILEpVdGtQEATipissLpAiVcFspSYdcpmQMig---NRhrcf-GLVT hsSTvGSIT,RLVDtycAsdsclLEAIdG-QEfTvDIssLpcmlgFspTYdcklEMvD---GRrgcf-GIVT dsAe~SITVsLcDsGkAAragVLEAIdN-QEATIQIs#d~pAIIaltpSYdcpmEVI~dsGRNFcf-GIaT stdaeGSLkIyLaDLGdkE .... LspldG-QcVTlhnheLpAllsFqpTYdcpmELvg---NRhrcf-AWV AIMV TSV IkSS~SITLKLfNeAtGE---LVDVdTD~hDAI-Qaclf-A-grYprSi-LakDaak---GHDIkl--VVh InTThAiaeLKLRNLAtGD---eLyGgTk-vNLn-kafIL-t-MtWprSI-faeaVhN---hKEIYIgGtVS 298 aa aa aa aa CaMV-D ORF1 71 CaMV-S ORFI 71 aa CaMV-C ORF1 FMV ORFI C~V DRF1 71 68 aa 62 aa .... y C v s - t k E - L - S v d I h d a t g k v y L P L - l T K E - E I - - N l ~ L s s aFsl~neIL .... yCvs-tkE-L-SvdIhdatgkvyLPL-ITKE-EI--NKRLss aFsP~neIL .... yCvs-tkE-L-SvdIhdatgkvyLPL-ITRE-EI--NKttLss aFkRKneIf. . . . yCvs-tkE-M-SvdIKdVsgqvyLPL-IlKQ-EI--QqKLmk aFsRKnh[y. . . . yhvN-ykE-f-SvdIcdthgkNyLPL-VTKs-EI--kKnLdk 238 aa LvagiLYdM--cfEYNtLkStYLkNlesfdeflSlyLPL-LSEvfSMNw~papD BYV 64K aa aa aFsRKnelL "LPL" RCNMV gVAipGhVTVeLINpnveG---pfQVMSG~QtLS-wspGa-G-kpclmiFsVhhQLns---DHEpFr--VRI CCMV BBMV BMV CMV qLdGvlsSgSTVvMSHAYWSaNfrSKpNnYkQyapmyKYV-epfDrLKr-LSrkq-LKN--YV eLnGvlgeghTVAIVHAYWkaMfrTKpgnYtRvkpaAKFI-ApfDrLKQ-LSSGq-I/<DafFI qLSGvVgTtGSVAVtHAYWQaNfkaKpNnYklhGpatiMV-MpfDFI/RQ-LDkks-LKN--YI erhGylgygGTtAsVcsnWQaQfSSKnNnYthAAaaktLV-LpyNrLAE-hSkps-avA-rLL 69 62 68 112 AIMV TSV aVas-tnmn~aVGVLypiWEdELSrK-QileRGADflKFpiAktEpVRDLLNaGK-LtD--FV caSs-VPahAkIGMwypIWSekVSiK-QIYqNTiD/hK-tEA~etFtpl]41SSdKeM~s--LL 82 78 RCNMV tnTG-IPTkkSyArcHAYWgfDVg~-hrYyKS-EpARLIELevgYqRTLLSSiKaVeA--YV ,22 : THRV GCMV Fig. 2. Alignment of conserved domains of family II virus movement proteins. The distinct groups (separated by blanks) are represented by the proteins of the bromo-/cucumo-, liar- and dianthoviruses. Asterisks indicate identical or similar residues in all sequences. The ASs were over 6.0 S.D. and the distances less than 220 for each step. The tentative counterpart of the 'D' motif of family I plant virus movement proteins is highlighted. Other designations are as in Fig. 1. HSP90 GRP94 HSP82 HSPg2 HSP90 HSP83 HSP90 93K B~K "c" : :: yKRsLVlsTV-fFgKsGlfaC~NvfgitavEFTdy-MptsyGgitHERDSw . . . . . qaML l l S 7K~sLV1sTc-fFgtsGlspGQNmfgitavEFTey-LptsyGgitHERDS ..... nOeL 114 human mouse D, m maize IRKNIVKKcLELFSELA-edKENyK-KFFEAFSKN-LK-LG--IHEDSTN--RRRLsELL IRKkLVRKTLDMIKKIA-eeKyN-d-TFwkEFgTN-VK-LG--VIEDhSN--RtRLakLL IRKNLVKKTMELIeeLT-edKENyK-KFyDQFSKN-LK-LG--VHEDSNN--RaKLaDfL IRKNLVKKcIEMFfelA-enKDDyA-KFyDaFSKN-IK-LG--IHEDSQN--RaKLaDLL c a b b . IRKNLVKKcMELFfelA-enKEDyA-KFyEaFSKN-LK-LG--IHEDSQN--RtKIaELL T. c IRKNIVI</qALELFeeLA-GnKEDyK-KFyEQFSKN-VK-LG--IHEDSSN--RKKLmELL S.c. IRKNIVKKIIEaFnelA-edsEQfe-KFysaFSKN-IK-LG--VHEOTQN--R~aLakLL 269 291 268 266 264 265 Z64 HTPG E. c o l i LRNaL~tKRvL0J~LeKLAkddAEkyq-TF~QQFglv-tH~-E~--paEDfaN--QeaIakLL226 CaMV-D ORFI CaMV-S ORFI CaMV-C ORFI FMV ORFI LKpE-V}~RTMsMV-hLG-AvKilIKaqFFNgidTp-IK-IA--LIBDrIN--sRR-dcLL LKpE-VBKTMsMV-hLG-AvKIIIKaqFrNgldTp-IK-IA--LIDDriN--sRR-dcLL LKpE-VBKIMsMV-hLG-AvKIIIKaqFrNgidTp-IK-IA--LIDDriN--sRR-dcLL IDps-VRsklsMl-hLG-AvKilltaqFrOgldTs-VK-MA--LIDDriv--NRK-dsLL Bh~/ V R l l f e l d A a E L L I K V p - t l n m h d s - I F I - - Y - K N K L R y L e s y f e D D S N E I l K v K V d s L L208 64K 162 162 162 161 "D" conservation of distinct motifs (discussed below), this suggests that each of these groups belongs to family I and are therefore related to each other. Family II is composed of the putative movement proteins of tricornaviruses (32K to 33K products of RNA 3) and dianthoviruses (35K product of R N A 2), yielding a statistically significant alignment (6.6 S.D.) on a 200 residue span (Fig. 2). Interestingly, the proteins of the two subdivisions of tricornaviruses (bromo-/cucumoviruses and A1MV/ilarviruses) were no more similar to each other than to the dianthovirus protein. Our attempts to produce an overall alignment of the movement proteins belonging to families I and II were not convincing, yielding alignment scores insufficient to prove that the two families are actually related to each other ( < 3 S.D.). One more group of plant virus proteins more or less definitely associated with cell-to-cell movement included those of nepoviruses (Meyer et al., 1986). We were unable to produce statistically significant alignments between the sequences of nepovirus proteins and any of those included in families I or II. Search for cellular proteins related to the plant virus movement proteins Representative sequences of each group of plant virus movement proteins were compared with all sequences in the SWlSSPROT amino acid sequence database Fig. 3. Alignment of the sequences of (putative) virus movement proteins with those of a conserved domain of HSP90. Double asterisks indicate residues identical or conserved in all aligned sequences; single asterisks indicate identical or conserved residues in HSP90 and the respective group of virus proteins (one exception in HSP90s or in caulimovirus proteins was allowed). Identical and similar residues in cellular and virus protein sequences are in upper case. The adjusted score for the alignment of the HSP90 sequences with those of caulimovirus ORF 1 product was 5.4 S.D., with a distance of 184.4. Including the BYV sequence segment in the alignment yielded a score of 4 S.D. (distance 264-9). The nepovirus sequences were aligned separately with those of HSP90, and the resulting alignments were reconciled by hand. Other designations are as in Fig. I. Abbreviations: D m . , Drosophila melanogaster; S.c., Saccharomyces cerevisiae; T. crucei, Trypanosoma crucei; E. coli, Escherichia coil (Release 14) using the SMART program (see Methods). No particularly close relationships could be revealed. The observed modest similarities were subjected to a detailed evaluation using the programs DOTHELIX and OPTAL. The only result deserving attention was obtained for an approximately 100 amino acid residue conserved segment of HSP90, which yielded a significant score (5.4 S.D.) upon comparison with the alignment of the caulimovirus movement proteins. Counterparts to this segment, though with lower similarity, have been identified in the putative movement proteins of nepoviruses, and in a partially sequenced 64K protein of unknown function encoded by BYV (Agranovsky et al., 1991 a; Fig. 3). The region conserved in HSP90 and virus Downloaded from www.microbiologyresearch.org by IP: 88.99.165.207 On: Mon, 31 Jul 2017 15:03:55 Plant virus movement proteins MOTIF "G . . VHENESLS--EVNL-L-K VHENESLS--EVNL-L-K VHENESLS--EVNL-L-K TANGRSLF--TFDV-L-K AQSYESLV--PVKL-L-R VAGSLTEH--KLGI-FTI VGGKKSEH--KLDV-FDQ TGMV BGHV CLV Z1K 331< 33K 23 23 21 LSHDLTEI--ILQF-P-S 12 CM-KID-HCVIEFR LSHDLTEI--ILQF-P-S 12 ~ - K I D - H C V I E Y R LTNDESPI--TLQF-P-S 12 (~4-KVD-HVVIEYR 10 W--VE--IHDKRM 103 I0 W--VE--IHDKI~4 211 I0 VI--VT--IRDTRL 220 CaI4VD CaMVS CaNVC YMV CERV SC)~V ORFI QRF1 ORF1 ORFI ORF1 ORFI 91 91 91 88 82 66 IHDATGKV--YLPL-ITK IRBATGKU--YLPL-ITK IHDATGKV--YLPL-ITR IKDVSGQV--YLPL-ITK ICDTHGKN--YLPL-VTK IEQTDGSV--FLAT-INK I0 I0 10 i0 10 9 RCMV CPMV 49K SSK ISO ]98 LADTNWI--DVPL-VPQ 20 AL-HVG-AIEVVIQ I0 AG--MM -LVDTFH 189 LAQPTTVL -DIPV-TKD 20 A I - H I G - A I E I I I Q lO GG- FL--LVDSLH 196 ACLSV 50K 55 IAGDGFRL-ASIPV IPS 14 IV-HWG-ALSISID 8 SG--VC -YVDNRW 283 TBRV GCHV 93K 84K 643 619 LGDNQAAV-LELPLVCSY LGDQEASV-LQLPLLCGY BYV 64K 262 ESFDCFLS-LYLPL 385 430 369 370 355 359 364 32Q IRGVVDSE--DLPLNISR VKGVVDSD--DLPLNVSR ~GWDSE--DLPLNISR VKGVVDSD -DLPLNISR VKGIVDSE--DLPLNISR LRGVVDSE--DLPLNISR VKGVVDSE--DLPLNLSR VRGLIBSS--BLPLNVSR CONSENSUS I VS--VC--LVDKRM VS--VC--LVDKRM VS--VC--MVDKRM AE--II--LLDSRL AT--VA--LVDTRM LQ--ID--LIDSRL IQ--ID--LLDSRL IK--IA--LIDDRI ~K--IA--LIDDRI IK--IA--L1DDRI VK--MA -LIDDRI IK--MA--LIDDRI LE--LT--LRDNRL 171 171 167 183 163 150 146 176 176 176 169 170 171 20 FFGKSGLFAGQNVF 21 FFGTSGLSPGQNMF I0 MPTSYGGITHERDS I0 LPTSYGGITHERDS 120 119 LSE 25 LLLKVP-TINMHD~ 7 LR-YLESYFEDDSN 219 9 9 9 9 9 9 9 9 277 299 276 274 272 373 272 23~ LPL * R K MOTIF MV-HLG-AVKILLK MU-HLG-EVKILLK MV-HLG-AVKILLK MI-HLG-AVKILLT DI-HFG-AIKVLIK YV-HIS-TLQVLIK 9 9 9 9 9 9 9 D" 43 43 43 44 45 55 48 19 19 19 19 20 19 CL--AG-LWTGEW CL--AG-LWTGE~ CL--VG-LWSGEt4 DV--LG-VVL.~QW TL--LG-VVVSGVW RI--VG-IQIKVTS KI--VG-VQLWTS . 30K 30K 30K 30K 30K 29K 29K HSP90 human GRP94 mouse HSP82 D.m. HSPSZ maize HsPgO eabb. HSP83 T. c, HsPgo S.c. ~TPG E . c o ] i I0 I0 10 12 10 12 12 . TMV TMVOm IMVTo TMVCo CGbfftV PEBV TRV 21LFSELA-EDKENYK 21MIKKIA-EEKYN-D 21LIEELT-EDKENYK 21 MFFEIA-ENKDDYA 21 LFFEIA-ENKEDYN 21LFEELA-GNKEDYK 21 AFNEIA-EDSEQFE 2Z MLEKLKKDDAEKYQ ** IG II A LK--LG--IHEDST VK--LG--VIEDHS LK--LG--VHEDSN IK--LG--IHEDSQ LK--LG -IHEDSQ VK -LG--IHEDST IK- LG--VHEDTQ LK--EG--PAEDFA • OG A leD R E "LPL" Fig. 4. Conservedmotifsin (putative)virusmovementproteinsand in HSP90. CONSENSUSindicatesthe residuesprsentin at least 16ofthe 31 sequencesaligned;asterisksindicatehydrophobicresidues(I, L, V, M, F, Y, W, C and A). proteins constitutes a part of the conserved domain of family I movement proteins (see Fig. 1 and discussion below). Conserved motifs in virus movement proteins Inspection of the alignment of family I movement proteins revealed at least two short, closely spaced segments which should be considered conserved motifs (Fig. 1 and 4). In tobamo- and tobravirus proteins these motifs were located inside the most conserved N-terminal region I described by Saito et al. (1988). The N-proximal 'G' motif included a Gly residue that is conserved in the vast majority of the sequences of the family, although it is replaced by Asp in geminivirus proteins. The flanking regions in all proteins are significantly enriched in hydrophobic residues. The distal 'D' motif included the Asp residue, which is the only invariant residue in the entire alignment; this residue is flanked by a hydrophobic segment to the N-terminal side and is separated by one residue from a highly conserved Arg to the C-terminal side. Despite our inability to demonstrate any overall similarity between the two families of movement proteins, one of the conserved sequence segments of family II resembled the 'D' motif (Fig. 2), also noted by Melcher (1990). 2899 Several mutants of TMV which exhibit the ts movement phenotype (Zimmern & Hunter, 1983; Meshi et al., 1987), or overcome the virus spread block in a tomato Tm2 line (Meshi et al., 1989), have been characterized. The mutations responsible for these phenotypes have been mapped to the conserved domain but fall outside the motifs delineated above. This is compatible with the apparent modulatory effect of these mutations on 30K protein activity. Inspection of the alignment of the amino acid sequences of HSP90 with caulimovirus and nepovirus movement proteins and the 64K protein of BYV (Fig. 3) highlighted another motif with the sequence LPL (hereafter LPL motif). Secondary structure predictions for HSP90 and the movement proteins yielded similar patterns, with a clear preponderance of s-helices and a loop around the LPL motif (not shown). When HSP90 sequences were compared with the aligned sequences of the movement proteins of tobamo-, tobra-, caulimo- and comoviruses, and ACLSV, in boundaries slightly different from those in Fig. 3, an alignment of reasonable statistical significance (about 5 S.D.) was produced. It became clear that modified forms of the LPL motif are present in the putative movement proteins of comoviruses and ACLSV, and degenerate counterparts of this motif could be tentatively identified in the tobamovirus and tobravirus sequences (Fig. 4). Thus the degree of conservation of different motifs varies in virus movement proteins and HSP90; the LPL motif is most prominent in HSP90 and caulimovirus proteins, whereas 'G' and 'D' motifs are best conserved aong tobamo-, tobra- and caulimovirus proteins (Fig. 4). Notably, caulimovirus movement proteins combine all three motifs in their non-degenerate forms. The conserved domains delimited above occupy somewhat different locations in the virus movement proteins (Fig. 5). In relatively small 22K to 38K proteins (gemini-, tobamo-, tobra-, caulimo-, tricorna- and dianthoviruses) these domains reside in N-terminal regions, with variable C-terminal extensions. Larger 48K to 64K proteins (comoviruses, nepoviruses, ACLSV and BYV) have both N- and C-terminal extensions. Functional implications of the alignment of movement proteins Recently, two types of plant virus movement, represented by tobamoviruses and comoviruses, were postulated. They differ primarily in that virus capsid proteins are required for the second, but not for the first type (Wellink & van Kammen, 1989; van Lent et al., 1990). By demonstrating that the movement proteins of tobamoviruses and comoviruses both belong to family I we have shown that, despite this important distinction, Downloaded from www.microbiologyresearch.org by IP: 88.99.165.207 On: Mon, 31 Jul 2017 15:03:55 2900 E. V. Koonin and others Tobamo- 'Tobraviruses 'LPL" "G" "D' .............................. 'G ~ 'D" II II TGMV Geminiviruses I1 iI I CLV BGMV [J Caulimoviruses | Comoviruses [ | II ACLSV | Nepoviruses | [I BYV (Closteroviruses) | HSP90 (eukaryotic) | HTPG 7_ _._J E. coli | 'LPL' 'G' 'D" Fig. 5. Locationsoftheconserveddomains (black)in the(putative)plant virus movement proteins and HSP90. Movement proteins are designated by rectangles drawn approximately to scale; the regions aligned in this paper are enclosed in the broken box. The three conserved motifs described in the text are shown in those proteins in which they could be clearly identified. the mechanism of movement protein action may be similar. Binding of ssRNAs and ssDNAs by the movement proteins of TMV and CaMV has recently been demonstrated (Citovsky et al., 1990, 1991). Interestingly, the CaMV protein showed a strong preference for R N A (Citovsky et al., 1991). Thus in both positive-strand R N A viruses and caulimoviruses, which replicate by reverse transcription of the complete RNA transcript of the viral genome, virus spread may occur in the R N A [ribonucleoprotein (RNP)] form (Citovsky & Zambryski, 1991). This is compatible with the significant sequence similarity between the movement proteins of caulimoviruses and several groups of positive-strand R N A viruses revealed in this and previous studies. The relationship identified between the movement proteins of R N A viruses, caulimoviruses and geminiviruses suggests that geminivirus proteins may bind ssDNA, and that the mechanism of movement occurring in the form of ssRNA and ssDNA may have much in common. We have searched the sequences of the movement proteins for known nucleic acid-binding motifs and compared them to the sequences of single-stranded nucleic acid-binding proteins. This analysis failed to reveal any convincing similarity. This may be not very surprising because, for example, the bacterial and phage ssDNA-binding proteins themselves show poor sequence conservation (Prasad & Chiu, 1987). Initial deletion mutagenesis analysis of the TMV 30K protein appeared to localize the RNA-binding site to an approximately 30 amino acid residue segment of this protein around the 'G' motif (Citovsky et al., 1990). However, recent detailed studies revealed a more complex situation, indicating that this site is probably formed by two non-contiguous regions of the polypeptide. Deletion of short segments encompassing either the 'G' or 'D' motif did not abolish R N A binding (V. Citovsky, personal communication). Thus, it is unlikely that any of these motifs is the primary determinant of R N A (DNA) binding by the movement proteins, although their importance for the specificity of this binding cannot be excluded. Alternatively, one or both of these motifs might be involved in some quite different activity of the movement proteins. Downloaded from www.microbiologyresearch.org by IP: 88.99.165.207 On: Mon, 31 Jul 2017 15:03:55 Plant virus movement proteins The structural similarity between some of the plant virus movement proteins and HSP90 might imply a functional analogy. Although the functions of HSP90 are relatively poorly understood, they are believed to facilitate intracellular transport of steroid hormone receptors (Kost et al., 1989) and retroviral tyrosine protein kinases (Ziemecky et al., 1986). Association of HSP90 with actin (Koyatsu et al., 1986) and tubulin (Sanchez et al., 1988) cytoskeletal frameworks may also indicate that they are involved in intracellular protein sorting. Thus, it seems likely that HSP90, similarly to the. more thoroughly studied HSP70, possesses molecular chaperone activity. Along with intracellular protein sorting and transport, molecular chaperones mediate cotranslational protein folding (Rothman, 1989; Beckman et al., 1990) and assembly of multimeric proteins (Pelham, 1990). They have also been found in association with various RNPs (Welch, 1990). Plant virus movement is a complex process and may require any, or all, of these activities. Obviously, movement includes not only the passage of virus in the form of virions or other R(D)NPs through the plasmodesmata, but also their intracellular routing from the site of assembly to the plasmodesmata. Both stages are plausible targets for chaperone activity. One attractive possibility is that virus-encoded chaperones might be involved in promoting assembly of specific virus R(D)NPs, or of some virus-induced intraceUular structures (e.g. unusual cytoskeletal and/or membrane complexes). Various structures of this type have been described in many virus-plant systems, e.g. in caulimovirus-infected turnip cells (Linstead et al., 1988). Recently, tubular structures have been observed in comovirus-infected cells and their specific association with the 48/58K putative movement protein has been demonstrated (van Lent et al., 1990). The chaperones might also mediate disassembly of the complex structure of the native plasmodesmata. Obliteration of plasmodesmata fine structure upon virus infection has been highlighted in several reports (Kitajima & Lauritis, 1969; Weintraub et al., 1976). Functional modification of plasmodesmata, i.e. an increase in their size exclusion limit, has been documented in TMV-infected cells (Wolf et al., 1989). The relationship of the present results to those o f previous comparisons Attempts to establish relationships between the sequences of movement proteins of various viruses have been quite numerous. The present results differ from those reported in many studies in which only short sequence segments were aligned. On the other hand, the alignment of tobamo-, tobra- and caulimovirus proteins is compatible with the alignment published by Melcher 2901 (1990). The latter alignment, however, failed to highlight the conservation of the 'G' motif in caulimovirus sequences. We failed to confirm the statistical significance of the alignment of tricornavirus movement proteins with those of tobamo-, tobra- and/or caulimoviruses, despite extensive efforts to align these sequences either in the boundaries indicated by Melcher or with various shifts. Although we have confirmed the relationship between the sequences of the putative movement proteins of ACLSV and caulimoviruses reported previously (German et al., 1990), our alignment is different. The divergences probably should be attributed to the fact that neither Melcher (1990) nor German et al. (1990) attempted to delineate similar sequence segments by local similarity searches prior to generation of the alignment. Rather, these investigators attempted to align complete protein sequences, a procedure that leads quite frequently to erratic alignments. We were unable to confirm claims of similarities between proteins of some virus groups, in particular the N-terminal domains of the potyvirus polyproteins (Domier et al., 1987), and the proteins discussed here. Zimmern (1983) noticed a degree of similarity between the 'G' motif and the surrounding sequences of tobamovirus 30K proteins and the sequence of the 'LAGLIDADG' box conserved in yeast mitochondrial intronencoded proteins, the maturases. We failed to confirm the significance of this observation in statistical terms using aligned maturase sequences (E. Koonin, unpublished observations) and those of various groups of virus movement proteins, in accord with the recent analysis by Melcher (t990). However, because the maturases mediate splicing by facilitating proper R N A folding (Burke, 1988), the similarity described by Zimmern may reflect a common structural feature. Concluding remarks The above sequence comparisons suggest that caulimovirus movement proteins are relatively closely related both to HSP90 and to proteins of several virus groups. The observed pattern of sequence relationships might be indicative of 'star' evolution, with the caulimovirus movement proteins most closely resembling the hypothetical common ancestor constituting the centre of evolutionary radiation. It is tempting to speculate that this common ancestor could be a captured cellular HSP90-related gene. Of special interest is the unique gene organization in BYV. One of the putative proteins encoded by this virus genome displays highly significant sequence similarity with the HSP70 family (Agranovsky et al., 1991 b). Thus, as suggested by the present observations, the BYV Downloaded from www.microbiologyresearch.org by IP: 88.99.165.207 On: Mon, 31 Jul 2017 15:03:55 2902 E. V. Koonin and others genome may encode two HSP homologues in tandem, and it seems plausible to speculate that they may act in a concerted manner in virus intra- and/or intercellular movement. The HSP90-related movement proteins of other viruses may function in conjunction with cellular HSP70s. C o n c e r t e d a c t i o n by cellular H S P 7 0 a n d H S P 9 0 has been p o s t u l a t e d r e c e n t l y for r e g u l a t i o n o f steroid h o r m o n e r e c e p t o r a c t i v i t y ( K o s t et al., 1989). C o m p a r i s o n o f the two groups of m o v e m e n t p r o t e i n s identified here to the classification suggested by A t a b e k o v & T a l i a n s k y (1990) shows t h a t our f a m i l y I differs s u b s t a n t i a l l y o w i n g to the a d d i t i o n o f g e m i n i viruses a n d A C L S V , a n d the l a c k o f poty- a n d n e p o viruses; f a m i l i e s II are identical. F u r t h e r , we d i d not include in our c o m p a r i s o n s a set o f t h r e e p r o t e i n s f r o m potex-, carla-, h o r d e i - a n d furoviruses p r o p o s e d to be i n v o l v e d in virus cell-to-cell m o v e m e n t ( M o r o z o v et al., 1989; P e t t y & J a c k s o n , 1990; Beck et al., 1991) b e c a u s e n o n e o f these p r o t e i n s c o n t a i n s d e t e c t a b l e s e q u e n c e h o m o l o g y to f a m i l y I or II m o v e m e n t proteins. T h i s fact m a y be i n d i c a t i v e o f the differences in the m e c h a n i s m o f m o v e m e n t o f the triple gene b l o c k - c o n t a i n i n g a n d o t h e r p l a n t viruses. T h e r e is i n c r e a s i n g e v i d e n c e t h a t m o v e m e n t o f m a n y p l a n t viruses m a y be affected b y c e r t a i n d o m a i n s o f p r o t e i n s h a v i n g o t h e r functions in virus m u l t i p l i c a t i o n , e.g. the c a p s i d p r o t e i n s o f c o m o v i r u s e s ( W e l l i n k & v a n K a m m e n , 1989), b r o m o v i r u s e s ( S a c h e r & A h l q u i s t , 1989; A l l i s o n et al., 1990) a n d m o n o p a r t i t e g e m i n i v i r u s e s ( L a z a r o w i t z et al., 1989), a n d the b r o m o v i r u s R N A p o l y m e r a s e ( T r a y n o r et aL, 1991). I n s p e c t i o n o f the a m i n o a c i d sequences o f these d o m a i n s d i d n o t r e v e a l o b v i o u s c o u n t e r p a r t s to the m o t i f s c o n s e r v e d in the m o v e m e n t proteins. H o w e v e r , d e t a i l e d a n a l y s e s o f these sequences were n o t u n d e r t a k e n . A s w i t h a n y results o f c o m p u t e r - a s s i s t e d c o m p a r i s o n s , m u c h c a u t i o n m u s t be e x e r t e d in the f u n c t i o n a l i n t e r p r e t a t i o n o f the r e l a t i o n s h i p s b e t w e e n p l a n t virus m o v e m e n t proteins. T h i s should be stressed in view o f the fact t h a t s o m e o f the a l i g n m e n t s r e p o r t e d here were o f m o d e r a t e statistical significance. D i r e c t e x t r a p o l a t i o n o f the o b s e r v a t i o n s m a d e for one p r o t e i n (the 30K p r o t e i n o f T M V ) to o t h e r p r o t e i n s w i t h d i s t a n t l y r e l a t e d sequences c a n h a r d l y be e n c o u r a g e d . N e v e r t h e l e s s , it is o u r hope t h a t the findings p r e s e n t e d here will be helpful in d e s i g n i n g e x p e r i m e n t s to c h a r a c t e r i z e m o v e m e n t proteins biochemically. The authors are grateful to Professor J. G. Atabekov for constant interest and encouragement, to Drs V. Citovsky and A. E. Gorbalenya for useful discussions, and to Drs J. C. Carrington, M. E. Taliansky and A. V. Karasev for critical reading of the manuscript. Thanks are also due to Drs V. Citovsky, U. Melcher and T. Candresse for communicating their results prior to publication. References AGRANOVSKY,A. A., BOYKO,V. P., KARASEV,A. V., LUNINA,N. A., KOONIN,E. V. & DOLJA,V. V. (1991a). Nucleotide sequence of the T-terminal half of beet yellows closterovirus RNA genome: unique arrangement of eight virus genes. Journal of General Virology 72, 15 23. AGRANOVSKY,A. A., BOYKO,V. P., KARASEV,m. V., KOONIN,E. V. & DOLJA, V. V. (1991b). The putative 65K protein of beet yellows closterovirus is a homologue of HSP70 heat shock proteins. Journalof Molecular Biology 217, 603-610. ALLISON,R. F., JANDA,M. & AHLQUIST,P. (1989). Sequence ofcowpea chlorotic mottle virus RNA 2 and 3 and evidence of a recombination event during bromovirus evolution. Virology 172, 321-330. ALLISON,R., THOMPSON,C. & AHLQUIST,P. (1990). Regeneration of a functional RNA virus genome by recombination between deletion mutants and requirement for cowpea chlorotic mottle virus 3a and coat genes for systemic infection. Proceedings of the National Academy of Sciences, U.S.A. 87, 1820-1824. ATABEKOV,J. G. & TALIANSKY,M. E. (1990). Expression of plant virusspecific transport function by various viral genornes. Advances in Virus Research 38, 201-248. BECK, D. L., GUILFORD, P. J., VOOT, D. M., ANDERSEN, M. T. & FORSTER,R. L. S. (1991). Triple gene block proteins of white clover mosaic potexvirus are required for transport. Virology 183, 695702. BECKMAN,R. P., MIZZEN, L. A. & WELCH, W. J. (1990). Interaction of HSP70 with newly synthesized proteins: implications for protein folding and assembly. Science 38, 850-853. BOCCARA,i . , HAMILTON,W. D. O. & BAULCOMBE,n. C. (1986). The organization and interviral homologies of genes at the 3' end of tobacco rattle virus. EMBO Journal 5, 223 229. BRODSKY,L. I., DRACHE'V',A. L., TATUZOV,R. L. & CHUMAKOV,K. M. (1991). GENEBEE: a package of computer programs for biDpolymer sequence analysis. Biopolimery i Kletka 7 (in press). BROUGH, C. L., HAYES, R. J., MORGAN, A. J., COUTTS, R. H. A. & BUCK, K. W. (1988). Effect of mutagenesis in vitro on the ability of cloned tomato golden mosaic virus DNA to infect Nicotiana benthamiana plants. Journal of General Virology 69, 503514. BURKE, J. M. (1988). Molecular genetics of group I introns - RNA structures and protein factors required for splicing. A review. Gene 73, 273-294. CITOVSKY,V. & ZAMBRYSKI,P. (1991). How do plant virus nucleic acids move across plasmodesmata? BioEssays (in press). CITOVSKY,V., KNORR, D., SHUSTER,G. St. ZAMBRYSKI,P. (1990). The P30 movement protein of tobacco mosaic virus is a single-strand nucleic acid binding protein. Cell 60, 637-647. CITOVSKY,V., KNORR, n. & ZAMBRYSKI,P. (1991). Gene I, a potential cell-to-cell movement locus of cauliflower mosaic virus, encodes an RNA binding protein. Proceedings of the National Academy of Sciences, U.S.A. 88, 2476-2480. DAVIES, J. W. & STANLEY,J. (1989). Geminivirus genes and vectors. Trends in Genetics 5, 77-81. DAYHOFF,M. O., BARKER,W. C. & HUNT, L. T. (1983). Establishing homologies in protein sequences. Methods in Enzymology 91, 524-549. DEOM,C. i . , OLIVER,M. J. & BEACHY,R. N. (1987). The 30 kilodalton gene product of tobacco mosaic virus potentiates virus movement. Science 237, 389-394. DOMIER,L. L., SHAW,J. G. & RI-IOADS,R. E. (1987). Potyviral proteins share amino acid sequence homology with picorna-, como- and caulimoviral proteins. Virology 158, 20-27. DOOLITTLE,R. F. (1986). On URFs and ORFs. Mill Valley: University Science Books. ETE~AMI, P., CALLIS, R., ELLWOOD, S. & STANLEY, J. (1988). Delimitation of essential genes of cassava latent virus DNA 2. Nucleic Acids Research 16, 4811-4829. FEN6, D. F., JorrNSON, M. S. & DOOLITrLE, R. F. (1985). Aligning amino acid sequences: comparison of commonly used methods. Journal of Molecular Evolution 21, 112-125. Downloaded from www.microbiologyresearch.org by IP: 88.99.165.207 On: Mon, 31 Jul 2017 15:03:55 Plant virus movement proteins GERMAN, S., CANDRESSE,T., LANNEAU,M., HUET, J. C., PERNOLET, J. C. & DUNEZ, J. (1990). Nucleotide sequence and genomic organization of apple chlorotic leaf spot virus. Virology 179, 104112. GInRAT, J.-F., GARNIER,J. & ROBSON,B. (1987). Further developments of protein secondary structure predictions using information theory~ New parameters and consideration of residue pairs. Journal of Molecular Biology 198, 425-443. GORBALENYA,A. E., BLINOV, V. M., DONCHENKO,A. P. & KOONIN, E. V. (1989). An NTP-binding motif is the most conserved sequence in a highly diverged monophyletic group of proteins involved in positive strand RNA viral replication. Journalof Molecular Evolution 28, 256-268. HASEGAWA,A., VERVER,J., SHIMADA,A., SAITO,M., GOLDBACH,R. & VAN KAMMEN, A. (1989). The complete nucleotide sequence of soybean chlorotic mottle virus DNA and the identification of a novel promoter. Nucleic Acids Research 17, 9993-10013. HULL, R. (1989). The movement of viruses in plants. Annual Review of Phytopathology 27, 213 240. HULL, R., SAEDLER, J. & LONGSTAFF, M. (1986). The sequence of carnation etched ring virus DNA: comparison with cauliflower mosaic virus and retroviruses. EMBO Journal 5, 3083-3090. KITAJIMA, E. W. & LAURITIS, J. A. (1969). Plant virions in plasmodesmata. Virology 37, 681-685. KOST, S. S., SMITH, D. F., SULLIVAN,W. P., WELCH, W. J. & TOFT, O. O. (1989). Binding of heat shock proteins to the avian progesterone receptor. Molecular and Cellular Biology 9, 3829-3838. KOYATSU,S., NISHIDA, E., KADOWAKI,T., MATSUZAKI,F., IIDA, K., HARADA, F., KASUGA, M., SAKAI, H. & YAHARA, I. (1986). Two mammalian heat shock proteins, HSP90 and HSP100, are actinbinding proteins. Proceedings of the National Academy of Sciences, U.S.A. 83, 8054-8058. LAZAROWITZ,S. G., PINDER, A. J., DARMSTEEGT,V. n. & ROGERS, S. G. (1989). Maize streak virus genes essential for systemic spread and symptom development. EMBO Journal 8, 1023-1032. LEONTOVICH, A. M., GORBALENYA,A. E. & BRODSKY,L. I. (1990). Generation of a complete local similarity map for two biopolymer sequences (DOTHELIX program of the GENEBEE package). Biopolymery i Kletka 6, 12-17. LINSTEAD,P. J., HILLS,G. J., PLASKITT,K. A., WILSON,I. G., HARKER, C. L. & MAULE,A. J. (1988). The subcellular location of the gene 1 product of cauliflower mosaic virus is consistent with a function associated with virus spread. Journal of General Virology 69, 1809 1818. MALYSHENKO, S. I., KONDAKOVA, O. A., TALIANSKY, M. E. & ATABEKOV, J. G. (1989). Plant virus transport function: complementation by helper viruses is non-specific. Journal of General Virology 70, 2751-2757. MARTINEZ-IZQUIERDO,J. A., FUTTERER,J. & HOHN, T. (1987). Protein encoded by ORF 1 of cauliflower mosaic virus is part of viral inclusion body. Virology 160, 527-530. MELCHER, U. (1990). Similarities between putative transport proteins of plant viruses. Journal of General Virology 71, 1009-1018. MESHI, T., WATANABE,J., SAITO, T., SUGIMOTO,A., MAEDA, T. & OKADA,Y. (1987). Function of the 30 kD protein of tobacco mosaic virus: involvement in cell-to-cell movement and dispensability for replication. EMBO Journal 6, 2557-2563. MESHI,T., MOTOYOSHI,F., MAEDA,T., YOSHIOWOKA,S. & WATANABE, Y. (1989). Mutations in the tobacco mosaic virus 30 kD protein gene overcome Tm-2 resistance in tomato. The Plant Cell 1, 515-522. MEYER, i . , HEMMER,O., MAYO, i . A. & FRITSCH, C. (1986). The nucleotide sequence of tobacco black ring virus RNA-2. Journal of General Virology 67, 1257-1271. MOROZOV,S. Yu., DOLJA, V. V. & ATABEKOV,J. G. (1989). Probable reassortment of genetic elements among elongated RNA-containing plant viruses. Journal of Molecular Evolution 29, 52-62. MOSES, O., GAGEY, M.-J., GODEFROY-COLBURN,T., STUSSI-GARAUD, G., ELLWART-TscHi3RTZ,M., NITSCI~O, H. & MUNDRY, K.-W. (1988). The fate of the transport protein of tobacco mosaic virus in systemic and hypersensitive tobacco hosts. Journal of General Virology 69, 1367-1373. 2903 PELHAM, H. R. B. (1990). Function of the Hsp70 protein family: an overview. In Stress Proteins in Biology and Medicine, pp. 287-289. Edited by R. I. Morimoto, A. Tissieres & G. Georgopulos. New York: Cold Spring Harbor Laboratory. PEarY, I. T. D. & JACKSON,A. O. (1990). Mutational analysis of barley stripe mosaic virus RNA ~. Virology 179, 712-718. PRASAD,B. V. V. & CHIU, W. (1987). Sequence comparison of singlestranded DNA binding proteins and its structural implications. Journal of Molecular Biology 193, 579-584. ROTHMAN,J. E. (1989). Polypeptide chain binding proteins: catalysis of protein folding and related processes in cells. Cell 59, 591-601. SACHER, R. & AHLQUIST, P. (1989). Effects of deletions in the N-terminal basic arm of brome mosaic virus coat protein on RNA packaging and systemic infection. Journalof Virology63, 4545-4552. SAITO, T., IMAI, Y., MESHI, T. & OKADA, Y. (1988). Interviral homologies of the 30K proteins of tobamoviruses. Virology 167, 653-656. SANCHEZ, E. R., REDMOND,T., SCHERRER,L. C., BRESNICK,E. H., WELCH, W. J. & PRATT, W. B. (1988). Evidence that the 90-kilodalton heat shock protein is associated with tubulincontaining complex in L cell cytosol and in intact PtK cells. Molecular Endocrinology 2, 756-760. SAVITRY,H. S. & MURTHY,M. R N. (1983). Evolutionary relationship of alfalfa mosaic virus with cucumber mosaic virus and brome mosaic virus. Journal of Bioscience 5, 183-187. STRATFORD, R. & COVEY, S. N. 0989). Segregation of cauliflower mosaic virus symptom genetic determinants. Virology 172, 451-459. TALIANSKY,M. E., MALYSHENKO,S. I., PSHENNIKOVA,E. S., KAPLAN, I. B., ULANOVA,E. F. & ATABEKOV,J. G. (1982). Plant virus-specific transport function. I. Virus genetic control required for systemic spread. Virology 122, 318 326. TOMENIUS, K., CLAI~HAM,D. & MESHI, T. (1987). Localization by immunogold cytochemistry of the virus-coded 30 kD protein in plasmodesmata of leaves infected with tobacco mosaic virus. Virology 160, 363-371. TRAYNOR,P., YOUNG,B. M. & AHLQUIST,P. (1991). Deletion analysis of brome mosaic virus 2a protein: effects on RNA replication and systemic spread. Journal of Virology 65, 2807-2815. VAN LENT, J., WELL1NK,J. & GOLDBACH,R. (1990). Evidence for the involvement of the 58K and 48K proteins in the intercellular movement of cowpea mosaic virus. Journal of General Virology 71, 219-223. WEINTRAUB,M., RAGETTI, H. W. V. & LEUNG, E. (1976). Elongated virus particles in plasmodesmata. Journalof Ultrastructure Research 56, 351-364. WELCH, W. J. (1990). The mammalian stress response: cell physiology and biochemistry of stress proteins. In Stress Proteins in Biology and Medicine, pp. 101-167. Edited by R. I. Morimoto, A. Tissieres & G. Georgopulos. New York: Cold Spring Harbor Laboratory. WELLINK, J. & VAN KAMMEN, A. (1989). Cell-to-cell transport Of cowpea mosaic virus requires both the 58K/48K proteins and the capsid proteins. Journal of General Virology 70, 2279-2286. WOLF, S., DEOM, C. M., BEACHY, R. N. & LUCAS, W. J. (1989). Movement protein of tobacco mosaic virus modifies plasmodesmatal size exclusion limit. Science 246, 377-379. XIONG, Z. & LOMMEL,S. A. (1989). The complete nucleotide sequence and genome organization of red clover necrotic mosaic virus RNA 1. Virology 171, 543-554. ZIEMECKY,A., CATELLI,M. G., JOAB,I. & MONCHARMONT,B. (1986). Association of the heat shock protein HSP90 with steroid hormone receptors and tyrosine kinase oncogene product. Biochemical and Biophysical Research Communications 138, 1298-1307. ZIMMERN, D. (1983). Homologous proteins encoded by yeast mitochondrial genome and by a group of RNA viruses from plants. Journal of Molecular Biology 171, 345-352. ZIMMERN,n. & HUNTER,T. (1983). Point mutation in the 30 kD open reading frame of TMV implicated in temperature sensitive assembly and local lesion spreading of mutant Ni2519. EMBO Journal 2, 1893-1990. (Received 22 April 1991; Accepted 30 July 1991) Downloaded from www.microbiologyresearch.org by IP: 88.99.165.207 On: Mon, 31 Jul 2017 15:03:55
© Copyright 2026 Paperzz