Volume 12 Number 5 1984 Nucleic Acids Research A duster of six tRNA genes in Drosophila mitochondrial DNA that indudes a gene for an unusual tRNA» G Y Douglas O.Clary and David R.Wolstenholme Department of Biology, University of Utah, Salt Lake City, UT 84112, USA Received 22 November 1983; Accepted 1 February 1984 ABSTRACT Genes for URF3, tRNA ala , tRNA ar 9, tRNA asn , tRNAf^J, tRNA91u, tRNAPhe, and the carboxyl terminal segment of the URF5 gene have been identified within a sequenced segment of the mtDNA molecule of Drosophila yakuba. The genes occur in the order given. The URF5 and tRNAPne genes are transcribed 1n the same direction as replication while the URF3 and remaining five tRNA genes are transcribed in the opposite direction. Considerable differences exist 1n the relative arrangement of these genes 1n Q_. yakuba and mammalian mtDNA molecules. In the tRNAjjgy gene an eleven nucleotide loop, within which secondary structure formation seems unlikely, replaces the dihydrouridine arm, and both the variable loop (six nucleotides) and the T'f'C loop (nine nucleotides) are larger than 1n any other D. yakuba tRNA gene. As available evidence is consistent with AGA codons specifying serine rather than arginine in the Drosophila mitochondria! genetic code, the possibility 1s considered that the 5'GCU anticodon of the D_. yakuba tRNAj^y gene can recognize AGA as wel1 as AGY codons. INTRODUCTION Within nucleotide sequences of the mitochondrial DNA (mtDNA) molecule of Drosophila yakuba we have identified and reported the genes for two ribosomal RNAs, twelve tRNAs and eight polypeptides (cytochrome c oxidase (CO) subunits I, II and III, ATPase subunit 6, Unidentified Reading Frames (URF) 1, 2 and 3 and A6L) (1-4). Nucleotide sequences of sections of the JJ. melanogaster mtDNA molecule, corresponding to some of the sequenced sections of the JJ_. yakuba mtDNA molecule have also been determined (1,2,5). While the genes found 1n Drosophila mtDNAs are the same as the genes found 1n a number of mammalian mtDNAs (6-8), their arrangement within the DrosophUa and mammalian mtDNA molecules differs considerably (1-5). Also, evidence obtained from comparisons of sequences of Drosophila and mammalian polypeptide genes suggests that 1n the Drosophila mitochondrial genetic code the triplet AGA, which specifies arginine in the standard genetic code, and is used only as a rare termination codon 1n some mammalian mtDNAs (6-8), may specify serine (3- © IRL Press Limited, Oxford, England. 2367 Nucleic Acids Research 5). Further, the COI gene of Drosophila mtDNA appears to be unique in that it may contain a four nucleotide initiation codon (4,5). In this paper we report a sequence of the _D_. yakuba mtDNA molecule that contains a cluster of six tRNA genes (tRNA a l a , tRNA a r 9, tRNA a s n , tRNAJjj^, tRNA9 lu and tRNAP he ) bounded by the genes for URF3 and URF5. As shown previously, the URF3 gene of JL yakuba mtDNA is one of a series of polypeptide genes (URF2, COI, COI I, URFA6L, ATPase6, COIII and URF3) which occur in the same relative order, and are transcribed in the same direction, in D_. yakuba and mammalian mtDNAs (2-4). However, relative to the URF3 gene, all of the six presently described 0_. yakuba mt-tRNA genes, as well as the URF5 gene, are found 1n different locations in the mammalian mtDNA molecule. Further, the structure of the tRNA predicted from the tRNAJ^y gene contains some unusual features both in comparison to other Drosophila mt-tRNAs, and to its mammalian counterpart. MATERIALS AND METHODS Experimental d e t a i l s regarding i s o l a t i o n of mtDNA from D_. yakuba (Stock 2371.6, Ivory C o a s t ) , preparation and i d e n t i f i c a t i o n of pBR322 and pBR325 clones of £ . yakuba mtDNAs, r e s t r i c t i o n enzyme d i g e s t i o n s , e l e c t r o p h o r e s i s , c l o n i n g of fragments i n t o M13mp8 or M13mp9 DNA, p u r i f i c a t i o n of s i n g l e stranded and double-stranded Ml3 DNAs and preparation of v i r a l DNAs c o n t a i n i n g partial d e l e t i o n s of cloned r e s t r i c t i o n fragments of mtDNA are given or r e f e r r e d to i n ( 1 , 3 ) . A l l DNA sequences were obtained from M13mp8- or M13mp9-cloned fragments by the e x t e n s i o n - d i d e o x y r i b o n u c l e o t i de t e r m i n a t i o n procedure (9) using [ a dATP (600 C1/mmol; New England Nuclear) as described ( 1 ) . P] The sequencing s t r a t e g i e s used are given 1n F i g . 1 . Computer analyses of DNA sequences were c a r r i e d out using the programs r e f e r r e d t o in (3). RESULTS AND DISCUSSION The sequence of a continuous 1629 n u c l e o t i d e section of the £ . yakuba mtDNA molecule, determined using the s t r a t e g i e s i n d i c a t e d 1n F1g. 1 , i s shown i n F1g. 2. Within the sequence are f i v e regions each of which can f o l d I n t o the c h a r a c t e r i s t i c secondary s t r u c t u r e of a tRNA, w i t h anticodons them to be the genes f o r tRNA 3). a1a ar , tRNA 9, tRNA asn A s i x t h r e g i o n , which l i e s between the tRNA , tRNA9 asn lu indicating and tRNAPhe (F1g. and tRNA g1u genes, can also f o l d i n t o a s t r u c t u r e which resembles a tRNA, but t h i s s t r u c t u r e lacks a 2368 Nucleic Acids Research O-R-* EcoRI | C H/ndlll | C | D B A IE |D|E| —' 1 H/ndlll J IECORI loo be A B ^ ~— B -J H/ndlll Figure 1. A map of the _D_. yakuba mtDNA molecule showing the relative locations of the A+T-rich region (crosshatched) the two rRNA genes (dotted), the origin (0) and direction (R) of replication, and EcoRI and Hindlll sites and fragments (A-E and A-F respectively), (see (1) for details and references). The bar under the map Indicates the segment sequenced. This segment is expanded below and the restriction sites and strategies employed to obtain the entire nucleotide sequence are shown. The origin of each sequence 1s as follows: a_, the EcoRI-E fragment. b_ and c_, the two ends of the 1.5 kb Hindi II-EcoRI subfragment of the pBR325-c!oned EcoRI-A fragment. d_, the Hindlll-E fragment which was subcloned from the cloned EcoRI-A fragment. Unlabelled sequences with arrows pointing to the right are DNasel-EcoRI deletions (cloned 1n M13mp9) of the 1.5 kb Hindi II-EcoRI subfragment of the EcoRI-A fragment. Unlabelled sequences with arrows pointing to the l e f t are TJNasel-BamHI deletions (cloned in M13mp9) of the H1ndIII-D fragment. The small vertical terminal arrows on the lower bar indicate the extent of the sequence shown in Fig. 2, and the solid arrowhead shows the 5'-3' direction of the sequence. standard dihydrouridine arm. Because 5'GCT 1s found in the anticodon position of this structure, and for other reasons discussed below, this sequence is Interpreted as being a gene for tRNA|gy. The tRNAP"e gene Is transcribed in the same direction as that in which replication proceeds around the molecule, while the remaining tRNA genes are transcribed in the opposite direction (F1g. 4). The sequence to the l e f t of the tRNAa^a gene contains an open reading frame of 351 nucleotides which would be transcribed in the same direction as the tRNAa1a gene and which has been Identified as the URF3 gene; this nucleotide sequence and the amino acid sequence predicted from i t are 52.4% and 41.9% homologous, respectively, to the corresponding sequences of the mouse URF3 gene (8). The 5' end of the D. yakuba URF3 gene is located 2369 Nucleic Acids Research . URF3 — • AK y »—»• I F S I I I I AGTATAGATAlATTTTTTCTATTATTATTATTCCTTCAGTAATCTTATTAATCACAACTCTTGTTATATTTTTAGC 75 S I L S K K A L I D R E K s S P F E C G F D P K S TTCAATTTTATCAAAAAAAGCTTTAATTGATCGAGAAAAAAGATCACCTTTTGAATGTGGATTTGACCCTAAATC 150 S S R L P F S L R F F L I T t l F L I F D V E I A TTCTTCTCGATTACCATTTTCATTACGATTTTTTTTAATCACTATTATCTTTTTAATTTTTGATGTAGAAATTGC 22 5 L I L P M I I I L K Y S N I M I U T I T S I L F I TTTAATTCTTCCTATAATTATTATTTTAAAATATTCTAATATTATAATTTGAACAATTACTTCGATTATTTTTAT 300 L I L L I G L Y H E W N Q G M L N U S N ••* TTTAATTTTATTAATTGGGCTATACCATGAATGAAATCAAGGTATATTAAATTGATCAAATTAATAAATATTTAA 37 5 |AGGCTTGTAGTTAATTATAACATTTGATTTCCATTCAAAAACTATTGAATATTCAATCTACCTTA|TATATATATA 450 tRNAar8—• '_ [ '_ ' TATATATATATATAATTfcAATATGAAGCGATTAATTGCAGTTAGTTTCGACCTAACCTTAGCTATTATATACCCT tlWA a8n — » . ' [ ' [ 525 ^ g § TATTTTtrTAATTGAAGCCAAAAAGAGGCGTATCACTGTTAATGATATAATTCAGTATAAACTCCAATTAAGlGAAG 600 [ TATGGTGATCAAGTAAAAGCTGCTAACTTTTTTCTTTTAATGGTTAAATTCCATTTATACTTCTUTTTATATAGT 67 5 TTAAAATAAAACCTTACATTTTCATTGTAATAATAAAATAATTTATTTTTATAAATTJACTATAATTAATTCACTA 750 | | '_ '_ '_ - * — "tRNAPhe ^rATTCAAAGATTAATTAATCTCCATAACATCTTCAGTCTCATACTCTAAATATAAGCTATTTGAAllATAAAAATA 1=1 * L F L 825 ATAAAAAACTAAATAAAATTATAATTCAAAATACAAATAATATTAAATAAATTTTTAAACTATTATTATGTATCA L F S F L I M I W F V F L M L Y I K L S N N H M L F L T K S T F S K V V f L N Q S G L Y K N Y L Y P L Y H F Q I G M G G F Y Y E S W G Q 900 D T ATCATATTGAACCTAAAAATAATGTTAAATTATAATTTAATAGAGATTTATTCAAAGAATATAAATTTCTCATAG W M S G L F L T L N Y N L L S K N L S Y L N s I S 1125 TTATATAAGGAAAAGCAAAAATTAACCAATTTAACATTCTACCTCCAATAATTCTTATAAATAATAAGCCTAGTA 1275 H Y P F P F I L G R L M V W S E Y V L R P S Y S W H L M S G C I I J D N L M N L S C C N V T L G T S F F Y L H F L L G L H L D G T M S Y F F S F M N I TATTTCTAATTCTAACAATTTCTAAAATTATATCCTTAGAATAAAATCCAGCTAAAAATGGTATTCCACATAAAG H l I l V I E L I M D K S Y F G A L F P M G C L A 15 0 0 CCAAATTAGAAACATTAAAACAAGCTGAAGTTAAAGGTATATGAATTCTCAACCCTCCTATTAACCGAATATCTT L N S V N F C A S T L P M H I s L G C M L R I D Q 157 5 CACAATTATTTATATTATGAATAATAGCTCCTGCACATATAAATAATAAAGCTT S N N M N H I I A G A C H F L L A -•-"URF5 2370 Nucleic Acids Research adjacent to the tRNA 9ly gene, as shown previously (Fig. 2; (2)). Alignment of the nucleotide and amino acid sequences of the URF3 genes of d_. yakuba and mouse indicate that three codons (nine nucleotides; 68-70; 104-106; 113-115, Fig. 2) of the D_. yakuba gene are absent from the mouse gene. The URF3 genes of both D_. yakuba and mouse have a TAA termination codon. In mouse this codon is separated by a single A from the 5' terminus of the tRNA ar 9 gene, but 1n 0_. yakuba 1t is separated by eleven nucleotides from the 51 terminus of the tRNA ala gene. The sequence of 811 nucleotides to the right of the tRNAP he gene contains only one open reading frame which proceeds 1n the same direction as the sense strand of the tRNAP he gene. The 270 amino acid sequence predicted from this open reading frame 1s 25.7% homologous to the 271 amino acid sequence which precedes the carboxyl terminus of the mouse URF5 gene (8). This alignment (not shown) Includes insertion/deletions of only three internal codons between the Drosophila and mouse genes, all within 108 nucleotides of the carboxyl terminus. Interpretation of this D_. yakuba sequence as the URF5 gene is strengthened by the finding that the amino a d d sequences corresponding to the first 67 codons (nucleotides 1427-1627, F1g. 2) of the d_. yakuba and mouse sequences have a homology of 55.2%. In mouse mtDNA the URF5 and URF6 genes are transcribed from opposite strands of the molecule, the carboxyl termini of the two genes overlap and each gene terminates with a standard TAA codon (8). In D_. yakuba mtDNA the last codon of the URF5 gene is separated by a single T from the 5' terminus of the tRNAP^ e gene. Presumably, as has been argued for other genes with similar arrangements in both mammalian (6,10) and DrosophUa (2-5) mtDNAs, a translation termination codon is created at the end of the D_. yakuba URF5 transcript by polyadenylation when this transcript is Figure 2. Nucleotide sequence of the segment of the 0_. yakuba mtDNA molecule Identified in Fig. 1. The sequence contains a cluster of six tRNA genes (boxed) the anticodons of which are underlined. From considerations of nucleotide and predicted amino a d d sequence homologies to mouse mtDNA (8) the sequence to the left of the tRNA genes 1s the entire URF3 gene, and the sequence to the right of the tRNA genes is the carboxyl terminal segment of the URF5 gene. The nucleotide sequence shown is the (5'-3') sense strand of all of the genes except tRNApfie and URF5, and the arrows Indicate the direction of transcription of each gene. Asterisks indicate partial or complete termination codons. In the amino a d d sequences a lower case letter s indicates the tentative assignment of serine to an AGA codon (see text). The line above nucleotides 463 and 464 indicates the lack of a TA dinucleotide 1n one of the cloned fragments of this region of the sequence. The sequence containing the amino terminal 88 nucleotides of the URF3 gene and the entire tRNA g l y gene are published 1n (2). 2371 Nucleic Acids Research 5' A • I G • T G - C G - C T - A r T G - C T A T A A C T A T T G A I T G A A G C G I T G C C C A T A G G T A T C C G C T C A G A G T G G C • T - A T « T - A • T T - A C-G t M A " A - T T G - C A - T A - 7 G - C T - A T- A [j! C I [ I C I I T G A A A C T T A A A A T T I A C I T C C G A G I T A A G A T T ' T - A • i I A - T C - G A - T A - I W* - T A - I A - T S- C Figure 3. Six tRNA genes of U_. yakuba mtDNA shown i n the presumed secondary s t r u c t u r e of the corresponding" tRNAs. Also I n d i c a t e d 1n the diagram of the tRNAJigY gene are possible secondary i n t e r a c t i o n s between nucleotides 1n the d1hydrouridine replacement loop and the v a r i a b l e loop (dotted l i n e s ) , and other possible t e r t i a r y I n t e r a c t i o n s ( s o l i d l i n e s ) proposed f o r tRNA^gy from bovine and other mammalian mitochondria (23; see t e x t ) . The numbering shown i s s p e c i f i c f o r t h i s tRNA gene. excised from a multicistronic primary transcript. Intervening nucleotides are not found between the central four tRNA genes (arg, asn, ser(AGY) and glu) of the six tRNA gene cluster (Fig. 2 ) , nor do any of these genes overlap. However, between the tRNA a l a and tRNA a s n genes are 27 nucleotides which include a twelve unit TA repeat (F1g. 2 ) . In a second sequence of this region which was derived from a different mtDNA molecule, the 2372 Nucleic Acids Research Figure 4. The arrangement of genes determined to date in the circular mtDNA molecule of £. yakuba, from the results of various studies (for references see 1-4) and the present data. Each tRNA gene is Identified by the one letter ami no acid code. Arrows within and outside the molecule indicate the direction of transcription of each gene. Wavy lines indicate uncertain gene boundaries. 0 and R indicate the origin and direction of replication respectively. sequence between tRNA a and tRNA a s n genes included only an eleven unit TA repeat. Whether this difference reflects heterogeneity 1n the size of the Intervening sequence between the tRNA a 1 a and tRNA a s n genes 1n different mtDNA molecules, either within or between flies, or 1s the product of a replication error or of recombination during clone amplification, remains unknown. Between tRNA9 l u and tRNAP n e there are 18 nucleotides, the only notable feature of which is a direct tetranucleotide repeat at each end (F1g. 2 ) . We have shown previously that the genes for URF2, COI, COII, URFA6L, ATPase6, CO III and URF3 are found in the same relative order in D. yakuba and mammalian mtDNAs, but that the tRNA genes which separate some of these polypeptide genes differ between the two mtDNAs. All of the six tRNA genes reported here as well as URF5 are in different relative locations in the d_. yakuba and mammalian mtDNA molecules. In mammalian mtDNA the tRNA ar 9 gene is adjacent to the URF3 gene; the tRNA a l a and tRNA a s n genes are adjacent to each other and located between the tRNA tr P and tRNA c > s genes; the tRNAJj^ gene lies between the tRNA n 1 s and tRNA^gJJ genes (which are located between the URF4 and URF5 genes); the tRNA p h e gene 1s located between the replication origincontaining region and the large rRNA gene; and the URF5 gene Is located between the tRNA^Jj and URF6 genes. Also, of these six tRNA genes and the URF5 gene, only the tRNA a r g and tRNAj^f genes are transcribed 1n the same direction in D_. yakuba and mammalian mtDNAs, relative to both the direction of replication and to the seven polypeptide genes listed above. The relative locations, and directions of transcription of all of the JD. yakuba mitochondrial genes which have been determined to date are shown in 2373 Nucleic Acids Research F1g. 4. The d i r e c t i o n s of t r a n s c r i p t i o n of the newly mapped s i x tRNA genes and of the URF5 gene add t o the trend which we have noted e a r l i e r ( 3 ) ; genes In the approximate h a l f of the n_. yakuba mtPNA molecule which l i e s to the left of the A+T-r1ch region (F1g. 4) are t r a n s c r i b e d from one strand of the molecule, w h i l e genes in the other h a l f of the molecule are t r a n s c r i b e d from the other strand (F1g. 4 ) . The only exceptions t o t h i s g e n e r a l i z a t i o n so f a r are three tRNA genes ( g i n , cys, t y r ) to the l e f t of the A+T-rich r e g i o n , and one tRNA gene ( s e r , UCN) t o the r i g h t . This c o n t r a s t s w i t h what i s found i n mammalian mtDNA, where a l l genes except the URF6 gene and e i g h t tRNA genes are t r a n s c r i b e d from one strand (the H strand) of the molecule ( 6 - 8 ) . Transfer RNA genes. The t R N A a l a , tRNA a r 9, tRNA a s n , tRNA9 l u and tRNAPne genes of D_. yakuba mtDNA are 62%, 32%, 56%, 54% and 56% homologous, r e s p e c t i v e l y , t o the corresponding tRNA genes of mouse mtDNA ( 8 ) . The value f o r tRNA ar 9 1s the lowest f o r any of the 18 JJ_. yakuba-mouse tRNA gene homology comparisons made so f a r . However, the £ . yakuba mt-tRNA a r 9 gene 1s 84% homologous t o the r e c e n t l y published sequence of tRNA ar 9 from mitochondria of mosquito, Aedes a l b o p i c t u s (11). The presently reported f i v e 0_. yakuba mt- tRNA genes are s i m i l a r t o the other twelve D_. yakuba mt-tRNA genes we have described p r e v i o u s l y (1-4) in regard t o t h e i r major secondary characteristics (Fig. 3). structural A l s o , each of these tRNA genes lacks some of the nucleotides which are constant i n p r o k a r y o t i c and non-organel le eukaryotic tRNAs. However, i n each of the t R N A a l a , tRNA a r 9, tRNA a s n and tRNAPne genes of _D_. yakuba mtDNA, as 1n a l l other D. yakuba mt-tRNA genes, the constant Pu£g, T33 and PU37 n u c l e o t i d e s are maintained (numbering system 1n 12). In the tRNA9' u gene T33 and PU37 are a l s o found, but n u c l e o t i d e 26 i s a p y r i m i d i n e (C) r a t h e r than a p u r i n e . The tRNA^ gene 1s again unusual i n t h a t I t s G+C content 1s only 9%, compared to a range of 24% t o 31% f o r the remaining f i v e newly described mt-tRNA genes. Among the other 0_. yakuba tRNA genes sequenced, only the tRNAasP gene has a s i m i l a r low (9%) G+C c o n t e n t . In both the tRNA9 1u and tRNA as P genes both the am1no-acyl and the T^C stems lack G-C pairs ( 3 ) . Among stems of other £ . yakuba mt-tRNA genes G-C p a i r s are absent only 1n the am1no-acyl stem of tRNA9 l n . Within the tRNA9 l u gene s t a b i l i t y of secondary s t r u c t u r e may be aided by the absence of unusual nucleotide pairs w i t h i n stems ( 2 ) , and by the stacking e f f e c t expected from runs of A's and T's ( F i g . 3 ) , as has been noted f o r the amino-acyl stems of the tRNAasP and tRNA91n genes ( 3 ) . As mentioned above, we have I n t e r p r e t e d the sequence between the tRNA asn and tRNA9' u genes as a tRNAj^y gene. 2374 This sequence can be folded i n t o a Nucleic Acids Research structure (F1g. 3) which has an am1no-acyl stem and an anticodon stem with the same number of nucleotide pairs (seven and f i v e , respectively), and an anticodon loop with the same number of nucleotides (seven) as are found in a l l other D_. yakuba mt-tRNA genes, and almost a l l other known tRNAs (12,13). The T'^C arm could comprise either a stem of five nucleotide pairs (other 0_. yakuba mt-tRNA genes contain four or five) and a loop of nine nucleotides (other V_. yakuba mt-tRNA genes contain between three and e i g h t ) , or a six nucleotide pair stem and a seven nucleotide loop. The variable loop of six nucleotides 1s larger than the variable loops (four or five nucleotides) of other d_. yakuba, other insect, and mammalian mt-tRNAs (2,5,11-14), but is quite small compared to the variable loops of some fungal mt-tRNAs (15,16), prokaryotic and non-organelle eukaryotic tRNAs (12,13). Eleven nucleotides occur between the last nucleotide of the amino-acyl stem (nucleotide 7, numbering system shown in Fig. 3) and the f i r s t nucleotide of the anticodon stem (nucleotide 19), but these nucleotides cannot be arranged to form a structure resembling the usual dihydrouridine arm of other tRNAs. Nucleotides T^g, GJJ and A ^ are complementary to T ^ . C ^ and Ajg, as might be expected i f they were the conserved dihydrouMdine stem; however, i f this is the case then two of the A j 6 , Gj7 and T^g nucleotides must have been added during the evolution of this gene from a gene coding a tRNA with a standard cloverleaf configuration. The three nucleotides 1n the anticodon position, 5'GCT, would be expected to recognize the t r i p l e t s AGU and AGC, which in a l l genetic codes known specify serine. Recently, the sequence of an mt-tRNA isolated from mosquito tissue culture cells has been obtained (D. T. Dubin, personal communcation) which is 91% homologous to the £ . yakuba t R N A ^ gene sequence, and can be folded Into a structure similar to that shown 1n Fig. 3 for the d_. yakuba tRNA^y gene sequence. In each of the mt-tRNAJjgy genes of nine different mammals (hamster (17), bovine (18,19), human (19), mouse (8,20), rat (21), g o r i l l a , chimpanzee, orangutan and gibbon (22)) the d1 hydrouridine arm 1s replaced by a loop of only five nucleotides. The variable loop has a maximum of five nucleotides, the IK stem comprises five nucleotide pairs and the TK loop 1s always eight or nine nucleotides. Based on information concerning t e r t i a r y structure obtained from experiments Involving chemical probing, de Bruijn and Klug (23) have derived a structural model for bovine tRNA^Qy. This model 1s similar 1n overall shape to the crystal structure of yeast tRNAPhe (24-26) but is smaller, and Involves a unique set of t e r t i a r y Interactions. The l a t t e r include interactions of a l l five nucleotides in the d1hydrour1d1ne arm 2375 Nucleic Acids Research replacement l o o p , two w i t h nucleotides i n the v a r i a b l e loop which e f f e c t i v e l y extends the anticodon stem, and t h r e e w i t h nucleotides in the T^C l o o p . The s t r u c t u r a l model proposed f o r bovine tRNAjjjjy 1s also compatible w i t h the tRNAs p r e d i c t e d from the other e i g h t described mammalian tRNA|gy genes, 1f some A-C base p a i r i n g i s permitted ( 2 3 ) . In the D_. yakuba tRNA p r e d i c t e d from the tRNAJjgy gene (F1g. 3 ) , A^g, A^g, and G ^ i n the di hydrouridi ne arm replacement loop could pair w i t h C33, l^g and T^Q i n the v a r i a b l e loop. A l s o , the three t e r t i a r y i n t e r a c t i o n s between nucleotides 1n the di hydrouridi ne arm replacement loop and the 1>C loop thought t o occur in the mammalian tRNA|gy (23) might be maintained (Gg-T 5 4 ; G g " 1 ^ ' i n t e r a c t i o n between T^g and Ag n . T 10'A50' F1 9- 3 ' as wel 1 as The s t r u c t u r a l model presented f o r mt- tRNA|gy from bovine and other mammals includes an I n t r a l o o p U-A p a i r found 1n yeast tRNAPne (24,25)) formed between the f i r s t and f i f t h of the T+C loop. (also nucleotides Such an I n t e r a c t i o n i n the n_. yakuba t R N A ^ 9 e n e ( F 1 9 - 3) seems more l i k e l y between the second and s i x t h nucleotides (T48-A52) of the T^C loop than between G47 and A g j . Codon usage and the genetic code. The frequency of codons ending i n A or T i n the URF3 gene (93.2%) and i n the carboxyl terminal segment of the URF5 gene (91.9%) are w i t h i n the range of values (91.6% to 98.1%) found f o r the other s i x polypeptide genes of 0_. yakuba mtDNA which have been analysed (3,4). The occurrence of the l e u d ne-sped f y i ng codon CTG (nucleotides 899- 9 0 1 , F1g. 2) 1n the URF5 gene leaves only four expected sense codons (CAG, GAG, CGC, and AGG) unrecorded among the seven completed polypeptide gene sequences and the p a r t i a l URF1 and URF5 sequences ( 1 - 4 ) . The G+C content of the URF3 gene (20.6%) and the p a r t i a l URF5 gene (21.9%) are s l i g h t l y higher than the G+C contents of the URF2 and URFA6L genes (18.6% and 17.2%, r e s p e c t i v e l y ) , but s t i l l d i s t i n c t l y lower than the range of G+C contents (24.2% to 30.1%) found f o r the D. yakuba COI, COM, COIII and ATPase6 genes ( 3 , 4 ) . As i n a l l other 0_. yakuba genes f o r which sequences have been o b t a i n e d , I n t e r n a l TGA and I n t e r n a l AGA codons are found 1n both the URF3 gene and the p a r t i a l URF5 gene (F1g. 2 ) . I t appears t h a t , as 1n fungal and mammalian mtDNAs ( 8 , 2 7 - 2 9 ) , TGA codons specify tryptophan 1n the Drosophila mltochondrial genetic code ( 1 , 3 - 5 ) . While two out of three of the TGAs of the JJ_. yakuba URF3 sequence correspond t o tryptophan-speci f y i n g codons (TGAs) i n the mouse URF3 sequence, none o f the three TGA codons of the £ . yakuba URF5 sequence correspond to t r y p t o p h a n - s p e d f y i n g codons i n the mouse URF5 sequence. 2376 The d i f f e r e n c e 1n conservation of TGA codons between the URF3 and Nucleic Acids Research URF5 genes of D_. yakuba and mouse correlates with differences 1n overall nucleotide sequence homology between these genes (52.4% and 39.7%, respectively), as noted previously for other D_. yakuba-mouse mitochondrial gene comparisons (4). I t has been argued from comparisons of mtDNA sequences of Drosophila and mammals (3-5), and from considerations of the characteristics of mitochondrial genetic codes (3), that AGA may specify seMne rather than arginine in the DrosophUa mitochondrial genetic code. The D. yakuba URF3 and URF5 genes contain one and seven AGA codons respectively, which correspond in position to codons specifying serine, asparagine (two), phenylalanine, alanine, threonine, histidine and cysteine in the mouse URF3 and URF5 genes. A similar low correspondence of serine-specifying codons to AGA codons has been found previously for other mammal -DrosophUa comparisons Involving mitochondrial genes with relatively low overall nucleotide sequence homologies (3-5). As J). yakuba mtDNA contains a gene for a tRNA with a 5'GCU anticodon which is expected to recognize only AGY codons, i t remains unclear as to how AGA codons are decoded. Two p o s s i b i l i t i e s seem worth considering: The D_. yakuba mtDNA molecule may contain one extra tRNA gene with a TCT anticodon (1) which is specific for AGA (and possibly AGG). I f this 1s the case then D_. yakuba mtDNA would contain three tRNAs to decode serine, a situation which is Inconsistent with the otherwise overall economy of structure and organization of this mtDNA molecule. The alternative possibility 1s that there is stable pairing between the GCU anticodon and each of the three AGC, AGU and AGA codons. This would necessitate either selective two-out-of-three nucleotide pair recognition (30,31), or that the G In the wobble position of the anticodon can effectively pair with C, U or A. As AGG codons have not been found in DrosophUa gene sequences so f a r , there Is no need to include the prediction of G-G pairing. Other cases of unusual base pairing of the nucleotide 1n the wobble position of the anticodon have been Indicated for mtDNAs. In mammalian mtDNAs the 5'CAU anticodon of tRNAf~met appears to be able to recognize as methionine-specifying codons a l l four AUN codons when they occur as I n i t i a t o r codons, and AUG and AUA when they occur Internally. A similar situation 1s Indicated in DrosophUa mtDNA, except that evidence for only AUG and AUU i n i t i a t i o n codons (and one possible special case of an AUA I n i t i a t i o n codon (4,5)) has been found so far (1-5). Also, 1n DrosophUa mtDNA, while genes contain both AAA and AAG codons, only a tRNA^ s gene with a 5'CTT anticodon has been located (3,5). I t seems reasonable, therefore, to give serious consideration to the view that in Drosophila mitochondria, the 2377 Nucleic Acids Research GCU anticodon can recognize AGA as w e l l as AGY codons. ACKNOWLEDGEMENTS We w i s h t o thank Donald T. Dubin f o r sending us data on mosquito tRNAs p r i o r t o p u b l i c a t i o n , and F e l i x Machuca f o r e x c e l l e n t t e c h n i c a l a s s i s t a n c e . T h i s work was supported by National I n s t i t u t e s of H e a l t h Grant No. GM 18375. REFERENCES 1 . C l a r y , D . O . , Goddard , J . M . , M a r t i n , S . C . , Fauron ,C.M.-R. and D.R. (1982) N u c l . A d d s Res. 10, 6619-6637. 2. C l a r y , D . O . , W a h l e i t h n e r ,J .A. and Wolstenholme,D.R. (1983) 1 1 , 2411-2425. 3. C l a r y , D . O . and Wolstenholme,D.R. (1983a) N u c l . Acids Res. 4 . C l a r y , D . O . and Wolstenholme.D.R. (1983b) N u c l . Acids Res. 5. de B r u i j n . M . H . L . (1983) Nature 304, 2 3 4 - 2 4 1 . Wolstenholme, N u c l . Acids Res. 1 1 , 4211-4227. 1 1 , 6859-6872. 6. Anderson,S., Bankier ,A.T., Barrell,B.G., de Bruijn .M.H.L., Coul son,A.R., Drouin.J., Eperon.I.C., Nierlich.D.P., Roe.B.A., Sanger.F., Schreier, P.H., Smith.A.J.H., Staden.R. and Young,I.G. (1981) Nature 290, 457-465. 7. Anderson,S., de Bru1 jn ,M.H.L., Coulson ,A.R., Eperon.I.C., Sanger.F. and Young,I.G. (1982) J. Mol. B1ol. 156, 683-717. 8. Bibb.M.J., Van Etten.R.A., Wright,C.T., Walberg.M.W. and Clayton,D.A. (1981) Cell 26, 167-180. 9. Sanger.F., N i c k l e n . S . and Coulson.A.R. (1977) Proc. Natl . Acad. S c i . USA 74, 5463-5467. 10. O j a l a . D . , Montoya , J . and A t t a r d i , G . (1981) Nature 290, 470-474. 1 1 . HsuChen.C.C, Cleaves,G.R. and Dubin,D.T. (1983) Plasmid 10, 55-65. 12. Gauss,D.H. and S p r i n z l . M . (1983a) N u c l . A d d s Res. 1 1 , r l - r 5 3 . 13. Gauss,D.H. and S p r i n z l . M . (1983b) N u c l . Acids Res. 1 1 , r 5 5 - r l O 3 . 14. HsuChen.C.C., Cleaves,G.R. and Dubin,D.T. (1984) N u c l . A d d s . Res. i n press. 15. K o c h e l . H . G . , Lazarus,C.M., Basak.N. and Kuntzel ,H. (1981) Cell 23, 625633. 16. N e t z k e r . R . , Kochel.H.G., Basak.N. and Kuntzel ,H. (1982) N u c l . Adds Res. 10, 4783-4794. 17. B a e r . R . J . and Dubin.D.T. (1980) N u c l . Acids Res. 8 , 3603-3610. 18. A r c a r i . P . and Brownlee.G.G. (1980) N u c l . Acids Res. 8 , 5207-5212. 19. de B r u i j n , M . H . L . , S c h r e i e r , P . H . , E p e r o n . I . C . , Barrel 1,B.G.. Chen.E.Y., Armstrong,P.W., Wong.J.F.H. and Roe.B.A. (1980) N u c l . Acids Res. 8, 52135222. 20. Van E t t e n . R . A . , Michael , N . L . , B i b b . M . J . , Brennicke.A. and Clayton,D.A. (1982) 1n M i t o c h o n d r i a l Genes, Slonimski , P . , B o r s t . P . and A t t a r d i . G . , Eds., pp. 73-88, Cold Spring Harbor Laboratory. 2 1 . Grosskopf.R. and Feldmann.H. (1981) C u r r . Gen. 4 , 191-196. 22. Brown,W.M., P r a g e r . E . M . , Wang,A. and Wilson.A.C. (1982) J . Mol. E v o l . 18, 225-239. 23. de B r u i j n , M . H . L . and Klug.A. (1983) EMBO J . 2, 1309-1321. 24. K1m,S.-H., S u d d a t h . F . L . , Q u i g l e y . G . J . , McPherson.A., Sussman.J.C., Wang,A., Seeman.N.C. and R1ch,A. (1974) Science 185, 435-440. 25. R o b e r t u s . J . D . , L a d n e r . J . E . , F i n c h , J . T . , Rhodes,D., Brown,R.S., C l a r k , B . F . C . and Klug.A. (1974) Nature 250, 546-551. 26. L a d n e r . J . E . , J a c k , A . , R o b e r t u s . J . D . , Brown,R.S., Rhodes,D., C l a r k , B . F . C . and Klug.A. (1975) Proc. N a t l . Acad. Sc1. USA 7 2 , 4414-4418. 2378 Nucleic Acids Research 27. Heckman.J.E., S a r n o f f . J . , Alzner-DeWeerd.B., Y1n,S. and RajBhandary,U.L. (1980) Proc. N a t l . Acad. S c i . USA 77, 3159-3163. 28. B o n i t z . S . G . , B e r l a n i . R . , C o r u z z i . G . , L1 , M . , M a d n o . G . , Nobrega.F.G., Nobrega.M.P., T h a l e n f e l d . B . E . and T z a g o l o f f , A . (1980) Proc. N a t l . Acad. S d . USA 7 7 , 3167-3170. 29. B a r r e l I . B . G . , B a n k i e r . A . T . , de Bru1jn . M . H . L . , Chen.E., Coulson , A . R . , D r o u i n . J . , Eperon,1.C., N i e r l i c h . D . P . , Roe.B.A., Sanger.F., S c h r e i e r , P . H . , S m i t h . A . J . H . , Staden.R. and Young,I.G. (1980) Proc. N a t l . Acad. S c i . USA 77, 3164-3166. 30. L a g e r v i s t , U. (1978) Proc. N a t l . Acad Sc1. USA 75, 1759-1762. 3 1 . L a g e r v i s t , U. (1981) Cell 2 3 , 305-306. 2379 Nucleic Acids Research
© Copyright 2024 Paperzz