as a PDF

Article No. jmbi.1999.3080 available online at http://www.idealibrary.com on
J. Mol. Biol. (1999) 292, 467±483
COMMUNICATION
Singly and Bifurcated Hydrogen-bonded Base-pairs in
tRNA Anticodon Hairpins and Ribozymes
Pascal Auffinger and Eric Westhof*
ModeÂlisations et Simulations
des Acides NucleÂiques, UPR
9002, Institut de Biologie
MoleÂculaire et Cellulaire du
CNRS, 15 rue Rene Descartes
67084 Strasbourg Cedex
France
The tRNA anticodon loops always comprise seven nucleotides and is
involved in many recognition processes with proteins and RNA fragments. We have investigated the nature and the possible interactions
between the ®rst (32) and last (38) residues of the loop on the basis of
the available sequences and crystal structures. The data demonstrate the
conservation of a bifurcated hydrogen bond interaction between residues
32 and 38, located at the stem/loop junction. This interaction leads to the
formation of a non-canonical base-pair which is preserved in the known
crystal structures of tRNA/synthetase complexes. Among the tRNA and
tDNA sequences, 93 % of the 32 38 oppositions can be assigned to two
families of isosteric base-pairs, one with a large (86 %) and the other with
a much smaller (7 %) population. The remainder (7 %) of the oppositions
have been assigned to a third family due to the lack of evidence for
assigning them into the ®rst two sets. In all families, the Y32 R38 basepairs are not isosteric upon reversal (like the sheared G A or wobble
G U pairs), explaining the strong conservation of a pyrimidine at position 32. Thus, the 32 38 interaction extends the sequence signature of the
anticodon loop beyond the conserved U-turn at position 33 and the
usually modi®ed purine at position 37. A comparison with other loops
containing both a singly hydrogen-bonded base-pair and a U-turn
suggests that the 32 38 pair could be involved in the formation of a base
triple with a residue in a ribosomal RNA component. It is also observed
that two crystal structures of ribozymes (hammerhead and leadzyme)
present similar base-pairs at the cleavage site.
# 1999 Academic Press
*Corresponding author
Keywords: Nucleic acid conformation; tRNA; anticodon; leadzyme;
hammerhead
Introduction
Hydrogen bonds between bases constitute the
cement of nucleic acid structures. Among nucleic
acid base-pairs, the well-known Watson-Crick GC
and A U pairs involve three and two hydrogen
bonds, respectively. Besides these regular WatsonCrick interactions, a large array of possible basepair interactions involving two hydrogen bonds
have been enumerated and many of them have
been observed in crystal structures (Tinoco, 1993;
Dirheimer et al., 1995; Leontis & Westhof, 1999a).
Such non-Watson-Crick interactions subtend the
great architectural diversity of nucleic acids, and
especially of RNA molecules where they are recurrently observed. Further, non-Watson-Crick pairs
E-mail address of the corresponding author:
[email protected]
0022-2836/99/380467±17 $30.00/0
contribute to the structural ¯exibility of these
molecules, since they are often strongly context
dependent. A less recognized class of base-pairs
comprises those in which the two bases are linked
by a single or a bifurcated hydrogen bond.
In transfer RNAs, the anticodon loop plays a
central role in processes associated with protein
synthesis. On the one hand, it must interact speci®cally with its cognate synthetase, and on the other
hand, it must ®t in the common tRNA binding site
of the ribosome for codon recognition on the messenger RNA. Over the years, rather extensive databases at the tRNA level (546 sequences) or at the
tDNA level (2726 sequences) have been compiled
(Sprinzl et al., 1998). Several years ago, it was
noticed that the ®rst (32) and last (38) residues of
the seven-membered anticodon loop are involved
in a bifurcated hydrogen bond contact (Quigley &
# 1999 Academic Press
468
Singly and Bifurcated Hydrogen-bonded Base-pairs
Figure 1. Secondary structures of tRNA anticodon hairpins for which crystal structures are accessible in the NDB.
The 32 38 base-pairs are shown in bold.
Rich, 1976; Westhof et al., 1985). Later, multiple
molecular dynamics simulations performed with
appropriate electrostatic treatment showed a surprising stability for this 32 38 contact (Auf®nger &
Westhof, 1996, 1998a; Auf®nger et al., 1999). Still
more surprising, in the crystal structure of the
speci®c complexes between tRNAs and their cognate synthetase (Rould et al., 1991; Ruff et al., 1991;
Cusack et al., 1996a, 1998), similar contacts between
residues 32 and 38 occur despite a severe unfolding of the rest of the anticodon loop. Therefore, we
decided to investigate the available crystal structures of tRNAs and their complexes with tRNAsynthetases, in the light of their base distribution in
the tRNA sequence database (Sprinzl et al., 1998),
in order to rationalize the possible interactions
between residues 32 and 38 (Figure 1).
Results
Crystallographic structures of free tRNAs
In all tRNA crystal structures present in the
NDB database (see Table 1), the ®rst (residue 32)
and last (residue 38) bases of the anticodon loop
(Figure 1), are linked through a bifurcated hydrogen bond and form a ``pseudo'' base-pair
(Figure 2). The only exception is the trna03 structure (Brown et al., 1985) obtained at low pH in the
presence of Pb2‡ (in trna03, a Pb2‡ is intercalated
between C32 and the hypermodi®ed Y37 base,
possibly preventing the formation of a 32 38 interaction). This particular anticodon loop conformation is probably not adopted by the tRNA
molecule under physiological conditions and will
not be further discussed.
In tRNAAsp, a 32C38 and, in tRNAPhe, a
C32 A38 base-pair are present ( stands for a
pseudouridine where the C5-H group is replaced
by a N1-H group and the C10 -N1 link between the
base and the sugar is replaced by a C10 -C5 bond;
see Figure 2). The 32 and 38 bases are linked by a
bifurcated hydrogen bond established between a
carbonyl oxygen atom and an amino group, i.e.
and
O4(32) . . . N4(C38)
in
tRNAAsp
Phe
O2(C32) . . . N6(A38) in tRNA . Interestingly, the
carbonyl and amino groups as well as the glycosidic bonds of both base-pairs superimpose very
well. Thus, although a signi®cant out-of-plane
deviation is observed (Figure 2), these 32 38 pairs
can be considered as isosteric (Figure 2). It is noteworthy that the observed out-of-plane deviation
does not affect the interaction scheme of the two
bases or the inter-base C10 (32) . . . C10 (38) distance,
Ê as in Watson-Crick pairs. Yet, the
close to 11 A
deviation of these isosteric pairs from a WatsonCrick arrangement is clearly illustrated by the
Ê ) between the C10 atoms of
large distance (7 A
residue 38 in the bifurcated hydrogen bonded
32 38 pairs and in the corresponding Watson-Crick
arrangement (Figure 2). Interestingly, a U C pair
identical with that observed in yeast tRNAAsp has
been observed in the crystal structure of the alphasarcin loop of 23 S rRNA (Correll et al., 1998). A
different situation is observed in the crystal structure of yeast tRNAMet
(Basavappa & Sigler, 1991),
i
where the C32 A38 pair (Figure 3) is not isosteric
to the 32 38 pairs described above. The authors
reported that the resolution of the anticodon hairpin is very low. Hence, this interaction scheme is
probably of poor structural relevance and will not
be further discussed. This view is also supported
by recent NMR studies of two transcripts of yeast
and Escherichia coli tRNAMet
anticodon
tRNAMet
i
m
hairpins (Schweisguth & Moore, 1997) from which
it was concluded that the observed non-canonical
C32 A38 NMR base-pairing are much closer to
those observed in yeast tRNAPhe crystal structures
structhan the one observed in the yeast tRNAMet
i
ture (Schweisguth & Moore, 1997).
A recent crystal structure of the complex
between E. coli tRNACys and the translation
elongation factor EF-Tu reveals the arrangement of
a 32 A38 base-pair (pr0004; Nissen et al., 1999).
Similarly to what is observed in the crystal structure of the complex of tRNAPhe with EF-Tu
(ptr012; Nissen et al., 1995), the anticodon loop of
tRNACys adopts a conformation close to that seen
in the crystal structures of uncomplexed tRNAs
with a non-Watson-Crick 32 A38 pair. Again the
A pair is isosteric to the C pair observed in
Singly and Bifurcated Hydrogen-bonded Base-pairs
469
Figure 2. The 32 38 base-pairs observed in crystal structures of uncomplexed tRNAs (see Table 1). (a) Average
structures of the 32 C38 and C32 A38 pairs extracted from the crystal structures of uncomplexed tRNA molecules,
including also the structure of the yeast tRNAPhe/RS complex (ptr012) where the anticodon loop does not interact
with the protein. The individual structures used for the averages are drawn with small lines. (b) Superposition (left)
of the 32 C38 (yellow) and C32 A38 (red) average structures showing the isostericity of these base-pairs in the
tRNA anticodon loop structural context. For comparitive purposes, a regular A U Watson-Crick pair is also drawn.
Orthogonal view (right) showing the observed out-of-plane deviation of these pairs. In all these Figures, the glycosidic bonds of nucleotides at position 32 have been superimposed.
yeast tRNAAsp and to the C A pair observed in
yeast tRNAPhe (Figure 2). Interestingly, in both
tRNACys and tRNAPhe (trna06) structures, a water
molecule is seen in the vicinity of the N7 and N6H groups of A38. In trna09, there is a comparable
water molecule which belongs, however, to the
coordination sphere of a Mg2‡.
Hence, from the available tRNA crystal structures where the anticodon hairpin keeps its folded
structure, it can be deduced that a set of non-canonical but isosteric base-pairs involving a bifurcated
hydrogen bond between a carbonyl oxygen atom
and an amino group is recurrently present at the
junction between the stem and the loop of the
anticodon hairpin. On a thermodynamic basis,
base-pairs involving a single or bifurcated hydrogen bond should be more labile compared to those
involving two or three hydrogen bonds. However,
as described below, formation of speci®c tRNA/
synthetase complexes, which involve the splaying
out toward the exterior of ®ve (33 to 37) of the
bases of the loop, do not disrupt the direct hydrogen bonding contacts established between bases 32
and 38.
Crystallographic structures of tRNA/
synthetase complexes
The coordinates of several crystal structures of
tRNAs complexed with their cognate synthetase
are available (Table 1). Upon formation of the
tRNA/synthetase complexes, slight conformational
rearrangements of the 3238 base-pair are
observed.
32 C38 pairs. Two structures of the complex of
the wild-type yeast tRNAAsp, including all the
modi®ed bases with its cognate synthetase
(tRNAAsp/RS) are available (Table 1). In these
structures, the 32 C38 pair adopts a conformation different from that observed in the uncomplexed tRNA structures (Figure 4). The C38 amino
group forms a single hydrogen bond rather than a
bifurcated hydrogen bond with the O4(32) atom.
Further, a signi®cant out-of-plane orientation of
base C38 is observed. These arrangements appear
less stable than those formed in uncomplexed
tRNAs and are, possibly, stabilized by additional
protein/tRNA interactions.
U32 U38 and Um32 38 pairs. No structure has
been solved of an uncomplexed tRNA having a
U32 U38 pair. However, in the structures of the
transcripts of E. coli tRNAGln/RS complex (Table 1),
a U32 U38 pair is observed (Figure 4). Again, a
single hydrogen bond links the two bases. Note
that this base-pair is approximately isosteric to a
Watson-Crick pair (see the bottom drawing in
right-hand column of Figure 6). Thus, its geometry
470
Singly and Bifurcated Hydrogen-bonded Base-pairs
Table 1. Description of the crystal structures of complexed and uncomplexed tRNA molecules referenced in the NDB
(Berman et al., 1992) before the 1st August 1999
NDB codea
Free tRNAs
trna01
trna02
trna03
trna04
trna05
trna06
trna07
trna08
trna09
trna10
trna12
tRNA/EF-TU complexes
pr0004
ptr012
tRNA/RS complexes
pte001
pte002
pte003
ptr001
ptr002
ptr003
ptr004
ptr005
ptr007
ptr008
ptr009
ptr010
ptr011
ptr014
ptr015
PDB code
Ê)
Res. (A
3238
Transcript
Coord.
/
/
1TN2
6TNA
/
1TRA
2TRA
3TRA
4TRA
4TNA
1YFG
yeast
yeast
yeast
yeast
yeast
yeast
yeast
yeast
yeast
yeast
yeast
tRNAPhe
tRNAPhe
tRNAPhe
tRNAPhe
tRNAAsp
tRNAPhe
tRNAAsp
tRNAAsp
tRNAPhe
tRNAPhe
tRNAMet
i
2.5
3.0
3.0
2.7
3.0
3.0
3.0
3.0
3.0
2.5
3.0
CmA
CmA
CmA
CmA
C
CmA
C
C
CmA
CmA
C A
N
N
N
N
N
N
N
N
N
N
N
N
N
Y
Y
Y
Y
Y
Y
Y
Y
Y
1B23
1TTT
E. coli tRNACys
yeast tRNAPhe
2.6
2.7
A
CmA
N
N
Y
Y
/
/
1QTQ
1GSG
1GTS
1GTR
1SER
1ASY
T. thermo. tRNAPro(GGG)
T. thermo. tRNAPro(CGG)
E. coli tRNAGln
E. coli tRNAGln
E. coli tRNAGln
E. coli tRNAGln
T. thermo. tRNASer
yeast tRNAPhe
T. thermo. tRNASer
yeast tRNAAsp
E. coli tRNAGln
E. coli tRNAGln
E. coli tRNAGln
T. thermo. tRNALys
T. thermo. tRNALys
3.6
3.5
2.2
2.8
2.8
2.5
2.9
2.9
2.7
3.0
2.6
2.7
3.0
2.7
2.9
UU
UA
UU
Um
UU
UU
C A
C
C A
C
UU
UU
UU
C A
C A
N
N
Y
N
Y
Y
N
N
N
N
Y
Y
Y
N
Y
N
N
Y
Nb
Y
Y
Yc
Y
N
Y
Y
Y
Y
N
N
1ASZ
1QRS
1QRT
1QRU
/
/
a
Crystal structure references are given next: (trna01, Quigley et al. (1978); trna02, trna03, Brown et al. (1985); trna04, Sussman et al.
(1978); trna05, Comarmond et al. (1986); trna06, Westhof & Sundaralingam (1986); trana07, trna08, trna09, Westhof et al. (1988);
trna10, Hingerty et al. (1978); trna12, Basavappa & Sigler (1991); pr0004, Nissen et al. (1999); ptr012, Nissen et al. (1995); pte001,
pte002, Cusack et al. (1998); pte003, Rath et al. (1998); ptr001, Rould et al. (1989), ptr002, Perona et al. (1993); ptr003, Rould et al.
(1991); ptr004, Biou et al. (1994); ptr005, Ruff et al. (1991); ptr007, Cusack et al. (1996a,b); ptr008, Cavarelli et al. (1994); ptr009, ptr010,
ptr011, Arnez & Steitz (1996); ptr014, ptr015, Cusack et al. (1996a,b).
b
Only backbone coordinates have been deposited in the NDB.
c
The coordinates for the atoms of the anticodon hairpin could not be derived from the crystal data.
is clearly distinct from that observed in the crystal
structures of uncomplexed tRNAs for C A, U A,
or U C pairs. The stabilization of such base-pairs
involves a network of water molecules, some linking one base to the other and some linking a base
to its backbone (see Figure 3 by Rould et al., 1991).
An additional contact between Asn370 and U38 is
present in these structures. The Um32 38 basepair of the E. coli tRNAGln/RS complex, which
includes all tRNA modi®ed residues, displays a
geometry identical with that observed for the
tRNA transcripts (Rould et al., 1991).
Other 32 38 pairs. In the structures of the two
complexes formed between Thermus thermophilus
tRNALys synthetase and transcripts of E. coli and
T. thermophilus tRNALys, the occurrence of a nonstandard C32 A38 base-pair has been reported
(Cusack et al., 1996a; S. Cusack, personal communication). Beside contacts made by A38 with the protein, a N3(C32) . . . N6(A38) is formed. This contact
is different from those observed in the structures of
uncomplexed tRNAPhe. Furthermore, in the struc-
tures of the two complexes of T. thermophilus
tRNAPro synthetase with two tRNAPro isoacceptors,
Ê resolution, the occurrence of a
determined at 3.5 A
non-standard U32 U38 and of a Watson-Crick
U32 A38 base-pair is mentioned (Cusack et al.,
1998; S. Cusack, personal communication).
Phylogenetic analysis
tDNA gene sequences. In order to detect any
sequence preferences for base-pairs at position
32 38, the tRNA gene database which contains up
to 2726 tDNA sequences (Sprinzl et al., 1998) was
analyzed. First, there is a very large proportion of
pyrimidines at position 32 (98 %, see Table 2) and
a signi®cant number of purines at position 38
(71 %). Secondly, sequences which could form
C G (six occurrences), G C (no occurrence) or A U
(ten occurrences) Watson-Crick pairs are very rare.
Nevertheless, there is a large proportion of
U32 A38 pairs (18 %). Note that the number of
U A pairs might be slightly lower since it has been
Singly and Bifurcated Hydrogen-bonded Base-pairs
Ê
Figure 3. The C32 A38 pair observed in the 3.0 A
(Basavappa & Sigler,
crystal structure of yeast tRNAMet
i
1991). This pair displays a unique conformation which
cannot be easily compared to the base pair arrangements shown in Figure 2. For comparison, a C G Watson-Crick pair and a C32 A38 pair extracted from yeast
tRNAPhe (see Figure 2) are also represented. The glycosidic bonds of 32 have been superimposed.
shown that U ! C editing can occur at position 32
(Beier et al., 1992).
The preferred base-pair sequence is C A
(48 %), which is followed by the T A (18 %),
T T (11 %), C C (8 %) and T C (8 %) pairs.
Most of these pairs, with the exception of the T A
pair, cannot form Watson-Crick arrangements.
Note that these proportions do not vary signi®cantly if one considers the two following subsets:
subset 1 (932 sequences of tDNA elongators) and
subset 2 (352 sequences of eukaryotic tDNA elongators) of the whole 2726 tDNA sequences.
471
tRNA sequences including modified bases. The
tRNA sequence database is approximately ®ve
times smaller (546 sequences) than the tDNA database (2726 sequences). Despite this reduced number of sequences, useful information can be
gathered due to the presence of modi®ed nucleotides (Table 3). It appears that, at least for C A or
U C pairs, only modi®ed bases compatible with
the formation of the non-canonical interaction
shown in Figure 2 are reported. For example, the
occurrence of C32 A38 pairs is close to 42 %; that
of C A pairs close to 11 % (a methylation of the
20 -hydroxyl group does not prevent the formation
of a bifurcated hydrogen-bonded pair as shown in
yeast tRNAPhe); that of m3C‡ close to 4 % (the
methylation of the N3 atom of a cytosine base is
compatible with the C A arrangement represented
Figure 2); and that of a s2C A pair close to 1 % (a
sulfur atom replacing the O2 atom of a cytosine
base could also form a S2(s2C32) . . . N6(A38) bifurcated hydrogen bond). Thus, 59 % of C A pairs are
observed at position 32 38. The majority of these
modi®ed C A pairs can adopt the conformation
shown in Figure 2. Similarly, all the modi®ed U U
pairs reported in Table 3 are compatible with the
Watson-Crick like arrangement shown in Figure 4.
Is the sequence at position 32 38 specific for a
given tRNA type? In order to detect a possible correlation between the sequence of the 32 38 pair
and either the amino acid type of the tRNA or the
phylogenetic group in which they occur, the family
of tDNAs containing a T32 T38 pair was analyzed
(Sprinzl et al., 1998). The T T pairs are found in
tDNAs coding for 14 different amino acid residues,
in eukaryotic, eubacterial, mitochondrial, and
chloroplastic tRNAs (in archaea, only one occurrence is reported). Further, no correlation between
the ``base type'' at positions 37 and 38 could be
noted (146 A37 T38 and 140 G37 T38 sequences).
As in the case of tDNAs containing a T32 T38 pair,
Figure 4. The 32 38 base-pairs observed in crystal structures of tRNAs complexed with synthetases (see Table 1).
The C base-pairs are shown on the left and U U base-pairs are shown on the right. For comparative purposes, a
regular U A Watson-Crick pair as well as the average C pair of free tRNAs shown Figure 1 are drawn. The glycosidic bonds of 32 have been superimposed.
472
Singly and Bifurcated Hydrogen-bonded Base-pairs
Table 2. Percentages of occurrence of the four A, G, C and T bases at the non-canonical 32 38 base-paired position
for all 2726 tDNAs contained in the tRNA database (Sprinzl et al., 1998), a subset of 932 elongator tDNAs, and a subset of 352 eukaryotic elongator tDNAs (percentages larger than 5 % are idicated in bold)
Base 32
A
G
A
G
C
T
Total:
R/Ya
1
0
48
18
66
69
0
0
3
3
Base 32
A
G
A
G
C
T
Total:
R/Ya
1
54
13
68
69
0
1
1
2
Base 32
A
G
A
G
C
T
Total:
R/Ya
1
58
2
89
61
0
0
0
1
a
C
Base 38
All tDNAs (2726)
T
0
0
0
1
8
1
8
11
16
13
30
All tDNA elongators (932)
C
T
0
0
12
1
7
10
18
11
30
All tDNA eukaryotic elongators (352)
C
T
14
8
22
39
2
15
17
Tot.
2
1
56
39
100
Tot.
1
68
30
100
Tot.
1
74
25
100
R/Ya
3
97
100
R/Ya
1
99
100
R/Ya
1
99
100
Indicates the percentages of purine (R) and pyrimidine (Y) bases at position 32 and 38.
no apparent correlation exists for tDNAs displaying other types of 32 38 pairs.
Model structures for the 32 38 interaction in tRNAs
C A, U A, U C, C C and U U base-pairs. From
the preceding structural and phylogenetic data, it
is possible to propose models for base-pairs located
at position 32 38 of tRNA anticodon hairpins
(Figure 5). As the vast majority of 32 38 pairs are
of the C A type, the arrangement observed in the
crystal structures of yeast tRNAPhe (see Figure 2)
seems the most plausible (family I). The same con®guration can be adopted by U A pairs which
may be linked by a bifurcated rather than by two
regular hydrogen bonds as in Watson-Crick pairs
(Figure 5). As shown before, the much rarer U C
and C C pairs could adopt conformations isosteric
to C A and U A pairs. Interestingly, the shallow
groove patterns of C A and U A pairs are similar,
as are the shallow groove patterns of U C and
C C pairs. Note that the U C pair shown in
Figure 5 is different from the U C pair observed in
the crystal structure of several RNA oligomers
(Holbrook et al., 1991; Cruse et al., 1994; Tanaka
et al., 1999) where the two bases are linked by a
O4(U) . . . N4(C) instead of an O2(U) . . . N4(C)
hydrogen bond with a water molecule bridging the
two N3 sites of both pyrimidines.
However U U pairs, which account for approximately 7 % of the total number of pairs, cannot
adopt a conformation isosteric to that observed for
C A pairs of family I. The arrangement shown in
Figure 5 is the one observed in the crystal struc-
tures of the wild-type and transcripts of E. coli
tRNAGln complexed with its cognate synthetase
(see Figure 4). It is proposed to be a representative
of family II. tRNAs with a U U pair at position
32 38 may also belong to a separate family of
tRNAs with speci®c and unknown features.
Rare 32 38 base-pairs. Besides the pairs in
families I and II shown in Figure 5, several rare
base oppositions are found at positions 32 38.
They account for 7 % of the total number of tDNA
(Table 2) and tRNA (Table 3) sequences and are
gathered in family III. Table 2 indicates that only
G C pairs are completely avoided in the 2726
sequences reported in the tRNA database (Sprinzl
et al., 1998). For several of these pairs, pairing
arrangements approximately isosteric to family I or
to family II can be suggested (Figure 6). In the following, only cis pairing types will be considered.
A A (1.1 %, see Table 2), GU (0.6 %), and G A
(0.1 %) pairs are roughly isosteric to C A pairs
(family I of Figure 5), while U G (2.7 %), C U
(1.3 %), and A U (0.4 %) pairs are approximately
isosteric to U U pairs (family II shown in Figure 5).
For the ®rst family, the A13 A22 pair, found in the
crystal structure of E. coli tRNAGln (pte003), is close
to the bifurcated C A pair. For G U pairs, no
structures close to that of the C A pair have been
observed and, thus, the G U pair shown in Figure 6
is suggested. The crystal structure of the 5 S rRNA
loop E (ur1064; Correll et al., 1997) contains a
sheared G98 A78 pair superposable to the singly
hydrogen bonded CA pair. For the second family,
the GU and A U pairs shown in Figure 6 have
473
Singly and Bifurcated Hydrogen-bonded Base-pairs
Table 3. Percentages of occurrence of the four A, G, C and U bases at the non-canonical 32 38 base-paired position
for all 546 tRNAs contained in the tRNA database (Sprinzl et al., 1998), a subset of 367 elongator tRNAs, and a subset
of 191 eukaryotic elongator tRNAs
Base 32
A
G
C
A
G
C
Cm
m3C‡
s2C
U
Um
m
Total:
R/Ya
1
0
42
11
4
1
5
1
3
68
70
0
2
2
7
0
4
5
0
16
30
Base 32
A
G
C
A
G
C
Cm
m3C‡
s2C
U
Um
m
Total:
R/Ya
35
14
5
1
7
1
2
66
68
1
1
5
2
7
0
4
1
16
32
Base 32
A
G
C
A
G
C
Cm
m3C‡
s2C
U
Um
m
Total:
R/Ya
27
23
10
2
64
64
-
5
1
1
9
1
18
36
a
Base 38
All tRNAs (546)
m5C
U
1
1
0
2
0
1
0
2
0
0
4
All tRNA elongators (367)
m5C
U
2
1
1
3
1
1
0
3
0
4
All tRNA eukaryotic elongators (191)
m5C
U
3
2
1
6
0
1
0
1
Tot.
0
2
2
1
1
7
1
0
53
12
4
1
16
2
2
0
100
Tot.
0
2
3
2
1
8
1
0
47
16
5
1
17
3
8
1
100
Tot.
1
4
2
2
2
10
0
1
40
26
10
4
3
13
1
100
R/Ya
1
99
100
R/Ya
1
100
R/Ya
99
100
Indicates the percentages of purine (R) and pyrimidine (Y) bases at position 32 and 38.
been modeled in order to be roughly isosteric to
the U U pair while interacting only through a
single hydrogen bond. The C U pair is extracted
from the same crystal structure of an RNA duplex
(ar0005; Tanaka et al., 1999). Note that NMR data
obtained for E. coli tRNAPhe revealed signals
associated to a rare A32 38 interaction (Hyde &
Reid, 1985a,b). For all these base-pairs, slight
changes in the positions of the bases may
``improve'' their isosteric character. For A G
(0.3 %), A C (0.2 %), CG (0.2 %), G C (0.04 %),
and G C (0 %) oppositions, no arrangement close
to either the C A or the U U pair shown in
Figure 5 can be proposed. Yet, for these rare basepairs, the possibility of base editing (Price & Gray,
1998) and the probable existence of a noise level in
the tRNA sequence database leading to a small
number of incorrect sequences should be taken
into account. Therefore, the sequences and model
structures of family III should be considered with
caution.
Discussion
A conserved non-canonical 32 38 base-pair in the
anticodon loop of tRNAs
The preceding structural and phylogenetic data
lead to the conclusion that particular types of pairing occur systematically between the ®rst (32) and
last (38) residues in the anticodon loop of tRNAs.
The largest family of base-pairs (family I) comprises the C A pair, where the two bases are linked
by a bifurcated hydrogen bond, and the isosteric
U A, U C and C C pairs (Figure 5). This category
represents 86 % of the total number of sequences.
A second family of base-pairs (family II) comprises
the U U pairs (7 %), which are not isosteric to the
474
Singly and Bifurcated Hydrogen-bonded Base-pairs
Figure 5. The possible single
hydrogen-bonded arrangements of
32 38 base-pairs in anticodon loop
of tRNAs organized in three
families. In family I, the structure
of the U A pair is inferred from
that of the C A pair extracted from
yeast tRNAPhe, and the structure of
the C C pair is inferred from that
of the U C pair extracted from
yeast tRNAAsp (see Figure 2). In
family II, the structure of the U U
pair is extracted from the E. coli
tRNAGln/RS complex (see Figure 4).
Family III gathers the rare oppositions. The glycosidic bonds of base
32 are similarly oriented in all the
base-pair oppositions.
pairs of family I. Finally, a third family (family III)
comprises a set of rare base-pairs (7 %) for which
few structural data are available (Figure 6), but
which could be roughly distributed among the ®rst
two families.
Importance of bifurcated hydrogen bonds
Up to now, pairs where the bases are linked by
a single or a bifurcated hydrogen-bonding interaction have encountered limited interest, probably
because of their apparent lability. Yet, such pairs
were observed in the crystal structures of yeast
tRNAPhe (Quigley & Rich, 1976) and yeast tRNAAsp
(Westhof et al., 1985) at position 32 38 (where two
chemical groups are involved) and at position
G18 55 (where three chemical groups are
involved). Furthermore, subsequent molecular
dynamics simulations have shown that they are
stable over nanosecond time scales in yeast
tRNAAsp (Auf®nger & Westhof, 1996; Auf®nger
et al., 1999). Since the resolution of the ®rst tRNA
crystal structures, the bestiary of non-canonical
base-pairs including bifurcated hydrogen bonds
has been extended and comprises, among others,
G G and G U pairs (Correll et al., 1997). Thus, the
consideration of singly or bifurcated hydrogen-
bonded pairs should improve the structures resulting from crystallographic, NMR, and modeling
re®nement processes. The fact that such pairs are
additionally stabilized by water-mediated interactions should also be taken into account (Rould
et al., 1991; Auf®nger & Westhof, 1998c).
Why a conserved pyrimidine at position 32
in tRNAs?
The non-canonical C32 A38 base-pair is characterized by a Watson-Crick-like distance between
the C10 atoms and an asymmetric disposition of
the angles at the C10 atoms (Figure 2). The latter
geometrical characteristic leads to a pronounced
non-isostericity upon reversal of the base-pair.
Thus, C A pairs are not isosteric to A C pairs
(Figure 7), U A pairs are not isosteric to A U
pairs, and U C pairs are not isosteric to C U pairs.
The C C and U U pairs shown in Figure 5 are
obviously self-isosteric. Thus, the anticodon loop
must begin with a pyrimidine at position 32 in
order to adopt its characteristic functional shape.
These structural considerations are in agreement
with phylogenetic data indicating that, in tRNAs, a
pyrimidine is present in 98 % of the sequences at
position 32 (Tables 2 and 3).
Singly and Bifurcated Hydrogen-bonded Base-pairs
475
Figure 6. Possible arrangements
of the rare 32 38 oppositions gathered in family III. Only those
arrangements which can approximately be consisdered as isosteric
to the base-pairs of families I and II
have been represented. The structure of the C A pair (family I),
extracted from yeast tRNAPhe, is
shown in blue and the structure of
the U U pair (family II), extracted
from the E. coli tRNAGln/RS complex, is shown in red (see Figure 5).
The glycosidic bonds of residues 32
are similarly oriented in all the
base-pair oppositions.
Is isostericity always required?
As demonstrated by the Watson-Crick pairs, isostericity is a central concept in nucleic acid structure. From crystal and phylogenetic data gathered
from a set of sequences of the 5 S rRNA loop E,
isostericity has also been proposed for several noncanonical base-pairs (Leontis & Westhof, 1998a,b,
1999b). In the present study, a certain number of
isosteric arrangements have been observed at position 32 38. Such isosteric pairs may result in a
similar structure of the anticodon loop for free
tRNAs. However, the occurrence of a non-negligible number of U U and related pairs (7 %) which
are not isosteric to C A pairs indicates that,
depending on the context, isosteric base-pairs may
not be systematically required. Thus, such sets of
non-isosteric base-pairs may be characteristic of
different tRNA families, for which adjustments for
interactions with proteins, other RNA motifs or
divalent ions are required. Signi®cantly, when
base-pairs are not strictly isosteric to the C A or
U U pairs, their percentage of occurrence drops
considerably (see Figure 6).
Several biochemical studies have evaluated the
effects of changing the 32 38 base-pair type. Yarus
and co-workers (1986b) have studied the suppression ef®ciency of a large number of mutants of the
476
Figure 7. Bifurcated C A pairs are not isosteric to
A C pairs as shown by the large distance separating the
glycosidic bonds. Thus, a A C pair cannot replace a
C A pair at position 32 38, indicating why a pyrimidine
at position 32 is required in tRNAs. The glycosidic
bonds of 32 have been superimposed.
amber suppressor Su7 and have shown that no
mutation at position 32 or 38 was neutral. For
example, Um and Um C pairs at position 32 38
were found to be less ef®cient than Um A or
Cm A pairs (Yarus et al., 1986a). Interestingly, the
measured suppression ef®ciency is Um '
Um C < Um A < Cm A and, thus, follows the proportion of 32 38 pairs derived from phylogenetic
analysis (Figure 5). The substitution of the Cm32
nucleotide by any purine residue reduces the suppression ef®ciency more than tenfold (Smith &
Yarus, 1989) in agreement with the fact that purines are strongly disfavored at position 32.
Mutations of base 38 show less dramatic effects
indicating that a larger mutational variability is
allowed at position 38 than at position 32 (see
Tables 2 and 3). In transcripts of E. coli tRNAGly,
with a U32 A38 pair the anticodon discriminates
the glycine codons according to the wobble rules,
but with a C A pair it loses its discriminating
power (Lustig et al., 1993). In transcripts of Mycoplasma myciodes tRNAGly, the substitution of the
wild-type C32 A38) by a U A pair (Claesson et al.,
1995) resulted in similar effects (but the transformation was the reverse of that performed in E. coli
tRNAGly). Since in both cases an isosteric substitution was performed (C A$U A, see Figure 5),
these data indicate that the nature of the contacts
between the tRNA and the ribosome may involve
the 32 38 base-pair and that such a substitution
affects the recognition pattern of the anticodon
hairpin more than its three-dimensional fold. It has
also been demonstrated that substitution of
U32 A38 by a C A pair in M. myciodes tRNAGly
results in a substantial increase in its frameshifting
ef®ciency (O'Connor, 1998). In E. coli tRNAGln transcripts, substitution of the U32 C38 opposition by
a C C or CA pair resulted in an increase of both
aminoacylation and ribosome performance, but
enhanced the latter function to a greater extent
(McClain et al., 1998). It is possible that the replace-
Singly and Bifurcated Hydrogen-bonded Base-pairs
ment of a rare U32 U38 by a non-isosteric C A
pair has as a consequence the formation of a more
``anticodon'' like hairpin. Besides the isostericity of
several of these non-canonical base-pair types, each
of these base-pairs may interact speci®cally with
proteins and ribosomal elements (see below). Isostericity, thus, is a generally valid concept indicating that it is possible to substitute, with minor
structural changes, base-pairs displaying different
shallow and deep groove recognition patterns.
However, each of these isosteric base-pairs has
speci®c functional roles, experimentally characterized by different translational ef®ciency. Furthermore,
in
some
occurrences
non-isosteric
substitutions have been observed like the U U pair
of the rare base-pairs found in Tables 2 and 3. For
example, in E. coli, a minor tRNAAla isoacceptor
differs from the other isoacceptors in possessing a
rare purine instead of a pyrimidine at position 32.
However, the structure and functional role of such
base-pairs is not yet known.
Importance of modified nucleotides at positions 32
and 38
Modi®ed nucleotides are found in many RNA
molecules and especially in tRNAs where they
account for 12 % of the total number of residues
(Grosjean & Benne, 1998). A large number of modi®ed nucleotides are located at the stem/loop junction of the anticodon hairpin in tRNAs (Auf®nger
& Westhof, 1998b). It has been proposed, in agreement with a large body of experimental evidence
and molecular dynamics simulations, that some of
these modi®cations, and especially pseudouridylation (Auf®nger & Westhof, 1998a), are required to
stabilize the three-dimensional structure of the
functionally important anticodon loop. At position
32, the stabilization mechanism involves the formation of a pseudouridine/water complex where
the water molecule mediates base to backbone
interactions, and thus helps to increase the stability
of the 3238 interaction pairs (Davis & Poulter,
1991; Arnez & Steitz, 1994; Auf®nger & Westhof,
1997, 1998a). The same nucleotide/water complexes can occur after pseudouridylation at positions 31, 38, and 39. Such modi®cations may thus
help to strengthen the structure of the tRNA anticodon hairpin in the ribosome and prevent excessive unfolding of the anticodon loop when
interacting with synthetase proteins.
Indeed, as noted above, in a few of the crystal
structures of tRNAs complexed with their cognate
synthetase, the 32 38 interaction is systematically
maintained even after partial unfolding of the
loop, and is stabilized by water mediated interactions involving modi®ed nucleotides (Roud et al.,
1989). Thus, one possible role of the 32 38 pair and
of associated modi®ed nucleotides is to limit the
``unfolding'' of the loop to the ®ve bases, 33 to 37.
The structure of base-pairs 31 39, 32 38, and the
stacking of U33 below base 32 being at least partially preserved, as shown in Figure 8(a) for the
Singly and Bifurcated Hydrogen-bonded Base-pairs
477
Figure 8. Turns in RNA motifs involving non-canonical base-pairs. (a) Views extracted from crystal structures of
yeast tRNAPhe (trna09) and tRNAAsp (trna07) as well as from the crystal structures of the complex of E. coli tRNAGln
with its cognate synthetase (pte003). For comparison, a fragment of a conventional Watson-Crick helix is drawn. The
position of the glycosidic bond, for a hypothetical 32-38 Watson-Crick base-pair in the tRNAPhe anticodon loop is
marked by a green arrow. (b) Fragments of the crystal structures of the TC loop of yeast tRNAPhe (trna09) and of
the GNRA tetraloop of the hammerhead ribozyme (uhx026). (c) Fragments of the crystal structure of the leadzyme
(ur0001) and the hammerhead ribozyme (uhx026) showing the C23 A45 and the C3 C17 base-pairs. In all these
Figures, the glycosidic bonds of 32 have been superimposed.
E. coli tRNAGln/RS complex, a subsequent ``refolding'' to the ribosomal active form, after decomplexation with the synthetase, may be facilitated.
Besides a structural role, the question of a possible
functional role of the non-canonical 32 38 base-
pair can be raised. It has been noted that in some
instances bases 32 and 38 can interact with the cognate synthetase and, thus, act as structural determinants in the recognition process. Biochemical data
fully support this view (for a review, see GiegeÂ
478
et al., 1998). Nevertheless, the largest number of
tRNA/synthetases do not use these bases as structural determinants and, thus, their structural role
seems to prevail over their direct implications in a
recognition process with a synthetase.
Is there a relationship between the 32 38
interaction and the U-turn?
The active three-dimensional structure of tRNA
anticodon loops is characterized by a U-turn
associated with the conserved base U33 and the Aform helical stack of loop residues 34 to 38 in continuity with the helical stem residues 39 to 44 (as
shown in Figure 8(a) for yeast tRNAAsp and yeast
tRNAPhe). While both residues 32 and 38 are
located in a helical track, the rotation of the glycosidic bonds on the side of residue 32 is much more
pronounced than in regular helical A-form stacks
(Figure 8(a)). Furthermore, between residues 34
and 38, the rotation angles are rather regular and
not far from the helical case. But, clearly, residue
38 has not rotated suf®ciently to form a WatsonCrick pair with residue 32. Hence, a substitution of
the non-canonical 3238 pair by a regular WatsonCrick pair would hinder the formation of the Uturn and the regularity of the loop. This explains
why no G C and very few C G pairs are reported
at these positions (Table 2). Therefore, the isosteric
C A, U A, U C, and C C pairs at position 32 38,
and possibly also U U pairs, appear as a crucial
transition element linking the stem to the loop of
the anticodon hairpin. Residues 32 and 38 should
Singly and Bifurcated Hydrogen-bonded Base-pairs
therefore be considered, along with the conserved
base U33 and likely a conserved purine at position
37, as a signature for tRNA anticodon loops
(Figure 9). The view that only speci®c base combinations occur at position 32 38 is in agreement
with the extended anticodon concept, indicating
that the coding performance of the triplet anticodon is enhanced by the appropriate loop and stem
sequence (Yarus, 1982; Yarus et al., 1986b; Yarus &
Smith, 1995).
In the seven base T-loops of tRNAs, a non-canonical two hydrogen-bonded trans Hoogsteen
T54 A58 pair precedes the U-turn. This base-pair
appears to play a similar role in T-loops as that
played by the 3238 pair in anticodon loops
(Figure 8(b)). Note that a T54 A58 pair is reported
in 78 % of tRNA gene sequences (Sprinzl et al.,
1998). Similarly, in the recent crystal structure of
the L11 binding region in LSU RNA, a trans Watson-Crick A U pair ¯anks the U-turn of a ®ve
nucleotide loop (Conn et al., 1999). GNRA tetraloops are known to form U-turns as well (Westhof
et al., 1989; Jucker & Pardi, 1995). In such motifs,
G A sheared pairs, which involve two hydrogen
bonds, are used to close the loop (Figure 8(b)).
Thus, sheared GA pairs, almost isosteric to C A
pairs (Figure 6), are used in short loops, presumably because of their intrinsic higher stability and
because they induce a sharper turn, while the
apparently more labile singly hydrogen bonded
base-pairs found in anticodon motifs are used in
longer loops.
Figure 9. tRNA anticodon loop signature showing the structure/sequence relationship at the 32 38 base-pair level.
The distribution of the four A, G, C, and U nucleotides is extracted from the set of 2726 tDNA gene sequences present in the tRNA database (Auf®nger & Westhof, 1998b; Sprinzl et al., 1998).
Singly and Bifurcated Hydrogen-bonded Base-pairs
Other examples of interfacial non-canonical basepairs
In hairpins. A recent NMR structure of the
tRNALys,3 anticodon hairpin obtained at low pH
revealed the formation of a C32 A‡38 base-pair
(Durant & Davis, 1999). The pKa of the adenine
base was estimated to be close to 6 for A38. Likewise, evidence for the formation of a non-canonical
bifurcated U C pair have been obtained by NMR
for the central hairpin of the HDV ribozyme containing a seven nucleotide loop (Kolk et al., 1997).
In this structure, the sharp turn which is associated
with the change of direction of the backbone is
shifted in the 30 direction by one nucleotide, and
characterized as a reversed U-turn. Interestingly,
an independent NMR determination of the structure of the same hepatitis delta virus hairpin proposed a different hydrogen-bonding scheme for
the U C pair of the nucleotides on the 50 and 30 ends of the loop (Lynch & Tinoco, 1998). In this latter structure, the cytosine amino group interacts
with O4 instead of O2 of U resulting in a water
inserted U C pair similar to that observed in the
crystal structure of RNA oligonucleotides
(Holbrook et al., 1991; Cruse et al., 1994; Tanaka
et al., 1999). In any case, in each example of structurally determined hairpins containing loops with
seven bases, a non-canonical base-pair is observed
at the junction between the stem and the loop.
Several examples of loops closed by non-canonical base-pairs can be found in crystal structures
(Pley et al., 1994; Scott et al., 1995; Cate et al., 1996;
Correll et al., 1998; Perbandt et al., 1998). A hairpin
closing wobble U U pair is found in the crystal
structure of the complex formed between a fragment of the U2 snRNA and a spliceosomal protein
(Price et al., 1998). Interestingly, in the crystal structure of the RNA-binding domain of the U1A
spliceosomal protein complexed with an RNA hairpin, no loop closing C A base-pair is observed
(Oubridge et al., 1994) indicating that, in this other
structural context, a possible non-canonical basepair is disrupted after complex formation.
In ribozymes. Interestingly, the crystal structure of
the leadzyme (Wedekind & McKay, 1999), a
C23 A45 pair, identical with the CA pair
observed in tRNAs, is present (Figure 8(c)). It has
been proposed, on the basis of crystallographic
data, that this non-canonical pair is involved in the
binding of a Pb2‡ in the RNA deep groove which
may be important for catalysis. In one motif of the
crystal structure of a Ba2‡ occupies this site (in
tRNA crystal structures, a related Mg2‡ binding
site is located in the deep groove of the 32 38
pair). Interestingly, the formation of a C A pair is
not mandatory for catalysis, since substitution
experiments have shown that the ribozyme is still
active after replacement of the adenine residue by
an abasic ribose (Chartrand et al., 1997).
In the hammerhead ribozyme, a U A non-canonical base-pair involving a single hydrogen bond
479
closes stem III (Pley et al., 1994; Scott et al., 1995).
This hydrogen bond, which is not a bifurcated one
as in tRNAs, is well maintained during an MD
simulation of the ribozyme (Hermann et al., 1998).
Also, and most interestingly, the base preceding
the U-turn in the hammerhead ribozyme is normally a C and it forms a wobble base-pair with the
cleavable C residue (C3 C17: see Figure 8(c)). This
base-pairs differ from the singly hydrogen-bonded
base-pair shown in Figure 5. Nevertheless, it seems
essential for the ribozyme as a substitution of this
C C pair by a G C pair results in a considerable
loss of activity (McKay, 1996).
Could the 32 38 base-pair participate in RNA/RNA
recognition in the ribosome?
As emphasized above, several biochemical studies have demonstrated that modi®cations of the
32 38 base-pair can alter both aminoacylation and
ribosome performance suggesting that this basepair is involved in direct or indirect recognition
features with their cognate synthetase and with the
ribosome (Lustig et al., 1993; Claesson et al., 1995;
Yarus & Smith, 1995; Giege et al., 1998; McClain
et al., 1998; O'Conner, 1998) or, as proposed by
Yarus and co-workers (Smith & Yarus, 1989; Yarus
& Smith, 1995), with a second tRNA inside the
ribosome. For RNA/RNA interactions within a
given organism, the Watson-Crick binding sites of
residue 38, located in the shallow groove of the
anticodon stem, are not blocked and could be
involved in the formation of a triple interaction
with a ribosomal residue (Figure 10). Thus, the
adenine base of C A and U A pairs and the cytosine base of U C and C C pairs could be involved
in Watson-Crick interactions with U or G residues,
respectively. Yet, other interaction types cannot be
excluded. For example, the formation of a trans
A38 U or C38 U pair could be proposed. This
interaction scheme presents the advantage of involving a uridine base in all four cases (Figure 10).
Further, a uridine base could also be placed in the
shallow groove of the U32 U38 pair. Interestingly,
GNRA tetraloops in which the Watson-Crick sites
of the guanine or the adenine residue of the closing
G A pair are available, are involved in RNA/RNA
recognition via the Watson-Crick sites of the A in
the shallow groove side of G C pairs in helices
(Michel & Westhof, 1990; Jaeger et al., 1994). Crystal structures have demonstrated that the G A pair
of a GNRA tetraloop can indeed form of a base triple with a helix (Pley et al., 1994).
However, RNA/RNA interactions could also
take place on the less accessible deep groove side,
especially at the border of a helix. Biochemical data
indicate that the substitution of a C A by an isosteric U A pair affects the discriminating ability of
tRNA transcripts (Lustig et al., 1993; Claesson et al.,
1995) and their frameshifting ef®ciency (O'Connor,
198). This seems only possible if recognition
phenomena involve the deep groove side which
presents different shapes. However, interaction
480
Singly and Bifurcated Hydrogen-bonded Base-pairs
Figure 10. Possible interaction of a ribosomal RNA residue with the shallow groove of the set of four isosteric
32 38 base-pairs in the largest Family. Left: Given the accessibility of the Watson-Crick sites of residue 38, formation
of Watson-Crick pairs involving either a U or a G residue can be proposed. Right: Base triples involving an uridine
residue in trans with respect to residue 38 are possible in all four cases.
patterns involving a third residue are less obvious.
Further, the deep groove side of the non-canonical
C A pair in yeast tRNAPhe ( C in yeast tRNAAsp)
has been described as forming an ion binding site
in tRNAs and in the leadzyme (Wedekind &
McKay, 1999). Several rare modi®cations were also
found to block the access to the deep groove side.
These modi®cations are m3C‡ at position 32 and
m5C at position 38. Note that m3C‡ has only been
detected at position 32 in tRNAs (Auf®nger &
Westhof, 1998b). In order to test the preceding
hypothesis, several biochemical assays can be proposed. For example, a methylation of the N1 site of
A38 or of the N3 site of C38 should block access to
the shallow groove, while a methylation of the N7
site of A38, the C5 site of C38, the N3 site of C32,
or the N3 site of U32 would at least partially hinder access to the deep groove. Interestingly, in the
recent crystal structure of a conserved ribosomal
RNA/protein domain, a U A pair adopts the conformation of the C A pair observed in yeast
tRNAPhe (Conn et al., 1999). Here, the hydroxyl
group of a distant adenine residue seems hydrogen
bonded to the uridine residue on the deep groove
Ê ). Although such a
side N3(U) . . . O20 (A) 2.6 A
hydrogen bond is not compatible with a methylation at the N3 site of the uridine base, it does
point to another potential binding site in the deep
groove. Furthermore, binding of ribosomal protein
residues cannot be excluded.
Conclusions
Biomolecular motifs with speci®c structure and
functions can be recognized by a set of sequence
characteristics constituting their molecular signatures. The knowledge of these signatures has many
implications such as (i) the ability to recognize
motifs with speci®c structure in sequence databases; and (ii) the possibility to derive a 3D model
from the sequence. Up to now, the signature of the
tRNA anticodon hairpins comprised the phylogenetically conserved U33 residue associated with a
conserved pyrimidine at position 32 and a conserved purine at position 37. A strong preference
for a purine at position 38 had also been noted
while an almost regular distribution of the four
bases is observed at the anticodon positions 35 to
36. At position 34 adenine bases are poorly represented (Grosjean et al., 1982; Auf®nger &
Westhof, 1998b). An analysis of available data led
to the re®nement of the signature of the tRNA
anticodon loop. tRNA anticodon loops are generally closed by a set of non-canonical isosteric basepairs involving a single inter-residue hydrogen
bond at position 32 38, which can be distributed in
three families. Family I (86 % of the sequences)
comprises the four C A, U A, U C, and CC pairs,
i.e. Y C/A, base-pairs (interestingly, the U A pair
is not of the Watson-Crick type). Family II (7 %)
gathers the set of U U base-pairs, not isosteric to
the preceding ones, also observed at position
32 38. Thus, sequences with U32 U38 pair may
comprise a tRNA family with distinct properties.
Family III (7 %) comprises a set of rare base oppositions for which only a limited number of approximately isosteric base-pairs could be tentatively
proposed. Besides the structural aspects, it is proposed that conserved patterns associated with the
set of the four isosteric 32 38 base-pairs of family I,
Singly and Bifurcated Hydrogen-bonded Base-pairs
could be involved in the formation of base triples
mediating tertiary interactions of the tRNA with a
ribosomal RNA residue.
Acknowledgements
The authors thank Stephen Cusack for providing coordinate sets for the tRNA anticodon hairpin of structures
pte002 and ptr014.
References
Arnez, J. G. & Steitz, T. A. (1994). Crystal structure of
unmodi®ed tRNAGln complexed with glutaminyltRNA synthetase and ATP suggests a possible role
for pseudo-uridines in stabilization of RNA structure. Biochemistry, 33, 7560-7567.
Arnez, J. G. & Steitz, T. A. (1996). Crystal structure of
three mysacylating mutants of Escherichia coli glutaminyl-tRNA synthetase complexed with tRNAGln
and ATP. Biochemistry, 35, 14725-14733.
Auf®nger, P. & Westhof, E. (1996). H-bond stability in
the tRNAAsp anticodon hairpin: 3 ns of multiple
molecular dynamics simulations. Biophys. J. 71, 940954.
Auf®nger, P. & Westhof, E. (1997). RNA hydration:
three nanoseconds of multiple molecular dynamics
simulations of the solvated tRNAAsp anticodon hairpin. J. Mol. Biol. 269, 326-341.
Auf®nger, P. & Westhof, E. (1998a). Effects of pseudouridylation on tRNA hydration and dynamics: a
theoretical approach. In Modi®cation and Editing of
RNA (Grosjean, H. & Benne, R., eds), pp. 103-112,
American Society for Microbiology, Washington,
DC.
Auf®nger, P. & Westhof, E. (1998b). Location and distribution of modi®ed nucleotides in tRNA. In Modi®cation and Editing of RNA (Grosjean, H. & Benne, R.,
eds),
pp.
569-576,
American
Society
for
Microbiology, Washington, DC.
Auf®nger, P. & Westhof, E. (1998c). RNA base pair
hydration. J. Biomol. Struct. Dynam. 16, 693-707.
Auf®nger, P., Louise-May, S. & Westhof, E. (1999). Molecular dynamics simulations of the solvated yeast
tRNAAsp. Biophys. J. 76, 50-64.
Ê crystal
Basavappa, R. & Sigler, P. B. (1991). The 3 A
structure of yeast initiator tRNA: functional implications in initiator/elongator discrimination. EMBO
J. 10, 3105-3111.
Beier, H., Lee, M. C., Sekiya, T., Kuchino, Y. &
Nishimura, S. (1992). Two nucleotides next to the
anticodon of cytoplasmic rat tRNA(Asp) are likely
generated by RNA editing. Nucl. Acids Res. 20,
2679-2683.
Berman, H. M., Olson, W. K., Beveridge, D. L.,
Westbrook, J., Gelbin, A., Demeny, T., Hsieh, S. H.
& Srinivasan, A. R. (1992). The nucleic acid database: a comprehensive relational database of threedimensional structures of nucleic acids. Biophys. J.
63, 751-759.
Biou, V., Yaremchuk, A., Tukalo, M. & Cusack, S.
Ê crystal structure of T. thermophilus
(1994). The 2.9 A
seryl-tRNA synthetase complexed with tRNASer.
Science, 263, 1404-1436.
Brown, R. S., Dewan, J. C. & Klug, A. (1985). Crystallographic and biochemical investigation of the lea-
481
d(II)-catalyzed hydrolisis of yeast phenylalanine
tRNA. Biochemistry, 24, 4785-4801.
Cate, J. H., Gooding, A. R., Podell, E., Zhou, K. H.,
Golden, B. L., Kundrot, C. E., Cech, T. R. &
Doudna, J. A. (1996). Crystal structure of a group I
ribozyme domain-Principles of RNA packing.
Science, 273, 1678-1685.
Cavarelli, J., Eriani, G., Rees, B., Ruff, M., Boeglin, M.,
Mitschler, A., Martin, F., Gangloff, J., Thierry, J.-C.
& Moras, D. (1994). The active site of yeast aspartyl-tRNA synthetase: structural and functional
aspects of the aminoacylation reaction. EMBO J. 13,
327-337.
Chartrand, P., Usman, N. & Cedergreen, R. (1997).
Effects of structural modi®cations on the activity of
the leadzyme. Biochemistry, 36, 3145-3150.
Claesson, C., Lustig, F., BoreÂn, T., Simonsson, C.,
Barciszewska, M. & Lagerkvist, U. (1995). Glycine
codon discrimination and the nucleotide in position
32 of the anticodon loop. J. Mol. Biol. 247, 191-196.
Comarmond, M. B., GiegeÂ, R., Thierry, J. C. & Moras, D.
(1986). Three-dimensional structure of yeast
tRNAAsp. I. Structure determination. Acta Crystallog.
sect. B, 42, 272-280.
Conn, G. L., Draper, D. E., Lattman, E. E. & Gittis, A. G.
(1999). Crystal structure of a conserved ribosomal
protein-RNA complex. Science, 284, 1171-1174.
Correll, C. C., Freeborn, B., Moore, P. B. & Steitz, T. A.
(1997). Metals, motifs and recognition in the crystal
structure of a 5S rRNA domain. Cell, 91, 705-712.
Correll, C. C., Munishkin, A., Chan, Y. L., Ren, Z.,
Wool, I. G. & Steitz, T. A. (1998). Crystal structure
of the ribosomal RNA domain essential for binding
elongation factors. Proc. Natl Acad. Sci. USA, 95,
13436-13441.
Cruse, W. B. T., Saludjian, P., Biala, E., Strazewski, P. &
PrangeÂ, T. (1994). Structure of a mispaired RNA
Ê resolution and implications
double helix at 1.6-A
for the prediction of RNA secondary structure. Proc.
Natl Acad. Sci. USA, 91, 4160-4164.
Cusack, S., Yaremchuk, A. & Tukalo, M. (1996a). The
crystal structure of T. thermophilus lysyl-tRNA
synthetase complexed with E. coli tRNALys and a
T. thermophilus tRNALys transcript: anticodon recognition and conformational changes upon binding of
a lysyl-adenylate analogue. EMBO J. 15, 6321-6334.
Cusack, S., Yaremchuk, A. & Tukalo, M. (1996b). The
crystal structure of the ternary complex of T. thermophilus seryl-tRNA synthetase with tRNASer and a
seryl-adenylate analogue reveals a conformational
switch in the active site. EMBO J. 15, 2834-2842.
Cusack, S., Yaremchuk, A., Krikliviy, I. & Tukalo, M.
(1998). tRNAPro anticodon recognition by Thermus
termophylus prolyl-tRNA synthetase. Structure, 6,
101-108.
Davis, D. R. & Poulter, C. D. (1991). 1H-15N NMR studies of escherichia coli tRNAPhe from hisT mutants:
a structural role for pseudouridine. Biochemistry, 30,
4223-4231.
Dirheimer, G., Keith, G., Dumas, P. & Westhof, E.
(1995). Primary, secondary, and tertiary structures
of tRNAs. In tRNA: Structure, Biosynthesis, and Function (SoÈll, D. & RajBhandary, U., eds), pp. 93-126,
American Society for Microbiology, Washington.
Durant, P. C. & Davis, D. R. (1999). Stabilization of the
anticodon stem-loop of tRNALys,3 by A‡ C basepair and by pseudouridine. J. Mol. Biol. 285, 115131.
482
GiegeÂ, R., Sissler, M. & Florentz, C. (1998). Universal
rules and idiosyncratic features in tRNA identity.
Nucl. Acids Res. 26, 5017-5035.
Grosjean, H. & Benne, R. (1998). Editors of Modi®cation
and Editing of RNA, American Society for Microbiology, Washington, DC.
Grosjean, H., Cedergreen, R. J. & McKay, W. (1982).
Structure in tRNA data. Biochimie, 64, 387-397.
Hermann, T., Auf®nger, P. & Westhof, E. (1998). Molecular dynamics investigations of the hammerhead
ribozyme RNA. Eur. J. Biophys. 27, 153-165.
Hingerty, B., Brown, R. S. & Jack, A. (1978). Further
re®nement of the structure of yeast tRNAPhe. J. Mol.
Biol. 124, 523-534.
Holbrook, S. R., Cheong, C., Tinoco, I. & Kim, S. H.
(1991). Crystal structure of an RNA double helix
incorporating a track of non-Watson-Crick base
pairs. Nature, 353, 579-581.
Hyde, E. I. & Reid, B. R. (1985a). Assignment of the
low-®eld 1H NMR spectrum of Escherichia coli
tRNAPhe using nuclear overhauser effects. Biochemistry, 24, 4307-4314.
Hyde, E. I. & Reid, B. R. (1985b). NMR studies of ion
binding to Escherichia coli tRNAPhe. Biochemistry, 24,
4315-4325.
Jaeger, L., Michel, F. & Westhof, E. (1994). Involvement
of a GNRA tetraloop in long-range RNA tertiary
interactions. J. Mol. Biol. 236, 1271-1276.
Jucker, F. M. & Pardi, A. (1995). GNRA tetraloops make
a U-turn. RNA, 1, 219-222.
Kolk, M. H., Heus, H. A. & Hilbers, C. W. (1997). The
structure of the isolated, central hairpin of the HDV
antigenomic ribozyme: novel structurl features and
similarity of the loop in the ribozyme and free in
solution. EMBO J. 16, 3685-3692.
Leontis, N. B. & Westhof, E. (1998a). The 5S rRNA loop
E: chemical probing and phylogenetic data versus
crystal structure. RNA, 4, 1134-1153.
Leontis, N. B. & Westhof, E. (1998b). A common motif
organizes the structure of multi-helix loops in 16S
and 23S ribosomal RNAs. J. Mol. Biol. 283, 571-583.
Leontis, N. B. & Westhof, E. (1999a). Conserved geometrical base pairing patterns in RNA. Quart. Rev.
Biophysics, in the press.
Leontis, N. B. & Westhof, E. (1999b). Recurrent RNA
motifs: analysis at the base pair level. In RNA Biochemistry and Biotechnology (Barciszewski, J. & Clark,
B. F. C., eds), Kluwer Academic Publishers, Boston.
Lustig, F., BoreÂn, T., Claesson, C., Simonsson, C.,
Barciszewska, M. & Lagerkvist, U. (1993). The
nucleotide at position 32 of the tRNA anticodon
loop determines ability of anticodon UCC to discriminate among glycine codons. Proc. Natl Acad.
Sci. USA, 90, 3343-3347.
Lynch, S. R. & Tinoco, I. (1998). The structure of the L3
loop from the hepatitis delta virus ribozyme: a syn
cytidine. Nucl. Acids Res. 26, 980-987.
McClain, W. H., Schneider, J., Bhattacharya, S. &
Gabriel, K. (1998). The importance of tRNA backbone-mediated interactions with synthetase for aminoacylation. Proc. Natl Acad. Sci. USA, 95, 460-465.
McKay, D. B. (1996). Structure and function of the hammerhead ribozyme: an un®nished story. RNA, 2,
395-403.
Michel, F. & Westhof, E. (1990). Modelling of the threedimensional architecture of group I catalytic introns
based on comparative sequence analysis. J. Mol.
Biol. 216, 585-610.
Singly and Bifurcated Hydrogen-bonded Base-pairs
Nissen, P., Kjeldgaard, M., Thirup, S., Polekhina, G.,
Reshetnikova, L., Clark, B. F. C. & Nyborg, J.
(1995). Crystal structure of the ternary complex of
Phe-tRNAPhe, EF-Tu and a GTP analog. Science, 270,
1464-1472.
Nissen, P., Thirup, S., Kjeldgaard, M. & Nyborg, J.
(1999). The crystal structure of Cys-tRNACys-EF-TiGDPNP reveals general and speci®c features in the
ternary complex and in tRNA. Structure, 7, 143-154.
O'Connor, M. (1998). tRNA imbalance promotes-1
frameshifting via near-cognate decoding. J. Mol.
Biol. 279, 727-736.
Oubridge, C., Nobutoshi, I., Evans, R. P., Teo, C. H. &
Ê resolNagai, K. (1994). Crystal structure at 1.92 A
ution of the RNA-binding domain of the U1A spliceosomal protein complexed with an RNA hairpin.
Nature, 372, 432-438.
Perbandt, M., Nolte, A., Lorentz, S., Erdmann, V. A. &
Betzel, C. (1998). Crystal structure of domain E of
Thermus ¯avus rRNA: a helical RNA structure
including a hairpin loop. FEBS Letters, 429, 211-215.
Perona, J. J., Rould, M. A. & Steitz, T. A. (1993). Structural basis for transfer RNA aminoacylation by
Escherichia coli glutaminyl-tRNA synthetase. Biochemistry, 32, 8758-8771.
Pley, H. M., Flaherty, K. M. & McKay, D. B. (1994).
Three-dimensional structure of a hammerhead ribozyme. Nature, 372, 68-74.
Price, D. H. & Gay, M. W. (1998). Editing of tRNA. In
Modi®cation and Editing of RNA (Grosjean, H. &
Benne, R., eds), pp. 103-112, American Society for
Microbiology, Washington, DC.
Price, S. R., Evans, P. R. & Nagai, K. (1998). Crystal
structure of the spliceosomal U2B00 -U2A0 protein
complex bound to a fragment of U2 smal nuclear
RNA. Nature, 394, 645-650.
Quigley, G. J. & Rich, A. (1976). Structural domains of
transfer RNA molecules. Science, 194, 796-806.
Quigley, G. J., Teeter, M. M. & Rich, A. (1978). Structural analysis of spermine and magnesium ion binding to yeast phenylalanine transfer RNA. Proc. Natl
Acad. Sci. USA, 75, 64-68.
Rath, V. L., Silvian, L. F., Beijer, B., Sproat, B. S. &
Steitz, T. A. (1998). How glutaminyl-tRNA synthetase selects glutamine. Structure, 6, 439-449.
Rould, M. A., Perona, J. J., SoÈll, D. & Steitz, T. A. (1989).
Structure of E. coli glutaminyl-tRNA synthetase
Ê resolcomplexed with tRNA(Gln) and ATP at 2.8 A
ution. Science, 246, 1135-1142.
Rould, M. A., Perona, J. J. & Steitz, T. A. (1991). Structural basis of anticodon loop recognition by glutaminyl-tRNA synthetase. Nature, 352, 213-218.
Ruff, M., Krishnaswamy, S., Boeglin, M., Poterszman,
A., Mitschler, A., Podjarny, A., Rees, B., Thierry,
J. C. & Moras, D. (1991). Class I aminoacyl tRNA
synthetases: crystal structure of yeast aspartyltRNA synthetase complexed with tRNAAsp. Science,
25, 1682-1689.
Schweisguth, D. C. & Moore, P. B. (1997). On the conformation of the anticodon loops of initiator and
elongator methionine tRNAs. J. Mol. Biol. 267, 505519.
Scott, W. G., Finch, J. T. & Klug, A. (1995). The crystal
structure of an all-RNA hammerhead ribozyme: a
proposed mechanism for RNA catalytic cleavage.
Cell, 81, 991-1002.
Smith, D. & Yarus, M. (1989). tRNA-tRNA interactions
within cellular ribosomes. Proc. Natl Acad. Sci. USA,
86, 4397-4401.
Singly and Bifurcated Hydrogen-bonded Base-pairs
Sprinzl, M., Horn, C., Brown, M., Loudovitch, A. &
Steinberg, S. (1998). Compilation of tRNA sequences
and sequences of tRNA genes. Nucl. Acids Res. 26,
148-153.
Sussman, J. L., Holbrook, S. R., Warrant, R. W., Church,
G. M. & Kim, S.-H. (1978). Crystal structure of
yeast phanylalanine transfer RNA. I. Crystallographic re®nement. J. Mol. Biol. 123, 607-630.
Tanaka, Y., Fujii, S., Hiroaki, H., Sakata, T., Tanaka, T.,
Uesugi, S., Tomita, K. & Kyogoku, Y. (1999). A0 form RNA double helix in the single crystal structure of r(UGAGCUUCGGCUC). Nucl. Acids Res. 27,
949-955.
Tinoco, I. (1993). Structure of base pairs involving at
least two hydrogen bonds. In The RNA World
(Gesteland, R. F. & Atkins, J. F., eds), pp. 603-607,
Cold Spring Harbor Laboratory Press, Cold Spring
Harbor, NY.
Wedekind, J. E. & McKay, D. B. (1999). Crystal structure
of a lead-dependent ribozyme revealing metal binding sites relevant to caalysis. Nature Struct. Biol. 6,
261-268.
Westhof, E. & Sundaralingam, M. (1986). Restrained
re®nement of the monoclinic form of yeast phenylalanine transfer RNA. Temperature factors and dyamics, coordinated waters, and base-pair propeller
twist angles. Biochemistry, 25, 4868-4878.
483
Westhof, E., Dumas, P. & Moras, D. (1985). Crystallographic re®nement of yeast aspartic acid transfer
RNA. J. Mol. Biol. 184, 119-145.
Westhof, E., Dumas, P. & Moras, D. (1988). Restrained
re®nement of two crystalline forms of yeast aspartic
acid and phenylalanine transfer RNA crystals. Acta
Crystallog. sect. A, 44, 112-123.
Westhof, E., Romby, P., Romaniuk, P., Ebel, J.-P.,
Ehresmann, C. & Ehresmann, B. (1989). Computer
modeling from solution data of spinach chloroplast
and of Xenopus laevis somatic and oocyte 5S rRNAs.
J. Mol. Biol. 207, 417-431.
Yarus, M. (1982). Translational ef®ciency of transfer
RNA's: uses of an extended anticodon. Science, 218,
646-652.
Yarus, M. & Smith, D. (1995). tRNA on the ribosome: a
waggle theory. In tRNA: Structure, Biosynthesis, and
Function (SoÈll, D. & RajBhandary, U., eds), pp. 443469,
American
Society
for
Microbiology,
Washington, DC.
Yarus, M., Cline, S., Raftery, L., Wier, P. & Bradley, D.
(1986a). The translational ef®ciency of tRNA is a
property of the anticodon arm. J. Biol. Chem. 261,
10496-10505.
Yarus, M., Cline, S. W., Wier, P., Breeden, L. &
Thompson, R. C. (1986b). Actions of the anticodon
arm in translation on the phenotypes of RNA
mutants. J. Mol. Biol. 192, 235-255.
Edited by D. E. Draper
(Received 7 June 1999; received in revised form 26 July 1999; accepted 30 July 1999)