The resonances of all the non-exchangeable protons (except 5`H

Volume 15 Number 14 1987
Nucleic Acids Research
Two-dimensional NMR investigation of a bent DNA fragment: assignment of the proton
resonances and preliminary structure analysis
Agustin Kintanar1, Rachel E.Klevit2 and Brian R.Reid 12
Departments of 'Chemistry and 2Biochemistry, University of Washington, Seattle, WA 98195, USA
Received March 9, 1987; Revised and Accepted June 10, 1987
ABSTRACT
The resonances of all the non-exchangeable protons (except 5'H and 5"H) of
d(CGAAAAATCGG) + d(CCGATTTTTCG), a putatively bent DNA duplex, have been
assigned using 'H two-dimensional nuclear magnetic resonance methods. The nuclear
Overhauser effect data indicate an overall B-form structure for this double-helical DNA
undecamer. However, several features of the NMR data such as some unusually weak
C8/C6 proton to Cl' proton NOE cross-peaks, the presence of relatively intense C2H to
Cl'H NOE cross-peaks, and unusual chemical shifts of some 2", 2', and 1' protons suggest
a substantial perturbation of the helix structure at the junctions and along the length of
the tract of A residues. These structural deviations are considered in terms of models of
DNA bending.
INTRODUCTION
In the last few years it has become apparent that the helix axis of certain doublehelical DNA sequences is bent in solution. The first evidence for DNA bending came from
the anomalously slow mobility of some restriction fragments from kinetoplast minicircles
of tropical parasites (1-3) and from bacteriophage A DNA (4) upon polyacrylamide gel
electrophoresis. A bent piece of DNA migrates slowly through a polyacrylamide gel due
to
its decreased ability to snake through the pores of the gel. In an elegant study, Wu and
Crothers (5) showed that the locus of the aberrant electrophoretic behavior in kinetoplast
DNA lay in a sequence containing recurring short stretches of 5 or 6 A residues with a
periodicity of about 10 base pairs. They proposed a model in which each oligo-A tract
induces a small kink or local helical distortion, and suggested that these defects are propagated in phase with the helical repeat (10-11 base pairs) resulting in a stable macroscopic
bend.
Recently, Hagerman (6) and Koo et al. (7) carried out a systematic study of synthetic
model DNA duplexes to determine which sequences exhibit bending. Both groups confirmed that continuous runs of A residues are required for the phenomenon; interrupted
runs (with a C, G or T in the middle) or runs of G residues in phase with the helical repeat
do not exhibit anomalously slow gel electrophoretic mobilities. Koo et al. found that a run
of 5 or 6 A residues was required for maximal bending, with 4 contiguous A's exhibiting a
—^——^—^__^^-____^_^_^___^__^^—^^—^_^-^^-^——_^__^_^_
5845
© IR L Press Limited, Oxford, England.
Nucleic Acids Research
smaller, but detectable, retardation. They concluded that accurate phasing of the 3' ends
•* •>
of the oligo-A stretches was more important than phasing of the 5' ends, suggesting that
bending is localized primarily at the 3' junction Hagerman's study indicated that three
_^
contiguous A residues was sufficient for virtually maximal bending; however. Hagerman's
A
sequences were of the type (GGAAATTTCC)n whereas the Koo et al. sequences contained
no internal palindromic symmetry within the 10 base pair repeat unit. Superimposing the
*"
^ .
additional two-fold symmetry creates 5'-AAAT-3' junctions in both strands. These workers
also investigated the role of bases flanking the A tracts and found that a T residue at the 3'
end produced maximal bending, but otherwise the identity of the flanking base mattered
little.
Numerous models have been proposed to account for the molecular basis of the observed sequence-directed bending. The models may be grouped into two classes as discussed by Koo et al. (7). The first class assumes that the runs of A residues retain a
general B-DNA structure and proposes that bending is induced by interactions of neighbor and next-nearest neighbor nucleotides. The second class of models assumes a different
right-handed helical structure for the A tract and proposes that bending occurs at the
junctions of the A tracts and adjacent B-DNA. Clearly, the understanding of DNA bending would be greatly enhanced if a detailed molecular structure of the junctions and of the
*<
v*
A
•*
y .,
oligo-A tract were available.
Two-dimensional ' H NMR has the potential for elucidating the structure of biological macromolecules in solution (8). The use of NOESY, COSY and RELAY 2D NMR
experiments facilitates the assignment of the cofolex 'H NMR spectra of proteins or
DNA molecules. Currently, it is possible to assign the spectrum of a non-symmetric DNA
duplex up to about 25 base pairs long. The NOESY experiment is based on the throughspace dipolar coupling between protons. With care, the resulting data can be converted
into distances between pairs of H-atoms <4.5 A apart. Provided there are enough measured distances, in theory one should be able to determine 06 initio the structure of the
molecule using distance geometry methods (9,10). Recently, the structures of several unusual DNA oligonucleotides were determined using two-dimensional NMR and distance
geometry methods. These include a hairpin (11), a duplex with a base pair mismatch (12),
and a duplex with an extrahelical residue (13).
We have thus undertaken a two-dimensional 'H NMR study of d(CGAAAAATCGG)
+ d(CCGATTTTTCG) with the aim of determining the detailed structure in solution in
order to elucidate the molecular basis of DNA bending. This sequence contains a tract
of 5 A residues followed by a T nucleotide: such a —GAAAAAT— sequence has been
shown to be the basis of the bending phenomenon. Here, we report the assignment of all
non-exchangeable proton resonances (except 5' and 5" protons) in this molecule. Several
interesting and unusual features of the 2D NMR spectra allow us to make some qualitative
5846
^
»_«
"*•
Nucleic Acids Research
comments about the structure in terms of the proposed models for the origin of DNA
bending. This work represents the first step towards determining the three-dimensional
structure of bent DNA by NMR and distance geometry methods.
MATERIALS ANT)
Sample Preparation. The DNA 11-mer d(CGAAAAATCGG) and its complement
d(CCGATTTTTCG) were synthesized using solid-phase phosphite triester techniques on
an Applied Biosystems 380A DNA synthesizer. Each strand was synthesized on a 10 /jmole
scale, yielding over 500 A260 units of crude material.
The individual strands of DNA in 0.5 M NaCl were precipitated with ethanol, redissolved in 1.5 ml H2O and chromatographed on a superfine Sephadex G-25 column (120
cm x 2 cm) in distilled H2O. Fractions of 1.5 ml were collected at a flow rate of 20 ml/hr.
The elution profile was monitored at 260 nm and aliquots of selected fractions (ca. 0.05
A26o) were 5' end-labelled with "7-32P-ATP (New England Nuclear) using polynucleotide
kinase (Sigma Chemical), then electrophoresed on a 20% polyacrylamide denaturing gel
(8M urea). Autoradiograms of these gels revealed good separation of the desired major
product, from the shorter failure sequences. Fractions containing only the full-length 11mer were pooled, resulting in the recovery of ~400 A260 units of each purified DNA strand
(-75% yield).
Equimolar amounts of each strand were mixed in 5 ml of 10 mM phosphate buffer
(pH 7) and annealed together by heating to ~75 °C and cooling slowly. The duplex
DNA was applied to a hydroxyapatite column (15 cm x 2.5 cm) and eluted at room
temperature with a gradient of 10mM-400mM phosphate buffer (pH 7) to remove singlestranded material. The duplex DNA (~20 mg) was then lyophilized, redissolved in 1.5
ml H2O and desalted on a superfine Sephadex G-15 column (6C cm x 2 cm) eluted with
distilled H2O. The desalted duplex DNA was lyophilized and redissolved in 0.4 ml buffer
containing 20 mM sodium phosphate (pH 7), 100 mM sodium chloride, 0.1 mM EDTA.
The sample was lyophilized, redissolved in D2O and lyophilized again. Finally, 0.4 ml of
99.99% D2O was added and the solution transferred into a 5 mm NMR tube. For NMR
studies in H2O the sample was lyophilized and taken up in 0.4 ml of 90% H2O/10% D2O.
The samples were >95% pure as judged by their 'H NMR spectra at 500 MHz.
NMR Spectroscopy. NMR spectra in D2O were acquired on a home-built spectrometer
operating at a field of 11.75T corresponding to a proton resonance frequency of 500 MHz.
Details of this spectrometer are described in Gladden and Drobny (14). All NMR spectra
in D2O solution were recorded at 30°C. One dimensional spectra of the exchangeable
protons in H2O solution were acquired on a Broker WM-500. These spectra were obtained
at several temperatures to investigate the exchange-broadening of the peaks due to fraying
of the helix ends (vide infra).
5847
Nucleic Acids Research
All 2D NMR spectra were collected with 512 complex points in t2 and 300-400 points
in t j . The dwell time was 200 //sec and the relaxation delay was 2 sec over and above the
acquisition time. Typically. 64 scans were obtained for each tj point and the 90° pulse
length was 12 /xsec. COSY and RELAY spectra were acquired in absolute magnitude
mode. Phase-sensitive COSY and NOESY spectra were recorded using the hypercomplex
methods of States et al. (15). The mixing times for the RELAY and NOESY experiments
were 40 and 300 msec, respectively. Spectra of exchangeable protons in H2O solution were
obtained with a Redfield 21412 soft pulse (16) to suppress the water peak, with the carrier
frequency set at 12.6 ppm from TMS. NOE difference spectra were acquired as described
previously (17).
The absolute magnitude COSY and RELAY data were apodized with a 512 point
sinebell, zero-filled to 1024 points and Fourier transformed in the t2 dimension. The ti
data were apodized with a sinebell of as many points as there were experiments, zero-filled
to 1024 points and Fourier transformed. Phase-sensitive data were processed similarly
except that 60° phase-shifted sinebell apodization and the appropriate phase corrections
were applied in both the tj and t2 dimensions. For purposes of comparing intensities,
the NOESY data were reprocessed with right-shifted 90° phase-shifted sinebell-squared
apodization functions having a value of one out to 384 points in the t2 dimension or a
value of one out to 225 points in the tj dimension. All two-dimensional data processing
was performed on a DEC MicroVax II using software developed by Dennis Hare.
RESULTS
The >H NMR spectrum of d(CGAAAAATCGG) + d(CCGATTTTTCG) in D2O
buffer at 30°C is shown in Figure 1. We expect a total of 193 resonance lines due to
non-exchangeable protons consisting of 154 sugar protons, 33 base protons and 6 methyl
groups. The peaks fall into six reasonably discrete regions according to chemical type, as
is typical for DNA, and these are labelled in the Figure. Also shown in the inset is the
numbering scheme for this 11-mer duplex. Close inspection of Figure 1 reveals a number of
minor impurity peaks. The majority of these are from by-products of the DNA synthesis
that co-purify with the DNA using our procedure. They are seen in all our samples and are
easily distinguished from DNA resonances. Some of the impurity peaks are due to short
failure sequences or oligonucleotides that do not form duplex. These are at a very low
level (<5%) as shown by gel electrophoresis, and are of no consequence in the 2D NMR
spectra.
The principles of the two-dimensional NMR assignment method for right-handed helical DNA have been described in detail elsewhere (18-20) and will be outlined only briefly
here. In Figure 2, an expanded portion of the 2D NOESY plot is shown, corresponding
to the cross-peaks between aromatic C6 or C8 base protons and Cl'/C5 protons. The five
5848
Nucleic Acids Research
S'-C-G-A-A-A-A-A-T-C-G-G
G-C-T-T-T-T-T-A-C-C-C-s'
•n Jl 20 19 18 17 16 IS H 13 1!
CMj
v
JH.2X
8R8H.2H
9.8
8.0
7.2
6.4
5.6
4.8
PPM
4.0
3.2
2.4
1 .6
0.8
0.0
Figure 1. 500 MHz 'H NMR spectrum of d(CGAAAAATCGG) ->- d(CCGATTTTTCG)
in D2O/20 mM sodium phosphate (pH 7|. 100 m.M sodium chloride and 0.1 mM EDTA,
at 30° C.
V
1
•
very intense cross-peaks correspond to the C6 and C5 proton resonances of thefivecytidine
residues in the duplex. These protons are separated by a fixed distance of 2.45A and are
thus expected to give rise to some of the most intense N'OE cross-peaks. The cytidine H5
and H6 resonances are J-coupled and these five peaks also appear in the corresponding
region of the COSY spectrum (Figure 3a).
In right-handed helical DNA. the purine C8H or pyrimidine C6H is not only close to
its intranucleotide Cl'H but also to the Cl'H of the preceding (5') nucleotide. Therefore,
each aromatic proton gives an NOE cross-peak to its own Cl'H as well as to the Cl'H
of the nucleotide on its 5' side. This forms the basis of one of the sequential assignment
connectivities in DNA since one can start at either end of the polynucleotide and assign
the C8H/C6H and l'H resonances to specific atoms in the macromolecule.
The complete C8H/C6H to l'H walk for both strands is outlined in Figure 2a,b. There
are a few chemical shift redundancies in this region, e.g. the A5 and A7 H8 resonances
are overlapped, as are the T18 and T19 C6H resonances; but the correct assignments are
easily verified in other regions of the spectrum. The assignment of the Cl' protons of
5849
Nucleic Acids Research
a
1 « : I • !•
i i i *
ICC
i C C A A A A 1
i
c
IT: I A S S
0
$
i
•
I*1
J
5
1
4
t9
a A
D
$r
0
••
8.0
•
I)
*w i
B.2
1
»
a 0
•
7.8
Figure 2. Expanded region of the 'H NMR NOESY spectrum of the DNA duplex in D2O
buffer at 30°C showing the cross-peaks between C6/C8 'C2 protons and Cl'/C5 protons,
(a) Trace of the aromatic/Cl' proton walk for strand 1 (Cl-Gll). The numbers label
the intranucleotide cross-peaks, (b) Same trace for strand 2 (C12-G22). (c) Cross-peaks
between the adenine C2H and the Cl' proton of the nucleoside on its 3' side and between
the adenine C2H and the Cl'H on the 3' side of its cross-strand partner are connected by
vertical dashed lines. A3 C2H has a single cross-peak to A4 Cl'H which is indicated by a
diamond ( • ). The cross-peaks between the cytidine C5H and its own C6H and the Cl'H
of its 5' neighbor are joined by horizontal bars.
5850
Nucleic Acids Research
3-2
8.0
7.8
7.6
7.4
7.2
T\0
1.2
Figurec 3. Expansion of the 'H NMR COSY spectrum of the DNA duplex in D2O buffer
at 30 C. (a) Region showing the cytidine C5H-C6H cross-peaks, (b) Region showing the
thymidine C6H-rnethyl cross-peaks.
5851
Nucleic Acids Research
2.8
1 .2
Figure 4. Expansion of the NOESY spectrum showing the cross-peaks between C6/C8
protons and 2"/2'/methyl protons. On the left is a trace of the aromatic/2"H walk for
strand 2 (C12-G22). On the right, the cross-peaks between a thymidine methyl group and
its own and 5' neighbor's C6H/C8H are joined by vertical bars. The diamond ( • ) denotes
the cross-peaks between the T19 methyl and the C6 protons of T18 and T19, which have
the same chemical shift.
A5 and A6 is potentially ambiguous but this ambiguity was resolved by comparison with
the NOESY spectrum of d(GCCAAAAATGCC) and its complement strand under similar
conditions (data not shown). In the latter sequence the l'H assignments are unambiguous
and the Cl'H resonance of the third A residue in the tract (corresponding to A5 in our
target sequence) is clearly upfield of the Cl'H resonance of the A residue on its 3' side.
The assignments in Figure 2 can be made with a high degree of confidence since
the spectrum contains additional information that provides self-consistency checks. For
example, the cytidine C5 protons give rise to an NOE cross-peak with the C8H/C6H
protons of the nucleotide on its 5' side. These are shown by horizontal bars in Figure 2c.
Notice that only three such contacts are seen since two of the cytidine residues of this
Nucleic Acids Research
•>
V
2.8
1.2
Figure 5. Expansion of the NOESY spectrum showing the cross-peaks between Cl'H and
2"/2'/methyl protons, (a) The cross-peaks between l'H and 2"H/2'H are connected by
horizontal bars. Note the weaker intensity of the 2'H cross-peaks, (b) The horizontal
dashed lines indicate the NOE connectivity of a cytidine C5H proton with the 2"H and
2'H of its 5' flanking residue.
r
5853
Nucleic Acids Research
duplex are at the 5' ends; the two cytidine resonances are thus independently identified as
either Cl or C12.
The thymidine C6H resonances can likewise be identified by their cross-peaks to
methyl groups in the upfield region of the spectrum (Figure 4). In right-handed helical
*
DNA, thymidine methyl groups are close in space to their own C6 proton and to the base
proton of the residue on the 5' side, and are thus expected to have two NOE cross-peaks
^
to aromatic resonances. The intranucleotide C6H and methyl resonances are identified
t
by their cross-peaks (through an easily detectable 4-bond scalar coupling) in the COSY
spectrum (Figure 3b). In the NOESY spectrum, four of the thymidine methyls have pairs
<
of cross-peaks in which both of the aromatic components are thymidine H6 resonances and
-«*
these must be T17, T18, T19 or T20. Of the other two methyl resonances at 1.07 ppm and
1.23 ppm, the former is easily identified as T8 by its cross-peak to the C8H of A7 and the
latter is T16 from its cross-peak to A15 C8H. Furthermore, it is also possible to carry out
a sequential 2"H to C8H/C6H walk in the region shown in Figure 4 in the same fashion as
*
the Cl'H to base proton walk shown in Figure 2, thereby confirming the aromatic proton
^„
'
assignments and simultaneously assigning the 2" protons.
An additional, and often more reliable, way to assign the 2" protons, as well as the
2' protons, is by the COSY and NOESY cross-peaks to their own Cl' protons, which
have already been assigned. This region of the NOESY spectrum is shown expanded in
Figure 5, with horizontal bars connecting the l'H-2'H and l'H-2"H cross-peaks of each
sugar. In addition to being a valuable assignment tool, the intensity (proximity) of the
*
1
-
l'-2' cross-peak compared to the l'-2"cross-peak is diagnostic of the sugar conformation
^"
in that these two distances change in the various allowed conformations of D-deoxyribose;
in most conformations the 2"H is closer to the l'H producing a more intense cross-peak.
These assignments can be made just as easily in the corresponding region of the COSY
spectrum (data not shown). For the most common C2'-endo sugar conformation in B-
«•
DNA, the 2"H is very weakly scalar coupled to 3'H allowing it to be distinguished from
the 2'H. In general, the 2'H of a given nucleotide is upfield of the 2"H. For this particular
oligonucleotide, the only exceptions occur in residue G2 where the 2'H and 2"H chemical
shifts are virtually identical, and in the 3' terminal residues (Gil and G22) where the 2'
<
A
*"
protons are downfield of the 2" protons.
The C3' and C4' protons were assigned by their NOE cross-peaks to Cl' protons. This
region is shown in Figure 6a. The C3'H/Cl'H cross-peak is generally less intense than the
corresponding C4'H/C1'H cross-peak, and the C3'H resonances are usually downfield of
the C4'H resonances. The l'H to (n +l)5'H/5"H cross-peaks are either extremely weak
or absent. The .assignments of the C3'H resonances from the Cl'H resonances can be
independently corroborated by C3'H cross peaks to the aromatic region, where we can
carry out a complete C3'H/base proton sequential walk (data not shown). Finally, the
i
y
'H
,
••
*
0
5854
Nucleic Acids Research
1
f
f
6
a
(
0
5
1
9
o
f
0
T
9
I
i
i
S
1)
P
*
<
i
g
13
*i
>
9
S3
is
f
©
Oc
6.2
6.0
5.8
@
5.6
PPM
0
0
5.4
5.2
5.0
Figure 6. (a) Expansion of the NOESY spectrum showing the cross-peaks between Cl'H
and the intranucleotide C3'H and C4'H, indicated by vertical bars, (b) Corresponding
region of the RELAY spectrum showing the intranucleotide Cl'H/C3'H cross-peaks. The
diamonds ( • ) denote impurity peaks or artifacts which are not present in either the COSY
or NOESY spectra.
5855
Nucleic Acids Research
I ! ]( S C ! M
1011
a
5' CGAAAAATCGG
GCTTTTTAGCC 5'
HDCBAEFIC
4
<
J.
14
—r~
12
10
Figure 7. (a) Downfield region of the 'H NMR spectrum of the duplex DNA in H2O buffer
at 32°C. Peaks A to I correspond to imino protons hydrogen-bonded between base pairs.
The peaks on the right correspond to amino protons (broad peaks) and carbon-bonded
aromatic protons of the bases. The sequence of base pairs numbered 1 to 11 is shown
in the inset together with the assignment of each imino proton, (b) One-dimensional
NOE difference spectrum showing the cross-relaxation between peak E (which has been
saturated) and imino peaks F. A and C2H peaks f and e. (c) Same as in (b) except
resonance F has been saturated.
5856
Nucleic Acids Research
ri"
C3'H assignments are made completely ironclad by observing coherence RELAY cross••«•
peaks from the C l ' proton as shown in Figure 6b. Unfortunately, coherence RELAY from
*
2'H or 2"H to the C4' proton is extremely weak and provides only partial verification of
the C4' assignments (data not shown). RELAY cross-peaks from 3'H to 5' and 5" protons
were not observed even after trying several mixing times, possibly indicating rather weak
'A
coupling between 3'H and 4'H. The C4'H assignments can still be confirmed by their crosspeaks to the C 3 ' protons in the COSY or RELAY spectrum near the diagonal (data not
shown).
%
r*+
•
The assignment of the adenine C2 protons was made from their NOE contacts to
imino protons in H2O solution. The one-dimensional ' H NMR spectrum of the DNA
sample in H2O at 32°C, showing the imino, amino and aromatic regions, is displayed
»..
each end of the helix have been exchange-broadened. Terminal fraying of the duplex was
confirmed by recording spectra at various temperatures down to 5°C (data not shown).
The imino protons were assigned using one-dimensional NOE difference techniques (21,22).
in Figure 7a. Only nine resolved imino peaks were observed: the imino resonances from
Representative difference spectra are shown in Figure 7b,c. For example, upon saturating
»
resonance E in the imino proton region, cross-saturation of imino peaks F and A and of
C2H resonances f and e was observed (Figure 7b). These protons must be separated by
•*
'*
one. Similarly, when resonance F was saturated, cross-saturation of peak E in the imino
region and of peaks f and e in the C2H region was observed (Figure 7c). Repeating the
experiment for peaks A to I establishes the linear relationship of the imino resonances in
less than 5A and therefore must belong to either the same base pair or to an adjacent
..
the DNA duplex as HDCBAEFIG. Resonances G, H, and I correspond to imino protons
in CG base pairs since they exhibit NOEs only to broad amino resonances in the 8-9 ppm
region, and not to narrow C2H lines.
^
The polarity of the assignments is determined by several means. One line of evidence is
^
that resonance G is the third residue to broaden as the temperature is raised. This coupled
to the fact that the guanine resonances H and I are adjacent to A-T base pairs establishes
^
the assignment of resonance G to the imino proton of residue GlO. Moreover, the imino
proton in an A-T base pair is expected to have NOE contact with the C2H of its own
t
adenosine and with the C2 of the A residue on the 3' side in B-DNA. Observation of these
types of contacts in the tract of A residues allows us to confirm the imino assignments.
These are shown in the inset of Figure 7. Furthermore, the C2H resonances of specific
*
adenosine residues can now be assigned since it is the stronger of the NOE contacts (crossstrand) upon irradiation of the complementary thymidine imino resonance. Going from
downfield to upfield, the C2H resonances are assigned as fedcab at 32°C where the small
••
letters have the same correspondence to nucleotides as the capital letters in the inset of
^
Figure 7. Assuming this order is not changed at 30°C, we can transfer these assignments
5857
Nucleic Acids Research
Table 1. Assignment of the Bent DNA Undecamer
1
11
C-G-A-A-A-A-A-T-C-G-G
G-C-T-T-T-T-T-A-G-C- C
22
12
Residue
C8H
Cl
G2
A3
A4
A5
A6
A7
C12
C13
G14
A15
T16
T17
T18
7.45
5.75
C2H
CH 3
7.53
6.94
6.84
6.90
7.49
6.92
7.25
5.42
1.07
7.60
7.38
5.79
5.54
7.70
7.66
7.79
8.10
7.68
7.04
7.33
7.36
7.36
7.29
7.39
T19
T20
C21
G22
C5H
7.77
7.99
7.91
7.85
7.81
7.85
T8
C9
GlO
Gil
C6H
7.81
1.23
1.42
1.50
1.53
1.55
5.61
Cl'H
C2"H
C2'H
C3'H
C4'H
5.56
5.08
5.68
5.63
5.71
5.80
5.97
5.77
5.54
5.55
6.00
2.13
2.48
2.67
2.66
2.74
2.79
2.77
2.31
2.19
2.56
2.22
1.57
2.51
2.51
2.43
2.40
2.41
2.32
1.81
1.78
2.49
2.40
4.50
4.80
4.91
4.89
4.88
4.90
4.85
4.71
4.70
4.84
4.52
3.90
4.11
4.25
4.28
4.29
4.32
4.34
4.07
3.98
4.19
4.07
5.84
5.35
5.48
6.17
5.83
6.07
6.07
6.02
5.95
5.60
6.01
2.37
2.23
2.69
2.85
2.48
2.54
2.57
2.56
2.38
2.24
2.25
1.90
1.90
2.59
2.55
1.92
2.12
2.12
2.06
1.99
1.90
2.50
4.53
4.70
4.89
4.92
4.73
4.80
4.82
4.81
4.78
4.72
4.56
3.98
3.97
4.23
4.38
4.09
4.11
4.13
4.12
4.06
4.02
4.04
The DNA duplex was in D2O solution at 30°C. Solution conditions are described in
the text.
to the 2D NOESY spectrum where we observe C2H to C2H cross-peaks near the diagonal
in the base proton region (data not shown) and C2H to Cl'H cross-peaks (Figure 2). The
chemical shifts of all non-exchangeable protons, except 5'H and 5"H, at 30°C are listed in
Table I.
DISCUSSION
In general terms, the NOESY spectrum of this double-helical DNA undecamer is
consistent with a solution structure within the broad family of B-DNA in that qualitatively
the intranucleotide and internucleotide connectivities characteristic of B-type DNA are all
5858
Nucleic Acids Research
observed. There are, however, some unusual and subtle features in the spectra that may
be correlated to structural anomalies in this putatively bent DNA duplex.
First, the G2 Cl' proton was found to have an exceptionally upfield-shifted resonance
position of 5.08 ppm, almost overlapping the 3' protons. To our knowledge, this is the
highest upfield Cl'H yet observed in unmodified DNA. Patel et al. (23) have observed
Cl'H resonances even further upfield but only in DNA duplexes containing a chemically
modified guanosine nucleotide. The most likely cause of the anomalous position of G2
Cl'H is a 0.3-0.4 ppm upfield ring-current shift from a heterocyclic base. Such upfield
shifts derive from stacking geometry, which eliminates the G2 purine as the cause of the
shift. The cytosine in position one has a much weaker ring current, is quite far away
from G2 l'H in right-handed DNA, and does not shift the G2 l'H resonance in several
other duplexes we have studied that begin with CG in the first and second positions.
This leaves the strong ring-current of the adenine moiety of A3 as the only remaining
candidate causing the upfield shift on G2 Cl'H. This effect is not seen on the analogous
G4 Cl'H in (CGCGAATTCGCG)2 which resonates at 5.42 ppm (20) in a sequence that
does not exhibit bending. However in a different bent sequence, GCCAAAAATGCC, we
have observed an analogous dramatic upfield shift of C3 Cl'H, suggesting that the effect is
peculiar to the residue proceeding the oligo-A tract in bent DNA sequences. In the present
CGAAAAATCGG sequence the most likely way for the A3 base to have this upfield effect
on G2 Cl'H is for it to markedly propeller-twist or to side-slip, or both.
The sugar of the G2 residue also appears to be in an unusual conformation. Evidence
that this sugar is structurally perturbed ib given by the unusual chemical shift positions of
the 2' and 2" protons of G2, in which 2'H resonates very slightly downfield of 2"H. This is
in contrast to all the other nucleoside residues in this macromolecule (except the terminal
Gil and G22) and in other DNA duplexes reported in the literature; in all these cases
the 2'H almost always resonates upfield of the 2"H. The only exceptions are at helix ends
(24,25) or at sites of chemical modification (23).
Another unusual and interesting feature of the NMR data is the presence of relatively
strong NOE cross-peaks between adenosine C2 protons and Cl' protons of the 3' neighbor on the same strand and across the strand to the 3' neighbor of the complementary
residue. Such cross-peaks have been observed in previous 2D NMR studies of B-form DNA
(23,26,27), and they are usually very weak. Interestingly, these cross-peaks are seen in the
NOESY spectrum of poly(dA)-poly(dT) which one might expect to have a similar structure to our oligo-A tract (27). In the NOESY spectrum of the present duplex (acquired
with a mixing time of 300 msec), we observe at least 10 C2H/Cl'H cross-peaks of which
two are quite strong (comparable in intensity to some C8H/C6H to l'H cross-peaks), six
are intermediate, and two are weak (Figure 2c). At least four of these cross-peaks are
still visible in a NOESY spectrum acquired with 150 msec mixing time (data not shown)
5859
Nucleic Acids Research
indicating reasonably short distances between these C2H and Cl'H atoms.
From the coordinates of classical B-form DNA determined by Arnott and co-workers
(28), one would not expect to see these cross-peaks since there are no C l ' protons within
5A of any adenine C2 protons. The work of Dickerson et al. (29) on a single crystal
of the self-complementary DNA d(CGCGAATTCGCG) however, reveals extensive local
deviation of the structure from idealized B-form DNA. For example, there is considerable
variation of the helix pitch and propeller twist of the base pairs at each position in the
sequence. Using the coordinates from Dickerson and co-workers' study, we find that an
adenosine C2H may be within ~4 A of the Cl'H of the 3'flanking residue and the Cl'H of
»*
..
4
*
the nucleotide to the 3'side of the cross-strand partner may also approach this close, thus
giving rise to NOE cross-peaks. Our observation of these cross-peaks suggests that there
is substantial deviation from classical B-DNA structure in the oligo-A tract of our DNA
duplex. Morever, we note that the intensities of the different C2H/C1'H cross-peaks vary
and are strongest for A6 C2H to T18 Cl'H and for A7 C2H to T17 Cl'H. This suggests
"*• -
that the deformation in the helix that allows these cross-peaks to appear is greatest near
the fourth and fifth adenosine residues from the 5'end of the tract. In the 2D NMR study of
poly(dA)-poly(dT) by Behling and Reams (27), the interstrand A 2H to T l'H cross-peak
^,
was also found to be more intense than the corresponding intrastrand cross-peak. This
•*•*
observation is consistent with the compression of the minor groove in B-form DNA.
Further unusual aspects of the NMR spectra are observed in the intensities of the NOE
cross-peaks between base protons and Cl'protons. In idealized B-DNA. the intensities of
all of these cross-peaks are expected to be quite similar. In the NOESY spectrum of the
undecamer, however, seven of these cross-peaks are exceptionally weak. These are G2
C8H/C1 Cl'H, A6 C8H/A6 Cl'H. C9 C6H/T8 Cl'H, C9 C6H/C9 Cl'H, G14 C8H/C13
Cl'H, C13 C6H/C13 Cl'H, Cl3 C6H/C12 Cl'H. We note that these anomalous NOE
intensities occur primarily for residues at the 3' end of the A tract, although anomalies
also occur at the 5' end and within the oligo-A tract at the fourth adenosine residue. It
is tempting, but somewhat dangerous, to draw conclusions about the molecular structure
from NOE intensities measured at only one mixing time, since numerous factors can affect
these values. For example, the observed weak NOE intensities may indicate that the
relevant protons are quite distant, or that there may be complex cross-relaxation pathways
between them (spin-diffusion), or that there is large-amplitude motion of one of these
moieties, or even some combination of these factors. Nevertheless, these preliminary results
suggest that there is a substantial perturbation of the helix structure at both ends of the
stretch of A residues with perhaps smaller perturbations at the fourth A nucleotide of the
oligo-A tract.
In summary, we have assigned all of the non-exchangeable protons (except 5'H and
5"H) in the 'H NMR spectrum of d(CGAAAAATCGG) + d(CCGATTTTTCG) in so5860
J<
«-
*
V
**•
m_,
"'
«
r
'
,,,
••
Nucleic Acids Research
lution. Qualitative analysis of the NMR parameters indicate that although the overall
structure of this duplex is B-form there are several significant local structural deviations.
In particular, there is a structural perturbation at the 5' end of the oligo-A tract and a
more extensive one at the 3' end. There is also a deviation of the helix parameters in the
A tract itself which is maximum for A6 and A7. These observations are consistent with
models of DNA bending but lack the molecular level detail to significantly increase our
understanding of the phenomenon. Therefore we are currently collecting a time-dependent
set of 2D NOESY experiments which will ultimately yield relatively precise proton-proton
distances. From this data set, we hope to obtain a detailed three-dimensional structure of
this macromolecule using distance geometry methods, and a clearer picture of the molecular basis of DNA bending.
ACKNOWLEDGEMENTS
We thank Susan Ribeiro, Jerilyn Beltman, Ponni Rajagopal, Leh-Jame Lin, and Kelly
Horng for their help in preparing the DNA sample, and Mary Coventry for her help in
preparing the manuscript.
REFERENCES
1. Simpson, L. (1979) Proc. Natl. Acad. Sci. USA 76, 1585-1588.
2. Marini, J.C., Levene, S.D., Crothers, D.M. and Englund, P.T. (1982) Proc. Natl.
Acad. Sci. USA 79, 7664-7668.
3. Hagerman. P.J. (1984) Proc. Natl. Acad. Sci. USA 81. 4632-4636.
4. Zahn, K. and Blattner, F.R. (1985) Nature 317, 451-453.
5. Wu. H.-M. and Crothers, D.M. (1984) Nature 308, 509-513.
6. Koo, H.-S., Wu, H.-M. and Crothers. D.M. (1986) Nature 320, 501-506.
7. Hagerman, P.J. (1985) Biochemistry 24, 7033-7037.
8. Wemmer, D.E. and Reid, B.R. (1985) Ann. Rev. Phys. Chem. 36, 105-137.
9. Havel, T.F., Crippen. G.M. and Kuntz. I.D. (1979) Biopolymers 18, 73-81.
10. Havel, T.F. and Wuthrich, K. (1985) J. Mol. Biol. 182, 281-294.
11. Hare, D.R. and Reid, B.R. (1986) Biochemistry 25, 5341-5350.
12. Hare, D.R., Shapiro, L. and Patel, D.J. (1986) Biochemistry 25, 7445-7456.
13. Hare, D.R., Shapiro, L. and Patel, D.J. (1986) Biochemistry 25, 7456-7464.
14. Gladden, J. and Drobny, G. (1987) (manuscript in preparation).
15. States, D.J., Haberkorn, R.A. and Ruben, D.J. (1982) J. Magn. Reson. 48, 286-292.
16. Redfield, A.G., Kunz, S.D. and Ralph, E.K. (1975) J. Magn. Reson. 19, 114-117.
17. Hare, D.R. and Reid, B.R.(1982) Biochemistry 21, 1835-1842.
18. Scheek, R.M., Russo, N., Boelens, R. and Kaptein, R. (1983) J. Amer. Chem. Soc
105, 2914-2916.
19. Feigon, J., Leupin. W., Denny, W.A. and Kearns, D.R.(1983) Biochemistry 22, 594320. Hare, D.R., Wemmer, D.E., Chou, S.-H., Drobny, G. and Reid, B.R. (1983) J. Mol.
Biol. 171, 319-336.
21. Patel, D.J., Kozlowski, S.A., Marky, L.A., Broka, C , Rice, J.A., Itakura, K. and
Breslauer, K.J. (1982) Biochemistry 21, 428-436.
22. Chou, S.-H., Hare, D.R., Wemmer, D.E. and Reid, B.R. (1983) Biochemistry 22,
3037-3041.
23. Patel, D.J., Shapiro, L., Kozlowski, S.A., Gaffney, B.L. and Jones, R.A. (1986) J.
Mol. Biol. 188, 677-692.
5861
Nucleic Acids Research
24. Wemmer, D.E., Chou, S.-H. and Reid, B.R. (1984) J. Mol. Biol. 180, 41-60.
25. Weiss, M.A., Patel, D.J.. Sauer, R.T. and Karplus, M. (1984) Proc. Natl. Acad. Sci.
USA 81, 130-134.
26. Weiss, M.A., Patel, D.J., Sauer, R.T. and Karplus, M. (1984) Nucleic Acids Res. 12.
4035-4047.
'
27. Behling, R.W. and Kearns, D.R. (1986) Biochemistry 25. 3335-3346.
28. Arnott, S. and Hukins, D.W.L. (1972) Biochem. Biophys. Res. Commun. 47, 15041509.
29. Dickerson, R.E. and Drew, H.P. (1981) J. Mol. Biol. 149 761-786.
5862