volume 15 Number 22 1987 Nucleic Acids Research Crystal structure analysis of an A-DNA fragment at 1.8 A resolution: d(GCCCGGGC) Udo Heinemann*, Hanspeter Lauble, Ronald Frank1 and Helmut Bldcker1 Abteilung Saenger, Institut fur Kristallographie, Freie Universitat Berlin, Takustrasse 6, D-1000 Berlin 33 and kjBF (Gesellschaft fur Biotechnologische Forschung mbH), Mascheroder Weg 1, D-33OO Braunschweig, FRG Received September 4, 1987; Revised October 20, 1987, Accepted October 27, 1987 ABSTRACT Single crystals of the self-complementary octadeoxyribonucleotide d(GCCCGGGC) have been analysed by X-ray diffraction methods at a resolution of 1.8 A. The tetragonal unit cell of space group P4,2.2 has dimensions of a=43.25 A and c=24.61 A and contains eight strands of the oligonucleotide. The structure was refined by standard crystallographic techniques to an R factor of 17.1% using 1359 3a structure factor observations. Two strands of the oligonucleotide are related by the crystallographic dyad axis to form a DNA helix in the A conformation. The d(GCCCGGGC) helix is characterized by a wide open major groove, a near perpendicular orientation of base pairs to the helix axis and an unusually small average helix twist angle of 31.3° indicating a slightly underwound helix with 11.5 base pairs per turn. Extensive cross-strand stacking between guanine bases at the central cytosine-guanine step is made possible by a number of local conforraational adjustments including a fully extended sugar-phosphate backbone of the central guanosine nucleotide. INTRODUCTION Single crystal X-ray diffraction and NMR spectroacopic studies of synthetic oligodeoxyribonucleotides have demonstrated the presence in DNA helices of a characteristic sequence-dependent structure (1-3). These investigations have not only led to the discovery of an entirely new form of DNA, left-handed Z-DNA (4-6), but have also widened our understanding of the classical A and B forms of DNA as previously defined by X-ray fiber diffraction methods (7-9). In all principal helix forms, the fine spatial structure depends on the base sequence in a partly predictable manner (10,11). Until this day, most individual DNA sequences studied by X-ray methods have been found to adopt the A conformation in the crystal, i.e. we have more information regarding the sequencedependent structure of A-DNA than for any other helix type. © IRL Press Limited, Oxford, England. 9531 Nucleic Acids Research Crystal structure analyses at resolutions between 1.7 and 3 A have been reported for d(GGTATACC) (12-14), d( I CCGG) (15.16), d(GGCCGGCC) (17,18), d(GGGGCCCC) (19), d(GGATGGGAG) (20), d(CCCCGGGG) (18), d(ACCGGCCGGT) (21) and d(GTGTACAC) (21a). In addition, the three-dimensional structure of a DNA/RNA hybrid is known (22) and several variants of the above sequences incorporating substituted bases (12-14,23) or non-Watson-Crick base pair mismatches (24,25) have been investigated. A comparison of these DNA fragments shows a considerable variation in both averaged and individual helical parameters. Here the crystal structure of the octadeoxyribonucleotide d(GCCCGGGC) is described. This octamer represents an extreme case of a slightly underwound A-DNA helix with a wide major groove, a very small base pair tilt and a concomitant large rise per base pair. EXPERIMENTAL DNA synthesis and crystallization d(GCCCGGGC) was synthesized in solution by a modified phosphotriester approach (26). Crystals were grown at room temperature from 10 mg/ml DNA in 10 mM Tris-HCl or 20 mM Na-cacodylate pH 7.5 in the presence of 10 mM MgCl- by microdialysis (27) against an identical buffer solution containing 2-methyl-2,4pentanediol (MPD) or isopropanol. Large crystals in the form of truncated tetragonal bipyramids appeared at concentrations of 28% isopropanol or 30% MPD. The crystals of d(GCCCGGGC) belong to the tetragonal space group P4.2..2 (no. 9 6 ) . The unit cell constants of a=43.25 A and c=24.61 A are similar to the crystal parameters of several octameric A-DNA fragments (15,17). One single strand of d(GCCCGGGC) (or four base pairs) constitutes the crystallographic asymmetric unit. Data acquisition and processing Diffraction data were collected from one crystal at room temperature on a Stoe four-circle diffractometer using Ni-filtered C u K a radiation from a sealed tube X-ray generator. Peak intensities were measured in 0.8* oj-acans over 12 to 60 sec per reflection, depending on intensity and peak shape. A full quadrant of reciprocal space with a limiting sphere of 1.8 A resolution 9532 Nucleic Acids Research was evaluated. The data set included 2599 reflection measurements with intensities exceeding one standard deviation. The crystal had suffered no observable radiation damage. The diffraction intensities were corrected for Lorentz and polarization factors and for the effect of absorption (28). Due to the plate-like shape of the crystals, absorption scale factors of up to 2.3 had to be applied. 2000 multiple measurements in 960 unique reflections yielded a symmetry R factor of 2.7% ( R = sym El<F>-F(i)| /Ef(i), over all i measurements). The final diffraction data set contained 1553 unique reflections with F ^ lcr(F) . Structure solution and refinement The similarity in sequence and crystal parameters of the DNA octamer under investigation with the oligonucleotides d( CCGG), d(GGCCGGCC) and d(CCCCGGGG) immediately suggested that, as with these DNA fragments, an A-DNA helix had formed where the complementary strands were related by the twofold axis of the unit cell. A double-helical model of d(GCCCGGGC) was therefore generated using idealized A-DNA coordinates based on fiber diffraction (79 ) . The four base pair asymmetric unit of the octaraer model was least-squares fitted to the structure of d( CCGG) (15,16) and placed into the unit cell accordingly. Owing to the differences in cell constants, the resulting octamer helix showed a break between the two halves leading to a C3'-O3' bond length of 4.5 A. The orientation of the four base pair fragment as a rigid body was least-squares refined, slowly increasing the resolution from 8 to 3 A. After this refinement the crystallographic residual R = LIF O -F c | /£F o was 44.4% and an intact helix with a C3'-O3' bond length of 1.42 A had been reconstituted. Constrained-restrained least squares refinement (29,30) was subsequently used to improve the fit between calculated and observed structure factor amplitudes and to extend the resolution from 3 to 2.5 A. After initial model idealization the octanucleotide double helix was divided into 8 nucleoside pairs with three internal degrees of freedom and 14 connecting rigid phosphate units and refined in space group P 4 3 (origin of P4,2 1 2). The refinement converged at R = 35.0%. Since no symmetry restraints had been applied the two equivalent strands adopted considerably different conformations. 9533 Nucleic Acids Research Table 1 Crystallographic and stereocheraical refinement parameters. Resolution range Number of reflections (Iff on F ) Off on F°) R factor (Iff reflections) (3ff reflections) F /F correlation coefficient o c Sugar-base bond distances Sugar-base bond angle distances Phosphate bond distances Phosphate angle and H-bond distances Planar groups Chlral volumes Single torsion contacts Multiple torsion contacts Isotropic thermal factors sugar-base bonds sugar-base angles phosphate bonds phosphate angles and H-bonda 6.0-1 .8 A 1475 1359 17.8% 17.1% 0.965 0.011 0.033 0.040 0.065 0.018 0.044 0.041 0.201 1.10 1.66 2.74 1.98 / / / / / / / / / / / / 0.025 0.050 0.050 0.075 0.030 0.100 0.250 0.250 2.50 3.50 3.50 3.50 A A A A A A3 A A AJ AJ A3 A* The R factor is defined as £ [F -F I/EF_.. the correlation coefficient is given by £[(F -<F 5)(F V » ] / [ j ; ( F -<F »*• £(F -<F > J 2 ] a / 2 . For the stereocfiemical parameters the left number givis tfie r.m.s. deviation from ideality and the right number is the sigma value used in the refinement. The weight applied on the corresponding restraint is the inverse square of the sigma value. At this stage the constrained refinement was replaced by stereochemically restrained least squares using the nucleic acids version NUCLSQ (31) of the Hendrickson-Konnert program (32,33) in conjunction with FRODO (34) model building on a vector graphics system. Restraints were placed on bond lengths and angles, base planes, deoryribose chiral centers, non-bonded contacts and thermal factors of linked atoms after these had been allowed to vary. In order to retain reasonable stereochemistry the entire eight base pair double helix rather than the asymmetric unit (one strand) was used in the refinement with additional restraints on hydrogen bonds in Watson-Crick base pairs. Initially, the two strands were forced towards identical stereochemistry by a symmetry restraint. This was found unnecessary in the later stages of refinement where the structure showed little tendency to deviate from ideal dyad symmetry. In the first cycles of NUCLSQ refinement the resolution was extended from 2.5 A to the limit of 1.8 A. The full 1CT data set 9534 Nucleic Acids Research including 1475 reflections was used subsequently. Individual temperature factors were introduced into the refinement at R = 31.2% and further progress was achieved by the incorporation of solvent molecules identified on the graphics system. Before each FRODO session perfect dyad symmetry of the helix was reconstituted by averaging over the equivalent strands. Double difference Fourier maps (2F -F ) and difference Fourier maps (F -F ) were o c o c used for the identification of solvent and to check the model. No manual revisions of the DNA helix model was necessary throughout refinement but much care was invested into the identification of solvent molecules. Only those difference Fourier peaks were assigned as water molecules that were nearly spherical in shape and were located in hydrogen bonding distance to polar DNA atoms or other waters. Despite the high resolution no cations bound to the sugar-phosphate backbone of d(GCCCGGGC) could be located. RESULTS AND DISCUSSION Refinement results The stereochemically restrained least-squares refinement of d(GCCCGGGC) converged at R = 17.1% for 1359 3a structure factor observations in the 6 to 1.8 A resolution range (R = 17.8% for 1475 la F o ) . The r.m.s. positional shift in the last cycle was 0.022 A and the r.m.s. temperature factor shift was 0.27 A 2 . In the final difference Fourier map the highest feature appeared with an electron density of 0.21 e/A3. Table 1 gives a summary of stereochemical and crys,tallographic parameters at the 'end of refinement. 34 solvent molecules were identified per strand of the octadeoxyribonucleotide, 8 of which refined to temperature factors between 50 and 60 A 2 . These water molecules were retained in the coordinate set since they reappeared in the difference electron density maps after having been omitted from refinement for five cycles. All other water molecules had thermal factors below 50 A 2 . The complete model for the d(GCCCGGGC) double helix thus consists of 322 DNA and 68 water oxygen atoms. Atomic coordinates have been deposited with the Brookhaven Protein Data Bank from which copies are available. ' 9535 Nucleic Acids Research Global conformation In the crystal, d(GCCCGGGC) forms an A-DNA double helix with perfect dyad symmetry. The bases on the first strand are numbered 1-8 and on the stereochemically equivalent complementary strand 9-16. Figure 1 presents several views of the structure of the octadeoxyribonucleotide helix. It is immediately apparent that the conformation of the octamer deviates significantly from A-DNA as defined by X-ray fiber diffraction (8,9). 9536 Nucleic Acids Research (c) Figure 1. Stereo representation of the structure of d(GCCCGGGC). Carbon, nitrogen, oxygen and phosphorus atoms are drawn with increasing radius, (a) View into the minor groove, helix axis vertical, (b) side-on view, dyad symmetry axis horizontally across the helix, (c) view into the major groove, (d) view down the helix axis. The average helix twist angle of the DNA fragment is 31.3" (Table 2 ) . Thus, there are 11.5 base pairs per complete turn of the helix putting this structure halfway between the classical A and A' helical conformations (38). The unusually small mean helix twist angle is caused predominantly by a partial unwinding of the helix between base pairs C4-G13 and G5-C12 which indicates a perturbation in the center of the helix structure where a poly(C) 9537 Nucleic Acids Research Table 2 Structural parameters for d(GCCCGGGC). Helix Rise per B.p. B..p. Propel lor twist b.p. roll tilt twist 5" 31 Gl - C16 C2 - G15 C3 - G14 C4 - G13 G5 - C12 G6 - Cll G7 - CIO C8 - G9 31 5' Average Fiber X-ray max. X-ray min. (°) 30.4 33.9 33.6 23.2 33.6 33.9 30.4 (A) 3.51 3.24 2.86 3.18 2.86 3.24 3.51 (•) C) C) (A) -6.13 -3.83 2.46 1.65 -1.65 -2.46 3.83 6.13 10.63 4.94 5.23 8.01 8.01 5.23 4.94 10.63 8.89 13.13 8.24 6.03 6.03 8.24 13.13 8.89 3.94 4.18 4.15 3.71 3.71 4.15 4.18 3.94 4.00 31.3 3.20 7.20 9.07 4.4 32.7 2.56 22.0 6.0 4.1(c) 14.0(a) 16.2(a) 34.0(a) 3.10(b) 3.5(a) 9.0(b) 8.6(d) 31.6(c) 2.84(a) The parameters are those defined by Dickerson and collegues (11,35-37) where the helix twist measures the angle at which the Cl'-Cl1 vectors of two successive base pairs intersect when viewed down the helix axis, base pair roll and tilt describe rotations of the best plane through a base pair about the long and the short axis of the base pair, respectively. The propellor twist is the angle of rotation about the C6(pyrimidine)-C8(purine) vector between the bases of a pair a'nd D measures the perpendicular displacement of this vector from the helix axis. The corresponding parameters for A-DNA as defined by fiber diffraction methods are taken from Arnott and colleages (9). X-ray max. and min. give the largest and smallest mean values for each parameter as observed by single crystal analysis of oligonucleotides (2): (a) d( i CCGG) (15,16), (b) d(CCCCGGGG) (18), (c) d(GGGGCCCC) (19), (d) d(GGCCGGCC) (17). stretch leads into a poly(G) stretch. At first glance, this reduction in local helix twist angle might be regarded as a proper Calladine strategy (10,11) for avoiding an inter-strand minor groove clash between the guanine bases at a pyrimidine-purine step. In addition to the mean helix twist angle other parameters have adopted values outside the range previously observed in crystalline A-DNA fragments (2) and different from the fiber diffraction model of A-DNA (9). The mean rise per base pair is 3.2 A and thus larger than expected for A-DNA, the average base pair tilt against the helix axis is only 7.2° and, most significantly, the major groove width measures 9.8 A (P-P separation minus 5.8 A 9538 Nucleic Acids Research 3. U 8 12 16 20 Base pair tilt Is) Figure 2. Dependence of major groove width and average rise per base pair on the mean base pair tilt in oligonucleotide crystal structures in the A and B-DNA conformation. Data are taken from (2). The solid symbols represent the fiber diffraction models of B-DNA (top left) and A-DNA (bottom right), symbols for a DNA/RNA hybrid (22) have been placed in parentheses and symbols for d(GCCCGGGC) are shown enlarged. The linear correlation coefficient with the mean base pair tilt is -0.983 for the major groove width and -0.976 for the mean rise per base pair. to account for the van der Waals radii of the two phosphate groups). Base pair tilt angle, rise per base pair and major groove separation are highly correlated as shown in Figure 2 which may also serve to place the global structure of d(GCCCGGGC) in the context of known DNA conformations. The polymorphism of A-DNA structure is most dramatically demonstrated by a comparison of the major groove widths of crystalline d(GCCCGGGC) and a model of the same sequence based on idealized coordinates from fiber diffraction studies of random-sequence DNA in the A conformation (7-9, see Figure 3 ) . There is much difference in the accessibility of bases in the major groove which may be important concerning a possible role of A-DNA in living cells (20,39). The low-resolution structure of a 9 base-pair DNA fragment covering part of the binding site for Xenopus laevis transcription factor IIIA (20) also shows A-form DNA with a very wide major groove. 9539 Nucleic Acids Research (a) (b) Figure 3. Space filling drawing of (a) the crystal structure of d(GCCCGGGC) and (b) an A-DNA model for the same sequence based on idealized coordinates from fiber diffraction (7-9) in an orientation which emphasizes the difference in major groove separation. Phosphorous atoms have been drawn in black to mark the sugarphosphate backbone. Base pair geometry and stacking Four individual base pairs and four base pair steps are found in the symmetric octamer double helix. All base pairs display the usual positive propellor twist. The roll angles (about the long axis Of a base pair) for individual base pair steps in d(GCCCGGGC) are 6.3" (G1-C2/C16-G15), 9.0° (C2-C3/G15-G14), 3.1* (C3-C4/G14-G13) and -0.1* (C4-G5/G13-C12) . This behaviour is in contrast to the Calladine/Dickerson predictions (10,11) which would call for a maximum roll angle at the central C-G base pair step to relieve a possible minor-groove purine-purine clash. The local helical conformation of DNA is dictated by the sequence-dependent stacking of base pairs. In the d(GCCCGGGC) helix stacking involving the guanine bases is strongly favoured over stacking with cytosines (Figure 4 ) . From the terminal base pair step into the center of the helix we observe increasing sliding out of the stack of base pairs, predominantly along their long axes, resulting in perfect inter-strand stacking between G5 and 9540 Nucleic Acids Research C16...G1 G15...C2 C16...G1 G15...C2 G15...C2 G14...C3 G15...C2 G14...C3 G14...C3 G13...C4 G14...C3 G13...C4 G13...C4 C12...G5 Figure 4. The four different base pair steps of d(GCCCGGGC) viewed down the helix axis. The upper nucleoside pair and the phosphate units are shown with solid bonds. Note the C3'-exo sugar pucker of C16 (top) and the pronounced interstrand stacking between G5 and G13 (bottom) which is made possible by a reduction in local helical twist and a fully extended sugar-phosphate backbone. 9541 Nucleic Acids Research Table 3 Stereochemistry of the sugar-phosphate backbone. a Gl C2 C3 C4 G5 G6 G7 C8 0 -84 .6 162.4 -60 .7 174.5 -82 .2 -173.7 -170 .5 -163.9 -78 .4 -178.0 -69 .1 173.6 -49 .7 166.8 7 6 e 47.6 69.2 42.0 54.0 142.8 57.0 61.6 55.8 78.5 71.3 80.9 81.0 67.6 80.1 78.8 145.8 -161.4 -159.6 168.3 -178.2 -154.9 -148.3 -161.9 r -55.7 -63.4 -46.0 -69.3 -72.6 -74.5 -66.9 X -158.7 -161.9 -159.0 -160.4 -177.3 -173.4 -165.5 -119.9 T M 39.1 43.9 45.0 35.1 47.8 41.8 43.2 42.8 P 8.6 21.9 12.8 22.3 16.0 18.9 15.3 186.5 pucker C3 '-endo C3 1-endo C3 '-endo C3 '-endo C3 '-endo C3 '-endo C3 '-endo C3 1 -exo Backbone torsion angles are defined as P a 05' 0 C5' 7 C4' 5 C3' £ 03' f P and the glycoayl angle is C2_tU X Cl '_O4"'~rpyrTmldTne«T~an3 C4 H9 X cl'~o"4' (purines). The p«eudorotation magnitude (TM> and phase (P) are gTven according to (41). G13 at the central C-G step. In order to maximize this stacking interaction the roll angle between the central base pairs has to be close to zero. A buckling of the central base pairs with an angle of -4.6* contributes to a parallel orientation of the guanine planes. Since, in addition, the propellor twist of the C4G13 base pair (and the symmetry-related G5-C12) is rather small (6.0°), the sliding of base pairs positions the six-membered rings of the guanine bases almost perfectly flat atop each other. The extended backbone at C4-G5, necessary to permit inter-strand purine-purine stacking, causes the reduction in local helix twist angle (see above). The single parameter that determines the local structure of d(GCCCGGGC) is the cross-strand stacking of guanine bases at the central C-G step. Analogous behaviour has been described for the helical structure of the related A-DNA octamers d(CCCCGGGG) (18) and d(GGGCGCCC) (40). Calladine and Drew (40a) have early on recognized the importance of pyriraidine-purine (and here especially C-G) steps for DNA helix structure. They have proposed two distinctly different stacking geometries: pyrimidine-purine stacking on the same strand in B-DNA and cross-strand stacking of purines, as observed in this structure, in A-DNA. Since preceding dinucleotide steps in d(GCCCGGGC) have no preference for either mode, the other base pairs follow the motion of the central guanines forced by "the passive, elastic stiffness of the backbone" (40a), as evidenced by the increase of the base pair slide (1.1 - 1.6 1.7 - 2.3 A) from end to center of the helix. In this sense the slide is in fact a cooperative parameter capable of being transmitted along the DNA helix. 9542 Nucleic Acids Research Sugar-phosphate backbone In the refinement of the octamer structure no restraints have been placed on sugar pucker and backbone torsion angles. Yet seven out of the eight deoxyribose units have adopted the usual C3'-endo conformation (Table 3 ) . The degree of puckering as measured by the pseudorotation magnitude 7"M (41) is almost uniform over the entire strand. Only the 3'-terminal sugar has adopted a pucker in the opposite domain (C3'-exo) which remains without major consequences for the helix structure. A similar observation has been made for d( CCGG) (15,16). It may be remarked once again that the A-DNA conformation appears to allow less variability in sugar pucker than the B form. With the exception of nucleotide unit G5 and the C3'-exo sugar pucker of C8 all backbone torsion angles fall into the range expected for A-DNA (2). To permit the above discussed cross-strand stacking of guanines in the center of the helix torsion angles a and 7 have to change from their standard values (-)gauche and (+)gauche to trans, i.e. the backbone has to be fully extended. As a consequence, the P-P separation between the nucleotide units 5 and 6 is increased to 6.8 A over an average of 6.0(3) A for the remaining intra-strand phosphate-phosphate steps. Again, this phenomenon has been noted before for C-G base pair steps in A-DNA (18,40). Inasmuch as it suggests a continuum of possible conformations, the term "elastic stiffness" then appears to be no adequate decription of the DNA backbone since the latter is capable of adopting distinct conformational states within one helix type. Concerning a possible biological function it should be pointed out that here sequence information is reflected in the conformation of the sugar-phosphate backbone. An information readout, thus, would not necessarily have to involve direct contact (of a protein, e.g.) with the bases but could occur by interaction with the phosphate groups. Thermal parameters Atomic temperature factors of linked atoms are restrained during least-squares refinement to prevent unreasonable differences of thermal motion (33). As often observed in oligonucleotide crystal structures, in d(GCCCGGGC) the mean isotropic 9543 Nucleic Acids Research o Figure 5. Hydration of the d(GCCCGGGC) double helix. The view is into the major groove as in Figure lc. Water molecules appear as open spheres. temperature factors are lowest for atoms in the bases (21.5 A 2 ) and highest in the phosphate groups (27.4 A 2 ) with sugars in between (24.8 A 2 ) . There is a tendency of atoms in central bases and sugars to adopt lower B values than at the ends of the helix but not so for the phosphate groups. For the solvent structure no partial occupancies were allowed. The temperature factors thus take up the effect of incomplete occupancy of water sites and do not strictly reflect thermal motion. Temperature factors of water oxygen atoms range from 27 to 60 A 2 . Hydration The interaction with water and dissolved ions plays an important role in the structure of biological raacromolecules (42). Mechanisms for the stabilization of B-DNA by the minor groove "spine of hydration" (43) and of A-DNA by "economics of hydration" (44) have been proposed. The influence of cations which have to be present to neutralize the phosphate charges of DNA, however, must not be overlooked. At the present resolution the identification of metal ions in the solvent sphere of d(GCCCGGGC) proved 9544 Nucleic Acids Research impossible. The discussion of hydration of the octamer will therefore be restricted to a few general comments. A detailed analysis of the solvent structure will have to await the analysis of higher-resolution X-ray diffraction data which have recently been collected at the DESY synchrotron source in Hamburg. Of the 68 water molecules identified per double helix all but 8 are in hydrogen-bonding distance (less than 3.5 A) to polar DNA atoms. The present crystal structure analysis has thus revealed little more than the primary hydration shell of the octamer. Since 10 water molecules can make hydrogen bonds to symmetryrelated helices in the crystal lattice and thus contribute twice to the hydration of one helix, the inner hydration shell contains at least 70 water molecules. The wide and deep major groove of d(GCCCGGGC) and the phosphate backbone are the primary hydration sites (Figure 5 ) . The phosphate groups are individually hydrated? no water molecules are found that bridge between adjacent phosphate groups by hydrogen-bonding as might have been predicted (44) although, with the mentioned exception of phosphate groups 5 and 6, all intra-strand phosphate-phosphate distances are in the normal A-DNA range. Binding of water molecules in the minor groove of the helix is partially hindered by lattice contacts. Therefore this crystal form is not a good model to study minorgroove hydration in A-DNA. Crystal packing The formation of the d(GCCCGGGC) double helix by a crystallographic dyad axis will not be considered here. Contacts in the crystal lattice between octamer helices involve van der Waals interactions and hydrogen bonds. Little can be learned about charge-charge interactions since dissolved cations could not be located. Inter-helix contacts include four possible hydrogen bonds: N2 of residue G5 to 04" of Gl (3.0 A ) , N2 of G6 to N3 of Gl (3.2 A) and the analogous contacts with the equivalent strand. Using a cutoff distance of 4.0 A for van der Waals interactions, there are 122 non-bonded contacts between octamer helices, 24 of which involve atom pairs closer than 3.5 A. In addition, there are 10 water molecules which might bridge between helices by hydrogen bonding. 9545 Nucleic Acids Research Figure 6. Packing of the terminal three base pairs of two symmetry-related molecules (narrow lines) against the flat minor groove of the helix in d(GCCCGGGC) crystals. Most intermolecular contacts arise from a packing of the terminal base pairs of two symmetry-related helices into the flat minor groove of the d(GCCCGGGC) double helix (Figure 6 ) . There is a very intimate inter-helical fit which undoubtedly greatly stabilizes the crystal lattice. A similar packing scheme has been observed in most A-DNA crystal structures analysed so far (12-25). Relevance to DNA solution structure and biological significance It is possible to determine the global conformation of oligonucleotides in solution by NMR methods (45,46). By this method and by Raman spectroscopy a number of oligonucleotides that form A-DNA in the crystal were shown to adopt the B conformation in solution (47-49) thus questioning the relevance of crystalline ADNA to the structure of DNA in aqueous solution. It should be kept in mind, however, that by NMR spectroscopy the overall conformation of a chain-like molecule like DNA has to be deduced from short-range inter-proton distances whose accurate determination is still a matter of debate (50) while other spectroscopic methods may not identify an oligonucleotide as d(GCCCGGGC), showing almost B-DNA like base pair tilt and rise, as A-DNA. Based on its sequence the octamer may be expected to display structural features reminiscent of the homopolyraer poly(dG). poly(dC). Fibers of this polymer have been reported to retain an 9546 Nucleic Acids Research A-DNA like conformation even at high relative humidity (51). Solution X-ray scattering experiments have yielded a radius of gyration closer to double-helical RNA than to DNA (52). The helical repeat of poly(dG).poly(dC) in solution is 10.7 base pairs per turn (53) and thus nearly identical with the repeat of mixed-sequence DNA (53,54) but markedly different from the 10 base pairs per turn of fiber diffraction B-DNA (7-9). These facts taken together suggest that in solution conformational differences between helix types are more subtle than in our models and intermediate forms between A and B-DNA exist of which poly(dG).poly(dc) may be one. It thus seems natural that d(GCCCGGGC) forms an A-DNA helix with certain structural characteristics such as major groove width, base pair tilt and rise resembling B-DNA. The biological significance of A-DNA has been discussed recently (20,39). With this respect the possibility of A-DNA to open its major groove as wide as in d(GCCCGGGC) may be important, e.g. for protein-DNA interaction and information readout. The characteristic cross-strand guanine-guanine stacking at the central C-G step of the octamer has been shown to be reflected in the conformation of the sugar-phosphate backbone. An information readout from DNA thus must not necessarily involve direct contact with the bases in the helical grooves but can be achieved by sensing backbone conformation in some cases. This may be illustrated by an admittedly speculative example. The enzyme DNase I has been shown to cleave phosphodiester bonds with rates depending on DNA sequence and local structure (55-58). While poly(dG). poly(dc) like sequences are generally cut less frequently than mixed-sequence DNA (58) cleavage rates at all three CCC V G sites in a 160 base pair DNA fragment were significantly above those of neighbouring dlnucleotides (57). In'd(GCCCGGGC) this cleavage site shows a unique all-trans backbone conformation and a phosphate-phosphate separation increased by nearly 1 A. If oligo(dG). oligo(dC) stretches in a mixed-sequence environment indeed adopt an A-DNA like conformation this may explain the generally reduced cleavage rates within these sequences. The enhanced cutting frequency at the C-G step may then be attributed to the higher exposure to nucleophilic attack of the guanine phosphate group in the extended backbone segment. 9547 Nucleic Acids Research ACKNOWLEDGEMENTS The authors thank Drs. E. Westhof, J.L. Sussman and R.E. Dickerson for computer programs and W. Saenger for reading the manuscript and helpful discussions. U.H. and H.L. acknowledge financial support from the Deutsche Forschungsgemeinschaft and the StSndige Koramission ftlr Forschung und wisaenschaftlichen Nachwuchs der Freien Universitat Berlin and M. Steifa for invaluable technical assistance. A stipend from the Studienstiftung des Deutschen Volkes to H.L. and help in DNA synthesis by W. Heikens and M. Becker are also gratefully acknowledged. We wish express our gratitude to a referee for valuable suggestions. *To whom correspondence should be addressed REFERENCES 1. Dickerson,R.E., Drew,H.R., Conner,B.N., Wing.R.M., Fratini, A.V. and Kopka,M.L. (1982) Science 216, 475-485. 2. Shakked.Z. and Rabinovich.D. (1986) Prog. Biophys. Mol. Biol. 47, 159-195. 3. Patel.D.J., Pardi.A. and Itakura,K. (1982) Science 216, 581-590. 4. Wang,A.H.-J., Quigley.G.J., Kolpak.F.J., Crawford,J.L., van Boom,J.H., van der Marel.G. and Rich,A. (1979) Nature 282, 680-686. 5. Drew,H.R., Takano,T., Tanaka.S., Itakura,K. and Dickerson, R.E. (1980) Nature 286, 567-573. 6. Rich,A., Nordheim,A. and Wang,A.H.-J. (1984) Ann. Rev. Biochem. 53, 791-846. 7. Arnott,S- and Hukins,D.W.L. (1972) Biochem. Biophys. Res. Commun. 47, 1504-1509. 8. Arnott.S. (1976) In Allfrey.V.G., Bautz,E.K.F., McCarthy, B.J., Schimke.R.T. and Tissieres,A. (eds) Organisation and Expression of Chromosomes, Dahlem Conference, Berlin, pp. 209-222. 9. Arnott.S., Chandrasekaran,R., Hall,I.H., Puigjaner.L.C., Walker,J.K. and Wang.M. (1982) Cold Spring Harbor Symp. Quant. Biol. 47, 53-65. 10. Calladine,C.R. (1982) J. Mol. Biol. 161, 343-352. 11. Dickerson,R.E. (1983) J. Mol. Biol. 166, 419-441. 12. Shakked,Z., Rabinovich.D., Cruse,W.B.T., Egert.E., Kennard, 0., Sala,G., Salisbury,S.A. and Viswamitra.M.A. (1981) Proc. R. Soc. London B213, 479-487. 13. Shakked,Z., Rabinovich.D., Kennard,O., Cruse,W.B.T., Salisbury,A. and Viswamitra,M.A. (1983) J. Mol. Biol. 166, 183-201. 14. Kennard.O., Cruse,W.B.T., Nachman,J., Prange.T., Shakked.Z. and Rabinovich,D. (1986) J. Bioraol. Struct. Dyn. 3, 623-647. 15. Conner,B.N., Takano,T., Tanaka,S., Itakura.K. and Dickerson, R.E. (1982) Nature 295, 294-299. 9548 Nucleic Acids Research 16. Conner,B.N., Y o o n , C , Dickerson,J.L. and Dickerson,R.E. (1984) J. Mol. Biol. 174, 663-695. 17. Wang,A.H.-J., Fujii.S., van Boom,J.H. and Rich,A. (1982) Proc. Natl. Acad. Sci. USA 79, 3968-3972. 18. Haran,T.E., Shakked,Z., Wang,A.H.-J. and Rich,A. (1987) J. Biomol. Struct. Dyn., in press 19. McCall.M., Brown,T. and Kennard.O. (1985) J. Mol. Biol. 183, 385-396. 20. McCall.M., Brown,T., Hunter,W. and Kennard,O. (1986) Nature 322, 661-664. 21. Frederick,C.A., Teng.M., Quigley.G.J., Wang,A.H.-J., Rich,A. van der Marel.G., van Boom,J. and Shakked,Z. (1987) Abstract 040, Fifth Conversation in Biomolecular Stereodynamics, State University of New York, Albany 21a. Jain,S., Zon,G. and Sundaralingam,M. (1987) J. Mol. Biol. 197, 141-145. 22. Wang,A.H.-J., Fujii.S., van Boora.J.H., van der Marel.G.A., van Boeckel,S.A.A. and Rich,A. (1982) Nature 299, 601-604. 23. Frederick,C.A., Saal.D., van der Marel.G.A., van Boom,J.H., Wang,A.H.-J. and Rich,A. (1987) Biopolymers 26, 145-160. 24. Brown,T., Kennard.O., Kneale.G. and Rabinovich.D. (1985) Nature 315, 604-606. 25. Kneale,G., Brown,T., Kennard,0. and Rabinovich,D. (1985) J. Mol. Biol. 186, 805-814. 26. Ohtsuka,E., Ikehara,M. and Sbll.D. (1982) Nucleic Acids Res. 10, 6553-6570. 27. Dattagupta,J.K., Fujiwara.T., Grishin.E.V., Lindner,K., Manor,P.C, Pieniazek,N.J., Saenger.W. and Suck.D. (1975) J. Mol. Biol. 97, 267-271. 28. North.A.C.T., Phillips,D.C. and Mathews.F.S. (1968) Acta Cryst. A24, 351-359. 29. Sussman.J.L., Holbrook,S.R., Church,G.M. and Kira,S.-H. (1977) Acta Cryst. A33, 800-804. 30. Sussman.J.L. (1985) Methods Enzymol. 115, 271-303. 31. Westhof,E., Dumas,P. and Moras,D. (1985) J. Mol. Biol. 184, 119-145. 32. Hendrickson,W.A. and Konnert,J.H. (1980) In Diamond,R., Ramaseshan,S. and Venkatesan.K. (eds) Computing in Crystallography, The Indian Academy of Sciences, Bangalore, pp. 13.01-13.23 33. Hendrickson.W.A. (1985) Methods Enzyraol. 115, 252-270. 34. Jones,T.A. (1978) J. Appl. Cryst. 11, 268-272. 35. Fratini.A.V., Kopka,M.L., Drew,H.R. and Dickerson,R.E. (1982) J. Biol. Chem. 257, 14686-14707. 36. McPherson,A. and Jurnak.F., eds. (1984) Structural Biology, Wylie, New York, pp. 471-494. 37. Kopka,M.L., Pjura,P., Y o o n . C , Goodsell.D. and Dickerson, R.E. (1985) In Clementi,E., Corongiu,G., Sarma,M.H. and Sarma.R.H. (eds) Structure and Motion: Membranes, Nucleic Acids and Proteins, Adenine Press, New York, pp. 461-483. 38. Saenger.W. (1984) Principles of Nucleic Acid Structure, Springer Verlag, New York. 39. Rhodes,G. and Klug.A. (1986) Cell 46, 123-132. 40. Rabinovich.D., Haran.T., Eisenstein.M and Shakked.Z. (1987) J.Mol.Biol., submitted 40a. Calladine,C.R. and Drew.H.R. (1984) J Mol. Biol. 178, 773-782. 9549 Nucleic Acids Research 41. 42. 43. 44. 45. 46. 47. 48. 49. 50. 51. 52. 53. 54. 55. 56. 57. 58. 9550 Altona.C. and Sundaralingam,M. (1972) J. Am. Chem. Soc. 94, 8205-8212. Saenger.W. (1987) Ann. Rev. Biophys. Biophys. Chem. 16, 93-114. Drew,H.R. and Dickerson.R.E. (1981) J. Mol. Biol. 151, 535-556. Saenger.W., Hunter,W.N. and Kennard.O. (1986) Nature 324, 385-388. Haasnoot.C.A.G., Westerink.H.P., van der Marel.G.A. and van Boom,J.H. (1984) J. Bioraol. Struct. Dyn. 2, 345-360. Nilsson.L., Clore.G.M., Gronenborn,A.M., Brlinger.A.T. and Karplus,M. (1986) J. Mol. Biol. 188, 455-475. Reid.D.G., Salisbury,S.A., Bellard.S., Shakked,Z. and Williams,D.H. (1983) Biochemistry 22, 2019-2025. Rinke'l.L.J. , Sanderson,M.A. , van der Marel.G.A., van Boom, J.H. and Altona.C. (1986) Eur. J. Biochem. 159, 85-9 3. Benevides,M.J., Wang,A.H.-J., Rich,A., Kyogoku.Y., van der Marel.G.A., van Boom,J.H. & Thomas,G.J. (1986) Biochemistry 25, 41-50. Lefevre,J.-F., Lane,A.N. and Jardetzky.O. (1987) Biochemistry 26, 5076-5090. Arnott,S. and Seising,E. (1974) J. Mol. Biol. 88, 551-552. Bram,S. (1971) Nature New Biology 233, 161-164. Peck.L.J. and Wang.J.C. (1981) Nature 292, 375-378. Rhodes,D. and Klug.A. (1980) Nature 286, 573-578. Lutter.L.C. (1978) J. Mol. Biol. 124, 391-420. Lomonossoff,G.P., Butler,P.J.G. and Klug.A. (1981) J. Mol. Biol. 149, 745-760. Drew.H.R. and Travers.A.A. (1984) Cell 37, 491-502. Drew.H.R. and Travers,A.A. (1985) Nucleic Acids Res. 13, 4445-4467.
© Copyright 2026 Paperzz