Volume 14 Number 1 1986 Nucleic Acids Research Computer graphics program to reveal the dependence of the gross three-dimensional structure of the B-DNA double helix on primary structure Chang-Shung Tung Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM 87545, and Stephen C.Harvey Department of Biochemistry, University of Alabama, Birmingham, AL 35294, USA Received 6 June 1985 ABSTRACT Programs are presented to plot the gross three-dimensional structure of the DNA double helix with the base sequence as input information. The rules that determine the overall structure of the double helix are those that predict the dependence of local helix parameters (specifically, helix twist angle and relative basepair roll angle) on sequence. For this purpose, the user can select either the Calladine-Dickerson parameters or the Tung-Harvey parameters. These programs can be used as tools to investigate the variation of DNA tertiary structure with sequence, which may play an important role in the sequence-specific recognition of DNA by proteins. INTRODUCTION The sequence-dependent DNA helix structure is «n interesting topic which has attracted much attention in recent years. This sequence-dependent nature of DNA tertiary structure may play an important role in DNA packaging and in the sequence-specific recognition of DNA by other molecules (1-3). In the past few years, several models (2,3,4-8) have been developed for the prediction of helix parameters [e.g., helix twist angles (t ), changes of roll angle (t f ), propeller twist angles (t ), basepair separations (d), etc.] of DNA double helices from primary sequences. These predictions are sets of numerical data pertaining to the three dimensional structure of DNA double helix. Even with all these helix parameters available through prediction models, it is still very difficult to visualize the overall shape of the DNA double helix in space. To solve this problem, computer programs are given here to plot pictures of DNA double helices when the primary structure* are specified. Possible applications are also suggested in this paper. Source programs are available in either of two formats. Requests for tapes in standard card image format should be directed to CST at Los Alamos, while VAX VMS format versions are available from SCH in Birmingham. There is no charge, and programs will be supplied upon receipt of a self-addressed mailing label and a blank tape. 381 Nucleic Acids Research Figure 1: A rectangular plate to represent a basepair. The rectagular plate BCDE is 10 S long, 5 8 wide, with the helix rotational center indicated as "+". A and F represent two points in the sugar phosphate backbone. METHOD The purpose of these programs is to display the overall shape of the DNA double helix with a specific sequence. Out of the helix parameters defined by (helix Dickerson et a^. (A,7), only three twist angle, change of roll angle, and basepair separation) are used in this algorithm for the following reasons: First, propeller twist angle is a property within a basepair which does affect not basepairs. the Second, relative basepair position and sliding will orientation of two consecutive not alter the orientation of the following bagepairs, and none of the existing prediction models can predict basepair sliding small in B-DNA well. Third, because basepair tilt angle is usually very (9, 1 0 ) , it has been assumed to be zero. To further simplify the pictorial presentation, each basepair is represented by a rectangular plate which lies on the mean plane of the basepair (Fig. 1 ) . The plate designated BCDE, is 10 X long, 5 X wide, and connected to two points (A and F) that represent points in the backbone. The center of helix rotation is indicated as "+". Each plate corresponds to a local coordinate system where the origin coincides with the center of helix rotation, the x-axis is parallel to AF, and the y-axis is parallel to CB. deduced The geometry of the n basepair can be easily in the local coordinate system corresponding to the (n-1) by moving the plate a distance d in the z direction (basepair separation is equal to d ) , rotating through an angle t with respect to the z-axis (helix twist angle is equal to t ), and then rotating through an angle t spect to the x-axis 2. nate with re- (change of roll angle is equal to t ) , as shown in Fig. Keeping track of the transformation matrices between each local coordisystem and the global coordinate system, the coordinates of all base- pairs with respect to the global coordinate 382 basepair system can be derived. The re- Nucleic Acids Research -2 X Figure 2: Geometry of two consecutive basepairs. Each basepair corresponds to a local coordinate system. The geometry of the top basepair can be easily derived in the local coordinate system of the bottom basepair by moving a distance d in z direction, rotating through an angle t with respect to the z-axis, and then rotating through an angle t with r & p e c t to the x-axis. suiting set of coordinates is then used for the plotting of the double helix. Figure 3 shows the flow chart of the algorithm. veloped. Two programs were de- PREPLT calculates the coordinates of all basepairs, while PLT plots the results from PREPLT. Both programs were first developed in FORTRAN on a CDC 7600 computer at the Los Alamos National Laboratory with DISPLA package (Display Integrated Software Systems 92121). Software Corporation, 4186 and Plotting Sorrento Language Valley from Blvd., San Integrated Diego, CA VAX versions of the programs were later developed at the University of Alabama at Birmingham; plotting System package these are also in FORTRAN, and they use the PLOT10 (Tektronix, Inc., P. 0. Box 500, Beaverton, OR 97077). SAMPLE PLOTS The programs can apply the helix parameters predicted from either of the two prediction models (2,3,7) to make the plot. tion model developed by Tung and Harvey model, with the predictions derived on all-atom models for the basepairs. are two. We chose to use the predic- (2,3) because it is a more detailed from confornational energy calculations The principal advantages of this model First, it is detailed enough to distinguish between adenosine and guanosine and between cytidine and thymidine; whereas the Calladine-Dickerson model only distinguishes between purines and pyimidines. Second, the only 383 Nucleic Acids Research START SET BP(1) OK ORIGIN OF CC LET LC(1) COINCIDE WITH CC CALCULATE BP(2) IN LC(1) SET LC(2). CALCUUTE TU(2) CALCULATE BP(N) IN LC(N-1) SET LC(NX CALCUUTE TM(N) TRANSFORM BP(N) TO CC Figure 3: Flowchart of the algorithm. BP(N) indicates coordinates of the n basepair. LC(N) represents local coordinate system corresponds to the n basepair. GC is the global coordinate system, while TM(N) represents transformation matrix between IX(N) and GC. Figure 4: Plot of d((G) I2 '(C) 12 ). This 12 basepair DNA double helix is straight with constant helix parameters for all basepairs except some small deviations for the end basepairs because of the lack of propagational effect from the neighboring steps. 384 Nucleic Acids Research Figure 5: Plot of d(CGCGAATTCGCG). As expected, this helix is not but with local variations from the ideal B-DNA structure. adjustable parameters correspond to simple physical quantities regular that can be compared with experimental quantities (3). The first plot is that of a homopolymer (d(G) U. This piece of DNA 'd(C)..) as shown in Fig. is straight with identical helix parameters within the (b) Figure 6: Plot of the 51-basepair bending locus of trypanosome kinetoplast DNA. The helix is bent with the bending nearly confined to a single plane. This piece of DNA was identified by Wu and Crothers (12) from gel electrophoresis raeasurements to be the bending locus of trypanosome kinetoplast minicircle DNA. b) shows the view of the helix with 90 degree rotation from a ) . Nucleic Acids Research ^•n**^ (b) (a) Figure 7 : Plot of d((A5T )..). This piece of DNA is 200 basepairs long. When compared to the structure shown in Fig. 7, this helix bends even more with a smaller radius of curvature. The bending of this helix is not planar but forms a superhelical structure. For the purpose of clarity, basepairs are not included in this plot. a) shows the side view, while b) shows the top view of the DNA double helix. helix (except some small deviations for the end basepairs due to the absence of the propagational effect fron the neighboring steps). The next d(CGCGAATTCGCG). plot (Fig. 5) is for the self-complementary dodecamer The interesting feature one notices first is that the helix is not straight. The heLix structure is not regular but depends on the sequence, as indicated in the crystal structure (7,8). Figure 6 is a plot of the 50 basepair bending locus of the trypanosome kinetoplast DNA whose anomalous gel mobility (11) is believed to be due to macroscopic bending (12,13). bent. One can see that this piece of DNA is indeed The bending is nearly confined to a plane; i.e., bending in one direc- tion is much more pronounced than in other directions. The last plot (Fig. 7) is a self-complementary DNA helix with an alternating adenosine-thymidine sequence (d(A^T ) ). This particular DNA is pre- dicted to bend even more than the bending locus of trypanosome kinetoplaBt DNA. The bending is not planar but forms a superhelical structure. SUHHARY All plots generated from PLT are stereoscopic views. The plotted DNA double helix can be seen in three-dimensional space through a pair of stereo 386 Nucleic Acids Research glasses with each lens focused on one plot. This representation is a very nice visual aid to the prediction models. These programs can be used to search for specific gross structural features in known DNA sequences. It can also be easily modified to look for the optimum sequence which comes closest to some specific predetermined three-dimensional structure. ACKNOWLEDGEMENTS This work was supported by the U. S. Department of Energy and * grant to S.C.H. from the National Science Foundation (PCM-8417001). REFERENCES 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. 13. Windom, J. (1984) Nature 309, 312-313. Tung, C.S. (1984) Ph.D. dissertation, Univ. of Alabama, Birmingham. Tung, C.S. and Harvey, S.C. (1984) Nucl. Acids Res. 12, 3343-3356. Fratini, A.V., Kopka, M.L. , Drew, H.R. and Dickerson, R.E. (1982) J. Biol. Chem. 257, 14686-14707. Kabsch, W. , Sander, C. and Trifonov, E.N. (1982) Nucl. Acids Res. 10, 1097-1104. Calladine, C.R. (1982) J. Mol. Biol. 161, 343-352. Dickerson, R. E. (1983) J. Mol. Biol. 166, 411-419. Dickerson, R.E. (1983) Scientific American 249(2), 94-111. Mellema, J.R., van Kamper, P.N., Carlson, C.N., Bosshard, H.E. and Altona, C. (1983) Nucl. Acids Res. 11, 2893-2905. Wells, R.D., Goodman, T.C., Hillen, W. , Horn, G.T., Klein, R.D., Larson, J.E., Miiller, U.R., Neuendorf, S.K., Panayotatos, N. and Stirdlvant,' S.M. (1980) Proc. Nucl. Acid Res. Mol. Biol. 24, 167-267. Marini, J.C., Levene, S.D., Crothers, D.M. and Englund, P.T. (1982) Proc. Nat'l Acad. Scl. USA 79, 7664-7668. Wu, H.-M. and Crothers, D.M. (1984) Nature 308, 509-513. Hagerman, P.J. (1984) Proc. Nat'l. Acad. Sci. USA 81, 4632-4646. 387 Nucleic Acids Research
© Copyright 2026 Paperzz