Computer graphics program to reveal the dependence of the gross

Volume 14 Number 1 1986
Nucleic Acids Research
Computer graphics program to reveal the dependence of the gross three-dimensional structure of
the B-DNA double helix on primary structure
Chang-Shung Tung
Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM 87545, and
Stephen C.Harvey
Department of Biochemistry, University of Alabama, Birmingham, AL 35294, USA
Received 6 June 1985
ABSTRACT
Programs are presented to plot the gross three-dimensional structure of
the DNA double helix with the base sequence as input information. The rules
that determine the overall structure of the double helix are those that predict the dependence of local helix parameters (specifically, helix twist
angle and relative basepair roll angle) on sequence. For this purpose, the
user can select either the Calladine-Dickerson parameters or the Tung-Harvey
parameters. These programs can be used as tools to investigate the variation
of DNA tertiary structure with sequence, which may play an important role in
the sequence-specific recognition of DNA by proteins.
INTRODUCTION
The sequence-dependent DNA helix structure is «n interesting topic which
has attracted much attention in recent years.
This sequence-dependent nature
of DNA tertiary structure may play an important role in DNA packaging and in
the sequence-specific recognition of DNA by other molecules (1-3).
In the
past few years, several models (2,3,4-8) have been developed for the prediction of helix parameters
[e.g., helix twist angles (t ), changes of roll
angle (t f ), propeller twist angles (t ), basepair separations (d), etc.] of
DNA double helices from primary sequences.
These predictions are sets of
numerical data pertaining to the three dimensional structure of DNA double
helix.
Even with all these helix parameters available through prediction
models, it is still very difficult to visualize the overall shape of the DNA
double helix in space.
To solve this problem, computer programs are given
here to plot pictures of DNA double helices when the primary structure* are
specified.
Possible applications are also suggested in this paper.
Source programs are available in either of two formats. Requests for
tapes in standard card image format should be directed to CST at Los Alamos,
while VAX VMS format versions are available from SCH in Birmingham. There
is no charge, and programs will be supplied upon receipt of a self-addressed
mailing label and a blank tape.
381
Nucleic Acids Research
Figure 1: A rectangular plate to represent a basepair. The rectagular plate
BCDE is 10 S long, 5 8 wide, with the helix rotational center indicated as
"+". A and F represent two points in the sugar phosphate backbone.
METHOD
The purpose of these programs is to display the overall shape of the DNA
double helix with a specific sequence.
Out of the helix parameters defined
by
(helix
Dickerson
et
a^.
(A,7), only
three
twist
angle,
change
of
roll
angle, and basepair separation) are used in this algorithm for the following
reasons:
First, propeller twist angle is a property within a basepair which
does
affect
not
basepairs.
the
Second,
relative
basepair
position
and
sliding will
orientation
of
two
consecutive
not alter the orientation of the
following bagepairs, and none of the existing prediction models can predict
basepair
sliding
small in B-DNA
well.
Third, because
basepair tilt angle is usually very
(9, 1 0 ) , it has been assumed to be zero.
To further simplify the pictorial presentation, each basepair is represented
by
a rectangular plate which lies on the mean plane of the basepair
(Fig. 1 ) . The plate designated BCDE, is 10 X long, 5 X wide, and connected to
two points
(A and F) that represent points
in the backbone.
The center of
helix rotation is indicated as "+".
Each plate corresponds to a local coordinate system where the origin coincides with the center of helix rotation, the x-axis is parallel to AF, and
the y-axis is parallel to CB.
deduced
The geometry of the n
basepair can be easily
in the local coordinate system corresponding to the (n-1)
by moving
the plate a distance d in the z direction (basepair separation is
equal to d ) , rotating through an angle t
with respect to the z-axis (helix
twist angle is equal to t ), and then rotating through an angle t
spect to the x-axis
2.
nate
with re-
(change of roll angle is equal to t ) , as shown in Fig.
Keeping track of the transformation matrices between each local coordisystem
and
the
global coordinate
system, the coordinates of all base-
pairs with respect to the global coordinate
382
basepair
system can be derived.
The re-
Nucleic Acids Research
-2
X
Figure 2: Geometry of two consecutive basepairs. Each basepair corresponds
to a local coordinate system. The geometry of the top basepair can be easily
derived in the local coordinate system of the bottom basepair by moving a
distance d in z direction, rotating through an angle t with respect to the
z-axis, and then rotating through an angle t
with r & p e c t to the x-axis.
suiting set of coordinates is then used for the plotting of the double helix.
Figure 3 shows the flow chart of the algorithm.
veloped.
Two programs were de-
PREPLT calculates the coordinates of all basepairs, while PLT plots
the results
from PREPLT.
Both programs were first developed in FORTRAN on a
CDC 7600 computer at the Los Alamos National Laboratory with DISPLA package
(Display
Integrated
Software
Systems
92121).
Software
Corporation,
4186
and
Plotting
Sorrento
Language
Valley
from
Blvd.,
San
Integrated
Diego,
CA
VAX versions of the programs were later developed at the University
of Alabama at Birmingham;
plotting
System
package
these are also in FORTRAN, and they use the PLOT10
(Tektronix,
Inc.,
P.
0.
Box
500,
Beaverton,
OR
97077).
SAMPLE PLOTS
The programs can apply the helix parameters predicted from either of the
two prediction models
(2,3,7) to make the plot.
tion model developed by Tung and Harvey
model, with
the predictions derived
on all-atom models for the basepairs.
are two.
We chose to use the predic-
(2,3) because
it is a more detailed
from confornational energy
calculations
The principal advantages of this model
First, it is detailed enough to distinguish between adenosine and
guanosine and between cytidine and thymidine; whereas the Calladine-Dickerson
model
only
distinguishes
between purines
and
pyimidines.
Second, the only
383
Nucleic Acids Research
START
SET BP(1) OK ORIGIN OF CC
LET LC(1) COINCIDE WITH CC
CALCULATE BP(2) IN LC(1)
SET LC(2). CALCUUTE TU(2)
CALCULATE BP(N) IN LC(N-1)
SET LC(NX CALCUUTE TM(N)
TRANSFORM BP(N) TO CC
Figure 3: Flowchart of the algorithm. BP(N) indicates coordinates of the
n
basepair. LC(N) represents local coordinate system corresponds to the n
basepair. GC is the global coordinate system, while TM(N) represents transformation matrix between IX(N) and GC.
Figure 4: Plot of d((G) I2 '(C) 12 ).
This 12 basepair DNA double helix is
straight with constant helix parameters for all basepairs except some small
deviations for the end basepairs because of the lack of propagational effect
from the neighboring steps.
384
Nucleic Acids Research
Figure 5: Plot of d(CGCGAATTCGCG). As expected, this helix is not
but with local variations from the ideal B-DNA structure.
adjustable
parameters
correspond
to
simple
physical
quantities
regular
that can be
compared with experimental quantities (3).
The first plot is that of a homopolymer (d(G)
U.
This piece of DNA
'd(C)..) as shown in Fig.
is straight with identical helix parameters within the
(b)
Figure 6:
Plot of the 51-basepair bending locus of trypanosome kinetoplast
DNA.
The helix is bent with the bending nearly confined to a single plane.
This piece of DNA was identified by Wu and Crothers (12) from gel electrophoresis raeasurements to be the bending locus of trypanosome kinetoplast minicircle DNA.
b) shows the view of the helix with 90 degree rotation from a ) .
Nucleic Acids Research
^•n**^
(b)
(a)
Figure 7 : Plot of d((A5T )..). This piece of DNA is 200 basepairs long.
When compared to the structure shown in Fig. 7, this helix bends even more
with a smaller radius of curvature. The bending of this helix is not planar
but forms a superhelical structure. For the purpose of clarity, basepairs
are not included in this plot. a) shows the side view, while b) shows the
top view of the DNA double helix.
helix (except some small deviations for the end basepairs due to the absence
of the propagational effect fron the neighboring steps).
The
next
d(CGCGAATTCGCG).
plot
(Fig.
5)
is
for
the
self-complementary
dodecamer
The interesting feature one notices first is that the helix
is not straight. The heLix structure is not regular but depends on the sequence, as indicated in the crystal structure (7,8).
Figure 6 is a plot of the 50 basepair bending locus of the trypanosome
kinetoplast DNA whose anomalous gel mobility (11) is believed to be due to
macroscopic bending (12,13).
bent.
One can see that this piece of DNA is indeed
The bending is nearly confined to a plane; i.e., bending in one direc-
tion is much more pronounced than in other directions.
The last plot (Fig. 7) is a self-complementary DNA helix with an alternating adenosine-thymidine sequence (d(A^T )
). This particular DNA is pre-
dicted to bend even more than the bending locus of trypanosome kinetoplaBt
DNA. The bending is not planar but forms a superhelical structure.
SUHHARY
All plots generated from PLT are stereoscopic views.
The plotted DNA
double helix can be seen in three-dimensional space through a pair of stereo
386
Nucleic Acids Research
glasses with each lens focused on one plot.
This representation is a very
nice visual aid to the prediction models.
These programs
can be used
to search for specific gross structural
features in known DNA sequences.
It can also be easily modified to look for
the
optimum
sequence
which comes closest to some specific predetermined
three-dimensional structure.
ACKNOWLEDGEMENTS
This work was supported by the U. S. Department of Energy and * grant to
S.C.H. from the National Science Foundation (PCM-8417001).
REFERENCES
1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
Windom, J. (1984) Nature 309, 312-313.
Tung, C.S. (1984) Ph.D. dissertation, Univ. of Alabama, Birmingham.
Tung, C.S. and Harvey, S.C. (1984) Nucl. Acids Res. 12, 3343-3356.
Fratini, A.V., Kopka, M.L. , Drew, H.R. and Dickerson, R.E. (1982) J.
Biol. Chem. 257, 14686-14707.
Kabsch, W. , Sander, C. and Trifonov, E.N. (1982) Nucl. Acids Res. 10,
1097-1104.
Calladine, C.R. (1982) J. Mol. Biol. 161, 343-352.
Dickerson, R. E. (1983) J. Mol. Biol. 166, 411-419.
Dickerson, R.E. (1983) Scientific American 249(2), 94-111.
Mellema, J.R., van Kamper, P.N., Carlson, C.N., Bosshard, H.E. and
Altona, C. (1983) Nucl. Acids Res. 11, 2893-2905.
Wells, R.D., Goodman, T.C., Hillen, W. , Horn, G.T., Klein, R.D., Larson,
J.E., Miiller, U.R., Neuendorf, S.K., Panayotatos, N. and Stirdlvant,'
S.M. (1980) Proc. Nucl. Acid Res. Mol. Biol. 24, 167-267.
Marini, J.C., Levene, S.D., Crothers, D.M. and Englund, P.T. (1982)
Proc. Nat'l Acad. Scl. USA 79, 7664-7668.
Wu, H.-M. and Crothers, D.M. (1984) Nature 308, 509-513.
Hagerman, P.J. (1984) Proc. Nat'l. Acad. Sci. USA 81, 4632-4646.
387
Nucleic Acids Research