Vol. 12 no. 1 1996 Pages 25-30 NAMOT2—a redesigned nucleic acid modeling tool: construction of non-canonical DNA structures Eugene S.Carter, II and Chang-Shung Tung Abstract Using a new set of reduced coordinates developed for describing regular and unusual nucleic acid structures, we have revised our nucleic acid modeling tool NAMOT2. NAMOT2 is general in terms of modeling different nucleic acid structures. A set of modifiable libraries allows users to customize their modeling environment. With this set of libraries, NAMOT2 can be used to model non-canonical structures such as parallel-stranded, triple-stranded and quadruple-stranded nucleic acid molecules. For modeling irregular structures (junctions, hairpin loops, etc.), we introduce a structural recipe approach. The complete procedure using NAMOT2 to construct the structure of a specific molecule is treated as the recipe for that structural motif. The existing recipes can be modified to generate new recipes for different structural motifs. Several examples of nucleic acids with non-canonical structures were modeled using NAMOT2. These examples include a DNA-drug complex, a DNA cube, a six-arm junction and a curved DNA molecule. Introduction Since the release of NAMOT (a nucleic acid modeling tool) about a year ago, we have developed a new set of reduced parameters (Tung et al., 1994) for describing base structures of nucleic acid molecules. One of the key features for this new set is that it satisfies the inversion rule. That is the parameters calculated from the two strands of the molecule are identical except sign reversals for tilt (T) and slide (Dx). Compared to the previous parameter set (Soumpasis and Tung, 1988), the new set is more versatile in terms of describing non-canonical structures such as triple-stranded, quadruple-stranded and parallel-stranded nucleic acid molecules. Using this new parameter set, we have developed NAM0T2, a redesigned version of our nucleic acid modeling tool. NAM0T2 has all the existing features of NAMOT plus new features that make the software suitable for designing and modeling both regular and unusual nucleic acid structures. A WWW (world wide web) site has been Theoretical Biology and Biophysics (T-10), Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA ) Oxford University Press established that hosts all information pertaining to NAMOT2. To demonstrate the versatility of this program, examples of modeling non-canonical nucleic acid structures including a DNA-drug complex, a DNA cube, a six-arm junction and a curved DNA molecule will be presented. The coordinates of all these modeled structures are available through our NAMOT WWW site. Methods and results NAM0T2 features To accommodate the added functions in the program, we have redesigned the user interface for NAMOT2. Figure 1 shows the layout of the NAMOT2 interface. The NAMOT2 window consists of five basic areas. The top bar contains a series of pull-down menus for executing a set of frequently executed tasks such as open/close files, alter images in the display (space filling or stick model), on-line help, etc. The main canvas is designed to display the molecules. According to their application, the size of this canvas can be changed by the users with a simple command. The command window accepts user-input commands. The results of user actions are displayed in the data window. The rightmost column has a series of controls for geometric manipulation of the molecules being modeled. NAMOT2 utilizes both Cartesian coordinates and reduced coordinates. For intramolecular structure manipulation, the set of reduced coordinates was used. The specifier for the reduced representation is H:U:B (helix: unit:base). The Cartesian coordinates are used for intermolecular structure manipulation including translations and rotations of different molecules. The specifier for the Cartesian representation is M:C:G:A (moleculexhain: group:atom). Expansibility and portability are two major goals in developing NAMOT2. To make the software easily portable, NAM0T2 has its own sphere (Porter, 1978) and line rendering (Bresenham, 1965) routines as seen in other molecular graphic softwares such as RASMOL (Sayle and Bissell, 1992). NAMOT2 features dynamical memory allocations and link list for both the molecular (M:C:G:A and H:U:B) and library data structures. This feature allows NAMOT2 to avoid setting an arbitrary 25 E.S.Carter, II and C.-S.Tung namotf file Image Pi mt ing Advanced Kelp | Hoi House Node Normal ..j Phosphate controls ParanSM . Pa i ..J j Start pj" NAMOT2 Eod ^1 I Chmgcr" act background white set f0105round black Base controls Parmeter Sx Start HT 1:2 1-2 _J j Viwj j Change f^" Fig. I. The layout of the NAMOT2 interface. maximum molecular size for modeling. Users are only limited by the capabilities of their computers. NAMOT2 has been compiled successfully on AIX, IRIX, LINUX, OSF1 and SunOS (4.1.3 and 5.4). Both MOTIF and XView toolkit versions exist, allowing NAM0T2 to be compiled on almost any Unix platform. NAM0T2 can be compiled with either ANSI (ANSI, 1989) or traditional (Kernighan and Ritchie, 1978) C compilers. To increase the capabilities for constructing and manipulating structures, NAMOT2 features a usermodifiable library for base, unit, helix, sugar and phosphates. Currently, this library features 29 DNA base pairs deduced from non-empirical calculations (Hobza and Sandorfy, 1987) plus one triplex (T-A-T; Radhakrishnan and Patel, 1994) and one quadruplex (G-G-G-G; Smith and Feigon, 1992). This library enables users to build their customized environment in modeling both regular and 26 unusual structures. Besides being more versatile in modeling nucleic acid structures, NAMOT2 also features added graphic capabilities. Using a specifier language, users can selectively display, color and/or space-fill the molecule. The exported graphic image can be in PostScript™, PPM or TIFF™ formats. Using NAM0T2 as a tool, complete procedures (structural recipes) for constructing different structural motifs for nucleic acids can be developed. One of the advantages for using structural recipes is that this method uses less information to store different structures. Motifs can also be assembled together to form new structures. Due to the fact that all parameters used by NAM0T2 to construct nucleic acid structures have intuitive physical meanings, it is likely that one can easily develop the recipe for a new structure by modifying a recipe in the existing pool. All the recipes for the non-canonical structures of NAMOT2—a redesigned nucleic acid modeling tool nucleic acid molecules being modeled in this paper are available through the NAMOT WWW site. DNA-ditercalinium complex In our earlier work (Tung and Carter, 1994), we demonstrated the capability of NAMOT to generate a cavity inside a DNA duplex for drug intercalation. As NAM0T2 is able to dock small ligands against the nucleic acid molecule, we modeled the structure of a D N A ditercalinium complex. Ditercalinium is an anticancer drug that binds to DNA by dis-intercalation. Instead of inhibition of DNA replication or transcription processes, ditercalinium induces malfunction of DNA repair systems (Lambert et al., 1990) that in turn causes cell death. The structure of the DNA-ditercalinium complex was solved to 1.7 A resolution using X-ray crystallography (Gao et al., 1991; Williams and Gao, 1992). We used the crystal structure of the drug in the complex (DDD030 in NDB; Berman et al., 1992) as the structure for ditercalinium in this study. DNA molecule for this study is an eightmer, [d(CGCGCGCG)] 2 , in a canonical B-conformation (Arnott and Hukins, 1972). Two intercalation sites were introduced to the oligomer at C 3 -G 4 Fig. 2. The modeled structure of the DNA-ditercalinium complex superimposed with the structure solved using X-ray crystallography. The two complexes were aligned using only the atoms in ditercalinium. The modeled structure is drawn in thick lines while the crystal structure is drawn in thin lines. The two DNA structures have a RMS difference of 1.0 A. and C 5 -G 6 . The intercalation sites were generated by simply extended the rise (Dz) of the base-pair step to 6.8 A and reduced the twist (tt) to 20°. In the docking procedure, both the drug and the DNA molecules were treated as rigid bodies. The ditercalinium was docked to DNA from the major groove side of the molecule. The objective of the procedure is to dock the drug molecule into the intercalation sites without introducing steric clashes. For the purpose of monitoring the surface contacts, during the docking process, both the DNA and the drug molecules were displayed using space-filling models. Surface matching between the two molecules were achieved by simply viewing the structure of the complex while rotating and translating the drug molecule with respect to the DNA molecule. The modeled structure of the complex (as shown in Figure 2), when compared to the crystal structure (using only the middle four base pairs of the DNA and the ditercalinium molecule), has a RMS difference of 0.7 A. DNA cube Because of the nature of the molecule that the complementary strands form a double helical structure, plus the fact that the chemistry for synthesis is straightforward and well developed, DNA is an ideal material for nanotechnology. The proposed applications of DNA as biomaterial (Seeman, 1991) include: the construction of macromolecular zeolites, the assembly of new catalysts, delivery of drugs and the assembly of molecular electronic devices. One such effort towards the goal of developing DNA into useful biomaterials is the design and synthesis of a DNA molecule with the connectivity of a cube (Chen and Seeman, 1991). Each side of this cube is a 20 base-pair duplex. The connectivity of the molecule was confirmed using both systematic digestion and gel electrophoresis analysis. Given the lack of experimental information available for the conformation, we choose to model this DNA molecule as a square cube to take advantages of the symmetry relationship existing in both the edges and the vertices of the cube. Each of the eight vertices is modeled as an orthogonal three-way junction, while each of the 12 sides is modeled as a regular double helix in the Bconformation. The template structure of an orthogonal three-way junction is constructed. For the purpose of taking advantage of the symmetry relationship, the junction is built with the vertex lying on the origin and the helical axes of the three arms coinciding with the x, y and z axes. To satisfy these constraints, each of the three base pairs forming the junction is allowed to move with only one translational and one rotational degree of freedom as shown in Figure 3(b). Our goal is to find the conformation 27 E.S.Carter, II and C.-S.Tung Recipe for constructing an orthogonal 3-way junction set WCAdfirsI on generate d d b c write pdh eg close generate (I (I I) I write pdh la close generate d d l> a write pdb at close load pdb rigid eg load pdh rigid ta load pdh rigid at rolorig I 2 ISO. rolorig I I 91). rolorig I 3 -"Ml. rotorig I 1 -32.2 trans I H.S4 I). 0. rotorig 2 2 I HO. rotorig 2 1 90. rolorig 2 2 67.8 trans 2 0.11.03 0. rotorig 3 3 -49.2 Irans300 12.75 write pdb 3-way-jc •.V, ~A' V; Fig. 3. The structure of the vertices in the cube. The conformation of the three base-pairs that form the three-arm junction of a vertice is shown in (a). The phosphate linkages between the pairs of sugar are shown as dashed lines. The arrangement of the eight vertices such that the 12 sides of cube are approximately equal is shown in (b). of the three base pairs such that the distances for the three phosphate linkages between the bases are optimal (3.5 A; Tung, 1993). With only six degrees of freedom, a short run of Metropolis Monte Carlo simulation (Metropolis el al., 1953) quickly finds the conformation of the three base pairs that optimize the distances for the phosphate linkages. The resultant translational and rotational parameters for the three base pairs are listed in Table I. The complete procedure for using NAMOT2 to construct an orthogonal three-way junction is treated as a structural recipe shown in Figure 3(a). This recipe can be used to construct all the vertices in the cube. One important feature of this three-way junction is that the three sides of the junction are not the same. Both the amount of translations and rotations for the three basepairs that form the junction are not identical. If a, b and c Fig. 4. The modeled structure of a orthogonal DNA cube in atomic detail. are the distances from the origin to the three base-pairs on the x, y and z axes respectively, we have c > b > a. Since a + c is approximately equal to b + b, we were able to arrange the eight vertices such that the 12 sides of the cube are approximately equal, as shown in Figure 3(c). After the structures of all the eight vertices are assembled, the structure of the 12 duplexes that connect the eight vertices can be constructed. Due to the differences in the rotation of the three base-pairs in the junction, the total twist angles for the 18 base-pairs that connect the base-pairs in the vertices to form edges of the cube are different. A small adjustment has to be introduced to the twist angles of these helices such that the connections between the helices and junctions are possible. The four duplexes that connect the b A- b arms of the junctions have a twist angle of 35.5 A, while the remaining eight duplexes have a twist angle of 37.5 A. The final structure of the modeled DNA cube is shown in Figure 4. Table I. Parameters for making an orthogonal three-way junction Base-pair a Base-pair b Base-pair c Six-arm junction Rr 180 R\ 90 R.-90 /?,. 180 Rx 90 Rr 67.8 R.-49.2 Rx -32.2 T\3.84, 0, 0 TO, 11.03,0 TO ,0, 12.75 The six-arm junction is yet another structural motif that can be used as a biomaterial in nanotechnology. This structural motif can serve as the connecting part in a cubic lattice. It was shown that the six-arm junction is not as stable as junctions containing three or four arms (Wang el al., 1991). Longer arms (16 bp) were required to make a stable six-arm junction. A six-stranded molecule (JYL6G, as shown in Figure 5) that forms a six-arm junction was constructed by Wang et al. (1991). Based on Ferguson Base-pair a, perpendicular to the .Y-axis; base-pair b, perpendicular to the j'-axis; base-pair c, perpendicular to the z-axis; Rx, rotation with respect to the .v-axis; Rv, rotation with respect to the v-axis: R:, rotation with respect to the z-axis; 7^ translation. All translations are in Angstroms and all rotations are in degrees. 28 NAMOT2—a redesigned nucleic acid modeling tool analysis, indicates that the backbone of this particular strand follows a continuous helical conformation at the junction. Without experimental data to indicate the exact arrangement of the six arms in the molecule, we chose to model the molecule as an orthogonal six-arm junction. The proposed structure of the six-arm junction (as shown in Figure 6a) has the four arms (I, II, III and IV) lying on a plane, while arms V and VI run perpendicular to the plane. Arms I, II, III and IV form an open, four-way junction with a structure similar to the one proposed by von Kitzing el al. (1990). Arms V and VI run through the cavity at the center of the four-way junction. The structure of the molecule in the vicinity of the junction was determined such that the stereochemistry is maintained (i.e. all bonds are intact and no steric clashes observed). The modeled structure of the six arm junction is shown in Figure 6b. Curved DNA Fig. 5. The base sequences of the six strands of DNA that form a six-arm junction as proposed by Wang el al. (1991). analysis (Ferguson, 1964), it was shown that the six arms of this molecule (JYL6G) form an extended structure that share a common central site. The lack of protection of the strand V at thejunction, as shown in the hydroxyl radical Whether intrinsic or induced, some DNA molecules do exist in a curved conformation (Klug el al., 1980; Wu and Crothers, 1984). Here, we show the modeling of a superhelical DNA molecule that mimics the nucleosome structure where the DNA wraps around a histone core in a left-handed superhelical conformation. The basic approach of modeling a superhelical molecule involves the following steps, (i) Generating a helical curve using a specified pitch and diameter. In the case of nucleosomal III II V a u Fig. 6. The structure of the modeled six-arm junction. The arrangement of the six strands of DNA in three-dimensional to form thejunction is shown in (a). The atomic modeled structure of the six-arm junction is shown in (b). 29 E.S.Carter, II and C.-S.Tung References Fig. 7. A superhelical DNA that mimics a 146 bp nucleosomal DNA wrapped around a histone core. DNA, we used a 28 A pitch and a 106 A diameter as observed in crystal data, (ii) Digitizing the curve into 3.4 A segments to represent the base-pair steps in the molecule, (iii) Mapping the segments into a set of base-pair parameters for generating the structure of all bases in the molecule, (iv) Completing the molecular structure by constructing the sugar-phosphate linkages. The details of the procedure for generating a curved DNA molecule that follows any smooth spatial curves (including the superhelices) will be published elsewhere. The complete modeled structure of a 146bp nucleosomal DNA is shown in Figure 7. Discussion Based on a new set of reduced coordinates, we have developed our nucleic acid modeling tool (NAMOT) into a new redesigned software: NAMOT2. NAMOT2 is a general tool capable of modeling both regular and noncanonical nucleic acid molecules. The four examples shown above have demonstrated the versatility of NAMOT2. NAMOT2 is written in C/Xlib. With the existence of both Motif™ and Xview™ interfaces, NAMOT2 is portable at most Unix workstations. A WWW site (http://namot.lanl.gov) has been established to host all the information (e.g. demos, update information, an official copy of the user manual, etc.) pertaining to NAM0T2. NAM0T2 resides on an anonymous ftp site, namol.lanl.gov. Acknowledgements We would like to thank Dr Angel Garcia for providing us with subroutines to generate sugar structures and to align molecules based on minimum RMS difference. Dr Dikeos M.Soumpasis of the Max Planck Institute and Dr Gerhard Hummer are acknowledged for the collaboration in developing the set of reduced coordinates used in NAMOT2. Ms Marilyn L.Kwan is acknowledged for her help in constructing the mismatch library. Mr Reinhard Klement of the Max Planck Institute and Dr Craig T.Martin of the University of Massachusetts provided us with useful suggestions and comments pertaining to NAM0T2 usages. This work is supported by the US Department of Energy. 30 ANSI (1989) Programing Language C, X3.159-1989. Arnott.S. and Hukins,W.L. (1972) Biochem. Biophvs. Res. Comnnm.. 47, 1504-1509. Berman,H.M., Olson,W.K., Beveridge,D.L, Westbrook.J., Gelbin.A., Demeny,T., Hsieh.S.-H., Srinivasan.A.R. and Schneider.B. (1992) Biophvs.J., 63, 751-759. Bresenham,J.E. (1965) IBM Syst. J., 4, 25-30. Chen,J. and Seeman,N.C. (1991) Nature, 350, 631-633. Ferguson,K..A. (1964) Metabolism, 13, 985-1002. Gao.Q., Williams,L.D., Egli,M., Rabinovich.D., Chen,S.-L., Quigley, G.J. and Rich, A. (1991) Proc. Nail. Acad. Sci. USA, 88, 2422-2426. Kernighan.B.W. and Ritchie.D.M. (1978) The C Programming Language, Prentice Hall, Englewood Cliffs, NJ. Klug,A., Rhodes.D., Smith,J., Finch.J.T. and Thomas.J.O. (1980) Nature, 287, 509-516. Lambert,B., Segal-Bendirdjian,E., Esnault.C, Le PecqJ.B., Roques, B.P., Jones.B. and Yeung,A.T. (1990) Anti-Cancer Drug Des., 5, 4 3 53. Metropolis.N., Rosenbluth.A.W., Rosenbluth,M.N., Teller.A.H. and Teller.E. (1953) J. Chem. Phvs., 21, 1087-1092. PorterJ.K. (1978) Comput. Graphics, 12, 282-285. Radhakrishnan,I. and Patel,D. (1994) J. Mol. Biol., 241, 600-619. Sayle.R. and Bissell,A. (1992) In Proc. 10th Eurographics UK 1992 Conference, University of Edinburgh, Scotland, April 1992. Seeman,N.C. (1991) NAMOTechnology, 2, 149-158. Smith.F.W. and Feigon,J. (1992) Nature, 356, 164-168. Soumpasis,D.M. and Tung,C.-S. (1988) J. Biomol. Struct. Dvn., 6, 397420. Tung,C.-S. and Carter.E.S., II (1994) Comput. Applic. Biosci.. 10, 427433. Tung,C.-S., Soumpasis,D.M. and Hummer,G. (1994) J. Biomol. Struct. Dyn., 11, 1327-1344. von Kitzing,E., Lilley.D.M.J. and Diekmann.S. (1990) Nucleic Acids Res., 18,2671-2683. Wang,Y., Mueller,J.E., Kemper,B. and Seeman.N.C. (1991) Biochemistry, 30, 5667-5674. Williams.L.D. and Gao,Q. (1992) Biochemistry, 31, 4315-4324. Wu,H.-M. and Crothers.D.M. (1984) Nature, 308, 509-513. Received on June 27. 1995; accepted on August 31. 1995
© Copyright 2026 Paperzz