volume 9 Number 31981 Nucleic A c i d s Research The nuclease specificity of the bacteriophage 0X174 A* protein S.A.Langeveld1'2, A.D.M.van Mansfeld1'3, A.van der Ende 1 , J.H.van de Pol 4 , G.A.van Arkel2 and P.J.Weisbeek2 ' Institute of Molecular Biology, 2Department of Molecular Cell Biology, •*Laboratory for Physiological Chemistry, and Academic Computer Centre Utrecht, State University of Utrecht, The Netherlands Received 24 November 1980 ABSTRACT The A protein of bacteriophage 0X174 is a single-stranded DNA specific nuclease. It can cleave 0X viral ss DNA in many different places. The position of these sites have been determined within the known 0X174 nucleotide sequence (1). From the sequences at these sites it is clear that the A protein recognizes and cleaves at sites that show only partial homology with the origin of RF DNA replication in the 0X DNA. Different parts of the origin sequence can be deduced that function as a signal for recognition and cleavage by the A protein. We conclude that different parts within the DNA recognition domain of the A protein are functional in the recognition of the origin sequence in single-stranded DNA. The existence of different DNA recognition domains in the A protein, and therefore also in the A protein, leads to a model that can explain how the A protein performs its multiple function in the 0X174 DNA replication process (2). INTRODUCTION Gene A of bacteriophage 0X174 codes for the only phage protein that is involved in the DNA replication process of the virus. Other proteins required for this process are provided by the host cell (3). The main product of gene A is a protein of 55 kD and this A protein has been investigated in great detail, both in vivo and in vitro (4-7). It has been shown that this protein fulfills several functions during the phage replication process. The A protein initiates the RF DNA replication by introducing a nick in the viral plus strand of the supertwisted RFI DNA molecule. At this site which is the origin of replication,a free 3' OH terminus is created which functions as a primer for DNA synthesis. The A protein becomes covalently attached to the 5' end of the nicked strand and is thought to interact with other proteins necessary for the initiation of the DNA synthesis. After initiation the DNA replication proceeds in a rolling circle mode (2). The A protein is also capable to linearize and circularize the displaced plus strand DNA after each round of replication. These steps require a re-cleavage at the origin site and ligation of the linear single-stranded DNA by the A protein. © IRL Press Limited. 1 Falconberg Court. London W 1 V 5FG. U.K. 545 Nucleic Acids Research The genetic organisation of the region in 0X DNA that contains gene A is rather complex. Three different proteins, the A protein (55 k D ) , the A pro- tein (37 kD) and the B protein (13.8 k D ) , are coded for by this part of the DNA molecule.The A and A proteins use the same reading frame of the DNA. The B protein is made in a different reading frame and is not involved in the phage DNA replication (8,9). Whereas the A protein is coded for by the entire gene A the A protein is synthesized from a translational start signal within gene A. The translation of the A and A proteins terminates at the same stop- codon (1, 10). The region also contains the origin of replication. The sequence at the origin of replication in 0X174 DNA is characterised by an AT-rich sequence of 30 nucleotides. This sequence is highly conserved in the bacteriophages 0X174, G4 and Stl and is surrounded by GC-rich regions (1, 11-14). These features allow the AT-rich sequence at the origin to adopt a partially denatured structure when the RF DNA molecule is supertwisted. Thus recognition and cleavage of the origin sequence in RFI DNA may resemble to some extent the recognition and cleavage of the origin sequence in single-stranded DNA. WLthin the AT-rich sequence of 30 nucleotides three different symmetrical sequences are found, -CAACTTG-, -TATTAATA- and -CTATAG-. These sequences may be. important for a specific recognition process between the RFI DNA or single-stranded DNA and the A protein, as many enzymes interacting with DNA require such symmetrical sequences. In contrast to the A protein which has well characterized functions in the viral DNA replication, the function of the A stood. The A protein is not clearly under- protein has been characterized as a single-stranded DNA specific endonuclease (lo), it cleaves 0X viral plus strand DNA at the origin of replication and at many other sites. It does not cleave supertwisted 0X RFI DNA under conditions where the A protein is active on RFI DNA (lS,lb,20 ) . This makes it less likely that the A protein is also involved in the initiation of the RF DNA replication. The in vitro properties of the A and A proteins are compared in Table 1 . Since the aminoacid sequence of the A protein is also present in the A pro- tein the nucleolytic activity of both proteins should be strongly related. This is confirmed by the similarities of the enzymatic activities of the two proteins as shown in Table I. Both proteins recognise the origin sequence, nick at this site and have ligation activity. We studied the properties of the A protein to elucidate some of the basic aspects of the nucleolytic activities that are found in the A and A 546 proteins. Nucleic Acids Research Table I . PROPERTIES OF THE A AND A* PROTEINS IN VITRO A protein cleaves RFI DNA (Mo**) • cleaves RFI DNA (Mn**) + A* protein ref. (2,5.6.11) + forms RFIII DNA (Mn**) + forms RFIV DNA (Mo**) (15) (15) (a) (15) forms RFIV DNA (Mn**) * cleaves ss DNA at origin • (15) + (16.17) cleaves ss DNA at other sites + (16) attaches covalently to 5' end of the nick • (16) + (b) binds to ds DNA binds to ss JNA + (b) (a) Highly purified A protein which was obtained after purification on a heparine-sepharose column in addition to the procedure described in (15), did not fora RFIII DNA upon incubation of it RFI DNA in the presence of Mn . (b) Results of DNA Dinding experiments are personal conmunications of H. v.d. Avoort. The extensive nuclease activity of the A protein on viral ss DNA provided a way to trace the specificity of this enzyme in the process of DNA recognition and cleavage. In this paper we present an analysis of the nucleotide sequences found at the different cleavage sites for the A protein in the 0X viral ss DNA. The results presented here show that a specific sequence at the origin of replication is essential for the recognition and cleavage by the A protein. This sequence can be divided into different parts each of which can be recognized separately by the A protein. The properties of the nuclease activity of the A protein are extrapolated to the A protein and a model based on these properties is presented for the multiple enzymatic activities of the A protein in the RF DNA replication process. MATERIALS AND METHODS Chemicals. |y-32P|ATP (specific activity 2000-3000 ci/nnnole) was purchased from the Radiochemical Centre (Amersham, U.K.); forraamide was from Merck and purified prior to use by stirring quantities of 100 ml with 5 g of the mixed bedresin AG 501-X8 (D) from Bio-Rad Laboratories. Acrylamide was from Serva and bisacrylamide (ultra pure) was from Eastman-Kodak Co; agarose was from Serva. Enzymes. Proteinase K was purchased from Merck; bacterial alkaline phospha- 547 Nucleic Acids Research tase (BAPF) from Worthington and T4 polynucleotide kinase from Boehringer (Mannheim). Haelll restriction enzyme was kindly provided by Dr. P. Baas. The A and A proteins were isolated as described in (15). Preparation of single-stranded DNA restriction fragments. Viral singlestranded DNA was isolated from virus particles by phenol extraction. 30 ng of viral ss DNA was digested with Haelll restriction enzyme in a 0.5 ml reaction mixture containing 50 mM Tris-HCl at pH 7.5, 10 mM MgCl.,, 5 mM dithiothreitol and 1 mM EDTA. Incubation proceeded for 16 hrs at 37 C. The reaction was terminated by the addition of EDTA to 50 mM and 500 yg proteinase K. Incubation was continued for 30 min at 37°C. The mixture was treated with phenol followed by precipitation of the DNA fragments with ethanol. Finally the DNA fragments were dissolved in 100 pi 50 mM Tris-HCl at pH 7.5, treated with alkaline phosphatase and labelled at their 5' termini with | Y - 3 2 P | A T P by T4 polynucleotide kinase as described by Maxam and Gilbert (18). The overall recovery of single-stranded DNA fragments after labelling was 32 estimated to be approx. 50%. p-labelled fragments were run on a neutral pre- parative 5% polyacrylamide slab gel in 40 mM Tris-acetate, 20 mM Na-acetate and 2 mM EDTA at pH 7.7. The gel was autoradiographed to visualize DNA fragments, which were eluted from the gel as in (18). Assay for nuclease activity on ss DNA fragments. 10 pi reaction mixtures contained 0.15 - 0.2 pinoles ss DNA fragments, 1 pinole A protein, 50 mM Tris- HCl at pH 7.5, 150 mM NaCl, 10 mM MgCl , 5 mM dithiothreitol and 1 mM EDTA unless otherwise stated and were incubated during 90 min at 37°C. Reactions were terminated by addition of EDTA to 50 mM and 10 yg of proteinase K. The incubation was continued for 30 min at 37 C. Finally 20 ul formamide containing 20% sucrose, 0.25% xylene cyanol, 0.25% bromophenol blue and 0.2% sarkosyl was added. The mixtures were then heated for 3 min at 100 C and layered on a denaturing slab gel. Agarose gel electrophoresis. Horizontal slab gels (1.4% agarose) were run in 40 mM Tris-acetate, 20 mM Na-acetate and 2 mM EDTA at pH 7.7 in the presence of ethidium bromide (5 ug/ml) and photographed under ultraviolet light (366 nm) . Denaturing gel electrophoresis. Denaturing gels (33 x 250 x 1.5 mm) were all 6% polyacrylamide gels made up in 98% formamide as described by Maniatis et al. (19). Sequence analysis. Sequence analysis was according to the chemical degradation method of Maxam and Gilbert (18) . Specific cleavage at guanine residues was achieved with dimethyl sulfate, at adenine residues by ringopening with 548 Nucleic Acids Research alkali, at cytosine and thymine residues by hydrazinolysis and at cytosine residues alone by hydrazinolysis in 2 M NaCl. Strand-scission was achieved by incubation with piperidine in all cases. For resolution of the cleavage products, 20% acrylamide gels were run in 7 M urea, 50 mM Tris-borate and 1 mM EDTA at pH 8.3. RESULTS A protein nuclease activity on 0X viral single-stranded DNA The purity of the A protein preparation used in the experiments was tes-: ted in two different ways. Firstly by electrophoresis on SDS polyacrylamide gels. In this way no additional protein bands were detected. Secondly by incubation of 0X RFI DNA with A* protein in the presence of Mg + + (see Table I ) . The A protein does not nick RFI DNA under these conditions (15, 20), so that contamination with A protein would be detected by conversion of RFI DNA into RFII DNA. The A protein itself is not inhibitory to the action of the A pro- tein on RFI DNA (data not shown). In this way no traces of A protein were found in our A protein preparation, not even at a protein/DNA molar ratio of 50. Incubation of circular single-stranded 0X DNA with increasing amounts of the pure A protein preparation and analysis of the reaction products on an aga- rose gel gives the result shown in Figure 1. At first two distinct subfragments of about 2100 and 3300 nucleotides appear. With higher protein concentrations the DNA is found as a broad band with higher mobility. This indicates that substantial degradation of the DNA has occurred. Circular and linear single-stranded DNA have the same mobility on the agarose gel used. Therefore the first nick in the circular DNA remains unnoticed. The conclusion is that two relatively strong cleavage sites for the A protein must be present in the single-stranded DNA, next to a large number of weaker sites. When the cir- Figure 1. Agarose gel electrophoresis of 0X circular single-stranded DNA incubated with increasing amounts of A protein. 1.5 pmoles ss DNA were incubated with (b) 6 pmoles, (c) 30 pmoles, (d) 60 pmoles and (e) 120 pmoles A protein, (a) is a control without A protein. Reaction conditions were as described in Materials and Methods. Arrows indicate the discrete subfragments. S4S Nucleic Acids Research cular single-stranded DNA is treated with A protein under similar experimental conditions only one nick at the origin of replication is introduced (16). Determination of the positions of the cleavage sites The position of the cleavage sites was determined by digesting specific ss DNA fragments with A protein. Single-stranded 0X DNA isolated from phage par- ticles was digested with the restriction enzyme Haelll which cleaves singlestranded DNA with the same sequence specificity as it does double-stranded DNA (16, 2 1 ) . The single-stranded DNA restriction fragments were labelled at 32 their 5' termini with P and purified as described in Materials and Methods. All restriction fragments were separately incubated with A protein, depro- teinzed with proteinase K and applied to formamide-polyacrylamide gels. On each gel a set of Haelll restriction fragments with known sizes was coelectrophoresed to serve as length markers. When the Haelll fragments are plotted semilogarithmically versus their migration distances a good linear relationship is found for the fragments Z4 (603 nucleotides) through Z10 (72 nucleotides) on a 6% polyacrylamide gel containing formamide. Therefore the sizes of the A protein produced subfragments can be deduced with reasonable accu- racy from their relative positions on the gel. Since only the 5' termini were labelled the size of each subfragment determines the position of a cleavage site in the 0X DNA sequence (1). An example of such an A protein digestion is given in Figure 2 for the fragments Z3, Z4 and the partial digestion product Z5-Z8. The cleavage sites determined in this way are listed in Table II. Efficiency of cleavage It is evident from Figure 1 that there is a large difference in cleavage efficiency for the various sites. Furthermore, upon incubation of individual restriction fragments with A* protein alsmost every fragment gives a certain number of subfragments (see Table II), but in varying fields. We determined the sites which are most efficiently cleaved by incubating mixtures of DNA fragments with limited amounts of A protein. An example of such an analysis is shown in Figure 3. The results show that restriction fragments Z5 through Z10 give two distinctive subfragments upon incubation with A protein, one of approx. 98 nucleotides and one smaller than 30 nucleotides. The 98 nucleotide long subfragment results from a cleavage at the origin site in Z6B (16). The small subfragment obviously results from a cleavage in the Z8 restriction fragment, as the Z8 band shows a significant decrease of intensity on the autoradiograph. Under similar conditions of incubation with a limited amount of A 550 protein, a mixture of the restriction fragments Zl through Z6B only Nucleic Acids Research Figure 2. Formamide polyacrylamide gel containing the individual restriction fragments Z3, Z4 and the partial digestion product Z5-Z8 incubated with A protein, (a): 0.3 pmoles of Z3, 0.3 pmoles of Z4 and 0.1 pmoles of the partial Z5-Z8 were incubated with 0.1 pmoles A protein, (b): control without A protein. Reaction conditions were as described in Materials and Methods. Marker positions are indicated. Z 6A gives the 98 nucleotide subfragment. We conclude from these data that there are two extremely efficient cleavage sites for the A protein in 0X viral single-stranded DNA, one at the origin of replication and another one in the Z8 restriction fragment. The positions of these sites are in good agreement with the sizes of the two discrete subfragments of 2100 and 3300 nucleotides found when intact single-stranded DNA is incubated with A protein (Fig. 1). The position of the site in Z8 was also determined by cleavage of a DNA fragment that contains Z5 and Z8. This partial digestion product is due to the stable hairpin that can be formed in the GC-rich region in which the Haelll cleavage site is located (22). Under our conditions approximately 50% of the single-stranded DNA remains uncleaved at this site. It was found that the cleavage site of the A protein in Z8 is about 10 nucleotides away from the Z5-Z8 junction. This position corresponds with the results obtained upon digestion of Z8 alone (data not shown). A comparison of the sequence at the cleavage site in Z8 and the sequence at the origin of replication in Z6B is given in Table III. Remarkable similari- 551 z CD o' > TABLE II. CLEAVAGE SITES OF THE A* PROTEIN IN 0X VIRAL SINGLE-STRANDED DNA Fragment Length of subfragment Position of cleavage site 21 235 - 245 190 - 200 2016 + 5 Z2 Fragment Z3 Length of subfragment 740 - 760 Position of cleavage site 313 + 153 + Fragment Lenght of subfragment 10 Z5 180 - 205 10 Z6A 165 - 170 lt>0 - 170 1971 + 5 1941 ± 5 500 - 510 68 + 5 100 - 120 1886 + 10 380 - 400 5339 t 10 90 - 95 1869 + 3 285 - 295 5239 + 5 84 - 86 1861 + 1 195 - 200 5147 + 3 670 - 690 3809 + 10 170 - 175 5122 + 3 150 - 160 3284 + 5 145 - 155 5099 95 - 105 3229 + 5 85 - 95 3219 + 5 50 - 60 3184 i 5 < 30 3159 - ? 580 - 600 90 - 100 7* Z4 130 - 135 i5 Z6B Z5-Z8 Z9 5044 + 5 Position of cleavage site 362 + 13 4656 *_ 3 130 - 135 4621 + 3 30 - 35 4521 + 3 98* 320 - 325 4305 992 + 3 29* 4826 4788 60* 4937 67* 4956 Z10 1306 + 3 ±3 80 - 85 1256 60 - 65 1236 + 3 40 - 45 1226 + 3 Restriction fragments were incubated with A* protein as described in Materials and Methods. The restriction fragment Z7 was not cleaved under these conditions. The lengths of the subfragments derived from the formamide polyacrylamide gels are expressed in nucleotides. The values determined in different experiments fall in the range that is indicated for each subfragment. Asterisks mark the subfragments of which the length has been established by sequence analysing techniques. Nucleic Acids Research Figure 3. A mixture of restriction fragments Z5 through Z10 was incubated with A protein, (a): 0.8 1.0 pmoles ss DNA fragments were incubated without A protein and (b): with 0.3 pmoles A protein. Arrows indicate the subfragments. ties exist between the two sequences: i) 13 nucleotides in a stretch of 18 nucleotides are identical, i i ) a pyrimidine-rich sequence is present at the left side of the cleavage s i t e , i i i ) an AT-rich region is present at the right side. The sequence -ACTTGA- that contains the point of cleavage at the origin site is also found in the Z8 cleavage s i t e . These are the only two sites in the 0X viral ss DNA where this sequence is present. I t suggests that the hexanucleotide -ACTTGA- is important for the A protein - ss DNA interaction. TABLE I I I . MUCLEOTIDE SEQUENCES AT THE CLEAVAGE SITES Fragment Sequence Z5B 4293 - c T C C C CC A A C T T G'A T A T Z5-Z8 978 - G G, C C C CT T A C T T G A IN Z6B AND Z5-Z8 T A A T A A C A - 4317 G G A T A A A T T A T - 1002 Haelll Nucleotides ideadeal in both sequences are represented in boxed areas. The AT-rich regions scare at positions ^306 and 995, GT-rich regions scare at positions 4299 and 983 and proceed leftward. The arrow indicates the poinc of cleavage ac che origin of replication. 553 Nucleic Acids Research The best cleaved sites in the indivivual restriction fragments Z2 and Z3 were also determined. The fragments were incubated with an amount of A protein tenfold less than normal (see Materials and Methods). Under these conditions both fragments give only one subfragment with a low yield per subfragment. The sequences at these cleavage sites together with the cleavage sites in Z6b and Z8, are listed in Table IV. The other weaker cleavage sites in Z2 and Z3 are listed in Table VI. From these tables it is clear that the best cleaved sites have the best homology with the origin sequence. These sites also have the sequence -TTGA- in common. The tetranucleotide -TTGA- is present at 40 different places in the 0X viral single-stranded DNA. Only seven cleavage sites, including the best cleaved sites listed in Table IV, have this sequence in it (see Table VI). This means that the tetranucleotide -TTGA- is not necessary nor sufficient for cleavage by the A protein. It can therefore be concluded that there is no sequence common to all cleavage sites. However almost all sites show homology with the sequence around the origin to some extent. The best cleaved sites have contiguous sequences that are homologous to the sequence -ACTTGATATT- at the origin. This sequence therefore is expected to have an important role in the recognition and cleavage process of the single-stranded DNA by the A* protein. Determination of exact cleavage sites The lengths of the subfragments as derived from the migration distances in the formamide gels give the approximate positions of the cleavage sites. For a limited number of sites the exact position of cleavage was determined by more accurate methods. Subfragments obtained from the individual restriction fragments Z3, Z6B, Z9 and Z10 after incubation with A protein were isolated and subjected to sequence analysis, following the chemical degradation method TABLE [V. MUCLEOTIOE SEQUENCES AT THE BEST CLEAVAGE SITES Fragment Sequence Z63 42y3 - C T C C C C C A A C T T G A Z8 978 - . . C C C C . . A C T T G A Z2 3804 - T T G A Z3 69 - A . C T T G A T A T T A A T A . . . T A A . C A - 4317 . . . - T A T T . . T . A . . T A A - 1002 3823 83 The sequences at the besc cleaved sices are compared with the origin sequence. Docs mark che positions oc the non-homologous nucleocides. The coasaon sequence is represented in a boxed area. The poinc of cleavage ac che origin in che restriction Srasnnenc Z6B is indicated by che arrow. 554 Nucleic Acids Research of Maxam and Gilbert (18). Subfragments were selected on appropriate length for sequence analysis. The subfragments were degraded directly without relabelling of their 5' ends and were analysed on sequencing gels. An example of such an analysis is shown for the 29 nucleotide subfragment of Z9 (Fig. 4 ) . The nucleotide sequence up to the nucleotide at the 3' OH end of the subfragments could be determined. The exact cleavage sites determined in this way are present in Table V. The sequences are arranged in such a way that optimal homology to the origin sequence in Z6B is achieved. It is clear that the A protein does not cleave in the homologous sequences at a fixed position. In Z3 and Z9 the sites with the best homology to the origin sequence are cleaved at positions other than those expected from the similarity with the origin sequence. Remarkably, the sequence at the cleavage sites in Z9 (4826) and Z10 (4937) differ considerably from the origin sequence, but they resemble each other strongly. The heptanucleotide -TTCTGGT- in Z10 was also found at other cleavage sites of the A protein that have no or very little resemblence to the sequence at the origin. From the exact cleavage sites given in Table V it becomes evident that the A protein does not cleave between two specific nucleo- C T A G Flgure 4- v* • " \a Autoradiograph of a sequencing gel containing the chemical degradation products of the smallest subfragment (29 nucleotides) of Z9 obtained after incubation with A protein. SSS: Nucleic Acids Research TABLE V. CLEAVAGE SITES DETERMINED BY SEQUENCE ANALYSIS Fraqment Sequence Z6B 4299 - C A Z3 4950 - C c t 9 Z9 4781 - g c Z9 g - 4835 Z10 g - 4946 A C T T G•A T A T T A A T - 4313 T T G A•T 9 c T A A a - 4964 c - 4795 4821 - a A C T T t A*T 9 c g 9 A A t t T TJG g T c g T c 9 4932 - C 9 t t c T^G g T t g g t t Points of cleavage are indicated by arrows. Nucleotides which are homologous with the origin sequence are represented by uppercase characters. tides. Furthermore the sequences in Table V in addition confirm the previous observation that there is no common sequence required for the cleavage activity of the A protein on ss DNA. Sequences at the cleavage sites The cleavage sites presented in Table II have sequences that can be divided into two classes, one class of sequences with homology to the origin sequence and another class of sequences resembling the sequence -TCTGGT- as found in Z10. The origin—like sequences are listed in Table VI and are aligned in such a way that the homology with the origin is maximal. The pattern of homologous sequences shows that the sequence -CAACTTGATATT- comprises almost all homologous sequences, but that there is a broad variation in homology between the two terminal sequences -CC.CAACTT- and- -GATATT-. Apparently only nucleotides in this part of the origin region are crucial for recognition and cleavage by the A protein. However, no unique sequence that is part of this sequence can be deduced from the cleavage sites. It appears that only a small degree of homology with the origin sequence suffices as a signal for cleavage and, as was found for Z2 and Z3, that the efficiency of cleavage depends on the degree of homology. The second class of sequences resemble the sequence -TTCTGGT-, a sequence also found at two other cleavage sites in the 0X viral ss DNA. This sequence does not correspond with any sequence at the origin of replication. However, when nucleotides in this sequence are represented as pyrimidines or purines a part of it can be written as -PyPyTGPuT- which is identical with the sequence -CTTGAT- at the origin. All sequences of cleavage sites resembling the sequen- 556 Nucleic Acids Research TABLE VI. HOMOLOGY OF ORIGIN-LIKE CLEAVAGE SITES Fragment Sequence Z6B 4293 Z3 5138 - . Z3 5090 - Zl 1960 - . T . . . . C A A C T T G Z3 - C T C C C C C A A C T T G A T A T T A A T A A C A - 4317 . . C C . C A ACTT T . A . A A . . - 5162 C CAA . . T G . . . . T A . T . . C . - 5114 A - 1984 144 - C . . C C . . . .C T T G A . A - 168 Zl 1850 - . T C T T G . . . T T . . T . . . . - 1874 Zl 1856 - . . . C C T T G . T . . T . . T . . . . - 1880 Z2 3171 - Z4 1246 - Z3 5037 - Z8 C . .C T T G A . . . T A - 3195 C . T G A - 1270 A. C T T G . T . . T A A - 5061 978 - . . C C C C . . A C T T G A . . . T A A - 1002 Z4 1214 - . T C C C . C A . C . T . A . . T T . A T . . . . - Z3 5228 - . . C . . . C A A C . . G A T A T T . A Z9 4775 - . . C Z2 3273 - C . T G A T . . T . . T . . . . - 3297 Z2 3219 - C T T G A T . T . . . T . . . . - 3243 Z3 59 - Z2 3804 - Z3 A CTT . AT . . . . A . A . . A. C T T G A T A 1238 - 5252 . - 4799 A - 83 . T T G A T A T T . . T . A . . - 3828 310 - . . . . C . C . . . T T G A . A T T . . . A A . . - 334 Z6A 4506 - C . T T G . T A T T . A - 4530 Z3 4944 - . T . . . . C . . . T T G A T . . T A A - 4968 Z6A 4644 - . . C Z6A 4611 - . . . . C . . . . . . T G A T A . T . . . A A . . - C . T G A T A T T . . T . . . A - 4668 4635 Zl 1874 - . T . C C . C . .. , T G . . A . T A A . . A . A - 1898 Z2 3207 - . . . . C C . . . . . T G A T A Zl 1925 - . T . . C . . A . . . . G A T A T T . . T . . C . - 1949 - 3231 Sequences at the cleavage sites listed in Table II which are homologous with the sequence around the origin in Z6B, were aligned with the origin sequence. Nucleotides which are homologous to the origin sequence are represented by upper-case characters, dots mark the positions of aberrant nucleotides. ce -TTCTGGT- are listed in Table VII. Cleavage at the sequences resembling the sequence -TTCTGGT- is at identical positions in the restriction fragments Z9 and Z10 as is shown by the alignment of sequences in Tables V and VII. 557 Nucleic Acids Research TABLE VII. CLEAVAGE SITES WITH SEQUENCES HOMOLOGOUS t'lTH -TTCTGGT- Fragment Sequence Z10 4929 - T C A C GT T C T*G G T T G G T T G T G - 4948 Zl 2003 - T t c C GT T C T G G T g a t T c G T c - 2022 Z2 3141 - T t t g t T T C T G G T g c t a T G g c - 3160 Z5 863 - a a A C GT T C T G G c g c t c g c c c - 882 Z4 1226 - c a c g t T T a T G G T g a a c a G T G - 1245 Z9 4818 - c C t a a 7 T t T*G G T c G t c g G g t - 4837 Z4 1295 - a a c a c T a C T G G T T a t a T t g a - 1314 Z3 5327 - g a c a a a T C T G c T c a a a T t T a - 5346 Z3 5114 - c a A g a a g C T G t T c a G a a t c a - 5133 Sequences of cleavage sites listed in Table II which are homologous to the sequence -TTCTGGT-, are compared with the sequence around the cleavage site in the restriction fragment Z10. Homologous nucleotides are represented by upper-case characters, non-homologous nucleotides are represented by lower-case characters. Arrows indicate the cleavage points in Z9 and Z10 which were determined by sequence analysis (see also Table V ) . The boxed area includes the -TTCTGGT- like sequences. DISCUSSION The a n a l y s i s of the nuclease a c t i v i t y of the A p r o t e i n on 0X v i r a l stranded DNA has revealed 35 d i f f e r e n t single- cleavage s i t e s . I t i s c l e a r t h a t not a l l p o t e n t i a l cleavage s i t e s of the A p r o t e i n on v i r a l ss DNA could be d e t e c ted with our method. E f f i c i e n t l y cleaved s i t e s near the 5 ' ends of the r e s - t r i c t i o n fragments can easily mask l e s s e f f i c i e n t l y cleaved s i t e s further away from the l a b e l l e d 5 ' terminus. The sequences of the majority of these c l e a vage s i t e s show s i g n i f i c a n t homology with the sequence at the r e p l i c a t i o n o r i gin of the phage DNA. Eight of these s i t e s however, have b e t t e r homology with the sequence -TTCTGGT-. I t was found t h a t the s i t e s which a r e most efficient- ly cleaved show good homology with the sequence a t the o r i g i n . This was demonstrated for the cleavage s i t e in Z8 and the b e s t cleavage s i t e s in Z2 and Z3. The sequences found a t the many cleavage s i t e s of the A p r o t e i n show c l e a r l y that the sequence of 30 nucleotides at the o r i g i n i s not required for s p e c i f i c recognition by t h e A p r o t e i n . All cleavage s i t e s , including the best ones, have sequences t h a t are homologous to p a r t s of the sequence -CAACTTGATATT- a t the origin of r e p l i c a t i o n . No common or unique sequence emerged from the sequences a t the cleavage s i t e s . This was already exprected 558 Nucleic Acids Research from the large difference in efficiency of cleavage at the various sites. Less efficient cleavage activity of the A protein at a certain site is re- flected in a lower degree of homology with the origin sequence. The symmetrical sequences around the origin are not required for recognition and cleavage of single-stranded DNA by the A protein. This is in agreement with the fin- dings of Van Mansfeld et al. (23), who recently demonstrated that the synthetic oligonucleotide -ACTTGATA- can be cleaved specifically by the A protein of 0X174. The actual cleavage at the cleavage sites occurs however at positions that differ from what is expected from the homology with the originsequence. Therefore it can be concluded that the A protein has a more relaxed sequence specificity than the larger A protein which nicks only in the originsequence and nowhere else in the 0X viral single—stranded DNA (16). We observed that entirely different parts of the sequence -CAACTTGATATT- can function as a signal for recognition and cleavage by the A protein. It is re- markable that the sequences -CAACTTG- and -GATATT- which represent the parts left-ward and right-ward from the point of cleavage at the origin of replication can be recognized and cleaved by the A sequence the A protein individually. As a con- protein must have DNA recognition domains which are able to recognise these parts of the origin sequence separately. This suggestion is supported by the ligation activity of the A protein. It has been demonstra- ted that relaxed RFIV DNA is formed upon incubation of 0X RFI DNA with A protein in the presence of Mn (15). In this reaction the A protein cleaves the RFI DNA at the origin of replication. While it is covalently bound to the 5' end of the DNA, it recognizes the 3' OH end of the nick after the release of supertwists from the RF DNA and it ligates both termini. Since the A and the A protein have very identical fundamental properties; cleavage of viral single-stranded DNA between two specific nucleotides at the origin of replication, covalent attachment to the 5' end of the cleaved DNA molecule and ligase activity, we expect that recognition of the specific sequence around the origin is basically the same for the A and the A protein. The independent recognition of the extreme parts of the sequence CAACTTGATATTat the origin by the A protein therefore provides an explanation for the multifunctional behaviour of the A protein in the phage DNA replication process as described by Eisenberg et al.(2). A model for the role of the different DNA recognition domains in the A protein which can recognise the different parts of the origin sequence during the bacteriophage RF DNA replication and viral strand DNA synthesis is presented in Figure 5. 559 Nucleic Acids Research Figure 5. See text for explanation The first function of the gene A protein in the phage DNA replication is the recognition and cleavage of the sequence at the origin in RFI DNA (Figure 5, A ) . In RFI DNA the duplex DNA structure of the AT-rich origin is thought to be destabilised (24). Recognition and cleavage of the origin sequence in RFI DNA and viral single-stranded DNA are therefore very similar. When the DNA is cleaved the protein becomes covalently bound to the 5' end of the cleavage site and a 3' OH terminus becomes available for the priming of the DNA synthesis (Figure 5, B) . After one round of replication the displaced plus strand DNA is split off into a genome length circular molecule by the A protein. In this reaction the origin sequence must specifically be recognized by the A protein that is covalently bound to the right part of the origin sequence (Figure 5, C ) . Therefore a free DNA recognition domain of the A protein is supposed to interact with the left part of the sequence at the origin. The displaced plus strand DNA is subsequently cleaved and the A protein switches from the 5' end of the old DNA strand to the 5' end of the new strand under simultaneous ligation of the 51 and 3 1 ends of the displaced strands. In this process a new circular DNA molecule is formed which is used as a template for the complementary 560 Nucleic Acids Research strand, or is encapsidated during phage maturation (2). The sequence that is recognized by the free part of the A protein actually functions as a signal for the termination of the plus strand DNA synthesis. If this model is also true for the filamentous phages it might offer an explanation for recent results obtained by Horiuchi (25). Ke found that when a second origin is cloned into f] the area between these origins is always deleted in progeny phage, suggesting that the origin sequence functions as a termination signal. From the model described above it is clear that the first origin sequence that is reached by the replication fork will act as a signal for the A protein or in the case of fl the gene II protein to split off and circularize the displaced strand. Consequently, the presence of additional sequences in the plus strand DNA that can function as a termination signal will abort the synthesis of intact plus strand DNA. This means that the sequences with the best homology to the origin sequence, listed in Tables IV and VI are not able to function as a termination signal during the in vivo DNA replication. Therefore the sequence -CAACTTG- alone is not enough for termination although it is recognized by the A protein. The minimal sequence to function as a termination signal therefore has to be larger that -CAACTTG-. The model presented in Figure 5 accounts for a processive action of a monomeric A protein molecule in the RF DNA replication and viral DNA synthesis (2, 17). Experiments are in progress to analyse in more detail DNA recognition domains in the A and A The sequences at the cleavage sites of the A proteins. protein have given information about sequences at the origin site that are essential for recognition and cleavage by the A and A proteins. Other sequences at the origin, however, may be required for a specific interaction with host DNA replication enzymes like the rep unwinding protein and DNA polymerase III. This might explain why a much longer sequence of 30 nucleotides is conserved in the phages 0X174, G4 and St-1. The RFI DNA of the phages G4 and St-1 is also cleaved by the 0X gene A protein (13). The activities of the A protein determined in vitro do not provide a clear in vivo function, but do point to an involvement of the A protein in the single-stranded DNA synthesis and/or maturation of the virus. It was suggested by Martin and Godson (26) that the A protein is also involved in the arrest of host cell DNA synthesis. The experiments presented here give a possible mechanism for such an interaction. If the A protein nicks in chromosomal single-stranded DNA e.g. in the replication fork, then the host DNA synthesis will be affected. 561 Nucleic Acids Research We conclude that the nuclease activity of the A protein is analogous to the A protein nuclease activity and that the higher specificity of the A protein comes from the extra NH_-terminal part of the A protein. REFERENCES 1. Sanger, F., Air, G.M., Barrell, B.G., Brown, N.L., Coulson, A.R., Fiddes, J.C., Hutchinson III, C.A. , Slocotnbe, P.M. and Smith, M.(1977) Nature 265, 687-695 2. Eisenberg, S., Griffith, J. and Kornberg, A. (1977) Proc. Natl. Acad. Sci. USA 74, 3198-3203 3. Tessman, E.S. (1966) J. Mol. Biol. 2 1 , 218-236 4. Francke, B. and Ray, D.S. (1971) J. Mol. Biol. 6±, 565-586 5. Henry, T.J. and Knippers, R. (1974) Proc. Natl. Acad. Sci. USA 7j_. 15491553 6. Ikeda, J., Yudelevich, A. and Hurwitz, J. (1976) Proc. Natl. Acad. Sci. USA J73_, 2669-2673 7. Eisenberg, S. and Kornberg, A. (1979) J. Biol. Chetn. j?54, 5328-5332 8. Smith, M. , Brown, N.L., Air, G.M., Barrell, B.G., Coulson, A.R., Hutchinson III, C.A. and Sanger, F. (1977) Nature 265, 702-705 9. Weisbeek, P.J., Borrias, W.E., Langeveld, S.A., Baas, P.D. and van Arkel, G.A. (1977) Proc. Natl. Acad. Sci. USA 74, 2504-2508 10. Linney, E. and Hayashi, M. (1973) Nature New Biol. ^4J>, 6-8 11. Langeveld, S.A., van Mansfeld, A.D.M., Baas, P.D., Jansz, H.S., van Arkel, G.A. and Weisbeek, P.J. (1978) Nature 2T\^, 417-420 12. Fiddes, J.C., Barrell, B.G. and Godson, G.N. (1978) Proc. Natl. Acad. Sci. USA _7_5, 1081-1085 13. Heidekamp, F., Langeveld, S.A., Baas, P.D. and Jansz, H.S. (1980) Nucl. Acids Res. £ , 2009-2021 14. Weisbeek, P.J., van Mansfeld, A.D.M. , Kuhlemeier, C , van Arkel, G.A. and Langeveld, S.A. (submitted) 15. Langeveld, S.A., van Arkel, G.A. and Weisbeek, P.J. (1980) FEBS Lett. 114, 269-272 16. Langeveld, S.A., van Mansfeld, A.D.M., de Winter, J. and Weisbeek, P.J. (1979) Nucl. Acids Res. ]_, 2177-2188 17. Eisenberg, S. (1980) J. Virology 3_5_, 409-413 18. Maxam, A.M. and Gilbert, W. (1977) Proc. Natl. Acad. Sci. USA 74_, 560564 19. Maniatis, T. , Jeffrey, A. and van de Sande, M. (1975) Biochemistry 14, 3783-3794 20. Ikeda, J., Yudelevich, A., Shimamoto, N. and Hurwitz, J. (1979) J. Biol. Chem. 2^4_, 9416-9428 21. Blakesly, R.W. and Wells, R.D. (1975) Nature 257., 421-422 22. Mi'iller, U.R. and Wells, R.D. (1980) J. Mol. Biol. Ut±, 25-41 23. Van Mansfeld, A.D.M., Langeveld, S.A., Baas, P.D., Jansz, H.S., Van de Marel, G.A. Veeneman, G.H. and van Boom, J.H. (!980) Nature (in press) 24. Jacob, R.J., Lebowitz, J. and Kleinschmidt, A.K. (1974) J. Virol. 13. 1176-1185 25. Horiuchi, K. (1980) Proc. Natl. Acad. Sci. USA 77.. 5226-5229 26. Martin, D.F. and Godson, G.N. (1975) Biochem. Biophys. Res. Commun. 65, 323-330 562
© Copyright 2026 Paperzz