Determination of the DNA-interacting region of the archaebacterial

Nucleic Acids Research, Vol. 19, No. 18
4937-4941
Determination of the DNA-interacting region of the
archaebacterial chromosomal protein MC1. Photocrosslinks
with 5-bromouracil-substituted DNA
Mehmaz Katouzian-Safadi, Bernard Laine1, Francois Chartier1, Jean-Yves Cremet,
Denise Belaiche1, Pierre Sautiere1 and Michel Charlier
Centre de Biophysique Mol§culaire, CNRS, 1A Avenue de la Recherche Scientifique, 45071
Orleans-Cedex 2 and 1URA 409 du CNRS, Institut de Recherche sur le Cancer de Lille,
Place de Verdun, 59045 Lille, France
Received June 11, 1991; Revised and Accepted August 28, 1991
ABSTRACT
Protein MC1 is the major chromosomal protein in
methanosarcinaceae. Using photochemical crosslinking on 5-bromouracil-substituted DNA, we identified
the region of the protein that interacts with it. This
region is located in the C-terminal part of the polypeptlde chain, and the crosslinked amino-aclds are in
the region 74 - 86. Tryptophan 74 is one of the aminoacids crosslinked to DNA.
This method has been successfully used for various proteins:
histones (12), lac repressor (13,14), £coRl (15) and cyclic AMP
receptor protein (16).
We have determined the region of the MCI protein which is
cross-linked to DNA by sequence analysis of the peptides
remaining bound to DNA after proteolytic cleavage.
MATERIAL AND METHODS
INTRODUCTION
In eubacteria, and in two groups of archaebacteria (thermophilic
sulfur-dependent, and methanogen bacteria), small and abundant
DNA-associated proteins have been shown to modify DNA
structure and to be involved in DNA replication or transcription
(see reviews I - 3 ) . Protein MCI, a basic polypeptide of 11 KDa,
is the major protein of the deoxyribonucleoprotein complex of
methanosarcinaceae, a methanogen family (4—6). Experiments
performed with protein MC 1, isolated from Methanosarcina sp
CHT155, show that protein MC 1 protects DNA against thermal
denaturation (5) and induces DNA bending and supercoiling (7).
Through comparative studies of the primary structures of the
protein MCI isolated from four species, and the use of predictive
methods of secondary structure determination, two regions can
be defined in this protein (8-11). The first region contains two
well-conserved sequences (residues 17—35 and 45—58) with high
probability of beta-sheet and alpha-helix conformations separated
by beta-turns. The second region, located in the carboxy-terminal
third of the protein, is highly variable but contains four strictly
conserved proline residues, three of them having a high
probability to form beta-turns.
Our goal in this paper was to identify the region of the protein
interacting with DNA, using a photochemical cross-linking
technique. We used DNA wherein the thymine residues have been
partially substituted by 5-bromouracil (BrUra). The advantage
of BrUra-substituted DNA is that it allows irradiation at
wavelengths long enough (X>300 nm) to reduce significantly
the direct photolysis of both DNA and proteins, with a crosslinking efficiency considerably greater than with normal DNA.
Endoproteinase Glu-C (EC 3.4.21.19) and endoproteinase LysC (EC 3.4.21.50) were purchased from Boehringer. Acetonitrile
for HPLC was from Carlo Erba. Reagents, solvents and
polyethylenimine used for gas-phase sequencing, were purchased
from Applied Biosystems.
Preparation of BrUra -substituted DNA
BrUra-substituted DNA was purified from an E. coli B/rT~
strain, as described previously (14). The percentage of substitution
was assayed by determination of the buoyant density of the DNA
in a CsCl gradient (17) and found to be 75%. Concentrations were
determined spectrophotometrically, assuming an extinction
coefficient of 13000 M~'cm~' per base pair.
Preparation of protein MCI
Methanosarcina sp CHTJ55 (strain 2902) was grown as described
previously (5).
Protein MCI was prepared as already described (4,7). The
protein concentration was determined spectrophotometrically
assuming an extinction coefficient of 11200 M~'cm~' per
protomer at 280 nm.
Preparation of MC1-DNA complexes
Samples (2.4 ml) were prepared by adding, under gentle stirring,
the protein solution to the DNA solution, both in 10 mM sodium
phosphate, pH 7.25. Final concentrations were 15 /tM protein
and 225 /tM DNA (as base pairs), leading to a ratio of one
molecule of protein per 15 base pairs. In these conditions, we
verified that all molecules of protein are bound to DNA.
4938 Nucleic Acids Research, Vol. 19, No. 18
When necessary, dissociation of the complex can be achieved
by increasing the salt concentration to 0.5 M KC1, 0.2 M
potassium phosphate, pH 7.25.
In this paper, low ionic strength buffer refers to 10 mM sodium
phosphate, pH 7.25, and high ionic strength buffer to 0.5 M KC1,
0.2 M potassium phosphate pH 7.25.
Irradiation of the complexes
Irradiations were performed using a high pressure mercury lamp
(Osram HbO 200 Watts) equipped with filters isolating radiations
of wavelengths ranging from 300 to 400 run (14). The fluence
rate in the front of the irradiation vessel was 480 Wm~2, and
the total incident fluence during the crosslinking reaction was
1.73 106 Jm- 2 .
Chromatographic separation of free and crosslinked proteins
or peptides
Free and crosslinked proteins or peptides were separated by
molecular sieving chromatography on AC A 34 gel. The column
(25x2 cm) was equilibrated and eluted with the high ionic
strength buffer.
Determination of the non-crosslinked peptides (Fig.2,
pathway 1)
The crosslinked DNA-protein complex, rid of free protein as
previously described, was dialysed against water and concentrated
by ultracentrifugation at 223000 Xg for 5 hours. After elimination
of the supernatant, the pellet was redissolved in 0.5 ml water
and adjusted to 0.1 M ammonium bicarbonate, 0.7 M NaCl, pH
7.8. The protein was then digested with endoproteinase Glu-C,
at 37°C for 18 hours, using an enzyme to substrate ratio of 1/50
(w/w).
The digest was centrifuged at 223000Xg for 5 hours. Free
peptides remaining in the supernatant were fractionated by
reversed phase HPLC on a C18 jiBondapack column (300x3.9
mm, Millipore) eluted by a gradient of acetonitrile in 0.05%
trifluoroacetic acid.
Amino-acid analyses of peptides were performed as previously
described (9).
Determination of the crosslinked peptides (Fig.2, pathway 2)
The irradiated complex solution (2.4 ml) was adjusted to 4.7 M
urea. Proteolysis by endoproteinase Lys-C was performed at
20°C for 10 hours at a final enzyme-to-substrate ratio of 1/25
(w/w).
The digest was then dialysed against the low ionic strength
buffer to remove urea, adjusted to high ionic strength, and
chromatographed on ACA 34. The fractions containing the
crosslinked peptides bearing DNA were pooled. The
nucleopeptide was precipitated with cold ethanol, centrifuged,
dissolved and dialysed against distilled water.
Ammonium bicarbonate was added to 0.2 M, before a second
extensive dialysis against bidistilled water. This step allows the
replacement of all contaminating ions and soluble compounds
by a volatile salt.
After concentration to 0.27 ml by evaporation, the sample was
directly submitted to sequencing.
Automated Edman degradation was carried out on a gas-phase
Applied Biosystems sequencer, using the 03RPE1 program, with
polyethylenimine (Applied Biosystems) as carrier (19).
Phenylthiohydantoin (PTH) derivatives of amino-acids were
identified by reversed phase HPLC with an on-line Applied
Biosystems 120 amino-acid analyser.
RESULTS
Cross-linking of protein MCI to DNA
After irradiation at low ionic strength, the complex solution was
adjusted to high ionic strength, to completely remove noncrosslinked proteins from the DNA. Fig. 1 shows the separation
DNA-MC1 complex
I
Irradiation
Cross-linked material
and frae proteins
PATHWAY 1
ACA 34 chromato.
Relative absorbance or fluorescence.
PATHWAY 2
Endo.Lys-C
digestion
Frit
prottln
MC1 crosalinked to DNA
I
ACA 34 chromato.
0.8 1Endo.Qlu-C
digestion
I
Ultracentrifugation
Pallet
DNA orost*
llnkad to
Supernatant
piptlda(t)
Free peptides
I
Peptide(s) crosslinked to ONA
Sequence
determination
HPLC
6
11
16
21
26
31
Fraction number.
Figure 1. Separation of non-crosslinked and crosslinked protein MCI by molecular
sieving chromatography. 2.4 m] of irradiated complex was loaded on the ACA34
column (20x2cm)andeftitedathigh ionic strength with a flow rate of 12 ml/hour.
Fractions of 1.5 ml were monitored by absorbance at 260 nm and by fluorescence
at 345 nm (excitation at 290 nm).
/
Amino-acids analysis
Identification of
free peptides
Identification of
the bound
peptide(s)
Figure 2. Strategies used to determine the DNA-crosslinked region of protein
MCI.
Nucleic Acids Research, Vol. 19, No. 18 4939
of crosslinked material from the free protein, on an ACA 34
column eluted at high ionic strength. Peak I, eluted in the void
volume, contains the DNA bearing the crosslinked proteins. Peak
II corresponds to free protein. From the relative area of the
fluorescence of each peak, an average value of 20% of
crosslinked proteins was determined.
To ascertain that crosslinks do not result from a diffusion
controlled reaction, a mixture of DNA and protein was irradiated
at high ionic strength which prevents complex formation. No
crosslinking was detected under these conditions.
Strategies used to determine the DNA binding domain of
protein MCI
At the beginning of our work, we intended to identify, after
proteolysis, the peptides crosslinked to DNA and the peptides
not bound to DNA (Fig. 2, padiway 1). For this purpose, the
non- crosslinked proteins were eliminated by molecular sieving
chromatography prior to digestion of the crosslinked material with
endoproteinase Glu-C. This enzyme was chosen to obtain a low
number of peptides (Fig. 3 A). Free peptides could be identified
after their fractionation by HPLC. We did not succeed in
identifying the peptide crosslinked to DNA, since low yields were
obtained at each step of purification.
This led us to use a simpler strategy (Fig. 2, pathway 2). Just
after irradiation, both crosslinked and non-crosslinked proteins
were digested with endoproteinase Lys-C. Free peptides were
eliminated by molecular sieving chromatography, and the
crosslinked peptides were directly identified by microsequencing.
Identification of the non-crosslinked peptides (Fig.2,
pathway 1)
The protein MCI crosslinked to DNA was cleaved with
endoproteinase Glu-C in 0.1 M ammonium bicarbonate (pH 7.8)
220 nm
containing 0.7 M NaCl. The high salt concentration, which
abolishes the electrostatic bonds between the protein and DNA,
was expected to increase the accessibility of potential sites of
cleavage. Moreover, to avoid eventual cleavage of the crosslinks
in acidic medium (20), neutral conditions were preferred for die
digestion by the endoproteinase.
Fig. 3A shows the HPLC elution pattern of the peptides
released from the crosslinked protein. The schematic
representation of these peptides in die protein sequence is shown
in Fig. 3B.
The diree peptides obtained from the crosslinked material were
identified by their amino-acid compositions. Peptides PE1
(residues 1 —63) and PEla (residues 1 -49) overlap the aminoterminal part of the protein. Peptide PE2 (residues 88-93)
overlaps the carboxy- terminus of the protein. No trace of a
peptide overlapping the sequence 64-87 was detected.
These results clearly show that there is no crosslinking site
in the region of die protein corresponding to peptides PE1 and
PE2, and strongly suggest diat the crosslinked residue(s) is(are)
located between amino-acids 64 and 87.
We can notice diat glutamyl bonds at positions 11,15 and 90
were not cleaved by the endoproteinase Glu-C in the crosslinked
protein. In die same conditions of digestion, the glutamyl bonds
at positions 11 and 90 were not cleaved eidier when protein MCI
is not crosslinked. We previously observed a lack of cleavage
of glutamyl bonds in the course of die determination of die
sequence of protein MCI variants from M.soehngenii (10). It
is not however clear at present if die lack of cleavage at position
15, and die partial cleavage at position 49 result from a particular
structure of die crosslinked protein, or from a steric hindrance
due to DNA crosslinking.
Identification of the peptides bound to DNA (Fig. 2,
pathway 2)
Free native MCI protein is cleaved wim die endoproteinase Lys-C
at every potential sites. In contrast, when complexed to DNA,
protein MCI remains resistant to attack by die protease.
Nevertheless, this resistance was completely removed in die
presence of 4.7 M urea. These conditions were used for die
crosslinked protein.
After proteolysis, die peptides crosslinked to DNA were
separated from free peptides by chromatography on die ACA
34 column. The crosslinked DNA-peptide(s), purified as
pMoles of PTH derivatives of amino-aclds
1000/
20
30
Elution volume (ml)
N
RP
•7 0>
-I-
B
100.
E
|lVDAPINBPAWUPEII5tPFVIIE|
64
87
Figure 3. A: Reversed phase HPLC elution pattern of the peptides generated
by endoproteinase Glu-C digestion of protein MCI crosslinked to DNA. Full
line, absorbance at 220 nm (left scale). Dashed line, acetonitrile concentration
(right scale). Flow rate: 1 rnl/min. B: Schematic drawing of the non-crosslinked
peptides generated by the cleavage of protein MCI crosslinked to DNA with
endoproteinase Glu-C. The one-letter sequence of the missing part of the protein
is shown.
1
2
3
4
S
6
7
8
9
10 11 12 13 U
15 14 17 18
Cycle n u m b e r .
Figure 4. Edman degradation of crosslinked peptides (700 pmoles) generated
by cleavage of crosslinked protein MCI with endoproteinase Lys-C. PTH
derivatives of amino-acids were identified and quantified by HPLC.
4940 Nucleic Acids Research, Vol. 19, No. 18
1000
Cumulated pMolet oJ PTH-amlno-aclds
100:
E I V D A P I N E P A W M P E t l S t P F V I I E
05
70
76
80
86
Amlno-acld
Figure 5. Same data as Fig. 4, but the two overlapping peptides have been phased
along the sequence of the protein and the amounts of PTH derivatives have been
cumulated. Arrows indicate the non-identified amino-acids.
described in the Material and Methods section, were submitted
to micro-sequencing.
By comparison with the sequence of protein MCI (8), the
results obtained over 18 cycles of automated Edman degradation
(Fig. 4) show that two peptides were simultaneously sequenced.
The major peptide began at Asp 70 and was sequenced as far
as Glu 87. The minor one began at Glu 63 and was sequenced
as far as Glu 77. The repetitive yield of the major peptide is lower
than that of the minor peptide; however, on the basis of the
amounts of the PTH derivatives obtained in the three first steps
of the Edman degradation, the major peptide is at least 2.5 times
more abundant than the minor one.
The diagram showing the cumulated picomoles of PTH
derivatives of amino-acids identified during the sequence analysis
is presented in Fig. 5.
DISCUSSION
Photochemical crosslinking between a protein and DNA can
occur only when bromouracil and a crosslinkable amino-acid are
present in a favourable conformation in the interaction site. The
presence of a crosslink reveals the proximity of residues, but its
absence gives no information. In other words, crosslinking is
a positive, but not an exclusive result. We used DNA where 75 %
of the thymines were substituted by BrUra, i.e. about one fifth
of the total bases are photoreactive residues. As the protein MCI
does not recognize a specific DNA sequence, the crosslinking
yield is limited by the percentage substitution of the DNA. This
explains the observed value of 20%.
Micro-sequencing results (Fig. 5), together with the
identification of the non-crosslinked peptides (Fig. 3B),
unambiguously show that the region interacting with DNA is
located within residues 6 9 - 8 7 . Sequencing of peptides linked
to DNA shows that two overlapping peptides remain bound: the
peptides 6 3 - 7 6 (minor peptide) and 70 - 87 (major peptide). The
good yields of PTH-amino-acids shows that no crosslink occurred
in the sequence 6 3 - 7 3 (Fig. 5). Therefore, we can infer that
recovery of the minor peptide results from a partial cleavage of
the lysine bond at position 69. This can be explained by its
restricted accessibility due to the steric hindrance of the
crosslinked DNA. Such a restricted accessibility also explains
lack of cleavage at lysine residues 78, 81, 85 and 86.
Lack of identification of Serine 80 could result from its
degradation after eleven cycles of automatic sequencing.
However, the low yield of lysine 78, and the lack of identification
of lysines 81, 85 and 86 cannot be explained at present. Lysine
residues can be crosslinked to DNA, but simultaneous
involvement of these residues in the crosslinking reaction seems
very unlikely, since it would need the presence of BrUra clusters.
There are probably several pathways to crosslink BrUrasubstituted DNA and proteins. Model systems using amino-acids
or peptides have been studied, including cysteine or cystine
(21,22), tryptophan, tyrosine and histidine (23-26), lysine (25)
and even the peptide bond (27). Among these putative
crosslinking sites, tryptophan residue is one of the more reactive,
and can be easily crosslinked to BrUra. In the case of protein
MCI, tryptophan 74 which occupies positions 5 and 12 in the
major and minor peptides respectively, was not identified during
sequencing of crosslinked peptides (Fig. 4). This strongly
suggests the involvement of tryptophan 74 in the crosslinking
reaction. Moreover, when protein MCI is complexed to DNA,
the fluorescence in the tryptophan band decreases by 20%,
without any change in the shape of the spectrum (Culard, personal
communication). Such a result has already been observed with
proteins or peptides, whose tryptophan residues interact directly
with DNA (28,29).
The interacting sequence 69-87 contains one arginine and five
lysine residues, which could give electrostatic interactions with
DNA. It does not correspond to the most basic region of the
protein, which is located in the middle part of the protein (residues
46—55), where half of the residues are arginine or lysine. The
interacting sequence contains 10 highly conserved residues found
in every MCI protein from Methanosarcinaceae studied up to
now (10). Among these residues are the three lysines at positions
81, 85 and 86 and three prolines at position 72, 76 and 82.
According to secondary structure predictions (10), prolines 72
and 76 may form adjacent beta-turns, and thus may play a role
in the conformation of this region (30). The high conservation
of these residues appears to be critical for the DNA binding ability
of protein MC1.
The DNA binding region of protein MCI differs from the
helix-turn-helix motif encountered in several bacterial DNAbinding proteins (31). On the other hand, this region is
reminiscent to the DNA-binding region of protein HU, an
abundant chromosomal protein in eubacteria (1,2). The
B.stearothermophilus protein HU interacts with DNA through
an external arm consisting of beta-sheet strands separated by betatums (32). The hydrophobic clusters analysis method (33) shows
that the interacting region of protein MC 1 displays hydrophobic
clusters with shape and orientation characteristic of beta-sheet
ribbons, flanked by the highly conserved proline residues cited
above (18). This suggests that as protein HU, protein MCI
interacts with DNA through beta-sheet structures separated by
beta-turns.
ACKNOWLEDGEMENTS
The authors are indebted to J.P.Touzel, from the Station de
Technologie Alimentaire (INRA, Villeneuve d'Ascq) for
providing the bacteria. We acknowledge the technical assistance
of A.Lemaire-Poitau. We thank J.Rossier, J-P.Le Caer, and
F.Culard for helpfull discussions, and communication of
preliminary results. This work was partially supported by grants
from the CNRS, the University de Lille II, and the Pdle des
Anairobies de la re'gion Nord-Pas de Calais.
Nucleic Acids Research, Vol. 19, No. 18 4941
REFERENCES
1. Driica, K. and Rouviere-Yaniv, J. (1987) Microbiol. Re\iews, 51, 301 -319.
2. Dijk, J. and Reinhardt, R. (1986) In Guakrzi, C O . and Pon, C.L. (eds.)
Bacterial Chromatin. Springer-Verlag. pp 186-218.
3. Schmid, M.B. (1990) Cell, 63, 461-451.
4. Chartier, F., Laine, B., Sautiere, P., Touzel, J-P. and Albagnac, G. (1985)
FEBS Lett., 183, 119-123.
5. Chartier, F., Laine, B. and Sautiere, P. (1988) Biochim. Biophys. Acta, 951,
149-156.
6. Imbert, M., Laine, B., Prensier, G., Touzel, J-P. and Sautiere, P. (1988)
Can. J. Microbiol.. 34, 931-937.
7. Laine, B., Culard, F., Maurizot, J-C. and Sautiere, P. Nucleic Acids Research,
in press.
8. Chartier, F., Laine, B., Belalche, D., Touzel, J-P. and Sautiere, P. (1989)
Biochim. Biophys. Acta, 1008, 309-314.
9. Laine, B., Chartier, F., Imbert, M., Lewis, R. and Sautiere, P. (1986) Eur.
7. Biochem., 161, 681-687.
10. Chartier, F., Laine, B., Belalche, D. and Sautiere, P. (1989)7. Biol. Chem.,
264, 17006-17015.
11. Laine, B., Chartier, F., Imbert, M. and Sautiere, P. (1990) In Belaich, J.P..
Bruschi, M. and Garcia, J.L. (eds.) Microbiology and Biochemistry of strict
anaerobes involved in interspecies hydrogen transfer. Plenum Press, N.York,
pp 291-301.
12. Weintraub, H. (1973) Cold Spring Harbor Symposium Quant. Biol., 38,
247-256.
13. Lin, S.Y. and Riggs, A.D. (1974)/>roc. Noll. Acad. Sci. USA, 71, 947-951.
14. Barbier, B., Charlier, M. and Maurizot, J-C. (1984) Biochemistry, 23.
2933-2939.
15. Wolfes, H., Fliess, A., Winkler. F. and Pingoud, A. (1986) Eur. 7. Biochem.,
159, 267-273.
16. Katouzian-Safadi, M., Blazy, B. and Charlier, M. (1991) Photochem.
Photobiol. 53, 611 -616.
17. Hackett, P., and Hanawalt, P. (1966) Biochim.Biophys. Ada, 123, 356-363.
18. Imbert, M., Laine, B., Helbecque, N., Mornon, J-P., Hinichart, J-P. and
Sautiere, P. (1990) Biochim. Biophys. Acta, 1038, 346-354.
19. Le Caer, J.P. and Rossrer, J. (1988) Anal.Biochem., 169, 246-252.
20. Pandey, V.H. and Modak, M.J. (1988)7. Chromalography, 444, 133-139.
21. Varghese, A.J. (1974) Photochem. Photobiol., 20, 461-464.
22. Dietz, T.M. and Koch, T.H. (1989) Photochem. Photobiol., 49, 121-129.
23. Ito, S., Saito, I. and Matsuura, T. (1980) 7. Amer. Chem. Soc., 102,
7535-7541.
24. Saito, I., Ito, S., Matsuura, T. and Helene, C. (1981) Photochem. Photobiol.,
33, 15-19.
25 Saito, I. and Matsuura, T. (1985) Ace. Chem. Res., 18, 134- 141.
26. Dietz, T.M. and Koch, T.H. (1987) Photochem. Photobiol., 46, 971 -978.
27. Dietz, T.M., von Trebra, R.J., Swanson, B.J. and Koch, T.H. (1987) 7.
Amer. Chem. Soc., 109, 1793-1797.
28. Helene, C , Brun, F. and Yaniv, M. (1971)7. Mol. Biol., 58, 349-365.
29. Brun, F., Toulme\ J-J. and Helene, C. (1975) Biochemistry, 14, 558-563.
30. Kolodrubetz, D. (1990) Nucleic Acids Research, 18, 5565.
31. Weber, I.T., McKay, D.B. and Steitz, T.A. (1982) Nucleic Acids Research,
10, 5085-5102.
32. White, S.W., Appelt, K., Wilson, K.S. and Tanaka, I. (1989) Proteins, 5,
281-288.
33. Gaboriaud, C . Bissery, V., Benchetrit, T. and Momon, J P. (1987) FEBS
Lett., 224, 149-155.