Molecular Mechanisms and Manipulation in

Independent Meeting held at the University of Nottingham, 30–31 January 2003. Edited by T. Allers and E. Bolt (Nottingham).
Initiation of archaeal DNA replication
E.R. Jenkinson and J.P.J. Chong1
Centre for Extremophile Research, Department of Biology and Biochemistry, University of Bath, Claverton Down, Bath BA2 7AY, U.K.
Abstract
The identification of DNA as the genetic material and the elucidation of its structure by Watson and Crick
[Watson and Crick, (1953) Nature (London) 171, 737–738], which has its 50th anniversary this year, first
suggested the simple elegance with which the problem of passing on precise genetic information from
one generation to the next could be solved. Semi-conservative replication is perhaps one of the simplest
biological concepts to explain and understand. However, despite an enormous amount of effort in the
intervening years, details of the way in which this process is regulated and performed are still unclear in
many organisms. Recent work suggests that, due to their simplicity, the Archaea may make a good model
for understanding some of the aspects of eukaryotic replication that still elude us.
An overview of replication
Replication is an essential process in both prokaryotic and
eukaryotic organisms that must occur in order for cells to
divide and proliferate. To ensure the integrity of the genome
for future generations, the duplication of the genetic material
has to be tightly controlled, as alterations in chromosome
number result for the most part in decreased cell viability. In
all organisms, replication must occur from a defined point
on the chromosome: the origin sequence. In Escherichia
coli the circular chromosome consists of a single replicon
(a region of DNA replicated by the action of a single pair
of replication forks), and bi-directional replication proceeds
from a single origin (oriC). In eukaryotes, many origins of
replication (10 000–100 000) are required in order to make
each replicon a manageable size. This is due to the sheer
quantity of DNA to be copied and further necessitated
by the packaging of the DNA into multiple chromosomes.
Having multiple origins on multiple chromosomes massively
increases the complexity of the DNA-replication task, and
so eukaryotic cells ensure each replicon is replicated only
once by a process of replication licensing, which functions to
maintain the correct gene copy number in daughter cells.
All organisms use initiator proteins to recognize and
begin replication at origins of replication. In E. coli, the
Key words: DNA replication, initiation, mini-chromosome maintenance (MCM).
Abbreviations used: pre-RC, pre-replicative complex: ORC, origin-recognition complex; MCM,
mini-chromosome maintenance; SSB, single-stranded DNA-binding protein.
1
To whom correspondence should be addressed (e-mail [email protected]).
widely studied DnaA protein is responsible for origin recognition [1–3]. The subsequent loading of the DnaB helicase by interaction with DnaC [4], and activation of the
helicase to initiate unwinding of the duplex, allows entry
of the DNA polymerase machinery so that replication can
proceed. Initiator proteins are also required in eukaryotes
but the formation of a pre-replicative complex (pre-RC)
is essential before the firing of origins can occur [5].
The origin-recognition complex (ORC) is a multisubunit
protein complex responsible for binding to and marking
the origin sequences in all eukaryotes [6–8]. During the
G1 phase of the cell cycle, ORC also functions to
recruit Cdc6 and Cdt1 proteins. Controlling the location
and availability of these proteins has a significant role
in determining the once-per-cell-cycle rule of replication
[9–13]. The most important function of these proteins is the
recruitment of the mini-chromosome maintenance (MCM)
complex to the origin [14], suggesting that they play a
role equivalent to bacterial DnaC. Loading of the MCM
proteins is the final step in establishing the pre-RC and
subsequent signals from cyclin-dependent kinases and Dbfdependent kinases activate the licensed origin for initiation
of replication. DNA replication systems are illustrated in
Figure 1.
Molecular Mechanisms and Manipulation in Archaea Independent Meeting
Molecular Mechanisms and
Manipulation in Archaea
Archaea as a model system
The Archaea have been identified as the ‘third domain
of life’ [15]. They are divided into three main phyla, the
C 2003
Biochemical Society
669
670
Biochemical Society Transactions (2003) Volume 31, part 3
Figure 1 DNA replication in the three domains of life
In each example, replication proceeds bi-directionally from single (prokaryotes, Archaea) or multiple (eukaryotes)
origin sequences. Origins are recognized by specific sequences in prokaryotes. Less is known about the eukaryal origin
sequences although consensus elements are found in Saccharomyces cerevisiae and to a certain extent in Schizosaccharomyces pombe. In each case, initiation requires loading of a helicase protein to unwind the double helix: DnaB in
prokaryotes, MCM complexes in eukaryotes and Archaea. CDK, cyclin-dependent kinase; DDK, Dbf-dependent
kinase.
crenarchaeota, the euryarchaeota and the korarchaeota
(with the recent addition of a fourth group named the
nanoarchaeota on the basis of their size [16]). The Archaea
have a prokaryotic cell morphology showing an absence
of nuclear structures, cytoskeletons and organelles [17,18].
Evidence from studied species points to a prokaryote-like
genome arrangement and it has recently become evident
for some species that a single bi-directional origin of
replication is employed [19]. Sequence information from
several archaeal genomes suggests a degree of homology
between eukaryotic transcription, translation and replication
proteins while most metabolic processes are comparable
with prokaryotes [17]. Many euryarchaeal species also
contain homologues to the eukaryotic histone proteins,
present in eukaryotes to organize chromatin into higherorder structures but normally absent from prokaryotic cells
[20,21]. The Archaea thus pose an interesting evolutionary
question as regards their phylogenetic relationship with
the other two domains of life [15,17]. Archaea provide us
with a simplified model containing a more primitive protein
network that may aid in unravelling the complex replication
mechanisms observed in eukaryotes. This situation can
be exploited to try to understand the processes of DNA
replication in eukaryotes. In particular, studies on two
sequenced Archaea, the euryarchaeon Methanothermobacter
thermautotrophicus and the crenarchaeon Sulfolobus solfataricus, are helping to shed light on the involvement of
the MCM complex in pre-RC formation and replication
initiation.
C 2003
Biochemical Society
Pre-RC formation and replication licensing
in Archaea
The process of replication appears to be conserved between
Archaea and eukaryotes with some significant differences.
The lack of an obvious ORC is the first major difference
in Archaea. In every sequenced archaeal species to date,
homologues to the ORC proteins are absent. However, most
Archaea (except Methanococcus jannaschii and Methanopyrus
kandleri) have one or more Cdc6 homologues. The
precise number of putative Cdc6s differs between species;
for example M. thermautotrophicus contains two copies,
Pyrococcus species contain one, the Sulpholobales contain
three and Halobacterium sp. has up to 13 genes. This may
have evolutionary implications, with the Cdc6 protein being
responsible for multiple roles within the archaeal pre-RC
formation. Cdc6 and some ORC subunits do share a certain
degree of homology, including some structural homology
with a conserved winged helix domain in both Cdc6 and
Orc1 [22]. Thus Cdc6 may play a dual role in both origin
recognition and subsequent loading of the MCM proteins.
It should be noted that the archaeal Cdc6 has structural
similarity to the bacterial origin-recognizing protein, DnaA
[23].
The second fundamental difference between eukaryotes
and Archaea is the lack of any Cdt1 homologue, responsible
for loading eukaryotic MCMs at the pre-RC. However, the
sequence of Cdt1 in eukaryotes appears highly divergent,
hence the Saccharomyces cerevisiae homologue has only
Molecular Mechanisms and Manipulation in Archaea
Figure 2 Schematic diagram of motif arrangements in MCM proteins
The core catalytic complex is made up of MCMs 4/6/7. This sub-complex possesses helicase activity in vitro. The regulatory
subunits (Reg s.u and Reg Dimer) MCM2 and MCM3/5 abolish this activity in vitro. MCMs 2, 4, 6 and 7 possess zinc-finger
domains which can also be identified in some archaeal MCMs. NLS, nuclear localization sequence.
recently been identified [11,24,25]. Before Cdt1 was identified
in this simple eukaryote, it was speculated that the absence
of a Cdt1 implied that its function was only required
in those eukaryotes with more complex origins, such as
Schizosaccharomyces pombe and metazoans [11,24]. Since the
identification of Cdt1 in the simplest of eukaryotes, it may
only be a matter of time before an archaeal homologue of
Cdt1 is discovered.
The conservation of the MCM helicase proteins, which
are probably required for DNA unwinding and replicationfork progression is at present of most interest. Much of
the biochemical information we have relating to archaeal
replication concerns this protein; these data demonstrate
activities consistent with those found in eukaryotic MCM
complexes. This substantiates the use of the Archaea as a
model system for understanding the molecular mechanisms
of DNA unwinding in eukaryotes.
The MCM helicase
Helicases are an integral part of many cellular processes
and defects in helicase activity have been linked to various
cancer-prone conditions and aging disorders [26]. The precise
function of a helicase is the coupling of NTP hydrolysis to
nucleic-acid unwinding by using the chemical energy derived
from the hydrolysis activity to catalyse the separation of
the DNA and/or RNA duplex [26]. All helicase enzymes
contain a series of highly conserved protein sequences, the
‘helicase motifs’, which have been shown to be essential in
the functioning of this class of enzymes [27]. Helicases can
be divided into four distinct superfamilies. The only motifs
common to all superfamilies are motifs I and II, the Walker A
and B motifs that are involved in NTP binding and hydrolysis
respectively. Helicases have been described that differ in
both the numbers of subunits that make up an active protein
and the mechanism of action by which they work. Monomeric, dimeric and hexameric helicase complexes have all
been described [27]. Models described for mechanisms based
on varying subunit compositions include the ‘inchworm’
model for monomeric helicases and a ‘rolling’ method for
dimers [27]. Various models have been proposed in the case of
hexameric helicases due to the apparent topological problems
in loading a ring-shaped complex on to nucleic acid. These
include disassembly and reassembly of the hexamer around
the substrate, and a threading of the free substrate ends
into the central cavity of the helicase [26]. Experimental
evidence would suggest that the most likely model is of a
facilitated ‘ring opening’ step prior to loading. This is seen
for both the DnaB helicase and the T7gp4A helicase-primase
hexamer [26,28,29].
The MCMs were first described in yeast by a genetic
screen that identified genes containing mutations that resulted
in strains that were unable to maintain mini-chromosomes,
indicating a role in DNA replication [30]. There are six
MCMs in all eukaryotes, MCMs 2–7 [31,32] (see Figure 2).
It is believed that a heterohexameric MCM complex is
responsible for initiating unwinding at the origin and also for
elongation of the replication fork by processive unwinding
[33]. In addition, at least one MCM homologue has been
identified in all sequenced archaeal genomes. A single copy
C 2003
Biochemical Society
671
672
Biochemical Society Transactions (2003) Volume 31, part 3
is found in most Archaea, with notable exceptions being
M. jannaschii, which has four, and Methanosarcina acetivorans and M. kandleri, both of which have two annotated
genome copies, although only one gene in each case contains
functional Walker motifs (J.P.J. Chong, unpublished work).
All MCMs share a great deal of homology, particularly
in the central domain of the protein, which spans some
200 amino acids and includes the NTP-binding and hydrolysis domains [27].
MCM complex formation
In addition to the pre-dominant heterohexamer containing
MCMs 2–7, the eukaryal MCMs can form a series of smaller
sub-complexes [34,35]. The heterohexamer is probably the
species recruited to the origin. However, in vitro the only
helicase activity associated with eukaryotic MCMs has been
found in a double trimer of MCMs 4/6/7; the heterohexamer
itself exhibits no detectable activity [36–38]. The MCM4/6/7
double trimer has a 3 →5 helicase activity in the presence
of ATP and dATP, but in vitro processivity is limited to
unwinding some 30 bp of duplex [37]. Processivity is greatly
enhanced by the addition of single-stranded DNA-binding
protein (SSB) and the use of forked substrates where up
to 600 bp of duplex may be unwound [39]. Other subcomplexes that have been identified include a dimer of
MCMs 3 and 5, which appears to function as a regulatory
subunit, inhibiting the helicase activity of the MCM4/6/7
double trimer [36,40]. Similarly, formation of a MCM2/4/6/7
tetramer abolishes helicase activity, indicating a regulatory
role for the MCM2 monomer [37]. One model that would
explain this data is that an MCM4/6/7 catalytic core is
regulated by both the MCM2 monomer and the peripheral
MCM3/5 dimer, with helicase activity requiring the removal
of both MCM2 and MCM3/5 once the hexamer has loaded
on to DNA. On the other hand, data regarding degron
mutants of each of the MCMs in S. cerevisiae demonstrate
a requirement for all six MCMs before, during and after
initiation steps [41]. Whether the in vitro helicase activity
is representative of the in vivo MCM activity and the exact
in vivo composition of the functional complex is a subject for
further investigation.
All the archaeal MCM homologues are more closely related
to eukaryotic MCM4 than any of the other MCMs. Archaeal
MCMs form complexes consistent with homohexamers.
In the case of S. solfataricus, the active species has been
shown to consist of a single homohexamer [42]. The
M. thermautotrophicus MCM protein has been more widely
studied and apparently forms a homododecameric structure
[43–45], although recent electron microscopy evidence points
to the possible formation of a seven-membered ring [46].
Studies of this complex have led to the widely held belief
that despite the weak eukaryal helicase activity, the MCM
complex is in fact responsible for DNA unwinding during
replication. The M. thermautotrophicus MCM complex has
been shown to possess ATP hydrolysis activity stimulated
by the presence of DNA. Mutations of the conserved lysine
C 2003
Biochemical Society
residue in the Walker A motif abolish this activity [43,45]. The
M. thermautotrophicus MCM complex has also been shown
to display processive 3 → 5 helicase activity, displacing
up to 500 bp of DNA in vitro. This activity requires the
presence of ATP or dATP but no other additional co-factors
[43–45]. In contrast, studies of the S. solfataricus MCM resulted in an ATP-dependent 3 → 5 helicase activity that was
stimulated greatly by the addition of an archaeal SSB. Further
characterizations revealed a physical association between SSB
and the S. solfataricus MCM hexamer [42]. Furthermore,
ATPase activity of the S. solfataricus MCM complex was not
stimulated by the presence of DNA [42], an observation in
contrast with that seen for other helicase and MCM enzymes.
These differences may reflect a modulation of activity in
the crenarchaeota as opposed to eukaryotes and the euryarchaeotes.
A role for the zinc-finger motif in
MCM function
Zinc-finger motifs have been identified in many proteins
requiring DNA or protein contacts. They have been found
in both eukaryotic and archaeal MCM proteins. In particular,
the zinc-finger motif is highly conserved between yeast and
higher-eukaryotic MCMs 2, 4, 6 and 7, suggesting that
this motif may play an important role in the functioning
of these proteins. Mutation of the zinc-finger motif of
MCM4 resulted in disruption of the MCM4/6/7 trimer–
trimer interactions in the active MCM4/6/7 hexamer and
resulted in single trimeric complexes [47]. This suggests
that the zinc-finger motif in MCM4 is required for stable
hexameric subunit interactions. The MCM2 zinc finger has
been implicated in protein–DNA contacts. Phosphorylation
of this subunit in the pre-RC may cause conformational
changes and could be responsible for releasing the MCM
complex from the origin and allowing replication to proceed
[48]. The M. thermautotrophicus MCM protein also contains
the zinc-finger motif that is found within a non-conserved
region of the eukaryotic MCM sequence. This motif is also
seen in the Archaeoglobus fulgidus and the S. solfataricus
MCMs but in only one of the four M. jannaschii MCM
sequences [49]. The archaeal zinc finger is comparable with
that of eukaryotic MCM2 (Cys-Xaa2 -Cys-Xaan -Cys-Xaa2 Cys). This zinc finger is apparently dispensable for the
M. thermautotrophicus MCM protein–protein interactions
and complex formation, as mutations in this motif in
M. thermautotrophicus MCM do not affect multimerization
[50]. However, a defect in Zn2+ -binding ability affects DNAdependent ATPase activity, single-stranded-DNA-binding
ability and helicase activity, indicating a role for this motif
in MCM function [50].
Are the Archaea a good model system for
understanding replication in eukaryotes?
The molecular mechanisms of the proteins required for
initiation of DNA replication, in particular the biochemistry
of the MCM complex, appear to be conserved between
Molecular Mechanisms and Manipulation in Archaea
Archaea and eukaryotes. Both domains of life contain MCM
and Cdc6 homologues and strong similarities exist between
eukaryotic and archaeal MCM activities in vitro. In addition,
although not discussed here, many of the archaeal proteins
likely to be involved in replication-fork progression also share
homology with their eukaryotic equivalents. Regulation of
the timing and activation of DNA replication is likely
to be under an entirely different form of control in the
Archaea. In eukaryotes, an additional ‘layer’ of regulation
controls the firing of origins possessing assembled pre-RCs.
This regulation involves Dbf-dependent kinase and cyclindependent kinase activity, two classes of enzyme for which no
archaeal homologues have yet been identified. An interesting
question therefore is how the replication competency of
an archaeal cell is controlled. Modulation of proteins by
post-translational modifications such as phosphorylation
or adenylation may play a role. Many of these control
mechanisms are yet to be understood and the identification
of more proteins specific to archaeal DNA replication will no
doubt follow.
We thank Dr Alan Majernı́k and Dr Setareh Sepehri Chong for critical
reading of this review. J.P.J.C. is a Biotechnology and Biological
Sciences Research Council David Phillips Research Fellow.
References
1 Sutton, M.D. and Kaguni, J.M. (1997) J. Mol. Biol. 274, 546–561
2 Speck, C. and Messer, W. (2001) EMBO J. 20, 1469–1476
3 Sutton, M.D., Carr, K.M., Vicente, M. and Kaguni, J.M. (1998)
J. Biol. Chem. 273, 34255–34262
4 Davey, M.J., Fang, L., McInerney, P., Georgescu, R.E. and O’Donnell, M.
(2002) EMBO J. 21, 3148–3159
5 Diffley, J.F., Cocker, J.H., Dowell, S.J., Harwood, J. and Rowley, A. (1995)
J. Cell Sci. Suppl. 19, 67–72
6 Liang, C., Weinreich, M. and Stillman, B. (1995) Cell 81, 667–676
7 Bell, S.P. and Stillman, B. (1992) Nature (London) 357, 128–134
8 Natale, D.A., Li, C.J., Sun, W.H. and DePamphilis, M.L. (2000) EMBO J. 19,
2728–2738
9 Drury, L.S., Perkins, G. and Diffley, J.F. (1997) EMBO J. 16, 5966–5976
10 Wohlschlegel, J.A., Dwyer, B.T., Dhar, S.K., Cvetic, C., Walter, J.C. and
Dutta, A. (2000) Science 290, 2309–2312
11 Tanaka, S. and Diffley, J.F. (2002) Nat. Cell Biol. 4, 198–207
12 Hofmann, J.F. and Beach, D. (1994) EMBO J. 13, 425–434
13 Nishitani, H., Lygerou, Z., Nishimoto, T. and Nurse, P. (2000) Nature
(London) 404, 625–628
14 Donovan, S., Harwood, J., Drury, L.S. and Diffley, J.F. (1997) Proc. Natl.
Acad. Sci. U.S.A. 94, 5611–5616
15 Woese, C.R., Kandler, O. and Wheelis, M.L. (1990) Proc. Natl. Acad.
Sci. U.S.A. 87, 4576–4579
16 Huber, H., Hohn, M.J., Rachel, R., Fuchs, T., Wimmer, V.C. and Stetter, K.O.
(2002) Nature (London) 417, 63–67
17 Edgell, D.R. and Doolittle, W.F. (1997) Cell 89, 995–998
18 Olsen, G.J. and Woese, C.R. (1997) Cell 89, 991–994
19 Myllykallio, H., Lopez, P., Lopez-Garcia, P., Heilig, R., Saurin, W.,
Zivanovic, Y., Philippe, H. and Forterre, P. (2000) Science 288,
2212–2215
20 Reeve, J.N., Sandman, K. and Daniels, C.J. (1997) Cell 89, 999–1002
21 Pereira, S.L., Grayling, R.A., Lurz, R. and Reeve, J.N. (1997) Proc. Natl.
Acad. Sci. U.S.A. 94, 12633–12637
22 Liu, J., Smith, C.L., DeRyckere, D., DeAngelis, K., Martin, G.S. and
Berger, J.M. (2000) Mol. Cell 6, 637–648
23 Erzberger, J.P., Pirruccello, M.M. and Berger, J.M. (2002) EMBO J. 21,
4763–4773
24 Devault, A., Vallen, E.A., Yuan, T., Green, S., Bensimon, A. and Schwob, E.
(2002) Curr. Biol. 12, 689–694
25 Maiorano, D. and Mechali, M. (2002) Nat. Cell Biol. 4, E58–E59
26 Patel, S.S. and Picha, K.M. (2000) Annu. Rev. Biochem. 69, 651–697
27 Hall, M.C. and Matson, S.W. (1999) Mol. Microbiol. 34, 867–877
28 Jezewska, M.J., Rajendran, S., Bujalowska, D. and Bujalowski, W. (1998)
J. Biol. Chem. 273, 10515–10529
29 Ahnert, P., Picha, K.M. and Patel, S.S. (2000) EMBO J. 19, 3418–3427
30 Maine, G.T., Sinha, P. and Tye, B.K. (1984) Genetics 106, 365–385
31 Kearsey, S.E. and Labib, K. (1998) Biochim. Biophys. Acta 1398,
113–136
32 Chong, J.P., Thommes, P. and Blow, J.J. (1996) Trends Biochem. Sci. 21,
102–106
33 Aparicio, O.M., Weinstein, D.M. and Bell, S.P. (1997) Cell 91, 59–69
34 Prokhorova, T.A. and Blow, J.J. (2000) J. Biol. Chem. 275, 2491–2498
35 Tye, B.K. and Sawyer, S. (2000) J. Biol. Chem. 275, 34833–34836
36 Lee, J.K. and Hurwitz, J. (2000) J. Biol. Chem. 275, 18871–18878
37 Ishimi, Y. (1997) J. Biol. Chem. 272, 24508–24513
38 Schwacha, A. and Bell, S.P. (2001) Mol. Cell 8, 1093–1104
39 Lee, J.K. and Hurwitz, J. (2001) Proc. Natl. Acad. Sci. U.S.A. 98, 54–59
40 Sherman, D.A. and Forsburg, S.L. (1998) Nucleic Acids Res. 26,
3955–3960
41 Labib, K., Tercero, J.A. and Diffley, J.F. (2000) Science 288, 1643–1647
42 Carpentieri, F., De Felice, M., De Falco, M., Rossi, M. and Pisani, F.M.
(2002) J. Biol. Chem. 277, 12118–12127
43 Chong, J.P., Hayashi, M.K., Simon, M.N., Xu, R.M. and Stillman, B. (2000)
Proc. Natl. Acad. Sci. U.S.A. 97, 1530–1535
44 Kelman, Z., Lee, J.K. and Hurwitz, J. (1999) Proc. Natl. Acad. Sci. U.S.A.
96, 14783–14788
45 Shechter, D.F., Ying, C.Y. and Gautier, J. (2000) J. Biol. Chem. 275,
15049–15059
46 Yu, X., VanLoock, M.S., Poplawski, A., Kelman, Z., Xiang, T., Tye, B.K. and
Egelman, E.H. (2002) EMBO Rep. 3, 792–797
47 You, Z., Ishimi, Y., Masai, H. and Hanaoka, F. (2002) J. Biol. Chem. 277,
42471–42479
48 Ishimi, Y., Komamura-Kohno, Y., Arai, K. and Masai, H. (2001)
J. Biol. Chem. 276, 42744–42752
49 Tye, B.K. (1999) Annu. Rev. Biochem. 68, 649–686
50 Poplawski, A., Grabowski, B., Long, S.E. and Kelman, Z. (2001)
J. Biol. Chem. 276, 49371–49377
Received 21 January 2003
C 2003
Biochemical Society
673