ATrich region and repeated sequences the essential elements of

REVIEW ARTICLE
AT-rich region and repeated sequences – the essential elements
of replication origins of bacterial replicons
Magdalena Rajewska, Katarzyna Wegrzyn & Igor Konieczny
Department of Molecular and Cellular Biology, Intercollegiate Faculty of Biotechnology, University of Gdansk, Gdansk, Poland
Correspondence: Igor Konieczny,
Department of Molecular and Cellular
Biology, Intercollegiate Faculty of
Biotechnology, University of Gdansk, Kladki
24, 80-822 Gdansk, Poland. Tel.: +48
585 236 365; fax: +48 585 236 427;
e-mail: [email protected]
Received 15 March 2011; accepted 7 July
2011. Final version published online 25
August 2011.
DOI: 10.1111/j.1574-6976.2011.00300.x
Editor: Grzegorz Wegrzyn
MICROBIOLOGY REVIEWS
Keywords
replication origin; 13-mers; DUE; Rep; DnaA.
Abstract
Repeated sequences are commonly present in the sites for DNA replication initiation in bacterial, archaeal, and eukaryotic replicons. Those motifs are usually
the binding places for replication initiation proteins or replication regulatory
factors. In prokaryotic replication origins, the most abundant repeated
sequences are DnaA boxes which are the binding sites for chromosomal replication initiation protein DnaA, iterons which bind plasmid or phage DNA
replication initiators, defined motifs for site-specific DNA methylation, and
13-nucleotide-long motifs of a not too well-characterized function, which are
present within a specific region of replication origin containing higher than
average content of adenine and thymine residues. In this review, we specify
methods allowing identification of a replication origin, basing on the localization of an AT-rich region and the arrangement of the origin’s structural
elements. We describe the regularity of the position and structure of the
AT-rich regions in bacterial chromosomes and plasmids. The importance of
13-nucleotide-long repeats present at the AT-rich region, as well as other
motifs overlapping them, was pointed out to be essential for DNA replication
initiation including origin opening, helicase loading and replication complex
assembly. We also summarize the role of AT-rich region repeated sequences
for DNA replication regulation.
Introduction
Although it has been almost 50 years since the replicon
model was originally proposed by Jacob and Brenner
(1963), our understanding of the concept of the replication origin is still evolving. It is generally accepted that
the replication origin is considered to be a site within the
DNA particle where the replication is initiated; however,
the sequence specificity and structural requirements of
this important genetic element are not fully elucidated
and vary depending on the analyzed replicon. Although
the bacterial origin sequences are usually well defined, the
eukaryotic and in particular higher eukaryotic replication
initiation sites are not characterized so well. The identification of the replication origin is still difficult and
requires not only the analysis of the DNA sequences
in silico but also application of advanced experimental
approaches.
The replication origins of the bacterial chromosomes,
bacteriophages and plasmids, as well as certain eukaryotic
FEMS Microbiol Rev 36 (2012) 408–434
replicons, including the DNA viruses and Saccharomyces
cerevisiae, possess characteristic functional elements,
including specific binding sites for the appropriate replication initiation protein (Rep), binding sites for accessory
replication proteins, and regions with high content of adenine and thymine residues (AT-rich) (Bell & Dutta, 2002;
Konieczny, 2003; Breier et al., 2004; Mackiewicz et al.,
2004; Ozaki et al., 2006; Chew et al., 2007; Shaheen et al.,
2009). The distribution of DnaA-boxes which are the
binding sites for the bacterial replication initiation
protein DnaA and the dnaA gene location could be
applied for the identification of the bacterial chromosomal
replication origins (oriC) (Mackiewicz et al., 2004). Moreover, the asymmetry in nucleotide composition measured
as the normalized difference in the content of complementary nucleotides is considered to be the method of
identification of putative oriC in bacterial chromosomes
(Freeman et al., 1998; Grigoriev, 1998; McLean et al.,
1998; Salzberg et al., 1998; Mackiewicz et al., 1999a, b;
Rocha et al., 1999). It has been demonstrated that the
ª 2011 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
AT-rich region and the repeated sequences in DNA replication
DNA asymmetry is the most universal method but to
achieve a better prediction, it is proposed to apply it
together with the analysis of the distribution of DnaA
boxes and dnaA gene location (Mackiewicz et al., 2004).
Unfortunately, these characteristic features, i.e. a cluster
of DnaA boxes, a dnaA gene and a switch in the asymmetry, are neither typical of all bacterial chromosomes
(Mackiewicz et al., 2004) nor of other replicons such as
plasmids and bacteriophages. By contrast, the AT-rich
part of a replication origin is the most universally conserved structural element identified in both prokaryotic
and eukaryotic replicons (Bell & Dutta, 2002; Konieczny,
2003; Breier et al., 2004; Mackiewicz et al., 2004). This
distinct region forms the DNA unwinding element (DUE)
(Kowalski & Eddy, 1989) where during the process of replication initiation, the initial destabilization (opening) of
the DNA double helix takes place. The opening creates a
single-stranded DNA (ssDNA) which is indispensable for
the replication initiation of a double-stranded (dsDNA)
replicon. The AT-rich region is exactly the site where a
replication complex is formed and where the DNA synthesis is initiated. Surprisingly, our knowledge about the
AT-rich region is limited and very often the conventional
perception is that the only important feature of this part
of the replication origin is its higher than average number
of adenine and thymine residues. Recent data revealed,
however, that the sequence of the AT-rich region is of
great importance.
Localization of the AT-rich regions
within replication origin
The replication process starts in the AT-rich sites within
one, two or multiple origins, depending on the organism.
The vicinity of these specific regions is characteristic of
the particular replicon; however, there are several conserved motifs surrounding the AT-rich sequences that can
be recognized within different replication origins. To
determine the location of the individual regions at the
origin, identification of the replication origin itself is
required in the first place.
Methods for the identification of replication
origin
The identification of a replication origin requires application of a wide range of techniques from the prediction of
the origin sequence in silico to its in vivo and in vitro
confirmation. The simplest theoretical indicators of the
replication origin’s position in bacterial chromosomes
take the advantage of the differences in the GC and the
AT content of leading and lagging strands. The observed
deviation in nucleotides composition between two strands
FEMS Microbiol Rev 36 (2012) 408–434
409
is more likely the effect of different mutation rates connected with the architectural asymmetry at the replication
forks and the higher accuracy of leading strand replication (Mrazek & Karlin, 1998; Frank & Lobry, 2000; Tillier
& Collins, 2000). The relative nucleotides skews (e.g. CG
skew = (C
G)/(C + G)), which are normalized differences in the content of complementary nucleotides,
change the sign crossing replication origin (Lobry, 1996a,
b; Freeman et al., 1998; Grigoriev, 1998; McLean et al.,
1998; Salzberg et al., 1998). The observed asymmetry in
the nucleotides composition is used, for example, in the
Oriloc (Frank & Lobry, 2000) and the CG-software (Roten
et al., 2002) programs for origin prediction. These computational methods designed for bacterial origin identification can also be applied to predict the sites where the
replication starts in archaea, despite the fact that the
archaeal replication apparatus appears to be more closely
related to that in eukaryotes (Grigoriev, 1998; Lopez
et al., 1999; Myllykallio et al., 2000; Zhang & Zhang,
2002, 2005). However, the analysis of chromosome asymmetry only gives the approximate origin position within
the genome sequence, indicating the region with a higher
than average AT content. For a more efficient determination of the site where the replication starts, other origin
features, such as the dnaA gene and the position of
the DnaA-box motifs, should be taken into account
(Mackiewicz et al., 2004). Considering the location of the
gene for bacterial initiation replication protein DnaA is
even more justified because the dnaA was found in
almost all bacterial chromosomes (Mackiewicz et al.,
2004) and the origin for replication is usually located
upstream or downstream of the dnaA gene (Messer,
2002). Moreover, using the position of the sequences recognized by this protein, which were found within the replication origin in almost all analyzed prokaryotic genomes
(Mackiewicz et al., 2004), is equally justified. What is
interesting, as the dnaA gene in bacterial chromosomes is
located near the replication start point, similarly the
archaeal genes for replication initiation proteins Orc1/
Cdc6 are also found close to the origin and their position
could be used for origin prediction (Norais et al., 2007)
The parameters like the dnaA and DnaA-boxes’ position
or the location of indicator genes (hemE, gidA, dnaN,
hemB, repC, etc.) and dif sequence are applied in DORIC
database (Gao & Zhang, 2007) and ORI-FINDER web server
(Gao & Zhang, 2008). The DORIC database contains over a
thousand deposited sequences (http://tubic.tju.edu.cn/
doric/). However, it should be taken into account that
sometimes applying different parameters in the in silico
studies can generate different results. The analysis, presented by Mackiewicz and coauthors, of 112 complete
bacterial genome sequences, based on three parameters,
in case of 67 chromosomes indicates the same region as
ª 2011 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
410
replication origin; however, in case of 17 genomes each
parameter indicates a disparate region (Mackiewicz et al.,
2004). As the available programs usually base on the
Escherichia coli oriC features and in some bacteria, there
can occur substantial differences in DnaA-boxes’ sequences
like in Helicobacter pylori (Zawilak et al., 2003) and
Borrelia burgdorferi (Picardeau et al., 1999) or variations
in the dnaA gene position (Mushegian & Koonin, 1996;
Richter et al., 1998), therefore the prediction of the replication origin location cannot be approved indiscriminately
without experimental data. The conduction of experiments
is extremely important in case of organisms whose genomes contain more than one region where the replication
starts, as different methods could indicate various regions
as replication origins (Dijkwel & Hamlin, 1995; Pelizon
et al., 1996; Berquist & DasSarma, 2003; Lundgren et al.,
2004; Robinson et al., 2004; Zhang & Zhang, 2005; Coker
et al., 2009).
The first experiments leading to the determination of
bacterial origin relied on construction of minichromosomes and investigation of their ability to replicate in
bacterial cells (Meijer et al., 1979; Messer et al., 1979; Sugimoto et al., 1979; Oka et al., 1980). Then the in vivo
analysis was extended to in vitro experiments to confirm
the site where the replication starts using techniques such
as two-dimensional gel electrophoresis (Yee & Smith,
1990; Suhan et al., 1994) or electron microscopy (Inselburg, 1974; Tomizawa et al., 1974; Krause et al., 1997),
which allowed to observe the replication intermediates.
These methods of origin identification were also successfully applied in case of archaeal and eukaryotic ARS
(autonomously replicating sequence) regions (Stinchcomb
et al., 1979; Hamlin & Dijkwel, 1995; DePamphilis, 1999;
Matsunaga et al., 2001; Lundgren et al., 2004; Robinson
et al., 2004). To determine where the replication proteins
bind within the origin specific motifs, the EMSA (Electro
Mobility Shift Assay) (Margulies & Kaguni, 1996; Weigel
et al., 1997; Zawilak et al., 2003; Ozaki et al., 2006; Sibley
et al., 2006), footprinting (Hwang & Kornberg, 1992a;
Margulies & Kaguni, 1996; Speck and Messer, 2001; Shaheen et al., 2009) and SPR (surface plasmon resonance)
analysis (Messer et al., 2001; Speck and Messer, 2001;
Zawilak et al., 2003) were performed. The use of these
methods enabled not only to determine whether or not
the protein binds within the replication origin, but also
to identify the sequence recognized by the protein and to
calculate the stoichiometry and kinetics of the nucleoprotein complex formation. The in vitro helix opening assays
with KMnO4 probing and P1, S1 or mung bean nucleases
were also performed to find the site, where helix melting
occurs and where the replication complex is exactly
formed (Gille & Messer, 1991; Mukhopadhyay et al.,
1993; Krause et al., 1997; Speck and Messer, 2001; Ozaki
ª 2011 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
M. Rajewska et al.
et al., 2006). Analogous biochemical analyses were applied
to determine the plasmids’ replication origins, for
instance the ColEI (Tomizawa et al., 1977), P1 (Abeles,
1986; Abeles et al., 1990; Wickner et al., 1990; Papp et al.,
1993; Park et al., 1998, 2001; Park & Chattoraj, 2001), F
(Eichenlaub et al., 1977; Zzaman et al., 2004), R6K (Kunnimalaiyaan et al., 2004), RK2 (Konieczny et al., 1997;
Doran et al., 1998) and other (Danbara et al., 1980; Diaz
& Staudenbauer, 1982; Kuzminov et al., 1997; Diaz-Lopez
et al., 2003; Schvartzman et al. 2010). For identification
of the archaeal and eukaryotic origins, where the recognition of the replication start point is much more difficult,
other methods, such as marker frequency analysis utilizing whole-genome DNA microarrays and replication initiation point analyses (Gerbi & Bielinsky, 1997; Bielinsky &
Gerbi, 1998, 1999, 2001; Abdurashidova et al., 2000; Gomez & Antequera, 1999; Romero & Lee, 2008), have been
applied. However, no matter how difficult the mapping
of the multiple replication start points is and how much
they differ from the bacterial origins, the common feature
for all of the origins is the AT richness within the region
where the replication process starts.
Localization of AT-rich region in respect to
other origin-specific motifs
Looking closer into the bacterial and plasmid origins with
reference to the position of the AT-rich region, there is a
clear noticeable regularity (Fig. 1). In the analyzed replicons, the binding sites for the replication initiating proteins
in different origins are located on the same site of the ATrich region. The best characterized binding sequences for
the bacterial replication initiators are DnaA-boxes, the
binding sites for the DnaA protein. Within the E. coli’s
245-bp oriC, there are five asymmetric 9-mer repeats
R1–R5 (Fuller et al., 1984; Matsui et al., 1985), bound with
different affinity by both active ATP-DnaA and inactive
ADP-DnaA proteins (Schaefer & Messer, 1991; Schaper &
Messer, 1995; Margulies & Kaguni, 1996; Weigel et al.,
1997; Blaesing et al., 2000; Hansen et al., 2006). There are
also, discovered recently, three I-sites preferentially bound
by the ATP-DnaA form (McGarry et al., 2004). Based on
the binding affinity, the sites recognized by DnaA were
grouped into strong affinity sites (R1, R2 and R4) and low
affinity sites (R3, R5 and I-sites) (Leonard & Grimwade,
2005). Both types are necessary for proper origin functioning as, for example, increasing the number of the I-sites by
replacing the R5 box with I2 inactivated the extrachromosomal oriC function (Grimwade et al., 2007). The origin
activity also depends on proper helical phasing of the binding sites for DnaA, as changes of one helical turn between
the R3 and R4 DnaA-boxes did not alter the replication
efficiency, whereas the insertion or deletion of a part of one
FEMS Microbiol Rev 36 (2012) 408–434
AT-rich region and the repeated sequences in DNA replication
411
Fig. 1. Organization of selected chromosomal
and plasmid replication origins. AT-rich region
of each origin is depicted as a shaded red box,
repeated sequences within the region are
marked with black arrows. DnaA-box
sequences are marked as yellow triangles;
iterons – orange arrows. The origins are not
drawn to scale. Specific structural
arrangements characterize both chromosomal
and plasmid DNA replication origins. The
AT-rich regions in bacterial chromosomal
origins of replication are usually located on
one side of a cluster of DnaA-box sequences.
In case of the plasmid origins, this rule applies
to the AT-rich region and the iteron sequences
recognized by initiator proteins. DnaA-boxes,
if present at a plasmid origin, are usually
distributed on the opposite to the iterons side
of the AT-rich region or flank the motifs.
helical turn completely destroyed the oriC function (Woelker & Messer, 1993). All of the E. coli DnaA-boxes are
located in the vicinity of the AT-rich region and the distance between the DnaA-box region and the AT-rich part
of the origin is strictly critical for the origin replication
activity. Even a single nucleotide insertion disabled the oriC
function both in vivo and in vitro (Hsu et al., 1994). Four
and seven DnaA-boxes of a similar location with relation to
the AT-rich region were found in Pseudomonas aeruginosa
and Pseudomonas putida, respectively (Yee & Smith, 1990).
Most of these motifs share the E. coli DnaA-box consensus
sequence 5′-TTATNCACA-3′ and two of them in both
Pseudomonas origins have the same orientation and spacing
as the R1 and R4 boxes in E. coli oriC. Unfortunately, neither the pseudomonad origins can be functional in E. coli
cells nor the enteric origin can be active in Pseudomonas
cells (Yee & Smith, 1990). This phenomenon is not
observed in case of Vibrio cholerae chromosome I and II
origins because E. coli cells can be transformed with plasmids containing either oriCI or oriCII, but only when the
Dam methyltransferase is present in cells (Egan & Waldor,
2003; Koch et al., 2010). From the two V. cholerae origins,
oriCI is very similar to the E. coli oriC and shares its 58%
sequence identity (Egan & Waldor, 2003). Within this
FEMS Microbiol Rev 36 (2012) 408–434
307-bp origin, six sequences recognized by DnaA protein
were found, five of them (R1–R5) are common for all
Enterobacteriaceae and the sixth (RV) is characteristic for
the Vibrio genus. However, this unique RV motif is not
essential for the replication function of the minimal origin
region (Saha et al., 2004). All six DnaA-boxes were identified downstream of an AT-rich sequence (Fig. 1). Similarly
as for gammaproteobacteria, in the replication origin of alphaproteobacteria and other bacterial taxons, the sites
bound by chromosomal replication initiator are observed
along the region abounding with adenines and thymines
(Fig. 1). In Caulobacter crescentus’ origin, five DnaA binding sites were found (Marczynski & Shapiro, 1992); however, none of them shared the perfect oriC consensus
sequence (Schaefer & Messer, 1991). One of the identified
DnaA-boxes, differing just in one nucleotide position when
compared with E. coli DnaA-box, was found also in the
origins of other freshwater and marine Caulobacter sp.
(Shaheen et al., 2009). The introduction of a second mismatch into this sequence resulted in an impairment of origin function (Marczynski & Shapiro, 1992). An exact
match to this C. crescentus DnaA binding site was found in
Sinorhizobium meliloti origin, which additionally contains
four other DnaA-boxes (Sibley et al., 2006). As many as
ª 2011 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
412
ten, different from the E. coli DnaA-boxes, were also found
within a 559-bp origin of a thermophile bacterium
Thermotoga maritima. These 12-mer sequences designated
as tma DnaA-boxes, with the consensus 5′-AAACCTAC
CACC-3′ (Lopez et al., 2000), are bound by tmaDnaA
protein which shares 29% identity and 47% similarity with
the E. coli initiator protein homologue (Ozaki et al., 2006).
However, in contrast to the E. coli DnaA protein, the
tmaDnaA binds to the recognized sites with constant affinity. Even more DnaA-boxes than in T. maritima origin
were found in the origin of Gram-positive bacterium
Bacillus subtilis. Three clusters of DnaA-boxes – incA, incB,
and incC – that contain five, four, and six sequences recognized by DnaA protein, respectively, were identified.
Although the number and organization of the B. subtilis
DnaA-boxes are different when compared with those in
E. coli oriC, in an in vitro assay the E.coli DnaA protein
recognized sequences of Bacillus and vice versa, while the
origin regions were incompatible in vivo (Fukuoka et al.,
1990; Messer, 2002). Between the incB and incC clusters
and upstream of the incA region, a dnaA gene was located;
also there were detected 16-mer AT-rich sequences (Moriya
et al., 1992; Krause et al., 1997). However, the analysis of
the open complex formation after KMnO4 probing revealed
that the melting did not occur in these 16-mer repeats but
in a 27-mer AT-cluster downstream of incC DnaA-boxes
group (Moriya et al., 1994; Krause et al., 1997). Although
the position of the opened AT-rich region in Bacillus origin
is not located upstream of the sites bound by DnaA, it
should be emphasized that DnaA-boxes never outflank the
region where the melting of the DNA helix occurs. Even
when looking at the origin of 1.7-Mbp megaplasmid pSymb from Rhizobium meliloti, it is noticeable that the three
DnaA-boxes are located on one side of an AT-rich region,
which is especially similar to that of C. crescentus. Two sites
for DnaA binding possess the exact consensus sequences
and the third contains one mismatch (Margolin & Long,
1993). There are also potential binding sites for the megaplasmid-specific factor inducing the pSym-b replication,
but they have not been identified yet (Margolin & Long,
1993). As the bacterial chromosomal replication origins
contain at least four conserved DnaA-boxes and many
plasmids contain just two such sites and encode their own
additional initiator protein that binds within the origin,
this megaplasmid combines features of both chromosomal
and plasmid origin (Margolin & Long, 1993).
To conclude, the AT-rich regions in bacterial chromosomal origins of replication are usually located at one side
of a cluster of DnaA-box sequences. Such regular position
of those regions, crucial for the replication initiation, is
possibly connected with the function of initiator proteins
and the mechanism of replication initiation (see paragraphs below).
ª 2011 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
M. Rajewska et al.
The organization of the position of the motifs within
plasmid origins is also not random. The presence of iterons that are tandem repeats bound by plasmid initiator
is characteristic of low-copy-number bacterial plasmids.
And as the DnaA-boxes are usually located upstream of
the AT-rich sequences within the plasmid origin, the iterons are found downstream of the region rich in adenines
and thymines (Fig. 1). Some of the best characterized
plasmids with this type of organization of the region
where replication starts are bacteriophage P1 and F plasmid. The P1 origin consists of two DnaA-boxes upstream
of 7-bp AT-rich repeats followed by five 19-mer iterons.
(Abeles et al., 1984, 1990; Abeles, 1986). To the right of
the iterons, there are also three additional sites bound by
the DnaA protein. However, the results obtained by Park
and Chattoraj showed that the origins with DnaA-boxes
situated closer to the origin-unwinding location opened
more efficiently and allowed plasmids to be maintained at
higher copy numbers (Park & Chattoraj, 2001). Within
the left and right clusters of the DnaA-boxes, there is one
perfect consensus sequence and the rest of the sites contain one or two mismatches. It was shown that the presence of one cluster of the DnaA-boxes is sufficient for
providing origin activity, moreover, just one site recognized by DnaA could support P1 replication, but the box
sequence must possess the exact consensus sequence (Abeles et al., 1990). Located downstream of the DnaA-boxes,
19-mer repeats of sequence 5′-GATGTGTGCTGGCGGG
ATA-3′ (Papp et al., 1994) are bound by RepA protein
and this interaction, as well as DnaA binding, is necessary
for proper origin functioning (Abeles, 1986; Wickner
et al., 1990). Similarly, in the F plasmid, binding a plasmid initiator protein RepE and the bacterial DnaA is
required for an efficient plasmid replication initiation
(Hansen & Yarmolinsky, 1986; Kline et al., 1986; Murakami et al., 1987; Masson & Ray, 1988; Ishiai et al., 1994;
Kawasaki et al., 1996). The sites where these proteins
bind, according to the observed rules, are located
upstream (DnaA-boxes) and downstream (iterons) of the
AT-rich region (Fig. 1). In the F plasmid origin, there are
two sites bound by DnaA and four 19-bp direct repeats
recognized by the monomers of RepE (Murotsu et al.,
1981; Matsunaga et al., 1995; Zzaman et al., 2004). Fewer
sites like this, located at two ends of the origin, were
identified in the origin of the pSC101 plasmid, which
contains just one site recognized by DnaA and three 21bp-long iterons bound by RepA (Churchward et al.,
1983), followed by two additional indirect repeats also
bound by initiator protein (Manen et al., 1992; Sugiura
et al., 1993; Ohkubo & Yamaguchi, 1995). The single
DnaA-box upstream of an AT-rich region was also identified in oric of plasmid R6K (MacAllister et al., 1991; Wu
et al., 1992). This E. coli plasmid contains, apart from
FEMS Microbiol Rev 36 (2012) 408–434
AT-rich region and the repeated sequences in DNA replication
oric, two other origins a and b, but both of them require
the presence of oric in cis for functioning (Crosa et al.,
1978; Kolter & Helinski, 1978, 1982; Shon et al., 1982;
Stalker et al., 1982; Mukherjee et al., 1988). The oric
required binding of the DnaA-box by DnaA and the
seven 22-bp iterons by the plasmid initiator protein p for
proper activity (Kelley & Bastia, 1992; Kelley et al., 1992;
Lu et al., 1998; Kruger & Filutowicz, 2000). It was shown
that binding of minimum five of seven iterons is required
for the origin activity (Kolter & Helinski, 1982) and that
the iterons’ binding occurs in a cooperative manner
(Bowers et al., 2007). Nucleotide changes in the iteron
sequences prevent binding of the p protein in vitro and
the plasmid replication in vivo (McEachern et al., 1985).
Moreover, mutants were isolated, containing changes
in the iteron sequences that reduced binding of p monomers or both monomers and dimers of the protein
(Kunnimalaiyaan et al., 2004). The sequences recognized
by the p protein are located downstream of the AT-rich
repeats (Fig. 1). Between the iterons and the site where
the helix melting occurs (Kruger & Filutowicz, 2003),
there is an additional site bound by the bacterial
IHF protein (Filutowicz & Appelt, 1988; Dellis et al.,
1996).
An unusual situation when compared with other analyzed plasmid origins could be observed in pPS10
and RK2 plasmids, where the sites recognized by chromosomal and plasmid initiators are located on the same side
of the AT-rich region. In the pPS10 plasmid, four 22-bp
iterons and one DnaA-box follow the AT region (Nieto
et al., 1992) and in plasmid RK2 the AT-rich region is
followed by five 17-bp iterons and four DnaA-boxes
(Stalker et al., 1981; Fuller et al., 1984; Konieczny et al.,
1997). Mutations within the pPS10 DnaA-box hampered
the replication in vivo, indicating the requirement for
DnaA binding within the origin (Nieto et al., 1992). The
single motif bound by the DnaA protein is located just
5 bp from the nearest iteron and its sequence differs in
the first nucleotide from the E. coli consensus (Giraldo &
Fernandez-Tresguerres, 2004). The RK2 DnaA-boxes also
contain one or two mismatches when compared with the
consensus and their binding by DnaA is necessary for
proper origin functioning. However, the requirement
for various DnaA-boxes present in oriV is host specific.
For example, the RK2 DnaA-box 3 and 4 should be
present in the origin during replication in E. coli and
P. putida but can be missed in P. aeruginosa (Doran et al.,
1999b). The sequence, number, orientation, and spacing
of the binding sites for the plasmid initiator protein TrfA
are also crucial for RK2 plasmid replication (Perri & Helinski, 1993). It was shown that the binding of TrfA to
just one iteron is unstable and probably a cooperative
binding occurs between the protein particles at the iterons
FEMS Microbiol Rev 36 (2012) 408–434
413
(Perri & Helinski, 1993). Similarly as it was observed for
E. coli oriC (Hsu et al., 1994 and see above), the spacing
between the particular regions of plasmid replication origin, and thus the proper helical phasing, is crucial for
effective replication. The insertion of 6 bp (approximately
half of one helical turn) between the AT-rich region and
iterons completely inactivated the origin, whereas the
11-bp insertion in the same region restored 55–78% of
replication activity compared with the activity obtained
for the wild-type (Doran et al., 1998). Typical iterons are
not found in R1 plasmid, which represents dissimilar
organization of origin when compared with the plasmid
described above. In the R1 replication origin, one inessential DnaA-box (Ortega-Jimenez et al., 1992) and two partially palindromic 10-bp sequences recognized by the
plasmid’s RepA protein are followed by three AT-rich
9-mers (Fig. 1; del Solar et al., 1998). The sites bound by
RepA are contained within the 100-bp region and it was
proposed that the binding of RepA causes a 100-bp DNA
loop formation, which is then filled with more RepA molecules and results in the opening of the AT-rich region
(Giraldo & Diaz, 1992; del Solar et al., 1998; Messer,
2002). Although the DnaA is not required for the replication initiation, the efficiency of R1 replication is increased
in the presence of the DnaA protein (Bernander et al.,
1991; Bernander et al., 1992). What’s interesting, in vivo
and in vitro experiments demonstrated that mutational
disruption of the DnaA-box sequence within the oriR
allowed decreased replication of the R1 plasmid from the
origin, whereas immuno-inactivation of the DnaA protein
totally abolished the process in vitro in the absence of the
DnaA-box sequence (Ortega-Jimenez et al., 1992). This
indicates that even though the utilization of DnaA is
favored for the R1 replication, the DnaA-box present in
the oriR seems to be redundant and in its absence the
DnaA recruitment to the origin may take place due to
DnaA–RepA interactions (Ortega-Jimenez et al., 1992).
In vivo experiments showed that deletion of the DnaAbinding site found downstream of the replication origin
of pBR322 plasmid does not have a significant effect on
the replication of the plasmid (Chiang et al., 1991).
It could be summarized that the AT-rich regions of
plasmid origins are usually followed by one or two
DnaA-boxes and precede the binding sites for plasmid
initiator protein. As in case of the chromosomal origins,
proper organization of the motifs within the origin is
important for the efficient functioning of replication proteins. The binding sites for initiator proteins should not
only be present in this region, but also be located in correct orientation in respect to each other, as very often the
changes in number, orientation, and even spacing
between the DnaA-boxes and iterons disturb replication
initiation.
ª 2011 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
414
Structure of the AT-rich region
Size and the internal stability of the AT-rich
region
The most distinguishable feature of an AT-rich region is
its high content of adenine and thymine residues when
compared with the overall average of an entire replication
origin. This depends on the base composition of a particular replicon and varies among chromosomal and plasmid origins. The region can be identified using in sillico
methods (see earlier paragraphs) based on sequence analysis and/or the internal thermodynamic stability of DNA.
However, defining the AT-rich region is sometimes rather
complicated since, as it was observed for example in
Streptomyces sp. (Jakimowicz et al., 1998), the region may
be dispersed and short AT stretches can be scattered
among other motifs. Nevertheless, typically there is no
bacterial origin and no DNA unwinding without an ATrich region.
In E. coli, the origin is composed in nearly 60% of
adenine and thymine residues, whereas its AT-rich
region consists of almost 77% of this type of bases.
However, the content may differ as much as in C. crescentus, where the general adenine and thymine content
in the origin is only about 34% and in the 40-bp-long
AT-rich region, it reaches 85% (Marczynski & Shapiro,
1992). Generally, the adenine–thymine richness of this
particular region varies from around 60% for P1 plasmid
origin, 71% in phage k (Schnos et al., 1988), almost
83% in R6K or 84% in pSC101 plasmids or R. meliloti’s
pSym-b megaplasmid origins, up to nearly 90% in case
of oriS of F plasmid. The length of an AT-rich region
within a replication origin can also be determined via
in sillico methods. It encloses a region of 40 bp, as for
example, in C. crescentus (Marczynski & Shapiro, 1992),
50–60 bp, as in E. coli, RK2 plasmid or Pseudomonas
origins (Bramhill & Kornberg, 1988b; Yee & Smith,
1990; Konieczny et al., 1997), to over 100 bp, as in the
case of R6K plasmid (Kruger & Filutowicz, 2003), within
the sequence of a replication origin. The pSC101 plasmid’s origin contains an asymmetric stretch of 80 bp,
rich in adenine and thymine residues (Churchward et al.,
1983; Manen & Caro, 1991). The high AT-content
results in the low thermodynamic stability of the region
which accounts for its role in the process of replication
initiation. At the AT-rich regions, the initial DNA helix
destabilization (opening) is induced by binding an initiator protein to its respective recognition sequences situated nearby. For chromosomal origins, these are DnaA
protein and DnaA-boxes and in case of most plasmids –
initiator proteins and iteron sequences. It is highly interesting that a protein interaction with specific DNA
ª 2011 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
M. Rajewska et al.
sequences opens the DNA helix at a site distant from
the protein’s recognition and binding sites. The extent of
the initial opening of an AT-rich region can only be
determined through biochemical analysis involving
experiments utilizing chemical modifications of thymine
residues by KMnO4 or P1 or S1 nucleases cleavage of a
ssDNA during replication initiation. The experiments
analyzing the exact DNA strand opening are scarce and
clear data can be found only for few chromosomal origins and some plasmids. However, when determined, the
extent of the opening does not differ too much in the
analyzed replicons and usually oscillates around 50 bp in
length. In the best analyzed replicon, E. coli, the DNA
fragment melting during initiation replication was determined to be c. 45 bp of the oriC sequence (Bramhill &
Kornberg, 1988a), although by the KMnO4 modifications
the initial melting was shown to encompass 26 bp (Gille
& Messer, 1991). Experiments concerning the opening of
the R6K’s oric in the presence of the p initiator, DnaA
and IHF proteins showed KMnO4 modifications within a
c. 55-bp fragment (Lu et al., 1998; Kruger et al., 2001)
of the 112-bp AT-rich region (Kruger & Filutowicz,
2003). Similar KMnO4 modification footprinting carried
out with the addition of TrfA initiator protein showed
the opening of the RK2 plasmid oriV origin within the
range of 46 bp on the top strand of DNA and 13 bp on
the bottom, which includes almost the whole AT-rich
region of 51 bp identified for the RK2 origin (Konieczny
et al., 1997). This is also the case of P1 plasmid, where
the opening was shown to presumably take place at
almost the whole extent of over 70-bp-long AT-rich
region (Park et al., 1998); however, here the adjacent
DnaA-box sequences were also included in the AT-rich
sequence. KMnO4 modifications occurred at the left end
of the region near the adjoining DnaA-boxes when the
bottom strand of the P1 plasmid DNA was analyzed.
When the analysis concerned the top strand, the modifications were detected in the middle of the AT-rich
region (Park et al., 1998). This might have been
observed due to the different base composition of the
particular strands. The analysis of S1 nuclease sensitivity
of the k phage DNA upon binding the k O protein to
iterons within the k origin demonstrated the presence of
an unwound ssDNA region of at least 23 bp and at most
49 bp of an AT-rich region adjacent to the iteron
sequences (Schnos et al., 1988). From these data, it can
be concluded that the initial opening of DUE is rather
no shorter than c. 20 bp and rarely longer than c. 50 bp,
which seems to be sufficient for the assembly of the prereplication complexes at the unwound site.
The pattern of thermodynamic stability is different for
various origins (see Fig. 2), nevertheless the AT-rich
region in most of the cases can be distinguished as the
FEMS Microbiol Rev 36 (2012) 408–434
AT-rich region and the repeated sequences in DNA replication
region of much lower free energy (DG) when compared
with the overall profile of the sequences surrounding it.
The regions contain some guanine and cytosine residues
and it is only natural that free energy will be accordingly
higher for those parts of the AT-rich sequences. In the
origins of E. coli (Fig. 2a), V. cholerae (Fig. 2d), C. crescentus (Fig. 2e) or F plasmid (Fig. 2j), the AT-rich
stretches form clear segments possessing noticeably lower
free energy when compared with the adjoining regions. In
other cases, as for example P1 or R6K plasmids (Fig. 2h
and k, respectively) the AT-rich region could not be
clearly distinguished among the origin sequences when
analyzing only the internal stability pattern. The fluctua-
415
tions and differences among the minima of free energy,
which can be observed when analyzing the thermodynamic stability patterns of particular origins, exist due to
the specific sequences of the regions and the existence of
direct repeats within the sequence. Their sequences determine the characteristic ‘intervals’ between points of the
local minima and maxima of free energy within some of
the repeats, as it can be observed in E. coli or V. cholerae
origins (Fig. 2a and d) or RK2 and pSC101 plasmids
(Fig. 2g and i). The internal stability pattern might also
be related to the helical periodicity of some of the repeats
within the AT-rich regions. This is the case in RK2 where
the local minima and maxima of the free energy could be
Fig. 2. Internal thermodynamic stability within selected replication origins’ fragments containing AT-rich regions. The stabilities were calculated
using WEBTHERMODYN software (http://gsa.buffalo.edu/dna/dk/WEBTHERMODYN/). The DNA sequence fragments taken for analysis were 200 bp
long. The red shading encloses the AT-rich regions with adjacent sequences. Black arrows depict repeats identified within the AT-rich regions of
the given origins. The sequence of the AT-rich regions and the repeats present within determine the characteristic ‘intervals’ between the points
of the local minima and maxima of the free energy of some of the replicons. Panels a-l present internal stabilities of selected chromosomal and
plasmid origins of replication.
FEMS Microbiol Rev 36 (2012) 408–434
ª 2011 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
416
ascribed to the distance of one helical turn. Most likely
the majority of the AT-rich regions of the prokaryotic
origins could contain repeats of various lengths. This
phenomenon is of great importance for the origin functioning and the formation of protein complexes during
replication initiation events (described in more detail in
the next paragraphs).
13-mers and other repeated sequences
Within the AT-rich regions of many prokaryotic origins
short repeated sequences have been identified; however,
their exact role was not explained up to date. Most of the
motifs in the AT-rich region are direct repeats, although
there are exceptions like RK2 plasmid’s oriV origin or
T. maritima’s where one of the motifs is inverted in relation to the other ones (Konieczny et al., 1997; Ozaki
et al., 2006) (Fig. 1). In E. coli, the AT-rich region is
present in the left part of the oriC origin and was shown
to contain three characteristic tandemly repeated
sequences, each 13-nucleotides long (Bramhill & Kornberg, 1988a). This length and the number of repeats seem
to be conserved among the chromosomal origins of replication, where the repeats were actually defined, as for
example in P. aeruginosa and P. putida (Yee & Smith,
1990), V. cholerae, Vibrio parahemolyticus and Vibrio vulnificus (Saha et al., 2004) (see Table 1). The E. coli
motifs, marked as L, M, and R (Bramhill & Kornberg,
1988a, b; Hwang & Kornberg, 1992a) are spaced with two
and three nucleotides, respectively. Upon the DnaA protein binding to the adjacent DnaA-box sequences the
region of the 13-mers unwinds, starting with the R motif
and successively proceeding toward the M and L ones
(Bramhill & Kornberg, 1988b). Initially it was assumed
that it is only the single-stranded structure and the helical
instability of the 13-mers that are important for the formation of the DUE and the origin activity. Escherichia coli
oriC deletion mutants lacking all three 13-mers were
unable to form an open complex and therefore were inactive (Bramhill & Kornberg, 1988b). Deleting the L 13-mer
from the oriC sequence led to an inhibition of origin
unwinding in the remaining two motifs, where no mung
bean nuclease hypersensitivity was detected in vitro and a
loss of the origin function in vivo was observed (Kowalski
& Eddy, 1989). However, the origin activity could be
restored by the insertion of a dissimilar DNA sequence in
the place of the deleted motif, suggesting that the L 13mer is not essential for oriC activity in vivo (Kowalski &
Eddy, 1989). Replacing this motif with a sequence containing higher GC content completely destroyed the origin activity (Asai et al., 1990). Recent studies
demonstrated that the deletion of the L 13-mer inactivates the heat-induced replication (HIR) and makes oriC
ª 2011 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
M. Rajewska et al.
inactive for cyclic replication in conditions of increased
growth temperature (Gonzalez-Soltero et al., 2006). It
was suggested that the strand opening for HIR initiation
occurs due to heat-induced destabilization of the L 13mer (Gonzalez-Soltero et al., 2006). The 13-mer motifs
possess a consensus sequence 5′-GATCTnTTnnTTT-3′
(Bramhill & Kornberg, 1988a). Experiments utilizing
sequence mutants showed that beside the low internal stability of the AT-rich region, also this specific sequence of
the motifs is important for the functionality of the origin
both in vivo and in vitro (Hwang & Kornberg, 1992a). Of
the three 13-mer repeats, the sequence of the R one seems
to be the most essential for origin function, as even single
mutations or one base insertions are sufficient to disable
the origin activity (Bramhill & Kornberg, 1988b; Asai
et al., 1990; Hwang & Kornberg, 1992a, b). The E. coli
oriC AT-rich region also contains, apart from the
13-nucleotide motifs, an AT cluster situated at the leftmost part of the origin adjacent to the L 13-mer, that
determines this border of origin (Asai et al., 1990).
The 13-mers found in Pseudomonas origins are packed
less tightly than those of E. coli oriC and spaced with
sequences of 11 and 12 bp in length (Yee & Smith, 1990).
The sequence composition of the P. putida repeats
accounts for the formation of local minima of thermodynamic instability (Fig. 2c), where the lowest energy points
are at the terminal nucleotides of the M and R repeats
that contain more AT residues than the beginning of the
sequences (see Table 1). The sequence of the Pseudomonas repeats differs from that in E. coli consensus in
respect to the GATC motif found within all 13-mers in
oriC. Only the P. putida L 13-mer contains that motif at
the beginning of its sequence. When the whole sequence
is compared, the repeats of pseudomonads differ from
the E. coli consensus in as much as four to even seven
nucleotide positions. The AT-rich regions in V. cholerae,
V. parahemolyticus and V. vulnificus contain sets of three
13-mer repeats L, M, and R, bearing close sequence consensus with the respective motifs of E. coli oriC; however,
only the R repeats show the highest accordance with the
oriC 13-mer consensus as well as with its length (Saha
et al., 2004). This could be expected as earlier studies
concerning replication in E. coli showed that the R 13mer’s sequence and length are essential for the proper initiation of replication (see earlier paragraph). Besides the
13-mer motifs, the AT-rich region of Vibrio origins also
contains a short tract, AT cluster, preceding them and
composed of adenine and thymine residues only (Saha
et al., 2004), as it is observed in E. coli oriC. The 40-bplong AT-rich region of C. crescentus contains a sequence
that is very similar to the 13-mer motifs of E. coli (Marczynski & Shapiro, 1992). When the sequences spacing
the E. coli 13-mers are taken into account, the C. crescentus
FEMS Microbiol Rev 36 (2012) 408–434
AT-rich region and the repeated sequences in DNA replication
417
Table 1. Repeated sequences within the AT-rich regions of selected replication origins
The table lists sequences of the repeated sequences present within AT-rich regions of selected chromosomal and plasmid origins of replication.
The distances between respective repeats are given in number of bp. The potential SeqA binding and methylation sites possessing GATC
sequence are enclosed in green boxes. In the bacterial origins there are typically up to three repeats within the AT-rich regions, usually of 13 nucleotides in length and a sequence similar to the consensus determined for Escherichia coli oriC 13-mers. In the chromosomal repeats, distinctive
core within the sequence (boxed in red), containing the highest content of adenine and thymine residues can be distinguished. For the plasmid
repeats, their length varies more and typically their entire sequence consists mostly of adenines and thymines.
FEMS Microbiol Rev 36 (2012) 408–434
ª 2011 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
418
13-mer matches 13 of 15 bp in the E. coli L, M, and R
13-mers (Marczynski & Shapiro, 1992). Therefore, the
motif may have similar functional significance as its
equivalents in E. coli. Concluding from the internal stability pattern of Cori fragment, containing the sequence of
the AT-rich region where the 13-mer is found in the
region of the lowest thermodynamic stability (Fig. 2e), it
can be assumed that it is the place where DNA helix
unwinding is initiated during replication initiation. At the
replication origin of B. subtilis, two AT-rich regions were
defined, one containing three repeats of 16 nucleotides in
length and the other consisting of a 27-bp cluster (Moriya
et al., 1985; Yoshikawa & Wake, 1993). The importance
of the 16-mers was not determined, as KMnO4 modifications upon binding of the DnaA protein to the origin
demonstrated, that it is the 27-mer that serves as the
DUE for the Bacillus origin (Moriya et al., 1994; Krause
et al., 1997). In T. maritima, three 9-nucleotide-long
sequences were found within one of its origin’s AT-rich
regions, marked AT-1, AT-2, and AT-3, sharing a consensus of 5′-TATTATTnA-3′ (Ozaki et al., 2006) and consisting almost in 93% of only adenine and thymine residues.
The comparative analysis of T. maritima AT sites pointed
out some sequence homology with the E. coli 13-mers
and the corresponding archaeal and eukaryotic sequences
(Ozaki et al., 2006). Only two of those repeats, AT-2 and
AT-3, are sufficient for the functionality of the T. maritima minimal origin. Binding an ATP-tmaDnaA (T. maritima homologue of the DnaA protein) at the tma-oriC
leads to duplex unwinding and the formation of an open
complex at the 9-mer region, depending on the temperature and superhelical tension of a plasmid containing the
tma-oriC. No 13-mers similar to those of E. coli were
found in oriC of Streptomyces; however, five short AT-rich
sequences are distributed among the sequences of DnaAboxes at the origins of those bacteria (Jakimowicz et al.,
1998). Based on the above finding, it can be observed that
in the bacterial origins the repeats within the AT-rich
regions are usually 13 nucleotides long and possess rather
high sequence similarity with consensus similar to that
determined for E. coli oriC 13-mers (see Table 1). A distinctive core consisting mostly of adenines and thymines
can be distinguished in the sequence of the chromosomal
repeats (see Table 1, boxed in red). Most often there are
three direct repeats in the region that play a direct role in
the open complex formation at the origin during the replication initiation.
Repeated sequences were also identified in the AT-rich
regions of plasmid origins and phage k, in up to five copies per origin. Those repeats seem to contain mostly adenine and thymine residues, which makes them c. 80 to
even 100% AT-rich. It is rather difficult to determine a
consensus sequence for the repeats found among different
ª 2011 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
M. Rajewska et al.
plasmids as they are of various lengths but a consensus
can be established for the repeats within a particular origin. Some of the repeats are shorter than the 13-mer
motifs of oriC, just like the 7-mers of P1 plasmid (Chattoraj et al., 1985; Abeles et al., 1990; Brendler et al.,
1991a, b; Park et al., 1998), 8-mers at oriS of F plasmid
(Wei & Bernander, 1996), 9-mers in R1 (Wei & Bernander, 1996) and 10-mers in R6K plasmids (Stalker et al.,
1979; Bramhill & Kornberg, 1988a) or 11-mers in orik
(Schnos et al., 1988), others preserve the 13-nucleotide
length as in the case of pSC101 plasmid (Wei & Bernander, 1996; Miller et al., 2003) or oriV origin of RK2
(Stalker et al., 1981; Konieczny et al., 1997) (see Table 1).
In the latter, four 13-nucleotide repeats can be distinguished within the AT-rich region: L, M1, M2, and R,
bearing a consensus sequence 5′-TAAACnTTnTTTT-3′
(Konieczny et al., 1997) similar to that defined for oriC
(Bramhill & Kornberg, 1988a). In those repeats, as in E.
coli, the most stringently conserved are the nucleotides at
the beginning and the end of each motif: TAA and TTTT
(see Table 1). Those initial and terminal nucleotides
account for the segments of the lowest internal stability
within the AT-rich region and as the middle part of each
13-mer shows higher G/C content, characteristic fluctuations of the thermodynamic stability can be observed for
this part of the region (see Fig. 2g). The AT content in
the RK2 13-mers reaches almost 80%. They are tightly
packed and their sequences partially overlap; three of
them, L, M1, and R, are direct and situated on the top
strand of DNA, the fourth, M2, is reversed and placed
on the bottom strand (Konieczny et al., 1997). This
arrangement of the repeats is crucial for origin functioning as introducing insertions between them disabled
origin’s activity in vivo (Kowalczyk et al., 2005). The
in vivo and in vitro experiments utilizing a group of
mutants of the RK2 oriV AT-rich region also demonstrated that its role in replication initiation does not
ensue only from its low inner thermodynamic stability.
It was shown that the position of each 13-mer within
the AT-rich region is critical for the activity of the
plasmid’s origin, as swapping or reversing the motifs had
a negative effect on origin function (Kowalczyk et al.,
2005).
As it was observed for 13-mers in the chromosomal
origins, the number and length of the repeats within the
AT-rich region of an individual plasmid origin possibly
also plays an important role in the origin’s activity. For
the RK2 plasmid, it was demonstrated that the deletion
of one of the 13-mer repeats present in the AT-rich
region disabled oriV activity in vivo (Kowalczyk et al.,
2005). In R. meliloti pSym-b megaplasmid, only one copy
of 13-mer was defined, varying from the E. coli consensus
sequence in three positions (Margolin & Long, 1993).
FEMS Microbiol Rev 36 (2012) 408–434
AT-rich region and the repeated sequences in DNA replication
The AT-rich region of the origin of pSC101 contains two
13-mer repeats (Manen & Caro, 1991) that are homologous to those found in oriC of E. coli. The c origin of
R6K plasmid contains three repeats of 10-nucleotide-long
sequences (Stalker et al., 1979; Bramhill & Kornberg,
1988a), nearly 100% AT-rich, whose content is higher
than that of the entire AT-rich region defined at the origin. The repeats, L and M 10-mers, are spaced with relatively long sequence of 29 bp and the distance between M
and R is 9 bp long. The lengths of the repeats and the
spacers in between them approximately multiply the
length of one helix turn (c. 11 bp) which might be significant for the DNA destabilization during replication initiation from the c origin or for nucleoprotein interactions
within the AT-rich region. It was demonstrated that the
R6K plasmid’s initiator protein p binds as a dimer to the
sequences within the AT-rich region at higher concentrations and possibly serves this way as a negative regulator
of primer formation at the site (Kruger & Filutowicz,
2000). In some cases, determining the repeats is more
complicated and different publications give different data
concerning the same replicon. For example, the work of
Wei and Bernander (Wei & Bernander, 1996) identifies
four highly AT-rich repeats of 8 bp in length (see
Table 1) within the origin of plasmid F. They account for
the region of the lowest thermodynamic stability when
compared with the surrounding sequences (see Fig. 2j).
However, Kawasaki et al. (1996) indicate a 13-nucleotidelong sequence 5′-GATCTTCTTTTTT-3′, overlapping two
of the four 8-mer motifs (M2 and R), and matching the
oriC 13-mer consensus at 11 positions, as the site of
localized DNA melting upon binding of the RepE, HU
and DnaA proteins to the origin, as it was shown by
experiments utilizing P1 nuclease cleavage of ssDNA. Further DNA unwinding at the F plasmid origin proceeds
toward the left side of the AT-rich region, where the
putative L and M1 motifs were suggested by the other
group. Differences also concern the origin of P1 plasmid,
where various works identify five tandem 7-nucleotide
repeats (Chattoraj et al., 1985; Abeles et al., 1990; Brendler et al., 1991a, b) within the central part of the origin,
others (Park et al., 1998) extend the repeats’ sequences to
10 bp. Although they are not designated as AT-rich (the
adenine and thymine content of each individual repeat
oscillates between 45% and 55%), the DNA strand melting takes place within the repeats containing region (Park
et al., 1998). Experiments utilizing mutants of the RK2
oriV AT-rich region demonstrated that the repeats are
also origin specific. Those mutants where the sequence of
a particular 13-mer was substituted with a respective
sequence of E. coli oriC showed a total lack of activity
in vivo and a significantly reduced replication activity
in vitro (Kowalczyk et al., 2005).
FEMS Microbiol Rev 36 (2012) 408–434
419
Mutagenesis at the AT-rich repeats was conducted
within different plasmid replicons. Single-base substitutions in the P1 plasmid’s AT-rich repeats demonstrated
that the sequence of the 7-mers is essential for the functionality of the P1 origin (Brendler et al., 1991a, b). Those
repeats contain GATC motifs (see description in the paragraph below) within their sequence. The negative effect of
mutations in other positions was also observed when
GATC motifs were left intact. For each of the repeats, at
least one single-base change that had a deleterious effect
on replication was found. The phenotype effects of mutations in the first 7-bp repeat (5′-AGATCCA-3′) showed
that 6 of 7 bp are important for origin function, whereas
changes in the last position of the motif had little or no
effect. The integrity of the sequence of the 7-nucleotide
motifs seems to be crucial for the origin activity but neither the sequence nor the length of the 3- to 6-bp spacing
between the motifs is very important, although the
requirement of minimal spacer length was suggested
(Brendler et al., 1991a, b). In the case of k origin’s ATrich region studies concerning the prepriming step of k
DNA replication initiation and the origin opening demonstrated that a point mutation in one of the 11-mers,
adjacent to the first iteron sequence of the origin, where
a G/C was substituted with a T/A pair, had an effect on
AT-rich region sensitivity to S1 cleavage which was found
remarkable as the mutation increases the A/T content of
the region (Schnos et al., 1988). Point mutations within
the AT-rich 13-mers of the RK2 plasmid also negatively
influenced the origin activity, decreasing it to 75 or even
to 0.01% when compared with the activity of the wildtype origin (Kowalczyk et al., 2005). What was also extremely interesting here, even the A/T or T/A substitutions
in the conserved initial or terminal sequences of each
13-mer, not affecting the pattern of the thermodynamic
stability of the region, caused a complete loss of activity
of the origin (Kowalczyk et al., 2005). This suggests that
the position of the adenine and thymine residues on each
DNA strand may be critical for the interactions of proteins involved in the plasmid’s replication initiation
within this region. Changes simultaneously affecting the
sequences of putative DnaA-box motifs within this region
showed that their sequence is also important to the origin’s activity (Kowalczyk et al., 2005); however, the
DnaA–DNA interaction within this region still remains
elusive. The activity of the oriV AT-rich region’s mutants
was similar in the in vivo experiments in E. coli and
P. aeruginosa, except for two of the mutants having
changes within the M1 13-mer, which being inactive in
E. coli could replicate in P. aeruginosa (Kowalczyk et al.,
2005). It is possible that this particular motif is important
for the plasmid’s replication in E. coli but not in
P. aeruginosa. This raises the possibility that host factors
ª 2011 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
M. Rajewska et al.
420
can specifically interact within the AT-rich region of the
plasmid origin. Moreover, it could be that the subtle
changes of the internal stability resulting from the
sequence change could affect replication activity in one
host but not the other. All in all, the above data demonstrates that the repeats found in the plasmids’ AT-rich
regions play an essential role in the DNA replication initiation. Their length, number, and spacing are individual
for each origin and the sequence of each is substantial for
origin functioning.
Motifs overlapping the AT-rich region’s
repeated sequences
Apart from the direct repeated sequences, there are other
motifs that can be identified within the AT-rich regions
of prokaryotic replication origins that play a crucial role
in the process of replication initiation or are involved in
its control (see Table 2). In the AT-rich region of E. coli
oriC, 6-nucleotide-long sequences of ATP-DnaA-boxes
were identified (Speck & Messer, 2001), possessing a consensus sequence 5′-AGATCT-3′. Those motifs overlap the
13-mers L and M found in oriC and are the recognition
sites for the ATP form of the DnaA initiator protein
(Speck et al., 1999). The interaction of the ATP-DnaA
with these sequences after the formation of a singlestranded structure was postulated to be important during
the process of open complex formation within the oriC
(Speck & Messer, 2001). Possible ATP-DnaA-box
sequences can be found in the AT-rich regions of Vibrio
species origins which are similar in structure to those of
E. coli oriC (Saha et al., 2004). Moreover, a single AGAT
CT sequence was found in the origins of C. crescentus
and the F plasmid; however, no data determine the function of those sequences in DNA replication of those replicons. Other specific motifs, TTGT and TTATT, within
the E. coli ssDNA of M and R 13-mers were identified to
interact with the ATP-DnaA protein (Ozaki et al., 2008).
No ATP-DnaA-boxes that would be similar to those in
oriC are present within the RK2 13-mers (Speck &
Messer, 2001; Kowalczyk et al., 2005). However, on the
borders of the 13-mers M1 with M2 and M2 with R,
sequences of putative DnaA-boxes were identified
(Kowalczyk et al., 2005), where the DnaA protein may
Table 2. Proteins interacting or suspected to interact within the AT-rich regions
Protein
Replicon
DnaA
DnaA
DnaA
DnaB
E. coli
E. coli
V. harveyi
E. coli
DnaC
DnaG primase
Pol III holoenzyme
core
b-ring
SSB
P
SeqA
IciA
E. coli
E. coli
E. coli
Π
Fis
ArcA
IHF
HobH
DpiA
CspA; CspE
CspB; CspC; CspD
E. coli
Phage k
E. coli
E. coli; F;
R1
R6K
R6K
E. coli
R6K; pSC101
E.
E.
E.
E.
B.
coli
coli; pSC101
coli
coli;
subtilis
Recognized sequence/
binding specificity
References
AGATCT
TTGT; TTATT
AGATCG
ssDNA; M/R 13-mers;
bottom strand
Cryptic ssDNA binding
ssDNA
ssDNA
Speck & Messer (2001)
Ozaki et al. (2008)
Messer (2002)
LeBowitz & McMacken (1986); Jezewska et al. (1996); Weigel &Seitz (2002)
Learn et al. (1997)
Wickner (1977); Rowen & Kornberg (1978)
Kelman & O’Donnell (1995)
ssDNA
Cryptic ssDNA binding
GATC
dsDNA; 13-mers
Meyer & Laine (1990)
Learn et al. (1997)
Brendler et al. (1995); Slater et al. (1995)
Thony et al. (1991); Hwang & Kornberg (1992a, b); Wei & Bernander (1996)
dsDNA
dsDNA
dsDNA
TTTAAGTTGCTGATT;
CCACAACTCAAA
dsDNA
dsDNA; 13-mers
ssDNA
ssDNA
Kruger & Filutowicz (2000)
Wu et al. (1996)
Lee et al. (2001b)
Filutowicz & Appelt (1988); Levchenko & Filutowicz (1996); Datta et al.
(1999)
Herrick et al. (1994)
Miller et al. (2003)
Jiang et al. (1997); Phadtare & Inouye (1999); Yamanaka et al. (2001)
Graumann & Marahiel (1994); Graumann et al. (1997); Jiang et al. (1997);
Phadtare & Inouye (1999); Yamanaka et al. (2001)
In the table, proteins interacting or suspected to interact with the sequences of the AT-rich regions of replication origins are listed. If a specific
sequence was identified to be bound by a protein, it is given in the table (see details in the text). Otherwise only ssDNA or dsDNA interaction is
implied.
ª 2011 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
FEMS Microbiol Rev 36 (2012) 408–434
AT-rich region and the repeated sequences in DNA replication
interact with DNA, as it was suggested previously (Gaylo
et al., 1987; Doran et al., 1999a, b). The exact role of the
DnaA-box sequences located within DUE is being investigated. It is difficult to comprehend if the ATP-DnaAboxes are universal motifs that could be found in more
origins or if they are unique only for some. However,
most likely the interaction of replication initiators with
specific motifs within an AT-rich region of an origin
could be widespread.
The sites recognized by Dam methyltransferase (Geier
& Modrich, 1979; Slater et al., 1995) were found overlapping the repeated sequences but also present in the adjacent sequences of the AT-rich regions. Those sites are the
short GATC motifs, where methyl group is introduced at
the adenine residues (Geier & Modrich, 1979). The
methylated GATC sites become hemimethylated during
the DNA replication and as such are recognized by a
SeqA protein (Brendler et al., 1995; Slater et al., 1995)
that is involved in sequestration of the newly synthesized
DNA (Lu et al., 1994), preventing premature DNA rereplication in vivo. SeqA binds to an origin tandemly as a
dimer (Lee et al., 2001a; Guarne et al., 2002; Han et al.,
2003; Guarne et al., 2005; Kang et al., 2005) and was
found to negatively regulate DNA replication by blocking
the GATC sequences within the DnaA-boxes of E. coli
oriC and preventing DnaA binding to those sites (Taghbalout et al., 2000; Nievera et al., 2006). The GATC sites
that are found within the E. coli origin’s AT-rich region
at the beginning of each 13-nucleotide sequence are also
recognized by SeqA. The highest affinity of the SeqA protein was shown at the GATC motif located within the
oriC L 13-mer (Kang et al., 1999) but strong binding was
shown also for the GATC motifs within M and R 13mers. Here SeqA inhibits the formation of the open complex at the AT-rich region of the origin (Torheim &
Skarstad, 1999). The SeqA recognition sequences are
found in similar positions of the repeats identified within
AT-rich region of P. putida, V. cholerae, V. parahemolyticus, V. vulnificus, C. crescentus or R. meliloti, but also in
some plasmid AT-rich sequences as for example F,
pSC101 or P1 (see Table 1). In F plasmid, they overlap
the 7-nucleotide repeated motifs and both the GATC
sequence and its methylation are important for the functionality of the F origin in vivo (Brendler et al., 1991a, b,
1995; Abeles et al., 1993). No GATC sequences were
found within the AT-rich region of the RK2 oriV origin,
which excludes the 13-mers as sites for SeqA interaction.
As GATC motifs are noticeably more frequent in the ATrich regions of chromosomal origins of replication, their
role in the regulation of replication initiation and chromosome segregation is of great importance.
Specific interactions within the 13-mer sequences of
oriC were reported for the IciA protein, which is an
FEMS Microbiol Rev 36 (2012) 408–434
421
inhibitor of chromosome replication initiation (Hwang &
Kornberg, 1990, 1992b; Thony et al., 1991). Binding the
IciA dimer to DNA at the 13-mers causes an early
inhibition of the unwinding process at the AT-rich region
by DnaA and HU proteins, preventing the replication
initiation (Hwang & Kornberg, 1990, 1992a). The introduction of sequence changes within the L 13-mer may
lead to an increase or decrease of IciA affinity toward the
13-mers, therefore inhibiting the process at a different
level (Hwang & Kornberg, 1990). This also suggests that
there might exist specific sequences recognized by the
protein within the 13-mers. IciA does not inhibit the
binding of the DnaA and IHF proteins to their respective
sequences (Hwang & Kornberg, 1992a). However, once
the 13-mers are opened due to the DnaA protein activity,
IciA no longer affects the DNA replication from the oriC
origin (Hwang & Kornberg, 1990). EMSA and DNase I
footprint analyses showed that sites recognized by the
IciA protein are also found at the F and R1 plasmid origins in their repeat-containing AT-rich regions, as it was
observed for E. coli oriC (Wei & Bernander, 1996) and
possibly play a similar role in replication initiation. No
data were found for the presence of IciA-recognized
sequences within the AT-rich regions in other origins;
however, such possibility cannot be excluded without
proper biochemical analysis.
Sites recognized also by other proteins were shown to
exist within the oriC AT-rich region. For example, a site
for binding the ArcA protein that, among other
sequences, interacts with the 13-mers and blocks the
open complex formation and therefore the replication
initiation. The protein binding does not affect the process
once the ssDNA structure is formed (Lee et al., 2001b).
Sequences within the 13-mer region were also reported
to bind a putative E. coli membrane protein, HobH, that
seems to be involved in postreplication oriC binding to
the membrane at the DNA hemimethylated state, preventing re-initiation (Herrick et al., 1994). Binding the
13-mers at the AT-rich regions of the oriC and pSC101
plasmid origins by a DpiA protein results in perturbed
DNA replication and plasmid inheritance destabilization;
also, the DpiA overexpression and binding to oriC
induces bacterial SOS response (Miller et al., 2003). The
sequences at the unwound AT-rich region of oriC were
found to take part in the interaction with another factor –
the CspD protein, belonging to the cold shock proteins
family. Its binding to the ssDNA was shown to effectively
inhibit both the initiation and elongation stages of the
minichromosomal DNA replication in vitro (Yamanaka
et al., 2001); however, the binding occurred without
apparent sequence specificity. The feature of ssDNA
binding of CspD is shared by other Csp proteins in
E. coli: CspA, CspB, CspC, and CspE (Jiang et al., 1997;
ª 2011 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
422
Phadtare & Inouye, 1999) and B. subtilis: CspB, CspC,
and CspD (Graumann & Marahiel, 1994; Graumann
et al., 1997).
The binding sites for nucleoid-associated proteins
(NAPs) were also identified within AT-rich region of some
replication origins. For example, Fis (factor for inversion
stimulation) binding sites, which are present outside the
AT-rich region of E. coli oriC (Gille et al., 1991), were
found in the sequence of the AT-rich region of the R6K
plasmid. The plasmid replication depends on Fis when
copy-up variants of R6K p protein and the Penr marker
were simultaneously used (Wu et al., 1996). In case of two
plasmids, R6K and pSC101, sites for binding the hostencoded DNA bending protein IHF (integration host
factor) were localized within the AT-rich regions of their
respective origins (Filutowicz & Appelt, 1988; Levchenko
& Filutowicz, 1996; Datta et al., 1999). The IHF stimulates
the unwinding of plasmid origins and E. coli oriC
(Bramhill & Kornberg, 1988b; Hwang & Kornberg, 1992a;
Kruger et al., 2001). It has been demonstrated that Fis
protein suppresses IHF as well as DnaA interaction within
the E. coli oriC and negatively affects replication initiation
(Ryan et al., 2004). Another factor interacting with ssDNA
and functionally affecting its structure is the ssDNA-binding protein (SSB), which interacts with the open region of
an origin and is involved in replication complex assembly
(Meyer & Laine, 1990).
Apart from the proteins described above also other factors were identified to be crucial for the regulation of
replication activity of prokaryotic origins. Some of them
influence origin opening at the AT-rich region; however,
none of them was demonstrated to interact directly with
the motifs found in this region. The RIDA mechanism
(regulatory inactivation of DnaA) (Katayama et al.,
1998), stimulated by Hda protein and b-ring (Katayama
et al., 2010), prevents overinitiation of DNA replication
by converting the active ATP-DnaA form to its inactive
ADP-DnaA form (Camara et al., 2005). The DnaA inactivation occurs due to direct interaction of Hda with the
b-clamp through a specific motif at its N-terminus (Kurz
et al., 2004). This process, together with the dnaA gene
autoregulation, acts to ensure homeostatic maintenance
of the E. coli chromosome (Riber et al., 2006). Rob protein binds the right side of oriC, the R2, R3, and R4
DnaA-boxes, which may influence the formation of
nucleoprotein structure in this region (Skarstad et al.,
1993). Cnu (oriC-binding NAP) could possibly play a role
in proper activity of oriC (Kim et al., 2005). DiaA
(DnaA-binding protein) binds DnaA complexed with
DnaA-box and stimulates the formation of an open complex (Ishida et al., 2004) and directly the formation of
the ATP-DnaA multimers on oriC ensuring timely replication (Keyamura et al., 2009).
ª 2011 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
M. Rajewska et al.
Relevance of repeated sequences within
AT-rich region for DNA replication
initiation
AT-rich region repeated sequences and origin
opening
The high content of adenine and thymine residues and
consequent low internal stability of the AT-rich region
facilitates DNA helix destabilization. It was observed
in vitro that to some extent, the DUEs have a tendency
toward spontaneous opening (breathing) without replication initiation factors (Polaczek et al., 1998). The E. coli
oriC unwinding can occur in either the presence or the
absence of initiation proteins, as detected by a singlestrand specific nuclease, P1 (Polaczek et al., 1998). This
suggests that oriC unwinding could be a spontaneous
event determined solely by the DNA sequence containing
repeated motifs contributing to the specific internal stability pattern with periodical local minima of free energy
(see paragraphs above). Therefore the replication proteins
may function only to stabilize the ssDNA structure.
Experiments conducted on other systems revealed that the
DNA helix unwinding could be detected only in the presence of the replication initiation proteins (Schnos et al.,
1988; Mukhopadhyay et al., 1993; Kawasaki et al., 1996;
Konieczny et al., 1997; Kruger et al., 2001; Kruger & Filutowicz, 2003). For example, in vitro tests with KMnO4
clearly demonstrate that during the initiation of RK2 plasmid’s replication, the opening of the AT-rich region is
induced by the plasmid replication initiation protein TrfA
and is only enhanced or/and stabilized by host DnaA
(Konieczny et al., 1997). Moreover, a strict requirement
of histone-like proteins indicates that the appropriate superhelicity of DNA within DUE is critical for its opening.
The necessity for HU or IHF proteins was reported for
E. coli oriC (Kano et al., 1991; Hwang & Kornberg, 1992a;
Ryan et al., 2002), phage k (Mendelson et al., 1991), plasmids F (Ogura et al., 1990; Kawasaki et al., 1996; Zzaman
et al., 2004), P1 (Ogura et al., 1990; Mukhopadhyay et al.,
1993; Park et al., 1998; Fekete et al., 2006), R6K (Dellis &
Filutowicz, 1991; Dellis et al., 1996; Lu et al., 1998;
Kruger et al., 2001; Abhyankar et al., 2003) and RK2
(Konieczny et al., 1997). Although the spontaneous destabilization of the AT-rich part of the origin is possible, no
functional origin opening leading to the formation of replication complex and the initiation of DNA replication
was observed without the initiator proteins. Therefore the
analysis of the nucleoprotein complexes formed at the
replication origin is fundamental to our understanding of
the mechanism of the initial origin opening.
It is thought that the initial complex at E. coli oriC
includes 5–20 DnaA molecules and it forms a wrapped
FEMS Microbiol Rev 36 (2012) 408–434
AT-rich region and the repeated sequences in DNA replication
nucleoprotein structure (Kornberg & Baker, 1992; Carr &
Kaguni, 2001; Messer et al., 2001). A similar multimeric
form called O-some consisting of the kO protein and
iterons was identified during the bacteriophage lambda
DNA replication initiation (Dodson et al., 1989). The formation of the E. coli oriC initial complex requires sequential DnaA protein binding to the individual DnaA-box
sequences, located within the origin’s sequence (Margulies
& Kaguni, 1996; Weigel et al., 1997). While ATP-DnaA
and ADP-DnaA are both able to form nucleoprotein
complexes at oriC, the ATP-DnaA is required for the
DUE opening (Bramhill & Kornberg, 1988a, b; Sekimizu
et al., 1988; Ozaki et al., 2008). The DnaA-boxes of the
chromosomal replication origins, as well as the plasmids’
iterons, are usually located on one side of AT-rich
sequences (see paragraph above and Fig 1). This indicates
that the initial complex, consisting of Rep proteins and
the wrapped DNA, is formed in the proximity but outside of the DUE element. Experiments, demonstrating
that the distance and helical spacing between the Rep
binding sites and an AT-rich region is critical for origin
activity (see paragraphs above and Hsu et al., 1994;
Doran et al., 1998), raised the possibility that the DUE
can directly interact with a nucleoprotein structure
formed by Rep proteins.
The last 10 years brought new data on the mechanism
of the initial DNA unwinding and highlighted the contribution of the DNA sequence of the AT-rich region to the
open complex formation. Mutational analysis revealed
that the E. coli oriC activity requires DnaA interaction
with both strong affinity sites (R1, R2, and R4) and low
affinity sites (R3, R5, and I-sites) and also that both ATPDnaA and ADP-DnaA forms interact at those sites (Leonard & Grimwade, 2005; Grimwade et al., 2007). The discovery of the ATP-DnaA-boxes within the AT-rich region
of the oriC origin of E. coli (Speck & Messer, 2001)
pointed out that DnaA can interact within the specific
sequences that are present at the AT-rich region. SPR
analysis of the DnaA nucleoprotein complexes clearly
demonstrated that the ATP-DnaA binds ssDNA containing ATP-DnaA-box sequences (Speck & Messer, 2001;
Messer, 2002). It was proposed (Speck & Messer, 2001)
that initially DnaA protein binds with high affinity to
dsDNA oriC DnaA box R1 and that this complex serves as
an anchor for a cooperative binding of ATP-DnaA to the
ATP-DnaA-boxes. This would position the protein to the
region of unwinding. The ssDNA resulting from the DUE
unwinding would then be stabilized by cooperative binding of ATP-DnaA. Crystallographic data on DnaA proteins, isolated from Aquifex aeolicus and T. maritima, and
crystal structures of Rep proteins of plasmids F, pPS10,
P1, R6K as well as the structural data obtained via bioinformatics’ approaches (Sharma et al., 2004; Pierechod
FEMS Microbiol Rev 36 (2012) 408–434
423
et al., 2009) elucidated general mechanisms underlying the
DNA replication initiation in bacteria. It became clear that
both the chromosomal DNA replication initiators, DnaA
homologous proteins and the plasmid replication initiators, Rep proteins, form oligomers when interacting with
their binding sites localized at the replication origin.
Structural analysis of the F plasmid’s RepE (Komori et al.,
1999) and the A. aeolicus DnaA (Erzberger et al., 2002)
was consistent with previous observations showing that
the replication initiators induce DNA bending, when
interacting with a DNA particle (Mukherjee et al., 1985;
Stenzel et al., 1991; Murakami et al., 1997). The repeated
sequences, iterons, and DnaA-boxes contribute to that
specific nucleoprotein structure. It must be pointed out
that due to the structural variations among the DnaA
homologues and the Rep proteins, the nucleoprotein complexes they form with DNA differ. The structure analysis
of the A. aeolicus DnaA protein (Erzberger et al., 2002)
revealed that the ATP-binding domain, including two
Walker motifs for nucleotide binding and hydrolysis,
belongs to the AAA+ superfamily (Neuwald et al., 1999;
Maurizi & Li, 2001; Ogura & Wilkinson, 2001). The protein’s C-terminal domain consists of a classic helix-turnhelix motif and an extra loop that could be in contact
with DNA (Erzberger et al., 2002). Based on the obtained
structural data, it was proposed that the DnaA oligomer
could conceivably accommodate either a closed ring or a
helical filament arrangement of monomers. The DUE
opening may occur spontaneously through a local strain
induced by an assembly of the nucleoprotein complex in
the presence of ATP (Erzberger et al., 2002). In a recent
study, Berger’s group reported that DnaA utilizes at least
two different oligomeric conformations and that it plays
distinct roles in controlling the progression of the replication initiation (Duderstadt et al., 2010). The structural
basis for a link between the dsDNA binding, the ability to
form oligomers and the interaction with ssDNA in ATPdependent manner was established. Moreover, experiments on the T. maritima DnaA showed the multimer
ATP-tmaDnaA-dependent formation of an open complex
at the replication origin (Ozaki et al., 2006). ATP is not
required for the E. coli DnaA binding to the DnaA-boxes
and the formation of multimeric complex at the oriC;
however, it is essential for the protein interaction with the
ATP-DnaA-boxes localized within the AT-rich region. The
interaction of DnaA proteins with specific motifs located
within the ssDNA of the AT-rich region is suggested to be
a common mechanism for the origin opening of bacterial
chromosomes.
Understanding the open complex formation at plasmid
origins is more complex as two initiator proteins are
involved during plasmid replication initiation. In contrast
to the DnaA homologues, the Rep proteins do not contain
ª 2011 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
424
AAA+ domain and consist of two winged helix (WH)
domains, which are a large family of DBDs found also in
eukaryotic organisms (Giraldo, 2003). The WH domains
are composed of a bundle of three a-helices and an antiparallel b-sheet (Gajiwala & Burley, 2000) that facilitate
Rep binding to the direct repeated sequences at plasmid’s
replication origin. No ATP binding sites were identified in
the Rep structure and, in contrast to the chromosomal
DNA replication initiation, no strict ATP requirement was
observed for plasmids’ origin opening (Mukhopadhyay
et al., 1993; Kawasaki et al., 1996; Lu et al., 1998; Park
et al., 1998). Although the Rep DNA binding and the
DUE opening at plasmid origins are not ATP-dependent,
the specific binding of Rep to the iterons at plasmid origins establishes oligomeric nucleoprotein complex similar
to those formed by DnaA (Komori et al., 1999). The plasmid and chromosomal origins exhibit similar positioning
of the AT-rich region in respect to the binding sites of a
replication initiator. The iterons in plasmid origins are
clustered similarly as the DnaA-boxes in chromosomal
origins, on one side of an AT-rich DUE (Fig. 1 and text
above). The open complex formation at the plasmid origins is induced by Rep protein binding to the iterons;
however, some plasmids still require DnaA to stabilize or
enhance the origin unwinding (Kawasaki et al., 1996;
Konieczny et al., 1997; Park et al., 1998; Kruger et al.,
2001). The molecular basis for DnaA contribution to plasmid origin opening must be different comparing to mechanisms proposed for the chromosomal replication
initiation by DnaA (see above). The melting at a plasmid
AT-rich region is ATP-independent and no ATP-DnaAbox motifs are present there. It must be pointed out that
the DnaA interaction with putative DnaA-boxes within
the DUE element of RK2 plasmid was postulated (Gaylo
et al., 1987; Konieczny et al., 1997). Moreover, it could
not be excluded that during the opening reaction the Rep
proteins could interact with the 13-mer sequences within
the AT-rich region. Both in vivo and in vitro tests revealed
that specific alterations within the 13-mer sequences result
in inhibition of the RK2 oriV opening and consequently
in the lack of origin activity (Kowalczyk et al., 2005;
Rajewska et al., 2008). As the introduced alterations did
not change the internal stability pattern, these results indicate the possibility of specific protein–DNA interactions
within the AT-rich region of plasmid origins.
AT-rich region repeated sequences – helicase
loading and replication complex assembly
The AT-rich region provides structural scaffold for the
assembling of a replication complex. Regardless of a replicon analyzed, the formation of the complex involves helicase loading and subsequent assembly of primase and
ª 2011 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
M. Rajewska et al.
replicative polymerase. Those enzymes specifically bind
the ssDNA (Wickner, 1977; Rowen & Kornberg, 1978;
LeBowitz & McMacken, 1986; Kelman & O’Donnell,
1995; Jezewska et al., 1996) that is created within the
opened DUE. It is not clear if and how the nucleotide
sequence of the opened region affects the formation of a
replication complex. The E. coli DnaB helicase loading at
oriC requires both the helicase interaction with DnaA
(Marszalek & Kaguni, 1994; Sutton et al., 1998; Seitz
et al., 2000) and the involvement of an accessory ATPase,
namely DnaC, protein (Wickner & Hurwitz, 1975; Lanka
& Schuster, 1983; Davey et al., 2002). Both DnaA and
DnaC bind ssDNA (Learn et al., 1997; Messer, 2002).
ATP stimulates ssDNA binding by E. coli DnaC, leading
to the expansion of the ssDNA bubble at the origin (Davey et al., 2002). Cryptic single-stranded-DNA binding
activities of the E. coli DnaC, as well as phage lambda P
replication initiation protein, facilitates the transfer of the
E. coli DnaB helicase onto the ssDNA (Learn et al.,
1997). The determination of the structure of the ATPase
region of the DnaC from A. aeolicus demonstrates that it
is a close paralogue of the DnaA and forms a helical
assembly similar to the quaternary state adopted by ATPbound DnaA (Mott et al., 2008). Those findings implicate
that DnaC is a molecular adaptor that uses ATP-activated
DnaA as a docking site for regulating the recruitment and
correct spatial deposition of the DnaB helicase onto the
ssDNA of the opened DUE element. Two E. coli DnaB
hexamers are loaded onto the opposing strands of the
open origin structure (Fang et al., 1999). After the helicase moves approximately 65 nucleotides, the primase
synthesizes primers and two molecules of the DNA polymerase III holoenzyme are assembled on the ssDNA templates (Fang et al., 1999). SSB protein was also indicated
as a factor important for the formation and functionality
of the replication complex at the ssDNA (Meyer & Laine,
1990; Kelman et al., 1998).
The helicase loading at plasmid origins requires DnaA,
DnaC and plasmid-encoded replication initiation protein
(Konieczny & Helinski, 1997; Lu et al., 1998; Datta et al.,
1999; Pacek et al., 2001; Weigel & Seitz, 2002; Jiang et al.,
2003); however, in vitro tests indicate that in some cases
the recruitment and loading of the bacterial helicase on a
plasmid origin does not require DnaC-like ATPase (Caspi
et al., 2001; Jiang et al., 2003; Konieczny, 2003). The
DnaA and DnaC interactions with the ssDNA within the
open AT-rich region are demonstrated for chromosomal
origins (see paragraph above). It is very likely that similar
complexes, including also the plasmid Rep proteins, could
be essential for the helicase loading at plasmid origins.
During the RK2 plasmid replication initiation, the E. coli
DnaB helicase complex consisting of DnaA and DnaC
proteins was shown at dsDNA DnaA-box repeats located
FEMS Microbiol Rev 36 (2012) 408–434
425
AT-rich region and the repeated sequences in DNA replication
outside the AT-rich region of the plasmid origin (Konieczny & Helinski, 1997; Pacek et al., 2001). Specific alterations within the 13-mer sequences, which do no affect
origin opening, significantly disturb the E. coli DnaB helicase translocation from the DnaA-boxes to the RK2
AT-rich region (Rajewska et al., 2008). In case of five
mutants, this process showed that the helicase loading at
the unwound single-stranded AT-rich region depends
strictly on the activity of the plasmid’s Rep protein and
the sequence of the 13-mer repeats (Rajewska et al., 2008).
This adds to the notion of the AT-rich region being
important to replication initiation not only due to its low
internal stability. However, whether it is the sequence itself
or the sequence-dependent formation of secondary structures at the unwound origin still remains ambiguous. A
selective activation by stretches of thymine residues of the
Mcm4/6/7 helicase complex has been reported and proposed to be a determinant for the selection of DNA replication initiation sites in mammalian genomes (You et al.,
2003). Experiments on the E. coli DnaB demonstrate that
DnaA can load helicase at a DnaA-box hairpin structure
(Carr & Kaguni, 2001). The potential formation of secondary structures within the open region of origin
(Pearson et al., 1996; Rampakakis et al., 2010) may affect
the nucleoprotein interactions.
AT-rich region repeated sequences and
replication regulation
The AT-rich part of a replication origin containing 13mer repeats and sequences overlapping them, including
GATC sites, ATP-DnaA boxes, and other more or less
specified motifs for binding various regulatory proteins
(see paragraphs above) could be considered as a major
region for controlling the DNA replication initiation. This
control is achieved mainly by managing the unwinding of
an AT-rich region. The binding of the regulatory proteins
changes the DNA architecture or prevents the formation
of specific nucleoprotein complexes at the DUE. That
results in the stimulation or inhibition of origin opening.
The interactions of ATP-DnaA with specific motifs (ATPDnaA-box-like sequences) localized within the AT-rich
part of origin is critical for its opening, whereas the binding of the SeqA, IciA, ArcA, HobH, or CspD proteins to
the AT-rich sequences inhibits the formation of the open
complex and therefore the DNA replication initiation. In
contrast, the DiaA protein directly stimulates the formation of ATP-DnaA multimers on oriC and ensures timely
replication. After the melting of DUE, the interactions of
Hda and Pol III sliding clamp with ATP-DnaA and
ssDNA within the open region inactivate DnaA protein.
This prevents an immediate ATP-dependent activity of
DnaA and as a result an uncontrolled origin opening.
FEMS Microbiol Rev 36 (2012) 408–434
The replication initiation could possibly be also regulated
by controlling the formation of a helicase complex at the
unwound AT-rich structure. It was recently proposed that
the DiaA dynamics is coupled with changes in the initial
complexes at the origin, leading to helicase loading (Keyamura et al., 2009). Moreover, the NAPs such as Fis,
HU, IHF affect the origin’s architecture and distribution
of the DnaA protein and, therefore, the activity of replication origin. In some replicons, the IHF-binding sites
are located within the DUE (see earlier paragraphs), indicating the relevance of the NAPs interactions within this
region. A model for the dynamic interplay between the
DNA architectural proteins and DnaA during replication
initiation has been proposed (Mott & Berger, 2007). The
AT-rich region of a replication origin accommodates
multiple protein interactions and conformations. The formation of oligomeric structures composed of replication
initiators and architectural proteins as well as specific
interactions of regulatory factors seem to be the key factors for origin activities. A specific nucleotide sequence of
DUE element is essential and provides specificity of
nucleoprotein interactions during the process of replication initiation and its control.
Summary
A common feature of many bacterial origins of replication is the presence of a region rich in adenine and thymine residues. These AT-rich regions are the sites where
the processes of DNA unwinding and replication start.
They are usually characterized by low thermodynamic stability in comparison with the overall origin stability and
contain repeated sequences of various lengths that are
crucial for the proper functioning of the replicons. The
presence of the repeated motifs causes fluctuations, characteristic ‘intervals’ and differences among the minima of
free energy, which can be observed when analyzing the
thermodynamic stability patterns of particular origins.
Patterns of the internal stability may also be related to
the helical periodicity of some of the repeats within the
AT-rich regions. In the bacterial chromosomal origins of
replication, the AT-rich regions are usually located at one
side of a cluster of DnaA-box sequences. In plasmids,
they are usually followed by one or two DnaA-boxes and
precede the binding sites for plasmid initiator protein.
Such arrangement of the motifs seems to be of great
importance for the efficient functioning of replication origins. It is possible that proper organization of the motifs
might be connected with the function of initiator proteins
and the mechanism of replication initiation at a particular origin. The AT-rich regions are of different sizes,
depending on the origin; however, what seems to be
common for them is the size of the initial DNA opening
ª 2011 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
M. Rajewska et al.
426
at DUE, which is very rarely longer than 50 bp and no
shorter than 20 bp. This extent of origin opening seems
to be sufficient for the assembly of the prereplication
complex at the unwound site.The great majority of the
AT-rich regions of prokaryotic origins contain repeats of
various lengths. They appear to be absolutely critical for
the origin functioning and the formation of protein complexes during replication initiation events. In the bacterial
origins, the repeats within the AT-rich regions are usually
13 nucleotides long and possess high sequence similarity
with consensus determined for E. coli oriC 13-mers. They
also contain a distinctive core consisting mostly of adenines and thymines at the right-hand side of each repeat.
Most often, there are three direct repeats in the region
that play a direct role in the open complex formation at
the origin during the replication initiation. In plasmids,
the repeats vary both in length and sequence and establishing a general consensus for them is rather difficult.
However, the length, number, and spacing between the
repeats are individual for each plasmid origin and the
specific sequence of each is substantial for origin functioning. Biochemical analyses of replication initiation at
different origins revealed that the sequence or, possibly,
the secondary DNA structure resulting from it is absolutely critical for the replicating activity of a replicon.
However, whether it is the linear or secondary structure
of DNA in the AT-rich region, which is crucial for origin’s activity and multiprotein interactions in this region,
still requires investigation. Within the repeats of the chromosomal and plasmid AT-rich regions overalapping
motifs for binding proteins engaged in replication initiation and its regulation were also identified. However, the
exact role of the particular motifs is also being investigated. It is highly interesting whether the multiprotein
interactions within the AT-rich region are competitive or
maybe the accumulation of motifs for protein interactions
plays a role at different stages of replication and allows
separation of particular events of the process. The analysis
of the nucleoprotein complexes formed at the AT-rich
region of replication origin will be crucial for further
understanding of the regulation and molecular mechanisms involved in the initial origin opening and subsequent steps of replication initiation.
Acknowledgements
This work was supported by grant N N301 295837 from
Polish Ministry of Science and Higher Education. We
thank Foundation for Polish Science for support under
TEAM (for I.K.) and START (for M.R.) programs. M.R.
was also financed from European Social Fund in the
framework of the Program of implementing the elements
of modern education forms at the University of Gdansk.
ª 2011 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
References
Abdurashidova G, Deganuto M, Klima R, Riva S, Biamonti G,
Giacca M & Falaschi A (2000) Start sites of bidirectional
DNA synthesis at the human lamin B2 origin. Science 287:
2023–2026.
Abeles AL (1986) P1 plasmid replication. Purification and
DNA-binding activity of the replication protein RepA. J Biol
Chem 261: 3548–3555.
Abeles AL, Snyder KM & Chattoraj DK (1984) P1 plasmid
replication: replicon structure. J Mol Biol 173: 307–324.
Abeles AL, Reaves LD & Austin SJ (1990) A single DnaA box
is sufficient for initiation from the P1 plasmid origin.
J Bacteriol 172: 4386–4391.
Abeles A, Brendler T & Austin S (1993) Evidence of two
levels of control of P1 oriR and host oriC replication
origins by DNA adenine methylation. J Bacteriol 175:
7801–7807.
Abhyankar MM, Zzaman S & Bastia D (2003) Reconstitution
of R6K DNA replication in vitro using 22 purified proteins.
J Biol Chem 278: 45476–45484.
Asai T, Takanami M & Imai M (1990) The AT richness and
gid transcription determine the left border of the replication
origin of the E. coli chromosome. EMBO J 9: 4065–4072.
Bell SP & Dutta A (2002) DNA replication in eukaryotic cells.
Annu Rev Biochem 71: 333–374.
Bernander R, Dasgupta S & Nordstrom K (1991) The E. coli
cell cycle and the plasmid R1 replication cycle in the
absence of the DnaA protein. Cell 64: 1145–1153.
Bernander R, Krabbe M & Nordstrom K (1992) Mapping of
the in vivo start site for leading strand DNA synthesis in
plasmid R1. EMBO J 11: 4481–4487.
Berquist BR & DasSarma S (2003) An archaeal chromosomal
autonomously replicating sequence element from an
extreme halophile, Halobacterium sp. strain NRC-1.
J Bacteriol 185: 5959–5966.
Bielinsky AK & Gerbi SA (1998) Discrete start sites for DNA
synthesis in the yeast ARS1 origin. Science 279: 95–98.
Bielinsky AK & Gerbi SA (1999) Chromosomal ARS1 has a
single leading strand start site. Mol Cell 3: 477–486.
Bielinsky AK & Gerbi SA (2001) Where it all starts: eukaryotic
origins of DNA replication. J Cell Sci 114: 643–651.
Blaesing F, Weigel C, Welzeck M & Messer W (2000) Analysis
of the DNA-binding domain of Escherichia coli DnaA
protein. Mol Microbiol 36: 557–569.
Bowers LM, Kruger R & Filutowicz M (2007) Mechanism of
origin activation by monomers of R6K-encoded pi protein.
J Mol Biol 368: 928–938.
Bramhill D & Kornberg A (1988a) A model for initiation at
origins of DNA replication. Cell 54: 915–918.
Bramhill D & Kornberg A (1988b) Duplex opening by DnaA
protein at novel sequences in initiation of replication at the
origin of the E. coli chromosome. Cell 52: 743–755.
Breier AM, Chatterji S & Cozzarelli NR (2004) Prediction of
Saccharomyces cerevisiae replication origins. Genome Biol 5:
R22.
FEMS Microbiol Rev 36 (2012) 408–434
AT-rich region and the repeated sequences in DNA replication
Brendler T, Abeles A & Austin S (1991a) Critical sequences in
the core of the P1 plasmid replication origin. J Bacteriol
173: 3935–3942.
Brendler TG, Abeles AL, Reaves LD & Austin SJ (1991b)
Unique sequence requirements for the P1 plasmid
replication origin. Res Microbiol 142: 209–216.
Brendler T, Abeles A & Austin S (1995) A protein that binds
to the P1 origin core and the oriC 13mer region in a
methylation-specific fashion is the product of the host seqA
gene. EMBO J 14: 4083–4089.
Camara JE, Breier AM, Brendler T, Austin S, Cozzarelli NR &
Crooke E (2005) Hda inactivation of DnaA is the
predominant mechanism preventing hyperinitiation of
Escherichia coli DNA replication. EMBO Rep 6: 736–741.
Carr KM & Kaguni JM (2001) Stoichiometry of DnaA and
DnaB protein in initiation at the Escherichia coli
chromosomal origin. J Biol Chem 276: 44919–44925.
Caspi R, Pacek M, Consiglieri G, Helinski DR, Toukdarian AE
& Konieczny I (2001) A broad host range replicon with
different requirements for replication initiation in three
bacterial species. EMBO J 20: 3262–3271.
Chattoraj DK, Snyder KM & Abeles AL (1985) P1 plasmid
replication: multiple functions of RepA protein at the
origin. P Natl Acad Sci USA 82: 2588–2592.
Chew DS, Leung MY & Choi KP (2007) AT excursion: a new
approach to predict replication origins in viral genomes by
locating AT-rich regions. BMC Bioinformatics 8: 163.
Chiang CS, Xu YC & Bremer H (1991) Role of DnaA protein
during replication of plasmid pBR322 in Escherichia coli.
Mol Gen Genet 225: 435–442.
Churchward G, Linder P & Caro L (1983) The nucleotide
sequence of replication and maintenance functions encoded
by plasmid pSC101. Nucleic Acids Res 11: 5645–5659.
Coker JA, DasSarma P, Capes M et al. (2009) Multiple
replication origins of Halobacterium sp. strain NRC-1:
properties of the conserved orc7-dependent oriC1.
J Bacteriol 191: 5253–5261.
Crosa JH, Luttropp LK & Falkow S (1978) Molecular cloning
of replication and incompatibility regions from the
R-plasmid R6K. J Mol Biol 124: 443–468.
Danbara H, Timmis JK, Lurz R & Timmis KN (1980) Plasmid
replication functions: two distinct segments of plasmid R1,
RepA and RepD, express incompatibility and are capable of
autonomous replication. J Bacteriol 144: 1126–1138.
Datta HJ, Khatri GS & Bastia D (1999) Mechanism of
recruitment of DnaB helicase to the replication origin of the
plasmid pSC101. P Natl Acad Sci USA 96: 73–78.
Davey MJ, Fang L, McInerney P, Georgescu RE & O’Donnell
M (2002) The DnaC helicase loader is a dual ATP/ADP
switch protein. EMBO J 17: 3148–3159.
del Solar G, Giraldo R, Ruiz-Echevarria MJ, Espinosa M &
Diaz-Orejas R (1998) Replication and control of circular
bacterial plasmids. Microbiol Mol Biol Rev 62: 434–464.
Dellis S & Filutowicz M (1991) Integration host factor of
Escherichia coli reverses the inhibition of R6K plasmid
replication by pi initiator protein. J Bacteriol 173: 1279–1286.
FEMS Microbiol Rev 36 (2012) 408–434
427
Dellis S, Feng J & Filutowicz M (1996) Replication of plasmid
R6K gamma origin in vivo and in vitro: dependence on IHF
binding to the ihf1 site. J Mol Biol 257: 550–560.
DePamphilis ML (1999) Replication origins in metazoan
chromosomes: fact or fiction? Bioessays 21: 5–16.
Diaz R & Staudenbauer WL (1982) Origin and direction of
mini-R1 plasmid DNA replication in cell extracts of
Escherichia coli. J Bacteriol 150: 1077–1084.
Diaz-Lopez T, Lages-Gonzalo M, Serrano-Lopez A, Alfonso
C, Rivas G, Diaz-Orejas R & Giraldo R (2003)
Structural changes in RepA, a plasmid replication
initiator, upon binding to origin DNA. J Biol Chem 278:
18606–18616.
Dijkwel PA & Hamlin JL (1995) The Chinese hamster
dihydrofolate reductase origin consists of multiple potential
nascent-strand start sites. Mol Cell Biol 15: 3023–3031.
Dodson M, McMacken R & Echols H (1989) Specialized
nucleoprotein structures at the origin of replication of
bacteriophage lambda. Protein association and
disassociation reactions responsible for localized initiation of
replication. J Biol Chem 264: 10719–10725.
Doran KS, Konieczny I & Helinski DR (1998) Replication
origin of the broad host range plasmid RK2. Positioning of
various motifs is critical for initiation of replication. J Biol
Chem 273: 8447–8453.
Doran KS, Helinski DR & Konieczny I (1999a) A critical
DnaA box directs the cooperative binding of the Escherichia
coli DnaA protein to the plasmid RK2 replication origin.
J Biol Chem 274: 17918–17923.
Doran KS, Helinski DR & Konieczny I (1999b) Hostdependent requirement for specific DnaA boxes for plasmid
RK2 replication. Mol Microbiol 33: 490–498.
Duderstadt KE, Mott ML, Crisona NJ, Chuang K, Yang H &
Berger JM (2010) Origin remodeling and opening in
bacteria rely on distinct assembly states of the DnaA
initiator. J Biol Chem 285: 28229–28239.
Egan ES & Waldor MK (2003) Distinct replication
requirements for the two Vibrio cholerae chromosomes. Cell
114: 521–530.
Eichenlaub R, Figurski D & Helinski DR (1977) Bidirection
replication from a unique origin in a mini-F plasmid.
P Natl Acad Sci USA 74: 1138–1141.
Erzberger JP, Pirruccello MM & Berger JM (2002) The
structure of bacterial DnaA: implications for general
mechanisms underlying DNA replication initiation. EMBO J
21: 4763–4773.
Fang L, Davey MJ & O’Donnell M (1999) Replisome assembly at
oriC, the replication origin of E. coli, reveals an explanation
for initiation sites outside an origin. Mol Cell 4: 541–553.
Fekete RA, Venkova-Canova T, Park K & Chattoraj DK (2006)
IHF-dependent activation of P1 plasmid origin by dnaA.
Mol Microbiol 62: 1739–1751.
Filutowicz M & Appelt K (1988) The integration host factor of
Escherichia coli binds to multiple sites at plasmid R6K
gamma origin and is essential for replication. Nucleic Acids
Res 16: 3829–3843.
ª 2011 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
428
Frank AC & Lobry JR (2000) Oriloc: prediction of replication
boundaries in unannotated bacterial chromosomes.
Bioinformatics 16: 560–561.
Freeman JM, Plasterer TN, Smith TF & Mohr SC (1998)
Patterns of genome organization in bacteria. Science 279:
1827.
Fukuoka T, Moriya S, Yoshikawa H & Ogasawara N (1990)
Purification and characterization of an initiation protein for
chromosomal replication, DnaA, in Bacillus subtilis.
J Biochem 107: 732–739.
Fuller RS, Funnell BE & Kornberg A (1984) The dnaA protein
complex with the E. coli chromosomal replication origin
(oriC) and other DNA sites. Cell 38: 889–900.
Gajiwala KS & Burley SK (2000) Winged helix proteins. Curr
Opin Struct Biol 10: 110–116.
Gao F & Zhang CT (2007) DoriC: a database of oriC regions
in bacterial genomes. Bioinformatics 23: 1866–1867.
Gao F & Zhang CT (2008) Ori-Finder: a web-based system for
finding oriCs in unannotated bacterial genomes. BMC
Bioinformatics 9: 79.
Gaylo PJ, Turjman N & Bastia D (1987) DnaA protein is
required for replication of the minimal replicon of the
broad-host-range plasmid RK2 in Escherichia coli. J Bacteriol
169: 4703–4709.
Geier GE & Modrich P (1979) Recognition sequence of the
dam methylase of Escherichia coli K12 and mode of cleavage
of Dpn I endonuclease. J Biol Chem 254: 1408–1413.
Gerbi SA & Bielinsky AK (1997) Replication initiation point
mapping. Methods 13: 271–280.
Gille H & Messer W (1991) Localized DNA melting and
structural pertubations in the origin of replication, oriC, of
Escherichia coli in vitro and in vivo. EMBO J 10: 1579–1584.
Gille H, Egan JB, Roth A & Messer W (1991) The FIS protein
binds and bends the origin of chromosomal DNA
replication, oriC, of Escherichia coli. Nucleic Acids Res 19:
4167–4172.
Giraldo R (2003) Common domains in the initiators of DNA
replication in Bacteria, Archaea and Eukarya: combined
structural, functional and phylogenetic perspectives. FEMS
Microbiol Rev 26: 533–554.
Giraldo R & Diaz R (1992) Differential binding of wild-type
and a mutant RepA protein to oriR sequence suggests a
model for the initiation of plasmid R1 replication. J Mol
Biol 228: 787–802.
Giraldo R & Fernandez-Tresguerres ME (2004) Twenty years
of the pPS10 replicon: insights on the molecular mechanism
for the activation of DNA replication in iteron-containing
bacterial plasmids. Plasmid 52: 69–83.
Gomez M & Antequera F (1999) Organization of DNA
replication origins in the fission yeast genome. EMBO J 18:
5683–5690.
Gonzalez-Soltero R, Botello E & Jimenez-Sanchez A (2006)
Initiation of heat-induced replication requires DnaA and the
L-13-mer of oriC. J Bacteriol 188: 8294–8298.
Graumann P & Marahiel MA (1994) The major cold shock
protein of Bacillus subtilis CspB binds with high affinity to
ª 2011 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
M. Rajewska et al.
the ATTGG- and CCAAT sequences in single stranded
oligonucleotides. FEBS Lett 338: 157–160.
Graumann P, Wendrich TM, Weber MH, Schroder K &
Marahiel MA (1997) A family of cold shock proteins in
Bacillus subtilis is essential for cellular growth and for
efficient protein synthesis at optimal and low temperatures.
Mol Microbiol 25: 741–756.
Grigoriev A (1998) Analyzing genomes with cumulative skew
diagrams. Nucleic Acids Res 26: 2286–2290.
Grimwade JE, Torgue JJ, McGarry KC, Rozgaja T, Enloe ST &
Leonard AC (2007) Mutational analysis reveals Escherichia
coli oriC interacts with both DnaA-ATP and DnaA-ADP
during pre-RC assembly. Mol Microbiol 66: 428–439.
Guarne A, Zhao Q, Ghirlando R & Yang W (2002) Insights
into negative modulation of E. coli replication initiation
from the structure of SeqA-hemimethylated DNA complex.
Nat Struct Biol 9: 839–843.
Guarne A, Brendler T, Zhao Q, Ghirlando R, Austin S & Yang
W (2005) Crystal structure of a SeqA-N filament:
implications for DNA replication and chromosome
organization. EMBO J 24: 1502–1511.
Hamlin JL & Dijkwel PA (1995) On the nature of replication
origins in higher eukaryotes. Curr Opin Genet Dev 5: 153–
161.
Han JS, Kang S, Lee H, Kim HK & Hwang DS (2003)
Sequential binding of SeqA to paired hemi-methylated
GATC sequences mediates formation of higher order
complexes. J Biol Chem 278: 34983–34989.
Hansen EB & Yarmolinsky MB (1986) Host participation in
plasmid maintenance: dependence upon dnaA of replicons
derived from P1 and F. P Natl Acad Sci USA 83: 4423–4427.
Hansen FG, Christensen BB, Nielsen CB & Atlung T (2006)
Insights into the quality of DnaA boxes and their
cooperativity. J Mol Biol 355: 85–95.
Herrick J, Kern R, Guha S, Landoulsi A, Fayet O, Malki A &
Kohiyama M (1994) Parental strand recognition of the
DNA replication origin by the outer membrane in
Escherichia coli. EMBO J 13: 4695–4703.
Hsu J, Bramhill D & Thompson CM (1994) Open
complex formation by DnaA initiator protein at the
Escherichia coli chromosomal origin requires the 13-mer
precisely spaced relative to the 9-mers. Mol Microbiol 11:
903–911.
Hwang DS & Kornberg A (1990) A novel protein binds a key
origin sequence to block replication of an E. coli
minichromosome. Cell 63: 325–331.
Hwang DS & Kornberg A (1992a) Opening of the replication
origin of Escherichia coli by DnaA protein with protein HU
or IHF. J Biol Chem 267: 23083–23086.
Hwang DS & Kornberg A (1992b) Opposed actions of
regulatory proteins, DnaA and IciA, in opening the
replication origin of Escherichia coli. J Biol Chem 267:
23087–23091.
Inselburg J (1974) Replication of colicin E1 plasmid DNA in
minicells from a unique replication initiation site. P Natl
Acad Sci USA 71: 2256–2259.
FEMS Microbiol Rev 36 (2012) 408–434
AT-rich region and the repeated sequences in DNA replication
Ishiai M, Wada C, Kawasaki Y & Yura T (1994) Replication
initiator protein RepE of mini-F plasmid: functional
differentiation between monomers (initiator) and dimers
(autogenous repressor). P Natl Acad Sci USA 91: 3839–3843.
Ishida T, Akimitsu N, Kashioka T et al. (2004) DiaA, a novel
DnaA-binding protein, ensures the timely initiation of
Escherichia coli chromosome replication. J Biol Chem 279:
45546–45555.
Jacob F & Brenner S (1963) [On the regulation of DNA
synthesis in bacteria: the hypothesis of the replicon].
CR Hebd Seances Acad Sci 256: 298–300.
Jakimowicz D, Majka J & Messer W et al. (1998) Structural
elements of the Streptomyces oriC region and their
interactions with the DnaA protein. Microbiology 144 (Pt 5):
1281–1290.
Jezewska MJ, Kim US & Bujalowski W (1996) Binding of
Escherichia coli primary replicative helicase DnaB protein to
single-stranded DNA. Long-range allosteric conformational
changes within the protein hexamer. Biochemistry 35: 2129–
2145.
Jiang W, Hou Y & Inouye M (1997) CspA, the major coldshock protein of Escherichia coli, is an RNA chaperone.
J Biol Chem 272: 196–202.
Jiang Y, Pacek M, Helinski DR, Konieczny I & Toukdarian A
(2003) A multifunctional plasmid-encoded replication
initiation protein both recruits and positions an active
helicase at the replication origin. P Natl Acad Sci USA 100:
8692–8697.
Kang S, Lee H, Han JS & Hwang DS (1999) Interaction of
SeqA and Dam methylase on the hemimethylated origin of
Escherichia coli chromosomal DNA replication. J Biol Chem
274: 11463–11468.
Kang S, Han JS, Kim KP, Yang HY, Lee KY, Hong CB &
Hwang DS (2005) Dimeric configuration of SeqA protein
bound to a pair of hemi-methylated GATC sequences.
Nucleic Acids Res 33: 1524–1531.
Kano Y, Ogawa T, Ogura T, Hiraga S, Okazaki T & Imamoto
F (1991) Participation of the histone-like protein HU and of
IHF in minichromosomal maintenance in Escherichia coli.
Gene 103: 25–30.
Katayama T, Kubota T, Kurokawa K, Crooke E & Sekimizu K
(1998) The initiator function of DnaA protein is negatively
regulated by the sliding clamp of the E. coli chromosomal
replicase. Cell 94: 61–71.
Katayama T, Ozaki S, Keyamura K & Fujimitsu K (2010)
Regulation of the replication cycle: conserved and diverse
regulatory systems for DnaA and oriC. Nat Rev Microbiol 8:
163–170.
Kawasaki Y, Matsunaga F, Kano Y, Yura T & Wada C (1996)
The localized melting of mini-F origin by the combined
action of the mini-F initiator protein (RepE) and HU and
DnaA of Escherichia coli. Mol Gen Genet 253: 42–49.
Kelley WL & Bastia D (1992) Activation in vivo of the
minimal replication origin beta of plasmid R6K requires a
small target sequence essential for DNA looping. New Biol
4: 569–580.
FEMS Microbiol Rev 36 (2012) 408–434
429
Kelley WL, Patel I & Bastia D (1992) Structural and functional
analysis of a replication enhancer: separation of the
enhancer activity from origin function by mutational
dissection of the replication origin gamma of plasmid R6K.
P Natl Acad Sci USA 89: 5078–5082.
Kelman Z & O’Donnell M (1995) DNA polymerase III
holoenzyme: structure and function of a chromosomal
replicating machine. Annu Rev Biochem 64: 171–200.
Kelman Z, Yuzhakov A, Andjelkovic J & O’Donnell M (1998)
Devoted to the lagging strand-the subunit of DNA
polymerase III holoenzyme contacts SSB to promote
processive elongation and sliding clamp assembly. EMBO J
17: 2436–2449.
Keyamura K, Abe Y, Higashi M, Ueda T & Katayama T (2009)
DiaA dynamics are coupled with changes in initial origin
complexes leading to helicase loading. J Biol Chem 284:
25038–25050.
Kim MS, Bae SH, Yun SH et al. (2005) Cnu, a novel oriCbinding protein of Escherichia coli. J Bacteriol 187: 6998–
7008.
Kline BC, Kogoma T, Tam JE & Shields MS (1986)
Requirement of the Escherichia coli dnaA gene product for
plasmid F maintenance. J Bacteriol 168: 440–443.
Koch B, Ma X & Lobner-Olesen A (2010) Replication of
Vibrio cholerae chromosome I in Escherichia coli:
dependence on dam methylation. J Bacteriol 192: 3903–
3914.
Kolter R & Helinski DR (1978) Construction of plasmid R6K
derivatives in vitro: characterization of the R6K replication
region. Plasmid 1: 571–580.
Kolter R & Helinski DR (1982) Plasmid R6K DNA replication.
II. Direct nucleotide sequence repeats are required for an
active gamma-origin. J Mol Biol 161: 45–56.
Komori H, Matsunaga F, Higuchi Y, Ishiai M, Wada C & Miki
K (1999) Crystal structure of a prokaryotic replication
initiator protein bound to DNA at 2.6 A resolution. EMBO
J 18: 4597–4607.
Konieczny I (2003) Strategies for helicase recruitment and
loading in bacteria. EMBO Rep 4: 37–41.
Konieczny I & Helinski DR (1997) Helicase delivery and
activation by DnaA and TrfA proteins during the initiation
of replication of the broad host range plasmid RK2. J Biol
Chem 272: 33312–33318.
Konieczny I, Doran KS, Helinski DR & Blasina A (1997) Role
of TrfA and DnaA proteins in origin opening during
initiation of DNA replication of the broad host range
plasmid RK2. J Biol Chem 272: 20173–20178.
Kornberg A & Baker T (1992) DNA Replication. W.H.
Freeman & Co., New York.
Kowalczyk L, Rajewska M & Konieczny I (2005) Positioning
and the specific sequence of each 13-mer motif are critical
for activity of the plasmid RK2 replication origin. Mol
Microbiol 57: 1439–1449.
Kowalski D & Eddy MJ (1989) The DNA unwinding element:
a novel, cis-acting component that facilitates opening of the
Escherichia coli replication origin. EMBO J 8: 4335–4344.
ª 2011 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
430
Krause M, Ruckert B, Lurz R & Messer W (1997) Complexes
at the replication origin of Bacillus subtilis with homologous
and heterologous DnaA protein. J Mol Biol 274: 365–380.
Kruger R & Filutowicz M (2000) Dimers of pi protein bind
the A+T-rich region of the R6K gamma origin near the
leading-strand synthesis start sites: regulatory implications.
J Bacteriol 182: 2461–2467.
Kruger R & Filutowicz M (2003) pi protein- and ATPdependent transitions from ‘closed’ to ‘open’ complexes at
the gamma ori of plasmid R6K. Nucleic Acids Res 31: 5993–
6003.
Kruger R, Konieczny I & Filutowicz M (2001) Monomer/dimer
ratios of replication protein modulate the DNA strandopening in a replication origin. J Mol Biol 306: 945–955.
Kunnimalaiyaan S, Kruger R, Ross W, Rakowski SA &
Filutowicz M (2004) Binding modes of the initiator and
inhibitor forms of the replication protein pi to the gamma
ori iteron of plasmid R6K. J Biol Chem 279: 41058–41066.
Kurz M, Dalrymple B, Wijffels G & Kongsuwan K (2004)
Interaction of the sliding clamp beta-subunit and Hda, a
DnaA-related protein. J Bacteriol 186: 3508–3515.
Kuzminov A, Schabtach E & Stahl FW (1997) Study of
plasmid replication in Escherichia coli with a combination of
2D gel electrophoresis and electron microscopy. J Mol Biol
268: 1–7.
Lanka E & Schuster H (1983) The dnaC protein of Escherichia
coli. Purification, physical properties and interaction with
dnaB protein. Nucleic Acids Res 11: 987–997.
Learn BA, Um S-J, Li H & McMacken R (1997) Cryptic
single-stranded-DNA binding activities of the phage lambda
P and Escherichia coli DnaC replication initiation proteins
facilitate the transfer of E. coli DnaB helicase onto DNA.
P Natl Acad Sci USA 94: 1154–1159.
LeBowitz JH & McMacken R (1986) The Escherichia coli dnaB
replication protein is a DNA helicase. J Biol Chem 261:
4738–4748.
Lee H, Kang S, Bae SH, Choi BS & Hwang DS (2001a) SeqA
protein aggregation is necessary for SeqA function. J Biol
Chem 276: 34600–34606.
Lee YS, Han JS, Jeon Y & Hwang DS (2001b) The arc twocomponent signal transduction system inhibits in vitro
Escherichia coli chromosomal initiation. J Biol Chem 276:
9917–9923.
Leonard AC & Grimwade JE (2005) Building a bacterial
orisome: emergence of new regulatory features
for replication origin unwinding. Mol Microbiol 55: 978–
985.
Levchenko I & Filutowicz M (1996) Initiator protein pi can
bind independently to two domains of the gamma origin
core of plasmid R6K: the direct repeats and the A+T-rich
segment. Nucleic Acids Res 24: 1936–1942.
Lobry JR (1996a) Asymmetric substitution patterns in the two
DNA strands of bacteria. Mol Biol Evol 13: 660–665.
Lobry JR (1996b) A simple vectorial representation of DNA
sequences for the detection of replication origins in bacteria.
Biochimie 78: 323–326.
ª 2011 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
M. Rajewska et al.
Lopez P, Philippe H, Myllykallio H & Forterre P (1999)
Identification of putative chromosomal origins of
replication in Archaea. Mol Microbiol 32: 883–886.
Lopez P, Forterre P, le Guyader H & Philippe H (2000)
Origin of replication of Thermotoga maritima. Trends Genet
16: 59–60.
Lu M, Campbell JL, Boye E & Kleckner N (1994) SeqA: a
negative modulator of replication initiation in E. coli. Cell
77: 413–426.
Lu Y, Datta HJ & Bastia D (1998) Mechanistic studies of
initiator-initiator interaction and replication initiation.
EMBO J 17: 5192–5200.
Lundgren M, Andersson A, Chen L, Nilsson P & Bernander R
(2004) Three replication origins in Sulfolobus species:
synchronous initiation of chromosome replication and
asynchronous termination. P Natl Acad Sci USA 101: 7046–
7051.
MacAllister TW, Kelley WL, Miron A, Stenzel TT & Bastia D
(1991) Replication of plasmid R6K origin gamma in vitro.
Dependence on dual initiator proteins and inhibition by
transcription. J Biol Chem 266: 16056–16062.
Mackiewicz P, Gierlik A, Kowalczuk M, Dudek MR & Cebrat
S (1999a) How does replication-associated mutational
pressure influence amino acid composition of proteins?
Genome Res 9: 409–416.
Mackiewicz P, Gierlik A, Kowalczuk M, Dudek MR &
Cebrat S (1999b) Asymmetry of nucleotide composition
of prokaryotic chromosomes. J Appl Genet 40: 1–14.
Mackiewicz P, Zakrzewska-Czerwinska J, Zawilak A, Dudek
MR & Cebrat S (2004) Where does bacterial replication
start? Rules for predicting the oriC region. Nucleic Acids Res
32: 3781–3791.
Manen D & Caro L (1991) The replication of plasmid pSC101.
Mol Microbiol 5: 233–237.
Manen D, Upegui-Gonzalez LC & Caro L (1992) Monomers and
dimers of the RepA protein in plasmid pSC101 replication:
domains in RepA. P Natl Acad Sci USA 89: 8923–8927.
Marczynski GT & Shapiro L (1992) Cell-cycle control of a
cloned chromosomal origin of replication from Caulobacter
crescentus. J Mol Biol 226: 959–977.
Margolin W & Long SR (1993) Isolation and characterization
of a DNA replication origin from the 1,700-kilobase-pair
symbiotic megaplasmid pSym-b of Rhizobium meliloti.
J Bacteriol 175: 6553–6561.
Margulies C & Kaguni JM (1996) Ordered and sequential
binding of DnaA protein to oriC, the chromosomal origin
of Escherichia coli. J Biol Chem 271: 17035–17040.
Marszalek J & Kaguni JM (1994) DnaA protein directs the
binding of DnaB protein in initiation of DNA replication in
Escherichia coli. J Biol Chem 269: 4883–4890.
Masson L & Ray DS (1988) Mechanism of autonomous
control of the Escherichia coli F plasmid: purification and
characterization of the repE gene product. Nucleic Acids Res
16: 413–424.
Matsui M, Oka A, Takanami M, Yasuda S & Hirota Y (1985)
Sites of dnaA protein-binding in the replication origin of
FEMS Microbiol Rev 36 (2012) 408–434
AT-rich region and the repeated sequences in DNA replication
the Escherichia coli K-12 chromosome. J Mol Biol 184: 529–
533.
Matsunaga F, Kawasaki Y, Ishiai M, Nishikawa K, Yura T &
Wada C (1995) DNA-binding domain of the RepE initiator
protein of mini-F plasmid: involvement of the carboxylterminal region. J Bacteriol 177: 1994–2001.
Matsunaga F, Forterre P, Ishino Y & Myllykallio H (2001)
In vivo interactions of archaeal Cdc6/Orc1 and
minichromosome maintenance proteins with the replication
origin. P Natl Acad Sci USA 98: 11152–11157.
Maurizi MR & Li CC (2001) AAA proteins: in search of a
common molecular basis. International Meeting on
Cellular Functions of AAA Proteins. EMBO Rep 2: 980–
985.
McEachern MJ, Filutowicz M & Helinski DR (1985) Mutations
in direct repeat sequences and in a conserved sequence
adjacent to the repeats result in a defective replication
origin in plasmid R6K. P Natl Acad Sci USA 82: 1480–1484.
McGarry KC, Ryan VT, Grimwade JE & Leonard AC (2004)
Two discriminatory binding sites in the Escherichia coli
replication origin are required for DNA strand opening by
initiator DnaA-ATP. P Natl Acad Sci USA 101: 2811–2816.
McLean MJ, Wolfe KH & Devine KM (1998) Base
composition skews, replication orientation, and gene
orientation in 12 prokaryote genomes. J Mol Evol 47: 691–
696.
Meijer M, Beck E, Hansen FG, Bergmans HE, Messer W,
von Meyenburg K & Schaller H (1979) Nucleotide
sequence of the origin of replication of the Escherichia
coli K-12 chromosome. P Natl Acad Sci USA 76: 580–584.
Mendelson I, Gottesman M & Oppenheim AB (1991) HU and
integration host factor function as auxiliary proteins in
cleavage of phage lambda cohesive ends by terminase.
J Bacteriol 173: 1670–1676.
Messer W (2002) The bacterial replication initiator DnaA.
DnaA and oriC, the bacterial mode to initiate DNA
replication. FEMS Microbiol Rev 26: 355–374.
Messer W, Meijer M, Bergmans HE, Hansen FG, von
Meyenburg K, Beck E & Schaller H (1979) Origin of
replication, oriC, of the Escherichia coli K12 chromosome:
nucleotide sequence. Cold Spring Harb Symp Quant Biol 43
(Pt 1): 139–145.
Messer W, Blaesing F, Jakimowicz D et al. (2001) Bacterial
replication initiator DnaA. Rules for DnaA binding and
roles of DnaA in origin unwinding and helicase loading.
Biochimie 83: 5–12.
Meyer RR & Laine PS (1990) The single-stranded DNA-binding
protein of Escherichia coli. Microbiol Rev 54: 342–380.
Miller C, Ingmer H, Thomsen LE, Skarstad K & Cohen SN
(2003) DpiA binding to the replication origin of Escherichia
coli plasmids and chromosomes destabilizes plasmid
inheritance and induces the bacterial SOS response.
J Bacteriol 185: 6025–6031.
Moriya S, Ogasawara N & Yoshikawa H (1985) Structure and
function of the region of the replication origin of the
Bacillus subtilis chromosome. III. Nucleotide sequence of
FEMS Microbiol Rev 36 (2012) 408–434
431
some 10,000 base pairs in the origin region. Nucleic Acids
Res 13: 2251–2265.
Moriya S, Atlung T, Hansen FG, Yoshikawa H & Ogasawara N
(1992) Cloning of an autonomously replicating sequence
(ars) from the Bacillus subtilis chromosome. Mol Microbiol
6: 309–315.
Moriya S, Firshein W, Yoshikawa H & Ogasawara N (1994)
Replication of a Bacillus subtilis oriC plasmid in vitro. Mol
Microbiol 12: 469–478.
Mott ML & Berger JM (2007) DNA replication initiation:
mechanisms and regulation in bacteria. Nat Rev Microbiol 5:
343–354.
Mott ML, Erzberger JP, Coons MM & Berger JM (2008)
Structural synergy and molecular crosstalk between bacterial
helicase loaders and replication initiators. Cell 135: 623–634.
Mrazek J & Karlin S (1998) Strand compositional asymmetry
in bacterial and large viral genomes. P Natl Acad Sci USA
95: 3720–3725.
Mukherjee S, Patel I & Bastia D (1985) Conformational
changes in a replication origin induced by an initiator
protein. Cell 43: 189–197.
Mukherjee S, Erickson H & Bastia D (1988) Enhancer-origin
interaction in plasmid R6K involves a DNA loop mediated
by initiator protein. Cell 52: 375–383.
Mukhopadhyay G, Carr KM, Kaguni JM & Chattoraj DK
(1993) Open-complex formation by the host initiator,
DnaA, at the origin of P1 plasmid replication. EMBO J 12:
4547–4554.
Murakami Y, Ohmori H, Yura T & Nagata T (1987)
Requirement of the Escherichia coli dnaA gene function for
ori-2-dependent mini-F plasmid replication. J Bacteriol 169:
1724–1730.
Murakami A, Sugiura S & Yamaguchi K (1997) DNA bending
of pSC101 ori induced by Rep, a replication initiator
protein and IHF. J Gen Appl Microbiol 43: 189–197.
Murotsu T, Matsubara K, Sugisaki H & Takanami M (1981)
Nine unique repeating sequences in a region essential for
replication and incompatibility of the mini-F plasmid. Gene
15: 257–271.
Mushegian AR & Koonin EV (1996) Gene order is not
conserved in bacterial evolution. Trends Genet 12: 289–290.
Myllykallio H, Lopez P, Lopez-Garcia P et al. (2000) Bacterial
mode of replication with eukaryotic-like machinery in a
hyperthermophilic archaeon. Science 288: 2212–2215.
Neuwald AF, Aravind L, Spouge JL & Koonin EV (1999)
AAA+: a class of chaperone-like ATPases associated with the
assembly, operation, and disassembly of protein complexes.
Genome Res 9: 27–43.
Nieto C, Giraldo R, Fernandez-Tresguerres E & Diaz R (1992)
Genetic and functional analysis of the basic replicon of
pPS10, a plasmid specific for Pseudomonas isolated from
Pseudomonas syringae patovar savastanoi. J Mol Biol 223:
415–426.
Nievera C, Torgue JJ, Grimwade JE & Leonard AC (2006)
SeqA blocking of DnaA-oriC interactions ensures staged
assembly of the E. coli pre-RC. Mol Cell 24: 581–592.
ª 2011 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
432
Norais C, Hawkins M, Hartman AL, Eisen JA, Myllykallio H &
Allers T (2007) Genetic and physical mapping of DNA
replication origins in Haloferax volcanii. PLoS Genet 3: e77.
Ogura T & Wilkinson AJ (2001) AAA+ superfamily ATPases:
common structure–diverse function. Genes Cells 6: 575–597.
Ogura T, Niki H, Kano Y, Imamoto F & Hiraga S (1990)
Maintenance of plasmids in HU and IHF mutants of
Escherichia coli. Mol Gen Genet 220: 197–203.
Ohkubo S & Yamaguchi K (1995) Two enhancer elements for
DNA replication of pSC101, par and a palindromic binding
sequence of the Rep protein. J Bacteriol 177: 558–565.
Oka A, Sugimoto K, Takanami M & Hirota Y (1980)
Replication origin of the Escherichia coli K-12 chromosome:
the size and structure of the minimum DNA segment
carrying the information for autonomous replication. Mol
Gen Genet 178: 9–20.
Ortega-Jimenez S, Giraldo-Suarez R, Fernandez-Tresguerres
ME, Berzal-Herranz A & Diaz-Orejas R (1992) DnaA
dependent replication of plasmid R1 occurs in the presence
of point mutations that disrupt the dnaA box of oriR.
Nucleic Acids Res 20: 2547–2551.
Ozaki S, Fujimitsu K, Kurumizaka H & Katayama T (2006)
The DnaA homolog of the hyperthermophilic eubacterium
Thermotoga maritima forms an open complex with a
minimal 149-bp origin region in an ATP-dependent
manner. Genes Cells 11: 425–438.
Ozaki S, Kawakami H, Nakamura K et al. (2008) A common
mechanism for the ATP-DnaA-dependent formation of
open complexes at the replication origin. J Biol Chem 283:
8351–8362.
Pacek M, Konopa G & Konieczny I (2001) DnaA box
sequences as the site for helicase delivery during plasmid
RK2 replication initiation in Escherichia coli. J Biol Chem
276: 23639–23644.
Papp PP, Chattoraj DK & Schneider TD (1993) Information
analysis of sequences that bind the replication initiator
RepA. J Mol Biol 233: 219–230.
Papp PP, Mukhopadhyay G & Chattoraj DK (1994) Negative
control of plasmid DNA replication by iterons. Correlation
with initiator binding affinity. J Biol Chem 269: 23563–
23568.
Park K & Chattoraj DK (2001) DnaA boxes in the P1 plasmid
origin: the effect of their position on the directionality of
replication and plasmid copy number. J Mol Biol 310: 69–81.
Park K, Mukkopadhyay S & Chattoraj DK (1998)
Requirements for and regulation of origin opening of
plasmid P1. J Biol Chem 273: 24906–24911.
Park K, Han E, Paulsson J & Chattoraj DK (2001) Origin
pairing (‘handcuffing’) as a mode of negative control of P1
plasmid copy number. EMBO J 20: 7323–7332.
Pearson CE, Zorbas H, Price GB & Zannis-Hadjopoulos M
(1996) Inverted repeats, stem-loops, and cruciforms:
significance for initiation of DNA replication. J Cell Biochem
63: 1–22.
Pelizon C, Diviacco S, Falaschi A & Giacca M (1996) Highresolution mapping of the origin of DNA replication in the
ª 2011 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
M. Rajewska et al.
hamster dihydrofolate reductase gene domain by
competitive PCR. Mol Cell Biol 16: 5358–5364.
Perri S & Helinski DR (1993) DNA sequence requirements for
interaction of the RK2 replication initiation protein with
plasmid origin repeats. J Biol Chem 268: 3662–3669.
Phadtare S & Inouye M (1999) Sequence-selective interactions
with RNA by CspB, CspC and CspE, members of the CspA
family of Escherichia coli. Mol Microbiol 33: 1004–1014.
Picardeau M, Lobry JR & Hinnebusch BJ (1999) Physical
mapping of an origin of bidirectional replication at the
centre of the Borrelia burgdorferi linear chromosome. Mol
Microbiol 32: 437–445.
Pierechod M, Nowak A, Saari A, Purta E, Bujnicki JM &
Konieczny I (2009) Conformation of a plasmid replication
initiator protein affects its proteolysis by ClpXP system.
Protein Sci 18: 637–649.
Polaczek P, Kwan K & Campbell JL (1998) Unwinding of the
Escherichia coli origin of replication (oriC) can occur in the
absence of initiation proteins but is stabilized by DnaA and
histone-like proteins IHF or HU. Plasmid 39: 77–83.
Rajewska M, Kowalczyk L, Konopa G & Konieczny I (2008)
Specific mutations within the AT-rich region of a plasmid
replication origin affect either origin opening or helicase
loading. P Natl Acad Sci USA 105: 11134–11139.
Rampakakis E, Gkogkas C, Di Paola D & Zannis-Hadjopoulos
M (2010) Replication initiation and DNA topology: the
twisted life of the origin. J Cell Biochem 110: 35–43.
Riber L, Olsson JA, Jensen RB, Skovgaard O, Dasgupta S,
Marinus MG & Lobner-Olesen A (2006) Hda-mediated
inactivation of the DnaA protein and dnaA gene
autoregulation act in concert to ensure homeostatic
maintenance of the Escherichia coli chromosome. Genes Dev
20: 2121–2134.
Richter S, Hess WR, Krause M & Messer W (1998) Unique
organization of the dnaA region from Prochlorococcus
marinus CCMP1375, a marine cyanobacterium. Mol Gen
Genet 257: 534–541.
Robinson NP, Dionne I, Lundgren M, Marsh VL, Bernander R
& Bell SD (2004) Identification of two origins of replication
in the single chromosome of the archaeon Sulfolobus
solfataricus. Cell 116: 25–38.
Rocha EP, Danchin A & Viari A (1999) Universal replication
biases in bacteria. Mol Microbiol 32: 11–16.
Romero J & Lee H (2008) One-way PCR-based mapping of a
replication initiation point (RIP). Nat Protoc 3: 1729–1735.
Roten CA, Gamba P, Barblan JL & Karamata D (2002)
Comparative Genometrics (CG): a database dedicated to
biometric comparisons of whole genomes. Nucleic Acids Res
30: 142–144.
Rowen L & Kornberg A (1978) Primase, the dnaG protein of
Escherichia coli. An enzyme which starts DNA chains. J Biol
Chem 253: 758–764.
Ryan VT, Grimwade JE, Nievera CJ & Leonard AC (2002) IHF
and HU stimulate assembly of pre-replication complexes at
Escherichia coli oriC by two different mechanisms. Mol
Microbiol 46: 113–124.
FEMS Microbiol Rev 36 (2012) 408–434
AT-rich region and the repeated sequences in DNA replication
Ryan VT, Grimwade JE, Camara JE, Crooke E & Leonard AC
(2004) Escherichia coli prereplication complex assembly is
regulated by dynamic interplay among Fis, IHF and DnaA.
Mol Microbiol 51: 1347–1359.
Saha A, Haralalka S & Bhadra RK (2004) A naturally
occurring point mutation in the 13-mer R repeat affects the
oriC function of the large chromosome of Vibrio cholerae
O1 classical biotype. Arch Microbiol 182: 421–427.
Salzberg SL, Salzberg AJ, Kerlavage AR & Tomb JF (1998)
Skewed oligomers and origins of replication. Gene 217: 57–67.
Schaefer C & Messer W (1991) DnaA protein/DNA
interaction. Modulation of the recognition sequence. Mol
Gen Genet 226: 34–40.
Schaper S & Messer W (1995) Interaction of the initiator
protein DnaA of Escherichia coli with its DNA target. J Biol
Chem 270: 17622–17626.
Schnos M, Zahn K, Inman RB & Blattner FR (1988) Initiation
protein induced helix destabilization at the lambda origin: a
prepriming step in DNA replication. Cell 52: 385–395.
Schvartzman JB, Martinez-Robles ML, Hernandez P & Krimer
DB (2010) Plasmid DNA replication and topology as
visualized by two-dimensional agarose gel electrophoresis.
Plasmid 63: 1–10.
Seitz H, Weigel C & Messer W (2000) The interaction
domains of the DnaA and DnaB replication proteins of
Escherichia coli. Mol Microbiol 37: 1270–1279.
Sekimizu K, Bramhill D & Kornberg A (1988) Sequential early
stages in the in vitro initiation of replication at the origin of
the Escherichia coli chromosome. J Biol Chem 263: 7124–7130.
Shaheen SM, Ouimet MC & Marczynski GT (2009)
Comparative analysis of Caulobacter chromosome
replication origins. Microbiology 155: 1215–1225.
Sharma S, Sathyanarayana BK, Bird JG, Hoskins JR, Lee B &
Wickner S (2004) Plasmid P1 RepA is homologous to the
F plasmid RepE class of initiators. J Biol Chem 279: 6027–
6034.
Shon M, Germino J & Bastia D (1982) The nucleotide
sequence of the replication origin beta of the plasmid R6K.
J Biol Chem 257: 13823–13827.
Sibley CD, MacLellan SR & Finan T (2006) The Sinorhizobium
meliloti chromosomal origin of replication. Microbiology
152: 443–455.
Skarstad K, Thony B, Hwang DS & Kornberg A (1993) A
novel binding protein of the origin of the Escherichia coli
chromosome. J Biol Chem 268: 5365–5370.
Slater S, Wold S, Lu M, Boye E, Skarstad K & Kleckner N
(1995) E. coli SeqA protein binds oriC in two different
methyl-modulated reactions appropriate to its roles in DNA
replication initiation and origin sequestration. Cell 82: 927–
936.
Speck C & Messer W (2001) Mechanism of origin unwinding:
sequential binding of DnaA to double- and single-stranded
DNA. EMBO J 20: 1469–1476.
Speck C, Weigel C & Messer W (1999) ATP- and ADP-dnaA
protein, a molecular switch in gene regulation. EMBO J 18:
6169–6176.
FEMS Microbiol Rev 36 (2012) 408–434
433
Stalker DM, Kolter R & Helinski DR (1979) Nucleotide
sequence of the region of an origin of replication of the
antibiotic resistance plasmid R6K. P Natl Acad Sci USA 76:
1150–1154.
Stalker DM, Thomas CM & Helinski DR (1981) Nucleotide
sequence of the region of the origin of replication of the
broad host range plasmid RK2. Mol Gen Genet 181: 8–12.
Stalker DM, Kolter R & Helinski DR (1982) Plasmid R6K
DNA replication. I. Complete nucleotide sequence of an
autonomously replicating segment. J Mol Biol 161: 33–43.
Stenzel TT, MacAllister T & Bastia D (1991) Cooperativity
at a distance promoted by the combined action of two
replication initiator proteins and a DNA bending protein
at the replication origin of pSC101. Genes Dev 5: 1453–
1463.
Stinchcomb DT, Struhl K & Davis RW (1979) Isolation and
characterisation of a yeast chromosomal replicator. Nature
282: 39–43.
Sugimoto K, Oka A, Sugisaki H, Takanami M, Nishimura A,
Yasuda Y & Hirota Y (1979) Nucleotide sequence of
Escherichia coli K-12 replication origin. P Natl Acad Sci USA
76: 575–579.
Sugiura S, Ohkubo S & Yamaguchi K (1993) Minimal essential
origin of plasmid pSC101 replication: requirement of a
region downstream of iterons. J Bacteriol 175: 5993–6001.
Suhan M, Chen SY, Thompson HA, Hoover TA, Hill A &
Williams JC (1994) Cloning and characterization of an
autonomous replication sequence from Coxiella burnetii.
J Bacteriol 176: 5233–5243.
Sutton MD, Carr KM, Vicente M & Kaguni JM (1998)
Escherichia coli DnaA protein. The N-terminal domain and
loading of DnaB helicase at the E. coli chromosomal origin.
J Biol Chem 273: 34255–34262.
Taghbalout A, Landoulsi A, Kern R et al. (2000) Competition
between the replication initiator DnaA and the sequestration
factor SeqA for binding to the hemimethylated
chromosomal origin of E. coli in vitro. Genes Cells 5: 873–
884.
Thony B, Hwang DS, Fradkin L & Kornberg A (1991) iciA, an
Escherichia coli gene encoding a specific inhibitor of
chromosomal initiation of replication in vitro. P Natl Acad
Sci USA 88: 4066–4070.
Tillier ER & Collins RA (2000) The contributions of
replication orientation, gene direction, and signal sequences
to base-composition asymmetries in bacterial genomes.
J Mol Evol 50: 249–257.
Tomizawa J, Sakakibara Y & Kakefuda T (1974) Replication of
colicin E1 plasmid DNA in cell extracts. Origin and
direction of replication. P Natl Acad Sci USA 71: 2260–
2264.
Tomizawa JI, Ohmori H & Bird RE (1977) Origin of
replication of colicin E1 plasmid DNA. P Natl Acad Sci USA
74: 1865–1869.
Torheim NK & Skarstad K (1999) Escherichia coli SeqA protein
affects DNA topology and inhibits open complex formation
at oriC. EMBO J 18: 4882–4888.
ª 2011 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
434
Wei T & Bernander R (1996) Interaction of the IciA protein
with AT-rich regions in plasmid replication origins. Nucleic
Acids Res 24: 1865–1872.
Weigel C & Seitz H (2002) Strand-specific loading of DnaB
helicase by DnaA to a substrate mimicking unwound oriC.
Mol Microbiol 46: 1149–1156.
Weigel C, Schmidt A, Ruckert B, Lurz R & Messer W (1997)
DnaA protein binding to individual DnaA boxes in the
Escherichia coli replication origin, oriC. EMBO J 16: 6574–
6583.
Wickner S (1977) DNA or RNA priming of bacteriophage G4
DNA synthesis by Escherichia coli dnaG protein. P Natl
Acad Sci USA 74: 2815–2819.
Wickner S & Hurwitz J (1975) Interaction of Escherichia coli
dnaB and dnaC(D) gene products in vitro. P Natl Acad Sci
USA 72: 921–925.
Wickner S, Hoskins J, Chattoraj D & McKenney K (1990)
Deletion analysis of the mini-P1 plasmid origin of
replication and the role of Escherichia coli DnaA protein.
J Biol Chem 265: 11622–11627.
Woelker B & Messer W (1993) The structure of the initiation
complex at the replication origin, oriC, of Escherichia coli.
Nucleic Acids Res 21: 5025–5033.
Wu F, Goldberg I & Filutowicz M (1992) Roles of a 106-bp
origin enhancer and Escherichia coli DnaA protein in
replication of plasmid R6K. Nucleic Acids Res 20: 811–817.
Wu F, Wu J, Ehley J & Filutowicz M (1996) Preponderance of
Fis-binding sites in the R6K gamma origin and the curious
effect of the penicillin resistance marker on replication of
this origin in the absence of Fis. J Bacteriol 178: 4965–4974.
ª 2011 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
M. Rajewska et al.
Yamanaka K, Zheng W, Crooke E, Wang YH & Inouye M
(2001) CspD, a novel DNA replication inhibitor induced
during the stationary phase in Escherichia coli. Mol Microbiol
39: 1572–1584.
Yee TW & Smith DW (1990) Pseudomonas chromosomal
replication origins: a bacterial class distinct from
Escherichia coli-type origins. P Natl Acad Sci USA 87:
1278–1282.
Yoshikawa H & Wake RG (1993) 36. Initiation and
termination of chromosome replication. Bacillus subtilis and
Other Gram-Positive Bacteria: Biochemistry, Physiology, and
Molecular Genetics (Sonenshein AL, Hoch JA & Losick R,
eds), pp. 507–528. ASM, Washington, DC.
You Z, Ishimi Y, Mizuno T, Sugasawa K, Hanaoka F & Masai
H (2003) Thymine-rich single-stranded DNA activates
Mcm4/6/7 helicase on Y-fork and bubble-like substrates.
EMBO J 22: 6148–6160.
Zawilak A, Durrant MC, Jakimowicz P, Backert S &
Zakrzewska-Czerwinska J (2003) DNA binding specificity of
the replication initiator protein, DnaA from Helicobacter
pylori. J Mol Biol 334: 933–947.
Zhang CT & Zhang R (2002) Evaluation of gene-finding
algorithms by a content-balancing accuracy index. J Biomol
Struct Dyn 19: 1045–1052.
Zhang R & Zhang CT (2005) Identification of replication
origins in archaeal genomes based on the Z-curve method.
Archaea 1: 335–346.
Zzaman S, Abhyankar MM & Bastia D (2004) Reconstitution
of F factor DNA replication in vitro with purified proteins.
J Biol Chem 279: 17404–17410.
FEMS Microbiol Rev 36 (2012) 408–434