Standard PDF - Wiley Online Library

Bacterial group II introns: not just splicing
Nicolás Toro, José Ignacio Jiménez-Zurdo & Fernando Manuel Garcı́a-Rodrı́guez
Grupo de Ecologı́a Genética de la Rizosfera, Estación Experimental del Zaidı́n, Consejo Superior de Investigaciones Cientı́ficas, Granada, Spain
Correspondence: Nicolás Toro, Grupo de
Ecologı́a Genética de la Rizosfera, Estación
Experimental del Zaidı́n, Consejo Superior de
Investigaciones Cientı́ficas, Profesor Albareda
1, 18008 Granada, Spain. Tel.:
134 958181600; fax: 134 958129600;
e-mail: [email protected]
Received 30 November 2006; revised 12
January 2007; accepted 12 January 2007.
First published online March 2007.
DOI:10.1111/j.1574-6976.2007.00068.x
Abstract
Group II introns are both catalytic RNAs (ribozymes) and mobile retroelements
that were discovered almost 14 years ago. It has been suggested that eukaryotic
mRNA introns might have originated from the group II introns present in the
alphaproteobacterial progenitor of the mitochondria. Bacterial group II introns are
of considerable interest not only because of their evolutionary significance, but
also because they could potentially be used as tools for genetic manipulation in
biotechnology and for gene therapy. This review summarizes what is known about
the splicing mechanisms and mobility of bacterial group II introns, and describes
the recent development of group II intron-based gene-targetting methods.
Bacterial group II intron diversity, evolutionary relationships, and behaviour in
bacteria are also discussed.
Editor: Rafael Giraldo
Keywords
catalytic RNA; homing; splicing; retroelements;
Lactococcus lactis ; Sinorhizobium meliloti .
Introduction: group II introns
Group II introns are large catalytic RNAs (ribozymes) and
mobile retroelements [reviewed by Pyle & Lambowitz
(2006)] that splice by means of a lariat intermediate, in a
mechanism similar to that of spliceosomal introns. The
transfer of these introns either to intronless alleles (retrohoming) or to ectopic sites (retrotransposition) is mediated
by ribonucleoprotein (RNP) complexes consisting of the
intron-encoded protein (IEP) and the excised intron RNA
lariat. Group II introns were initially identified in the
mitochondrial and chloroplast genomes of lower eukaryotes
and plants (Michel et al., 1989). They were later identified in
bacteria (Ferat & Michel, 1993), and have recently been
found in the archaeal genus Methanosarcinales (Toro, 2003;
Dai & Zimmerly, 2003; Rest & Mindell, 2003). It is thought
that both nuclear spliceosomal introns and non-Long Terminal Repeat (LTR) retrotransposons evolved from mobile
group II introns [Sharp, 1985; Cech, 1986; Cavalier-Smith,
1991; Eickbush, 1994; Zimmerly et al., 1995; recently
reviewed by Koonin (2006)]. According to this hypothesis,
group II introns originated in bacteria and invaded the
nucleus of a primitive eukaryote, possibly from the alphaproteobacterial progenitor of the mitochondria; they were
then fragmented to form the spliceosome (Cavalier-Smith,
1991; Stoltzfus, 1999). It has recently been suggested that the
2007 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
c
spread of group II introns and their decay into spliceosomal
introns created a strong selective pressure, necessitating
compartmentalization of the nucleus and cytoplasm (Martin & Koonin, 2006). In addition to their possible evolutionary significance in eukaryogenesis, mobile group II
introns are of interest because they have been used to
develop a new type of gene-targetting vector (Lambowitz &
Zimmerly, 2004; Lambowitz et al., 2005), with potential
applications in biotechnology and medicine.
A typical group II intron (Fig. 1) consists of a highly
structured RNA with six distinct double-helical domains (DI
to DVI) and an internally encoded (ORF within DIV)
reverse transcriptase (RT) maturase. The IEP is required for
in vivo folding of the intron RNA into a catalytically active
structure (Michel & Ferat, 1995). Unlike organellar introns,
most bacterial group II introns have an IEP (Lambowtiz &
Zimmerly, 2004). The IEPs of these introns (Fig. 1) have
four conserved domains: an N-terminal RT domain; domain X – a putative RNA-binding domain associated with
RNA splicing or maturase activity-, a C-terminal DNAbinding domain (D); and a DNA-endonuclease domain
(En). More than half of the bacterial group II IEPs annotated
to date lack the En domain, and many also lack the D domain
(Martı́nez-Abarca & Toro, 2000b; Zimmerly et al., 2001;
Belfort et al., 2002; Toro, 2003; Lambowitz & Zimmerly,
2004). It has been suggested that all current group II introns
FEMS Microbiol Rev 31 (2007) 342–358
343
Bacterial group II introns
Fig. 1. Generic secondary structure of group II intron RNA and domain structure of the intron-encoded proteins (IEP). The model shows the ribozyme
domains (DI–DVI), the EBS/IBS elements, the bulged adenosine within DVI, and the sequences involved in tertiary interactions (Greek letters) operating in
splicing as described in the text. Major differences between IIA and IIB introns are indicated in brackets. Note that IIC introns lack the EBS2 element.
Boxed are the schematics (drawn to scale) of the Ll.LtrB (IIA), RmInt1 (IIB) and GBSi1 (IIC) IEPs encoded within intron DIV, with indication of the predicted
reverse transcriptase (RT), maturase (X), variable DNA-binding region (D) and conserved DNA endonuclease (En) domains. Numbers above the RT
domain identify conserved amino acid sequence blocks characteristic of RTs. The question mark denotes a functionally uncharacterized C-terminal
extension of 20 amino acid residues in the RmInt1 IEP.
are descended from an RT-encoding group II intron in
bacteria (Toor et al., 2001). Three main phylogenetic subclasses (IIA, IIB and IIC) of group II introns have been
described on the basis of IEP and conserved intron RNA
structures (Michel et al., 1989; Toor et al., 2001; Zimmerly
et al., 2001; Ferat et al., 2003; Toro, 2003), with the IIC
subclass unique to bacteria. Bacterial group II introns differ
markedly from organellar introns in not generally being
located in conserved genes and in being mostly associated
with mobile DNAs or located within intergenic regions (Dai
& Zimmerly, 2002; Toro, 2003).
Group II introns, which may predate prokaryotes
(Koonin, 2006), are still present in many bacteria. So, what
role do group II introns play in the bacterial genome?
Although they are generally seen as ‘selfish’ elements,
studies of group II introns are continually discovering
FEMS Microbiol Rev 31 (2007) 342–358
surprising features such as the association of some lineages
with specific genetic signals or with the host replication
machinery for mobility.
Bacterial group II intron splicing
Group II introns splice by means of a lariat intermediate,
in a mechanism resembling that of spliceosomal introns.
Intron excision from eukaryotic mRNA requires a
set of host-cell snRNA molecules and proteins, whereas
the catalytic functions involved in group II intron splicing
reside within the RNA molecule itself. Nevertheless,
the folding of the intron RNA into the catalytically
active structure in vivo depends on the maturase activity
of the IEP encoded within intron DIV, outside the catalytic
core of the ribozyme.
2007 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
c
344
Self-splicing mechanisms and products
Several bacterial group II introns have been found to selfsplice in the absence of the IEP when incubated under
nonphysiological high-salt conditions in vitro: Cal.x1 from
Calothrix (Ferat & Michel, 1993); IntB from Escherichia coli
(Ferat et al., 1994); Ll.LtrB from Lactococcus (Matsuura
et al., 1997); AV1 (Avi.groEL) and AV2 from Azotobacter
(Adamidi et al., 2003; Ferat et al., 2003; Kosaraju et al.,
2005); GBSi1 from Streptococcus (Granlund et al., 2001);
B.a.I1, B.a.I2 and B.hI1 from Bacillus (Robart et al., 2004;
Tourasse et al., 2005; Toor et al., 2006); Gk.Int1 from
Geobacillus (Chee & Takami, 2005); and RmInt1 from
Sinorhizobium (Costa et al., 2006a). However, bacterial
group II ribozymes are generally only moderately reactive
and display slow kinetics in vitro. The models chosen for
investigations of group II intron catalysis and splicing
mechanisms have therefore tended to be the more efficient
large mitochondrial group II ribozymes of the two major
group II intron subclasses, IIA and IIB, from eukaryotic
organisms (Costa et al., 1997, 2000; Su et al., 2001).
The results of these studies suggest that the pathway
generally used for group II intron splicing involves two
sequential transesterification reactions (Fig. 2a). The first
involves a nucleophilic attack on the 5 0 splice junction by the
2 0 -OH of a bulged nucleotide, typically an adenosine residue
(bulging A) located in DVI near the 3 0 end of the intron,
releasing the 5 0 exon and generating an intron-3 0 exon
branched intermediate. The second reaction involves a
nucleophilic attack on the 3 0 splice site by the free 3 0 -OH of
the last nucleotide of the 5 0 -exon, yielding the ligated exons
and the intron RNA lariat with a 2 0 –5 0 phosphodiester bond
as the major splicing products. Alternatively, the splicing
reaction may be initiated by hydrolysis (Fig. 2b), followed by
the second step, resulting in the excision of the intron as a
linear molecule rather than a lariat. Most of the self-splicing
Nicolás Toro et al.
bacterial group II introns identified to date display both
excision pathways in vitro. The exceptions are some bacterial
class C introns such as GBSi1, identified in some Streptococcus B isolates, which splice solely via a linear intermediate
molecule (Granlund et al., 2001). In addition, it has been
reported that yeast intron aI5g can be also excised as a true
circular form in vitro (Fig. 2c). The circle seems to result
from the formation of a 2 0 –5 0 phosphodiester bond between
the last and first residues of the intron, and it has been
proposed that this reaction requires the 3 0 exon to have been
released first from precursor molecules by a trans-splicing
reaction triggered by 5 0 exon molecules previously generated
by the so-called spliced exon reopening (SER) reaction.
Subsequent to trans splicing, the 2 0 -OH of the terminal
intron residue will attack the 5 0 splice site, releasing the
5 0 exon and an intron circle (Murray et al., 2001).
Group II ribozyme positioning on the precursor mRNA
substrate and cleavage site fidelity are determined principally by base-pairing interactions between short-sequence
elements located in loop structures of intron DI (exonbinding sites, EBS) and sequence stretches flanking the
splice site (intron-binding sites, IBS). EBS1 and EBS2, each
typically five to seven nucleotides long, recognize their
partner sequences – IBS1 and IBS2 – at the 3 0 end of the 5 0
exon. Interestingly, IIC introns are self-splicing-competent,
despite lacking the EBS2–IBS2 pairing (Granlund et al.,
2001; Toor et al., 2006). The recognition of the 3 0 intron–
exon junction involves two additional base-pair interactions. The first is a single tertiary base-pair interaction
between the last position of the intron (g 0 ) and another
intron nucleotide (g) between DII and DIII. The second
involves the first one or several specific exon and intron
nucleotides. The identity of this IBS for the 3 0 exon can be
used to differentiate between the two major subclasses of
group II introns. In IIA introns, the sequence immediately
upstream from EBS1 (d sequence) base-pairs with one to
Fig. 2. Mechanisms of group II intron splicing through (a) branching, (b) hydrolysis and (c) circles formation. 5 0 and 3 0 exons are indicated with black
and grey boxes, respectively, whereas the line represents the intron RNA. Double arrows indicate the reversibility of some reactions.
2007 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
c
FEMS Microbiol Rev 31 (2007) 342–358
345
Bacterial group II introns
three nucleotide residues (d 0 position) of the 3 0 exon for
intron splicing (3 nt in the Lactococcus lactis Ll.LtrB intron).
In contrast, most IIB introns display no significant signs of
d–d 0 complementarity. These group II ribozymes recognize
the first nucleotide in the 3 0 exon, referred to as IBS3 rather
than d 0 , by canonical base-pairing with an intron nucleotide,
referred to as EBS3 rather than d, located in the so-called
‘coordination loop’ of DI (Costa et al., 2000; De Lencastre
et al., 2005). The d position is involved in a new tertiary
interaction with another residue (denoted d 0 ), also located
in the coordination loop (Costa et al. 2000). These nucleotides in the loop are involved in aligning the two exons for
the second step of splicing (d–d 0 and IBS3–EBS3 interactions). The three-dimensional arrangement of the active site
of group II ribozymes can be further modelled by a complex
network of tertiary interactions involving the various RNA
domains (Michel & Ferat, 1995; Costa et al., 2000; Swisher
et al., 2001 and references therein).
Thus, 5 0 exon binding through DI interactions and the
conserved DV are involved in the formation of the minimal
catalytic core of the ribozyme, with DII and DIII also
contributing to RNA folding and catalytic efficiency (Fedorova et al., 2003; for a review see Pyle & Lambowitz, 2006).
The branch site of DVI, both exons, the essential catalytic
regions of DV (bulge and AGC triad regions), conserved
nucleotides in the joining region (J) between DII and DIII
(J2/3), and the e–e 0 substructure are all in close proximity
before the first splicing step (De Lencastre et al., 2005). It has
recently been shown that the branch site binds, with specific
polarity, to the coordination loop substructure, thereby
placing the nucleophilic adenosine in a position in which it
can react with the 5 0 splice site (Hamill & Pyle, 2006). The
docking site for DV, a tetraloop receptor motif composed of
the z–z and k–k 0 interactions, is separated from the coordination loop by a short helical stem (reviewed by Lehmann &
Schmidt, 2003). The choice of branch site is strictly controlled
by a molecular caliper, which measures the distance between
the base of DV and the nucleophilic 2 0 -hydroxyl in DVI (Chu
et al., 2001). Thus, DV and the branch-site receptors function
together to position the nucleophile correctly, so as to
promote catalysis (Hamill & Pyle, 2006).
Alternative EBS-IBS interactions, shifted from the
expected EBS-IBS pairings, have also been described in the
bacterial group IIB introns B.a.I2 and RmInt1 (Robart et al.,
2004; Costa et al., 2006a, b). As shown in vitro, the 3 0 splice
site of B.a.I2 is flexible but depends on potential g–g 0 and
IBS3–EBS3 pairings downstream from the wild-type site.
Alternative splice-site usage either creates frame shifts within the exon-encoded ORF or joins the two exonic ORFs in
frame, ultimately affecting the output of host mRNA
translation in vivo (Robart et al., 2004). On the other hand,
it has been suggested that an RmInt1 ribozyme reaction at a
surrogate IBS sequence located 3 0 to the intron might be
FEMS Microbiol Rev 31 (2007) 342–358
responsible for the generation of unconventional self-splicing products along with the ligated exons and the excised
intron. These unconventional self-splicing products include
a 3 0 exon truncated at its 5 0 end, truncated variants of the
linear and lariat forms of the intron –3 0 exon intermediate,
and putative circular molecules derived from this intermediate (Costa et al., 2006a). The self-splicing profiles of
B.a.I2 and RmInt1 may therefore highlight mechanisms
underlying the control of host-gene expression by bacterial
group II introns.
Protein-assisted splicing in vivo
Only a minority of bacterial group II introns have been
unequivocally shown to be functional or to undergo splicing
in vivo: the L. lactis Ll.LtrB (Mills et al., 1996; Shearman
et al., 1996); the Sinorhizobium meliloti RmInt1 (Martı́nezAbarca et al., 1998); a Clostridium difficile intron (Roberts
et al., 2001); the Bacillus anthracis B.a.I2; and the Trichodesmium erythraeum RIR-3 (Robart et al., 2004; Meng et al.,
2005) introns. The best characterized of these introns to date
are Ll.LtrB and RmInt1, as representatives of subclasses IIA
and IIB, respectively. Initial sequencing and genetic analysis
have shown that the splicing of these group II introns in vivo
requires the IEP encoded within intron DIV (Mills et al.,
1996; Shearman et al., 1996; Martı́nez-Abarca et al., 1998;
Muñoz-Adelantado et al., 2003). Bacterial group II intronencoded maturases function more efficiently when
expressed in cis, within the same precursor transcript
encoding the ribozyme core. However, trans-encoded IEPs
can promote the splicing of genomic copies of nonfunctional ORF-less introns, a property that has evolutionary
and biotechnological implications (Cui et al., 2004; Meng
et al., 2005; Nisa-Martı́nez et al., 2007).
Conserved RT and X domains are common to both
Ll.LtrB and RmInt1 IEPs, being the X domains associated
with the splicing-promoting (maturase) activity of the
proteins. However, most of the available information on
the mechanisms involved in IEP-assisted bacterial group II
intron splicing is based on the lactoccocal intron (Matsuura
et al., 1997, 2001; Saldanha et al., 1999; Wank et al., 1999;
Singh et al., 2002; Cui et al., 2004). Ll.LtrB and its encoded
protein, LtrA, are efficiently expressed and functional in
E. coli, facilitating the detailed biochemical characterization
of both the intron RNA and the IEP (Matsuura et al., 1997).
The purified LtrA protein binds to the unspliced precursor
RNA via a rapid bimolecular reaction, followed by a slower
unimolecular step involving a change in RNA conformation
that ultimately promotes Ll.LtrB splicing when assayed
in vitro under low salt concentrations shown to inhibit the
self-splicing reaction (Saldanha et al., 1999). In certain
specific conditions, LtrA also binds nonspecifically to other
group II introns and RNA molecules. However, this
2007 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
c
346
nonspecific binding is inefficient for RNA splicing, and LtrA
is therefore considered to be an intron-specific splicing
factor (Saldanha et al., 1999). This specificity is based on
protein recognition of a unique structural feature in the
intron subdomain DIVa as a primary high-affinity binding
site; this site contains the Shine–Dalgarno sequence and the
initiation codon of the LtrA ORF (Wank et al., 1999; Singh
et al., 2002). This binding, which also autoregulates LtrA
translation, requires domain X and parts of the RT domain
of the IEP, and extends further into the 5 0 and 3 0 conserved
regions of the catalytic core and DIV through weaker
secondary contacts that are sufficient to support residual,
but inefficient, splicing in vivo when the high-affinity
binding site in DIVa is deleted (Wank et al., 1999; Matsuura
et al., 2001; Singh et al., 2002; Cui et al., 2004). Further
chemical probing and RNA footprinting experiments have
demonstrated that this array of protein–RNA contacts
promotes the generation of a novel set of conserved longrange tertiary interactions in the Ll.LtrB RNA, which are not
stably formed in the absence of the maturase under low-salt
conditions, but which are required to model the active
conformation of the ribozyme core in vivo (Matsuura et al.,
2001). The major protein-assisted splicing products of Ll.LtrB
are the conventional linear and lariat forms of the intron,
along with exons ligated in frame (Saldanha et al., 1999).
The maturase activity of the RmInt1 IEP has not been
biochemically characterized. However, recent studies have
demonstrated that RmInt1 splices in vivo leading to the
formation of the intron lariat form, along with circular
molecules in which the first and last intron residues are
ligated (Molina-Sánchez et al., 2006). Genetic analysis of the
conserved X domain of the RmInt1 IEP not only confirmed
the requirement of the maturase for intron splicing in vivo
but also supported an additional role for the IEP in
controlling the balance between the two intron excision
pathways (Molina-Sánchez et al., 2006). Thus, in vivo the
presence of the IEP seems to ensure cleavage at the correct 3 0
splice site or at position 11 in the 3 0 exon before intron
RNA circle formation. Circular RNAs had previously only
been described as splicing products of group II introns in
eukaryotic cells, in which they were thought to be linked to
cellular senescence (Murray et al., 2001 and references
therein). These novel findings therefore open up new perspectives for investigation of the functional diversity of
bacterial group II ribozymes and their biological significance.
As described above, analyses of the splicing reaction,
using the purified Ll.LtrB-encoded maturase and RNA,
predict a remarkable self-sufficiency of bacterial group II
introns in the promotion of their own splicing. However, it
has also been shown that the efficiency of the in vivo splicing
reaction of the group II intron RmInt1 decreases when the
intron is expressed in a genetic background other than that
of its natural bacterial host, S. meliloti (Martı́nez-Abarca
2007 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
c
Nicolás Toro et al.
et al., 1998, 1999). Several nuclear gene products have been
identified as RNA chaperones involved in the splicing of
group II introns from yeast, algae or higher plants (Lambowitz & Perlman, 1990; Lambowitz et al., 1999; Lehmann
& Schmidt, 2003; Ostheimer et al., 2003). It remains unclear
whether this is also the case for bacterial group II introns,
and to what extent splicing efficiency depends on host
functions.
Bacterial group II intron mobility
mechanisms
Group II introns act both as large catalytic RNAs and as sitespecific retroelements (more recently reviewed by Lambowitz & Zimmerly, 2004; Pyle & Lambowitz, 2006). After
splicing, the intron RNA lariat and the IEP remain associated, forming a ribonucleoprotein particle (RNP) promoting intron insertion into DNA target sites identical to the
splice site through retrohoming, which is the principal and
most efficient group II intron RNA-based mobility pathway
(Fig. 3). Retrohoming has been unequivocally demonstrated
for the lactococcal Ll.LtrB and rhizobial RmInt1 introns,
using engineered ‘twintron’ shuttle constructs in which the
bacterial group II intron was interrupted by a DNA sequence
encoding a splicing-competent group I intron (tdI). Consistent with a mobility pathway involving an RNA rather
than a DNA intermediate, the group I intron was found to
be absent from the homing products generated in plasmidborne target sites as a result of tdI splicing in the precursor
twintron RNA (Cousineau et al., 1998; Martı́nez-Abarca
et al., 2004). Group II introns can also insert, at a lower
frequency, into noncognate sequences resembling the homing site – a process known as ectopic transposition (retrotransposition) – which contributes to group II intron
dispersal in nature (Cousineau et al., 2000; Martı́nez-Abarca
& Toro, 2000a; Muñoz et al., 2001; Dai & Zimmerly, 2002;
Ichiyanagi et al., 2002).
Of all the bacterial group II introns characterized as
splicing-competent in vivo, only the L. lactis Ll.LtrB and
S. meliloti RmInt1 introns have been shown to be efficient
mobile genetic elements, leading to the use of these elements
as model experimental systems for deciphering the bacterial
group II intron mobility pathways and mechanisms (Lambowitz & Zimmerly, 2004 and references therein).
Retrohoming through target DNA-primed
reverse transcription (TPRT)
The retrohoming pathway of bacterial group II introns was
first investigated for Ll.LtrB, by plasmid-based genetic assays
in vivo, in E. coli or L. lactis. It was then further dissected by
characterization of the biochemical activities of the intronencoded RNPs reconstituted with a purified LtrA expressed
in E. coli and an Ll.LtrB RNA lariat excised from a precursor
FEMS Microbiol Rev 31 (2007) 342–358
347
Bacterial group II introns
Fig. 3. Retrohoming pathways of bacterial group II introns. RNP complexes recognize the target site on double- or single-stranded DNA primarily by the
EBS-IBS (d–d 0 in subgroup IIA introns) pairings, whereas the IEP also binds specifically to key nucleotide residues at distal 5 0 - and 3 0 -exon regions
indicated by dotted lines in the diagram (a). The En-dependent retrohoming mechanism (TPRT) used by the L. lactis Ll.LtrB intron involves DNA
unwinding assisted by the domain D of the IEP, reverse splicing of the intron RNA into the top strand of the target DNA, and antisense strand cleavage
catalyzed by the En domain. The cleaved antisense strand is then used by the IEP (RT activity) as primer to reverse transcribe the reverse-spliced intron
RNA (b). The major pathway used for retrohoming by the Sinorhizobium meliloti RmInt1 intron (c) is linked to DNA replication involving invasion of the
DNA strand that serves as a template for lagging strand synthesis. The reverse transcription of the inserted intron RNA could be primed by RNA primers
synthesised by the primase or partial polymerization of the Okazaki fragments by DNA polymerase III. A second possible minor retrohoming pathway (c)
may occur independently of the target-site strand, and involve alternative priming mechanisms. A cDNA copy of the intron is completely integrated in
the new location by the host repair functions.
transcript in vitro (Matsuura et al., 1997; Cousineau et al.,
1998; Saldanha et al., 1999; Aizawa et al., 2003). Neither the
Ll.LtrB RNA itself nor the purified LtrA protein displays
reverse splicing or endonuclease activities of the RNP
particle (Saldanha et al., 1999). Group II intron RNPs
initially bind DNA nonspecifically and then search for
DNA target sites (Aizawa et al., 2003). Mutagenesis experiments, DNA footprinting, and modification interference
FEMS Microbiol Rev 31 (2007) 342–358
mapping have shown that Ll.LtrB-encoded RNPs recognize
a relatively long DNA target site (Fig. 3a) extending from
position 25 to 19 relative to the intron insertion site
(Guo et al., 2000; Mohr et al., 2000; Karberg et al., 2001;
Singh & Lambowitz, 2001). Retrohoming then occurs by a
target DNA-primed reverse transcription mechanism
(TPRT) involving several sequential reactions (Fig. 3b).
The RNA component of the RNP cleaves the sense-strand
2007 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
c
348
precisely at the exon junction in the double-stranded DNA
recipient allele, by a reverse splicing reaction that integrates
the intron RNA at the target site, whereas LtrA uses its
endonuclease domain (En) to cleave the antisense-strand at
position 19 relative to the insertion site. The 3 0 end of the
cleaved antisense strand is then targetted to the active site of
the reverse transcriptase, where it is used as a primer for
reverse transcription of the inserted intron RNA by the RT
domain of LtrA, generating a cDNA copy of the intron that
is subsequently integrated into its new location by homologous recombination-independent repair mechanisms
(Cousineau et al., 1998).
The target sequence is recognized principally by the RNA
component of the RNP complex, through EBS2-IBS1, EBS1EBS2 and d-d 0 base-pairing interactions, which, for Ll.LtrB,
involve positions 12 to 8, 6 to 1 and 11 to 13,
respectively. The nucleotide residues within the IBS and d 0
regions contribute to DNA target-site recognition to different extents, but mutations in these nucleotides have an
overall inhibitory effect on the reverse splicing activity of the
reconstituted RNPs and Ll.LtrB homing frequencies in vivo;
nonetheless, both can be restored to wild-type levels by
introducing the corresponding complementary mutations
into the intron RNA (Guo et al., 2000; Mohr et al., 2000).
The variable DNA-binding domain (D) is involved in the
recognition of a subset of key nucleotide residues (positions
T-23, G-21 and A-20) in the distal 5 0 exon region via major
groove interactions bolstered by phosphate-backbone contacts (Singh & Lambowitz, 2001). LtrA may also be involved
in the recognition of distal nucleotides of IBS2, probably on
the complementary strand of the target site, as the restoration of EBS2-IBS2 base-pairing at these positions does not
result in the recovery of wild-type reverse splicing levels or
homing frequencies. Mutations at these key positions in the
distal 5 0 exon region block reverse splicing of the intron
RNA into double-stranded, but not into single-stranded
DNA, substrates. This suggests that the interaction of these
positions with LtrA triggers local DNA unwinding, thereby
placing the intron RNA in a position to base-pair with the
IBS and d 0 sequences for reverse splicing into doublestranded DNA target sites (Mohr et al., 2000). Consistent
with LtrA interaction and base-pairing of the intron RNA
being concerted rather than sequential to promote the
efficient unwinding of DNA at cognate sites, KMnO4
modification experiments revealed that RNPs reconstituted
with intron RNAs with EBS/d mutations preventing basepairing with the DNA target site do not trigger DNA
unwinding. Recent data indicate that before the reverse
splicing of intron RNA, RNPs bend the target DNA by
maintaining initial contacts with the 5 0 exon while engaging
in 3 0 exon interactions, gradually bringing the scissile
phosphate into position for bottom-strand cleavage (Noah
et al., 2006). Second-strand cleavage, catalyzed by the
2007 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
c
Nicolás Toro et al.
conserved En domain of LtrA, occurs after a time lag, and
requires additional interactions of the protein with fixed
positions in the 3 0 exon, the most critical of which being
position T15, which lies within a single-stranded region
after initial DNA unwinding (Mohr et al., 2000; Singh &
Lambowitz, 2001).
En-independent retrohoming is linked to DNA
replication in bacteria
TPRT-based retrohoming requires the generation of a
primer for reverse transcription of the intron RNA via
second-strand cleavage, catalyzed by the conserved En
domain of the IEP. Phylogenetic analysis based on the
sequence and domain structure of group II intron IEPs, of
both bacterial and eukaryotic origin, assigned a putative
mitochondrial lineage to the Ll.LtrB intron, with most
bacterial introns located elsewhere in the tree. Furthermore,
the C-terminal En domain is found in only c. 40% of the
bacterial group II IEPs currently annotated in databases
(Martı́nez-Abarca & Toro, 2000b; Zimmerly et al., 2001; Dai
& Zimmerly, 2002). The S. meliloti RmInt1 intron is the best
characterized of the group II introns clustering on the
bacterial branch. RmInt1 encodes a protein with conserved
RT and maturase domains and a functionally uncharacterized C-terminal extension of 20 amino acid residues that
appears to be unrelated to the DNA-binding (D) domain of
LtrA (Martı́nez-Abarca et al., 2000; San Filippo & Lambowitz, 2002). Despite the absence of recognizable D and En
domains, RmInt1 retrohomes very efficiently, with 100% of
the intron-containing strains undergoing insertion events at
plasmid-borne target sites (homing frequency), and 20–45%
of target copies within a single cell invaded by the intron
(homing efficiency) (Martı́nez-Abarca et al., 2000; JiménezZurdo et al., 2003). However, experiments with engineered
En mutant Ll.LtrB derivatives and target sites have shown
that Ll.LtrB can retrohome, albeit much less efficiently,
without second-strand cleavage (D’Souza & Zhong, 2002;
Zhong & Lambowitz, 2003). These findings suggest that
there are alternative pathways operating in the mobility of
En bacterial group II introns linked, as described below, to
DNA replication (Muñoz-Adelantado et al., 2003; Zhong &
Lambowitz, 2003; Martı́nez-Abarca et al., 2004).
RmInt1 mobility (Fig. 3a and c) depends on the intron
RNA and the IEP, with the RmInt1-encoded RNP complex
recognizing a DNA target site extending 20 nt into the 5 0
exon and 5 nt into the 3 0 exon, as inferred from genetic
characterization of the intron and target site coupled to
homing assays in vivo (Jiménez-Zurdo et al., 2003; MuñozAdelantado et al., 2003). Subsequent mutational analysis
demonstrated that this site is recognized primarily by the
intron RNA, through the characteristic pairing pattern of
IIB intron splicing: EBS2-IBS2, EBS1-IBS1 and EBS3-IBS3,
FEMS Microbiol Rev 31 (2007) 342–358
349
Bacterial group II introns
involving positions 13 to 9, 7 to 1 and 11 in the
RmInt1 target site, respectively (Jiménez-Zurdo et al., 2003).
Unlike Ll.LtrB, RmInt1 has less stringent requirements for
recognition of the distal 5 0 and 3 0 exon regions, with only
single nucleotide positions (T-15 and G14) required for
wild-type retrohoming (Jiménez-Zurdo et al., 2003). These
positions were initially thought to be recognized by the
RmInt1 IEP, but recent findings suggest that there may be an
alternative intron–exon pairing between an intron segment
overlapping with the EBS2 exon-binding site and a 5 0 exon
site located just distal of IBS2 relative to the splice junction
(IBS2; Costa et al., 2006a, b). Thus, pairing of the first A of
EBS2 with the last T of IBS2 at position 15 of the intron
target site might render the last T of IBS2 critical for
homing. The removal of nucleotides 20 to 16 from the
RmInt1 target site dramatically decreases homing efficiency
(Jiménez-Zurdo et al., 2003). The presumably small number
of IEP–target-site interactions raises questions about the
identity of the IEP domains involved and mechanistic
implications for DNA target-site recognition.
RmInt1-derived RNP particles have RT activity and
promote the reverse splicing of intron RNA into both
single-stranded and double-stranded DNA substrates,
through RNA-mediated cleavage at the intron insertion site.
However, they cannot carry out the En-dependent secondstrand cleavage in double-stranded DNA substrates
(Muñoz-Adelantado et al., 2003). The efficiency of the
reverse splicing reaction is markedly lower with long double-stranded DNA substrates. RmInt1-RNPs have higher
levels of RT activity on exogenous synthetic substrates than
on the endogenous intron RNA template, the reverse
transcription of which in vitro has no obvious priming bias,
suggesting that positioning of the reverse transcriptase on
the precursor RNA facilitates initiation of the RT reaction
various distances downstream from the intron (MuñozAdelantado et al., 2003). According to this biochemical
analysis, the RmInt1-encoded RNP complex should recognize the key nucleotide residues in both single-stranded and
double-stranded DNA target sites. However, the limited
contact with the IEP is insufficient for the promotion of
local DNA unwinding, suggesting that the preferred retrohoming pathway of RmInt1 involves reverse splicing into a
transiently single-stranded DNA target site and that there
may be alternative, uncharacterized priming mechanisms
for reverse transcription of the intron RNA (MuñozAdelantado et al., 2003).
Homing assays in vivo, using recipient plasmids in which
the RmInt1 target site was inserted in both orientations
relative to the origin of replication of the plasmid, have
provided evidence for two distinct retrohoming pathways
for RmInt1 (Fig. 3c). The main mechanism of mobility is
favoured by cell division and involves the reverse splicing of
the intron RNA into single-stranded DNA at DNA replicaFEMS Microbiol Rev 31 (2007) 342–358
tion forks, with a bias towards the DNA strand that serves as
a template for lagging strand synthesis (Martı́nez-Abarca
et al., 2004). This preference suggests that the intron should
be inserted once the replication fork has passed the DNA
target site, avoiding the potentially disruptive passage of the
DNA polymerase complex through the region occupied by
the RNP. Reverse transcription of the intron RNA is then
primed by the nascent lagging strand, using either the RNA
primer synthesized by the primase or partially polymerized
Okazaki fragments. There is a second possible but minor
retrohoming pathway (Fig. 3c), independent of DNA replication, that may involve reverse splicing into either doublestranded or transiently single-stranded DNA target sites and
alternative priming mechanisms, such as random nonspecific DNA nicks (Schäfer et al., 2003), nascent leading strand
(Zhong & Lambowitz, 2003) or de novo initiation priming
(Wang & Lambowitz, 1993). The blocking of Ll.LtrBmediated second-strand cleavage by mutations in the En
LtrA domain or substitutions of key nucleotide residues in
the 3 0 exon revealed a minor retrohoming pathway for the
lactococcal intron, dependent on DNA replication, with
preferential use of the nascent leading strand to prime
reverse transcription – a bias opposite to that for RmInt1
retrohoming (Zhong & Lambowitz, 2003).
Other naturally occurring En group II introns include
those belonging to the IIC subgroup of bacterial introns.
These introns are often found inserted at target sites with
recognizable IBS1 but not IBS2 sequences. The IBS2
sequence is replaced by a palindromic Rho-independent
transcription terminator motif or similar structures (Granlund et al., 2001; Centron & Roy, 2002; Dai & Zimmerly,
2002). Recent data suggest that exon recognition by IIC
introns rely on IBS1 and IBS3 pairings, and putatively on a
new nonpairing interaction with a stem-loop (Toor et al.,
2006).
Dependence of retrohoming on host functions
Group II introns are genetic elements with molecular
features allowing them to maintain themselves and to spread
in a genome. However, their mobility depends, to a variable
extent, on host genetic background, suggesting that retrohoming requires the recruitment of host functions for its
successful completion (Karberg et al., 2001; Toro et al.,
2003). In addition to requiring the replication machinery
during key initial stages, the retrohoming pathway of En
bacterial group II introns requires, at later stages, repair
functions poorly characterized for both endonucleasedependent and -independent retrohoming. A recent study
of Ll.LtrB homing efficiency in various E. coli mutants with
defective DNA/RNA repair functions led to the construction
of a retrohoming completion model including exonucleases
(Recj, MutD and PolI) and RNAses (RNase H) for DNA
2007 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
c
350
resection and removal of the intron RNA template after
first-strand cDNA synthesis, together with complexes of
DNA polymerases and repair polymerases (PolII, PolIII,
PolIV and PolV) for synthesis and proofreading of the
second strand of the intron cDNA (Smith et al., 2005).
However, further functional characterization is required to
determine unambiguously the role of these genes and
proteins in the retrohoming pathway.
Ectopic transposition
Bacterial group II introns can also retrotranspose, at low
frequency (typically 104 to 105), to nonallelic sites by
means of an RNA intermediate (Cousineau et al., 2000;
Martı́nez-Abarca & Toro, 2000a; Ichiyanagi et al., 2002).
However, retrotransposition events were first demonstrated
for eukaryotic mitochondrial group II introns, the RNPs of
which were shown to promote reverse splicing of the intron
RNA into noncognate DNA target sites (Mueller et al., 1993;
Yang et al., 1998; Dickson et al., 2001).
RNA-based retrotransposition was first investigated in
bacteria using an Ll.LtrB twintron variant carrying the selfsplicing group I td intron and a kanamycin resistance gene
for the selection of mobility events (Cousineau et al., 2000).
In contrast to the situation described for mitochondrial
group II introns, these experiments suggested that there may
be a major En-independent and RecA-dependent Ll.LtrB
retrotransposition pathway in L. lactis, probably involving
reverse splicing of the intron RNA into cellular RNA rather
than into DNA targets (Cousineau et al., 2000). However,
the results of other studies are not consistent with this RNAtargetted mechanism as the preferred general pathway for
the retrotransposition of bacterial group II introns: (1) the
S. meliloti RmInt1 intron displays RecA-independent insertion into an ectopic site within the oxi1 gene (Martı́nezAbarca & Toro, 2000a); (2) many bacterial group II introns
are located in nontranscribed intergenic regions (Martı́nezAbarca & Toro, 2000a; Dai & Zimmerly, 2002); (3) for the
group II introns analysed to date, the IEP has been shown to
have an affinity for DNA, rather than for RNA.
Ll.LtrB retrotransposition was analysed further, using
improved twintron donor constructs incorporating the
kanamycin resistance gene as a retrotransposition indicator
gene (RIG) inserted into intron DIV and interrupted by the
td intron. Retrotransposition events can thus be reliably
detected on the basis of kanamycin resistance once the td
intron has been spliced out of the precursor RNA mobility
intermediate (Ichiyanagi et al., 2002). The pattern of Ll.LtrB
spread within the L. lactis genome, as shown by RIG
analysis, is consistent with intron retrotransposition into
double-stranded or single-stranded DNA targets through a
homologous recombination-independent mechanism, as
described for the mitochondrial and bacterial RmInt1
2007 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
c
Nicolás Toro et al.
introns. Insertion events were biased toward the use of the
lagging strand as a template. This suggests a possible major
En-independent retrotransposition pathway involving
reverse splicing of the intron RNA into transiently singlestranded DNA target sites at replication forks, and the use of
the nascent lagging strand to prime reverse transcription of
the inserted intron RNA, a mechanism similar to the retrohoming mechanism of En bacterial group II introns
(Ichiyanagi et al., 2002). An analysis of ectopic insertion
sites for bacterial group II introns revealed that bona fide
IBS1 sequences were conserved, whereas the IBS2 element
and key nucleotide positions in the 5 0 and 3 0 exon sequences
recognized by the protein component of the RNP in TPRTbased retrohoming events were not (Cousineau et al., 2000;
Martı́nez-Abarca & Toro, 2000a, b; Dai & Zimmerly, 2002;
Ichiyanagi et al., 2002). However, these target sites retain
essential recognition elements for insertion events independent of second-strand cleavage. Nevertheless, recent studies
have reported a preferential En-dependent Ll.LtrB retrotransposition mechanism in E. coli, suggesting that the
retrotransposition strategies of bacterial group II introns
may be largely influenced by the host (Coros et al., 2005).
During the retrotransposition of the lactoccocal group II
intron Ll.LtrB in E. coli, insertion occurs preferentially
within the Ori and Ter macrodomains of the chromosome.
This bipolar distribution of retrotransposition events results
from the presence of the IEP (LtrA) at the poles of the cell
(Zhao & Lambowitz, 2005; Beauregard et al., 2006).
Bacterial group II intron retrotransposition has been
shown to occur in natural bacterial populations and is
favoured by targets in transmissible genetic elements, such
as plasmids, providing further support for this mobility
mechanism as the major dissemination strategy of bacterial
group II introns in nature (Muñoz et al., 2001; Ichiyanagi
et al., 2003).
Group II introns as biotechnological tools
Derivatives of group II intron Ll.LtrB have been used for
gene disruption in various gram-negative (E. coli, Shigella
flexneri and Salmonella typhimurium; Karberg et al., 2001)
and gram-positive (i.e. Lactococcus lactis, Clostridium
perfringens and Staphylococcus aureus; Chen et al., 2005;
Yao et al., 2006) bacteria.
A number of characteristics render group II introns
suitable for use as biotechnological tools: (1) they are mobile
elements, integrating with high efficiency into their DNA
targets by a homologous recombination-independent process; (2) they recognize the target DNA mainly by base
pairing, so target specificity can be changed by modifying
the EBS sequences within the intron RNA; (3) they can
mobilize foreign genetic information inserted within the
intron; and (4) minimal host functions are required to
FEMS Microbiol Rev 31 (2007) 342–358
351
Bacterial group II introns
support intron mobility and are probably provided by
conserved housekeeping proteins.
Group II introns were first engineered to increase the
frequency and efficiency of mobility (Guo et al., 2000;
Nisa-Martı́nez et al., 2007; Plante & Cousineau, 2006).
Initial modifications to Ll.LtrB included a deletion in the
domain IV loop removing most of the LtrA ORF (DORF
intron), with the intact LtrA ORF cloned and expressed
downstream from the 3 0 exon. This configuration increased
the frequency of mobility to almost 100% (Guo et al., 2000).
Smaller DORF Ll.LtrB intron derivatives were recently
constructed in which LtrA was inserted upstream from the
5 0 exon; these derivatives also had a higher homing efficiency (Plante & Cousineau, 2006). Similar engineering has
been applied to the RmInt1 intron, resulting in a higher
retrohoming efficiency than for the wild-type intron, with
constructs in which the IEP has been deleted from domain
IV and expressed either upstream from the 5 0 exon or
downstream from the 3 0 exon. The retrohoming efficiency
obtained was also higher (reaching 90%) if the constructs
carried short stretches of wild-type 5 0 and 3 0 exons (20/
15) flanking the DORF intron (Nisa-Martı́nez et al., 2007).
The greater mobility of the engineered introns seems to be
the result of greater stability of the intron RNA, probably
owing to a decrease in susceptibility to degradation by host
nucleases (Guo et al., 2000). In these constructs, the IEP
promotes splicing of the DORF intron. However, after
insertion into a new location, splicing of the DORF
intron cannot occur in the absence of the IEP, resulting in
disruption of the target gene. Introns targetted to the
antisense strand lead to unconditional disruption of
the gene. However, if DORF intron derivatives are retargetted to the sense strand, insertion of the intron generates
a conditional disruption, as splicing can take place if the IEP
is expressed in trans (Frazier et al., 2003; Nisa-Martı́nez
et al., 2007).
Another type of conditional gene disruption has also been
achieved with Ll.LtrB. This conditional mutagenesis is based
on the temperature-sensitive splicing displayed by this
group II intron. The essential gene hsa of S. aureus was
disrupted by an Ll.LtrB-DORF derivative in the sense
orientation with respect to gene transcription. It could
therefore be removed by RNA splicing, permitting the
translation of Hsa. As IEP-assisted splicing is temperaturesensitive, hsa mutants can grow at 32 1C, a temperature at
which splicing can take place, but cannot grow at 43 1C, a
temperature at which splicing cannot occur (Yao et al.,
2006).
Thus, it is currently possible to reprogram group II
introns to insert into any desired target sequence. Guo et al.
(2000) have developed a method based on a random intron
library, making it possible to select introns inserting into the
selected target DNA. Two plasmids are used in this system.
FEMS Microbiol Rev 31 (2007) 342–358
The donor plasmid expresses a set of intron molecules from
a DORF-IEP configuration with randomized EBS sequences
that also contain a T7 promoter, replacing the ORF within
DIV. The second plasmid (the recipient) harbours the
selected target sequence upstream from a promoterless tetR
gene. A sequence encoding an E. coli rrnB T1 transcription
terminator capable of terminating transcription by both the
E. coli and T7 RNA polymerases is inserted upstream from
the target site, and an rrnB T2 terminator, which terminates
transcription by the E. coli RNA polymerase, but not by the
T7 RNA polymerase, is inserted between the target site and
the tetR gene. A phage T7 TF terminator is inserted downstream from the tetR gene to act as a terminator for the T7
RNA polymerase. Insertion of the intron carrying the T7
promoter into the target site activates expression of the tetR
gene. This system can be used to select modified introns
from the randomized library that can insert into the chosen
gene. These introns can be further modified to improve
base-pairing with the target DNA. This system was designed
for use in E. coli, but genes from other organisms could be
cloned and used in this system to select the intron disrupting
the cloned gene most efficiently. This technique has been
used to select an intron disrupting the human CCR5 gene
(coreceptor for human HIV-1 virus; Guo et al., 2000). The
CCR5 gene, carried by a plasmid within embryonic kidney
cells, was then disrupted with the selected intron, which was
introduced into the human cells as RNPs.
The retargetting of bacterial group II introns can be
optimized with a computer algorithm, which scans the
target sequence, selects the best position for IEP recognition,
and then designs primers to modify the intron EBS1, EBS2
and d sequences accordingly (Perutka et al., 2004). The IBS1
and IBS2 sequences in the 5 0 exon of the donor plasmid are
also modified to make them complementary to the retargetted EBS sequences for efficient RNA splicing. The IEP
recognizes few enough positions with a sufficient flexibility
for the identification of several target sites in any gene.
Introns integrated into the target DNA can be selected by
colony PCR, or using a selectable marker, such as an
antibiotic resistance gene. The selectable marker strategy is
possible because group II introns can carry foreign DNA
sequences within their RNA core. This heterologous DNA is
carried in DIV of the group II intron, after removal of the
IEP sequence. The presence of this foreign DNA generally
has a negative effect on homing, but the modified introns
nonetheless display detectable homing (Guo et al., 2000;
Ichiyanagi et al., 2002; Frazier et al., 2003; Zhong et al.,
2003). The main concern with conventional antibiotic
resistance genes is that they are expressed from the donor
plasmid independently of homing, which can hinder the
selection of true homing events. The development of the
retrotransposition-activated selectable marker (RAM) system has overcome this problem. The RAM system is based
2007 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
c
352
on RIG markers, interrupted by a splicing-competent tdI
group I intron (Zhong et al., 2003). The selectable marker is
thus expressed only if retrohoming occurs, facilitating the
monitoring of group II intron dispersal through retrohoming. Almost all resistant, RAM-containing colonies display
disruption of a single target. Subsequent modifications, in
which the TpR gene was flanked by Flp Recombinase Target
sequence (FRT), have adapted the system for multiple gene
disruptions (Broach et al., 1982). The inserted FRT-flanked
RAM marker can thus be excised by expressing the Flp
recombinase, leading to 100% excision. Once the marker has
been excised, a second mutagenesis with the same RAM is
possible. Engineered group II introns carrying both the
RAM-Tp marker and randomized EBSs have been used to
generate a library of E. coli mutants (Zhong et al., 2003).
Most of the intron insertions were found to be located close
to the origin of replication of the chromosome, indicating a
bias favouring this region. However, the library was complex
enough for the detection by PCR of insertions in less
favoured regions. The integrated introns can be isolated
and used to obtain single knockouts in any gene (Yao et al.,
2005).
Bacterial group II intron diversity and
evolution
Group II introns have been found in c. 25% of sequenced
bacterial genomes (Lambowitz & Zimmerly, 2004) and they
are present in the Proteobacteria, Cyanobacteria, Acidobacteria, Actinobacteria, Bacteroides and Firmicutes (gram-positive bacteria) groups (Zimmerly group II-intron database,
http://www.fp.ucalgary.ca/group2introns). Group II introns
were discovered more recently in the Archaea, in which they
Nicolás Toro et al.
have been found only in two closely related methanogenic
species, Methanosarcina mazei and Methanosarcina acetivorans, probably as a result of horizontal transmission from
Eubacteria (Dai & Zimmerly, 2003; Rest & Mindell, 2003;
Toro, 2003). Group II introns are currently classified into
three main phylogenetic subclasses: IIA, IIB and IIC (Michel
et al., 1989; Toor et al., 2001, 2002; Zimmerly et al., 2001;
Ferat et al., 2003; Toro, 2003). Organellar introns from
mitochondria and chloroplasts are most closely related to
bacterial IIA and IIB (IIB1, IIB2) introns, whereas IIC
introns are missing from Cyanobacteria and there are no
close relatives of these introns among organellar introns
(Fig. 4).
The RNA structures of the group II intron ribozyme and
the IEP seem to have coevolved, leading to the ‘retroelement
ancestor hypothesis’, which suggests that all extant group II
introns descended from an RT-encoding group II intron in
bacteria with RNA structural features not conforming to
canonical IIA or IIB structures (Robart & Zimmerly, 2005).
Two different lineages, the canonical IIA and IIB, then
became associated with organelles and the ORF was
repeatedly lost, resulting in the formation of numerous
ORF-less organellar introns (Toor et al., 2001; Lambowitz &
Zimmerly, 2004; Robart & Zimmerly, 2005). ORF-less
introns have also been found in Cyanobacteria (Nakamura
et al., 2002), Bacillus (van der Auwera et al., 2005; Tourasse
et al., 2006) and Archaea (Dai & Zimmerly, 2003),
but they seem to be derived from intron-carrying ORFs.
Interestingly, truncated group II introns are also present in
bacteria (Dai & Zimmerly, 2002; Fernández-López et al.,
2005). Some have a 5 0 truncation that could be accounted
for by incomplete reverse transcription during TPRT.
However, 5 0 truncations account for only a minority of
fragmented introns (Robart & Zimmerly, 2005). Introns
Fig. 4. Phylogenetic relationships between group II intron ORFs and correspondence with RNA structural class. The consensus phylogenetic unrooted
tree shown was obtained using the maximum-parsimony method, and values are for 1000 bootstrap replicates (shown as percentages). The IEP RTs used
for the alignment were the same as previously described (Zimmerly et al., 2001; Toro, 2003). The various mitochondrial group II intron-encoded ORFs
and their accession numbers are as follows: M.p. atpA,1, M.p. atpA,2, M.p. cob,3, M.p. coxI,1, M.p. ORF732, M.p. coxI,2 and M.p. 18S rRNA geneM68929 are from the liverwort Marchantia polymorpha; O.b.nad1,4-M63034 is from the green plant Oenothera berteriana; S.p. cob,1-X54421, K.l.
coxI,1-X57546, S.c. aI1 and S.c. aI2-AJ011856 are from the yeasts Schizosaccharomyces pombe, Kluyveromyces lactis and Saccharomyces cerevisiae,
respectively; N.c. coI,1-S07649 and P.a. coI,1-X55026 are from the fungi Neurospora crassa and Podospora anserina, respectively; P.l.LSU,2.-S58504
and P.l.LSU,1-S58503 are from the brown alga Pylaiella littoralis; P.p. ORF544 and P.p. ORF546-AF114794 are from the red alga Porphyra purpurea. The
chloroplast group II intron-encoded ORFs are S.o. petD,1-P19593 from the green alga Scenedesmus obliquus and B.m. rbcl,1-X55877 from Bryopsis
maxima. The bacterial IEPs are in bold and underlined and named according to Dai & Zimmerly (2002), except for those from the uncultured marine
bacterium (UMB), for which we used its accession number (AAL78688). M.a.I1 (gene name MA4626, AE011185) and M.a.I2 (gene name MA3645,
AE011073) are from Methanosarcina acetivorans str.C2A, and M.m.I1 is from Methanosarcina mazei Goe1 (gene name MM2683, AE013515). RmInt1
(S.me.I1) is from Sinorhizobium meliloti (Y11597), as are S.me.I3 (AL0603646) and S.me.I4 (AJ496462); E.c.I2 is from Escherichia coli (X77508), as are
E.c.I4 (AB024946) and E.c.I5 (AF074613); B.j.I2 (elsewhere B.j.F1) is from Bradyrhizobium japonicum (AF322013), as are B.j.F4/B.j.F5 (AF322012) and
B.j.I1 (AF322013); S.ma.I1 is from Serriata marcescens (AF027768); S.p.I1 is from Streptococcus pneumoniae (AF030367); B.h.I1 is from Bacillus
halodurans (AP01507); P.p.I1 and P.p.I2 are from Pseudomonas putida (AF101076 and Y18999, respectively); P.a.I1 is from Pseudomonas alcaligenes
(AF323437); R.e.I1 is from Ralstonia eutropha (AF261712); C.d.I1 is from Clostridium difficile (X98606); B.me.I1 is from Bacillus megaterium
(AB022308); B.a.I1 and B.a.I2 are from Bacillus anthracis (AF065404) and S.f.I1 is from Shigella flexneri (U97489); C.sp.I1 is from Calothrix sp.
(X71404); L.l.LtrB (L.l.I1) is from Lactococcus lactis; N.a.I1 and N.a.I2 are from Novosphingobium aromaticivorans (AF079317).
2007 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
c
FEMS Microbiol Rev 31 (2007) 342–358
353
Bacterial group II introns
like RmInt1 tend to be inactivated by fragmentation, with
loss of the 3 0 terminus. In some cases, such inactivation
arises from the insertion of other mobile elements and
further genetic rearrangements (Dai & Zimmerly, 2002;
Fernández-López et al., 2005). The fragmentation of
group II introns, together with the accumulation of mutations and natural selection may well be responsible for
the current status of self-splicing group II introns in
bacteria, with the intrinsically large population size and
genome simplification (genomic streamlining) also playing
important roles (Lynch, 2006).
FEMS Microbiol Rev 31 (2007) 342–358
The IEP of full-length bacterial group II introns generally
contains both the RT and maturase domains, but different
groups of introns can be distinguished on the basis of the
presence or absence of the D and/or En domains. The
absence of the D and En domains may result from their
specific loss in some introns. However, whole intron groups
within the IIB class (Fig. 4), such as the IIB3 (also known as
the D class, Zimmerly et al., 2001) and IIB5 lineages (also
known as the E class, Toro et al., 2002; Lambowitz &
Zimmerly, 2004; Robart & Zimmerly, 2005), seem to lack
the D and En domains entirely (Toro, 2003). The IIC class of
2007 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
c
354
Nicolás Toro et al.
require further investigation, it is also plausible that the
ancestor of these introns was actually a catalytic RNA intron
functioning in the absence of the RT protein (Fig. 5),
perhaps in extreme environments capable of supporting
such catalytic activity (Lambowitz & Zimmerly, 2004).
The acquisition of the RT-maturase protein may have
enabled group II introns to extend into bacterial populations, in which they survived and spread by inserting into
nonessential genes, such as other mobile elements.
Their mobility would have been linked to the replication
machinery of the host cell, with insertion into singlestranded DNA target sites at a replication fork, using
a nascent lagging DNA strand as the primer for reverse
transcription. Subsequent acquisition of the D and En
domains would then have enabled group II introns to insert
efficiently by reverse splicing into double-stranded DNA
target sites (Lambowitz & Zimmerly, 2004). This acquisition
may have provided group II introns present in the alphaproteobacterium ancestor of the mitochondria, which
invaded an archaeal host during eukaryogenesis (Fig. 5),
with the features required for the massive invasion of the
host-cell genome, leading to the creation of spliceosomal
introns (Koonin, 2006).
Conclusions and perspectives
Fig. 5. Hypothesis for the evolution of group II introns. The model
predicts a catalytic group II intron RNA functioning in the absence of an
RT and its evolution in bacteria by acquisition of the RT-maturase IEP
leading to the ancestor of the three current intron lineages IIA, IIB and IIC
(a). IIA and IIB introns present in the alphaproteobacterium ancestor of
mitochondria would lead to massive invasion of the archaeal host
genome during eukaryogenesis (Koonin, 2006), thus giving rise to
spliceosomal introns (b).
introns (Fig. 4), characterized by the absence of an EBS2
sequence and a shorter ribozyme DV (Toor et al., 2001;
Toro, 2003), also lacks the En domain, but still has a putative
D domain (San Filippo & Lambowitz, 2002). It is widely
accepted that the ancestral RT probably lacked the D and En
domains (Fig. 5) and that their acquisition and loss in some
cases may have contributed to the evolution of the various
bacterial group II intron lineages.
Phylogenetic analyses of IEP and the intron RNA structures (Martı́nez-Abarca & Toro, 2000b; Toor et al., 2001;
Zimmerly et al., 2001; Toro, 2003) have shown the IIB and
IIA classes to be sister lineages and the IIC class to be the
most divergent lineage (Fig. 4). So which of these lineages of
group II introns is the oldest? The data currently available
provide no final answer to this question, but, according to
the retroelement hypothesis, the earliest branching class of
introns is most likely to be class IIC.
Regardless of the evolutionary relationships between the
current subclasses of group II introns in bacteria, which
2007 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
c
Since the discovery 14 years ago of group II introns in
bacteria, information has accumulated about the architectural and functional organization of group II introns as
catalytic RNAs, the mechanism of interaction between the
IEP and the intron RNA catalytic core leading to RNP
assembly, the mechanisms of intron mobility and DNA
target site recognition, and the development of these retroelements as new gene-targetting vectors. These advances
have provided new insight into the fundamental nature and
applications of these ribozymes.
Improvements in our understanding of the course of
evolution of group II introns in bacteria should also provide
clues to the origin of the spliceosome and the possible role of
these introns in eukaryogenesis. What role do group II
introns play in bacterial genomes? Although some group II
introns are inserted into nonessential regions of the genome,
others are found in essential genes. The current situation
may result from bacterial genome streamlining and
population size thus decreasing intron numbers or leading
to such genetic elements being entirely absent in some
bacterial species. There may also be unknown mechanisms
in bacteria controlling intron expression and mobility.
Despite extensive debate concerning the way in which
introns have played a critical role in eukaryote evolution
by inserting into protein-coding genes, very little is
known about group II intron evolution in bacteria. Progress
in bacterial genome sequencing programs may shed light on
FEMS Microbiol Rev 31 (2007) 342–358
355
Bacterial group II introns
the evolution of bacterial group II introns, revealing lineages
other than the current known subclasses IIA, IIB and IIC,
and possibly on group II intron RNAs functioning in the
absence of the RT protein. This work may involve sequencing metagenomes from extreme environments. Studies on
natural bacterial populations may also improve our understanding of the current contribution of group II introns to
bacterial evolution and the possible functional role of these
introns. The current view of group II introns as selfish
elements maintained throughout prokaryotic evolution
may turn out to be a simplistic assumption based on our
current lack of knowledge of bacterial group II intron
evolution.
Future studies on group II introns will focus not only on
fundamental evolutionary aspects, but also on the development of gene-targetting vectors. The development of such
tools based on other bacterial introns, with specific mechanistic intron mobility features different from those of Ll.LtrB,
should make this technology more broadly applicable. The
mobility mechanism of these bacterial introns does not
depend on recombination. This feature makes these
retroelements potential tools for genetic manipulation in
higher eukaryotes with low levels of recombination, such
as plants and animals. Like the splicing factors identified
for group II introns in organelles, bacterial group II
introns probably use certain host factors that might
contribute to intron RNA folding and mobility. In the next
few years, efforts should be made to identify such factors,
which might increase the functionality of these bacterial
group II introns in various prokaryotic and eukaryotic
hosts.
Studies on the architectural and functional organization
of group II introns will continue to expand our knowledge
about the splicing mechanism of these ribozymes, and will
also contribute to parallel investigations on related elements,
such as spliceosomes and retrotransposons. The excision of
group II introns as circles has been demonstrated to occur
in vivo for yeast introns such as aI2 and for some plant
mitochondrial introns. The recent detection of circular
intron molecules in bacteria indicates that this particular
mode of excision may be more widespread in nature than
initially thought. The splicing mechanism involved in group
II intron circle formation, and the biological role of such
circles in intron spread remain key issues that we can expect
to see resolved in the next few years.
Acknowledgements
Work at the authors’ laboratory was supported by the
Spanish Ministerio de Educación y Ciencia (BIO200502312) and Junta de Andalucı́a (CVI-01522). F.M.G.-R.
holds an I3P postdoctoral fellowship (CSIC), and J.I.J.-Z. is
a Ramón & Cajal hired scientist.
FEMS Microbiol Rev 31 (2007) 342–358
References
Adamidi C, Fedorova O & Pyle AM (2003) A group II intron
inserted into a bacterial heat-shock operon shows
autocatalytic activity and unusual thermostability.
Biochemistry 42: 3409–3418.
Aizawa Y, Xiang Q, Lambowitz A & Pyle AM (2003) The pathway
for DNA recognition and RNA integration by a group II intron
retrotransposon. Mol Cell 11: 795–805.
Beauregard A, Chalamcharla VR, Lyn Piazza C, Belfort M &
Coros CJ (2006) Bipolar localization of the group II intron
Ll.LtrB is maintained in Escherichia coli deficient in nucleoid
condensation, chromosome partitioning and DNA
replication. Mol Microbiol 62: 709–722.
Belfort M, Derbyshire V, Parker MM, Cousineau B & Lambowitz
AM (2002) Mobile introns. Mobile DNA. 2nd edn (Craig N,
Craigie R, Gellert M & Lambowitz AM, eds), pp. 761–783.
ASM Press, Washington, DC.
Broach JR, Guarascio VR & Jayaram M (1982) Recombination
within the yeast 2 mu circle is site-specific. Cell 29: 227–234.
Cavalier-Smith T (1991) Intron phylogeny: a new hypothesis.
Trends Genet 7: 145–148.
Cech TR (1986) The generality of self-splicing RNA: relationship
to nuclear mRNA splicing. Cell 44: 207–210.
Centron D & Roy PH (2002) Presence of a group II intron in a
multiresistant Serratia marcescens strain that harbors three
integrons and a novel gene fusion. Antimicrob Agents
Chemother 45: 1402–1409.
Chee GJ & Takami H (2005) Housekeeping recA gene interrupted
by group II intron in the thermophilic Geobacillus
kaustophilus. Gene 363: 211–220.
Chen Y, McClane BA, Fisher DJ, Rood JI & Gupta P (2005)
Construction of an alpha toxin gene knockout mutant of
Clostridium perfringens type A by use of a mobile group II
intron. Appl Environ Microbiol 71: 7542–7547.
Chu VT, Adamidi C, Liu Q, Perlman PS & Pyle AM (2001)
Control of branch-site choice by a group II intron. EMBO J
20: 6866–6876.
Coros CJ, Landthaler M, Piazza CL, Beauregard A, Esposito D,
Perutka J, Lambowitz AM & Belfort M (2005)
Retrotransposition strategies of the Lactococcus lactis Ll.LtrB
group II intron are dictated by host identity and cellular
environment. Mol Microbiol 56: 509–524.
Costa M, Deme E, Jacquier A & Michel F (1997) Multiple tertiary
interactions involving domain II of group II self-splicing
introns. J Mol Biol 267: 520–536.
Costa M, Michel F & Westhof E (2000) A three-dimensional
perspective on exon binding by a group II self-splicing intron.
EMBO J 19: 5007–5018.
Costa M, Michel F, Molina-Sánchez MD, Martı́nez-Abarca F &
Toro N (2006a) An alternative intron–exon pairing scheme
implied by unexpected in vitro activities of group II intron
RmInt1 from Sinorhizobium meliloti. Biochimie 88: 711–717.
Costa M, Michel F & Toro N (2006b) Potential for alternative
intron–exon pairings in group II intron RmInt1 from
Sinorhizobium meliloti and its relatives. ARN 12: 338–341.
2007 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
c
356
Cousineau B, Smith D, Lawrence-Cavanagh S, Mueller JE, Yang J,
Mills D, Manias D, Dunny G, Lambowitz A & Belfort M (1998)
Retrohoming of a bacterial group II intron mobility via
complete reverse splicing, independent of homologous DNA
recombination. Cell 94: 451–462.
Cousineau B, Lawrence S, Smith D & Belfort M (2000)
Retrotransposition of a bacterial group II intron. Nature 404:
1018–1021. Erratum (2001) Nature 414: 84.
Cui X, Matsuura M, Wang Q, Ma H & Lambowitz AM (2004) A
group II intron-encoded maturase functions preferentially in
cis and requires both the reverse transcriptase and X domains
to promote RNA splicing. J Mol Biol 340: 211–231.
Dai L & Zimmerly S (2002) Compilation and analysis of group II
intron insertions in bacterial genomes: evidence for
retroelement behaviour. Nucleic Acids Res 30: 1091–1102.
Dai L & Zimmerly S (2003) ORF-less and reverse-transcriptaseencoding group II introns in archeobacteria, with a pattern of
homing into related group II introns ORFs. RNA 8:
1294–1307.
De Lencastre A, Hamill S & Pyle AM (2005) A single-active site
region for a group II intron. Nature Struct Mol Biol 12:
626–627.
Dickson L, Huang HR, Liu L, Matsuura M, Lambowitz AM &
Perlman PS (2001) Retrotransposition of a yeast group II
intron occurs by reverse splicing directly into ectopic DNA
sites. Proc Natl Acad Sci USA 98: 13207–13212.
D’Souza LM & Zhong J (2002) Mutations in the Lactococcus lactis
Ll.LtrB group II intron that retain mobility in vivo. BMC Mol
Biol 3: 17.
Eickbush TH (1994) Origins and evolutionary relationships of
retroelements. The Evolutionary Biology of Viruses (Morse SS,
ed), pp. 121–157. Raven Press, New York.
Fedorova O, Mitros T & Pyle AM (2003) Domains 2 and 3
interact to form critical elements of the group II intron active
site. J Mol Biol 330: 197–209.
Ferat JL & Michel F (1993) Group II self-splicing introns in
bacteria. Nature 364: 358–361.
Ferat JL, Le Gouar M & Michel F (1994) Multiple group II selfsplicing introns in mobile DNA from Escherichia coli. C R Acad
Sci III 317: 141–148.
Ferat JL, Le Gouar M & Michel F (2003) A group II intron has
invaded the genus Azotobacter and inserted within the
termination codon of the essential groEL gene. Mol Microbiol
49: 1407–1423.
Fernández-López M, Muñoz-Adelantado E, Gillis M, Willems A
& Toro N (2005) Dispersal and evolution of the Sinorhizobium
meliloti group II RmInt1 intron in bacteria that interact with
plants. Mol Biol Evol 22: 1518–1528.
Frazier CL, San Filipo J, Lambowitz AM & Mills DA (2003)
Genetic manipulation of Lactococcus lactis by using targeted
group II introns: generation of stable insertions without
selection. Appl Environ Microbiol 69: 1121–1128.
Granlund M, Michel F & Norgren M (2001) Mutually exclusive
distribution of IS1548 and GBSi1, an active group II intron
2007 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
c
Nicolás Toro et al.
identified in human isolates of group B Streptococci.
J Bacteriol 183: 2560–2569.
Guo H, Karberg M, Long M, Jones JP III, Sullenger B &
Lambowitz AM (2000) Group II introns designed to insert
into therapeutically relevant DNA target sites in human cells.
Science 289: 452–457.
Hamill S & Pyle AM (2006) The receptor for branch-site docking
within a group II intron active site. Mol Cell 23: 831–840.
Ichiyanagi K, Beauregard A, Lawrence S, Smith D, Cousineau B &
Belfort M (2002) Retrotransposition of the Ll.LtrB group II
intron proceeds predominantly via reverse splicing into DNA
targets. Mol Microbiol 46: 1259–1272.
Ichiyanagi K, Beauregard A & Belfort M (2003) A bacterial group
II intron favors retrotransposition into plasmid targets. Proc
Natl Acad Sci USA 100: 15742–15747.
Jiménez-Zurdo JI, Garcı́a-Rodrı́guez FM, Barrientos-Durán A &
Toro N (2003) DNA target site requirements for homing in
vivo of a bacterial group II intron encoding a protein lacking
the DNA endonuclease domain. J Mol Biol 326: 413–423.
Karberg M, Guo H, Zhong J, Coon R, Perutka J & Lambowitz AM
(2001) Group II introns as controllable gene targeting vectors
for genetic manipulation of bacteria. Nat Biotechnol 19:
1162–1167.
Koonin EV (2006) The origins of introns and their role in
eukaryogenesis: a compromise solution to introns-early vs.
introns-late debate? Biol Direct 1: 39.
Kosaraju P, Pulakat L & Gavini N (2005) Analysis of the genome
of Azotobacter vinelandii revealed the presence of two
genetically distinct group II introns on the chromosome.
Genetica 124: 107–115.
Lambowitz AM & Perlman PS (1990) Involvement of aminoacyltRNA synthetases and other proteins in group I and group II
intron splicing. Trends Biochem Sci 15: 440–444.
Lambowitz AM & Zimmerly S (2004) Mobile group II introns.
Ann Rev Genet 38: 1–35.
Lambowitz AM, Caprara MG, Zimmerly S & Perlman PS (1999)
Group I and group II ribozymes as RNPs: clues to the past and
guides to the future. The RNA World. 2nd edn (Gesteland RF,
Cech TR, Atkins JF, eds), pp. 451–485. Cold Spring Harbor
Laboratory Press, Cold Spring Harbor, NY.
Lambowitz AM, Mohr G & Zimmerly S (2005) Group II intron
homing endonucelases: ribonuceloprotein complexes with
programmable target specificity. Homing Endonucleases and
Inteins (Belfort M, ed), pp. 121–145. Springer-Verlag,
Heidelberg.
Lehmann K & Schmidt U (2003) Group II introns: structural and
catalytic versatility of large natural ribozymes. Crit Rev
Biochem Mol Biol 38: 249–303.
Lynch M (2006) Streamlining and simplification of microbial
genome architecture. Annu Rev Microbiol 60: 327–349.
Martin W & Koonin EV (2006) Introns and the origin of nucleuscytosol compartmentalization. Nature 440: 41–45.
Martı́nez-Abarca F & Toro N (2000a) RecA-independent ectopic
transposition in vivo of a bacterial group II intron. Nucleic
Acids Res 28: 4397–4402.
FEMS Microbiol Rev 31 (2007) 342–358
357
Bacterial group II introns
Martı́nez-Abarca F & Toro N (2000b) Group II introns in the
bacterial world. Mol Microbiol 38: 917–926.
Martı́nez-Abarca F, Zekri S & Toro N (1998) Characterization
and splicing in vivo of a Sinorhizobium meliloti group II intron
associated with particular insertion sequences of the IS630Tc1/IS3 retroposon superfamily. Mol Microbiol 28: 1295–1306.
Martı́nez-Abarca F, Garcı́a-Rodrı́guez F, Muñoz E & Toro N
(1999) Biochemical demonstration of reverse transcriptase
activity and splicing efficiency of bacterial group II intron
from Sinorhizobium meliloti. Nucleic Acids Symp Ser 41:
117–119.
Martı́nez-Abarca F, Garcı́a-Rodrı́guez FM & Toro N (2000)
Homing of a bacterial group II intron with an intron-encoded
protein lacking a recognizable endonuclease domain. Mol
Microbiol 35: 1405–1412.
Martı́nez-Abarca F, Barrientos-Durán A, Fernández-López M &
Toro N (2004) The RmInt1 group II intron has two different
retrohoming pathways for mobility using predominantly the
nascent lagging strand at DNA replication forks for priming.
Nucleic Acids Res 32: 2880–2888.
Matsuura M, Saldanha R, Ma H, Wank H, Yang J, Mohr G,
Cavanagh S, Dunny GM, Belfort M & Lambowitz AM (1997)
A bacterial group II intron encoding reverse transcriptase,
maturase and DNA endonuclease activities: biochemical
demonstration of maturase activity and insertion of new
genetic information within the intron. Genes Dev 11:
2910–2924.
Matsuura M, Noah JW & Lambowitz AM (2001) Mechanism of
maturase-promoted group II intron splicing. EMBO J 20:
7259–7270.
Meng Q, Wang Y & Liu XQ (2005) An intron-encoded protein
assists RNA splicing of multiple similar introns of different
bacterial genes. J Biol Chem 280: 35085–35088.
Michel F & Ferat JL (1995) Structure and activities of group II
introns. Annu Rev Biochem 64: 435–461.
Michel F, Umesomo K & Ozeki H (1989) Comparative and
functional anatomy of group II catalytic introns – a review.
Gene 82: 5–30.
Mills DA, McKay LL & Dunny GM (1996) Splicing of a group II
intron involved in the conjugative transfer of pRS01 in
lactococci. J Bacteriol 178: 3531–3538.
Mohr G, Smith D, Belfort M & Lambowitz AM (2000) Rules for
DNA target-site recognition by a lactococcal group II intron
enable retargeting of the intron to specific DNA sequences.
Genes Dev 14: 559–573.
Molina-Sánchez MD, Martı́nez-Abarca F & Toro N (2006)
Excision of the Sinorhizobium meliloti group II intron RmInt1
as circles in vivo. J Biol Chem 281: 28737–28744.
Mueller MW, Allmaier M, Eskes R & Schweyen RJ (1993)
Transposition of group II intron aI1 in yeast and invasion of
mitochondrial genes at new locations. Nature 366: 174–176.
Muñoz E, Villadas P & Toro N (2001) Ectopic transposition of a
group II intron in natural bacterial populations. Mol Microbiol
41: 645–652.
FEMS Microbiol Rev 31 (2007) 342–358
Muñoz-Adelantado E, San Filippo J, Martı́nez-Abarca F, Garcı́aRodrı́guez FM, Lambowitz AM & Toro N (2003) Mobility of
the Sinorhizobium meliloti group II intron RmInt1 occurs by
reverse splicing into DNA, but requires an unknown reverse
transcriptase priming mechanism. J Mol Biol 327: 931–943.
Murray HL, Mikheeva S, Coljee VW, Turczyk BM, Donahue WF,
Bar-Shalom A & Jarrel KA (2001) Excision of group II introns
as circles. Mol Cell 8: 201–211.
Nakamura Y, Kaneko T, Sato S, et al. (2002) Complete genome
structure of the thermophilic cyanobacterium
Thermosynechococcus elongatus BP-1. DNA Res 9: 123–130.
Nisa-Martı́nez R, Jiménez-Zurdo JI, Martı́nez-Abarca F, MuñozAdelantado E & Toro N (2007) Dispersion of the RmInt1
group II intron in the Sinorhizobium meliloti genome upon
acquisition by conjugative transfer. Nucleic Acids Res 35:
214–222.
Noah JW, Park S, Whitt JT, Perutka J, Frey W & Lambowitz AM
(2006) Atomic force microscopy reveals DNA bending during
group II intron ribonucleoprotein particle integration into
double-stranded DNA. Biochemistry 45: 12424–12435.
Ostheimer GJ, Williams-Carrier R, Belcher S, Osborne E, Gierke J
& Barkan A (2003) Group II intron splicing factors derived by
diversification of an ancient RNA-binding domain. EMBO J
22: 3919–3929.
Perutka J, Wang W, Goerlitz D & Lambowitz AM (2004) Use of
computer-designed group II introns to disrupt Escherichia coli
DexH/D-box protein and DNA helicase genes. J Mol Biol 336:
421–439.
Plante I & Cousineau B (2006) Restriction for gene insertion
within the Lactococcus lactis Ll.LtrB group II intron. RNA. 12:
1980–1992.
Pyle AM & Lambowitz AM (2006) Group II introns: ribozymes
that splice RNA and invade DNA. RNA World. 3rd edn
(Getsland RF, Ceh TR & Atkins JF, eds), pp. 469–505. Cold
Spring Harbor Laboratory Press, Cold Spring Harbor, NY.
Rest JS & Mindell DP (2003) Retroids in archaea: phylogeny and
lateral origins. Mol Biol Evol 20: 1134–1142.
Robart AR & Zimmerly S (2005) Group II intron retroelements:
function and diversity. Cytogenet Genome Res 110: 589–597.
Robart AR, Montgomery NK, Smith KL & Zimmerly S (2004)
Principles of 3 0 splice site selection and alternative splicing for
an unusual group II intron from Bacillus anthracis. RNA 10:
854–862.
Roberts AP, Braun V, von Eichel-Streiber C & Mullany P (2001)
Demonstration that the group II intron from the clostridial
conjugative transposon Tn5397 undergoes splicing in vivo.
J Bacteriol 183: 1296–1299.
Saldanha R, Chen B, Wank H, Matsuura M, Edwards J &
Lambowitz AM (1999) RNA and protein catalysis in group II
intron splicing and mobility reactions using purified
components. Biochemistry 38: 9069–9083.
San Filippo J & Lambowitz AM (2002) Characterization of the Cterminal DNA-binding/DNA endonuclease region of a group
II intron-encoded protein. J Mol Biol 324: 933–951.
2007 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
c
358
Schäfer B, Gan L & Perlman PS (2003) Reverse transcriptase and
reverse splicing activities encoded by the mobile group II
intron cob Il of fission yeast mitochondrial DNA. J Mol Biol
329: 191–206.
Sharp PA (1985) On the origin of RNA splicing and introns. Cell
42: 397–400.
Shearman C, Godon JJ & Gasson M (1996) Splicing of a group II
intron in a functional transfer gene of Lactococcus lactis. Mol
Microbiol 21: 45–53.
Singh NN & Lambowitz AM (2001) Interaction of a group II
intron ribonucleoprotein endonuclease with its DNA target
site investigated by DNA footprinting and modification
interference. J Mol Biol 309: 361–386.
Singh RN, Saldanha RJ, D’Souza LM & Lambowitz AM (2002)
Binding of a group II intron-encoded reverse transcriptase/
maturase to its high affinity intron RNA binding site involves
sequence-specific recognition and autoregulates translation.
J Mol Biol 318: 287–303.
Smith D, Zhong J, Matsuura M, Lambowitz AM & Belfort M
(2005) Recruitment of host functions suggests a repair
pathway for late steps in group II intron retrohoming. Genes
Dev 19: 2477–2487.
Stoltzfus A (1999) On the possibility of constructive neutral
evolution. J Mol Evol 49: 169–181.
Su LJ, Qin PZ, Michels WJ & Pyle AM (2001) Guiding ribozyme
cleavage through motif recognition: the mechanism of
cleavage site selection by a group II intron ribozyme. J Mol Biol
306: 655–668.
Swisher J, Duarte CM, Su LJ & Pyle AM (2001) Visualizing the
solvent-inaccessible core of a group II intron ribozyme. EMBO
J 20: 2051–2061.
Toor N, Hausner G & Zimmerly S (2001) Coevolution of group II
intron RNA structures with their intron-encoded reverse
transcriptases. RNA 7: 1142–1152.
Toor N, Robart AR, Christianson J & Zimmerly S (2006) Selfsplicing of a group IIC intron: 5 0 exon recognition and
alternative 5 0 splicing events implicate the stem-loop motif of
a transcriptional terminator. Nucleic Acids Res 34: 6561–6573.
Toro N (2003) Bacteria and archaea group II introns: additional
mobile elements in the environment. Environ Microbiol 5:
143–151.
Toro N, Molina-Sánchez MD & Fernández-López M (2002)
Identification and characterization of bacterial class E group II
introns. Gene 299: 245–250.
Toro N, Martı́nez-Abarca F, Fernández-López M & MuñozAdelantado E (2003) Diversity of group II introns in the
genome of Sinorhizobium meliloti strain 1021: splicing and
mobility of RmInt1. Mol Genet Genomics 268: 628–636.
2007 Federation of European Microbiological Societies
Published by Blackwell Publishing Ltd. All rights reserved
c
Nicolás Toro et al.
Tourasse NJ, Stabell FB, Reiter L & Kolst AB (2005) Unusual
group II introns in bacteria of the Bacillus cereus group.
J Bacteriol 187: 5437–5451.
Tourasse NJ, Helgason E, Økstad OA, Hegna IK & Kolst AB
(2006) The Bacillus cereus group: novel aspects of population
structure and genome dynamics. J Appl Microbiol 101:
579–593.
van der Auwera GA, Andrup L & Mahillon J (2005) Conjugative
plasmid pAW63 brings new insights into the genesis of
Bacillus anthracis virulence plasmid pX02 and of the
Bacillus thuringiensis plasmid pBT9727. BMC Genomics
6: 103.
Wang H & Lambowitz AM (1993) The Mauriceville plasmid
reverse transcriptase can initiate cDNA synthesis de novo and
may be related to reverse transcriptase and DNA polymerase
progenitor. Cell 75: 1071–1081.
Wank H, San Filippo J, Singh RN, Matsuura M & Lambowitz AM
(1999) A reverse transcriptase/maturase promotes splicing by
binding at its own coding segment in a group II intron RNA.
Mol Cell 4: 239–250.
Yang J, Mohr G, Perlman PS & Lambowitz AM (1998) Group II
intron mobility in yeast mitochondria: target DNA-primed
reverse transcription activity of aI1 and reverse splicing
into DNA transposition sites in vitro. J Mol Biol 282:
505–523.
Yao J, Zhong J & Lambowitz AM (2005) Gene targeting using
randomly inserted group II introns (targetrons) recovered
from Escherichia coli gene disruption library. Nucleic Acids Res
33: 3351–3362.
Yao J, Zhong J, Fang Y, Geisinger E, Novick RP & Lambowitz AM
(2006) Use of targetrons to disrupt essential and nonessential genes in Staphylococcus aureus reveals temperature
sensitivity of Ll.LtrB group II intron splicing. RNA 12:
1271–1281.
Zhao J & Lambowitz AM (2005) A bacterial group II intronencoded reverse transcriptase localizes to cellular poles. Proc
Natl Acad Sci USA 102: 16133–16140.
Zhong J & Lambowitz AM (2003) Group II intron mobility using
nascent strands at DNA replication forks to prime reverse
transcription. EMBO J 22: 4555–4565.
Zhong J, Karberg M & Lambowitz AM (2003) Targeted and
random bacterial gene disruption using a group II intron
(targetron) vector containing a retrotransposition-activated
selectable marker. Nucleic Acids Res 31: 1656–1664.
Zimmerly S, Guo H, Perlman PS & Lambowitz AM (1995) Group
II intron mobility occurs by target DNA-primed reverse
transcription. Cell 82: 545–554.
Zimmerly S, Hausner G & Wu X (2001) Phylogenetic
relationships among group II intron ORFs. Nucleic Acids Res
29: 1238–1250.
FEMS Microbiol Rev 31 (2007) 342–358