Article Minor Fitness Costs in an Experimental Model of Horizontal

Minor Fitness Costs in an Experimental Model of Horizontal
Gene Transfer in Bacteria
Anna Knöppel,y,1 Peter A. Lind,y,z,1 Ulrika Lustig,1 Joakim Näsvall,1 and Dan I. Andersson*,1
1
Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
Present address: New Zealand Institute for Advanced Study, Massey University, Auckland, New Zealand
y
These authors contributed equally to this work.
*Corresponding author: E-mail: [email protected].
Associate editor: Michael Nachman
z
Abstract
Genes introduced by horizontal gene transfer (HGT) from other species constitute a significant portion of many bacterial
genomes, and the evolutionary dynamics of HGTs are important for understanding the spread of antibiotic resistance and
the emergence of new pathogenic strains of bacteria. The fitness effects of the transferred genes largely determine the
fixation rates and the amount of neutral diversity of newly acquired genes in bacterial populations. Comparative analysis
of bacterial genomes provides insight into what genes are commonly transferred, but direct experimental tests of the
fitness constraints on HGT are scarce. Here, we address this paucity of experimental studies by introducing 98 random
DNA fragments varying in size from 0.45 to 5 kb from Bacteroides, Proteus, and human intestinal phage into a defined
position in the Salmonella chromosome and measuring the effects on fitness. Using highly sensitive competition assays,
we found that eight inserts were deleterious with selection coefficients (s) ranging from & 0.007 to 0.02 and 90 did
not have significant fitness effects. When inducing transcription from a PBAD promoter located at one end of the insert, 16
transfers were deleterious and 82 were not significantly different from the control. In conclusion, a major fraction of the
inserts had minor effects on fitness implying that extra DNA transferred by HGT, even though it does not confer an
immediate selective advantage, could be maintained at selection-transfer balance and serve as raw material for the
evolution of novel beneficial functions.
Key words: bacterial evolution, horizontal gene transfer, lateral gene transfer, fitness effects.
Introduction
Article
Horizontal gene transfer (HGT) is a major force in bacterial
evolution (Ochman et al. 2000; Creevey et al. 2011; Baltrus
2013). The percentage of HGT genes vary widely dependent
on lifestyle and genome size, ranging from close to 0% in
symbiotic intracellular organisms with reduced genomes,
such as Buchnera aphidicola, up to 25% in free-living bacteria
with large genomes (Snel et al. 1999; Ochman et al. 2000;
Koonin et al. 2001; Nakamura et al. 2004; Ge et al. 2005).
High levels of HGT have also been linked to a parasitic or
pathogenic lifestyle and are of major importance for the
spread of antibiotic resistance genes (Ambur et al. 2009;
Barlow 2009; Juhas et al. 2009).
Apart from ecological constraints (i.e., likelihood of sharing
growth niches), mechanistic constraints on HGT likely influence the transfer rate in natural populations. For example,
host range restrictions will limit the transfer potential of
conjugation and transduction (Thomas and Nielsen 2005),
and once inside the recipient cell, restriction systems can
destroy transferred fragments. Finally, the HGT genes must
be integrated into a replicon by homologous or illegitimate
recombination (Thomas and Nielsen 2005) to be stably maintained. Of the transfers that integrate into the genome, only a
subset will fix, and the rate of fixation will depend on the
fitness effects of the transferred gene(s), the potential for
compensatory mutations and the effective population size
(Lind et al. 2010). Most insertional HGT events are expected
to be deleterious given that most bacteria have a coding
density of 95% and that disruption of many genes or regulatory regions reduces fitness (Elena et al. 1998). The costs of
replication and transcription of the extra DNA are relatively
small, whereas costs of translation of the foreign genes are of
much larger importance (Ingraham et al. 1983; Lawrence and
Hendrickson 2005) In addition, the expressed protein could
be involved in a metabolic reaction that is costly (Koskiniemi
et al. 2012). Finally, increased levels of an RNA or protein
could lead to imbalances in gene dosage that results in, for
example, improper gene regulation or unwanted molecular
interactions, thereby disrupting native systems. Such effects
imply that successful transfers are probably on the edges of
functional networks (Kurland 1992; Woese 1998; Jain et al.
1999; Comas et al. 2007; Wellner et al. 2007; Stoebel et al.
2008). Thus, HGT is expected to be more common between
related species where the mechanisms of regulation of transcription and translation are conserved and the successful
integration of the alien genes into the functional networks
of the cell is more likely.
Experimental tests of the fitness effects of HGT are necessary to to identify the mechanistic causes of the dominant fitness constraints and to determine to what degree deleterious
effects will reduce the HGT diversity in bacterial populations.
Sorek et al. (2007) reported the first major investigation of
ß The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please
e-mail: [email protected]
1220
Mol. Biol. Evol. 31(5):1220–1227 doi:10.1093/molbev/msu076 Advance Access publication February 17, 2014
Minor Fitness Costs of Horizontal Gene Transfer . doi:10.1093/molbev/msu076
MBE
Fig. 1. Genetic system for experimental studies of HGT on the Salmonella chromosome. (A) The 0.45–5-kb random inserts from Bacteroides fragilis,
Proteus mirabilis, and human intestinal phage were cloned into a template plasmid (pBADcam). (B) The inserts were amplified together with an
arabinose-inducible promoter (PBAD), a Cam-resistance marker, and a terminator, using primers with homology extensions for the cbiA locus in
Salmonella. Through -red recombineering, the fragments were then transformed into the Salmonella cbiA-locus. (C and D) The inserts were transduced into strains expressing cyan fluorescent protein (CFP) or yellow fluorescent protein (YFP), and the fitness of the inserts was measured through
competition experiments followed by flow cytometry analysis.
fitness constraints on random HGTs, where they searched for
genes that could not be cloned in Escherichia coli and found
that most genes resistant to HGT are informational core
genes and that the fitness costs (which were not quantified)
were largely dependent on toxicity through gene dosage
effects (Sorek et al. 2007). Here, we examined genes that are
clonable in E. coli, excluding genes with lethal or very large
effects on fitness (s > 0.875), and introduced the foreign
DNA into a neutral position in the Salmonella typhimurium
chromosome. This allowed us to measure the fitness effects of
defined insertion mutants and examine what fraction of HGT
events are strongly deleterious, have minor effects, or are
highly advantageous in the absence of the cost of insertional
mutagenesis or abnormal copy number increases associated
with chromosomal genes carried on multicopy plasmids
(Sorek et al. 2007).
Results
Experimental Rationale
Using lambda red recombination, random fragments of DNA
from Bacteroides fragilis, Proteus mirabilis, and human intestinal phage (Reyes et al. 2010) were inserted into the cbiA gene
in the S. typhimurium chromosome, together with a chloramphenicol resistance marker, an arabinose-inducible promoter and a transcriptional terminator (fig. 1). Inactivation
of the cbiA gene has no fitness effects on S. typhimurium
under the growth conditions used (data not shown). By selecting one single position for insertion, we exclude the unpredictable fitness effects of random insertional mutagenesis and
focus on the fitness effects conferred only by the inserted
fragment. B. fragilis is a gram-negative obligate anaerobe
(GC content 43%) that is a commensal resident of the
lower human gastrointestinal tract (Cerdeno-Tarraga et al.
2005). Proteus mirabilis is a gram-negative enteric bacterium
with a genomic GC content of 39% (Pearson et al. 2008). Both
S. typimurium and P. mirabilis are Gammaproteobacteria,
whereas B. fragilis is classified in the distantly related
phylum Bacteroidetes. Based on previous experiments (Lind
et al. 2010), we expect the P. mirabilis gene products to be
functional in S. typhimurium and disrupt cellular function to a
lesser degree than the phylogenetically distant B. fragilis insert,
provided these are successfully expressed.
Bacteriophages are present in large numbers in most environments (Suttle 2005; Srinivasiah et al. 2008; Reyes et al.
2012) and are a major source of HGT, both by acting as agents
of gene transfer (transduction) and by lysogen formation by
temperate phages (Canchaya et al. 2003). Phage-related elements are very common in recently acquired HGTs (Daubin
et al. 2003), but it is not clear whether this is solely due to their
assumingly higher transfer rate or that they more commonly
encode genes that are less deleterious or advantageous. Very
little is known about the probability of random phage open
reading frames (ORFs) to be successfully expressed in a novel
host, and our experiments investigate whether these inserts
commonly have large effects on fitness. Human bacteriophage DNA was obtained from the gut (feces) of healthy
unrelated female adults (Reyes et al. 2010). In total, 52 inserts
were constructed from B. fragilis, 29 from P. mirabilis, and 17
from the intestinal phage. Because the human intestinal
phages used in this study were not previously sequenced
and annotated, stepwise sequencing followed by ORFprediction and BlastX (NCBI) searches were performed
to map putative genes and their expected functions (supplementary fig. S1 and table S1, Supplementary Material
online).
Fitness Measurements
To examine the fitness effects of the inserts, we performed
competition experiments in M9 glucose using strains marked
with fluorescent proteins (cfp and yfp, or bfp and syfp2) as
previously described (Lind et al. 2010). Overnight cultures
grown in M9 glucose of the insertion mutants and an isogenic
1221
Knöppel et al. . doi:10.1093/molbev/msu076
MBE
Fig. 2. Mean fitness of insert mutants. All data were generated through competition experiments and are reported relative to wild-type controls (wildtype strains DA22555 for cfp/yfp and DA27153 for bfp/syfp2; s = 0). * indicates deleterious inserts. (A) Fitness (mean ± SD) in M9 glucose. (B) Fitness
(mean ± SD) in M9 glucose supplemented with 0.2% arabinose. The mutants are sorted in the same order as in A. (C and D) Fitness distributions
showing the data presented in A and B, respectively. (E) Fitness in M9 glucose plotted against fitness in M9 glucose 0.2% arabinose.
wild type (marked with cfp or yfp, or bfp or syfp2) were mixed
at equal ratios and were cycled for 50 generations by diluting
the culture 1,000-fold each day (ten generations/day). Each
day, the ratio of insertion mutant to wild type was determined by counting 105 cells using a fluorescence-activated
cell sorter (BD FACS Aria), and the selection coefficients were
determined using the regression model s = [ln R(t)/R(0)]/t,
where R is the ratio of mutant to wild type, as previously
described (Dykhuizen 1990), and all mutants were assayed in
both the cfp and yfp background or bfp and syfp2 background. This assay allows detection of differences in fitness
1222
as small as j s j > 0.007 and measures composite fitness over
the whole growth cycle (lag phase, exponential growth, and
stationary phase survival) in contrast to typical growth rate
measurements that only assay the exponential growth phase.
Fitness data and information on the inserted genes are presented in supplementary tables S1–S3, Supplementary
Material online. Using this assay, 8/98 inserts were significantly deleterious and 90/98 were not significantly different
from the control ( j s j < 0.007; fig. 2A). When arabinose was
added to induce transcription from the PBAD promoter, 16/98
inserts were deleterious and 82/98 were not significantly
Minor Fitness Costs of Horizontal Gene Transfer . doi:10.1093/molbev/msu076
MBE
Fig. 3. Analysis of the mean fitness of insert mutants. Fitness measurements for M9 glucose are marked in turquoise, and M9 glucose supplemented
with 0.2% arabinose are marked in red. All data were generated through competition experiments and are reported relative to wild-type controls (s = 0).
(A) A box plot presenting fitness of insert mutants, separated by insert origin (B. fragilis, P. mirabilis, and phage). Boxes represent the interquartile range
(25th–75th percentile) and whiskers the range without outliers. Outliers are marked with circles. Wilcoxon Signed-Rank test, failed to show any
difference between any of the groups (P > 0.05). (B) Fitness in relation to GC content of the insert. (C) Fitness in relation to length of the insert. (D)
Fitness in relation to number of complete ORFs. Linear regressions show no correlation (P > 0.05).
different from the control (fig. 2B). When induced, overall
fitness was lower than when no arabinose was added to
the growth medium (Wilcoxon signed-rank test,
P = 4 107). Typically, the inserts with deleterious effects
without arabinose were increasingly deleterious under arabinose induction, whereas inserts with no significant fitness
effects remained at a similar fitness level as without arabinose
(fig. 2 and supplementary tables S2 and S3, Supplementary
Material online). The ORFs encoded by deleterious inserts are
listed in supplementary table S3, Supplementary Material
online.
Further analysis of the inserts revealed no correlation between fitness effect and insert origin (Wilcoxon signed-rank
test, P > 0.05; fig. 3A), neither was there any correlation
between fitness effect and GC content, size of insert, or
number of complete ORFs (linear regression P > 0.05;
fig. 3B–D). The GC content of the inserts ranged from
28.6% to 51.6%, where the mean GC content (±standard
deviation) for B. fragilis inserts was 43.9 (±4.6)%, P. mirabilis
40.5 (±2.7)%, and phage 42.5 (±4.2)%. The mean GC content
of the S. typhimurium genome is 52.2% (McClelland et al.
2001) and E. coli is 50.8% (Riley et al. 2006)). The lengths of
the inserts ranged from 0.45 to 5 kb (supplementary table S2,
Supplementary Material online) with the mean length
(±standard deviation) of 2.2 (±0.9) kb for B. fragilis, 2.9
(±1.1) kb for P. mirabilis, and 2.6 (±0.9) kb for the phages.
On average, each insert contained 3.1 ORFs (complete and
truncated) and 1.5 complete ORFs. The genes included in the
1223
Knöppel et al. . doi:10.1093/molbev/msu076
inserts are listed in supplementary table S2, Supplementary
Material online (B. fragilis and P. mirabilis) and supplementary
table S1, Supplementary Material online (phage). The genes
found in B. fragilis and P. mirabilis inserts were analyzed for
connectivity, using STRING 9.1 (Franceschini et al. 2013).
Twelve of 81 fragments in the Bacteroides and Proteus
groups contained full sequences of high connectivity genes
(10 predicted interaction partners with high confidence
[Franceschini et al. 2013]) and 21/81 fragments contained
genes that are likely to be expressed from PBAD (full genes
in forward direction from PBAD with no genes in reverse in
between). Further analyses revealed no differences in mean
fitness between inserts containing at least one full gene with
high connectivity (10 predicted interaction partners with
high confidence using STRING 9.1) and those not including
such (t-tests ara: P = 0.978, + ara: P = 0.806). Neither was
there any difference in mean fitness in the presence of arabinose of inserts, including at least one full gene in the forward
direction from PBAD and those that did not (t-test: P = 0.806).
Discussion
Among the 98 horizontally transferred DNA fragments, 90
had fitness effects not significantly different from the control
( j s j < 0.007). The high number of inserts with j s j < 0.007
suggests the possibility that a relatively large fraction of HGT
events could be nearly neutral, that is, their fate is determined
by genetic drift in populations with Ne j s j 1, where Ne is
the effective population size (Maruyama and Kimura 1980).
However, it should be noted that experimental methods to
study fitness effects are limited by the fact that transfers with
small negative and positive effects cannot reliably be distinguished from truly neutral events (fig. 4). Natural selection
can act on j s j < 1/Ne, and as Ne is >107 for many bacteria,
j s j values as small as ~107 could be seen by selection
(Kimura 1983; Charlesworth and Eyre-Walker 2006). In our
experimental setup, deleterious fitness effects that range from
s = 0.007 to 0.875 were detectable. The lower limit was set
by the detection limit in the competition assay, where j s jvalues <0.007 cannot be distinguished from wild-type fitness.
The higher limit was estimated from the first cloning step of
the inserts into plasmids in E. coli. To be seen by the naked
eye, a colony needs to contain ~106 cells, corresponding
to about 20 generations of growth from a single cell.
Transformants were incubated for 48 h, meaning that a cell
with a generation time of up to 2.4 h (48 h/20 generations) is
detectable as a colony. The wild-type cell, without an alien
fragment, has a generation time of 0.3 h. Thus, cells with a
relative growth rate of 0.3/2.4 = 0.125 of the wild-type form
visible colonies, meaning that very slow-growing mutants
(with an s value as low 0.875 where 1 is lethal) can still
be detected. Although this limitation clearly excludes inserts
that are very strongly deleterious and lethal from the distribution of fitness effects, there is reason to believe these would
be a minor fraction of randomly transferred fragments given
that >99% of genes from 79 genomes have previously been
shown to be clonable in E. coli using standard methods (Sorek
et al. 2007). Even if our experimental setup was designed to
reduce the influence of fitness during strain construction,
1224
MBE
Fig. 4. Comparison of analyzable s values between different experimental approaches to study fitness constraints of HGTs. In our study,
s values j s j > 0.007 and s > 0.875 were detectable. The lower
limit, j s j > 0.007, was set by the detection limit in the competition
assay and the higher limit, j s j < 0.875, was estimated from the first
cloning step of the inserts into plasmids into Escherichia coli. In the
Sorek et al. (2007) study, HGTs that are lethal when carried on a multicopy plasmid in E. coli were analyzed.
described in the methods section below, we acknowledge
that there is always a bias toward higher fitness variants in
any experiment where different mutants compete with each
other and that this will introduce a bias that will somewhat
reduce the frequency of low fitness inserts. Similarly, we
cannot exclude that some inserts that had too high fitness
cost to be found in the E. coli cloning strain would have had a
smaller effect on S. typhimurium physiology.
The fitness assays conducted in this study measures composite fitness over the entire growth cycle, including lag
phase, exponential phase, and stationary phase in a minimal
glucose medium. This is clearly an artificial environment that
is relatively benign, and the magnitude of the fitness effects
reported here might be very different from a natural environment or under more stressful conditions where fitness costs
could be either amplified or reduced (Elena and de Visser
2003). In our experimental setup, we also exclude the fitness
effects of insertional mutagenesis by inserting the fragments
into a defined position with no detectable effect on fitness.
This is done to separate the fitness effects of disrupting native
genes with the effects conferred by the content of the transferred DNA fragments. On the basis of experimental data in
E. coli (Elena et al. 1998) and the high coding density of bacterial genomes, we would expect that the large majority
(>90%) of random HGTs inserted into the genome would
be strongly deleterious regardless of gene content and be
rapidly lost from the population. It seems likely that insertional effects will dominate the distribution of fitness effects
of integrative HGT events, but we also require information of
the fitness effects that are dependent on gene content to
understand the potential for HGTs to reach high frequencies
in bacterial populations.
If a significant fraction of HGTs, when inserted into a neutral position, have selection coefficients <0.001 and each
import will produce a total of 1/ j s j descendants before
being lost from the population (Lind et al. 2010), these
nearly neutral HGTs could be relatively common in natural
populations if the mechanistic rate of HGT is high enough.
A neutral HGT will rarely fix in the population, but when it
does, this will take in the order of Ne generations (Kimura
1983) in an ideal population with constant population size.
For an effective population size for enteric bacteria of 107
(Charlesworth and Eyre-Walker 2006) and assuming a generation time of 1 day, this corresponds to >10,000 years, but
Minor Fitness Costs of Horizontal Gene Transfer . doi:10.1093/molbev/msu076
this time could be substantially shortened for more realistic
conditions where populations might experience bottlenecks
and hitchhiking with linked beneficial loci. Perhaps more importantly, the presence of a substantial pool of horizontally
acquired genetic diversity might provide raw material for selection to act upon when challenged by changing environmental conditions. So even if the early evolutionary dynamics
of the HGT genes is largely determined by genetic drift, once
under selection they can be important for the evolution of
novel beneficial functions and acquisition of novel traits.
The fragments were determined deleterious in this study if
the confidence interval for independent fitness measurements did not overlap with the standard deviation for the
wild type. In total, 8/98 inserts were deleterious, and when the
PBAD promoter induced, the number doubled (16/98; fig. 2A
and B). It should be noted that we have not demonstrated
that any of the inserts actually express a gene product, although we strongly believe this to be the cause of the deleterious fitness effects. This is supported by a finding that
seven of eight inserts that are significantly deleterious only
under arabinose induction contain at least one ORF in the
forward direction (supplementary tables S2 and S3,
Supplementary Material online). Likewise, the arabinose induction of the PBAD promoter does not necessarily mean
increased transcription of the alien ORFs, even though this
was demonstrated for an YFP control strain. As these inserts
contain ORFs in both directions relative to the PBAD promoter, we do not expect any increase in gene expression
for ORFs in the opposite direction, although this is possible
due to reduced binding of H-NS protein that have been
shown to silence foreign genes (Navarre et al. 2006). It is
also possible that differences in fitness effects are solely due
to the change in environment by adding arabinose that is not
related to increased transcription from the PBAD promoter.
Despite these reservations, we interpret the increases in fitness cost of many of the deleterious inserts under arabinose
induction as supporting the case that the fitness costs are
mainly dependent on the expression of gene products
encoded on the inserts. The inserts with deleterious effects
contained ORFs that encoded proteins of diverse functions
(supplementary table S3, Supplementary Material online) including BF0844 (putative transmembrane protein), upper
and lower collar connector proteins, kdpC (potassium-transporting ATPase subunit c), BF0969 (putative RNA polymerase
ECF-type sigma factor), dapB (dihydrodipicolinate reductase),
ispH (4-hydroxy-3-methylbut-2-enyl diphosphate reductase),
and menE (putative O-succinylbenzoate-CoA ligase). The inserts also encoded truncated genes such as: prfA (putative
peptide chain release factor 1), clpB (negative regulator of
genetic competence), and BF0970 (putative antisigma factor).
Deleterious effects caused by insertion of HGTs into a neutral chromosomal position can be conferred at different levels.
First, the extra DNA may cause a burden on replication.
Second, expression of the extra genes may be costly both at
the level of transcription and translation. However, as translation is by far more energy consuming than replication and
transcription, it is expected to dominate the cost of gene expression (Ingraham et al. 1983; Lawrence and Hendrickson
MBE
2005; Bragg and Wagner 2009). In addition, the expressed
protein could be involved in an enzymatic reaction that is
costly (Diaz Ricci and Hernandez 2000; Koskiniemi et al. 2012).
Finally, increased levels of a gene product could lead to imbalances in protein/RNA dosage that results in, for example,
improper gene regulation or unwanted molecular interactions, thereby disrupting regulatory networks or complexes
(Pal et al. 2005; Sorek et al. 2007; Park and Zhang 2012). From
the present experiments, we cannot identify the reason for
the cost of the eight deleterious inserts (s values ranging from
& 0.007 to 0.02) because no correlation was seen between fitness effect and size of the inserts, or number of genes
(fig. 3C and D), which is expected if costs of gene expression
predominated, and this suggests that the deleterious effect
predominantly is due to, largely unpredictable, toxic gene
dosage effects caused by specific gene products.
Our demonstration that most HGTs events have minor
effect on fitness when introduced into a neutral position in
the genome suggests that most horizontally acquired DNAs
are only temporary residents in the genome. However, this
does not exclude the presence of strongly selected advantageous transfers that are important under strong selective
pressures, including transfer of antibiotic resistance genes
and genes allowing invasion of new ecological niches.
Although we did not find any inserts with strong, statistically
significant positive effects on fitness, it is possible that some
inserts confer beneficial effects that are smaller than 0.7%.
The only other experimental study of random HGTs to
date, to our knowledge (Sorek et al. 2007), focused on genes
that are not clonable in E. coli (in pUC plasmids with ~200
copies per cell) using a purely qualitative classification:
unclonable (lethal; 0.57%) or clonable (viable; 99.43%), covering 246,045 genes from 79 bacterial and archaeal genomes.
Figure 4 describes the major differences in which types of
HGT events would be discovered in our and their study. In
the study of Sorek et al., unclonable genes were most frequently found among bacteria belonging to the phylum
Firmicutes and the class Gammaproteobacteria, groups that
also coincided with lower GC content than the average
among the 79 genomes. In our study, no correlation between
GC content and fitness was found (fig. 3B). In agreement with
our findings, GC content seemed to play no role in the fitness
effect of exchanged ribosomal genes (Lind et al. 2010).
Potential explanations for the difference between the results
of those of Sorek et al. and ours is that our material was too
small to detect a GC effect or that the GC effect is more
pronounced when the genes are expressed at very high
levels from a high-copy number plasmid (as in the study of
Sorek et al. [2007]). It is also possible that the GC effect is only
relevant for classes of fragments that are lethal or have large
effects on fitness, and these are not included in this study.
The relatively small fitness effects of HGTs in our experimental model suggest there could be a major hidden genetic
diversity in bacterial populations. Whether or not this is the
case will depend on if the rates of conjugation, transduction,
and transformation in natural populations are high enough to
reintroduce HGT variation lost by genetic drift.
1225
MBE
Knöppel et al. . doi:10.1093/molbev/msu076
Materials and Methods
Strains and Media
All experiments used S. enterica subsp. enterica serovar
Typhimurium str. LT2 (designated S. typhimurium in the
text) and derivatives thereof except for the cloning of
random DNA fragments that was performed using E. coli
cloning strains (TOP10 [Invitrogen] or NEB 5-alpha [New
England Biolabs]). Luria-Bertani agar (LA) was used as solid
media and supplemented with ampicillin (200 mg/l) or chloramphenicol (10/15 mg/l) where needed for selection and
plasmid maintenance. Luria-Bertani/Lysogeny broth (LB)
and M9 minimal medium with 0.2% glucose or SOC (New
England Biolabs) were used as liquid media.
Construction of Strains with Random Insertions
A chloramphenicol resistance cassette was PCR-amplified
with high-fidelity polymerase (Phusion high-fidelity DNA polymerase, Finnzymes) with plasmid pKD3 (Datsenko and
Wanner 2000) (GenBank AY048742) as template and using
primers with added 50 -end restriction sites (oligonucleotide
primers used in strain construction are listed in supplementary table S4, Supplementary Material online). The PCR product was ligated into plasmid pBAD30 (Guzman et al. 1995)
between the SphI and HindIII restriction sites to form plasmid
pBADcam. Genomic DNA was isolated from B. fragilis and
P. mirabilis using Genomic Tip 100G (Qiagen). The phage
DNA was obtained from human feces from two healthy
and unrelated female adults (Reyes et al. 2010). Briefly, after
separation of the phage from bacteria and large particles present in feces, the samples were run on a CsCl gradient, and
viral particles in the 1.2–1.5 g ml1 range were collected and
used for DNA extraction. The phage DNA was amplified with
Genome Phi V2 (GE Healthcare) according to the manufacturer’s instructions before cloning. The amplified DNA was
restriction digested with EcoRI and KpnI (Fermentas), subjected to agarose gel electrophoresis, and fragments (2–5 kb
in size) were purified using GeneJET Gel Extraction Kit
(Fermentas). Plasmid pBADcam was isolated using E.Z.N.A.
plasmid purification kit (Omega Bio-Tek) and restriction digested with EcoRI and KpnI and treated with Shrimp Alkaline
Phosphatase (Fermentas). The gel-purified DNA fragments
were then ligated into the pBADcam plasmid and transformed into E. coli by electroporation, and suitable dilutions
were plated after 20-min shaking incubation at 37 C on
nonselective LA plates (fig. 1). The LA plates were incubated
overnight at 37 C before replica printing to selective chloramphenicol plates. This protocol was used to reduce bias
toward fast-growing transformants during recovery and
reduce the probability of obtaining multiple clones with identical inserts. Transformants were scored after 24 and 48 h and
used as template in a PCR to amplify the insert, the PBAD
promoter, and the chloramphenicol resistance cassette using
primers with 50 -homology regions for insertion into cbiA in
the S. typhimurium chromosome using the using the Red
system as previously described (Datsenko and Wanner 2000;
Lind et al. 2010). Correct inserts were then confirmed by PCR
1226
using primers outside the insert region and sequenced. The
inserts, including four independently constructed controls
containing only the PBAD promoter, the chloramphenicol
marker and the terminator, without a foreign DNA insert,
were moved by phage P22 transduction into the wild-type
S. typhimurium LT2 background and the two cfp and yfp
containing preadapted strains used for the competition experiments. For five strains, bfp and syfp2 were instead used as
markers.
Competition Experiments
Selection coefficients (s) were estimated by competing the
mutants with insert against an isogenic wild-type control
without a foreign insert but containing the arabinose promoter, chloramphenicol resistance marker, and terminator.
The inserts and controls were moved by P22 transduction
into strains that had previously been serially passaged under
the experimental conditions for 1,000 generations and encoding CFP and YFP fluorescent proteins, as previously described
(Lind et al. 2010). HGT mutants and wild-type controls carrying either a cfp or yfp marker or a bfp or syfp2 marker were
mixed at equal volumes from an overnight culture and maintained by 1,000-fold serial dilution (ten generations per serial
passage) every 24 h. 105 cells were counted each cycle by flow
cytometry (BD FACS Aria) to determine the ratio of mutant
to wild type for five cycles (50 generations). Estimation of
selection coefficients (s) were performed according to the
regression model s = [ln R(t)/R(0)]/t, as previously described
(Dykhuizen 1990), where R is the ratio of mutant to wild type,
and all mutants were assayed in both the cfp and yfp background, or the bfp and syfp2 background. Pearson correlation
between the cfp/bfp and yfp/syfp2 strains was r = 0.83 without
arabinose and r = 0.91 with arabinose induction, showing that
fitness was dependent on the insert. Independently constructed wild-type controls were used to correct for the
small difference in fitness between the cfp/bfp and yfp/syfp2
alleles. Supplementary figure S3, Supplementary Material
online is a control experiment that demonstrates that arabinose induces the PBAD promoter under the conditions used.
Statistical Analysis
The statistical analysis was carried out using the software R
(version 3.0.2) and RStudio (version 0.97.551). P values < 0.05
are referred to as statistically different and >0.05 as not different. An insert was determined to be deleterious if the
confidence interval for independent fitness measurements
of the specific strain did not overlap with the SD for control
strain measured in + ara. If the confidence interval overlapped with the SD for control strain, the strain was instead
referred to as “not significantly different from the control”
(supplementary fig. S2, Supplementary Material online). For
connectivity analysis, STRING 9.1 (string-db.org, [Franceschini
et al. 2013]) was used to list the number of interaction partners for each gene in the B. fragilis and P. mirabilis inserts. The
cutoff for included interaction partners was set to 0.7 (combined score) that refers to “predicted functional partners”
with “high confidence.”
Minor Fitness Costs of Horizontal Gene Transfer . doi:10.1093/molbev/msu076
Supplementary Material
Supplementary figures S1–S3 and tables S1–S4 are available at
Molecular Biology and Evolution online (http://www.mbe.
oxfordjournals.org/).
Acknowledgments
This work was supported by grants from the Swedish
Research Council to D.I.A. The authors thank Hervé
Nicoloff for technical support and insightful discussions.
They also thank Forest Rowher for kindly providing us with
the phage DNA used in this study.
References
Ambur OH, Davidsen T, Frye SA, Balasingham SV, Lagesen K, Rognes T,
Tonjum T. 2009. Genome dynamics in major bacterial pathogens.
FEMS Microbiol Rev. 33:453–470.
Baltrus DA. 2013. Exploring the costs of horizontal gene transfer. Trends
Ecol Evol. 28:489–495.
Barlow M. 2009. What antimicrobial resistance has taught us about
horizontal gene transfer. Methods Mol Biol. 532:397–411.
Bragg JG, Wagner A. 2009. Protein material costs: single atoms can make
an evolutionary difference. Trends Genet. 25:5–8.
Canchaya C, Fournous G, Chibani-Chennoufi S, Dillmann ML, Brussow
H. 2003. Phage as agents of lateral gene transfer. Curr Opin Microbiol.
6:417–424.
Cerdeno-Tarraga AM, Patrick S, Crossman LC, Blakely G, Abratt V,
Lennard N, Poxton I, Duerden B, Harris B, Quail MA, et al. 2005.
Extensive DNA inversions in the B. fragilis genome control variable
gene expression. Science 307:1463–1465.
Charlesworth J, Eyre-Walker A. 2006. The rate of adaptive evolution in
enteric bacteria. Mol Biol Evol. 23:1348–1356.
Comas I, Moya A, Gonzalez-Candelas F. 2007. Phylogenetic signal and
functional categories in Proteobacteria genomes. BMC Evol Biol.
7(Suppl 1):S7.
Creevey CJ, Doerks T, Fitzpatrick DA, Raes J, Bork P. 2011. Universally
distributed single-copy genes indicate a constant rate of horizontal
transfer. PLoS One 6:e22099.
Datsenko KA, Wanner BL. 2000. One-step inactivation of chromosomal
genes in Escherichia coli K-12 using PCR products. Proc Natl Acad Sci
U S A. 97:6640–6645.
Daubin V, Lerat E, Perriere G. 2003. The source of laterally transferred
genes in bacterial genomes. Genome Biol. 4:R57.
Diaz Ricci JC, Hernandez ME. 2000. Plasmid effects on Escherichia coli
metabolism. Crit Rev Biotechnol. 20:79–108.
Dykhuizen DE. 1990. Experimental studies of natural selection in bacteria. Annu Rev Ecol Syst. 21:373–398.
Elena SF, de Visser JA. 2003. Environmental stress and the effects of
mutation. J Biol. 2:12.
Elena SF, Ekunwe L, Hajela N, Oden SA, Lenski RE. 1998. Distribution of
fitness effects caused by random insertion mutations in Escherichia
coli. Genetica 102–103:349–358.
Franceschini A, Szklarczyk D, Frankild S, Kuhn M, Simonovic M, Roth A,
Lin J, Minguez P, Bork P, von Mering C, et al. 2013. STRING v9.1:
protein-protein interaction networks, with increased coverage and
integration. Nucleic Acids Res. 41:D808–D815.
Ge F, Wang LS, Kim J. 2005. The cobweb of life revealed by genome-scale
estimates of horizontal gene transfer. PLoS Biol. 3:e316.
Guzman LM, Belin D, Carson MJ, Beckwith J. 1995. Tight regulation,
modulation, and high-level expression by vectors containing the
arabinose PBAD promoter. J Bacteriol. 177:4121–4130.
Ingraham JL, Maaløe O, Neidhardt FC. 1983. Growth of the bacterial cell.
Sunderland (MA): Sinauer Associates.
Jain R, Rivera MC, Lake JA. 1999. Horizontal gene transfer among
genomes: the complexity hypothesis. Proc Natl Acad Sci U S A. 96:
3801–3806.
MBE
Juhas M, van der Meer JR, Gaillard M, Harding RM, Hood DW, Crook
DW. 2009. Genomic islands: tools of bacterial horizontal gene transfer and evolution. FEMS Microbiol Rev. 33:376–393.
Kimura M. 1983. The neutral theory of molecular evolution. Cambridge:
Cambridge University Press.
Koonin EV, Makarova KS, Aravind L. 2001. Horizontal gene transfer in
prokaryotes: quantification and classification. Annu Rev Microbiol.
55:709–742.
Koskiniemi S, Sun S, Berg OG, Andersson DI. 2012. Selection-driven gene
loss in bacteria. PLoS Genet. 8:e1002787.
Kurland CG. 1992. Translational accuracy and the fitness of bacteria.
Annu Rev Genet. 26:29–50.
Lawrence JG, Hendrickson H. 2005. Genome evolution in bacteria: order
beneath chaos. Curr Opin Microbiol. 8:572–578.
Lind PA, Tobin C, Berg OG, Kurland CG, Andersson DI. 2010.
Compensatory gene amplification restores fitness after inter-species
gene replacements. Mol Microbiol. 75:1078–1089.
Maruyama T, Kimura M. 1980. Genetic variability and effective population size when local extinction and recolonization of subpopulations
are frequent. Proc Natl Acad Sci U S A. 77:6710–6714.
McClelland M, Sanderson KE, Spieth J, Clifton SW, Latreille P, Courtney
L, Porwollik S, Ali J, Dante M, Du F, et al. 2001. Complete genome
sequence of Salmonella enterica serovar Typhimurium LT2. Nature
413:852–856.
Nakamura Y, Itoh T, Matsuda H, Gojobori T. 2004. Biased biological
functions of horizontally transferred genes in prokaryotic genomes.
Nat Genet. 36:760–766.
Navarre WW, Porwollik S, Wang Y, McClelland M, Rosen H, Libby SJ,
Fang FC. 2006. Selective silencing of foreign DNA with low GC
content by the H-NS protein in Salmonella. Science 313:236–238.
Ochman H, Lawrence JG, Groisman EA. 2000. Lateral gene transfer and
the nature of bacterial innovation. Nature 405:299–304.
Pal C, Papp B, Lercher MJ. 2005. Adaptive evolution of bacterial metabolic networks by horizontal gene transfer. Nat Genet. 37:1372–1375.
Park C, Zhang J. 2012. High expression hampers horizontal gene transfer.
Genome Biol Evol. 4:523–532.
Pearson MM, Sebaihia M, Churcher C, Quail MA, Seshasayee AS,
Luscombe NM, Abdellah Z, Arrosmith C, Atkin B, Chillingworth
T, et al. 2008. Complete genome sequence of uropathogenic
Proteus mirabilis, a master of both adherence and motility.
J Bacteriol. 190:4027–4037.
Reyes A, Haynes M, Hanson N, Angly FE, Heath AC, Rohwer F, Gordon JI.
2010. Viruses in the faecal microbiota of monozygotic twins and
their mothers. Nature 466:334–338.
Reyes A, Semenkovich NP, Whiteson K, Rohwer F, Gordon JI. 2012.
Going viral: next-generation sequencing applied to phage populations in the human gut. Nat Rev Microbiol. 10:607–617.
Riley M, Abe T, Arnaud MB, Berlyn MK, Blattner FR, Chaudhuri RR,
Glasner JD, Horiuchi T, Keseler IM, Kosuge T, et al. 2006. Escherichia
coli K-12: a cooperatively developed annotation snapshot–2005.
Nucleic Acids Res. 34:1–9.
Snel B, Bork P, Huynen MA. 1999. Genome phylogeny based on gene
content. Nat Genet. 21:108–110.
Sorek R, Zhu Y, Creevey CJ, Francino MP, Bork P, Rubin EM. 2007.
Genome-wide experimental determination of barriers to horizontal
gene transfer. Science 318:1449–1452.
Srinivasiah S, Bhavsar J, Thapar K, Liles M, Schoenfeld T, Wommack KE.
2008. Phages across the biosphere: contrasts of viruses in soil and
aquatic environments. Res Microbiol. 159:349–357.
Stoebel DM, Dean AM, Dykhuizen DE. 2008. The cost of expression of
Escherichia coli lac operon proteins is in the process, not in the
products. Genetics 178:1653–1660.
Suttle CA. 2005. Viruses in the sea. Nature 437:356–361.
Thomas CM, Nielsen KM. 2005. Mechanisms of, and barriers to, horizontal gene transfer between bacteria. Nat Rev Microbiol. 3:711–721.
Wellner A, Lurie MN, Gophna U. 2007. Complexity, connectivity, and
duplicability as barriers to lateral gene transfer. Genome Biol. 8:R156.
Woese C. 1998. The universal ancestor. Proc Natl Acad Sci U S A. 95:
6854–6859.
1227