Infrequent Transitions between Saline and Fresh Waters in One of

Infrequent Transitions between Saline and Fresh Waters in
One of the Most Abundant Microbial Lineages (SAR11)
Ramiro Logares,*,1 Jon Bråte,2 Friederike Heinrich,1 Kamran Shalchian-Tabrizi,2 and Stefan Bertilsson1
1
Limnology/Department of Ecology and Evolution, Uppsala University, Uppsala, Sweden
Microbial Evolution Research Group, Department of Biology, University of Oslo, Oslo, Norway
*Corresponding author: E-mail: [email protected].
Associate editor: Douglas Crawford
2
Abstract
Key words: environmental transitions, prokaryotes, saline, freshwater, SAR11, 16S rDNA.
Introduction
The SAR11 lineage within the alpha-Proteobacteria has received great attention from the scientific community since
it was discovered in the early 1990s (Giovannoni et al.
1990). This interest is certainly justified as they appear
to be the dominant bacterioplankton in the surface of
the oceans, where they can contribute to as much as
50% of the total microbial community (Morris et al.
2002). With an estimated global population size of
2.4 1028 cells, the SAR11 group is among the most successful organisms on the planet, but many questions still
remain unanswered concerning its ecology and physiology.
Several subgroups have been proposed within the SAR11
cluster, which in some cases seem to be correlated with
specific environments and therefore suggest the occurrence of adaptive evolution (Field et al. 1997; Brown and
Fuhrman 2005; Kan et al. 2008). The difficulties to cultivate
SAR11 strains have been one of the main obstacles for understanding their ecological roles in the ecosystem. The
first SAR11 culture (Candidatus Pelagibacter ubique) was
isolated off the coast of Oregon (Rappe et al. 2002). The
small vibroid cells formed by this strain were capable of
heterotrophic growth, but more recent studies point to
a rather complex metabolism and ecology. For instance,
it contains a functional gene coding for the light-dependent proton pump ‘‘Proteorhodopsin,’’ suggesting a capacity for phototrophic or at least photoheterotrophic energy
acquisition (Giovannoni, Bibbs, et al. 2005). Such complexities contrast with the fact that SAR11 members have one
of the smallest genomes known for any free-living microorganism (;1.3 Mb; Giovannoni, Tripp, et al. 2005). Recent
evidence also showed that the strain C. P. ubique is dependent on exogenous sources of reduced sulfur for growth
(Tripp et al. 2008), and this may explain to a certain extent
the low success in isolating members of this group. This
dependency also implies strong ecological links to other
planktonic organisms capable of synthesizing these
compounds.
In addition to marine environments, members of SAR11
have also been found in brackish (Kan et al. 2008) and fresh
waters (LD12; Zwart et al. 2002), but their numerical significance in communities from these environments is still
uncertain. Besides, there are so far no comprehensive and
synoptic studies on the evolutionary relationships between
marine, brackish, and freshwater SAR11 lineages. Nevertheless, previous studies have suggested that distinct evolutionary lineages may be present in marine, brackish, and
fresh waters (Zwart et al. 2002; Kan et al. 2008). This
may seem counterintuitive, as the high abundance of
© The Author 2009. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please
e-mail: [email protected]
Mol. Biol. Evol. 27(2):347–357. 2010 doi:10.1093/molbev/msp239
Advance Access publication October 6, 2009
347
Research article
The aquatic bacterial group SAR11 is one of the most abundant organisms on Earth, with an estimated global population
size of 2.4 1028 cells in the oceans. Members of SAR11 have also been detected in brackish and fresh waters, but the
evolutionary relationships between the species present in the different environments have been ambiguous. In particular, it
was not clear how frequently this lineage has crossed the saline–freshwater boundary during its evolutionary
diversification. Due to the huge population size of SAR11 and the potential of microbes for long-distance dispersal, we
hypothesized that environmental transitions could have occurred repeatedly during the evolutionary diversification of this
group. Here, we have constructed extensive 16S rDNA–based molecular phylogenies and undertaken metagenomic data
analyses to assess the frequency of saline–freshwater transitions in SAR11 and to investigate the evolutionary implications
of this process. Our analyses indicated that very few saline–freshwater transitions occurred during the evolutionary
diversification of SAR11, generating genetically distinct saline and freshwater lineages that do not appear to exchange
genes extensively via horizontal gene transfer. In contrast to lineages from saline environments, extant freshwater taxa
from diverse, and sometimes distant, geographic locations were very closely related. This points to a rapid diversification
and dispersal in fresh waters or to slower evolutionary rates in fresh water SAR11 when compared with marine
counterparts. In addition, the colonization of both saline and fresh waters appears to have occurred early in the evolution
of SAR11. We conclude that the different biogeochemical conditions that prevail in saline and fresh waters have likely
prevented the environmental transitions in SAR11, promoting the evolution of clearly distinct lineages in each
environment.
Logares et al. · doi:10.1093/molbev/msp239
SAR11 in the oceans would have promoted repeated colonizations of brackish and fresh waters. The rationale is that
large populations can contribute many individuals and genotypes for colonizing new environments (many genotypes
increase the chances that at least one of them will succeed
in the new environment). However, it should also be considered that genome streamlining in SAR11 (Giovannoni,
Tripp, et al. 2005) may reduce the adaptability of the group
to diverse environmental conditions, although diverse
genes in hypervariable genome regions (Wilhelm et al.
2007; Gilbert et al. 2008) may counterbalance such effect.
Here, we investigate the transitions between marine,
brackish, and fresh waters during the evolution of SAR11
by using extensive 16S rDNA–based molecular phylogenies
containing publicly available sequences from various geographic locations. In addition, we have analyzed metagenomic libraries from environments distributed along
a wide salinity range in order to determine if SAR11 members in marine, brackish, and fresh waters are different at
the whole-genome level. Overall, our results indicate that
historical transitions between saline and fresh waters have
been uncommon events during the evolutionary diversification of SAR11, which promoted the emergence and
persistence of genetically distinct lineages in each
environment.
Materials and Methods
Phylogenetic Analyses
Preliminary Bayesian and maximum likelihood (ML) phylogenetic analyses using a reduced data set indicated the existence of strongly supported marine and freshwater SAR11
clusters, as well as a putative brackish one (results not
shown). Sequences from these clusters were subsequently
used for extensive Blast data-mining searches in Genbank
against the NCBI-nr database (searches finalized in December
2008). In total, about 700 sequences were retrieved, several
originating from the GOS (Global Ocean Sampling; Rusch
et al. 2007) data set (the GOS-associated sequences have
already been curated (i.e., checked for chimeras and errors)
and used in previous works, see Shaw et al. (2008); additionally, we have rechecked all used sequences for the presence of chimeras). The brackish and marine SAR11 query
sequences matched several GOS sequences from estuaries
(Delaware and Chesapeake bays, US and Bay of Fundy, Canada) and various marine environments, respectively. In
contrast, the freshwater sequences retrieved mostly GOS
sequences from the freshwater Lake Gatun in Panama
(the only freshwater lake included in the GOS data set
[GS20], which drains into the Panama Canal) as well as
some from estuaries. In all cases, the sequences retrieved
via Blast were highly similar to our query sequences. After
eliminating short and potentially erroneous sequences, the
retrieved sequences were used to construct an alignment
(using MAFFT [v6.240] with the algorithm E-INS-I; Katoh
and Toh 2008). This alignment consisted of 604 sequences
and 1,530 characters (AL1). The sequences in AL1 originated from several different geographical locations, some348
MBE
times separated by large distances. An additional more
stringent alignment was constructed by removing ambiguously aligned sites using GBLOCKS (Castresana 2000) as well
as visual examination. This alignment consisted in 604 sequences and 1,351 characters (AL2). A third alignment was
generated (AL3) using Nearest Alignment Space Termination (DeSantis, Hugenholtz, Keller, et al. 2006) in Greengenes (http://greengenes.lbl.gov/; DeSantis, Hugenholtz,
Larsen et al. 2006). AL3 consisted of 524 sequences and
1,268 characters and included 12 SAR11 sequences retrieved from metagenomic libraries from the Antarctic
Ace Lake. Several sites were removed from this alignment
using GBLOCKS as well as visual examination. All sequences
that became identical after the exclusion of sites were
removed from this alignment, which was used strictly for
investigating phylogenetic patterns and not for environment–phylogeny comparisons. An extra alignment (AL4)
based on AL3 was constructed by removing eight sequences
that branched at the base of the marine–brackish cluster
(EU800530, EU801112, EU800189, EU801347, EU800751,
EU801707, EU800662, and EU800327). This was carried
out in order to test for the effects that these deep branching
sequences could have had in the overall tree-topology reconstructions. AL4 consisted of 516 sequences and 1,268
characters. All alignments are summarized in supplementary table S1, Supplementary Material online.
Phylogenies were constructed using ML as implemented
in RaxML (v.7.0.4; Stamatakis 2006). The analyses in RaxML
were run with a rapid hill-climbing mode under the general
time reversible (GTR) þ MIX þ I option using 25 rate categories. The MIX þ I option in RaxML performs a tree
search under the GTR þ CAT approximation, and when
the analysis is finished, it changes the model to GTR þ
G þ I to evaluate the final topology of the tree and calculate likelihood values under this model. The GTR þ CAT is
an approximation for the GTR þ G model that accelerates
the computation of trees and is recommended for large
data sets (see RaxML user manual). We have carried out
additional analyses in RaxML under the GTR þ G þ I model
(with four rate categories; best topology and bootstrap), in
order to compare with the results given by GTR þ MIX þ I.
Overall, no significant differences (topology and bootstrap)
were found when using GTR þ MIX þ I and GTR þ G þ I;
hence, the two approaches will be considered as analogous
for the purpose of this work. In the analyses with RaxML,
the shape parameter (a) of the Gamma (G) distribution
and the proportion of invariable sites (I) were estimated
from the data set using default options. A total of 100 heuristic tree searches per alignment were carried out from
random starting trees, and the topology with the highest
likelihood was selected. Bootstrapping was carried out with
1,000 pseudoreplicates under the GTR þ MIX þ I/GTR þ
G þ I models, with one heuristic search per replicate. All
phylogenetic analyses were run at the University of Oslo
Bioportal (http://www.bioportal.uio.no/) computer cluster.
The trees generated were visualized and edited in FigTree
(v1.2.2; Rambaut 2009) and Interactive Tree Of Life (Letunic
and Bork 2007). Alignments and trees are available on request.
Saline–Freshwater Transitions in Prokaryotes · doi:10.1093/molbev/msp239
Unifrac Analyses
We have used the Unifrac and P value approach (Martin
2002; Lozupone and Knight 2005) to test statistically
whether there is an evolutionary separation between marine, brackish, and freshwater groups. The microbial communities in given environments are considered
evolutionary different if the Unifrac value for the real tree
is greater than a random distribution of the sequences
among the environments (see Martin 2002; Lozupone
and Knight 2005 for a detailed description of Unifrac
and the P value). To carry out the Unifrac analyses, the habitats of the sequences included in the analyses were determined according to the ‘‘Features’’ information in the
GenBank records or by referring to the publication of
the sequence. Each site was classified as 1 5 Marine
(30–40 PSU), 2 5 Brackish (0.5–30 PSU), and 3 5 Freshwater (,0.5 PSU). This classification is used in aquatic sciences and derives from ‘‘The Venice System for the
Classification of Marine Waters According to Salinity’’
(see Anonymous 1958; Hammer 1986). Note that in contrast to marine and fresh waters, brackish environments
may experience wide salinity changes in diverse spatiotemporal scales (see Dahl 1956; Lampert and Sommer 2007).
The analyses were carried out considering different combinations of environmental categories. In one analysis, 1 and
2 were considered as one single ‘‘marine–brackish’’ category, and in another analysis, 2 and 3 were considered
as the single ‘‘brackish–freshwater’’ category. We also carried out analyses considering marine, brackish, and freshwater as independent environmental categories. The used
input tree was the best tree (highest likelihood) obtained
after 100 heuristic searches in RaxML using the AL2 alignment. Unifrac tests were run with 100 permutations and P
values were corrected for multiple comparisons using the
Bonferroni correction. Phylogenetic outgroups were not
considered in the Unifrac tests.
Phylogenetic Diversity within Marine–Brackish
and Freshwater Clades
In order to estimate the amount of evolutionary diversification (i.e., amount of molecular evolution) that has occurred within both the marine–brackish and freshwater
clusters (phylogenetic diversity), we have added the branch
lengths within each of them (F 5 sum of branch lengths
within freshwater cluster; M 5 sum of branch lengths
within marine–brackish cluster). Subsequently, in order
to normalize these values, we divided F and M by the
number of sequences contained by the freshwater and
marine–brackish clusters correspondingly (F1 5 F/127
and M1 5 M/461). Afterward, we calculated the ratio
F1/M1, which estimates the proportion of evolutionary
diversification between the freshwater and marine–
brackish clusters.
Fragment Recruitment Analyses (FRAs)
FRA uses the genome of a given strain to detect homologous fragments (sequences) in metagenomic libraries (see
Rusch et al. 2007). Here, we have used the genome of the
MBE
marine SAR11 strain C. P. ubique HTCC1062 (accession
number CP000084) to check for the presence of evolutionary-related strains in metagenomic libraries from environments distributed along a wide salinity range (0–175 PSU).
This approach can shed light on marine–freshwater transitions. For example, if close relatives of the marine strain
HTCC1062 are present in fresh waters, then various reads
matching with high similarity different areas of the query
genome should be detected in freshwater metagenomic libraries. This analysis can also provide an indication of
whether or not the differences or similarities in the 16S between different groups are reflected in the rest of their genomes. This is particularly relevant, as substantial horizontal
gene transfer could still occur between genetically differentiated saline and freshwater lineages, specially in SAR11 (see
Vergin et al. 2007; Gilbert et al. 2008). The strain C. P. ubique
HTCC1062 was the first member of SAR11 to be isolated
(Rappe et al. 2002). This isolate has been collected from
a depth of 10 m, at a location 27.6 km off from the coast
of Oregon (44°39.1#N, 124°24.7#W). The salinity of the water from where HTCC1062 was isolated was about 33 PSU
(Grantham et al. 2004). FRA was carried out in both CAMERA (http://camera.calit2.net/; Seshadri et al. 2007) and
Meta Genome Rapid Annotation using Subsystem Technology (MG-RAST) (http://metagenomics.nmpdr.org/; Meyer
et al. 2008). FRA was performed in 13 publicly available
metagenomes from diverse aquatic environments (see table
1). These metagenomic data sets were selected in order to
encompass a wide salinity gradient and different environmental conditions (e.g., lakes vs. marine). Other metagenomic libraries have also been investigated, but they are
not included in the final analyses because they do not provide additional information (data not shown). All metagenomic libraries originated from water samples taken at one
depth (normally surface) at each sampled site. However,
metagenomic libraries from different depths were available
for Ace Lake, and we have included them in this study as
this lake presents strong vertical gradients in several physicochemical variables (Rankin et al. 1999) that can affect
microbial community composition.
16S Survey of Metagenomic Libraries
Large metagenomic libraries can potentially contain genome fragments from rare populations that are not normally detected in regular clone libraries of amplified 16S
rDNA genes. Thus, for example, if there were freshwater
SAR11 strains present in low abundances in marine environments, then those strains could be detected by Blastsearching known fresh waters sequences against marine
metagenomic data sets. Using this approach, we aimed
at detecting relatives of marine, brackish, and freshwater
SAR11 sequences in the 13 analyzed metagenomes. Representative 16S rDNA sequences belonging to freshwater
(Z99997, LD12), Brackish (U70686, putative group), and
Marine (AF510191, HTCC1062) groups were used as
queries for Blast searches in MG-RAST (Meyer et al.
2008) against the investigated metagenomic libraries. Only
those retrieved sequences with an E-value , 0.0001 and
349
MBE
350
Only recruited fragments with .50% similarity were considered in the analyses. Metagenomic data sets were obtained from CAMERA and follow the nomenclature given in CAMERA or GOS (see Rusch et al. 2007). The size fraction from
which each sample was obtained ranged between 0.1 and 0.8 lm (except GS00C, which was obtained from the fraction 0.22–0.8 lm).
Size Data set Number of Mean of
(Number of Recruited Fragment
Min Bit Score/Max
Sequences) Fragments Identity
SD E-value/Min align.
95,521
21,174
83.3
11.9
34/1.26E202/17
103,058
14,710
82.3
12.5
32/7.94E203/15
368,835
46,208
73
11.2
35/1.00E202/18
59,679
16,770
86.6
12.2
35/7.94E203/19
124,435
16,172
74.5
12.4
45/5.01E203/23
126,162
15,561
71.8
12.0
44/7.94E203/25
28,481
0
0
0
N/A
281,490
15,301
82.6
12.4
36/1.00E202/19
54,446
2
74
7.1 149/3.16E238/91
10,042
3
63
1.7 167/5.01E240/134
96,72
9
74
15.7 190/6.31E247/146
100,085
29
65.4
15.1
57/7.94E207/51
296,355
8,419
64.3
9.5
45/5.01E03/29
Salinity
Library
Sample Location
Environment
Latitude/Longitude
Depth (m)
(PSU)
1 (S9)
Open Ocean, Antarctica
Marine
63°53#28$S–112°4#23$E
2
34.0
2 (S8)
Newcomb Bay, Antarctica
Marine
66°16#12$S–110°31#59$E
2
34.0
3 (GS00C) Sargasso Sea, Atlantic Ocean
Marine
32°12#29$N–64°00#36$W
5
36.7
4 (GS006) Bay of Fundy, Canada
Estuary
45°6#42$N–64°56#48$W
1
25.0-31.0
5 (GS011) Delaware Bay, United States
Estuary
39°25#4$N–75°30#15$W
1
3.5
6 (GS012) Chesapeake Bay, United States Estuary
38°56#49$N–76°25#2$W
13.2
3.5
7 (S7)
Organic Lake, Antarctica
Hypersaline, marine derived 68°25#58$S–78°17#59$E
0.1
175.0
8 (S6)
Ace Lake, Antarctica
Saline, marine derived
68°24#0$S–78°10#58$E
5
22.0
9 (S4)
Ace Lake, Antarctica
Saline, marine derived
68°24#0$S–78°10#58$E
12.7
22.0
10 (S3)
Ace Lake, Antarctica
Saline, marine derived
68°24#0$S–78°10#58$E
14
32.0
11 (S2)
Ace Lake, Antarctica
Saline, marine derived
68°24#0$S–78°10#58$E
18
34.0
12 (S1)
Ace Lake, Antarctica
Saline, marine derived
68°24#0$S–78°10#58$E
23
N/D
13 (GS020) Lake Gatun, Panama
Freshwater
9°9#52$N–79°50#10$W
2
0.1
Table 1. Metagenomic Libraries Used for Fragment Recruitment Analysis Using the Genome of the Marine SAR11 Strain Candidatus Pelagibacter ubique HTCC1062 as Query and a General
Description of the Results.
Logares et al. · doi:10.1093/molbev/msp239
a Bit score . 1,000 were further analyzed, as they have reasonable chances of being SAR11 members. The percentage
of similarity between the query and the retrieved sequences
was analyzed.
Results
Deep Evolutionary Divergence between Marine–
Brackish and Freshwater SAR11 Lineages
Our phylogenetic analyses (based on AL2/4) gave significant support for the SAR11 cluster (100% bootstrap support [BS]), as well as for the marine–brackish (.70% BS)
and freshwater (.70% BS; so-called LD12, Zwart et al.
2002) subclusters (fig. 1, see summary of the phylogenetic
results in supplementary table S1, Supplementary Material
online). In the trees based on AL2, the marine–brackish
cluster contained 461 sequences, whereas the freshwater
cluster contained 127. Within the marine–brackish cluster,
there were 220 marine (48%) and 241 brackish (52%) sequences (fig. 2). No freshwater sequences fell into the
marine–brackish cluster. The freshwater cluster (LD12)
was composed by 119 freshwater (94%) and 8 brackish
(6%) sequences (fig. 2). Marine sequences were absent in
the freshwater cluster. Due to the low proportion of
brackish sequences falling into the freshwater cluster (that
probably do not represent indigenous brackish lineages
that have derived from freshwater ancestors), we have
named this cluster simply ‘‘freshwater’’ (instead of
‘‘freshwater–brackish’’).
Overall, the phylogenetic diversity (i.e., amount of molecular evolution) within the marine–brackish cluster was
at least 7.5 times higher than the diversity within the freshwater cluster (fig. 1; the normalized phylogenetic diversity
within the freshwater clade was 0.0014, whereas the diversity within the marine–brackish cluster was 0.0111; calculations based on AL2). Despite the significant statistical
support for the backbone of the SAR11 phylogeny, most
of the branching patterns within the marine–brackish
and freshwater subclusters were unsupported (fig. 2).
Within the marine–brackish SAR11, the general clustering
patterns obtained in our phylogenetic reconstructions
(marine–brackish phylogeny backbone; AL2) agreed to
a certain extent with previous work, and therefore, we will
follow the cluster nomenclature already proposed (see, e.g.,
Kan et al. 2008). Specifically, the subclusters SAR11-I (66%
BS) and SAR11-II (97% BS) were recovered in our phylogenetic reconstructions (fig. 2). However, the previously proposed cluster SAR11-III (normally sister to the freshwater
cluster) was not supported by our analyses (fig. 2). We constructed the more stringent alignment AL4, excluding sequences branching at the base of the marine–brackish
cluster (see fig. 2 ‘‘A’’) in order to test if these sequences
could have affected the tree-topology reconstruction
and if SAR11-III would be recovered under these new conditions. However, the removal of those sequences increased
the BS for the previously observed topology.
Overall, SAR11-I was recovered as a mosaic of marine
and brackish sequences (fig. 2) with no clear internal
Saline–Freshwater Transitions in Prokaryotes · doi:10.1093/molbev/msp239
FIG. 1. Distinct saline and freshwater lineages in SAR11. SAR11 ML
phylogenetic tree (phylogram) based on 604 sequences (AL2)
calculated under the GTR þ MIX þ I model in RaxML. The best
topology was obtained after 100 heuristic searches. BS values
presented over the nodes derive from 1,000 pseudoreplicates. (A)
phylogram showing the backbone of the phylogeny, with the
marine–brackish (red) and freshwater (blue) subgroups collapsed.
Note the reduced phylogenetic diversity (i.e., amount of molecular
evolution) within the freshwater clade when compared with the
marine–brackish. (B) Radial phylogram illustrating the amount of
genetic divergence within the marine–brackish and freshwater
clusters. The name of some major clusters is indicated over the
branches. The arrows indicate nodes where some subclusters start.
The scale bar indicates substitutions/site.
clustering pattern. Most of the brackish sequences originated from the Chesapeake and Delaware bays. In addition,
SAR11-I contained sequences from various locations, such
as the Arctic, Southern Ocean, deep sea, and the coast of
Oregon, United States. The strain C. P. ubique (HTCC1062),
which was genome sequenced and used in this work for
FRAs, fell within this cluster.
SAR11-II was predominantly composed of marine sequences (fig. 2). Within this group, we have recovered
the clusters SAR11-IIA (91% BS; only represented by two
sequences), SAR11-IIB (74% BS), and SAR11-IIC (79% BS)
MBE
(fig. 2). SAR11-IIB contained 154 sequences from diverse
geographic origins, such as the Arctic and Southern Ocean,
Newport Harbor (US), and the sea near Costa Rica and
Panama. SAR11-IIC was composed of sequences originating
from different locations in the Pacific Ocean, the Sargasso
Sea, Caribbean Sea, and Arctic Ocean. Overall, most of the
recovered branching patterns within SAR11-II subclusters
were unsupported.
Several sequences, most of them brackish, had a basal
position in relation to SAR11-I and II (fig. 2). Some of these
sequences were previously assigned to SAR11-III (e.g.,
U70686; Kan et al. 2008 and AF353226; Stingl et al.
2007), a group that, as mentioned before, was not
supported in our analyses. Many of the basal sequences
originated from Chesapeake and Delaware bays. Other
sequences from diverse locations such as Arctic Ocean,
vicinity of Panama and Cocos Island were also present.
The freshwater cluster (LD12) included sequences originating from various freshwater environments around the
world, sometimes separated by large geographical distances
(see supplementary file S1, Supplementary Material online).
In particular, several sequences within LD12 originate from
Lake Gatun in Panama and were obtained during the GOS
expedition (Lake Gatun was the only freshwater body included in GOS). There was low support for the internal
branching patterns within LD12. The eight brackish sequences within LD12 did not present any obvious clustering pattern among each other or with other freshwater
sequences.
Within SAR11-I and II, among the basal sequences of the
marine–brackish cluster and within LD12, a number of statistically supported subclusters were present. These were
usually near the tip of the branches and included very similar sequences. Such clusters may correspond to ecotypes
or species, but because their analysis is not comprehended
within the main focus of our study, they will not be considered further.
Test of Environment-Specific Evolution
The first Unifrac analysis considered marine, brackish, and
fresh waters as three distinct environments. When all the
three environments were analyzed together, our results
indicated that at least one of them contained environment-specific lineages (P , 0.01; Unifrac significant). In
the pairwise comparison of environments, we observed
a marginally significant separation between marine, brackish, and fresh waters (P , 0.03 in all combinations; Unifrac
marginally significant), suggesting different lineages in the
different environments. In the second analysis, we considered marine–brackish as one environment and freshwater
as another, and we obtained a significant difference between them (P , 0.01; Unifrac significant). In the third
analysis, we considered marine as one environment and
brackish–freshwater as another, and a significant difference
was also obtained (P , 0.01; Unifrac significant). Overall,
the fact that the last two comparisons were both statistically significant suggests that environment-specific
evolution has occurred within each environment (i.e., that
351
Logares et al. · doi:10.1093/molbev/msp239
MBE
FIG. 2. Distribution of marine, brackish, and freshwater sequences across SAR11 phylogeny. Cladogram of the same tree shown in figure 1
indicating the distribution of marine (red), brackish (green), and freshwater (blue) sequences across the phylogeny. Major clusters or groups of
sequences are delineated with solid black lines and subclusters with discontinuous lines. BS values . 50% are presented over the nodes (circles;
size proportional to the support) and derive from 1,000 pseudoreplicates. A 5 group of basal sequences that were removed in some analyses to
test the tree topology. B 5 marine subclade basal to SAR11-I and II. Note that the proportion of marine, brackish, and freshwater sequences in
the tree most likely does not represent the relative diversity of these groups in nature.
there seem to be endemic marine, brackish, and freshwater
lineages as the pairwise comparison between these three
environments previously indicated). Otherwise, if strains
in brackish waters were closely related to several marine
or freshwater ones, one of the previous combinations would
have been either not significant or marginally significant.
FRAs Support Phylogenetic Patterns
Overall, our FRAs indicated that saline and freshwater
SAR11 are clearly distinct at the whole-genome level. Thus,
352
the 16S rDNA divergence between saline and freshwater
lineages reflects genomewide divergence in SAR11. Using
the genome of the marine SAR11 strain C. P. ubique
(HTCC1062), we carried out FRA in metagenomic libraries
from environments distributed along a wide salinity spectrum (see Materials and Methods). The most similar fragments (average similarity . 80%) were recruited from the
Southern Ocean in Antarctica (S8/9), Bay of Fundy
(Canada, GS006), and the saline surface waters of the
Antarctic Ace Lake (S6, table 1; fig. 3). Slightly less similar
Saline–Freshwater Transitions in Prokaryotes · doi:10.1093/molbev/msp239
MBE
FIG. 3. FRA results using Candidatus Pelagibacter ubique HTCC1062 as query genome. The gray columns show the average percentage of
similarity of the recruited fragments and its standard deviation. Black columns indicate the percentage of the corresponding metagenomic
library that was recruited. Note that the gray and black columns should not be compared. The used metagenomic libraries are indicated in the
x-axis and described in table 1. Only recruited fragments with at least 50% similarity were considered in the analyses. Other FRA results are
indicated in table 1.
fragments (but still with relatively high average similarities;
70–80%) were recovered from the Sargasso Sea (GS00C)
and the brackish estuaries Chesapeake (GS012) and Delaware (GS011) bays. The most dissimilar fragments recovered originated from the freshwater Lake Gatun
(Panama), indicating that at least no close relatives of
the marine HTCC1062 strains are present in fresh waters
(still, the average similarity of these fragments fell between
60% and 70%, which suggests that a fraction of them belong to LD12). In other metagenomic libraries, the amount
of recruited fragments was very low or null, indicating that
SAR11 members are absent or in very low densities in those
environments. Such were the cases of the Antarctic marine-derived lakes Organic (hypersaline), as well as the deep
layers of Ace Lake (saline, 12- to 23-m depth; see table 1).
We have also calculated the percentage of the total
metagenomic library represented by the recruited fragments in each analysis (fig. 3), as this can provide an insight
into the relative abundance of SAR11 in different environments. The largest amounts of fragments (20–29%) were
recruited from the Southern Ocean near Antarctica (S9)
and from the Bay of Fundy (GS006) (fig. 3). The amount
of fragments recruited from coastal Antarctic areas (S8),
the Sargasso Sea (GS00C), and the brackish estuaries Delaware (GS011) and Chesapeake (GS012) bays represented
in all cases between 11% and 15% of the corresponding
metagenomic libraries. The lowest amounts of fragments
(,5%) were recruited from the surface waters of the saline
Antarctic Ace Lake (S6) and the freshwater Lake Gatun
(Panama, GS020) (see fig. 3).
The Blast searches against all the analyzed metagenomic
libraries using representative 16S rDNA sequences from
marine, brackish, and fresh waters supported the previously
observed patterns (table 2). The freshwater sequence
Z99997 retrieved very similar sequences only from the
freshwater Lake Gatun in Panama and from the brackish
Delaware Bay, United States (table 2). This supports that
the somewhat different fragments recruited by the marine
genome HTCC1062 in Lake Gatun originate from freshwater SAR11 members. The brackish–coastal sequence
U70686 (assigned to this category by Kan et al. 2008) retrieved very similar sequences from the brackish Delaware
Bay. The marine sequence AF510191 (strain HTCC1062;
C. P. ubique) recruited very similar sequences from Antarctic marine waters (S8/S9), Bay of Fundy, Delaware and
Chesapeake bays, and the surface of Ace Lake (table 2).
The sequences retrieved from Ace Lake were included in
our phylogenetic analyses (AL3/4), and all of them clustered within the marine–brackish group SAR11-I. As expected from the FRA, none of the queries retrieved
sequences from Organic Lake or from the deeper layers
(.12 m) of Ace Lake.
Discussion
Distinct Marine–Brackish and Freshwater
Phylogenetic Clusters within SAR11
Our phylogenetic and statistical analyses based on the 16
rDNA strongly supported an evolutionary separation between freshwater and saline SAR11 groups (see also Zwart
et al. 2002; Rappe and Giovannoni 2003; Atamna-Ismaeel
et al. 2008). Within the marine–brackish cluster, we have
recovered with moderate-high support the formerly recognized clusters SAR11-I and SAR11-II (Stingl et al. 2007; Kan
353
MBE
354
Only those sequences with E-value , 0.0001 and bit-score . 1000 were considered in the analysis. The values in the last three columns indicate average percentage of similarity of the recruited sequences to the reference, standard deviation
(SD), and number of sequences recruited using the format X % (X SD; X sequences).
a
This sequence belongs to a putative brackish cluster (see Kan et al. 2008).
b
In GS011, sequences belonging to apparently two groups were recruited by the three references, therefore resulting values are presented independently.
Marine (AF510191,
HTCC1062)
97.9 (1.9; 30)
97.9 (1.3; 18)
96.4 (1.4; 63)
98.9 (0.7; 18)
98.0 (0.3; 25)/94.2 (2.5; 6)
98.1 (4.5; 15)
0
98.7 (0.7; 19)
0
0
0
0
89.4 (0.8; 11)
Brackish/Coastal
(U70686)a
0
0
89.1 (0.5; 15)
0
89.7 (1.0; 10)/98.4 (1.1; 5)
95.5 (4.7; 6)
0
96.0 (0; 2)
0
0
0
0
91.3 (0.5; 17)
Freshwater
(Z99997, LD12)
89.3 (1.1; 3)
88.0 (0; 1)
89.2 (0.8; 15)
89.4 (0.5; 8)
89.8 (1.2; 11)/98.5 (0.7; 2)b
91.3 (1.1; 3)
0
89.3 (1.4; 6)
0
0
0
0
98.5 (1.1; 26)
Salinity
(PSU)
34.0
34.0
36.7
25.0–31.0
3.5
3.5
175.0
22.0
22.0
32.0
34.0
N/D
0.1
Depth (m)
2
2
5
1
1
13.2
0.1
5
12.7
14
18
23
2
Sample Location
Open Ocean, Antarctica
Newcomb Bay, Antarctica
Sargasso Sea, Atlantic Ocean
Bay of Fundy, Canada
Delaware Bay, United States
Chesapeake Bay, United States
Organic Lake, Antarctica
Ace Lake, Antarctica
Ace Lake, Antarctica
Ace Lake, Antarctica
Ace Lake, Antarctica
Ace Lake, Antarctica
Lake Gatun, Panama
Metagenomic
Libraries
1 (S9)
2 (S8)
3 (GS00C)
4 (GS006)
5 (GS011)
6 (GS012)
7 (S7)
8 (S6)
9 (S4)
10 (S3)
11 (S2)
12 (S1)
13 (GS020)
Table 2. Average Percentage of Similarity of the 16S rDNA Sequences Recruited from the Metagenomic Libraries to Marine, Brackish, and Freshwater SAR11 Reference Sequences.
Logares et al. · doi:10.1093/molbev/msp239
et al. 2008). In contrast, the previously defined cluster
SAR11-III was not recovered in our taxon richer trees. Usually, SAR11-III has been recovered in phylogenies including
fewer taxa (e.g., Stingl et al. 2007; Kan et al. 2008; Carlson
et al. 2009; unpublished results), and hence, its presence
could have been the outcome of reduced taxon sampling.
The variable position of the SAR11-III cluster in other phylogenetic trees (compare, e.g., Kan et al. 2008; Carlson et al.
2009) supports the latter hypothesis. Further work including more sequences and other genes is needed to determine whether SAR11-III is a natural group.
In contrast to earlier publications (see Stingl et al. 2007;
Kan et al. 2008), our analyses did not recover the subclusters IA and IB within SAR11-I. In our results, sequences previously assigned to cluster IB grouped adjacent to each
other but did not constitute a principal subcluster. Thus,
SAR11-IA and IB may have appeared as main subclusters in
previous phylogenies also as a product of reduced taxon
sampling.
SAR11-I encompassed a mix of intercalated brackish and
marine sequences, suggesting that members of this group
could have moved repeatedly (i.e., adapted back and forth)
between brackish and marine waters. Alternatively, this
pattern might reflect a wide salinity tolerance of an originally marine or brackish group. For example, in several of
the cases where marine and brackish sequences were
closely related (e.g. AF510192 and EU919824), the brackish
sequences originated from either Newport Harbor
(26.5 PSU) or Kongsfjorden in the Arctic (in the latter, salinity can be reduced to less than 28 PSU due to runoff of
freshwater from the glaciers; see Piwosz et al. 2009). Thus,
several of the brackish sequences that were closely related
to marine sequences could actually represent marine
strains living in waters with salinity slightly lower than
in marine environments.
Within SAR11-II, the previously defined clusters SAR11IIA/B/C (Kan et al. 2008) were recovered by our analyses
with significant statistical support. Overall, the cluster
SAR11-II seems to contain mostly marine sequences from
various locations around the world.
Several sequences were basal to the groups SAR11-I and
II (hereafter BASAL; fig. 2). Sequences assigned to SAR11-III
in other works (see Stingl et al. 2007; Kan et al. 2008;
Carlson et al. 2009; the latter work named this group
SAR11-IV) were included within this set of basal sequences
in our phylogenies. Basal sequences were mostly retrieved
from brackish waters, except a number of marine sequences that clustered together (fig. 2 ‘‘B’’). This marine basal
cluster could represent an early marine ancestor of the
saline SAR11.
Within LD12, we observed some subclusters featuring
moderate to significant BS that may correspond to different ecotypes, comparable with the ecotypes suggested for
marine SAR11 counterparts (Rappe and Giovannoni 2003;
Brown and Fuhrman 2005; Carlson et al. 2009). Notably, we
found that the phylogenetic diversity (i.e., amount of
molecular evolution) within the freshwater SAR11 group
appears to be around 7.5 times lower than within the
Saline–Freshwater Transitions in Prokaryotes · doi:10.1093/molbev/msp239
marine–brackish cluster. In particular, freshwater SAR11
strains obtained from distant geographic locations were
very closely related, a pattern also observed in phylogenies
based on SAR11 proteorhodopsins (Atamna-Ismaeel et al.
2008). Overall, this points to a rapid diversification and dispersal in fresh waters and/or to different evolutionary rates
in saline and freshwater SAR11 lineages. Slower evolutionary rates in diverse genetic markers have been found in
freshwater crustaceans when compared with halophilic
counterparts (Hebert et al. 2002). The negative effects of
salt in the fidelity of DNA replication seem to have accelerated the rate of molecular evolution in the halophilic
taxa, despite their osmoregulatory capabilities (Hebert
et al. 2002), and analogous effects could have also affected
the evolution of saline SAR11. However, similar evolutionary rates have been observed in diverse saline and freshwater prokaryotes and microeukaryotes (Logares et al. 2009),
suggesting that the potential effects of salinity in molecular
evolution are not generalized.
The moderate–low BS for the internal topology of the
marine–brackish and freshwater clusters may be a consequence of a relatively rapid diversification, where the divergence time has been too short for the 16S rDNA to reflect
the diversification process. A comparable lack of support
appears to be present even in trees based on the fastevolving internal transcribed spacer rDNA (Brown and
Fuhrman 2005). Nevertheless, other processes may account
for the blurry diversification patterns. For example, high intraspecific recombination rates have been found in SAR11
(Vergin et al. 2007), which could decrease the phylogenetic
signal in the 16S through the generation of mosaic molecules with diverse evolutionary origins (Dewhirst et al. 2005).
Even though most environmental sequences used in our
analyses have been used (and checked) in other works (see,
e.g., Shaw et al. 2008 regarding the curation of the used
GOS sequences) and even though we have reassessed their
quality, it could still be possible that some erroneous sequences could have remained in our analyses. The solitary
environmental sequences branching at the base of the
marine–brackish cluster (fig. 2 ‘‘A’’) could represent such
an example. As mentioned earlier, the removal of those
sequences did not generate changes in the general tree topology; on the contrary, it increased its BS. Overall, the consistency in the general topologies and BS values that we
have obtained across different phylogenetic reconstructions make us confident that the potential effect of undetected erroneous sequences in our conclusions is most
likely minimal.
Exclusive Brackish Groups?
Our phylogenetic and statistical analyses suggested that
there might be exclusive SAR11 brackish lineages, as already
suggested by Kan et al. (2008) based on more reduced data
sets. Here, we have analyzed a relatively large number of
brackish sequences, and almost all of them (94%) grouped
with marine sequences, constituting the marine–brackish
cluster. It is important to note that within the marine–
MBE
brackish cluster, marine and brackish sequences were
not closely related for the most part, pointing to environment-specific evolution. The basal position of several
brackish sequences suggests that at least some of them
diverged early in the evolution of SAR11. Despite several
of the analyzed brackish sequences came from environments with salinities close to freshwater, the transitions between brackish and freshwater environments seem to have
been uncommon events. Overall, only 6% of the included
brackish sequences grouped within the freshwater clade,
and these brackish sequences probably represent freshwater strains that ended up in brackish surface waters as
a product of riverine freshwater discharge or runoff. The
absence of freshwater SAR11 from environments that
present even a slight increase in salinity mirrors early
results from studies on macroorganisms by Remane
(1934), suggesting that the distributions of freshwater
microbes and multicellular organisms may be similarly
affected by salinity.
Metagenomic Analyses Support the Observed
Phylogenetic Patterns and Give Insight into
Distributions along the Salinity Gradient
Our metagenomic analyses indicated a significant amount
of divergence at the genome level between SAR11 lineages
present in saline and fresh waters, which agrees with our
16S-based phylogenies. This suggests that the rate of horizontal gene transfer between saline and freshwater SAR11
lineages is probably not very high. We have used the genome of the marine SAR11 strain C. P. ubique HTCC1062
to perform FRA against selected metagenomic libraries
obtained from environments distributed along a wide salinity gradient (0–175 PSU). As initially expected, the
most similar fragments were recruited from saline environments (marine samples or the saline Ace Lake) and the
most dissimilar ones from the freshwater Lake Gatun. From
brackish environments, very similar and somewhat more
dissimilar fragments were recruited, suggesting that
a mix of marine, brackish, and freshwater SAR11 lineages
may populate these environments. The results of the
16S recruitment analyses pointed in the same direction
as our FRA results.
The presence of 16S sequences belonging to the marine–
brackish group SAR11-I in the surface waters of the marinederived Antarctic Ace Lake indicates that marine SAR11
strains have the potential to colonize and survive in saline
continental waterbodies. This is also supported by other
SAR11 sequences (affiliating to a putative coastal-brackish
group) that have been retrieved from saline lakes in China
(Wu et al. 2006; Kan et al. 2008). Overall, it is surprising that
members of SAR11-I manage to survive in Antarctic lakes,
where environmental conditions are considered to be
extreme (Laybourn-Parry and Pearce 2007).
Infrequent Saline–Freshwater Transitions in
SAR11
If transitions between environments with different salinity
would have been common during the evolutionary
355
MBE
Logares et al. · doi:10.1093/molbev/msp239
diversification of SAR11, intercalated marine, brackish and
freshwater sequences across the phylogenies would have
been expected (see Logares et al. 2009). However, the presence of statistically supported and distinct saline and freshwater clusters clearly indicates that transitions between
these environments have been rare during the diversification of the group. Freshwater–saline transitions seem to
have been rare also in other microbial groups (Logares
et al. 2009). However, the huge population size of SAR11
led us to hypothesize that at least a few more transitions
could have occurred within this lineage, as has been observed in other microbial groups (e.g., Alverson et al.
2007; Shalchian-Tabrizi et al. 2008).
Environmental transitions between marine and brackish
waters seem to have occurred more often during the evolution of SAR11 than transitions between freshwater and
brackish environments, which mirrors patterns from multicellular organisms (Remane 1934). This is probably related
to intrinsic genetic, physiologic, or ecologic limitations
in SAR11. In particular, it has been shown that marine
SAR11 require exogenous reduced sulfur originating from
other plankton for growth (Tripp et al. 2008), and the absence of such accompanying taxa or specific compounds in
new environments may preclude transitions. In addition,
genome streamlining in SAR11 (Giovannoni, Tripp et al.
2005) may reduce the plasticity of the group to switch between environments. Overall, more research is needed to
determine what mechanisms prevent saline–freshwater
transitions.
Concluding Remarks
Our phylo-metagenomic analyses indicated that few
saline–freshwater transitions occurred during the evolutionary diversification of SAR11, generating genetically distinct lineages in each environment. In addition, the rate of
horizontal gene transfer between saline and freshwater
SAR11 does not appear to have been very high during
the evolution of the group. The divergence between saline
and freshwater SAR11 probably involved several adaptive
changes at the genome level. Future genomic studies are
needed to unveil the mechanisms of adaptation of
SAR11 to these environments and answer the question
‘‘what makes it a bacterium saline or freshwater’’?
Supplementary Material
Supplementary file S1 and supplementary table S1 are
available at Molecular Biology and Evolution online
(http://www.mbe.oxfordjournals.org/).
Acknowledgments
Financial support was provided by the Swedish Research
Council for Environment, Agricultural Sciences and Spatial
Planning (Formas) via Uppsala Microbiomics Centre
(UMC), and the Swedish Research Council (grants to
S.B.). Phylogenetic analyses were run at the University of
Oslo Bioportal (http://www.bioportal.uio.no/). We thank
N. Rodriguez Ezpeleta for the help provided with some phy356
logenetic analyses and the staff of CAMERA and MG-RAST
projects for support/advice with the analyses. We thank
the comments and suggestions of the four reviewers
and the editor that helped to improve this manuscript.
References
Alverson AJ, Jansen RK, Theriot EC. 2007. Bridging the Rubicon:
phylogenetic analysis reveals repeated colonizations of marine
and fresh waters by thalassiosiroid diatoms. Mol Phylogenet Evol.
45:193–210.
Anonymous. 1958. The Venice system for the classification of
marine waters according to salinity. Limnol Oceanogr. 3:346–347.
Atamna-Ismaeel N, Sabehi G, Sharon I, Witzel KP, Labrenz M,
Jurgens K, Barkay T, Stomp M, Huisman J, Beja O. 2008.
Widespread distribution of proteorhodopsins in freshwater and
brackish ecosystems. ISME J. 2:656–662.
Brown MV, Fuhrman JA. 2005. Marine bacterial microdiversity as
revealed by internal transcribed spacer analysis. Aquat Microb
Ecol. 41:15–23.
Carlson CA, Morris R, Parsons R, Treusch AH, Giovannoni SJ,
Vergin K. 2009. Seasonal dynamics of SAR11 populations in the
euphotic and mesopelagic zones of the northwestern Sargasso
Sea. ISME J. 3:283–295.
Castresana J. 2000. Selection of conserved blocks from multiple
alignments for their use in phylogenetic analysis. Mol Biol Evol.
17:540–552.
Dahl E. 1956. Ecological salinity boundaries in Poikilohaline waters.
Oikos 7:1–21.
DeSantis TZ Jr., Hugenholtz P, Keller K, Brodie EL, Larsen N,
Piceno YM, Phan R, Andersen GL. 2006. NAST: a multiple
sequence alignment server for comparative analysis of 16S rRNA
genes. Nucleic Acids Res. 34:W394–W399.
DeSantis TZ, Hugenholtz P, Larsen N, Rojas M, Brodie EL, Keller K,
Huber T, Dalevi D, Hu P, Andersen GL. 2006. Greengenes,
a chimera-checked 16S rRNA gene database and workbench
compatible with ARB. Appl Environ Microbiol. 72:5069–5072.
Dewhirst FE, Shen Z, Scimeca MS, Stokes LN, Boumenna T, Chen T,
Paster BJ, Fox JG. 2005. Discordant 16S and 23S rRNA gene
phylogenies for the genus Helicobacter: implications for
phylogenetic inference and systematics. J Bacteriol. 187:6106–6118.
Field KG, Gordon D, Wright T, Rappe M, Urback E, Vergin K,
Giovannoni SJ. 1997. Diversity and depth-specific distribution of
SAR11 cluster rRNA genes from marine planktonic bacteria.
Appl Environ Microbiol. 63:63–70.
Gilbert JA, Muhling M, Joint I. 2008. A rare SAR11 fosmid clone
confirming genetic variability in the ‘Candidatus Pelagibacter
ubique’ genome. ISME J. 2:790–793.
Giovannoni SJ, Bibbs L, Cho JC, et al. (12 co-authors). 2005.
Proteorhodopsin in the ubiquitous marine bacterium SAR11.
Nature. 438:82–85.
Giovannoni SJ, Britschgi TB, Moyer CL, Field KG. 1990. Genetic
diversity in Sargasso Sea bacterioplankton. Nature 345:60–63.
Giovannoni SJ, Tripp HJ, Givan S, et al. (14 co-authors). 2005.
Genome streamlining in a cosmopolitan oceanic bacterium.
Science 309:1242–1245.
Grantham BA, Chan F, Nielsen KJ, Fox DS, Barth JA, Huyer A,
Lubchenco J, Menge BA. 2004. Upwelling-driven nearshore
hypoxia signals ecosystem and oceanographic changes in the
northeast Pacific. Nature. 429:749–754.
Hammer T. 1986. Saline lake ecosystems of the world. Dordrecht
(The Netherlands): Dr. W. Junk Publishers
Hebert PD, Remigio EA, Colbourne JK, Taylor DJ, Wilson CC. 2002.
Accelerated molecular evolution in halophilic crustaceans.
Evolution 56:909–926.
Saline–Freshwater Transitions in Prokaryotes · doi:10.1093/molbev/msp239
Kan J, Evans SE, Chen F, Suzuki MT. 2008. Novel estuarine
bacterioplankton in rRNA operon libraries from the Chesapeake
Bay. Aquat Microb Ecol. 51:55–66.
Katoh K, Toh H. 2008. Recent developments in the MAFFT multiple
sequence alignment program. Brief Bioinform. 9:286–298.
Lampert W, Sommer U. 2007. Limnoecology. The ecology of lakes
and streams. Oxford: Oxford University Press.
Laybourn-Parry J, Pearce DA. 2007. The biodiversity and ecology of
Antarctic lakes: models for evolution. Philos Trans R Soc Lond B
Biol Sci. 362:2273–2289.
Letunic I, Bork P. 2007. Interactive Tree Of Life (iTOL): an online tool
for phylogenetic tree display and annotation. Bioinformatics
23:127–128.
Logares R, Brate J, Bertilsson S, Clasen JL, Shalchian-Tabrizi K,
Rengefors K. 2009. Infrequent marine-freshwater transitions in
the microbial world. Trends Microbiol. 17:414–422.
Lozupone C, Knight R. 2005. UniFrac: a new phylogenetic method
for comparing microbial communities. Appl Envir Microbiol.
71:8228–8235.
Martin AP. 2002. Phylogenetic approaches for describing and
comparing the diversity of microbial communities. Appl Envir
Microbiol. 68:3673–3682.
Meyer F, Paarmann D, D’Souza M, et al. (12 co-authors). 2008. The
metagenomics RAST server—a public resource for the automatic phylogenetic and functional analysis of metagenomes.
BMC Bioinform. 9:386.
Morris RM, Rappe MS, Connon SA, Vergin KL, Siebold WA,
Carlson CA, Giovannoni SJ. 2002. SAR11 clade dominates ocean
surface bacterioplankton communities. Nature 420:806–810.
Piwosz K, Walkusz W, Hapter R, Wieczorek P, Hop H, Wiktor J. 2009.
Comparison of productivity and phytoplankton in a warm
(Kongsfjorden) and a cold (Hornsund) Spitsbergen fjord in midsummer 2002. Polar Biol. 32:549–559.
Rambaut A. 2009. FigTree. Institute of Evolutionary Biology,
University of Edinburgh, United Kingdom. http://tree.bio.ed.
ac.uk/software/figtree/.
Rankin LM, Gibson JAE, Franzmann PD, Burton HR. 1999. The
chemical stratification and microbial communities of Ace Lake,
Antarctica: a review of the characteristics of a marine-derived
meromictic lake. Polarforschung. 66:33–52.
Rappe MS, Connon SA, Vergin KL, Giovannoni SJ. 2002. Cultivation
of the ubiquitous SAR11 marine bacterioplankton clade. Nature
418:630–633.
Rappe MS, Giovannoni SJ. 2003. The uncultured microbial majority.
Annu Rev Microbiol. 57:369–394.
MBE
Remane A. 1934. Die Brackwasserfauna. Verh Deutsch Zool Ges.
36:34–74.
Rusch DB, Halpern AL, Sutton G, et al. (40 co-authors). 2007. The
Sorcerer II Global Ocean Sampling expedition: northwest
Atlantic through eastern tropical Pacific. PLoS Biol. 5:e77.
Seshadri R, Kravitz SA, Smarr L, Gilna P, Frazier M. 2007.
CAMERA: a community resource for metagenomics. PLoS Biol.
5:e75.
Shalchian-Tabrizi K, Brate J, Logares R, Klaveness D, Berney C,
Jakobsen KS. 2008. Diversification of unicellular eukaryotes:
cryptomonad colonizations of marine and fresh waters inferred
from revised 18S rRNA phylogeny. Environ Microbiol. 10:
2635–2644.
Shaw AK, Halpern AL, Beeson K, Tran B, Venter JC,
2Martiny JB. 2008. It’s all relative: ranking the diversity of
aquatic bacterial communities. Environ Microbiol. 10:2200–
2210.
Stamatakis A. 2006. RAxML-VI-HPC: maximum likelihood-based
phylogenetic analyses with thousands of taxa and mixed models.
Bioinformatics 22:2688–2690.
Stingl U, Tripp HJ, Giovannoni SJ. 2007. Improvements of
high-throughput culturing yielded novel SAR11 strains and
other abundant marine bacteria from the Oregon coast
and the Bermuda Atlantic Time Series study site. ISME J. 1:
361–371.
Tripp HJ, Kitner JB, Schwalbach MS, Dacey JW, Wilhelm LJ,
Giovannoni SJ. 2008. SAR11 marine bacteria require exogenous
reduced sulphur for growth. Nature 452:741–744.
Vergin KL, Tripp HJ, Wilhelm LJ, Denver DR, Rappe MS,
Giovannoni SJ. 2007. High intraspecific recombination rate in
a native population of Candidatus pelagibacter ubique (SAR11).
Environ Microbiol. 9:2430–2440.
Wilhelm LJ, Tripp HJ, Givan SA, Smith DP, Giovannoni SJ. 2007.
Natural variation in SAR11 marine bacterioplankton
genomes inferred from metagenomic data. Biol Direct.
2:27.
Wu QL, Zwart G, Schauer M, Kamst-van Agterveld MP, Hahn MW.
2006. Bacterioplankton community composition along a
salinity gradient of sixteen high-mountain lakes located on
the Tibetan Plateau, China. Appl Environ Microbiol. 72:5478–
5485.
Zwart G, Crump BC, Agterveld MPKV, Hagen F, Han SK. 2002.
Typical freshwater bacteria: an analysis of available 16S rRNA
gene sequences from plankton of lakes and rivers. Aquat Microb
Ecol. 28:141–155.
357