Evolutionary resurrection of flagellar motility via rewiring of the nitrogen regulation system Article Accepted Version Last edited version for re-submission Taylor, T. B., Mulley, G., Dills, A. H., Alsohim, A. S., McGuffin, L. J., Studholme, D. J., Silby, M. W., Brockhurst, M. A., Johnson, L. J. and Jackson, R. W. (2015) Evolutionary resurrection of flagellar motility via rewiring of the nitrogen regulation system. Science, 347 (6225). pp. 1014-1017. ISSN 0036-8075 doi: 10.1126/science.1259145 Available at http://centaur.reading.ac.uk/39447/ It is advisable to refer to the publisher’s version if you intend to cite from the work. Published version at: http://dx.doi.org/10.1126/science.1259145 To link to this article DOI: http://dx.doi.org/10.1126/science.1259145 Publisher: AAAS All outputs in CentAUR are protected by Intellectual Property Rights law, including copyright law. Copyright and IPR is retained by the creators or other copyright holders. Terms and conditions for use of this material are defined in the End User Agreement . www.reading.ac.uk/centaur CentAUR Central Archive at the University of Reading Reading’s research outputs online Title: Evolutionary resurrection of flagellar motility via rewiring of the nitrogen regulation system Authors: Tiffany B. Taylor1†, Geraldine Mulley1†, Alexander H. Dills2, Abdullah S. Alsohim1,3, Liam J. McGuffin1, David J. Studholme4, Mark W. Silby2, Michael A. Brockhurst5, Louise J. Johnson1,*, Robert W. Jackson1,6 Affiliations: 1 School of Biological Sciences, University of Reading, Whiteknights, Reading, RG6 6AJ, UK 2 Department of Biology, University of Massachusetts Dartmouth, 285 Old Westport Road, North Dartmouth, MA 02747, USA 3 Department of Plant Production and Protection, Qassim University, Qassim, Saudi Arabia P.O. Box 6622 4 College of Life and Environmental Sciences, University of Exeter, Stocker Road, Exeter, EX4 4QD, UK 5 Department of Biology, University of York, Wentworth Way, York, YO10 5DD, UK 6 The University of Akureyri, Borgir vid Nordurslod, IS-600 Akureyri, Iceland *Corresponding author: [email protected] † These authors contributed equally to this work 2 1 One Sentence Summary: 2 Rapid repeatable rewiring of regulatory networks: a nitrogen regulatory gene evolves a 3 new function, restoring flagella to immotile bacteria. 4 5 Abstract: 6 A central process in evolution is the recruitment of genes to regulatory networks. We 7 engineered immotile strains of the bacterium Pseudomonas fluorescens that lack 8 flagella due to deletion of the regulatory gene fleQ. Under strong selection for motility, 9 these bacteria consistently regained flagella within 96 hours via a two-step evolutionary 10 pathway. Step 1 mutations increase intracellular levels of phosphorylated NtrC, a distant 11 homologue of FleQ, which begins to commandeer control of the fleQ regulon at the cost 12 of disrupting nitrogen uptake and assimilation. Step 2 is a switch-of-function mutation 13 that redirects NtrC away from nitrogen uptake and towards its novel function as a 14 flagellar regulator. Our results demonstrate that natural selection can rapidly rewire 15 regulatory networks in very few, repeatable mutational steps. 16 17 18 Main Text: A longstanding evolutionary question concerns how the duplication and 19 recruitment of genes to regulatory networks facilitates their expansion (1), and how 20 networks gain mutational robustness and evolvability (2). Bacteria respond to diverse 21 environments using a vast range of specialised regulatory pathways, predominantly 22 two-component systems (3), which are the result of adaptive radiations within gene 3 23 families. Due to past cycles of gene duplication, divergence and horizontal genetic 24 transfer, there is often extensive homology between the components of different 25 pathways (4), raising the possibility of cross-talk or redundancy between pathways (5). 26 Here we monitor the recovery of microbial populations from a catastrophic gene 27 deletion: bacteria engineered to lack a particular function are exposed to environments 28 that impose strong selection to re-evolve it, sometimes by recruitment of new genes to 29 regulatory networks (6, 7, 8, 9). 30 In the plant-associated soil bacterium P. fluorescens, the master regulator of 31 flagellar synthesis is FleQ (also called AdnA), a 154-dependent enhancer binding protein 32 (EBP) that activates transcription of genes required for flagellum biosynthesis (10, 11). 33 The starting P. fluorescens strain is AR2; this strain lacks flagella, due to deletion of 34 fleQ, and is unable to move by spreading motility due to mutation of viscosin synthase 35 (viscB), resulting in a distinctive, point-like colony morphology on spreading motility 36 medium (SMM) (12) (Figure 1A). We grew replicate populations of AR2 on SMM; when 37 local nutrients became depleted, starvation imposed strong selection to re-evolve 38 motility. To demonstrate that this finding was not strain-specific, these experiments were 39 replicated in a different strain of P. fluorescens, Pf0-2x. This strain is a 2fleQ variant of 40 Pf0-1, already viscosin-deficient, and is thus unable to move by spreading or swimming 41 motility. 42 After 96 hours incubation of AR2 and Pf0-2x at room temperature on SMM, two 43 breakout mutations were visible conferring first slow (AR2S and Pf0-2xS) and then fast 44 (AR2F and Pf0-2xF) spreading over the agar surface (Fig. 1A). The AR2F strain 45 produces flagella, but we could not detect flagella in EM samples for AR2S (Fig. 1B). 4 46 Genome resequencing revealed a single nucleotide point mutation in ntrB in strain 47 AR2S, causing an amino acid substitution within the PAS domain of the histidine kinase 48 (HK) sensor NtrB (T97P). The fast-spreading strain AR2F had acquired an additional 49 point mutation in the 154-dependent EBP gene ntrC, which alters an amino acid 50 (R442C) within the DNA-binding domain (Table 1 & S2). 51 NtrB and NtrC comprise a two-component system: under nitrogen limitation NtrB 52 phosphorylates NtrC, which activates transcription of genes required for nitrogen uptake 53 and metabolism. To determine how mutations in this separate regulatory pathway 54 restored motility in the absence of FleQ, we performed microarray and qRT-PCR 55 analyses of the ancestral and evolved strains (Fig. S1 & Table S1). The expression of 56 genes required for flagellum biosynthesis and chemotaxis was abolished in AR2 57 compared to wild-type SBW25 (Fig. 2A). The ntrB mutation in AR2S partially restores 58 the expression of flagellar genes, and over-activates the expression of genes involved 59 in nitrogen regulation, uptake and metabolism. The subsequent ntrC mutation in AR2F 60 reduces the expression of nitrogen uptake and metabolism genes, while further up- 61 regulating flagellar and chemotaxis gene expression to wild-type levels (Fig. 2B). While 62 AR2S and AR2F showed higher growth rates than the ancestor in LB medium (the 63 medium on which the mutants arose; Tukey-Kramer HSD test, growth in LB compared 64 to AR2: AR2S, P < 0.001; AR2F, P < 0.001) (Fig. 1C), both mutants grew poorly in 65 minimal medium with ammonium as the sole nitrogen source (Tukey-Kramer HSD test, 66 growth in M9 + ammonium compared to AR2: AR2S, P < 0.001; AR2F, P = 0.001). This 67 is likely to be the result of ammonium toxicity due to the strong up-regulation of genes 5 68 involved in ammonium uptake and assimilation, indicating a pleiotropic cost of this 69 adaptation. 70 Sequencing of the ntrBC locus from independently evolved replicate strains 71 showed that evolution often followed parallel trajectories in both AR2 and Pf0-2x: 72 mutation of ntrB gave a slow-spreading strain and this was followed by mutation of ntrC 73 yielding a fast-spreading strain (Table 1; Fig. 3A & C). While all Pf0-2xF replicates 74 carried mutations in ntrC, several Pf0-2xS strains were not mutated in ntrB, suggesting 75 an alternative evolutionary pathway. Genome resequencing of these strains revealed 76 mutations in glnA or glnK (Table 1; Fig. 3B) likely to result in loss of function leading to 77 abnormally high levels of phosphorylated NtrC: reduced ammonium assimilation by 78 glutamine synthetase (glnA) would impose severe nitrogen limitation in the cell 79 irrespective of nitrogen availability, whereas GlnK is a PII-protein that regulates both 80 NtrB and glutamine synthetase activities. 81 These data suggest a predictable two-step evolutionary process: Step 1 82 increases levels of phosphorylated NtrC, via either (a) a direct regulatory route with 83 mutations in NtrB or GlnK, or (b) a physiological route with loss-of-function mutations 84 reducing glutamine synthetase activity and causing NtrB activation, partially or 85 intermittently reactivating the flagellar cascade. In Step 2, NtrC adapts to enhance 86 activation of the flagellar genes and in doing so, becomes a less potent activator of 87 nitrogen uptake genes. This model explains the microarray data and is consistent with 88 the predicted structural effects of the mutations (Figs. 2 and 3). Specifically, for NtrB the 89 structural changes are likely to either increase kinase or reduce phosphatase activity. In 6 90 support of this, the mutated NtrB(D228A) repeatedly emerging in Pf0-2xS resembles 91 NtrB(D227A) in P. aeruginosa which constitutively activates the Ntr system (13). 92 NtrC is a distant homologue of FleQ, sharing 30% amino acid identity and the 93 same three structural domains (TM-score > 0.7; P < 0.001; Fig. 3D) (14): an N-terminal 94 receiver domain, a conserved central 154-interacting domain, and a C-terminal DNA- 95 binding domain containing a helix-turn-helix (HTH) motif flanked by highly disordered 96 regions. We posit that an overabundance of phosphorylated NtrC activates transcription 97 of flagellar genes through functional promiscuity (15). Consistent with this, the ntrC 98 mutations in fast-spreading strains are predominantly located within or adjacent to the 99 HTH domain and likely influence enhancer-binding specificity; one is a frameshift 100 abolishing the HTH altogether (Fig. 3C). The evolved NtrC’ must be constitutively 101 phosphorylated by overactive NtrB to enable its new multipurpose role, with the result 102 that flagellum biosynthesis and nitrogen regulation are probably no longer responsive to 103 environmental stimuli. Consequently, there is a trade-off between nitrogen utilization 104 and motility (Fig. 1A & C). 105 The flagellar regulatory network may have an unusually dynamic evolutionary 106 history. Flagella are expensive to make, and not always advantageous. Pathogens 107 expressing flagella can trigger strong immune responses in the host, so rapid transitions 108 are seen over short timescales between uniflagellate, multiflagellate and aflagellate 109 states (16). This volatility is reflected in the structure of regulatory networks: in close 110 relatives of Pseudomonas, fleQ appears not to be involved in flagellar gene expression 111 (17), and in Helicobacter pylori, a gene of unknown function can be used as a “spare 112 part” to permit motility in flhB mutants (18). Our results illustrate that trans-acting 7 113 mutations can contribute to gene network evolution (19), but that as predicted, such 114 mutations bear severe pleiotropic costs (20, 21). Genes can retain the potential to take 115 on the functions of long-diverged homologues, suggesting that some degree of 116 evolutionary resilience is a consequence of regulatory pathways that evolve via gene 117 duplication. While de novo origination of new functions in nature is likely to take longer 118 and involve more mutational steps, this system enables us to understand the adaptive 119 process in detail at the genetic and phenotypic level. Here we identified a novel, 120 tractable model for gene network evolution and observed, in real time, the rewiring of 121 gene networks to enable the incorporation of a modified component (NtrC’) creating a 122 novel regulatory function, by a highly repeatable two-step evolutionary pathway with the 123 same point mutations often recurring in independent lineages. 124 125 References: 126 1. evolution. Annu. Rev. Cell. Dev. Bi. 18, 53-80 (2002). 127 128 2. 3. 133 J. A. Hoch, Two-component and phosphorelay signal transduction. Curr. Opin. Microbiol. 3, 165-170 (2000). 131 132 M. Aldana, E. Balleza, S. Kauffman, O. Resendiz, Robustness and evolvability in genetic regulatory networks. J. Theor. Biol. 245, 433-448 (2007). 129 130 J. R. True, S. B. Carroll, Gene co-option in physiological and morphological 4. S. A. Teichmann, M. M. Babu, Gene regulatory network growth by duplication. Nat. Genet. 36, 492-496 (2004). 8 134 5. systems. Annu. Rev. Microbiol. 66, 325-347 (2012). 135 136 E. J. Capra, M. T. Laub, Evolution of two-component signal transduction 6. Z. D. Blount, C. Z. Borland, R. E. Lenski, Historical contingency and the evolution 137 of a key innovation in an experimental population of Escherichia coli. Proc. Natl. 138 Acad. Sci. USA 105, 7899-7906 (2008). 139 7. innovation in phage lambda. Science 335, 428-432 (2012). 140 141 8. B. Hall, The EBG system of E. coli: Origin and evolution of a novel 3galactosidase for the metabolism of lactose. Genetica 118, 143-156 (2003). 142 143 J. R. Meyer, et al., Repeatability and contingency in the evolution of a key 9. D. Blank, L. Wolf, M. Ackermann, O. K. Silander, The predictability of molecular 144 evolution during functional innovation. Proc. Natl. Acad. Sci. USA 111, 3044- 145 3049 (2014). 146 10. E. A. Robleto, I. López-Hernández, M. W. Silby, S. B. Levy, Genetic analysis of 147 the AdnA regulon in Pseudomonas fluorescens: Nonessential role of flagella in 148 adhesion to sand and biofilm formation. J. Bacteriol. 185, 453-460 (2003). 149 11. N. Dasgupta, et al., A four-tiered transcriptional regulatory circuit controls 150 flagellar biogenesis in Pseudomonas aeruginosa. Mol. Microbiol. 50, 809-824 151 (2003). 152 12. A. S. Alsohim, et al., The biosurfactant viscosin produced by Pseudomonas 153 fluorescens SBW25 aids spreading motility and plant growth promotion. Environ. 154 Microbiol. 16, 2267-2281 (2014). 9 155 13. W. Li, C.-D. Lu, Regulation of carbon and nitrogen utilization by CbrAB and 156 NtrBC two-component systems in Pseudomonas aeruginosa. J. Bacteriol. 189, 157 5413-5420 (2007). 158 14. activators. J. Bacteriol. 185, 1757-1767 (2003). 159 160 D. J. Studholme, R. Dixon, Domain architectures of 154-dependent transcriptional 15. R. Wassem, E. M. de Souza, M. G. Yates, F. D. Pedrosam, M. Buck, Two roles 161 for integration host factor at an enhancer-dependent nifA promoter, Mol. 162 Microbiol. 35, 756-764 (2000). 163 16. E. Amiel, R. R. Lovewell, G. A. O'Toole, D. A. Hogan, B. Berwin, Pseudomonas 164 aeruginosa evasion of phagocytosis is mediated by loss of swimming motility and 165 is independent of flagellum expression. Infect. Immun. 78, 2937-2945 (2010). 166 17. R. León, G. Espín, flhDC, but not fleQ, regulates flagella biogenesis in 167 Azotobacter vinelandii, and is under AlgU and CydR negative control. 168 Microbiology 154, 1719-1728 (2008). 169 18. M. E. Wand, et al., Helicobacter pylori FlhB function: The FlhB c-terminal 170 homologue HP1575 acts as a “spare part” to permit flagellar export when the 171 HP0770 FlhBCC domain is deleted. J. Bacteriol. 188, 7531-7541 (2006). 172 19. adaptation. Evolution 61, 995-1016 (2007). 173 174 175 H. E. Hoekstra, J. A. Coyne, The locus of evolution: evo devo and the genetics of 20. G. A. Wray, The evolutionary significance of cis-regulatory mutations. Nat. Rev. Genet. 8, 206-216 (2007). 10 176 21. theory of morphological evolution. Cell 134, 25-36 (2008). 177 178 22. 23. 24. K. Rutherford, et al., Artemis: sequence visualization and annotation. Bioinformatics 16, 944-945 (2000). 183 184 F. Van Immerseel, et al., Salmonella Gallinarum field isolates from laying hens are related to the vaccine strain SG9R. Vaccine 31, 4940-4945 (2013). 181 182 M. W. Silby, et al., Genomic and genetic analyses of diversity and plant interactions of Pseudomonas fluorescens. Genome Biol. 10, R51 (2009). 179 180 S. B. Carroll, Evo-devo and an expanding evolutionary synthesis: a genetic 25. D. B. Roche, M. T. Buenavista, S. J. Tetchner, L. J. McGuffin, The IntFOLD 185 server: an integrated web resource for protein fold recognition, 3D model quality 186 assessment, intrinsic disorder prediction, domain prediction and ligand binding 187 site prediction. Nucleic Acids Res. 39, W171-W176 (2011). 188 26. L. J. McGuffin, M. T. Buenavista, D. B. Roche, The ModFOLD4 server for the 189 quality assessment of 3D protein models. Nucleic Acids Res. 41, W368-W372 190 (2013). 191 27. fold recognition models. Bioinformatics 24, 1798-1804 (2008). 192 193 L. J. McGuffin, Intrinsic disorder prediction from the analysis of multiple protein 28. D. B. Roche, M. T. Buenavista, L. J. McGuffin, The FunFOLD2 server for the 194 prediction of protein–ligand interactions. Nucleic Acids Res. 41, W303-W307 195 (2013). 196 197 29. Y. Zhang, J. Skolnick, TM-align: a protein structure alignment algorithm based on the TM-score. Nucleic Acids Res. 33, 2302-2309 (2005). 11 Acknowledgments: TBT, LJJ, RWJ and MAB conceived and designed the study. TBT, GM and AA performed experiments. MWS and AHD performed independent lines of enquiry on Pf02x. DS conducted bioinformatics analysis of genome resequencing data, identified mutated genes and handled sequencing data. LM conducted the protein structure prediction and analysis. This work was supported by a Leverhulme grant to LJJ, MAB and RWJ, BBSRC grant BB/J015350/1 to RWJ, start-up funding from the University of York to MAB, Qassim University to AA, and Agriculture and Food Research Initiative Competitive Grant 2010-65110-20392 from the USDA’s National Institute of Food and Agriculture, Microbial Functional Genomics Program to MWS. TBT, GM, LJJ, RWJ, MWS, DJS and MAB wrote the paper. We thank Graham Bell, Mark Pagel, Angus Buckling and James Moir for useful discussions; Peter Ashton for processing of microarray data; and Konrad Paszkiewicz and Exeter Sequencing Service facility and support from the following: Wellcome Trust Institutional Strategic Support Fund (WT097835MF), Wellcome Trust Multi User Equipment Award (WT101650MA) and BBSRC LOLA award (BB/K003240/1). Sequence data for genomic resequencing of AR2S and AR2F have been submitted to the SRA under accession numbers SRR1510202 and SRR1510203, respectively. The eArray design ID for the microarray is 045642. Microarray data have been submitted to the ArrayExpress database under accession number E-MTAB-2788 (www.ebi.ac.uk/arrayexpress). 12 Figures 1 – 3: Fig. 1. Phenotypic assays of motility variants. A Surface spreading motility assays of ancestral (AR2) and evolved slow spreading (AR2S) and fast spreading (AR2F) mutants, after 27 hours. B Electron microscopy confirms the presence of a flagellum in AR2F, but fails to confirm presence in AR2S. C Mean (N=4) cell doublings per hour (+/- 1 SEM). Strains were grown in differing nitrogen environments: 10 mM glutamine (Gln), glutamate (Glu) and ammonium (NH4) as the sole nitrogen source, or in high nutrient lysogeny broth (LB): AR2, F3, 12 = 13.460, P < 0.001; AR2S, F3, 12 = 72.674, P < 0.001; AR2F, F3, 12 = 52.538, P < 0.001. There were also differences between doubling rate of strains within each nitrogen medium (Glutamate (Glu), F2, 9 = 12.654, P = 0.002; Ammonium (NH4), F2, 9 = 40.529, P < 0.001), with the exception of Glutamine (Gln) (F2, 9 = 3.703, P = 0.067). Fig. 2. Heat map of microarray expression profiles for all evolved and ancestral motility variants. Heat maps show where there is significant (P 1 0.05) fold-change of 4 2 in genes selected based on GO-terms for A Bacterial-type flagellum (24 genes) and B Nitrogen compound transport (146 genes) for all strains. The gradation of colors reflects normalized raw signal values across the entire array. Genes are ordered according to chromosomal position to enable clearer visualization of coregulated gene clusters. Full transcriptome analysis is reported in Supplementary File “Microarray dataset.xlsx”. Fig. 3. Full chain multi-template 3D models of protein structures of slow and fast spreading motility variants. Slow spreading variants can either follow the direct regulatory route through mutation of NtrB or GlnK (A), or the physiological route through mutation of GlnA causing overactivation of NtrB (B); both routes are predicted to lead to hyperphosphorylation of NtrC. Fast spreading variants all show mutational changes to NtrC (C), which has a similar global structure to FleQ (D). The colour scheme represents the variation in models, 13 which correlates with local (per-residue) quality and disorder. Regions coloured in blue and green represent low local variability in structure, while those in red show high local variability (see Table 1 and Table S2 for mutation details). § = All mutations mapped onto SBW25 wildtype protein structures for illustrative purposes; * = Truncated domain. AR2 14 Slow Spreaders (AR2/Pf0-2xS) Fast Spreaders (AR2/Pf0-2xF) Hyperphosphorylation of NtrC Switch of NtrC-P specificity ntrB T97P* ntrC R442C* ntrB V185K ntrC K342E ntrB D179N ntrC G452R ntrB L184Q / V185I ntrC K342V / Frameshift: V342 ntrC N454S ntrB D228A§ ntrC R441S ntrC N454S ntrC P424L Pf0-2x ntrC N454S ntrC L418R ntrB D228A§ ntrC G414D ntrC N454S ntrC F426V glnK Frameshift: I86* ntrC R442H* glnA T237P* ntrC A445V* glnA Frameshift: T205* ntrC R442C* * = Identified by genome resequencing § = Independent ntrB mutant strains, parent to multiple ntrC mutant strains Table 1. Mutational trajectory towards slow and fast spreading phenotypes. Mutations confirmed in slow spreading motility variants are predicted to result in hyperphosphorylation of NtrC; mutations in fast spreading variants lead to predicted switched specificity of NtrC-P towards FleQ targets. Slow and fast spreading variants share the same ancestry. 15 Figure 1 16 Figure 2 17 Figure 3 18 Supplementary Materials: Title: Evolutionary resurrection of flagellar motility via rewiring of the nitrogen regulation system Authors: Tiffany B. Taylor1†, Geraldine Mulley1†, Alexander H. Dills2, Abdullah S. Alsohim1,3, Liam J. McGuffin1, David J. Studholme4, Mark W. Silby2, Michael A. Brockhurst5, Louise J. Johnson1,*, Robert W. Jackson1,6 Materials and Methods Fig. S1. Table S1. Table S2. 198 Supplementary Materials: 199 200 Materials and Methods: 201 202 Microbiological methods 203 P. fluorescens strains used in this study: SBW25, SBW252fleQ, AR2 (SBW252fleQ IS- 204 5Km-hah: PFLU2552), and Pf0-2x (Pf0-12fleQ). Motility assays are as described in 205 (12). All starting populations were from a single AR2 colony grown on LB agar (1.5%), 19 206 and stab inoculated into the center of a SMM plate using a sterile wire. The initial 207 observed motility mutants (AR2S and AR2F) were cultured immediately, cryopreserved 208 and used for subsequent genome resequencing and microarray analysis. Independently 209 evolved motility mutants were isolated from AR2 and Pf0-2x and cryopreserved. 210 Mutation rates of all lineages were checked by plating an overnight culture onto LB agar 211 supplemented with 100 µg ml-1 rifampicin. All strains were found to have a similar 212 mutation frequency: SBW25 = 6.98 x 10-9 cfu ml-1; 2fleQSBW25 = 3.72 x 10-8 cfu ml-1; 213 AR2 = 2.50 x 10-8 cfu ml-1; AR2S = 2.3 x 10-8 cfu ml-1; AR2F = 1.4 x 10-8 cfu ml-1. 214 M9 nitrogen modified medium (lacking ammonium) was supplemented with 10 mM 215 glutamate, glutamine or ammonium solutions. Strains were grown for 16 hours in LB at 216 27oC and diluted to OD 0.001 in M9 nitrogen modified media, and in LB. Optical density 217 595 nm was measured every 20 minutes, under continuous shaking and at an 218 incubation of 27oC, for 24 hours (Tecan, GENios). 219 220 Molecular methods 221 The mutations present in the original evolved mutants AR2S and AR2F, plus 3 Pf0-2xS 222 and 3 descended Pf0-2xF strains were identified by genome resequencing. 223 Genomic DNA was isolated using Puregene DNA extraction (Qiagen) or Wizard® 224 Genomic DNA Purification (Promega) kits. We resequenced the complete genomes of 225 mutants AR2S and AR2F (original) using the Illumina HiSeq and identified single- 226 nucleotide variants (SNV) by alignment to the SBW25 reference as described previously 227 (22, 23). Further mutants were analyzed by targeted PCR amplification and sequencing 20 228 of ntrBC to determine whether the same two-step process had occurred; primers were 229 designed to flank the region where SNVs had been identified by whole-genome 230 resequencing (NtrC, F: CTTCATCCCCAACTCCTTGA, R: 231 AAGCTGCTGAAAAGCGAGAC; NtrB, F: CTTGCGCCTTGAGTACATGA, R: 232 ATGCGGTCTACCAGGTTACG). 233 234 AR2S and AR2F Isolate Genome Resequencing Details 235 Whole-genome resequencing was performed using the Illumina GA2x. We generated 236 12,065,035 pairs of 36-bp reads for AR2S (SRA accession SRR1510202) and 237 24,075,130 pairs of 36-bp reads for AR2F (SRR1510203). To identify mutations, we 238 aligned the genomic sequence reads against the Pseudomonas fluorescens SBW25 239 genome sequence (NCBI RefSeq accession NC_012660) using BWA-mem version 240 0.7.5a-r405 (http://www.ncbi.nlm.nih.gov/pubmed/19451168). We discarded reads that 241 did not uniquely map to a single location on the SBW25 genome in order to avoid 242 artifacts due to misalignment of sequence reads arising from repeat sequences. The 243 resulting average depths of coverage over the SBW25 genome were 124x and 144x for 244 AR2S and AR2F, respectively. For 99.86% of the SBW25 reference genome sequence 245 (6,713,197 out of 6,722,539 bp) we were able to unambiguously determine the 246 nucleotide sequence in both mutant genomes (on the basis that at these genomic 247 positions there was at least 95% consensus among all aligned reads at those positions 248 for each of the two resequenced genomes and depth of at least 5x). Over the 6,713,197 249 bp of the genome for which we could unambiguously determine the DNA sequence for 250 both mutant genomes, we found only three SNVs with respect to the reference genome 21 251 sequence. Two of these three variants were present in both resequenced genomes: at 252 position 376,439 both resequenced strains had a G whereas in the SBW25 reference 253 genome the base is T. This corresponds to a non-silent change from codon acc to 254 codon Ccc in gene PFLU0344 (glnL/ntrB). The second variant was at position 1,786,536 255 where SBW25 has A but the two resequenced mutants have G; this variant falls in an 256 intergenic region and we have no reason to suspect it has an effect on the phenotypes 257 observed. The third SNV is private to AR2F and falls within the gene PFLU0343 258 (glnG/ntrC). In SBW25 and AR2S the base is G but in AR2F it is A; this results in a non- 259 silent change from codon cgt to codon Tgt. Aside from these three variant sites, the 260 remainder of the 6,713,197 bp of unambiguously resolved genome sequence contained 261 no variation from the SBW25 reference genome. 262 263 Pf0-2xS and Pf0-2xF Isolate Genome Resequencing Details 264 Library preparation was performed using Illumina Nextera XT kit with Nextera XT 265 Indexes and sequenced as 250PE reads from MiSeq. 266 Whole-genome resequencing of Pf0-2x strains was performed using the Illumina MiSeq. 267 We generated 1,145,122 forward and 1,141,017 reverse reads of 251-bp for Pf0-2xS_1, 268 1,279,360 forward and 1,272,990 reverse reads of 251-bp for Pf0-2xF_1.1, 1,962,339 269 forward and 1,941,676 reverse reads of 251-bp for Pf0-2xS_2, 907,328 forward and 270 906,083 reverse reads of 251-bp for Pf0-2xF_2.1, 935,862 forward and 932,415 reverse 271 reads of 251-bp for Pf0-2xS_3, and 1,194,665 forward and 1,190,975 reverse reads of 272 251-bp for Pf0-2xF_3.1. These genomic sequence reads were mapped against the 273 Pseudomonas fluorescens Pf0-1 genome sequence (NCBI RefSeq accession 22 274 NC_007492.2) using CLC Genomics Workbench 6.0; unmapped reads were not 275 included in further analysis. The average coverage depths for each library were: 44.6 for 276 Pf0-2xS_1, 49.8 for Pf0-2xF_1.1, 76.1 for Pf0-2xS_2, 35.3 for Pf0-2xF_2.1, 36.4 for Pf0- 277 2xS_3, and 46.5 for Pf0-2xF_3.1. Out of the 6,438,405 bp Pf0-1 genome, 97.06-99.35% 278 of nucleotides (depending on sample) had a minimum 95% consensus among all 279 mapped reads. Using probabilistic variant detection to investigate unambiguous 280 nucleotide positions in each sequenced strain, we identified two SNVs and a 3bp 281 deletion common to all mutants (Table S2). These three common changes were also 282 present in the parental strain (Pf0-2x) and were thus not considered further. A 15bp 283 deletion was identified in both the Pf0-2xS1 and F1.1 strains with a unique SNV 284 occurring in the latter. An additional SNV was common to Pf0-2xS2 and Pf0-2xF2.1, yet 285 another to both Pf0-2xS3 and Pf0-2xF3.1, and a single unique SNV was discovered in 286 Pf0-2xF2.1 and another in Pf0-2xF3.1 with respect to the reference genome sequence. 287 288 The presence of glnA, glnK and ntrC mutations in resequenced strains was confirmed 289 by PCR of candidate regions from parental and evolved strains followed by Sanger 290 sequencing of the amplicons. Mutations in ntrB were detected by targeted PCR 291 amplification and sequencing, and ntrC mutations in derivatives of ntrB mutants were 292 detected the same way. In some cases, multiple primers were used to cover the full 293 length of the gene. This enabled us to be certain about the mutations, and rule out any 294 others. Primers used for Pf0-1 were: ntrB, F: ATTGCGCCTCGAGTACATGA and 295 TCGACACGGTTTCACTACGG, R: ATGCGGTCGACCAGATTGCG and 296 TCGGAGCCTTGGTTTGGTTT; ntrC, F: AAGCTGCTGAAAAGCGAGAC and 23 297 GATTAAGGGTCACGGTGCCT, R: CTTCATGCCGAGTTCCTTGA and 298 CACTGGAACAAGGAGCCACA; glnA, F: GGAGGCCTTTCTTTGTCACG and 299 ACGCTTGTAGGAGTTGGTGG and CGGGAAGTAGCCACCTTTCA, R: 300 CCACCAACTCCTACAAGCGT and CAAGTCCGACATCTCCGGTT and 301 CTACCCGCCCTAATTCACCC; glnK, F: GCCGGGCATTACGATAGACA, R: 302 TGCGCTTAGACTTGAGTCGG. 303 304 Microarray design 305 For microarray analysis, RNA was extracted from SBW25, SBW252fleQ, AR2, AR2S 306 and AR2F using an RNeasy kit (Qiagen) and assayed for quality [2100 Bioanalyzer 307 (Agilent), and Nanodrop 1000 Spectrophotometer (Thermo Fisher Scientific)]. RNA was 308 labelled, hybridised to custom Agilent arrays and scanned according to the 309 manufacturer’s instructions (Technology Facility, Dept. of Biology, University of York, 310 UK). Differentially expressed genes were identified by ANOVA using the Benjamini- 311 Hochberg FDR correction. No commercially available microarrays were available for P. 312 fluorescens, so a custom design was created using the Agilent eArray system 313 (http://earray.chem.agilent.com/earray). The P. fluorescens SBW25 genome was 314 loaded into Artemis (24) (http://www.sanger.ac.uk/resources/software/artemis), and all 315 open reading frames marked as CDSs and these putative CDS written to a FASTA file 316 which was uploaded to eArray. A probeset was created containing five unique probes 317 per CDS, plus the appropriate Agilent control sequences. The probes were then laid out 318 onto a standard Agilent 8x60K format slide. Microarrays were validated using qRT-PCR 319 (MyiQTM, Bio-Rad). 24 320 321 RNA labelling and hybridisation 322 RNA isolated from each sample was labelled with Cy-3 using the Agilent Low Input 323 Quick Amp WT Labelling Kit (one color) according to the manufacturer’s instructions. 324 Briefly, the RNA is reverse transcribed using a primer mix containing oligo-dT and 325 random nucleotide primers containing T7 promoters, then the cRNA amplified using the 326 T7 polymerase in the presence of Cy-3 labelled dCTP. The labelled cDNA was then 327 quality controlled and only samples with a specific activity of > 6 were used hybridised 328 to the arrays. 329 Samples were hybridised to the arrays, incubated and washed in accordance with the 330 manufacturer’s instructions, and the slides scanned using an Agilent Microarray 331 Scanner and the fluorescence quantified with the Feature Extraction software. The 332 resulting files were then loaded into GeneSpring for further analysis. 333 334 Initial Data Analysis 335 GeneSpring software version 12.6 was used for all data analysis. Initial analysis was 336 performed to identify those genes that were differentially expressed between any 337 conditions in the experiment, followed by Gene Set Expression Analysis (GSEA) on the 338 basis of Gene Ontology terms. Briefly, the sample replicates were grouped, and then 339 ANOVA performed to identify differentially expressed genes, using the Benjamini- 340 Hochberg FDR p-value correction (with a FDR of 5%), and a Tukey HSD post-hoc test 341 used for each gene to identify which samples were significantly different from each 25 342 other for that gene. Those GO categories that were over-represented in the set of 343 differentially expressed genes were then identified. 344 345 3D protein models 346 The IntFOLD server (25) version 2 was used to build multi-template 3D models from the 347 wild type and mutant sequences of AdnA, NtrB, NtrC, FleQ, GlnA and GlnK. The 3D 348 models were quality checked using the ModFOLD4 protocol (26). Intrinsic protein 349 disorder was predicted using the DISOclust method (27). Binding sites were predicted 350 using the FunFOLD2 method (28). Models were structurally aligned and scored using 351 TM-align (29). 352 353 Further Discussion of Microarray Results 354 Full transcriptome analysis is reported separately in Supplementary File “Microarray 355 dataset.xlsx”. When genes were filtered for significant fold changes (P 1 0.05) of 4 2, 356 and sorted by descending fold changes, the genes that showed the greatest changes in 357 expression are those highlighted in this study. Specifically, when comparing AR2 and 358 AR2S, the large majority of genes with greatest up regulation are related to nitrogen 359 transport, whereas when comparing AR2S to AR2F, the large majority of genes are 360 related to flagellar motility and chemotaxis. 361 Our microarray data show that the average level of flagella gene expression in AR2S is 362 between 1.5-2-fold lower than in SBW25, but considerably higher than AR2 (between 3 363 and 75 fold). There are at least two possible models to explain why we were unable to 26 364 observe flagella in AR2S: (i) Phosphorylated NtrC (NtrC-P) only weakly interacts with 365 flagella gene promoters, but the over-abundance of NtrC-P in AR2S saturates NtrC- 366 dependent promoters and the excess NtrC-P drives expression of flagella genes. The 367 level of transcription is suboptimal, thus the flagella may be unstable and support limited 368 or transient motility; (ii) Promiscuous activity of over-abundant NtrC-P (e.g. interacting 369 with RpoN from solution) leads to expression from flagella promoters, but this is not 370 equivalent in all cells and only a proportion of the population express sufficient levels to 371 produce a functional flagellum at any one time. 27 372 Fig. S1: Validation of gene expression changes detected by microarray using qRT-PCR. A – E Box plots show gene expression, relative to wild-type SBW25, in genes related to nitrogen utilization and flagellar function. Note, data presented on log scales. 28 A. SBW251fleQ Bacterial-type Flagellum Regulation of Nitrogen Utilization GENE SYMBOL flgF Pflu4448 (Flagellin) glnA ntrB ntrC GENE CHIP ARRAY Fold Change p-value -79.06 1.52E-07 REAL TIME PCR Fold Change p-value -8.4 0.196 -158.33 2.09E-07 -22.59 0.002 1.01 -1.54 -1.74 2.24E-06 1.37E-07 1.58E-07 9.14 -4.5 -1.39 0.187 0.41 0.136 B. AR2 GENE SYMBOL Bacterial-type Flagellum Regulation of Nitrogen Utilization flgF Pflu4448 (Flagellin) glnA ntrB ntrC GENE CHIP ARRAY Fold p-value Change -76.48 1.52E-07 REAL TIME PCR Fold Change p-value -2.84 0.265 -128.22 2.09E-07 -6.61 0.004 1.03 -1 -1.41 2.24E-06 1.37E-07 1.58E-07 3.27 -2.24 -1.31 0.393 0.424 0.231 C. AR2S GENE SYMBOL Bacterial-type Flagellum Regulation of Nitrogen Utilization flgF Pflu4448 (Flagellin) glnA ntrB ntrC GENE CHIP ARRAY Fold p-value Change -2.53 1.52E-07 REAL TIME PCR Fold Change p-value 3.13 0.192 -1.71 2.09E-07 4.34 0.042 6.08 28.18 27.14 2.24E-06 1.37E-07 1.58E-07 25.49 26.45 49.14 0.008 0.017 0.059 D. AR2F Bacterial-type Flagellum Regulation of Nitrogen Utilization GENE SYMBOL flgF Pflu4448 (Flagellin) glnA ntrB ntrC GENE CHIP ARRAY Fold Change p-value 1.1 1.52E-07 REAL TIME PCR Fold Change p-value 4.49 0.1 3.6 2.09E-07 16.57 0.065 11.34 58.27 63.72 2.24E-06 1.37E-07 1.58E-07 175.63 96.09 49.73 0.005 0.006 0.03 Table S1: Mean fold change (relative to SBW25) from qRT-PCR and microarrays with p-values (A – D). Colours correspond to genes with significant ( 4 2 fold change) up regulated (red) and down regulated (blue) changes. 29 Strain AR2S_1 AR2F_1.1 AR2S_2 AR2F_2.1 AR2S_3 AR2F_3.1 AR2S_4 AR2F_4.1 Nucleotide change T376439G* G374322A* G376175A/ T376174A T374622C G376193A C374291G G376176A T376174A T374622C/ I A374625 66431771-3* AA change T97P R442C Gene ntrB ntrC Domain/Function PAS domain HTH domain V185K ntrB HK domain K342E D179N G452R L184Q V185I K342V FS from V342 2G85 ntrC ntrB ntrC HTH domain HK domain Miscellaneous function ntrB HK domain ntrC HTH domain gidB 66171472-87* C382879T* 66431771-3* FS from I86 R442H 2G85 glnK ntrC gidB T388861G* 6902225* G382870A* 66431771-3* T237P FS from S544 A445V 2G85 glnA carB ntrC gidB 6388958* G382880A* A384603C A382843G C382883A A382843G C382933T A382843G A384603C T382951G G382963A A382843G T382928G FS from T205 R442C D228A N454S R441S N454S P424L N454S D228A L418R G414D N454S F426V glnA ntrC ntrB ntrC ntrC ntrC ntrC ntrC ntrB ntrC ntrC ntrC ntrC Methyltransferase domain PII regulator HTH domain Methyltransferase domain Catalytic domain Miscellaneous function HTH domain Methyltransferase domain Catalytic domain HTH domain Miscellaneous function HTH domain HTH domain HTH domain HTH domain HTH domain Miscellaneous function Miscellaneous function Miscellaneous function HTH domain HTH domain Pf0-2xS_1 Pf0-2xF_1.1 Pf0-2xS_2 Pf0-2xF_2.1 Pf0-2xS_3 Pf0-2xF_3.1 Pf0-2xS_4 Pf0-2xF_4.1 Pf0-2xF_4.2 Pf0-2xF_4.3 Pf0-2xF_4.4 Pf0-2xF_4.5 Pf0-2xS_5 Pf0-2xF_5.1 Pf0-2xF_5.2 Pf0-2xF_5.3 Pf0-2xF_5.4 * = Identified by genome resequencing. Table S2: Full details of mutations identified in strains with parental AR2 and Pf02x origin. Numbers before decimal points represent independent lineages, and numbers after represent derived strains. Note fast spreading mutants contain all nucleotide changes within a lineage.
© Copyright 2026 Paperzz