Comparative Genomic Analysis of Fruiting Body Formation in Myxococcales Stuart Huntley,1 Nils Hamann,1 Sigrun Wegener-Feldbrügge,1 Anke Treuner-Lange,1 Michael Kube,2 Richard Reinhardt,2 Sven Klages,2 Rolf Müller,3 Catherine M. Ronning,4 William C. Nierman,4 and Lotte Søgaard-Andersen*,1 1 Department of Ecophysiology, Max Planck Institute for Terrestrial Microbiology, Marburg, Germany Max Planck Institute for Molecular Genetics, Berlin, Germany 3 Department Microbial Natural Products, Helmholtz Institute for Pharmaceutical Research Saarland, Saarland University, Saarbrücken, Germany 4 Infectious Diseases Program, J. Craig Venter Institute, Rockville, MD *Corresponding author: E-mail: [email protected]. Associate Editor: James McInerney 2 Genetic programs underlying multicellular morphogenesis and cellular differentiation are most often associated with eukaryotic organisms, but examples also exist in bacteria such as the formation of multicellular, spore-filled fruiting bodies in the order Myxococcales. Most members of the Myxococcales undergo a multicellular developmental program culminating in the formation of spore-filled fruiting bodies in response to starvation. To gain insight into the evolutionary history of fruiting body formation in Myxococcales, we performed a comparative analysis of the genomes and transcriptomes of five Myxococcales species, four of these undergo fruiting body formation (Myxococcus xanthus, Stigmatella aurantiaca, Sorangium cellulosum, and Haliangium ochraceum) and one does not (Anaeromyxobacter dehalogenans). Our analyses show that a set of 95 known M. xanthus development-specific genes—although suffering from a sampling bias—are overrepresented and occur more frequently than an average M. xanthus gene in S. aurantiaca, whereas they occur at the same frequency as an average M. xanthus gene in S. cellulosum and in H. ochraceum and are underrepresented in A. dehalogenans. Moreover, genes for entire signal transduction pathways important for fruiting body formation in M. xanthus are conserved in S. aurantiaca, whereas only a minority of these genes are conserved in A. dehalogenans, S. cellulosum, and H. ochraceum. Likewise, global gene expression profiling of developmentally regulated genes showed that genes that upregulated during development in M. xanthus are overrepresented in S. aurantiaca and slightly underrepresented in A. dehalogenans, S. cellulosum, and H. ochraceum. These comparative analyses strongly indicate that the genetic programs for fruiting body formation in M. xanthus and S. aurantiaca are highly similar and significantly different from the genetic program directing fruiting body formation in S. cellulosum and H. ochraceum. Thus, our analyses reveal an unexpected level of plasticity in the genetic programs for fruiting body formation in the Myxococcales and strongly suggest that the genetic program underlying fruiting body formation in different Myxococcales is not conserved. The evolutionary implications of this finding are discussed. Key words: Sporulation, fruiting body formation, bacterial development, bacterial differentiation, Myxococcus xanthus, comparative genomics. Introduction To optimize their chances of survival in an ever-changing environment, bacteria have several strategies to sense and respond to these changes (Perez and Groisman 2009). One strategy involves adaptive changes in gene expression without evident changes in cell morphology. In an alternative strategy, changes in gene expression result in cellular differentiation with the formation of cells with altered morphologies and properties. One of the best-studied examples of cell differentiation in bacteria in response to changes in the environment is the formation of a heat and desiccationresistant cell morphology, referred to as spores and cysts (from here on referred to collectively as spores). Spore formation is generally induced by nutrient depletion and spores are generally less metabolically active than vegetative cells, are more resistant to various types of chemical and physical stresses than vegetative cells, and can survive extended periods of starvation (Shimkets and Brun 2000). Bacterial species from widely separated taxonomic clades have evolved the ability to form spores in response to starvation (Shimkets and Brun 2000), suggesting that the ability to form spores has emerged independently several times in bacteria. Spore-forming bacteria include members of the low G þ C Gram-positive bacteria (e.g., Bacillus subtilis and Clostridium tetani), high G þ C Gram-positive bacteria (e.g., Streptomyces coelicolor), filamentous cyanobacteria (e.g., Nostoc punctiforme) as well as members of the Gramnegative Alphaproteobacteria (e.g., Methylosinus trichosporium and Rhodospirillum capsulatus), Gammaproteobacteria (e.g., Azotobacter vinelandii) and Deltaproteobacteria (e.g., © The Author 2010. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: [email protected] Mol. Biol. Evol. 28(2):1083–1097. 2011 doi:10.1093/molbev/msq292 Advance Access publication October 29, 2010 1083 Research article Abstract MBE Huntley et al. · doi:10.1093/molbev/msq292 Myxococcus xanthus) (Shimkets and Brun 2000). Spore formation has been most thoroughly studied in B. subtilis, S. coelicolor, and M. xanthus. Supporting the idea that sporulation has evolved several times independently in bacteria, the mechanisms underlying spore formation in these three phylogenetically widely separated species are very different (Shimkets and Brun 2000). In B. subtilis (Kroos 2007), spore formation is initiated by an asymmetric cell division in a rod-shaped cell resulting in the formation of a large mother cell and a small prespore. The prespore is subsequently engulfed by the mother cell before the mother cell lyses resulting in the release of the spherical mature spore. In S. coelicolor (Flärdh and Buttner 2009), chains of spherical spores are formed by synchronous cell divisions in aerial hyphae. In M. xanthus, spore formation is coupled to the formation of multicellular fruiting bodies, and individual rod-shaped cells that have accumulated inside a fruiting body directly differentiate into spherical spores without a prior cell division (Kroos 2007). The availability of complete genome sequences of related bacteria provide the opportunity to explore to what extent the genetic programs governing spore formation in a taxonomically well-defined group of bacteria are conserved. Here, we have focused on an evolutionary analysis of the genetic program underlying spore formation in the Myxococcales. Members of the Deltaproteobacteria order Myxococcales are typically found in topsoil where they form cooperatively feeding colonies and grow as saprophytes by decomposing degradable polymers or as predators by preying on other microorganisms (Shimkets et al. 2006). They share three traits that distinguish them from other Deltaproteobacteria. First, most members are strictly aerobic, whereas most Deltaproteobacteria are anaerobic (Shimkets et al. 2006). Second, in response to starvation, most Myxococcales initiate a developmental program that culminates in the formation of multicellular fruiting bodies inside which the rod-shaped cells differentiate into spores (Shimkets et al. 2006). Third, most members of the Myxococcales have large genome sizes around 10 Mb (Goldman et al. 2006; Schneiker et al. 2007; Thomas et al. 2008; Wu et al. 2009; Natalia et al. 2010), whereas other Deltaproteobacteria have smaller genome sizes ranging from 2.9 to 6.5 Mb (supplementary table S1, Supplementary Material online). Myxococcus xanthus has emerged as the model organism to understand the molecular mechanisms underlying fruiting body formation (Kroos 2007). Fruiting body formation is initiated by the RelA-dependent stringent response and proceeds in distinct morphological stages. The first signs of fruiting body formation are evident 4–6 h after starvation as cells aggregate to form small aggregation centers. As they accumulate more cells, the centers increase in size and eventually become mound shaped. By 24 h, the aggregation process is complete and each nascent fruiting body contains approximately 105 cells. Inside the nascent fruiting bodies, the rod-shaped cells differentiate into spherical spores, resulting in mature fruiting bodies. Of the total population present at the onset of starvation, only 10% 1084 undergo sporulation and normally only those cells that have accumulated inside the fruiting bodies. Up to 30% of the cells remain outside the fruiting bodies. These cells remain rod shaped and differentiate to a cell type termed peripheral rods (O’Connor and Zusman 1991a, 1991b). Finally, the remaining cells undergo lysis (Wireman and Dworkin 1977; Rosenbluh et al. 1989). Recently, this cell lysis was suggested to reflect programmed cell death (Nariya and Inouye 2008). Fruiting body formation depends on intercellular signaling (Kroos 2007) and highly coordinated changes in gene expression (Kroos 2007), as well as on the coordinated motility behavior of cells (Leonardy et al. 2008). Thus, fruiting body formation includes many of the characteristics associated with developmental programs in metazoans such as cell–cell signaling, cells adopting different cell fates, programmed cell death, morphogenetic cell movements, and coupling of cell differentiation to multicellular morphogenesis. Because spore formation occurs inside the fruiting bodies, these structures are essentially spore-filled assemblies. In the presence of nutrients, spores germinate to give rise to rod-shaped, motile, metabolically active vegetative cells. With each fruiting body consisting of approximately 105 spores, germination immediately gives rise to a spreading, cooperatively feeding colony. Thus, it has been argued that fruiting bodies are optimally designed to ensure that a new vegetative cycle is initiated by a community of cells rather than by a single cell (Dworkin 1972). All Myxococcales tested—with the exception of one (see below)—initiate fruiting body formation in response to starvation, suggesting that the last common ancestor of the Myxococcales harbored a genetic program for fruiting body formation and that fruiting Myxococcales would share in common a genetic program underlying fruiting body formation. Here, we examined this hypothesis using comparative and functional genomics on four complete genomes of fruiting Myxococcales and one genome of the only known nonfruiting Myxococcales species. Our analyses suggest that the genetic program underlying fruiting body formation in different Myxococcales is not conserved, that is, different species of Myxococcales harbor different programs for fruiting body formation. The evolutionary implications of this finding are discussed. Materials and Methods Phylogenetic Analyses Full-length 16S rRNA sequences of type strains representing diverse Myxococcales genera, as well as sequences from several non-Myxococcales Proteobacteria were collected from the Ribosomal Database Project (http://rdp.cme.msu.edu/). The sequences were aligned using ClustalW1.83 (Thompson et al. 1994), ends of the resulting alignment were trimmed so that all aligned sequences extended fully to both ends of the alignment. Gaps were removed and the sequences were realigned using ClustalW1.83 to ensure that the modifications from the first alignment did not alter the relationships between the remaining sequences. The Evolution of fruiting Fruiting body Body formation Formation · doi:10.1093/molbev/msq292 FIG. 1. Phylogeny within the order Myxococcales and fruiting body morphologies. Dendrogram of members of the order Myxococcales. Species included in this study are indicated by the asterisks (*). A consensus sequence generated from an alignment of several nonMyxococcales 16S rRNA genes was used as the root of the tree. Bootstrap values (percentages) shown at nodes are based on 1,000 replications. The vertical bar to the right indicates the three suborders within the Myxococcales: light gray, Cystobacterineae; dark gray, Nannocystineae; and black, Sorangineae. resulting alignment was used to generate a bootstrapped (1,000 iterations) neighbor joining tree file from which the image shown in Figure 1 was generated using the Treedyn software (Chevenet et al. 2006). The above analyses were repeated using the 16S rRNA sequences from all Deltaproteobacteria with a completed genome sequence to produce the tree figure shown in supplementary table S1 (Supplementary Material online). Sequencing and Annotation of Stigmatella aurantiaca DW4/3-1 Stigmatella aurantiaca DW4/3-1 DNA was isolated with the Genomic DNA kit (Qiagen, Hildesheim, Germany) according to the manufacturer’s instructions. The genomic sequences were determined by whole-genome shotgun sequencing using Sanger-based sequencing technology and pyrosequencing. The genome was covered by short-insert shotgun libraries and a fosmid library with 37-kb inserts (CopyControl Fosmid Library Production Kit; Epicentre, Madison). End sequencing of clone-derived inserts was performed using BigDye 3.1 chemistry and 3730XL capillary sequencers (ABI, Darmstadt, Germany) resulting in 90,783 reads (plasmid-derived reads 61,770 and 27,683 fosmid-derived reads) reaching a 7-fold sequencing coverage. In addition, pyrosequencing was performed using the GS20 sequencer (454 life science/Roche) resulting in 292,874 reads (241 bp average read length), which provided an additional 7-fold sequencing coverage. The assembly of Sanger and 454 derived reads was performed within two steps. GS20 data were initially assembled by Newbler (454 life science/Roche) and the resulting contigs were fragmented using PERL scripts resulting in overlapping sequence fragments, which were assigned as forward and reverse reads in fasta format and the corresponding fasta MBE quality files. These ‘‘faked reads’’ were assembled together with the Sanger-derived processed reads using PhredPhrap (http://www.phrap.org). GS FLX data and Sanger-derived reads were assembled using the Celera assembler v4.3 (Myers et al. 2000). Assembled data were imported into Consed (Gordon 2003), edited, and verified. Finishing experiments included primer walking on bridging fosmid clones and polymerase chain reaction products to improve sequence quality and gap closure to a single closed circular sequence. Stigmatella aurantiaca protein-coding sequences were predicted by Glimmer3 (Delcher et al. 1999), manually curated in Artemis (Rutherford et al. 2000), and annotated with HTGA (Rabus et al. 2002). Structural rRNAs and tRNAs were determined using RNAmmer (Lagesen et al. 2007) and tRNAscan-SE (Lowe and Eddy 1997). An unfinished version of the S. aurantiaca DW4/3-1 genome was previously reported (NZ_AAMD00000000). The genome sequence reported here represents the complete genome sequence of the DW4/3-1 strain. The genome sequence is deposited in GenBank with the accession number CP002271. Functional Annotation and Ortholog Identification Sequences and annotation for all publicly available bacterial species for which complete genome data available were obtained from the NCBI site ftp://ftp.ncbi.nih.gov/genomes/Bacteria/, files ‘‘All.faa.tar.gz and All.ptt.tar.gz (6 May 2009). Based on our information concerning known developmentally important genes, we included two M. xanthus loci (MXAN_3217, actD; and MXAN_5430, desS) to those included in the publicly available annotation. Functional annotations used in this study were generated by adaptation of available cluster of orthologous group (COG) functional assignments. First, locus descriptions from the four annotations were combined and sorted based on product description. All loci bearing the same description were assigned a single functional assignment code, chosen based on the code most prevalent among those provided in the COG assignments. Loci for which no COG assignment were available, as well as those loci for which the ambiguous COG categories R (general prediction only) and S (conserved) were originally assigned, were combined into a new category ‘‘X,’’ to indicate that no clear function has been identified for these proteins. Following the reciprocal Blast analyses (see below), final adjustments to the functional assignments were made manually such that orthologs shared the same functional category assignment. To identify orthologs and homologs for each Myxococcales protein, complete, independent protein sequence databases were generated for each of the Myxococcales genome, as well as each of 973 complete, non-Myxococcales bacterial genomes available at the time of the analyses (from NCBI data files listed above). Each Myxococcales protein was used as a query sequence in a BlastP analysis against each database to identify homologs of the protein, using an expect score cutoff of 1 105. The BlastPidentified matches were filtered, saving only those 1085 MBE Huntley et al. · doi:10.1093/molbev/msq292 Table 1. Characteristics of the Five Myxococcales Species Analyzed in This Study. Characteristic Suborder Genome size (bp)a Protein-coding sequencesa Genome GC contenta (%) Metabolisma Oxygen tolerance/requirementb Fruiting body morphologyb Spore morphologyb Myxococcus xanthus Cystobacterineae 9,139,732 7,333 Stigmatella aurantiaca Cystobacterineae 10,260,755 8,352 Anaeromyxobacter dehalogenans Cystobacterineae 5,013,478 4,346 Sorangium cellulosum Sorangineae 13,033,778 9,384 Haliangium ochraceum Nannocystineae 9,446,314 6,719 68.9 Amino acids; respiration 67.5 Amino acids; respiration 74.9 Short fatty acids; respiration 71.4 Cellulose; respiration 69.0 DNA, amino acids, starch; respiration Aerobic Hay-stack shaped Aerobic Sporangioles on stalk Microaerophilic Not fruiting Aerobic Spherical, optically refractile Short bent rods, optically refractile NA Aerobic Assembly of sporangioles, not on stalk Short rods, optically refractile Assembly of sporangioles, not on stalk Spherical, optically refractile NOTE.—NA, not applicable. a References for genome information: M. xanthus (Goldman et al. 2007), S. aurantiaca (this study), A. dehalogenans (Thomas et al. 2008), S. cellulosum (Schneiker et al. 2007), and H. ochraceum (Natalia et al. 2010). b References for physiology, fruiting body, and spore morphology: M. xanthus, S. aurantiaca, and S. cellulosum (Shimkets et al. 2006), A. dehalogenans (Sanford et al. 2002), and H. ochraceum (Fudou et al. 2002). matches that spanned at least one-third of the query sequence length. The expect scores used and the alignment length filter size were empirically determined to produce BlastP results with a maximum number of queries producing significant hits while limiting random noise in the results. For the purposes of this study, we considered all BlastP hits that passed these requirements to be homologs of the initial query protein. The protein corresponding to the highest scoring (bit score) match was then used as a query in a second BlastP analysis, this time against a complete protein database for the organism from which the original query protein was found. If the highest scoring match in this second BlastP analysis was to the original query protein, the two proteins were considered orthologs for the purposes of this study. To estimate the false-negative discovery rate of orthologs, we determined the frequency with which the original query protein was among the five best hits but not the best scoring hit in the second (reciprocal) BlastP analysis. Synteny Plots To visualize the conservation of genomic context of orthologs between species, a Perl script was written, which reads the genomic coordinates of each pair of orthologs from their respective genomes. Subsequently, the coordinates of all pairs of orthologs were plotted in a diagram having the coordinates of one species on the X axis and the coordinates of the other species on the Y axis giving rise to a dot plot-style image. Results Comparisons of Five Completely Sequenced Myxococcales Genomes At the time of this study, complete genome sequence information was available for M. xanthus strain DK1622 (Goldman et al. 2006), Sorangium cellulosum strain So 1086 ce56 (Schneiker et al. 2007), Haliangium ochraceum strain SMP-2 (Natalia et al. 2010), Anaeromyxobacter dehalogenans strains 2CP-C (Thomas et al. 2008) and 2CP-1, and Anaeromyxobacter species 109-5 and K. To decide which Anaerobacter genome to use in our analyses, we compared the four Anaerobacter genomes. The genomes of Anaeromyxobacter 2CP-C, 2CP-1, Fw109-5, and K contain 4,346, 4,473, 4,466, and 4,457 protein-coding sequences, respectively (supplementary fig. S1, Supplementary Material online). Possible orthologs for each Anaeromyxobacter gene were identified using a reciprocal best BlastP hit method (see below). About 3,041 genes are conserved in all four genomes and represent 70%, 68%, 68%, and 68% of the genes in 2CP-C, 2CP-1, Fw109-5, and K, respectively (supplementary fig. S1, Supplementary Material online). About 7%, 6%, 26%, and 5% of all genes in the genomes of 2CP-C, 2CP-1, Fw109-5, and K, respectively, have no orthologs in any of the other Anaeromyxobacter genomes. Because the 2CP-C genome is the best-characterized of these four genomes (Thomas et al. 2008) and appear to represent well all four genomes, we used the A. dehalogenans 2CP-C genome for our further analyses. To increase the resolving power of comparative genome analyses, we determined the complete nucleotide sequence of the S. aurantiaca DW4-3-1 genome (table 1). These five species cover all three Myxococcales suborders: M. xanthus, S. aurantiaca, and A. dehalogenans belong to the Cystobacterineae, S. cellulosum belong to the Sorangineae, and H. ochraceum belong to the Nannocystineae (Fig. 1 and table 1) (Shimkets et al. 2006). In all three suborders, species have been isolated, which initiate a developmental program resulting in fruiting body formation (Shimkets et al. 2006). Anaeromyxobacter dehalogenans is significantly different from the four other species in several ways (table 1). Although the others are strict aerobes, A. dehalogenans is MBE Evolution of fruiting Fruiting body Body formation Formation · doi:10.1093/molbev/msq292 Table 2. COG Categorization of Genes of the Five Myxococcales Species Analyzed in This Studya. Functional category RNA processing and modification Chromatin structure and dynamics Energy production and conversion Cell cycle control, cell division, chromosome partitioning Amino acid transport and metabolism Nucleotide transport and metabolism Carbohydrate transport and metabolism Coenzyme transport and metabolism Lipid transport and metabolism Translation and ribosome structure Transcription Replication, recombination, and DNA repair Cell wall, membrane, envelope biogenesis Cell motility Post-translation modification, protein turnover, chaperones Inorganic ion transport and metabolism Secondary metabolite biosynthesis, transport, catabolism Signal transduction Intracellular traffic, secretion, and vesicular transport Defense mechanisms Cytoskeleton Hypotheticalb a b Code A B C D E F G H I J K L M N O P Q T U V Z X Myxococcus xanthus 5(0.1) 5(0.1) 262(3.6) 41(0.6) 315(4.3) 94(1.3) 192(2.6) 156(2.1) 204(2.8) 225(3.1) 340(4.6) 290(4.0) 340(4.6) 57(0.8) 273(3.7) 268(3.7) 241(3.3) 566(7.7) 94(1.3) 96(1.3) 1(<0.1) 3268(44.5) Stigmatella aurantiaca 4(<0.1) 3(<0.1) 257(3.1) 49(0.6) 324(3.9) 102(1.2) 306(3.7) 168(2.0) 188(2.3) 228(2.7) 352(4.2) 326(3.9) 329(3.9) 58(0.7) 272(3.3) 254(3.0) 261(3.1) 674(8.1) 93(1.1) 89(1.1) 1(<0.1) 4014(48.1) Anaeromyxobacter dehalogenans 2(<0.1) 2(<0.1) 296(6.8) 38(0.9) 231(5.3) 70(1.6) 143(3.3) 120(2.8) 127(2.9) 198(4.6) 214(4.9) 169(3.9) 257(5.9) 59(1.4) 187(4.3) 178(4.1) 93(2.1) 339(7.8) 63(1.4) 70(1.6) 0(0) 1490(34.3) Sorangium cellulosum 3(<0.1) 2(<0.1) 364(3.9) 62(0.7) 365(3.9) 97(1.0) 343(3.7) 209(2.2) 198(2.1) 231(2.5) 475(5.1) 323(3.4) 335(3.6) 39(0.4) 313(3.3) 298(3.2) 288(3.1) 831(8.9) 69(0.7) 98(1.0) 0(0) 4441(47.3) Haliangium ochraceum 2(<0.1) 3(<0.1) 238(3.5) 45(0.7) 294(4.4) 72(1.1) 181(2.7) 139(2.1) 173(2.6) 186(2.8) 249(3.7) 237(3.5) 235(3.5) 24(0.4) 205(3.1) 180(2.6) 192(2.8) 556(8.3) 59(0.8) 78(1.2) 1(<0.1) 3370(50.2) Numbers indicate total number of genes in each COG in that species, with numbers in parentheses indicating percent of all proteins in that species. The category hypothetical combines COG categories ‘‘general prediction’’ and ‘‘conserved hypothetical’’ and proteins for which no functional assignment was made. microaerophilic and its genome size is roughly half that of the other species’. The most significant difference, for the purposes of this study, is that A. dehalogenans has not been demonstrated to initiate fruiting body formation and sporulation in response to starvation (Sanford et al. 2002; Thomas et al. 2008). The morphology of the fruiting bodies formed by the four fruiters varies significantly (table 1): Myxococcus xanthus fruiting bodies are haystack shaped, S. aurantiaca produces elaborate fruiting bodies consisting of a stalk carrying several spore-filled sporangioles, and S. cellulosum as well as H. ochraceum generate assemblies of sporangioles, which are not carried by a stalk. Although the vegetative cells in all four species are long and rod shaped, spore morphology varies from spherical in M. xanthus and H. ochraceum to short rods in S. aurantiaca and S. cellulosum (table 1). To compare the functional gene repertoires of the five species, functional category assignments were created for all genes in each of the five species based on COGs (Tatusov et al. 1997). For this analysis, the assignments were adjusted such that each gene was assigned only one functional category (Materials and Methods). By assigning one functional category to each gene and maintaining consistency of assignments among predicted orthologs, we were able to analyze the distribution and conservation of functions across species in a standardized manner. In agreement with previous observations for M. xanthus and S. cellulosum (Schneiker et al. 2007), all five organisms have similar relative COG distributions with the exception of genes specifying proteins of unknown function (category X), the percentage of which is significantly lower in A. dehalogenans than in the four other species (table 2). Gene Conservation Within the Myxococcales To be able to analyze the significance of the conservation of genes directly involved in fruiting body and sporulation in M. xanthus in the four other species or in other bacteria, we first established a baseline for these comparisons by systematically identifying possible orthologs for each Myxococcales gene (from here on predicted orthologs are referred to as orthologs). To this end, we used a reciprocal best BlastP hit method (Overbeek et al. 1999; Fang et al. 2010) in which each protein from each species was tested against the protein repertoires of each of the four other species as well as against the protein repertoires of 973 bacterial species for which the complete, annotated genome was available at the time of this study (Materials and Methods). From these analyses, Myxococcales proteins were sorted into two categories: Myxococcales specific and non-Myxococcales specific, based on whether or not an ortholog is found in a non-Myxococcales genome. The M. xanthus, S. aurantiaca, A. dehalogenans, S. cellulosum, and H. ochraceum genomes contain 1,240, 1,501, 707, 1,731, and 1,165 pairs of paralogous genes, respectively. Paralogs represent a potential problem in the identification of orthologs using a reciprocal best BlastP hit method and may result in an underestimation of orthologs (Overbeek et al. 1999; Fang et al. 2010). Using a less-stringent definition of orthologs (Materials and Methods), we found that the false-negative discovery rate for all pairwise genome comparisons varied between 2% and 4% (data not shown). We conclude that our method results in a valid estimation of orthologs. In agreement with the phylogenetic analysis, the protein repertoires of M. xanthus and S. aurantiaca appear more 1087 MBE Huntley et al. · doi:10.1093/molbev/msq292 Table 3. Percent of Orthologs in Pairs of Myxococcales Species. % of Orthologs Found In Gene Repertoire M. xanthus S. aurantiaca A. dehalogenans S. cellulosum H. ochraceum Myxococcus xanthus 100 56 52 28 34 Stigmatella aurantiaca 63 100 50 28 35 similar to each other than any other species pairing, sharing well over half of their protein repertoires (table 3). In contrast, less than half of either M. xanthus or S. aurantiaca proteins are conserved in A. dehalogenans. Given the small size of the A. dehalogenans genome, this was not surprising. In agreement with earlier comparisons of M. xanthus and S. cellulosum (Schneiker et al. 2007), only about 30% of M. xanthus and S. aurantiaca proteins are conserved in S. cellulosum and in H. ochraceum and vice versa. Finally, only approximately 30% of the protein repertoires of S. cellulosum and H. ochraceum are shared. Anaeromyxobacter dehalogenans 31 26 100 19 23 Sorangium cellulosum 36 31 41 100 35 Haliangium ochraceum 31 28 36 25 100 Pairwise comparisons of the position of orthologous genes on their respective genomes revealed a high degree of conserved synteny between M. xanthus and S. aurantiaca genomes and a lesser degree of synteny between these two genomes and that of A. dehalogenans (Fig. 2). The evident X-shaped pattern in the M. xanthus versus S. aurantiaca plot—which is also evident at a lower resolution in the plots comparing M. xanthus or S. aurantiaca with A. dehalogenans—likely reflects large chromosomal inversions that reverse gene order symmetrically around the origin of replication (Eisen et al. 2000). In all other pairwise FIG. 2. Synteny plot for orthologous genes in pairwise combinations of genomes of Myxococcales species. Each dot indicates the position of an orthologous gene pair on the indicated chromosomes. Chromosome orientations and sizes in megabases (Mb) are indicated on the appropriate axis for each species. The dnaA gene is defined as gene 1 in all five species. 1088 Evolution of fruiting Fruiting body Body formation Formation · doi:10.1093/molbev/msq292 MBE Table 4. Gene Conservation in Five Myxococcales Genomes. Genomeb Segmenta IV III II I M.x. S.a. A d. Gene repertoirec S.c. H.o. Five Myxococcales Genomes 1056j1046 273j262 197j186 425j403 40j39 469j376 286j226 94j88 400j357 70j70 1548j886 33j32 133j118 254j221 159j131 1896j809 30j30 39j39 19j19 103j100 89j79 212j162 290j249 2916j1368 74j73 196j179 122j110 1506j986 534j363 5506j2558 2290j1521 M.x. Development Genes 15j15 3j3 2j2 3j2 0 7j6 3j3 0 9j6 0 36j24 1j1 0 4j4 1j0 11j7 NA NA NA NA NA NA NA NA NA NA NA NA NA NA NA Downregulated during M.x. Development 160j160 18j17 9j9 20j19 2j2 33j27 17j13 4j3 16j14 1j1 65j39 0 2j2 8j7 8j7 61j30 NA NA NA NA NA NA NA NA NA NA NA NA NA NA NA Upregulated during M.x. Development 40j38 17j16 5j4 26j24 2j2 32j23 24j20 6j6 20j17 2j2 117j62 0 7j7 12j11 13j11 87j37 NA NA NA NA NA NA NA NA NA NA NA NA NA NA NA NOTE.—M.x., Myxococcus xanthus; S.a., Stigmatella aurantiaca; A.d., Anaeromyxobacter dehalogenans; S.c., Sorangium cellulosum; and H.o., Haliangium ochraceum. a The four segments labeled I, II, III, and IV highlight M. xanthus–specific conservation, M. xanthus/S. aurantiaca–specific conservation, fruiter-specific conservation, and conservation in all five Myxococcales species, respectively. b Gray shading indicates the presence of a gene in the genome of that species. c The left number indicates the number of genes shared between the species marked with grey shading for four different gene repertoires. The right number indicates the subset of those proteins for which an ortholog was found in at least one non-Myxococcales species. comparisons, no discernable synteny was observed (Fig. 2). Thus, in agreement with the phylogenetic relatedness of the five Myxococcales, gene organization is significantly more conserved between M. xanthus, S. aurantiaca, and A. dehalogenans compared with S. cellulosum and H. ochraceum. Moreover, gene organization in S. cellulosum and H. ochraceum does not show a high degree of conservation. To analyze the divergence in the gene repertoires of the five Myxococcales since their last common ancestor, we identified genes conserved in all five species and genes having lineage specificity within the order. About 1,056 genes are conserved between all five Myxococcales genomes, and most of these genes (1,046; 99%) have orthologs outside the Myxococcales (Segment IV in table 4 and supplementary table S2, Supplementary Material online). The 1,056 genes correspond to the Myxococcales core genome. The core genome was found to represent from a high of 24% of all genes in a species (A. dehalogenans) to a low of 11% (S. cellulosum). In agreement with the idea that the core genome is expected to represent functions required for these bacteria to thrive despite differences in their respective ecological niches, 85% of the core genes have inferred functions with the largest functional categories including housekeeping proteins involved in translation and ribosome structure (J); amino acid transport and metabolism (E); energy production and conversion (C); replication, recombination, and DNA repair (L); and cell wall, membrane, and envelope biogenesis (M). Interestingly, the largest COG category in the core genome is that of proteins of unknown function (X) (15%). About 53 genes (5%) for signal transduction (T) are also part of the core genome. The 10 genes in segment IV without orthologs outside the Myxococcales include six for proteins of unknown function, and the rest are involved in general cellular processes such as energy production, metabolism, and transcription. The Myxococcales core genome concurs reasonably well with other core genomes previously defined at different taxonomic levels. For example, in the species Streptococcus agalactiae, the core genome is estimated to consist of 1,806 genes, corresponding to 80% of any genome, and with 67% of the core genes having an inferred function (Tettelin et al. 2005). In 1089 MBE Huntley et al. · doi:10.1093/molbev/msq292 Table 5. Summary of Orthologs in Pairs of Myxococcales Species. % Found In Myxococcus xanthus Repertoire All coding sequences Development-specific genes Genes downregulated during development Genes upregulated during development Myxococcus xanthus 100 100 100 100 the genus Prochlorococcus, the core genome includes approximately 1,250 genes corresponding to 40–67% of the genes of the 12 fully sequenced Prochlorococcus genomes, and 81% of the genes have an inferred function (Kettler et al. 2007). Finally, in the order Lactobacillales, the core genome is made up of 567 genes corresponding to 20–33% of the genes in fully sequenced Lactobacillales genomes, and 84% of these genes have an inferred function (Makarova et al. 2006). About 425 genes are conserved in M. xanthus, S. aurantiaca, S. cellulosum, and H. ochraceum (Segment III in table 4 and supplementary table S2, Supplementary Material online). These genes could, in principle, represent the signature genes encoding the functions needed to be a member of the fruiting body-forming Myxococcales. Among the 425 genes, those involved in ion transport and metabolism (P) (8%) as well as proteins of unknown function (X) (30%) represent the two largest groups. In addition, these putative fruiting body–specific genes are enriched for signal transduction (T), amino acid transport and metabolism (E), as well as energy production and conversion (C), functional categories. Interestingly, most of the 425 genes (403; 95%) have orthologs outside the Myxococcales. The 22 Myxococcales-specific genes are enriched for those involved in signal transduction (T) as well as proteins of unknown function (X). Between the five Myxococcales species, 1,548 genes are shared specifically by M. xanthus and S. aurantiaca (Segment II in table 4 and supplementary table S2, Supplementary Material online), and 57% of this group (886) have orthologs outside the Myxococcales. Regardless of the presence or absence of non-Myxococcales orthologs, genes involved in signal transduction (T) as well as those of unknown function (X) are enriched in this group. Finally, 26%, 35%, 35%, 59%, and 34% of all genes from M. xanthus, S. aurantiaca, A. dehalogenans, S. cellulosum, and H. ochraceum, respectively, have no ortholog in any of the other Myxococcales. Approximately, 50% of these genes have an ortholog in a non-Myxococcales species. Like the genes shared between M. xanthus and S. aurantiaca, M. xanthus– specific genes with and without orthologs in non-Myxococcales species (Segment I in table 4 and supplementary table S2, Supplementary Material online) are enriched for those involved in signal transduction (T) as well as proteins of unknown function (X). Several patterns emerge from these analyses. First, M. xanthus genes at the four different levels of conservation (Segments I–IV in table 4) are differentially enriched for 1090 Stigmatella aurantiaca 63 82 80 69 Anaeromyxobacter dehalogenans 31 29 53 26 Sorangium cellulosum 36 36 54 30 Haliangium ochraceum 31 26 52 28 genes specifying different functional categories. Those conserved in all five species or by the four fruiters are dominated by genes for general cellular processes, whereas those conserved in M. xanthus and M. xanthus plus S. aurantiaca are dominated by signal transduction proteins and proteins of unknown function. Second, there is a gradient of conservation with respect to orthologs outside the Myxococcales with most genes (99%) shared by all five species having orthologs in non-Myxococcales but only 43% of the genes specific to M. xanthus having orthologs in non-Myxococcales. Conservation of M. xanthus Development Genes The previous analyses established a baseline from which to analyze the extent of conservation of M. xanthus genes directly involved in fruiting body and sporulation in the three other species. We identified 95 genes in M. xanthus that are directly involved in fruiting body formation and/or sporulation but do not appear to influence cells growing vegetatively (from here on this gene set is referred to as M. xanthus development genes) (supplementary table S3, Supplementary Material online). It should be emphasized that 60 (64%) of these genes belong to the signal transduction (T) category (supplementary table S3, Supplementary Material online), whereas only 7.7% of the total M. xanthus proteins belong to this COG. Thus, the M. xanthus development genes are subject to a sampling bias. Nevertheless, as a first approximation, we used this set of genes to analyze the possible conservation of the M. xanthus genetic program for fruiting body formation and sporulation in other Myxococcales. About 78 (82%) of the 95 M. xanthus development genes are conserved in S. aurantiaca, 28 (29%) in A. dehalogenans, 34 (36%) in S. cellulosum, and 25 (26%) in H. ochraceum (table 4). The M. xanthus development genes shared with S. aurantiaca are overrepresented, compared with the overall frequency of M. xanthus genes having orthologs in S. aurantiaca (82% vs. 63%) (tables 3 and 5 for summary). The frequency of M. xanthus development genes shared with A. dehalogenans (29% vs. 31%) or with S. cellulosum (36% vs. 36%) equals the frequency of all M. xanthus genes shared with either of these two organisms. Myxococcus xanthus development genes shared with H. ochraceum (31% vs. 26%) are slightly underrepresented. With the caveat that the M. xanthus development-specific genes are subject to a sampling bias, these comparisons suggest that M. xanthus development genes are overrepresented in S. aurantiaca and not more likely to be conserved than Evolution of fruiting Fruiting body Body formation Formation · doi:10.1093/molbev/msq292 any other gene or slightly underrepresented in S. cellulosum and H. ochraceum, respectively. Interestingly, only 22 of the 95 M. xanthus developmentspecific genes are specific to the Myxococcales with the remaining 77 having orthologs in non-Myxococcales species (table 4 and supplementary table S3, Supplementary Material online). About 15 development-specific genes are conserved in all five Myxococcales and they all have orthologs outside Myxococcales. Only three genes are shared by the four fruiters, two have orthologs outside the Myxococcales. Three (M. xanthus, S. aurantiaca, H. ochraceum), zero (M. xanthus, S. cellulosum, H. ochraceum), and nine genes (M. xanthus, S. aurantiaca, S. cellulosum) are specifically shared by the three different combinations of three fruiters. About 36 genes are conserved only in M. xanthus and S. aurantiaca, and of these, 24 are found outside the Myxococcales. Finally, 11 are specific to M. xanthus, and of these, seven are found in a non-Myxococcales. By parsimony, the M. xanthus development-specific genes shared by all five Myxococcales likely existed in their common ancestor. These 15 genes are all conserved outside Myxococcales, suggesting that they could be part of an ancestral response to starvation rather than being specifically involved in fruiting body formation and/or sporulation. In agreement with this notion, the RelA protein is found in this group. Notably, RelA is also important for fruiting body formation in S. cellulosum (Knauber et al. 2008). RelA is required for induction of the stringent response in M. xanthus and in S. cellulosum and is a universally conserved protein in bacteria. The stringent response is a ubiquitously conserved mechanism in bacteria that links nutrient availability to accumulation of the second messenger molecular guanosine-5#-(tri)di-3#-diphosphate ((p)ppGpp) by means of the RelA and/or SpoT proteins (Potrykus and Cashel 2008). Similarly, genes shared by the four fruiters could be relics of an ancient mechanism for fruiting body formation and sporulation and which was lost in A. dehalogenans. In contrast, genes specific to M. xanthus and S. aurantiaca may be part of a development mechanism specific to those species’ lineage and might reflect an adaptation to their particular lifestyle or environment. The above interpretation is based on numbers. We thought that a more biological approach to determine the relevance of the differential conservation of M. xanthus development-specific genes would be to determine to which extent genes for specific signal transduction pathways important for development in M. xanthus are conserved in the four other species. Therefore, we analyzed the conservation of the genes in two of the best-studied developmental pathways in M. xanthus, the MrpC/FruA pathway, and the C-signal transduction pathway (Fig. 3). Briefly, in the MrpC/ FruA pathway, expression of the mrpAB genes encoding a two-component system is induced by starvation (Sun and Shi 2001a, 2001b). Phosphorylated MrpB induces expression of mrpC that encodes a transcriptional regulator of the CRP family (Sun and Shi 2001a, 2001b). In vegetative cells, MrpC is phosphorylated by the Pkn8/Pkn14 cascade both of which are Ser/Thr protein kinases (Nariya and In- MBE ouye 2005). During starvation, phosphorylation of MrpC is thought to cease and instead MrpC is thought to be cleaved by the LonD protease to MrpC2 (Ueki and Inouye 2003). The EspA histidine protein kinase by an unknown mechanism inhibits accumulation of MrpC (Higgs et al. 2008). MrpC2 activates transcription of fruA (Ueki and Inouye 2003) encoding a DNA-binding response regulator (Ogawa et al. 1996; Ellehauge et al. 1998) and mazF encoding a toxin protein involved in programmed cell death during development (Nariya and Inouye 2008). Intriguingly, all the genes for the proteins in this pathway except Pkn8 are conserved in S. aurantiaca (Fig. 3A), whereas these genes are largely absent in A. dehalogenans, S. cellulosum, and H. ochraceum. The same picture emerges from an analysis of the conservation of proteins in the C-signal transduction pathway (Fig. 3B): Transcription of the csgA gene is induced in response to starvation in a RelA-dependent manner (Crawford and Shimkets 2000). The full-length CsgA protein (p25) is exported and anchored in the outer membrane (Lobedanz and Søgaard-Andersen 2003). Here, p25 is cleaved by the PopC protease to generate p17, the actual C-signal (Kim and Kaiser 1990; Lobedanz and SøgaardAndersen 2003; Rolbetzki et al. 2008). Transcription of csgA is also regulated by the four ActABCD proteins (Gronewold and Kaiser 2001). C-signal transmission likely induces phosphorylation of FruA (Ellehauge et al. 1998). Phosphorylated FruA inhibits the activity of the Frz chemosensory system (Søgaard-Andersen and Kaiser 1996). Subsequently, the activity of the Ras-like GTPase MglA decreases and cells stop reversing (Jelsbak and Søgaard-Andersen 2002) and eventually aggregate into nascent fruiting bodies. FruA in combination with LadA (Viswanathan et al. 2007b) stimulates transcription of the devTRS operon (Viswanathan et al. 2007b), which encodes three clustered, regularly interspaced short palindromic repeat proteins-associated proteins (Viswanathan et al. 2007a), and alone induces transcription of fdgA (Ueki and Inouye 2005). Several other pathways converge on the C-signal transduction pathway at the level of FruA phosphorylation. These pathways are defined by proteins of two-components systems including SdeK (Pollack and Singer 2001), TodK (Rasmussen and Sogaard-Andersen 2003), RodK (Rasmussen et al. 2005), and RokA (Wegener-Feldbrugge and Sogaard-Andersen 2009) as well as the transcription factor MXAN4899 (Jelsbak et al. 2005). Intriguingly, with the exception of the genes for DevTRS, all these genes are conserved in S. aurantiaca. On the other hand, only a few of the genes in the C-signal transduction pathway are present in A. dehalogenans, S. cellulosum, and H. ochraceum. The conservation of essentially all the genes for the proteins in these two pathways in S. aurantiaca suggests that these pathways may also exist in S. aurantiaca and could be important for development in this organism. On the other hand, the almost complete absence of the genes for the proteins of these two pathways in S. cellulosum and H. ochraceum strongly suggests that these two pathways do not exist to regulate development in these two organisms. 1091 MBE Huntley et al. · doi:10.1093/molbev/msq292 A MrpB -P FruA MazF fruA mazF Colour code: M.x. S.a. A.d. S.c. H.o. MrpC2 MrpA -P MrpC LonD Signal? -P Pkn14 -P MrpB Pkn14 MrpC MrpA Pkn8 EspA mrpA mrpB -P Pkn8 mrpC Signal? Starvation B FrzF FrzA FrzCD FrzE MglA FrzZ Reversal frequency Aggregation FrzB FrzG FruA -P LadA p17 DevT DevR FdgA DevS Sporulation PopC ActA ActB ActC ActD actA actB actC actD devT devR devS fdgA p25 csgA FruA fruA 4899 SdeK TodK sdeK todK RokA RodK (p)ppGpp RelA Starvation FIG. 3. Two signal transduction pathways important for development in Myxococcus xanthus. (A) The MrpC/FruA pathway. See the text for details. Genes and proteins are color coded according to conservation as shown in the box. Proteins not circled by a black line are also important for vegetative growth and are, therefore, not included in the list of M. xanthus–specific developmental genes. (B) The C-signal transduction pathway. See the text for details. The color code is as in (A). The stippled line indicates the cell envelopes of two neighboring cells. The gray box around FruA;P indicate that the mechanisms by which the MXAN_4899, SdeK, TodK, and RodK/RokA pathways converge with the C-signal transduction pathway are unknown. Because several of the M. xanthus development-specific genes present only in M. xanthus and S. aurantiaca are also conserved in non-Myxococcales species (supplementary table S3, Supplementary Material online), we sought to determine whether these genes were lost from A. dehalogenans, S. cellulosum, and H. ochraceum or transferred by horizontal gene transfer (HGT) to a common ancestor of M. xanthus and S. aurantiaca. Therefore, we analyzed the phylogenetic distribution of orthologs and homologs of each M. xanthus development-specific gene. As a criterion for HGT, we used disagreements between the phylogenetic trees of a gene and that of the species (Beiko et al. 2005). As shown in supplementary figure S2 (Supplementary 1092 Material online), most M. xanthus development-specific genes have orthologs or homologs in the four other Myxococcales as well as in non-Myxococcales making conclusions about HGT difficult. Only in the case of hthA (MXAN_6889) does the distribution of orthologs and homologs support that this gene was acquired by HGT by a common ancestor of M. xanthus, S. aurantiaca, and A. dehalogenans. mspA (MXAN_2269), nfsH (MXAN_3378), and fruE (MXAN_4486) represent genes which are only conserved in M. xanthus and S. aurantiaca. Interestingly, most of these genes are specifically important for sporulation in M. xanthus and not for aggregation. Evolution of fruiting Fruiting body Body formation Formation · doi:10.1093/molbev/msq292 Conservation of Developmentally Regulated Genes in M. xanthus Because there is a sampling bias in our set of M. xanthus development-specific genes, we sought an alternative method to evaluate the extent of conservation of genes important for M. xanthus development in other species. Many genes with a function in M. xanthus development are transcriptionally regulated during fruiting body formation (e.g., see Fig. 3). Therefore, to obtain an unbiased set of genes potentially required for development in M. xanthus, we identified at a global scale genes that are transcriptionally regulated in response to starvation, using data from a previously published M. xanthus DNA microarray experiment (Shi et al. 2008). In this experiment, total RNA was isolated from M. xanthus cells after 0, 2, 4, 6, 9, 12, 15, 18, and 24 h of starvation on a solid surface. Using as a criterion for developmental regulation, a 1.5-fold up- or downregulation in at least one time point during development, we found that, over the course of development, 424 genes were downregulated and 410 upregulated (supplementary tables S5 and S6, Supplementary Material online) and that the COG categories represented by up- and downregulated genes are significantly different (supplementary table S7 and S8, Supplementary Material online). In total, 338 (80%) of the 424 downregulated genes are conserved in S. aurantiaca, 225 (53%) in A. dehalogenans, 229 (54%) in S. cellulosum, and 220 (52%) H. ochraceum (table 4). Compared with the overall percentages of M. xanthus genes having orthologs in the three other Myxococcales (table 3; summary table 5), downregulated genes are overrepresented in all species, suggesting the existence of a common transcriptional response to starvation in these five species. Moreover, a total of 350 (83%) of all downregulated genes have orthologs outside the Myxococcales (table 4). Nearly half (160) of the downregulated genes are conserved in all five Myxococcales and also conserved outside the order. Three of the largest groups of downregulated genes shared by all five Myxococcales include the categories translation and ribosome structure (J), energy production and conversion (C), and posttranslation modification, protein turnover, chaperones (O), and, thus, likely represent the stringent response in M. xanthus (supplementary table S7, Supplementary Material online). Moreover, a significant fraction of these genes represent proteins of unknown function. There are relatively few (20) downregulated genes conserved in the four fruiting body forming bacteria and, of those, most are involved in posttranslation modification, protein turnover, chaperones (O). Conversely, of the M. xanthus–specific and M. xanthus and S. aurantiaca–specific genes downregulated during development, more than 50% represent proteins of unknown function (X). In all, 280 (68%) of the 410 M. xanthus genes upregulated during development have orthologs outside the Myxococcales; 281 (69%) of the 410 upregulated genes are found in S. aurantiaca, 105 (26%) in A. dehalogenans, 125 (30%) in S. cellulosum, and 116 (28%) in H. ochraceum (table 4). MBE Compared with the overall percentages of M. xanthus genes having orthologs in the four other Myxococcales (table 5 for summary), M. xanthus upregulated genes shared with S. aurantiaca are overrepresented (69% vs. 63%). As expected, M. xanthus upregulated genes shared with the non-developer A. dehalogenans are slightly underrepresented (26% vs. 31%). Strikingly, the percentage of M. xanthus upregulated genes shared with S. cellulosum (30% vs. 36%) or with H. ochraceum (28% vs. 31%) are also slightly underrepresented. In agreement with these observations, the conservation of upregulated genes between the four Myxococcales is very different compared with that of downregulated genes: Half of the upregulated genes are found either only in M. xanthus (87; 21%) or are shared with S. aurantiaca (117; 29%) (table 4). Most of these genes are classified as signal transduction proteins or proteins of unknown function (supplementary table S8, Supplementary Material online). Only 26 (6%) of the upregulated genes are conserved in all four fruiters and 40 (10%) genes are conserved in all five species (table 4). Nearly all these more broadly conserved genes within the Myxococcales are also found outside the order (table 4), suggesting that these may be genes involved in starvation-induced stress response mechanisms common to many bacteria. Discussion The aim of the work presented here was to investigate the hypothesis that a common a genetic program underlies fruiting body formation in Myxococcales. We used three comparative genomics approaches to address this hypothesis. All approaches strongly indicate that the genetic programs for fruiting body formation in M. xanthus and S. aurantiaca may share many components and perhaps even entire signal transduction pathways in common. All approaches also suggest that these programs in S. cellulosum and H. ochraceum are significantly different from that in M. xanthus. Thus, the overall conclusion from these analyses is that fruiting Myxococcales do not share a common genetic program directing fruiting body formation. This conclusion is based on the following observations: First, a set of 95 M. xanthus development-specific genes—although suffering from a sampling bias—are overrepresented in S. aurantiaca, whereas they occur at the same frequency as an average M. xanthus gene in A. dehalogenans and in S. cellulosum and are underrepresented in H. ochraceum. Second, essentially all proteins in the MrpC/ FruA and C-signal signal transduction pathways in M. xanthus are conserved in S. aurantiaca, whereas only a minority of the proteins in these pathways are present in S. cellulosum, A. dehalogenans, or in H. ochraceum. Third, genes upregulated during development in M. xanthus are overrepresented in S. aurantiaca and slightly underrepresented in S. cellulosum, A. dehalogenans, and in H. ochraceum. In contrast, genes downregulated during development in M. xanthus are overrepresented in S. aurantiaca as well as in S. cellulosum, A. dehalogenans, and H. ochraceum. These 1093 Huntley et al. · doi:10.1093/molbev/msq292 differences among up and downregulated genes suggest that the five species studied could share in common a transcriptional response to starvation involving down-regulated genes. The COG classification of the corresponding genes and their high frequency of conservation outside the Myxococcales strongly suggest that this response represents the stringent response. The overall conclusion that fruiting Myxococcales do not share a common genetic program directing fruiting body formation has two interpretations. One would be that the genetic programs governing fruiting body formation evolved more than once in the Myxococcales. In other words, the last common ancestor of the Myxococcales was not a fruiting bacterium, fruiting bodies are not homologous structures, and fruiting body formation in the Myxococcales is an example of convergent evolution. All five Myxococcales share M. xanthus development-specific proteins and upregulated genes in common. If fruiting body formation did in fact evolve more than once, these shared genes could be part of a general response to starvation. A likely precedent for multicellularity evolving more than once within an order are provided by the evolution of multicellularity in the green algae Volvocales, which form planar or spherical colonies. In this group of organisms, it has been suggested that multicellularity has likely evolved several times during evolution (Kirk 1999, 2005). An alternative explanation for the overall conclusion that fruiting Myxococcales do not share a common genetic program directing fruiting body formation would be that the last common ancestor of the Myxococcales was a fruiting bacterium. Accordingly, this program would have been lost in A. dehalogenans probably in parallel with a reduction in genome size. A precedent for an apparently similar loss is provided by the loss of the sporulation program in the order Lactobacillales after the divergence from the last common ancestor of Bacilli (Makarova et al. 2006). In M. xanthus and S. aurantiaca, the extant version of this program would largely be shared because their separation into separate species occurred relatively late. In particular, the overrepresentation of M. xanthus development-specific proteins and genes upregulated during development genes in S. aurantiaca and the possible conservation of entire signal transduction pathways in S. aurantiaca support this evolutionary scheme. Moreover, the overrepresentation of M. xanthus development-specific genes and upregulated genes in S. aurantiaca suggest that selection acts to maintain these genes in M. xanthus and S. aurantiaca. In addition, M. xanthus may have evolved development-specific functions as suggested by the existence of development-specific proteins, which are not conserved in S. aurantiaca. We speculate that the same is true for S. aurantiaca. This idea is supported by the observation that fruiting body formation in S. aurantiaca depends on the intercellular signaling molecule stigmolone whereas this molecule has no effect on M. xanthus development (Plaga et al. 1998). On the other hand, the S. cellulosum and the H. ochraceum lineages separated much earlier from the M. xanthus and S. aurantiaca lineage and during the course of evolution the initially shared 1094 MBE program may have diverged to such an extent that the extant overlap with the programs in M. xanthus and S. aurantiaca is minor. According to this scheme, fruiting bodies are homologous structures generated from nonhomologous proteins. On the basis of the existing data, we cannot distinguish between the two scenarios, that is, convergent evolution or divergent evolution. Likewise, our data do not allow conclusions about the extent to which the genetic programs for fruiting body formation in S. cellulosum and H. ochraceum may overlap. Regardless, our findings suggest considerable plasticity in and innovation of developmental pathways in the Myxococcales. The last common ancestor of the Myxococcales is thought to have existed 2.0 Gyr ago (Hedges et al. 2006). The lack of conservation of a genetic program for fruiting body formation and sporulation in Myxococcales is in stark contrast to the conservation of the genetic programs for sporulation in bacilli and clostridia (both of which are low GþC Gram-positive bacteria) and cell cycle control in Alphaproteobacteria. Sporulation in bacilli, which are generally facultative anaerobes, and clostridia, which are obligate anaerobes, has been mostly studied in B. subtilis. Except for the components of the phosphorelay that regulates initiation of sporulation in B. subtilis, most other regulatory proteins in this program are conserved in sporulating Bacilli and Clostridia (Paredes et al. 2005). Clostridia are thought to have diverged from other low G þ C Gram-positive bacteria 2.7 Gyr ago and bacilli 2.3 Gyr ago (Paredes et al. 2005), suggesting that the genetic program for sporulation in these bacteria has been largely conserved for 2.7 Gyr. Cell cycle control in Alphaproteobacteria has been mostly studied in Caulobacter crescentus. A recent study of 13 proteins involved in cell cycle control in C. crescentus showed that many of these are conserved in other Alphaproteobacteria (Brilli et al. 2010), which had their last common ancestor 2.0 Gyr ago (Hedges et al. 2006), suggesting that the genetic program for cell cycle control in the Alphaproteobacteria was largely in place 2.0 Gyr ago. Cell cycle control is an essential process, and therefore, the possibility of significant evolutionary changes in this program is minor. On the other hand, sporulation is not absolutely essential for survival, thus possibly allowing for a larger degree of change. However, this does not explain the differences in level of conservation of proteins required for sporulation in the Myxococcales and low G þ C Gram-positive bacteria. The evolution of morphological variations in metazoans has been suggested to have mainly involved cis-regulatory mutations affecting gene expression patterns rather than protein function (Stern and Orgogozo 2008; Tirosh et al. 2009) and gene duplication followed by neo- or subfunctionalization (Lynch and Conery 2000). Nevertheless, there are examples in which the formation of homologous structures are regulated by nonhomologous pathways (De Robertis 2008; Sommer 2009). Likewise, orphan genes have been suggested to have important functions in the generation of morphological diversity (Khalturin et al. 2009). Bacterial genomes are thought to evolve mainly Evolution of fruiting Fruiting body Body formation Formation · doi:10.1093/molbev/msq292 by incorporation of novel genes by HGT, by intragenomic duplications and by gene loss (Ochman et al. 2000; Lerat et al. 2005; Dagan et al. 2008). It has previously been suggested that several genes important for development in M. xanthus were acquired by HGT and ‘‘fashioned into a unique developmental cycle and presumably played a different role in their previous host’’ (Goldman et al. 2007). Our analyses do not support HGT as a significant source for genes in developmental pathways in M. xanthus. These different conclusions are likely based on the much larger set of bacterial genomes available for our analyses. In summary, our analyses reveal an unexpected level of plasticity in the genetic programs for fruiting body formation and sporulation in the Myxococcales. Future work will be directed at elucidating whether this plasticity is the result of divergent evolution of a homologous program or convergent evolution of similar structures. Supplementary Material Supplementary tables S1–S8 and Figures S1–S2 are available at Molecular Biology and Evolution online (http:// mbe.oxfordjournals.org/). Acknowledgments We thank Penelope Higgs and Frank Müller for helpful discussions. US National Science Foundation grant number 0236595 (to W.C.N.) and the Max Planck Society supported this work. References Beiko RG, Harlow TJ, Ragan MA. 2005. Highways of gene sharing in prokaryotes. Proc Natl Acad Sci U S A. 102:14332–14337. Brilli M, Fondi M, Fani R, Mengoni A, Ferri L, Bazzicalupo M, Biondi E. 2010. The diversity and evolution of cell cycle regulation in alpha-proteobacteria: a comparative genomic analysis. BMC Syst Biol. 4:52. Chevenet F, Brun C, Banuls AL, Jacq B, Christen R. 2006. TreeDyn: towards dynamic graphics and annotations for analyses of trees. BMC Bioinformatics. 7:439. Crawford EW, Shimkets LJ. 2000. The Myxococcus xanthus socE and csgA genes are regulated by the stringent response. Mol Microbiol. 37:788–799. Dagan T, Artzy-Randrup Y, Martin W. 2008. Modular networks and cumulative impact of lateral transfer in prokaryote genome evolution. Proc Natl Acad Sci U S A. 105:10039–10044. De Robertis EM. 2008. Evo-Devo: variations on ancestral themes. Cell 132:185–195. Delcher AL, Harmon D, Kasif S, White O, Salzberg SL. 1999. Improved microbial gene identification with GLIMMER. Nucleic Acids Res. 27:4636–4641. Dworkin M. 1972. The myxobacteria: new directions in studies of prokaryotic development. Crit Rev Microbiol. 1:435–452. Eisen JA, Heidelberg JF, White O, Salzberg SL. 2000. Evidence for symmetric chromosomal inversions around the replication origin in bacteria. Genome Biol. 1:RESEARCH0011. Ellehauge E, Norregaard-Madsen M, Søgaard-Andersen L. 1998. The FruA signal transduction protein provides a checkpoint for the temporal co-ordination of intercellular signals in Myxococcus xanthus development. Mol Microbiol. 30:807–817. MBE Fang G, Bhardwaj N, Robilotto R, Gerstein MB. 2010. Getting started in gene orthology and functional analysis. PLoS Comput Biol. 6:e1000703. Flärdh K, Buttner MJ. 2009. Streptomyces morphogenetics: dissecting differentiation in a filamentous bacterium. Nat Rev Microbiol. 7:36–49. Fudou R, Jojima Y, Iizuka T, Yamanaka S. 2002. Haliangium ochraceum gen. nov., sp. nov. and Haliangium tepidum sp. nov.: novel moderately halophilic myxobacteria isolated from coastal saline environments. J Gen Appl Microbiol. 48:109–115. Goldman B, Bhat S, Shimkets LJ. 2007. Genome evolution and the emergence of fruiting body development in Myxococcus xanthus. PLoS One. 2:e1329. Goldman BS, Nierman WC, Kaiser D, et al. (22 co-authors). 2006. Evolution of sensory complexity recorded in a myxobacterial genome. Proc Natl Acad Sci U S A. 103:15200–15205. Gordon D. 2003. Viewing and editing assembled sequences using Consed. Curr Protoc Bioinformatics. 11:2.1–2.43. Gronewold TM, Kaiser D. 2001. The act operon controls the level and time of C-signal production for Myxococcus xanthus development. Mol Microbiol. 40:744–756. Hedges SB, Dudley J, Kumar S. 2006. TimeTree: a public knowledgebase of divergence times among organisms. Bioinformatics 22:2971–2972. Higgs PI, Jagadeesan S, Mann P, Zusman DR. 2008. EspA, an orphan hybrid histidine protein kinase, regulates the timing of expression of key developmental proteins of Myxococcus xanthus. J Bacteriol. 190:4416–4426. Jelsbak L, Givskov M, Kaiser D. 2005. Enhancer-binding proteins with a forkhead-associated domain and the sigma54 regulon in Myxococcus xanthus fruiting body development. Proc Natl Acad Sci U S A. 102:3010–3015. Jelsbak L, Søgaard-Andersen L. 2002. Pattern formation by a cell surface-associated morphogen in Myxococcus xanthus. Proc Natl Acad Sci U S A. 99:2032–2037. Kettler GC, Martiny AC, Huang K, et al. (14 co-authors). 2007. Patterns and implications of gene gain and loss in the evolution of Prochlorococcus. PLoS Genet. 3:e231. Khalturin K, Hemmrich G, Fraune S, Augustin R, Bosch TCG. 2009. More than just orphans: are taxonomically-restricted genes important in evolution? Trends Genetics. 25:404–413. Kim SK, Kaiser D. 1990. C-factor: a cell-cell signaling protein required for fruiting body morphogenesis of M. xanthus. Cell 61:19–26. Kirk DL. 1999. Evolution of multicellularity in the volvocine algae. Curr Opin Plant Biol. 2:496–501. Kirk DL. 2005. A twelve-step program for evolving multicellularity and a division of labor. BioEssays 27:299–310. Knauber T, Doss SD, Gerth K, Perlova O, Müller R, Treuner-Lange A. 2008. Mutation in the rel gene of Sorangium cellulosum affects morphological and physiological differentiation. Mol Microbiol. 69:254–266. Kroos L. 2007. The Bacillus and Myxococcus developmental networks and their transcriptional regulators. Annu Rev Genet. 41:13–39. Lagesen K, Hallin P, Rodland EA, Staerfeldt HH, Rognes T, Ussery DW. 2007. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 35:3100–3108. Leonardy S, Bulyha I, Søgaard-Andersen L. 2008. Reversing cells and oscillating proteins. Mol BioSystems. 4:1009–1014. Lerat E, Daubin V, Ochman H, Moran NA. 2005. Evolutionary origins of genomic repertoires in bacteria. PLoS Biol. 3:e130. Lobedanz S, Søgaard-Andersen L. 2003. Identification of the C-signal, a contact dependent morphogen coordinating multiple developmental responses in Myxococcus xanthus. Genes Dev. 17:2151–2161. 1095 Huntley et al. · doi:10.1093/molbev/msq292 Lowe TM, Eddy SR. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25:955–964. Lynch M, Conery JS. 2000. The evolutionary fate and consequences of duplicate genes. Science 290:1151–1155. Makarova K, Slesarev A, Wolf Y, et al. (50 co-authors). 2006. Comparative genomics of the lactic acid bacteria. Proc Natl Acad Sci U S A. 103:15611–15616. Myers EW, Sutton GG, Delcher AL, et al. (29 co-authors). 2000. A whole-genome assembly of Drosophila. Science 287:2196–2204. Nariya H, Inouye M. 2008. MazF, an mRNA interferase, mediates programmed cell death during multicellular Myxococcus development. Cell 132:55–66. Nariya H, Inouye S. 2005. Identification of a protein Ser/Thr kinase cascade that regulates essential transcriptional activators in Myxococcus xanthus development. Mol Microbiol. 58:367–379. Natalia I, Chris D, Elke L, et al. (33 co-authors). 2010. Complete genome sequence of Haliangium ochraceum type strain (SMP2T). Stand Genomic Sci. 2:96–106. O’Connor KA, Zusman DR. 1991a. Development in Myxococcus xanthus involves differentiation into two cell types, peripheral rods and spores. J Bacteriol. 173:3318–3333. O’Connor KA, Zusman DR. 1991b. Behaviour of peripheral rods and their role in the life cycle of Myxococcus xanthus. J Bacteriol. 173:3342–3355. Ochman H, Lawrence JG, Groisman EA. 2000. Lateral gene transfer and the nature of bacterial innovation. Nature 405:299–304. Ogawa M, Fujitani S, Mao X, Inouye S, Komano T. 1996. FruA, a putative transcription factor essential for the development of Myxococcus xanthus. Mol Microbiol. 22:757–767. Overbeek R, Fonstein M, D’Souza M, Pusch GD, Maltsev N. 1999. The use of gene clusters to infer functional coupling. Proc Natl Acad Sci U S A. 96:2896–2901. Paredes CJ, Alsaker KV, Papoutsakis ET. 2005. A comparative genomic view of clostridial sporulation and physiology. Nat Rev Microbiol. 3:969–978. Perez JC, Groisman EA. 2009. Evolution of transcriptional regulatory circuits in bacteria. Cell 138:233–244. Plaga W, Stamm I, Schairer HU. 1998. Intercellular signaling in Stigmatella aurantiaca: purification and characterization of stigmolone, a myxobacterial pheromone. Proc Natl Acad Sci U S A. 95:11263–11267. Pollack JS, Singer M. 2001. SdeK, a histidine kinase required for Myxococcus xanthus development. J Bacteriol. 183:3589–3596. Potrykus K, Cashel M. 2008. (p)ppGpp: still magical? Annu Rev Microbiol. 62:35–51. Rabus R, Kube M, Beck A, Widdel F, Reinhardt R. 2002. Genes involved in the anaerobic degradation of ethylbenzene in a denitrifying bacterium, strain EbN1. Arch Microbiol. 178:506–516. Rasmussen AA, Porter SL, Armitage JP, Sogaard-Andersen L. 2005. Coupling of multicellular morphogenesis and cellular differentiation by an unusual hybrid histidine protein kinase in. Myxococcus xanthus. Mol Microbiol. 56:1358–1372. Rasmussen AA, Sogaard-Andersen L. 2003. TodK, a putative histidine protein kinase, regulates timing of fruiting body morphogenesis in Myxococcus xanthus. J Bacteriol. 185:5452–5464. Rolbetzki A, Ammon M, Jakovljevic V, Konovalova A, SogaardAndersen L. 2008. Regulated secretion of a protease activates intercellular signaling during fruiting body formation in M. xanthus. Dev Cell. 15:627–634. Rosenbluh A, Nir R, Sahar E, Rosenberg E. 1989. Cell-densitydependent lysis and sporulation of Myxococcus xanthus in agarose beads. J Bacteriol. 171:4923–4929. 1096 MBE Rutherford K, Parkhill J, Crook J, Horsnell T, Rice P, Rajandream MA, Barrell B. 2000. Artemis: sequence visualization and annotation. Bioinformatics 16:944–945. Sanford RA, Cole JR, Tiedje JM. 2002. Characterization and description of Anaeromyxobacter dehalogenans gen. nov., sp. nov., an aryl-halorespiring facultative anaerobic Myxobacterium. Appl Environ Microbiol. 68:893–900. Schneiker S, Perlova O, Kaiser O, et al. (58 co-authors). 2007. Complete genome sequence of the myxobacterium Sorangium cellulosum. Nat Biotechnol. 25:1281–1289. Shi X, Wegener-Feldbrugge S, Huntley S, Hamann N, Hedderich R, Sogaard-Andersen L. 2008. Bioinformatics and experimental analysis of proteins of two-component systems in Myxococcus xanthus. J Bacteriol. 190:613–624. Shimkets L, Dworkin M, Reichenbach H. 2006. The myxobacteria. In: Dworkin M, Falkow S, Rosenberg E, Schleifer K-H, Stackenbrandt E, editors. The prokaryotes. New York: Springer. p. 31–115. Shimkets LJ, Brun YV. 2000. Prokaryotic development: strategies to enhance survival. In: Brun YV, Shimkets LJ, editors. Prokaryotic development. Washington DC: American Society for Microbiology. p. 1–7. Søgaard-Andersen L, Kaiser D. 1996. C factor, a cell-surfaceassociated intercellular signaling protein, stimulates the cytoplasmic Frz signal transduction system in Myxococcus xanthus. Proc Natl Acad Sci U S A. 93:2675–2679. Sommer RJ. 2009. The future of evo-devo: model systems and evolutionary theory. Nat Rev Genet. 10:416–422. Stern DL, Orgogozo V. 2008. The loci of evolution: how predictable is genetic evolution? Evolution 62:2155–2177. Sun H, Shi W. 2001a. Genetic studies of mrp, a locus essential for cellular aggregation and sporulation of Myxococcus xanthus. J Bacteriol. 183:4786–4795. Sun H, Shi W. 2001b. Analyses of mrp genes during Myxococcus xanthus development. J Bacteriol. 183:6733–6739. Tatusov RL, Koonin EV, Lipman DJ. 1997. A genomic perspective on protein families. Science. 278:631–637. Tettelin H, Masignani V, Cieslewicz MJ, et al. (46 co-authors). 2005. Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: implications for the microbial ‘‘pan-genome’’. Proc Natl Acad Sci U S A. 102:13950–13955. Thomas SH, Wagner RD, Arakaki AK, Skolnick J, Kirby JR, Shimkets LJ, Sanford RA, Loffler FE. 2008. The mosaic genome of Anaeromyxobacter dehalogenans strain 2CP-C suggests an aerobic common ancestor to the delta-proteobacteria. PLoS One. 3:e2103. Thompson JD, Higgins DG, Gibson TJ. 1994. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22:4673–4680. Tirosh I, Reikhav S, Levy AA, Barkai N. 2009. A yeast hybrid provides insight into the evolution of gene expression regulation. Science 324:659–662. Ueki T, Inouye S. 2003. Identification of an activator protein required for the induction of fruA, a gene essential for fruiting body development in Myxococcus xanthus. Proc Natl Acad Sci U S A. 100:8782–8787. Ueki T, Inouye S. 2005. Identification of a gene involved in polysaccharide export as a transcription target of FruA, an essential factor for Myxococcus xanthus development. J Biol Chem. 280:32279–32284. Viswanathan P, Murphy K, Julien B, Garza AG, Kroos L. 2007a. Regulation of dev, an operon that includes genes essential for Myxococcus xanthus development and CRISPR-associated genes and repeats. J Bacteriol. 189:3738–3750. Viswanathan P, Ueki T, Inouye S, Kroos L. 2007b. Combinatorial regulation of genes essential for Myxococcus xanthus Evolution of fruiting Fruiting body Body formation Formation · doi:10.1093/molbev/msq292 development involves a response regulator and a LysR-type regulator. Proc Natl Acad Sci U S A. 104:7969–7974. Wegener-Feldbrugge S, Sogaard-Andersen L. 2009. The atypical hybrid histidine protein kinase RodK in Myxococcus xanthus: spatial proximity supersedes kinetic preference in phosphotransfer reactions. J Bacteriol. 191:1765–1776. MBE Wireman JW, Dworkin M. 1977. Developmentally induced autolysis during fruiting body formation by Myxococcus xanthus. J Bacteriol. 129:798–802. Wu D, Hugenholtz P, Mavromatis K, et al. (34 co-authors). 2009. A phylogeny-driven genomic encyclopaedia of bacteria and archaea. Nature 462:1056–1060. 1097
© Copyright 2026 Paperzz