Evolution of Key Cell Signaling and Adhesion Protein Families Predates Animal Origins Nicole King, Christopher T. Hittinger, Sean B. Carroll* The evolution of animals from a unicellular ancestor involved many innovations. Choanoflagellates, unicellular and colonial protozoa closely related to Metazoa, provide a potential window into early animal evolution. We have found that choanoflagellates express representatives of a surprising number of cell signaling and adhesion protein families that have not previously been isolated from nonmetazoans, including cadherins, C-type lectins, several tyrosine kinases, and tyrosine kinase signaling pathway components. Choanoflagellates have a complex and dynamic tyrosine phosphoprotein profile, and cell proliferation is selectively affected by tyrosine kinase inhibitors. The expression in choanoflagellates of proteins involved in cell interactions in Metazoa demonstrates that these proteins evolved before the origin of animals and were later co-opted for development. A central question in animal evolution is how multicellular animals evolved from a protozoan ancestor. One approach to animal origins is to determine which developmental proteins predated the origin of animals and were subsequently co-opted for animal development. Comparative genomics can identify the minimal set of genes in place at the outset of animal evolution by revealing those shared by all animals and their nearest relatives (1). The choanoflagellates, a group of unicellular and colonial flagellates that resemble cells found only in Metazoa, have emerged as an important model for studies of early animal evolution (2, 3). Analyses of nuclear and mitochondrial genes consistently support a close phylogenetic relationship between choanoflagellates and Metazoa (4–7). Specifically, the presence in a choanoflagellate mitochondrial genome of multiple genes lost from a conserved portion of animal (including sponge and cnidarian) mitochondrial genomes suggests that choanoflagellates are an outgroup of animals (8–11) (fig. S1 and tables S1 and S2). Therefore, genes expressed by choanoflagellates and animals to the exclusion of other eukaryotes define the minimal complexity of the eukaryotic genome before the emergence of multicellular animals. To sample the diversity of genes expressed by choanoflagellates, we collected more than 5000 expressed sequence tags (ESTs) from two choanoflagellate species, Monosiga brevicollis and Proterospongia-like sp. ATCC50818 (11, 12). Under laboratory conditions, Monosiga is strictly unicellular, whereas ATCC50818 occasionally forms small colonies of apparently unHoward Hughes Medical Institute (HHMI), University of Wisconsin, 1525 Linden Drive, Madison, WI 53706, USA. *To whom correspondence should be addressed. Email: [email protected] differentiated cells; both may exhibit additional behaviors in the natural environment. In marked contrast to their simple lifestyle, choanoflagellates express members of a wide variety of protein families involved in animal cell interactions, including cadherins, C-type lectins, tyrosine kinases (TKs), and a G protein–coupled receptor (GPCR), as well as several multidomain polypeptides that contain protein-protein interaction domains involved in signaling and adhesion in animals [such as the epidermal growth factor (EGF) motif, Src homology 2 (SH2) domain, tumor necrosis factor receptor (TNFR) domain, and sushi or complement control protein (CCP) domain (Fig. 1)]. The phylogenetic distribution of each protein or domain of interest was determined by a combination of strategies. First, the occurrence of each domain in eukaryotes, Bacteria, Archaea, and viruses was examined with two protein domain annotation programs, SMART and PFAM (13, 14). Second, we queried all available sequences from representative nonanimals, including Fungi, Plantae, Dictyostelium, and diverse bacteria, using BlastX and keyword searches. Third, we examined certain sequences more closely for adherence to the conserved traits of the protein domain family (fig. S2). Proteins containing cadherin, fibrinogen, GPCR proteolytic site (GPS), somatomedin, and CCP domains were not detected in Fungi, Plantae, or other eukaryotes (11). Although putative TK and C-type lectin domains were detected in plant genomes by the SMART and PFAM annotation sites, further scrutiny revealed these sequences to have been classified incorrectly (11). We conclude that choanoflagellates and animals share, to the exclusion of other eukaryotes whose genomes have been analyzed, proteins that contain secretin-like GPCR, GPS, fibrinogen, somatomedin, and CCP domains, as well as members of the cadherin, C-type lectin, and TK protein families. In animals, cadherins mediate cell-cell adhesion and signaling through homophilic interactions (15). Both Monosiga and ATCC50818 were found to express a cadherin-encoding gene (Fig. 1). MBCDH1 from Monosiga encodes at least two cadherin repeats and PRCDH1 from ATCC50818 encodes at least four. Comparisons of cadherin repeat sequences and the domain architectures of cadherin-containing proteins have revealed at least six subfamilies within the cadherin family (16). Phylogenetic analysis revealed the choanoflagellate cadherins to be most similar to protocadherins and to the Flamingo class of cadherins (11) (fig. S3). Fig. 1. Choanoflagellates express a variety of multidomain proteins that contain animal-type protein motifs (11). Cadherin domains are found as tandem repeats in predicted proteins from ATCC50818 (PRCDH1) and Monosiga(MBCDH1). Choanoflagellates also express members of the C-type lectin (MBCTL1 and MBCTL2) and TK (MBSRC1, MBRTK2, and MBRTK1) protein families. Additional multidomain proteins from choanoflagellates (such as MB3515) contain previously uncharacterized combinations of protein domains that are commonly found in animal proteins that mediate cell adhesion and signaling. MB7TM1 resembles secretin-like GPCRs, and MBSPEC1 resembles spectrin-containing proteins, both families of which are used in animals for signaling and cytoskeletal structure respectively. The source species is indicated by the first two letters of each gene name; “PR” indicates predicted proteins from ATCC50818 and “MB” indicates predicted proteins from Monosiga. www.sciencemag.org SCIENCE VOL 301 18 JULY 2003 Downloaded from www.sciencemag.org on December 6, 2012 REPORTS 361 C-type lectins mediate cell adhesion and signaling through calcium-dependent recognition and binding of specific sugars. Within the superfamily that contains C-type lectin–like domains, those containing carbohydrate recognition domains (CRDs) have previously been identified only within animals (17). We found two CRDcontaining C-type lectin genes in Monosiga, one containing three C-type lectin domains (MBCTL1) and a second containing a C-type lectin domain fused to three CCP domains (MBCTL2) (Fig. 1 and fig. S2). We have also isolated five homologs of animal TKs, four from Monosiga and one from ATCC50818, including MBRTK1, which we have previously reported (5) (fig. S2). Two choanoflagellate TKs, MBRTK1 and MBRTK2, conform to the general structure of receptor TKs, with a C-terminal TK enzymatic core connected through a transmembrane span to an N-terminal extracellular domain (Fig. 1). The remaining three choanoflagellate TKs (MBSRC1, MBTK3, and PRTK1) lack transmembrane domains and are predicted to operate downstream in TK signaling pathways. For instance, MBSRC1 contains an N-terminal SH2 domain linked to a TK domain and resembles Src TKs from animals (Fig. 1 and fig. S3). Because components of TK signaling pathways appear highly conserved from sponges to vertebrates, we sought to determine whether TK signaling, although absent from fungi and plants, evolved before the origin of animals. We identified ESTs for many components of TK signaling pathways in Monosiga and ATCC50818, including phosphoinositide 3-kinase, growth factor receptor– bound protein 2, Vav proteins, PDZ-containing sequences, tyrosine phosphatases, Raf kinase, and Ras, in addition to mitogen-activated protein kinase and Src kinase (18–20). The expression of these signal transduction components suggests that choanoflagellates may use tyrosine phosphorylation for signaling in a manner similar to animals. To investigate the extent of TK signaling in choanoflagellates, we probed Monosiga cell lysates with antibodies specific to phosphotyrosine. In cells maintained under conditions of nutrient depletion, approximately 15 to 18 specific phosphotyrosine-containing proteins were detected (Fig. 2, A and B), a similar number to that seen in unstimulated animal cells (21, 22). In animal cells, TK signaling is rapidly stimulated by the addition of serum to cultures of serumstarved cells (21, 23). We compared the tyrosine phosphoprotein profile of nutrient-depleted cells cultured in seawater with the profile in cells exposed to seawater enriched with either a cereal grass infusion (1525) or 1525 supplemented with Enterobacter (1525/Ea). No change was seen in the relative abundance of tyrosine-phosphorylated proteins from starved cells treated with fresh seawater compared to untreated starved cells (data not shown). In contrast, treatment of starved cells with either 1525 or 1525/Ea resulted in rapid modifications to the tyrosine phosphorylation status of at least four proteins (Fig. Fig. 2. The spectrum of tyrosine phosphoproteins in Monosiga. Monosiga cell lysates were electrophoresed, blotted onto membranes, and probed with antibodies specific to phosphotyrosine (11). (A) The profile of tyrosine-phosphorylated proteins in nutrient-deprived cells. About 15 to 18 tyrosine-phosphorylated protein species, ranging in size from 40 kD to ⬎200 kD, were detected in lysates from Monosiga cultured in seawater. (B) All but one of the detected tyrosine phosphoproteins in (A) derive from Monosiga. We compared the tyrosine phosphoproteins in protein extracts from Monosiga cultured with Enterobacter (M.b. ⫹ E.a.) with those from Enterobacter (E.a.) protein extracts (left panel). Two bacterial protein species reacted with the antibodies to phosphotyrosine, but only the larger molecular weight band (approximately 78 kD) is specific as determined by inhibition with the hapten p-nitrophenyl phosphate (PNPP) (right panel). (C) Protein tyrosine phosphorylation status is affected by environmental conditions. Cells were incubated in seawater as in (A) and (B), then treated for 5, 10, or 30 min with either seawater (S.W.), a cereal grass infusion of seawater (1525), or 1525 supplemented with Enterobacter at 108 cells/mL (1525/Ea). Within 5 min, a 95-kD protein species (P95) was detected in lysates treated with cereal grass infusion either with or without Enterobacter, but not in lysates treated with seawater. In contrast, the antibodies to phosphotyrosine showed diminished reactivity to protein species of 38, 80, and 83 kD (P38 and P80/83) in the presence of cereal grass infusion. 362 2C). An apparently phosphotyrosine-free 95-kD protein (P95) became tyrosine-phosphorylated within 5 min of treatment, indicating the probable activation of one or more tyrosine kinases, and the tyrosine phosphorylation levels of three protein species (P38, P80, and P83) diminished in response to stimulation with enriched media (Fig. 2C). To examine the functional significance of TK signaling in choanoflagellate biology, we measured the proliferation and maximal cell density of Monosiga cultures treated with specific inhibitors of TKs at concentrations previously shown to be effective in cultured animal cells (11, 20, 24). Control cultures of Monosiga grown in 1525/Ea conformed to the typical growth characteristics of cultured microbes, exhibiting lag, log, and stationary phases (Fig. 3). The doubling time of untreated cells during log phase was approximately 6 hours and cells reached a maximum density of ⬃107 cells/mL. In contrast, cultures treated with genistein, a general inhibitor of TKs, were delayed in their entry into log phase and reached a maximum density of 2 ⫻ 106 cells/mL, one-fifth that of untreated cells. In addition, treatment of Monosiga cultures with PP2, an inhibitor of Src kinases, entirely blocked cell proliferation. Treatment of Monosiga cultures with inactive analogs of genistein and PP2, the control compounds genistin and PP3, had no significant effect on cell density (not shown). Importantly, genistein- and PP2-treated cells displayed normal levels of motility and feeding behavior. The dependence of Monosiga cells on TK activity indicates a functional link between TK signaling and the cell cycle or cell survival. The discovery of multiple signaling and adhesion gene family members in choanoflagellates demonstrates that key proteins required for animal development evolved before the origin of animals. Other multicellular eukaryotes, and even those with complex modes of development Fig. 3. TK activity is required for proliferation of Monosiga cultures. In contrast with cultures grown in the presence of a control solvent [dimethyl sulfoxide (DMSO), diamonds], the increase in cell density of cultures treated with TK inhibitors [25 M genistein (circles) or 500 nM PP2 (squares) in DMSO] is significantly reduced (11). 18 JULY 2003 VOL 301 SCIENCE www.sciencemag.org Downloaded from www.sciencemag.org on December 6, 2012 REPORTS REPORTS References and Notes 1. Comparisons of yeast, Caenorhabditis, Drosophila, and human genomes have revealed a set of protein domains conserved among Bilateria and absent from yeast (25, 26). However, because the Bilateria represent a recent derivation within the Metazoa and the Fungi diverged from the animal lineage long before the transition to multicellularity, comparisons limited to fungal and bilaterian animal genomes do not reveal the complexity of the ancestral animal genome. 2. W. Saville Kent, A Manual of the Infusoria (David Bogue, London, 1880 –1882). 3. D. J. Hibberd, J. Cell Sci. 17, 191 (1975). 4. P. O. Wainright, G. Hinkle, M. L. Sogin, S. K. Stickel, Science 260, 340 (1993). 5. N. King, S. B. Carroll, Proc. Natl. Acad. Sci. U.S.A. 98, 15032 (2001). 6. E. A. Snell, R. F. Furlong, P. W. H. Holland, Curr. Biol. 11, 967 (2001). 7. B. F. Lang, C. O’Kelly, T. Nerad, M. W. Gray, G. Burger, Curr. Biol. 12, 1773 (2002). 8. G. Burger, L. Forget, Y. Zhu, M. W. Gray, B. F. Lang, Proc. Natl. Acad. Sci. U.S.A. 100, 892 (2003). 9. R. F. Watkins, A. T. Beckenbach, J. Mol. Evol. 48, 542 (1999). 10. J. L. Boore, Nucleic Acids Res. 27, 1767 (1999). 11. Materials and methods are available as supporting material on Science Online. 12. Searchable choanoflagellate EST databases are available at http://projects.bocklabs.wisc.edu/ carroll/choano/. 13. I. Letunic et al., Nucleic Acids Res. 30, 242 (2002). 14. A. Bateman et al., Nucleic Acids Res. 30, 276 (2002). 15. L. Shapiro, P. D. Kwong, A. M. Fannon, D. R. Colman, W. A. Hendrickson, Proc. Natl. Acad. Sci. U.S.A. 92, 6793 (1995). 16. F. Nollet, P. Kools, F. van Roy, J. Mol. Biol. 299, 551 (2000). 17. R. B. Dodd, K. Drickamer, Glycobiology 11, 71R (2001). 18. X. R. Bustelo, Mol. Cell. Biol. 20, 1461 (2000). 19. A. C. Porter, R. R. Vaillancourt, Oncogene 17, 1343 (1998). 20. A. Levitzki, A. Gazit, Science 267, 1782 (1995). 21. E. B. Pasquale, P. A. Maher, S. J. Singer, J. Cell. Physiol. 137, 146 (1988). 22. K. Miyazawa et al., Blood 80, 1685 (1992). 23. J. A. Cooper, B. M. Sefton, T. Hunter, Mol. Cell. Biol. 4, 30 (1984). 24. P. Kumar, S. Hosaka, A. E. Koch, J. Biol. Chem. 276, 21039 (2001). 25. G. M. Rubin et al., Science 287, 2204 (2000). 26. V. Wood et al., Nature 415, 871 (2002). 27. We thank P. Bertics, A. Guadarrama, B. Leadbeater, and T. Hunter for advice and technical assistance; L. Olds for assistance with graphics; J. Holt for computing support; and B. Hersh and C. Malone for critical reading of the manuscript. N.K. is supported by an NIH postdoctoral fellowship (GM-20734) and C.T.H. is an HHMI predoctoral fellow. This work was supported by the HHMI (S.B.C.). Supporting Online Material www.sciencemag.org/cgi/content/full/301/5631/361/ DC1 Materials and Methods SOM Text Figs. S1 to S4 Tables S1 and S2 References and Notes 25 February 2003; accepted 28 May 2003 Hox10 and Hox11 Genes Are Required to Globally Pattern the Mammalian Skeleton Deneen M. Wellik and Mario R. Capecchi* Mice in which all members of the Hox10 or Hox11 paralogous group are disrupted provide evidence that these Hox genes are involved in global patterning of the axial and appendicular skeleton. In the absence of Hox10 function, no lumbar vertebrae are formed. Instead, ribs project from all posterior vertebrae, extending caudally from the last thoracic vertebrae to beyond the sacral region. In the absence of Hox11 function, sacral vertebrae are not formed and instead these vertebrae assume a lumbar identity. The redundancy among these paralogous family members is so great that this global aspect of Hox patterning is not apparent in mice that are mutant for five of the six paralogous alleles. Hox genes have long been recognized as important transcriptional regulators of embryonic development. In mammals, this complex of 39 genes resides on four separate chromosomal linkage groups designated A, B, C, and D, which arose early in the evolution of vertebrates from successive duplications of a single ancestral complex. Homologous members within the separate linkage groups are divided into 13 sets of paralogous genes, each having two to four members. During development, paralogous sets of genes are activated sequentially, with Hox1 and Hox2 paralogous genes being expressed earlier and more anteriorly in the embryo and successive genes through paralogous group Hox13 appearing later and more posteriorly. Howard Hughes Medical Institute and University of Utah, Salt Lake City, UT 84112, USA. *To whom correspondence should be addressed. Email: [email protected] The spectrum of perturbations of the mammalian skeleton resulting from either gain- or loss-of-function mutations in individual Hox genes has been difficult to interpret in terms of a coherent model of how these genes participate in the patterning of the axial skeleton. Loss-of-function Hox mutations have yielded changes in vertebral morphology along the anteroposterior (AP) axis that have been interpreted as anterior homeotic transformations as well as posterior homeotic transformations. Typically, these morphological changes involve perturbations in one or a small number of vertebrae. Among different vertebrate species, axial skeletal patterns have diverged considerably. A comparative survey of Hox gene expression in mice and chicks showed that Hox gene expression boundaries along the rostrocaudal axis shift in accordance with changes in the class of vertebrae produced at a given axial level (1). This observation suggests that Hox genes play a crit- www.sciencemag.org SCIENCE VOL 301 18 JULY 2003 Downloaded from www.sciencemag.org on December 6, 2012 (e.g., Dictyostelium, fungi, and plants), apparently lack many proteins used by animals for signaling and adhesion. Therefore, choanoflagellates, with their closer evolutionary relationship to animals (4–7) and their expression of signaling and cell adhesion protein homologs, are more informative for studies of animal origins. The existence in unicellular choanoflagellates of proteins used for cell adhesion and signal transduction in animals prompts the question of their ancestral function in the progenitor of animals and choanoflagellates. Despite the apparent simplicity of the choanoflagellate lifestyle, it is possible that choanoflagellate homologs of animal proteins perform similar biochemical functions within a unicellular context. For instance, TKs may act in choanoflagellates to detect changes in the extracellular environment, as we have demonstrated through their response to nutrient availability. In addition, animal cell adhesion proteins, such as the cadherins, may derive from ancestral proteins that stabilized the interactions between protozoan cells during conjugation or colony formation. Proteins that mediate cell attachment or defense against pathogens in animals may have evolved from proteins required for prey recognition and capture. C-type lectins might allow choanoflagellates to distinguish between and capture different bacterial species by binding specific sugar groups displayed on bacterial cell walls. Targeted manipulations of gene function in choanoflagellates will be necessary to test hypotheses about the ancestral roles of these conserved molecules. We have sampled just a fraction of the choanoflagellate proteome. The diversity of choanoflagellate proteins predicted to function in cell interactions suggests that additional proteins shared exclusively with animals will be discovered through sequencing the entire choanoflagellate genome. Of particular interest will be the repertoire of transcription factors and the potential representation of families of proteins that regulate cell differentiation and development in animals. It may then be possible to determine whether entire regulatory pathways linking receptorbased signaling inputs to gene regulation and cell behavior predate the origin of animals. 363
© Copyright 2026 Paperzz