Downloaded from http://rstb.royalsocietypublishing.org/ on June 18, 2017 Phil. Trans. R. Soc. B (2012) 367, 2556–2573 doi:10.1098/rstb.2012.0107 Review Evolution of SH2 domains and phosphotyrosine signalling networks Bernard A. Liu1 and Piers D. Nash2, * 1 2 Samuel Lunenfeld Research Institute, Mount Sinai Hospital, Toronto, Canada Ben May Department for Cancer Research and Committee on Cancer Biology, The University of Chicago, Chicago, IL 60637, USA Src homology 2 (SH2) domains mediate selective protein–protein interactions with tyrosine phosphorylated proteins, and in doing so define specificity of phosphotyrosine (pTyr) signalling networks. SH2 domains and protein-tyrosine phosphatases expand alongside protein-tyrosine kinases (PTKs) to coordinate cellular and organismal complexity in the evolution of the unikont branch of the eukaryotes. Examination of conserved families of PTKs and SH2 domain proteins provides fiduciary marks that trace the evolutionary landscape for the development of complex cellular systems in the proto-metazoan and metazoan lineages. The evolutionary provenance of conserved SH2 and PTK families reveals the mechanisms by which diversity is achieved through adaptations in tissue-specific gene transcription, altered ligand binding, insertions of linear motifs and the gain or loss of domains following gene duplication. We discuss mechanisms by which pTyr-mediated signalling networks evolve through the development of novel and expanded families of SH2 domain proteins and the elaboration of connections between pTyr-signalling proteins. These changes underlie the variety of general and specific signalling networks that give rise to tissue-specific functions and increasingly complex developmental programmes. Examination of SH2 domains from an evolutionary perspective provides insight into the process by which evolutionary expansion and modification of molecular protein interaction domain proteins permits the development of novel protein-interaction networks and accommodates adaptation of signalling networks. Keywords: SH2; protein-tyrosine kinase; eukaryotes; phosphotyrosine 1. INTRODUCTION Phosphotyrosine (pTyr)-mediated signalling is a relatively recently evolved cellular communication system in comparison to more primordial post-translational modifications (PTMs) such as Ser/Thr phosphorylation or lysine ubiquitination that are used across all branches of the eukaryota [1]. Some prokaryotes possess unique tyrosine kinases that are involved in pTyr-mediated cellular activities but these are distinct from those found in metazoans (see Grangeasse et al. [2]). Signalling mediated through the use of pTyr is essential in metazoans, regulating many key cellular and developmental processes, including cell growth, proliferation, differentiation and migration. In metazoans, pTyr signalling depends upon an essential triad of signalling molecules: the protein-tyrosine kinases (PTKs) that add a phosphate onto substrate tyrosines (conceptually these may be viewed as the ‘writers’); the protein-tyrosine phosphatases (PTPs) that remove or dephosphorylate substrates (the ‘erasers’); and the modular protein interaction domains that recognize the phosphorylated ligand (the ‘readers’) and hence recruit the proteins containing these * Author for correspondence ([email protected]). One contribution of 13 to a Theme Issue ‘The evolution of protein phosphorylation’. domains to specify downstream signalling events [1]. Several modular interaction domains have the capability to bind to tyrosine-phosphorylated ligands. These include most Src homology 2 (SH2) domains [3,4], a subset of phosphotyrosine-binding (PTB) domains [5], at least one C2 domain [6] and the Hakai pTyr-binding domain [7]. The SH2 domain is the largest domain family dedicated to pTyr recognition, with 111 proteins containing at least one SH2 domain encoded in the human genome [8]. With the recent sequencing of the genomes of multiple organisms, the identification of the PTKs, PTPs and SH2 domains has provided insights into the emergent evolution of this PTM signalling system [1,9 – 11]. Recent examination of the evolutionary history of SH2 domains indicates that PTKs coevolved along with SH2 domain proteins to bring about organismal and tissue complexity in metazoans [8]. The diversification of proteins that contain SH2 domains allows this singularly remarkable domain to coordinate specificity in a diverse range of signalling networks. This review explores the origins of pTyr-mediated signalling through the expansion and diversification of SH2-domain-containing proteins. Examination of the origins of SH2 domains and their respective protein families reveals that SH2 domains are integrated into a diverse set of protein interaction modules. In this way, SH2 domains coordinate 2556 This journal is q 2012 The Royal Society Downloaded from http://rstb.royalsocietypublishing.org/ on June 18, 2017 Review. Evolution of pTyr signalling (a) B. A. Liu and P. D. Nash 54 PTB 90 PTKs 2557 111 SH2 P P x–3-x–2-x–1-Y-x+1-x+2-x+3 x–3-x–2-x–1-Y-x+1-x+2-x+3 >10 000 pTyr sites P 107 PTPs number of genes (b) 180 360 450 751–993 540–650 970 990 650 H. sapiens 90 M. musculus M. domestica X. tropicalis D. rerio C. intestinalis S. purpuratus A. aegypti D. melanogaster C. elegans N. vectensis M. brevicollis 250 761–957 1400 1500 1000 900 800 700 600 500 400 300 200 100 PTK 90 93 90 79 122 49 53 SH2 111 111 115 101 133 54 53 PTP 38 (107) 38 32 28 51 32 36 31 32 90 31 128 33 36 66 29 112 15 15 67 36 38 S. cerevisiae 0 1 3 D. dictyostelium 0 15 4 Ma Figure 1. The phosphotyrosine signalling circuitry expands in unikonts to become a large and complex signalling system in mammals. (a) The number of writers (protein-tyrosine kinases, PTKs), erasers (protein-tyrosine phosphatases, PTP) and readers (Src homology 2, SH2; phosphotyrosine-binding domain, PTB) in humans. PTB (green) and SH2 (light blue) domain both contain a pTyr binding pocket (yellow circle), yet specificity is determined by a secondary binding site adjacent to the pTyr binding pocket that recognize sequences surrounding the pTyr. (b) The total number of PTK, classical PTPs (total number of tyrosine phosphatases) and SH2-encoded genes across eukaryotes. The branched divergence times and lengths in millions of years (Ma) are indicated. specificity across a multitude of signalling pathways. As we trace the evolutionary history of the different protein families, we discuss the various evolutionary paths by which these proteins change and diverge. In detail, we will highlight the key underlying genetic mechanisms of diversification through gene duplication, transcriptional regulation, domain loss and gain, and insertion of linear motifs. Lastly, we will explore both the conserved and divergent nature of pTyr-mediated signalling networks to provide a better appreciation for the complexity of mammalian pTyr networks. This overview will introduce the mechanisms by which adaptive changes within pTyr– SH2 interaction networks act to drive cellular complexity and participate in the development of tissue specific signalling networks. Understanding the molecular changes that drive expansion and complexity of pTyr signalling networks provides an early glimpse of some aspects of the process of molecular evolution that underlies the adaptive changes in organisms as they adapt and develop more elaborate functions and increasingly complex tissue types. Phil. Trans. R. Soc. B (2012) 2. SH2 DOMAINS HAVE A BROAD ROLE IN PHOSPHOTYROSINE SIGNALLING AMONG MAMMALS Much of our knowledge and understanding of pTyr signalling and SH2 domains comes from studies in mammals, particularly in mouse and human. The human genome encodes roughly 90 PTKs, 107 PTPs, 54 PTB domain-containing proteins (only about one-fifth of PTB domains can bind pTyr) and 111 SH2-domain-containing proteins. This creates a complex network of regulators in pTyr signalling (figure 1a) [3,12,13]. Since the discovery of pTyr nearly three decades ago [14], pTyr signalling has been implicated in almost every aspect of normal cellular function and found to be a major driver in human malignancies such as cancer [15,16]. Tyrosine phosphorylation encompasses a small percentage (up to 2 per cent) of the total protein phosphorylation within a mammalian cell, although a list of over 10 000 putative pTyr sites have been identified so far across various tissues and species [17,18]. The discovery of the SH2 domain [19– 21] led to numerous Downloaded from http://rstb.royalsocietypublishing.org/ on June 18, 2017 2558 B. A. Liu and P. D. Nash Review. Evolution of pTyr signalling studies mapping interactions and SH2 domain selectivity that have led to a growing understanding of specificity of pTyr signalling (reviewed in earlier studies [3,4,22]). SH2 and PTB domains determine selectivity for pTyr ligands by recognizing pTyr and residues surrounding the pTyr (figure 1a). The preference for þ1 to þ5 residues C-terminus of the pTyr in a sequence-dependent manner is dictated by the surface residues on the SH2 domain that lie adjacent to the pTyr-binding pocket. In contrast, PTB domains recognize sequences N-terminus to the pTyr with a core consensus of N-P-x-pY, yet only a small fraction of PTB domains are capable of binding pTyr peptides. An overwhelming preponderance of what we know about pTyr signalling is based on studies in human and mouse systems, with very little known about pTyr signalling in simpler organisms. Mammalian genomes and protein interaction maps provide one window into the role of pTyr signalling in the development of complex organisms. Indeed, a study of human PTK circuits using prediction networks to identify PTKs, SH2 and PTPs suggests an evolving paradigm of pTyr signalling and evolution [23]. In this review, we describe the evolutionary provenance and trajectory of aspects of the cellular pTyr circuitry. The available pTyr machinery in various extant organisms suggests that pTyr use transitioned from a nuclear and cytoplasmic system in early unicellular eukaryotes to signalling at the cellular membrane through use of receptor tyrosine kinases (RTKs). This evolutionary analysis of pTyr signalling has implications for the involvement of PTKs and pTyr signalling in human disease such as cancer [23]. The spatio-temporal regulation of tyrosine phosphorylation is necessarily complex in mammalian cells. The interplay between the substrate tyrosines and the array of available PTKs and PTPs governs the timing, localization and longevity of phosphorylation events. PTKs and PTPs commonly have somewhat limited substrate specificity for peptide ligands. They are, however, tightly controlled in both their activation and spatial localization. The result is that, as a general paradigm, PTKs and PTPs control the spatio-temporal localization of phosphorylation events but have limited deterministic control over downstream events. Specificity in pTyr signalling is thus largely determined by recognition modules, such as SH2 domains, that distinguish specific tyrosine phosphorylated motifs and act to nucleate complex formation to propagate signalling. Further complexity arises from tissue- and cell-specific expression profiles and dynamic sub-cellular localization of reader proteins that may play a role in the cellular context of protein phosphorylation. Therefore, dissecting the temporal and spatial organization of dynamic protein interaction networks is an essential element in understanding information flow through the signalling system [24]. Transgenic animal models give a first approximation of the role of a protein in a signalling network. Over 80 of the 111 SH2 proteins have been genetically disrupted in mice, revealing a broad range of cellular functions for SH2 domains in various tissues and cell types [3]. However, in order to extrapolate the multiple biological functions of these proteins and understand the cellular context by which these proteins regulate cellular Phil. Trans. R. Soc. B (2012) activity, understanding of the PTKs, PTPs and potential interaction networks within specific cell types or tissues is required. This already challenging problem is compounded by the copious variety of tissues and cell states. Further complexity arises when multiple members of a closely related gene family are present where some extent of overlapping or redundant functions coexist with specialized roles. For instance, in the GRB2 family (Grb2, Gads and Grap), Grb2 is ubiquitously expressed and plays an essential role in growth factor signalling downstream of RTKs. Gads and Grap, by contrast, are highly restricted in their expression to a subset of immune cells and appear to function primarily in signalling downstream of the T-cell and B-cell receptors. Loss of Grb2 results in embryonic lethality [25], whereas loss of Gads or Grap leads to rather specific immune phenotypes [26,27]. While the three encoded genes are closely related, they possess distinct roles in their specific cellular context. This is not always the case for all families, suggesting that some members have redundant roles while some individual genes within a family have a ubiquitous and essential role in development [3]. We discuss some of the evolutionary patterns that drive divergence and complexity in SH2-domain-containing proteins in §6. 3. IDENTIFICATION AND CLASSIFICATION OF SH2 DOMAINS Large numbers of whole genome sequences, combined with emerging bioinformatics tools, present the opportunity to examine signalling systems across a broad range of organisms. Modular domain families are identified from sequence datasets using hidden Markov models and multiple sequence alignments tools such as those developed by the protein families database (Pfam) [28], simple modular architecture tool (SMART) [29] and conserved domains database (CDD) [30]. Soon after the availability of their annotated genomes, the human and mouse complements of all the kinases, including the PTKs, as well as the PTPs, were reported [12,13,31,32]. This was followed by the identification of 111 SH2 domain proteins in the human and mouse genomes [3] and later by the total complement of PTKs and SH2 encoding genes across multiple eukaryotic organisms (figure 1b) [8,10,11]. The 111 SH2-containing proteins contain a diverse array of additional protein interaction domains (e.g. SH3, PTB, PH) and catalytic domains (e.g. TK, phosphatase) that are gained and lost over time to generate diversity in signalling functions [3,33]. With over 150 eukaryotic genomes sequenced till date, bioinformatic approaches are required to identify protein domains and trace paralogues and orthologues across these diverse organisms. By examination of SH2 domains across 21 extant species in the Eukaryota we identified 38 conserved families of SH2 domains using a hierarchical method combining sequence homology, domain organization and positions of splice-site patterns [3,8]. Such a hierarchical method was necessary to avoid improper classification of a domain family based on sequence homology alone. Analysis of splice-site position within the open reading frame provides an easily accessible hallmark Downloaded from http://rstb.royalsocietypublishing.org/ on June 18, 2017 Review. Evolution of pTyr signalling B. A. Liu and P. D. Nash 2559 Table 1. SH2 domain families. family human gene names domains (in order) pseudogenes (yes/no)a ABL Abl1, Abl2 no CBL Cbl, CblB, CblC CHN CRK CSK DAPP1 FPS FRK GRB2 GRB7 JAK NCK PI3KR PLCg Chn1, Chn2 Crk, CrkL Csk, Matk Dapp1 Fes, Fer Frk, Brk, Srms Grb2, Gads, Grap Grb7, Grb10, Grb14 Tyk2, Jak1, Jak2, Jak3 Nck1, Nck2 Pik3r1, Pik3r2, Pik3r3 Plcg1, Plcg2 PTPN RASA1 RIN SH2B SH2D1 SH2D2 SH2D3 SH2D4 SH2D5 SH3BP2 SHB SHC SHIP SLAP SLP76 SOCS Ptpn6, Ptpn11 Rasa1 Rin1, Rin2, Rin3 Aps, Lnk, Sh2b Sh2d1a, Sh2d1b Sh2d2a, Hsh2, Sh2d7 Sh2d3a, Sh2d3c, Bcar3 Sh2d4a, Sh2d4b Sh2d5 Sh3bp2 Shb, Shd, She, Shf Shc1, Shc2, Shc3, Shc4 Ship1, Ship2 Slap, Slap2 Slnk, Blnk, Slp76, Mist Socs1, Socs2, Socs3, Socs4, Socs5, Socs6, Socs7, Cish Src, Fyn, Lck, Fgr, Yes, Lyn, Hck, Blk Bks, Brdg1 Stat1, Stat2, Stat3, Stat4, Stat5a, Stat5b, Stat6 Supt6h Zap70, Syk Bmx, Tec, Btk, Itk, Txk Tns1, Tns3, Tenc1, Tns4 Vav1, Vav2, Vav3 SH3, SH2, TK , F-actin binding domain EF-Hand, SH2, 4-helix bundle, RING, UBAb SH2, C1, RhoGAP SH2, SH3, SH3 SH3, SH2, TK SH2, PH F-Bar, SH2, TK SH3, SH2, TK SH3, SH2, SH3 RA, PH, SH2 FERM, SH2, TK SH3, SH3, SH3, SH2 SH3b, RhoGAPb, SH2, SH2 PH, PLCXc, PH, SH2, SH2, SH3, PLCYc, C2 SH2, SH2, phosphatase SH2, SH3, SH2, PH, C2, RasGAP SH2, VPS9, RA PH, SH2 SH2 SH2 SH2, RasGEF SH2 PTB, SH2 PH, SH2 SH2 PTB, SH2 SH2, phosphatase SH3, SH2 SAMb, SH2 SH2, Socs SRC STAP STAT SPT6 SYK TEC TNS VAV no no no no no no no no no no no no no yes no no no yes no no no no no no yes no no no yes SH3, SH2, TK yes PH, SH2 DNA BD, SH2, TA no no CSZ, S1, SH2 SH2, SH2, TK PH, BTK, SH3, SH2, TK PTPc, SH2, PTB CH, RhoGEF, PH, C1, SH3, SH2, SH3 no no no no no a Human pseudogenes identified using the Pseudofam database (http://pseudofam.pseudogene.org/) [35]; bdomains that are not shared among all family members. For complete domain descriptions visit http://pawsonlab.mshri.on.ca or http://www.sh2domain.org. of conserved genomic structure that can assist in accurate classification of a family [8,34]. This provided a methodology that could trace families as well as individual proteins across lineages and species. Domain families assist in defining orthologous and paralogous proteins across the 21 different eukaryotes. This in turn provides insights into the evolutionary mechanisms that drive diversification within these families that create new proteins with both shared and novel functions. The 38 families of SH2 domains in humans encode a heterogeneous range of proteins that connect pTyr signalling to a diverse array of cellular systems. Many families have distinctive domain architectures that place the SH2 domain in the context of other modular protein domains such as SH3, PH, kinase, phosphatase, RING, PTB, Phil. Trans. R. Soc. B (2012) RasGAP, RasGEF, FERM and others (table 1) [3,33]. The various mechanisms by which diversity appears to have arisen within individual families will be examined in detail in later sections. Examination of these families across a broad range of eukaryotes suggests points of origin for each SH2 family at common ancestors representing branch points that separate specific lineages. The role of each SH2 protein and selective advantage conferred upon first emergence is rarely clear and presents a major challenge for understanding the core cellular functions of primordial SH2 domain proteins. Sequence analysis and functional studies provide a means to trace the evolutionary history of SH2-mediated pTyr signalling in metazoans. Examination of the coevolution of PTKs and SH2 domain proteins may provide insights into their role in the development of Downloaded from http://rstb.royalsocietypublishing.org/ on June 18, 2017 2560 B. A. Liu and P. D. Nash Review. Evolution of pTyr signalling (a) C-SH2 N-SH2 C-SH2 Dictyostelium N-SH2 mammals (b) Cbl DIF-1 TK? JAK STATc STATc STATc PTP-3 STAT1 STAT1 STAT1 TCPTP DIF-1 CblA Figure 2. Primordial function of SH2 domains and pTyr signalling. (a) The ribbon structure of the yeast Spt6 tandem SH2 (left panel, PBD: 3PJP). The charged surface of the tandem SH2 (right) indicates the positions of negative (red) and positive charge (blue) of the tandem SH2 domain. Highlighted in the yellow circle is the phospho-binding pocket for pSer. (b) The SH2 domain transcriptional activator STAT (red) is present as early as Dictyostelium and found in humans. Analogous to mammalian STAT signalling, STATc is regulated by tyrosine phosphorylation (yellow dot) through PTKs (green) and PTPs (blue). In Dictyostelium, the SH2/RING domain containing protein CblA (orange) antagonizes the phosphastase, while in mammals Cbl inhibits the PTK Jak. multicellularity and subsequent leaps in complexity of tissues and development of complex systems such as the immune system and the mammalian brain (see §5). 4. SH2 DOMAINS PREDATE DEDICATED PROTEIN-TYROSINE KINASES Within Eukaryota, there exist two major divisions: the bikonts (eukaryotic organisms derived from cells with two flagella) and unikonts (organisms derived from cells with one flagella). SH2 domains were previously described in unikont organisms such as humans, yeast, amoeba and choanoflagellates [10,11,36]. Recent analysis of SH2 domains across 21 eukaryotic genomes revealed the presence of SH2 domains in both divisions [8], including the unicellular organisms Naegleria gruberi and Phytophthora capsici, and thus SH2 domains probably predate the emergence of PTKs [37]. A common SH2 domain protein, Spt6, is found in both bikonts and unikonts and is universally present in metazoans. A handful of other SH2 domains are observed in certain bikont lineages, but it remains unclear whether any of these are functional and capable of binding pTyr [8]. Spt6 was first described in the budding yeast Saccharomyces cerevisiae Phil. Trans. R. Soc. B (2012) [36] as an elongation factor, a histone H3 – H4 chaperone that mediates nucleosome assembly and interacts with RNA polymerase II (RNAPII) [38,39]. The SH2 domain of Spt6 is reportedly capable of recognizing the phosphorylated serines (pSer) of RNAPII C-terminal tail (CTD) [40], with no detectable binding to pTyr peptides. Structural studies indicate that Spt6 possesses a tandem SH2 fold [41,42] (figure 2a). Only the first SH2 domain of Spt6 is recognizable as a canonical SH2 domain by sequence analysis. The second SH2 fold is highly atypical, lacking the FLVR (Phe-Leu-Val-Arg) motif for phosphobinding, and is not detected using standard SH2 domain models, such as SMART or Pfam. Both SH2 domains are necessary for this region to recognize pSer residues on the CTD of RNAPII [40]. The structure of the yeast Spt6 N-SH2 domain possesses many sequence and structural features that clearly resemble pTyr-binding SH2 domains in mammals [41,43 – 45]. Binding to pSer/pThr peptides uses the same canonical binding pocket as canonical SH2 domains use to bind pTyr peptides [45]. Based on sequence and structure, it appears likely that the SH2 domain of Spt6 represents an ancestral pSer/pThr-binding domain that may have given rise to pTyr-binding SH2 domains Downloaded from http://rstb.royalsocietypublishing.org/ on June 18, 2017 Review. Evolution of pTyr signalling somewhere around the divergence between Unikonta and Bikonta. The Spt6 SH2 remains well conserved from yeast to humans. Most branches of Unikonta, including Amoebozoa, Choanozoa and Animalia, contain diverse sets of SH2 domains, suggesting that pTyr-binding SH2 domains developed very early in the unikont lineage. In members of the Amoebozoa lineage, such as the Entamoebae and Mycetozoa, SH2 domains expand into an independent set of proteins with relatively few obvious homologues in Metazoa (e.g. Shk, LrrB) [46,47]. This is despite the absence of dedicated PTKs that existed within these species. It is not immediately clear how pTyr signalling networks are constructed within these organisms. Some Ser/Thr kinases, such as dual specificity MAPKs [48], as well as other unique PTKs such as DPYK [49], are capable of phosphorylating tyrosine residues. Such non-traditional tyrosine kinases may explain the expansion and maintenance of SH2 domain proteins in Amoebozoa such as Dictyostelium discoideum. It is also in Amoebozoa that we first encounter SH2 domains linked to a dual-specificity kinase, the Ser/ Thr/Tyr kinase Shk [50]. Despite the absence of PTKs, tyrosine phosphorylation has been reported in D. discoideum and can be observed in the phosphorylation of the C-terminus of STATc [51,52] (figure 2b). When STATc is tyrosine phosphorylated, it forms a homodimer through reciprocal SH2-domain– pTyr interactions and then accumulates in the nucleus, functioning in a manner analogous to the mammalian STAT orthologues. Amoebozoa, Choanozoa and Metazoa all encode SH2-domain-containing Stat proteins. In addition, D. discoideum contains a Cbl-like pTyr-binding region, comprising an EF-hand and an SH2 domain, linked to an E3 ubiquitin-ligase RING domain. D. discoideum CblA functions as a positive regulator of STATc by downregulating a tyrosine phosphatase, whereas metazoan CBL proteins negatively regulate tyrosine kinases [52]. Because D. discoideum diverged from the lineage that gave rise to Metazoa before Fungi but after the Bikonta/Unikonta split, it remains unclear why STATs, CBL and other SH2 domain proteins were apparently lost in Fungi [53]. In addition to the SH2 domain proteins, 24 kinase subfamilies shared between D. discoideum and Metazoa are absent in yeast, suggesting that yeast developed a specialized biological programme that does not require many of the kinase signalling networks used in other unikont branches [54]. In fact, the progenitor to the yeasts may have used pTyr signalling, but, as Gerard Manning’s analysis suggests, the PTKs and some other kinase families were lost as the yeasts evolved leaving a few PTP/DSPs (dual specificity phosphatases). Fungi, which together with Metazoa are in the opisthokont branch of Unikonta, lack any apparent pTyr-binding SH2 domains other than the pSer-binding SPT6. pTyr-binding SH2 domains were lost from Fungi, perhaps as a result of deleterious outcomes from the occurrence of pTyr. This is supported by lethal effects of expression of the PTK v-Src in S. cerevisiae that are associated with loss of cell cycle control [55,56]. The ability of v-Src to kill yeast cells is dependent upon its SH2 domain Phil. Trans. R. Soc. B (2012) B. A. Liu and P. D. Nash 2561 as an active kinase alone has little effect on yeast growth [57,58]. This suggests that pTyr-mediated signalling may have been negatively selected out of the Fungi common ancestor around 1200 Ma [59,60] and that functional use of pTyr and SH2 domains may predate dedicated PTKs [47]. Tyrosine phosphorylation and tyrosine kinase signalling is thought to occur primarily in the cytoplasm, particularly at the plasma membrane, in metazoans. Upon ligand engagement or cell contact, activated RTKs phosphorylate their own cytoplasmic tails as well as membrane-localized substrate proteins. SH2 domain proteins are then recruited to activated signalling complexes at the plasma membrane or on signalling endosomes to propagate downstream signalling. In this paradigm, much of the PTK–SH2 signalling is cytoplasmic in nature with only limited examples of nuclear involvement (however, more and more nuclear pTyr proteins are being reported, although admittedly this does not necessarily tell us where in the cells these proteins were phosphorylated or whether they have nuclear SH2-domain-binding partners). In contrast, primordial SH2 domain-containing proteins, SPT6 and STAT, function in the nuclear compartment of the cell. There was probably significant nuclear pTyr signalling in early unicellular eukaryotes that remains a feature of various lineages. This is supported by several lines of evidence that tyrosine phosphorylation plays a role in the nucleus. Tyrosine phosphorylation of nuclear cell cycle regulators such as Cdc28 and Cdc2 is observed in mammalian and yeast cells [61–63]. Staining for pTyr in the unicellular organism Giardia lamblia indicates enrichment in the nuclei that is distinct in comparison to pSer/Thr staining [64]. This suggests that in early unicellular eukaryotes, pTyr signalling may have been more central to the nuclei with concurrent evolution of SH2 domains that may mediate nuclear functions such as DNA transcription (e.g. Stat proteins). As cytoplasmic tyrosine kinases (CTKs) emerged, pTyr signalling may have become more extensively used for cytoplasmic functions, eventually becoming focused at the plasma membrane with the emergence of RTKs. Extensive RTK signalling mediated by SH2 domains is itself a hallmark of multicellularity in Metazoa and represents a key stepping stone in the emergence of multicellularity in the animal lineage [23]. Early in the unikont lineage, SH2 domains appear that have the ability to recognize pTyr-based motifs. This is a key feature allowing rapid evolution of pTyr signalling by shuffling SH2 domains into various proteins [1]. In pre-metazoans, specifically the choanoflagellate lineage, diversification of SH2 domains and PTKs is tightly linked, suggesting that they coevolved to create diverse functional signalling systems. Within the unikont lineages, SH2 domain expansion may be partially driven by the kinases that generate the binding partners for SH2 domains through phosphorylation. True PTKs first appeared in the unicellular choanoflagellate Monosiga brevicollis [65], but are absent in amoeba and slime moulds [54]. The choanoflagellate branch, which includes M. brevicollis, encodes the core set of PTKs, PTPs and SH2 domains [10,11] and thus exhibits a functional pTyr signalling Downloaded from http://rstb.royalsocietypublishing.org/ on June 18, 2017 2562 B. A. Liu and P. D. Nash Review. Evolution of pTyr signalling phylum 38 +1 +5 +9 –2 –5 –6 +20 Opisthokonta +1 –2 Unikonta +3 Homo sapiens Chordata +2 –1 37 –3 35 Urochordata +1 –4 35 Echinodermata SH2 families Amoebozoa –3 genus species Danio rerio Ciona intestinalis Strongylocentrotus purpuratus 27 Arthropoda 25 Nematoda Drosophila melanogaster Caenorhabditis elegans 24 Cnidaria Nematostella vectensis 21 Choanozoa 1 Ascomycota 3 Mycetozoa Monosiga brevicollis 0 Saccharomyces cerevisiae Dictyostelium dictyostelium Entamoebae Entamoeba histolytica phylum genus species other eukaryotes 29 Homo sapiens Chordata tyrosine kinase families +1 –2 +3 –1 +5 +12 +6 Opisthokonta Unikonta Amoebozoa –1 –5 29 Danio rerio 27 Ciona intestinalis Urochordata 25 Echinodermata 22 Strongylocentrotus purpuratus Arthropoda 18 Nematoda 18 Cnidaria Drosophila melanogaster Caenorhabditis elegans 6 Choanozoa 0 Ascomycota 0 Mycetozoa 0 Entamoebae Monosiga brevicollis Nematostella vectensis Saccharomyces cerevisiae Dictyostelium dictyostelium Entamoeba histolytica other eukaryotes Figure 3. The gain and loss of SH2 and PTK families in Eukaryota. The eukaryotic tree is shown with the phylum and genus species listed on the right. The number of SH2/PTK families are listed in black text with the gain and loss indicated in grey text. The 38 SH2 and 29 PTK families represent those that are found in humans and are present in other lower eukaryotes [8,10,34]. network reminiscent of metazoans. A total of 19 conserved SH2 families and six conserved PTK families are found in M. brevicollis, several of which contain both an SH2 and a TK domain (figure 3). The heterogeneity of the M. brevicollis pTyr system suggests that SH2 proteins may have cellular roles that diverge somewhat from their mammalian counterparts. For instance, the CTK MbCsk is co-expressed with and can phosphorylate MbSrc1 at tyrosine near the C-terminus in a manner similar to that of the mammalian Src and Csk pair. However, the negative regulation of Src through phosphorylation of this residue is absent from the M. brevicollis pair, suggesting that allosteric regulation of Src developed more recently in the metazoan lineage [66,67]. The presence of an extensive and varied collection of pTyr signalling molecules suggests that M. brevicollis depends upon pTyr to mediate a multitude of cellular functions, most of Phil. Trans. R. Soc. B (2012) which we have yet to understand. It will be fascinating to see what roles unique SH2 proteins in M. brevicollis serve in coordinating organism-specific pTyr signalling. 5. EMERGENCE OF PROTEIN-TYROSINE KINASE AND SH2 DOMAIN FAMILIES FOR CELLULAR COMPLEXITY Throughout the evolution of Metazoa, new SH2 and PTK families appear while others are occasionally lost (figure 4). The timing of the appearance of new families may indicate the evolution of specific signalling networks that support an adaptive advantage. Phenotypic complexity and complex developmental programmes are hallmarks of metazoan evolution. Numerous SH2 families emerge in relatively simple eukaryotes such as Choanozoa and Cnidaria. It remains unclear what the specific cellular functions Downloaded from http://rstb.royalsocietypublishing.org/ on June 18, 2017 B. A. Liu and P. D. Nash Review. Evolution of pTyr signalling SH2D5 SLAP SH2D1 AXL AATYK TIE TRK STAP JAK SLP76 MUSK SHB ALK EGFR MET SOCS SH2D3 SH2B NCK GRB7 DAPP1 CHN ACK1 FAK NOK1 PTK7 ROR RYK VEGFR DDR INSR FPS FRK SYK VAV TNS SHIP SHC SH3BP2 SH2D4 SH2D2 RIN RASA1 PTPN PLCG PIK3R GRB2 CRK TEC SRC CSK ABL EPHR FGFR CBL STAT SPT6 PTB SH2 2563 PTB SH2 SH3 SH2 IG IG FN3 T M T M FN3 SH3 Y kinase Y kinase E G F E G F IG Cys-Rich PH T M Y kinase T M Y kinase FN3 IG Cys-Rich IG IG E G F SH2 FERM STY kinase SH2 SAM Y kinase SH2 IG IG IG SH2 T M kringle Cys-Rich Y kinase SH2 Y kinase LDLa FU FU FU SEMA IG PSI IG IG T M Y kinase T M Y kinase T M Y kinase SH2 SOCS phosphatase RasGEF Y kinase SH2 PH SH3 SH3 FN3 RasGAP SH2 SH3 RA RasGEF SH2 PH IG SH2 PH SH2 C1 SH2 T M RhoGAP CDC42 BD Y kinase Y kinase IG IG IG IG Y kinase IG IG Cys-Rich IG IG T M T M Y kinase T M Y kinase T M Y kinase IG IG Kringle T M IG kringle FAT T M FERM Y kinase IG IG Y Kinase PH FN3 Cys-Rich Y kinase Y kinase RhoGAP Y kinase SH2 SH2 T M FN3 SH2 F-Bar SH3 Cys-Rich F-Bar Y kinase SH2 CH RhoGEF C1 PTPc DSPs C1 PH SH2 SH3 SH3 C1 SH2 phosphatase SH2 PTB C2 SH2 PH SH2 SH2 LDLa SH2 VPS9 SH2 SH2 SH2 SH3 SH2 SH3 C2 PTPc DSPs RasGAP phosphatase SOCS PLCXc PH SH2 PH SH2 SH2 RhoGAP SH3 RA PH SH2 SH3 PH PLCYc SH2 CDC42 BD SH3 SH2 SH2 SH3 PH BTK SH3 FU SH3 Y kinase SH2 SH3 SH2 Y kinase SH3 SH2 Y kinase DNA BD SH3 Y kinase SH2 IG IG 4H IG 4H FN3 Cys-rich IG T M FN3 T M Y kinase Y kinase EF EF SH2 RING DNA BD TEX SH2 RING TA S1 SH2 ns H. sapie ulus M. musc nstica M. dome alis X. tropic D. rerio nalis C. intesti s ratu S. purpu r nogaste D. mela ti A. aegyp ns C. elega nsis N. vecte ollis M. brevic isiae S. cerev ostelium D. dicty um ure D. purp TEX S1 TA CH Figure 4. Emergence and expansion of SH2 and PTK families. The presence and absence of conserved families of SH2 and PTK proteins is indicated for representative species covering the unikont branch from social amoeba/slime mould Dictyostelium species to H. sapiens. The families of SH2 and PTK were identified previously [8,10,34]. Text colour indicates SH2 families that lack a TK domain (black), RTK families (orange), CTKs containing an SH2 domain (blue) and CTKs that lack an SH2 domain (green). The general domain organization of each family is shown on the right. these proteins serve in primordial cellular signalling. Certain SH2 families arise at distinct stages coincident with the development of multicellularity. For example, Phil. Trans. R. Soc. B (2012) establishment of cell polarity in cnidarians probably depended upon the appearance of specific signalling pathways such as the canonical Wnt and b-catenin Downloaded from http://rstb.royalsocietypublishing.org/ on June 18, 2017 2564 B. A. Liu and P. D. Nash Review. Evolution of pTyr signalling pathway [68,69]. Alongside appeared several SH2 domain families, including NCK and SOCS. Both Nck and Socs7 play an important role in regulating cell polarity through Septins in mammals [70], which suggests that they may have coevolved with the polarity complexes that emerge in cnidarians [71]. Further hints of the emerging programmes guiding multicellularity can been seen in the families of PTKs that are identified in the cnidarian Nematostella vectensis. Only two families of RTKs are present in M. brevicollis, whereas nine additional families arise in N. vectensis. This correlates with studies by Li et al. [23] suggesting that early pTyr signalling was primarily mediated by CTK. In cnidarians, additional RTK families (RYK, ROR, VEGR and others) promote inter-cellular communication for cell – cell contact and cell adhesion (figure 4). The coincident emergence of FAK, a PTK crucially involved in regulating focal adhesions critical for maintaining cell contact [72,73], during this lineage further supports the development of this complex cellular process. In arthropods, complex tissues and organs begin to emerge, such as the muscle, heart and vascular system as well as the innate immune system [74]. The emergence of the muscle-specific tyrosine kinase in arthropods is coincident with the development of a muscular system within this lineage (figure 4). In addition to the emergence of new proteins and protein families, protein interaction networks can be seen to be coevolving in parallel. For example, Shc and Grb2 are both observed in N. vectensis (figure 4). An interaction between Shc and Grb2 is first seen in arthropods with the insertion of a pTyr site within the CH1 region of Shc [8]. Phosphorylation of this site allows for interaction with the SH2 domain of Grb2. In mammals, this interaction is supported by multiple pTyr sites in the products of four distinct Shc genes (Shc1, 2, 3, 4) expressing various isoforms in a wide variety of tissues [75]. Furthermore, elimination of these pTyr sites in the Shc1 gene in mice leads to a muscle motor stretch reflex defect, specifically a decreased number of muscle spindles of the skeletal muscle sensory organs that regulate motor behaviour [76]. This highlights the various processes for emerging novel protein circuits through evolving not just novel protein families but also bringing together the pTyr circuit through evolving pTyr motifs and SH2 domains in a manner that may have promoted the development of tissue-specialized signalling. Additional analysis regarding evolution of pTyr signalling networks will be discussed in a later section. Although many proteins emerge at distinct points in evolution, mutation, adaptation and selection underlie a process capable of developing new interactions that drive the creation of novel signalling networks. Functional interactions in one species do not, therefore, necessarily represent primordial protein functions. Studies in mammalian systems suggest that Grb2 associates with Shc and epidermal growth factor receptor (EGFR), whereas Crk interacts with the CTK c-Abl and thus plays essential roles in these signalling networks [77– 79]. Yet, these functions may be later adaptations and signify the role that these proteins served in common ancestor species or the role that they serve in other lineages. Both Grb2 Phil. Trans. R. Soc. B (2012) and Shc appear in M. brevicollis and are found across all metazoans, suggesting that they emerged in a common pre-metazoan ancestor species (figure 4). Grb2 associates with Shc through a pTyr-dependent interaction dependent upon the SH2 domain of Grb2 and the pY-x-N motifs present in the CH1 region of Shc. However, these pTyr-binding sites do not appear until arthropods such as Drosophila melanogaster, suggesting that the ancestral role for Grb2 and Shc were independent of this function, and that this is a later adaptation. Phenotypic evidence supports the assertion that the Grb2 – Shc – pTyr-mediated interaction is indeed functionally important in signalling events in more complex multicellular metazoans and in particular for signalling networks that underlie the development of neuromuscular systems. In a similar manner, the Jak-Stat signalling network is extensively used in adaptive immune signalling in mammals [80] and gene disruptions in this network often exhibit specific defects in immunological signalling. Yet, both Jak and Stat proteins predate the adaptive immune system and appear to have arisen at different points in evolution. This is consistent with evolution proceeding by rewiring existing components when adaptive pressure drives the development of more complex systems. Thus, the ancestral function of proteins can either change or become obscured by observed functions in wellstudied mammalian organisms. Shc1, for instance, serves essential functions in mammals distinct from the pTyr-mediated interaction with Grb2. Targeted disruption of the Shc1 gene is embryonic lethal [81], while disruption of the pTyr sites that allow Grb2 binding has a relatively specific function later in development [76]. The essential function of Shc1 may well relate to the ancestral function of the protein in singlecelled pre-metazoan species. Evolution in this case has resulted in the use of existing components to build new circuits and connections that underlie novel signalling pathways and add additional levels of complexity. 6. DIVERSIFICATION OF SH2 DOMAIN PROTEINS THROUGH GENE DUPLICATION Over 30 years ago, Ohno [82] elucidated the potential for gene duplication to drive diversification, but it was not until recently that this concept has become widespread and pivotal in understanding the evolution of gene families and functional diversification. Approximately 65 per cent of human genes are duplicated, with 33 of 38 SH2 and 25 of 29 PTK families possessing duplicate copies. Gene duplication provides the opportunity for the duplicate gene to explore an otherwise prohibited mutational space. Freed from the pressure to maintain essential function the newly duplicate gene is free to wander, on condition that the duplicate genes collectively continue to contribute their original function. In the case of SH2 domain proteins, very few new families arise at the split between arthropods and vertebrates (figure 3). Instead, gene duplication appears to function by providing an opportunity to build complexity out of existing components. The expansion and divergence of SH2 domains and pTyr signalling components Downloaded from http://rstb.royalsocietypublishing.org/ on June 18, 2017 Review. Evolution of pTyr signalling (a) B. A. Liu and P. D. Nash 2565 duplication duplication eliminated fixation subfunctionalization (eg. VAV, NCK) neofunctionalization (eg. SLAP, SAP, GRB2) degradation (pseudogenes eg Grb2-ps1) (b) loss or gain of domains tissue specialization transcriptional regulation evolving domain specificity loss or gain of linear motifs Figure 5. Fates of duplicate genes. (a) When a new duplicate arises it is eliminated in a majority of cases as fixation in a population is rare event. After fixation, the gene can undergo three general fates that include: neofunctionalization, where a duplicate copy acquires a novel function; non-functionalization of a gene through mutation of its promoter and/or mutation of its function creating a pseudogene by inactivation; and subfunctionalization that divides the primary ancestral function between the duplicate copies possibly across distinct tissue types. (b) Duplicate copies of genes can diverge through means of evolving novel protein domain specificity for ligands, gain or loss of a protein domain, insertion or deletion of short linear motifs and lastly an alteration in the promoter of the gene allowing for specialized tissue expression and transcription control. provides a rich case study for how organisms and genomes gain complexity by gene duplication followed by functional divergence [83]. To understand how gene duplicates give rise to diversification, let us consider the general mechanisms and consequences. When a gene is duplicated, it faces a number of hurdles that must be overcome for it to be fixed into the genome of a species. If a duplicate gene confers no selective advantage, it is likely to be lost or mutated. In order for a gene duplicate to be observed in an extant species, it must be first fixed within the population, and second it must be preserved over time [83,84] (figure 5a). Three fates are typically envisaged for a gene duplicate. One fate, called nonfunctionalization, refers to the loss of all functionality of a gene. This typically leaves behind a pseudogene wherein the duplicate gene accumulates mutations that destroy its function to the point at which it can no longer be transcribed or translated. The SH2-homology pseudogene Grb2-ps1 is present within the mouse genome but is absent in humans, Phil. Trans. R. Soc. B (2012) yet no transcript can be identified and the sequence is recognizably that of a pseudogene. At least 16 human SH2 pseudogenes can be identified using the Pseudofam database (http://pseudofam.pseudogene. org/) [35] (table 1). A second fate results when an advantageous allele arises as a result of one of the gene copies gaining a new function. This is referred to as neofunctionalization. For example, the SLAP family appears to have originated from a duplication of a SRC family gene. The loss of the TK domain in SLAP allowed this duplicate to acquire a new function distinct from its ancestor. Similarly, SAP (SH2D1A), arose from a duplication of SHIP, and lacks the phosphatase domain. Neofunctionalization is more likely to occur in genes that are necessarily rapid evolving such as those involved in immunity and host defence [85]. Lastly, the original functions of the single-copy gene may be partitioned between the duplicates. This is known as subfunctionalization. Biological function suggests that many gene products perform a multiplicity of subtly distinct functions and thus selective Downloaded from http://rstb.royalsocietypublishing.org/ on June 18, 2017 2566 B. A. Liu and P. D. Nash Review. Evolution of pTyr signalling pressures may result in a compromise between optimal sequences for each role. Partitioning these functions between the duplicates may increase the fitness of the organism by removing the conflict between two or more functions. For example, the families of NCK and VAV each demonstrate subfunctionalization. Loss of either Nck1 or Nck2 in mice result in only subtle phenotypic changes [86 –88], whereas loss of both alleles results in embryonic lethality. This is indicative of overlapping and mutually compensatory functions [89]. In a similar vein, loss of individual genes within the VAV family causes no obvious phenotypes, whereas loss of all members of the family results in a severe defect in lymphocyte development [90]. Subfunctionalization can also enable temporal and spatial partitioning of expression, protein localization and assembly of signalling complexes. The driving force during evolution for preserving a duplicate of an existing gene is to gain a selective advantage. Gene duplicates must confer an immediate selective advantage (or at least not a selective disadvantage) and this is clearly the case if elevating the level of protein expression (dosage) is advantageous. The dosage balance involves changes in protein concentrations relative to those of potential binding partners, where an imbalance in expression levels may potentially result in deleterious effects such as improper protein complex assembly. In order to provide balance in the system, duplicate genes may undergo various paths for divergence from its ancestor such as point mutations, gene fusions, truncations, deletions and gene conversions. Mutation of a duplicate gene can arise through multiple avenues that allow for divergence in a manner that includes alterations in gene expression patterns through promoter mutations, changes in ligand specificity, gain/loss of protein domains and insertion/ deletions of short linear motifs (figure 5b). Many factors influence the number and divergence patterns for duplicate genes. For example, protein–protein interactions of the ancestral gene can influence the number of duplicates. The more highly connected and more centralized proteins found in humans are usually encoded by duplicated genes, which favour the functional diversification of paralogues [91]. This feature is unique to vertebrates following whole genome duplication (WGD), as gene dosage, in these cases, is commonly controlled through tissue-selective expression and miRNA regulation. High connectivity may favour the functional diversification of paralogues through tissue specialization or other mechanisms [92,93]. A strong correlation (r ¼ 0.93) of the number of cell types with the number of SH2 domains suggests that duplication may have driven tissue-specific expression enabling specialized tissues and developmental programmes to evolve [94]. Dosage balance may act as an intermediate step to neo- and subfunctionalization, prolonging the retention of the duplicates before one of the other mechanisms determines the ultimate fate of the duplicates. This highlights the important role of functional divergence of duplicate genes in optimizing pTyr networks for specialized tissues and creating novel signalling pathways. Examination of the 38 conserved SH2 families reveals 33 families with at least one gene duplicate in Phil. Trans. R. Soc. B (2012) the human genome. Within these families, there are notable examples of some of the mechanisms driving divergence between duplicate copies. Duplication can lead to loss or gain of domains, often at the ends (N-terminus or C-terminus) of the protein [95]. In addition, duplicate genes can diverge in sequence to acquire novel binding motifs either within the SH2 domain or elsewhere within the protein (discussed in detail in §7). Five generalized classes have been described for the divergence of SH2 families as a result of duplication events combined with domain loss or gain [8]. Gene duplication can simply result in multiple members of a family, though the size of the family can vary substantially with the SRC family having eight functional genes in humans. The heterogeneity in family size likely relates to heterogeneity in both the gain and loss of duplicate genes between families, though it is not fully understood what evolutionary forces determine the pattern of gene retention. In some cases, a novel domain may be inserted or lost such as a SAM domain in SLP76 or the UBA domain in Cbl-C, respectively. Some families arise from neofunctionalization, where the gene family undergoes a change such as a loss of catalytic domain, as in the case of SH2D1 and SLAP, followed by an additional level of duplication for subfunctionalization. Lastly, several SH2 families such as RASA1 and SH3BP2 remain single copy, suggesting that any duplications that arose did not undergo fixation or that the genes were inactivated through nonfunctionalization. These classes, while general in definition, encompass the mechanism for divergence by gene duplication. 7. PHOSPHOTYROSINE NETWORKS EVOLVE THROUGH SH2-DOMAIN-MEDIATED INTERACTIONS The pTyr signalling networks are important contributors to the evolution of organismal complexity in the unikont branch of Eukaryota, and particularly in Metazoa. Genome-wide loss of tyrosines in proteins across metazoans correlates with the expansion and utilization of pTyr as a novel PTM that carries significant functional importance in signal transduction [59]. Evolutionary forces have driven the assembly of novel regulatory pathways and networks in pTyr signalling over the course of metazoan evolution. The pTyr protein interaction network involves the physical interaction between a pTyr ligand and the recognition by domains such as SH2 and PTB. Existing interactions that provide a selective advantage must either coevolve or remain conserved through evolution. Novel interactions may evolve rapidly by integrating pTyr motifs recognized by SH2 domains into scaffolds and other SH2 domain proteins. The identification of both conserved and rapidly evolving phosphorylation sites may have important implications for understanding human diseases [96]. Understanding of the mechanisms driving the elaboration of pTyr networks by modification of SH2 domain specificity and the integration of pTyr signalling into diverse signalling pathways also have implications for synthetic biology approaches such as cellular rewiring and creating customized Downloaded from http://rstb.royalsocietypublishing.org/ on June 18, 2017 Review. Evolution of pTyr signalling circuits [97]. Within the SH2 domain, an invariant arginine residue within the second b-strand is necessary for coordinating the negatively charged phosphate of pTyr. Specificity is then determined through the interactions with pockets on the surface of the SH2 domain that recognize the residues primarily C-terminus to the pTyr, but in various cases extending from the 22 position up to the þ5 position in respect to the pTyr [98–100]. Loop regions of the SH2 fold play additional roles in ligand selectivity through their ability to block certain contact regions and limit ligand access to the binding surface [98]. Selectivity is not limited to direct contact of residues that impart binding energy, as longer range charge repulsion, steric clashes and entropic effects also play a role in selectivity. Interestingly, SH2 domains explore only a very limited sequence space in their potential ligands. Two large classes of SH2 domains exhibit a strong preference for either an asparagine residue at þ2 (e.g. pY-X-N-X) or a proline or aliphatic residue at þ3 (e.g. pY-X-X-P/L). Despite this limited apparent selectivity for permissive residues, necessary to impart the minimal energies required for binding, SH2 domains exhibit remarkable selectivity for physiological peptide ligands. The basis for this is the extensive use of non-permissive residues that distinguish the binding of SH2 domains that otherwise have very similar primary motifs for binding [101]. In this way, the Crk and Brk SH2 domains are able to distinguish ligands using non-permissive residues and sequence context [101]. In evolutionary terms, this implies remarkable flexibility in adaptive mechanisms. A newly arising SH2 domain may preserve the essential binding motif of its immediate ancestral paralogue, with adaptation allowing fine-tuning of ligands without major disruption to the primary binding surface. The use of non-permissive residues is inherently more flexible than permissive residues that require direct contacts and constrained geometry of binding pockets. Non-permissive residues can be accommodated in various ways, ranging from steric clashes within or adjacent to the binding groove, longer range charge-repulsion, or even entropic penalties associated with altering the conformation of the peptide upon binding [101]. The fidelity of pTyr signalling is in large part determined by the ability of a given SH2 domain to selectively recognize a specific set of pTyr ligands. This involves the interplay between the PTK/PTP, the SH2 domain and the pTyr ligand (figure 6a). In many cases, this means that over evolutionary distances, SH2 domains display conservation of ligand specificity, thereby maintaining core-binding partners whose ligand sequences also do not diverge significantly. Novel binding partners may still arise by insertion of pTyr motifs or mutations of sequences surrounding the pTyr to permit or abolish binding by specific SH2 domains. A complementary mechanism is for specificity to evolve through changes in the binding surface of an SH2 domain such that it recognizes different ligands than its ancestral relatives, allowing for novel interactions and re-wiring of the network. The obvious issue with altering the primary binding interface is that existing interactions will be lost, potentially with deleterious consequences for organismal fitness. Thus, large changes within the Phil. Trans. R. Soc. B (2012) B. A. Liu and P. D. Nash 2567 primary binding pocket are uncommon and the peptide binding groove is generally highly conserved among orthologous SH2 domains across various species [8]. Duplication events provide the possibility for more significant alterations, particularly if deleterious dosage effects provide the selection pressure to adopt alternative binding partners. More subtle alterations to the binding partner repertoire may be achieved with changes outside the primary contact region that impart restrictions on the binding partners, such as prohibiting a specific residue at a specific position. The refinement of non-permissive residues that can arise from changes outside the primary pocket can extend to distinguishing physicochemically related amino acid residues. Thus, the Brk SH2 domain treats a glutamic acid at þ1 as a permissive residue, whereas an aspartic acid is nonpermissive, the Crk SH2 binds to ligands with a þ3 proline or leucine residues but not to ligands containing the closely related isoleucine or valine residues, and Grb2 SH2 binds to ligands containing pY-EN-E but not pY-D-N-D, etc. [101] (figure 6b). Such subtlety in distinguishing binding by related SH2 domains is consistent with the notion that specificity of SH2 domains coevolves with its pool of available ligands to fine-tune specificity over time as the size and complexity of pTyr signalling networks increases. For SH2 families, such as GRB2 (consisting of Grb2, Gads and Grap), NCK (Nck1 and Nck2) and CRK (Crk and CrkL), the specificity pocket appears highly conserved from fly to humans [8] and these therefore share a common core set of binding peptides from fly to humans [101] (Nash laboratory 2012, unpublished data). In addition, insertion of novel binding sites within SH2 domain proteins is prevalent, particularly between arthropod and mammal, in a manner that serves to create more highly connected and potentially robust pTyr interaction networks. The Grb2 family encodes two SH3 domains flanking the central SH2 domain (table 1 and figure 6c). Examination of the conservation of the specificity pockets of the two SH3 domains reveals high conservation of the N-terminal SH3 domain pocket but much lower conservation of the C-terminal SH3 domain ligand binding surface, suggesting that this domain may have evolved to recognize novel binding partners that distinguish its function between arthropod and mammals. Indeed, the C-SH3 of the GRB2 family recognizes a very specific and atypical PxxxRxxKxP motif in which the invariant and essential residues are the RxxK [102 – 104]. This differs substantially from the canonical PxxP motif recognized by most SH3 domains, and suggests that mutations in the C-SH3 mutated to accommodate a novel binding modality that became fixed owing to selective interactions with a handful of proteins harbouring the extended RxxK motif. While these examples demonstrate both conservation and adaptations in specificity, certain SH2 domains appear to have lost the ability to recognize pTyr ligands at all. For example, individual members of the RIN (Rin2) and JAK (Tyk2) family harbour a mutation of the essential bB-Arg to a His residue, presumably abolishing pTyr ligand binding [3]. In Downloaded from http://rstb.royalsocietypublishing.org/ on June 18, 2017 2568 B. A. Liu and P. D. Nash Review. Evolution of pTyr signalling (a) ty ici cif ng d lvi ve vo ser e n ds co an lig spe evolving specificity conserved ligands ev ol ev ving ol s vi ng peci lig fici t an ds y loss of ligand binding (b) Brk P Y-E-x-P-E +1 +2 +3 +4 (c) Crk Grb2 P Y-D-x-P-R P Y-x-x-P-x +1 +2 +3 +4 P Y-E-N-E +1 +2 +3 +4 +1 +2 +3 P Y-D-N-D +1 +2 +3 GRB2 C P-x-x-P R-x-x-K pY-x-N P C-SH3 N-SH3 SH2 P-x-x-P N N-SH3 SH2 C-SH3 0 20 40 60 80 100 percentage conservation (%) Figure 6. Evolution of protein domain specificity. (a) The specificity of a protein interaction domain, such as an SH2 domain, can either change or remain conserved over time. Conservation of peptide ligand specificity pocket allows interactions to be conserved throughout evolution. Insertion of linear motifs and evolving novel binding sites can allow for an SH2 domain to recognize multiple binding partners. Specificity can also evolve for a domain, yet this can result in acquiring a completely new set of binding partners while the ancient partners are lost. Lastly, a domain can coevolve over time to maintain ancient binding partners but also evolve novel binding partners by evolving its peptide-binding pocket. A protein domain can become inactivated through mutations of its binding pocket such that it no longer can recognize any of its original ligands. (b) SH2 domains use sequence context and non-permissive residues to discriminate pTyr peptide ligands. This underlies a wider channel for information flow between the ligand and the SH2 domain. For example, the SH2 domains of Brk and Crk can each engage a subset of peptides containing a Pro or Leu at þ3 (three residues C-terminus to pTyr; left panel): Brk and Crk distinguish between peptides containing a þ3 Pro through distinct recognition of permissive and non-permissive residues at þ1 and þ4. Brk SH2-domain binding is favoured with permissive factors for Brk (þ1 Glu and þ4 Glu), which are also non-permissive factors for Crk favour Brk binding (middle panel). A þ1 Asp and þ4 Arg favours Crk binding over Brk (right panel). This is suggestive of coevolution between SH2 domains and peptide ligands to maximize specific recognition events using available peptide sequence information. (c) The adaptor Grb2 protein contains two SH3 domains (N-SH3, C-SH3) and a central SH2 domain. The sequence alignment of the GRB2 family from C. elegans to H. sapiens was mapped onto the structure of each individual domain. The percentage sequence conservation of the three domains is represented as a heat map on the surface of the solved ligand-bound structures (see legend for percentage of conservation). Structures used to highlight conservation are PDB: 1BMB; PDB: 2W0Z; PDB: 1GBR. Phil. Trans. R. Soc. B (2012) Downloaded from http://rstb.royalsocietypublishing.org/ on June 18, 2017 Review. Evolution of pTyr signalling (a) B. A. Liu and P. D. Nash 2569 (b) Dm Sp Ci Abl SH3 binding DE loop Dr Xt (c) Abl SH3 binding DE loop Gg Mm Rt Bt Hs Figure 7. Evolution of a short linear motif within the SH2 domain of Crk to accommodate the interaction with the SH3 domain of Abl kinase. (a) Overlay of Crk SH2 domain structures from multiple organisms. The structures are coloured according to the different organisms in (b). The intramolecular pTyr peptide ligand of Crk (Y221) is shown in grey. (b) The SH2 domain structures from D. melanogaster (Dm), S. purpuratas (Sp), C. intestinalis (Ci), D. rerio (Dr), X. tropicalis (Xt), G. gallus (Gg), M. musculus (Mm), R. norvegicus (Rt), M. domestica (Md), B. taurus (Bt) were modelled using Swiss-model and UCSF-Chimera of the H. sapiens (Hs) Crk SH2 domain, PDB: 1JU5 (see Liu et al. [8] for detailed methods). (c) Sequence alignment of the DE loop of Crk and CrkL orthologues is shown. The presence of a splice site within the DE loop (circled in red) in Crk and absent in CrkL orthologues provides an opportunity for evolving an extended SH3 binding loop. addition to evolving SH2 domain specificity through its ligand-binding pocket, domains may evolve secondary binding sites with their ligands to improve affinity, specificity or alter the biophysical nature of the interaction (e.g. by reducing off-rate to enhance downstream signalling). For example, the N-SH2 domain of phospholipase Cg uses secondary contacts through its BC and DE loops for interaction with the C-lobe of the fibroblast growth factor receptor (FGFR) catalytic domain structure, which are important for binding to pY766 in FGFR1 [105]. SH2 domains may also exert allosteric regulation over other aspects of their host protein. For example, intramolecular contacts between the SH2 domain and tyrosine kinase domain directly regulate the enzymatic substrate activity for SH2-domain-containing kinases such as Fes and Abl [106]. Thus, SH2 domain proteins can evolve in multiple fashions to achieve selectivity in pTyr-mediated signalling. Following a WGD event, many duplicate copies of genes are lost, whereas a fraction of the duplicates survive. Immediately following the WGD event, regulatory Phil. Trans. R. Soc. B (2012) networks may be rapidly rewired to integrate the newly duplicate genes and prevent chaotic loss of control over cellular processes. The resulting rapid evolution and functional divergence can occur, in part, at the level of transcription as mentioned earlier [92]. However, adaption may also occur at the level of post-translational regulation. Sites of PTMs are typically short motifs that commonly occur within intrinsically disordered protein regions that are inherently under less selective pressure to maintain sequence and thus are capable of rapid evolution. Changes in PTM sites thus represents one mechanism of rapidly effecting necessary rewiring. A study of phosphosites across paralogous yeast genomes following WGD identified duplicate genes as having on average more phosphosites than genes that were not duplicated [107]. This post-translational level of regulation might be a means of rapidly rewiring duplicate copies, allowing these genes to adapt and to be retained. SH2 proteins typically contain an assortment of other modular domains and short linear motifs that prescribe a range of additional interactions. The product of a gene duplication may give rise to a new gene that preserves Downloaded from http://rstb.royalsocietypublishing.org/ on June 18, 2017 2570 B. A. Liu and P. D. Nash Review. Evolution of pTyr signalling certain functions of the parent while acquiring specialized functions to coordinate novel interactions [108]. For example, the Socs proteins have evolved to fine-tune their primary binding interface in order to recognize specific sets of targets. Thus, Socs2 binds specifically to the growth hormone receptor, whereas Socs4 recognizes phosphorylated EGFR [109]. Both Crk and Sh2d1a each evolved as duplications of existing family members and then acquired new interactions through insertion of extended loop regions within the SH2 domain that act to promote the recruitment of SH3 domains via a nonpTyr-dependent, secondary binding surface within the SH2 domain. The gene family CRK encodes two paralogues, Crk and CrkL, and in mammals the Crk SH2 domain evolved an extended proline rich loop between the bD and bE strands of the SH2 domain that specifies an interaction with the SH3 domain of the CTK c-Abl [78,110] (figure 7). The insertion of a splice site within the bD–bE loop in vertebrate Crk but absent in CrkL allowed for evolutionary divergence through extension of this loop [8]. Splice site slippage is one way that such sequence fragments may be inserted [111], allowing for rapid divergence between paralogues [8]. Evolving new binding sites is both a mechanism for diversifying duplicate genes as well as enabling the development of more complex signalling networks within specific tissues. Collectively, these mechanisms suggest that gene duplication may have served in part to promote SH2 domain proteins to coevolve and diverge within tissue types to create specialized cellular signalling networks such as those involved in signalling in lymphocytes. 8. CONCLUSIONS The evolution of pTyr signalling involves the complex interplay of signalling components for control and activation of functional downstream signalling events through phosphorylation of tyrosine residues by PTKs and the recognition of specific pTyr sites by SH2 or other pTyr-binding domains. The site of phosphorylation and the sequence context surrounding the pTyr is under evolutionary pressure both to optimize kinase/ phosphatase specificity as well as binding specificity for SH2 or other domains. Therefore, SH2 domains must coevolve with pTyr sites to optimize the appropriate level of specificity in signalling. Clearly, PTPs also play a role in the evolution of this system and present a further opportunity for study. There are still many unanswered questions, such as what are the primordial functions of SH2 domain proteins in their earliest forms? What were the driving forces for diversifying and expanding these classes of proteins at specific times? And what role did pTyr signalling play in the emergence and elaboration of multicellularity in the metazoan lineage. It is likely that important insights into the role of SH2 domains will come from future dissection of biological networks and pathways through genetics in lower organisms. Additional sequencing of genomes representing early branches of the unikont lineage and early steps in the development of metazoan multicellularity as well as biochemical analysis of pTyr signalling pathways in these organisms may provide deeper insight into the origins and evolution of pTyr signalling networks. Phil. Trans. R. Soc. B (2012) Addendum: During the publication process, two relevant articles were published that provide further insight into phosphotyrosine evolution that we would be remiss to ignore. Jeffrey G. Williams group identified Pyk2 as the kinase responsible for phosphorylation and activation of STATc in Dictyostelium (Proc. Nat. Acad. Sci. PMID: 22699506). Patrick Cramer and Dirk Eick’s groups showed the tandem SH2 domains of SPT6 bound well to pTyr1, pTyr1/pSer2, pTyr1/pSer5 but not to unphosphorylated CTD. This indicates that Tyr1 phosphorylation stimulates CTD binding of the bona fide elongation factor SPT6 (Science, PMID: 22745433). We thank Tony Pawson, Chris Tan and the members of the Pawson laboratory for helpful discussions. B.A.L is supported by a Canadian Institutes of Health Postdoctoral Fellowship. P.D.N. is supported by funds from the National Science Foundation grant (no. MCB-0819125), the University of Chicago Cancer Research Center and the Cancer Research Foundation. REFERENCES 1 Lim, W. A. & Pawson, T. 2000 Phosphotyrosine signaling: evolving a new cellular communication system. Cell 142, 661–667. (doi:10.1016/j.cell.2010.08.023) 2 Grangeasse, C., Nessler, S. & Mijakovic, I. 2012 Bacterial tyrosine-kinases: evolution, biological function and structural insights. Phil. Trans. R. Soc. B 367, 2640–2655. (doi:10.1098/rstb.2012.0424) 3 Liu, B. A., Jablonowski, K., Raina, M., Arce, M., Pawson, T. & Nash, P. D. 2006 The human and mouse complement of SH2 domain proteins—establishing the boundaries of phosphotyrosine signaling. Mol. Cell. 22, 851–868. (doi:10.1016/j.molcel.2006.06.001) 4 Pawson, T. 2004 Specificity in signal transduction: from phosphotyrosine –SH2 domain interactions to complex cellular systems. Cell 116, 191 –203. (doi:10.1016/ S0092-8674(03)01077-8) 5 Smith, M. J., Hardy, W. R., Murphy, J. M., Jones, N. & Pawson, T. 2006 Screening for PTB domain binding partners and ligand specificity using proteome-derived NPXY peptide arrays. Mol. Cell Biol. 26, 8461 –8474. (doi:10.1128/MCB.01491-06) 6 Benes, C. H., Wu, N., Elia, A. E., Dharia, T., Cantley, L. C. & Soltoff, S. P. 2005 The C2 domain of PKCdelta is a phosphotyrosine binding domain. Cell 121, 271 –280. (doi:10.1016/j.cell.2005.02.019) 7 Mukherjee, M. et al. 2012 Structure of a novel phosphotyrosine-binding domain in Hakai that targets E-cadherin. EMBO J. 31, 1308 –1319. (doi:10.1038/ emboj.2011.496) 8 Liu, B. A., Shah, E., Jablonowski, K., Stergachis, A., Engelmann, B. & Nash, P. D. 2011 The SH2 domaincontaining proteins in 21 species establish the provenance and scope of phosphotyrosine signaling in eukaryotes. Sci. Signal 4, ra83. (doi:10.1126/scisignal.2002105) 9 King, N. et al. 2008 The genome of the choanoflagellate Monosiga brevicollis and the origin of metazoans. Nature 451, 783 –788. (doi:10.1038/nature06617) 10 Manning, G., Young, S. L., Miller, W. T. & Zhai, Y. 2008 The protist, Monosiga brevicollis, has a tyrosine kinase signaling network more elaborate and diverse than found in any known metazoan. Proc. Natl Acad. Sci. USA 105, 9674–9679. (doi:10.1073/pnas.0801314105) 11 Pincus, D., Letunic, I., Bork, P. & Lim, W. A. 2008 Evolution of the phospho-tyrosine signaling machinery in premetazoan lineages. Proc. Natl Acad. Sci. USA 105, 9680– 9684. (doi:10.1073/pnas.0803161105) 12 Manning, G., Whyte, D. B., Martinez, R., Hunter, T. & Sudarsanam, S. 2002 The protein kinase complement Downloaded from http://rstb.royalsocietypublishing.org/ on June 18, 2017 Review. Evolution of pTyr signalling 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 of the human genome. Science 298, 1912–1934. (doi:10.1126/science.1075762) Alonso, A. et al. 2004 Protein tyrosine phosphatases in the human genome. Cell 117, 699 –711. (doi:10.1016/j. cell.2004.05.018) Eckhart, W., Hutchinson, M. A. & Hunter, T. 1979 An activity phosphorylating tyrosine in polyoma T antigen immunoprecipitates. Cell 18, 925–933. (doi:10.1016/ 0092-8674(79)90205-8) Hunter, T. 1998 The role of tyrosine phosphorylation in cell growth and disease. Harvey Lect. 94, 81–119. Hunter, T. 2009 Tyrosine phosphorylation: thirty years and counting. Curr. Opin. Cell Biol. 21, 140–146. (doi:10.1016/j.ceb.2009.01.028) Olsen, J. V., Blagoev, B., Gnad, F., Macek, B., Kumar, C., Mortensen, P. & Mann, M. 2006 Global, in vivo, and site-specific phosphorylation dynamics in signaling networks. Cell 127, 635 –648. (doi:10.1016/j.cell. 2006.09.026) Huttlin, E. L. et al. 2010 A tissue-specific atlas of mouse protein phosphorylation and expression. Cell 143, 1174 –1189. (doi:10.1016/j.cell.2010.12.001) Sadowski, I., Stone, J. C. & Pawson, T. 1986 A noncatalytic domain conserved among cytoplasmic protein-tyrosine kinases modifies the kinase function and transforming activity of Fujinami sarcoma virus P130gag-fps. Mol. Cell Biol. 6, 4396–4408. Sadowski, I. & Pawson, T. 1987 Catalytic and noncatalytic domains of the Fujinami sarcoma virus P130gag-fps protein-tyrosine kinase distinguished by the expression of v-fps polypeptides in Escherichia coli. Oncogene 1, 181–191. Mayer, B. J., Hamaguchi, M. & Hanafusa, H. 1988 A novel viral oncogene with structural similarity to phospholipase C. Nature 332, 272–275. (doi:10.1038/332272a0) Songyang, Z. & Cantley, L. C. 2004 ZIP codes for delivering SH2 domains. Cell 116(Suppl. 2), S41 –S43. (doi:10.1016/S0092-8674(04)00041-8) Li, L., Tibiche, C., Fu, C., Kaneko, T., Moran, M., Schiller, M., Li, S. S. C. & Wang, E. In press. The human phosphotyrosine signaling network: evolution and hotspots of hijacking in cancer. Genome Res. (doi:10.1101/gr.128819.111) Scott, J. D. & Pawson, T. 2009 Cell signaling in space and time: where proteins come together and when they’re apart. Science 326, 1220–1224. (doi:10.1126/ science.1175668) Cheng, A. M. et al. 1998 Mammalian Grb2 regulates multiple steps in embryonic development and malignant transformation. Cell 95, 793–803. (doi:10.1016/ S0092-8674(00)81702-X) Yoder, J., Pham, C., Iizuka, Y. M., Kanagawa, O., Liu, S. K., McGlade, J. & Cheng, A. M. 2001 Requirement for the SLP-76 adaptor GADS in T cell development. Science 291, 1987–1991. (doi:10.1126/science.1057176) Shen, R., Ouyang, Y. B., Qu, C. K., Alonso, A., Sperzel, L., Mustelin, T., Kaplan, M. H. & Feng, G.-S. 2002 Grap negatively regulates T-cell receptorelicited lymphocyte proliferation and interleukin-2 induction. Mol. Cell Biol. 22, 3230– 3236. (doi:10. 1128/MCB.22.10.3230-3236.2002) Sonnhammer, E. L., Eddy, S. R. & Durbin, R. 1997 Pfam: a comprehensive database of protein domain families based on seed alignments. Proteins 28, 405 –420. (doi:10.1002/(SICI)1097-0134(199707)28: 3,405::AID-PROT10.3.0.CO;2-L) Schultz, J., Milpetz, F., Bork, P. & Ponting, C. P. 1998 SMART, a simple modular architecture research tool: identification of signaling domains. Proc. Natl Acad. Sci. USA 95, 5857–5864. (doi:10.1073/pnas.95.11.5857) Phil. Trans. R. Soc. B (2012) B. A. Liu and P. D. Nash 2571 30 Marchler-Bauer, A. et al. 2011 CDD: a conserved domain database for the functional annotation of proteins. Nucleic Acids Res. 39, D225–D229. (doi:10. 1093/nar/gkq1189) 31 Andersen, J. N., Jansen, P. G., Echwald, S. M., Mortensen, O. H., Fukada, T., Del Vecchio, R., Tonks, N. K. & Møller, N. P. H. 2004 A genomic perspective on protein tyrosine phosphatases: gene structure, pseudogenes, and genetic disease linkage. FASEB J. 18, 8 –30. (doi:10.1096/fj.02-1212rev) 32 Braconi Quintaje, S. & Orchard, S. 2008 The annotation of both human and mouse kinomes in UniProtKB/ Swiss-Prot: one small step in manual annotation, one giant leap for full comprehension of genomes. Mol. Cell Proteomics 7, 1409–1419. (doi:10.1074/mcp. R700001-MCP200) 33 Jin, J. et al. 2009 Eukaryotic protein domains as functional units of cellular evolution. Sci. Signal 2, ra76. (doi:10.1126/scisignal.2000546) 34 D’Aniello, S., Irimia, M., Maeso, I., Pascual-Anaya, J., Jimenez-Delgado, S., Bertrand, S. & Garcia-Fernandez, J. 2008 Gene expansion and retention leads to a diverse tyrosine kinase superfamily in amphioxus. Mol. Biol. Evol. 25, 1841–1854. (doi:10.1093/molbev/msn132) 35 Lam, H. Y., Khurana, E., Fang, G., Cayting, P., Carriero, N., Cheung, K. H. & Gerstein, M. B. 2009 Pseudofam: the pseudogene families database. Nucleic Acids Res. 37, D738–D743. (doi:10.1093/nar/gkn758) 36 Maclennan, A. J. & Shaw, G. 1993 A yeast SH2 domain. Trends Biochem. Sci. 18, 464 –465. (doi:10. 1016/0968-0004(93)90006-9) 37 Williams, J. G. & Zvelebil, M. 2004 SH2 domains in plants imply new signalling scenarios. Trends Plant Sci. 9, 161 –163. (doi:10.1016/j.tplants.2004.02.001) 38 Ardehali, M. B., Yao, J., Adelman, K., Fuda, N. J., Petesch, S. J., Webb, W. W. & Lis, J. T. 2009 Spt6 enhances the elongation rate of RNA polymerase II in vivo. EMBO J. 28, 1067–1077. (doi:10.1038/emboj.2009.56) 39 Adkins, M. W. & Tyler, J. K. 2006 Transcriptional activators are dispensable for transcription in the absence of Spt6-mediated chromatin reassembly of promoter regions. Mol. Cell 21, 405–416. (doi:10.1016/j.molcel. 2005.12.010) 40 Yoh, S. M., Cho, H., Pickle, L., Evans, R. M. & Jones, K. A. 2007 The Spt6 SH2 domain binds Ser2-P RNAPII to direct Iws1-dependent mRNA splicing and export. Genes Dev. 21, 160–174. (doi:10.1101/gad.1503107) 41 Diebold, M. L., Loeliger, E., Koch, M., Winston, F., Cavarelli, J. & Romier, C. 2010 Noncanonical tandem SH2 enables interaction of elongation factor Spt6 with RNA polymerase II. J. Biol. Chem. 285, 38 389– 38 398. (doi:10.1074/jbc.M110.146696) 42 Sun, M., Lariviere, L., Dengl, S., Mayer, A. & Cramer, P. 2010 A tandem SH2 domain in transcription elongation factor Spt6 binds the phosphorylated RNA polymerase II C-terminal repeat domain (CTD). J. Biol. Chem. 285, 41 597–41 603. (doi:10.1074/jbc.M110.144568) 43 Dengl, S., Mayer, A., Sun, M. & Cramer, P. 2009 Structure and in vivo requirement of the yeast Spt6 SH2 domain. J. Mol. Biol. 389, 211 –225. (doi:10. 1016/j.jmb.2009.04.016) 44 Gao, Q., Hua, J., Kimura, R., Headd, J. J., Fu, X. Y. & Chin, Y. E. 2004 Identification of the linker-SH2 domain of STAT as the origin of the SH2 domain using two-dimensional structural alignment. Mol. Cell Proteomics 3, 704–714. (doi:10.1074/mcp.M300131-MCP200) 45 Liu, J., Zhang, J., Gong, Q., Xiong, P., Huang, H., Wu, B., Lu, G., Wu, J. & Shi, Y. 2011 Solution structure of tandem SH2 domains from Spt6 protein and their binding to the phosphorylated RNA polymerase II Downloaded from http://rstb.royalsocietypublishing.org/ on June 18, 2017 2572 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 B. A. Liu and P. D. Nash Review. Evolution of pTyr signalling C-terminal domain. J. Biol. Chem. 286, 29 218 – 29 226. (doi:10.1074/jbc.M111.252130) Sugden, C., Ross, S., Bloomfield, G., Ivens, A., Skelton, J., Mueller-Taubenberger, A. & Williams, J. G. 2010 Two novel Src homology 2 domain proteins interact to regulate Dictyostelium gene expression during growth and early development. J. Biol. Chem. 285, 22 927–22 935. (doi:10.1074/jbc.M110.139733) Williams, J. G., Noegel, A. A. & Eichinger, L. 2005 Manifestations of multicellularity: Dictyostelium reports in. Trends Genet. 21, 392–398. (doi:10.1016/j.tig.2005.05. 002) Schieven, G., Thorner, J. & Martin, G. S. 1986 Proteintyrosine kinase activity in Saccharomyces cerevisiae. Science 231, 390–393. (doi:10.1126/science.2417318) Tan, J. L. & Spudich, J. A. 1990 Developmentally regulated protein-tyrosine kinase genes in Dictyostelium discoideum. Mol. Cell Biol. 10, 3578–3583. Moniakis, J. et al. 2001 An SH2-domain-containing kinase negatively regulates the phosphatidylinositol-3 kinase pathway. Genes Dev. 15, 687 –698. (doi:10. 1101/gad.871001) Fukuzawa, M., Araki, T., Adrian, I. & Williams, J. G. 2001 Tyrosine phosphorylation-independent nuclear translocation of a Dictyostelium STAT in response to DIF signaling. Mol. Cell. 7, 779–788. (doi:10.1016/ S1097-2765(01)00222-2) Langenick, J., Araki, T., Yamada, Y. & Williams, J. G. 2008 A Dictyostelium homologue of the metazoan Cbl proteins regulates STAT signalling. J. Cell Sci. 121, 3524–3530. (doi:10.1242/jcs.036798) Manning, G., Plowman, G. D., Hunter, T. & Sudarsanam, S. 2002 Evolution of protein kinase signaling from yeast to man. Trends Biochem. Sci. 27, 514 –520. (doi:10.1016/S0968-0004(02)02179-5) Goldberg, J. M., Manning, G., Liu, A., Fey, P., Pilcher, K. E., Xu, Y. & Smith, J. L. 2006 The Dictyostelium kinome –analysis of the protein kinases from a simple model organism. PLoS Genet. 2, e38. (doi:10.1371/ journal.pgen.0020038) Brugge, J. S., Jarosik, G., Andersen, J., Queral-Lustig, A., Fedor-Chaiken, M. & Broach, J. R. 1987 Expression of Rous sarcoma virus transforming protein pp60v-src in Saccharomyces cerevisiae cells. Mol. Cell Biol. 7, 2180–2187. Kornbluth, S., Jove, R. & Hanafusa, H. 1987 Characterization of avian and viral p60src proteins expressed in yeast. Proc. Natl Acad. Sci. USA 84, 4455–4459. (doi:10.1073/pnas.84.13.4455) Boschelli, F., Uptain, S. M. & Lightbody, J. J. 1993 The lethality of p60v-src in Saccharomyces cerevisiae and the activation of p34CDC28 kinase are dependent on the integrity of the SH2 domain. J. Cell Sci. 105, 519–528. Trager, J. B. & Martin, G. S. 1997 The role of the Src homology-2 domain in the lethal effect of Src expression in the yeast Saccharomyces cerevisiae. Int. J. Biochem. Cell Biol. 29, 635 –648. (doi:10.1016/ S1357-2725(96)00162-8) Tan, C. S., Pasculescu, A., Lim, W. A., Pawson, T., Bader, G. D. & Linding, R. 2009 Positive selection of tyrosine loss in metazoan evolution. Science 325, 1686–1688. (doi:10.1126/science.1174301) Tan, C. S. 2011 Sequence, structure, and network evolution of protein phosphorylation. Sci. Signal. 4, mr6. (doi:10.1126/scisignal.2002093) Amon, A., Surana, U., Muroff, I. & Nasmyth, K. 1992 Regulation of p34CDC28 tyrosine phosphorylation is not required for entry into mitosis in S. cerevisiae. Nature 355, 368 –371. (doi:10.1038/355368a0) Sorger, P. K. & Murray, A. W. 1992 S-phase feedback control in budding yeast independent of tyrosine Phil. Trans. R. Soc. B (2012) 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 phosphorylation of p34cdc28. Nature 355, 365 –368. (doi:10.1038/355365a0) Gu, Y., Rosenblatt, J. & Morgan, D. O. 1992 Cell cycle regulation of CDK2 activity by phosphorylation of Thr160 and Tyr15. EMBO J. 11, 3995– 4005. Manning, G., Reiner, D. S., Lauwaet, T., Dacre, M., Smith, A., Zhai, Y., Svard, S. & Gillin, F. D. 2011 The minimal kinome of Giardia lamblia illuminates early kinase evolution and unique parasite biology. Genome Biol. 12, R66. (doi:10.1186/gb-201112-7-r66) King, N. & Carroll, S. B. 2011 A receptor tyrosine kinase from choanoflagellates: molecular insights into early animal evolution. Proc. Natl Acad. Sci. USA 98, 15 032 –15 037. (doi:10.1073/pnas.261477698) Li, W., Young, S. L., King, N. & Miller, W. T. 2008 Signaling properties of a non-metazoan Src kinase and the evolutionary history of Src negative regulation. J. Biol. Chem. 283, 15 491–15 501. (doi:10.1074/jbc.M800002200) Segawa, Y., Suga, H., Iwabe, N., Oneyama, C., Akagi, T., Miyata, T. & Okada, M. 2006 Functional development of Src tyrosine kinases during evolution from a unicellular ancestor to multicellular animals. Proc. Natl Acad. Sci. USA 103, 12 021–12 026. (doi:10.1073/pnas.0600021103) Lee, P. N., Pang, K., Matus, D. Q. & Martindale, M. Q. 2006 A WNT of things to come: evolution of Wnt signaling and polarity in cnidarians. Semin. Cell Dev. Biol. 17, 157–167. (doi:10.1016/j.semcdb.2006.05.002) Wikramanayake, A. H., Hong, M., Lee, P. N., Pang, K., Byrum, C. A., Bince, J. M., Xu, R. & Martindale, M. Q. 2003 An ancient role for nuclear beta-catenin in the evolution of axial polarity and germ layer segregation. Nature 426, 446–450. (doi:10.1038/nature02113) Kremer, B. E., Adang, L. A. & Macara, I. G. 2007 Septins regulate actin organization and cell-cycle arrest through nuclear accumulation of NCK mediated by SOCS7. Cell 130, 837–850. (doi:10.1016/j.cell.2007.06.053) Magie, C. R. & Martindale, M. Q. 2008 Cell-cell adhesion in the cnidaria: insights into the evolution of tissue morphogenesis. Biol. Bull. 214, 218–232. (doi:10.2307/25470665) Mitra, S. K., Hanson, D. A. & Schlaepfer, D. D. 2005 Focal adhesion kinase: in command and control of cell motility. Nat. Rev. Mol. Cell Biol. 6, 56–68. (doi:10.1038/nrm1549) Frame, M. C., Patel, H., Serrels, B., Lietha, D. & Eck, M. J. 2010 The FERM domain: organizing the structure and function of FAK. Nat. Rev. Mol. Cell Biol. 11, 802–814. (doi:10.1038/nrm2996) Hartenstein, V. & Mandal, L. 2006 The blood/vascular system in a phylogenetic perspective. Bioessays 28, 1203– 1210. (doi:10.1002/bies.20497) Pelicci, G. et al. 1996 A family of Shc related proteins with conserved PTB, CH1 and SH2 regions. Oncogene 13, 633–641. Hardy, W. R., Li, L., Wang, Z., Sedy, J., Fawcett, J., Frank, E., Kucera, J. & Pawson, T. 2007 Combinatorial ShcA docking interactions support diversity in tissue morphogenesis. Science 317, 251 –256. (doi:10.1126/ science.1140114) Feller, S. M., Knudsen, B. & Hanafusa, H. 1994 c-Abl kinase regulates the protein binding activity of c-Crk. EMBO J. 13, 2341–2351. Anafi, M., Rosen, M. K., Gish, G. D., Kay, L. E. & Pawson, T. 1996 A potential SH3 domain-binding site in the Crk SH2 domain. J. Biol. Chem. 271, 21 365 – 21 374. (doi:10.1074/jbc.271.35.21365) Rozakis-Adcock, M., Fernley, R., Wade, J., Pawson, T. & Bowtell, D. 1993 The SH2 and SH3 domains of Downloaded from http://rstb.royalsocietypublishing.org/ on June 18, 2017 Review. Evolution of pTyr signalling 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 mammalian Grb2 couple the EGF receptor to the Ras activator mSos1. Nature 363, 83–85. (doi:10.1038/ 363083a0) Shuai, K. & Liu, B. 2003 Regulation of JAK-STAT signalling in the immune system. Nat. Rev. Immunol. 3, 900 –911. (doi:10.1038/nri1226) Lai, K. M. & Pawson, T. 2000 The ShcA phosphotyrosine docking protein sensitizes cardiovascular signaling in the mouse embryo. Genes Dev. 14, 1132 –1145. (doi:10.1101/gad.14.9.1132) Ohno, S. 1970 Evolution by gene duplication. London, UK: George Allen and Unwin. Hurles, M. 2004 Gene duplication: the genomic trade in spare parts. PLoS Biol. 2, E206. (doi:10.1371/ journal.pbio.0020206) Lynch, M. & Conery, J. S. 2000 The evolutionary fate and consequences of duplicate genes. Science 290, 1151 –1155. (doi:10.1126/science.290.5494.1151) Emes, R. D., Goodstadt, L., Winter, E. E. & Ponting, C. P. 2003 Comparison of the genomes of human and mouse lays the foundation of genome zoology. Hum. Mol. Genet. 12, 701–709. (doi:10.1093/hmg/ ddg078) Fawcett, J. P. et al. 2007 Nck adaptor proteins control the organization of neuronal circuits important for walking. Proc. Natl Acad. Sci. USA 104, 20 973– 20 978. (doi:10.1073/pnas.0710316105) Jones, N. et al. 2009 Nck proteins maintain the adult glomerular filtration barrier. J. Am. Soc. Nephrol. 20, 1533 –1543. (doi:10.1681/ASN.2009010056) Roy, E. et al. 2010 Nck adaptors are positive regulators of the size and sensitivity of the T-cell repertoire. Proc. Natl Acad. Sci. USA 107, 15 529 –15 534. (doi:10. 1073/pnas.1009743107) Bladt, F., Aippersbach, E., Gelkop, S., Strasser, G. A., Nash, P., Tafuri, A., Gertler, F. B. & Pawson, T. 2003 The murine Nck SH2/SH3 adaptors are important for the development of mesoderm-derived embryonic structures and for regulating the cellular actin network. Mol. Cell Biol. 23, 4586–4597. (doi:10.1128/MCB.23. 13.4586-4597.2003) Fujikawa, K. et al. 2003 Vav1/2/3-null mice define an essential role for Vav family proteins in lymphocyte development and activation but a differential requirement in MAPK signaling in T and B cells. J. Exp. Med. 198, 1595–1608. (doi:10.1084/jem. 20030874) D’Antonio, M. & Ciccarelli, F. D. 2011 Modification of gene duplicability during the evolution of protein interaction network. PLoS Comput. Biol. 7, e1002029. (doi:10.1371/journal.pcbi.1002029) Li, W. H., Yang, J. & Gu, X. 2005 Expression divergence between duplicate genes. Trends Genet. 21, 602 –607. (doi:10.1016/j.tig.2005.08.006) Liang, H. & Li, W. H. 2007 Gene essentiality, gene duplicability and protein connectivity in human and mouse. Trends Genet. 23, 375 –378. (doi:10.1016/j.tig. 2007.04.005) Vogel, C. & Chothia, C. 2006 Protein family expansions and biological complexity. PLoS Comput. Biol. 2, e48. (doi:10.1371/journal.pcbi.0020048) Buljan, M. & Bateman, A. 2009 The evolution of protein domain families. Biochem. Soc. Trans. 37, 751 –755. (doi:10.1042/BST0370751) Tan, C. S. et al. 2009 Comparative analysis reveals conserved protein phosphorylation networks implicated in multiple diseases. Sci. Signal 2, ra39. (doi:10.1126/ scisignal.2000316) Phil. Trans. R. Soc. B (2012) B. A. Liu and P. D. Nash 2573 97 Lim, W. A. 2010 Designing customized cell signalling circuits. Nat. Rev. Mol. Cell Biol. 11, 393 –403. (doi:10.1038/nrm2904) 98 Kaneko, T., Huang, H., Zhao, B., Li, L., Liu, H., Voss, C. K., Wu, C., Schiller, M. R. & Li, S. S. C. 2010 Loops govern SH2 domain specificity by controlling access to binding pockets. Sci. Signal 3, ra34. (doi:10. 1126/scisignal.2000796) 99 Songyang, Z. et al. 1993 SH2 domains recognize specific phosphopeptide sequences. Cell 72, 767– 778. (doi:10.1016/0092-8674(93)90404-E) 100 Waksman, G. et al. 1992 Crystal structure of the phosphotyrosine recognition domain SH2 of v-src complexed with tyrosine-phosphorylated peptides. Nature 358, 646 –653. (doi:10.1038/358646a0) 101 Liu, B. A., Jablonowski, K., Shah, E. E., Engelmann, B. W., Jones, R. B. & Nash, P. D. 2010 SH2 domains recognize contextual peptide sequence information to determine selectivity. Mol. Cell Proteomics 9, 2391–2404. (doi:10.1074/mcp.M110.001586) 102 Lewitzky, M. et al. 2001 The C-terminal SH3 domain of the adapter protein Grb2 binds with high affinity to sequences in Gab1 and SLP-76 which lack the SH3typical P-x-x-P core motif. Oncogene 20, 1052–1062. (doi:10.1038/sj.onc.1204202) 103 Berry, D. M., Nash, P., Liu, S. K., Pawson, T. & McGlade, C. J. 2002 A high-affinity Arg-X-X-Lys SH3 binding motif confers specificity for the interaction between Gads and SLP-76 in T cell signaling. Curr. Biol. 12, 1336–1341. (doi:10.1016/S0960-9822 (02)01038-2) 104 Liu, Q., Berry, D., Nash, P., Pawson, T., McGlade, C. J. & Li, S. S. 2003 Structural basis for specific binding of the Gads SH3 domain to an RxxK motifcontaining SLP-76 peptide: a novel mode of peptide recognition. Mol. Cell 11, 471 –481. (doi:10.1016/ S1097-2765(03)00046-7) 105 Bae, J. H., Lew, E. D., Yuzawa, S., Tome, F., Lax, I. & Schlessinger, J. 2009 The selectivity of receptor tyrosine kinase signaling is controlled by a secondary SH2 domain binding site. Cell 138, 514– 524. (doi:10.1016/ j.cell.2009.05.028) 106 Filippakopoulos, P. et al. 2008 Structural coupling of SH2-kinase domains links Fes and Abl substrate recognition and kinase activation. Cell 134, 793 –803. (doi:10.1016/j.cell.2008.07.047) 107 Amoutzias, G. D., He, Y., Gordon, J., Mossialos, D., Oliver, S. G. & Van de Peer, Y. 2010 Posttranslational regulation impacts the fate of duplicated genes. Proc. Natl Acad. Sci. USA 107, 2967–2971. (doi:10.1073/ pnas.0911603107) 108 Hughes, A. L. 2002 Adaptive evolution after gene duplication. Trends Genet. 18, 433– 434. (doi:10.1016/ S0168-9525(02)02755-5) 109 Bullock, A. N., Rodriguez, M. C., Debreczeni, J. E., Songyang, Z. & Knapp, S. 2007 Structure of the SOCS4-ElonginB/C complex reveals a distinct SOCS box interface and the molecular basis for SOCS-dependent EGFR degradation. Structure 15, 1493–1504. (doi:10.1016/j.str.2007.09.016) 110 Donaldson, L. W., Gish, G., Pawson, T., Kay, L. E. & Forman-Kay, J. D. 2002 Structure of a regulatory complex involving the Abl SH3 domain, the Crk SH2 domain, and a Crk-derived phosphopeptide. Proc. Natl Acad. Sci. USA 99, 14 053–14 058. (doi:10.1073/pnas.212518799) 111 Hiller, M., Huse, K., Platzer, M. & Backofen, R. 2005 Creation and disruption of protein features by alternative splicing: a novel mechanism to modulate function. Genome Biol. 6, R58. (doi:10.1186/gb-2005-6-7-r58)
© Copyright 2026 Paperzz