Evolution of protein phosphatases in plants and animals

Biochem. J. (2009) 417, 401–409 (Printed in Great Britain)
401
doi:10.1042/BJ20081986
REVIEW ARTICLE
Evolution of protein phosphatases in plants and animals
Greg B. G. MOORHEAD1 , Veerle DE WEVER, George TEMPLETON and David KERK
Department of Biological Sciences, University of Calgary, 2500 University Dr., N.W. Calgary, Alberta, Canada T2N 1N4
Protein phosphorylation appears to be a universal mechanism
of protein regulation. Genomics has provided the means to compile inventories of protein phosphatases across a wide selection of
organisms and this has supplied insights into the evolution of this
group of enzymes. Protein phosphatases evolved independently
several times yielding the groups we observe today. Starting
from a core catalytic domain, phosphatases evolved by a series
of gene duplication events and by adopting the use of regulatory
subunits and/or fusion with novel functional modules or domains.
Recent analyses also suggest that the serine/threonine specific
enzymes are more ancient than the PTPs (protein tyrosine
phosphatases). It is likely that the latter played a key role at the
onset of metazoan evolution in conjunction with the tremendous
expansion of tyrosine kinases and PTPs at this point. In the
present review, we discuss the evolution of the PTPs, the serine/
threonine specific PPP (phosphoprotein phosphatase) and PPM
(metallo-dependent protein phosphatase) families and the more
recently discovered phosphatases that utilize an aspartate-based
catalytic mechanism. We will also highlight examples of
convergent evolution and several phosphatases which are unique
to plants.
INTRODUCTION
and through biochemistry, molecular biology and most recently
genomics, we can now catalogue the phosphatase complement
of organisms and begin to examine evolutionary relationships
between the protein phosphatases. This review will discuss the
spectrum of protein phosphatases, evolutionary relationships,
convergent evolution and the role of phosphatases in the evolution
of metazoans.
There are few individuals whose contribution to science has had a
more profound effect on the study of biology than that of Charles
Darwin. It is an honour to write an article on protein phosphatase
evolution to commemorate his 200th birthday and 150 years since
the 1859 publication of “On the Origin of Species”. We live
in a time when we are beginning to understand evolution at a
molecular level and, having passed into the post-genomic era,
we are currently seeing a transition to an age of evolutionary
and comparative genomics, which ideally will culminate into
evolutionary proteomics. These emerging fields will undoubtedly
have a profound effect on not only evolutionary and genomic
biology, but on biological research in general.
Recent studies have highlighted the vast number of modified
proteins within proteomes and the often large number of
functional groups found on any given protein. Many studies
have also reinforced the view that protein phosphorylation is a
common means to the regulation of most cellular processes, is
ancient in origin, and is probably a universal covalent modification
among organisms [1–3]. This is highlighted by the fact that
protein kinases and phosphatases, which respectively add and
remove phosphate on proteins, constitute 2– 4 % of the genes
in a typical eukaryotic genome [4,5]. Early studies on protein
phosphorylation focused on its role in modifying enzyme activity,
while more recently it has been found to play a role in protein
localization, protein turnover and to provide a specific interaction
site for other proteins [6]. Like protein kinases, the study of
protein phosphatases began in the field of glycogen metabolism
Key words: genomics, kelch phosphatase, laforin, phylogenetics,
protein tyrosine phosphatase (PTP), starch excess 4 (SEX4),
TFIIF associating component of CTD phosphatase/small CTD
phosphatase (FCP1/SCP).
PROTEIN PHOSPHATASE CLASSIFICATION
Proteins can be phosphorylated on nine amino acids (tyrosine,
serine, threonine, cysteine, arginine, lysine, aspartate, glutamate
and histidine) with serine, threonine and tyrosine phosphorylation
being predominant in eukaryotic cells and playing key regulatory
roles [5]. Although phosphorylation on aspartate and histidine
residues in bacterial two-component systems have been well
characterized, several recent phospho-proteomic studies have
shown bacterial protein phosphorylation to be more predominant
than originally thought [7–9]. In three bacterial species, 63–
79 proteins were found to be phosphorylated, with a surprising
distribution on serine, threonine and tyrosine of approx. 69, 22
and 9 % respectively. In humans, phosphorylation on serine,
threonine and tyrosine is approx. 86.4, 11.8 and 1.8 % respectively
[10]. The enzymes that dephosphorylate these amino acids
(serine, threonine and tyrosine) form four groups based on
unique catalytic signatures/domain sequences and substrate preference. The majority of phospho-serine and phospho-threonine
dephosphorylation is accounted for by the PPP (phosphoprotein
Abbreviations used: CDC, cell division cycle; CTD, C-terminal domain; DSP (or DUSP), dual-specificity phosphatase; EPM2A, epilepsy of progressive
myoclonus type 2 gene A; EYA, eyes absent; FCP/SCP, TFIIF (transcription initiation factor IIF)-associating component of CTD phosphatase/small CTD
phosphatase; HAD, haloacid dehalogenase; LMPTP, low-molecular-mass PTP; MAPK, mitogen-activated protein kinase; MAPKP, MAPK phosphatase;
MTM, myotubularin; PAS2, Pasticcino 2; PP1 etc., protein phosphatase 1 etc.; PPKL, protein phosphatase 1 and kelch-like; PPM, metallo-dependent
protein phosphatase; PPP, phosphoprotein phosphatase; PRL, phosphatase of regenerating liver; PTB domain, phosphotyrosine-binding domain; PTEN,
phosphatase and tensin homologue deleted on chromosome 10; PTP, protein tyrosine phosphatase; PTPLA/B, PTP-like (proline instead of catalytic
arginine), member A/B; RLK, receptor-like kinase; RPTP, receptor PTP; SEX4, starch excess 4; SH2 domain, Src homology 2 domain; SIX, Sine oculis ;
TFIIF, transcription initiation factor IIF; UBLCP1, ubiquitin-like domain containing CTD phosphatase 1.
1
To whom correspondence should be addressed (email [email protected]).
c The Authors Journal compilation c 2009 Biochemical Society
402
G. B. G. Moorhead and others
phosphatase) [PP (protein phosphatase) 1, PP2A, PP2B (or
PP3), PP4, PP5, PP6 and PP7] and PPM (metallo-dependent
protein phosphatase; PP2C) families and includes the enzymes
shown in parentheses. Remarkably, the PPM and PPP families
are unrelated in sequence and are probably evolved from two
unique ancestral genes, but converged to have highly related
structures at the catalytic centre [11]. The PTP (protein tyrosine
phosphatase) family is defined by the catalytic signature CX5 R
and is very diverse in domain structure and substrate preference
with some family enzymes now recognized to dephosphorylate
complex carbohydrates, mRNA and phosphoinositides [12,13].
Finally, for the most recently classified group, the aspartatebased phosphatases, catalysis is driven by an aspartic acid
signature (DXDXT/V) and includes the FCP/SCP [TFIIF
(transcription initiation factor IIF)-associating component of CTD
(C-terminal domain) phosphatase/small CTD phosphatase] and
HAD (haloacid dehalogenase) family enzymes. We refer readers
to several excellent reviews on phosphatase classification and
functions [12–19].
SPECIFICITY THROUGH MODULAR DOMAINS AND REGULATORY
SUBUNITS
Having a complete inventory of phosphatases for several
organisms reveals many aspects of this group of enzymes, with one
striking feature being immediately apparent: ancient phosphatase
domains functionally evolved by docking to novel regulatory
subunits (most PPP members), or through additional domains
that were acquired by fusing with these catalytic domains at the
gene level (PTP, PPM and aspartate-based families) (see Figure 1).
Although the PTP core domain has also evolved to accommodate
specific substrates, other phosphatase catalytic subunits typically
display broad substrate specificity in vitro. As a result, specificity
of function is brought about by an array of modules or domains
and regulatory subunits. Domains and subunits provide a means of
subcellular targeting, allow docking of other proteins and inositol
lipids and often permit regulation by covalent modifications [20].
The evolutionary strategy of adding domains on to a core unit was
first noted for the PTPs, but is also apparent for the PPM (PP2C)
family (especially in plants) and a number of other phosphatases
(e.g. PP5). The PPP family members PP1 and PP2A are some
of the most highly conserved enzymes known. Conversely, they
obtain specificity for their catalytic engines via interactions with
many novel regulatory subunits. The best example of this is PP1,
with nearly 100 recognized regulatory subunits (Figure 1). Two of
the more recently identified members of the PPP family, PP4 and
PP6, followed a similar evolutionary route as PP1 and PP2A,
with a remarkable conservation of regulatory subunits across
species [21,22]. For instance, the PP4-interacting proteins, R2
and R3α and β of humans, have readily apparent orthologues in
the Saccharomyces cerevisiae genome that have been shown to
complex with the yeast PP4 catalytic subunit [23].
PTPs
The PTPs are defined through their common signature motif
(CX5 R) that drives catalysis [13]. It is thought that this motif and
catalytic mechanism evolved independently three times to yield
three groups or classes of PTPs [12]. The human genome encodes
more than 100 PTPs with orthologues for all but one in mouse
[12]. Of the more than 100 PTPs, 11 are catalytically inactive
and 16 dephosphorylate either glycogen (1), mRNA (2) or
phosphoinositides (13), and not proteins. It is thought that the
remainder of these PTPs dephosphorylate phosphotyrosine and,
c The Authors Journal compilation c 2009 Biochemical Society
Figure 1
Distinct evolutionary routes for protein phosphatases
Catalytic and regulatory modules have evolved by two means to accommodate specific
substrates and/or functions. (A) represents evolution by domain fusion on a genome level,
exemplified by PTP evolution. This evolutionary mode is also followed by some PPP members
(e.g. kelch-like phosphatases and aspartate-based phosphatases). The ancestral catalytic
domain (blue rectangle) fused with a regulatory module (red oval). Subsequent evolution of
regulatory modules (pentagon, oval, triangle) confer unique properties to the catalytic domain.
Modifications in the catalytic domain throughout evolution render further substrate specificity
(depicted by the shading of the catalytic domain). Some catalytic domains have become inactive
(red cross). (B) reflects evolution via complex formation on a proteome level, conceptualized
using PP1 as an example. The catalytic subunit (green oval) accommodates novel substrates via
interaction with numerous regulatory subunits (rounded triangle, pentagon, star). The RVxF/W
motif (black bar) is the primary PP1-docking motif.
in some cases, also phosphoserine and phosphothreonine. Class I,
which include the classic and dual-specificity enzymes, have a
common PTP domain structural fold and are by far the largest
group of PTPs and are further divided into subfamilies. The
classic enzymes (receptor and non-receptor) are given this name
as they defined the PTPs and they all dephosphorylate tyrosine
residues. The DSPs (dual-specificity phosphatases; also known
as DUSPs; this group includes all non-protein phosphatase PTPs)
are further divided into the MAPKP [MAPK (mitogen-activated
protein kinase) phosphatase], slingshot, PRL (phosphatase of
regenerating liver), atypical DSP, CDC14 (cell division cycle
14), PTEN (phosphatase and tensin homologue deleted on
chromosome 10) and MTMs (myotubularins). Class II and
III are represented by the LMPTP (low-molecular-mass PTP)
and Cdc25 isoforms and are thought to have evolved from a
bacterial rhodanese-like enzyme and a bacterial arsenate reductase
respectively (see [12,13,24] for further details).
In plants, no true protein tyrosine kinases exist, even though in
Arabidopsis there are at least 1055 protein kinases (compared with
518 in humans) of which ∼ 630 are RLKs (receptor-like kinases).
These RLKs are most like human Pelle kinases that display
serine/threonine kinase activity, and as such it is believed that
all plant RLKs are serine/threonine kinases [25,26]. Consistent
with the absence of tyrosine kinases is the near total lack of
plant phosphotyrosine-specific phosphatases in the PTP family
(i.e. classic enzymes) [27–29]. As with plants, other genomes
examined that were found to lack tyrosine kinases also have very
few PTPs [30]. For example, most species that belong to the
apicomplexa (protozoans) have no class II, class III or classic
PTPs and only a very few DSPs (in this case, PRL and MAPKP).
Apicomplexa genomes also have no obvious protein tyrosine
kinases [30]. As will be discussed below, it is now thought that
the PTPs that dephosphorylate phosphotyrosine evolved before
Protein phosphatase evolution
tyrosine kinases due to the action of promiscuous serine/threonine
kinases that phosphorylate tyrosine residues. The appearance of
true tyrosine kinases coincides with a dramatic expansion of the
PTP superfamily just before the evolution of metazoans.
Sequence and structural analysis of the class I PTPs reveals a
universal structural fold supporting the concept of evolution from
a common ancestral gene [12]. Consistent with this is evidence
of gene duplication. The highly sequence-related pairs TCPTP
(T-cell PTP)/PTP1B, PTPα/PTPε and PTPD1/PTPD2 each share
very similar exon structures in the genomes of human, mouse
and chimpanzee [31]. Human DUSP24 is localized adjacent to
DUSPs 13a and 13b indicating two ancestral gene duplications.
Humans have 21 RPTPs (receptor PTPs) and 12 of them have
a tandem arrangement of intracellular catalytic domains (D1 is
membrane proximal and D2 is membrane distal). With only one
exception, all D1 are active and all D2 are inactive, yet all D2s are
still well conserved compared with other PTP domains [13,32].
Phylogenetic analysis of PTP domains shows that D1 and D2
are from separate clades (i.e. all D1s are most like each other),
but group together when compared with all other PTP domains,
indicating that the PTP domain duplicated within the ancestral
PTP gene before the whole gene duplicated and ultimately gave
rise to the 12 RPTPs [32]. Recent analysis has shown that
the catalytic cysteine residue of the CX5 R motif is susceptible
to oxidation due to the same properties that make it a good
nucleophile [33]. Nature has evolved two mechanisms to protect
this group from irreversible oxidation [34]. PTP1B has been
shown to form a reversible cyclic sulfenamide with the catalytic
cysteine residue forming a bond with the amide nitrogen of the
neighbouring serine residue. PTPs from the DSP, LMPTP and
CDC25 groups have been demonstrated to form disulfide bonds
between the catalytic cysteine residue and another cysteine residue
of the same protein [34]. Although the D2 domains of RPTPs are
inactive, they intriguingly maintain their CX5 R signature and
are inactivated by loss of other key catalytic amino acids. The
remarkable conservation of D2 PTP domains and specifically
the CX5 R motifs has been postulated to be due to the potential
role of these domains as cellular redox sensors [34].
Certain members of the PTP family exemplify how sequence
and function conservation can go hand in hand. The class I dualspecificity enzyme Cdc14 controls mitotic exit in metazoans and
yeast. Searching for this enzyme in plants and eukaryotic (green)
algae revealed a single orthologue in Chlamydomonas reinhardtii
with sequence coverage over both the N-terminal region and PTP
domain [27]. Cdc14-like protein(s) are also present in higherplant genomes, yet phylogenetic analysis showed that they form a
distinct clade, separate from characterized Cdc14 proteins. These
Cdc14-like sequences lack key catalytic and substrate targeting
residues and thus could not function as active phosphatases,
but seem to have been co-opted into protein–protein interaction
domains [27]. Although putative Cdc14 orthologues have been
noted in many protozoans, there is an absence of this PTP in all
apicomplexans, except Cryptosporidium [30].
The single class II enzyme, also known as LMPTP, is tyrosinespecific and widespread, being found in all kingdoms including
Archea and most bacterial species, and is one of only three PTPs
in yeast [12,35]. A survey of several bacterial LMPTPs shows that
the enzyme is 30– 40 % identical with the human enzyme. This
astonishing conservation suggests an ancient conserved function,
but to date no clear role has been defined for LMPTP.
The sole class III PTP, also known as Cdc25, dephosphorylates the cell-cycle protein kinase Cdc2 [36]. The Cdc25s display
sequence similarity to rhodaneses and are thought to have evolved
from a bacterial, rhodanese-like protein [12,27]. Our genomic
analysis of several plant and algal species for phosphatases related
403
to that of human Cdc25 revealed that, like Cdc14, there appears to be no Cdc25 in plants [27]. Several, but not all, unicellular
parasites have Cdc25 orthologues. A notable absence is again in
the apicomplexa [30]. Candidate plant and algal Cdc25s actually
form a clade with several arsenate reductases and not the wellcharacterized human and yeast Cdc25s suggesting that these
candidates are really arsenate reductases. These sequences also
lack the N-terminal regulatory domain found in true Cdc25s. This
insight, along with the lack of Cdc14 in plants, indicates that
the role that these phosphatases play in cell-cycle control was
established after ancestral plant and animals/yeast diverged, or
was lost early in the plant lineage.
Inactive PTPs
A number of enzymes are inactive, yet conserved, in many
organisms (e.g. the pseudokinases), which opens discussion
towards their biological function [37–39]. The phosphatases are
no exception to this, particularly the PTPs since the core catalytic
motif is well defined and inactive mutations are readily identified
[12,13]. Inactive PTPs display PTP-like domain structures, but
possess mutations in the cysteine or arginine residues of the
CX5 R motif, in an upstream conserved aspartate residue or in
combinations of the three, all exemplified by the STYX domain
proteins and the D2 domain of RPTPs [13,40]. In the case of
STYX, such alterations are thought to disrupt the enzymatic
activity of the phosphatase, yet potentially maintain interaction
with phosphorylated substrates ([12,40] and references therein),
not unlike how a 14-3-3 protein docks a phosphoserine or
threonine motif. Mutations must have occurred early in evolution
since various inactive PTPs {e.g. tensins, PTPLA/Bs [PTPlike (proline instead of catalytic arginine), member A/B]} are
conserved within eukaryotes [41,42].
The MTMs, a well-defined class I PTP subfamily, are phosphoinositide phosphatases specific for the 3-position of PtdIns3P and
PtdIns(3,5)P2 . These phosphoinositides provide docking sites for
proteins involved in endosomal–lysosomal membrane trafficking.
Only a few MTMs have been found in a number of apicomplexan
species, while we identified two presumably active MTMs in
Arabidopsis [27]. In humans, seven of the 16 MTMs are predicted
to be inactive [12]. Their characterization indicates that they play
essential roles, some of which are linked to human disease.
For example, loss of either MTMR2 or MTMR13, an active
and inactive MTM respectively, both cause a form of Charcot–
Marie tooth disease [43]. Hence, the inactive protein is equally
essential to prevent disease onset, and both proteins are thought to
interact.
Another family of potentially inactive PTPs are the tensins.
The tensin family has four members in humans (Tensins 1–
3 and Cten), each containing an SH2 (Src homology 2) and
PTB (phosphotyrosine-binding) domain at their C-termini [44].
Primary sequence analyses predict either an inactive PTP domain
[45] or a PTEN-C2-like domain in the N-terminal region of
Tensins 1–3 [46]. Cten lacks the N-terminal region and therefore
the putative PTP domain. Recently, Tensins 1 and 2 and Cten
expression were shown to stimulate cell migration whereas Tensin
3 expression had an inhibitory effect. Interestingly, our sequence
alignment shows that Tensin 3 maintains the CX5 R motif, whereas
Tensins 1 and 2 display mutations in these residues (cysteine to
asparagine and arginine to lysine respectively; results not shown),
whereas Cten obviously lacks the CX5 R motif.
Finally, the PTP-like proteins PTPLA/B are classified as
inactive PTPs, although they have no sequence homology with the
PTPs except for the CX5 R motif, where in their case the arginine
c The Authors Journal compilation c 2009 Biochemical Society
404
G. B. G. Moorhead and others
residue is mutated into a proline [47]. Significant progress towards
the elucidation of their biological function has been made with the
PTPLA homologues in plants and baker’s yeast, PAS2 (Pasticcino
2) and Phs1 respectively. Reciprocal complementation studies
identified them as functional orthologues [48], while PTPLA/B
expression also rescues Phs1-impaired cells [49]. Both PAS2 and
Phs1 function as dehydratases in the elongase complex required
for the production of very-long-chain fatty acids. Further studies
are required to clarify the role of the CX5 R motif in these proteins
and the function of PTPLA in metazoan organisms.
Overall, these examples show that loss of key catalytic
residues, but overall domain conservation, led these proteins to
evolve from their initial function as phosphate-removing enzymes
towards different functions, e.g. phosphotyrosine-recognition
with potential scaffolding and localization capacities.
TYROSINE PHOSPHATASES AND METAZOAN EVOLUTION
Until recently, phosphotyrosine signalling was considered the
hallmark of intercellular communication of multicellular animals
as other unicellular organisms (including yeast) lack true tyrosine
kinases. Previously, a moderate selection of SH2 and PTP domains
were noted in the genomes of plants, fungi and dictyostelium
and in a few cases shown to be functional, as predicted [50 –
52]. Baker’s yeast, for example, has three PTPs [12,52,53] and
phospho-proteomics of several yeast species have shown a handful
of tyrosine phosphorylated proteins, even in the absence of true
tyrosine kinases [54–56]. This is explained by what has been
described as promiscuous serine/threonine kinase activity and
the action of the well-characterized MAPKK (MAPK kinase)
enzymes that phosphorylate MAPK activation loops in a TXY
motif [57].
The choanoflagellates are unicellular protists and, through
comparative genomics of mitochondrial DNA, they are currently
considered the closest living relatives of metazoans [58]. Recent
completion of the first choanoflagellate (Monosiga brevicollis)
genome has revealed an abundance of tyrosine kinases (∼ 128),
phosphotyrosine-specific PTPs (∼ 39) and proteins with the
phosphotyrosine-docking domains SH2 (∼ 123) and PTB (∼ 20)
[39,57,58]. Surprisingly, these numbers exceed that of any other
metazoan and support the concept that the phosphotyrosinesignalling machinery was present prior to the evolution of
multicellular animals.
These landmark genomic analyses tell us that although phosphorylation on tyrosine was present prior to metazoan evolution, it
had a limited role until tyrosine kinases evolved and with the tools
of kinase, phosphatase and phosphotyrosine-docking domain at
hand, a rapid expansion of all components occurred in some premetazoan ancestor that ultimately led to choanoflagellates and
multicellular animals. A comparison of the 38 classic (tyrosinespecific) PTPs of humans to the 39 M. brevicollis PTPs starkly
shows that only four are probably true orthologues, which presumably evolved in a pre-metazoan ancestor, while the remaining
independently evolved in each lineage with unique combinations
of signalling domains. This rapid expansion of the PTPs and
the phosphotyrosine-signalling system certainly played, to some
extent, a role in metazoan evolution. In contrast, as discussed
in [39,57], other well-recognized signalling domains/systems
(excluding small GTPases, phosphoserine/threonine kinases,
phosphatases and docking domains) are much more ancient and
appear in combinations that are common to both M. brevicollis,
metazoans and other eukaryotes, like plants and fungi, serving
as a reminder of how ancient other (non-PTP) phosphatases
are.
c The Authors Journal compilation c 2009 Biochemical Society
THE PPM FAMILY
The PPM family phosphatases, which include the PP2C
and pyruvate dehydrogenase phosphatase, are Mn2+ /Mg2+ dependent serine/threonine-specific enzymes that are resistant to
microcystin, okadaic acid and other classic toxin inhibitors of
the PPP family. Unlike most PPP members, the PP2C enzymes
do not have additional subunits but, like the PTPs, do display a
wide variety of additional domains that confer unique functions.
All eukaryotic PPM enzymes have 11 conserved motifs with nine
highly conserved amino acids, four of which are aspartate residues
that co-ordinate metal ions necessary for catalysis. Analysis of 16
archaeal genomes shows the presence of one PPM gene, while
bacteria display putative PPM genes in 27 out of 121 genomes
examined [59], grouped into two subfamilies. Interestingly,
subfamily II enzymes usually have additional domains with most
being GAF and PAS domains, while subfamily I genes are nearly
always lone catalytic domains [59]. While most bacterial species
with PPMs have one or a few PPMs, two Streptomyces
species have as many as 49 PPMs. Phylogenetic analysis of
these enzymes with eukaryotic PPMs reveals that the archaeal and
bacterial enzymes are rooted in the eukaryotic PPMs, suggesting
that the Mn2+ /Mg2+ -dependent enzymes originated in eukaryotes
and radiated into bacteria and archaea by horizontal gene transfer
[59]. Because PPM genes are spread widely throughout the bacterial kingdom, it is thought PPMs originated very early in the
eukaryotic lineage, then spread horizontally into bacteria early in
that lineage, and probably transferred several times. Support for
an early eukaryotic origin is upheld by the observation that PPM
phosphatases are found throughout eukaryotes, including plants,
and have no sequence homology with the ancient PPP enzymes.
The PPM enzymes are the largest phosphatase family in
plants, with 76 members in Arabidopsis thaliana. Phylogenetics
assembles them into ten subfamilies with six non-clustered genes
[60]. What is apparent from analysis of this group is the large
number of N- and C-terminal additions to these genes. They
are thought to confer specificity and/or localization to enzyme
function. These domains include MAPK-docking regions, FHA
(forkhead associated) domains, MORN (membrane occupation
and recognition nexus) repeats and putative transmembranespanning domains [60].
The PP2C members of humans have orthologues in most other
vertebrates [61]. Each vertebrate PP2C member (for instance
all PP2Cεs) clusters with the others, supporting the idea that
the family arose by a series of duplication events. A further
phylogenetic reconstruction of the metazoan PP2Cs reveals that
all enzymes cluster into two groups with one group having
unique sequences near the active site, as previously noted in
mammalian PP2Cs [61]. Analysis also indicates that two series
of rapid duplications took place, one before the emergence of
bilaterians and a second at the emergence of vertebrates. The
vertebrate-specific duplications have been linked to tissue-specific
expression patterns and may contribute to tissue development
in vertebrates. Interestingly, many other (non-phosphatase) gene
families, including the protein kinase subfamilies, evolved via a
series of duplication events [4], reminiscent of PP2C, supporting
the concept that expansion of signalling protein families was a
major driving force during key events of metazoan evolution.
THE PPP FAMILY
The PPP family is defined by three signature motifs (-GDXHG-,
-GDXVDRG- and -GNHE-) within a ∼ 280 amino acid catalytic
domain. PPP enzymes are regarded as very ancient, with most
members widely distributed in all eukaryotes and most bacterial
Protein phosphatase evolution
and archeal genomes having at least one PPP-like member. A
comparison of the PPP complements of humans and Arabidopsis
shows that plants have a complete absence of PP2B, but an
expansion of the number of PP1, PP2A, PP4 and PP6 genes and the
addition of a number of novel enzymes. Remarkably, the catalytic
domains of mammalian PP1s are 76 –88 % and 90 % identical
with plant and fungi PP1s respectively. Perhaps more remarkable
is that human PP1β is 100 % identical with rat, mouse, rabbit
and chicken and 97 % identical with the zebra fish, goldfish and
Atlantic salmon enzymes. Other noteworthy PPP complements
are found in the protozoan parasites Encephalitozoon cuniculi,
with only five PPP enzymes, and Entamoeba histolytica and
Trichomonas vaginalis both having large expansions with 81
and 169 predicted PPPs respectively [62], compared with 13 and
26 in humans and plants [27]. In addition there is the notable
absence of PP7 in yeast, but the presence of the unique PPP
members PPQ, PPZ1, PPZ2 and PPG1.
We have generated an unrooted tree for the PPP members
in humans and Arabidopsis thaliana (Figure 2), and the result
is consistent with similar analyses using a broader range of
eukaryotic PPP members [19,62,63]. PP5, PP7 and two plantspecific enzymes form a clade separate from other PPP members,
suggesting two distinct branches that probably separated early in
eukaryotic evolution. It has been proposed that the N-terminal
TPR (tetratricopeptide repeat) domains of PP5 and the Cterminal EF hand extensions of PP7 were originally separate
regulatory subunits and gene fusion yielded the products we
see today [62]. Because these extra domains are conserved
across eukaryotes, this too must have been an early event in
PP5 and PP7 evolution. With the presumed exception of the
Kelch-domain phosphatases, which were derived from a PP1
ancestor (Figure 1 and [62], discussed below), all other PPP
enzymes of this branch have regulatory subunits that define the
function of these proteins. As indicated previously, because a
majority of these additional regulatory subunits are present in
genomes of other eukaryotes and are often functionally conserved,
it is likely that most originated early in eukaryotic evolution
(Figure 1). This may explain why these PPP enzymes are so
highly conserved. Having these interaction partners that define the
function of the catalytic subunit certainly would hinder additional
evolution of the catalytic engine. For instance, the majority of the
PP1 regulatory subunits have a primary docking site called
the RVXF/W motif that allows interaction with PP1 in a
hydrophobic cleft on the surface of the enzyme [64–66].
Undoubtedly this docking site was exploited in the history of
PP1 allowing it to be recruited again and again to specifically
dephosphorylate new target proteins by incorporation of this short
docking motif in the regulatory subunit (Figure 1). Consistent
with this proposal is the knowledge that nearly 100 PP1-binding
proteins have now been identified and it is expected that there are
at least that many more several times over [14,24].
A unique PP1-related enzyme with extra domains is the PPKLs
(PP1 and kelch-like). The PPKL enzyme family is defined by
their large, kelch repeat containing N-terminal extensions and Cterminal PP1-like phosphatase domain. The kelch domain is a
widespread and ancient motif, with limited conserved residues
[67]. The founding members of the PPKL family are the
Plasmodium falciparium protein PfPPα [68] and four proteins
identified in A. thaliana [28,69]. BLAST searches with the kelchrepeat domain of these five proteins against translated genomes
identified further family members in the kingdom plantae (green
algae, vascular plants, mosses) and the superphylum alveolata,
with candidates in both the obligate parasitic apicomplexa and
the free-living aquatic ciliophora (Supplementary Figure S1
at http://www.BiochemJ.org/bj/417/bj4170401add.htm). We did
405
uncover PPKL members in Solanum lycopersicum (tomato) and
Vinis vinifera (grape) yet despite extensive searches, which
included newly sequenced genomes, we could not identify
orthologues in red algae, fungi or animals, nor in any noneukaryotic species, in keeping with previous reports [69,70].
Comparisons of the conserved residues in the PPP family
with the PPKL enzymes [71,72] indicate that most, but not
all, conserved PPP residues are maintained in this family (see
Supplementary Figure S2 at http://www.BiochemJ.org/bj/417/
bj4170401add.htm). This analysis further underscores how
(i) PPKLs contain five inserts in their phosphatase domain [68,69];
(ii) PPKL enzymes are more closely related to PP1 than to PP2A;
(iii) a significant number of PP1 family residues, particularly
those involved in small-molecule-inhibitor binding, either diverge
from the consensus or are preceded by peptide sequences which
may alter inhibitor binding [73]; and (iv) PPKL clades can be
defined by certain residues. For further details please refer to
Supplementary Figure S2 and [68–70].
The origin of the PPKL family is still under debate, as is
the origin of the apicomplexa, to which a large number of the
PPKLs are annotated (Supplementary Figure S1 and [74,75]).
We built a phylogenetic tree with the PPKL family members
using either domain separately (results not shown) or whole
protein sequences (Supplementary Figure S1; this Figure is
representative for each approach). We, and others, have found
PPKLs only in plants, green algae and several Alveolata
species [70]. This shared genotype between otherwise dissimilar
phyla support the current alveolate origin hypothesis whereby
a progenitor of (green and red) algae and plants yielded an
ancestral enzyme (i.e. the kelch domain/PP1-like enzyme), which
was retained in these organisms via secondary endosymbiosis
of the algae and subsequent nuclear gene transfer [76,77].
Further evolution of the alveolates and green plants could
thus produce the plant-related kelch-domain phosphatases. This
hypothesis could also explain why both apicomplexa and
green plants lack most PTP members. Finally, the current
PPKL distribution, which lacks members in the chromista, also
supports recent studies that question the strength of the current
grouping that yields the chromalveolate phylum [74,75]. We refer
readers to the following websites (http://tolweb.org/Eukaryotes/3
or www.discoverlife.org/mp/20m?tree = Life&flags = all) for
current classification schemes.
ASPARTATE-BASED CATALYSIS
While the other families of protein phosphatases have been
studied for more than 20 years, those utilizing aspartate-based
catalysis have only been noted in the last decade. The first
identified member of this family was FCP1, a TFIIF-associated
RNA polymerase II CTD phosphatase [78]. The aspartate-based
catalysis group is often divided into FCP/SCP-like and HADfamily phosphatases, but both actually share the same catalytic
motif and belong to different groups within the HAD superfamily
(http://www.ebi.ac.uk/interpro/). The HAD-family phosphatases
are ancient proteins, with FCP1 conserved throughout Eukaryota,
like members of the PPP family. However, unlike most members
of the PPP family, specificity of function has been gained through
the addition of other domains to the catalytic module, not
through association with other proteins (see Figure 1A for a
modular representation). One common additional domain is the
BRCT domain, which may function to direct these phosphatases
to phosphorylated substrates.
The evolutionary conservation of FCP1, as with other
phosphatases mentioned above, is completely logical where we
understand biochemical function. FCP1 conservation throughout
c The Authors Journal compilation c 2009 Biochemical Society
406
Figure 2
G. B. G. Moorhead and others
Phylogenetic analysis of the PPP family enzymes from Homo sapiens and A. thaliana
An unrooted tree was generated by comparison of catalytic domains from members of the PPP family from human and A. thaliana . Catalytic domains were aligned using ClustalX, hand-optimized
using GeneDoc, and the bootstrap tree was generated using the neighbour-joining function of ClustalX [27]. Proteins with an At suffix are from A. thaliana , while an Hs suffix denotes human
sequences. The Arabidopsis sequences are: TOPP1 (At2g29400), TOPP2 (At5g59160), TOPP3 (At1g64040), TOPP4 (At2g39840), TOPP5 (At3g46820), TOPP6 (At5g43380), TOPP7 (At4g11240),
TOPP8 (At5g27840), TOPP9 (not generally referred to as TOPP9, but listed as this here for clarity; At3g05580), AtPP2A-1 (At1g59830), AtPP2A-2 (At1g10430), AtPP2A-3 (At3g58500), AtPP2A-4
(At2g42500), AtPP2A-5 (At1g69960), AtPP4-1 (At4g26720), AtPP4-2 (At5g55260), AtPP5 (At2g42810), AtPP6 (At1g50370), AtFYPP3 (At3g19980) and AtPP7 (At5g63870). The human sequences
are (GenBank® accession numbers): HsPP1α (NP_002699), HsPP1β (NP_002700), HsPP1γ (NP_002701), HsPP2A-α (NP_002706), HsPP2A-β (NP_004147), HsPP2B-α (NP_000935),
HsPP2B-β (NP_066955) and HsPP2B-γ (NP_005596), HsPP4 (NP_002711), HsPP5 (NP_006238), HsPP6 (NP_002712), HsPP7-1 (PPEF1; NP_006231) and HsPP7-2 (PPEF2; NP_006230).
eukaryota is probably due to its role in the dephosphorylation of
the CTD of RNA polymerase II, a process that is itself ancient
and conserved. This is in contrast with the EYA (eyes absent)
c The Authors Journal compilation c 2009 Biochemical Society
protein, another aspartate-based phosphatase, which has
been identified only in multicellular eukaryotes (one to four
members per organism), and plays a role in development.
Protein phosphatase evolution
Mutations of these proteins lead to severe developmental
abnormalities, such as Branchio-oto-renal syndrome, caused
by mutation to the human EYA1 protein [79]. Most EYA
homologues contain an N-terminal transactivation domain that
forms an active transcription factor along with orthologues of
the SIX (Sine oculis) protein of Drosophila. The exception to
this rule is in plants, where Arabidopsis and Oryza express EYA
orthologues that lack transactivation domains [27]. The absence
of the transactivation domain itself is not surprising, as plants lack
a homologue to SIX. However, since plants also lack all of the
systems currently known to be affected by mutation of the EYA
proteins, the function of these proteins in plants is still unknown.
The aspartate-based phosphatases provide two excellent
examples of how protein sequence conservation can infer functional conservation. The first is a ubiquitin-like domain containing
CTD phosphatase, UBLCP1, which, like the EYA proteins, is
conserved throughout multicellular eukaryotes. Only a single
study exists on the human protein, demonstrating both phosphatase activity and nuclear localization of UBLCP1 [80].
The second is a 50 kDa translocase of the inner mitochondrial
membrane (termed TIM50 or TIMM50), an essential component
of the mitochondrial inner membrane in both human and
yeast cells [81]. This protein has been studied in some detail;
however, very little attention has been given to the presence
of the phosphatase domain, save a single study demonstrating
protein phosphatase activity [81]. Simply due to their respective
conservation patterns, we infer that UBLCP1 will be involved, as
are the EYA proteins, with multicellular development, and that
the phosphatase domain of TIM50 plays an important role in the
regulation of mitochondrial translocation.
Conversely, where species have ‘unique’ versions of these
proteins, it suggests a species-specific role. Arabidopsis, rice and
poplar genomes indicate a dramatic expansion in the total number
of FCP-like phosphatases, with more than double the number of
proteins compared with humans (19, 19 and 23 respectively,
compared with eight in humans [27]). Hence these proteins may
have a number of unique roles in plants, indeed demonstrated by
recent studies implicating two of the unique members, CPL1 and
CPL2, in jasmonic acid biosynthesis, as well as stress and auxin
responses [82,83].
CONVERGENCE ON FUNCTION: EVOLUTION OF
CARBOHYDRATE-BINDING PHOSPHATASES
Two of the best examples of convergent evolution at the molecular level come from the phosphatase field. We have already
noted the structural convergence of the PPP and PPM active
sites with little or no sequence relatedness in their respective
genes. More recently the human and plant PTPs, laforin and
SEX4 (starch excess 4), have been demonstrated to be functional
equivalents [84]. Lafora disease is an autosomal recessive
disorder characterized by the accumulation of long-stranded,
poorly branched glycogen (polyglucosan), known as Lafora
bodies. Nearly 50 % of Lafora disease patients have a recessive
mutation in EPM2A (epilepsy of progressive myoclonus type 2
gene A) [85] that encodes laforin, a DSP with a carbohydratebinding module (CBM 20) N-terminal to the PTP domain
(Figure 3). Laforin is the only human phosphatase with a
carbohydrate-docking module (PP1 associates with glycogen
through separate glycogen-binding subunits) and displays the
currently unique ability among PTPs to dephosphorylate complex
carbohydrates [84,86]. Consistent with results from a mouse
knockout of EPM2A, laforin is now recognized as a glycogen
phosphatase and mutation of the laforin gene leads to aberrant
Figure 3
407
Domain structure and gene organization of SEX4 and laforin
The plant starch phosphatase SEX4 is shown with its chloroplast transit peptide (cTP),
dual-specificity (or PTP) phosphatase domain (DSP) and carbohydrate-binding module 20
(CBM20). Human laforin has dual-specificity (or PTP) phosphatase domain (DSP) and
carbohydrate-binding module 20 (CBM20) domains, but they are reversed in order indicating
convergent evolution to a common target substrate.
accumulation of glycogen and disease development. Both starch
and glycogen are phosphorylated molecules and only recently
has the significance of this modification been appreciated. In
plants the phosphorylation and dephosphorylation of starch is critical to its accumulation and appropriate mobilization in leaves
[87–89]. The power of genomics combined with phylogenetics
(and biochemistry) was nicely demonstrated when laforin was
further investigated [84]. A number of protists (unicellular eukaryotes) synthesize and store floridean starch, another complex
carbohydrate related to amylopectin. Genomics of several of
these protists (Toxoplasma gondi, Eimeria tenella, Tetrahymena
thermophila, Paramecium tetraurelia and Cyanidioschyzon merolae) has revealed putative laforin orthologues. Characterization
of the least-conserved laforin (from C. merolae) showed it has
biochemical properties similar to human laforin and localizes to
floridean starch in vivo [84]. Each of these five protists identified
with a laforin orthologue are of red algal descent, contain a true
mitochondrion and produce floridean starch. Other organisms of
red algal descent lack laforin orthologues [88] and in each case
they were noted to not make floridean starch and/or not have
true mitochondria. Further analysis of genomic data revealed
that laforin orthologues only exist in these five protists and all
vertebrates, but not in any invertebrate animals or other eukaryote,
bacteria or archeal genomes. In the same study it was suggested
that laforin originated in an ancestral red algae early in eukaryotic
evolution and was maintained only in vertebrates and those
organisms that produce floridean starch.
A genetic screen for plants that produce excess starch (SEX
mutants) yielded several novel mutants including one designated
SEX4 whose locus was mapped to At3g52180 in Arabidopsis.
This gene, like laforin, encodes a DSP with a carbohydratebinding module (CBM 20), but in this case the domain order
is reversed (Figure 3) indicating that SEX4 and laforin are
not orthologues. SEX4 was predicted and shown to localize to
the chloroplast (the site of starch synthesis/degradation) and to associate with and dephosphorylate starch [87–89]. Being the only
genes in animals and plants whose mutation causes inappropriate
glycogen/starch accumulation in combination with both having
CBM and DSP domains, led Gentry et al. [84] to postulate that
SEX4 and laforin were functional equivalents. In an elegant study
they generated stable cell lines of the Arabidopsis SEX4 mutant
cell line sex4-3 transformed with either wild-type SEX4 or human
laforin (preceded by a SEX4 chloroplast transit peptide). Both of
these reverted the sex4-3 mutant starch-excess phenotype. This is
an excellent example of convergent evolution showing that plants
and animals evolved functionally equivalent phosphatases.
CONCLUSION
The age of comparative genomics will have a major impact on
evolutionary biology and biological research in general. Already
c The Authors Journal compilation c 2009 Biochemical Society
408
G. B. G. Moorhead and others
this field has played a role in resolving key issues in the
phylogenetic relationships between organisms (particularly those
that sit at evolutionary nodes) and shedding light on the emergence
of basic cellular processes [90] and evolution of species-specific
traits [91].
Information gleaned from genomics studies will also allow new
large-scale or systems biology approaches to answer research
questions in all fields of biology. For example, knockdown and
knockout studies can be initiated that take a complement of
phosphatases and asks what phenotype is generated when the
expression of an individual enzyme is dramatically modified or
eliminated. We can also do experiments where individual enzymes
from the phosphatase catalogue can be overexpressed in cells
and then examine their influence on the phosphorylation state of
particular proteins. Indeed, the age of comparative genomics has
meant that the protein phosphatases have come of age.
FUNDING
This work was supported by the Natural Sciences and Engineering Research Council of
Canada [grant number 216895]; the Alberta Ingenuity Center for Carbohydrate Science
[grant number RT707047]; and the Alberta Cancer Board [grant number 23161].
REFERENCES
1 Cohen, P. (2002) The origins of protein phosphorylation. Nat. Cell Biol. 4, E127–E130
2 Kennelly, P. J. (2003) Archaeal protein kinases and protein phosphatases: insights from
genomics and biochemistry. Biochem. J. 370, 373–389
3 Hunter, T. (2007) The age of crosstalk: phosphorylation, ubiquitination, and beyond.
Mol. Cell 28, 730–738
4 Manning, G., Plowman, G. D., Hunter, T. and Sudarsanam, S. (2002) Evolution of protein
kinase signaling from yeast to man. Trends Biochem. Sci. 27, 514–520
5 Hunter, T. (2004) Protein phosphorylation: what does the future hold? In Life Sciences for
the 21st Century (Ehud Keinan, I.S. and Sela, M., ed.), pp. 191–223, Wiley, Hoboken, NJ
6 Cohen, P. (2000) The regulation of protein function by multisite phosphorylation: a
25 year update. Trends Biochem. Sci. 25, 596–601
7 Macek, B., Gnad, F., Soufi, B., Kumar, C., Olsen, J. V., Mijakovic, I. and Mann, M. (2008)
Phosphoproteome analysis of E. coli reveals evolutionary conservation of bacterial
Ser/Thr/Tyr phosphorylation. Mol. Cell. Proteomics 7, 299–307
8 Macek, B., Mijakovic, I., Olsen, J. V., Gnad, F., Kumar, C., Jensen, P. R. and Mann, M.
(2007) The serine/threonine/tyrosine phosphoproteome of the model bacterium Bacillus
subtilis . Mol. Cell. Proteomics 6, 697–707
9 Soufi, B., Gnad, F., Jensen, P. R., Petranovic, D., Mann, M., Mijakovic, I. and Macek, B.
(2008) The Ser/Thr/Tyr phosphoproteome of Lactococcus lactis IL1403 reveals multiple
phosphorylated proteins. Proteomics 8, 3486–3493
10 Olsen, J. V., Blagoev, B., Gnad, F., Macek, B., Kumar, C., Mortensen, P. and Mann, M.
(2006) Global, in vivo , and site-specific phosphorylation dynamics in signaling networks.
Cell 127, 635–648
11 Das, A. K., Helps, N. R., Cohen, P. T. and Barford, D. (1996) Crystal structure of the
protein serine/threonine phosphatase 2C at 2.0 Å resolution. EMBO J. 15, 6798–6809
12 Alonso, A., Sasin, J., Bottini, N., Friedberg, I., Friedberg, I., Osterman, A., Godzik, A.,
Hunter, T., Dixon, J. and Mustelin, T. (2004) Protein tyrosine phosphatases in the human
genome. Cell 117, 699–711
13 Tonks, N. K. (2006) Protein tyrosine phosphatases: from genes, to function, to disease.
Nat. Rev. Mol. Cell Biol. 7, 833–846
14 Cohen, P. T. (2002) Protein phosphatase 1: targeted in many directions. J. Cell Sci. 115,
241–256
15 Moorhead, G. B., Trinkle-Mulcahy, L. and Ulke-Lemee, A. (2007) Emerging roles of
nuclear protein phosphatases. Nat. Rev. Mol. Cell Biol. 8, 234–244
16 Bollen, M. (2001) Combinatorial control of protein phosphatase-1. Trends Biochem. Sci.
26, 426–431
17 Kamenski, T., Heilmeier, S., Meinhart, A. and Cramer, P. (2004) Structure and mechanism
of RNA polymerase II CTD phosphatases. Mol. Cell 15, 399–407
18 Janssens, V., Longin, S. and Goris, J. (2008) PP2A holoenzyme assembly: in cauda
venenum (the sting is in the tail). Trends Biochem. Sci. 33, 113–121
19 Cohen, P. T. (2004) Overview of protein serine/threonine phosphatases. In Protein
Phosphatases (Arino, J., ed.), pp. 1–20, Springer, Berlin
20 Seet, B. T., Dikic, I., Zhou, M. M. and Pawson, T. (2006) Reading protein modifications
with interaction domains. Nat. Rev. Mol. Cell Biol. 7, 473–483
c The Authors Journal compilation c 2009 Biochemical Society
21 Stefansson, B. and Brautigan, D. L. (2006) Protein phosphatase 6 subunit with conserved
Sit4-associated protein domain targets IκBε. J. Biol. Chem. 281, 22624–22634
22 Cohen, P. T., Philp, A. and Vazquez-Martin, C. (2005) Protein phosphatase 4: from
obscurity to vital functions. FEBS Lett. 579, 3278–3286
23 Gingras, A. C., Caballero, M., Zarske, M., Sanchez, A., Hazbun, T. R., Fields, S.,
Sonenberg, N., Hafen, E., Raught, B. and Aebersold, R. (2005) A novel, evolutionarily
conserved protein phosphatase complex involved in cisplatin sensitivity.
Mol. Cell. Proteomics 4, 1725–1740
24 Moorhead, G. B., Trinkle-Mulcahy, L. and Ulke-Lemée, A. (2007) Emerging roles of
nuclear protein phosphatases. Nat. Rev. Mol. Cell Biol. 8, 234–244
25 Shiu, S. H. and Bleecker, A. B. (2001) Receptor-like kinases from Arabidopsis form a
monophyletic gene family related to animal receptor kinases. Proc. Natl. Acad. Sci. U.S.A.
98, 10763–10768
26 Shiu, S. H. and Bleecker, A. B. (2003) Expansion of the receptor-like kinase/Pelle gene
family and receptor-like proteins in Arabidopsis . Plant Physiol. 132, 530–543
27 Kerk, D., Templeton, G. and Moorhead, G. B. (2008) Evolutionary radiation pattern of novel
protein phosphatases revealed by analysis of protein data from the completely sequenced
genomes of humans, green algae, and higher plants. Plant Physiol. 146, 351–367
28 Kerk, D., Bulgrien, J., Smith, D. W., Barsam, B., Veretnik, S. and Gribskov, M. (2002) The
complement of protein phosphatase catalytic subunits encoded in the genome of
Arabidopsis . Plant Physiol. 129, 908–925
29 Luan, S. (2003) Protein phosphatases in plants. Annu. Rev. Plant Biol. 54, 63–92
30 Andreeva, A. V. and Kutuzov, M. A. (2008) Protozoan protein tyrosine phosphatases.
Int. J. Parasitol 38, 1279–1295
31 Andersen, J. N., Jansen, P. G., Echwald, S. M., Mortensen, O. H., Fukada, T.,
Del Vecchio, R., Tonks, N. K. and Moller, N. P. (2004) A genomic perspective on protein
tyrosine phosphatases: gene structure, pseudogenes, and genetic disease linkage.
FASEB J. 18, 8–30
32 Andersen, J. N., Mortensen, O. H., Peters, G. H., Drake, P. G., Iversen, L. F., Olsen, O. H.,
Jansen, P. G., Andersen, H. S., Tonks, N. K. and Moller, N. P. (2001) Structural and
evolutionary relationships among protein tyrosine phosphatase domains. Mol. Cell. Biol.
21, 7117–7136
33 Salmeen, A., Andersen, J. N., Myers, M. P., Meng, T. C., Hinks, J. A., Tonks, N. K. and
Barford, D. (2003) Redox regulation of protein tyrosine phosphatase 1B involves a
sulphenyl-amide intermediate. Nature 423, 769–773
34 Barford, D. (2004) The role of cysteine residues as redox-sensitive regulatory switches.
Curr. Opin. Struct. Biol. 14, 679–686
35 Musumeci, L., Bongiorni, C., Tautz, L., Edwards, R. A., Osterman, A., Perego, M.,
Mustelin, T. and Bottini, N. (2005) Low-molecular-weight protein tyrosine phosphatases
of Bacillus subtilis . J. Bacteriol. 187, 4945–4956
36 Trinkle-Mulcahy, L. and Lamond, A. I. (2006) Mitotic phosphatases: no longer silent
partners. Curr. Opin. Cell Biol. 18, 623–631
37 Kannan, N. and Taylor, S. S. (2008) Rethinking pseudokinases. Cell 133, 204–205
38 Boudeau, J., Miranda-Saavedra, D., Barton, G. J. and Alessi, D. R. (2006) Emerging roles
of pseudokinases. Trends Cell Biol. 16, 443–452
39 Manning, G., Young, S. L., Miller, W. T. and Zhai, Y. (2008) The protist, Monosiga
brevicollis , has a tyrosine kinase signaling network more elaborate and diverse than
found in any known metazoan. Proc. Natl. Acad. Sci. U.S.A. 105, 9674–9679
40 Wishart, M. J. and Dixon, J. E. (1998) Gathering STYX: phosphatase-like form predicts
functions for unique protein-interaction domains. Trends Biochem. Sci. 23, 301–306
41 Torgler, C. N., Narasimha, M., Knox, A. L., Zervas, C. G., Vernon, M. C. and Brown, N. H.
(2004) Tensin stabilizes integrin adhesive contacts in Drosophila . Dev. Cell 6, 357–369
42 Schuldiner, M., Collins, S. R., Thompson, N. J., Denic, V., Bhamidipati, A., Punna, T.,
Ihmels, J., Andrews, B., Boone, C., Greenblatt, J. F. et al. (2005) Exploration of the
function and organization of the yeast early secretory pathway through an epistatic
miniarray profile. Cell 123, 507–519
43 Begley, M. J. and Dixon, J. E. (2005) The structure and regulation of myotubularin
phosphatases. Curr. Opin. Struct. Biol. 15, 614–620
44 Mouneimne, G. and Brugge, J. S. (2007) Tensins: a new switch in cell migration.
Dev. Cell 13, 317–319
45 Haynie, D. T. and Ponting, C. P. (1996) The N-terminal domains of tensin and auxilin are
phosphatase homologues. Protein Sci. 5, 2643–2646
46 Chen, H., Duncan, I. C., Bozorgchami, H. and Lo, S. H. (2002) Tensin1 and a previously
undocumented family member, tensin2, positively regulate cell migration. Proc. Natl.
Acad. Sci. U.S.A. 99, 733–738
47 Uwanogho, D. A., Hardcastle, Z., Balogh, P., Mirza, G., Thornburg, K. L., Ragoussis, J.
and Sharpe, P. T. (1999) Molecular cloning, chromosomal mapping, and developmental
expression of a novel protein tyrosine phosphatase-like gene. Genomics 62, 406–416
48 Bach, L., Michaelson, L. V., Haslam, R., Bellec, Y., Gissot, L., Marion, J., Da Costa, M.,
Boutin, J. P., Miquel, M., Tellier, F. et al. (2008) The very-long-chain hydroxy fatty
acyl-CoA dehydratase PASTICCINO2 is essential and limiting for plant development.
Proc. Natl. Acad. Sci. U.S.A. 105, 14727–14731
Protein phosphatase evolution
49 Kihara, A., Sakuraba, H., Ikeda, M., Denpoh, A. and Igarashi, Y. (2008) Membrane
topology and essential amino acid residues of Phs1, a 3-hydroxyacyl-CoA dehydratase
involved in very long-chain fatty acid elongation. J. Biol. Chem. 283, 11199–11209
50 Williams, J. G. and Zvelebil, M. (2004) SH2 domains in plants imply new signalling
scenarios. Trends Plant Sci. 9, 161–163
51 Zhukovskaya, N. V., Fukuzawa, M., Tsujioka, M., Jermyn, K. A., Kawata, T., Abe, T.,
Zvelebil, M. and Williams, J. G. (2004) Dd-STATb, a Dictyostelium STAT protein with a
highly aberrant SH2 domain, functions as a regulator of gene expression during growth
and early development. Development 131, 447–458
52 Neel, B. G. and Tonks, N. K. (1997) Protein tyrosine phosphatases in signal transduction.
Curr. Opin. Cell Biol. 9, 193–204
53 Zhan, X. L., Deschenes, R. J. and Guan, K. L. (1997) Differential regulation of FUS3 MAP
kinase by tyrosine-specific phosphatases PTP2/PTP3 and dual-specificity phosphatase
MSG5 in Saccharomyces cerevisiae . Genes Dev. 11, 1690–1702
54 Wilson-Grady, J. T., Villén, J. and Gygi, S. P. (2008) Phosphoproteome analysis of fission
yeast. J. Proteome Res. 7, 1088–1097
55 Chi, A., Huttenhower, C., Geer, L. Y., Coon, J. J., Syka, J. E., Bai, D. L., Shabanowitz, J.,
Burke, D. J., Troyanskaya, O. G. and Hunt, D. F. (2007) Analysis of phosphorylation sites
on proteins from Saccharomyces cerevisiae by electron transfer dissociation (ETD) mass
spectrometry. Proc. Natl. Acad. Sci. U.S.A. 104, 2193–2198
56 Gruhler, A., Olsen, J. V., Mohammed, S., Mortensen, P., Faergeman, N. J., Mann, M. and
Jensen, O. N. (2005) Quantitative phosphoproteomics applied to the yeast pheromone
signaling pathway. Mol. Cell. Proteomics 4, 310–327
57 Pincus, D., Letunic, I., Bork, P. and Lim, W. A. (2008) Evolution of the phospho-tyrosine
signaling machinery in premetazoan lineages. Proc. Natl. Acad. Sci. U.S.A. 105,
9680–9684
58 King, N., Westbrook, M. J., Young, S. L., Kuo, A., Abedin, M., Chapman, J.,
Fairclough, S., Hellsten, U., Isogai, Y., Letunic, I. et al. (2008) The genome of the
choanoflagellate Monosiga brevicollis and the origin of metazoans. Nature 451, 783–788
59 Zhang, W. and Shi, L. (2004) Evolution of the PPM-family protein phosphatases in
Streptomyces : duplication of catalytic domain and lateral recruitment of additional
sensory domains. Microbiology 150, 4189–4197
60 Schweighofer, A., Hirt, H. and Meskiene, I. (2004) Plant PP2C phosphatases: emerging
functions in stress signaling. Trends Plant Sci. 9, 236–243
61 Stern, A., Privman, E., Rasis, M., Lavi, S. and Pupko, T. (2007) Evolution of the metazoan
protein phosphatase 2C superfamily. J. Mol. Evol. 64, 61–70
62 Kutuzov, M. A. and Andreeva, A. V. (2008) Protein Ser/Thr phosphatases of parasitic
protozoa. Mol. Biochem. Parasitol. 161, 81–90
63 Byrum, C. A., Walton, K. D., Robertson, A. J., Carbonneau, S., Thomason, R. T., Coffman,
J. A. and McClay, D. R. (2006) Protein tyrosine and serine-threonine phosphatases in the
sea urchin, Strongylocentrotus purpuratus : identification and potential functions.
Dev. Biol. 300, 194–218
64 Egloff, M. P., Johnson, D. F., Moorhead, G., Cohen, P. T., Cohen, P. and Barford, D. (1997)
Structural basis for the recognition of regulatory subunits by the catalytic subunit of
protein phosphatase 1. EMBO J. 16, 1876–1887
65 Wakula, P., Beullens, M., Ceulemans, H., Stalmans, W. and Bollen, M. (2003) Degeneracy
and function of the ubiquitous RVXF motif that mediates binding to protein
phosphatase-1. J. Biol. Chem. 278, 18817–18823
66 Johnson, D. F., Moorhead, G., Caudwell, F. B., Cohen, P., Chen, Y. H., Chen, M. X. and
Cohen, P. T. (1996) Identification of protein-phosphatase-1-binding domains on the
glycogen and myofibrillar targetting subunits. Eur. J. Biochem. 239, 317–325
67 Prag, S. and Adams, J. C. (2003) Molecular phylogeny of the kelch-repeat superfamily
reveals an expansion of BTB/kelch proteins in animals. BMC Bioinformatics 4, 42
68 Li, J. L. and Baker, D. A. (1998) A putative protein serine/threonine phosphatase from
Plasmodium falciparum contains a large N-terminal extension and five unique inserts in
the catalytic domain. Mol. Biochem. Parasitol. 95, 287–295
69 Kutuzov, M. A. and Andreeva, A. V. (2002) Protein Ser/Thr phosphatases with kelch-like
repeat domains. Cell. Signalling 14, 745–750
70 Mora-Garcia, S., Vert, G., Yin, Y., Cano-Delgado, A., Cheong, H. and Chory, J. (2004)
Nuclear protein phosphatases with Kelch-repeat domains modulate the response to
brassinosteroids in Arabidopsis . Genes Dev. 18, 448–460
71 Goldberg, J., Huang, H. B., Kwon, Y. G., Greengard, P., Nairn, A. C. and Kuriyan, J. (1995)
Three-dimensional structure of the catalytic subunit of protein serine/threonine
phosphatase-1. Nature 376, 745–753
72 Barton, G. J., Cohen, P. T. and Barford, D. (1994) Conservation analysis and structure
prediction of the protein serine/threonine phosphatases. Sequence similarity with
diadenosine tetraphosphatase from Escherichia coli suggests homology to the protein
phosphatases. Eur. J. Biochem. 220, 225–237
409
73 Maynes, J. T., Luu, H. A., Cherney, M. M., Andersen, R. J., Williams, D., Holmes, C. F.
and James, M. N. (2006) Crystal structures of protein phosphatase-1 bound to
motuporin and dihydromicrocystin-LA: elucidation of the mechanism of enzyme
inhibition by cyanobacterial toxins. J. Mol. Biol. 356, 111–120
74 Burki, F., Shalchian-Tabrizi, K., Minge, M., Skjaeveland, A., Nikolaev, S. I., Jakobsen,
K. S. and Pawlowski, J. (2007) Phylogenomics reshuffles the eukaryotic supergroups.
PLoS ONE 2, e790
75 Parfrey, L. W., Barbero, E., Lasser, E., Dunthorn, M., Bhattacharya, D., Patterson, D. J. and
Katz, L. A. (2006) Evaluating support for the current classification of eukaryotic diversity.
PLoS Genet. 2, e220
76 Cavalier-Smith, T. (1999) Principles of protein and lipid targeting in secondary
symbiogenesis: euglenoid, dinoflagellate, and sporozoan plastid origins and the
eukaryote family tree. J. Eukaryotic Microbiol. 46, 347–366
77 Weber, A. P., Linka, M. and Bhattacharya, D. (2006) Single, ancient origin of a plastid
metabolite translocator family in Plantae from an endomembrane-derived ancestor.
Eukaryotic Cell 5, 609–612
78 Archambault, J., Pan, G., Dahmus, G. K., Cartier, M., Marshall, N., Zhang, S., Dahmus,
M. E. and Greenblatt, J. (1998) FCP1, the RAP74-interacting subunit of a human protein
phosphatase that dephosphorylates the carboxyl-terminal domain of RNA polymerase IIO.
J. Biol. Chem. 273, 27593–27601
79 Abdelhak, S., Kalatzis, V., Heilig, R., Compain, S., Samson, D., Vincent, C., Weil, D.,
Cruaud, C., Sahly, I., Leibovici, M. et al. (1997) A human homologue of the Drosophila
eyes absent gene underlies branchio-oto-renal (BOR) syndrome and identifies a novel
gene family. Nat. Genet. 15, 157–164
80 Zheng, H., Ji, C., Gu, S., Shi, B., Wang, J., Xie, Y. and Mao, Y. (2005) Cloning and
characterization of a novel RNA polymerase II C-terminal domain phosphatase.
Biochem. Biophys. Res. Commun. 331, 1401–1407
81 Guo, Y., Cheong, N., Zhang, Z., De Rose, R., Deng, Y., Farber, S. A.,
Fernandes-Alnemri, T. and Alnemri, E. S. (2004) Tim50, a component of the
mitochondrial translocator, regulates mitochondrial integrity and cell death.
J. Biol. Chem. 279, 24813–24825
82 Matsuda, O., Sakamoto, H., Nakao, Y., Oda, K. and Iba, K. (2008) CTD phosphatases in
the attenuation of wound-induced transcription of jasmonic acid-biosynthetic genes
in Arabidopsis . Plant J., doi: 10.1111/j.1365-313X.2008.03663.x
83 Ueda, A., Li, P., Feng, Y., Vikram, M., Kim, S., Kang, C. H., Kang, J. S., Bahk, J. D., Lee,
S. Y., Fukuhara, T. et al. (2008) The Arabidopsis thaliana carboxyl-terminal domain
phosphatase-like 2 regulates plant growth, stress and auxin responses. Plant Mol. Biol.
67, 683–697
84 Gentry, M. S., Dowen, 3rd, R. H., Worby, C. A., Mattoo, S., Ecker, J. R. and Dixon, J. E.
(2007) The phosphatase laforin crosses evolutionary boundaries and links carbohydrate
metabolism to neuronal disease. J. Cell Biol. 178, 477–488
85 Minassian, B. A., Lee, J. R., Herbrick, J. A., Huizenga, J., Soder, S., Mungall, A. J.,
Dunham, I., Gardner, R., Fong, C. Y., Carpenter, S. et al. (1998) Mutations in a gene
encoding a novel protein tyrosine phosphatase cause progressive myoclonus epilepsy.
Nat. Genet. 20, 171–174
86 Tagliabracci, V. S., Turnbull, J., Wang, W., Girard, J. M., Zhao, X., Skurat, A. V.,
Delgado-Escueta, A. V., Minassian, B. A., Depaoli-Roach, A. A. and Roach, P. J. (2007)
Laforin is a glycogen phosphatase, deficiency of which leads to elevated phosphorylation
of glycogen in vivo . Proc. Natl. Acad. Sci. U.S.A. 104, 19262–19266
87 Niittyla, T., Comparot-Moss, S., Lue, W. L., Messerli, G., Trevisan, M., Seymour, M. D.,
Gatehouse, J. A., Villadsen, D., Smith, S. M., Chen, J. et al. (2006) Similar protein
phosphatases control starch metabolism in plants and glycogen metabolism in mammals.
J. Biol. Chem. 281, 11815–11818
88 Sokolov, L. N., Dominguez-Solis, J. R., Allary, A. L., Buchanan, B. B. and Luan, S. (2006)
A redox-regulated chloroplast protein phosphatase binds to starch diurnally and functions
in its accumulation. Proc. Natl. Acad. Sci. U.S.A. 103, 9732–9737
89 Kerk, D., Conley, T. R., Rodriguez, F. A., Tran, H. T., Nimick, M., Muench, D. G. and
Moorhead, G. B. (2006) A chloroplast-localized dual-specificity protein phosphatase in
Arabidopsis contains a phylogenetically dispersed and ancient carbohydrate-binding
domain, which binds the polysaccharide starch. Plant J. 46, 400–413
90 Merchant, S. S., Prochnik, S. E., Vallon, O., Harris, E. H., Karpowicz, S. J., Witman, G. B.,
Terry, A., Salamov, A., Fritz-Laylin, L. K., Marechal-Drouard, L. et al. (2007) The
Chlamydomonas genome reveals the evolution of key animal and plant functions. Science
318, 245–250
91 Hanikenne, M., Talke, I. N., Haydon, M. J., Lanz, C., Nolte, A., Motte, P., Kroymann, J.,
Weigel, D. and Kramer, U. (2008) Evolution of metal hyperaccumulation required
cis-regulatory changes and triplication of HMA4. Nature 453, 391–395
Received 3 October 2008/18 November; accepted 19 November 2008
Published on the Internet 23 December 2008, doi:10.1042/BJ20081986
c The Authors Journal compilation c 2009 Biochemical Society
Biochem. J. (2009) 417, 401–409 (Printed in Great Britain)
doi:10.1042/BJ20081986
SUPPLEMENTARY ONLINE DATA
Evolution of protein phosphatases in plants and animals
Greg B. G. MOORHEAD1 , Veerle DE WEVER, George TEMPLETON and David KERK
Department of Biological Sciences, University of Calgary, 2500 University Dr., N.W. Calgary, Alberta, Canada T2N 1N4
Figure S1
Phylogenetic analysis of the PPP family kelch-domain containing phosphatases
A rooted tree was generated by comparison of PPKL sequences derived from tBLASTn searches throughout eukaryotic genomes. Sequences were aligned using ClustalX, hand-optimized using
GeneDoc, and a bootstrap tree was generated with the neighbour-joining function of ClustalX [1]. Sequences are derived from Uniprot/TrEMBL (http://www.expasy.org) unless stated otherwise:
Ostreococcus lucimarinus , A4S3K6; Chlamydomonas reinhardtii , A8HNE3; Physcomitrella patens a , A9S2C2; Physcomitrella patens b , A9STN4; Physcomitrella patens c , A9SF24; Physcomitrella
patens d , A9TQY8; Arabidopsis thaliana , At1g03445, Q9LR78; At1g08420, Q9SJF0; At2g27210, Q9SHS7; At4g03080, Q9ZTA4; Vinis Vitifera a , A7QVR2; Vinis Vitifera b , A7PU12; Oryza sativa
indica a , A2XK43; Oryza sativa indica b , A2ZMT0; Oryza sativa japonica a , A3AKU8; Oryza sativa japonica b , A3BCU1; Oryza sativa japonica c , Q2QM47; Oryza sativa japonica d , Q10G19;
Oryza sativa japonica e , Q60EX6; Solanum lycopersicum (NCBI nucleotide database: BT013516); Babesia bovis , A7AT59; Plasmodium falciparium , Q8IKH5 identical with O96914; Plasmodium
berghei , Q4Z2M2; Plasmodium knowlesi , B3L999; Plasmodium chabaudi , Q4XN17; Plasmodium vivax ; A5K396; Plasmodium yoelii yoelii , Q7RA97; Theileria parva , Q4N2C8; Theileria annulata ,
Q4U9P3; Tetrahymena thermophila a (NCBI protein XP_001031447); Tetrahymena thermophila b , Q240X8; Tetrahymena thermophila c , Q22BC4; Neospora caninum (GeneDB NC_LIV_113060);
Cryptosporidium hominis , Q5CHL7; Cryptosporidium parvum , A3FQS0; Paramecium tetraurelia a , A0BV09; Paramecium tetraurelia b , A0BZY8; Paramecium tetraurelia c , A0CLB3; Paramecium
tetraurelia d , A0CV78; Paramecium tetraurelia e , A0DWN0; Paramecium tetraurelia f , A0D9U9; and Paramecium tetraurelia g , A0E8D8. The human PP1α sequence P62136 was used as an outgroup.
1
To whom correspondence should be addressed (email [email protected]).
c The Authors Journal compilation c 2009 Biochemical Society
G. B. G. Moorhead and others
Figure S2
For legend see last page.
c The Authors Journal compilation c 2009 Biochemical Society
Protein phosphatase evolution
Figure S2
For legend see last page.
c The Authors Journal compilation c 2009 Biochemical Society
G. B. G. Moorhead and others
Figure S2
For legend see last page.
c The Authors Journal compilation c 2009 Biochemical Society
Protein phosphatase evolution
Figure S2
For legend see last page.
c The Authors Journal compilation c 2009 Biochemical Society
G. B. G. Moorhead and others
Figure S2
Multiple sequence alignment of protein phosphatases with kelch domains
For sequence annotations and alignment method, see the legend of Figure 2 in the main text. Kelch domains are indicated above according to [2]. Conserved residues within are marked with $.
Consensus secondary structure predictions obtained from (http://npsa-pbil.ibcp.fr/) with default parameters were indicated: α-helices are shown in bold and β-strands are underlined. Indecisive
predictions are bold and underlined. Inserts in the PPKL phosphatase domains, as compared with the PPP catalytic domains, are indicated above according to [2]. Conserved residues within
the phosphatase domain, compared with human PP1α (!) or PPP family members (*) [3] are indicated with symbols shown between parentheses while residues involved in inhibitor (#) ([4] and
V. DeWever and G. Moorhead, unpublished work) and RVXF-motif binding (%) [4,5] defined based on PP1 studies are marked as indicated between parentheses.
c The Authors Journal compilation c 2009 Biochemical Society
Protein phosphatase evolution
REFERENCES
1 Kerk, D., Templeton, G. and Moorhead, G. B. (2008) Evolutionary radiation pattern of novel
protein phosphatases revealed by analysis of protein data from the completely sequenced
genomes of humans, green algae, and higher plants. Plant Physiol. 146, 351–367
2 Kutuzov, M. A. and Andreeva, A. V. (2002) Protein Ser/Thr phosphatases with kelch-like
repeat domains. Cell. Signalling 14, 745–750
3 Barton, G. J., Cohen, P. T. and Barford, D. (1994) Conservation analysis and structure
prediction of the protein serine/threonine phosphatases. Sequence similarity with
diadenosine tetraphosphatase from Escherichia coli suggests homology to the protein
phosphatases. Eur. J. Biochem. 220, 225–237
4 Mora-Garcia, S., Vert, G., Yin, Y., Cano-Delgado, A., Cheong, H. and Chory, J. (2004)
Nuclear protein phosphatases with Kelch-repeat domains modulate the response to
brassinosteroids in Arabidopsis . Genes Dev. 18, 448–460
5 Wu, X. and Tatchell, K. (2001) Mutations in yeast protein phosphatase type 1 that affect
targeting subunit binding. Biochemistry 40, 7410–7420
Received 3 October 2008/18 November; accepted 19 November 2008
Published on the Internet 23 December 2008, doi:10.1042/BJ20081986
c The Authors Journal compilation c 2009 Biochemical Society