Biochem. J. (2008) 409, 333–348 (Printed in Great Britain) 333 doi:10.1042/BJ20071097 REVIEW ARTICLE The histidine phosphatase superfamily: structure and function Daniel J. RIGDEN1 School of Biological Sciences, University of Liverpool, Crown Street, Liverpool L69 7ZB, U.K. The histidine phosphatase superfamily is a large functionally diverse group of proteins. They share a conserved catalytic core centred on a histidine which becomes phosphorylated during the course of the reaction. Although the superfamily is overwhelmingly composed of phosphatases, the earliest known and arguably best-studied member is dPGM (cofactor-dependent phosphoglycerate mutase). The superfamily contains two branches sharing very limited sequence similarity: the first containing dPGM, fructose-2,6-bisphosphatase, PhoE, SixA, TIGAR [TP53 (tumour protein 53)-induced glycolysis and apoptosis regulator], Sts-1 and many other activities, and the second, smaller, branch composed mainly of acid phosphatases and phytases. Human representatives of both branches are of considerable medical interest, and various parasites contain superfamily members whose inhibition might have therapeutic value. Additionally, several phosphatases, notably the phytases, have current or potential applications in agriculture. The present review aims to draw together what is known about structure and function in the superfamily. With the benefit of an expanding set of histidine phosphatase superfamily structures, a clearer picture of the conserved elements is obtained, along with, conversely, a view of the sometimes surprising variation in substrate-binding and proton donor residues across the superfamily. This analysis should contribute to correcting a history of over- and mis-annotation in the superfamily, but also suggests that structural knowledge, from models or experimental structures, in conjunction with experimental assays, will prove vital for the future description of function in the superfamily. INTRODUCTION in the family is essential to inform future function annotation and is one of the objectives of the present review. Another is to bring together, for the first time, the literature on activities of histidine phosphatases and thereby illustrate how impressively diverse they are, involving many proteins of medical or applied interest. Research into the histidine phosphatase superfamily dates back to 1935 when an activity was found in yeast capable of interconverting 2PGA and 3PGA (2- and 3-phosphoglycerate) [1]. Of course, it was only later that the sequence of the enzyme responsible, dPGM (cofactor-dependent phosphoglycerate mutase), was determined [2], and later still when it was realized that very different activities, the first being F26BPase (fructose-2,6-bisphosphatase), could be catalysed by homologous enzymes [3]. In the 20 years since, a great many different functions have been associated with the superfamily and structures have been determined for some of the respective proteins. All of these new discoveries have been phosphatase activities of one kind or another, illustrating that, ironically enough, the earliest and arguably best studied member of the superfamily is anomalous in catalysing primarily a mutase reaction. Although the superfamily has undoubtedly benefited from containing the key glycolytic enzyme dPGM, the subject of many early studies [4], a reading of the literature also illustrates a disadvantage arising from the mutase-dominated superfamily history; many superfamily members discovered in genomes have been suggested to be mutases when a phosphatase activity is in fact much more likely. Indeed, like other superfamilies [5], the histidine phosphatases present considerable challenges to automated genome annotation tools and have a history of widespread mis- and over-annotation. Even human interpretation of sequence relationships has been regularly biased towards mutase activity rather than the actual predominant phosphatase activity [6–8]. An analysis of the structure–function relationship Key words: acid phosphatase, cofactor-dependent phosphoglycerate mutase, fructose-2,6-bisphosphatase, histidine phosphatase, structure–function relationship. SEQUENCES Iterative database searches in current databases using well-known characterized proteins such as dPGMs or F26BPases provide clear evidence for a deep split of the superfamily into two branches. PSI-BLAST searches using queries from branch 1 fail to produced branch 2 members among hit lists and vice versa. As discussed in detail below, the functions associated with branch 1 are much more diverse than those in branch 2, the latter containing mainly enzymes characterized as APs (acid phosphatases) or phytases (Table 1). Nevertheless, modern profile–profile comparison methods readily detect the distant evolutionary relationship between the two branches, a relationship revealed unambiguously by the first crystal structure of a branch 2 enzyme in 1993 [9]. In fact, even early sequence studies predicted a relationship between dPGMs and F26BPases (in branch 1) and APs (in branch 2) on the basis of their sharing a small RHG motif, near the beginning of the sequence, whose histidine residue was already known to be phosphorylated during catalysis [10]. Sequences in the histidine phosphatase superfamily are very diverse, both within and between the two principal branches. Supplementary Table S1 (http://www.BiochemJ.org/bj/409/ bj4090333add.htm) shows percentage sequence identities Abbreviations used: AP, acid phosphatase; 1,3-BPG, 1,3-bisphosphoglycerate; 2,3-BPG, 2,3-bisphosphoglycerate; BPGM, bisphosphoglycerate mutase; dPGM, cofactor-dependent phosphoglycerate mutase; EPP, ecdysteroid phosphate phosphatase; ER, endoplasmic reticulum; F26BP, fructose 2,6bisphosphate; F26BPase, fructose-2,6-bisphosphatase; HPt, histidine-containing phosphotransfer; Minpp1, multiple inositol-polyphosphate phosphatase 1; PAP, prostatic acid phosphatase; PFK-2, phosphofructokinase 2; 2PGA, 2-phosphoglycerate; 3PGA, 3-phosphoglycerate; PGAM5, phosphoglycerate mutase 5; SH3, Src homology 3; TFIIC, transcription factor IIC; TIGAR, TP53 (tumour protein 53)-induced glycolysis and apoptosis regulator; TRAP, tartrate-resistant acid phosphatase; UBASH, ubiquitin-associated and SH3 domain-containing. 1 email [email protected] c The Authors Journal compilation c 2008 Biochemical Society 334 Table 1 D. J. Rigden The principal known or reliably inferred molecular functions of histidine phosphatase domains grouped into broad functional categories Detail on reactions catalysed, their broader significance and further references can be found in Supplementary Table S2 (http://www.BiochemJ.org/bj/409/bj4090333add.htm) and in the text. Branch 1 Catalytic Broad functional category Protein and key molecular function reference Example species EC number Intermediary metabolism dPGM [4] BPGM [120] α-Ribazole phosphatase (CobC) [16] Mannitol-1-phosphatase [79] Neo13 [18] PFK2/F26BPase [121] TIGAR [28] SixA [15] E. coli , S. cerevisiae , human Human Salmonella Typhimurium Eimeria tenella Streptomyces fradiae Eukaryotes Eukaryotes E. coli 5.4.2.1 (also 3.1.3.13 and 5.4.2.4) 5.4.2.4 (also 5.4.2.1 and 3.1.3.13) 3.1.3.73 3.1.3.22 3.1.3.3.1.3.46 3.1.3.46 3.1.3.- EPP [19] 3.1.3.- Sts-1, Sts-2 (also known as UBASH3A, TULA) [20] PMU1 [8] PhoE [116] TFIIIC τ 55 kDa subunit [39] Bombyx mori , Drosophila melanogaster Human S. cerevisiae G. stearothermophilus S. cerevisiae PGAM5 [38] Pho5p [22] Human S. cerevisiae May include 3.6.1.3 Glucose-1-phosphatase [21] Phytase [23] Minpp1 [122] E. coli Aspergilli and E. coli Human 3.1.3.10 3.1.3.8, 3.1.3.26 3.1.3.62 Lysophosphatidic acid phosphatase [30] PAP [123] Human Human 3.1.3.2 3.1.3.48 Testicular acid phosphatase [34] Lysosomal acid phosphatase (LAP) and Acp5/TRAP [36] Peripheral vacuolar acid phosphatase [37] Human Human 3.1.3.48 ? Giardia lamblia ? Metabolic regulation Other regulatory or developmental process Unknown Non-catalytic (?) 2 Catalytic Extracellular metabolism/scavenging Regulation of signalling molecule concentration Other regulatory or developmental process determined for pairwise comparisons between members of known structure. In branch 1, SixA, which is considerably smaller than the other members (see below), shares only 15–19 % sequence identity with them. The similarly sized dPGM, PhoE and F26BPase share just 19–28 % sequence identity between different catalytic activities. The comparison of dPGMs and BPGMs (bisphosphoglycerate mutases) shows how simple sequenceidentity-based function annotation can easily go astray: human dPGM and BPGM, with different principal catalytic activities, share 53 % sequence identity, whereas the value for the bacterial dPGMs from Escherichia coli and Mycobacterium tuberculosis, catalysing the same reaction, is only 42 %. Between branches 1 and 2 the highest observed percentage sequence identity is just 18 %. Low values, down to 9 %, are also recorded for pairwise comparisons within branch 2 where less well conserved N- and C-terminal extensions can contribute to diversity. The division of the superfamily is reflected in its twin entries in the Pfam domain database [11] (http://pfam.sanger.ac.uk/). The ‘phosphoglycerate mutase family’ entry (PF00300) corresponds to branch 1 and the ‘histidine acid phosphatase’ entry (PF00328) represents branch 2. Their relationship is acknowledged by their union in the ‘phosphoglycerate mutase-like superfamily’ clan (CL0071). In the present version of the Pfam database (version 22, July 2007), branch 1 with just over 3000 members outnumbers branch 2 by four to one. Interestingly, archaea contain no members of branch 2 and just 37 representatives of branch 1. These numbers, and the diversity of the archaeal sequences, suggest that the common ancestor of archaea lacked a histidine phosphatase and a small number of organisms later acquired branch 1 members by lateral gene transfer. Interestingly, branch 2 proteins are largely found in eukaryotes (634 sequences) with fewer in c The Authors Journal compilation c 2008 Biochemical Society 3.1.3.3.1.3.3.1.3.- bacteria (108 sequences). In branch 1, the reverse is true, with bacterial sequences (2192) outnumbering eukaryotic proteins (781). The COG (Clusters of Orthologous Groups) database [12] (http://www.ncbi.nlm.nih.gov/COG/), designed to group orthologous proteins from genome sequences, covers a small portion of the histidine phosphatase superfamily with three entries. However, there are a number of problems with these entries, problems that automated annotation pipelines have unfortunately propagated into large numbers of individual sequence annotations. For example, examination of COG0588, ‘phosphoglycerate mutase 1’, shows that many members do not share substrate-binding residues (discussed later) with the well-characterized E. coli dPGM and therefore are probably not genuine dPGM enzymes. Worse, COG0406, containing predominantly bacterial sequences, is labelled ‘fructose-2,6-bisphosphatase’ on the basis of the inclusion of a Saccharomyces cerevisiae sequence with that activity, yet F26BP has not been detected outside eukaryotes. {A putatively active PFK-2 (phosphofructokinase 2)/F26BPase has been found in two Desulfovibrio species [13,14], neither present in COG0406.} The remaining entry is COG2062, ‘phosphohistidine phosphatase SixA’. Importantly, many members of the superfamily cannot be reliably annotated with any of the known functions (see below) since they bear very limited sequence similarity to characterized members. Clustering experiments (D. J. Rigden, unpublished work) show many well-defined groups, presumably sharing the same function, for which no functional data are available. Given sufficient knowledge of the structure–function relationship in the superfamily, it should be possible to apply bioinformatics tools to predict their functions. Structure and function in the histidine phosphatase superfamily FUNCTIONS A census Supplementary Table S2 (http://www.BiochemJ.org/bj/409/ bj4090333add.htm) presents a compilation of functions characterized for members of the histidine phosphatase superfamily. It contains only activities, demonstrated or strongly inferred, with potential physiological relevance: many enzymes will hydrolyse unnatural model substrates, for example. A condensed version with activities placed in broad categories is shown as Table 1. The most impressive feature of the superfamily is the range of activities catalysed. This is manifest most obviously in the presence of the dPGMs in a list otherwise dominated by phosphatases. As discussed later, a common catalytic machinery is shared by mutases and phosphatases. The list also shows that specificities among enzymes vary dramatically from small molecules such as phosphoglycerate (M r = 187), the substrate of dPGM, to large phosphorylated proteins such as the HPt (histidine-containing phosphotransfer) domain of ArcB [15], the substrate of SixA (M r = 9050). dPGM, as a participant in both glycolysis and gluconeogenesis, contributes to both catabolic and anabolic pathways. Other anabolic enzymes include CobC, involved in cobalamin synthesis [16], mannitol-1-phosphatase, which forms the Eimeria parasite storage compound mannitol [17], and Neo13, characterized from Streptomyces fradiae, which contributes to synthesis of the antibiotic neomycin [18]. Catabolic enzymes include EPP (ecdysteroid phosphate phosphatase), which releases free ecdysteroids from conjugated reserves in insects [19,20], and glucose-1-phosphatase, required for growth of E. coli on a glucose 1-phosphate carbon source [21]. Fungi have two superfamily members involved in scavenging phosphate from extracellular sources: Pho5p, which degrades extracellular nucleotides [22], and phytase, which releases the phosphate groups from phytic acid [23,24]. As well as metabolic functions, a surprisingly large proportion of the family is involved in signalling or regulation of one kind or another. EPP, mentioned above, is likely to be important for insect development, whereas two distinct members of the family, F26BPase and TIGAR [TP53 (tumour protein 53)-induced glycolysis and apoptosis regulator], have roles, demonstrated or strongly suggested respectively, in controlling the concentration of intracellular F26BP (fructose 2,6-bisphosphate) and thereby influencing the rates of glycolysis and gluconeogenesis in eukaryotic cells [25–27]. For a fuller discussion of F26BPases, and their N-terminal PFK-2 domains, the reader is referred to other reviews [13,14]. Interestingly, the well-characterized F26BP-binding residues of F26BPase are not conserved in TIGAR, indicating that the two proteins have converged on the same substrate and share more distant ancestry than their common activity would suggest [28]. The primary in vivo role of PMU1 (whose name derives from mis-annotation as a potential phosphomutase) in yeast is likely to involve regulation of the concentration of another small molecule, AICAR (5-amino-4imidazolecarboxamide riboside), and which has an influence over transcription of adenine synthesis genes [8]. Eukaryotic Minpp1 (multiple inositol-polyphosphate phosphatase 1) regulates the cellular concentrations of other signalling molecules, namely inositol pentakisphosphate and inositol hexakisphosphate [29]. Also in eukaryotes, the signalling molecule lysophosphatidic acid is a known substrate of both lysophosphatidic acid phosphatase [30] and PAP (prostatic acid phosphatase) [31]. Protein phosphatases from both branches have regulatory roles. In branch 1, SixA acts on a phosphohistidine residue in ArcB with a well-characterized role as a regulator of two-component 335 signalling in bacteria [15,32]. In branch 2, superfamily members target phosphotyrosine residues in important regulatory proteins such as ErbB2 [33], ErbB4 [34] and the EGF (epidermal growth factor) receptor [35]. Other protein phosphatase activities in branch 2 have interestingly different roles, in vertebrate bone development [36] and in excystation of the parasite Giardia lamblia [37]. In two cases, eukaryotic members of the superfamily have had wholly non-catalytic roles proposed for them: protein– protein interactions in the case of human PGAM5 [38] and stabilization/assembly in the case of the TFIIIC (transcription factor IIIC) τ 55 kDa subunit in yeast [39]. However, in both cases, the prerequisites for catalysis (see below) are apparently present so that catalytic roles remain possible. This situation is graphically illustrated by the story of Sts-1, a key T-cell regulator [6,40,41] with a multidomain structure. The presence of a histidine phosphatase domain led investigators to test for dPGM activity and find none [6]. Since loss of that domain destroyed the dimeric quaternary structure, it was proposed that it functioned merely to assemble the dimer [6]. However, after its close kinship with EPP was recognized, it was tested for and was found to have activity against ecdysteroid and steroid phosphates [20], although its true in vivo substrate may be neither. There are several cases where indirect evidence provides broad clues as to the function of histidine phosphatase superfamily members. For example, the cell-surface Pho4 protein of Schizosaccharomyces pombe can be repressed by thiamine, suggesting a role in thiamine metabolism, possibly cleavage of a thiamine phosphate [42]. In the case of BluF from Rhodobacter capsulatus, it is the genomic location of its gene in an operon containing exclusively cobalamin synthesis genes which argues strongly for a role (distinct from CobC above, also present in R. capsulatus) in that process [43]. Similarly, ORF2 (open reading frame 2) of the plasmid-borne avrPphF gene in Pseudomonas syringae pathovar phaseolicola, a histidine phosphatase, was implicated in bacterial virulence, a role later confirmed, although its substrate remains unknown [44]. For a homologue from Arabidopsis thaliana, At74, induction by glucose and an increase in glucose uptake by overexpressing plants point to a role in carbohydrate metabolism [45]. Domain fusions are among the most potent sources of functional clues in the superfamily. Figure 1 illustrates the domain content of at least partly characterized members of the histidine phosphatase superfamily. Genome searches suggest that further functionally suggestive domain architectures exist in other parts of the superfamily (D. J. Rigden, unpublished work). First among these domain combinations is the pairing of F26BPase with the kinase domain (placed N-terminally) that catalyses the nearreverse reaction: synthesis of F26BP. The relative activity of these two domains is regulated by phosphorylation state in a way not yet fully understood [14], and differs between mammalian isoenzymes due to intrinsic sequence features [46]. Very recently, a potentially similar situation has been discovered involving a branch 2 histidine phosphatase domain [47]. The N-terminal domain of yeast Vip1 has inositol-hexakisphosphate kinase activity and is followed by a phosphatase domain of as yet unknown specificity, but which is very likely to act on multiply phosphorylated inositol species. The SH3 (Src homology 3) and UBASH (ubiquitinassociated and SH3 domain-containing) domains found in EPP and Sts-1, suggestive of protein- and ubiquitin-binding roles respectively, have been investigated experimentally [6], confirming predictions. The same proteins also contain 2H phosphoesterase domains [48], as yet uncharacterized, but likely to share ligands in some way with the histidine phosphatase domain. Additional domains have been noted in some PFK-2/F26BPase sequences [13,49]. Thus, in some trypanosomatid sequences, multiple c The Authors Journal compilation c 2008 Biochemical Society 336 Figure 1 D. J. Rigden Domain architectures of selected histidine phosphatase superfamily members All domain combinations that are at least partially characterized are shown, as well as some designed to illustrate the range of sizes of phosphatase domains. Domains are drawn approximately to scale. Catalytic domains are coloured blue (histidine phosphatase domains; dark for branch 1, light for branch 2), red (PFK-2 domains), purple (the 2H phosphoesterase domain seen in Sts-1 and relatives [20]), pink (the reductase domain of enzymes involved in opine synthesis [50–52]) and brown (the Vip1 kinase domain [47]). Other domains are coloured mauve (the carbohydrate-binding module of plant PFK-2/F26BPases [13]), green (the ankyrin repeats seen in some trypanosomatid PFK-2/F26BPases [49]; darker shades indicating more reliably assigned repeats), and grey [UBA (ubiquitin-associated)] or yellow (SH3), both found in Sts-1 and relatives [20]. ankyrin repeat motifs are present [49], indicating a possible site of interaction of the PFK-2/F26BPase protein with others, while plant homologues contain a carbohydrate-binding module at their N-terminus, suggesting a possible means to control PFK-2/ F26BPase activity [13]. Finally, histidine phosphatase domains are found N-terminally fused to dehydrogenase domains in proteins involved in opine synthesis [50–52]. With the C-terminal domains assigned functions such as agropine synthesis reductase [50], it is clear that the histidine phosphatase domains will be involved in some way in the opine synthetic pathway. Subcellular localization When the known subcellular localizations of histidine phosphatase superfamily members are listed, an interesting dichotomy is observed. In branch 1, cytoplasmic location is the norm with other members found partially or wholly in the nucleus (Table 2). The only known exceptions to this rule are the predicted periplasmic location of the Ais/AfrS family [53], and the presence in yeast cell wall of dPGM [54]. In contrast, all branch 2 family members appear to enter the secretory pathway. Some remain in the ER (endoplasmic reticulum), others are found at the cell surface, periplasm or cell wall, and others are simply secreted (Table 2). Lysosomal AP, unusually, is transported as a type I membrane protein from the ER, via the plasma membrane, to the c The Authors Journal compilation c 2008 Biochemical Society lysosome [55] where its luminal portion is slowly proteolytically cleaved. There seems to be no obvious intrinsic reason for the largely distinct subcellular localizations of the two branches, but the fact that the division is visible across bacteria and eukaryotes perhaps suggests that each branch became specialized for an intra- or extra-cellular use soon after an early divergence. In accordance with their subcellular localization, branch 2 enzymes commonly, although not invariably, contain disulfide bonds, with Swiss-Prot [56] entries cataloguing anything from zero to five of them per protein. This range is confirmed by available crystal structures, since an enzyme from Francisella tularensis (PDB code 2GLC) has none, whereas Aspergillus fumigatus phytase (PDB code 1QWO) has five. The disulfide bonds, where present, are found throughout the structure, but not near the catalytic site, indicating further a general stabilizing function rather than any regulatory role. Swiss-Prot entries indicate no known disulfide bonds in branch 1, but conservation and proximity in a molecular model suggest strongly that the unusual periplasmic branch 1 enzymes of the Ais/AfrS family contain two [53]. Medical and applied importance As expected from their diverse important roles, deficiencies or mutations in many human members of the histidine phosphatase superfamily, from both branches 1 and 2, lead to diseases. Structure and function in the histidine phosphatase superfamily Table 2 337 Subcellular localization of some histidine phosphatase superfamily members Branch Localization Protein Representative species Evidence for localization 1 Cytosol dPGM BPGM F26BPase TFIIIC τ 55 kDa subunit UBASH3A/TULA/SRS-2 PGAM5 dPGM YOR283W YDR051C dPGM Ais Human Rat Rat S. cerevisiae Human Human Rat S. cerevisiae S. cerevisiae S. cerevisiae E. coli [4] [124] [125] [39] [6,126] [38] [127,128] [129] [129] [54] Predicted in [53] Lysophosphatidic acid phosphatase Multiple inositol-polyphosphate phosphatase AP AP Testicular acid phosphatase Thiamine-repressible acid phosphatase precursor Phytase Glucose-1-phosphatase Pho5p Phytase Human Human Leishmania mexicana Leishmania donovani Human S. pombe E. coli E. coli S. cerevisiae Aspergillus ficuum [130] [29,122] [131] [132] Swiss-Prot [56] [133] [134] [21] [135] [136] Nuclear Cytosol and nuclear Cytosol and cell wall Periplasm 2 Lysosome ER/plasma membrane Endosomal/lysosomal compartment Cell membrane (type I, catalytic domain outside cell) Cell membrane (type I, catalytic domain outside cell) Cell wall Periplasm Secreted Deficiency conditions of varying degrees of severity result from loss of activity of dPGM [57], BPGM [58], lysosomal AP [59] or macrophage AP [60]. For some branch 1 human enzymes, there is potential therapeutic interest in inhibition of activity. One exciting emerging medical angle concerns dPGM as a sensitive node of the glycolytic pathway, which is up-regulated in cancer cells of various kinds [61]. For example, when phosphorylation of dPGM was prevented in a tumour cell line, a reduction in glycolysis and arrest of tumour growth was observed [62]. Later, a compound MJE3 isolated as having an anti-proliferative effect on breast tumour cells [63] was found to target dPGM by covalent attachment of a spiroepoxide group to a lysine residue near the catalytic site [64]. Modulation of erythrocyte BPGM activity is also of potential medical interest. BPGM is the sole enzyme responsible for controlling the concentration of 2,3-BPG (2,3-bisphosphoglycerate) in erythrocytes. Its synthase activity produces 2,3-BPG, while the phosphatase side reaction degrades it (Table 1). 2,3-BPG lowers the apparent affinity of haemoglobin for oxygen by binding selectively to the deoxygenated form. A rise in 2,3-BPG concentration is consistently seen in conditions of hypoxia [65], suggesting an adaptive response while lowering 2,3BPG concentrations has an anti-sickling effect [66]. Since BPGM is responsible for the loss of 2,3-BPG in stored blood, there is also an applied interest in discovery of an agent that would inhibit its phosphatase activity [67]. Although it is fair to say that we still lack a complete understanding of AP activities and in vivo substrates, these branch 2 enzymes have important clinical applications (reviewed in [68]). In particular, PAP has long been used as a serum marker for prostate cancer [69], and its overexpression on cancer cells is being exploited for immunotherapy [70]. More speculatively, both well-characterized activities of PAP lead to down-regulation of cancer growth [31,33], suggesting that the enzyme may have therapeutic value. Conversely, should inhibition of the enzyme be desired, good inhibitors are already available [71]. TRAP (tartrateresistant acid phosphatase) is a marker for bone resorption and can be monitored in patients with metabolic bone disorders [72]. Again in humans, the suggestion that multiple inositol- polyphosphate phosphatase could contribute to the pathogenesis of some thyroid carcinomas [73] means that specific inhibitors could be of therapeutic interest. In other medically relevant species, histidine phosphatases are also of interest. For example, a peripheral vacuolar AP is required for excystation of the parasite Giardia lamblia [37], suggesting that a selective inhibitor of that enzyme might have value as an anti-parasitic agent. In another group of parasites, the Leishmania species, three APs have been described and, although conclusive evidence is lacking, their conservation among Leishmania species is interpreted as suggestive of an important role in the growth or development of the parasite [74]. Finally, a better understanding of the molecular functions of the histidine phosphatases implicated in synthesis of different classes of antibiotics [18,50–52,75] may help efforts to produce novel antibiotics. The most well-known applied use of a histidine phosphatase is the addition of phytase to animal feed, first in order to increase the availability of the phosphate component of phytate, obviating the need to supply phosphorus supplements, and secondly to liberate iron and other trace elements from their phytate chelation [76]. To this end, phytases with improved molecular properties have been both designed through mutations of known enzymes [77] and sought by screening (e.g. [78]). Another potential agricultural application could involve EPP [19,20] as a novel insecticide target. Although formal validation is still required, the role of EPP in liberating inactive stores of ecdysteroids during insect development suggests that its inhibition should have serious consequences for the insect. Finally, parasitic protozoa of the genus Eimeria are a significant problem in the poultry industry. These organisms use mannitol-1-phosphatase during the synthesis of mannitol, an essential storage compound, suggesting that inhibitors of this enzyme, absent from the host, could be useful for parasite control [79]. STRUCTURES Table 3 presents a compilation of all histidine phosphatase superfamily structures available at the time of writing. As the c The Authors Journal compilation c 2008 Biochemical Society 338 D. J. Rigden c The Authors Journal compilation c 2008 Biochemical Society Table 3 Available structures of members of the histidine phosphatase superfamily 1 Å = 0.1 nm Major branch Enzyme activity Source Year of deposition in the PDB PDB code(s) Resolution (Å) Reference(s) Notes 1 dPGM S. cerevisiae 1975 1PGM* 3.5 [137,138] 1982 1997 3PGM† 4PGM 2.8 2.3 [139] [81] The first superfamily crystal structure, published in 1974 [138] following an earlier low-resolution structure [137]. No sequence available. The first sequence-traced structure. Corrects an error in the conformation of the active-site histidine residues of earlier structures. S. pombe E. coli 1998–1999 2000 2000 5PGM, 1BQ3, 1BQ4, 1QHF 1FZT 1E58, 1E59 1.7–2.7 n/a 1.25–1.3 [107,108,140] [141] [89,106] M. tuberculosis Plasmodium falciparum Human (B-type) Human erythrocyte 2003 2004 2005 2004–2005 1RII 1XQ9 1YJX, 1YFK 1T8P, 2A9J, 2HHJ, 2F90, 2H52, 2H4X, 2H4Z 1.7 2.6 2.7–2.8 1.5–2.5 [142] Unpublished [112] [90,109] G. stearothermophilus Rat testis Human liver E. coli M. tuberculosis T. thermophilus Rat Human Aspergillus ficuum Aspergillus niger Aspergillus fumigatus E. coli E. coli Francisella tularensis 2001–2002 1996–2000 2001 2003 2005 2003–2006 1993 1998–2002 1997 1999 2003–2004 1999 2003 2006 1EBB, 1H2E, 1H2F 1BIF, 2BIF, 3BIF, 1FBT, 1TIP, 1C80, 1C7Z, 1C81 1K6M 1UJB, 1UJC 2A6P 1V37, 2HIA, 1V7Q 1RPA, 1RPT 2HPA, 1CVI, 1ND5, 1ND6 1IHP 1QFX 1QWO, 1SKA, 1SKB, 1SK9 1DKL, 1DKM, 1DKN, 1DKO, 1DKP, 1DKQ 1NT4 2GLA, 2GLB, 2GLC 1.69–2.3 2.0–2.6 2.4 1.9–2.06 2.2 1.4–1.6 3.0 2.4–3.2 2.5 2.4 1.5–1.69 2.05–2.4 2.4 1.65–1.85 [92,114] [100,143–146] [46] [85] [86] Unpublished [9,82,148] [71,149,150] [151] [152] [93,115] [103] [91] Unpublished BPGM PhoE F26BPase 2 SixA Phosphatase of unknown specificity Unknown PAP Phytase Glucose-1-phosphatase Unknown *Obsolete, no longer available in the regular PDB. †Obsolete, but still present in the PDB. The only NMR structure of a superfamily member. Includes the highest resolution structure in the superfamily, phosphorylated enzyme [89]. Includes a remarkable set of structures providing snapshots along the process of enzyme phosphorylation by 2,3-BPG [90]. Includes the first structure of a phosphoenzyme in the superfamily [144]. Representative of the minimal core fold of the superfamily. Unpublished Structural Genomics outputs [147]. The first, chloride-bound structure is not apparently deposited in the PDB. Includes complexes relevant to rational inhibitor design. Include a high-resolution structure of the phosphorylated enzyme. Structure and function in the histidine phosphatase superfamily Figure 2 339 Consensus tree derived from analysis of all histidine phosphatase superfamily structures The Figure was generated at the PROCKSI server (http://www.procksi.net). References for the protein structures are shown in Table 3. deposition dates show, structural studies date back to the early days of protein crystallography. The first dPGM structure was deposited in 1975, 1 year after publication of the first Protein Data Bank [80] newsletter listed just 12 proteins for which co-ordinates were available. Unfortunately, erroneous positioning of catalytic histidine residues misled efforts to understand the mechanism [4] until the availability of a corrected structure in 1997 [81]. The first branch 2 crystal structure was that of rat PAP in 1993 [82]. In recent years, structure determinations have accelerated, so that at least nine distinct molecular functions are now covered. In order to provide a structural context for discussion of catalysis and substrate diversity, all known structures of the superfamily were collected together and superimposed with MUSTANG [83] with post-processing of the results with STACCATO [84]. A dendrogram representation of the structures, on the basis of a consensus calculation on the results of six different structural superposition methods, was created using the PROCKSI server (http://www.procksi.net) and is shown in Figure 2. This Figure confirms the broad division of the superfamily into two branches, the first containing dPGMs, F26BPases and other activities, and the second containing APs and phytases. A selection of 11 structures was made in order to represent the broad diversity of known structures in the superfamily. Comparison of these, in order of increasing size, with the nearminimal core α/β domain present in SixA [85] reveals how insertions into the fold differentiate between different lineages and define the substrate-binding site, thereby conferring the observed activity. These insertions, sometimes as long as the core domain itself, together with the sometimes substantial N- and C-terminal extensions, explain the fact that the largest histidine phosphatase c The Authors Journal compilation c 2008 Biochemical Society 340 D. J. Rigden structure yet determined is almost three times the size of the smallest. PDB accession codes and references can be found in Table 3. A structure-based sequence alignment of representative histidine phosphatases is shown in Figure 3. The principal structural difference between SixA and the Thermus thermophilus protein of unknown function lies in the 35residue insertion of the latter between β3 and α3. An insertion, of at least this size, is present here in all structures apart from SixA. In addition, the β1–α1 loop of the T. thermophilus protein is slightly larger and adopts a different conformation, packing against the β3–α3 insertion. The characterized M. tuberculosis phosphatase [86] resembles the T. thermophilus protein, but has a slightly larger α4–β5 loop which also packs against the β3–α3 insertion. The monomeric S. pombe dPGM and Geobacillus stearothermophilus PhoE (Figure 4b) follow this same general scheme, although, of course, with detailed sequence differences defining different specificities (see below). In F26BPase, a substantial 25-residue C-terminal tail is added to this scheme (Figure 4c). The tail twice traverses the entrance to the catalytic site, essentially blocking substrate access, and its removal increases F26BPase activity [87]. Although a tail is absent from S. pombe dPGM, the E. coli dPGM structures show how the dPGM C-terminal tails, where present, adopt a completely different conformation from those in F26BPases (Figure 4d). dPGM tails are shorter and lie along the rim of the entrance to the catalytic site. Again with the exception of the S. pombe enzymes, dPGMs contain an additional ∼ 25-residue section in the large β3–α3 insertion (Figure 4d), which, in the tetrameric enzymes such as that from S. cerevisiae, is involved in forming the dimer–dimer interface. Insertions from dimeric dPGMs exhibit sequence and conformational similarity to those of tetrameric enzymes, illustrating that relatively small sequence differences must define quaternary state. Indeed a single mutation at this interface in S. cerevisiae dPGM dramatically destabilized the tetrameric state without affecting kinetic properties [88]. Sequence differences between dimeric E. coli dPGM and the tetrameric S. cerevisiae enzyme in this region have been analysed [89]. In branch 2, the comparison of the E. coli glucose-1phosphatase structure with SixA again shows insertions after β1 and β3, the former once again packing against the latter (Figure 4e). Importantly, however, the β3–α3 insertion is very large with approx. 100 residues. This insertion bears no obvious sequence similarity to the branch 1 insertions at the same position, and is composed almost entirely of α-helices. The E. coli glucose1-phosphatase structure contains other insertions far from the catalytic site and a long C-terminal region, again apparently unrelated to comparable regions in branch 1 of the superfamily. The C-terminal extension forms a short region of antiparallel βsheet with β6. In E. coli phytase, the β3–α3 insertion is even longer, at 140 residues almost as long as SixA itself (Figure 4f). Finally, and remarkably, the A. fumigatus phytase structure differs from the two branch 2 structures mentioned above in having a large N-terminal extension, of largely irregular secondary structure, which inserts into the back of the substrate-binding site, opposite the entrance (Figure 4g), resulting in the availability of Gln27 and Tyr28 for substrate interaction (see below). The superimposed structures reveal that, quite surprisingly, the entrances to the catalytic sites lie on opposite faces of the protein in the two branches. Thus Figures 4(b)–4(d) show three branch 1 enzymes where substrate enters from the right past the C-terminal regions (red) with important roles in allowing and responding to substrate binding. The three branch 2 structures shown in Figures 4(e)–4(g) contain catalytic sites accessed from the left. This unusual situation has presumably been made possible by the positioning of most of the catalytic apparatus after the central two c The Authors Journal compilation c 2008 Biochemical Society strands, β1 and β4, of the core β-sheet. In this way, additions to the fold can form a substrate-binding site facing in either direction. It is tempting to consider the simple SixA structure, with few short loops between the regular secondary structure elements of the core (Figure 4a), as resembling the ancestor of all present-day members of the superfamily. By this hypothesis, evolution added insertions to the basic scaffold to produce structures containing cavities suitable for binding of small-molecule substrates. The absolute lack of sequence or structural similarity between the respective insertions in branch 1 and branch 2 structures (Figure 4) strongly suggests that this decoration of a basic fold happened twice independently. Furthermore, the parallel processes of elaboration on the basic fold began early, since both branches are widely represented in both bacteria and eukaryotes. CATALYSIS Structural determinants Catalytic activity in the superfamily centres on phosphorylation and dephosphorylation of a histidine residue that follows the first β-strand of the fold (His8 in E. coli SixA) (see Figures 5 and 6). A scheme for this catalytic cycle is shown in Figure 5. The in-line transfer [90] of the phospho group from substrate to enzyme occurs with the aid of several residues, forming a ‘phosphate pocket’, that hydrogen bond to the phospho group before, during and after transfer. These include a pair of flanking arginine residues, positions 7 and 55 in E. coli SixA numbering, and the other histidine, His108 , which are completely conserved in known active members of the superfamily (see, e.g., Figure 3). Interestingly, the alignment also reveals a few other residues that are highly conserved between otherwise very diverse proteins (see also Figure 6). The most obvious of these is Gly9 which forms part of the well-known and characteristic RHG motif. Gly9 packs closely against the protein core and is well-conserved as a glycine or occasionally an alanine residue. The structure of E. coli glucose-1-phosphatase (PDB code 1NT4 [91]) captures a very rare example of an asparagine residue at this position and shows how this larger side chain leads to structural displacements elsewhere in the fold. The carbonyl of Gly9 makes an important hydrogen bond with the phosphorylable histidine side chain holding it in an appropriate conformation for catalysis (Figure 6). The backbone nitrogen of Gly9 makes a further hydrogen bond with the side chain of Thr59 (SixA numbering) which is > 95 % conserved as threonine or serine in both branches of the superfamily. The alignment also reveals a L[S/T]XXG motif in the region between β1 and β2. As shown in Figure 6, the glycine residue (numbered 27 in SixA) is conserved for steric reasons; a larger side chain here would disturb the catalytic apparatus. The leucine side chain contacts several key residues (His8 , Gly9 and Arg55 ), while the threonine/serine residue stabilizes the local structure through hydrogen bonds to both the backbone of the neighbour of Gly9 (not sequenceconserved) and the backbone of Gly27 . Taken as a whole, this hotspot of conservation in the second layer behind the catalytic site illustrates how important maintenance of catalytic histidine conformation is for activity. Aside from the two histidine and two arginine residues mentioned above, other interactions with the phospho group (the PP residue in Figure 5) vary to a surprising extent (see Figure 3). In most of branch 1, Asn16 (E. coli dPGM numbering) provides a hydrogen bond. In the M. tuberculosis phosphatase [86], a serine residue is present at this position and may substitute functionally, although with the available crystal structure, and its nonphysiological ligands, it is hard to say. Any Asn16 -like interaction is clearly absent in SixA, however, due to a local conformational Structure and function in the histidine phosphatase superfamily Figure 3 341 Structure-based sequence alignment of selected histidine phosphatase superfamily members The Figure was generated by post-processing of a MUSTANG [83] structure alignment with STACCATO [84]. The structures shown are PDB codes 1UJC, E. coli SixA [85]; 1H2E, G. stearothermophilus PhoE [92]; 1K6M, human liver F26BPase [46]; 1E58, E. coli dPGM [89]; 1NT4, E. coli glucose-1-phosphatase [91]; 1DKQ, E. coli phytase [103]; 1QWO, A. fumigatus phytase [93]. The secondary structures of the smallest (1UJC) and largest (1QWO) proteins are shown below the alignment with β-strands of the core β-sheet (six in 1UJC, seven in 1QWO) and selected helices numbered. The alignment is shaded blue according to sequence conservation with certain sets of residues (see text for details) picked out as follows: white on red, conserved catalytic core (see also Figure 6); black on green, proton donors; black on yellow, additional members of the ‘phosphate pocket’; black on purple, substrate binding residues. Boxed blue residues are non-binding, but conserved positions lying near the catalytic histidine (see Figure 6 and text). Note that the alanine mutations present at position 17 of 1DKQ and position 18 of 1NT4 have been replaced by the naturally occurring histidine. The Figure was produced using JALVIEW [153]. c The Authors Journal compilation c 2008 Biochemical Society 342 Figure 4 D. J. Rigden Cross-eyed stereo cartoons of structures of representative histidine phosphatase superfamily members Structures are shown as cartoons and are coloured grey except for orange, N-terminal region; red, C-terminal tail; magenta, insertion in the β3–α3 region; blue, insertion in the β1–α1 region; and turquoise, oligomeric dPGM-specific insertion (d). The structures are: (a) PDB code 1UJC, E. coli SixA [85]; (b) PDB code 1H2E, G. stearothermophilus PhoE [92]; (c) PDB code 1K6M, human liver F26BPase [46]; (d) PDB code 1E58, E. coli dPGM [89]; (e) PDB code 1NT4, E. coli glucose-1-phosphatase [91]; (f) PDB code 1DKQ, E. coli phytase [103]; (g) PDB code 1QWO, A. fumigatus phytase [93]. The phosphorylable histidine residue of SixA is shown as green spheres to localize the catalytic site. difference [85]. An additional interaction with the phospho group is provided by Gln22 in PhoE [92]. In SixA, allowing for a small conformational change on substrate binding, Arg21 is well positioned to interact with a phospho group [85], as predicted [53]. The same modelling predicted that a different residue, Arg64 would fulfil the corresponding role in the Ais family [53]. Recent modelling suggests that a further lineage in the superfamily, the c The Authors Journal compilation c 2008 Biochemical Society EPP/Sts-1 family, has independently evolved an arginine residue at this position [20]. In branch 2 of the family, the phosphorylated structure of A. fumigatus phytase (PDB code 1QWO; [93]) reveals that the pair of arginine residues and second histidine residue are supplemented by Arg62 which is positioned similarly to Asn16 in E. coli dPGM [93]. The same interactions are predicted to hold for all branch 2 enzymes of presently known structure. Structure and function in the histidine phosphatase superfamily Figure 5 343 Catalytic mechanism of the histidine phosphatase superfamily The essentially invariant four residues of the catalytic core (see also Figures 3 and 6) are shown numbered as in E. coli SixA. His8 is phosphorylated during the course of the reaction. The other three residues interact electrostatically with the phospho group before, during and after its transfer and form most or all of the ‘phosphate pocket’. Additional neutral or positive residues, represented as PP in the diagram, may also contribute to the ‘phosphate pocket’ by hydrogen-bonding to the phospho group (see also Figure 3). The proton donor, an aspartate or glutamate residue whose position varies in different families (Figure 3), is shown as PD. Figure 6 The near-superimposable conserved catalytic cores of E. coli SixA and A. fumigatus phytase The cores are shown in green and nearby conserved residues in purple for E. coli SixA (left; 156 residues; PDB code 1UJC [85]) and A. fumigatus phytase (right; 442 residues; PDB code 1QWO [93]) which share 12 % sequence identity overall. Broken lines represent hydrogen bonds. Selected β-strands are drawn and labelled. The tungstate ion binding to the ‘phosphate pocket’ of SixA is drawn as ball-and-stick in both panels. The Figure was generated using PyMOL (DeLano Scientific; http://www.pymol.org). A three-dimensional interactive version of this Figure can be seen at http://www.BiochemJ.org/bj/409/0333/bj4090333add.htm. For the most studied members of the superfamily, dPGM, F26BPase and AP, there have been many mutagenesis studies supporting the roles of the key catalytic residues (see, e.g., [88,94–98]). Typically, loss of these residues results in mutant proteins with no or much reduced catalytic activity, but there is an important exception that should be mentioned. When the phosphorylable His256 of rat testis F26BPase was replaced with an alanine residue, a highly unexpected 17 % of catalytic activity remained [99], probably due to water taking over the nucleophilic role [100]. This result argues that caution should be exercised in inferring lack of catalytic activity for homologous sequences lacking elements of the otherwise conserved catalytic machinery. An important component of the catalytic scheme (Figure 5) is the proton donor, required to donate a proton to the leaving group as the substrate transfers its phospho group to the enzyme. The only exception, not requiring proton donation, would be priming of dPGM with 1,3-BPG, involving transfer of the acyl 1phosphate [4]. In most branch 1 members, a conserved glutamate residue, Glu88 (E. coli dPGM numbering), functions as proton donor [101]. It was therefore a surprise when it was realized that the lineages having a minimal fold lack this residue [92] and are predicted to use differently placed aspartate residues for this function. The crystal structure of SixA [85] suggests that the predicted conserved Asp18 is indeed likely to act as proton donor. In branch 2, early mutagenesis data clearly pointed to Asp304 (E. coli phytase numbering) as the proton donor [102], and crystal structures show it to be well positioned with respect to the rest of the catalytic apparatus. The side chain of Asp304 , c The Authors Journal compilation c 2008 Biochemical Society 344 D. J. Rigden although contributed by a different part of the sequence, occupies approximately the same position in superimposed structures as Glu88 in E. coli dPGM. Asp304 as proton donor remains the consensus view, but it has been argued that the equivalent to the preceding histidine residue may, under certain circumstances, act as a secondary proton donor, at least in glucose-1-phosphatase [91]. This would require movement of the histidine side chain from the crystallographically observed position [91]. It is worth noting that such movement would be harder in branch 1 enzymes since proton donor Glu88 , or non-catalytic Glu74 in SixA, packs against the histidine side chain. A further suggestion is that Asp304 and equivalents simply facilitate an intramolecular proton movement on the substrate [71]. During the hydrolysis of the phosphoenzyme structure, the same acidic residues activate bound water molecules for nucleophilic attack. Crystal structures have revealed suitably positioned water molecules in, for example, E. coli dPGM [89] and E. coli phytase [103] complex structures. When proton donors are highlighted on a sequence alignment (Figure 3), it becomes clear that, within the superfamily, they can be placed either shortly after strands β1 (SixA), β3 (dPGMs etc.) or β4 (branch 2). The anomalous mutases Most of the superfamily are simple phosphatases carrying out the scheme in Figure 5. However, there are anomalous members in the form of the mutases: dPGMs and BPGMs. Each of these catalyses three reactions (Table 1), but at very different relative rates [4]. Of these, the phosphatase reaction conforms to the scheme in Figure 5. The ‘priming’ of dPGMs, i.e. their phosphorylation by 2,3-BPG or 1,3-BPG on the phosphorylable histidine residue, can be considered as an incomplete phosphatase reaction. The ‘synthase’ reaction, the production of 2,3-BPG from 1,3-BPG, the predominant activity of BPGM, would first involve the transfer of the 1-phosphate of 1,3-BPG to enzyme. After reorientation of the resulting 3PGA in the catalytic site, the phospho group attaches to the 2-position and product 2,3-BPG is released from the enzyme. The mutase reaction, catalysed at the highest rate by dPGMs, involves binding of one phosphoglycerate form, either 3PGA or 2PGA, to the already phosphorylated (‘primed’) enzyme and transfer of phospho group to the empty 2- or 3-position. Intermediate 2,3-BPG then reorients within the catalytic site, transferring the other phospho group to the enzyme. The primed form of the enzyme is thereby restored and phosphoglycerate is released. The distinguishing characteristic of mutases compared with phosphatases is thus the ability to maintain intermediates bound while allowing their reorientation with the active site. In the case of dPGMs, a plausible mechanism for this reorientation has been proposed [104]. Differences between dPGMs and BPGMs relate first to the necessity of BPGMs to release 2,3-BPG, whereas for dPGMs, it is the obligatory intermediate, and secondly to the requirement of dPGMs to stabilize the phosphorylated form of the enzyme and protect it from hydrolysis. The C-terminal tail has long been implicated in stabilizing the phosphoenzyme form of dPGM since the naturally tailfree dPGM from S. pombe and a proteolysed form of S. cerevisiae dPGM lacking seven C-terminal residues, both have lower mutase and higher phosphatase reaction rates when compared with dPGMs having the tails [105]. Interpretation of these data was complicated by the lack of interpretable density for the tail in all S. cerevisiae structures. The position of the tail was only seen on the determination of a structure of E. coli dPGM in its phosphorylated form, strongly linking ordering of the tail and phosphorylation state [89]. The earlier hypothesis that the tail helped to exclude water from the catalytic site, and thereby avoid c The Authors Journal compilation c 2008 Biochemical Society hydrolysis of the phosphoenzyme [4], was rendered unlikely by the presence of ordered water molecules in the catalytic site of the phosphoenzyme structure [89]. Instead, the C-terminal tail appears to provide additional stabilizing interactions with the phosphorylated state, through interactions of tail residues with Asn19 . This residue, and Asn16 , which forms two hydrogen bonds with the phosphohistidine residue, lie on a loop whose conformation varies according to phosphorylation state [106]. Interestingly, PhoE complex structures indicate that, although Cterminal tail structuring similarly accompanies modification of the phosphorylable histidine residue, a different mechanism involving Arg9 and the C-terminal carboxy group applies [92]. For dPGMs, there are no reliable structures of complexes with substrates since there are reasons to doubt the validity of the reported 3PGA complex [107], as discussed in [106]. Nevertheless, bound sulfate and vanadate seen in other structures have offered clues as to likely substrate-binding residues [104,106]. The binding of 2PGA and 3PGA (to phosphorylated enzyme) and 2,3-BPG in two different orientations (as retained intermediate in the mutase reaction, bound to unphosphorylated enzyme) was modelled in E. coli dPGM [106]. The prediction was that backbone amide groups of Thr22 and Gly23 bind the carboxy group in all cases, and that side chains of Arg89 , Tyr91 , Arg115 , Arg116 and Asn185 bind the distal phospho group. For BPGM, a remarkable set of complex structures, after various periods of soaking in ligands, have recently provided a detailed molecular picture of 2,3-BPG binding and transfer of its phospho group to the enzyme [90]. The dPGM predictions are broadly confirmed, suggesting that subtle differences are responsible for the different spectra of activities of dPGM and BPGM. Different explanations have been suggested, frequently involving Gly14 (human BPGM numbering; serine in dPGMs) and Ser24 (glycine in dPGMs) [106,109]. Misleadingly, the first BPGM structure [109] showed structural differences from dPGMs that turned out not to be present in its ligated form [90]. Now that a direct comparison of phosphorylated dPGM and BPGM can be made, it is clear that neither residue obviously influences activity directly. However, a noted restriction of catalytic size volume [109], in part relating to differences in the C-terminal tail region, is still apparent in BPGM compared with dPGM. This could easily impede the reorientation of 2,3-BPG within the catalytic site that lies at the heart of the mutase reaction. Additionally, although the sets of substrate-binding residues (seen in BPGM, modelled in dPGM) are very similar, a structural superposition shows many conformational differences in their side chains that could relate to different affinities for 1,3-BPG or 2,3-BPG between BPGMs and dPGMs. One particularly interesting position is residue 100 (BPGM numbering), occupied by a conserved arginine residue in BPGMs and by a conserved lysine residue in dPGMs. Deciphering subtle differences in ligand affinities between the two families will probably require the determination of a crystal structure of one in complex with 1,3-BPG. In passing, it is intriguing to note that the present superfamily is not the only example of evolutionarily related phosphatases and mutases, other examples being the cofactor-independent phosphoglycerate mutases, one of whose domains is homologous with alkaline phosphatases [110] and β-phosphoglucomutase, closely related to the phosphatase branch of the HAD (haloalkanoic acid dehalogenase) enzyme superfamily [111]. In all cases, transient phosphorylation of the enzyme is involved in catalysis. Substrate binding in phosphatases Information regarding which residues are responsible for substrate binding in other members of the superfamily is not available for Structure and function in the histidine phosphatase superfamily all structures; indeed the substrate(s) of some enzymes of known structure remain unknown [112]. In the case of F26BPase, the crystal structure of the complex with fructose 6-phosphate and phosphate [100], as well as mutagenesis data [113], implicate the following set of residues in substrate binding: Ile267 , Tyr336 , Arg350 , Lys354 , Tyr365 , Gln391 and Arg395 (numbering for rat testis enzyme). For PhoE, no complex is available with natural ligand, but its preferred substrate, α-naphthyl phosphate, can be convincingly docked into its notably hydrophobic active site, suggesting that Met21 , Trp109 , Val153 , Leu86 and Phe108 are all substrate-binding residues [114]. The crystal structure of E. coli SixA alone allowed for the prediction of a tentative model of the complex with substrate ArcB Hpt domain [85]. The model implicated SixA residues Asn26 , Glu30 and Lys156 in substrate binding. In branch 2 of the superfamily, complexes with inhibitors (vanadate, tartrate, tungstate) are again more common than substrate complexes. Nevertheless, substrate complex crystal structures are available for both E. coli enzymes, the phytase and the glucose-1-phosphatase [91]. The principal phytate-binding residues in the former are Thr305 , Arg267 and Lys24 (there are other interactions via water). In glucose-1-phosphatase, Glu196 and Tyr247 (via water) bind glucose-1-phosphate, whereas the specificity of the enzyme for the 3-position of its alternative substrate phytate is defined by the gating residue Leu24 [91]. The mode of interaction of phytate with A. fumigatus phytase can be predicted based on the E. coli phytase complex [115]. In this position, phytate interacts with the catalytic core plus Gln27 , Tyr28 , Lys68 , His189 , Lys278 , Tyr282 and Asn340 . For future efforts in genome annotation in the superfamily, it is important to know which regions of the enzymes are likely to contribute to core catalysis and/or the determination of substrate specificity. As mentioned above, the conserved catalytic core is contributed by residues lying in the regions after strands β1, β2 and β4 (see Figures 3 and 6), whereas proton donors map to the portions following β1, β3 and β4 (Figure 3). These are the basic components for catalysis (although the surprising activity of the H256A F26BPase mutant must be remembered). To these may be added additional ‘phosphate pocket’ interactions, varying between lineages, but, so far, always contributed by residues lying in the β1–β2 loop (Figure 3). The pattern for specificitydetermining residues, binding the remainder of the substrate, is very different. These are to be found scattered throughout the sequence alignment, clustered in the loops following β1, β3 and β4, but also at various points in the large and very diverse insertions between β3 and β4, in the N-terminus of A. fumigatus phytase [93] and at the very C-terminus of SixA [85]. CONCLUSIONS Although large-scale sequencing projects continue to discover novel protein families awaiting function elucidation, function predictions within superfamilies are equally valuable. The histidine phosphatase superfamily, although possessing an everlengthening list of known functions, undoubtedly harbours many novel activities awaiting discovery. Sequence and structure analyses paint a clear picture of evolution of two deep-rooted branches, the first of them, two thirds comprising bacterial proteins, usually intracellular and of widely varied functions. The second branch, predominantly eukaryotic and extracellular, contains some well-known enzymes such as phytase and other members, the APs, of partial or inconclusive functional annotation. Humans contain several proteins from both branches, many with known or potential medical importance. Phosphatases in parasites may also have therapeutic value, whereas others have (potentially) important agricultural applications. 345 After years of sporadic progress, the number of structures of histidine phosphatases is rising rapidly, allowing for a review of the level of our understanding of the structure–function relationship of the superfamily. All histidine phosphatase structures contain a conserved catalytic core, and conserved ancillary residues involved in maintaining the correct orientation of the core, but vary dramatically in their proton donor and in substrate-binding residues. This structural variability of specificity determinants renders harder still function annotation in the superfamily and suggests that structural knowledge will be important, both for reliable annotation of new sequences with known activities and for prediction of novel activities, prior to experimental demonstration. Potentially, both modelling (see, e.g., [20,53,116]) and further crystal structures could be used for structure-based function predictions. In summary, with many families and even structures of unknown function, the histidine phosphatase superfamily illustrates well the problems that have led experimentalists to develop large-scale function screens [117], crystallographers to suggest a structural genomics approach with superfamilies [118] and bioinformaticians to seek novel structure-based function assignment methods (see, e.g. [119]). An investigation of the histidine phosphatase superfamily by any or all of these methods would certainly yield many new insights. REFERENCES 1 Meyerhof, O. and Kiessling, W. (1935) Über die isolierung der isomeren phosphoglycerinsäuren (glycerinsäure-2-phosphorsäure und glycerinsäure-3-phosphorsäure) aus gäransätzen und ihr enzymatisches gleichgewicht. Biochem. Z. 276, 239–253 2 Fothergill, L. A. and Harkins, R. N. (1982) The amino acid sequence of yeast phosphoglycerate mutase. Proc. R. Soc. London Ser B. 215, 19–44 3 Pilkis, S. J., Lively, M. O. and el-Maghrabi, M. R. (1987) Active site sequence of hepatic fructose-2,6-bisphosphatase: homology in primary structure with phosphoglycerate mutase. J. Biol. Chem. 262, 12672–12675 4 Fothergill-Gilmore, L. A. and Watson, H. C. (1989) The phosphoglycerate mutases. Adv. Enzymol. Relat. Areas Mol. Biol. 62, 227–313 5 Friedberg, I. (2006) Automated protein function prediction: the genomic challenge. Brief. Bioinform. 7, 225–242 6 Kowanetz, K., Crosetto, N., Haglund, K., Schmidt, M. H., Heldin, C. H. and Dikic, I. (2004) Suppressors of T-cell receptor signaling sts-1 and sts-2 bind to cbl and inhibit endocytosis of receptor tyrosine kinases. J. Biol. Chem. 279, 32786–32795 7 Mazumder, R., Iyer, L. M., Vasudevan, S. and Aravind, L. (2002) Detection of novel members, structure–function analysis and evolutionary classification of the 2H phosphoesterase superfamily. Nucleic Acids Res. 30, 5229–5243 8 Rebora, K., Laloo, B. and Daignan-Fornier, B. (2005) Revisiting purine-histidine cross-pathway regulation in Saccharomyces cerevisiae : a central role for a small molecule. Genetics 170, 61–70 9 Schneider, G., Lindqvist, Y. and Vihko, P. (1993) Three-dimensional structure of rat acid phosphatase. EMBO J. 12, 2609–2615 10 Bazan, J. F., Fletterick, R. J. and Pilkis, S. J. (1989) Evolution of a bifunctional enzyme: 6-phosphofructo-2-kinase/fructose-2,6-bisphosphatase. Proc. Natl. Acad. Sci. U.S.A. 86, 9642–9646 11 Finn, R. D., Mistry, J., Schuster-Bockler, B., Griffiths-Jones, S., Hollich, V., Lassmann, T., Moxon, S., Marshall, M., Khanna, A., Durbin, R. et al. (2006) Pfam: clans, web tools and services. Nucleic Acids Res. 34, D247–D251 12 Tatusov, R. L., Fedorova, N. D., Jackson, J. D., Jacobs, A. R., Kiryutin, B., Koonin, E. V., Krylov, D. M., Mazumder, R., Mekhedov, S. L., Nikolskaya, A. N. et al. (2003) The COG database: an updated version includes eukaryotes. BMC Bioinformatics 4, 41 13 Michels, P. A. and Rigden, D. J. (2006) Evolutionary analysis of fructose 2,6-bisphosphate metabolism. IUBMB Life 58, 133–141 14 Rider, M. H., Bertrand, L., Vertommen, D., Michels, P. A., Rousseau, G. G. and Hue, L. (2004) 6-Phosphofructo-2-kinase/fructose-2,6-bisphosphatase: head-to-head with a bifunctional enzyme that controls glycolysis. Biochem. J. 381, 561–579 15 Ogino, T., Matsubara, M., Kato, N., Nakamura, Y. and Mizuno, T. (1998) An Escherichia coli protein that exhibits phosphohistidine phosphatase activity towards the HPt domain of the ArcB sensor involved in the multistep His-Asp phosphorelay. Mol. Microbiol. 27, 573–585 c The Authors Journal compilation c 2008 Biochemical Society 346 D. J. Rigden 16 O’Toole, G. A., Trzebiatowski, J. R. and Escalante-Semerena, J. C. (1994) The cobC gene of Salmonella typhimurium codes for a novel phosphatase involved in the assembly of the nucleotide loop of cobalamin. J. Biol. Chem. 269, 26503–26511 17 Levine, N. D. (1982) Taxonomy and life cycles of coccidia. In The Biology of Coccidia (Long, P. L., ed.), pp. 1–30, University Park Press, Baltimore 18 Huang, F., Haydock, S. F., Mironenko, T., Spiteller, D., Li, Y. and Spencer, J. B. (2005) The neomycin biosynthetic gene cluster of Streptomyces fradiae NCIMB 8233: characterisation of an aminotransferase involved in the formation of 2-deoxystreptamine. Org. Biomol. Chem. 3, 1410–1418 19 Sonobe, H., Ohira, T., Ieki, K., Maeda, S., Ito, Y., Ajimura, M., Mita, K., Matsumoto, H. and Wilder, M. N. (2006) Purification, kinetic characterization, and molecular cloning of a novel enzyme, ecdysteroid 22-kinase. J. Biol. Chem. 281, 29513–29524 20 Davies, L., Anderson, I. P., Turner, P. C., Shirras, A. D., Rees, H. H. and Rigden, D. J. (2007) An unsuspected ecdysteroid/steroid phosphatase activity in the key T-cell regulator, sts-1: surprising relationship to insect ecdysteroid phosphate phosphatase. Proteins 67, 720–731 21 Pradel, E. and Boquet, P. L. (1988) Acid phosphatases of Escherichia coli : molecular cloning and analysis of agp, the structural gene for a periplasmic acid glucose phosphatase. J. Bacteriol. 170, 4916–4923 22 Kennedy, E. J., Pillus, L. and Ghosh, G. (2005) Pho5p and newly identified nucleotide pyrophosphatases/phosphodiesterases regulate extracellular nucleotide phosphate metabolism in Saccharomyces cerevisiae . Eukaryotic Cell 4, 1892–1901 23 Irving, G. C. and Cosgrove, D. J. (1972) Inositol phosphate phosphatases of microbiological origin: the inositol pentaphosphate products of Aspergillus ficuum phytases. J. Bacteriol. 112, 434–438 24 Greiner, R., Carlsson, N. and Alminger, M. L. (2001) Stereospecificity of myo -inositol hexakisphosphate dephosphorylation by a phytate-degrading enzyme of Escherichia coli . J. Biotechnol. 84, 53–62 25 Pilkis, S. J., El-Maghrabi, M. R., Pilkis, J., Claus, T. H. and Cumming, D. A. (1981) Fructose 2,6-bisphosphate: a new activator of phosphofructokinase. J. Biol. Chem. 256, 3171–3174 26 Van Schaftingen, E., Hue, L. and Hers, H.-G. (1980) Control of the fructose 6-phosphate/fructose 1,6-bisphosphate cycle in isolated hepatocytes by glucose and glucagon: role of a low-molecular-weight stimulator of phosphofructokinase. Biochem. J. 192, 887–895 27 Van Schaftingen, E., Hue, L. and Hers, H.-G. (1980) Fructose 2,6-bisphosphate, the probable structure of the glucose- and glucagon-sensitive stimulator of phosphofructokinase. Biochem. J. 192, 897–901 28 Bensaad, K., Tsuruta, A., Selak, M. A., Vidal, M. N., Nakano, K., Bartrons, R., Gottlieb, E. and Vousden, K. H. (2006) TIGAR, a p53-inducible regulator of glycolysis and apoptosis. Cell 126, 107–120 29 Chi, H., Yang, X., Kingsley, P. D., O’Keefe, R. J., Puzas, J. E., Rosier, R. N., Shears, S. B. and Reynolds, P. R. (2000) Targeted deletion of Minpp1 provides new insight into the activity of multiple inositol polyphosphate phosphatase in vivo . Mol. Cell. Biol. 20, 6496–6507 30 Hiroyama, M. and Takenawa, T. (1999) Isolation of a cDNA encoding human lysophosphatidic acid phosphatase that is involved in the regulation of mitochondrial lipid biosynthesis. J. Biol. Chem. 274, 29172–29180 31 Tanaka, M., Kishi, Y., Takanezawa, Y., Kakehi, Y., Aoki, J. and Arai, H. (2004) Prostatic acid phosphatase degrades lysophosphatidic acid in seminal plasma. FEBS Lett. 571, 197–204 32 Matsubara, M. and Mizuno, T. (2000) The SixA phospho-histidine phosphatase modulates the ArcB phosphorelay signal transduction in Escherichia coli . FEBS Lett. 470, 118–124 33 Meng, T. C. and Lin, M. F. (1998) Tyrosine phosphorylation of c-ErbB-2 is regulated by the cellular form of prostatic acid phosphatase in human prostate cancer cells. J. Biol. Chem. 273, 22096–22104 34 Fleisig, H., El-Din El-Husseini, A. and Vincent, S. R. (2004) Regulation of ErbB4 phosphorylation and cleavage by a novel histidine acid phosphatase. Neuroscience 127, 91–100 35 Lin, M. F. and Clinton, G. M. (1988) The epidermal growth factor receptor from prostate cells is dephosphorylated by a prostate-specific phosphotyrosyl phosphatase. Mol. Cell. Biol. 8, 5477–5485 36 Suter, A., Everts, V., Boyde, A., Jones, S. J., Lullmann-Rauch, R., Hartmann, D., Hayman, A. R., Cox, T. M., Evans, M. J., Meister, T. et al. (2001) Overlapping functions of lysosomal acid phosphatase (LAP) and tartrate-resistant acid phosphatase (Acp5) revealed by doubly deficient mice. Development 128, 4899–4910 37 Slavin, I., Saura, A., Carranza, P. G., Touz, M. C., Nores, M. J. and Lujan, H. D. (2002) Dephosphorylation of cyst wall proteins by a secreted lysosomal acid phosphatase is essential for excystation of Giardia lamblia . Mol. Biochem. Parasitol. 122, 95–98 c The Authors Journal compilation c 2008 Biochemical Society 38 Lo, S. C. and Hannink, M. (2006) PGAM5, a bcl-XL -interacting protein, is a novel substrate for the redox-regulated Keap1-dependent ubiquitin ligase complex. J. Biol. Chem. 281, 37893–37903 39 Manaud, N., Arrebola, R., Buffin-Meyer, B., Lefebvre, O., Voss, H., Riva, M., Conesa, C. and Sentenac, A. (1998) A chimeric subunit of yeast transcription factor IIIC forms a subcomplex with τ 95. Mol. Cell. Biol. 18, 3191–3200 40 Carpino, N., Turner, S., Mekala, D., Takahashi, Y., Zang, H., Geiger, T. L., Doherty, P. and Ihle, J. N. (2004) Regulation of ZAP-70 activation and TCR signaling by two related proteins, sts-1 and sts-2. Immunity 20, 37–46 41 Carpino, N., Kobayashi, R., Zang, H., Takahashi, Y., Jou, S. T., Feng, J., Nakajima, H. and Ihle, J. N. (2002) Identification, cDNA cloning, and targeted deletion of p70, a novel, ubiquitously expressed SH3 domain-containing protein. Mol. Cell. Biol. 22, 7491–7500 42 Schweingruber, M. E., Fluri, R., Maundrell, K., Schweingruber, A. M. and Dumermuth, E. (1986) Identification and characterization of thiamin repressible acid phosphatase in yeast. J. Biol. Chem. 261, 15877–15882 43 Pollich, M. and Klug, G. (1995) Identification and sequence analysis of genes involved in late steps in cobalamin (vitamin B12 ) synthesis in Rhodobacter capsulatus . J. Bacteriol. 177, 4481–4487 44 Tsiamis, G., Mansfield, J. W., Hockenhull, R., Jackson, R. W., Sesma, A., Athanassopoulos, E., Bennett, M. A., Stevens, C., Vivian, A., Taylor, J. D. and Murillo, J. (2000) Cultivar-specific avirulence and virulence functions assigned to avrPphF in Pseudomonas syringae pv. phaseolicola , the cause of bean halo-blight disease. EMBO J. 19, 3204–3214 45 Bourgis, F., Botha, F. C., Mani, S., Hiten, F. N., Rigden, D. J. and Verbruggen, N. (2005) Characterization and functional investigation of an Arabidopsis cDNA encoding a homologue to the d-PGMase superfamily. J. Exp. Bot. 56, 1129–1142 46 Lee, Y. H., Li, Y., Uyeda, K. and Hasemann, C. A. (2003) Tissue-specific structure/function differentiation of the liver isoform of 6-phosphofructo-2kinase/fructose-2,6-bisphosphatase. J. Biol. Chem. 278, 523–530 47 Mulugu, S., Bai, W., Fridy, P. C., Bastidas, R. J., Otto, J. C., Dollins, D. E., Haystead, T. A., Ribeiro, A. A. and York, J. D. (2007) A conserved family of enzymes that phosphorylate inositol hexakisphosphate. Science 316, 106–109 48 Davies, L., Anderson, I. P., Turner, P. C., Shirras, A. D., Rees, H. H. and Rigden, D. J. (2007) An unsuspected ecdysteroid/steroid phosphatase activity in the key T-cell regulator, sts-1: surprising relationship to insect ecdysteroid phosphate phosphatase. Proteins 67, 720–731 49 Chevalier, N., Bertrand, L., Rider, M. H., Opperdoes, F. R., Rigden, D. J. and Michels, P. A. (2005) 6-phosphofructo-2-kinase and fructose-2,6-bisphosphatase in trypanosomatidae: molecular characterization, database searches, modelling studies and evolutionary analysis. FEBS J. 272, 3542–3560 50 Bouchez, D. and Tourneur, J. (1991) Organization of the agropine synthesis region of the T-DNA of the ri plasmid from Agrobacterium rhizogenes . Plasmid 25, 27–39 51 Hansen, G., Larribe, M., Vaubert, D., Tempe, J., Biermann, B. J., Montoya, A. L., Chilton, M. D. and Brevet, J. (1991) Agrobacterium rhizogenes pRi8196 T-DNA: mapping and DNA sequence of functions involved in mannopine synthesis and hairy root differentiation. Proc. Natl. Acad. Sci. U.S.A. 88, 7763–7767 52 Palanichelvam, K., Oger, P., Clough, S. J., Cha, C., Bent, A. F. and Farrand, S. K. (2000) A second T-region of the soybean-supervirulent chrysopine-type ti plasmid pTiChry5, and construction of a fully disarmed vir helper plasmid. Mol. Plant Microbe Interact. 13, 1081–1091 53 Rigden, D. J. (2003) Unexpected catalytic site variation in phosphoprotein phosphatase homologues of cofactor-dependent phosphoglycerate mutase. FEBS Lett. 536, 77–84 54 Motshwene, P., Brandt, W. and Lindsey, G. (2003) Significant quantities of the glycolytic enzyme phosphoglycerate mutase are present in the cell wall of yeast Saccharomyces cerevisiae . Biochem. J. 369, 357–362 55 Prill, V., Lehmann, L., von Figura, K. and Peters, C. (1993) The cytoplasmic tail of lysosomal acid phosphatase contains overlapping but distinct signals for basolateral sorting and rapid internalization in polarized MDCK cells. EMBO J. 12, 2181–2193 56 Bairoch, A., Boeckmann, B., Ferro, S. and Gasteiger, E. (2004) Swiss-Prot: juggling between evolution and stability. Brief. Bioinform. 5, 39–55 57 Hadjigeorgiou, G. M., Kawashima, N., Bruno, C., Andreu, A. L., Sue, C. M., Rigden, D. J., Kawashima, A., Shanske, S. and DiMauro, S. (1999) Manifesting heterozygotes in a Japanese family with a novel mutation in the muscle-specific phosphoglycerate mutase (PGAM-M) gene. Neuromuscul. Disord. 9, 399–402 58 Lemarchandel, V., Joulin, V., Valentin, C., Rosa, R., Galacteros, F., Rosa, J. and Cohen-Solal, M. (1992) Compound heterozygosity in a complete erythrocyte bisphosphoglycerate mutase deficiency. Blood 80, 2643–2649 59 Nadler, H. L. and Egan, T. J. (1970) Deficiency of lysosomal acid phosphatase: a new familial metabolic disorder. N. Engl. J. Med. 282, 302–307 60 Robinson, D. B. and Glew, R. H. (1980) Acid phosphatase in Gaucher’s disease. Clin. Chem. 26, 371–382 Structure and function in the histidine phosphatase superfamily 61 Gatenby, R. A. and Gillies, R. J. (2007) Glycolysis in cancer: a potential target for therapy. Int. J. Biochem. Cell Biol. 39, 1358–1366 62 Engel, M., Mazurek, S., Eigenbrodt, E. and Welter, C. (2004) Phosphoglycerate mutase-derived polypeptide inhibits glycolytic flux and induces cell growth arrest in tumor cell lines. J. Biol. Chem. 279, 35803–35812 63 Evans, M. J., Saghatelian, A., Sorensen, E. J. and Cravatt, B. F. (2005) Target discovery in small-molecule cell-based screens by in situ proteome reactivity profiling. Nat. Biotechnol. 23, 1303–1307 64 Evans, M. J., Morris, G. M., Wu, J., Olson, A. J., Sorensen, E. J. and Cravatt, B. F. (2007) Mechanistic and structural requirements for active site labeling of phosphoglycerate mutase by spiroepoxides. Mol. Biosyst. 3, 495–506 65 Oski, F. A., Cannon, E. and Gottlieb, A. J. (1970) Red-cell 2,3-DPG level. N. Engl. J. Med. 283, 1169–1170 66 Poillon, W. N., Kim, B. C., Labotka, R. J., Hicks, C. U. and Kark, J. A. (1995) Antisickling effects of 2,3-diphosphoglycerate depletion. Blood 85, 3289–3296 67 Hamasaki, N. and Yamamoto, M. (2000) Red blood cell function and blood storage. Vox Sang. 79, 191–197 68 Bull, H., Murray, P. G., Thomas, D., Fraser, A. M. and Nelson, P. N. (2002) Acid phosphatases. Mol. Pathol. 55, 65–72 69 Dattoli, M., Wallner, K., True, L., Sorace, R., Koval, J., Cash, J., Acosta, R., Biswas, M., Binder, M., Sullivan, B. et al. (1999) Prognostic role of serum prostatic acid phosphatase for 103 Pd-based radiation for prostatic carcinoma. Int. J. Radiat. Oncol. Biol. Phys. 45, 853–856 70 Fong, L. and Small, E. J. (2006) Immunotherapy for prostate cancer. Curr. Urol. Rep. 7, 239–246 71 Ortlund, E., LaCount, M. W. and Lebioda, L. (2003) Crystal structures of human prostatic acid phosphatase in complex with a phosphate ion and α-benzylaminobenzylphosphonic acid update the mechanistic picture and offer new insights into inhibitor design. Biochemistry 42, 383–389 72 Rico, H. and Villa, L. F. (1993) Serum tartrate-resistant acid phosphatase (TRAP) as a biochemical marker of bone remodeling. Calcif. Tissue Int. 52, 149–150 73 Gimm, O., Chi, H., Dahia, P. L., Perren, A., Hinze, R., Komminoth, P., Dralle, H., Reynolds, P. R. and Eng, C. (2001) Somatic mutation and germline variants of MINPP1, a phosphatase gene located in proximity to PTEN on 10q23.3, in follicular thyroid carcinomas. J. Clin. Endocrinol. Metab. 86, 1801–1805 74 Shakarian, A. M., Joshi, M. B., Yamage, M., Ellis, S. L., Debrabant, A. and Dwyer, D. M. (2003) Members of a unique histidine acid phosphatase family are conserved amongst a group of primitive eukaryotic human pathogens. Mol. Cell. Biochem. 245, 31–41 75 Ota, Y., Tamegai, H., Kudo, F., Kuriki, H., Koike-Takeshita, A., Eguchi, T. and Kakinuma, K. (2000) Butirosin-biosynthetic gene cluster from Bacillus circulans . J. Antibiot. (Tokyo). 53, 1158–1167 76 Maga, J. A. (1982) Phytate: its chemistry, occurrence, food interactions, nutritional significance, and methods of analysis. J. Agric. Food Chem. 30, 1–9 77 Kim, T., Mullaney, E. J., Porres, J. M., Roneker, K. R., Crowe, S., Rice, S., Ko, T., Ullah, A. H., Daly, C. B., Welch, R. and Lei, X. G. (2006) Shifting the pH profile of Aspergillus niger PhyA phytase to match the stomach pH enhances its effectiveness as an animal feed additive. Appl. Environ. Microbiol. 72, 4397–4403 78 Luo, H., Huang, H., Yang, P., Wang, Y., Yuan, T., Wu, N., Yao, B. and Fan, Y. (2007) A novel phytase appA from Citrobacter amalonaticus CGMCC 1696: gene cloning and overexpression in Pichia pastoris . Curr. Microbiol. 55, 185–192 79 Liberator, P., Anderson, J., Feiglin, M., Sardana, M., Griffin, P., Schmatz, D. and Myers, R. W. (1998) Molecular cloning and functional expression of mannitol-1-phosphatase from the apicomplexan parasite Eimeria tenella . J. Biol. Chem. 273, 4237–4244 80 Bernstein, F. C., Koetzle, T. F., Williams, G. J., Meyer, Jr, E. F., Brice, M. D., Rodgers, J. R., Kennard, O., Shimanouchi, T. and Tasumi, M. (1977) The protein data bank: a computer-based archival file for macromolecular structures. J. Mol. Biol. 112, 535–542 81 Rigden, D. J., Alexeev, D., Phillips, S. E. and Fothergill-Gilmore, L. A. (1998) The 2.3 Å X-ray crystal structure of S. cerevisiae phosphoglycerate mutase. J. Mol. Biol. 276, 449–459 82 Lindqvist, Y., Schneider, G. and Vihko, P. (1993) Three-dimensional structure of rat acid phosphatase in complex with L(+)-tartrate. J. Biol. Chem. 268, 20744–20746 83 Konagurthu, A. S., Whisstock, J. C., Stuckey, P. J. and Lesk, A. M. (2006) MUSTANG: a multiple structural alignment algorithm. Proteins 64, 559–574 84 Shatsky, M., Nussinov, R. and Wolfson, H. J. (2006) Optimization of multiple-sequence alignment based on multiple-structure alignment. Proteins 62, 209–217 85 Hamada, K., Kato, M., Shimizu, T., Ihara, K., Mizuno, T. and Hakoshima, T. (2005) Crystal structure of the protein histidine phosphatase SixA in the multistep His-Asp phosphorelay. Genes Cells 10, 1–11 347 86 Watkins, H. A. and Baker, E. N. (2006) Structural and functional analysis of Rv3214 from Mycobacterium tuberculosis , a protein with conflicting functional annotations, leads to its characterization as a phosphatase. J. Bacteriol. 188, 3589–3599 87 Li, L., Ling, S., Wu, C., Yao, W. and Xu, G. (1997) Separate bisphosphatase domain of chicken liver 6-phosphofructo-2-kinase/fructose-2,6-bisphosphatase: the role of the C-terminal tail in modulating enzyme activity. Biochem. J. 328, 751–756 88 White, M. F., Fothergill-Gilmore, L. A., Kelly, S. M. and Price, N. C. (1993) Dissociation of the tetrameric phosphoglycerate mutase from yeast by a mutation in the subunit contact region. Biochem. J. 295, 743–748 89 Bond, C. S., White, M. F. and Hunter, W. N. (2001) High resolution structure of the phosphohistidine-activated form of Escherichia coli cofactor-dependent phosphoglycerate mutase. J. Biol. Chem. 276, 3247–3253 90 Wang, Y., Liu, L., Wei, Z., Cheng, Z., Lin, Y. and Gong, W. (2006) Seeing the process of histidine phosphorylation in human bisphosphoglycerate mutase. J. Biol. Chem. 281, 39642–39648 91 Lee, D. C., Cottrill, M. A., Forsberg, C. W. and Jia, Z. (2003) Functional insights revealed by the crystal structures of Escherichia coli glucose-1-phosphatase. J. Biol. Chem. 278, 31412–31418 92 Rigden, D. J., Littlejohn, J. E., Henderson, K. and Jedrzejas, M. J. (2003) Structures of phosphate and trivanadate complexes of Bacillus stearothermophilus phosphatase PhoE: structural and functional analysis in the cofactor-dependent phosphoglycerate mutase superfamily. J. Mol. Biol. 325, 411–420 93 Xiang, T., Liu, Q., Deacon, A. M., Koshy, M., Kriksunov, I. A., Lei, X. G., Hao, Q. and Thiel, D. J. (2004) Crystal structure of a heat-resilient phytase from Aspergillus fumigatus , carrying a phosphorylated histidine. J. Mol. Biol. 339, 437–445 94 Lin, K., Li, L., Correia, J. J. and Pilkis, S. J. (1992) Arg-257 and Arg-307 of 6-phosphofructo-2-kinase/fructose-2,6-bisphosphatase bind the C-2 phospho group of fructose-2,6-bisphosphate in the fructose-2,6-bisphosphatase domain. J. Biol. Chem. 267, 19163–19171 95 Tauler, A., Lin, K. and Pilkis, S. J. (1990) Hepatic 6-phosphofructo-2kinase/fructose-2,6-bisphosphatase: use of site-directed mutagenesis to evaluate the roles of His-258 and His-392 in catalysis. J. Biol. Chem. 265, 15617–15622 96 Ostanin, K., Saeed, A. and Van Etten, R. L. (1994) Heterologous expression of human prostatic acid phosphatase and site-directed mutagenesis of the enzyme active site. J. Biol. Chem. 269, 8971–8978 97 Nairn, J., Duncan, D., Price, N. E., Kelly, S. M., Fothergill-Gilmore, L. A., Uhrinova, S., Barlow, P. N., Rigden, D. J. and Price, N. C. (2000) Characterization of active-site mutants of Schizosaccharomyces pombe phosphoglycerate mutase: elucidation of the roles of amino acids involved in substrate binding and catalysis. Eur. J. Biochem. 267, 7065–7074 98 Zhang, X. Q., Lee, M. S., Zelivianski, S. and Lin, M. F. (2001) Characterization of a prostate-specific tyrosine phosphatase by mutagenesis and expression in human prostate cancer cells. J. Biol. Chem. 276, 2544–2550 99 Mizuguchi, H., Cook, P. F., Tai, C. H., Hasemann, C. A. and Uyeda, K. (1999) Reaction mechanism of fructose-2,6-bisphosphatase: a mutation of nucleophilic catalyst, histidine 256, induces an alteration in the reaction pathway. J. Biol. Chem. 274, 2166–2175 100 Yuen, M. H., Mizuguchi, H., Lee, Y. H., Cook, P. F., Uyeda, K. and Hasemann, C. A. (1999) Crystal structure of the H256A mutant of rat testis fructose-6-phosphate,2kinase/fructose-2,6-bisphosphatase: fructose 6-phosphate in the active site leads to mechanisms for both mutant and wild type bisphosphatase activities. J. Biol. Chem. 274, 2176–2184 101 Lin, K., Li, L., Correia, J. J. and Pilkis, S. J. (1992) Glu327 is part of a catalytic triad in rat liver fructose-2,6-bisphosphatase. J. Biol. Chem. 267, 6556–6562 102 Ostanin, K. and Van Etten, R. L. (1993) Asp304 of Escherichia coli acid phosphatase is involved in leaving group protonation. J. Biol. Chem. 268, 20778–20784 103 Lim, D., Golovan, S., Forsberg, C. W. and Jia, Z. (2000) Crystal structures of Escherichia coli phytase and its complex with phytate. Nat. Struct. Biol. 7, 108–113 104 Rigden, D. J., Walter, R. A., Phillips, S. E. and Fothergill-Gilmore, L. A. (1999) Sulphate ions observed in the 2.12 Å structure of a new crystal form of S. cerevisiae phosphoglycerate mutase provide insights into understanding the catalytic mechanism. J. Mol. Biol. 286, 1507–1517 105 Walter, R. A., Nairn, J., Duncan, D., Price, N. C., Kelly, S. M., Rigden, D. J. and Fothergill-Gilmore, L. A. (1999) The role of the C-terminal region in phosphoglycerate mutase. Biochem. J. 337, 89–95 106 Bond, C. S., White, M. F. and Hunter, W. N. (2002) Mechanistic implications for Escherichia coli cofactor-dependent phosphoglycerate mutase based on the high-resolution crystal structure of a vanadate complex. J. Mol. Biol. 316, 1071–1081 107 Crowhurst, G. S., Dalby, A. R., Isupov, M. N., Campbell, J. W. and Littlechild, J. A. (1999) Structure of a phosphoglycerate mutase:3-phosphoglyceric acid complex at 1.7 Å. Acta Crystallogr. Sect. D Biol. Crystallogr. 55, 1822–1826 108 Reference deleted c The Authors Journal compilation c 2008 Biochemical Society 348 D. J. Rigden 109 Wang, Y., Wei, Z., Bian, Q., Cheng, Z., Wan, M., Liu, L. and Gong, W. (2004) Crystal structure of human bisphosphoglycerate mutase. J. Biol. Chem. 279, 39132– 39138 110 Jedrzejas, M. J., Chander, M., Setlow, P. and Krishnasamy, G. (2000) Structure and mechanism of action of a novel phosphoglycerate mutase from Bacillus stearothermophilus . EMBO J. 19, 1419–1431 111 Zhang, G., Dai, J., Wang, L., Dunaway-Mariano, D., Tremblay, L. W. and Allen, K. N. (2005) Catalytic cycling in β-phosphoglucomutase: a kinetic and structural analysis. Biochemistry 44, 9404–9416 112 Wang, Y., Wei, Z., Liu, L., Cheng, Z., Lin, Y., Ji, F. and Gong, W. (2005) Crystal structure of human B-type phosphoglycerate mutase bound with citrate. Biochem. Biophys. Res. Commun. 331, 1207–1215 113 Li, L., Lin, K., Correia, J. J. and Pilkis, S. J. (1992) Lysine 356 is a critical residue for binding the C-6 phospho group of fructose 2,6-bisphosphate to the fructose-2,6bisphosphatase domain of rat liver 6-phosphofructo-2-kinase/fructose-2,6bisphosphatase. J. Biol. Chem. 267, 16669–16675 114 Rigden, D. J., Mello, L. V., Setlow, P. and Jedrzejas, M. J. (2002) Structure and mechanism of action of a cofactor-dependent phosphoglycerate mutase homolog from Bacillus stearothermophilus with broad specificity phosphatase activity. J. Mol. Biol. 315, 1129–1143 115 Liu, Q., Huang, Q., Lei, X. G. and Hao, Q. (2004) Crystallographic snapshots of Aspergillus fumigatus phytase, revealing its enzymatic dynamics. Structure 12, 1575–1583 116 Rigden, D. J., Bagyan, I., Lamani, E., Setlow, P. and Jedrzejas, M. J. (2001) A cofactor-dependent phosphoglycerate mutase homolog from Bacillus stearothermophilus is actually a broad specificity phosphatase. Protein Sci. 10, 1835–1846 117 Bertone, P. and Snyder, M. (2005) Advances in functional protein microarray technology. FEBS J. 272, 5400–5411 118 Friedberg, I. and Godzik, A. (2007) Functional differentiation of proteins: implications for structural genomics. Structure 15, 405–415 119 Hermann, J. C., Marti-Arbona, R., Fedorov, A. A., Fedorov, E., Almo, S. C., Shoichet, B. K. and Raushel, F. M. (2007) Structure-based activity prediction for an enzyme of unknown function. Nature 448, 775–779 120 Haggarty, N. W., Dunbar, B. and Fothergill, L. A. (1983) The complete amino acid sequence of human erythrocyte diphosphoglycerate mutase. EMBO J. 2, 1213–1220 121 Pilkis, S. J., Walderhaug, M., Murray, K., Beth, A., Venkataramu, S. D., Pilkis, J. and El-Maghrabi, M. R. (1983) 6-Phosphofructo-2-kinase/fructose 2,6-bisphosphatase from rat liver. J. Biol. Chem. 258, 6135–6141 122 Ali, N., Craxton, A. and Shears, S. B. (1993) Hepatic Ins(1,3,4,5)P 4 3-phosphatase is compartmentalized inside endoplasmic reticulum. J. Biol. Chem. 268, 6161–6167 123 Lin, M. F. and Clinton, G. M. (1986) Human prostatic acid phosphatase has phosphotyrosyl protein phosphatase activity. Biochem. J. 235, 351–357 124 Rose, Z. B. (1980) The enzymology of 2,3-bisphosphoglycerate. Adv. Enzymol. Relat. Areas Mol. Biol. 51, 211–253 125 El-Maghrabi, M. R., Claus, T. H., Pilkis, J., Fox, E. and Pilkis, S. J. (1982) Regulation of rat liver fructose 2,6-bisphosphatase. J. Biol. Chem. 257, 7603–7607 126 Feshchenko, E. A., Smirnova, E. V., Swaminathan, G., Teckchandani, A. M., Agrawal, R., Band, H., Zhang, X., Annan, R. S., Carr, S. A. and Tsygankov, A. Y. (2004) TULA: an SH3- and UBA-containing protein that binds to c-cbl and ubiquitin. Oncogene 23, 4690–4706 127 Urena, J. M., Egea, G., Grana, X., Castella, J., Marsal, J., Carreras, J. and Climent, F. (1990) Location of phosphoglycerate mutase in rat skeletal muscle: an immunocytochemical and biochemical study. Eur. J. Cell Biol. 51, 151–156 128 Egea, G., Urena, J. M., Grana, X., Marsal, J., Carreras, J. and Climent, F. (1992) Nuclear location of phosphoglycerate mutase BB isozyme in rat tissues. Histochemistry 97, 269–275 129 Huh, W. K., Falvo, J. V., Gerke, L. C., Carroll, A. S., Howson, R. W., Weissman, J. S. and O’Shea, E. K. (2003) Global analysis of protein localization in budding yeast. Nature 425, 686–691 130 Pohlmann, R., Krentler, C., Schmidt, B., Schroder, W., Lorkowski, G., Culley, J., Mersmann, G., Geier, C., Waheed, A. and Gottschalk, S. (1988) Human lysosomal acid phosphatase: cloning, expression and chromosomal assignment. EMBO J. 7, 2343–2350 Received 13 August 2007/18 September 2007; accepted 21 September 2007 Published on the Internet 21 December 2007, doi:10.1042/BJ20071097 c The Authors Journal compilation c 2008 Biochemical Society 131 Wiese, M., Berger, O., Stierhof, Y. D., Wolfram, M., Fuchs, M. and Overath, P. (1996) Gene cloning and cellular localization of a membrane-bound acid phosphatase of Leishmania mexicana . Mol. Biochem. Parasitol. 82, 153–165 132 Gottlieb, M. and Dwyer, D. M. (1981) Protozoan parasite of humans: surface membrane with externally disposed acid phosphatase. Science 212, 939–941 133 Ding, D. Q., Tomita, Y., Yamamoto, A., Chikashige, Y., Haraguchi, T. and Hiraoka, Y. (2000) Large-scale screening of intracellular protein localization in living fission yeast cells by the use of a GFP-fusion genomic DNA library. Genes Cells 5, 169–190 134 Hafkenscheid, J. C. (1968) Properties of an acid phosphatase in Escherichia coli . Biochim. Biophys. Acta 167, 582–589 135 Shnyreva, M. G., Petrova, E. V., Egorov, S. N. and Hinnen, A. (1996) Biochemical properties and excretion behavior of repressible acid phosphatases with altered subunit composition. Microbiol. Res. 151, 291–300 136 Ullah, A. H. and Gibson, D. M. (1987) Extracellular phytase (E.C. 3.1.3.8) from Aspergillus ficuum NRRL 3135: purification and characterization. Prep. Biochem. 17, 63–91 137 Campbell, J. W., Hodgson, G. I. and Watson, H. C. (1972) Low resolution structure of yeast phosphoglycerate mutase. Nat. New Biol. 240, 137–139 138 Campbell, J. W., Watson, H. C. and Hodgson, G. I. (1974) Structure of yeast phosphoglycerate mutase. Nature 250, 301–303 139 Winn, S. I., Watson, H. C., Harkins, R. N. and Fothergill, L. A. (1981) Structure and activity of phosphoglycerate mutase. Philos. Trans. R. Soc. London Ser. B. 293, 121–130 140 Rigden, D. J., Walter, R. A., Phillips, S. E. and Fothergill-Gilmore, L. A. (1999) Polyanionic inhibitors of phosphoglycerate mutase: combined structural and biochemical analysis. J. Mol. Biol. 289, 691–699 141 Uhrinova, S., Uhrin, D., Nairn, J., Price, N. C., Fothergill-Gilmore, L. A. and Barlow, P. N. (2001) Solution structure and dynamics of an open β-sheet, glycolytic enzyme, monomeric 23.7 kDa phosphoglycerate mutase from Schizosaccharomyces pombe . J. Mol. Biol. 306, 275–290 142 Muller, P., Sawaya, M. R., Pashkov, I., Chan, S., Nguyen, C., Wu, Y., Perry, L. J. and Eisenberg, D. (2005) The 1.70 angstroms X-ray crystal structure of Mycobacterium tuberculosis phosphoglycerate mutase. Acta Crystallogr. Sect. D Biol. Crystallogr. 61, 309–315 143 Yuen, M. H., Wang, X. L., Mizuguchi, H., Uyeda, K. and Hasemann, C. A. (1999) A switch in the kinase domain of rat testis 6-phosphofructo-2kinase/fructose-2,6-bisphosphatase. Biochemistry 38, 12333–12342 144 Lee, Y. H., Olson, T. W., Ogata, C. M., Levitt, D. G., Banaszak, L. J. and Lange, A. J. (1997) Crystal structure of a trapped phosphoenzyme during a catalytic reaction. Nat. Struct. Biol. 4, 615–618 145 Lee, Y. H., Ogata, C., Pflugrath, J. W., Levitt, D. G., Sarma, R., Banaszak, L. J. and Pilkis, S. J. (1996) Crystal structure of the rat liver fructose-2,6-bisphosphatase based on selenomethionine multiwavelength anomalous dispersion phases. Biochemistry 35, 6010–6019 146 Hasemann, C. A., Istvan, E. S., Uyeda, K. and Deisenhofer, J. (1996) The crystal structure of the bifunctional enzyme 6-phosphofructo-2-kinase/fructose-2,6-bisphosphatase reveals distinct domain homologies. Structure 4, 1017–1029 147 Rigden, D. J. (2006) Understanding the cell in terms of structure and function: insights from structural genomics. Curr. Opin. Biotechnol. 17, 457–464 148 Lindqvist, Y., Schneider, G. and Vihko, P. (1994) Crystal structures of rat acid phosphatase complexed with the transition-state analogs vanadate and molybdate. implications for the reaction mechanism. Eur. J. Biochem. 221, 139–142 149 Jakob, C. G., Lewinski, K., Kuciel, R., Ostrowski, W. and Lebioda, L. (2000) Crystal structure of human prostatic acid phosphatase. Prostate 42, 211–218 150 LaCount, M. W., Handy, G. and Lebioda, L. (1998) Structural origins of L(+)-tartrate inhibition of human prostatic acid phosphatase. J. Biol. Chem. 273, 30406–30409 151 Kostrewa, D., Gruninger-Leitch, F., D’Arcy, A., Broger, C., Mitchell, D. and van Loon, A. P. (1997) Crystal structure of phytase from Aspergillus ficuum at 2.5 Å resolution. Nat. Struct. Biol. 4, 185–190 152 Kostrewa, D., Wyss, M., D’Arcy, A. and van Loon, A. P. (1999) Crystal structure of Aspergillus niger pH 2.5 acid phosphatase at 2.4 Å resolution. J. Mol. Biol. 288, 965–974 153 Clamp, M., Cuff, J., Searle, S. M. and Barton, G. J. (2004) The Jalview Java alignment editor. Bioinformatics 20, 426–427
© Copyright 2026 Paperzz