A Comprehensive Overview of the Vertebrate p24 Family: Identification of a Novel Tissue-Specifically Expressed Member Jeroen R.P.M. Strating,*1 Nick H.M. van Bakel,* Jack A.M. Leunissen, and Gerard J.M. Martens* *Department of Molecular Animal Physiology, Donders Centre for Neuroscience and Nijmegen Centre for Molecular Life Sciences (NCMLS), Faculty of Science, Radboud University, Nijmegen, The Netherlands; and Laboratory of Bioinformatics, Wageningen University, Wageningen, The Netherlands The members of the p24 protein family have an important but unclear role in transport processes in the early secretory pathway. The p24 family consists of four subfamilies (a, b, c, and d), whereby the exact composition of the family varies among species. Despite more than 15 years of p24 research, the vertebrate p24 family is still surprisingly ill characterized. Here, we describe the human, mouse, Xenopus, and zebrafish orthologues of 10 p24 family members and a new member that we term p24c5. Of these eleven p24 family members, nine are conserved throughout the vertebrate lineage, whereas two (p24c4 and p24d2) occur in some but not all vertebrates. We further show that all p24 proteins are widely expressed in mouse, except for p24a1 and p24c5 that display restricted expression patterns. Thus, we present for the first time a comprehensive overview of the phylogeny and expression of the vertebrate p24 protein family. Introduction The secretory pathway comprises a number of distinct membrane-bounded organelles involved in the transport, maturation, and sorting of biologically active proteins destined for the plasma membrane or the extracellular space. Proteins enter the secretory pathway in the endoplasmic reticulum (ER) where they are properly folded. They are then transported to the Golgi from where they are targeted to their final intra- or extracellular destination (Palade 1975). The members of the p24 protein family play an important role in vesicular transport processes at the ER–Golgi interface, although their exact function is still unclear (roles suggested for p24 have been reviewed in Emery et al. 1999; Kaiser 2000; Kuiper and Martens 2000; Carney and Bowen 2004). The p24 proteins constitute a family of type-I transmembrane (TM) proteins of ;24 kDa. The family can be subdivided into four subfamilies termed a, b, c, and d (Dominguez et al. 1998). The p24 proteins share a similar architecture that includes a single TM helix. At the lumenal side, the p24 proteins have an N-terminal Golgi dynamics (GOLD) domain that may play a role in the incorporation of cargo into transport vesicles (Anantharaman and Aravind 2002). In the structure of some family members, this domain is followed by a putative coiled-coil region that may interact with other p24 proteins (Ciufo and Boyd 2000). At the cytoplasmic side, the p24 proteins have a short C-terminal tail of 12–22 amino acids (aas) that contains binding motifs for the vesicle coat complexes COPI and COPII (Fiedler et al. 1996; Sohn et al. 1996; Dominguez et al. 1998; Harter and Wieland 1998; Belden and Barlowe 2001; Béthune et al. 2006). Despite its crucial role in the cell, a comprehensive overview of the vertebrate p24 family is not available and the occurrence of orthologues of the various p24 family members among vertebrate species is surprisingly ill documented, as are their tissue distributions. In contrast, the 1 Present address: Heidelberg University Biochemistry Center (BZH), Im Neuenheimer Feld 328, Heidelberg, Germany. Key words: early secretory pathway, phylogeny, evolutionary tree, tissue distribution, p23, p25, p26, p27, tp24, gp25L, TMED. E-mail: [email protected]. Mol. Biol. Evol. 26(8):1707–1714. 2009 doi:10.1093/molbev/msp099 Advance Access publication May 8, 2009 Ó The Author 2009. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: [email protected] compositions of the p24 families in yeast and fruit fly, and their expression patterns in fly developmental stages and tissues have been extensively studied (Schimmöller et al. 1995; Belden and Barlowe 1996; Marzioch et al. 1999; Carney and Taylor 2003; Bartoszewski et al. 2004; Boltz et al. 2007). In this paper, we describe the composition in vertebrates and evolutionary aspects of the p24 family as well as the tissue distributions of the mouse p24 family members. Materials and Methods Bioinformatic Methods Searches against the nonredundant protein database at the National Center for Biotechnology Information (NCBI) were performed using the BlastP algorithm. Genomic databases at the NCBI (March 2008 releases) were queried using the TBlastN algorithm. Sequences were aligned using ClustalW 1.83 (Thompson et al. 1994) and MUSCLE (Edgar 2004), and edited using JalView 2.3 (Clamp et al. 2004; Waterhouse et al. 2009). The alignments were colored for conservation (threshold 25%) using the ClustalX coloring algorithm (Thompson et al. 1997). A detailed description of this coloring algorithm is available through the ClustalX documentation (available on www.clustal.org). In short, this algorithm assigns colors to aligned positions based on the type of aa residue in the consensus and the conservation of its physicochemical properties. The intensity of the coloring (as set by the threshold) is determined by the degree of conservation of the consensus residue type. Phylogenetic trees were built using PHYLIP 3.67, PAUP 4.0b10, and MrBayes 3.1.1, and plotted with DRAWTREE (PHYLIP package; fig. 1) and SVGTree (Leunissen JAM, unpublished data; supplementary figs. 2 and 3, Supplementary Material online). The phylogenetic trees calculated by the various programs were in general agreement with each other, apart from minor variations in branch lengths and bootstrap values. Signal peptides were predicted using the public SignalP 3.0 server (Bendtsen et al. 2004). Generally, the Neural networks and the Hidden Markov models algorithms yielded the same result. If not, the predicted signal peptide cleavage position was taken that was most consistent with that of the most closely related p24 proteins. TM helices were predicted using the public TMHMM 1708 Strating et al. FIG. 1.—Unrooted tree of the p24 protein family. A multiple alignment of p24 protein sequences was performed (supplementary fig. 1, Supplementary Material online) and an unrooted cladogram tree was generated. The four subfamilies are indicated by gray background shading. Hs, Homo sapiens; Pt, Pan troglodytes; Mm, Mus musculus; Xt, Xenopus tropicalis; Dr, Danio rerio; Tn, Tetraodon nigroviridis; Dm, Drosophila melanogaster; Sc, Saccharomyces cerevisiae; and At, Arabidopsis thaliana. 2.0 server (Krogh et al. 2001). In general, the predicted TM helices within different p24 proteins aligned well. RNA Isolation and Reverse Transcriptase (RT)-PCR Female BALB/c surplus mice (12 months old) were sacrificed by cervical dislocation in accordance with institutional guidelines, and tissues were dissected and snap frozen in liquid nitrogen. Total RNA was extracted from the tissues using Trizol reagent (Invitrogen) according to the manufacturer’s instructions. RNA concentrations were determined spectrophotometrically and checked for integrity of the rRNAs by agarose gel electrophoresis. The Trizol method yielded high-quality RNA from all tissues, except for pancreas. In order to isolate high-quality RNA from pancreas suitable for RT-PCR, we used the RNAlater ice solution (Ambion) but again we were not successful, probably be- cause of the abundant presence of pancreatic RNases. We therefore could not examine the mRNA expression of the mouse p24 proteins in pancreas. The RNA samples were treated with DNaseI (Fermentas) and cDNA was generated using a RevertAid H-minus first strand cDNA synthesis kit (Fermentas) according to the manufacturer’s protocol. Standard PCR reactions were performed using intron-spanning primer pairs (supplementary table 1, Supplementary Material online). A PCR for b-actin was performed as a control for equal input. Results p24 Nomenclature In the past, various nomenclatures for vertebrate p24 proteins have been introduced and are still being used in The Vertebrate p24 Family 1709 Table 1 Nomenclature and Occurrence of the Vertebrate p24 Proteins a b c d Systematic Name Systematic Database Namea Alternative Names Mammals Amphibians Fish p24a1 p24a2 p24a3 p24b1 p24c1 p24c2 p24c3 p24c4 p24c5 p24d1 p24d2 TMED11 TMED9 TMED4 TMED2 TMED1 TMED5 TMED7 TMED3 TMED6 TMED10 gp25L p25, GMP25, gp25L2, p24d, p24D GMP25iso p24, p24a tp24, T1/ST2 (receptor-binding protein)c p28, T1/ST2 isoc (g)p27 p26, p24b þb þ þ þ þ þ þ þ þ þ þ þ þ þ þ þ þ þ þ þ þ þ þ þ þd þ þ þ þe þ p23, tmp21(I), p24c p23iso a In the protein database, most p24 proteins are annotated as ‘‘TMED’’ (TM emp24 protein transport domain containing; emp24 is the first yeast p24 protein identified; Schimmöller et al. 1995). The nonredundant NCBI protein database contains one protein (TMED8) that is erroneously annotated as being related to the p24 proteins, because of its size (;300 aas) and the fact that it lacks a TM domain and the two absolutely conserved cysteine residues (see text). The TMED8 protein contains a GOLD domain in its C-terminal region. A Blast search for TMED8-related proteins retrieved several GOLD domain-containing proteins, indicating that TMED8 is a GOLD domaincontaining protein but not a member of the p24 family. b The human p24a1 gene contains an in-frame stop codon and therefore the p24a1 protein does not appear to occur in humans. However, other primates have an intact p24a1 (see text). c p24c1 is sometimes erroneously termed T1/ST2, instead of the original name T1/ST2 receptor binding protein (Gayle et al. 1996), and because of its close relationship p24c2 is sometimes—also erroneously—named T1/ST2 iso. d Zebrafish contains two p24c1 inparalogues (see text). e The present assembly of the zebrafish genome does not contain a full-length p24c5 sequence, but the Tetraodon nigroviridis genome does (see text). parallel (table 1). Work on mammalian p24 proteins has mostly focused on six family members, usually termed p23, p24, p25, p26, p27, and tp24. A more systematic approach, involving the use of a Greek letter (to identify a subfamily of structurally related p24 proteins) followed by a number (starting with the first-discovered member), has been proposed by Dominguez et al. (1998). Here, we use this systematic nomenclature because it provides direct information as to which subfamily the p24 protein belongs and the subfamily can be expanded relatively easy with a newly identified family member (based on its structure). A careful literature search revealed the occurrence of 10 p24 proteins in vertebrate species, namely, gp25L/p24a1 (Wada et al. 1991), p25/p24a2 (Dominguez et al. 1998), GMP25iso/p24a3 (Rötter et al. 2002), p24/p24b1 (Stamnes et al. 1995), tp24/ p24c1 (Gayle et al. 1996), p28/p24c2 (Rötter et al. 2002), p27/p24c3 (Dominguez et al. 1998), p26/p24c4 (Dominguez et al. 1998), p23/p24d1 (Sohn et al. 1996), and p23iso/p24d2 (Holthuis et al. 1995; Kuiper et al. 2000) (table 1). In invertebrates, yet other nomenclatures exist (e.g., see Marzioch et al. 1999 for an overview of the p24 proteins in yeast and Boltz et al. 2007 for the Drosophila p24 proteins). The Vertebrate p24 Proteins To provide a comprehensive overview of the p24 family in vertebrates, we chose to compare the p24 protein sequences from human and vertebrate model organisms, namely, mouse (Mus musculus), amphibian (Xenopus tropicalis), and zebrafish (Danio rerio). The genome-sequencing projects for these species have either been completed or are in a far-advanced stage, which enables the identification of any novel, unannotated members of the p24 family. We retrieved the known p24 proteins from the protein database using a text-based search and the unannotated family members with a Blast search against the nonredundant protein database or the respective genome and expressed sequence tag (EST) databases. All p24 proteins have orthologues in all vertebrate species (table 1), except for the few cases discussed below. In addition, we identified a novel member of the p24 family that we termed p24c5 (see below). From table 1, it is clear that most vertebrate genomes encode 10 p24 family members. The Human p24a1 Open Reading Frame (ORF) Contains a Premature Stop Codon The gp25L/p24a1 protein, originally found in dog pancreatic microsomes, was the first member of the p24 family to be identified (Wada et al. 1991). Interestingly, we found during our searches that the human sequence for p24a1 contains an in-frame stop codon (codon 111; TAA). This stop codon was found in both the reference and the alternate human genome assembly and in human ESTs for p24a1. A stop codon in the ORF of p24a1 was found only in human, but not in other vertebrates. In most other vertebrates (e.g., orangutan, chimp, macaque, bushbaby, mouse lemur, cow, cat, mouse, and rat) codon 111 is TCA (encoding a serine), whereas in some other species (e.g., horse, dog, opossum, and microbat), it is TCC or TCG (also encoding a serine). The chimpanzee p24a1 ORF is identical to the human p24a1 sequence, except for 3 nt changes, one of which corresponds to the A/C difference in codon 111 (the two other differences are silent). On the basis of the available data, we conclude that humans lack an intact p24a1 protein. Therefore, we decided to include chimpanzee p24a1 in our phylogenetic analysis as a representative of primate p24a1. Zebrafish Has Two p24c1 Copies In the database, we found two zebrafish protein sequences corresponding to p24c1, which we termed p24c1A and p24c1B. The p24c1A protein is encoded by a gene on 1710 Strating et al. chromosome 3, whereas the p24c1B protein is derived from a gene on a 29.90-kb scaffold (Scaffold Zv7_NA1556) that has yet to be assigned to a genomic location. The lengths of the zebrafish p24c1A and p24c1B proteins are identical (226 aa residues), whereby the two protein sequences are 61.5% identical and 77.4% similar (data not shown). A comparative aa sequence analysis with other p24 family members from various vertebrate species clearly grouped both sequences with human, mouse, and Xenopus p24c1 (fig. 1 and supplementary fig. 2, Supplementary Material online). This indicates that the two proteins are derived from two different p24c1 genes rather than from splicing isoforms or different alleles of the same gene. Thus, the two zebrafish p24c1 proteins most likely are inparalogues (Sonnhammer and Koonin 2002) resulting from a genome duplication event early in the teleost lineage (Jaillon et al. 2004). Amphibians Lack a p24c4 Orthologue but Have an Additional Amphibian-Specific p24d2 Despite extensive searches in genome and EST databases of both X. tropicalis and Xenopus laevis, we did not identify a Xenopus orthologue of p24c4. Because all other vertebrates included in our analyses have p24c4 orthologues, Xenopus likely lost this family member during evolution. With respect to the p24d subfamily, it is interesting to note that we previously found two subfamily members in X. laevis (p24d1 and p24d2; Holthuis et al. 1995; Kuiper et al. 2000). Because we did not identify an orthologue of Xenopus p24d2 in any other vertebrate branch, we conclude that p24d2 is an amphibian-specific member of the p24 family. Interestingly, a second expressed p24d gene (Tmp21-II) has been identified in human (Blum et al. 1996), but this gene does not encode a complete p24 protein due to a change in reading frame and a nonsense mutation (Horer et al. 1999). At the protein sequence level, human Tmp21-II grouped with human and mouse p24d1 (data not shown). Therefore, it is not an orthologue of Xenopus p24d2 but rather a nonfunctional inparalogue of p24d1. An orthologue of Tmp21-II occurs in chimp (with a stop codon located slightly further upstream than in the human sequence) but not in mouse or rat, suggesting that the duplication event took place recently in the primate radiation. A Novel Member of the Vertebrate p24 Family: p24c5 During our Blast searches for orthologues of known p24 proteins, we serendipitously found a novel protein (TMED6) that based on sequence similarity appeared to be a member of the p24 family. Because in our phylogenetic analysis this protein belongs to the p24c subfamily, we termed this novel family member p24c5. We next identified p24c5 orthologues in a large number of evolutionarily distant vertebrate species (human, mouse, cow, dog, platypus, opossum, Xenopus, and pufferfish). The zebrafish genome assembly used in our searches (Zv7) contained only two fragments of the p24c5 sequence (located on different scaffolds that both mapped to chromosome 25), but not a fulllength sequence. Because we identified the complete protein sequence of a p24c5 orthologue in another teleost fish, the green-spotted pufferfish Tetraodon nigroviridis, we chose to include this sequence as a representative of fish p24c5 in our phylogenetic analyses. Phylogenetic Relationships between the p24 Proteins After having retrieved the orthologues of all known p24 proteins and the novel family member p24c5, we used all mouse p24 protein sequences to query the mouse genome for any other novel family member, but we did not identify any additional p24 sequences. In addition to the protein sequences of human, mouse, Xenopus, and zebrafish, we chose to include in our phylogenetic analyses the model invertebrate Drosophila melanogaster (fruit fly), the model yeast Saccharomyces cerevisiae (budding yeast), and the model plant Arabidopsis thaliana. Fruit fly and yeast have representatives of all four p24 subfamilies (Marzioch et al. 1999; Boltz et al. 2007), whereas plants contain only members of the p24b and p24d subfamilies (Carney and Bowen 2004). Alignment of the various p24 protein sequences showed that overall these proteins display a low degree of aa sequence identity (supplementary fig. 1, Supplementary Material online). In all p24 proteins, only the two cysteine residues of the GOLD domain are fully conserved (Anantharaman and Aravind 2002). A number of other aa residues show a high degree of sequence identity or similarity, in particular in the TM region, for example, a glutamine residue that has been shown to be involved in the trafficking of p24 proteins (Fiedler and Rothman 1997). Next, we predicted the positions of the TM helices in all p24 proteins included in our analysis. In general, the predicted TM helices could be easily aligned (positions 250– 272 in supplementary fig. 1, Supplementary Material online; TM length of 23 residues). Of particular interest is that the predicted TM domains in all p24a proteins are three aa residues shorter (positions 250–269; a length of 20 residues) than those of the p24b, -c and -d proteins. The cytoplasmic tails of most p24 proteins contain an FF-motif involved in COPII binding (Dominguez et al. 1998) or an alternative motif consisting of other aromatic or large hydrophobic residues (supplementary fig. 1, Supplementary Material online). In addition, the majority of the p24 proteins has a doublet of basic aas located two positions C-terminal of the FF, together yielding the FFXXBBXn motif (n 2; B indicates any basic residue) involved in COPI binding (Béthune et al. 2006). Within each of the four vertebrate p24 subfamilies, the degree of aa sequence identity between the various members of that particular subfamily is high (supplementary fig. 3, Supplementary Material online). Furthermore, with the exception of the c-subfamily, all vertebrate p24 subfamilies have extremely wellconserved cytoplasmic tails that provide ‘‘signatures’’ to identify a particular subfamily. Figure 1 (an unrooted cladogram tree) and supplementary figure 2, Supplementary Material online (a bootstrapped midpoint-rooted tree) show the relationships between the various p24 proteins and subfamilies. The phylogenetic relationships among the orthologuous p24 proteins correspond to the evolutionary relationships of the species from which they are derived, with the exception of the p24d branch. In the phylogenetic trees, zebrafish p24d1 appears The Vertebrate p24 Family 1711 more closely related to mouse and human p24d1 than Xenopus p24d1 (fig. 1 and supplementary figs. 2 and 3B, Supplementary Material online). However, the bootstrap values for the branching of Xenopus and zebrafish p24d1 are relatively low (supplementary figs. 2 and 3B, Supplementary Material online), indicating that the p24d1 proteins are too closely related to reliably calculate their phylogenetic descent. Alternatively, zebrafish p24d1 has undergone a number of reverting mutations that cause it to resemble the mammalian p24d1 proteins more than Xenopus p24d1. It appears that the p24a and p24d subfamilies share a common ancestor, as do the p24b and p24c subfamilies. The four p24 subfamilies branch off very early and are present in yeast and all animals. We therefore conclude that the common ancestor to fungi and animals likely contained members of all p24 subfamilies. Interestingly, although the plant p24 proteins group in only two subfamilies (p24b and p24d), plants contain descendants of both the p24a/d and the p24b/c ancestors. Assuming that the root of the tree (fig. 1 and supplementary fig. 2, Supplementary Material online) lies in between the a/d and the b/c stems implicates that the a/d and b/c ancestors were already present in the common ancestor to plants and animals. This again implies that a loss of two genes in plants is more likely than the gain of two genes in animals/fungi. Furthermore, subfamily expansion appears to have taken place relatively recently in evolution and independently in vertebrates and other species (fig. 1 and supplementary fig. 2, Supplementary Material online). For example, the yeast p24a subfamily forms a separate group from the vertebrate a subfamily, although both seem to have a single common p24a ancestor. An even clearer example may be provided by the plant p24d subfamily that seems to have greatly expanded independently from the fungi/animals from a single p24d ancestor. Interestingly, all vertebrate p24 orthologues are evolutionarily well conserved. In particular, the p24b1 protein is extremely well conserved (fig. 1, supplementary figs. 1—3, Supplementary Material online), for example, the p24b1 proteins from the distantly related species man and zebrafish show 94.5% aa sequence identity (excluding the signal peptide; supplementary fig. 3A, Supplementary Material online). In other taxa (e.g., plants, fungi, invertebrates), the p24b1 orthologues are also well conserved (Carney and Bowen 2004). Moreover, the database queries revealed that the various members of each of the vertebrate p24 subfamilies do not only display a high degree of sequence conservation, but also that their genes have similar intron–exon architectures that correlate well with evolutionary relationships. The genes encoding members of the related p24a and p24d subfamilies each consist of five exons, all genes of the related p24b subfamily and the p24c1/2/5 branch are composed of four exons, whereas the p24c3/4 genes have only three exons (supplementary table 2, Supplementary Material online). FIG. 2.—Tissue distributions of p24 family members. The presence of transcripts of the p24 family members in mouse tissues was investigated by RT-PCR. Using two independent primer sets (supplementary table 1, Supplementary Material online), p24a1 was not detected in any of the tissues examined here. The arrow indicates the position where the PCR product for p24a1 is expected; the asterisk indicates the position of the primers used (see also in some of the negative control lanes). b-Actin served as an input control. proteins or a limited number of tissues (Denzel et al. 2000; Kuiper et al. 2000; Rötter et al. 2002; Hosaka et al. 2007). Therefore, we decided to investigate by RT-PCR the mRNA expression patterns of all p24 family members in the mouse. We found that all but two p24 proteins were widely expressed. Using two different, independent primer sets, no expression of p24a1 mRNA was found in the 14 tissues examined. For p24c5 mRNA, strong expression was observed in lung, liver, kidney, small intestine, and colon, and weak expression in spleen, whereas no expression was found in the other tissues examined (fig. 2). Because no p24a1 mRNA expression was found in any of the tissues examined, we searched the database for p24a1 ESTs. Most p24a1 ESTs were from pancreas (mouse, cow, pig, chicken, and Xenopus) and a few from liver (dog), placenta (cow), or embryonic stages (Xenopus), suggesting that the expression of p24a1 is indeed very limited. Discussion Tissue Distributions of the p24 Family Members in the Mouse Surprisingly little is known concerning the tissue distributions of the vertebrate p24 proteins. Previous expression studies have examined only a limited number of p24 In this study, we have investigated the composition, phylogenetic relationship, and expression patterns of the vertebrate p24 family. For most p24 family members, multiple names are used in parallel, which hampers communication in the field. Although presently the p23–p27 nomenclature is 1712 Strating et al. mostly used, it does not offer the flexibility that is needed to properly describe the full set of p24 proteins that we present in this paper. Therefore, we suggest employing the systematic nomenclature proposed by Dominguez et al. (1998) and used in this paper, or at least to provide both names (e.g., p23/p24d1). In general, the vertebrate species studied here contain orthologues of all vertebrate p24 family members. Our phylogenetic analyses of the complete sets of p24 proteins from man, mouse, zebrafish, Xenopus, fruit fly, yeast, and Arabidopsis showed that the expansion of the p24 subfamilies has taken place rather recently in evolution, because it has occurred independently in plants, fungi, and animals, and partly even independently in vertebrates and invertebrates. Thus, nonvertebrate (plant, fungus, and invertebrate) p24 proteins are not orthologuous to specific vertebrate family members. Although only a few residues have been conserved throughout the entire p24 family, all p24 proteins have a similar overall structural organization. Interestingly, in silico studies predict that the TM domains of the p24a proteins are 3 aa residues shorter than those of the p24b, -c, and -d subfamily members. The consistency in the positions of the predicted p24a TM domains suggests that the prediction is correct. In this connection, it is of interest to note that p24a proteins localize primarily to the ER, whereas the other p24s are mainly present in Golgi membranes (Wada et al. 1991; Sohn et al. 1996; Rojo et al. 1997; Dominguez et al. 1998; Blum et al. 1999; Füllekrug et al. 1999; Gommel et al. 1999; Emery et al. 2000). Furthermore, the thickness of secretory pathway membranes increases gradually from the ER to the plasma membrane, possibly due to an increasing cholesterol content (Bretscher and Munro 1993). Because a TM protein may localize correctly by residing in membranes of which the thickness is compatible with the length of its TM domain (the lipid bilayer thickness model, for review see, Munro 1998), the relatively short TM domains of the p24a proteins may accommodate their localization in the ER. However, p24a subfamily members cycle through the Golgi, and when their retrieval motif is mutated, they can even reach the late secretory pathway where they form cholesterol-rich membrane microdomains (Dominguez et al. 1998; Emery et al. 2003). Hence, the TM domain is surely not the only determinant for the ER localization of p24a proteins. Interestingly, in most p24a subfamily members, the TM domain is directly followed by two basic and one hydrophobic residue. Possibly, the positive charges can be masked thus allowing an extension of the p24a TM helix and the presenceofp24a inthickermembranes.Ifandhowthecapabilityof p25/p24a2 to form cholesterol-rich microdomains relates to the properties of its TM domain (length, composition) remains to be established. We furthermore showed that, with the exception of p24a1 and p24c5, the mouse p24 proteins are widely expressed, in line with the wide expression of the majority of p24s in flies (Boltz et al. 2007). We furthermore found that each tissue examined expressed a representative of each subfamily and even multiple members of the same subfamily. The broad coexpression of p24 subfamily members in vertebrate organs has not been noticed before and may point to a functional nonredundancy, even among the closest subfamily members. Knowledge of the tissue distributions of the p24 proteins is also relevant for the de- sign of functional p24 studies in knockout mice. Because a homozygous knockout of the widely expressed p23/ p24d1 was lethal in an early embryonic stage (Denzel et al. 2000), the generation of knockouts of less broadly expressed p24 proteins (such as of p24a1 or p24c5) should be considered because this approach will increase the chances that functional p24 studies are feasible. The restricted expression patterns of p24a1 and p24c5 are of special interest because it may well reflect specialized functions for these p24 proteins in a restricted number of tissues with ER-toGolgi transport of a specific set of cargo proteins. Collectively, we present the known vertebrate p24 proteins (all in the systematic nomenclature proposed by Dominguez et al. 1998), reveal the evolutionary conservation of the entire p24 family in line with previous reports on smaller subsets of the p24 family (Stamnes et al. 1995; Fiedler and Rothman 1997; Dominguez et al. 1998; Emery et al. 1999; Marzioch et al. 1999; Kuiper and Martens 2000; Kuiper et al. 2000; Rötter et al. 2002; Carney and Bowen 2004), show the tissue distributions of all 10 mouse p24 family members, and report the discovery of a novel family member that displays a restricted expression pattern. Together, we provide for the first time a comprehensive overview of the vertebrate p24 family. As such, our findings will help in the design and interpretation of future studies aimed at elucidating p24 functioning. Supplementary Material Supplementary figures 1–3 and supplementary tables 1 and 2 are available at Molecular Biology and Evolution online (http://www.mbe.oxfordjournals.org/). Acknowledgments This work was supported by grant 811.38.002 from the Netherlands Organisation for Scientific Research—Earth and Life Sciences (NWO-ALW). We would like to thank the members of the Martens laboratory for helpful discussions and encouragements, and Frank Oerlemans, Eric Jansen, and Jessica van Schijndel for help with the dissection of mouse tissues. Literature Cited Anantharaman V, Aravind L. 2002. The GOLD domain, a novel protein module involved in Golgi function and secretion. Genome Biol. 3(5):research0023.1–research0023.7. Bartoszewski S, Luschnig S, Desjeux I, Grosshans J, NussleinVolhard C. 2004. Drosophila p24 homologues eclair and baiser are necessary for the activity of the maternally expressed Tkv receptor during early embryogenesis. Mech Dev. 121(10):1259–1273. Belden WJ, Barlowe C. 1996. Erv25p, a component of COPIIcoated vesicles, forms a complex with Emp24p that is required for efficient endoplasmic reticulum to Golgi transport. J Biol Chem. 271(43):26939–26946. Belden WJ, Barlowe C. 2001. Distinct roles for the cytoplasmic tail sequences of Emp24p and Erv25p in transport between the endoplasmic reticulum and Golgi complex. J Biol Chem. 276(46):43040–43048. The Vertebrate p24 Family 1713 Bendtsen JD, Nielsen H, von Heijne G, Brunak S. 2004. Improved prediction of signal peptides: SignalP 3.0. J Mol Biol. 340(4):783–795. Béthune J, Kol M, Hoffmann J, Reckmann I, Brügger B, Wieland F. 2006. Coatomer, the coat protein of COPI transport vesicles, discriminates endoplasmic reticulum residents from p24 proteins. Mol Cell Biol. 26(21):8011–8021. Blum R, Feick P, Puype M, Vandekerckhove J, Klengel R, Nastainczyk W, Schulz I. 1996. Tmp21 and p24A, two type I proteins enriched in pancreatic microsomal membranes, are members of a protein family involved in vesicular trafficking. J Biol Chem. 271(29):17183–17189. Blum R, Pfeiffer F, Feick P, Nastainczyk W, Kohler B, Schafer KH, Schulz I. 1999. Intracellular localization and in vivo trafficking of p24A and p23. J Cell Sci. 112(Pt 4):537–548. Boltz KA, Ellis LL, Carney GE. 2007. Drosophila melanogaster p24 genes have developmental, tissue-specific, and sex-specific expression patterns and functions. Dev Dyn. 236(2):544–555. Bretscher MS, Munro S. 1993. Cholesterol and the Golgi apparatus. Science. 261(5126):1280–1281. Carney GE, Bowen NJ. 2004. p24 proteins, intracellular trafficking, and behavior: Drosophila melanogaster provides insights and opportunities. Biol Cell. 96(4):271–278. Carney GE, Taylor BJ. 2003. Logjam encodes a predicted EMP24/GP25 protein that is required for Drosophila oviposition behavior. Genetics. 164(1):173–186. Ciufo LF, Boyd A. 2000. Identification of a lumenal sequence specifying the assembly of Emp24p into p24 complexes in the yeast secretory pathway. J Biol Chem. 275(12):8382–8388. Clamp M, Cuff J, Searle SM, Barton GJ. 2004. The Jalview Java alignment editor. Bioinformatics. 20(3):426–427. Denzel A, Otto F, Girod A, Pepperkok R, Watson R, Rosewell I, Bergeron JJ, Solari RC, Owen MJ. 2000. The p24 family member p23 is required for early embryonic development. Curr Biol. 10(1):55–58. Dominguez M, Dejgaard K, Füllekrug J, Dahan S, Fazel A, Paccaud JP, Thomas DY, Bergeron JJ, Nilsson T. 1998. gp25L/emp24/p24 protein family members of the cis-Golgi network bind both COP I and II coatomer. J Cell Biol. 140(4):751–765. Edgar RC. 2004. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 32(5):1792–1797. Emery G, Gruenberg J, Rojo M. 1999. The p24 family of transmembrane proteins at the interface between endoplasmic reticulum and Golgi apparatus. Protoplasma. 207(1):24–30. Emery G, Parton RG, Rojo M, Gruenberg J. 2003. The transmembrane protein p25 forms highly specialized domains that regulate membrane composition and dynamics. J Cell Sci. 116(Pt 23):4821–4832. Emery G, Rojo M, Gruenberg J. 2000. Coupled transport of p24 family members. J Cell Sci. 113(Pt 13):2507–2516. Fiedler K, Rothman JE. 1997. Sorting determinants in the transmembrane domain of p24 proteins. J Biol Chem. 272(40):24739–24742. Fiedler K, Veit M, Stamnes MA, Rothman JE. 1996. Bimodal interaction of coatomer with the p24 family of putative cargo receptors. Science. 273(5280):1396–1399. Füllekrug J, Suganuma T, Tang BL, Hong W, Storrie B, Nilsson T. 1999. Localization and recycling of gp27 (hp24gamma3): complex formation with other p24 family members. Mol Biol Cell. 10(6):1939–1955. Gayle MA, Slack JL, Bonnert TP, Renshaw BR, Sonoda G, Taguchi T, Testa JR, Dower SK, Sims JE. 1996. Cloning of a putative ligand for the T1/ST2 receptor. J Biol Chem. 271(10):5784–5789. Gommel D, Orci L, Emig EM, Hannah MJ, Ravazzola M, Nickel W, Helms JB, Wieland FT, Sohn K. 1999. p24 and p23, the major transmembrane proteins of COPI-coated transport vesicles, form hetero-oligomeric complexes and cycle between the organelles of the early secretory pathway. FEBS Lett. 447(2–3):179–185. Harter C, Wieland FT. 1998. A single binding site for dilysine retrieval motifs and p23 within the gamma subunit of coatomer. Proc Natl Acad Sci USA. 95(20):11649–11654. Holthuis JC, Jansen EJ, van Riel MC, Martens GJ. 1995. Molecular probing of the secretory pathway in peptide hormone-producing cells. J Cell Sci. 108(Pt 10):3295–3305. Holthuis JC, van Riel MC, Martens GJ. 1995. Translocon-associated protein TRAP delta and a novel TRAP-like protein are coordinately expressed with pro-opiomelanocortin in Xenopus intermediate pituitary. Biochem J. 312(Pt 1):205–213. Horer J, Blum R, Feick P, Nastainczyk W, Schulz I. 1999. A comparative study of rat and human Tmp21 (p23) reveals the pseudogene-like features of human Tmp21-II. DNA Seq. 10(2):121–126. Hosaka M, Watanabe T, Yamauchi Y, Sakai Y, Suda M, Mizutani S, Takeuchi T, Isobe T, Izumi T. 2007. A subset of p23 localized on secretory granules in pancreatic b-cells. J Histochem Cytochem. 55(3):235–245. Jaillon O, Aury J-M, Brunet F, et al. 2004. Genome duplication in the teleost fish Tetraodon nigroviridis reveals the early vertebrate proto-karyotype. Nature. 431(7011):946–957 (61 co-authors). Kaiser C. 2000. Thinking about p24 proteins and how transport vesicles select their cargo. Proc Natl Acad Sci USA. 97(8):3783–3785. Krogh A, Larsson B, von Heijne G, Sonnhammer EL. 2001. Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol. 305(3):567–580. Kuiper RP, Martens GJ. 2000. Prohormone transport through the secretory pathway of neuroendocrine cells. Biochem Cell Biol. 78(3):289–298. Kuiper RP, Waterham HR, Rötter J, Bouw G, Martens GJ. 2000. Differential induction of two p24delta putative cargo receptors upon activation of a prohormone-producing cell. Mol Biol Cell. 11(1):131–140. Marzioch M, Henthorn DC, Herrmann JM, Wilson R, Thomas DY, Bergeron JJ, Solari RC, Rowley A. 1999. Erp1p and Erp2p, partners for Emp24p and Erv25p in a yeast p24 complex. Mol Biol Cell. 10(6):1923–1938. Munro S. 1998. Localization of proteins to the Golgi apparatus. Trends Cell Biol. 8(1):11–15. Palade G. 1975. Intracellular aspects of the process of protein synthesis. Science. 189(4200):347–358. Rojo M, Pepperkok R, Emery G, Kellner R, Stang E, Parton RG, Gruenberg J. 1997. Involvement of the transmembrane protein p23 in biosynthetic protein transport. J Cell Biol. 139(5):1119–1135. Rötter J, Kuiper RP, Bouw G, Martens GJ. 2002. Cell-type-specific and selectively induced expression of members of the p24 family of putative cargo receptors. J Cell Sci. 115(Pt 5):1049–1058. Schimmöller F, Singer-Krüger B, Schröder S, Krüger U, Barlowe C, Riezman H. 1995. The absence of Emp24p, a component of ER-derived COPII-coated vesicles, causes a defect in transport of selected proteins to the Golgi. EMBO J. 14(7):1329–1339. Sohn K, Orci L, Ravazzola M, Amherdt M, Bremser M, Lottspeich F, Fiedler K, Helms JB, Wieland FT. 1996. A major transmembrane protein of Golgi-derived COPI-coated vesicles involved in coatomer binding. J Cell Biol. 135(5):1239–1248. 1714 Strating et al. Sonnhammer ELL, Koonin EV. 2002. Orthology, paralogy and proposed classification for paralog subtypes. Trends Genet. 18(12):619–620. Stamnes MA, Craighead MW, Hoe MH, Lampen N, Geromanos S, Tempst P, Rothman JE. 1995. An integral membrane component of coatomer-coated transport vesicles defines a family of proteins involved in budding. Proc Natl Acad Sci USA. 92(17):8011–8015. Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG. 1997. The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 25(24):4876–4882. Thompson JD, Higgins DG, Gibson TJ. 1994. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22(22):4673–4680. Wada I, Rindress D, Cameron P, Ou W, Doherty J 2nd, Louvard D, Bell A, Dignard D, Thomas D, Bergeron J. 1991. SSR alpha and associated calnexin are major calcium binding proteins of the endoplasmic reticulum membrane. J Biol Chem. 266(29):19599–19610. Waterhouse AM, Procter JB, Martin DMA, Clamp M, Barton GJ. 2009. Jalview Version 2-a multiple sequence alignment editor and analysis workbench. Bioinformatics. doi:10.1093/bioinformatics/btp033 Claudia Kappen, Associate Editor Accepted April 9, 2009
© Copyright 2026 Paperzz