J O U RN A L OF P ROTE O M IC S 7 9 ( 2 01 3 ) 1 1 4 –12 2 Available online at www.sciencedirect.com www.elsevier.com/locate/jprot In-depth proteomic analysis of the human sperm reveals complex protein compositions Gaigai Wang1 , Yueshuai Guo1 , Tao Zhou1 , Xiaodan Shi1 , Jun Yu, Ye Yang, Yibo Wu, Jing Wang, Mingxi Liu, Xin Chen, Wenjiao Tu, Yan Zeng, Min Jiang, Suying Li, Pan Zhang, Quan Zhou, Bo Zheng, Chunmei Yu, Zuomin Zhou, Xuejiang Guo⁎, Jiahao Sha⁎ State Key Laboratory of Reproductive Medicine, Department of Histology and Embryology, Nanjing Medical University, Nanjing 210029, China AR TIC LE I N FO ABS TR ACT Article history: The male gamete (sperm) can fertilize an egg, and pass the male genetic information to the Received 2 October 2012 offspring. It has long been thought that sperm had a simple protein composition. Efforts Accepted 7 December 2012 have been made to identify the sperm proteome in different species, and only about 1000 Available online 23 December 2012 proteins were reported. However, with advanced mass spectrometry and an optimized proteomics platform, we successfully identified 4675 human sperm proteins, of which Keywords: 227 were testis-specific. This large number of identified proteins indicates the complex Sperm composition and function of human sperm. Comparison with the sperm transcriptome Proteome reveals little overlap, which shows the importance of future studies of sperm at the protein Mass spectrometry level. Interestingly, many signaling pathways, such as the IL-6, insulin and TGF-beta Signaling pathway receptor signaling pathways, were found to be overrepresented. In addition, we found that Drug 500 proteins were annotated as targets of known drugs. Three of four drugs studied were Human found to affect sperm movement. This in-depth human sperm proteome will be a rich resource for further studies of sperm function, and will provide candidate targets for the development of male contraceptive drugs. Crown Copyright © 2012 Published by Elsevier B.V. All rights reserved. 1. Introduction Approximately one in six couples experience difficulty in conceiving a child. Male infertility accounts for about half the cases in which assisted reproductive techniques are recommended [1]. Over 85% of infertile male can actually produce sperm [2]; however, for some reason, those sperm are often unable to fertilize an egg. As the only cell performing its function outside the male human body, sperm is a highly specialized cell with distinct morphological and compositional differences compared with other somatic and germ cells [3]. It was long believed that the function of sperm was only to deliver the paternal genomes to the egg. However, recent studies have shown that sperm can deliver a complex set of RNAs to the egg [4]. In addition, the entire cell, including the midpiece and tail, enters the egg in most species [5]. Current studies have suggested that sperm defects can disrupt embryo development, even if the genome carried by the cells is perfectly normal [6]. Thus, characterization of the protein composition of sperm can help better understand sperm function. The recently proposed Chromosome-Centric Human Proteome Project (C-HPP) aims to define the full set of proteins encoded in each chromosome. The initial goal of the C-HPP is to identify at least one representative protein encoded by each of the approximately 20,300 human genes [7]. The genes and ⁎ Corresponding authors. Tel.: +86 25 86862038; fax: +86 25 86862908. E-mail addresses: [email protected] (X. Guo), [email protected] (J. Sha). 1 These authors contributed equally to the work. 1874-3919/$ – see front matter. Crown Copyright © 2012 Published by Elsevier B.V. All rights reserved. http://dx.doi.org/10.1016/j.jprot.2012.12.008 J O U RN A L OF P ROT EO M IC S 7 9 ( 2 01 3 ) 1 1 4 –1 22 proteins had tissue-dependent expression. According to previous analyses, the human testis, the male gonad producing sperm, contains the largest number of tissue-specific genes across the 31 human tissues [8]. Thus, in order to characterize all the proteins, proteins in the testis and/or testicular cells should be carefully studied. The in-depth proteomic analysis of human sperm will produce data important for C-HPP. To date, some efforts have been made to identify the human sperm proteome, and in fact, a list of 1056 proteins has been reported [9]. Additionally, Johnston et al. [10] claimed identification of 1760 proteins in human sperm, but this protein list is not available. Efforts to identify the sperm proteome in other species, including drosophila and mammals, such as rat and mouse [11,12], have also been made, and the numbers of proteins identified were all around or below one thousand. In the present study, using the advanced mass spectrometry and an optimized proteomics platform, we successfully identified 4675 unique proteins from human sperm, which showed the complex composition of human sperm. 115 Mobile phase A = 95:5 H2O:ACN, 5 mM ammonium formate buffer pH= 2.7, mobile phase B = mobile phase A + 800 mM ammonium formate, pH= 2.7. The gradient used was 0–56% B for 20 min, 56% to 100% B for 1 min, 100% B for 5 min, 100% to 0% B for 1 min, and 0% B for 20 min before the next run. In each series of experiments, 100 μl fractions were collected every 2 min, and 20 fractions were obtained. The experiments were repeated for 3 times. 2.3. Mass spectrometric analysis and database search This human study was ratified by the Ethics Committee of Nanjing Medical University, and was in accordance with National and International guidelines. Before initiating the study, consent was obtained from all participants. The sperm subjected to proteomics analysis were from 32 healthy male volunteers with a mean age of 30 ± 4 years old (mean ± standard deviation). These men had proven fertility and normal semen quality, as assessed by World Health Organization criteria (1999). The semen samples were obtained by masturbation after at least 3 days of abstinence. The samples were ejaculated into sterile containers and allowed to liquefy for at least 30 min before being processed by centrifugation in a 60% Percoll gradient (GE Healthcare, Waukesha, WI, USA) to remove seminal plasma, immature germ cells, and non-sperm cells (mainly epithelial cells), as described by Loredana-Gandini et al. [13,14]. The purified sperms were then washed in PBS three times before subsequent proteomics analysis. For purity evaluation, the sperm were resuspended in PBS and stained with Hoechst H33342 (Sigma, St Louis, MO) for 30 s; 1000 cells were counted by light microscopy. For capillary reverse-phase liquid chromatography (LC) and mass spectrometric analysis, each fraction was directly loaded onto a μ-precolumn™ cartridge (0.3× 5 mm, 5 μm, 100 Å; Dionex) at a flow rate of 20 μl/min. The trap column effluent was then transferred to a reverse-phase microcapillary column (0.075 × 150 mm, Acclaim® PepMap100 C18 column, 3 μm, 100 Å; Dionex). The reverse-phase separation of peptides was performed using the following buffers: 2% ACN, 0.5% acetic acid (buffer A) and 80% ACN, 0.5% acetic acid (buffer B); a 122 or 82-min ACN gradient (4% to 7% buffer B for 3 min, 7% to 33% buffer B for 102 min or 62 min, 33% to 50% buffer B for 10 min, 50% to 100% buffer B for 3 min, 100% buffer B for 3 min, 100% to 4% buffer B for 1 min) was used. Peptide analysis was performed using a LTQ Orbitrap Velos (ThermoFisher Scientific, San Jose, CA) coupled directly to an LC column. An MS survey scan was obtained for the m/z range 350–1800, and MS/MS spectra were acquired from the survey scan for the 20 most intense ions (as determined by Xcalibur mass spectrometer software in real time). Dynamic mass exclusion windows of 60 s were used, and siloxane (m/z 445.120025) was used as an internal standard. RAW files for LC–MS/MS identification were processed by MaxQuant (v1.2.2.5), and identified with the Andromeda search engine according to standard workflow [15]. The peak lists were searched against the UniProtKB human proteome sequence database (2012/04/18), which contains 86,770 entries. Carbamidomethylation of cysteine (+57 Da) was set as a fixed modification, and oxidized methionine (+ 16 Da) was set as a variable modification. The initial mass tolerances for protein identification on MS and MS/MS peaks were 20 ppm and 0.5 Da, respectively. Two missed cleavages were permitted, and full cleavage by trypsin was used. The false discovery rate (FDR) of the identified peptides and proteins was estimated by searching against the database with the reversed amino acid sequence [15]. Only peptides that were a minimum of six amino acids in length and had a FDR of 1% were considered for identification. 2.2. 2.4. 2. Materials and methods 2.1. Sperm collection Sample preparation for mass spectrometry Human sperm were dissolved in 7 M urea, 2 M thiourea, 65 mM DTT, and 1% (v/v) protease inhibitor cocktail, and the extracted proteins from different men were mixed for subsequent proteomic studies. Proteins of 240 μg were reduced, alkylated and sequentially digested with modified trypsin (sequencing grade, Promega, Madison, WI). These in-solution digests were loaded onto a strong-cation exchange column (1 mm ID× 10 cm, packed with Poros 10S, Dionex, Sunnyvale, CA) for fractionation. A linear salt gradient ammonium formate in 5% acetonitrile (ACN) was applied at a flow-rate of 50 μl/min. Human sperm proteome annotation For bioinformatics analysis, the international protein index (IPI) accession number was converted to an Entrez Gene ID or Ensembl Gene ID. All Ensembl Gene IDs were loaded onto the Database for Annotation, Visualization and Integrated Discovery (DAVID) [16] to identify the enriched biological themes, including Gene Ontology. A FDR of less than 0.05 was considered statistically enriched. The Entrez Gene IDs were loaded onto a Web-based Gene Set Analysis Toolkit (http:// bioinfo.vanderbilt.edu/webgestalt/) [17] to identify the hyper-represented WikiPathways; a FDR of less than 0.05 116 J O U RN A L OF P ROTE O M IC S 7 9 ( 2 01 3 ) 1 1 4 –12 2 was considered hyper-represented. Human cilia proteins were from the Ciliaproteome Database V3 (http://www. ciliaproteome.org/) [18]. The testis-specific genes used were from those assembled by Dezső et al. based on mRNA expression across 31 human tissues [8]. Drosophila othologs of human genes were batch downloaded from Ensembl 65 using BioMart (http://www.ensembl.org/biomart/martview/). Known drug targets in the identified human sperm proteome was annotated using the DrugBank database (http://www. drugbank.ca) [19], which is a database that contains more than 1500 non-redundant proteins or drug target sequences. In addition, DrugBank is a richly annotated resource that combines detailed drug data with comprehensive drug target and drug action information. 2.5. Indirect immunofluorescence Sperm were washed three times by centrifugation for 5 min at 300 ×g, resuspended in PBS, and air-dried onto polyLysine-coated coverslips. The sperm cells were fixed in 4% formaldehyde/phosphate buffered saline (PBS) for 30 min, washed three times with PBS for 5 min, and then blocked with horse serum (Beijing ZhongShan Biotechnology Co., Beijing, China) for 2 h at room temperature. Following incubation with primary antibodies overnight at 4 °C, the cells were incubated with secondary antibody labeled with fluorescein isothiocyanate (FITC; Beijing ZhongShan Biotechnology Co.) at a dilution of 1:200 for 1 h at room temperature. Negative controls were incubated without the primary antibodies, but otherwise the same. The primary antibodies used are provided in Supplementary Data 1. 2.6. Evaluation of the effects of drugs on human sperm motility in vitro The drugs studied were disulfiram (Antabuse) (Selleck Chemicals LLC, Houston, TX), propofol (Sigma), leflunomide (Selleck Chemicals LLC) and sorafenib (Nexavar) (Selleck Chemicals LLC). Sperm were incubated in a capacitating medium with 10 mg/ml BSA [20], with or without drugs at gradient concentrations. During the experiments, the drugs to be tested with sperm were dissolved in distilled water or dimethylsulphoxide (DMSO) (Sigma). The concentration of DMSO in the incubation media never exceeded 1% (v/v), a condition that does not affect sperm capacitation or acrosome reaction [21]. After incubation for 0, 15, 30, 60 or 120 min at 37 °C in a 5% CO2 incubator, the sperm motility was detected using Computer-Aided Sperm Analysis (IVOS, Hamilton Thorne Biosciences, Beverly, MA). 3. Results 3.1. Identification of total proteins expressed in human sperm We used a 60% percoll gradient to purify sperm from human semen, as this gradient has been verified to be able to remove seminal plasma, non-sperm cells and even immature germ cells [13]. We counted 1000 purified sperm with nuclear staining, and none were contaminated with non-sperm or immature germ cells (see Supplementary Fig. 1). From the purified sperm, we successfully identified 30,903 unique peptides, that corresponded to 4675 unique proteins, using an advanced LTQ Orbitrap Velos mass spectrometer. Since such a large amount of data can be subject to high false positives, we controlled the FDR to 1% at both the peptide and protein levels using a reversed sequence database by MaxQuant [15]. Of the 4675 proteins, 4401 proteins (94%) were identified in two independent experiments (Fig. 1A). Their identification information is shown in Supplementary Data 2 and 3, and the detailed single peptide-based identification data including annotated mass spectra are presented in Supplementary Data 4 and 5. The human sperm proteome has been studied using different approaches, including two-dimensional gel electrophoresis (2D-PAGE) separation [3] and liquid chromatography separation [9]. Additionally, the human sperm nucleus has been purified and profiled [22]. Comparison with these published proteomes showed that 93% (206/221) of the proteins identified by two independent studies were successfully identified in our proteome (Fig. 1B), and 3777 proteins were newly identified in our proteome. For example, we identified 21 different phosphodiesterases in human sperm, which have been refractory to previous studies. Whereas, in all three published studies, only Baker et al. [9] identified one phosphodiesterase. Phospholipase C, zeta 1 (PLCZ1), phosphodiesterase 1A (PDE1A) and phospholipase C, delta 4 (PLCD4), which are well-studied phosphodiesterase that are important for sperm function [23–25], were only identified in our proteome. The Ciliary proteome database annotates 2688 human cilia proteins, of which more than half (1510 proteins) were identified in human sperm (Fig. 1C; Supplementary Data 2). Such a large number of cilia proteins in sperm suggest that the human sperm tail may have a similar mechanism to the cilia in somatic cells. Thus, it may be possible to use the knowledge of cilia structure and function to study the sperm tail. 3.2. Immunofluorescence studies To verify the identified human sperm proteome, 29 proteins with commercial antibodies were randomly selected for immunofluorescence studies. The results showed diverse localizations in sperm (Fig. 2). Our immunolocalization revealed proteins from different parts of sperm, including the acrosome, equatorial region, post-equatorial region, neck, midpiece, principal piece, and end piece, which confirmed the validity of our proteome identification, and also showed that our approach can indeed perform in-depth profiling of the entire sperm proteome. In the human sperm proteome, there are proteins identified only by one unique peptide, which is of relatively lower confidence. Ras homolog family member A (RHOA), inner membrane protein, mitochondrial (IMMT), and histone deacetylase 1 (HDAC1) are such identified proteins. Our immunofluorescence studies confirmed their expression in human sperm, and also showed that our proteome was reliable. 3.3. Over-represented pathways in the human sperm proteome Assigning proteins to the WikiPathways revealed a considerable number of overrepresented pathways, including those involved J O U RN A L OF P ROT EO M IC S 7 9 ( 2 01 3 ) 1 1 4 –1 22 117 Fig. 1 – Comparison with the published proteomes or transcriptome. (A). Overlap of proteins between three replicates. (B). The previously published sperm proteome, as determined by 2D-PAGE and LC–MS/MS, and sperm nucleus proteome were compared with our proteome. The proteins were converted to Ensembl Genes, and the overlap among the protein lists is shown in the Venn diagram. The indicated numbers are the numbers of Ensembl Genes. (C). The overlap between our sperm proteome with the cilia proteins from the Ciliaproteome Database V3 (http://www.ciliaproteome.org/). (D). The overlap of genes between our sperm proteome and the published sperm transcriptome [1]. in energy metabolism, signal transduction, cytoskeleton, and so on (Supplementary Data 6). the existence of so many different signaling pathways in sperm. 3.3.1. 3.3.3. Energy metabolism After maturation in the epididymis, sperm acquire motility which consumes a lot of energy. The proteins in many different energy metabolism pathways, including glycolysis and gluconeogenesis, tricarboxylic acid cycle (TCA) cycle (Supplementary Fig. 2A), oxidative phosphorylation and electron transport chain, and glycogen metabolism pathways, were found to be overrepresented. High coverage of these pathways was achieved. For example, 27 of the total 32 proteins in TCA cycle were identified in the human sperm proteome (Supplementary Fig. 2A; Supplementary Data 6). 3.3.2. Signal transduction Calcium is an important messenger for signal transduction during fertilization, and its influx can be induced by progesterone [26]. We observed over-representation of a pathway of calcium regulation in the cardiac cell. During capacitation, MAPK was activated. Suppression of MAPK inhibits sperm capacitation, making them refractory to the progesteroneactivated acrosome reaction [27]. In the human sperm proteome, several MAPKs were identified, including MAPK14, MAPK1, MAPK3, MAPK9 and MAPK13. In addition, the MAPK signaling pathway was found to be overrepresented (Supplementary Fig. 2B). The proteins identified in this pathway will help elucidate the signal cascade of MAPK that is regulated during capacitation. We found that many other signaling pathways were overrepresented in sperm as well, such as the TNF-alpha/NF-κB, EGFR1, insulin, TGF-beta, IL-6, IL-2, and IL-5 signaling pathways (Supplementary Data 6). It was interesting to discover Cytoskeleton The pathway of regulation of actin cytoskeleton was also found to be over-represented in sperm (Supplementary Fig. 2C). Remodeling of the actin-cytoskeleton occurs during sperm capacitation and acrosome reaction [28]. Our previous studies have shown that proteins in the pathway of regulation of actin cytoskeleton, such as RhoA and RhoGDI, play important functions in sperm capacitation [20]. 3.4. Testis-specific genes Based on the mRNA expression results from 31 human tissues [8], 227 proteins corresponding to 223 Entrez genes were found to be testis-specific (Supplementary Data 2), and were significantly enriched in sperm proteome compared with genome-wide distribution (p value =2.2E−16, fold enrichment=2.1 by Fisher's exact test). Gene ontology analysis annotated only 78 genes with biological process terms, of which 41 (53%) were annotated to sexual reproduction (Supplementary Data 7). The remaining 147 testis-specific genes (65%) had no functional annotation (Supplementary Data 7). Therefore, it is clear that the function of these testis-specific proteins is largely unknown, although they are supposedly important for fertility in men and thus deserve further investigation. 3.5. Comparison with human sperm mRNA Complex mRNA profiles have been identified in human sperm [1]. The overlap between sperm mRNAs and proteins is not well known. With our in-depth human sperm proteome, we 118 J O U RN A L OF P ROTE O M IC S 7 9 ( 2 01 3 ) 1 1 4 –12 2 compared the sperm proteome profile and sperm RNA profile. Of the 1915 genes with RNA detected in human sperm, only 553 (29%) have corresponding identified proteins (Fig. 1D). The low percentage of identified proteins in human sperm indicates that the RNA in sperm may not only be residual RNA for the translated sperm proteins, but may also function in sperm capacitation or fertilization. Recently, scientists found translation by mitochondrial ribosome during sperm capacitation [29], and after entering the oocyte, the RNA may be translated in the zygote. 3.6. Comparison with drosophila sperm proteome Dorus et al. characterized drosophila sperm proteome, and identified 381 proteins [30]. Recently, further proteomic analysis of drosophila sperm identified 766 additional sperm proteins, thereby expanding the sperm proteome to 1108 proteins [31]. Because analysis of spermatogenesis and the involvement of paternal products during and after fertilization in drosophila is directly relevant to analogous processes in mammals. With the available in-depth human sperm proteome, it is possible to perform comparative proteomic analyses of sperm between human and drosophila. This will allow the establishment of parallel genetic models for the study of spermatogenesis and sperm function, and will allow the identification of sperm factors responsible for male infertility in humans. Of the 744 genes encoding drosophila sperm proteome with orthologs in human, 441 (59%) have orthologs in our human sperm proteome (see Supplementary Data 8), and these genes can be studied in advance to better understand human fertility. For example, oxen (FBgn0011227) was found to be involved in spermatid development in drosophila [32]. It will be interesting to study the function of its ortholog UQCR10 in human spermatogenesis. 3.7. Drug target annotation DrugBank has been widely used to facilitate in silico drug target discovery, drug design, drug docking or screening, drug metabolism prediction and drug interaction prediction. We Fig. 2 – Immunofluorenscence studies of the identified human sperm proteins.The proteins from our human sperm proteome with commercial antibodies were randomly selected for immunofluorenscence studies. The nucleus was stained with Hoechst (blue). Differential localization (green) of the proteins in the sperm was observed (green). (A) acrosome: GNA11; (B) equatorial region: DDX4; (C) equatorial region and neck: alpha/beta SNAP; (D) post-equatorial region: SUGT1; (E–O) neck: α Tubulin, RBBP7, DCTN1, Flotillin 2, HSPA5, KIF3A, M6PRBP1, MAP2K1, VAPA, Calnexin and PRPS2; (P–V) midpiece: ACY1, IMMT, UCHL1, PRPF19, SOD2, YKT6 and VDAC1; (W–X) principal piece: beta actin, ENO2; (Y) neck and principal piece: HDAC1; (Z) neck, midpiece and principal piece: ENO1; (AA) end piece: RBBP4; and (AB–AC) whole spermatozoon: β Tubulin and RhoA. The negative control is shown in (AD). The fluorescent images were overlaid on the DIC images. J O U RN A L OF P ROT EO M IC S 7 9 ( 2 01 3 ) 1 1 4 –1 22 119 motility after adding drug to the sperm cell culture. Disulfiram (DrugBank: DB00822) and propofol (DrugBank: DB00818) inhibited sperm motility (Fig. 4A–B), but leflunomide (DrugBank: DB01097) had no effect on sperm motility (Fig. 4C). Sperm motility was generally unaffected with sorafenib (DrugBank: DB00398), except after a 60 min incubation at a 100 μM concentration (Fig. 4D). 4. Fig. 3 – Subcellular distribution of sperm proteins targeted by known drugs in DrugBank. Proteins targeted by drugs according to the DrugBank database were analyzed, and the subcellular distribution of sperm proteins was annotated using gene ontology. annotated the human sperm proteome using DrugBank data [19]. We found that 500 human sperm proteins could be targeted by known drugs (Supplementary Data 9). Of these proteins, 150 proteins were mitochondrial proteins according to gene ontology annotation, which is significantly enriched (Fig. 3; Supplementary Data 9 and 10). 162 proteins were annotated as cilia proteins according to Ciliary proteome database (Supplementary Data 9). The mitochondrial sheath provides energy for sperm motility, and the sperm tail is similar to the cilia. Inhibition of these proteins is expected to suppress sperm motility and can thus be future methods for contraception. 3.8. Sperm motility assessment after drug treatment Four drugs with sperm protein targets annotated by DrugBank were used for functional assessment. Because cilia proteins may play important roles in sperm motility, we evaluated the sperm Discussion It has been estimated that sperm contains only 400 to 1300+ proteins [33], and until now, the published sperm proteome contained the number of proteins within this estimated range [9,11,12,30,31]. Thus, sperm was assumed to be relatively “simple” in protein composition [9]. However, using advanced mass spectrometry and a proteomics platform, we identified 4675 proteins from human sperm, which is about 4-fold greater than the previously estimated number. The large number of identified sperm proteins demonstrates the unexpected complexity of the human sperm protein composition. In the human sperm proteome, we identified 223 testisspecific genes, which comprise 46% of the total 484 reported testis-specific genes by Dezső et al. [8]. Sperm has been known to use specific proteins to regulate its function. For example, although glycolysis is highly conserved, this central metabolic pathway is modified in germ cells using specifically-expressed enzymes, such as testis-specific glyceraldehyde-3-phosphate dehydrogenase (GAPDHS), lactate dehydrogenase C (LDHC) and phosphoglycerate kinase type 2 (PGK2) [34]. We successfully identified unique peptides of three germ cell-specific enzymes, which are different from the somatic counterparts of the enzymes, lactate glyceraldehyde-3-phosphate dehydrogenase (GAPDH), lactate dehydrogenase (LDH) and phosphoglycerate kinase type 1 (PGK1). With the identification of 233 testis-specific genes with unique peptides, this will help further our understanding of sperm function, and also contribute to the initial goal of C-HPP, which is to characterize at least one representative protein encoded by each of the approximately 20,300 human genes [7]. Fig. 4 – The effect of drugs on sperm movement. Purified human sperm were incubated under capacitating conditions for 0, 15, 30, 60 or 120 min, and motility was measured in the presence of disulfirum (A), propofol (B), leflunomide (C), or sorafenib (D). The standard deviation is shown as bars. Statistical differences by Student's t-test compared with control are annotated as “*” for p < 0.05 or “**” for p < 0.01. 120 J O U RN A L OF P ROTE O M IC S 7 9 ( 2 01 3 ) 1 1 4 –12 2 With such a large-scale human sperm proteome, it is possible to analyze functional pathways in sperm. In such pathway analyses, we found enrichment of a series of energy metabolic pathways including glycolysis and gluconeogenesis pathway, which supplies the energy required for the movement and fertilization of spermatozoa [35]. In addition, we also found hyperrepresentation of many signaling pathways. The IL-6 signaling pathway is one such pathway. Previous studies have shown that IL-6 can enhance the fertilizing capacity of human sperm by increasing capacitation and the acrosome reaction [36]; however, the detailed pathway and mechanism by which IL-6 does this are still not known. We identified ERK1/2 and p38, which are annotated to downstream of the IL-6 signaling pathway. ERK1/2 and p38 MAPK are primarily localized to the tail of mature human spermatozoa, and are both involved in the acrosome reaction [37]. In addition to the IL-6 signaling pathway, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-9, insulin, EGFR1, TGF beta, and TNF-alpha/NF-κB signaling pathways were all hyperrepresented. Some of these pathways have already been studied and verified to function in sperm. For example, TNF alpha can affect human sperm function by elevating nitric oxide production, and regulate sperm motility [38,39]. Insulin can enhance human sperm motility, acrosome reaction and nitric oxide production [40]. Activation of the EGFR at the end of capacitation is enhanced [Ca2+], leaded to F-actin breakdown and facilitating the acrosome reaction [41]. TGF beta 1 can regulate protein changes in human sperm [42]. Thus, sperm can be regulated by and respond to various external signals through signaling pathways. In seminal plasma, different factors have been detected, including IL-6, IL-8, VEGF, TNFalpha, IL-1beta and TGFbeta1 [43]. When sperm leave the male body to meet the egg, it goes through the cervical canal and fallopian tube. IL-6, IL-5 and IL-2 have been found in cervical mucus [44], and TGF-beta was also found to exist in the human fallopian tube [45]. These factors may regulate sperm function, and help completion of the fertilization process. The availability of the human sperm proteome will not only help in the study of sperm function, but will also help in the development of contraceptive drugs. Annotation of drug targets showed that 500 human sperm proteins are known targets. The list of sperm proteins can be a rich resource for the development contraceptive drugs. For example, the identified Angiotensin-converting enzyme (ACE) can be inhibited by 13 different drugs, including enalapril and quinapril, according to the DrugBank database annotation. ACE has been shown to be important for male fertility in mice. The fertility of homozygous male mutants of ACE was greatly reduced [46]. Although available inhibitors can only inhibit its function in sperm weakly, if the drug can be optimized to inhibit its GPIase function in sperm, it may serve as a promising contraceptive drug [47]. Of the annotated drug targeted proteins, 154 are mitochondrial proteins and 162 are cilia proteins. Mitochondria provide energy for sperm motility; cilia are structurally similar to sperm flagella, which provide force for motility. The functional assessment of 4 drugs targeting sperm proteins revealed that three could influence the motility of human sperm. Disulfiram targets the mitochondrial protein, aldehyde dehydrogenase 2 family (mitochondrial) (ALDH2), and may affect mitochondrial function and affect sperm motility. Propofol targets fatty acid amide hydrolase (FAAH), a cilia protein, and is also expected to affect sperm motility. Sorafenib targets RAF1. RAF1 is located in the acrosome and flagella of sperm, and is an important protein in the Ras/Raf/MAPK pathway, which reportedly regulates sperm motility [37,48]. Leflunomide targets DHODH. DHODH is annotated as a mitochondrial protein, but its well known function is to regulate transcriptional elongation [49]. Mature sperms are known to be transcriptionally dormant [1]; therefore, it is reasonable that DHODH did not affect sperm motility. Thus, these drug target data can be used to evaluate the reproductive toxicity of drugs in use, and these proteins may be candidate targets for the development of spermicides and treatments for male infertility. 5. Conclusions The results of the present study reveal that the proteomic composition of human sperm is more complex than we previously thought. In addition, the sperm proteome characterization in this study is currently the largest sperm proteome thus far. Annotation of the sperm proteome revealed many signaling pathways and targets of known drugs, and 227 of the identified sperm proteins were testis-specific. Our human sperm proteome will help further our understanding of sperm function, and will provide candidate targets for male contraceptives. Additionally, the identification of these sperm proteins, especially the testis-specific ones, will contribute to the initial goal of C-HPP. Acknowledgments This study was supported by grants from the 973 program (2011CB944304, 2009CB941703), and the Chinese Natural Science Funds (81222006, 31000637, 31271245). And it was sponsored by Qing Lan Project. Appendix A. Supplementary data Supplementary data to this article can be found online: Supplementary Fig. 1 is purified human sperm stained by Hoechst. Supplementary Fig. 2 is hyperrepresented pathways in our human sperm proteome. Supplementary Data 1 is a table that shows the primary antibodies used in immunofluorescence studies. Supplementary Data 2 is a table that shows the proteins identified in the human sperm proteome with a FDR of 1%. Supplementary Data 3 is all the peptides Identified with FPR of 1%. Supplementary Data 4 is a table that shows the single peptide-based protein identifications. Supplementary Data 5 corresponds to MS/MS spectra and fragment assignments of the single peptide-based identifications. Supplementary Data 6 is a table that shows the WikiPathway annotation of the human sperm proteome. Supplementary Data 7 contains tables that show the gene ontology annotation of testis-specific proteins in the human sperm proteome. Supplementary Data 8 is a table of the orthologous relationship between genes from Drosophila sperm proteome and J O U RN A L OF P ROT EO M IC S 7 9 ( 2 01 3 ) 1 1 4 –1 22 human sperm proteome. Supplementary Data 9 is a table of proteins targeted by drugs according to DrugBank database in the human sperm proteome. Supplementary Data 10 is a table of gene ontology annotation of drug-targeted proteins in the human sperm proteome. Supplementary data to this article can be found online at http://dx.doi.org/10.1016/j.jprot.2012.12.008. REFERENCES [1] Ostermeier GC, Dix DJ, Miller D, Khatri P, Krawetz SA. Spermatozoal RNA profiles of normal fertile men. Lancet 2002;360:772–7. [2] Evaluation of the azoospermic male. Fertil Steril 2008;90: S74–7. [3] Martinez-Heredia J, Estanyol JM, Ballesca JL, Oliva R. Proteomic identification of human sperm proteins. Proteomics 2006;6:4356–69. [4] Ostermeier GC, Miller D, Huntriss JD, Diamond MP, Krawetz SA. Reproductive biology: delivering spermatozoan RNA to the oocyte. Nature 2004;429:154. [5] Ankel-Simons F, Cummins JM. Misconceptions about mitochondria and mammalian fertilization: implications for theories on human evolution. Proc Natl Acad Sci U S A 1996;93:13859–63. [6] Loppin B, Lepetit D, Dorus S, Couble P, Karr TL. Origin and neofunctionalization of a Drosophila paternal effect gene essential for zygote viability. Curr Biol 2005;15:87–93. [7] Paik YK, Jeong SK, Omenn GS, Uhlen M, Hanash S, Cho SY, et al. The Chromosome-Centric Human Proteome Project for cataloging proteins encoded in the genome. Nat Biotechnol 2012;30:221–3. [8] Dezso Z, Nikolsky Y, Sviridov E, Shi W, Serebriyskaya T, Dosymbekov D, et al. A comprehensive functional analysis of tissue specificity of human gene expression. BMC Biol 2008;6: 49. [9] Baker MA, Reeves G, Hetherington L, Muller J, Baur I, Aitken RJ. Identification of gene products present in Triton X-100 soluble and insoluble fractions of human spermatozoa lysates using LC–MS/MS analysis. Proteomics Clin Appl 2007;1:524–32. [10] Johnston DS, Wooters J, Kopf GS, Qiu Y, Roberts KP. Analysis of the human sperm proteome. Ann N Y Acad Sci 2005;1061: 190–202. [11] Baker MA, Hetherington L, Reeves GM, Aitken RJ. The mouse sperm proteome characterized via IPG strip prefractionation and LC–MS/MS identification. Proteomics 2008;8:1720–30. [12] Baker MA, Hetherington L, Reeves G, Muller J, Aitken RJ. The rat sperm proteome characterized via IPG strip prefractionation and LC–MS/MS identification. Proteomics 2008;8:2312–21. [13] Gandini L, Lenzi A, Lombardo F, Pacifici R, Dondero F. Immature germ cell separation using a modified discontinuous Percoll gradient technique in human semen. Hum Reprod 1999;14:1022–7. [14] Zhao C, Huo R, Wang FQ, Lin M, Zhou ZM, Sha JH. Identification of several proteins involved in regulation of sperm motility by proteomic analysis. Fertil Steril 2007;87: 436–8. [15] Cox J, Neuhauser N, Michalski A, Scheltema RA, Olsen JV, Mann M. Andromeda: a peptide search engine integrated into the MaxQuant environment. J Proteome Res 2011;10:1794–805. [16] Huang da W, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc 2009;4:44–57. 121 [17] Zhang B, Kirov S, Snoddy J. WebGestalt: an integrated system for exploring gene sets in various biological contexts. Nucleic Acids Res 2005;33:W741–8. [18] Gherman A, Davis EE, Katsanis N. The ciliary proteome database: an integrated community resource for the genetic and functional dissection of cilia. Nat Genet 2006;38:961–2. [19] Wishart DS, Knox C, Guo AC, Cheng D, Shrivastava S, Tzur D, et al. DrugBank: a knowledgebase for drugs, drug actions and drug targets. Nucleic Acids Res 2008;36:D901–6. [20] Wang L, Chen W, Zhao C, Huo R, Guo XJ, Lin M, et al. The role of ezrin-associated protein network in human sperm capacitation. Asian J Androl 2010;12:667–76. [21] de Lamirande E, Gagnon C. The extracellular signal-regulated kinase (ERK) pathway is involved in human sperm function and modulated by the superoxide anion. Mol Hum Reprod 2002;8:124–35. [22] de Mateo S, Castillo J, Estanyol JM, Ballesca JL, Oliva R. Proteomic characterization of the human sperm nucleus. Proteomics 2011;11:2714–26. [23] Yoon SY, Jellerette T, Salicioni AM, Lee HC, Yoo MS, Coward K, et al. Human sperm devoid of PLC, zeta 1 fail to induce Ca(2 +) release and are unable to initiate the first step of embryo development. J Clin Invest 2008;118:3671–81. [24] Lefievre L, de Lamirande E, Gagnon C. Presence of cyclic nucleotide phosphodiesterases PDE1A, existing as a stable complex with calmodulin, and PDE3A in human spermatozoa. Biol Reprod 2002;67:423–30. [25] Fukami K, Nakao K, Inoue T, Kataoka Y, Kurokawa M, Fissore RA, et al. Requirement of phospholipase Cdelta4 for the zona pellucida-induced acrosome reaction. Science 2001;292: 920–3. [26] Strunker T, Goodwin N, Brenker C, Kashikar ND, Weyand I, Seifert R, et al. The CatSper channel mediates progesterone-induced Ca2+ influx in human sperm. Nature 2011;471:382–6. [27] Luconi M, Barni T, Vannelli GB, Krausz C, Marra F, Benedetti PA, et al. Extracellular signal-regulated kinases modulate capacitation of human spermatozoa. Biol Reprod 1998;58: 1476–89. [28] Brener E, Rubinstein S, Cohen G, Shternall K, Rivlin J, Breitbart H. Remodeling of the actin cytoskeleton during mammalian sperm capacitation and acrosome reaction. Biol Reprod 2003;68:837–45. [29] Gur Y, Breitbart H. Mammalian sperm translate nuclear-encoded proteins by mitochondrial-type ribosomes. Genes Dev 2006;20: 411–6. [30] Dorus S, Busby SA, Gerike U, Shabanowitz J, Hunt DF, Karr TL. Genomic and functional evolution of the Drosophila melanogaster sperm proteome. Nat Genet 2006;38:1440–5. [31] Wasbrough ER, Dorus S, Hester S, Howard-Murkin J, Lilley K, Wilkin E, et al. The Drosophila melanogaster sperm proteome-II (DmSP-II). J Proteomics 2010;73:2171–85. [32] Castrillon DH, Gonczy P, Alexander S, Rawson R, Eberhart CG, Viswanathan S, et al. Toward a molecular genetic analysis of spermatogenesis in Drosophila melanogaster: characterization of male-sterile mutants generated by single P element mutagenesis. Genetics 1993;135:489–505. [33] Karr TL. Fruit flies and the sperm proteome. Hum Mol Genet 2007;16 (Spec No. 2:R124-33). [34] Goldberg E, Eddy EM, Duan C, Odet F. LDHC: the ultimate testis-specific gene. J Androl 2010;31:86–94. [35] Kuravsky ML, Aleshin VV, Frishman D, Muronetz VI. Testis-specific glyceraldehyde-3-phosphate dehydrogenase: origin and evolution. BMC Evol Biol 2011;11:160. [36] Naz RK, Kaplan P. Interleukin-6 enhances the fertilizing capacity of human sperm by increasing capacitation and acrosome reaction. J Androl 1994;15:228–33. [37] Almog T, Lazar S, Reiss N, Etkovitz N, Milch E, Rahamim N, et al. Identification of extracellular signal-regulated kinase 122 [38] [39] [40] [41] [42] [43] [44] J O U RN A L OF P ROTE O M IC S 7 9 ( 2 01 3 ) 1 1 4 –12 2 1/2 and p38 MAPK as regulators of human sperm motility and acrosome reaction and as predictors of poor spermatozoan quality. J Biol Chem 2008;283:14479–89. Lampiao F, du Plessis SS. TNF-alpha and IL-6 affect human sperm function by elevating nitric oxide production. Reprod Biomed Online 2008;17:628–31. Perdichizzi A, Nicoletti F, La Vignera S, Barone N, D'Agata R, Vicari E, et al. Effects of tumour necrosis factor-alpha on human sperm motility and apoptosis. J Clin Immunol 2007;27:152–62. Lampiao F, du Plessis SS. Insulin and leptin enhance human sperm motility, acrosome reaction and nitric oxide production. Asian J Androl 2008;10:799–807. Breitbart H, Etkovitz N. Role and regulation of EGFR in actin remodeling in sperm capacitation and the acrosome reaction. Asian J Androl 2010;13:106–10. Naz RK, Kumar R. Transforming growth factor beta 1 enhances expression of 50 kDa protein related to 2′–5′ oligoadenylate synthetase in human sperm cells. J Cell Physiol 1991;146:156–63. von Wolff M, Nowak O, Pinheiro RM, Strowitzki T. Seminal plasma-immunomodulatory potential in men with normal and abnormal sperm count. Eur J Obstet Gynecol Reprod Biol 2007;134:73–8. Lieberman JA, Moscicki AB, Sumerel JL, Ma Y, Scott ME. Determination of cytokine protein levels in cervical mucus [45] [46] [47] [48] [49] samples from young women by a multiplex immunoassay method and assessment of correlates. Clin Vaccine Immunol 2008;15:49–54. Zhao Y, Chegini N, Flanders KC. Human fallopian tube expresses transforming growth factor (TGF beta) isoforms, TGF beta type I–III receptor messenger ribonucleic acid and protein, and contains [125I]TGF beta-binding sites. J Clin Endocrinol Metab 1994;79:1177–84. Krege JH, John SW, Langenbach LL, Hodgin JB, Hagaman JR, Bachman ES, et al. Male–female differences in fertility and blood pressure in ACE-deficient mice. Nature 1995;375:146–8. Kondoh G, Tojo H, Nakatani Y, Komazawa N, Murata C, Yamagata K, et al. Angiotensin-converting enzyme is a GPI-anchored protein releasing factor crucial for fertilization. Nat Med 2005;11:160–6. Nixon B, Bielanowicz A, Anderson AL, Walsh A, Hall T, McCloghry A, et al. Elucidation of the signaling pathways that underpin capacitation-associated surface phosphotyrosine expression in mouse spermatozoa. J Cell Physiol 2010;224: 71–83. White RM, Cech J, Ratanasirintrawoot S, Lin CY, Rahl PB, Burke CJ, et al. DHODH modulates transcriptional elongation in the neural crest and melanoma. Nature 2011;471:518–22.
© Copyright 2025 Paperzz