Protein Phosphorylation and Expression Profiling by Yin

Protein Phosphorylation and Expression Profiling by Yin-Yang
Multidimensional Liquid Chromatography (Yin-Yang MDLC) Mass
Spectrometry
Jie Dai, Wen-Hai Jin, Quan-Hu Sheng, Chia-Hui Shieh, Jia-Rui Wu, and Rong Zeng*
Research Center for Proteome Analysis, Institute of Biochemistry and Cell Biology, Shanghai Institutes for
Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China
Received August 16, 2006
A system which consisted of multidimensional liquid chromatography (Yin-yang MDLC) coupled with
mass spectrometry was used for the identification of peptides and phosphopeptides. The multidimensional liquid chromatography combines the strong-cation exchange (SCX), strong-anion exchange
(SAX), and reverse-phase methods for the separation. Protein digests were first loaded on an SCX
column. The flow-through peptides from SCX were collected and further loaded on an SAX column.
Both columns were eluted by offline pH steps, and the collected fractions were identified by reversephase liquid chromatography tandem mass spectrometry. Comprehensive peptide identification was
achieved by the Yin-yang MDLC-MS/MS for a 1 mg mouse liver. In total, 14 105 unique peptides were
identified with high confidence, including 13 256 unmodified peptides and 849 phosphopeptides with
809 phosphorylated sites. The SCX and SAX in the Yin-Yang system displayed complementary features
of binding and separation for peptides. When coupled with reverse-phase liquid chromatography mass
spectrometry, the SAX-based method can detect more extremely acidic (pI < 4.0) and phosphorylated
peptides, while the SCX-based method detects more relatively basic peptides (pI > 4.0). In total, 134
groups of phosphorylated peptide isoforms were obtained, with common peptide sequences but
different phosphorylated states. This unbiased profiling of protein expression and phosphorylation
provides a powerful approach to probe protein dynamics, without using any prefractionation and
chemical derivation.
Keywords: Protein phosphorylation • Protein expression • Strong-cation exchange • Strong-anion exchange • YinYang multidimensional liquid chromatography • pH elution • Mass spectrometry
Introduction
The proteins in cells behave in a very dynamic manner,
which includes the alteration of protein expression, protein
modification, protein distribution, and interaction. When a cell
receives a stimulus from the environment, one of the ubiquitous cellular responses includes protein phosphorylation/
dephosphorylation. The phosphorylation behavior frequently
triggers signal transduction, followed by gene regulation and
protein expression.1-4 Therefore, the protein phosphorylation
and expression appear to be the most important targets
analyzed by current proteomics. Many large-scale protein
identification methods were recently developed to probe the
protein expression profile.5-8 On the other hand, the efforts to
elucidate the phosphorylation seem more difficult due to the
low sensitivity to phosphorylated peptides in mass spectrometry.4
Current global analysis of protein phosphorylation requires
an enrichment method to increase detection of low-abundance
* To whom correspondence should be addressed at Research Center for
Proteome Analysis, Institute of Biochemistry and Cell Biology, Shanghai
Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai
200031,China.Tel.: 86-21-54920170.Fax: 86-21-54920171;E-mail: [email protected].
250
Journal of Proteome Research 2007, 6, 250-262
Published on Web 12/14/2006
phosphoproteins. Some high-affinity chemical derivation
methods15-18 have emerged, but suffer from side reactions,
marginal yields, and low specificities. Immunopurification19-22
and affinity chromatography including immobilized metal
affinity chromatography (IMAC) and titanium dioxide (TiO2)
are frequently used techniques for the enrichment of phosphorylated proteins and peptides,23-28 but also suffer from
unsatisfactory specificity or sensitivity. Recently, strong-cation
exchange (SCX) chromatography using salt gradient has been
applied for phosphopeptide enrichment by collecting fractions
from an SCX column followed by reverse-phase chromatography and mass spectrometry.29,30
Combination of different separation or HPLC methods has
also been used to decrease the complexity of proteins and
increase the concentration of phosphopeptides. First, proteins
were subjected to a prefractionation, such as gel electrophoresis, before the protein digestion.29,30 Some groups combined
two affinity methods for phosphopeptides such as antibody
purification combined with IMAC,19 or ion-exchange chromatography and IMAC,31,32 to increase the capacity of identified
phosphopeptides. Subcellular fractionation was also frequently
contributed to decrease the protein complexity.29,31,32
10.1021/pr0604155 CCC: $37.00
 2007 American Chemical Society
Protein Phosphorylation/Expression Profiling by Yin-Yang MDLC
In our recent work, we used the pH gradient in the SCX to
replace the traditional salt elution, which demonstrated the
distribution of peptides roughly according to their pI values.33
In this report, we designed an effective way to obtain the total
information of peptides, using the newly developed Yin-Yang
multidimensional liquid chromatography tandem mass spectrometry (Yin-yang-MDLC-MS/MS) method. This method
combines SCX, strong-anion exchange (SAX), and reverse-phase
separation for the comprehensive protein identification. The
SCX and SAX in this system displayed complementary acquisition of peptides. The SAX-based method can obtain more
extremely acidic peptides (pI < 4.0) and phosphopeptides,
while SCX-based method identifies relatively basic peptides (pI
> 4.0). In total, 134 groups of phosphorylated peptide isoforms
were observed in this data set, which have common peptide
sequences but different phosphorylated states. This method
provided an unbiased profiling of protein expression and
phosphorylation, without introducing any prefractionation and
chemical derivation.
Experimental Procedures
Chemicals. All the water used in this experiment was
prepared using a Milli-Q system (Millipore, Bedford, MA).
Dithiothreitol (DTT), ammonium bicarbonate, and iodoacetamide (IAA) were all purchased from Bio-Rad (Hercules, CA).
Urea was obtained from Sigma (St. Louis, MO). Trypsin was
purchased from Promega (Madison, WI). Formic acid (FA) was
obtained from Aldrich (Milwaukee, WI). Acetonitrile (HPLC
gradient grade) was obtained from Merck (Darmstadt, Germany). Sodium orthovanadate (Na3VO4) and sodium fluoride
(NaF) were obtained from Sigma (St. Louis, MO); ethylene
diamine tetraacetic acid (EDTA), ethylene glycol-bis-[2-aminoethyl ether]-N,N,N′,N′ tetraacetic acid (EGTA), and phenyl
methyl sulfonyl fluoride (PMSF) were purchased from Amresco
(Solon, OH). All the chemicals are of analytical grade except
acetonitrile, which is of HPLC grade. The SCX column (0.32 ×
100 mm), SAX column (0.32 × 100 mm), and pH buffers kit
were bought from Column Technology, Inc., (Fremont, CA).
Mouse Liver Sample Preparation. Mice were sacrificed, and
their livers were promptly removed and placed in an ice-cold
homogenization buffer consisting of 8 M urea, 4% CHAPS (w/
v), 65 mM DTT, 1 mM EDTA, 0.5 mM EGTA, and a mixture of
protease inhibitor (1 mM PMSF), phosphatase inhibitors (0.2
mM Na3VO4 and 1 mM NaF), and 40 mM Tris-HCl at pH 7.4.
After they were minced with scissors and washed to remove
blood, the livers were homogenized in a Potter-Elvejhem
homogenizer with a Teflon piston, using 10 mL of the homogenization buffer per 2 g of tissue. The suspension was homogenized for approximately 1 min, sonicated for 100 W × 30 s
and centrifuged at 25 000g for 1 h. The supernatants were
colleted.
Tryptic Digestion of Protein Mixture. The tryptic digestion
was processed according to a previously published method.33
Briefly, 1 mg of proteins was redissolved in reducing solution,
then mixed with 5 µL of 1 M DTT. The mixture was incubated
at 56 °C for 1 h. Forty microliters of 1 M IAA was then added,
and the mixture was incubated for an additional 40 min at
room temperature in darkness. The protein mixtures were spun
and exchanged into 100 mM ammonium bicarbonate buffer,
and then incubated with trypsin (25:1) at 37 °C for 20 h.
Yin-Yang-MDLC-MS/MS System. The SCX and SAX columns were rinsed before sample loading. Finally, the SCX
column was filled with a pH 2.5 buffer, and the SAX column
research articles
was filled with a pH 10.0 buffer. In this study, a series of elution
buffers with different pH (pH 10.0, 8.5, 8.0, 7.0, 6.0, 5.5, 5.0,
4.5, 4.0, 3.5, 3.0, and 2.5,) were used for pH step gradient
elution. First, a total of 1 mg of peptide mixture was lyophilized,
and then dissolved in 100 µL of pH 2.5 buffer. Then, the sample
was injected into the SCX column by a syringe pump, at a flow
rate of 3 µL/min. Then, the column was flushed with another
100 µL of elution buffer at pH 2.5. The flow-through fraction
was collected and lyophilized for SAX column fractionation.
Thereafter, the SCX column was eluted by 200 µL each of
buffers at pH 3.0, 3.5, 4.0, 4.5, 5.0, 5.5, 6.0, 7.0, 8.0, 8.5, and
10.0 sequentially, and the corresponding eluted fraction was
collected individually and lyophilized for nanoLC-MS. The
unbound fraction at pH 2.5 of the SCX column was lyophilized
and dissolved in 100 µL of pH 10.0 buffer, then loaded into the
SAX column. Then, another 100 µL of pH 10.0 buffer was used
to elute the unbound peptide mixture in the SAX column. The
flow-through peptide mixture was collected. The SAX column
was eluted by 200 µL each of buffers at pH 8.5, 8.0, 7.0, 6.0,
5.5, 5.0, 4.5, 4.0, 3.5, 3.0, and 2.5 sequentially, and the corresponding eluted fraction was collected individually and lyophilized for nanoLC-MS.
A Surveyor liquid chromatography system (Thermo Finnigan,
San Jose, CA), consisting of a degasser, MS Pump, and autosampler, and equipped with a C18 trap column (RP, 320 µm
× 20 mm, Column Technology, Inc., Fremont, CA) and an
analytical C18 column (RP, 75 µm × 150 mm, Column
Technology, Inc., Fremont, CA) was used. The HPLC solvents
used were 0.1% formic acid (v/v) aqueous (A) and 0.1% formic
acid (v/v) acetonitrile (B). The samples were loaded into the
trap column first at a 3 µL/min flow rate after the split, and
then the reverse-phase gradient was from 2 to 40% mobile
phase B in 165 min at 100 µL/min flow rate before the split
and 250 nL/min after the split. A Finnigan LTQ linear ion trap
mass spectrometer equipped with a nanospray source was used
to for the MS/MS experiment with ion transfer capillary at 160
°C and NSI voltage of 1.8 kV. Normalized collision energy was
35.0. The mass spectrometer was set so that one full MS scan
was followed by 10 MS/MS scans with the following Dynamic
Exclusion settings: repeat count 2, repeat duration at 30 s,
exclusion duration at 90 s.
Data Analysis and Validation. All .dta files were created
using Bioworks 3.2, with precursor mass tolerance of 1.4 Da,
threshold of 100, and minimum ion count of 15. The acquired
MS/MS spectra were searched against the Mouse International
Protein Index protein sequence database (version 3.07, www.ebi.ac.uk/IPI) combined with real protein and reverse sequences of proteins, by using the TurboSEQUEST program in
the BioWorks 3.2 software suite, with a mass tolerance of 3.0
Da. One missed cleavage site of trypsin was allowed. Dynamic
modifications were allowed for the detection of oxidized Met
(+16), and phosphorylated Ser, Thr, and Tyr (+80).
All output results were combined using an in-house software
named BuildSummary. When no modifications were included,
the filter was set to Xcorr g1.9 with charge state of 1+, Xcorr
g2.2 with charge state of 2+, and Xcorr g3.75 with charge state
of 3+; ∆Cn g 0.1. For phosphopeptides, the filter was set to
Xcorr g2.0 with charge state of 1+, Xcorr g2.5 with charge state
of 2+, and Xcorr g3.3 for charge state of 3+. Both of the filters
used ∆Cn above 0.1. The unmodified peptide and phosphopeptide were combined to assign proteins. Finally, only proteins
with at least two unique peptides were kept. For proteins that
have single phosphopeptides identified, only phosphopeptides
Journal of Proteome Research • Vol. 6, No. 1, 2007 251
research articles
Dai et al.
Figure 1. The flowchart of the Yin-yang-MDLC-MS/MS. Protein digests were loaded on the SCX column first, and the flow-through
fraction was collected. The bound peptides in SCX were eluted by 11-pH steps from pH 3.0 to pH 10.0. The flow-through fraction was
loaded on the offline SAX column and eluted from pH 10.0 to pH 2.5. Each fraction from SCX and SAX was identified by reverse-phase
liquid chromatography coupled with mass spectrometry.
with at least two MS/MS spectra were kept. The false-positive
rate was calculated based on the following formula: % fal )
2[nrev/(nrev + nreal)]. The nrev is the number of peptide hits
matched to “reverse” protein, and nreal is the number of peptide
hits matched to “real” protein.8 The false-positive rate was
finally 1.3%. Further manual check removed the suspicious
phosphopeptides with unclear MS/MS spectra. Therefore, the
final false-positive rate of this data set should be <1.3%.
For phosphorylated site determination, when an exact phosphorylated site could not be determined in a phosphorylated
peptide, the site was defined as ambiguous. If ambiguous sites
in one MS spectrum were confirmed by exact sites in another
MS spectrum, the ambiguous sites would not be counted. If
peptides have a common sequence, but different numbers of
ambiguous phosphorylated sites, the least ambiguous sites were
counted.
Results and Discussion
Peptide and Phosphopeptide Identification by Yin-Yang
MDLC Method. A Yin-yang-MDLC-MS/MS system was constructed based on the pH elution. Figure 1 shows the flowchart
of the Yin-yang-LC system. One milligram of mouse liver
protein digest was loaded onto the SCX column first, and the
flow-through fraction was collected. The bound peptides in the
SCX were eluted by 11-pH steps from pH 3.0 to pH 10.0. The
flow-through fraction was loaded on the offline SAX column
and eluted by 12 steps from pH 10.0 to pH 2.5. Each fraction
from the SCX and SAX was loaded on reverse-phase coupled
with mass spectrometry. In this method, the basic peptides
were retained and separated by strong-cation exchange column. The acidic peptides which did not bind to the SCX
column was collected and separated by anion exchange
252
Journal of Proteome Research • Vol. 6, No. 1, 2007
column. This is particularly important for the recovery of
phosphopeptides which are more acidic than other peptides
in general. All the fractions from both columns then undergo
reverse-phase separation followed by tandem mass spectrometry. This is the most comprehensive identification of the
peptides and phosphopeptides in the MDLC-MS/MS. All the
peptides were either retained or separated by the strong-cation
exchange, or by the anion exchange and then followed by
reverse-phase chromatography. This design can be used to
identify both pospho- and nonphosphopeptides in one analysis.
Table 1 shows the mouse liver proteome identified in Yinyang MDLC mass spectrometry, including the unmodified and
phosphorylated peptide identification in each SCX and SAX
fraction, respectively. Totally, 9971 unique unmodified peptides
were identified in SCX-RP-MS, while 6403 were identified in
SAX-RP-MS, resulting in 13 256 unique unmodified peptides.
On the other hand, 659 unique phosphopeptides were detected
in SAX-RP-MS, and 210 phosphopeptides were identified in
SCX-RP-MS. Twenty phosphopeptides were both identified
by SAX-RP-MS and SCX-RP-MS. The combined 849 unique
phosphopeptides were identified including 809 phosphorylated
sites by the Yin-yang-MDLC-MS/MS system, under the low
false-positive rate of 1.3%. Of the 809 phosphorylated sites,
there are 607 unique sites, of which 425 (70.02%) are on Ser,
130 (21.42%) on Thr, and 52 (8.57%) on Tyr. The other 202
phosphorylated sites belong to 458 ambiguous candidates. In
SAX-RP-MS, 2.95 spectra were obtained on average for each
phosphopeptides, while in SCX-RP-MS, the ratio was only
1.59. The ratios for unmodified peptides are very similar in SAX
(9.80) and SCX (8.18). The higher number of MS/MS spectra
indicates the higher intensity of the phosphopeptides in SAX.
research articles
Protein Phosphorylation/Expression Profiling by Yin-Yang MDLC
Table 1. The Unmodified Peptides and Phosphorylated Peptides Identified in Each Fraction by SCX-RP-MS (A and B) and
SAX-RP-MS (C and D)
fraction
pH 3.0
pH 3.5
unique peptides
peptide hits
ratio (hits/unique)
3677
15448
4.201
3778
17552
4.646
unique peptides
peptide hits
ratio (hits/unique)
70
99
1.414
62
94
1.516
fraction
pH 2.5
pH 3.0
unique peptides
peptide hits
ratio (hits/unique)
698
2010
2.880
471
2092
4.442
unique peptides
peptide hits
ratio (hits/unique)
78
249
3.192
110
393
3.573
pH 4.0
pH 4.5
pH 5.0
pH 5.5
pH 6.0
(A) Unmodified Peptides Identified in SCX
2342
1829
1539
1270
801
10552
9876
8472
5889
3226
4.506
5.400
5.505
4.637
4.027
(B) Phosphopeptides Identified in SCX
35
22
10
16
5
38
26
19
26
6
1.086
1.182
1.900
1.625
1.200
pH 3.5
pH 4.0
pH 4.5
pH 5.0
pH 5.5
(C) Unmodified Peptides Identified in SAX
588
648
862
699
955
3256
5132
3921
3490
4226
5.537
7.920
4.549
4.993
4.425
(D) Phosphopeptides Identified in SAX
186
91
54
28
59
407
174
94
75
110
2.188
1.912
1.741
2.679
1.864
The Yin-yang MDLC system used the SCX as the first
dimension, and the unbound materials were further loaded and
separated using SAX. The coupling of SCX and SAX captured
peptides more comprehensively than SCX alone. As our results
show, the flow-though fraction still contained thousands of
peptides, especially many phosphopeptides. In reported work,
fractions from SCX column with salt gradient in pH 2.729,30 were
collected and followed by mass spectrometry identification, and
6 mg of protein fractionated by SDS-PAGE were needed before
the SCX separation of peptides. Other work combined the ion
exchange with IMAC to increase the sample capacity.31,32 Our
method is much simpler and easier to handle based only on
ion exchange chromatography, consuming only 1 mg of
proteins. Unlike other enrichment methods of phosphopeptides, this method obtained many phosphopeptides without
loss of unmodified peptides. The identified unmodified peptides and phosphopeptides were listed in Supplementary Tables
1 and 2 of Supporting Information.
SCX and SAX Coupled with Mass Spectrometry Can Identify
Different Peptides. Figure 2 shows the base peak of fractions
in SAX-RP-MS. The clear peaks in the acidic regions indicated
that many acidic peptides were identified by the SAX-RP-MS.
Figure 3 presents the distribution of the theoretical pI of the
identified unmodified peptides. It is very interesting that the
SCX-RP-MS and SAX-RP-MS identification are extremely
complementary. In SCX-RP-MS, many more peptides with pI
g 4.0 were identified than in SAX-RP-MS. However, for the
unmodified peptides with theoretical pI 3.5-4.0, 6944 spectra
were obtained for 796 unique peptides identified by SCX-RPMS, while up to 35 884 spectra were identified for 2174 unique
peptides in SAX-RP-MS. For peptides with pI 3.0-3.5, only 25
spectra for 9 unique peptides were obtained in SCX-RP-MS,
but SAX-RP-MS still identified 1360 spectra belonging to 96
unique peptides. Thus, the SAX-RP-MS identified 5.34-fold
more of the spectra and 2.82-fold more of the unique peptides
than SCX-RP-MS, for the unmodified peptides with theoretical pI < 4.0.
Figure 4 illustrates the theoretical pI distribution of the
nonphosphorylated precursors of the identified singly phosphorylated peptides in SCX and SAX. Most of the peptide
backbones of the phosphopeptides were originally acidic with
the pI below 4.5. For peptides with pI 4.0-4.5, 144 singly
phosphorylated products were detected, in which 40 unique
pH 7.0
pH 8.0
pH 8.5
pH 10.0
1098
3970
3.616
553
1643
2.971
556
1674
3.011
562
3284
5.843
6
10
1.667
0
0
/
1
1
1.000
11
12
1.091
pH 6.0
pH 7.0
pH 8.0
pH 8.5
pH 10.0
972
5428
5.584
1544
7257
4.700
1615
8811
5.456
3485
10804
3.100
2330
6351
2.726
50
104
2.080
68
118
1.735
55
95
1.727
50
65
1.300
22
27
1.227
peptides (55 hits) were detected in SCX-RP-MS, and 104
unique peptides (181 hits) were found in SAX-RP-MS. For
peptides with pI 3.5-4.0, 344 unique peptides (with 1222 hits)
were detected with their singly phosphorylated peptides by
SAX-RP-MS, while only 10 of them (with 19 hits) were
identified in SCX-RP-MS. It is interesting that SCX-RP-MS
lost most of the unmodified peptides with pI below 4.0 (as
Figure 3). SCX-RP-MS also cannot identify most of the
phosphorylated peptides if their precursor pI values were below
4.5. It seems that the single-phosphorylation statistically decreased the pI of the peptides 0.5 unit. Figure 4A also indicates
that most of the nonphosphorylated precursors of the phosphopeptides are originally acidic with pI below 4.5. Thus, the
phosphorylation will make the peptides more acidic, which
further increases their loss in the SCX-RP-MS, and these were
then identified in the secondary SAX-RP-MS. Most of the
multiple phosphorylated peptides (82% of the peptide hits)
were detected in SAX-RP-MS. The movement of the pI to more
acidic regions by multiphosphorylation leads to further loss in
SCX-RP-MS. It is also notable that there are still some
phosphopeptides obtained by SCX-RP-MS, including 69
phosphorylated peptides that their nonphosphorylated precursors were originally basic, such as >8.0. Therefore, the phosphopeptide distribution is more comprehensive than expected.
The SCX-RP-MS preferentially identified some phosphopeptides if their nonphosphorylated precursors are very basic. The
phosphorylation did not make them too acidic to be retained
by SCX-RP-MS. The 849 unique phosphorylated peptides
were listed in Supplementary Table 2 of Supporting Information.
Different Distribution Tendency of Peptides in SCX and
SAX. Figure 5 illustrates the phosphopeptide and unmodified
peptide distribution profiles in the SCX-RP-MS and SAX-RPMS. In SCX-RP-MS, the phosphopeptides and unmodified
peptides display the similar distribution tendency along with
the pH increase. The number of these two kinds of peptides
both declined when pH increased. In contrast, in SAX-RP-MS,
the phosphopeptides and unmodified peptides had opposite
focusing tendency. When pH decreased, the level of unmodified
peptides declined significantly, while the phosphopeptides
increased. In the SCX, even in the acidic region, the phosphorylated peptides still mixed with many unmodified peptides.
This indicates, though SCX might retain some phosphopepJournal of Proteome Research • Vol. 6, No. 1, 2007 253
research articles
Dai et al.
Figure 2. The base peak of SAX-RP-MS in the Yin-yang MDLC-MS/MS system. More intensive signals were obtained in acidic
regions.
254
Journal of Proteome Research • Vol. 6, No. 1, 2007
Protein Phosphorylation/Expression Profiling by Yin-Yang MDLC
research articles
Figure 3. The distribution of unique peptides (A) and spectra hits (B) from unmodified peptides, in given theoretical pI regions. The
gray bars show the identified numbers in SCX-RP-MS, and black bars represent the identified numbers in SAX-RP-MS. The X-axis
represents theoretical pI range calculated from unmodified peptides, and the Y-axis shows the numbers of peptides and spectra hits
in each pI range.
tides, the coelution of unmodified peptides with phosphopeptides in acidic or low-salt regions still suppressed the identification of phosphopeptides by mass spectrometry. Therefore,
the secondary IMAC was used to further enrich the phosphopeptides in previous work.31 Otherwise, prefractionation of
proteins before SCX was used to decrease the coeluted,
unmodified peptides in low-salt region.29,30 In this work, the
SAX-RP-MS seems to be more suitable to enrich and identify
phosphopeptides than SCX-RP-MS. The only concern of the
SAX is the limited loading capacity. Therefore, the use of SCX
prior to SAX bound most unmodified peptides and significantly
decreased the complexity of the peptide mixture. The phosphopeptides were further focused to acidic regions in SAX by
pH elution, which decreased the suppression of nonphosphopeptides to phosphopeptides as much as possible. This SCXSAX mode can identify more phosphopeptides than SAX-only
mode, or SAX-SCX mode (data not shown).
In the offline mode, the traditional salt elution needs
desalting, resulting in sample loss. Because of the strong
binding of the phosphopeptides with the SAX, the phosphopeptides need to be eluted by high concentration of salt if the
traditional mode is used. This may be problematic for the
analysis of phosphopeptides, because phosphopeptides are
more likely to be lost during the desalting process, as reported
by several groups.32,34-37 For a pH gradient-based ion exchange
chromatography, the basic peptides will elute at higher pHs
and the acidic peptides will elute at lower pHs, with the static
low concentration (5-10 mM) of salt as we reported.33 The pH
elution had excellent peptide resolution according to the
peptide pI.33 The pH step elution method used in this study
not only enriched the phosphopeptides to acidic regions, but
also reduced the risk of phosphopeptide losses to a minimum,
since no further desalting was needed.
Comprehensive Coverage of Peptides and Proteins of the
Yin-Yang MDLC-MS/MS. This unbiased method based on the
Yin-Yang system can achieve more complete resolution of
phosphopeptides, as well as unmodified peptides, than traditional methods. Among the 13 256 unique and unmodified
peptides, only 3118 were identified both by SCX-RPLC-MS
and SAX-RPLC-MS, demonstrating 23.5% overlapping. Of the
849 unique phosphopeptides, only 20 were identified both by
SCX and SAX. When the unmodified and phosphopeptides are
combined, 14 105 unique peptides were identified, corresponding to 2804 high-confidence identifications of proteins. There
are 72.2% of proteins that were identified both by SCX and SAX
(2024 overlapped protein vs 2804 total proteins), which indicated more peptide coverage of the proteins was obtained from
multiple peptide matching, due to the complementary features
of SCX-RP-MS and SAX-RP-MS. SCX-RP-MS occupied
75.2% unique unmodified peptides and SAX-RP-MS identified
Journal of Proteome Research • Vol. 6, No. 1, 2007 255
research articles
Dai et al.
Figure 4. The distribution of unique peptides (A) and spectra hits (B) from nonphosphorylated precursors of the phosphopeptides, in
given theoretical pI regions. The gray bars show the identified numbers in SCX-RP-MS, and black bars represent the identified numbers
in SAX-RP-MS. The X-axis shows the theoretical pI range calculated from nonphosphorylated precursors of the phosphopeptides,
and the Y-axis demonstrates the numbers of peptides and spectra hits in each pI range.
77.6% of the unique phosphopeptides. The Yin-Yang-MDLCMS/MS system shows complementarity and high resolution for
proteome analysis. For example, protein alpha-1 catenin was
identified with 13 unmodified peptides and 2 phosphopeptides,
in which 11 unmodified peptides were only found in SCXRP-MS, 1 found only in SAX-RP-MS, and 1 detected both in
SAX and SCX. The two phosphopeptides were only detected
by SAX-RP-MS. Table 2 shows the distribution of peptides
from alpha-1 catenin. The Yin-Yang-MDLC-MS/MS system
provides an unbiased method to obtain phosphopeptides, as
well as unmodified peptides. Certainly, when only phosphopeptides were focused, this method can be combined with
other affinity methods to further enrich the phosphopeptides.
In this work, extensive manual check was needed, especially
for the phosphopeptides, due to the low-resolution of the linear
ion trap mass spectrometry. Further work using high-resolution
LTQ-Orbitrap mass spectrometry has shown to be more
powerful for phosphopeptide identification (Dai, J. et al.,
manuscript in preparation).
Peptide Isoforms Obtained in Yin-Yang-MDLC-MS/MS.
It is interesting that phosphopeptide isoforms with common
peptide sequence but different phosphorylated states can be
obtained by this method. There are 134 groups of phosphopeptide isoforms acquired by this method. It is noted that most
of the isoform groups are identified by SAX-RP-MS. Among
the 134 phosphopeptide isoforms, 101 groups with all compo256
Journal of Proteome Research • Vol. 6, No. 1, 2007
nents were only found in SAX-RP-MS, 3 groups only found
in SCX-RP-MS, and 30 groups were found complementary
in SCX-RP-MS and SAX-RP-MS.
Panels A and B of Figure 6 show the spectra of peptide
isoforms of DQWGSEEEEEAGGYR and DQWGS*EEEEEAGGYR,
respectively. These two groups of peptides both belong to the
Cdc42 effector protein 4, which was not found with phosphorylation before in any reference or annotation (www.expasy.ch).
The nonphosphopeptide shows the dominant y13 ion, while
the phosphopeptide has the intensive peak form neutral loss.
However, the common y3 to y10 confirmed the phosphopeptide identification.
Figure 6C,D shows the mass spectra of peptide isoforms of
VPS(*)S(*)DEEVVEEPQSR, with single but ambiguous phosphorylation, and VPS*S*DEEVVEEPQSR, with double distinct
phosphorylated sites, both found in SAX-RP-MS. In the
monophosphopeptide spectra, the y4 and y7 are still dominant.
In the doubly phosphopeptide spectra, the y4 and y7 are
suppressed by the neutral loss peak. The serial y3-y11 confidently identifies the doubly phosphorylated peptide. These
phosphopeptides belong to the mKIAA1741 protein, which is
annotated as 182 kDa tankyrase 1-binding protein (http://
harvester.embl.de). The same phosphorylated sites were also
found most recently.31
Figure 6E,F presents the spectra of peptide isoforms of
GDKS(*)S(*)EPTEDVETK, with a monophosphorylated site
research articles
Protein Phosphorylation/Expression Profiling by Yin-Yang MDLC
Figure 5. The distributions of phosphopeptides and unmodified peptides in fractions of SCX-RP-MS (A) and SAX-RP-MS (B). The
X-axis shows the pH fractions in each method, and the Y-axis presents the percentage of identified spectra hits in each fraction vs all
spectra hits in each method.
Table 2. The Distribution of Unmodified Peptides and Phosphpeptides in SCX-RP-MS and SAX-RP-MS, from Protein Alpha-1
Catenina
IPI:IPI00112963.1| Alpha-1 catenin
fraction
in SCX
(pH)
charge
pI
K.AHVLAASVEQATENFLEK.G
K.AVMDHVSDSFLETNVPLLVLIEAAK.N
K.HVNPVQALSEFK.A
K.IAEQVASFQEEK.S
K.NVPILYTASQACLQHPDVAAYK.A
K.QIIVDPLSFSEER.F
K.SAAGEFADDPCSSVK.R
K.SKLDAEVSK.W
K.SQGMASLNLPAVSWK.M
R.MSASQLEALCPQVINAALALAAKPQSK.L
R.QALQDLLSEYMGNAGR.K
R.TpSpVQTpEDDQLIAGQSAR.A
2
2 and 3
2
2
2
2
2
2
2
3
2
2
4.75
4.31
6.75
4.25
6.73
4.14
4.03
5.79
8.47
7.95
4.37
4.03
5
4.5
5.5
3
4
3.5
3
5
3
4.5
3.5
N.F
R.TPEELDDS*DFETEDFDVR.S
R.TPEELDDSDFETEDFDVR.S
R.VLTDAVDDITSIDDFLAVSENHILEDVNK.C
2
2
3
3.51
3.51
3.76
N.F
N.F
3.5, 4.0, 4.5,
6.0, 7.0
fraction
in SAX
(pH)
N.F
N.F
N.F
8.5
N.F
N.F
N.F
N.F
N.F
N.F
N.F
4.5, 5.0,
5.5, 6.0
2.5
3
N.F
modified
site
N.F
N.F
N.F
N.F
N.F
N.F
N.F
N.F
N.F
N.F
N.F
2
1
N.F
N.F
N.F
a
(*) Indicates the phosphorylated site is unique; p indicates the phosphorylated site is ambiguous; N.F., not found.
ambiguously on serine, and GDKS*S*EPTEDVETK, with phosphorylation on both serine residues. The doubly phosphorylated spectrum has a more complicated spectra and is not easily
identified. The series of y3-y5 ions and the obvious y8 ions
that appeared in both peptide isoforms display similar patterns
of fragmentations, confirming the identification of the multiphosphorylated peptide. These peptides were from the TransGolgi network integral membrane protein 1. This protein is a
membrane protein located in the golgi apparatus and has been
thought of as a glycoprotein. The phosphorylation of this
protein was first found in this work.
Supplementary Table 3 in Supporting Information shows the
134 peptide groups with nonphosphopeptides, monophosphopeptides, and multiphosphopeptides. Almost all of the phosphorylated peptides are eluted in more acidic fractions than
their nonphosphorylated counterparts. Also, more phosphopeptide isoforms were identified by SAX-RP-MS. It is reasonable to expect that, if a peptide has multiple potential phosJournal of Proteome Research • Vol. 6, No. 1, 2007 257
research articles
Figure 6 (continued)
258 Journal of Proteome Research • Vol. 6, No. 1, 2007
Dai et al.
Protein Phosphorylation/Expression Profiling by Yin-Yang MDLC
research articles
Figure 6 (continued)
Journal of Proteome Research • Vol. 6, No. 1, 2007 259
research articles
Dai et al.
Figure 6. The MS/MS spectra of representive peptide isoforms: peptide DQWGSEEEEEAGGYR (A) and its monophosphorylated peptide
DQWGS*EEEEEAGGYR (B); monophosphopeptide VPS*SDEEVVEEPQSR (C) and its doubly phosphorylated peptide VPS*S*DEEVVEEPQSR(D); monophosphopeptide GDKS(*)S(*)EPTEDVETK (E) and corresponding doubly phosphorylated peptide GDKS*S*EPTEDVETK (F).
The (*) means ambiguous phosphorylated site, and the * means distinct phosphorylated site.
260
Journal of Proteome Research • Vol. 6, No. 1, 2007
Protein Phosphorylation/Expression Profiling by Yin-Yang MDLC
phorylated sites, more serines and threonines will be contained.
The phosphorylation will make them more acidic, with makes
it much easier to be lost in SCX-RP-MS. Supplementary Table
4 in Supporting Information shows the identification details
of all unmodified peptides and phosphopeptides.
Conclusion
In conclusion, this methodology significantly enhances the
ability to routinely discover protein expression and phosphorylation within very complex protein mixtures by exploiting pH
step-based Yin-yang-MDLC-MS/MS. The pH elution enables
the sensitive identification and minimum sample loss of
phosphopeptides. The separation and identification of phosphopeptides using SAX-RP-MS also presents a novel strategy
for phosphoproteomics. The combination of the SCX-RP-MS
and SAX-RP-MS achieved comprehensive identification of
phosphorylated as well as unmodified peptides.
Phosphoproteins can exist with varying degrees of phosphorylation, reflecting different functional states of cells. Also, the
phosphorylation will trigger gene expression leading to alteration of protein expression. Further work is ongoing that
combines this Yin-yang-MDLC-MS/MS with quantitative
methods, such as iTRAQ and SILAC,10,11,39 to probe the dynamics of the protein expression as well as phosphorylation.
Abbreviations: SAX, strong-anion exchange; SCX, strongcation exchange; Yin-yang MDLC-MS/MS, Yin-Yang multidimensional liquid chromatography tandem mass spectrometry;
IMAC, immobilized metal affinity chromatography; TiO2, titanium dioxide; LC, liquid chromatography; SCX-RP-MS, strongcation exchange reverse-phase liquid chromatography mass
spectrometry; SAX-RP-MS, strong-anion exchange reversephase liquid chromatography mass spectrometry; ESI, electrospray ionization; MS/MS, tandem mass spectrometry; EDTA,
ethylene diamine tetraacetic acid; EGTA, ethylene glycol-bis[2-aminoethyl ether]-N,N,N′,N′ tetraacetic acid.
Acknowledgment. We thank Mr. Lei Zhang of Shanghai
Institute of Biochemistry and Cell Biology for his maintenance
of the PC cluster system. This work was supported by the
National Natural Science Foundation (30425021, 0637S12442),
Basic Research Foundation (2002CB713807), and Shanghai Key
Project of Basic Science Research (04DZ14005).
Supporting Information Available: Tables listing the
identified unmodified peptides and phosphopeptides; the 134
peptides groups with nonphosphopeptides, monophosphopeptides, and multiphosphopeptides; and the identification
details of all unmodified peptides and phophopeptides. This
material is available free of charge via the Internet at http://
pubs.acs.org.
References
(1) Pawson, T.; Scott J. D. Protein phosphorylation in signalings50
years and counting. Trends Biochem. Sci. 2005, 30, 286-290.
(2) Cohen, P. The origins of protein phosphorylation. Nat. Cell Biol.
2002, 4, E127-130.
(3) Mukherji, M. Phosphoproteomics in analyzing signaling pathways. Expert Rev. Proteomics 2005, 2, 117-128.
(4) Mann, M.; et al. Analysis of protein phosphorylation using mass
spectrometry: deciphering the phosphoproteome. Trends Biotechnol. 2002, 20, 261-268.
(5) Link, A. J.; Eng, J.; Schieltz, D. M.; Carmack, E.; Mize, G. J.; Morris,
D. R.; Garvik, B. M.; Yates, J. R., III. Direct analysis of protein
complexes using mass spectrometry. Nat. Biotechnol. 1999, 17,
676-682.
research articles
(6) Washburn, M. P.; Wolters, D.; Yates, J. R., III. Large-scale analysis
of the yeast proteome by multidimensional protein identification
technology. Nat. Biotechnol. 2001, 19, 242-247.
(7) Chelius, D.; Huhmer, A. F.; Shieh, C. H.; Lehmberg, E.; Traina, J.
A.; Slattery, T. K.; Pungor, E., Jr. Analysis of the adenovirus type
5 proteome by liquid chromatography and tandem mass spectrometry methods. J. Proteome Res. 2002, 1, 501-513.
(8) Peng, J.; Elias, J. E.; Thoreen, C. C.; Licklider, L. J.; Gygi, S. P.
Evaluation of multidimensional chromatography coupled with
tandem mass spectrometry (LC/LC-MS/MS) for large-scale
protein analysis: the yeast proteome. J. Proteome Res. 2003, 2,
43-50.
(9) Ross, P. L.; et al. Multiplexed protein quantitation in Saccharomyces cerevisiae using amine-reactive isobaric tagging reagents.
Mol. Cell. Proteomics 2004, 3, 1154-1169.
(10) Ong, S. E.; et al. Stable isotope labeling by amino acids in cell
culture, SILAC, as a simple and accurate approach to expression
proteomics. Mol. Cell. Proteomics 2002, 1, 376-386.
(11) Andersen, J. S.; et al. Nucleolar proteome dynamics. Nature 2005,
433, 77-83.
(12) Kratchmarova, I.; Blagoev, B.; Haack-Sorensen, M.; Kassem, M.;
Mann, M. Mechanism of divergent growth factor effects in
mesenchymal stem cell differentiation. Science 2005, 308, 14721477.
(13) Cantin, G. T.; Venable, J. D.; Cociorva, D.; Yates, J. R., III.
Quantitative phosphoproteomic analysis of the tumor necrosis
factor pathway. J. Proteome Res. 2006, 5, 127-134.
(14) Amanchy, R.; Kalume, D. E.; Iwahori, A.; Zhong, J.; Pandey, A.
Phosphoproteome analysis of HeLa cells using stable isotope
labeling with amino acids in cell culture SILAC. J. Proteome Res.
2005, 4, 1661-1671.
(15) Oda, Y.; Nagasu, T.; Chait, B. T. Enrichment analysis of phosphorylated proteins as a tool for probing the phosphoproteome.
Nat. Biotechnol. 2001, 19, 379-382.
(16) Goshe, M. B.; et al. Phosphoprotein isotope-coded affinity tags:
application to the enrichment and identification of low-abundance phosphoproteins. Anal. Chem. 2002, 74, 607-616.
(17) Goshe, M. B.; et al. Phosphoprotein isotope-coded affinity tag
approach for isolating and quantitating phosphopeptides in
proteome-wide analyses. Anal. Chem. 2001, 73, 2578-2586.
(18) Zhou, H.; Watts, J. D.; Aebersold, R. A systematic approach to
the analysis of protein phosphorylation. Nat. Biotechnol. 2001,
19, 375-378.
(19) Zhang, Y.; et al. Time-resolved mass spectrometry of tyrosine
phosphorylatiopn sites in the EGF receptor signaling network
reveals dynamic modules. Mol. Cell. Proteomics 2005, 4, 12401250.
(20) Hinsby, A. M.; Olsen, J. V.; Mann, M. Tyrosine phosphoproteomics
of fibroblast growth factor signaling: a role for insulin receptor
substrate-4. J. Biol. Chem. 2004, 279, 46438-46447.
(21) Blagoev, B.; Ong, S. E.; Kratchmarova, I.; Mann, M. Temporal
analysis of phosphotyrosine-dependent signaling networks by
quantitative proteomics. Nat. Biotechnol. 2004, 22, 1139-1145.
(22) Wang, Y.; Li, R.; Du, D.; Zhang, C.; Yuan, H.; Zeng, R.; Chen, Z.
Proteomic analysis reveals novel molecules involved in insulin
signaling pathway. J. Proteome Res. 2006, 5, 846-855.
(23) Ficarro, S. B.; et al. Phosphoproteome analysis by mass spectrometry and its application to Saccharomyces cerevisiae. Nat.
Biotechnol. 2002, 20, 301-305.
(24) Posewitz, M. C.; Tempst, P. Immobilized gallium(III) affinity
chromatography of phosphopeptides. Anal. Chem. 1999, 71,
2883-2892.
(25) Garcia, B. A.; Joshi, S.; Thomas, C. E.; Chitta, R. K.; Diaz, R. L.;
Busby, S. A.; Andrews, P. C.; Ogorzalek Loo, R. R.; Shabanowitz,
J.; Kelleher, N. L.; Mizzen, C. A.; Allis, C. D.; Hunt, D. F.
Comprehensive phosphoprotein analysis of linker histone H1
from Tetrahymena thermophila. Mol. Cell. Proteomics 2006, 5,
1593-1609.
(26) Hart, S. R.; Waterfield, M. D.; Burlingame, A.L; Cramer, R. Factors
governing the solubilization of phosphopeptides retained on
ferric NTA IMAC beads and their analysis by MALDI TOFMS. J.
Am. Soc. Mass Spectrom. 2002, 13, 1042-1051.
(27) Larsen, M. R.; Thingholm, T. E.; Jensen, O. N.; Roepstorff, P.;
Jorgensen, T. J. Highly selective enrichment of phosphorylated
peptides from peptide mixtures using titanium dioxide microcolumns. Mol. Cell. Proteomics 2005, 4, 873-886.
(28) Pinkse, M. W.; Uitto, P. M.; Hilhorst, M. J.; Ooms, B.; Heck, A. J.
Selective isolation at the femtomole level of phosphopeptides
from proteolytic digests using 2D-nanoLC-ESI-MS/MS and titanium oxide precolumns. Anal. Chem. 2004, 76, 3935-3943.
Journal of Proteome Research • Vol. 6, No. 1, 2007 261
research articles
(29) Beausoleil, S. A.; et al. Large-scale characterization of HeLa cell
nuclear phosphoproteins. Proc. Natl. Acad. Sci. U.S.A. 2004, 101,
12130-12135.
(30) Ballif, B. A.; Villen, J.; Beausoleil, S. A.; Schwartz, D.; Gygi, S. P.
Phosphoproteomic analysis of the developing mouse brain. Mol.
Cell. Proteomics 2004, 3, 1093-1101.
(31) Trinidad, J. C.; Specht, C. G.; Thalhammer, A.; Schoepfer, R.;
Burlingame, A. L. Comprehensive identification of phosphorylation sites in postsynaptic density preparations. Mol. Cell.
Proteomics 2006, 5, 914-922.
(32) Nuhse, T. S.; Stensballe, A.; Jensen, O. N.; Peck, S. C. Large-scale
analysis of in vivo phosphorylated membrane proteins by immobilized metal ion affinity chromatography and mass spectrometry. Mol. Cell. Proteomics 2003, 2, 1234-1243.
(33) Dai, J.; Shieh, C. H.; Sheng, Q. H.; Zhou, H.; Zeng, R. Proteomic
analysis with integrated multiple dimensional liquid chromatography/mass spectrometry based on elution of ion exchange
column using pH steps. Anal. Chem. 2005, 77, 5793-5799.
(34) Neubauer, G.; Mann, M. Mapping of phosphorylation sites of gelisolated proteins by nanoelectrospray tandem mass spectrometry: potentials and limitations. Anal. Chem. 1999, 71, 235242.
262
Journal of Proteome Research • Vol. 6, No. 1, 2007
Dai et al.
(35) Chin, E. T.; Papac, D. I. The use of a porous graphitic carbon
column for desalting hydrophilic peptides prior to matrix-assisted
laser desorption/ionization time-of-flight mass spectrometry.
Anal. Biochem. 1999, 273, 179-185.
(36) Vacratsis, P. O.; Phinney, B. S.; Gage, D. A.; Gallo, K. A. Identification of in vivo phosphorylation sites of MLK3 by mass spectrometry and phosphopeptide mapping. Biochemistry. 2002, 41, 56135624.
(37) Larsen, M. R.; Cordwell, S. J.; Roepstorff, P. Graphite powder as
an alternative or supplement to reversed-phase material for
desalting and concentration of peptide mixtures prior to matrixassisted laser desorption/ionization-mass spectrometry. Proteomics 2002, 2, 1277-1287.
(38) Ross, P. L.; et al. Multiplexed protein quantitation in Saccharomyces cerevisiae using amine-reactive isobaric tagging reagents.
Mol. Cell. Proteomics 2004, 3, 1154-1169.
(39) Venable, J. D.; Dong, M. Q.; Wohlschlegel, J.; Dillin, A.; Yates, J.
R. Automated approach for quantitative analysis of complex
peptide mixtures from tandem mass spectra. Nat. Methods 2004,
1, 39-45.
PR0604155