"Mass Defect" Labels

IMLS™- A Method for Sequencing Intact Proteins by In-Source
Fragmentation Using Novel "Mass Defect" Labels
M.P. Hall, S. Ashrafi, R. Petesch, and L.V. Schneider
Abstract
We present a new class of "mass defect" tags with broad utility in biomolecular mass
spectrometry. Elements with atomic numbers between 17 (Cl) and 77 (Ir) have a substantially
different nuclear binding energy (mass defect) from the elements common to biomolecules. This
mass defect yields a readily resolvable mass difference between tagged and untagged species in
high-resolution mass spectrometers. Examples of the use of these tags in inverted mass ladder
sequencing (IMLS™) and as isotope differentiated binding energy shift tags (IDBEST™) for
differential peptide display on protein chips are presented. IMLS™ has substantial speed and
cost advantages over traditional tandem MS sequencing methods. IDBEST™ may yield the
precision advantages of isotope-coded affinity tags (ICAT), without the need for affinity cleanup
of the peptides. Mass defect tags also have potential for sequencing other biopolymers (e.g.,
DNA, oligosaccharides, and lipids) and in combinatorial chemistry and high throughput
screening applications.
Introduction
Applications of biomolecular mass spectrometry can be drawn from the areas of
identification and sequencing of proteins,(1) polynucleic acids,(2) and polysaccharides.(3) Mass
spectrometry has been successfully applied to probing biomolecular structure/function
relationships such as protein-ligand and protein-protein interactions.(4) The ability to resolve
stable isotopes has also made mass spectrometry useful for differential display applications.(5)
However, chemical noise in mass spectra arising from matrix impurities, fragmentation products,
or unidentified constituents can compromise spectral analysis. Incorporation of one or more
elements having atomic numbers between 17 (Cl) and 77 (Ir), and more effectively between 35
(Br) and 63 (Eu), into the biomolecules of interest produces a discernible mass shift of these
tagged biomolecules in the mass spectrum away from the rest of the chemical noise, improving
the ability to analyze the spectrum.(6)
The Mass Defect
The mass defect is related to the nuclear binding energy released upon formation and
stabilization of the nucleus of a given isotope.(7) By convention, the mass defect of C-12 is
defined as zero atomic mass units (amu), and the mass defect of any other stable elemental
isotope is calculated as: (8)
Mass Defect (amu) = Monoisotopic Mass - Σ(# Protons and Neutrons)
The mass defects of other elements commonly found in biomolecules (H, N, and O)
differ negligibly from that of carbon and the lower abundance heteroatoms, S and P, exhibit only
slightly larger mass defects. An analysis of the mass defects for the most abundant stable nuclei
of all of the elements shows a maximal mass defect value of approximately -0.1 amu for
Target Discovery Inc. • 4015 Fabian Way • Palo Alto • California • 94303
Tel: (650) 812-8120 • Fax: (650) 812-8130
elements with atomic numbers between 35 (Br) and 63 (Eu). Br and I are the easiest of these to
incorporate into organic tags and Br occurs in a natural 50:50 isotope abundance.
Inverted Mass Ladder Sequencing (IMLS™)
IMLS™ is a method for determining an N- or C-terminal protein sequence tag (PST) of
an intact protein by in-source fragmentation in a simple ESI-TOF mass spectrometer (ABI
Mariner™) in less than 2 min.(10) By incorporating a mass defect element into the tag attached
to the N-terminus of myoglobin, the resulting mass defect peptide fragment ladder can be
algorithmically resolved (blue) from the raw fragmentation spectrum (black). The mass defect
tag (shown in inset) incorporates Br, which allows for peak pairing for further resolution of the
tagged b-ion ladder (red) from the residual chemical noise. The first 6 residues are resolvable
with a sequencing algorithm.(11)
Differential Display
Because the spectral deconvolution algorithm preserves the relative abundances of the
two Br isotopes, it is also possible to use isotopically-pure versions of the mass defect tags for
high precision differential display applications. Myoglobin was labeled with isotopedifferentiated binding energy shift tags (IDBEST™), mixed with a whole cell lysate of E. coli.
The mixture was subjected to trypic digestion, desalted with by ZipTip, and spotted on a
Ciphergen SELDI chip. A mass spectrum of this sample was obtained using an MDS Sciex QStar tandem MS equipped with a Ciphergen ionization head. The IDBEST™ tagged myoglobin
peptides were algorithmically recovered (red) from the raw data (white). The sequence of the
tagged N-terminal tryptic peptide was confirmed from the CID fragmentation spectrum.
Conclusions
Mass defect tags are useful for a wide range of biomolecular mass spectrometry
applications because they differentiate the tagged species from other biological molecules. The
examples shown incorporate just a single mass defect element into the label, but up to 4 mass
defect elements incorporated into a tag can be distinguished algorithmically in the mass
spectrum. This allows other biomolecular applications such as sequencing all four bases of DNA
simultaneously and applications in combinatorial chemistry and high throughput screening. The
spectific inverted mass ladder sequencing (IMLS™) application of mass defect tags provides a
100-fold speed and cost advantage over tandem MS sequencing and can be conducted in a
simple ESI-TOF. Isotope-differentiated binding energy shift tags (IDBEST™) can be used for
peptide differential display applications, potentially providing ICAT-like precision without the
need for an affinity clean up step.
References
1.
Li, J., Tremblay, T.L., Wang, C., Attiya, S., Harrison, D.J. & Thibault, P. Integrated system
for high-throughput protein identification using a microfabricated device coupled to
capillary electrophoresis/nanoelectrospray mass spectrometry. Proteomics 1:975–86
(2001); Belghazi, M. et al. Analysis of protein sequences and protein complexes by matrixassisted laser desorption/ionization mass spectrometry. Proteomics 1:946–54 (2001);
Ablonczy, Z., Kono, M., Crouch, R.K. & Knapp, D.R. Mass spectrometric analysis of
integral membrane proteins at the subnanomolar level: application to recombinant
photopigments. Anal. Chem. 73:4774–9 (2001); Ogorzalek Loo, R.R. et al. Virtual 2-D gel
electrophoresis: visualization and analysis of the E. coli proteome by mass spectrometry.
Anal. Chem. 73:4063–70 (2001).
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
Oberacher, H., Wellenzohn, B. & Huber, C.G. Comparative sequencing of nucleic acids by
liquid chromatography-tandem mass spectrometry. Anal. Chem. 74:211–8 (2001);
Edwards, J.R., Itagaki, Y. & Ju, J. DNA sequencing using biotinylated dideoxynucleotides
and mass spectrometry. Nucleic Acids Res. 29:E104–4 (2001); Walters, J.J. et al.
Genotyping single nucleotide polymorphisms using intact polymerase chain reaction
products by electrospray quadrupole mass spectrometry. Rapid Commun. Mass Spectrom.
15:1752–9 (2001).
Creaser, C.S., Reynolds, J.C. & Harvey, D.J. Structural analysis of oligosaccharides by
atmospheric pressure matrix-assisted laser desorption/ionisation quadrupole ion trap mass
spectrometry. Rapid Commun. Mass. Spectrom. 16:176–84 (2002): Deery, M.J., Stimson,
E. & Chappell, C.G. Size exclusion chromatography/mass spectrometry applied to the
analysis of polysaccharides. Rapid Commun. Mass. Spectrom. 15:2273–83 (2001).
Kim, M.Y., Maier, C.S., Reed, D.J. & Deinzer, M.L. Site-specific amide
hydrogen/deuterium exchange in E. coli thioredoxins measured by electrospray ionization
mass spectrometry. J. Am. Chem. Soc. 123:9860–6 (2001); Elviri, L., Zagnoni, I., Careri,
M., Cavazzini, D. & Rossi, G.L. Non-covalent binding of endogenous ligands to
recombinant cellular retinol-binding proteins studied by mass spectrometric techniques.
Rapid Comm. Mass Spectrom. 15:2186–92 (2001).
Han, D.K., Eng, J., Zhou, H. & Aebersold, R. Quantitative profiling of differentiationinduced microsomal proteins using isotope-coded affinity tags and mass spectrometry. Nat.
Biotechnol. 19:946–51 (2001); Jimenez, C.R. et al. Pattern changes of pituitary peptides in
rat after salt-loading as detected by means of direct, semiquantitative mass spectrometric
profiling. Proc. Natl. Acad. Sci. (USA). 94:9481–6 (1997).
Hall, M.P. et al. Inverted Mass Ladder Sequencing (IMLS™): A method for sequencing
intact proteins by in-source fragmentation using novel "mass defect" labels. Paper
presented at: Amer. Soc. Mass Spectrom. 50th Ann. Mtg. (Orlando, FL, June, 2002).
Bueche, F. Principles of Physics, 3rd edition (McGraw-Hill, New York, 1977).
Weast, R., ed., CRC Handbook of Chemistry and Physics, 60th edition, B236–320 (CRC
Press, Inc., Boca Raton, 1980).
Felinger, A. and Pietrogrande, M.C. Decoding complex multicomponent chromatograms.
Anal. Chem. 73:619A-626A (2001); Besset, D. H. Object-oriented implementation of
numerical methods (Morgan Kaufmann, San Francisco, 2001).
Schneider, L.V., Hall, M. P., and Peterson, J.N. Methods for sequencing proteins. US
Patent #6379971 (Apr. 30, 2002).
Schneider, L.V., Petesch, R., & Hall, M.P. Methods for determining protein and peptide
terminal sequences. PCT Patent Appl. WO 01/49491 (in press).