IMLS™- A Method for Sequencing Intact Proteins by In-Source Fragmentation Using Novel "Mass Defect" Labels M.P. Hall, S. Ashrafi, R. Petesch, and L.V. Schneider Abstract We present a new class of "mass defect" tags with broad utility in biomolecular mass spectrometry. Elements with atomic numbers between 17 (Cl) and 77 (Ir) have a substantially different nuclear binding energy (mass defect) from the elements common to biomolecules. This mass defect yields a readily resolvable mass difference between tagged and untagged species in high-resolution mass spectrometers. Examples of the use of these tags in inverted mass ladder sequencing (IMLS™) and as isotope differentiated binding energy shift tags (IDBEST™) for differential peptide display on protein chips are presented. IMLS™ has substantial speed and cost advantages over traditional tandem MS sequencing methods. IDBEST™ may yield the precision advantages of isotope-coded affinity tags (ICAT), without the need for affinity cleanup of the peptides. Mass defect tags also have potential for sequencing other biopolymers (e.g., DNA, oligosaccharides, and lipids) and in combinatorial chemistry and high throughput screening applications. Introduction Applications of biomolecular mass spectrometry can be drawn from the areas of identification and sequencing of proteins,(1) polynucleic acids,(2) and polysaccharides.(3) Mass spectrometry has been successfully applied to probing biomolecular structure/function relationships such as protein-ligand and protein-protein interactions.(4) The ability to resolve stable isotopes has also made mass spectrometry useful for differential display applications.(5) However, chemical noise in mass spectra arising from matrix impurities, fragmentation products, or unidentified constituents can compromise spectral analysis. Incorporation of one or more elements having atomic numbers between 17 (Cl) and 77 (Ir), and more effectively between 35 (Br) and 63 (Eu), into the biomolecules of interest produces a discernible mass shift of these tagged biomolecules in the mass spectrum away from the rest of the chemical noise, improving the ability to analyze the spectrum.(6) The Mass Defect The mass defect is related to the nuclear binding energy released upon formation and stabilization of the nucleus of a given isotope.(7) By convention, the mass defect of C-12 is defined as zero atomic mass units (amu), and the mass defect of any other stable elemental isotope is calculated as: (8) Mass Defect (amu) = Monoisotopic Mass - Σ(# Protons and Neutrons) The mass defects of other elements commonly found in biomolecules (H, N, and O) differ negligibly from that of carbon and the lower abundance heteroatoms, S and P, exhibit only slightly larger mass defects. An analysis of the mass defects for the most abundant stable nuclei of all of the elements shows a maximal mass defect value of approximately -0.1 amu for Target Discovery Inc. • 4015 Fabian Way • Palo Alto • California • 94303 Tel: (650) 812-8120 • Fax: (650) 812-8130 elements with atomic numbers between 35 (Br) and 63 (Eu). Br and I are the easiest of these to incorporate into organic tags and Br occurs in a natural 50:50 isotope abundance. Inverted Mass Ladder Sequencing (IMLS™) IMLS™ is a method for determining an N- or C-terminal protein sequence tag (PST) of an intact protein by in-source fragmentation in a simple ESI-TOF mass spectrometer (ABI Mariner™) in less than 2 min.(10) By incorporating a mass defect element into the tag attached to the N-terminus of myoglobin, the resulting mass defect peptide fragment ladder can be algorithmically resolved (blue) from the raw fragmentation spectrum (black). The mass defect tag (shown in inset) incorporates Br, which allows for peak pairing for further resolution of the tagged b-ion ladder (red) from the residual chemical noise. The first 6 residues are resolvable with a sequencing algorithm.(11) Differential Display Because the spectral deconvolution algorithm preserves the relative abundances of the two Br isotopes, it is also possible to use isotopically-pure versions of the mass defect tags for high precision differential display applications. Myoglobin was labeled with isotopedifferentiated binding energy shift tags (IDBEST™), mixed with a whole cell lysate of E. coli. The mixture was subjected to trypic digestion, desalted with by ZipTip, and spotted on a Ciphergen SELDI chip. A mass spectrum of this sample was obtained using an MDS Sciex QStar tandem MS equipped with a Ciphergen ionization head. The IDBEST™ tagged myoglobin peptides were algorithmically recovered (red) from the raw data (white). The sequence of the tagged N-terminal tryptic peptide was confirmed from the CID fragmentation spectrum. Conclusions Mass defect tags are useful for a wide range of biomolecular mass spectrometry applications because they differentiate the tagged species from other biological molecules. The examples shown incorporate just a single mass defect element into the label, but up to 4 mass defect elements incorporated into a tag can be distinguished algorithmically in the mass spectrum. This allows other biomolecular applications such as sequencing all four bases of DNA simultaneously and applications in combinatorial chemistry and high throughput screening. The spectific inverted mass ladder sequencing (IMLS™) application of mass defect tags provides a 100-fold speed and cost advantage over tandem MS sequencing and can be conducted in a simple ESI-TOF. Isotope-differentiated binding energy shift tags (IDBEST™) can be used for peptide differential display applications, potentially providing ICAT-like precision without the need for an affinity clean up step. References 1. Li, J., Tremblay, T.L., Wang, C., Attiya, S., Harrison, D.J. & Thibault, P. Integrated system for high-throughput protein identification using a microfabricated device coupled to capillary electrophoresis/nanoelectrospray mass spectrometry. Proteomics 1:975–86 (2001); Belghazi, M. et al. Analysis of protein sequences and protein complexes by matrixassisted laser desorption/ionization mass spectrometry. Proteomics 1:946–54 (2001); Ablonczy, Z., Kono, M., Crouch, R.K. & Knapp, D.R. Mass spectrometric analysis of integral membrane proteins at the subnanomolar level: application to recombinant photopigments. Anal. Chem. 73:4774–9 (2001); Ogorzalek Loo, R.R. et al. Virtual 2-D gel electrophoresis: visualization and analysis of the E. coli proteome by mass spectrometry. Anal. Chem. 73:4063–70 (2001). 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. Oberacher, H., Wellenzohn, B. & Huber, C.G. Comparative sequencing of nucleic acids by liquid chromatography-tandem mass spectrometry. Anal. Chem. 74:211–8 (2001); Edwards, J.R., Itagaki, Y. & Ju, J. DNA sequencing using biotinylated dideoxynucleotides and mass spectrometry. Nucleic Acids Res. 29:E104–4 (2001); Walters, J.J. et al. Genotyping single nucleotide polymorphisms using intact polymerase chain reaction products by electrospray quadrupole mass spectrometry. Rapid Commun. Mass Spectrom. 15:1752–9 (2001). Creaser, C.S., Reynolds, J.C. & Harvey, D.J. Structural analysis of oligosaccharides by atmospheric pressure matrix-assisted laser desorption/ionisation quadrupole ion trap mass spectrometry. Rapid Commun. Mass. Spectrom. 16:176–84 (2002): Deery, M.J., Stimson, E. & Chappell, C.G. Size exclusion chromatography/mass spectrometry applied to the analysis of polysaccharides. Rapid Commun. Mass. Spectrom. 15:2273–83 (2001). Kim, M.Y., Maier, C.S., Reed, D.J. & Deinzer, M.L. Site-specific amide hydrogen/deuterium exchange in E. coli thioredoxins measured by electrospray ionization mass spectrometry. J. Am. Chem. Soc. 123:9860–6 (2001); Elviri, L., Zagnoni, I., Careri, M., Cavazzini, D. & Rossi, G.L. Non-covalent binding of endogenous ligands to recombinant cellular retinol-binding proteins studied by mass spectrometric techniques. Rapid Comm. Mass Spectrom. 15:2186–92 (2001). Han, D.K., Eng, J., Zhou, H. & Aebersold, R. Quantitative profiling of differentiationinduced microsomal proteins using isotope-coded affinity tags and mass spectrometry. Nat. Biotechnol. 19:946–51 (2001); Jimenez, C.R. et al. Pattern changes of pituitary peptides in rat after salt-loading as detected by means of direct, semiquantitative mass spectrometric profiling. Proc. Natl. Acad. Sci. (USA). 94:9481–6 (1997). Hall, M.P. et al. Inverted Mass Ladder Sequencing (IMLS™): A method for sequencing intact proteins by in-source fragmentation using novel "mass defect" labels. Paper presented at: Amer. Soc. Mass Spectrom. 50th Ann. Mtg. (Orlando, FL, June, 2002). Bueche, F. Principles of Physics, 3rd edition (McGraw-Hill, New York, 1977). Weast, R., ed., CRC Handbook of Chemistry and Physics, 60th edition, B236–320 (CRC Press, Inc., Boca Raton, 1980). Felinger, A. and Pietrogrande, M.C. Decoding complex multicomponent chromatograms. Anal. Chem. 73:619A-626A (2001); Besset, D. H. Object-oriented implementation of numerical methods (Morgan Kaufmann, San Francisco, 2001). Schneider, L.V., Hall, M. P., and Peterson, J.N. Methods for sequencing proteins. US Patent #6379971 (Apr. 30, 2002). Schneider, L.V., Petesch, R., & Hall, M.P. Methods for determining protein and peptide terminal sequences. PCT Patent Appl. WO 01/49491 (in press).
© Copyright 2026 Paperzz