Abstract
Biochemistry has many examples of linear chain polymers, i.e., molecules formed from a sequence of units from a finite set of possibilities; examples include proteins, RNA, single-stranded DNA, and paired DNA. In the field of mass spectrometry, it is useful to consider the idea of weighted alphabets, with a word inheriting weight from its letters. We describe the distribution of the mass of these words in terms of a simple recurrence relation, the general solution to that relation, and a canonical form that explicitly describes both the exponential form of this distribution and its periodic features, thus explaining a wave pattern that has been observed in protein mass databases. Further, we show that a pure exponential term dominates the distribution and that there is exactly one such purely exponential term. Finally, we illustrate the use of this theorem by describing a formula for the integer mass distribution of peptides and we compare our theoretical results with mass distributions of human and yeast peptides.
Similar content being viewed by others
References
Hubler S.L., Jue A., Keith J., McAlister G.C., Craciun G. et al.: Valence parity renders z(center dot)-type ions chemically distinct. J. Am. Chem. Soc. 130, 6388–6394 (2008)
Scigelova M., Makarov A.: Orbitrap mass analyzer—overview and applications in proteomics. Proteomics 6, 16–21 (2006)
Coon JJ: Collisions or electrons? Protein sequence analysis in the 21st century. Anal. Chem. 81, 3208–3215 (2009)
J.D.M.Y. Tipton, C.L. Hendrickson, A.G. Marshall, Utility of the valence parity rule for ECD/AIECD FT-ICR MS: H-dot atom transfer, isobars, and isomers for peptide analysis. Madison, 7–10 December 2008
S.L. Hubler, Mathematical analysis of mass spectra data, PhD Thesis, University of Wisconsin-Madison, Madison, 2010
V.K. Balakrishnan, Introductory Discrete Mathematics (Dover Publications, New York, 1996), vol. xiv, p. 236
Cormen T.H.: Introduction to Algorithms. The MIT Press, Cambridge (2009)
W.G. Kelley, A.C. Peterson, Difference Equations: An Introduction with Applications (Academic Press, Boston, 1991), vol. xi, p. 455
S.L. Hubler, G. Craciun, Periodic Patterns in Distributions of Peptide Masses (2011)
Bergelson V.: Minimal idempotents and ergodic Ramsey theory. In: Bezuglyi, S., Kolyada, S. (eds) Topics in Dynamics and Ergodic Theory, pp. 8–39. Cambridge University Press, Cambridge (2003)
Hardy G., Littlewood J.: Some problems of diophantine approximation (part I). Acta Mathematica 37, 155–191 (1914)
MATLAB. 7.4 ed. (The Mathworks Inc., Nattick, 2007)
G. Alves, A.Y. Ogurtsov, W.W. Wu, G. Wang, R.F. Shen et al., Calibrating e-values for MS2 database search methods. Biol. Direct 2 (2007)
Kim S., Bandeira N., Pevzner P.A.: Spectral profiles, a novel representation of tandem mass spectra and their applications for de novo peptide sequencing and identification. Mol. Cell. Proteom. 8, 1391–1400 (2009)
Geer L.Y., Markey S.P., Kowalak J.A., Wagner L., Xu M. et al.: Open mass spectrometry search algorithm. J. Proteome Res. 3, 958–964 (2004)
Author information
Authors and Affiliations
Corresponding authors
Rights and permissions
About this article
Cite this article
Hubler, S.L., Craciun, G. Mass distributions of linear chain polymers. J Math Chem 50, 1458–1483 (2012). https://doi.org/10.1007/s10910-012-9983-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10910-012-9983-z