Whole proteome analysis of post-translational modifications: Applications of mass-spectrometry for proteogenomic annotation

  1. Nitin Gupta1,7,8,
  2. Stephen Tanner1,7,
  3. Navdeep Jaitly2,
  4. Joshua N. Adkins2,
  5. Mary Lipton2,
  6. Robert Edwards3,4,5,
  7. Margaret Romine2,
  8. Andrei Osterman3,4,
  9. Vineet Bafna1,6,
  10. Richard D. Smith2, and
  11. Pavel A. Pevzner1,6
  1. 1 Bioinformatics Program, University of California San Diego, La Jolla, California 92093, USA;
  2. 2 Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, USA;
  3. 3 Burnham Institute for Medical Research, La Jolla, California 92037, USA;
  4. 4 Fellowship for Interpretation of Genomes, Burr Ridge, Illinois 60527, USA;
  5. 5 San Diego State University, San Diego, California 92182, USA;
  6. 6 Department of Computer Science and Engineering, University of California San Diego, La Jolla, California 92093, USA
  1. 7 These authors contributed equally to this work.

Abstract

While bacterial genome annotations have significantly improved in recent years, techniques for bacterial proteome annotation (including post-translational chemical modifications, signal peptides, proteolytic events, etc.) are still in their infancy. At the same time, the number of sequenced bacterial genomes is rising sharply, far outpacing our ability to validate the predicted genes, let alone annotate bacterial proteomes. In this study, we use tandem mass spectrometry (MS/MS) to annotate the proteome of Shewanella oneidensis MR-1, an important microbe for bioremediation. In particular, we provide the first comprehensive map of post-translational modifications in a bacterial genome, including a large number of chemical modifications, signal peptide cleavages, and cleavages of N-terminal methionine residues. We also detect multiple genes that were missed or assigned incorrect start positions by gene prediction programs, and suggest corrections to improve the gene annotation. This study demonstrates that complementing every genome sequencing project by an MS/MS project would significantly improve both genome and proteome annotations for a reasonable cost.

Footnotes

  • 8 Corresponding author.

    8 E-mail ngupta{at}ucsd.edu; fax (858) 534-7029.

  • [Supplemental material is available online at www.genome.org.]

  • Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.6427907

    • Received February 22, 2007.
    • Accepted June 12, 2007.
| Table of Contents

Preprint Server