A Proteomic View on Genome-Based Signal Peptide Predictions

  1. Haike Antelmann1,
  2. Harold Tjalsma2,
  3. Birgit Voigt1,
  4. Steffen Ohlmeier1,
  5. Sierd Bron2,
  6. Jan Maarten van Dijl3,4, and
  7. Michael Hecker1
  1. 1Institut für Mikrobiologie und Molekularbiologie, Ernst-Moritz-Arndt-Universiät Greifswald, D-17487 Greifswald, Germany; 2Department of Genetics, Groningen Biomolecular Sciences and Biotechnology Institute, 9751 NN Haren, The Netherlands; 3Department of Pharmaceutical Biology, University of Groningen, Groningen, The Netherlands

Abstract

The availability of complete genome sequences has allowed the prediction of all exported proteins of the corresponding organisms with dedicated algorithms. Even though numerous studies report on genome-based predictions of signal peptides and cell retention signals, they lack a proteomic verification. For example, 180 secretory and 114 lipoprotein signal peptides were predicted recently for the Gram-positive eubacterium Bacillus subtilis. In the present studies, proteomic approaches were used to define the extracellular complement of the B. subtilis secretome. Using different growth conditions and a hyper-secreting mutant, ∼200 extracellular proteins were visualized by two-dimensional (2D) gel electrophoresis, of which 82 were identified by mass spectrometry. These include 41 proteins that have a potential signal peptide with a type I signal peptidase (SPase) cleavage site, and lack a retention signal. Strikingly, the remaining 41 proteins were predicted previously to be cell associated because of the apparent absence of a signal peptide (22), or the presence of specific cell retention signals in addition to an export signal (19). To test the importance of the five type I SPases and the unique lipoprotein-specific SPase of B. subtilis, the extracellular proteome of (multiple) SPase mutants was analyzed. Surprisingly, only the processing of the polytopic membrane protein YfnI was strongly inhibited in Spase I mutants, showing for the first time that a native eubacterial membrane protein is a genuine Spase I substrate. Furthermore, a mutation affecting lipoprotein modification and processing resulted in the shedding of at least 23 (lipo-)proteins into the medium. In conclusion, our observations show that genome-based predictions reflect the actual composition of the extracellular proteome for ∼50%. Major problems are currently encountered with the prediction of extracellular proteins lacking signal peptides (including cytoplasmic proteins) and lipoproteins.

Footnotes

  • 4 Corresponding author.

  • E-MAIL J.M.van.Dijl{at}farm.rug.nl; FAX 31-50-3633000.

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.182801.

    • Received February 1, 2001.
    • Accepted June 14, 2001.
| Table of Contents

Preprint Server