Ψ-Φ: Exploring the outer limits of bacterial pseudogenes

  1. Emmanuelle Lerat1,3 and
  2. Howard Ochman1,2
  1. 1 Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 87521, USA
  2. 2 Department of Biochemistry and Molecular Biophysics, University of Arizona, Tucson, Arizona 87521, USA

Abstract

Because bacterial chromosomes are tightly packed with genes and were traditionally viewed as being optimized for size and replication speed, it was not surprising that the early annotations of sequenced bacterial genomes reported few, if any, pseudogenes. But because pseudogenes are generally recognized by comparisons with their functional counterparts, as more genome sequences accumulated, many bacterial pathogens were found to harbor large numbers of truncated, inactivated, and degraded genes. Because the mutational events that inactivate genes occur continuously in all genomes, we investigated whether the rarity of pseudogenes in some bacteria was attributable to properties inherent to the organism or to the failure to recognize pseudogenes. By developing a program suite (called Ψ-Φ, for Ψ-gene Finder) that applies a comparative method to identify pseudogenes (attributable both to misannotation and to nonrecognition), we analyzed the pseudogene inventories in the sequenced members of the Escherichia coli/Shigella clade. This approach recovered hundreds of previously unrecognized pseudogenes and showed that pseudogenes are a regular feature of bacterial genomes, even in those whose original annotations registered no truncated or otherwise inactivated genes. In Shigella flexneri 2a, large proportions of pseudogenes are generated by nonsense mutations and IS element insertions, events that seldom produce the pseudogenes present in the other genomes examined. Almost all (>95%) pseudogenes are restricted to only one of the genomes and are of relatively recent origin, suggesting that these bacteria possess active mechanisms to eliminate nonfunctional genes.

Footnotes

  • [Supplemental material is available online at www.genome.org.]

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.2925604. Article published online before print in October 2004.

  • 3 Corresponding author. E-mail lerat{at}email.arizona.edu; fax (520) 621-3709.

    • Accepted August 9, 2004.
    • Received June 22, 2004.
| Table of Contents

Preprint Server