Genomes in Flux: The Evolution of Archaeal and Proteobacterial Gene Content

  1. Berend Snel1,3,
  2. Peer Bork1,2, and
  3. Martijn A. Huynen1,2
  1. 1European Molecular Biology Laboratory, 69117 Heidelberg, Germany; 2Max-Delbrück-Centrum for Molecular Medicine, 13122 Berlin-Buch, Germany

Abstract

In the course of evolution, genomes are shaped by processes like gene loss, gene duplication, horizontal gene transfer, and gene genesis (the de novo origin of genes). Here we reconstruct the gene content of ancestral Archaea and Proteobacteria and quantify the processes connecting them to their present day representatives based on the distribution of genes in completely sequenced genomes. We estimate that the ancestor of the Proteobacteria contained around 2500 genes, and the ancestor of the Archaea around 2050 genes. Although it is necessary to invoke horizontal gene transfer to explain the content of present day genomes, gene loss, gene genesis, and simple vertical inheritance are quantitatively the most dominant processes in shaping the genome. Together they result in a turnover of gene content such that even the lineage leading from the ancestor of the Proteobacteria to the relatively large genome of Escherichia coli has lost at least 950 genes. Gene loss, unlike the other processes, correlates fairly well with time. This clock-like behavior suggests that gene loss is under negative selection, while the processes that add genes are under positive selection.

Footnotes

  • 3 Corresponding author.

  • E-MAIL snel{at}EMBL-heidelberg.de; FAX 49 6221 387 517.

  • Article published on-line before print in December 2001: Genome Res., 10.1101/gr.176502.

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.176501.

    • Received December 18, 2000.
    • Accepted October 16, 2001.
| Table of Contents

Preprint Server