Genomic Gene Clustering Analysis of Pathways in Eukaryotes

  1. Jennifer M. Lee and
  2. Erik L.L. Sonnhammer1
  1. Center for Genomics and Bioinformatics, Karolinska Institutet, S171 77 Stockholm, Sweden

Abstract

Genomic clustering of genes in a pathway is commonly found in prokaryotes due to transcriptional operons, but these are not present in most eukaryotes. Yet, there might be clustering to a lesser extent of pathway members in eukaryotic genomes, that assist coregulation of a set of functionally cooperating genes. We analyzed five sequenced eukaryotic genomes for clustering of genes assigned to the same pathway in the KEGG database. Between 98% and 30% of the analyzed pathways in a genome were found to exhibit significantly higher clustering levels than expected by chance. In descending order by the level of clustering, the genomes studied were Saccharomyces cerevisiae,Homo sapiens, Caenorhabditis elegans,Arabidopsis thaliana, and Drosophila melanogaster. Surprisingly, there is not much agreement between genomes in terms of which pathways are most clustered. Only seven of 69 pathways found in all species were significantly clustered in all five of them. This species-specific pattern of pathway clustering may reflect adaptations or evolutionary events unique to a particular lineage. We note that although operons are common in C. elegans, only 58% of the pathways showed significant clustering, which is less than in human. Virtually all pathways in S. cerevisiae showed significant clustering.

Footnotes

  • 1 Corresponding author.

  • E-MAIL erik.sonnhammer{at}cgb.ki.se; FAX +46 8 337983.

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.737703. Article published online before print in April 2003.

    • Received August 28, 2002.
    • Accepted January 28, 2003.
| Table of Contents

Preprint Server