Following tetraploidy in an Arabidopsis ancestor, genes were removed preferentially from one homeolog leaving clusters enriched in dose-sensitive genes

  1. Brian C. Thomas1,
  2. Brent Pedersen2, and
  3. Michael Freeling3,4
  1. 1 College of Natural Resources, University of California–Berkeley, Berkeley, California 94720, USA;
  2. 2 Department of Environmental Science, Policy & Management, University of California–Berkeley, Berkeley, California 94720, USA;
  3. 3 Department of Plant & Microbial Biology, University of California–Berkeley, Berkeley, California 94720, USA

Abstract

Approximately 90% of Arabidopsis’ unique gene content is found in syntenic blocks that were formed during the most recent whole-genome duplication. Within these blocks, 28.6% of the genes have a retained pair; the remaining genes have been lost from one of the homeologs. We create a minimized genome by condensing local duplications to one gene, removing transposons, and including only genes within blocks defined by retained pairs. We use a moving average of retained and non-retained genes to find clusters of retention and then identify the types of genes that appear in clusters at frequencies above expectations. Significant clusters of retention exist for almost all chromosomal segments. Detailed alignments show that, for 85% of the genome, one homeolog was preferentially (1.6×) targeted for fractionation. This homeolog fractionation bias suggests an epigenetic mechanism. We find that islands of retention contain “connected genes,” those genes predicted—by the gene balance hypothesis—to be resistant to removal because the products they encode interact with other products in a dose-sensitive manner, creating a web of dependency. Gene families that are overrepresented in clusters include those encoding components of the proteasome/protein modification complexes, signal transduction machinery, ribosomes, and transcription factor complexes. Gene pair fractionation following polyploidy or segmental duplication leaves a genome enriched for “connected” genes. These clusters of duplicate genes may help explain the evolutionary origin of coregulated chromosomal regions and new functional modules.

Footnotes

  • 4 Corresponding author.

    4 E-mail freeling{at}nature.berkeley.edu; fax (510) 642-4995.

  • [Supplemental material is available online at www.genome.org.]

  • Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.4708406

    • Received September 28, 2005.
    • Accepted May 9, 2006.
| Table of Contents

Preprint Server