Analysis of Primate Genomic Variation Reveals a Repeat-Driven Expansion of the Human Genome

  1. Ge Liu1,
  2. NISC Comparative Sequencing Program2,
  3. Shaying Zhao3,
  4. Jeffrey A. Bailey1,
  5. S. Cenk Sahinalp1,
  6. Can Alkan1,
  7. Eray Tuzun1,
  8. Eric D. Green2, and
  9. Evan E. Eichler1,4
  1. 1Department of Genetics, Center for Human Genetics and Center for Computational Genomics, Case Western Reserve University School of Medicine and University Hospitals of Cleveland, Cleveland, Ohio 44106, USA; 2Genome Technology Branch and NIH Intramural Sequencing Center, National Human Genome Research Institute, Bethesda, Maryland 20892, USA; 3The Institute for Genome Research, Rockville, Maryland 20850, USA

Abstract

We performed a detailed analysis of both single-nucleotide and large insertion/deletion events based on large-scale comparison of 10.6 Mb of genomic sequence from lemur, baboon, and chimpanzee to human. Using a human genomic reference, optimal global alignments were constructed from large (>50-kb) genomic sequence clones. These alignments were examined for the pattern, frequency, and nature of mutational events. Whereas rates of single-nucleotide substitution remain relatively constant (1–2 × 10−9 substitutions/site/year), rates of retrotransposition vary radically among different primate lineages. These differences have lead to a 15%–20% expansion of human genome size over the last 50 million years of primate evolution, 90% of it due to new retroposon insertions. Orthologous comparisons with the chimpanzee suggest that the human genome continues to significantly expand due to shifts in retrotransposition activity. Assuming that the primate genome sequence we have sampled is representative, we estimate that human euchromatin has expanded 30 Mb and 550 Mb compared to the primate genomes of chimpanzee and lemur, respectively.

[Supplemental material is available online at www.genome.org.]

Footnotes

  • 4 Corresponding author.

  • E-MAIL eee{at}po.cwru.edu, FAX (216) 368-3432.

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.923303.

    • Received October 22, 2002.
    • Accepted January 2, 2003.
| Table of Contents

Preprint Server