Patterns of Evolutionary Constraints in Intronic and Intergenic DNA of Drosophila

  1. Daniel L. Halligan1,
  2. Adam Eyre-Walker2,
  3. Peter Andolfatto1,3, and
  4. Peter D. Keightley1,4
  1. 1 University of Edinburgh, School of Biological Sciences, Edinburgh EH9 3JT, UK
  2. 2 University of Sussex, Centre for the Study of Evolution and School of Biological Sciences, Brighton BN1 9QG, UK

Abstract

We develop methods to infer levels of evolutionary constraints in the genome by comparing rates of nucleotide substitution in noncoding DNA with rates predicted from rates of synonymous site evolution in adjacent genes or other putatively neutrally evolving sites, while accounting for differences in base composition. We apply the methods to estimate levels of constraint in noncoding DNA of Drosophila. In introns, constraint (the estimated fraction of mutations that are selectively eliminated) is absolute at the 5′ and 3′ splice junction dinucleotides, and averages 72% in base pairs 3-6 at the 5′-end. Constraint at the 5′ base pairs 3-6 is significantly lower in the lineage leading to Drosophila melanogaster than in Drosophila simulans, a finding that agrees with other features of genome evolution in Drosophila and indicates that the effect of selection on intron function has been weaker in the melanogaster lineage. Elsewhere in intron sequences, the rate of nucleotide substitution is significantly higher than at synonymous sites. By using intronic sites outside splice control regions as a putative neutrally evolving standard, constraint in the 500 bp of intergenic DNA upstream and downstream regions of protein-coding genes averages ∼44%. Although the estimated level of constraint in intergenic regions close to genes is only about one-half of that of amino acid sites, selection against single-nucleotide mutations in intergenic DNA makes a substantial contribution to the mutation load in Drosophila.

Footnotes

  • [Supplemental material is available online at www.genome.org. The sequence data from this study have been submitted to GenBank under accession nos.AY459538-AY459582.]

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.1329204.

  • 3 Present address: Department of Zoology, University of Toronto, Toronto, ON M5S 3G5, Canada.

  • 4 Corresponding author. E-MAIL Peter.Keightley{at}ed.ac.uk; FAX 44 131 650 5443.

    • Accepted December 1, 2003.
    • Received March 11, 2003.
| Table of Contents

Preprint Server