Computational definition of sequence motifs governing constitutive exon splicing

  1. Xiang H-F. Zhang and
  2. Lawrence A. Chasin1
  1. Department of Biological Sciences, MC2433, Columbia University, New York, New York 10027, USA

Abstract

We have searched for sequence motifs that contribute to the recognition of human pre-mRNA splice sites by comparing the frequency of 8-mers in internal noncoding exons versus unspliced pseudo exons and 5′ untranslated regions (5′ untranslated regions [UTRs]) of transcripts of intronless genes. This type of comparison avoids the isolation of sequences that are distinguished by their protein-coding information. We classified sequence families comprising 2069 putative exonic enhancers and 974 putative exonic silencers. Representatives of each class functioned as enhancers or silencers when inserted into a test exon and assayed in transfected mammalian cells. As a class, the enhancer sequencers were more prevalent and the silencer elements less prevalent in all exons compared with introns. A survey of 58 reported exonic splicing mutations showed good agreement between the splicing phenotype and the effect of the mutation on the motifs defined here. The large number of effective sequences implied by these results suggests that sequences that influence splicing may be very abundant in pre-mRNA.

Keywords

Footnotes

  • Article published online ahead of print. Article and publication date are at http://www.genesdev.org/cgi/doi/10.1101/gad.1195304.

  • Supplemental material is available at http://www.genesdev.org.

  • 1 Corresponding author. E-MAIL lac2{at}columbia.edu; FAX (212) 865-8246.

    • Accepted April 9, 2004.
    • Received February 17, 2004.
| Table of Contents

Life Science Alliance