Splice Variation in Mouse Full-Length cDNAs Identified by Mapping to the Mouse Genome

  1. Mihaela Zavolan1,3,
  2. Erik van Nimwegen2, and
  3. Terry Gaasterland1
  1. 1Laboratory for Computational Genomics and 2Center for Studies in Physics and Biology, The Rockefeller University, New York, New York 10021, USA

Abstract

We mapped the collection of The Institute of Physical and Chemical Research (Japan) (RIKEN) 21,076 full-length mouse cDNA clone sequences and the mouse RefSeq sequences to the recently completed draft of the mouse genome. Using this mapping, we identified 3674 mouse genes with multiple transcripts, of which 1098 have splice variants. All but 532 of 21,076 clones (97.5%) mapped to the genome assembly. Alignments of cDNA clone sequences with proteins show that much of the detected splice variation alters coding regions and affects the translated protein. We developed novel analytical techniques to classify observed splice variation and to assess the relation between splice variation and alternative transcription. This analysis indicates that an alternative choice of transcription start or polyadenylation signal frequently induces splice variation.

Footnotes

  • 3 Corresponding author.

  • E-MAIL mihaela{at}genomes.rockefeller.edu; FAX (212) 327-7765.

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.191702.

    • Received February 15, 2002.
    • Accepted July 18, 2002.
| Table of Contents

Preprint Server