Assessment of SAGE in Transcript Identification

  1. Erin D. Pleasance,
  2. Marco A. Marra, and
  3. Steven J.M. Jones1
  1. Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver V5Z 4E6, Canada

Abstract

An essential step in Serial Analysis of Gene Expression (SAGE) is tag mapping, which refers to the unambiguous determination of the gene represented by a SAGE tag. Current resources for tag mapping are incomplete, and thus do not allow assessment of the efficacy of SAGE in transcript identification. A method of tag mapping is described here and applied to the Drosophila melanogaster and Caenorhabditis elegans genomes, which permits detailed SAGE assessment and provides tag-mapping resources that were unavailable previously for these organisms. In our method, a conceptual transcriptome is constructed using genomic sequence and annotation by extending predicted coding regions to include UTRs on the basis of EST and cDNA alignments, UTR length distributions, and polyadenylation signals. Analysis of extracted tags suggests that, using the standard SAGE procedure, expression of 8% of D. melanogaster and 15% of C. elegans genes cannot be detected unambiguously by SAGE due to shared sequence or lack of NlaIII-anchoring enzyme sites. Both increasing tag length by 2–3 bp and using Sau3A instead of NlaIII as the anchoring enzyme increases potential for transcript detection. This work identifies and quantifies genes not amenable to SAGE analysis, in addition to providing tag-to-gene mappings for two model organisms.

Footnotes

  • [The following individuals kindly provided reagents, samples, or unpublished information as indicated in the paper: S. Gorski, G. Vatcher, and S. Zuyderduyn.]

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.873003. Article published online before print in May 2003.

  • 1 Corresponding author. E-MAIL sjones{at}bcgsc.bc.ca; FAX (604) 877-6085.

    • Accepted March 17, 2003.
    • Received October 7, 2002.
| Table of Contents

Preprint Server