RazerS—fast read mapping with sensitivity control

  1. David Weese1,3,
  2. Anne-Katrin Emde1,
  3. Tobias Rausch2,
  4. Andreas Döring1 and
  5. Knut Reinert1
  1. 1 Department of Computer Science, Free University of Berlin, 14195 Berlin, Germany;
  2. 2 International Max Planck Research School for Computational Biology and Scientific Computing, 14195 Berlin, Germany

    Abstract

    Second-generation sequencing technologies deliver DNA sequence data at unprecedented high throughput. Common to most biological applications is a mapping of the reads to an almost identical or highly similar reference genome. Due to the large amounts of data, efficient algorithms and implementations are crucial for this task. We present an efficient read mapping tool called RazerS. It allows the user to align sequencing reads of arbitrary length using either the Hamming distance or the edit distance. Our tool can work either lossless or with a user-defined loss rate at higher speeds. Given the loss rate, we present an approach that guarantees not to lose more reads than specified. This enables the user to adapt to the problem at hand and provides a seamless tradeoff between sensitivity and running time.

    Footnotes

    | Table of Contents

    Preprint Server