Sequence characteristics of functional siRNAs

  1. BERND JAGLA1,2,
  2. NATHALIE AULNER1,2,
  3. PETER D. KELLY1,
  4. DA SONG1,
  5. ALLEN VOLCHUK1,3,
  6. ANDRZEJ ZATORSKI1,4,
  7. DAVID SHUM1,
  8. THOMAS MAYER1,2,
  9. DINO A. DE ANGELIS1,
  10. OUATHEK OUERFELLI1,
  11. URS RUTISHAUSER1, and
  12. JAMES E. ROTHMAN1,2
  1. 1Functional Proteomics Project, Memorial Sloan-Kettering Cancer Center, New York, New York 10021, USA

Abstract

RNA interference in mammalian cells is actively used to conduct genetic screens, to identify and to validate targets, and to elucidate regulators and modifiers of cellular pathways. To ensure the specificity and efficacy of the active 21mer siRNA molecules, it is pertinent to develop a strategy for their rational design. Here we show that most functional siRNAs have characteristic sequence features. We tested 601 siRNAs targeting one exogenous and three endogenous genes. The efficacy of the siRNAs was determined at the protein level. Using a decision tree algorithm in combination with information analysis, our analyses revealed four sets of rules with a mean knockdown efficacy ranging from 60% to 73%. (To distinguish between percentages used to describe the quality of an siRNA and the percentages used to describe parts of data sets we underlined the former throughout this paper.) The best rule comprises an A/U at positions 10 and 19, a G/C at position 1, and more than three A/Us between positions 13 and 19, in the sense strand of the siRNA sequence. Using these rules, there is a 99.9% chance of designing an effective siRNA in a set of three with more than 50% knockdown efficiency in a biological readout.

Keywords

Footnotes

  • 5 We counted 2688 different sequences with at least two A/U in a string of length 6 (region 13–18). At three positions only 2 nt are allowed (1, 10, 19). The remaining 10 positions allow for all four possible nucleotides. Then 2688 × 23 × 410 is the theoretical number of sequences having this motif.

  • 2 Present addresses: Columbia University, 1150 St. Nicolas Avenue, Russ Berrie Medical Pavilion, New York, NY, 10032, USA;

  • 3 Toronto General Research Institute, 200 Elizabeth Street, MBRC 4R402, Toronto, Ontario, M5G 2C4, Canada;

  • 4 Department of Chemistry, Hunter College, City University of New York, 695 Park Avenue, Room 1410 North, New York, NY 10021, USA.

  • Article and publication are at http://www.rnajournal.org/cgi/doi/10.1261/rna.7275905.

    • Accepted March 2, 2005.
    • Received December 17, 2004.
| Table of Contents