Nucleosome positioning signals in genomic DNA

  1. Heather E. Peckham1,2,
  2. Robert E. Thurman3,
  3. Yutao Fu1,
  4. John A. Stamatoyannopoulos4,
  5. William Stafford Noble4,5,
  6. Kevin Struhl6, and
  7. Zhiping Weng1,2,7
  1. 1 Bioinformatics Program, Boston University, Boston, Massachusetts 02215, USA;
  2. 2 Department of Biomedical Engineering, Boston University, Boston, Massachusetts 02215, USA;
  3. 3 Division of Medical Genetics, University of Washington, Seattle, Washington 98195, USA;
  4. 4 Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA;
  5. 5 Department of Computer Science and Engineering, University of Washington, Seattle, Washington 98195, USA;
  6. 6 Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Boston, Massachusetts 02115, USA

Abstract

Although histones can form nucleosomes on virtually any genomic sequence, DNA sequences show considerable variability in their binding affinity. We have used DNA sequences of Saccharomyces cerevisiae whose nucleosome binding affinities have been experimentally determined (Yuan et al. 2005) to train a support vector machine to identify the nucleosome formation potential of any given sequence of DNA. The DNA sequences whose nucleosome formation potential are most accurately predicted are those that contain strong nucleosome forming or inhibiting signals and are found within nucleosome length stretches of genomic DNA with continuous nucleosome formation or inhibition signals. We have accurately predicted the experimentally determined nucleosome positions across a well-characterized promoter region of S. cerevisiae and identified strong periodicity within 199 center-aligned mononucleosomes studied recently (Segal et al. 2006) despite there being no periodicity information used to train the support vector machine. Our analysis suggests that only a subset of nucleosomes are likely to be positioned by intrinsic sequence signals. This observation is consistent with the available experimental data and is inconsistent with the proposal of a nucleosome positioning code. Finally, we show that intrinsic nucleosome positioning signals are both more inhibitory and more variable in promoter regions than in open reading frames in S. cerevisiae.

Footnotes

| Table of Contents

Preprint Server