Computational analysis of genome-wide DNA methylation during the differentiation of human embryonic stem cells along the endodermal lineage
- Lukas Chavez1,5,
- Justyna Jozefczuk1,
- Christina Grimm1,
- Jörn Dietrich1,
- Bernd Timmermann2,
- Hans Lehrach1,
- Ralf Herwig1,4 and
- James Adjaye1,3,4,5
- 1 Department of Vertebrate Genomics, Max-Planck-Institute for Molecular Genetics, D-14195 Berlin, Germany;
- 2 Next Generation Sequencing Group, Max-Planck-Institute for Molecular Genetics, D-14195 Berlin, Germany;
- 3 The Stem Cell Unit, Department of Anatomy, College of Medicine, King Saud University, Riyadh 11461, Saudi Arabia
-
↵4 These authors contributed equally to this work.
Abstract
The generation of genome-wide data derived from methylated DNA immunoprecipitation followed by sequencing (MeDIP-seq) has become a major tool for epigenetic studies in health and disease. The computational analysis of such data, however, still falls short on accuracy, sensitivity, and speed. We propose a time-efficient statistical method that is able to cope with the inherent complexity of MeDIP-seq data with similar performance compared with existing methods. In order to demonstrate the computational approach, we have analyzed alterations in DNA methylation during the differentiation of human embryonic stem cells (hESCs) to definitive endoderm. We show improved correlation of normalized MeDIP-seq data in comparison to available whole-genome bisulfite sequencing data, and investigated the effect of differential methylation on gene expression. Furthermore, we analyzed the interplay between DNA methylation, histone modifications, and transcription factor binding and show that in contrast to de novo methylation, demethylation is mainly associated with regions of low CpG densities.
Footnotes
-
↵5 Corresponding authors.
E-mail chavez{at}molgen.mpg.de.
E-mail adjaye{at}molgen.mpg.de.
-
[Supplemental material is available at http://www.genome.org. The MeDIP-seq data from this study have been submitted to NCBI Sequence Read Archive (http://www.ncbi.nlm.nih.gov/Traces/sra/sra.cgi) under accession no. SRA012665. The bead array gene expression data from this study have been submitted to the NCBI Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo) under accession no. GSE21715. The MEDIPS software package, manual, data, and example data are available online at http://medips.molgen.mpg.de.]
-
Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.110114.110.
- Received May 7, 2010.
- Accepted July 16, 2010.
- Copyright © 2010 by Cold Spring Harbor Laboratory Press