A Predictive Model for Regulatory Sequences Directing Liver-Specific Transcription

William Krivan; Wyeth W. Wasserman

doi:10.1101/gr.180601

A Predictive Model for Regulatory Sequences Directing Liver-Specific Transcription

William Krivan1 and
Wyeth W. Wasserman2

Bioinformatics Unit, Center for Genomics and Bioinformatics, Karolinska Institutet, 17177 Stockholm, Sweden

Abstract

The identification and interpretation of the regulatory signals within the human genome remain among the greatest goals and most difficult challenges in genome analysis. The ability to predict the temporal and spatial control of transcription is likely to require a combination of methods to address the contribution of sequence-specific signals, protein–protein interactions and chromatin structure. We present here a new procedure to identify clusters of transcription factor binding sites characteristic of sequence modules experimentally verified to direct transcription selectively to liver cells. This algorithm is sufficiently specific to identify known regulatory sequences in genes selectively expressed in liver, promising acceleration of experimental promoter analysis. In combination with phylogenetic footprinting, this improvement in the specificity of predictions is sufficient to motivate a scan of the human genome. Potential regulatory modules were identified in orthologous human and rodent genomic sequences containing both known and uncharacterized genes.

[Supplementary data and the submission of sequences for analysis are available athttp://www.cgb.ki.se/krivan/liver/liver.html.]

Footnotes

↵1 Present address: ZymoGenetics Inc., 1201 Eastlake Avenue East, Seattle, WA 98102.
↵2 Corresponding author.
E-MAIL wyeth.wasserman{at}cgb.ki.se; FAX 46 8 337412.
Article published on-line before print: Genome Res., 10.1101/gr. 180601.
Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.180601.
- Received January 18, 2001.
- Accepted June 13, 2001.
Cold Spring Harbor Laboratory Press