Department of Biochemistry and Molecular Pharmacology; Program in Bioinformatics and Integrative Biology
Chromatin; Genome, Human; Models, Genetic; Models, Statistical; Transcription, Genetic
Biochemistry, Biophysics, and Structural Biology | Bioinformatics | Computational Biology | Genetics and Genomics
BACKGROUND: Previous work has demonstrated that chromatin feature levels correlate with gene expression. The ENCODE project enables us to further explore this relationship using an unprecedented volume of data. Expression levels from more than 100,000 promoters were measured using a variety of high-throughput techniques applied to RNA extracted by different protocols from different cellular compartments of several human cell lines. ENCODE also generated the genome-wide mapping of eleven histone marks, one histone variant, and DNase I hypersensitivity sites in seven cell lines.
RESULTS: We built a novel quantitative model to study the relationship between chromatin features and expression levels. Our study not only confirms that the general relationships found in previous studies hold across various cell lines, but also makes new suggestions about the relationship between chromatin features and gene expression levels. We found that expression status and expression levels can be predicted by different groups of chromatin features, both with high accuracy. We also found that expression levels measured by CAGE are better predicted than by RNA-PET or RNA-Seq, and different categories of chromatin features are the most predictive of expression for different RNA measurement methods. Additionally, PolyA+ RNA is overall more predictable than PolyA- RNA among different cell compartments, and PolyA+ cytosolic RNA measured with RNA-Seq is more predictable than PolyA+ nuclear RNA, while the opposite is true for PolyA- RNA.
CONCLUSIONS: Our study provides new insights into transcriptional regulation by analyzing chromatin features in different cellular contexts.
Rights and Permissions
© 2012 Dong et al.; licensee BioMed Central Ltd.
This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited
DOI of Published Version
Genome Biol. 2012 Jun 13;13(9):R53. doi: 10.1186/gb-2012-13-9-r53. Link to article on publisher's site
Dong, Xianjun; Greven, Melissa C.; Kundaje, Anshul; Djebali, Sarah; Brown, James B.; Cheng, Chao; Gingeras, Thomas R.; Gerstein, Mark B.; Guigo, Roderic; Birney, Ewan; and Weng, Zhiping, "Modeling gene expression using chromatin features in various cellular contexts" (2012). Program in Bioinformatics and Integrative Biology Publications and Presentations. 12.