Genome-wide discovery of human heart enhancers

التفاصيل البيبلوغرافية
العنوان: Genome-wide discovery of human heart enhancers
المؤلفون: Marcelo A. Nobrega, Noboru J. Sakabe, Leelavati Narlikar, Alexander A. Blanski, Ivan Ovcharenko, Fabio Eiji Arimura, John M. Westlund
المصدر: Genome Research. 20:381-392
بيانات النشر: Cold Spring Harbor Laboratory, 2010.
سنة النشر: 2010
مصطلحات موضوعية: Mef2, Amino Acid Motifs, Computational biology, Regulatory Sequences, Nucleic Acid, Biology, Genome, Mice, Pregnancy, Methods, Genetics, Animals, Humans, Enhancer, Transcription factor, Gene, Zebrafish, Genetics (clinical), Mammals, Regulation of gene expression, Base Sequence, Reproducibility of Results, Heart, biology.organism_classification, Female, Classifier (UML), Protein Binding
الوصف: The various organogenic programs deployed during embryonic development rely on the precise expression of a multitude of genes in time and space. Identifying the cis-regulatory elements responsible for this tightly orchestrated regulation of gene expression is an essential step in understanding the genetic pathways involved in development. We describe a strategy to systematically identify tissue-specific cis-regulatory elements that share combinations of sequence motifs. Using heart development as an experimental framework, we employed a combination of Gibbs sampling and linear regression to build a classifier that identifies heart enhancers based on the presence and/or absence of various sequence features, including known and putative transcription factor (TF) binding specificities. In distinguishing heart enhancers from a large pool of random noncoding sequences, the performance of our classifier is vastly superior to four commonly used methods, with an accuracy reaching 92% in cross-validation. Furthermore, most of the binding specificities learned by our method resemble the specificities of TFs widely recognized as key players in heart development and differentiation, such as SRF, MEF2, ETS1, SMAD, and GATA. Using our classifier as a predictor, a genome-wide scan identified over 40,000 novel human heart enhancers. Although the classifier used no gene expression information, these novel enhancers are strongly associated with genes expressed in the heart. Finally, in vivo tests of our predictions in mouse and zebrafish achieved a validation rate of 62%, significantly higher than what is expected by chance. These results support the existence of underlying cis-regulatory codes dictating tissue-specific transcription in mammalian genomes and validate our enhancer classifier strategy as a method to uncover these regulatory codes.
تدمد: 1088-9051
الوصول الحر: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::1af2d52160f1ce4d3fdd05d29c968c54Test
https://doi.org/10.1101/gr.098657.109Test
حقوق: OPEN
رقم الانضمام: edsair.doi.dedup.....1af2d52160f1ce4d3fdd05d29c968c54
قاعدة البيانات: OpenAIRE