دورية أكاديمية

CEMIG: prediction of the cis-regulatory motif using the de Bruijn graph from ATAC-seq.

التفاصيل البيبلوغرافية
العنوان: CEMIG: prediction of the cis-regulatory motif using the de Bruijn graph from ATAC-seq.
المؤلفون: Wang, Yizhong, Li, Yang, Wang, Cankun, Lio, Chan-Wang Jerry, Ma, Qin, Liu, Bingqiang
المصدر: Briefings in Bioinformatics; Jan2024, Vol. 25 Issue 1, p1-8, 8p
مصطلحات موضوعية: DE Bruijn graph, DNA, HAMMING distance, TRANSCRIPTION factors, GRAPH theory, CHROMATIN, LINUX operating systems
مستخلص: Sequence motif discovery algorithms enhance the identification of novel deoxyribonucleic acid sequences with pivotal biological significance, especially transcription factor (TF)-binding motifs. The advent of assay for transposase-accessible chromatin using sequencing (ATAC-seq) has broadened the toolkit for motif characterization. Nonetheless, prevailing computational approaches have focused on delineating TF-binding footprints, with motif discovery receiving less attention. Herein, we present Cis rEgulatory Motif Influence using de Bruijn Graph (CEMIG), an algorithm leveraging de Bruijn and Hamming distance graph paradigms to predict and map motif sites. Assessment on 129 ATAC-seq datasets from the Cistrome Data Browser demonstrates CEMIG's exceptional performance, surpassing three established methodologies on four evaluative metrics. CEMIG accurately identifies both cell-type-specific and common TF motifs within GM12878 and K562 cell lines, demonstrating its comparative genomic capabilities in the identification of evolutionary conservation and cell-type specificity. In-depth transcriptional and functional genomic studies have validated the functional relevance of CEMIG-identified motifs across various cell types. CEMIG is available at https://github.com/OSU-BMBL/CEMIGTest , developed in C++ to ensure cross-platform compatibility with Linux, macOS and Windows operating systems. [ABSTRACT FROM AUTHOR]
Copyright of Briefings in Bioinformatics is the property of Oxford University Press / USA and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
قاعدة البيانات: Complementary Index
الوصف
تدمد:14675463
DOI:10.1093/bib/bbad505