WACS: improving ChIP-seq peak calling by optimally weighting controls

التفاصيل البيبلوغرافية
العنوان: WACS: improving ChIP-seq peak calling by optimally weighting controls
المؤلفون: Marcel Turcotte, Aseel Awdeh, Theodore J. Perkins
المصدر: BMC Bioinformatics, Vol 22, Iss 1, Pp 1-21 (2021)
BMC Bioinformatics
بيانات النشر: BMC, 2021.
سنة النشر: 2021
مصطلحات موضوعية: Chromatin Immunoprecipitation, Computer science, Reliability (computer networking), genetic processes, lcsh:Computer applications to medicine. Medical informatics, Biochemistry, Genome, Signal, DNA sequencing, chemistry.chemical_compound, 03 medical and health sciences, 0302 clinical medicine, Bias, Structural Biology, natural sciences, Molecular Biology, lcsh:QH301-705.5, 030304 developmental biology, Controls, 0303 health sciences, Reproducibility, biology, Noise (signal processing), business.industry, Applied Mathematics, High-Throughput Nucleotide Sequencing, Reproducibility of Results, Contrast (statistics), Pattern recognition, Sequence Analysis, DNA, Chip, Computer Science Applications, Weighting, ChIP-seq, Histone, chemistry, lcsh:Biology (General), biology.protein, Chromatin Immunoprecipitation Sequencing, lcsh:R858-859.7, Artificial intelligence, business, Chromatin immunoprecipitation, Peak calling, Algorithms, DNA, 030217 neurology & neurosurgery, Research Article
الوصف: Background Chromatin immunoprecipitation followed by high throughput sequencing (ChIP-seq), initially introduced more than a decade ago, is widely used by the scientific community to detect protein/DNA binding and histone modifications across the genome. Every experiment is prone to noise and bias, and ChIP-seq experiments are no exception. To alleviate bias, the incorporation of control datasets in ChIP-seq analysis is an essential step. The controls are used to account for the background signal, while the remainder of the ChIP-seq signal captures true binding or histone modification. However, a recurrent issue is different types of bias in different ChIP-seq experiments. Depending on which controls are used, different aspects of ChIP-seq bias are better or worse accounted for, and peak calling can produce different results for the same ChIP-seq experiment. Consequently, generating “smart” controls, which model the non-signal effect for a specific ChIP-seq experiment, could enhance contrast and increase the reliability and reproducibility of the results. Result We propose a peak calling algorithm, Weighted Analysis of ChIP-seq (WACS), which is an extension of the well-known peak caller MACS2. There are two main steps in WACS: First, weights are estimated for each control using non-negative least squares regression. The goal is to customize controls to model the noise distribution for each ChIP-seq experiment. This is then followed by peak calling. We demonstrate that WACS significantly outperforms MACS2 and AIControl, another recent algorithm for generating smart controls, in the detection of enriched regions along the genome, in terms of motif enrichment and reproducibility analyses. Conclusions This ultimately improves our understanding of ChIP-seq controls and their biases, and shows that WACS results in a better approximation of the noise distribution in controls.
اللغة: English
تدمد: 1471-2105
الوصول الحر: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::2708fcb3af349f5ded151af2aa0bd2a0Test
https://doaj.org/article/3d0f3ca77ba84cfa914356385e9ad0e2Test
حقوق: OPEN
رقم الانضمام: edsair.doi.dedup.....2708fcb3af349f5ded151af2aa0bd2a0
قاعدة البيانات: OpenAIRE