دورية أكاديمية

scATAC-seq preprocessing and imputation evaluation system for visualization, clustering and digital footprinting.

التفاصيل البيبلوغرافية
العنوان: scATAC-seq preprocessing and imputation evaluation system for visualization, clustering and digital footprinting.
المؤلفون: Akhtyamov, Pavel, Shaheen, Layal, Raevskiy, Mikhail, Stupnikov, Alexey, Medvedeva, Yulia A
المصدر: Briefings in Bioinformatics; Jan2024, Vol. 25 Issue 1, p1-9, 9p
مصطلحات موضوعية: DATA visualization, TRANSCRIPTION factors, GENETIC transcription regulation, DATABASES, CHROMATIN, LANDSCAPE assessment, FOOTPRINTS
مستخلص: Single-cell ATAC-seq (scATAC-seq) is a recently developed approach that provides means to investigate open chromatin at single cell level, to assess epigenetic regulation and transcription factors binding landscapes. The sparsity of the scATAC-seq data calls for imputation. Similarly, preprocessing (filtering) may be required to reduce computational load due to the large number of open regions. However, optimal strategies for both imputation and preprocessing have not been yet evaluated together. We present SAPIEnS (scATAC-seq Preprocessing and Imputation Evaluation System), a benchmark for scATAC-seq imputation frameworks, a combination of state-of-the-art imputation methods with commonly used preprocessing techniques. We assess different types of scATAC-seq analysis, i.e. clustering, visualization and digital genomic footprinting, and attain optimal preprocessing-imputation strategies. We discuss the benefits of the imputation framework depending on the task and the number of the dataset features (peaks). We conclude that the preprocessing with the Boruta method is beneficial for the majority of tasks, while imputation is helpful mostly for small datasets. We also implement a SAPIEnS database with pre-computed transcription factor footprints based on imputed data with their activity scores in a specific cell type. SAPIEnS is published at: https://github.com/lab-medvedeva/SAPIEnSTest. SAPIEnS database is available at: https://sapiensdb.comTest [ABSTRACT FROM AUTHOR]
Copyright of Briefings in Bioinformatics is the property of Oxford University Press / USA and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
قاعدة البيانات: Complementary Index
الوصف
تدمد:14675463
DOI:10.1093/bib/bbad447