دورية أكاديمية

eSVD-DE: cohort-wide differential expression in single-cell RNA-seq data using exponential-family embeddings.

التفاصيل البيبلوغرافية
العنوان: eSVD-DE: cohort-wide differential expression in single-cell RNA-seq data using exponential-family embeddings.
المؤلفون: Lin, Kevin Z., Qiu, Yixuan, Roeder, Kathryn
المصدر: BMC Bioinformatics; 3/15/2024, Vol. 25 Issue 1, p1-30, 30p
مصطلحات موضوعية: EXPONENTIAL families (Statistics), BIOLOGICAL systems, MATRIX decomposition, RNA sequencing, FUZZY neural networks, GENES
مستخلص: Background: Single-cell RNA-sequencing (scRNA) datasets are becoming increasingly popular in clinical and cohort studies, but there is a lack of methods to investigate differentially expressed (DE) genes among such datasets with numerous individuals. While numerous methods exist to find DE genes for scRNA data from limited individuals, differential-expression testing for large cohorts of case and control individuals using scRNA data poses unique challenges due to substantial effects of human variation, i.e., individual-level confounding covariates that are difficult to account for in the presence of sparsely-observed genes. Results: We develop the eSVD-DE, a matrix factorization that pools information across genes and removes confounding covariate effects, followed by a novel two-sample test in mean expression between case and control individuals. In general, differential testing after dimension reduction yields an inflation of Type-1 errors. However, we overcome this by testing for differences between the case and control individuals' posterior mean distributions via a hierarchical model. In previously published datasets of various biological systems, eSVD-DE has more accuracy and power compared to other DE methods typically repurposed for analyzing cohort-wide differential expression. Conclusions: eSVD-DE proposes a novel and powerful way to test for DE genes among cohorts after performing a dimension reduction. Accurate identification of differential expression on the individual level, instead of the cell level, is important for linking scRNA-seq studies to our understanding of the human population. [ABSTRACT FROM AUTHOR]
Copyright of BMC Bioinformatics is the property of BioMed Central and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
قاعدة البيانات: Complementary Index
الوصف
تدمد:14712105
DOI:10.1186/s12859-024-05724-7