دورية أكاديمية

DeepAutoGlioma: a deep learning autoencoder-based multi-omics data integration and classification tools for glioma subtyping

التفاصيل البيبلوغرافية
العنوان: DeepAutoGlioma: a deep learning autoencoder-based multi-omics data integration and classification tools for glioma subtyping
المؤلفون: Sana Munquad, Asim Bikas Das
المصدر: BioData Mining, Vol 16, Iss 1, Pp 1-18 (2023)
بيانات النشر: BMC, 2023.
سنة النشر: 2023
المجموعة: LCC:Computer applications to medicine. Medical informatics
LCC:Analysis
مصطلحات موضوعية: Multi-omics, Autoencoder, Lower-grade glioma (LGG), Glioblastoma multiforme (GBM), Convolutional neural network (CNN), Computer applications to medicine. Medical informatics, R858-859.7, Analysis, QA299.6-433
الوصف: Abstract Background and objective The classification of glioma subtypes is essential for precision therapy. Due to the heterogeneity of gliomas, the subtype-specific molecular pattern can be captured by integrating and analyzing high-throughput omics data from different genomic layers. The development of a deep-learning framework enables the integration of multi-omics data to classify the glioma subtypes to support the clinical diagnosis. Results Transcriptome and methylome data of glioma patients were preprocessed, and differentially expressed features from both datasets were identified. Subsequently, a Cox regression analysis determined genes and CpGs associated with survival. Gene set enrichment analysis was carried out to examine the biological significance of the features. Further, we identified CpG and gene pairs by mapping them in the promoter region of corresponding genes. The methylation and gene expression levels of these CpGs and genes were embedded in a lower-dimensional space with an autoencoder. Next, ANN and CNN were used to classify subtypes using the latent features from embedding space. CNN performs better than ANN for subtyping lower-grade gliomas (LGG) and glioblastoma multiforme (GBM). The subtyping accuracy of CNN was 98.03% (± 0.06) and 94.07% (± 0.01) in LGG and GBM, respectively. The precision of the models was 97.67% in LGG and 90.40% in GBM. The model sensitivity was 96.96% in LGG and 91.18% in GBM. Additionally, we observed the superior performance of CNN with external datasets. The genes and CpGs pairs used to develop the model showed better performance than the random CpGs-gene pairs, preprocessed data, and single omics data. Conclusions The current study showed that a novel feature selection and data integration strategy led to the development of DeepAutoGlioma, an effective framework for diagnosing glioma subtypes.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 1756-0381
العلاقة: https://doaj.org/toc/1756-0381Test
DOI: 10.1186/s13040-023-00349-7
الوصول الحر: https://doaj.org/article/1058e325ebba4e8bb3647977518a700eTest
رقم الانضمام: edsdoj.1058e325ebba4e8bb3647977518a700e
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:17560381
DOI:10.1186/s13040-023-00349-7