دورية أكاديمية

Application of unsupervised deep learning algorithms for identification of specific clusters of chronic cough patients from EMR data

التفاصيل البيبلوغرافية
العنوان: Application of unsupervised deep learning algorithms for identification of specific clusters of chronic cough patients from EMR data
المؤلفون: Shao, Wei, Luo, Xiao, Zhang, Zuoyi, Han, Zhi, Chandrasekaran, Vasu, Turzhitsky, Vladimir, Bali, Vishal, Roberts, Anna R., Metzger, Megan, Baker, Jarod, La Rosa, Carmen, Weaver, Jessica, Dexter, Paul, Huang, Kun
المساهمون: Biostatistics and Health Data Science, School of Medicine
المصدر: PMC
بيانات النشر: BMC
سنة النشر: 2022
المجموعة: Indiana University - Purdue University Indianapolis: IUPUI Scholar Works
مصطلحات موضوعية: Chronic cough, Deep clustering, EMR data, Unsupervised learning
الوصف: Background: Chronic cough affects approximately 10% of adults. The lack of ICD codes for chronic cough makes it challenging to apply supervised learning methods to predict the characteristics of chronic cough patients, thereby requiring the identification of chronic cough patients by other mechanisms. We developed a deep clustering algorithm with auto-encoder embedding (DCAE) to identify clusters of chronic cough patients based on data from a large cohort of 264,146 patients from the Electronic Medical Records (EMR) system. We constructed features using the diagnosis within the EMR, then built a clustering-oriented loss function directly on embedded features of the deep autoencoder to jointly perform feature refinement and cluster assignment. Lastly, we performed statistical analysis on the identified clusters to characterize the chronic cough patients compared to the non-chronic cough patients. Results: The experimental results show that the DCAE model generated three chronic cough clusters and one non-chronic cough patient cluster. We found various diagnoses, medications, and lab tests highly associated with chronic cough patients by comparing the chronic cough cluster with the non-chronic cough cluster. Comparison of chronic cough clusters demonstrated that certain combinations of medications and diagnoses characterize some chronic cough clusters. Conclusions: To the best of our knowledge, this study is the first to test the potential of unsupervised deep learning methods for chronic cough investigation, which also shows a great advantage over existing algorithms for patient data clustering.
نوع الوثيقة: article in journal/newspaper
وصف الملف: application/pdf
اللغة: English
العلاقة: BMC Bioinformatics; Shao W, Luo X, Zhang Z, et al. Application of unsupervised deep learning algorithms for identification of specific clusters of chronic cough patients from EMR data. BMC Bioinformatics. 2022;23(Suppl 3):140. Published 2022 Apr 19. doi:10.1186/s12859-022-04680-4; https://hdl.handle.net/1805/33419Test
الإتاحة: https://doi.org/10.1186/s12859-022-04680-4Test
https://hdl.handle.net/1805/33419Test
حقوق: Attribution 4.0 International ; http://creativecommons.org/licenses/by/4.0Test/
رقم الانضمام: edsbas.38CBE20D
قاعدة البيانات: BASE