دورية أكاديمية

Enhancing random forest classification with NLP in DAMEH: A system for DAta Management in eHealth Domain.

التفاصيل البيبلوغرافية
العنوان: Enhancing random forest classification with NLP in DAMEH: A system for DAta Management in eHealth Domain.
المؤلفون: Amato, Flora1 (AUTHOR) flora.amato@unina.it, Coppolino, Luigi1,2 (AUTHOR) luigi.coppolino@uniparthenope.it, Cozzolino, Giovanni1 (AUTHOR) giovanni.cozzolino@unina.it, Mazzeo, Giovanni1 (AUTHOR) giovanni.mazzeo@uniparthenope.it, Moscato, Francesco1,3 (AUTHOR) fmoscato@unisa.it, Nardone, Roberto1,4 (AUTHOR) roberto.nardone@unirc.it
المصدر: Neurocomputing. Jul2021, Vol. 444, p79-91. 13p.
مصطلحات موضوعية: *NATURAL language processing, *RANDOM forest algorithms, *TELEMEDICINE, *DATA management, *SMART devices, *SMART cities
مصطلحات جغرافية: ITALY
مستخلص: • We apply NLP approaches in features choice for enhancing Classifier based on Random Forests approach. • We analyze medical records to retrieve features for clinical records classifications in smart environments. • We use a Machine Learning approach to build a fast multi-classification schema. • We apply the methodology to real case studies from health-care organizations in Italy. • We show accuracy of presented approach in terms of Accuracy-Rejection curves. The use of pervasive IoT devices in Smart Cities, have increased the Volume of data produced in many and many field. Interesting and very useful applications grow up in number in E-health domain, where smart devices are used in order to manage huge amount of data, in highly distributed environments, in order to provide smart services able to collect data to fill medical records of patients. The problem here is to gather data, to produce records and to analyze medical records depending on their contents. Since data gathering involve very different devices (not only wearable medical sensors, but also environmental smart devices, like weather, pollution and other sensors) it is very difficult to classify data depending their contents, in order to enable better management of patients. Data from smart devices couple with medical records written in natural language: we describe here an architecture that is able to determine best features for classification, depending on existent medical records. The architecture is based on pre-filtering phase based on Natural Language Processing, that is able to enhance Machine learning classification based on Random Forests. We carried on experiments on about 5000 medical records from real (anonymized) case studies from various health-care organizations in Italy. We show accuracy of the presented approach in terms of Accuracy-Rejection curves. [ABSTRACT FROM AUTHOR]
قاعدة البيانات: Academic Search Index
الوصف
تدمد:09252312
DOI:10.1016/j.neucom.2020.08.091