Ensemble method for Text Classification in medicine with multiple rare classes

التفاصيل البيبلوغرافية
العنوان: Ensemble method for Text Classification in medicine with multiple rare classes
المؤلفون: Alessandro Albano, Mariangela Sciandra, Antonella Plaia
المساهمون: Alessandro Albano, Mariangela Sciandra, Antonella Plaia
سنة النشر: 2023
المجموعة: IRIS Università degli Studi di Palermo
مصطلحات موضوعية: text classification, ensemble method, machine learning, clinical coding, Settore SECS-S/01 - Statistica
الوصف: The paper presents an ensemble method for text classification in the presence of multiple rare classes in the context of medical record data. Specifically, our study aims to classify clinical notes into multiple disease categories, including rare diseases. The Ensemble method involves combining the predictions of multiple machine learning models to predict the patient's diagnosis more accurately. We used three different machine learning algorithms, namely Support Vector Machine, Random Forest, and Naive Bayes, to generate three distinct models and combine their predictions through an ensemble method. The results demonstrate that the ensemble method improves the classification performance compared to individual models. We evaluated this approach on a dataset of 50,000 clinical notes with multiple rare classes.
نوع الوثيقة: book part
اللغة: English
العلاقة: info:eu-repo/semantics/altIdentifier/isbn/9788891935632; ispartofbook:Book of abstracts and short papers 14th Scientific Meeting of the Classification and Data Analysis Group; CLADAG 2023; numberofpages:4; https://hdl.handle.net/10447/611173Test
الإتاحة: https://hdl.handle.net/10447/611173Test
حقوق: info:eu-repo/semantics/closedAccess
رقم الانضمام: edsbas.5857CA48
قاعدة البيانات: BASE