دورية أكاديمية

Data-Driven Solution to Identify Sentiments from Online Drug Reviews

التفاصيل البيبلوغرافية
العنوان: Data-Driven Solution to Identify Sentiments from Online Drug Reviews
المؤلفون: Rezaul Haque, Saddam Hossain Laskar, Katura Gania Khushbu, Md Junayed Hasan, Jia Uddin
المصدر: Computers, Vol 12, Iss 4, p 87 (2023)
بيانات النشر: MDPI AG, 2023.
سنة النشر: 2023
المجموعة: LCC:Electronic computers. Computer science
مصطلحات موضوعية: deep learning, word embedding, Bi-LSTM, GloVe, drug sentiment analysis, drug discovery, Electronic computers. Computer science, QA75.5-76.95
الوصف: With the proliferation of the internet, social networking sites have become a primary source of user-generated content, including vast amounts of information about medications, diagnoses, treatments, and disorders. Comments on previously used medicines, contained within these data, can be leveraged to identify crucial adverse drug reactions, and machine learning (ML) approaches such as sentiment analysis (SA) can be employed to derive valuable insights. However, given the sheer volume of comments, it is often impractical for consumers to manually review all of them before determining a purchase decision. Therefore, drug assessments can serve as a valuable source of medical information for both healthcare professionals and the general public, aiding in decision making and improving public monitoring systems by revealing collective experiences. Nonetheless, the unstructured and linguistic nature of the comments poses a significant challenge for effective categorization, with previous studies having utilized machine and deep learning (DL) algorithms to address this challenge. Despite both approaches showing promising results, DL classifiers outperformed ML classifiers in previous studies. Therefore, the objective of our study was to improve upon earlier research by applying SA to medication reviews and training five ML algorithms on two distinct feature extractions and four DL classifiers on two different word-embedding approaches to obtain higher categorization scores. Our findings indicated that the random forest trained on the count vectorizer outperformed all other ML algorithms, achieving an accuracy and F1 score of 96.65% and 96.42%, respectively. Furthermore, the bidirectional LSTM (Bi-LSTM) model trained on GloVe embedding resulted in an even better accuracy and F1 score, reaching 97.40% and 97.42%, respectively. Hence, by utilizing appropriate natural language processing and ML algorithms, we were able to achieve superior results compared to earlier studies.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 2073-431X
العلاقة: https://www.mdpi.com/2073-431X/12/4/87Test; https://doaj.org/toc/2073-431XTest
DOI: 10.3390/computers12040087
الوصول الحر: https://doaj.org/article/8bd6d487826c41ef848fe04608fa921fTest
رقم الانضمام: edsdoj.8bd6d487826c41ef848fe04608fa921f
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:2073431X
DOI:10.3390/computers12040087