دورية أكاديمية

Natural language processing for structuring clinical text data on depression using UK-CRIS

التفاصيل البيبلوغرافية
العنوان: Natural language processing for structuring clinical text data on depression using UK-CRIS
المؤلفون: Vaci, Nemanja, Liu, Qiang, Kormilitzin, Andrey, De Crescenzo, Franco, Kurtulmus, Ayse, Harvey, Jade, O'Dell, Bessie, Innocent, Simeon, Tomlinson, Anneka, Cipriani, Andrea, Nevado-Holgado, Alejo
بيانات النشر: BMJ Publishing Group Ltd
سنة النشر: 2020
المجموعة: HighWire Press (Stanford University)
مصطلحات موضوعية: Original research
الوصف: Background Utilisation of routinely collected electronic health records from secondary care offers unprecedented possibilities for medical science research but can also present difficulties. One key issue is that medical information is presented as free-form text and, therefore, requires time commitment from clinicians to manually extract salient information. Natural language processing (NLP) methods can be used to automatically extract clinically relevant information. Objective Our aim is to use natural language processing (NLP) to capture real-world data on individuals with depression from the Clinical Record Interactive Search (CRIS) clinical text to foster the use of electronic healthcare data in mental health research. Methods We used a combination of methods to extract salient information from electronic health records. First, clinical experts define the information of interest and subsequently build the training and testing corpora for statistical models. Second, we built and fine-tuned the statistical models using active learning procedures. Findings Results show a high degree of accuracy in the extraction of drug-related information. Contrastingly, a much lower degree of accuracy is demonstrated in relation to auxiliary variables. In combination with state-of-the-art active learning paradigms, the performance of the model increases considerably. Conclusions This study illustrates the feasibility of using the natural language processing models and proposes a research pipeline to be used for accurately extracting information from electronic health records. Clinical implications Real-world, individual patient data are an invaluable source of information, which can be used to better personalise treatment.
نوع الوثيقة: text
وصف الملف: text/html
اللغة: English
العلاقة: http://ebmh.bmj.com/cgi/content/short/23/1/21Test; http://dx.doi.org/10.1136/ebmental-2019-300134Test
DOI: 10.1136/ebmental-2019-300134
الإتاحة: https://doi.org/10.1136/ebmental-2019-300134Test
http://ebmh.bmj.com/cgi/content/short/23/1/21Test
حقوق: Copyright (C) 2020, Royal College of Psychiatrists
رقم الانضمام: edsbas.B30AD296
قاعدة البيانات: BASE