دورية أكاديمية

A Method to Learn Embedding of a Probabilistic Medical Knowledge Graph: Algorithm Development

التفاصيل البيبلوغرافية
العنوان: A Method to Learn Embedding of a Probabilistic Medical Knowledge Graph: Algorithm Development
المؤلفون: Li, Linfeng, Wang, Peng, Wang, Yao, Wang, Shenghui, Yan, Jun, Jiang, Jinpeng, Tang, Buzhou, Wang, Chengliang, Liu, Yuting
المصدر: JMIR Medical Informatics, Vol 8, Iss 5, p e17645 (2020)
بيانات النشر: JMIR Publications, 2020.
سنة النشر: 2020
المجموعة: LCC:Computer applications to medicine. Medical informatics
مصطلحات موضوعية: Computer applications to medicine. Medical informatics, R858-859.7
الوصف: BackgroundKnowledge graph embedding is an effective semantic representation method for entities and relations in knowledge graphs. Several translation-based algorithms, including TransE, TransH, TransR, TransD, and TranSparse, have been proposed to learn effective embedding vectors from typical knowledge graphs in which the relations between head and tail entities are deterministic. However, in medical knowledge graphs, the relations between head and tail entities are inherently probabilistic. This difference introduces a challenge in embedding medical knowledge graphs. ObjectiveWe aimed to address the challenge of how to learn the probability values of triplets into representation vectors by making enhancements to existing TransX (where X is E, H, R, D, or Sparse) algorithms, including the following: (1) constructing a mapping function between the score value and the probability, and (2) introducing probability-based loss of triplets into the original margin-based loss function. MethodsWe performed the proposed PrTransX algorithm on a medical knowledge graph that we built from large-scale real-world electronic medical records data. We evaluated the embeddings using link prediction task. ResultsCompared with the corresponding TransX algorithms, the proposed PrTransX performed better than the TransX model in all evaluation indicators, achieving a higher proportion of corrected entities ranked in the top 10 and normalized discounted cumulative gain of the top 10 predicted tail entities, and lower mean rank. ConclusionsThe proposed PrTransX successfully incorporated the uncertainty of the knowledge triplets into the embedding vectors.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 2291-9694
العلاقة: https://medinform.jmir.org/2020/5/e17645Test; https://doaj.org/toc/2291-9694Test
DOI: 10.2196/17645
الوصول الحر: https://doaj.org/article/89828163ab1741de9494c702a29c7413Test
رقم الانضمام: edsdoj.89828163ab1741de9494c702a29c7413
قاعدة البيانات: Directory of Open Access Journals