دورية أكاديمية

Referring expression comprehension model with matching detection and linguistic feedback

التفاصيل البيبلوغرافية
العنوان: Referring expression comprehension model with matching detection and linguistic feedback
المؤلفون: Jianming Wang, Enjie Cui, Kunliang Liu, Yukuan Sun, Jiayu Liang, Chunmiao Yuan, Xiaojie Duan, Guanghao Jin, Tae‐Sun Chung
المصدر: IET Computer Vision, Vol 14, Iss 8, Pp 625-633 (2020)
بيانات النشر: Wiley, 2020.
سنة النشر: 2020
المجموعة: LCC:Computer applications to medicine. Medical informatics
LCC:Computer software
مصطلحات موضوعية: matching detection module, NP‐RefCOCO+, expression‐image pairs, relationship detection module, entity detection module, expression parsing module, Computer applications to medicine. Medical informatics, R858-859.7, Computer software, QA76.75-76.765
الوصف: The task of referring expression comprehension (REC) is to localise an image region of a specific object described by a natural language expression, and all existing REC methods assume that the object described by the referring expression must be located in the given image. However, this assumption is not correct in some real applications. For example, a visually impaired user might tell his robot ‘please take the laptop on the table to me’. In fact, the laptop is not on the table anymore. To address this problem, the authors propose a novel REC model to deal with the situation where expression‐image mismatching occurs and explain the mismatching by linguistic feedback. The authors' REC model consists of four modules: the expression parsing module, the entity detection module, the relationship detection module, and the matching detection module. They built a data set called NP‐RefCOCO+ from RefCOCO+ including both positive samples and negative samples. The positive samples are original expression‐image pairs in RefCOCO+. The negative samples are the expression‐image pairs in RefCOCO+, whose expressions are replaced. They evaluate the model on NP‐RefCOCO+ and the experimental results show the advantages of their method for dealing with the problem of expression‐image mismatching.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 1751-9640
1751-9632
العلاقة: https://doaj.org/toc/1751-9632Test; https://doaj.org/toc/1751-9640Test
DOI: 10.1049/iet-cvi.2019.0483
الوصول الحر: https://doaj.org/article/b27732c720fc473ab2d43d3259ea3d1aTest
رقم الانضمام: edsdoj.b27732c720fc473ab2d43d3259ea3d1a
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:17519640
17519632
DOI:10.1049/iet-cvi.2019.0483