Short Answer Detection for Open Questions: A Sequence Labeling Approach with Deep Learning Models

التفاصيل البيبلوغرافية
العنوان:	Short Answer Detection for Open Questions: A Sequence Labeling Approach with Deep Learning Models
المؤلفون:	Samuel González-López, Zeltzyn Guadalupe Montes-Rosales, Adrián Pastor López-Monroy, Aurelio López-López, Jesús Miguel García-Gorrostieta
المصدر:	Mathematics; Volume 10; Issue 13; Pages: 2259
بيانات النشر:	Multidisciplinary Digital Publishing Institute
سنة النشر:	2022
المجموعة:	MDPI Open Access Publishing
مصطلحات موضوعية:	question answering, open questions, academic document analysis, sequence labeling, deep learning
الوصف:	Evaluating the response to open questions is a complex process since it requires prior knowledge of a specific topic and language. The computational challenge is to analyze the text by learning from a set of correct examples to train a model and then predict unseen cases. Thus, we will be able to capture patterns that characterize answers to open questions. In this work, we used a sequence labeling and deep learning approach to detect if a text segment corresponds to the answer to an open question. We focused our efforts on analyzing the general objective of a thesis according to three methodological questions: Q1: What will be done? Q2: Why is it going to be done? Q3: How is it going to be done? First, we use the Beginning-Inside-Outside (BIO) format to label a corpus of targets with the help of two annotators. Subsequently, we adapted four state-of-the-art architectures to analyze the objective: Bidirectional Encoder Representations from Transformers (BERT-BETO) for Spanish, Code Switching Embeddings from Language Model (CS-ELMo), Multitask Neural Network (MTNN), and Bidirectional Long Short-Term Memory (Bi-LSTM). The results of the F-measure for detection of the answers to the three questions indicate that the BERT-BETO and CS-ELMo architecture obtained the best effectivity. The architecture that obtained the best results was BERT-BETO. BERT was the architecture that obtained more accurate results. The result of a detection analysis for Q1, Q2 and Q3 on a non-annotated corpus at the graduate and undergraduate levels is also reported. We found that for detecting the three questions, only the doctoral academic level reached 100%; that is, the doctoral objectives did contain the answer to the three questions.
نوع الوثيقة:	text
وصف الملف:	application/pdf
اللغة:	English
العلاقة:	Mathematics and Computer Science; https://dx.doi.org/10.3390/math10132259Test
DOI:	10.3390/math10132259
الإتاحة:	https://doi.org/10.3390/math10132259Test
حقوق:	https://creativecommons.org/licenses/by/4.0Test/
رقم الانضمام:	edsbas.AEA33FB6
قاعدة البيانات:	BASE

View record in BASE

الوصف
DOI:	10.3390/math10132259