A transformation-driven approach for recognizing textual entailment

التفاصيل البيبلوغرافية
العنوان: A transformation-driven approach for recognizing textual entailment
المؤلفون: Roberto Zanoli, Silvia Colombo
المصدر: Natural Language Engineering. 23:507-534
بيانات النشر: Cambridge University Press (CUP), 2016.
سنة النشر: 2016
مصطلحات موضوعية: Linguistics and Language, Relation (database), Computer science, business.industry, Supervised learning, Inference, 02 engineering and technology, computer.software_genre, Language and Linguistics, Data set, Set (abstract data type), Fragment (logic), Artificial Intelligence, 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, business, Textual entailment, Classifier (UML), computer, Software, Natural language processing
الوصف: Textual Entailment is a directional relation between two text fragments. The relation holds whenever the truth of one text fragment, called Hypothesis (H), follows from another text fragment, called Text (T). Up until now, using machine learning approaches for recognizing textual entailment has been hampered by the limited availability of data. We present an approach based on syntactic transformations and machine learning techniques which is designed to fit well with a new type of available data sets that are larger but less complex than data sets used in the past. The transformations are not predefined, but calculated from the data sets, and then used as features in a supervised learning classifier. The method has been evaluated using two data sets: the SICK data set and the EXCITEMENT English data set. While both data sets are of a larger order of magnitude than data sets such as RTE-3, they are also of lower levels of complexity, each in its own way. SICK consists of pairs created by applying a predefined set of syntactic and lexical rules to its T and H pairs, which can be accurately captured by our transformations. The EXCITEMENT English data contains short pieces of text that do not require a high degree of text understanding to be annotated. The resulting AdArte system is simple to understand and implement, but also effective when compared with other existing systems. AdArte has been made freely available with the EXCITEMENT Open Platform, an open source platform for textual inference.
تدمد: 1469-8110
1351-3249
الوصول الحر: https://explore.openaire.eu/search/publication?articleId=doi_________::153834a51ca858f583874bd42d36cf24Test
https://doi.org/10.1017/s1351324916000176Test
حقوق: OPEN
رقم الانضمام: edsair.doi...........153834a51ca858f583874bd42d36cf24
قاعدة البيانات: OpenAIRE