Oracle-Checker Scheme for Evaluating a Generative Large Language Model

التفاصيل البيبلوغرافية
العنوان: Oracle-Checker Scheme for Evaluating a Generative Large Language Model
المؤلفون: Zeng, Yueling Jenny, Wang, Li-C., Ibbetson, Thomas
سنة النشر: 2024
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Computation and Language
الوصف: This work presents a novel approach called oracle-checker scheme for evaluating the answer given by a generative large language model (LLM). Two types of checkers are presented. The first type of checker follows the idea of property testing. The second type of checker follows the idea of program checking. Their applications are demonstrated in two separate contexts, entity extraction and paraphrase decision, respectively.
نوع الوثيقة: Working Paper
الوصول الحر: http://arxiv.org/abs/2405.03170Test
رقم الانضمام: edsarx.2405.03170
قاعدة البيانات: arXiv