FinTextQA: A Dataset for Long-form Financial Question Answering

التفاصيل البيبلوغرافية
العنوان:	FinTextQA: A Dataset for Long-form Financial Question Answering
المؤلفون:	Chen, Jian, Zhou, Peilin, Hua, Yining, Loh, Yingxin, Chen, Kehui, Li, Ziyuan, Zhu, Bing, Liang, Junwei
سنة النشر:	2024
المجموعة:	Computer Science
مصطلحات موضوعية:	Computer Science - Computation and Language, Computer Science - Artificial Intelligence
الوصف:	Accurate evaluation of financial question answering (QA) systems necessitates a comprehensive dataset encompassing diverse question types and contexts. However, current financial QA datasets lack scope diversity and question complexity. This work introduces FinTextQA, a novel dataset for long-form question answering (LFQA) in finance. FinTextQA comprises 1,262 high-quality, source-attributed QA pairs extracted and selected from finance textbooks and government agency websites.Moreover, we developed a Retrieval-Augmented Generation (RAG)-based LFQA system, comprising an embedder, retriever, reranker, and generator. A multi-faceted evaluation approach, including human ranking, automatic metrics, and GPT-4 scoring, was employed to benchmark the performance of different LFQA system configurations under heightened noisy conditions. The results indicate that: (1) Among all compared generators, Baichuan2-7B competes closely with GPT-3.5-turbo in accuracy score; (2) The most effective system configuration on our dataset involved setting the embedder, retriever, reranker, and generator as Ada2, Automated Merged Retrieval, Bge-Reranker-Base, and Baichuan2-7B, respectively; (3) models are less susceptible to noise after the length of contexts reaching a specific threshold.
نوع الوثيقة:	Working Paper
الوصول الحر:	http://arxiv.org/abs/2405.09980Test
رقم الانضمام:	edsarx.2405.09980
قاعدة البيانات:	arXiv

View record in Arxiv

الوصف
الوصف غير متاح.