دورية أكاديمية

MF-Saudi: A multimodal framework for bridging the gap between audio and textual data for Saudi dialect detection

التفاصيل البيبلوغرافية
العنوان: MF-Saudi: A multimodal framework for bridging the gap between audio and textual data for Saudi dialect detection
المؤلفون: Raed Alharbi
المصدر: Journal of King Saud University: Computer and Information Sciences, Vol 36, Iss 6, Pp 102084- (2024)
بيانات النشر: Elsevier, 2024.
سنة النشر: 2024
المجموعة: LCC:Electronic computers. Computer science
مصطلحات موضوعية: Dialectal detection, Arabic dialects, Multimodal framework, Information fusion, Electronic computers. Computer science, QA75.5-76.95
الوصف: Detecting variations in dialects within a language can be challenging, particularly in regions with rich linguistic diversity like Saudi Arabia. To our knowledge, no prior attempts have been made to develop a multimodal, audio–textual framework for Saudi dialect detection. The current approaches often concentrate on detecting dialects only based on audio or textual data, which fails to capture the complex relationship between both modalities. In this paper, we propose a novel Multimodal Framework, called MF-Saudi, for Saudi dialect detection. The framework consists of three main components: (1) a pretrained BERT encoder for extracting and encoding textual information; (2) an acoustic model for representing audio signals and fusing them with textual information via the fusion layer; and (3) an alignment learning module to develop meaningful representations that capture the complexities of audio–text relationships, resulting in improved dialect detection. We conduct empirical evaluations on a real-world dataset, demonstrating that our solution outperforms some of the state-of-the-art baseline methods. The experiment’s code can be found here: https://github.com/raed19/MF-SaudiTest.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 1319-1578
العلاقة: http://www.sciencedirect.com/science/article/pii/S1319157824001733Test; https://doaj.org/toc/1319-1578Test
DOI: 10.1016/j.jksuci.2024.102084
الوصول الحر: https://doaj.org/article/ba12643c53a0486abc368c9fb88380b2Test
رقم الانضمام: edsdoj.ba12643c53a0486abc368c9fb88380b2
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:13191578
DOI:10.1016/j.jksuci.2024.102084