Towards the Development of a Real-Time Deepfake Audio Detection System in Communication Platforms

التفاصيل البيبلوغرافية
العنوان: Towards the Development of a Real-Time Deepfake Audio Detection System in Communication Platforms
المؤلفون: Mathew, Jonat John, Ahsan, Rakin, Furukawa, Sae, Kumar, Jagdish Gautham Krishna, Pallan, Huzaifa, Padda, Agamjeet Singh, Adamski, Sara, Reddiboina, Madhu, Pankajakshan, Arjun
سنة النشر: 2024
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Sound, Computer Science - Cryptography and Security, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Audio and Speech Processing
الوصف: Deepfake audio poses a rising threat in communication platforms, necessitating real-time detection for audio stream integrity. Unlike traditional non-real-time approaches, this study assesses the viability of employing static deepfake audio detection models in real-time communication platforms. An executable software is developed for cross-platform compatibility, enabling real-time execution. Two deepfake audio detection models based on Resnet and LCNN architectures are implemented using the ASVspoof 2019 dataset, achieving benchmark performances compared to ASVspoof 2019 challenge baselines. The study proposes strategies and frameworks for enhancing these models, paving the way for real-time deepfake audio detection in communication platforms. This work contributes to the advancement of audio stream security, ensuring robust detection capabilities in dynamic, real-time communication scenarios.
نوع الوثيقة: Working Paper
الوصول الحر: http://arxiv.org/abs/2403.11778Test
رقم الانضمام: edsarx.2403.11778
قاعدة البيانات: arXiv