دورية أكاديمية

ASVspoof 2019: a large-scale public database of synthetized, converted and replayed speech

التفاصيل البيبلوغرافية
العنوان: ASVspoof 2019: a large-scale public database of synthetized, converted and replayed speech
المؤلفون: Wang, Xin, Yamagishi, Junichi, Todisco, Massimiliano, Delgado, Hector, Nautsch, Andreas, Evans, Nicholas, Sahidullah, Md, Vestman, Ville, Kinnunen, Tomi, Aik Lee, Kong, Juvela, Lauri, Alku, Paavo, Peng, Yu-Huai, Hwang, Hsin-Te, Tsao, Yu, Wang, Hsin-Min, Le Maguer, Sebastien, Becker, Markus, Henderson, Fergus, Clark, Rob, Zhang, Yu, Wang, Quan, Jia, Ye, Onuma, Kai, Mushika, Koji, Kaneda, Takashi, Jiang, Yuan, Liu, Li-Juan, Wu, Yi-Chiao, Huang, Wen-Chin, Toda, Tomoki, Tanaka, Kou, Kameoka, Hirokazu, Steiner, Ingmar, Matrouf, Driss, Bonastre, Jean-Francois, Govender, Avashna, Ronanki, Srikanth, Zhang, Jing-Xuan, Ling, Zhen-Hua
المساهمون: Dept Signal Process and Acoust, Speech Communication Technology, National Institute of Informatics, EURECOM, Université de Lorraine, University of Eastern Finland, NEC Corporation, Academia Sinica, Trinity College Dublin, Google, USA, HOYA Corporation, IFLYTEK Co., Ltd., Nagoya University, NTT Communication Science Laboratories, AudEERING GmbH, Avignon Université, University of Edinburgh, University of Science and Technology of China, Aalto-yliopisto, Aalto University
بيانات النشر: Academic Press Inc.
سنة النشر: 2020
المجموعة: Aalto University Publication Archive (Aaltodoc) / Aalto-yliopiston julkaisuarkistoa
مصطلحات موضوعية: automatic speaker verification, countermeasure, anti-spoofing, presentation attack, presentation attack detection, text-to-speech synthesis, voice conversion, replay, ASVspoof challenge, biometrics, media forensics
الوصف: Automatic speaker verification (ASV) is one of the most natural and convenient means of biometric person recognition. Unfortunately, just like all other biometric systems, ASV is vulnerable to spoofing, also referred to as “presentation attacks.” These vulnerabilities are generally unacceptable and call for spoofing countermeasures or “presentation attack detection” systems. In addition to impersonation, ASV systems are vulnerable to replay, speech synthesis, and voice conversion attacks. The ASVspoof challenge initiative was created to foster research on anti-spoofing and to provide common platforms for the assessment and comparison of spoofing countermeasures. The first edition, ASVspoof 2015, focused upon the study of countermeasures for detecting of text-to-speech synthesis (TTS) and voice conversion (VC) attacks. The second edition, ASVspoof 2017, focused instead upon replay spoofing attacks and countermeasures. The ASVspoof 2019 edition is the first to consider all three spoofing attack types within a single challenge. While they originate from the same source database and same underlying protocol, they are explored in two specific use case scenarios. Spoofing attacks within a logical access (LA) scenario are generated with the latest speech synthesis and voice conversion technologies, including state-of-the-art neural acoustic and waveform model techniques. Replay spoofing attacks within a physical access (PA) scenario are generated through carefully controlled simulations that support much more revealing analysis than possible previously. Also new to the 2019 edition is the use of the tandem detection cost function metric, which reflects the impact of spoofing and countermeasures on the reliability of a fixed ASV system. This paper describes the database design, protocol, spoofing attack implementations, and baseline ASV and countermeasure results. It also describes a human assessment on spoofed data in logical access. It was demonstrated that the spoofing data in the ASVspoof 2019 database have varied ...
نوع الوثيقة: article in journal/newspaper
وصف الملف: application/pdf
اللغة: English
تدمد: 0885-2308
1095-8363
العلاقة: Computer Speech and Language; Volume 64; Wang, X, Yamagishi, J, Todisco, M, Delgado, H, Nautsch, A, Evans, N, Sahidullah, M, Vestman, V, Kinnunen, T, Aik Lee, K, Juvela, L, Alku, P, Peng, Y-H, Hwang, H-T, Tsao, Y, Wang, H-M, Le Maguer, S, Becker, M, Henderson, F, Clark, R, Zhang, Y, Wang, Q, Jia, Y, Onuma, K, Mushika, K, Kaneda, T, Jiang, Y, Liu, L-J, Wu, Y-C, Huang, W-C, Toda, T, Tanaka, K, Kameoka, H, Steiner, I, Matrouf, D, Bonastre, J-F, Govender, A, Ronanki, S, Zhang, J-X & Ling, Z-H 2020, ' ASVspoof 2019: a large-scale public database of synthetized, converted and replayed speech ', Computer Speech and Language, vol. 64, 101114 . https://doi.org/10.1016/j.csl.2020.101114Test; PURE UUID: ef07b412-410e-41e4-bcce-e51a348c2d75; PURE ITEMURL: https://research.aalto.fi/en/publications/ef07b412-410e-41e4-bcce-e51a348c2d75Test; PURE LINK: http://www.scopus.com/inward/record.url?scp=85085554705&partnerID=8YFLogxKTest; PURE LINK: https://arxiv.org/abs/1911.01601Test; PURE FILEURL: https://research.aalto.fi/files/42966245/Wang_ASVspoof2019.pdfTest; https://aaltodoc.aalto.fi/handle/123456789/114567Test; URN:NBN:fi:aalto-202205243414
DOI: 10.1016/j.csl.2020.101114
الإتاحة: https://doi.org/10.1016/j.csl.2020.101114Test
https://aaltodoc.aalto.fi/handle/123456789/114567Test
حقوق: openAccess
رقم الانضمام: edsbas.6581EA7
قاعدة البيانات: BASE
الوصف
تدمد:08852308
10958363
DOI:10.1016/j.csl.2020.101114