The POTUS Corpus, a database of weekly addresses for the study of stance in politics and virtual agents

التفاصيل البيبلوغرافية
العنوان: The POTUS Corpus, a database of weekly addresses for the study of stance in politics and virtual agents
المؤلفون: Janssoone, Thomas, Bailly, Kevin, Richard, Gael, Clavel, Chloé
المساهمون: Université Pierre et Marie Curie - Paris 6 (UPMC), Interaction, Institut des Systèmes Intelligents et de Robotique (ISIR), Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS), Signal, Statistique et Apprentissage (S2A), Laboratoire Traitement et Communication de l'Information (LTCI), Institut Mines-Télécom Paris (IMT)-Télécom Paris-Institut Mines-Télécom Paris (IMT)-Télécom Paris, Département Images, Données, Signal (IDS), Télécom ParisTech, Télécom ParisTech-Institut Mines-Télécom Paris (IMT)-Centre National de la Recherche Scientifique (CNRS)
المصدر: Conference on Language Resources and Evaluation (LREC 2020) ; https://telecom-paris.hal.science/hal-02873020Test ; Conference on Language Resources and Evaluation (LREC 2020), 2020, Marseille, France. pp.11 - 16
بيانات النشر: HAL CCSD
سنة النشر: 2020
مصطلحات موضوعية: Multi-modal Social Signal, Signal Processing, Embodied Conversational Agent, Audio Video Corpus, POTUS, [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing
جغرافية الموضوع: Marseille, France
الوصف: International audience ; One of the main challenges in the field of Embodied Conversational Agent (ECA) is to generate socially believable agents. The common strategy for agent behaviour synthesis is to rely on dedicated corpus analysis. Such a corpus is composed of multimedia files of socio-emotional behaviors which have been annotated by external observers. The underlying idea is to identify interaction information for the agent's socio-emotional behavior by checking whether the intended socio-emotional behavior is actually perceived by humans. Then, the annotations can be used as learning classes for machine learning algorithms applied to the social signals. This paper introduces the POTUS Corpus composed of high-quality audio-video files of political addresses to the American people. Two protagonists are present in this database. First, it includes speeches of former president Barack Obama to the American people. Secondly, it provides videos of these same speeches given by a virtual agent named Rodrigue. The ECA reproduces the original address as closely as possible using social signals automatically extracted from the original one. Both are annotated for social attitudes, providing information about the stance observed in each file. It also provides the social signals automatically extracted from Obama's addresses used to generate Rodrigue's ones.
نوع الوثيقة: conference object
اللغة: English
العلاقة: hal-02873020; https://telecom-paris.hal.science/hal-02873020Test; https://telecom-paris.hal.science/hal-02873020/documentTest; https://telecom-paris.hal.science/hal-02873020/file/2020.lrec-1.193.pdfTest
الإتاحة: https://telecom-paris.hal.science/hal-02873020Test
https://telecom-paris.hal.science/hal-02873020/documentTest
https://telecom-paris.hal.science/hal-02873020/file/2020.lrec-1.193.pdfTest
حقوق: http://creativecommons.org/licenses/by-ncTest/ ; info:eu-repo/semantics/OpenAccess
رقم الانضمام: edsbas.A7BAE137
قاعدة البيانات: BASE