Modeling Vocal Entrainment in Conversational Speech Using Deep Unsupervised Learning

التفاصيل البيبلوغرافية
العنوان: Modeling Vocal Entrainment in Conversational Speech Using Deep Unsupervised Learning
المؤلفون: Brian R. Baucom, Shrikanth S. Narayanan, Panayiotis G. Georgiou, Nasir, Craig J. Bryan
المصدر: IEEE Transactions on Affective Computing. 13:1651-1663
بيانات النشر: Institute of Electrical and Electronics Engineers (IEEE), 2022.
سنة النشر: 2022
مصطلحات موضوعية: Artificial neural network, Computer science, business.industry, Speech recognition, media_common.quotation_subject, Deep learning, Feature extraction, Interpersonal communication, Distance measures, Human-Computer Interaction, Unsupervised learning, Conversation, Artificial intelligence, business, Entrainment (chronobiology), Software, media_common
الوصف: In interpersonal spoken interactions, individuals tend to adapt to their conversation partner's vocal characteristics to become similar, a phenomenon known as entrainment. A majority of the previous computational approaches are often knowledge driven and linear and fail to capture the inherent nonlinearity of entrainment. In this work, we present an unsupervised deep learning framework to derive a representation from speech features containing information relevant for vocal entrainment. We investigate both an encoding based approach and a more robust triplet network based approach within the proposed framework. We also propose a number of distance measures in the representation space and use them for quantification of entrainment. We first validate the proposed distances by using them to distinguish real conversations from fake ones. Then we also demonstrate their applications in relation to modeling several entrainment-relevant behaviors in observational psychotherapy, namely agreement, blame and emotional bond.
تدمد: 2371-9850
الوصول الحر: https://explore.openaire.eu/search/publication?articleId=doi_________::608646ea384560063ddb9ce80da36315Test
https://doi.org/10.1109/taffc.2020.3024972Test
حقوق: CLOSED
رقم الانضمام: edsair.doi...........608646ea384560063ddb9ce80da36315
قاعدة البيانات: OpenAIRE