Exploring Emotion Expression Recognition in Older Adults Interacting With a Virtual Coach

IF 9.8 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Affective Computing Pub Date : 2025-04-08 DOI:10.1109/TAFFC.2025.3558141

Cristina Palmero;Mikel deVelasco;Mohamed Amine Hmani;Aymen Mtibaa;Leila Ben Letaifa;Pau Buch-Cardona;Raquel Justo;Terry Amorese;Eduardo González-Fraile;Begoña Fernández-Ruanova;Jofre Tenorio-Laranga;Anna Torp Johansen;Micaela Rodrigues da Silva;Liva Jenny Martinussen;Maria Stylianou Korsnes;Gennaro Cordasco;Anna Esposito;Mounim A. El-Yacoubi;Dijana Petrovska-Delacrétaz;M. Inés Torres;Sergio Escalera

{"title":"Exploring Emotion Expression Recognition in Older Adults Interacting With a Virtual Coach","authors":"Cristina Palmero;Mikel deVelasco;Mohamed Amine Hmani;Aymen Mtibaa;Leila Ben Letaifa;Pau Buch-Cardona;Raquel Justo;Terry Amorese;Eduardo González-Fraile;Begoña Fernández-Ruanova;Jofre Tenorio-Laranga;Anna Torp Johansen;Micaela Rodrigues da Silva;Liva Jenny Martinussen;Maria Stylianou Korsnes;Gennaro Cordasco;Anna Esposito;Mounim A. El-Yacoubi;Dijana Petrovska-Delacrétaz;M. Inés Torres;Sergio Escalera","doi":"10.1109/TAFFC.2025.3558141","DOIUrl":null,"url":null,"abstract":"The EMPATHIC project aimed to design an emotionally expressive virtual coach capable of engaging healthy seniors to improve well-being and promote independent aging. In particular, the system’s human sensing capabilities allow for the perception of emotional states to provide a personalized experience. This paper outlines the development of the emotion expression recognition module of the virtual coach, encompassing data collection, annotation design, and a first methodological approach, all tailored to the project requirements. With the latter, we investigate the role of various modalities, individually and combined, for discrete emotion expression recognition in this context: speech from audio, and facial expressions, gaze, and head dynamics from video. The collected corpus includes users from Spain, France, and Norway, and was annotated separately for the audio and video channels with distinct emotional labels, allowing for a performance comparison across cultures and label types. Results confirm the informative power of the modalities studied for the emotional categories considered, with multimodal methods generally outperforming others (around 68% accuracy with audio labels and 72-74% with video labels). The findings are expected to contribute to the limited literature on emotion recognition applied to older adults in conversational human-machine interaction, and guide the development of future systems.","PeriodicalId":13131,"journal":{"name":"IEEE Transactions on Affective Computing","volume":"16 3","pages":"2303-2320"},"PeriodicalIF":9.8000,"publicationDate":"2025-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10953784","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Affective Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10953784/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

The EMPATHIC project aimed to design an emotionally expressive virtual coach capable of engaging healthy seniors to improve well-being and promote independent aging. In particular, the system’s human sensing capabilities allow for the perception of emotional states to provide a personalized experience. This paper outlines the development of the emotion expression recognition module of the virtual coach, encompassing data collection, annotation design, and a first methodological approach, all tailored to the project requirements. With the latter, we investigate the role of various modalities, individually and combined, for discrete emotion expression recognition in this context: speech from audio, and facial expressions, gaze, and head dynamics from video. The collected corpus includes users from Spain, France, and Norway, and was annotated separately for the audio and video channels with distinct emotional labels, allowing for a performance comparison across cultures and label types. Results confirm the informative power of the modalities studied for the emotional categories considered, with multimodal methods generally outperforming others (around 68% accuracy with audio labels and 72-74% with video labels). The findings are expected to contribute to the limited literature on emotion recognition applied to older adults in conversational human-machine interaction, and guide the development of future systems.

查看原文本刊更多论文

探索老年人与虚拟教练互动时的情绪表达识别

移情项目旨在设计一个情感表达的虚拟教练，能够吸引健康的老年人，提高幸福感，促进独立老龄化。特别是，该系统的人类感知能力允许感知情绪状态，以提供个性化的体验。本文概述了虚拟教练情感表达识别模块的开发，包括数据收集、注释设计和第一种方法方法，所有这些都是根据项目需求量身定制的。对于后者，我们研究了在这种情况下，各种模式（单独和组合）在离散情感表达识别中的作用：来自音频的语音、来自视频的面部表情、凝视和头部动态。收集的语料库包括来自西班牙、法国和挪威的用户，并分别为具有不同情感标签的音频和视频频道进行了注释，以便跨文化和标签类型进行性能比较。结果证实了所考虑的情感类别的模态研究的信息能力，多模态方法通常优于其他方法（音频标签的准确率约为68%，视频标签的准确率为72-74%）。这一发现有望对目前有限的用于老年人对话式人机交互的情感识别文献做出贡献，并指导未来系统的发展。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Affective Computing COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-COMPUTER SCIENCE, CYBERNETICS

CiteScore

15.00

自引率

6.20%

发文量

174

期刊介绍： The IEEE Transactions on Affective Computing is an international and interdisciplinary journal. Its primary goal is to share research findings on the development of systems capable of recognizing, interpreting, and simulating human emotions and related affective phenomena. The journal publishes original research on the underlying principles and theories that explain how and why affective factors shape human-technology interactions. It also focuses on how techniques for sensing and simulating affect can enhance our understanding of human emotions and processes. Additionally, the journal explores the design, implementation, and evaluation of systems that prioritize the consideration of affect in their usability. We also welcome surveys of existing work that provide new perspectives on the historical and future directions of this field.