Xuyao Dai , Zhen Liu , Tingting Liu , Guokun Zuo , Jialin Xu , Changcheng Shi , Yuanyi Wang
{"title":"Modelling conversational agent with empathy mechanism","authors":"Xuyao Dai , Zhen Liu , Tingting Liu , Guokun Zuo , Jialin Xu , Changcheng Shi , Yuanyi Wang","doi":"10.1016/j.cogsys.2023.101206","DOIUrl":null,"url":null,"abstract":"<div><p>Empathy mechanism in communication is the cornerstone for effective and meaningful interaction. Establishing an empathy mechanism in conversation agent (CA) requires accurate recognition of users’ emotions to facilitate generates appropriate empathetic responses. Therefore, we proposed a Multimodal Emotion Recognition Model (MERM) to recognizes a user’s emotional state from multimodal data (audio, facial expressions, and conversation text) during conversation, and an Interactive Empathetic Conversation Model (IECM) to generate empathetic responses based on the MERM. Comparative and ablation study results indicated that the proposed models outperform existing methods in recognizing the user’s emotions and generating appropriate empathetic responses. We also conducted an experiment study, the results indicated that the CA significantly enhances the user’s emotional experience.</p></div>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2023-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"102","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1389041723001407","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
引用次数: 0
Abstract
Empathy mechanism in communication is the cornerstone for effective and meaningful interaction. Establishing an empathy mechanism in conversation agent (CA) requires accurate recognition of users’ emotions to facilitate generates appropriate empathetic responses. Therefore, we proposed a Multimodal Emotion Recognition Model (MERM) to recognizes a user’s emotional state from multimodal data (audio, facial expressions, and conversation text) during conversation, and an Interactive Empathetic Conversation Model (IECM) to generate empathetic responses based on the MERM. Comparative and ablation study results indicated that the proposed models outperform existing methods in recognizing the user’s emotions and generating appropriate empathetic responses. We also conducted an experiment study, the results indicated that the CA significantly enhances the user’s emotional experience.
对话代理(CA)在人机交互研究中发挥着举足轻重的作用。以往的研究主要集中在会话生成、非语言行为表现和共情能力等方面。然而,由于用户生成的多模态信息中存在各种关系,CA 的行为与上下文信息之间可能会出现不一致。为了应对这一挑战,我们开展了跨学科研究,旨在克服以往研究中观察到的有关 CA 中移情机制和情感交互的局限性。在这方面,我们开发了一个多模态人机情感交互综合框架,使 CA 能够识别人类情感并做出适当回应。该框架包括一个虚拟现实环境中的人形 CA,以及一个交互式多模态情感识别-移情对话生成循环架构。CA通过利用多模态信号(包括音频、面部表情和对话文本)来推断用户的情绪。随后,它通过交互式移情对话模型展示语言行为。已进行的几项与情感相关的实验表明,CA 的多模态识别和表达能力以及行为一致性提高了多模态人机交互的自然度和可信度。总之,这项研究有助于开发多模态人机情感交互的综合框架,提高 CA 的质量、可信度和移情能力。