Speech emotion recognition in conversations using artificial intelligence: a systematic review and meta-analysis

IF 10.7 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Artificial Intelligence Review Pub Date : 2025-04-11 DOI:10.1007/s10462-025-11197-8

Ghada Alhussein, Ioannis Ziogas, Shiza Saleem, Leontios J. Hadjileontiadis

{"title":"Speech emotion recognition in conversations using artificial intelligence: a systematic review and meta-analysis","authors":"Ghada Alhussein, Ioannis Ziogas, Shiza Saleem, Leontios J. Hadjileontiadis","doi":"10.1007/s10462-025-11197-8","DOIUrl":null,"url":null,"abstract":"<div><p> Manifestations of emotion in social conversational interactions stand at a focal point in the rapidly growing affective computing area, with applications in healthcare, education and human-computer interaction. Artificial intelligence (AI) holds great potential in modeling the challenging dynamic nature of affect in speech conversation. In this paper, we analyze and criticize latest trends and open problems through a systematic review and multi-subgroup meta-analysis of AI approaches for emotion recognition in conversation (ERC). We adopt the PRISMA-DTA guidelines toward analysis of AI-driven speech ERC. A comprehensive database search through predefined query strings and selection criteria allowed for data extraction of essential diagnostic performance parameters. We analyze salient patterns related to methodological quality and risk of bias. Univariate random-effects models are then designed with a multi-subgroup perspective, centered around affective annotations models, while encompassing the ERC parameters of modalities, feature extraction and conversation style. 51 studies were systematically reviewed for qualitative analysis, whereas 27 articles were included in the meta-analysis. Diagnostic test performance manifested with high heterogeneity, with intriguing insights regarding affective state annotation, input modality, feature extraction methods, and dataset conversation style. Our analysis raised concerns regarding bias, reporting quality and inter-rater reliability in annotations. Our research contributes fine-grained insights as recommendations that tackle open-problems in ERC. While providing valuable information on diagnostic performance of AI in speech ERC, we underscore the imperative need for further advancements in annotations and models capable of handling diverse emotional expressions. </p><p>Trial Registration: PROSPERO identifier - CRD42023416879.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"58 7","pages":""},"PeriodicalIF":10.7000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11197-8.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence Review","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10462-025-11197-8","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Manifestations of emotion in social conversational interactions stand at a focal point in the rapidly growing affective computing area, with applications in healthcare, education and human-computer interaction. Artificial intelligence (AI) holds great potential in modeling the challenging dynamic nature of affect in speech conversation. In this paper, we analyze and criticize latest trends and open problems through a systematic review and multi-subgroup meta-analysis of AI approaches for emotion recognition in conversation (ERC). We adopt the PRISMA-DTA guidelines toward analysis of AI-driven speech ERC. A comprehensive database search through predefined query strings and selection criteria allowed for data extraction of essential diagnostic performance parameters. We analyze salient patterns related to methodological quality and risk of bias. Univariate random-effects models are then designed with a multi-subgroup perspective, centered around affective annotations models, while encompassing the ERC parameters of modalities, feature extraction and conversation style. 51 studies were systematically reviewed for qualitative analysis, whereas 27 articles were included in the meta-analysis. Diagnostic test performance manifested with high heterogeneity, with intriguing insights regarding affective state annotation, input modality, feature extraction methods, and dataset conversation style. Our analysis raised concerns regarding bias, reporting quality and inter-rater reliability in annotations. Our research contributes fine-grained insights as recommendations that tackle open-problems in ERC. While providing valuable information on diagnostic performance of AI in speech ERC, we underscore the imperative need for further advancements in annotations and models capable of handling diverse emotional expressions.

Trial Registration: PROSPERO identifier - CRD42023416879.

查看原文本刊更多论文

人工智能对话中的语音情感识别：系统回顾与元分析

在快速发展的情感计算领域中，社交会话交互中的情感表现是一个焦点，在医疗保健、教育和人机交互中都有应用。人工智能（AI）在模拟语音会话中具有挑战性的情感动态特性方面具有巨大的潜力。在本文中，我们通过对对话中情感识别（ERC）的人工智能方法的系统回顾和多亚群元分析，分析和批评了最新趋势和开放问题。我们采用PRISMA-DTA指南来分析人工智能驱动的语音ERC。通过预定义的查询字符串和选择标准进行全面的数据库搜索，可以提取基本诊断性能参数的数据。我们分析了与方法学质量和偏倚风险相关的显著模式。单变量随机效应模型以多子组视角设计，以情感注释模型为中心，同时包含模态、特征提取和会话风格的ERC参数。51项研究进行了系统的定性分析，27篇文章纳入了meta分析。诊断测试性能表现出高异质性，在情感状态注释、输入模式、特征提取方法和数据集会话风格方面具有有趣的见解。我们的分析引起了对注释的偏倚、报告质量和评分者之间可靠性的关注。我们的研究提供了细粒度的见解，作为解决ERC中开放问题的建议。在提供有关人工智能在语音ERC中的诊断性能的有价值信息的同时，我们强调了在能够处理各种情绪表达的注释和模型方面取得进一步进展的迫切需要。试验注册：PROSPERO标识符- CRD42023416879。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Artificial Intelligence Review 工程技术-计算机：人工智能

CiteScore

22.00

自引率

3.30%

发文量

194

审稿时长

5.3 months

期刊介绍： Artificial Intelligence Review, a fully open access journal, publishes cutting-edge research in artificial intelligence and cognitive science. It features critical evaluations of applications, techniques, and algorithms, providing a platform for both researchers and application developers. The journal includes refereed survey and tutorial articles, along with reviews and commentary on significant developments in the field.