Speech Emotion Recognition in Mental Health: Systematic Review of Voice-Based Applications.

IF 5.8 2区 医学 Q1 PSYCHIATRY
Jmir Mental Health Pub Date : 2025-09-30 DOI:10.2196/74260
Eric Jordan, Raphaël Terrisse, Valeria Lucarini, Motasem Alrahabi, Marie-Odile Krebs, Julien Desclés, Christophe Lemey
{"title":"Speech Emotion Recognition in Mental Health: Systematic Review of Voice-Based Applications.","authors":"Eric Jordan, Raphaël Terrisse, Valeria Lucarini, Motasem Alrahabi, Marie-Odile Krebs, Julien Desclés, Christophe Lemey","doi":"10.2196/74260","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The field of speech emotion recognition (SER) encompasses a wide variety of approaches, with artificial intelligence technologies providing improvements in recent years. In the domain of mental health, the links between individuals' emotional states and pathological diagnoses are of particular interest.</p><p><strong>Objective: </strong>This study aimed to investigate the performance of tools combining SER and artificial intelligence approaches with a view to their use within clinical contexts and to determine the extent to which SER technologies have already been applied within clinical contexts.</p><p><strong>Methods: </strong>The review includes studies applied to speech (audio) signals for a select set of pathologies or disorders and only includes those studies that evaluate diagnostic performance using machine learning performance metrics or statistical correlation measures. The PubMed, IEEE Xplore, arXiv, and ScienceDirect databases were queried as recently as February 2025. The Quality Assessment of Diagnostic Accuracy Studies tool was used to measure the risk of bias.</p><p><strong>Results: </strong>A total of 14 articles were included in the final review. The included papers addressed suicide risk (3/14, 21%), depression (8/14, 57%), and psychotic disorders (3/14, 21%).</p><p><strong>Conclusions: </strong>SER technologies are mostly used indirectly in mental health research and in a wide variety of ways, including different architectures, datasets, and pathologies. This diversity makes a direct assessment of the technology challenging. Nonetheless, promising results are obtained in various studies that attempt to diagnose patients based on either indirect or direct results from SER models. These results highlight the potential for this technology to be used within a clinical setting. Future work should focus on how clinicians can use these technologies collaboratively.</p><p><strong>Trial registration: </strong>PROSPERO CRD420251006669; https://www.crd.york.ac.uk/PROSPERO/view/CRD420251006669.</p>","PeriodicalId":48616,"journal":{"name":"Jmir Mental Health","volume":"12 ","pages":"e74260"},"PeriodicalIF":5.8000,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jmir Mental Health","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2196/74260","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHIATRY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: The field of speech emotion recognition (SER) encompasses a wide variety of approaches, with artificial intelligence technologies providing improvements in recent years. In the domain of mental health, the links between individuals' emotional states and pathological diagnoses are of particular interest.

Objective: This study aimed to investigate the performance of tools combining SER and artificial intelligence approaches with a view to their use within clinical contexts and to determine the extent to which SER technologies have already been applied within clinical contexts.

Methods: The review includes studies applied to speech (audio) signals for a select set of pathologies or disorders and only includes those studies that evaluate diagnostic performance using machine learning performance metrics or statistical correlation measures. The PubMed, IEEE Xplore, arXiv, and ScienceDirect databases were queried as recently as February 2025. The Quality Assessment of Diagnostic Accuracy Studies tool was used to measure the risk of bias.

Results: A total of 14 articles were included in the final review. The included papers addressed suicide risk (3/14, 21%), depression (8/14, 57%), and psychotic disorders (3/14, 21%).

Conclusions: SER technologies are mostly used indirectly in mental health research and in a wide variety of ways, including different architectures, datasets, and pathologies. This diversity makes a direct assessment of the technology challenging. Nonetheless, promising results are obtained in various studies that attempt to diagnose patients based on either indirect or direct results from SER models. These results highlight the potential for this technology to be used within a clinical setting. Future work should focus on how clinicians can use these technologies collaboratively.

Trial registration: PROSPERO CRD420251006669; https://www.crd.york.ac.uk/PROSPERO/view/CRD420251006669.

心理健康中的语音情感识别:基于语音的应用系统综述。
背景:语音情感识别(SER)领域包含各种各样的方法,近年来人工智能技术提供了改进。在心理健康领域,个人情绪状态和病理诊断之间的联系是特别有趣的。目的:本研究旨在调查结合SER和人工智能方法的工具的性能,以确定SER技术在临床环境中的应用程度。方法:该综述包括应用于语音(音频)信号的研究,用于一组选定的病理或疾病,仅包括那些使用机器学习性能指标或统计相关措施评估诊断性能的研究。直到2025年2月,PubMed、IEEE explore、arXiv和ScienceDirect数据库都被查询过。使用诊断准确性研究质量评估工具来测量偏倚风险。结果:最终评审共纳入14篇文章。纳入的论文涉及自杀风险(3/14,21%)、抑郁症(8/14,57%)和精神障碍(3/14,21%)。结论:SER技术大多间接用于心理健康研究,并以多种方式使用,包括不同的架构、数据集和病理。这种多样性使得对该技术的直接评估具有挑战性。尽管如此,在试图根据SER模型的间接或直接结果诊断患者的各种研究中,都获得了令人鼓舞的结果。这些结果突出了该技术在临床环境中应用的潜力。未来的工作应侧重于临床医生如何协同使用这些技术。试验注册:PROSPERO CRD420251006669;https://www.crd.york.ac.uk/PROSPERO/view/CRD420251006669。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Jmir Mental Health
Jmir Mental Health Medicine-Psychiatry and Mental Health
CiteScore
10.80
自引率
3.80%
发文量
104
审稿时长
16 weeks
期刊介绍: JMIR Mental Health (JMH, ISSN 2368-7959) is a PubMed-indexed, peer-reviewed sister journal of JMIR, the leading eHealth journal (Impact Factor 2016: 5.175). JMIR Mental Health focusses on digital health and Internet interventions, technologies and electronic innovations (software and hardware) for mental health, addictions, online counselling and behaviour change. This includes formative evaluation and system descriptions, theoretical papers, review papers, viewpoint/vision papers, and rigorous evaluations.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信