语音情感识别的深度神经模糊框架。

IF 1.6 4区医学 Q3 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Computer Methods in Biomechanics and Biomedical Engineering Pub Date : 2025-09-17 DOI:10.1080/10255842.2025.2559060

Qingqing Zhang

{"title":"语音情感识别的深度神经模糊框架。","authors":"Qingqing Zhang","doi":"10.1080/10255842.2025.2559060","DOIUrl":null,"url":null,"abstract":"Speech Emotion Recognition (SER) is crucial in fields like healthcare and education, requiring robust techniques for accurate emotion detection. This paper proposes a deep neuro-fuzzy framework combining Deep Neural Networks (DNN) and Adaptive Neuro-Fuzzy Inference Systems (ANFIS). It includes fuzzification, deep feature extraction, and defuzzification units, enhancing SER accuracy while addressing ANFIS limitations with high-dimensional data and DNN's lack of interpretability. The scheme's productivity on three standard speech databases is appraised: RML, SAVEE, and RAVDESS. The results indicate that our framework outperforms ANFIS, DNN, and pre-trained models, achieving up to 97.95% accuracy and showing strong potential for future SER research.","PeriodicalId":50640,"journal":{"name":"Computer Methods in Biomechanics and Biomedical Engineering","volume":" ","pages":"1-12"},"PeriodicalIF":1.6000,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A deep neuro-fuzzy framework for speech emotion recognition.\",\"authors\":\"Qingqing Zhang\",\"doi\":\"10.1080/10255842.2025.2559060\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Speech Emotion Recognition (SER) is crucial in fields like healthcare and education, requiring robust techniques for accurate emotion detection. This paper proposes a deep neuro-fuzzy framework combining Deep Neural Networks (DNN) and Adaptive Neuro-Fuzzy Inference Systems (ANFIS). It includes fuzzification, deep feature extraction, and defuzzification units, enhancing SER accuracy while addressing ANFIS limitations with high-dimensional data and DNN's lack of interpretability. The scheme's productivity on three standard speech databases is appraised: RML, SAVEE, and RAVDESS. The results indicate that our framework outperforms ANFIS, DNN, and pre-trained models, achieving up to 97.95% accuracy and showing strong potential for future SER research.\",\"PeriodicalId\":50640,\"journal\":{\"name\":\"Computer Methods in Biomechanics and Biomedical Engineering\",\"volume\":\" \",\"pages\":\"1-12\"},\"PeriodicalIF\":1.6000,\"publicationDate\":\"2025-09-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer Methods in Biomechanics and Biomedical Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1080/10255842.2025.2559060\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Methods in Biomechanics and Biomedical Engineering","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1080/10255842.2025.2559060","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

摘要

语音情感识别（SER）在医疗保健和教育等领域至关重要，需要强大的技术来准确检测情感。本文提出了一种结合深度神经网络（DNN）和自适应神经模糊推理系统（ANFIS）的深度神经模糊框架。它包括模糊化、深度特征提取和去模糊化单元，提高了SER的准确性，同时解决了高维数据和DNN缺乏可解释性的ANFIS限制。在RML、SAVEE和RAVDESS三种标准语音数据库上评价了该方案的生产率。结果表明，我们的框架优于ANFIS、DNN和预训练模型，准确率高达97.95%，在未来的SER研究中显示出强大的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A deep neuro-fuzzy framework for speech emotion recognition.

Speech Emotion Recognition (SER) is crucial in fields like healthcare and education, requiring robust techniques for accurate emotion detection. This paper proposes a deep neuro-fuzzy framework combining Deep Neural Networks (DNN) and Adaptive Neuro-Fuzzy Inference Systems (ANFIS). It includes fuzzification, deep feature extraction, and defuzzification units, enhancing SER accuracy while addressing ANFIS limitations with high-dimensional data and DNN's lack of interpretability. The scheme's productivity on three standard speech databases is appraised: RML, SAVEE, and RAVDESS. The results indicate that our framework outperforms ANFIS, DNN, and pre-trained models, achieving up to 97.95% accuracy and showing strong potential for future SER research.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Computer Methods in Biomechanics and Biomedical Engineering 工程技术-工程：生物医学

CiteScore

4.10

自引率

6.20%

发文量

179

审稿时长

4-8 weeks

期刊介绍： The primary aims of Computer Methods in Biomechanics and Biomedical Engineering are to provide a means of communicating the advances being made in the areas of biomechanics and biomedical engineering and to stimulate interest in the continually emerging computer based technologies which are being applied in these multidisciplinary subjects. Computer Methods in Biomechanics and Biomedical Engineering will also provide a focus for the importance of integrating the disciplines of engineering with medical technology and clinical expertise. Such integration will have a major impact on health care in the future.