Efficient Speech Emotion Recognition Based on Multisurface Proximal Support Vector Machine

Chengfu Yang, X. Pu, Xiaobin Wang
{"title":"Efficient Speech Emotion Recognition Based on Multisurface Proximal Support Vector Machine","authors":"Chengfu Yang, X. Pu, Xiaobin Wang","doi":"10.1109/RAMECH.2008.4681444","DOIUrl":null,"url":null,"abstract":"An efficient speech emotion recognition method based on Multisurface Proximal Support Vector Machine (MPSVM) is presented in this paper. Seven primary human emotions including anger, boredom, disgust, fear/anxiety, happiness, neutral, sadness are investigated using cepstral and spectral features. These novel and robust acoustic features and the multisurface proximal support vector machine classifier based on the Gaussian Mixture Models (GMM) are proposed to yield more correct result. In order to get the normal features in speech emotion space, the corpus of Berlin database of emotional speech is used to train the system, and a simple speech emotion corpus in English, French, Slovenian and Spanish recorded by 2 non-professional speakers are used to test the classifiers. The results achieved by MPSVM are compared by that of the standard support vector machine (SSVM) classifier. The more efficient and more accurate results are achieved.","PeriodicalId":320560,"journal":{"name":"2008 IEEE Conference on Robotics, Automation and Mechatronics","volume":"447 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE Conference on Robotics, Automation and Mechatronics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RAMECH.2008.4681444","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

An efficient speech emotion recognition method based on Multisurface Proximal Support Vector Machine (MPSVM) is presented in this paper. Seven primary human emotions including anger, boredom, disgust, fear/anxiety, happiness, neutral, sadness are investigated using cepstral and spectral features. These novel and robust acoustic features and the multisurface proximal support vector machine classifier based on the Gaussian Mixture Models (GMM) are proposed to yield more correct result. In order to get the normal features in speech emotion space, the corpus of Berlin database of emotional speech is used to train the system, and a simple speech emotion corpus in English, French, Slovenian and Spanish recorded by 2 non-professional speakers are used to test the classifiers. The results achieved by MPSVM are compared by that of the standard support vector machine (SSVM) classifier. The more efficient and more accurate results are achieved.
基于多面近端支持向量机的高效语音情感识别
提出了一种基于多面近端支持向量机(MPSVM)的语音情感识别方法。七种主要的人类情绪,包括愤怒、无聊、厌恶、恐惧/焦虑、快乐、中性、悲伤,使用倒谱和谱特征进行了调查。这些新颖的鲁棒声学特征和基于高斯混合模型(GMM)的多面近端支持向量机分类器可以得到更准确的结果。为了获得语音情感空间的正常特征,使用柏林情感语音数据库的语料库对系统进行训练,并使用2名非专业说话者记录的英语、法语、斯洛文尼亚语和西班牙语的简单语音情感语料库对分类器进行测试。将MPSVM的分类结果与标准支持向量机(SSVM)分类器的分类结果进行比较。获得了更高效、更准确的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信