基于多面近端支持向量机的高效语音情感识别

2008 IEEE Conference on Robotics, Automation and Mechatronics Pub Date : 2008-11-18 DOI:10.1109/RAMECH.2008.4681444

Chengfu Yang, X. Pu, Xiaobin Wang

{"title":"基于多面近端支持向量机的高效语音情感识别","authors":"Chengfu Yang, X. Pu, Xiaobin Wang","doi":"10.1109/RAMECH.2008.4681444","DOIUrl":null,"url":null,"abstract":"An efficient speech emotion recognition method based on Multisurface Proximal Support Vector Machine (MPSVM) is presented in this paper. Seven primary human emotions including anger, boredom, disgust, fear/anxiety, happiness, neutral, sadness are investigated using cepstral and spectral features. These novel and robust acoustic features and the multisurface proximal support vector machine classifier based on the Gaussian Mixture Models (GMM) are proposed to yield more correct result. In order to get the normal features in speech emotion space, the corpus of Berlin database of emotional speech is used to train the system, and a simple speech emotion corpus in English, French, Slovenian and Spanish recorded by 2 non-professional speakers are used to test the classifiers. The results achieved by MPSVM are compared by that of the standard support vector machine (SSVM) classifier. The more efficient and more accurate results are achieved.","PeriodicalId":320560,"journal":{"name":"2008 IEEE Conference on Robotics, Automation and Mechatronics","volume":"447 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Efficient Speech Emotion Recognition Based on Multisurface Proximal Support Vector Machine\",\"authors\":\"Chengfu Yang, X. Pu, Xiaobin Wang\",\"doi\":\"10.1109/RAMECH.2008.4681444\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"An efficient speech emotion recognition method based on Multisurface Proximal Support Vector Machine (MPSVM) is presented in this paper. Seven primary human emotions including anger, boredom, disgust, fear/anxiety, happiness, neutral, sadness are investigated using cepstral and spectral features. These novel and robust acoustic features and the multisurface proximal support vector machine classifier based on the Gaussian Mixture Models (GMM) are proposed to yield more correct result. In order to get the normal features in speech emotion space, the corpus of Berlin database of emotional speech is used to train the system, and a simple speech emotion corpus in English, French, Slovenian and Spanish recorded by 2 non-professional speakers are used to test the classifiers. The results achieved by MPSVM are compared by that of the standard support vector machine (SSVM) classifier. The more efficient and more accurate results are achieved.\",\"PeriodicalId\":320560,\"journal\":{\"name\":\"2008 IEEE Conference on Robotics, Automation and Mechatronics\",\"volume\":\"447 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-11-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 IEEE Conference on Robotics, Automation and Mechatronics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/RAMECH.2008.4681444\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE Conference on Robotics, Automation and Mechatronics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RAMECH.2008.4681444","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

提出了一种基于多面近端支持向量机(MPSVM)的语音情感识别方法。七种主要的人类情绪，包括愤怒、无聊、厌恶、恐惧/焦虑、快乐、中性、悲伤，使用倒谱和谱特征进行了调查。这些新颖的鲁棒声学特征和基于高斯混合模型(GMM)的多面近端支持向量机分类器可以得到更准确的结果。为了获得语音情感空间的正常特征，使用柏林情感语音数据库的语料库对系统进行训练，并使用2名非专业说话者记录的英语、法语、斯洛文尼亚语和西班牙语的简单语音情感语料库对分类器进行测试。将MPSVM的分类结果与标准支持向量机(SSVM)分类器的分类结果进行比较。获得了更高效、更准确的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Efficient Speech Emotion Recognition Based on Multisurface Proximal Support Vector Machine

An efficient speech emotion recognition method based on Multisurface Proximal Support Vector Machine (MPSVM) is presented in this paper. Seven primary human emotions including anger, boredom, disgust, fear/anxiety, happiness, neutral, sadness are investigated using cepstral and spectral features. These novel and robust acoustic features and the multisurface proximal support vector machine classifier based on the Gaussian Mixture Models (GMM) are proposed to yield more correct result. In order to get the normal features in speech emotion space, the corpus of Berlin database of emotional speech is used to train the system, and a simple speech emotion corpus in English, French, Slovenian and Spanish recorded by 2 non-professional speakers are used to test the classifiers. The results achieved by MPSVM are compared by that of the standard support vector machine (SSVM) classifier. The more efficient and more accurate results are achieved.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2008 IEEE Conference on Robotics, Automation and Mechatronics

自引率

0.00%

发文量