Efficient Speech Emotion Recognition Based on Multisurface Proximal Support Vector Machine

2008 IEEE Conference on Robotics, Automation and Mechatronics Pub Date : 2008-11-18 DOI:10.1109/RAMECH.2008.4681444

Chengfu Yang, X. Pu, Xiaobin Wang

引用次数: 2

Abstract

An efficient speech emotion recognition method based on Multisurface Proximal Support Vector Machine (MPSVM) is presented in this paper. Seven primary human emotions including anger, boredom, disgust, fear/anxiety, happiness, neutral, sadness are investigated using cepstral and spectral features. These novel and robust acoustic features and the multisurface proximal support vector machine classifier based on the Gaussian Mixture Models (GMM) are proposed to yield more correct result. In order to get the normal features in speech emotion space, the corpus of Berlin database of emotional speech is used to train the system, and a simple speech emotion corpus in English, French, Slovenian and Spanish recorded by 2 non-professional speakers are used to test the classifiers. The results achieved by MPSVM are compared by that of the standard support vector machine (SSVM) classifier. The more efficient and more accurate results are achieved.

查看原文本刊更多论文

基于多面近端支持向量机的高效语音情感识别

提出了一种基于多面近端支持向量机(MPSVM)的语音情感识别方法。七种主要的人类情绪，包括愤怒、无聊、厌恶、恐惧/焦虑、快乐、中性、悲伤，使用倒谱和谱特征进行了调查。这些新颖的鲁棒声学特征和基于高斯混合模型(GMM)的多面近端支持向量机分类器可以得到更准确的结果。为了获得语音情感空间的正常特征，使用柏林情感语音数据库的语料库对系统进行训练，并使用2名非专业说话者记录的英语、法语、斯洛文尼亚语和西班牙语的简单语音情感语料库对分类器进行测试。将MPSVM的分类结果与标准支持向量机(SSVM)分类器的分类结果进行比较。获得了更高效、更准确的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2008 IEEE Conference on Robotics, Automation and Mechatronics

自引率

0.00%

发文量