{"title":"基于多面近端支持向量机的高效语音情感识别","authors":"Chengfu Yang, X. Pu, Xiaobin Wang","doi":"10.1109/RAMECH.2008.4681444","DOIUrl":null,"url":null,"abstract":"An efficient speech emotion recognition method based on Multisurface Proximal Support Vector Machine (MPSVM) is presented in this paper. Seven primary human emotions including anger, boredom, disgust, fear/anxiety, happiness, neutral, sadness are investigated using cepstral and spectral features. These novel and robust acoustic features and the multisurface proximal support vector machine classifier based on the Gaussian Mixture Models (GMM) are proposed to yield more correct result. In order to get the normal features in speech emotion space, the corpus of Berlin database of emotional speech is used to train the system, and a simple speech emotion corpus in English, French, Slovenian and Spanish recorded by 2 non-professional speakers are used to test the classifiers. The results achieved by MPSVM are compared by that of the standard support vector machine (SSVM) classifier. The more efficient and more accurate results are achieved.","PeriodicalId":320560,"journal":{"name":"2008 IEEE Conference on Robotics, Automation and Mechatronics","volume":"447 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Efficient Speech Emotion Recognition Based on Multisurface Proximal Support Vector Machine\",\"authors\":\"Chengfu Yang, X. Pu, Xiaobin Wang\",\"doi\":\"10.1109/RAMECH.2008.4681444\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"An efficient speech emotion recognition method based on Multisurface Proximal Support Vector Machine (MPSVM) is presented in this paper. Seven primary human emotions including anger, boredom, disgust, fear/anxiety, happiness, neutral, sadness are investigated using cepstral and spectral features. These novel and robust acoustic features and the multisurface proximal support vector machine classifier based on the Gaussian Mixture Models (GMM) are proposed to yield more correct result. In order to get the normal features in speech emotion space, the corpus of Berlin database of emotional speech is used to train the system, and a simple speech emotion corpus in English, French, Slovenian and Spanish recorded by 2 non-professional speakers are used to test the classifiers. The results achieved by MPSVM are compared by that of the standard support vector machine (SSVM) classifier. The more efficient and more accurate results are achieved.\",\"PeriodicalId\":320560,\"journal\":{\"name\":\"2008 IEEE Conference on Robotics, Automation and Mechatronics\",\"volume\":\"447 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-11-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 IEEE Conference on Robotics, Automation and Mechatronics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/RAMECH.2008.4681444\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE Conference on Robotics, Automation and Mechatronics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RAMECH.2008.4681444","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Efficient Speech Emotion Recognition Based on Multisurface Proximal Support Vector Machine
An efficient speech emotion recognition method based on Multisurface Proximal Support Vector Machine (MPSVM) is presented in this paper. Seven primary human emotions including anger, boredom, disgust, fear/anxiety, happiness, neutral, sadness are investigated using cepstral and spectral features. These novel and robust acoustic features and the multisurface proximal support vector machine classifier based on the Gaussian Mixture Models (GMM) are proposed to yield more correct result. In order to get the normal features in speech emotion space, the corpus of Berlin database of emotional speech is used to train the system, and a simple speech emotion corpus in English, French, Slovenian and Spanish recorded by 2 non-professional speakers are used to test the classifiers. The results achieved by MPSVM are compared by that of the standard support vector machine (SSVM) classifier. The more efficient and more accurate results are achieved.