Abdulbasit K. Al-Talabani, H. Sellahewa, S. Jassim
{"title":"Excitation source and low level descriptor features fusion for emotion recognition using SVM and ANN","authors":"Abdulbasit K. Al-Talabani, H. Sellahewa, S. Jassim","doi":"10.1109/CEEC.2013.6659464","DOIUrl":null,"url":null,"abstract":"Emotion recognition is a challenging task with many applications in healthcare and human-machine interaction. In this study we propose to fuse two sets of features for emotion recognition at the classification level. A set of features that includes LPCC and MFCC extracted from LP-residual samples and Wavelet Octave Coefficient Of Residual (WOCOR) is proposed in this study as excitation source features. The proposed set of features is fused with 6552 LLDs using SVM and ANN classifiers. The experiments are tested on a newly acquired emotional speech database in Kurdish language, the Berlin emotional speech database, and the Aibo database. The experiments demonstrate that the fusion of the proposed excitation source features with the common LLDs can achieve better recognition accuracies than what is reported in the state-of-the-art studies.","PeriodicalId":309053,"journal":{"name":"2013 5th Computer Science and Electronic Engineering Conference (CEEC)","volume":"109 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 5th Computer Science and Electronic Engineering Conference (CEEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CEEC.2013.6659464","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Emotion recognition is a challenging task with many applications in healthcare and human-machine interaction. In this study we propose to fuse two sets of features for emotion recognition at the classification level. A set of features that includes LPCC and MFCC extracted from LP-residual samples and Wavelet Octave Coefficient Of Residual (WOCOR) is proposed in this study as excitation source features. The proposed set of features is fused with 6552 LLDs using SVM and ANN classifiers. The experiments are tested on a newly acquired emotional speech database in Kurdish language, the Berlin emotional speech database, and the Aibo database. The experiments demonstrate that the fusion of the proposed excitation source features with the common LLDs can achieve better recognition accuracies than what is reported in the state-of-the-art studies.