Raseeda Hamzah, N. Jamil, K. A. Samah, Nur Nabilah Abu Mangshor, Nurbaity Sabri, Rosniza Roslan
{"title":"Comparing statistical classifiers for emotion classification","authors":"Raseeda Hamzah, N. Jamil, K. A. Samah, Nur Nabilah Abu Mangshor, Nurbaity Sabri, Rosniza Roslan","doi":"10.1109/ICSENGT.2017.8123443","DOIUrl":null,"url":null,"abstract":"Speech emotion recognition has been widely used in human computer interaction and applications. This paper has classified emotion into two classes: happy and angry. All the speech signal is preprocessed from Malay spoken speech database. Emotional information is obtained by applying two well-established acoustical features that are Mel Frequency Cepstral Coefficients (MFCC) and Short Time Energy (STE). The performance of the classification is done by comparing four types of classifiers which are Naïve Bayes, Multi-Layer Perceptron (MLP), C4.5 and Random Forest. Result shows that Random Forest has achieved the highest accuracy of ∼90% exceeding C4.5, Multilayer Perceptron (MLP) and Naïve Bayes. Naïve Bayes shows the lowest score of ∼76% accuracy.","PeriodicalId":350572,"journal":{"name":"2017 7th IEEE International Conference on System Engineering and Technology (ICSET)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 7th IEEE International Conference on System Engineering and Technology (ICSET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSENGT.2017.8123443","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Speech emotion recognition has been widely used in human computer interaction and applications. This paper has classified emotion into two classes: happy and angry. All the speech signal is preprocessed from Malay spoken speech database. Emotional information is obtained by applying two well-established acoustical features that are Mel Frequency Cepstral Coefficients (MFCC) and Short Time Energy (STE). The performance of the classification is done by comparing four types of classifiers which are Naïve Bayes, Multi-Layer Perceptron (MLP), C4.5 and Random Forest. Result shows that Random Forest has achieved the highest accuracy of ∼90% exceeding C4.5, Multilayer Perceptron (MLP) and Naïve Bayes. Naïve Bayes shows the lowest score of ∼76% accuracy.