Comparison of Artificial Neural Network and Gaussian Mixture Model Based Machine Learning Techniques Using DDMFCC Vectors for Emotion Recognition in Kannada
{"title":"Comparison of Artificial Neural Network and Gaussian Mixture Model Based Machine Learning Techniques Using DDMFCC Vectors for Emotion Recognition in Kannada","authors":"Prashanth Kannadaguli, Vidya Bhat","doi":"10.1109/IEMENTech48150.2019.8981386","DOIUrl":null,"url":null,"abstract":"We build an emotion recognition system based on Artificial Neural Network (ANN) and compare the same with the one based upon the Gaussian Mixture Modeling (GMM) scheme. Both the systems were built upon probabilistic pattern recognition and acoustic phonetic modelling approaches. Since our native language is Kannada, one of the very rich Indian language, we have used words uttered in Kannada to train and test the schemes. Since Mel Frequency Cepstral Coefficients (MFCC) are well known acoustic features of speech [1] [2] [4], we have used the Delta MFCC and the Double Delta MFCC vectors in speech feature extraction. Finally, performance analysis of these models in terms of Emotion Error Rate (EER) justifies the fact that modeling using the ANN yields better results over other modeling schemes and can be used in developing Automatic Emotion Recognition systems.","PeriodicalId":243805,"journal":{"name":"2019 3rd International Conference on Electronics, Materials Engineering & Nano-Technology (IEMENTech)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 3rd International Conference on Electronics, Materials Engineering & Nano-Technology (IEMENTech)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IEMENTech48150.2019.8981386","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
We build an emotion recognition system based on Artificial Neural Network (ANN) and compare the same with the one based upon the Gaussian Mixture Modeling (GMM) scheme. Both the systems were built upon probabilistic pattern recognition and acoustic phonetic modelling approaches. Since our native language is Kannada, one of the very rich Indian language, we have used words uttered in Kannada to train and test the schemes. Since Mel Frequency Cepstral Coefficients (MFCC) are well known acoustic features of speech [1] [2] [4], we have used the Delta MFCC and the Double Delta MFCC vectors in speech feature extraction. Finally, performance analysis of these models in terms of Emotion Error Rate (EER) justifies the fact that modeling using the ANN yields better results over other modeling schemes and can be used in developing Automatic Emotion Recognition systems.