{"title":"基于梯度增强的语音实时情感识别","authors":"A. Iqbal, K. Barua","doi":"10.1109/ECACE.2019.8679271","DOIUrl":null,"url":null,"abstract":"Emotion recognition from speech is one of the research fields for emotional human-computer interaction. In this contribution, a real-time emotion recognition system is presented which recognizes emotions from live recorded speech by analyzing tonal properties. 34 audio features are extracted including MFCCs, energy, spectral entropy etc. Basically this system classifies emotions using models trained by Gradient Boosting. Other two classifiers such as Support Vector Machine (SVM) and K-Nearest Neighbor (KNN) are also applied to observe the accuracies of them on test audio files. Two databases are employed for training the system like RA VDESS and SA VEE database. This system examines four basic emotions - anger, happiness, sadness and neutral.","PeriodicalId":226060,"journal":{"name":"2019 International Conference on Electrical, Computer and Communication Engineering (ECCE)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"28","resultStr":"{\"title\":\"A Real-time Emotion Recognition from Speech using Gradient Boosting\",\"authors\":\"A. Iqbal, K. Barua\",\"doi\":\"10.1109/ECACE.2019.8679271\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Emotion recognition from speech is one of the research fields for emotional human-computer interaction. In this contribution, a real-time emotion recognition system is presented which recognizes emotions from live recorded speech by analyzing tonal properties. 34 audio features are extracted including MFCCs, energy, spectral entropy etc. Basically this system classifies emotions using models trained by Gradient Boosting. Other two classifiers such as Support Vector Machine (SVM) and K-Nearest Neighbor (KNN) are also applied to observe the accuracies of them on test audio files. Two databases are employed for training the system like RA VDESS and SA VEE database. This system examines four basic emotions - anger, happiness, sadness and neutral.\",\"PeriodicalId\":226060,\"journal\":{\"name\":\"2019 International Conference on Electrical, Computer and Communication Engineering (ECCE)\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"28\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Conference on Electrical, Computer and Communication Engineering (ECCE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ECACE.2019.8679271\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Electrical, Computer and Communication Engineering (ECCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ECACE.2019.8679271","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Real-time Emotion Recognition from Speech using Gradient Boosting
Emotion recognition from speech is one of the research fields for emotional human-computer interaction. In this contribution, a real-time emotion recognition system is presented which recognizes emotions from live recorded speech by analyzing tonal properties. 34 audio features are extracted including MFCCs, energy, spectral entropy etc. Basically this system classifies emotions using models trained by Gradient Boosting. Other two classifiers such as Support Vector Machine (SVM) and K-Nearest Neighbor (KNN) are also applied to observe the accuracies of them on test audio files. Two databases are employed for training the system like RA VDESS and SA VEE database. This system examines four basic emotions - anger, happiness, sadness and neutral.