{"title":"基于深度学习和核非线性PSVM的语音情感识别","authors":"Zhiyan Han, Jian Wang","doi":"10.1109/CCDC.2019.8832414","DOIUrl":null,"url":null,"abstract":"For the sake of ameliorating the precision of speech emotion recognition, this paper put forward a new emotion recognition technique based on Deep Learning and Kernel Nonlinear PSVM (Proximal Support Vector Machine) to discern four fundamental human emotion (angry, joy, sadness, surprise). First of all, preprocess speech signal. And then use DBN (Deep Belief Networks) to extract emotional features in speech signal automatically. After that, integrate the DBN automatic features and traditional features (prosody features and quality features) as the total features. Finally, use six Nonlinear Proximal Support Vector Machines to recognize the emotion and use majority voting principle to obtain the final identification result. To assess the new method, we compare the total features, DBN automatic features and traditional features. The experimental results indicate that the total features are better than the other two methods.","PeriodicalId":254705,"journal":{"name":"2019 Chinese Control And Decision Conference (CCDC)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Speech Emotion Recognition Based on Deep Learning and Kernel Nonlinear PSVM\",\"authors\":\"Zhiyan Han, Jian Wang\",\"doi\":\"10.1109/CCDC.2019.8832414\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"For the sake of ameliorating the precision of speech emotion recognition, this paper put forward a new emotion recognition technique based on Deep Learning and Kernel Nonlinear PSVM (Proximal Support Vector Machine) to discern four fundamental human emotion (angry, joy, sadness, surprise). First of all, preprocess speech signal. And then use DBN (Deep Belief Networks) to extract emotional features in speech signal automatically. After that, integrate the DBN automatic features and traditional features (prosody features and quality features) as the total features. Finally, use six Nonlinear Proximal Support Vector Machines to recognize the emotion and use majority voting principle to obtain the final identification result. To assess the new method, we compare the total features, DBN automatic features and traditional features. The experimental results indicate that the total features are better than the other two methods.\",\"PeriodicalId\":254705,\"journal\":{\"name\":\"2019 Chinese Control And Decision Conference (CCDC)\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-06-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 Chinese Control And Decision Conference (CCDC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCDC.2019.8832414\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 Chinese Control And Decision Conference (CCDC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCDC.2019.8832414","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Speech Emotion Recognition Based on Deep Learning and Kernel Nonlinear PSVM
For the sake of ameliorating the precision of speech emotion recognition, this paper put forward a new emotion recognition technique based on Deep Learning and Kernel Nonlinear PSVM (Proximal Support Vector Machine) to discern four fundamental human emotion (angry, joy, sadness, surprise). First of all, preprocess speech signal. And then use DBN (Deep Belief Networks) to extract emotional features in speech signal automatically. After that, integrate the DBN automatic features and traditional features (prosody features and quality features) as the total features. Finally, use six Nonlinear Proximal Support Vector Machines to recognize the emotion and use majority voting principle to obtain the final identification result. To assess the new method, we compare the total features, DBN automatic features and traditional features. The experimental results indicate that the total features are better than the other two methods.