Budi Triandi, H. Mawengkang, S. Efendi, Syawaluddin
{"title":"深度学习背景下语音情感识别技术的发展","authors":"Budi Triandi, H. Mawengkang, S. Efendi, Syawaluddin","doi":"10.1109/ICORIS56080.2022.10031385","DOIUrl":null,"url":null,"abstract":"Understanding emotions from voice signals is a crucial yet difficult aspect of human-computer interaction (HCI). Many methods, including many well-known speech analysis and classification methods, have been used to extract emotions from signals in the literature on speech emotion recognition (SER). Deep learning methodologies have recently been presented as a replacement to traditional SER procedures. This paper presents an overview of deep learning algorithms for speech-based emotion recognition and evaluates some recent work that makes use of these methods. The review discusses the databases used, the emotions retrieved, the advancements made in voice emotion recognition, and its limits.","PeriodicalId":138054,"journal":{"name":"2022 4th International Conference on Cybernetics and Intelligent System (ICORIS)","volume":"33 7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Development of Techniques for Speech Emotion Recognition (SER) In the Context of Deep Learning\",\"authors\":\"Budi Triandi, H. Mawengkang, S. Efendi, Syawaluddin\",\"doi\":\"10.1109/ICORIS56080.2022.10031385\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Understanding emotions from voice signals is a crucial yet difficult aspect of human-computer interaction (HCI). Many methods, including many well-known speech analysis and classification methods, have been used to extract emotions from signals in the literature on speech emotion recognition (SER). Deep learning methodologies have recently been presented as a replacement to traditional SER procedures. This paper presents an overview of deep learning algorithms for speech-based emotion recognition and evaluates some recent work that makes use of these methods. The review discusses the databases used, the emotions retrieved, the advancements made in voice emotion recognition, and its limits.\",\"PeriodicalId\":138054,\"journal\":{\"name\":\"2022 4th International Conference on Cybernetics and Intelligent System (ICORIS)\",\"volume\":\"33 7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 4th International Conference on Cybernetics and Intelligent System (ICORIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICORIS56080.2022.10031385\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 4th International Conference on Cybernetics and Intelligent System (ICORIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICORIS56080.2022.10031385","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Development of Techniques for Speech Emotion Recognition (SER) In the Context of Deep Learning
Understanding emotions from voice signals is a crucial yet difficult aspect of human-computer interaction (HCI). Many methods, including many well-known speech analysis and classification methods, have been used to extract emotions from signals in the literature on speech emotion recognition (SER). Deep learning methodologies have recently been presented as a replacement to traditional SER procedures. This paper presents an overview of deep learning algorithms for speech-based emotion recognition and evaluates some recent work that makes use of these methods. The review discusses the databases used, the emotions retrieved, the advancements made in voice emotion recognition, and its limits.