{"title":"New Trends in Speech Emotion Recognition","authors":"Yesim Ulgen Sonmez, A. Varol","doi":"10.1109/ISDFS.2019.8757528","DOIUrl":null,"url":null,"abstract":"In this study, sound energy and characteristics of sound were investigated. Then, emotion recognition models built upon sound data in the literature were reviewed. Speech emotion recognition studies which adopt the most suitable machine-learning algorithms making feature extraction using both acoustic analysis methods and spectrogram analysis methods were investigated. In light of these studies, implementation has been carried out using EMO-DB data. Speech emotion recognition is a difficult problem for machine learning. The analysis of a sound signal is difficult to make as it includes various frequencies and features. Speech is digitized using signal processing methods and then sound characteristics are obtained through acoustic analysis. However, the overall success rate changes as the changes in these characteristics differ according to the emotions (sadness, fear, anger, happiness, neutral, displeasure, etc.). Although different methods are utilized in both feature extraction and emotion recognition, the success rate varies according to emotions and databases.","PeriodicalId":247412,"journal":{"name":"2019 7th International Symposium on Digital Forensics and Security (ISDFS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 7th International Symposium on Digital Forensics and Security (ISDFS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISDFS.2019.8757528","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
Abstract
In this study, sound energy and characteristics of sound were investigated. Then, emotion recognition models built upon sound data in the literature were reviewed. Speech emotion recognition studies which adopt the most suitable machine-learning algorithms making feature extraction using both acoustic analysis methods and spectrogram analysis methods were investigated. In light of these studies, implementation has been carried out using EMO-DB data. Speech emotion recognition is a difficult problem for machine learning. The analysis of a sound signal is difficult to make as it includes various frequencies and features. Speech is digitized using signal processing methods and then sound characteristics are obtained through acoustic analysis. However, the overall success rate changes as the changes in these characteristics differ according to the emotions (sadness, fear, anger, happiness, neutral, displeasure, etc.). Although different methods are utilized in both feature extraction and emotion recognition, the success rate varies according to emotions and databases.