{"title":"Affective Computing for Social Companion Robots Using Fine-grained Speech Emotion Recognition","authors":"Saransh Ahuja, Amir Shabani","doi":"10.1109/cai54212.2023.00146","DOIUrl":null,"url":null,"abstract":"The increasing demand and diverse applications for social companion robots necessitate the development of more engaging and meaningful human-robot interactions and hence affective computing or emotion Al. In this paper, we propose a fine-grained speech emotion recognition using a state-of-the-art Deep Convolutional Neural Network trained on three-channel representations of speech signals to classify each emotion and also their intensity level. Experimental results on a publicly available dataset with intensity level (RAVEDESS) show that our method can effectively predict the users emotion and their intensity with 95.85±1.38% accuracy, a promising results towards empowering companion robots to be more affective and potentially be helpful in emotion regulations of their users.","PeriodicalId":129324,"journal":{"name":"2023 IEEE Conference on Artificial Intelligence (CAI)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE Conference on Artificial Intelligence (CAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/cai54212.2023.00146","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The increasing demand and diverse applications for social companion robots necessitate the development of more engaging and meaningful human-robot interactions and hence affective computing or emotion Al. In this paper, we propose a fine-grained speech emotion recognition using a state-of-the-art Deep Convolutional Neural Network trained on three-channel representations of speech signals to classify each emotion and also their intensity level. Experimental results on a publicly available dataset with intensity level (RAVEDESS) show that our method can effectively predict the users emotion and their intensity with 95.85±1.38% accuracy, a promising results towards empowering companion robots to be more affective and potentially be helpful in emotion regulations of their users.