{"title":"Emotion Recognition In Emergency Call Centers: The challenge of real-life emotions","authors":"Théo Deschamps-Berger","doi":"10.1109/aciiw52867.2021.9666308","DOIUrl":null,"url":null,"abstract":"Detected emotional states of speakers are a key component of constructive social relationships but also of efficiency for capturing the degree of emergency. This paper provides an overview of my doctoral project that focuses on bimodal emotion recognition in an emergency call center with deep end-to-end learning techniques using the most advanced approaches such as transformer and zero-shot learning. In this work, we will first propose a supervised classification system for bimodal emotion recognition (paralinguistic and linguistic). Then, we will investigate an unsupervised system as a complement to the previous one in order to deal with “unseen” emotions and mixtures of real-life emotions. Our previous studies mainly explored the acoustic modality of speech emotion recognition (SER), we achieved close to the state-of-the-art results on the improvised part of the well-known database IEMOCAP and we applied our approach to a French emergency database CEMO collected in a previous project. In my thesis, new real recordings in an emergency call center will be collected. The main research topics of my thesis are: Emotional representation and annotation; Speech emotion recognition and ethical implications; Evaluation and real-life trials.","PeriodicalId":105376,"journal":{"name":"2021 9th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos (ACIIW)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 9th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos (ACIIW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/aciiw52867.2021.9666308","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Detected emotional states of speakers are a key component of constructive social relationships but also of efficiency for capturing the degree of emergency. This paper provides an overview of my doctoral project that focuses on bimodal emotion recognition in an emergency call center with deep end-to-end learning techniques using the most advanced approaches such as transformer and zero-shot learning. In this work, we will first propose a supervised classification system for bimodal emotion recognition (paralinguistic and linguistic). Then, we will investigate an unsupervised system as a complement to the previous one in order to deal with “unseen” emotions and mixtures of real-life emotions. Our previous studies mainly explored the acoustic modality of speech emotion recognition (SER), we achieved close to the state-of-the-art results on the improvised part of the well-known database IEMOCAP and we applied our approach to a French emergency database CEMO collected in a previous project. In my thesis, new real recordings in an emergency call center will be collected. The main research topics of my thesis are: Emotional representation and annotation; Speech emotion recognition and ethical implications; Evaluation and real-life trials.