{"title":"基于多尺度滑动窗口的情绪变化自动检测","authors":"Yuchao Fan, Mingxing Xu, Zhiyong Wu, Lianhong Cai","doi":"10.1109/ICOT.2014.6956642","DOIUrl":null,"url":null,"abstract":"Emotion recognition from speech plays an important role in developing affective and intelligent Human Computer Interaction. The goal of this work is to build an Automatic Emotion Variation Detection (AEVD) system to determine each emotional salient segment in continuous speech. We focus on emotion detection in angry-neutral speech, which is common in recent studies of AEVD. This study proposes a novel framework for AEVD using Multi-scaled Sliding Window (MSW-AEVD) to assign an emotion class to each window-shift by fusion decisions of all the sliding windows containing the shift. Firstly, sliding window with fixed-length is introduced as the basic procedure, in which several different fusion methods are investigated. Then multi-scaled sliding window is employed to support multi-classifiers with different timescale features, in which another two fusion strategies are provided. Finally, a postprocessing is applied to refine the final outputs. Performance evaluation is carried out on the public Berlin database EMO-DB. Our experimental results show that proposed MSW-AEVD significantly outperforms the traditional HMM-based AEVD.","PeriodicalId":343641,"journal":{"name":"2014 International Conference on Orange Technologies","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Automatic emotion variation detection using multi-scaled sliding window\",\"authors\":\"Yuchao Fan, Mingxing Xu, Zhiyong Wu, Lianhong Cai\",\"doi\":\"10.1109/ICOT.2014.6956642\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Emotion recognition from speech plays an important role in developing affective and intelligent Human Computer Interaction. The goal of this work is to build an Automatic Emotion Variation Detection (AEVD) system to determine each emotional salient segment in continuous speech. We focus on emotion detection in angry-neutral speech, which is common in recent studies of AEVD. This study proposes a novel framework for AEVD using Multi-scaled Sliding Window (MSW-AEVD) to assign an emotion class to each window-shift by fusion decisions of all the sliding windows containing the shift. Firstly, sliding window with fixed-length is introduced as the basic procedure, in which several different fusion methods are investigated. Then multi-scaled sliding window is employed to support multi-classifiers with different timescale features, in which another two fusion strategies are provided. Finally, a postprocessing is applied to refine the final outputs. Performance evaluation is carried out on the public Berlin database EMO-DB. Our experimental results show that proposed MSW-AEVD significantly outperforms the traditional HMM-based AEVD.\",\"PeriodicalId\":343641,\"journal\":{\"name\":\"2014 International Conference on Orange Technologies\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-11-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 International Conference on Orange Technologies\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICOT.2014.6956642\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference on Orange Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOT.2014.6956642","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Automatic emotion variation detection using multi-scaled sliding window
Emotion recognition from speech plays an important role in developing affective and intelligent Human Computer Interaction. The goal of this work is to build an Automatic Emotion Variation Detection (AEVD) system to determine each emotional salient segment in continuous speech. We focus on emotion detection in angry-neutral speech, which is common in recent studies of AEVD. This study proposes a novel framework for AEVD using Multi-scaled Sliding Window (MSW-AEVD) to assign an emotion class to each window-shift by fusion decisions of all the sliding windows containing the shift. Firstly, sliding window with fixed-length is introduced as the basic procedure, in which several different fusion methods are investigated. Then multi-scaled sliding window is employed to support multi-classifiers with different timescale features, in which another two fusion strategies are provided. Finally, a postprocessing is applied to refine the final outputs. Performance evaluation is carried out on the public Berlin database EMO-DB. Our experimental results show that proposed MSW-AEVD significantly outperforms the traditional HMM-based AEVD.