{"title":"语音情感识别的高级特征分析","authors":"H. Atassi, A. Esposito, Z. Smékal","doi":"10.1109/TSP.2011.6043708","DOIUrl":null,"url":null,"abstract":"The paper deals with the vocal emotion recognition which is a very important task for several applications in the field of human-machine interaction. There is a plenty of algorithms proposed up to date for this purpose that exploit different types of features and classifiers. Our previous work showed that high-level features perform very well in terms of emotion classification from speech. However, little attention has been paid so far to the statistical analysis of these features. For this reason the presented paper mainly focuses on the emotion recognition by using only high-level features. Two different emotional speech corpora were exploited in our experiments, namely the Berlin Database of Emotional Speech and the COST2102 Italian Database of Emotional Speech. Results showed that the best high-level features in terms of high discriminative power strongly differ among the databases considered on the first hand and among the emotions within each database on the second hand.","PeriodicalId":341695,"journal":{"name":"2011 34th International Conference on Telecommunications and Signal Processing (TSP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":"{\"title\":\"Analysis of high-level features for vocal emotion recognition\",\"authors\":\"H. Atassi, A. Esposito, Z. Smékal\",\"doi\":\"10.1109/TSP.2011.6043708\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The paper deals with the vocal emotion recognition which is a very important task for several applications in the field of human-machine interaction. There is a plenty of algorithms proposed up to date for this purpose that exploit different types of features and classifiers. Our previous work showed that high-level features perform very well in terms of emotion classification from speech. However, little attention has been paid so far to the statistical analysis of these features. For this reason the presented paper mainly focuses on the emotion recognition by using only high-level features. Two different emotional speech corpora were exploited in our experiments, namely the Berlin Database of Emotional Speech and the COST2102 Italian Database of Emotional Speech. Results showed that the best high-level features in terms of high discriminative power strongly differ among the databases considered on the first hand and among the emotions within each database on the second hand.\",\"PeriodicalId\":341695,\"journal\":{\"name\":\"2011 34th International Conference on Telecommunications and Signal Processing (TSP)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-10-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"19\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 34th International Conference on Telecommunications and Signal Processing (TSP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TSP.2011.6043708\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 34th International Conference on Telecommunications and Signal Processing (TSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TSP.2011.6043708","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Analysis of high-level features for vocal emotion recognition
The paper deals with the vocal emotion recognition which is a very important task for several applications in the field of human-machine interaction. There is a plenty of algorithms proposed up to date for this purpose that exploit different types of features and classifiers. Our previous work showed that high-level features perform very well in terms of emotion classification from speech. However, little attention has been paid so far to the statistical analysis of these features. For this reason the presented paper mainly focuses on the emotion recognition by using only high-level features. Two different emotional speech corpora were exploited in our experiments, namely the Berlin Database of Emotional Speech and the COST2102 Italian Database of Emotional Speech. Results showed that the best high-level features in terms of high discriminative power strongly differ among the databases considered on the first hand and among the emotions within each database on the second hand.