语音编码对情绪感知影响的主观评价

Felix Labelle, R. Lefebvre, P. Gournay
{"title":"语音编码对情绪感知影响的主观评价","authors":"Felix Labelle, R. Lefebvre, P. Gournay","doi":"10.1109/ISPACS.2016.7824685","DOIUrl":null,"url":null,"abstract":"The accuracy of the reproduction of emotions by speech coders has only recently been identified as a relevant issue. Several published studies have shown that speech compression reduces the accuracy of emotions classification. These studies, however, were all conducted using objective evaluation methods that involve an automatic classifier. The only definitive way to prove or disprove that the emotional content of a speech signal is degraded by compression operations is by testing it with human subjects. This paper proposes a subjective evaluation method and applies it to emotional speech coded by the AMR-WB speech coder at 6.6 and 12.65 kbps. The results confirm that there is a significant degradation in the perception of emotions by human listeners at both bit rates. The proposed evaluation method, and the insight provided by the results, could be useful in developing new speech coders that better preserve the emotional content of speech signals.","PeriodicalId":131543,"journal":{"name":"2016 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)","volume":"59 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A subjective evaluation of the effects of speech coding on the perception of emotions\",\"authors\":\"Felix Labelle, R. Lefebvre, P. Gournay\",\"doi\":\"10.1109/ISPACS.2016.7824685\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The accuracy of the reproduction of emotions by speech coders has only recently been identified as a relevant issue. Several published studies have shown that speech compression reduces the accuracy of emotions classification. These studies, however, were all conducted using objective evaluation methods that involve an automatic classifier. The only definitive way to prove or disprove that the emotional content of a speech signal is degraded by compression operations is by testing it with human subjects. This paper proposes a subjective evaluation method and applies it to emotional speech coded by the AMR-WB speech coder at 6.6 and 12.65 kbps. The results confirm that there is a significant degradation in the perception of emotions by human listeners at both bit rates. The proposed evaluation method, and the insight provided by the results, could be useful in developing new speech coders that better preserve the emotional content of speech signals.\",\"PeriodicalId\":131543,\"journal\":{\"name\":\"2016 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)\",\"volume\":\"59 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISPACS.2016.7824685\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISPACS.2016.7824685","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

语音编码器复制情绪的准确性直到最近才被确定为一个相关问题。一些已发表的研究表明,语音压缩降低了情绪分类的准确性。然而,这些研究都是使用涉及自动分类器的客观评估方法进行的。证明或反驳语音信号的情感内容被压缩操作降低的唯一明确方法是用人类受试者进行测试。本文提出了一种主观评价方法,并将其应用于AMR-WB语音编码器以6.6和12.65 kbps编码的情绪语音。结果证实,在两种比特率下,人类听众对情绪的感知都有明显的下降。所提出的评估方法和结果提供的见解可能有助于开发新的语音编码器,更好地保留语音信号的情感内容。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A subjective evaluation of the effects of speech coding on the perception of emotions
The accuracy of the reproduction of emotions by speech coders has only recently been identified as a relevant issue. Several published studies have shown that speech compression reduces the accuracy of emotions classification. These studies, however, were all conducted using objective evaluation methods that involve an automatic classifier. The only definitive way to prove or disprove that the emotional content of a speech signal is degraded by compression operations is by testing it with human subjects. This paper proposes a subjective evaluation method and applies it to emotional speech coded by the AMR-WB speech coder at 6.6 and 12.65 kbps. The results confirm that there is a significant degradation in the perception of emotions by human listeners at both bit rates. The proposed evaluation method, and the insight provided by the results, could be useful in developing new speech coders that better preserve the emotional content of speech signals.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信