基于谱熵距离的含噪语音质量估计

Gabriel Mittag, S. Möller
{"title":"基于谱熵距离的含噪语音质量估计","authors":"Gabriel Mittag, S. Möller","doi":"10.1109/ICT.2019.8798783","DOIUrl":null,"url":null,"abstract":"In this paper, we propose to use spectral entropy distance as a new measure for objective quality estimations of noisy speech. While the perceived quality estimation of a transmitted speech signal under background noise is fairly straight forward, the estimation of noise on active speech is more complex. For example, an increase in loudness can be confused as noise by common quality measures. Also, other distortions, such as interruptions due to packet loss, can decrease the energy in the degraded signal and thus lead to an underestimation of the noisiness. This is especially critical when the noise is only present during active speech segments, as it is the case for quantization noise caused by low bitrate codecs or voice activity detections at the receiver side. The spectral entropy, however, only considers the frequency composition of a signal and does not depend on the signal energy. Therefore, it gives a robust measure of how noisy a signal is in the presence of active speech. In our experiments, we trained a prediction model based on the spectral entropy and obtained excellent prediction results that show that the spectral entropy distance is indeed a useful tool for the quality estimation of noisy speech.","PeriodicalId":127412,"journal":{"name":"2019 26th International Conference on Telecommunications (ICT)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Quality Estimation of Noisy Speech Using Spectral Entropy Distance\",\"authors\":\"Gabriel Mittag, S. Möller\",\"doi\":\"10.1109/ICT.2019.8798783\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose to use spectral entropy distance as a new measure for objective quality estimations of noisy speech. While the perceived quality estimation of a transmitted speech signal under background noise is fairly straight forward, the estimation of noise on active speech is more complex. For example, an increase in loudness can be confused as noise by common quality measures. Also, other distortions, such as interruptions due to packet loss, can decrease the energy in the degraded signal and thus lead to an underestimation of the noisiness. This is especially critical when the noise is only present during active speech segments, as it is the case for quantization noise caused by low bitrate codecs or voice activity detections at the receiver side. The spectral entropy, however, only considers the frequency composition of a signal and does not depend on the signal energy. Therefore, it gives a robust measure of how noisy a signal is in the presence of active speech. In our experiments, we trained a prediction model based on the spectral entropy and obtained excellent prediction results that show that the spectral entropy distance is indeed a useful tool for the quality estimation of noisy speech.\",\"PeriodicalId\":127412,\"journal\":{\"name\":\"2019 26th International Conference on Telecommunications (ICT)\",\"volume\":\"27 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 26th International Conference on Telecommunications (ICT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICT.2019.8798783\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 26th International Conference on Telecommunications (ICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICT.2019.8798783","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

本文提出将谱熵距离作为噪声语音客观质量估计的一种新测度。背景噪声下传输语音信号的感知质量估计比较简单,而主动语音信号的感知质量估计则比较复杂。例如,通过普通的质量测量,声音的增加可能会被混淆为噪音。此外,其他失真,如由于丢包而导致的中断,可以减少降级信号中的能量,从而导致对噪声的低估。当噪声仅在活动语音段中存在时,这一点尤其重要,因为这是由低比特率编解码器或接收端语音活动检测引起的量化噪声的情况。而谱熵只考虑信号的频率组成,不依赖于信号的能量。因此,它提供了一种鲁棒的方法来衡量在主动语音存在的情况下信号的噪声。在我们的实验中,我们训练了一个基于谱熵的预测模型,并获得了良好的预测结果,表明谱熵距离确实是一个有用的工具,用于噪声语音的质量估计。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Quality Estimation of Noisy Speech Using Spectral Entropy Distance
In this paper, we propose to use spectral entropy distance as a new measure for objective quality estimations of noisy speech. While the perceived quality estimation of a transmitted speech signal under background noise is fairly straight forward, the estimation of noise on active speech is more complex. For example, an increase in loudness can be confused as noise by common quality measures. Also, other distortions, such as interruptions due to packet loss, can decrease the energy in the degraded signal and thus lead to an underestimation of the noisiness. This is especially critical when the noise is only present during active speech segments, as it is the case for quantization noise caused by low bitrate codecs or voice activity detections at the receiver side. The spectral entropy, however, only considers the frequency composition of a signal and does not depend on the signal energy. Therefore, it gives a robust measure of how noisy a signal is in the presence of active speech. In our experiments, we trained a prediction model based on the spectral entropy and obtained excellent prediction results that show that the spectral entropy distance is indeed a useful tool for the quality estimation of noisy speech.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信