客观语音质量评价中的分词影响评估

Zhixing Liu, Yannan Wang, Gaoxiong Yi, Tao Yu, Fei Chen
{"title":"客观语音质量评价中的分词影响评估","authors":"Zhixing Liu, Yannan Wang, Gaoxiong Yi, Tao Yu, Fei Chen","doi":"10.1109/WASPAA52581.2021.9632785","DOIUrl":null,"url":null,"abstract":"Accurately predicting speech quality is important for the design of new speech coding and processing algorithms to improve speech communication. Existing speech quality metrics are computed with all speech segments, and do not consider the contributions of various speech segments for quality evaluation. The present work utilized a speech-level based segmentation method to separate a speech signal into high-, middle- and low-level regions, and computed the quality measures only with selected speech segments. Subjective speech quality rating data from 120 noise-masked/suppressed conditions (processed by 14 single-channel noise-suppression algorithms) were correlated with the objective speech quality indices. Results showed that compared with the conventional implementation with all speech segments, using middle-level speech segments to compute speech quality index could yield an improved correlation coefficient in predicting subjective quality ratings for most quality measures, particularly for the measure of output signal-to-noise ratio. The findings of the present work may provide a new scheme to improve the performance of objective speech quality assessment based on the segmental contributions of speech signals.","PeriodicalId":429900,"journal":{"name":"2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Assessing Segmental Impact for Objective Speech Quality Evaluation\",\"authors\":\"Zhixing Liu, Yannan Wang, Gaoxiong Yi, Tao Yu, Fei Chen\",\"doi\":\"10.1109/WASPAA52581.2021.9632785\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Accurately predicting speech quality is important for the design of new speech coding and processing algorithms to improve speech communication. Existing speech quality metrics are computed with all speech segments, and do not consider the contributions of various speech segments for quality evaluation. The present work utilized a speech-level based segmentation method to separate a speech signal into high-, middle- and low-level regions, and computed the quality measures only with selected speech segments. Subjective speech quality rating data from 120 noise-masked/suppressed conditions (processed by 14 single-channel noise-suppression algorithms) were correlated with the objective speech quality indices. Results showed that compared with the conventional implementation with all speech segments, using middle-level speech segments to compute speech quality index could yield an improved correlation coefficient in predicting subjective quality ratings for most quality measures, particularly for the measure of output signal-to-noise ratio. The findings of the present work may provide a new scheme to improve the performance of objective speech quality assessment based on the segmental contributions of speech signals.\",\"PeriodicalId\":429900,\"journal\":{\"name\":\"2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WASPAA52581.2021.9632785\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WASPAA52581.2021.9632785","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

准确预测语音质量对于设计新的语音编码和处理算法以改善语音通信具有重要意义。现有的语音质量度量是用所有语音段计算的,没有考虑不同语音段对质量评估的贡献。本研究利用基于语音水平的分割方法,将语音信号划分为高、中、低水平区域,并仅对选定的语音片段进行质量度量计算。主观语音质量评级数据来自120种噪声屏蔽/抑制条件(经过14种单通道噪声抑制算法处理),与客观语音质量指标相关。结果表明,与传统的全语音段实现相比,使用中级语音段计算语音质量指数在预测大多数质量度量的主观质量评级方面,尤其是在输出信噪比度量方面,可以获得更高的相关系数。本文的研究结果可以为基于语音信号的分段贡献来提高客观语音质量评估的性能提供一种新的方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Assessing Segmental Impact for Objective Speech Quality Evaluation
Accurately predicting speech quality is important for the design of new speech coding and processing algorithms to improve speech communication. Existing speech quality metrics are computed with all speech segments, and do not consider the contributions of various speech segments for quality evaluation. The present work utilized a speech-level based segmentation method to separate a speech signal into high-, middle- and low-level regions, and computed the quality measures only with selected speech segments. Subjective speech quality rating data from 120 noise-masked/suppressed conditions (processed by 14 single-channel noise-suppression algorithms) were correlated with the objective speech quality indices. Results showed that compared with the conventional implementation with all speech segments, using middle-level speech segments to compute speech quality index could yield an improved correlation coefficient in predicting subjective quality ratings for most quality measures, particularly for the measure of output signal-to-noise ratio. The findings of the present work may provide a new scheme to improve the performance of objective speech quality assessment based on the segmental contributions of speech signals.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信