客观语音质量评价中的分词影响评估

2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) Pub Date : 2021-10-17 DOI:10.1109/WASPAA52581.2021.9632785

Zhixing Liu, Yannan Wang, Gaoxiong Yi, Tao Yu, Fei Chen

{"title":"客观语音质量评价中的分词影响评估","authors":"Zhixing Liu, Yannan Wang, Gaoxiong Yi, Tao Yu, Fei Chen","doi":"10.1109/WASPAA52581.2021.9632785","DOIUrl":null,"url":null,"abstract":"Accurately predicting speech quality is important for the design of new speech coding and processing algorithms to improve speech communication. Existing speech quality metrics are computed with all speech segments, and do not consider the contributions of various speech segments for quality evaluation. The present work utilized a speech-level based segmentation method to separate a speech signal into high-, middle- and low-level regions, and computed the quality measures only with selected speech segments. Subjective speech quality rating data from 120 noise-masked/suppressed conditions (processed by 14 single-channel noise-suppression algorithms) were correlated with the objective speech quality indices. Results showed that compared with the conventional implementation with all speech segments, using middle-level speech segments to compute speech quality index could yield an improved correlation coefficient in predicting subjective quality ratings for most quality measures, particularly for the measure of output signal-to-noise ratio. The findings of the present work may provide a new scheme to improve the performance of objective speech quality assessment based on the segmental contributions of speech signals.","PeriodicalId":429900,"journal":{"name":"2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Assessing Segmental Impact for Objective Speech Quality Evaluation\",\"authors\":\"Zhixing Liu, Yannan Wang, Gaoxiong Yi, Tao Yu, Fei Chen\",\"doi\":\"10.1109/WASPAA52581.2021.9632785\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Accurately predicting speech quality is important for the design of new speech coding and processing algorithms to improve speech communication. Existing speech quality metrics are computed with all speech segments, and do not consider the contributions of various speech segments for quality evaluation. The present work utilized a speech-level based segmentation method to separate a speech signal into high-, middle- and low-level regions, and computed the quality measures only with selected speech segments. Subjective speech quality rating data from 120 noise-masked/suppressed conditions (processed by 14 single-channel noise-suppression algorithms) were correlated with the objective speech quality indices. Results showed that compared with the conventional implementation with all speech segments, using middle-level speech segments to compute speech quality index could yield an improved correlation coefficient in predicting subjective quality ratings for most quality measures, particularly for the measure of output signal-to-noise ratio. The findings of the present work may provide a new scheme to improve the performance of objective speech quality assessment based on the segmental contributions of speech signals.\",\"PeriodicalId\":429900,\"journal\":{\"name\":\"2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WASPAA52581.2021.9632785\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WASPAA52581.2021.9632785","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

准确预测语音质量对于设计新的语音编码和处理算法以改善语音通信具有重要意义。现有的语音质量度量是用所有语音段计算的，没有考虑不同语音段对质量评估的贡献。本研究利用基于语音水平的分割方法，将语音信号划分为高、中、低水平区域，并仅对选定的语音片段进行质量度量计算。主观语音质量评级数据来自120种噪声屏蔽/抑制条件(经过14种单通道噪声抑制算法处理)，与客观语音质量指标相关。结果表明，与传统的全语音段实现相比，使用中级语音段计算语音质量指数在预测大多数质量度量的主观质量评级方面，尤其是在输出信噪比度量方面，可以获得更高的相关系数。本文的研究结果可以为基于语音信号的分段贡献来提高客观语音质量评估的性能提供一种新的方案。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Assessing Segmental Impact for Objective Speech Quality Evaluation

Accurately predicting speech quality is important for the design of new speech coding and processing algorithms to improve speech communication. Existing speech quality metrics are computed with all speech segments, and do not consider the contributions of various speech segments for quality evaluation. The present work utilized a speech-level based segmentation method to separate a speech signal into high-, middle- and low-level regions, and computed the quality measures only with selected speech segments. Subjective speech quality rating data from 120 noise-masked/suppressed conditions (processed by 14 single-channel noise-suppression algorithms) were correlated with the objective speech quality indices. Results showed that compared with the conventional implementation with all speech segments, using middle-level speech segments to compute speech quality index could yield an improved correlation coefficient in predicting subjective quality ratings for most quality measures, particularly for the measure of output signal-to-noise ratio. The findings of the present work may provide a new scheme to improve the performance of objective speech quality assessment based on the segmental contributions of speech signals.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

自引率

0.00%

发文量