Zhixing Liu, Yannan Wang, Gaoxiong Yi, Tao Yu, Fei Chen
{"title":"客观语音质量评价中的分词影响评估","authors":"Zhixing Liu, Yannan Wang, Gaoxiong Yi, Tao Yu, Fei Chen","doi":"10.1109/WASPAA52581.2021.9632785","DOIUrl":null,"url":null,"abstract":"Accurately predicting speech quality is important for the design of new speech coding and processing algorithms to improve speech communication. Existing speech quality metrics are computed with all speech segments, and do not consider the contributions of various speech segments for quality evaluation. The present work utilized a speech-level based segmentation method to separate a speech signal into high-, middle- and low-level regions, and computed the quality measures only with selected speech segments. Subjective speech quality rating data from 120 noise-masked/suppressed conditions (processed by 14 single-channel noise-suppression algorithms) were correlated with the objective speech quality indices. Results showed that compared with the conventional implementation with all speech segments, using middle-level speech segments to compute speech quality index could yield an improved correlation coefficient in predicting subjective quality ratings for most quality measures, particularly for the measure of output signal-to-noise ratio. The findings of the present work may provide a new scheme to improve the performance of objective speech quality assessment based on the segmental contributions of speech signals.","PeriodicalId":429900,"journal":{"name":"2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Assessing Segmental Impact for Objective Speech Quality Evaluation\",\"authors\":\"Zhixing Liu, Yannan Wang, Gaoxiong Yi, Tao Yu, Fei Chen\",\"doi\":\"10.1109/WASPAA52581.2021.9632785\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Accurately predicting speech quality is important for the design of new speech coding and processing algorithms to improve speech communication. Existing speech quality metrics are computed with all speech segments, and do not consider the contributions of various speech segments for quality evaluation. The present work utilized a speech-level based segmentation method to separate a speech signal into high-, middle- and low-level regions, and computed the quality measures only with selected speech segments. Subjective speech quality rating data from 120 noise-masked/suppressed conditions (processed by 14 single-channel noise-suppression algorithms) were correlated with the objective speech quality indices. Results showed that compared with the conventional implementation with all speech segments, using middle-level speech segments to compute speech quality index could yield an improved correlation coefficient in predicting subjective quality ratings for most quality measures, particularly for the measure of output signal-to-noise ratio. The findings of the present work may provide a new scheme to improve the performance of objective speech quality assessment based on the segmental contributions of speech signals.\",\"PeriodicalId\":429900,\"journal\":{\"name\":\"2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WASPAA52581.2021.9632785\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WASPAA52581.2021.9632785","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Assessing Segmental Impact for Objective Speech Quality Evaluation
Accurately predicting speech quality is important for the design of new speech coding and processing algorithms to improve speech communication. Existing speech quality metrics are computed with all speech segments, and do not consider the contributions of various speech segments for quality evaluation. The present work utilized a speech-level based segmentation method to separate a speech signal into high-, middle- and low-level regions, and computed the quality measures only with selected speech segments. Subjective speech quality rating data from 120 noise-masked/suppressed conditions (processed by 14 single-channel noise-suppression algorithms) were correlated with the objective speech quality indices. Results showed that compared with the conventional implementation with all speech segments, using middle-level speech segments to compute speech quality index could yield an improved correlation coefficient in predicting subjective quality ratings for most quality measures, particularly for the measure of output signal-to-noise ratio. The findings of the present work may provide a new scheme to improve the performance of objective speech quality assessment based on the segmental contributions of speech signals.