{"title":"Influence of Silence and Noise Filtering on Speech Quality Monitoring","authors":"R. Jaiswal","doi":"10.1109/sped53181.2021.9587364","DOIUrl":null,"url":null,"abstract":"With the exponential increase of mobile users and internet subscribers, the utilization of voice over internet protocol (VoIP) application is increasing dramatically. People exploit different VoIP applications for effective communication, for example, Google Meet, Microsoft Skype, Zoom video conferencing applications, etc. The single-ended speech quality metrics are employed for measuring and monitoring the quality of speech. However, different types of degradations present in the surroundings distort the quality of speech. In order to meet the desired quality of experience (QoE) level of end-user while using VoIP applications, it is necessary to reduce VoIP degradations and obtain the optimized speech quality. Along that line, this paper investigates the conjunction of filtering of silence and noise as a pre-processing block with the single-ended speech quality metric under various common occurring degradations encountered during VoIP communication. This can help the internet service providers in understanding the potential root cause of decrement in quality of speech and then applying the QoE management service to fulfill desired human QoE level. Results demonstrate that the deployment of joint pre-processing on speech samples under various VoIP degradations improves the quality of speech to a great extent.","PeriodicalId":193702,"journal":{"name":"2021 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/sped53181.2021.9587364","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
With the exponential increase of mobile users and internet subscribers, the utilization of voice over internet protocol (VoIP) application is increasing dramatically. People exploit different VoIP applications for effective communication, for example, Google Meet, Microsoft Skype, Zoom video conferencing applications, etc. The single-ended speech quality metrics are employed for measuring and monitoring the quality of speech. However, different types of degradations present in the surroundings distort the quality of speech. In order to meet the desired quality of experience (QoE) level of end-user while using VoIP applications, it is necessary to reduce VoIP degradations and obtain the optimized speech quality. Along that line, this paper investigates the conjunction of filtering of silence and noise as a pre-processing block with the single-ended speech quality metric under various common occurring degradations encountered during VoIP communication. This can help the internet service providers in understanding the potential root cause of decrement in quality of speech and then applying the QoE management service to fulfill desired human QoE level. Results demonstrate that the deployment of joint pre-processing on speech samples under various VoIP degradations improves the quality of speech to a great extent.
随着移动用户和互联网用户的指数级增长,VoIP (voice over internet protocol)应用的使用率急剧上升。人们利用不同的VoIP应用程序进行有效的通信,例如,Google Meet, Microsoft Skype, Zoom视频会议应用程序等。采用单端语音质量指标对语音质量进行测量和监控。然而,周围环境中存在的不同类型的退化会扭曲语音质量。为了满足终端用户在使用VoIP应用时所期望的QoE (quality of experience)水平,有必要减少VoIP的降级,从而获得最优的语音质量。沿着这条线,本文研究了在VoIP通信中遇到的各种常见退化情况下,沉默和噪声滤波作为预处理块与单端语音质量度量的结合。这可以帮助互联网服务提供商了解语音质量下降的潜在根本原因,然后应用质量质量管理服务来满足期望的人类质量质量水平。结果表明,在各种VoIP降级情况下对语音样本进行联合预处理,在很大程度上提高了语音质量。