基于stft域降噪的语音失真加权单通道维纳滤波器

2023 IEEE Statistical Signal Processing Workshop (SSP) Pub Date : 2023-07-02 DOI:10.1109/SSP53291.2023.10208040

Jie Zhang, Rui Tao, Lirong Dai

{"title":"基于stft域降噪的语音失真加权单通道维纳滤波器","authors":"Jie Zhang, Rui Tao, Lirong Dai","doi":"10.1109/SSP53291.2023.10208040","DOIUrl":null,"url":null,"abstract":"In this work, we focus on the single-channel noise reduction (NR) in the short-time Fourier transform (STFT) domain from the traditional signal processing perspective. As conventional single-channel NR methods suffer from a serious speech distortion (SD), we propose an SD weighted single-channel Wiener filter (SDW-SWF), where an auxiliary parameter µ is exploited to trade-off the SD and residual noise variance. In the subspace, the obtained SDW-SWF can be formulated as a function of µ and a set of generalized eigenvectors of correlation matrices. In addition, we theoretically analyze the impacts of the trade-off factor and the rank on the SD, residual noise power and the output signal-to-noise ratio (SNR). Finally, numerical results validate the effectiveness of the proposed method, exhibiting a consistency with the theoretical findings. It can be concluded that the SDW-SWF approach enables more degrees-of-freedom to improve the speech intelligibility at a sacrifice of SNR.","PeriodicalId":296346,"journal":{"name":"2023 IEEE Statistical Signal Processing Workshop (SSP)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Speech Distortion Weighted Single-Channel Wiener Filter Based STFT-Domain Noise Reduction\",\"authors\":\"Jie Zhang, Rui Tao, Lirong Dai\",\"doi\":\"10.1109/SSP53291.2023.10208040\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this work, we focus on the single-channel noise reduction (NR) in the short-time Fourier transform (STFT) domain from the traditional signal processing perspective. As conventional single-channel NR methods suffer from a serious speech distortion (SD), we propose an SD weighted single-channel Wiener filter (SDW-SWF), where an auxiliary parameter µ is exploited to trade-off the SD and residual noise variance. In the subspace, the obtained SDW-SWF can be formulated as a function of µ and a set of generalized eigenvectors of correlation matrices. In addition, we theoretically analyze the impacts of the trade-off factor and the rank on the SD, residual noise power and the output signal-to-noise ratio (SNR). Finally, numerical results validate the effectiveness of the proposed method, exhibiting a consistency with the theoretical findings. It can be concluded that the SDW-SWF approach enables more degrees-of-freedom to improve the speech intelligibility at a sacrifice of SNR.\",\"PeriodicalId\":296346,\"journal\":{\"name\":\"2023 IEEE Statistical Signal Processing Workshop (SSP)\",\"volume\":\"24 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-07-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE Statistical Signal Processing Workshop (SSP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SSP53291.2023.10208040\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE Statistical Signal Processing Workshop (SSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSP53291.2023.10208040","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文从传统信号处理的角度出发，重点研究短时傅里叶变换(STFT)域的单通道降噪(NR)问题。由于传统的单通道NR方法存在严重的语音失真(SD)，我们提出了一种SD加权单通道维纳滤波器(SDW-SWF)，其中利用辅助参数µ来权衡SD和残余噪声方差。在子空间中，得到的SDW-SWF可以表示为µ的函数和相关矩阵的广义特征向量集。此外，我们还从理论上分析了权衡系数和秩对SD、剩余噪声功率和输出信噪比的影响。最后，数值结果验证了该方法的有效性，与理论结果一致。可以得出结论，SDW-SWF方法可以在牺牲信噪比的情况下提供更多的自由度来提高语音可理解性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Speech Distortion Weighted Single-Channel Wiener Filter Based STFT-Domain Noise Reduction

In this work, we focus on the single-channel noise reduction (NR) in the short-time Fourier transform (STFT) domain from the traditional signal processing perspective. As conventional single-channel NR methods suffer from a serious speech distortion (SD), we propose an SD weighted single-channel Wiener filter (SDW-SWF), where an auxiliary parameter µ is exploited to trade-off the SD and residual noise variance. In the subspace, the obtained SDW-SWF can be formulated as a function of µ and a set of generalized eigenvectors of correlation matrices. In addition, we theoretically analyze the impacts of the trade-off factor and the rank on the SD, residual noise power and the output signal-to-noise ratio (SNR). Finally, numerical results validate the effectiveness of the proposed method, exhibiting a consistency with the theoretical findings. It can be concluded that the SDW-SWF approach enables more degrees-of-freedom to improve the speech intelligibility at a sacrifice of SNR.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2023 IEEE Statistical Signal Processing Workshop (SSP)

自引率

0.00%

发文量