利用改进的频谱过减算法增强非平稳环境下语音的感知驱动平稳小波包滤波器组

2012 4th International Conference on Intelligent Human Computer Interaction (IHCI) Pub Date : 2012-12-01 DOI:10.1109/IHCI.2012.6481840

Navneet Upadhyay, A. Karmakar

{"title":"利用改进的频谱过减算法增强非平稳环境下语音的感知驱动平稳小波包滤波器组","authors":"Navneet Upadhyay, A. Karmakar","doi":"10.1109/IHCI.2012.6481840","DOIUrl":null,"url":null,"abstract":"This paper proposes a novel speech enhancement approach for a single-microphone system to meet the demand of quality noise reduction algorithms. The proposed system incorporates a perceptually motivated stationary wavelet packet filter-bank (PM-SWPFB) and improved spectral over-subtraction (I-SOS) algorithm together to enhance the speech degraded by non-stationary or colored noise environment. The PM-SWPFB is obtained by adjusting the uniformly spaced stationary wavelet packet tree in order to most closely mimic the critical-bands of the psycho-acoustic model. The PM-SWPFB is, firstly, used to decompose the input noisy speech signal into nonuniform sub-bands. Then, I-SOS algorithm is used to estimate of speech from each sub-band. The I-SOS algorithm uses a new noise estimation approach, to estimate noise power from each sub-band without the need of explicit speech silence detection. The sub-band noise estimate is updated by adaptively smoothing the noisy signal power. The smoothing parameter is controlled by a function of the estimated signal-to-noise ratio (SNR). The performance of the proposed speech enhancement system is evaluated objectively by SNR, Itakura-Saito distortion measure and subjectively by informal listening test. The results confirm that the proposed speech enhancement system is capable of reducing noise with little speech degradation remains acceptable in real-world environments, and the overall performance is superior to several competitive methods.","PeriodicalId":107245,"journal":{"name":"2012 4th International Conference on Intelligent Human Computer Interaction (IHCI)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A perceptually motivated stationary wavelet packet filter-bank utilizing improved spectral over-subtraction algorithm for enhancing speech in non-stationary environments\",\"authors\":\"Navneet Upadhyay, A. Karmakar\",\"doi\":\"10.1109/IHCI.2012.6481840\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a novel speech enhancement approach for a single-microphone system to meet the demand of quality noise reduction algorithms. The proposed system incorporates a perceptually motivated stationary wavelet packet filter-bank (PM-SWPFB) and improved spectral over-subtraction (I-SOS) algorithm together to enhance the speech degraded by non-stationary or colored noise environment. The PM-SWPFB is obtained by adjusting the uniformly spaced stationary wavelet packet tree in order to most closely mimic the critical-bands of the psycho-acoustic model. The PM-SWPFB is, firstly, used to decompose the input noisy speech signal into nonuniform sub-bands. Then, I-SOS algorithm is used to estimate of speech from each sub-band. The I-SOS algorithm uses a new noise estimation approach, to estimate noise power from each sub-band without the need of explicit speech silence detection. The sub-band noise estimate is updated by adaptively smoothing the noisy signal power. The smoothing parameter is controlled by a function of the estimated signal-to-noise ratio (SNR). The performance of the proposed speech enhancement system is evaluated objectively by SNR, Itakura-Saito distortion measure and subjectively by informal listening test. The results confirm that the proposed speech enhancement system is capable of reducing noise with little speech degradation remains acceptable in real-world environments, and the overall performance is superior to several competitive methods.\",\"PeriodicalId\":107245,\"journal\":{\"name\":\"2012 4th International Conference on Intelligent Human Computer Interaction (IHCI)\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 4th International Conference on Intelligent Human Computer Interaction (IHCI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IHCI.2012.6481840\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 4th International Conference on Intelligent Human Computer Interaction (IHCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IHCI.2012.6481840","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

为了满足高质量降噪算法的要求，本文提出了一种新的单麦克风系统语音增强方法。该系统采用感知驱动的平稳小波包滤波组(PM-SWPFB)和改进的频谱过减(I-SOS)算法来增强非平稳或有色噪声环境下的语音退化。通过调整均匀间隔平稳小波包树，以最接近地模拟心理声学模型的临界带，得到PM-SWPFB。首先，利用PM-SWPFB将输入噪声语音信号分解成非均匀子带;然后，使用I-SOS算法对每个子带的语音进行估计。I-SOS算法采用了一种新的噪声估计方法，在不需要明确的语音沉默检测的情况下估计每个子带的噪声功率。通过自适应平滑噪声信号功率来更新子带噪声估计。平滑参数由估计的信噪比(SNR)函数控制。通过信噪比、Itakura-Saito失真测量和非正式听力测试来客观评价语音增强系统的性能。结果表明，所提出的语音增强系统能够有效地降低噪声，并且在实际环境中语音退化很小，总体性能优于几种竞争方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A perceptually motivated stationary wavelet packet filter-bank utilizing improved spectral over-subtraction algorithm for enhancing speech in non-stationary environments

This paper proposes a novel speech enhancement approach for a single-microphone system to meet the demand of quality noise reduction algorithms. The proposed system incorporates a perceptually motivated stationary wavelet packet filter-bank (PM-SWPFB) and improved spectral over-subtraction (I-SOS) algorithm together to enhance the speech degraded by non-stationary or colored noise environment. The PM-SWPFB is obtained by adjusting the uniformly spaced stationary wavelet packet tree in order to most closely mimic the critical-bands of the psycho-acoustic model. The PM-SWPFB is, firstly, used to decompose the input noisy speech signal into nonuniform sub-bands. Then, I-SOS algorithm is used to estimate of speech from each sub-band. The I-SOS algorithm uses a new noise estimation approach, to estimate noise power from each sub-band without the need of explicit speech silence detection. The sub-band noise estimate is updated by adaptively smoothing the noisy signal power. The smoothing parameter is controlled by a function of the estimated signal-to-noise ratio (SNR). The performance of the proposed speech enhancement system is evaluated objectively by SNR, Itakura-Saito distortion measure and subjectively by informal listening test. The results confirm that the proposed speech enhancement system is capable of reducing noise with little speech degradation remains acceptable in real-world environments, and the overall performance is superior to several competitive methods.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2012 4th International Conference on Intelligent Human Computer Interaction (IHCI)

自引率

0.00%

发文量