{"title":"利用改进的频谱过减算法增强非平稳环境下语音的感知驱动平稳小波包滤波器组","authors":"Navneet Upadhyay, A. Karmakar","doi":"10.1109/IHCI.2012.6481840","DOIUrl":null,"url":null,"abstract":"This paper proposes a novel speech enhancement approach for a single-microphone system to meet the demand of quality noise reduction algorithms. The proposed system incorporates a perceptually motivated stationary wavelet packet filter-bank (PM-SWPFB) and improved spectral over-subtraction (I-SOS) algorithm together to enhance the speech degraded by non-stationary or colored noise environment. The PM-SWPFB is obtained by adjusting the uniformly spaced stationary wavelet packet tree in order to most closely mimic the critical-bands of the psycho-acoustic model. The PM-SWPFB is, firstly, used to decompose the input noisy speech signal into nonuniform sub-bands. Then, I-SOS algorithm is used to estimate of speech from each sub-band. The I-SOS algorithm uses a new noise estimation approach, to estimate noise power from each sub-band without the need of explicit speech silence detection. The sub-band noise estimate is updated by adaptively smoothing the noisy signal power. The smoothing parameter is controlled by a function of the estimated signal-to-noise ratio (SNR). The performance of the proposed speech enhancement system is evaluated objectively by SNR, Itakura-Saito distortion measure and subjectively by informal listening test. The results confirm that the proposed speech enhancement system is capable of reducing noise with little speech degradation remains acceptable in real-world environments, and the overall performance is superior to several competitive methods.","PeriodicalId":107245,"journal":{"name":"2012 4th International Conference on Intelligent Human Computer Interaction (IHCI)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A perceptually motivated stationary wavelet packet filter-bank utilizing improved spectral over-subtraction algorithm for enhancing speech in non-stationary environments\",\"authors\":\"Navneet Upadhyay, A. Karmakar\",\"doi\":\"10.1109/IHCI.2012.6481840\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a novel speech enhancement approach for a single-microphone system to meet the demand of quality noise reduction algorithms. The proposed system incorporates a perceptually motivated stationary wavelet packet filter-bank (PM-SWPFB) and improved spectral over-subtraction (I-SOS) algorithm together to enhance the speech degraded by non-stationary or colored noise environment. The PM-SWPFB is obtained by adjusting the uniformly spaced stationary wavelet packet tree in order to most closely mimic the critical-bands of the psycho-acoustic model. The PM-SWPFB is, firstly, used to decompose the input noisy speech signal into nonuniform sub-bands. Then, I-SOS algorithm is used to estimate of speech from each sub-band. The I-SOS algorithm uses a new noise estimation approach, to estimate noise power from each sub-band without the need of explicit speech silence detection. The sub-band noise estimate is updated by adaptively smoothing the noisy signal power. The smoothing parameter is controlled by a function of the estimated signal-to-noise ratio (SNR). The performance of the proposed speech enhancement system is evaluated objectively by SNR, Itakura-Saito distortion measure and subjectively by informal listening test. The results confirm that the proposed speech enhancement system is capable of reducing noise with little speech degradation remains acceptable in real-world environments, and the overall performance is superior to several competitive methods.\",\"PeriodicalId\":107245,\"journal\":{\"name\":\"2012 4th International Conference on Intelligent Human Computer Interaction (IHCI)\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 4th International Conference on Intelligent Human Computer Interaction (IHCI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IHCI.2012.6481840\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 4th International Conference on Intelligent Human Computer Interaction (IHCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IHCI.2012.6481840","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A perceptually motivated stationary wavelet packet filter-bank utilizing improved spectral over-subtraction algorithm for enhancing speech in non-stationary environments
This paper proposes a novel speech enhancement approach for a single-microphone system to meet the demand of quality noise reduction algorithms. The proposed system incorporates a perceptually motivated stationary wavelet packet filter-bank (PM-SWPFB) and improved spectral over-subtraction (I-SOS) algorithm together to enhance the speech degraded by non-stationary or colored noise environment. The PM-SWPFB is obtained by adjusting the uniformly spaced stationary wavelet packet tree in order to most closely mimic the critical-bands of the psycho-acoustic model. The PM-SWPFB is, firstly, used to decompose the input noisy speech signal into nonuniform sub-bands. Then, I-SOS algorithm is used to estimate of speech from each sub-band. The I-SOS algorithm uses a new noise estimation approach, to estimate noise power from each sub-band without the need of explicit speech silence detection. The sub-band noise estimate is updated by adaptively smoothing the noisy signal power. The smoothing parameter is controlled by a function of the estimated signal-to-noise ratio (SNR). The performance of the proposed speech enhancement system is evaluated objectively by SNR, Itakura-Saito distortion measure and subjectively by informal listening test. The results confirm that the proposed speech enhancement system is capable of reducing noise with little speech degradation remains acceptable in real-world environments, and the overall performance is superior to several competitive methods.