{"title":"An auditory perception based improved multi-band spectral subtraction algorithm for enhancement of speech degraded by non-stationary noises","authors":"Navneet Upadhyay, A. Karmakar","doi":"10.1109/IHCI.2012.6481854","DOIUrl":null,"url":null,"abstract":"In this paper, an auditory perception based improved multi-band spectral subtraction algorithm is proposed to enhance the speech signal degraded by non-stationary or colored noises. In the proposed scheme, the whole speech spectrum is divided in different non-uniform bands (N = 6) in accordance to the critical-band rate scale and spectral subtraction is applied separately in each band. The proposed algorithm uses a new approach to estimate the noise power from each band without the need of explicit speech silence detection. The noise estimate is updated by adaptively smoothing the noisy signal power. The smoothing parameter is controlled by a linear function of a-posteriori signal-to-noise ratio (SNR). This noise estimation approach gives accurate results at low SNR and works continuously in the presence of speech. The objective measures as well as informal subjective tests demonstrate that the proposed algorithm reduces remnant noise efficiently and the enhanced speech contains minimal speech distortions with improved SNR.","PeriodicalId":107245,"journal":{"name":"2012 4th International Conference on Intelligent Human Computer Interaction (IHCI)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 4th International Conference on Intelligent Human Computer Interaction (IHCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IHCI.2012.6481854","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In this paper, an auditory perception based improved multi-band spectral subtraction algorithm is proposed to enhance the speech signal degraded by non-stationary or colored noises. In the proposed scheme, the whole speech spectrum is divided in different non-uniform bands (N = 6) in accordance to the critical-band rate scale and spectral subtraction is applied separately in each band. The proposed algorithm uses a new approach to estimate the noise power from each band without the need of explicit speech silence detection. The noise estimate is updated by adaptively smoothing the noisy signal power. The smoothing parameter is controlled by a linear function of a-posteriori signal-to-noise ratio (SNR). This noise estimation approach gives accurate results at low SNR and works continuously in the presence of speech. The objective measures as well as informal subjective tests demonstrate that the proposed algorithm reduces remnant noise efficiently and the enhanced speech contains minimal speech distortions with improved SNR.