{"title":"A modified spectral subtraction method for speech enhancement based on masking property of human auditory system","authors":"Bingyin Xia, Yan Liang, C. Bao","doi":"10.1109/WCSP.2009.5371466","DOIUrl":null,"url":null,"abstract":"This paper addresses the problem of musical noise introduced by conventional spectral subtraction method for speech enhancement. A modified spectral subtraction algorithm based on the masking properties of human auditory system is proposed. In comparison with Virag's algorithm, the modification of proposed method is made from four aspects. Firstly, VAD(Voice Activity Detection) is substituted by MCRA(Minima-Controlled Recursive Averaging) algorithm to estimate the background noise; Secondly, the masking threshold is calculated based on enhanced speech by multi-band spectral subtraction method; Thirdly, the adaptive parameters of spectral subtraction method is adjusted; Finally, a modified form of parametric spectral subtraction is employed. The performance of the proposed method is evaluated under ITU-T G.160 standard. The results shows that, comparing with the reference algorithms, the proposed method provides acceptable amount of signal-to-noise ratio(SNR) improvement and noise reduction with a little impact on the level of speech. The objective speech quality is improved evidently at the same time.","PeriodicalId":244652,"journal":{"name":"2009 International Conference on Wireless Communications & Signal Processing","volume":"86 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 International Conference on Wireless Communications & Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WCSP.2009.5371466","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10
Abstract
This paper addresses the problem of musical noise introduced by conventional spectral subtraction method for speech enhancement. A modified spectral subtraction algorithm based on the masking properties of human auditory system is proposed. In comparison with Virag's algorithm, the modification of proposed method is made from four aspects. Firstly, VAD(Voice Activity Detection) is substituted by MCRA(Minima-Controlled Recursive Averaging) algorithm to estimate the background noise; Secondly, the masking threshold is calculated based on enhanced speech by multi-band spectral subtraction method; Thirdly, the adaptive parameters of spectral subtraction method is adjusted; Finally, a modified form of parametric spectral subtraction is employed. The performance of the proposed method is evaluated under ITU-T G.160 standard. The results shows that, comparing with the reference algorithms, the proposed method provides acceptable amount of signal-to-noise ratio(SNR) improvement and noise reduction with a little impact on the level of speech. The objective speech quality is improved evidently at the same time.