A modified spectral subtraction method for speech enhancement based on masking property of human auditory system

2009 International Conference on Wireless Communications & Signal Processing Pub Date : 2009-12-31 DOI:10.1109/WCSP.2009.5371466

Bingyin Xia, Yan Liang, C. Bao

{"title":"A modified spectral subtraction method for speech enhancement based on masking property of human auditory system","authors":"Bingyin Xia, Yan Liang, C. Bao","doi":"10.1109/WCSP.2009.5371466","DOIUrl":null,"url":null,"abstract":"This paper addresses the problem of musical noise introduced by conventional spectral subtraction method for speech enhancement. A modified spectral subtraction algorithm based on the masking properties of human auditory system is proposed. In comparison with Virag's algorithm, the modification of proposed method is made from four aspects. Firstly, VAD(Voice Activity Detection) is substituted by MCRA(Minima-Controlled Recursive Averaging) algorithm to estimate the background noise; Secondly, the masking threshold is calculated based on enhanced speech by multi-band spectral subtraction method; Thirdly, the adaptive parameters of spectral subtraction method is adjusted; Finally, a modified form of parametric spectral subtraction is employed. The performance of the proposed method is evaluated under ITU-T G.160 standard. The results shows that, comparing with the reference algorithms, the proposed method provides acceptable amount of signal-to-noise ratio(SNR) improvement and noise reduction with a little impact on the level of speech. The objective speech quality is improved evidently at the same time.","PeriodicalId":244652,"journal":{"name":"2009 International Conference on Wireless Communications & Signal Processing","volume":"86 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 International Conference on Wireless Communications & Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WCSP.2009.5371466","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

Abstract

This paper addresses the problem of musical noise introduced by conventional spectral subtraction method for speech enhancement. A modified spectral subtraction algorithm based on the masking properties of human auditory system is proposed. In comparison with Virag's algorithm, the modification of proposed method is made from four aspects. Firstly, VAD(Voice Activity Detection) is substituted by MCRA(Minima-Controlled Recursive Averaging) algorithm to estimate the background noise; Secondly, the masking threshold is calculated based on enhanced speech by multi-band spectral subtraction method; Thirdly, the adaptive parameters of spectral subtraction method is adjusted; Finally, a modified form of parametric spectral subtraction is employed. The performance of the proposed method is evaluated under ITU-T G.160 standard. The results shows that, comparing with the reference algorithms, the proposed method provides acceptable amount of signal-to-noise ratio(SNR) improvement and noise reduction with a little impact on the level of speech. The objective speech quality is improved evidently at the same time.

查看原文本刊更多论文

一种基于人听觉掩蔽特性的改进频谱减法语音增强方法

本文解决了传统频谱减法在语音增强中引入音乐噪声的问题。提出了一种基于人类听觉系统掩蔽特性的改进谱减法算法。通过与Virag算法的比较，从四个方面对本文方法进行了改进。首先，用最小控制递归平均(MCRA)算法代替VAD(Voice Activity Detection)算法来估计背景噪声;其次，采用多频带谱减法计算基于增强语音的掩蔽阈值;第三，调整谱减法的自适应参数;最后，采用了一种改进的参数谱减法。根据ITU-T G.160标准对该方法的性能进行了评估。结果表明，与参考算法相比，本文提出的方法在对语音水平影响不大的情况下，提供了可接受的信噪比提升和降噪效果。同时，客观语音质量明显提高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2009 International Conference on Wireless Communications & Signal Processing

自引率

0.00%

发文量