A Perceptually Motivated Multi-Band Spectral Subtraction Algorithm for Enhancement of Degraded Speech

2012 Third International Conference on Computer and Communication Technology Pub Date : 2012-11-23 DOI:10.1109/ICCCT.2012.75

Navneet Upadhyay, A. Karmakar

{"title":"A Perceptually Motivated Multi-Band Spectral Subtraction Algorithm for Enhancement of Degraded Speech","authors":"Navneet Upadhyay, A. Karmakar","doi":"10.1109/ICCCT.2012.75","DOIUrl":null,"url":null,"abstract":"The spectral subtraction method is a classical approach for enhancement of degraded speech. The basic principle of this technique is to estimate the short-time spectral magnitude of speech by subtracting estimated noise from the noisy speech spectrum and to combine it with the phase of the noisy speech. Besides reducing the noise, this method generates an unnatural and unpleasant noise, called remnant noise. The other drawback of this method is that it can work only for white Gaussian noise which has a flat spectrum and is distributed uniformly over the frequency spectrum. But real-world noise is mostly colored and has a non-uniform spectrum. To take care of this kind of noises, spectral subtraction algorithm has been extended to a multi-band case with uniformly spaced frequency bands. In this paper, a perceptually motivated multi-band spectral subtraction algorithm is proposed to enhance the speech signal degraded by colored noise. In the proposed scheme, the whole speech spectrum is divided in different non-uniform bands (N = 6) in accordance to the critical-band rate scale and spectral subtraction is executed independently in each band. The simulation results as well as informal subjective evaluations show that the proposed algorithm reduces remnant noise efficiently and the enhanced speech contains minimal speech distortions with improved signal-to-noise ratio.","PeriodicalId":235770,"journal":{"name":"2012 Third International Conference on Computer and Communication Technology","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 Third International Conference on Computer and Communication Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCT.2012.75","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

The spectral subtraction method is a classical approach for enhancement of degraded speech. The basic principle of this technique is to estimate the short-time spectral magnitude of speech by subtracting estimated noise from the noisy speech spectrum and to combine it with the phase of the noisy speech. Besides reducing the noise, this method generates an unnatural and unpleasant noise, called remnant noise. The other drawback of this method is that it can work only for white Gaussian noise which has a flat spectrum and is distributed uniformly over the frequency spectrum. But real-world noise is mostly colored and has a non-uniform spectrum. To take care of this kind of noises, spectral subtraction algorithm has been extended to a multi-band case with uniformly spaced frequency bands. In this paper, a perceptually motivated multi-band spectral subtraction algorithm is proposed to enhance the speech signal degraded by colored noise. In the proposed scheme, the whole speech spectrum is divided in different non-uniform bands (N = 6) in accordance to the critical-band rate scale and spectral subtraction is executed independently in each band. The simulation results as well as informal subjective evaluations show that the proposed algorithm reduces remnant noise efficiently and the enhanced speech contains minimal speech distortions with improved signal-to-noise ratio.

查看原文本刊更多论文

基于感知动机的多频带频谱减法增强退化语音

谱减法是增强退化语音的一种经典方法。该技术的基本原理是通过从有噪声的语音频谱中减去估计的噪声并将其与有噪声语音的相位相结合来估计语音的短时谱幅值。除了减少噪声，这种方法还会产生一种不自然的、令人不快的噪声，称为残余噪声。这种方法的另一个缺点是它只能适用于频谱平坦且在频谱上均匀分布的高斯白噪声。但现实世界的噪声大多是有色的，光谱也不均匀。为了处理这类噪声，将谱减算法扩展到均匀频带间隔的多频带情况。本文提出了一种感知激发的多波段频谱减法算法，用于增强被彩色噪声退化的语音信号。在该方案中，将整个语音频谱按照临界频带速率尺度划分为不同的非均匀频带(N = 6)，并在每个频带中独立执行频谱减法。仿真结果和非正式的主观评价表明，该算法有效地降低了残余噪声，增强语音具有最小的语音失真，提高了信噪比。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2012 Third International Conference on Computer and Communication Technology

自引率

0.00%

发文量