Speech enhancement using both spectral and spectral modulation domains

2017 IEEE 30th Canadian Conference on Electrical and Computer Engineering (CCECE) Pub Date : 2017-04-01 DOI:10.1109/CCECE.2017.7946647

Julien Bosco, É. Plourde

引用次数: 2

Abstract

This paper proposes a speech enhancement approach that uses both the spectral and spectral modulation domains. In this approach, the noisy speech signal is enhanced simultaneously in the spectral domain, using a minimum mean square error (MMSE) short time spectral amplitude estimator, and in the spectral modulation domain, using a MMSE spectral modulation magnitude estimator. The results of both estimators are then weighted and combined together, using a function based on the a posteriori SNR, to produce the desired enhanced signal. Comparative results using both the segmental SNR and PESQ objective measures are presented for both stationary and non-stationary noises. It is observed that the proposed approach suppresses more noise than the compared approaches, but at the usual compromise of introducing speech distortions.

查看原文本刊更多论文

语音增强使用频谱和频谱调制域

本文提出了一种同时使用频谱域和频谱调制域的语音增强方法。该方法在频谱域采用最小均方误差(MMSE)短时频谱幅度估计器，在频谱调制域采用MMSE频谱调制幅度估计器，同时对含噪语音信号进行增强。然后，使用基于后验信噪比的函数对两个估计器的结果进行加权并组合在一起，以产生所需的增强信号。在平稳噪声和非平稳噪声的情况下，给出了分段信噪比和PESQ客观测量的比较结果。我们观察到，所提出的方法比所比较的方法抑制了更多的噪声，但通常是以引入语音失真为代价的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 IEEE 30th Canadian Conference on Electrical and Computer Engineering (CCECE)

自引率

0.00%

发文量