Speech enhancement by minimum mean-square error spectral amplitude estimation assuming weibull speech priors

2017 Artificial Intelligence and Signal Processing Conference (AISP) Pub Date : 2017-10-01 DOI:10.1109/AISP.2017.8324079

M. Bahrami, N. Faraji

引用次数: 4

Abstract

In this paper, we present a novel single-channel speech enhancement method in the Discrete Fourier Transform (DFT) domain. Here, the amplitude of DFT coefficients of a clean speech signal is modeled by a Weibull probability density function. Measuring the Jensen-Shannon divergence (JSD), Weibull distribution showed a better fit to clean speech signal compared to the previously fitted distributions such as gamma and Rayleigh. Therefore, we modify the Minimum Mean Square Error (MMSE) estimation algorithm for speech enhancement considering Weibull speech priors and Gaussian additive noise signals. The enhanced speech signals are assessed based on the perceptual evaluation of speech quality (PESQ) and segmental signal-to-noise ratio (SEG-SNR) criteria. Extensive simulation experiments on speech signals degraded by various additive non-stationary noise sources demonstrate that performance improvements are possible employing Weibull speech priors in the MMSE-based speech enhancement algorithm compared to the Rayleigh and Gamma PDFs.

查看原文本刊更多论文

假设威布尔语音先验的最小均方误差谱幅估计语音增强

本文提出了一种新的离散傅立叶变换(DFT)域单通道语音增强方法。在这里，一个干净的语音信号的DFT系数的幅度是由威布尔概率密度函数建模。测量Jensen-Shannon散度(JSD)，与gamma和Rayleigh等先前的拟合分布相比，Weibull分布显示出更好的拟合干净语音信号。因此，我们对最小均方误差(MMSE)估计算法进行了改进，考虑了威布尔语音先验和高斯加性噪声信号。基于语音质量感知评价(PESQ)和分段信噪比(SEG-SNR)标准对增强语音信号进行评估。对各种加性非平稳噪声源退化的语音信号进行的大量仿真实验表明，与瑞利和伽玛pdf相比，在基于mmse的语音增强算法中使用威布尔语音先验可以提高性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 Artificial Intelligence and Signal Processing Conference (AISP)

自引率

0.00%

发文量