Speech enhancement by minimum mean-square error spectral amplitude estimation assuming weibull speech priors

M. Bahrami, N. Faraji
{"title":"Speech enhancement by minimum mean-square error spectral amplitude estimation assuming weibull speech priors","authors":"M. Bahrami, N. Faraji","doi":"10.1109/AISP.2017.8324079","DOIUrl":null,"url":null,"abstract":"In this paper, we present a novel single-channel speech enhancement method in the Discrete Fourier Transform (DFT) domain. Here, the amplitude of DFT coefficients of a clean speech signal is modeled by a Weibull probability density function. Measuring the Jensen-Shannon divergence (JSD), Weibull distribution showed a better fit to clean speech signal compared to the previously fitted distributions such as gamma and Rayleigh. Therefore, we modify the Minimum Mean Square Error (MMSE) estimation algorithm for speech enhancement considering Weibull speech priors and Gaussian additive noise signals. The enhanced speech signals are assessed based on the perceptual evaluation of speech quality (PESQ) and segmental signal-to-noise ratio (SEG-SNR) criteria. Extensive simulation experiments on speech signals degraded by various additive non-stationary noise sources demonstrate that performance improvements are possible employing Weibull speech priors in the MMSE-based speech enhancement algorithm compared to the Rayleigh and Gamma PDFs.","PeriodicalId":386952,"journal":{"name":"2017 Artificial Intelligence and Signal Processing Conference (AISP)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Artificial Intelligence and Signal Processing Conference (AISP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AISP.2017.8324079","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

In this paper, we present a novel single-channel speech enhancement method in the Discrete Fourier Transform (DFT) domain. Here, the amplitude of DFT coefficients of a clean speech signal is modeled by a Weibull probability density function. Measuring the Jensen-Shannon divergence (JSD), Weibull distribution showed a better fit to clean speech signal compared to the previously fitted distributions such as gamma and Rayleigh. Therefore, we modify the Minimum Mean Square Error (MMSE) estimation algorithm for speech enhancement considering Weibull speech priors and Gaussian additive noise signals. The enhanced speech signals are assessed based on the perceptual evaluation of speech quality (PESQ) and segmental signal-to-noise ratio (SEG-SNR) criteria. Extensive simulation experiments on speech signals degraded by various additive non-stationary noise sources demonstrate that performance improvements are possible employing Weibull speech priors in the MMSE-based speech enhancement algorithm compared to the Rayleigh and Gamma PDFs.
假设威布尔语音先验的最小均方误差谱幅估计语音增强
本文提出了一种新的离散傅立叶变换(DFT)域单通道语音增强方法。在这里,一个干净的语音信号的DFT系数的幅度是由威布尔概率密度函数建模。测量Jensen-Shannon散度(JSD),与gamma和Rayleigh等先前的拟合分布相比,Weibull分布显示出更好的拟合干净语音信号。因此,我们对最小均方误差(MMSE)估计算法进行了改进,考虑了威布尔语音先验和高斯加性噪声信号。基于语音质量感知评价(PESQ)和分段信噪比(SEG-SNR)标准对增强语音信号进行评估。对各种加性非平稳噪声源退化的语音信号进行的大量仿真实验表明,与瑞利和伽玛pdf相比,在基于mmse的语音增强算法中使用威布尔语音先验可以提高性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信