Modular dynamic deep denoising autoencoder for speech enhancement

2017 7th International Conference on Computer and Knowledge Engineering (ICCKE) Pub Date : 2017-10-01 DOI:10.1109/ICCKE.2017.8167886

Razieh Safari, S. Ahadi, Sanaz Seyedin

引用次数: 5

Abstract

Deep Denoising Autoencoder (DDAE) is an effective method for noise reduction and speech enhancement. However, a single DDAE with a fixed number of frames for neural network input cannot extract contextual information sufficiently. It has also less generalization in unknown SNRs (signal-to-noise-ratio) and the enhanced output has some residual noise. In this paper, we use a modular model in which three DDAEs with different window lengths are stacked. Experimental results showes that our proposed architecture, namely modular dynamic deep denoising autoencoder (MD-DDAE) provides superior performance in comparison with the traditional DDAE models in different noisy conditions.

查看原文本刊更多论文

模块化动态深度去噪自编码器语音增强

深度去噪自动编码器(DDAE)是一种有效的降噪和增强语音的方法。然而，对于神经网络输入帧数固定的单一DDAE，不能充分提取上下文信息。在信噪比未知的情况下泛化能力较差，增强后的输出存在一定的残余噪声。在本文中，我们使用了一个模块化模型，其中三个不同窗口长度的ddae堆叠在一起。实验结果表明，我们提出的模块化动态深度去噪自动编码器(MD-DDAE)在不同的噪声条件下都比传统的DDAE模型具有更好的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 7th International Conference on Computer and Knowledge Engineering (ICCKE)

自引率

0.00%

发文量