Investigation of Network Architecture for Single-Channel End-to-End Denoising

2020 28th European Signal Processing Conference (EUSIPCO) Pub Date : 2021-01-24 DOI:10.23919/Eusipco47968.2020.9287753

Takuya Hasumi, Tetsunori Kobayashi, Tetsuji Ogawa

引用次数: 0

Abstract

This paper examines the effectiveness of a fully convolutional time-domain audio separation network (Conv-TasNet) on single-channel denoising. Conv-TasNet, which has a structure to explicitly estimate a mask for encoded features, has shown to be effective in single-channel sound source separation in noise-free environments, but it has not been applied to denoising. Therefore, the present study investigates a method of learning Conv-TasNet for denoising and clarifies the optimal structure for single-channel end-to-end modeling. Experimental comparisons conducted using the CHiME-3 dataset demonstrate that Conv-TasNet performs well in denoising and yields improvements in single-channel end-to-end denoising over existing denoising autoencoder-based modeling.

查看原文本刊更多论文

单通道端到端去噪网络体系结构研究

本文研究了全卷积时域音频分离网络在单通道去噪中的有效性。卷积tasnet具有明确估计编码特征掩模的结构，已被证明在无噪声环境下的单通道声源分离中有效，但尚未应用于去噪。因此，本研究研究了一种学习卷积tasnet去噪的方法，并阐明了单通道端到端建模的最佳结构。使用CHiME-3数据集进行的实验比较表明，与现有的基于自编码器的去噪模型相比，卷积tasnet在单通道端到端去噪方面表现良好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 28th European Signal Processing Conference (EUSIPCO)

自引率

0.00%

发文量