Sparse Reverberant Audio Source Separation via Reweighted Analysis

IEEE Transactions on Audio Speech and Language Processing Pub Date : 2013-07-01 DOI:10.1109/TASL.2013.2250962

S. Arberet, P. Vandergheynst, R. Carrillo, J. Thiran, Y. Wiaux

引用次数: 21

Abstract

We propose a novel algorithm for source signals estimation from an underdetermined convolutive mixture assuming known mixing filters. Most of the state-of-the-art methods are dealing with anechoic or short reverberant mixture, assuming a synthesis sparse prior in the time-frequency domain and a narrowband approximation of the convolutive mixing process. In this paper, we address the source estimation of convolutive mixtures with a new algorithm based on i) an analysis sparse prior, ii) a reweighting scheme so as to increase the sparsity, iii) a wideband data-fidelity term in a constrained form. We show, through theoretical discussions and simulations, that this algorithm is particularly well suited for source separation of realistic reverberation mixtures. Particularly, the proposed algorithm outperforms state-of-the-art methods on reverberant mixtures of audio sources by more than 2 dB of signal-to-distortion ratio on the BSS Oracle dataset.

查看原文本刊更多论文

基于重加权分析的稀疏混响音源分离

我们提出了一种基于已知混合滤波器的欠定卷积混合源信号估计算法。大多数最先进的方法都是处理消声或短混响混合，假设时频域的合成稀疏先验和卷积混合过程的窄带近似。在本文中，我们用一种新的算法来解决卷积混合的源估计问题，该算法基于i)分析稀疏先验，ii)重新加权以增加稀疏性，iii)约束形式的宽带数据保真度项。我们通过理论讨论和模拟表明，该算法特别适合于现实混响混合的源分离。特别是，在BSS Oracle数据集上，所提出的算法在混响音源混合上优于最先进的方法，信号失真比超过2db。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Audio Speech and Language Processing 工程技术-工程：电子与电气

自引率

0.00%

发文量

审稿时长

24.0 months

期刊介绍： The IEEE Transactions on Audio, Speech and Language Processing covers the sciences, technologies and applications relating to the analysis, coding, enhancement, recognition and synthesis of audio, music, speech and language. In particular, audio processing also covers auditory modeling, acoustic modeling and source separation. Speech processing also covers speech production and perception, adaptation, lexical modeling and speaker recognition. Language processing also covers spoken language understanding, translation, summarization, mining, general language modeling, as well as spoken dialog systems.