Multichannel music separation with deep neural networks

2016 24th European Signal Processing Conference (EUSIPCO) Pub Date : 2016-08-29 DOI:10.1109/EUSIPCO.2016.7760548

Aditya Arie Nugraha, A. Liutkus, E. Vincent

引用次数: 77

Abstract

This article addresses the problem of multichannel music separation. We propose a framework where the source spectra are estimated using deep neural networks and combined with spatial covariance matrices to encode the source spatial characteristics. The parameters are estimated in an iterative expectation-maximization fashion and used to derive a multichannel Wiener filter. We evaluate the proposed framework for the task of music separation on a large dataset. Experimental results show that the method we describe performs consistently well in separating singing voice and other instruments from realistic musical mixtures.

查看原文本刊更多论文

基于深度神经网络的多声道音乐分离

本文解决了多声道音乐分离的问题。我们提出了一种利用深度神经网络估计源光谱并结合空间协方差矩阵对源空间特征进行编码的框架。以迭代期望最大化的方式估计参数，并用于推导多通道维纳滤波器。我们在一个大型数据集上评估了提出的音乐分离任务框架。实验结果表明，我们所描述的方法可以很好地从现实音乐混合中分离出歌唱声音和其他乐器。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 24th European Signal Processing Conference (EUSIPCO)

自引率

0.00%

发文量