基于深度神经网络的多声道音乐分离

2016 24th European Signal Processing Conference (EUSIPCO) Pub Date : 2016-08-29 DOI:10.1109/EUSIPCO.2016.7760548

Aditya Arie Nugraha, A. Liutkus, E. Vincent

{"title":"基于深度神经网络的多声道音乐分离","authors":"Aditya Arie Nugraha, A. Liutkus, E. Vincent","doi":"10.1109/EUSIPCO.2016.7760548","DOIUrl":null,"url":null,"abstract":"This article addresses the problem of multichannel music separation. We propose a framework where the source spectra are estimated using deep neural networks and combined with spatial covariance matrices to encode the source spatial characteristics. The parameters are estimated in an iterative expectation-maximization fashion and used to derive a multichannel Wiener filter. We evaluate the proposed framework for the task of music separation on a large dataset. Experimental results show that the method we describe performs consistently well in separating singing voice and other instruments from realistic musical mixtures.","PeriodicalId":127068,"journal":{"name":"2016 24th European Signal Processing Conference (EUSIPCO)","volume":"307 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"77","resultStr":"{\"title\":\"Multichannel music separation with deep neural networks\",\"authors\":\"Aditya Arie Nugraha, A. Liutkus, E. Vincent\",\"doi\":\"10.1109/EUSIPCO.2016.7760548\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article addresses the problem of multichannel music separation. We propose a framework where the source spectra are estimated using deep neural networks and combined with spatial covariance matrices to encode the source spatial characteristics. The parameters are estimated in an iterative expectation-maximization fashion and used to derive a multichannel Wiener filter. We evaluate the proposed framework for the task of music separation on a large dataset. Experimental results show that the method we describe performs consistently well in separating singing voice and other instruments from realistic musical mixtures.\",\"PeriodicalId\":127068,\"journal\":{\"name\":\"2016 24th European Signal Processing Conference (EUSIPCO)\",\"volume\":\"307 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-08-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"77\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 24th European Signal Processing Conference (EUSIPCO)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/EUSIPCO.2016.7760548\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 24th European Signal Processing Conference (EUSIPCO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EUSIPCO.2016.7760548","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 77

摘要

本文解决了多声道音乐分离的问题。我们提出了一种利用深度神经网络估计源光谱并结合空间协方差矩阵对源空间特征进行编码的框架。以迭代期望最大化的方式估计参数，并用于推导多通道维纳滤波器。我们在一个大型数据集上评估了提出的音乐分离任务框架。实验结果表明，我们所描述的方法可以很好地从现实音乐混合中分离出歌唱声音和其他乐器。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Multichannel music separation with deep neural networks

This article addresses the problem of multichannel music separation. We propose a framework where the source spectra are estimated using deep neural networks and combined with spatial covariance matrices to encode the source spatial characteristics. The parameters are estimated in an iterative expectation-maximization fashion and used to derive a multichannel Wiener filter. We evaluate the proposed framework for the task of music separation on a large dataset. Experimental results show that the method we describe performs consistently well in separating singing voice and other instruments from realistic musical mixtures.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2016 24th European Signal Processing Conference (EUSIPCO)

自引率

0.00%

发文量