基于麦克风坐标下降的独立深度学习矩阵分析鲁棒去混滤波器更新算法

2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) Pub Date : 2019-11-01 DOI:10.1109/APSIPAASC47483.2019.9023032

Naoki Makishima, Norihiro Takamune, H. Saruwatari, Daichi Kitamura, Yu Takahashi, Kazunobu Kondo

{"title":"基于麦克风坐标下降的独立深度学习矩阵分析鲁棒去混滤波器更新算法","authors":"Naoki Makishima, Norihiro Takamune, H. Saruwatari, Daichi Kitamura, Yu Takahashi, Kazunobu Kondo","doi":"10.1109/APSIPAASC47483.2019.9023032","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a robust demixing filter update algorithm for audio source separation, which is the task of recovering source signals from multichannel mixtures observed in a microphone array. Recently, independent deeply learned matrix analysis (IDLMA) has been proposed as a state-of-the-art separation method. IDLMA utilizes the deep neural network (DNN) inference of source models and the blind estimation of demixing filters based on sources' independence. In conventional IDLMA, iterative projection (IP) is exploited to estimate the demixing filters. Although IP is a fast algorithm, when a specific source model is not accurate owing to an unfavorable SNR condition, the subsequent update of filters will fail. This is because IP updates the demixing filters in a sourcewise manner, where only one source model is used for each update. In this paper, we derive a new microphone-wise update algorithm that exploits all information of the source models simultaneously for each update. The microphone-wise update problem cannot be solved by IP, but instead, a new type of vectorwise coordinate descent algorithm is introduced into the proposed algorithm to realize convergence-guaranteed parameter estimation. Experimental results show that the proposed update algorithm achieves better separation performance than IP.","PeriodicalId":145222,"journal":{"name":"2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Robust Demixing Filter Update Algorithm Based on Microphone-wise Coordinate Descent for Independent Deeply Learned Matrix Analysis\",\"authors\":\"Naoki Makishima, Norihiro Takamune, H. Saruwatari, Daichi Kitamura, Yu Takahashi, Kazunobu Kondo\",\"doi\":\"10.1109/APSIPAASC47483.2019.9023032\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose a robust demixing filter update algorithm for audio source separation, which is the task of recovering source signals from multichannel mixtures observed in a microphone array. Recently, independent deeply learned matrix analysis (IDLMA) has been proposed as a state-of-the-art separation method. IDLMA utilizes the deep neural network (DNN) inference of source models and the blind estimation of demixing filters based on sources' independence. In conventional IDLMA, iterative projection (IP) is exploited to estimate the demixing filters. Although IP is a fast algorithm, when a specific source model is not accurate owing to an unfavorable SNR condition, the subsequent update of filters will fail. This is because IP updates the demixing filters in a sourcewise manner, where only one source model is used for each update. In this paper, we derive a new microphone-wise update algorithm that exploits all information of the source models simultaneously for each update. The microphone-wise update problem cannot be solved by IP, but instead, a new type of vectorwise coordinate descent algorithm is introduced into the proposed algorithm to realize convergence-guaranteed parameter estimation. Experimental results show that the proposed update algorithm achieves better separation performance than IP.\",\"PeriodicalId\":145222,\"journal\":{\"name\":\"2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)\",\"volume\":\"56 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/APSIPAASC47483.2019.9023032\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APSIPAASC47483.2019.9023032","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

在本文中，我们提出了一种用于音频源分离的鲁棒除混滤波器更新算法，该算法的任务是从麦克风阵列中观察到的多通道混合中恢复源信号。近年来，独立深度学习矩阵分析(IDLMA)作为一种最先进的分离方法被提出。IDLMA利用源模型的深度神经网络(DNN)推理和基于源独立性的去混滤波器的盲估计。在传统的IDLMA中，使用迭代投影(IP)来估计除混滤波器。虽然IP是一种快速的算法，但当特定的源模型由于不利的信噪比条件而不准确时，后续的滤波器更新将失败。这是因为IP以源方式更新除混过滤器，其中每次更新仅使用一个源模型。在本文中，我们推导了一种新的智能麦克风更新算法，该算法在每次更新时同时利用源模型的所有信息。针对IP无法解决的麦克风智能更新问题，在该算法中引入了一种新的矢量坐标下降算法，实现了保证收敛的参数估计。实验结果表明，该算法比IP算法具有更好的分离性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Robust Demixing Filter Update Algorithm Based on Microphone-wise Coordinate Descent for Independent Deeply Learned Matrix Analysis

In this paper, we propose a robust demixing filter update algorithm for audio source separation, which is the task of recovering source signals from multichannel mixtures observed in a microphone array. Recently, independent deeply learned matrix analysis (IDLMA) has been proposed as a state-of-the-art separation method. IDLMA utilizes the deep neural network (DNN) inference of source models and the blind estimation of demixing filters based on sources' independence. In conventional IDLMA, iterative projection (IP) is exploited to estimate the demixing filters. Although IP is a fast algorithm, when a specific source model is not accurate owing to an unfavorable SNR condition, the subsequent update of filters will fail. This is because IP updates the demixing filters in a sourcewise manner, where only one source model is used for each update. In this paper, we derive a new microphone-wise update algorithm that exploits all information of the source models simultaneously for each update. The microphone-wise update problem cannot be solved by IP, but instead, a new type of vectorwise coordinate descent algorithm is introduced into the proposed algorithm to realize convergence-guaranteed parameter estimation. Experimental results show that the proposed update algorithm achieves better separation performance than IP.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)

自引率

0.00%

发文量