基于多层kullback - leibler的LPC误差聚类复NMF盲源分离

A. Amiri, Sanaz Seyedin, S. Ahadi
{"title":"基于多层kullback - leibler的LPC误差聚类复NMF盲源分离","authors":"A. Amiri, Sanaz Seyedin, S. Ahadi","doi":"10.1109/ICCKE.2017.8167890","DOIUrl":null,"url":null,"abstract":"In many applications such as music transcription, audio forensics, and speech source separation, it is needed to decompose a mono recording into its respective sources. These techniques are usually referred to as blind source separation (BSS). One of the methods recently used in BSS is non-negative matrix factorization (NMF) both in supervised and unsupervised learning cases. In this paper, we propose a novel NMF-based algorithm namely, multi-layer KL-CNMF (Kullback-Leibler-Complex NMF) using fuzzy initial clustering to improve the performance of BSS in the unsupervised mode. In addition, we use LPC error clustering as a powerful criterion especially for separating harmonic signals such as certain speech sources from their multi-layer KL-CNMF components. The results on speech mixtures of the TIMIT database based on signal to distortion ratio (SDR) and signal to interference ratio (SIR) show that the proposed system significantly outperforms the baseline system which is an NMF-based BSS with LPC error clustering.","PeriodicalId":151934,"journal":{"name":"2017 7th International Conference on Computer and Knowledge Engineering (ICCKE)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Multi-layer Kullback-Leibler-based Complex NMF with LPC error clustering for blind source separation\",\"authors\":\"A. Amiri, Sanaz Seyedin, S. Ahadi\",\"doi\":\"10.1109/ICCKE.2017.8167890\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In many applications such as music transcription, audio forensics, and speech source separation, it is needed to decompose a mono recording into its respective sources. These techniques are usually referred to as blind source separation (BSS). One of the methods recently used in BSS is non-negative matrix factorization (NMF) both in supervised and unsupervised learning cases. In this paper, we propose a novel NMF-based algorithm namely, multi-layer KL-CNMF (Kullback-Leibler-Complex NMF) using fuzzy initial clustering to improve the performance of BSS in the unsupervised mode. In addition, we use LPC error clustering as a powerful criterion especially for separating harmonic signals such as certain speech sources from their multi-layer KL-CNMF components. The results on speech mixtures of the TIMIT database based on signal to distortion ratio (SDR) and signal to interference ratio (SIR) show that the proposed system significantly outperforms the baseline system which is an NMF-based BSS with LPC error clustering.\",\"PeriodicalId\":151934,\"journal\":{\"name\":\"2017 7th International Conference on Computer and Knowledge Engineering (ICCKE)\",\"volume\":\"38 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 7th International Conference on Computer and Knowledge Engineering (ICCKE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCKE.2017.8167890\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 7th International Conference on Computer and Knowledge Engineering (ICCKE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCKE.2017.8167890","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

在许多应用中,如音乐转录、音频取证和语音源分离,需要将单声道录音分解为其各自的源。这些技术通常被称为盲源分离(BSS)。非负矩阵分解(NMF)是目前在有监督学习和无监督学习中应用的一种方法。在本文中,我们提出了一种新的基于NMF的算法,即多层KL-CNMF (Kullback-Leibler-Complex NMF),利用模糊初始聚类来提高BSS在无监督模式下的性能。此外,我们使用LPC误差聚类作为一个强大的准则,特别是在将谐波信号(如某些语音源)与其多层KL-CNMF分量分离时。基于信失真比(SDR)和信干扰比(SIR)的TIMIT数据库语音混合结果表明,该系统明显优于基于nmf的LPC错误聚类的BSS基线系统。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Multi-layer Kullback-Leibler-based Complex NMF with LPC error clustering for blind source separation
In many applications such as music transcription, audio forensics, and speech source separation, it is needed to decompose a mono recording into its respective sources. These techniques are usually referred to as blind source separation (BSS). One of the methods recently used in BSS is non-negative matrix factorization (NMF) both in supervised and unsupervised learning cases. In this paper, we propose a novel NMF-based algorithm namely, multi-layer KL-CNMF (Kullback-Leibler-Complex NMF) using fuzzy initial clustering to improve the performance of BSS in the unsupervised mode. In addition, we use LPC error clustering as a powerful criterion especially for separating harmonic signals such as certain speech sources from their multi-layer KL-CNMF components. The results on speech mixtures of the TIMIT database based on signal to distortion ratio (SDR) and signal to interference ratio (SIR) show that the proposed system significantly outperforms the baseline system which is an NMF-based BSS with LPC error clustering.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信