Spectro-temporal Filtering based on The Beta-divergence for Speech Separation using Nonnegative Matrix Factorization

M. Fakhry
{"title":"Spectro-temporal Filtering based on The Beta-divergence for Speech Separation using Nonnegative Matrix Factorization","authors":"M. Fakhry","doi":"10.1109/ISRITI54043.2021.9702880","DOIUrl":null,"url":null,"abstract":"Nonnegative matrix factorization (NMF) has shown high effectiveness to perform supervised speech separation. In this context, nonnegative spectral basis matrices representing sources in an observed mixture, are trained independently. The trained matrices are used later to compute the corresponding nonnegative temporal activation matrices. Estimations of the source signals in the mixture are obtained through Wiener gains by minimizing the Euclidean distance between true and estimated source signals. In this paper, we propose to quantify such a distance using the Beta-divergence ($\\beta$-divergence), which has been successfully used to accomplish NMF. The proposed gains are derived by minimizing the distance measured by the divergence, and it is involved afterward in the context of supervised NMF for speech separation. The experimental evaluation concludes that the gain computed by the Beta-divergence with $\\beta=1.5$, provides better performance compared to the conventional Wiener gain.","PeriodicalId":156265,"journal":{"name":"2021 4th International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 4th International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISRITI54043.2021.9702880","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Nonnegative matrix factorization (NMF) has shown high effectiveness to perform supervised speech separation. In this context, nonnegative spectral basis matrices representing sources in an observed mixture, are trained independently. The trained matrices are used later to compute the corresponding nonnegative temporal activation matrices. Estimations of the source signals in the mixture are obtained through Wiener gains by minimizing the Euclidean distance between true and estimated source signals. In this paper, we propose to quantify such a distance using the Beta-divergence ($\beta$-divergence), which has been successfully used to accomplish NMF. The proposed gains are derived by minimizing the distance measured by the divergence, and it is involved afterward in the context of supervised NMF for speech separation. The experimental evaluation concludes that the gain computed by the Beta-divergence with $\beta=1.5$, provides better performance compared to the conventional Wiener gain.
基于β散度的非负矩阵分解语音分离频谱时间滤波
非负矩阵分解(NMF)在监督语音分离中表现出很高的有效性。在这种情况下,非负谱基矩阵表示源观察混合物,是独立训练。训练后的矩阵用于计算相应的非负时间激活矩阵。通过最小化真源信号和估计源信号之间的欧几里得距离,通过维纳增益获得混合源信号的估计。在本文中,我们建议使用beta散度($\beta$-散度)来量化这样的距离,该散度已成功地用于实现NMF。所提出的增益是通过最小化发散所测量的距离而得出的,并且随后涉及到用于语音分离的监督NMF的背景。实验结果表明,与传统的维纳增益相比,由β散度计算的增益在$\ β =1.5$时具有更好的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信