Estimation of the spatial information in Gaussian model based audio source separation using weighted spectral bases

2016 24th European Signal Processing Conference (EUSIPCO) Pub Date : 2016-08-01 DOI:10.1109/EUSIPCO.2016.7760436

M. Fakhry, P. Svaizer, M. Omologo

{"title":"Estimation of the spatial information in Gaussian model based audio source separation using weighted spectral bases","authors":"M. Fakhry, P. Svaizer, M. Omologo","doi":"10.1109/EUSIPCO.2016.7760436","DOIUrl":null,"url":null,"abstract":"In Gaussian model based audio source separation, source spatial images are modeled by Gaussian distributions. The covariance matrices of the distributions are represented by source variances and spatial covariance matrices. Accordingly, the likelihood of observed mixtures of independent source signals is parametrized by the variances and the covariance matrices. The separation is performed by estimating the parameters and applying multichannel Wiener filtering. Assuming that spectral basis matrices trained on source power spectra are available, this work proposes a method to estimate the parameters by maximizing the likelihood using Expectation-Maximization. In terms of normalization, the variances are estimated applying singular value decomposition. Furthermore, by building weighted matrices from vectors of the trained matrices, semi-supervised nonnegative matrix factorization is applied to estimate the spatial covariance matrices. The experimental results prove the efficiency of the proposed algorithm in reverberant environments.","PeriodicalId":127068,"journal":{"name":"2016 24th European Signal Processing Conference (EUSIPCO)","volume":"166 3","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 24th European Signal Processing Conference (EUSIPCO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EUSIPCO.2016.7760436","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

In Gaussian model based audio source separation, source spatial images are modeled by Gaussian distributions. The covariance matrices of the distributions are represented by source variances and spatial covariance matrices. Accordingly, the likelihood of observed mixtures of independent source signals is parametrized by the variances and the covariance matrices. The separation is performed by estimating the parameters and applying multichannel Wiener filtering. Assuming that spectral basis matrices trained on source power spectra are available, this work proposes a method to estimate the parameters by maximizing the likelihood using Expectation-Maximization. In terms of normalization, the variances are estimated applying singular value decomposition. Furthermore, by building weighted matrices from vectors of the trained matrices, semi-supervised nonnegative matrix factorization is applied to estimate the spatial covariance matrices. The experimental results prove the efficiency of the proposed algorithm in reverberant environments.

查看原文本刊更多论文

基于高斯模型的音频源分离中空间信息的加权谱基估计

在基于高斯模型的音源分离中，音源空间图像采用高斯分布建模。分布的协方差矩阵由源方差和空间协方差矩阵表示。因此，观测到的独立源信号混合的似然由方差和协方差矩阵参数化。通过估计参数和应用多通道维纳滤波实现分离。假设在源功率谱上训练的谱基矩阵是可用的，本文提出了一种利用期望最大化最大化似然来估计参数的方法。在归一化方面，采用奇异值分解估计方差。在此基础上，利用训练矩阵的向量构造加权矩阵，利用半监督非负矩阵分解法估计空间协方差矩阵。实验结果证明了该算法在混响环境下的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 24th European Signal Processing Conference (EUSIPCO)

自引率

0.00%

发文量