Joint Dereverberation and Separation of Reverberant Speech Mixtures

2022 19th International Multi-Conference on Systems, Signals & Devices (SSD) Pub Date : 2022-05-06 DOI:10.1109/SSD54932.2022.9955829

M. Kemiha, A. Kacha, L. Chouikhi

{"title":"Joint Dereverberation and Separation of Reverberant Speech Mixtures","authors":"M. Kemiha, A. Kacha, L. Chouikhi","doi":"10.1109/SSD54932.2022.9955829","DOIUrl":null,"url":null,"abstract":"This paper proposes a joint dereverberation and separation method for speech mixtures. In the literature, several studies consider the problems of blind separation and blind dereverberation as two separate problems and propose a solution to each problem. The dereverberation method consists to estimate the log of maximum likelihood parameter which depends on three parameters. So the reverberation algorithm returns to jointly maximize the likelihood function. The bivariate empirical mode decomposition (BEMD) algorithm combined with complex independent component analysis by entropy bound minimization (ICA-EBM) technique is proposed as an alternative to separate convolutive mixtures of speech signals reverberant environnement. The proposed algorithm is considered as a connection in tandem of the dereverberation network and separation network to estimate the source signals. The performance of the proposed approach is tested on real speech sounds chosen from available databases. Two sets of room impulse responses with reverberation times of 0.3 and 0.5 s, with a total of 410 test samples for each reverberation condition are simulated. The proposed method is compared to the conditional separation and dereverberation (CSD) method and Frequency domain blind source separation (FDBSS) method using objective measures which are segmental signal-to-reverberation ratio (SRRseg), signal-to-interference ratio (SIR) and direct-to-reverberationr (DRR).","PeriodicalId":253898,"journal":{"name":"2022 19th International Multi-Conference on Systems, Signals & Devices (SSD)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 19th International Multi-Conference on Systems, Signals & Devices (SSD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSD54932.2022.9955829","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

This paper proposes a joint dereverberation and separation method for speech mixtures. In the literature, several studies consider the problems of blind separation and blind dereverberation as two separate problems and propose a solution to each problem. The dereverberation method consists to estimate the log of maximum likelihood parameter which depends on three parameters. So the reverberation algorithm returns to jointly maximize the likelihood function. The bivariate empirical mode decomposition (BEMD) algorithm combined with complex independent component analysis by entropy bound minimization (ICA-EBM) technique is proposed as an alternative to separate convolutive mixtures of speech signals reverberant environnement. The proposed algorithm is considered as a connection in tandem of the dereverberation network and separation network to estimate the source signals. The performance of the proposed approach is tested on real speech sounds chosen from available databases. Two sets of room impulse responses with reverberation times of 0.3 and 0.5 s, with a total of 410 test samples for each reverberation condition are simulated. The proposed method is compared to the conditional separation and dereverberation (CSD) method and Frequency domain blind source separation (FDBSS) method using objective measures which are segmental signal-to-reverberation ratio (SRRseg), signal-to-interference ratio (SIR) and direct-to-reverberationr (DRR).

查看原文本刊更多论文

混响混合语音的联合消噪与分离

提出了一种混合语音的联合去噪和分离方法。在文献中，一些研究将盲分离和盲去噪问题视为两个独立的问题，并针对每个问题提出了解决方案。去噪方法包括估计依赖于三个参数的最大似然参数的对数。因此混响算法回归到联合最大化似然函数。将二元经验模态分解(BEMD)算法与熵界最小化的复独立分量分析(ICA-EBM)技术相结合，提出了一种用于分离卷积混合语音信号混响环境的方法。该算法被认为是去噪网络和分离网络的串联连接，用于估计源信号。本文对从数据库中选择的真实语音进行了性能测试。模拟了混响时间分别为0.3和0.5 s的两组室内脉冲响应，每种混响条件下共410个测试样品。采用分段信混响比(SRRseg)、信干涉比(SIR)和直接混响比(DRR)作为客观指标，将该方法与条件分离与去噪(CSD)方法和频域盲源分离(FDBSS)方法进行了比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 19th International Multi-Conference on Systems, Signals & Devices (SSD)

自引率

0.00%

发文量