Joint Dereverberation and Separation of Reverberant Speech Mixtures

M. Kemiha, A. Kacha, L. Chouikhi
{"title":"Joint Dereverberation and Separation of Reverberant Speech Mixtures","authors":"M. Kemiha, A. Kacha, L. Chouikhi","doi":"10.1109/SSD54932.2022.9955829","DOIUrl":null,"url":null,"abstract":"This paper proposes a joint dereverberation and separation method for speech mixtures. In the literature, several studies consider the problems of blind separation and blind dereverberation as two separate problems and propose a solution to each problem. The dereverberation method consists to estimate the log of maximum likelihood parameter which depends on three parameters. So the reverberation algorithm returns to jointly maximize the likelihood function. The bivariate empirical mode decomposition (BEMD) algorithm combined with complex independent component analysis by entropy bound minimization (ICA-EBM) technique is proposed as an alternative to separate convolutive mixtures of speech signals reverberant environnement. The proposed algorithm is considered as a connection in tandem of the dereverberation network and separation network to estimate the source signals. The performance of the proposed approach is tested on real speech sounds chosen from available databases. Two sets of room impulse responses with reverberation times of 0.3 and 0.5 s, with a total of 410 test samples for each reverberation condition are simulated. The proposed method is compared to the conditional separation and dereverberation (CSD) method and Frequency domain blind source separation (FDBSS) method using objective measures which are segmental signal-to-reverberation ratio (SRRseg), signal-to-interference ratio (SIR) and direct-to-reverberationr (DRR).","PeriodicalId":253898,"journal":{"name":"2022 19th International Multi-Conference on Systems, Signals & Devices (SSD)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 19th International Multi-Conference on Systems, Signals & Devices (SSD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSD54932.2022.9955829","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

This paper proposes a joint dereverberation and separation method for speech mixtures. In the literature, several studies consider the problems of blind separation and blind dereverberation as two separate problems and propose a solution to each problem. The dereverberation method consists to estimate the log of maximum likelihood parameter which depends on three parameters. So the reverberation algorithm returns to jointly maximize the likelihood function. The bivariate empirical mode decomposition (BEMD) algorithm combined with complex independent component analysis by entropy bound minimization (ICA-EBM) technique is proposed as an alternative to separate convolutive mixtures of speech signals reverberant environnement. The proposed algorithm is considered as a connection in tandem of the dereverberation network and separation network to estimate the source signals. The performance of the proposed approach is tested on real speech sounds chosen from available databases. Two sets of room impulse responses with reverberation times of 0.3 and 0.5 s, with a total of 410 test samples for each reverberation condition are simulated. The proposed method is compared to the conditional separation and dereverberation (CSD) method and Frequency domain blind source separation (FDBSS) method using objective measures which are segmental signal-to-reverberation ratio (SRRseg), signal-to-interference ratio (SIR) and direct-to-reverberationr (DRR).
混响混合语音的联合消噪与分离
提出了一种混合语音的联合去噪和分离方法。在文献中,一些研究将盲分离和盲去噪问题视为两个独立的问题,并针对每个问题提出了解决方案。去噪方法包括估计依赖于三个参数的最大似然参数的对数。因此混响算法回归到联合最大化似然函数。将二元经验模态分解(BEMD)算法与熵界最小化的复独立分量分析(ICA-EBM)技术相结合,提出了一种用于分离卷积混合语音信号混响环境的方法。该算法被认为是去噪网络和分离网络的串联连接,用于估计源信号。本文对从数据库中选择的真实语音进行了性能测试。模拟了混响时间分别为0.3和0.5 s的两组室内脉冲响应,每种混响条件下共410个测试样品。采用分段信混响比(SRRseg)、信干涉比(SIR)和直接混响比(DRR)作为客观指标,将该方法与条件分离与去噪(CSD)方法和频域盲源分离(FDBSS)方法进行了比较。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信