一种基于时延补偿估计和移位PCA的双通道波束形成器用于语音增强

Zhang Jie, Hong Liu
{"title":"一种基于时延补偿估计和移位PCA的双通道波束形成器用于语音增强","authors":"Zhang Jie, Hong Liu","doi":"10.1109/SOFTCOM.2015.7314060","DOIUrl":null,"url":null,"abstract":"Speech enhancement is an essential technique to process degraded audio in various applications. Beamforming to eliminate interferences based on sensor arrays is the most well-known method for this issue. However, traditional beamformers often face magnitude incoherence towards received signals due to directional weighting. Therefore, a novel dual-channel beamformer based on time-delay compensation (TDC) and shifted principal components analysis (PCA) is presented in this work. Firstly, our enhancement algorithm utilizes TDC estimator to preserve binaural cues, including interaural time-delay and intensity difference. Then the estimated cues are comprised to improve the shifted PCA, which can reduce noise by extracting primary components. Finally, the aforehand processed audio are input to a beamformer with post-filter to obtain enhanced speech. Experiments have demonstrated that the proposed algorithm could achieve some superiorities in speech intelligibility compared with the state-of-the-arts against real scenarios.","PeriodicalId":264787,"journal":{"name":"2015 23rd International Conference on Software, Telecommunications and Computer Networks (SoftCOM)","volume":"8 9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A dual-channel beamformer based on time-delay compensation estimator and shifted PCA for speech enhancement\",\"authors\":\"Zhang Jie, Hong Liu\",\"doi\":\"10.1109/SOFTCOM.2015.7314060\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Speech enhancement is an essential technique to process degraded audio in various applications. Beamforming to eliminate interferences based on sensor arrays is the most well-known method for this issue. However, traditional beamformers often face magnitude incoherence towards received signals due to directional weighting. Therefore, a novel dual-channel beamformer based on time-delay compensation (TDC) and shifted principal components analysis (PCA) is presented in this work. Firstly, our enhancement algorithm utilizes TDC estimator to preserve binaural cues, including interaural time-delay and intensity difference. Then the estimated cues are comprised to improve the shifted PCA, which can reduce noise by extracting primary components. Finally, the aforehand processed audio are input to a beamformer with post-filter to obtain enhanced speech. Experiments have demonstrated that the proposed algorithm could achieve some superiorities in speech intelligibility compared with the state-of-the-arts against real scenarios.\",\"PeriodicalId\":264787,\"journal\":{\"name\":\"2015 23rd International Conference on Software, Telecommunications and Computer Networks (SoftCOM)\",\"volume\":\"8 9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-11-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 23rd International Conference on Software, Telecommunications and Computer Networks (SoftCOM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SOFTCOM.2015.7314060\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 23rd International Conference on Software, Telecommunications and Computer Networks (SoftCOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SOFTCOM.2015.7314060","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

语音增强是在各种应用中处理退化音频的一项重要技术。基于传感器阵列的波束形成消除干扰是解决这一问题最著名的方法。然而,由于方向加权,传统的波束形成器往往面临接收信号的幅度不相干。为此,提出了一种基于时延补偿(TDC)和位移主成分分析(PCA)的新型双通道波束形成器。首先,我们的增强算法利用TDC估计器来保留双耳信号,包括耳间时延和强度差。然后对估计的线索进行组合,改进移位主成分分析,通过提取主成分来降低噪声。最后,将预处理后的音频输入到带后滤波器的波束形成器中,以获得增强语音。实验结果表明,该算法在语音可理解性方面比现有算法有一定的优势。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A dual-channel beamformer based on time-delay compensation estimator and shifted PCA for speech enhancement
Speech enhancement is an essential technique to process degraded audio in various applications. Beamforming to eliminate interferences based on sensor arrays is the most well-known method for this issue. However, traditional beamformers often face magnitude incoherence towards received signals due to directional weighting. Therefore, a novel dual-channel beamformer based on time-delay compensation (TDC) and shifted principal components analysis (PCA) is presented in this work. Firstly, our enhancement algorithm utilizes TDC estimator to preserve binaural cues, including interaural time-delay and intensity difference. Then the estimated cues are comprised to improve the shifted PCA, which can reduce noise by extracting primary components. Finally, the aforehand processed audio are input to a beamformer with post-filter to obtain enhanced speech. Experiments have demonstrated that the proposed algorithm could achieve some superiorities in speech intelligibility compared with the state-of-the-arts against real scenarios.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信