基于模型的后置滤波器用于麦克风阵列语音增强

2018 7th International Conference on Digital Home (ICDH) Pub Date : 2018-11-01 DOI:10.1109/ICDH.2018.00023

Yan Xiong, Qiang Chen, S. Deng, Sheng Liang, Kai Wang, Jun Zhang, Jie Wang

{"title":"基于模型的后置滤波器用于麦克风阵列语音增强","authors":"Yan Xiong, Qiang Chen, S. Deng, Sheng Liang, Kai Wang, Jun Zhang, Jie Wang","doi":"10.1109/ICDH.2018.00023","DOIUrl":null,"url":null,"abstract":"Generalized sidelobe canceller (GSC) is wildly used in speech enhancement due to its efficient implementation. However, the conventional GSC has some drawbacks when applied to speech enhancement system. First, it is focused on improving the signal-to-noise ratio (SNR) without considering the characteristics of speech so that is not optimal for speech enhancement applications. Second, the adaptive branch in the GSC does not always estimate the noise in the fixed branch output accurately, especially when the SNR is high, the noise is spatially incoherent, or the spatial incoherent noises and spatial coherent interferences coexist. In this paper, we propose a model-based post filter for the sub-band GSC which is a typical form of the microphone array beamformer. An improved noise estimation method is developed to estimate the noise in the fixed branch output of each sub-band GSC from its adaptive branch output. Then the fixed branch output is filtered by an optimal filter which is constructed according to a GMM model trained by clean speeches and an online-estimated noise model. Experimental results show that the proposed method achieves significant improvement over the conventional sub-band GSC and outperforms several speech enhancement methods in different noisy environments.","PeriodicalId":117854,"journal":{"name":"2018 7th International Conference on Digital Home (ICDH)","volume":"248 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Model-Based Post Filter for Microphone Array Speech Enhancement\",\"authors\":\"Yan Xiong, Qiang Chen, S. Deng, Sheng Liang, Kai Wang, Jun Zhang, Jie Wang\",\"doi\":\"10.1109/ICDH.2018.00023\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Generalized sidelobe canceller (GSC) is wildly used in speech enhancement due to its efficient implementation. However, the conventional GSC has some drawbacks when applied to speech enhancement system. First, it is focused on improving the signal-to-noise ratio (SNR) without considering the characteristics of speech so that is not optimal for speech enhancement applications. Second, the adaptive branch in the GSC does not always estimate the noise in the fixed branch output accurately, especially when the SNR is high, the noise is spatially incoherent, or the spatial incoherent noises and spatial coherent interferences coexist. In this paper, we propose a model-based post filter for the sub-band GSC which is a typical form of the microphone array beamformer. An improved noise estimation method is developed to estimate the noise in the fixed branch output of each sub-band GSC from its adaptive branch output. Then the fixed branch output is filtered by an optimal filter which is constructed according to a GMM model trained by clean speeches and an online-estimated noise model. Experimental results show that the proposed method achieves significant improvement over the conventional sub-band GSC and outperforms several speech enhancement methods in different noisy environments.\",\"PeriodicalId\":117854,\"journal\":{\"name\":\"2018 7th International Conference on Digital Home (ICDH)\",\"volume\":\"248 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 7th International Conference on Digital Home (ICDH)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDH.2018.00023\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 7th International Conference on Digital Home (ICDH)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDH.2018.00023","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

广义旁瓣对消器(GSC)由于其高效的实现，在语音增强中得到了广泛的应用。然而，传统的GSC在语音增强系统中的应用存在一定的缺陷。首先，它的重点是提高信噪比(SNR)，而没有考虑语音的特性，因此不是语音增强应用的最佳选择。其次，GSC中的自适应支路并不总是能准确估计固定支路输出中的噪声，特别是在信噪比较高、噪声空间不相干或空间不相干噪声与空间相干干扰共存的情况下。在本文中，我们提出了一种基于模型的后置滤波器用于子带GSC，这是传声器阵列波束形成器的一种典型形式。提出了一种改进的噪声估计方法，从各子带GSC的自适应支路输出中估计其固定支路输出中的噪声。然后，根据干净语音训练的GMM模型和在线估计的噪声模型构建最优滤波器，对固定支路输出进行滤波。实验结果表明，该方法在不同噪声环境下的语音增强性能优于传统子带GSC方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Model-Based Post Filter for Microphone Array Speech Enhancement

Generalized sidelobe canceller (GSC) is wildly used in speech enhancement due to its efficient implementation. However, the conventional GSC has some drawbacks when applied to speech enhancement system. First, it is focused on improving the signal-to-noise ratio (SNR) without considering the characteristics of speech so that is not optimal for speech enhancement applications. Second, the adaptive branch in the GSC does not always estimate the noise in the fixed branch output accurately, especially when the SNR is high, the noise is spatially incoherent, or the spatial incoherent noises and spatial coherent interferences coexist. In this paper, we propose a model-based post filter for the sub-band GSC which is a typical form of the microphone array beamformer. An improved noise estimation method is developed to estimate the noise in the fixed branch output of each sub-band GSC from its adaptive branch output. Then the fixed branch output is filtered by an optimal filter which is constructed according to a GMM model trained by clean speeches and an online-estimated noise model. Experimental results show that the proposed method achieves significant improvement over the conventional sub-band GSC and outperforms several speech enhancement methods in different noisy environments.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2018 7th International Conference on Digital Home (ICDH)

自引率

0.00%

发文量