Microphone array speech enhancement based on optimized IMCRA

IF 0.3 4区工程技术 Q4 ACOUSTICS

Noise Control Engineering Journal Pub Date : 2021-11-01 DOI:10.3397/1/376944

Qiuying Li, Tao Zhang, Yanzhang Geng, Zhenke Gao

{"title":"Microphone array speech enhancement based on optimized IMCRA","authors":"Qiuying Li, Tao Zhang, Yanzhang Geng, Zhenke Gao","doi":"10.3397/1/376944","DOIUrl":null,"url":null,"abstract":"Microphone array speech enhancement algorithm uses temporal and spatial informa- tion to improve the performance of speech noise reduction significantly. By combining noise estimation algorithm with microphone array speech enhancement, the accuracy of noise estimation is improved, and\n the computation is reduced. In traditional noise es- timation algorithms, the noise power spectrum is not updated in the presence of speech, which leads to the delay and deviation of noise spectrum estimation. An optimized im- proved minimum controlled recursion average speech enhancement\n algorithm, based on a microphone matrix is proposed in this paper. It consists of three parts. The first part is the preprocessing, divided into two branches: the upper branch enhances the speech signal, and the lower branch gets the noise. The second part is the optimized improved minimum\n controlled recursive averaging. The noise power spectrum is updated not only in the non-speech segments but also in the speech segments. Fi- nally, according to the estimated noise power spectrum, the minimum mean-square error log-spectral amplitude algorithm is used to enhance speech. Testing\n data are from TIMIT and Noisex-92 databases. Short-time objective intelligibility and seg- mental signal-to-noise ratio are chosen as evaluation metrics. Experimental results show that the proposed speech enhancement algorithm can improve the segmental signal-to-noise ratio and short-time\n objective intelligibility for various noise types at different signal-to-noise ratio levels.","PeriodicalId":49748,"journal":{"name":"Noise Control Engineering Journal","volume":" ","pages":""},"PeriodicalIF":0.3000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Noise Control Engineering Journal","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.3397/1/376944","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ACOUSTICS","Score":null,"Total":0}

引用次数: 0

Abstract

Microphone array speech enhancement algorithm uses temporal and spatial informa- tion to improve the performance of speech noise reduction significantly. By combining noise estimation algorithm with microphone array speech enhancement, the accuracy of noise estimation is improved, and the computation is reduced. In traditional noise es- timation algorithms, the noise power spectrum is not updated in the presence of speech, which leads to the delay and deviation of noise spectrum estimation. An optimized im- proved minimum controlled recursion average speech enhancement algorithm, based on a microphone matrix is proposed in this paper. It consists of three parts. The first part is the preprocessing, divided into two branches: the upper branch enhances the speech signal, and the lower branch gets the noise. The second part is the optimized improved minimum controlled recursive averaging. The noise power spectrum is updated not only in the non-speech segments but also in the speech segments. Fi- nally, according to the estimated noise power spectrum, the minimum mean-square error log-spectral amplitude algorithm is used to enhance speech. Testing data are from TIMIT and Noisex-92 databases. Short-time objective intelligibility and seg- mental signal-to-noise ratio are chosen as evaluation metrics. Experimental results show that the proposed speech enhancement algorithm can improve the segmental signal-to-noise ratio and short-time objective intelligibility for various noise types at different signal-to-noise ratio levels.

查看原文本刊更多论文

基于优化IMCRA的麦克风阵列语音增强

麦克风阵列语音增强算法利用时间和空间信息，显著提高了语音降噪性能。通过将噪声估计算法与麦克风阵列语音增强相结合，提高了噪声估计的精度，减少了计算量。传统的噪声估计算法在语音存在的情况下不更新噪声功率谱，导致噪声谱估计的延迟和偏差。提出了一种优化改进的基于麦克风矩阵的最小控制递归平均语音增强算法。它由三部分组成。第一部分是预处理，分为两个支路:上支路对语音信号进行增强，下支路对噪声进行处理。第二部分是优化改进的最小控制递归平均。噪声功率谱不仅在非语音段更新，而且在语音段更新。最后，根据估计的噪声功率谱，采用最小均方误差对数谱幅值算法对语音进行增强。测试数据来自TIMIT和Noisex-92数据库。选取短时客观可理解度和区段信噪比作为评价指标。实验结果表明，本文提出的语音增强算法能够提高不同信噪比水平下不同噪声类型的分段信噪比和短时客观可理解度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Noise Control Engineering Journal 工程技术-工程：综合

CiteScore

0.90

自引率

25.00%

发文量

审稿时长

3 months

期刊介绍： NCEJ is the pre-eminent academic journal of noise control. It is the International Journal of the Institute of Noise Control Engineering of the USA. It is also produced with the participation and assistance of the Korean Society of Noise and Vibration Engineering (KSNVE). NCEJ reaches noise control professionals around the world, covering over 50 national noise control societies and institutes. INCE encourages you to submit your next paper to NCEJ. Choosing NCEJ: Provides the opportunity to reach a global audience of NCE professionals, academics, and students; Enhances the prestige of your work; Validates your work by formal peer review.