用于双声道场景中声学回声消除的时序卷积网络

IF 0.9 4区物理与天体物理 Q4 ACOUSTICS

Acoustical Physics Pub Date : 2024-02-28 DOI:10.1134/S1063771023600195

Jinfang Zeng, Chao Li, Jiamei Huang, Wei Li

{"title":"用于双声道场景中声学回声消除的时序卷积网络","authors":"Jinfang Zeng, Chao Li, Jiamei Huang, Wei Li","doi":"10.1134/S1063771023600195","DOIUrl":null,"url":null,"abstract":"In communication systems, when the loudspeaker and the microphone are coupled together, it creates acoustic echoes. With the increasing demand for mobile communication and online conference, it is urgent to solve the problem of acoustic echo cancellation (AEC) in communication systems. Due to the existence of nonlinear distortion, background noise and other reasons, traditional AEC methods can no longer solve the problem of echo cancellation well. Although some traditional methods consider the problem of nonlinear distortion, the effect of echo suppression is still not ideal. In this paper, we propose an echo cancellation method based on frequency domain mask, which is defined as a supervised speech separation problem. The use of the temporal convolutional network and optimal ratio mask to obtain the predicted mask, as well as the use of SISNR as the loss function, have been shown to effectively reduce echo in double-talk, nonlinear distortion, and background noise. This method is a significant advancement in the field of AEC and can be used in for mobile communication and online conference.","PeriodicalId":455,"journal":{"name":"Acoustical Physics","volume":"69 6","pages":"897 - 906"},"PeriodicalIF":0.9000,"publicationDate":"2024-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Temporal Convolutional Network for Acoustic Echo Cancellation in Double-Talk Scenarios\",\"authors\":\"Jinfang Zeng, Chao Li, Jiamei Huang, Wei Li\",\"doi\":\"10.1134/S1063771023600195\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In communication systems, when the loudspeaker and the microphone are coupled together, it creates acoustic echoes. With the increasing demand for mobile communication and online conference, it is urgent to solve the problem of acoustic echo cancellation (AEC) in communication systems. Due to the existence of nonlinear distortion, background noise and other reasons, traditional AEC methods can no longer solve the problem of echo cancellation well. Although some traditional methods consider the problem of nonlinear distortion, the effect of echo suppression is still not ideal. In this paper, we propose an echo cancellation method based on frequency domain mask, which is defined as a supervised speech separation problem. The use of the temporal convolutional network and optimal ratio mask to obtain the predicted mask, as well as the use of SISNR as the loss function, have been shown to effectively reduce echo in double-talk, nonlinear distortion, and background noise. This method is a significant advancement in the field of AEC and can be used in for mobile communication and online conference.\",\"PeriodicalId\":455,\"journal\":{\"name\":\"Acoustical Physics\",\"volume\":\"69 6\",\"pages\":\"897 - 906\"},\"PeriodicalIF\":0.9000,\"publicationDate\":\"2024-02-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Acoustical Physics\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://link.springer.com/article/10.1134/S1063771023600195\",\"RegionNum\":4,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"ACOUSTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Acoustical Physics","FirstCategoryId":"101","ListUrlMain":"https://link.springer.com/article/10.1134/S1063771023600195","RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ACOUSTICS","Score":null,"Total":0}

引用次数: 0

摘要

摘要在通信系统中，当扬声器和麦克风耦合在一起时，会产生声学回声。随着移动通信和在线会议需求的不断增长，解决通信系统中的声学回声消除（AEC）问题迫在眉睫。由于非线性失真、背景噪声等原因的存在，传统的声学回声消除方法已不能很好地解决回声消除问题。虽然一些传统方法考虑了非线性失真问题，但回声抑制效果仍不理想。本文提出了一种基于频域掩码的回声消除方法，将其定义为有监督的语音分离问题。通过使用时序卷积网络和最优比掩码来获得预测掩码，并使用 SISNR 作为损失函数，结果表明可以有效地减少复音、非线性失真和背景噪声中的回声。这种方法是 AEC 领域的一大进步，可用于移动通信和在线会议。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Temporal Convolutional Network for Acoustic Echo Cancellation in Double-Talk Scenarios

查看原文本刊更多论文

Temporal Convolutional Network for Acoustic Echo Cancellation in Double-Talk Scenarios

In communication systems, when the loudspeaker and the microphone are coupled together, it creates acoustic echoes. With the increasing demand for mobile communication and online conference, it is urgent to solve the problem of acoustic echo cancellation (AEC) in communication systems. Due to the existence of nonlinear distortion, background noise and other reasons, traditional AEC methods can no longer solve the problem of echo cancellation well. Although some traditional methods consider the problem of nonlinear distortion, the effect of echo suppression is still not ideal. In this paper, we propose an echo cancellation method based on frequency domain mask, which is defined as a supervised speech separation problem. The use of the temporal convolutional network and optimal ratio mask to obtain the predicted mask, as well as the use of SISNR as the loss function, have been shown to effectively reduce echo in double-talk, nonlinear distortion, and background noise. This method is a significant advancement in the field of AEC and can be used in for mobile communication and online conference.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Acoustical Physics 物理-声学

CiteScore

1.60

自引率

50.00%

发文量

审稿时长

3.5 months

期刊介绍： Acoustical Physics is an international peer reviewed journal published with the participation of the Russian Academy of Sciences. It covers theoretical and experimental aspects of basic and applied acoustics: classical problems of linear acoustics and wave theory; nonlinear acoustics; physical acoustics; ocean acoustics and hydroacoustics; atmospheric and aeroacoustics; acoustics of structurally inhomogeneous solids; geological acoustics; acoustical ecology, noise and vibration; chamber acoustics, musical acoustics; acoustic signals processing, computer simulations; acoustics of living systems, biomedical acoustics; physical principles of engineering acoustics. The journal publishes critical reviews, original articles, short communications, and letters to the editor. It covers theoretical and experimental aspects of basic and applied acoustics. The journal welcomes manuscripts from all countries in the English or Russian language.