用于双声道场景中声学回声消除的时序卷积网络

IF 0.9 4区 物理与天体物理 Q4 ACOUSTICS
Jinfang Zeng, Chao Li, Jiamei Huang, Wei Li
{"title":"用于双声道场景中声学回声消除的时序卷积网络","authors":"Jinfang Zeng,&nbsp;Chao Li,&nbsp;Jiamei Huang,&nbsp;Wei Li","doi":"10.1134/S1063771023600195","DOIUrl":null,"url":null,"abstract":"<p>In communication systems, when the loudspeaker and the microphone are coupled together, it creates acoustic echoes. With the increasing demand for mobile communication and online conference, it is urgent to solve the problem of acoustic echo cancellation (AEC) in communication systems. Due to the existence of nonlinear distortion, background noise and other reasons, traditional AEC methods can no longer solve the problem of echo cancellation well. Although some traditional methods consider the problem of nonlinear distortion, the effect of echo suppression is still not ideal. In this paper, we propose an echo cancellation method based on frequency domain mask, which is defined as a supervised speech separation problem. The use of the temporal convolutional network and optimal ratio mask to obtain the predicted mask, as well as the use of SISNR as the loss function, have been shown to effectively reduce echo in double-talk, nonlinear distortion, and background noise. This method is a significant advancement in the field of AEC and can be used in for mobile communication and online conference<i>.</i></p>","PeriodicalId":455,"journal":{"name":"Acoustical Physics","volume":"69 6","pages":"897 - 906"},"PeriodicalIF":0.9000,"publicationDate":"2024-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Temporal Convolutional Network for Acoustic Echo Cancellation in Double-Talk Scenarios\",\"authors\":\"Jinfang Zeng,&nbsp;Chao Li,&nbsp;Jiamei Huang,&nbsp;Wei Li\",\"doi\":\"10.1134/S1063771023600195\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>In communication systems, when the loudspeaker and the microphone are coupled together, it creates acoustic echoes. With the increasing demand for mobile communication and online conference, it is urgent to solve the problem of acoustic echo cancellation (AEC) in communication systems. Due to the existence of nonlinear distortion, background noise and other reasons, traditional AEC methods can no longer solve the problem of echo cancellation well. Although some traditional methods consider the problem of nonlinear distortion, the effect of echo suppression is still not ideal. In this paper, we propose an echo cancellation method based on frequency domain mask, which is defined as a supervised speech separation problem. The use of the temporal convolutional network and optimal ratio mask to obtain the predicted mask, as well as the use of SISNR as the loss function, have been shown to effectively reduce echo in double-talk, nonlinear distortion, and background noise. This method is a significant advancement in the field of AEC and can be used in for mobile communication and online conference<i>.</i></p>\",\"PeriodicalId\":455,\"journal\":{\"name\":\"Acoustical Physics\",\"volume\":\"69 6\",\"pages\":\"897 - 906\"},\"PeriodicalIF\":0.9000,\"publicationDate\":\"2024-02-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Acoustical Physics\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://link.springer.com/article/10.1134/S1063771023600195\",\"RegionNum\":4,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"ACOUSTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Acoustical Physics","FirstCategoryId":"101","ListUrlMain":"https://link.springer.com/article/10.1134/S1063771023600195","RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0

摘要

摘要 在通信系统中,当扬声器和麦克风耦合在一起时,会产生声学回声。随着移动通信和在线会议需求的不断增长,解决通信系统中的声学回声消除(AEC)问题迫在眉睫。由于非线性失真、背景噪声等原因的存在,传统的声学回声消除方法已不能很好地解决回声消除问题。虽然一些传统方法考虑了非线性失真问题,但回声抑制效果仍不理想。本文提出了一种基于频域掩码的回声消除方法,将其定义为有监督的语音分离问题。通过使用时序卷积网络和最优比掩码来获得预测掩码,并使用 SISNR 作为损失函数,结果表明可以有效地减少复音、非线性失真和背景噪声中的回声。这种方法是 AEC 领域的一大进步,可用于移动通信和在线会议。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Temporal Convolutional Network for Acoustic Echo Cancellation in Double-Talk Scenarios

Temporal Convolutional Network for Acoustic Echo Cancellation in Double-Talk Scenarios

In communication systems, when the loudspeaker and the microphone are coupled together, it creates acoustic echoes. With the increasing demand for mobile communication and online conference, it is urgent to solve the problem of acoustic echo cancellation (AEC) in communication systems. Due to the existence of nonlinear distortion, background noise and other reasons, traditional AEC methods can no longer solve the problem of echo cancellation well. Although some traditional methods consider the problem of nonlinear distortion, the effect of echo suppression is still not ideal. In this paper, we propose an echo cancellation method based on frequency domain mask, which is defined as a supervised speech separation problem. The use of the temporal convolutional network and optimal ratio mask to obtain the predicted mask, as well as the use of SISNR as the loss function, have been shown to effectively reduce echo in double-talk, nonlinear distortion, and background noise. This method is a significant advancement in the field of AEC and can be used in for mobile communication and online conference.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Acoustical Physics
Acoustical Physics 物理-声学
CiteScore
1.60
自引率
50.00%
发文量
58
审稿时长
3.5 months
期刊介绍: Acoustical Physics is an international peer reviewed journal published with the participation of the Russian Academy of Sciences. It covers theoretical and experimental aspects of basic and applied acoustics: classical problems of linear acoustics and wave theory; nonlinear acoustics; physical acoustics; ocean acoustics and hydroacoustics; atmospheric and aeroacoustics; acoustics of structurally inhomogeneous solids; geological acoustics; acoustical ecology, noise and vibration; chamber acoustics, musical acoustics; acoustic signals processing, computer simulations; acoustics of living systems, biomedical acoustics; physical principles of engineering acoustics. The journal publishes critical reviews, original articles, short communications, and letters to the editor. It covers theoretical and experimental aspects of basic and applied acoustics. The journal welcomes manuscripts from all countries in the English or Russian language.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信