{"title":"用于双声道场景中声学回声消除的时序卷积网络","authors":"Jinfang Zeng, Chao Li, Jiamei Huang, Wei Li","doi":"10.1134/S1063771023600195","DOIUrl":null,"url":null,"abstract":"<p>In communication systems, when the loudspeaker and the microphone are coupled together, it creates acoustic echoes. With the increasing demand for mobile communication and online conference, it is urgent to solve the problem of acoustic echo cancellation (AEC) in communication systems. Due to the existence of nonlinear distortion, background noise and other reasons, traditional AEC methods can no longer solve the problem of echo cancellation well. Although some traditional methods consider the problem of nonlinear distortion, the effect of echo suppression is still not ideal. In this paper, we propose an echo cancellation method based on frequency domain mask, which is defined as a supervised speech separation problem. The use of the temporal convolutional network and optimal ratio mask to obtain the predicted mask, as well as the use of SISNR as the loss function, have been shown to effectively reduce echo in double-talk, nonlinear distortion, and background noise. This method is a significant advancement in the field of AEC and can be used in for mobile communication and online conference<i>.</i></p>","PeriodicalId":455,"journal":{"name":"Acoustical Physics","volume":"69 6","pages":"897 - 906"},"PeriodicalIF":0.9000,"publicationDate":"2024-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Temporal Convolutional Network for Acoustic Echo Cancellation in Double-Talk Scenarios\",\"authors\":\"Jinfang Zeng, Chao Li, Jiamei Huang, Wei Li\",\"doi\":\"10.1134/S1063771023600195\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>In communication systems, when the loudspeaker and the microphone are coupled together, it creates acoustic echoes. With the increasing demand for mobile communication and online conference, it is urgent to solve the problem of acoustic echo cancellation (AEC) in communication systems. Due to the existence of nonlinear distortion, background noise and other reasons, traditional AEC methods can no longer solve the problem of echo cancellation well. Although some traditional methods consider the problem of nonlinear distortion, the effect of echo suppression is still not ideal. In this paper, we propose an echo cancellation method based on frequency domain mask, which is defined as a supervised speech separation problem. The use of the temporal convolutional network and optimal ratio mask to obtain the predicted mask, as well as the use of SISNR as the loss function, have been shown to effectively reduce echo in double-talk, nonlinear distortion, and background noise. This method is a significant advancement in the field of AEC and can be used in for mobile communication and online conference<i>.</i></p>\",\"PeriodicalId\":455,\"journal\":{\"name\":\"Acoustical Physics\",\"volume\":\"69 6\",\"pages\":\"897 - 906\"},\"PeriodicalIF\":0.9000,\"publicationDate\":\"2024-02-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Acoustical Physics\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://link.springer.com/article/10.1134/S1063771023600195\",\"RegionNum\":4,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"ACOUSTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Acoustical Physics","FirstCategoryId":"101","ListUrlMain":"https://link.springer.com/article/10.1134/S1063771023600195","RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ACOUSTICS","Score":null,"Total":0}
Temporal Convolutional Network for Acoustic Echo Cancellation in Double-Talk Scenarios
In communication systems, when the loudspeaker and the microphone are coupled together, it creates acoustic echoes. With the increasing demand for mobile communication and online conference, it is urgent to solve the problem of acoustic echo cancellation (AEC) in communication systems. Due to the existence of nonlinear distortion, background noise and other reasons, traditional AEC methods can no longer solve the problem of echo cancellation well. Although some traditional methods consider the problem of nonlinear distortion, the effect of echo suppression is still not ideal. In this paper, we propose an echo cancellation method based on frequency domain mask, which is defined as a supervised speech separation problem. The use of the temporal convolutional network and optimal ratio mask to obtain the predicted mask, as well as the use of SISNR as the loss function, have been shown to effectively reduce echo in double-talk, nonlinear distortion, and background noise. This method is a significant advancement in the field of AEC and can be used in for mobile communication and online conference.
期刊介绍:
Acoustical Physics is an international peer reviewed journal published with the participation of the Russian Academy of Sciences. It covers theoretical and experimental aspects of basic and applied acoustics: classical problems of linear acoustics and wave theory; nonlinear acoustics; physical acoustics; ocean acoustics and hydroacoustics; atmospheric and aeroacoustics; acoustics of structurally inhomogeneous solids; geological acoustics; acoustical ecology, noise and vibration; chamber acoustics, musical acoustics; acoustic signals processing, computer simulations; acoustics of living systems, biomedical acoustics; physical principles of engineering acoustics. The journal publishes critical reviews, original articles, short communications, and letters to the editor. It covers theoretical and experimental aspects of basic and applied acoustics. The journal welcomes manuscripts from all countries in the English or Russian language.