VoiceFind

Proceedings of the 1st ACM International Workshop on Intelligent Acoustic Systems and Applications Pub Date : 2022-06-25 DOI:10.1145/3539490.3539600

Irtaza Shahid, Y. Bai, Nakul Garg, Nirupam Roy

引用次数: 6

Abstract

Robust speech enhancement is a key requirement for many emerging applications. It is challenging to recover clear speech in commodity devices, especially in noisy real-world scenarios. In this paper, we propose VoiceFind, which uses only two microphones to spatial filter the desired speech from all interference. Furthermore, to improve the intelligibility of the speech after filtering, we design a Conditional Generative Adversarial Network (cGAN) to reconstruct the desired speech from environmental noises and interference speeches. This is an early attempt to explore this direction. Results from simulation and real-world experiments show promise.

查看原文本刊更多论文

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 1st ACM International Workshop on Intelligent Acoustic Systems and Applications

自引率

0.00%

发文量