Foreground-Background Audio Separation using Spectral Peaks based Time-Frequency Masks

2022 IEEE International Conference on Signal Processing and Communications (SPCOM) Pub Date : 2022-07-11 DOI:10.1109/SPCOM55316.2022.9840850

Mrinmoy Bhattacharjee, S. Prasanna, P. Guha

引用次数: 0

Abstract

The separation of foreground and background sounds can serve as a useful preprocessing step when dealing with real-world audio signals. This work proposes a foreground-background audio separation (FBAS) algorithm that uses spectral peak information for generating time-frequency masks. The proposed algorithm can work without training, is relatively fast, and provides decent audio separation. As a specific use case, the proposed algorithm is used to extract clean foreground signals from noisy speech signals. The quality of foreground speech separated with FBAS is compared with the output of a state-of-the-art deep-learning-based speech enhancement system. Various subjective and objective evaluation measures are computed, which indicate that the proposed FBAS algorithm is effective.

查看原文本刊更多论文

基于谱峰时频掩模的前背景音频分离

在处理真实音频信号时，前景和背景声音的分离可以作为有用的预处理步骤。本工作提出了一种前景背景音频分离(FBAS)算法，该算法使用频谱峰值信息生成时频掩模。该算法无需训练即可工作，速度相对较快，并提供了良好的音频分离。作为具体用例，本文提出的算法用于从噪声语音信号中提取干净的前景信号。用FBAS分离的前景语音质量与最先进的基于深度学习的语音增强系统的输出进行了比较。计算了各种主观和客观评价指标，表明了所提出的FBAS算法是有效的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE International Conference on Signal Processing and Communications (SPCOM)

自引率

0.00%

发文量