Sound Event Detection with Adaptive Frequency Selection

2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) Pub Date : 2021-05-17 DOI:10.1109/WASPAA52581.2021.9632798

Zhepei Wang, Jonah Casebeer, Adam Clemmitt, Efthymios Tzinis, P. Smaragdis

引用次数: 1

Abstract

In this work, we present HIDACT, a novel network architecture for adaptive computation for efficiently recognizing acoustic events. We evaluate the model on a sound event detection task where we train it to adaptively process frequency bands. The model learns to adapt to the input without requesting all frequency sub-bands provided. It can make confident predictions within fewer processing steps, hence reducing the amount of computation. Experimental results show that HIDACT has comparable performance to baseline models with more parameters and higher computational complexity. Furthermore, the model can adjust the amount of computation based on the data and computational budget.

查看原文本刊更多论文

声音事件检测与自适应频率选择

在这项工作中，我们提出了一种新的网络结构HIDACT，用于自适应计算，以有效地识别声事件。我们在声音事件检测任务中评估模型，我们训练它自适应处理频带。该模型学习适应输入，而不要求提供所有的频率子带。它可以在更少的处理步骤内做出可靠的预测，从而减少了计算量。实验结果表明，HIDACT具有与参数较多、计算复杂度较高的基线模型相当的性能。此外，该模型还可以根据数据和计算预算来调整计算量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

自引率

0.00%

发文量