I-SED: An Interactive Sound Event Detector

Proceedings of the 22nd International Conference on Intelligent User Interfaces Pub Date : 2017-03-07 DOI:10.1145/3025171.3025231

B. Kim, Bryan Pardo

引用次数: 18

Abstract

Tagging of sound events is essential in many research areas. However, finding sound events and labeling them within a long audio file is tedious and time-consuming. Building an automatic recognition system using machine learning techniques is often not feasible because it requires a large number of human-labeled training examples and fine tuning the model for a specific application. Fully automated labeling is also not reliable enough for all uses. We present I-SED, an interactive sound detection interface using a human-in-the-loop approach that lets a user reduce the time required to label audio that is tediously long (e.g. 20 hours) to do manually and has too few prior labeled examples (e.g. one) to train a state-of-the-art machine audio labeling system. We performed a human-subject study to validate its effectiveness and the results showed that our tool helped participants label all target sound events within a recording twice as fast as labeling them manually.

查看原文本刊更多论文

I-SED:一个交互式声音事件检测器

声音事件的标注在许多研究领域都是必不可少的。然而，寻找声音事件并在长音频文件中标记它们是乏味且耗时的。使用机器学习技术构建自动识别系统通常是不可行的，因为它需要大量人工标记的训练示例，并对特定应用的模型进行微调。全自动标签也不是所有用途都足够可靠。我们提出了I-SED，一个使用人在循环方法的交互式声音检测界面，可以让用户减少标记音频所需的时间，这些音频手动标记的时间非常长(例如20小时)，并且之前标记的示例太少(例如一个)，无法训练最先进的机器音频标记系统。我们进行了一项人类受试者研究来验证其有效性，结果表明，我们的工具帮助参与者标记录音中所有目标声音事件的速度是手动标记速度的两倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 22nd International Conference on Intelligent User Interfaces

自引率

0.00%

发文量