A Deep Learning Based Sound Event Location and Detection Algorithm Using Convolutional Recurrent Neural Network

Hongxia Zhu, Jun Yan
{"title":"A Deep Learning Based Sound Event Location and Detection Algorithm Using Convolutional Recurrent Neural Network","authors":"Hongxia Zhu, Jun Yan","doi":"10.1109/cits55221.2022.9832991","DOIUrl":null,"url":null,"abstract":"With the application of sound event detection in more and more fields, an accurate sound event location and detection system has attracted wide attention. In this paper, we propose a sound event location and detection algorithm based on convolutional recurrent neural network (CRNN). In the offline phase, complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) algorithm is used to remove the noise of unknown distribution of the collected data set. Then, we extract filter banks (FBANK) features and cross correlation (GCC) features of each channel and fuse them. Finally, the features are input to CRNN which combined with soft attention mechanism to train the model. The CRNN is a multi-task learning framework. For sound category and sound location, it is realized by classification task and regression task respectively. Experimental results show that the algorithm is effective and can provide accurate category estimation and location estimation.","PeriodicalId":136239,"journal":{"name":"2022 International Conference on Computer, Information and Telecommunication Systems (CITS)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Computer, Information and Telecommunication Systems (CITS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/cits55221.2022.9832991","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

With the application of sound event detection in more and more fields, an accurate sound event location and detection system has attracted wide attention. In this paper, we propose a sound event location and detection algorithm based on convolutional recurrent neural network (CRNN). In the offline phase, complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) algorithm is used to remove the noise of unknown distribution of the collected data set. Then, we extract filter banks (FBANK) features and cross correlation (GCC) features of each channel and fuse them. Finally, the features are input to CRNN which combined with soft attention mechanism to train the model. The CRNN is a multi-task learning framework. For sound category and sound location, it is realized by classification task and regression task respectively. Experimental results show that the algorithm is effective and can provide accurate category estimation and location estimation.
基于深度学习的卷积递归神经网络声音事件定位与检测算法
随着声事件检测在越来越多领域的应用,准确的声事件定位与检测系统引起了人们的广泛关注。本文提出了一种基于卷积递归神经网络(CRNN)的声音事件定位与检测算法。在离线阶段,采用自适应噪声完全集成经验模态分解(CEEMDAN)算法去除采集数据集未知分布的噪声。然后,提取各信道的滤波器组(FBANK)特征和互相关(GCC)特征并进行融合。最后将特征输入到CRNN中,结合软注意机制对模型进行训练。CRNN是一个多任务学习框架。对于声音类别和声音定位,分别通过分类任务和回归任务实现。实验结果表明,该算法是有效的,可以提供准确的类别估计和位置估计。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信