Sound event classification using neural networks and feature selection based methods

Ammar Ahmed, Y. Serrestou, K. Raoof, J. Diouris
{"title":"Sound event classification using neural networks and feature selection based methods","authors":"Ammar Ahmed, Y. Serrestou, K. Raoof, J. Diouris","doi":"10.1109/EIT51626.2021.9491869","DOIUrl":null,"url":null,"abstract":"Sound events, emanate from several sources, are ubiquitous and manifest them selves with different characteristics in different environments. With the advancement of deep learning models and existence of ever increasing training data, the automatic recognition and classification task of these events has improved significantly over the years . Traditionally, environmental sound event recognition systems are developed by keeping the generic database that is readily available, while sound events generated in a particular environment are not focused. Another issue of training of large neural networks requires huge amount of parameters and training them costs computational resources. To tackle this issue, we firstly built a custom database consisting of events occurring outside and around smart homes or building. The sound events such as rain, wind, human gait, and passing of vehicles. We propose the use of a sequential feature selection technique for for reduction of dimension of features extracted with MFCC. Selected features are used for training recurrent neural network (RNN) on aforementioned sound events. We compared the results of our proposed method with the same RNN trained with MFCC features and convolutional neural networks (CNN) trained with mel frequency band (MFB) features. Our proposed system performed with high accuracy in former case but slightly better compared to CNN in achieving higher classification accuracy and a significant reduction of parameters during training with the proposed system.","PeriodicalId":162816,"journal":{"name":"2021 IEEE International Conference on Electro Information Technology (EIT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Electro Information Technology (EIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EIT51626.2021.9491869","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Sound events, emanate from several sources, are ubiquitous and manifest them selves with different characteristics in different environments. With the advancement of deep learning models and existence of ever increasing training data, the automatic recognition and classification task of these events has improved significantly over the years . Traditionally, environmental sound event recognition systems are developed by keeping the generic database that is readily available, while sound events generated in a particular environment are not focused. Another issue of training of large neural networks requires huge amount of parameters and training them costs computational resources. To tackle this issue, we firstly built a custom database consisting of events occurring outside and around smart homes or building. The sound events such as rain, wind, human gait, and passing of vehicles. We propose the use of a sequential feature selection technique for for reduction of dimension of features extracted with MFCC. Selected features are used for training recurrent neural network (RNN) on aforementioned sound events. We compared the results of our proposed method with the same RNN trained with MFCC features and convolutional neural networks (CNN) trained with mel frequency band (MFB) features. Our proposed system performed with high accuracy in former case but slightly better compared to CNN in achieving higher classification accuracy and a significant reduction of parameters during training with the proposed system.
基于神经网络和特征选择的声音事件分类方法
声音事件有多种来源,无处不在,在不同的环境中表现出不同的特征。随着深度学习模型的进步和训练数据的不断增加,这些事件的自动识别和分类任务在过去几年里有了显著的提高。传统上,环境声音事件识别系统是通过保持随时可用的通用数据库来开发的,而在特定环境中产生的声音事件则没有重点。大型神经网络训练的另一个问题是需要大量的参数,并且需要耗费大量的计算资源。为了解决这个问题,我们首先建立了一个自定义数据库,由智能家居或建筑外部和周围发生的事件组成。声音事件,如雨、风、人的步态和车辆的通过。我们提出了使用序列特征选择技术来降低用MFCC提取的特征的维数。选择的特征用于训练循环神经网络(RNN)对上述声音事件。我们将该方法的结果与使用MFCC特征训练的RNN和使用mel频带(MFB)特征训练的卷积神经网络(CNN)进行了比较。在前一种情况下,我们提出的系统具有很高的准确率,但与CNN相比,在使用所提出的系统进行训练时,在实现更高的分类准确率和显著减少参数方面略好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信