DCH-Net: Densely Connected Highway Convolution Neural Network for Environmental Sound Classification

Xiaohu Zhang, Yuexian Zou
{"title":"DCH-Net: Densely Connected Highway Convolution Neural Network for Environmental Sound Classification","authors":"Xiaohu Zhang, Yuexian Zou","doi":"10.1109/ICDSP.2018.8631632","DOIUrl":null,"url":null,"abstract":"Environmental Sound Classification (ESC) plays a vital role in the field of machine auditory scene. Recently, the Highway Network CNN model has achieved the state-of-art results via solving the vanishing-gradient problem of much deeper CNN. However, carefully analyzing the Highway Network model shows that the Highway Network model lacks ability to maximize information flow between layers, which is essentially benefits the discriminative representation of acoustic events. Besides, the Highway Network model size is larger than 20MB for ESC task, which is still large for mobile applications. Regarding to these two issues, in this study, we propose a novel Densely Connected Highway Convolutional Network (DCH-Net) model for ESC task. Specifically, a densely highway module is developed which is able to ensure the maximum information flow between layers by connecting all layers directly with each other. Besides, to reduce the model size, a global average pooling layer is designed which replaces the traditional fully connection layers and the parameters of the model is greatly reduced. Experimental results show that our DCH-Net ESC model achieves accuracy of 69% and 90% on ESC50 and ESCIO dataset respectively, which is 2% and 10% higher than that of Highway Network based Highway networks ESC model. Meanwhile our model size is only 2MB.","PeriodicalId":218806,"journal":{"name":"2018 IEEE 23rd International Conference on Digital Signal Processing (DSP)","volume":"100 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 23rd International Conference on Digital Signal Processing (DSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDSP.2018.8631632","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Environmental Sound Classification (ESC) plays a vital role in the field of machine auditory scene. Recently, the Highway Network CNN model has achieved the state-of-art results via solving the vanishing-gradient problem of much deeper CNN. However, carefully analyzing the Highway Network model shows that the Highway Network model lacks ability to maximize information flow between layers, which is essentially benefits the discriminative representation of acoustic events. Besides, the Highway Network model size is larger than 20MB for ESC task, which is still large for mobile applications. Regarding to these two issues, in this study, we propose a novel Densely Connected Highway Convolutional Network (DCH-Net) model for ESC task. Specifically, a densely highway module is developed which is able to ensure the maximum information flow between layers by connecting all layers directly with each other. Besides, to reduce the model size, a global average pooling layer is designed which replaces the traditional fully connection layers and the parameters of the model is greatly reduced. Experimental results show that our DCH-Net ESC model achieves accuracy of 69% and 90% on ESC50 and ESCIO dataset respectively, which is 2% and 10% higher than that of Highway Network based Highway networks ESC model. Meanwhile our model size is only 2MB.
DCH-Net:用于环境声音分类的密集连接公路卷积神经网络
环境声分类(ESC)在机器听觉场景领域中起着至关重要的作用。最近,高速公路网CNN模型通过解决更深层的CNN的梯度消失问题,取得了最先进的结果。然而,仔细分析公路网模型表明,公路网模型缺乏最大化层间信息流的能力,而这本质上有利于声学事件的判别表示。此外,高速公路网络模型大小大于20MB的ESC任务,这仍然是大的移动应用程序。针对这两个问题,在本研究中,我们提出了一种用于ESC任务的新型密集连接公路卷积网络(DCH-Net)模型。具体来说,开发了一个密集的高速公路模块,通过各层之间的直接连接,保证了各层之间最大程度的信息流。此外,为了减小模型尺寸,设计了一个全局平均池化层,取代了传统的全连接层,大大减少了模型的参数。实验结果表明,DCH-Net ESC模型在ESC50和ESCIO数据集上的准确率分别达到69%和90%,比基于公路网的ESC模型分别提高了2%和10%。同时我们的模型大小只有2MB。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信