1D convolutional context-aware architectures for acoustic sensing and recognition of passing vehicle type

A. Kurowski, Szymon Zaporowski, A. Czyżewski
{"title":"1D convolutional context-aware architectures for acoustic sensing and recognition of passing vehicle type","authors":"A. Kurowski, Szymon Zaporowski, A. Czyżewski","doi":"10.23919/spa50552.2020.9241256","DOIUrl":null,"url":null,"abstract":"A network architecture that may be employed to sensing and recognition of a type of vehicle on the basis of audio recordings made in the proximity of a road is proposed in the paper. The analyzed road traffic consists of both passenger cars and heavier vehicles. Excerpts from recordings that do not contain vehicles passing sounds are also taken into account and marked as ones containing silence. The neural network architecture employed for these tasks is a 1D convolutional network. Two types of classifiers are tested: one analyzing only the current audio frame and one analyzing three consecutive audio frames that allow us to take into account the context of the middle frame occurrence. The neural network is trained on datasets derived for four frame lengths, namely 50 ms, 100 ms, 200 ms, and 400 ms. Results of statistical analysis of both network classification accuracy are presented. The context-aware variant of a neural network performed better in a statistically significant manner for three out of four investigated frame lengths.","PeriodicalId":157578,"journal":{"name":"2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/spa50552.2020.9241256","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

A network architecture that may be employed to sensing and recognition of a type of vehicle on the basis of audio recordings made in the proximity of a road is proposed in the paper. The analyzed road traffic consists of both passenger cars and heavier vehicles. Excerpts from recordings that do not contain vehicles passing sounds are also taken into account and marked as ones containing silence. The neural network architecture employed for these tasks is a 1D convolutional network. Two types of classifiers are tested: one analyzing only the current audio frame and one analyzing three consecutive audio frames that allow us to take into account the context of the middle frame occurrence. The neural network is trained on datasets derived for four frame lengths, namely 50 ms, 100 ms, 200 ms, and 400 ms. Results of statistical analysis of both network classification accuracy are presented. The context-aware variant of a neural network performed better in a statistically significant manner for three out of four investigated frame lengths.
一维卷积上下文感知架构用于过往车辆类型的声感知和识别
本文提出了一种基于在道路附近录制的音频来感知和识别某类车辆的网络架构。所分析的道路交通包括乘用车和重型车辆。不包含车辆经过声音的录音摘录也被考虑在内,并标记为包含沉默的录音。用于这些任务的神经网络架构是一维卷积网络。测试了两种类型的分类器:一种只分析当前音频帧,另一种分析三个连续的音频帧,允许我们考虑中间帧发生的上下文。神经网络在四种帧长度的数据集上进行训练,即50ms, 100ms, 200ms和400ms。给出了两种网络分类准确率的统计分析结果。神经网络的上下文感知变体在四分之三的调查帧长度中以统计显著的方式表现得更好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信