1D convolutional context-aware architectures for acoustic sensing and recognition of passing vehicle type

2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA) Pub Date : 2020-09-23 DOI:10.23919/spa50552.2020.9241256

A. Kurowski, Szymon Zaporowski, A. Czyżewski

{"title":"1D convolutional context-aware architectures for acoustic sensing and recognition of passing vehicle type","authors":"A. Kurowski, Szymon Zaporowski, A. Czyżewski","doi":"10.23919/spa50552.2020.9241256","DOIUrl":null,"url":null,"abstract":"A network architecture that may be employed to sensing and recognition of a type of vehicle on the basis of audio recordings made in the proximity of a road is proposed in the paper. The analyzed road traffic consists of both passenger cars and heavier vehicles. Excerpts from recordings that do not contain vehicles passing sounds are also taken into account and marked as ones containing silence. The neural network architecture employed for these tasks is a 1D convolutional network. Two types of classifiers are tested: one analyzing only the current audio frame and one analyzing three consecutive audio frames that allow us to take into account the context of the middle frame occurrence. The neural network is trained on datasets derived for four frame lengths, namely 50 ms, 100 ms, 200 ms, and 400 ms. Results of statistical analysis of both network classification accuracy are presented. The context-aware variant of a neural network performed better in a statistically significant manner for three out of four investigated frame lengths.","PeriodicalId":157578,"journal":{"name":"2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/spa50552.2020.9241256","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

A network architecture that may be employed to sensing and recognition of a type of vehicle on the basis of audio recordings made in the proximity of a road is proposed in the paper. The analyzed road traffic consists of both passenger cars and heavier vehicles. Excerpts from recordings that do not contain vehicles passing sounds are also taken into account and marked as ones containing silence. The neural network architecture employed for these tasks is a 1D convolutional network. Two types of classifiers are tested: one analyzing only the current audio frame and one analyzing three consecutive audio frames that allow us to take into account the context of the middle frame occurrence. The neural network is trained on datasets derived for four frame lengths, namely 50 ms, 100 ms, 200 ms, and 400 ms. Results of statistical analysis of both network classification accuracy are presented. The context-aware variant of a neural network performed better in a statistically significant manner for three out of four investigated frame lengths.

查看原文本刊更多论文

一维卷积上下文感知架构用于过往车辆类型的声感知和识别

本文提出了一种基于在道路附近录制的音频来感知和识别某类车辆的网络架构。所分析的道路交通包括乘用车和重型车辆。不包含车辆经过声音的录音摘录也被考虑在内，并标记为包含沉默的录音。用于这些任务的神经网络架构是一维卷积网络。测试了两种类型的分类器:一种只分析当前音频帧，另一种分析三个连续的音频帧，允许我们考虑中间帧发生的上下文。神经网络在四种帧长度的数据集上进行训练，即50ms, 100ms, 200ms和400ms。给出了两种网络分类准确率的统计分析结果。神经网络的上下文感知变体在四分之三的调查帧长度中以统计显著的方式表现得更好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)

自引率

0.00%

发文量