Audio surveillance of roads using deep learning and autoencoder-based sample weight initialization

Z. Mnasri, S. Rovetta, F. Masulli
{"title":"Audio surveillance of roads using deep learning and autoencoder-based sample weight initialization","authors":"Z. Mnasri, S. Rovetta, F. Masulli","doi":"10.1109/MELECON48756.2020.9140594","DOIUrl":null,"url":null,"abstract":"Road safety has always been a major concern, where a variety of competences is involved, ranging from government and local authorities, medical caregivers and other service provides. Prompt intervention in emergency cases is one of the key factors to minimize damages. Therefore, real-time surveillance is proposed as an efficient means to detect problems on roads. Video surveillance alone is not enough to detect serious accidents, since any hazardous behavior on the road may be confused with an accident, which may lead to many wrong alarms. Instead, audio processing has the potential to recognize sounds coming from different sources, such as crashes, tire skidding, harsh braking, etc. Since a few years, deep learning has become the state of the art for audio events detection. However, the usual dominance of absence of events in road surveillance would make a bias in the training process. Therefore, a novel method to initialize the neural network's weights using an autoencoder trained only on event-related data is used to balance the data distribution.","PeriodicalId":268311,"journal":{"name":"2020 IEEE 20th Mediterranean Electrotechnical Conference ( MELECON)","volume":"237 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 20th Mediterranean Electrotechnical Conference ( MELECON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MELECON48756.2020.9140594","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

Abstract

Road safety has always been a major concern, where a variety of competences is involved, ranging from government and local authorities, medical caregivers and other service provides. Prompt intervention in emergency cases is one of the key factors to minimize damages. Therefore, real-time surveillance is proposed as an efficient means to detect problems on roads. Video surveillance alone is not enough to detect serious accidents, since any hazardous behavior on the road may be confused with an accident, which may lead to many wrong alarms. Instead, audio processing has the potential to recognize sounds coming from different sources, such as crashes, tire skidding, harsh braking, etc. Since a few years, deep learning has become the state of the art for audio events detection. However, the usual dominance of absence of events in road surveillance would make a bias in the training process. Therefore, a novel method to initialize the neural network's weights using an autoencoder trained only on event-related data is used to balance the data distribution.
使用深度学习和基于自动编码器的样本权重初始化的道路音频监控
道路安全一直是一个主要问题,涉及政府和地方当局、医疗护理人员和其他服务提供者等各种能力。在紧急情况下及时干预是减少损害的关键因素之一。因此,实时监控作为一种检测道路问题的有效手段被提出。视频监控本身不足以检测严重事故,因为道路上的任何危险行为都可能与事故混淆,这可能导致许多错误的警报。相反,音频处理具有识别来自不同来源的声音的潜力,例如碰撞,轮胎打滑,猛烈刹车等。几年来,深度学习已经成为音频事件检测的最新技术。然而,在道路监控中,事件缺失通常占主导地位,这将在训练过程中产生偏见。因此,采用一种新的方法来初始化神经网络的权值,该方法使用一个只训练事件相关数据的自编码器来平衡数据分布。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信