Audio surveillance of roads using deep learning and autoencoder-based sample weight initialization

2020 IEEE 20th Mediterranean Electrotechnical Conference ( MELECON) Pub Date : 2020-06-01 DOI:10.1109/MELECON48756.2020.9140594

Z. Mnasri, S. Rovetta, F. Masulli

{"title":"Audio surveillance of roads using deep learning and autoencoder-based sample weight initialization","authors":"Z. Mnasri, S. Rovetta, F. Masulli","doi":"10.1109/MELECON48756.2020.9140594","DOIUrl":null,"url":null,"abstract":"Road safety has always been a major concern, where a variety of competences is involved, ranging from government and local authorities, medical caregivers and other service provides. Prompt intervention in emergency cases is one of the key factors to minimize damages. Therefore, real-time surveillance is proposed as an efficient means to detect problems on roads. Video surveillance alone is not enough to detect serious accidents, since any hazardous behavior on the road may be confused with an accident, which may lead to many wrong alarms. Instead, audio processing has the potential to recognize sounds coming from different sources, such as crashes, tire skidding, harsh braking, etc. Since a few years, deep learning has become the state of the art for audio events detection. However, the usual dominance of absence of events in road surveillance would make a bias in the training process. Therefore, a novel method to initialize the neural network's weights using an autoencoder trained only on event-related data is used to balance the data distribution.","PeriodicalId":268311,"journal":{"name":"2020 IEEE 20th Mediterranean Electrotechnical Conference ( MELECON)","volume":"237 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 20th Mediterranean Electrotechnical Conference ( MELECON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MELECON48756.2020.9140594","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

Abstract

Road safety has always been a major concern, where a variety of competences is involved, ranging from government and local authorities, medical caregivers and other service provides. Prompt intervention in emergency cases is one of the key factors to minimize damages. Therefore, real-time surveillance is proposed as an efficient means to detect problems on roads. Video surveillance alone is not enough to detect serious accidents, since any hazardous behavior on the road may be confused with an accident, which may lead to many wrong alarms. Instead, audio processing has the potential to recognize sounds coming from different sources, such as crashes, tire skidding, harsh braking, etc. Since a few years, deep learning has become the state of the art for audio events detection. However, the usual dominance of absence of events in road surveillance would make a bias in the training process. Therefore, a novel method to initialize the neural network's weights using an autoencoder trained only on event-related data is used to balance the data distribution.

查看原文本刊更多论文

使用深度学习和基于自动编码器的样本权重初始化的道路音频监控

道路安全一直是一个主要问题，涉及政府和地方当局、医疗护理人员和其他服务提供者等各种能力。在紧急情况下及时干预是减少损害的关键因素之一。因此，实时监控作为一种检测道路问题的有效手段被提出。视频监控本身不足以检测严重事故，因为道路上的任何危险行为都可能与事故混淆，这可能导致许多错误的警报。相反，音频处理具有识别来自不同来源的声音的潜力，例如碰撞，轮胎打滑，猛烈刹车等。几年来，深度学习已经成为音频事件检测的最新技术。然而，在道路监控中，事件缺失通常占主导地位，这将在训练过程中产生偏见。因此，采用一种新的方法来初始化神经网络的权值，该方法使用一个只训练事件相关数据的自编码器来平衡数据分布。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 IEEE 20th Mediterranean Electrotechnical Conference ( MELECON)

自引率

0.00%

发文量