CCTV Latent Representations for Reducing Accident Response Times

Proceedings of the 6th International Conference on Digital Signal Processing Pub Date : 2022-02-25 DOI:10.1145/3529570.3529582

Shafinul Haque

{"title":"CCTV Latent Representations for Reducing Accident Response Times","authors":"Shafinul Haque","doi":"10.1145/3529570.3529582","DOIUrl":null,"url":null,"abstract":"Emergency Medical Services’ response times to accidents are crucial to saving lives in vehicle accidents. Using deep learning to instantly detect accidents in public cameras and automatically alerting authorities could help this issue. However, this would require a large set of data on public cameras to train on, but this type of data hardly exists in a usable form. Current deep learning approaches to vehicle accidents typically use first-person cameras, which are not helpful for reducing response time as we do not have access to these cameras at all times. Also, public cameras such as closed-circuit television (CCTV) pick up a much larger amount of street activity than private cameras. Thus, we create a video dataset from live closed-circuit television, so we have access to the cameras at all times. We annotate the videos with metadata to help with future trend prediction as well as give further information for each video, as they are unlabeled. We create an unsupervised learning model to train on this video dataset, and visualize latent space representations of this data in order to cluster different types of street activity and pinpoint vehicle accidents. 1","PeriodicalId":430367,"journal":{"name":"Proceedings of the 6th International Conference on Digital Signal Processing","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 6th International Conference on Digital Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3529570.3529582","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Emergency Medical Services’ response times to accidents are crucial to saving lives in vehicle accidents. Using deep learning to instantly detect accidents in public cameras and automatically alerting authorities could help this issue. However, this would require a large set of data on public cameras to train on, but this type of data hardly exists in a usable form. Current deep learning approaches to vehicle accidents typically use first-person cameras, which are not helpful for reducing response time as we do not have access to these cameras at all times. Also, public cameras such as closed-circuit television (CCTV) pick up a much larger amount of street activity than private cameras. Thus, we create a video dataset from live closed-circuit television, so we have access to the cameras at all times. We annotate the videos with metadata to help with future trend prediction as well as give further information for each video, as they are unlabeled. We create an unsupervised learning model to train on this video dataset, and visualize latent space representations of this data in order to cluster different types of street activity and pinpoint vehicle accidents. 1

查看原文本刊更多论文

减少事故响应时间的CCTV潜在表征

紧急医疗服务对事故的反应时间对于挽救交通事故中的生命至关重要。利用深度学习技术即时检测公共摄像头中的事故，并自动向当局发出警报，可能有助于解决这一问题。然而，这将需要公共摄像机上的大量数据来进行训练，但这类数据几乎不存在可用的形式。目前处理交通事故的深度学习方法通常使用第一人称摄像头，这对减少响应时间没有帮助，因为我们并不是随时都能使用这些摄像头。此外，闭路电视(CCTV)等公共摄像头比私人摄像头捕捉到更多的街头活动。因此，我们从现场闭路电视中创建了一个视频数据集，这样我们就可以随时访问摄像机。我们用元数据注释视频，以帮助预测未来的趋势，并为每个视频提供进一步的信息，因为它们是未标记的。我们创建了一个无监督学习模型来训练这个视频数据集，并将这些数据的潜在空间表示可视化，以便聚类不同类型的街道活动并查明车辆事故。1

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 6th International Conference on Digital Signal Processing

自引率

0.00%

发文量