基于对抗性训练的双编码器-解码器-编码器在监控视频中的无监督交通事故检测。

IF 1.2 4区综合性期刊 Q3 MULTIDISCIPLINARY SCIENCES

Jove-Journal of Visualized Experiments Pub Date : 2025-09-05 DOI:10.3791/68731

Sneha Kandacharam, B Rajathilagam, Shriram K Vasudevan

{"title":"基于对抗性训练的双编码器-解码器-编码器在监控视频中的无监督交通事故检测。","authors":"Sneha Kandacharam, B Rajathilagam, Shriram K Vasudevan","doi":"10.3791/68731","DOIUrl":null,"url":null,"abstract":"To enhance road safety and improve emergency response, traffic incidents should be detected in real-world surveillance footage as quickly as possible. Existing systems largely depend on manual monitoring, which is time-consuming and prone to error. Automated accident detection remains challenging due to the substantial class imbalance: normal driving situations are overrepresented, whereas accidents are rare and diverse. In such cases, traditional computer vision systems often cannot reliably differentiate between normal and abnormal events. This study addresses the problem by developing a deep learning architecture based on a dual encoder-decoder-encoder (EDE) framework. The model uses two shared encoder-decoder pipelines to map image distributions to specified latent distributions in both directions. This framework enables the system to model common traffic behavior patterns and become more sensitive to changes that may indicate dangerous or unusual events. A two-phase training technique is proposed to further improve anomaly detection. In the first phase, the model learns to reconstruct images of normal driving, using reconstruction loss to characterize normal behavior. In the second phase, a generative adversarial mechanism is introduced: reconstructed latent vectors from one EDE are passed to the other, generating synthetic images and latent spaces. This process amplifies differences between real and synthetic outputs, making the system more responsive to subtle signs of potential anomalies. The dual-EDE architecture and adversarial training methodology represent a substantial advance over current methods by modeling both normal and pathological behavior. Experimental results on real-world traffic surveillance datasets demonstrate that the proposed method significantly improves the detection of accidents and unsafe driving behaviors, both in terms of accuracy and robustness.","PeriodicalId":48787,"journal":{"name":"Jove-Journal of Visualized Experiments","volume":" 223","pages":""},"PeriodicalIF":1.2000,"publicationDate":"2025-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Dual Encoder-Decoder-Encoder with Adversarial Training for Unsupervised Traffic Accident Detection in Surveillance Videos.\",\"authors\":\"Sneha Kandacharam, B Rajathilagam, Shriram K Vasudevan\",\"doi\":\"10.3791/68731\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To enhance road safety and improve emergency response, traffic incidents should be detected in real-world surveillance footage as quickly as possible. Existing systems largely depend on manual monitoring, which is time-consuming and prone to error. Automated accident detection remains challenging due to the substantial class imbalance: normal driving situations are overrepresented, whereas accidents are rare and diverse. In such cases, traditional computer vision systems often cannot reliably differentiate between normal and abnormal events. This study addresses the problem by developing a deep learning architecture based on a dual encoder-decoder-encoder (EDE) framework. The model uses two shared encoder-decoder pipelines to map image distributions to specified latent distributions in both directions. This framework enables the system to model common traffic behavior patterns and become more sensitive to changes that may indicate dangerous or unusual events. A two-phase training technique is proposed to further improve anomaly detection. In the first phase, the model learns to reconstruct images of normal driving, using reconstruction loss to characterize normal behavior. In the second phase, a generative adversarial mechanism is introduced: reconstructed latent vectors from one EDE are passed to the other, generating synthetic images and latent spaces. This process amplifies differences between real and synthetic outputs, making the system more responsive to subtle signs of potential anomalies. The dual-EDE architecture and adversarial training methodology represent a substantial advance over current methods by modeling both normal and pathological behavior. Experimental results on real-world traffic surveillance datasets demonstrate that the proposed method significantly improves the detection of accidents and unsafe driving behaviors, both in terms of accuracy and robustness.\",\"PeriodicalId\":48787,\"journal\":{\"name\":\"Jove-Journal of Visualized Experiments\",\"volume\":\" 223\",\"pages\":\"\"},\"PeriodicalIF\":1.2000,\"publicationDate\":\"2025-09-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Jove-Journal of Visualized Experiments\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.3791/68731\",\"RegionNum\":4,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jove-Journal of Visualized Experiments","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.3791/68731","RegionNum":4,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

为了加强道路安全和改善应急反应，应尽快在现实世界的监控录像中发现交通事故。现有的系统很大程度上依赖于人工监控，这既耗时又容易出错。由于严重的类别不平衡，自动事故检测仍然具有挑战性：正常驾驶情况被过度代表，而事故则罕见且多样化。在这种情况下，传统的计算机视觉系统往往不能可靠地区分正常和异常事件。本研究通过开发基于双编码器-解码器-编码器（EDE）框架的深度学习架构来解决这个问题。该模型使用两个共享的编码器-解码器管道在两个方向上将图像分布映射到指定的潜在分布。该框架使系统能够对常见的交通行为模式进行建模，并对可能指示危险或异常事件的变化变得更加敏感。为了进一步改进异常检测，提出了一种两阶段训练技术。在第一阶段，模型学习重建正常驾驶的图像，使用重建损失来表征正常行为。在第二阶段，引入了生成对抗机制：将一个EDE的重建潜在向量传递到另一个EDE，生成合成图像和潜在空间。这个过程放大了真实输出和合成输出之间的差异，使系统对潜在异常的细微迹象更敏感。双ede架构和对抗性训练方法通过对正常和病理行为进行建模，代表了当前方法的实质性进步。在真实交通监控数据集上的实验结果表明，该方法在检测事故和不安全驾驶行为的准确性和鲁棒性方面都有显著提高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Dual Encoder-Decoder-Encoder with Adversarial Training for Unsupervised Traffic Accident Detection in Surveillance Videos.

To enhance road safety and improve emergency response, traffic incidents should be detected in real-world surveillance footage as quickly as possible. Existing systems largely depend on manual monitoring, which is time-consuming and prone to error. Automated accident detection remains challenging due to the substantial class imbalance: normal driving situations are overrepresented, whereas accidents are rare and diverse. In such cases, traditional computer vision systems often cannot reliably differentiate between normal and abnormal events. This study addresses the problem by developing a deep learning architecture based on a dual encoder-decoder-encoder (EDE) framework. The model uses two shared encoder-decoder pipelines to map image distributions to specified latent distributions in both directions. This framework enables the system to model common traffic behavior patterns and become more sensitive to changes that may indicate dangerous or unusual events. A two-phase training technique is proposed to further improve anomaly detection. In the first phase, the model learns to reconstruct images of normal driving, using reconstruction loss to characterize normal behavior. In the second phase, a generative adversarial mechanism is introduced: reconstructed latent vectors from one EDE are passed to the other, generating synthetic images and latent spaces. This process amplifies differences between real and synthetic outputs, making the system more responsive to subtle signs of potential anomalies. The dual-EDE architecture and adversarial training methodology represent a substantial advance over current methods by modeling both normal and pathological behavior. Experimental results on real-world traffic surveillance datasets demonstrate that the proposed method significantly improves the detection of accidents and unsafe driving behaviors, both in terms of accuracy and robustness.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Jove-Journal of Visualized Experiments MULTIDISCIPLINARY SCIENCES-

CiteScore

2.10

自引率

0.00%

发文量

992

期刊介绍： JoVE, the Journal of Visualized Experiments, is the world''s first peer reviewed scientific video journal. Established in 2006, JoVE is devoted to publishing scientific research in a visual format to help researchers overcome two of the biggest challenges facing the scientific research community today; poor reproducibility and the time and labor intensive nature of learning new experimental techniques.