Sneha Kandacharam, B Rajathilagam, Shriram K Vasudevan
{"title":"基于对抗性训练的双编码器-解码器-编码器在监控视频中的无监督交通事故检测。","authors":"Sneha Kandacharam, B Rajathilagam, Shriram K Vasudevan","doi":"10.3791/68731","DOIUrl":null,"url":null,"abstract":"<p><p>To enhance road safety and improve emergency response, traffic incidents should be detected in real-world surveillance footage as quickly as possible. Existing systems largely depend on manual monitoring, which is time-consuming and prone to error. Automated accident detection remains challenging due to the substantial class imbalance: normal driving situations are overrepresented, whereas accidents are rare and diverse. In such cases, traditional computer vision systems often cannot reliably differentiate between normal and abnormal events. This study addresses the problem by developing a deep learning architecture based on a dual encoder-decoder-encoder (EDE) framework. The model uses two shared encoder-decoder pipelines to map image distributions to specified latent distributions in both directions. This framework enables the system to model common traffic behavior patterns and become more sensitive to changes that may indicate dangerous or unusual events. A two-phase training technique is proposed to further improve anomaly detection. In the first phase, the model learns to reconstruct images of normal driving, using reconstruction loss to characterize normal behavior. In the second phase, a generative adversarial mechanism is introduced: reconstructed latent vectors from one EDE are passed to the other, generating synthetic images and latent spaces. This process amplifies differences between real and synthetic outputs, making the system more responsive to subtle signs of potential anomalies. The dual-EDE architecture and adversarial training methodology represent a substantial advance over current methods by modeling both normal and pathological behavior. Experimental results on real-world traffic surveillance datasets demonstrate that the proposed method significantly improves the detection of accidents and unsafe driving behaviors, both in terms of accuracy and robustness.</p>","PeriodicalId":48787,"journal":{"name":"Jove-Journal of Visualized Experiments","volume":" 223","pages":""},"PeriodicalIF":1.2000,"publicationDate":"2025-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Dual Encoder-Decoder-Encoder with Adversarial Training for Unsupervised Traffic Accident Detection in Surveillance Videos.\",\"authors\":\"Sneha Kandacharam, B Rajathilagam, Shriram K Vasudevan\",\"doi\":\"10.3791/68731\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>To enhance road safety and improve emergency response, traffic incidents should be detected in real-world surveillance footage as quickly as possible. Existing systems largely depend on manual monitoring, which is time-consuming and prone to error. Automated accident detection remains challenging due to the substantial class imbalance: normal driving situations are overrepresented, whereas accidents are rare and diverse. In such cases, traditional computer vision systems often cannot reliably differentiate between normal and abnormal events. This study addresses the problem by developing a deep learning architecture based on a dual encoder-decoder-encoder (EDE) framework. The model uses two shared encoder-decoder pipelines to map image distributions to specified latent distributions in both directions. This framework enables the system to model common traffic behavior patterns and become more sensitive to changes that may indicate dangerous or unusual events. A two-phase training technique is proposed to further improve anomaly detection. In the first phase, the model learns to reconstruct images of normal driving, using reconstruction loss to characterize normal behavior. In the second phase, a generative adversarial mechanism is introduced: reconstructed latent vectors from one EDE are passed to the other, generating synthetic images and latent spaces. This process amplifies differences between real and synthetic outputs, making the system more responsive to subtle signs of potential anomalies. The dual-EDE architecture and adversarial training methodology represent a substantial advance over current methods by modeling both normal and pathological behavior. Experimental results on real-world traffic surveillance datasets demonstrate that the proposed method significantly improves the detection of accidents and unsafe driving behaviors, both in terms of accuracy and robustness.</p>\",\"PeriodicalId\":48787,\"journal\":{\"name\":\"Jove-Journal of Visualized Experiments\",\"volume\":\" 223\",\"pages\":\"\"},\"PeriodicalIF\":1.2000,\"publicationDate\":\"2025-09-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Jove-Journal of Visualized Experiments\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.3791/68731\",\"RegionNum\":4,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jove-Journal of Visualized Experiments","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.3791/68731","RegionNum":4,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
Dual Encoder-Decoder-Encoder with Adversarial Training for Unsupervised Traffic Accident Detection in Surveillance Videos.
To enhance road safety and improve emergency response, traffic incidents should be detected in real-world surveillance footage as quickly as possible. Existing systems largely depend on manual monitoring, which is time-consuming and prone to error. Automated accident detection remains challenging due to the substantial class imbalance: normal driving situations are overrepresented, whereas accidents are rare and diverse. In such cases, traditional computer vision systems often cannot reliably differentiate between normal and abnormal events. This study addresses the problem by developing a deep learning architecture based on a dual encoder-decoder-encoder (EDE) framework. The model uses two shared encoder-decoder pipelines to map image distributions to specified latent distributions in both directions. This framework enables the system to model common traffic behavior patterns and become more sensitive to changes that may indicate dangerous or unusual events. A two-phase training technique is proposed to further improve anomaly detection. In the first phase, the model learns to reconstruct images of normal driving, using reconstruction loss to characterize normal behavior. In the second phase, a generative adversarial mechanism is introduced: reconstructed latent vectors from one EDE are passed to the other, generating synthetic images and latent spaces. This process amplifies differences between real and synthetic outputs, making the system more responsive to subtle signs of potential anomalies. The dual-EDE architecture and adversarial training methodology represent a substantial advance over current methods by modeling both normal and pathological behavior. Experimental results on real-world traffic surveillance datasets demonstrate that the proposed method significantly improves the detection of accidents and unsafe driving behaviors, both in terms of accuracy and robustness.
期刊介绍:
JoVE, the Journal of Visualized Experiments, is the world''s first peer reviewed scientific video journal. Established in 2006, JoVE is devoted to publishing scientific research in a visual format to help researchers overcome two of the biggest challenges facing the scientific research community today; poor reproducibility and the time and labor intensive nature of learning new experimental techniques.