{"title":"基于幂等生成网络的视频异常检测方法","authors":"Wenmin Dong, Lifeng Zhang, Wenjuan Shi, Xiangwei Zheng, Yuang Zhang","doi":"10.1016/j.aej.2025.03.106","DOIUrl":null,"url":null,"abstract":"<div><div>Video anomaly detection (VAD) is vital in intelligent security for public safety. Reconstruction-based VAD has received increasing research attention, but faces challenges such as missing anomalies for the reconstruction error as a criterion, and information loss when suppressing anomalous data, existing methods also struggle to detect unseen anomalies. We propose a novel reconstruction-based video anomaly detection with idempotent generative network (RVADIGN), which is composed of the novel reconstruction module namely PSVAE and an idempotent loss term (IGN). Specifically, video frames are reconstructed within PSVAE. During this process, skip connections are established between the encoder and decoder to enhance contextual understanding. Finite Scalar Quantization (FSQ) layer is designed to discretize the encoder’s output, preserving key discriminative features. Meanwhile, the Pyramid Deformation Module (PDM), as an integral part of PSVAE, computes offset maps of original video frames for anomaly detection supplementation. Alongside PSVAE, idempotence is introduced as a regularity term, which projects the anomaly information back to the estimated manifolds of the target distribution, improves the adaptability and stability of the reconstruction method in different videos. Extensive experimental results demonstrate that our method outperforms other state-of-the-art VAD methods, achieving 99.03%, 92.40%, and 77.20% AUC on UCSD Ped2, CUHK Avenue, and ShanghaiTech, respectively.</div></div>","PeriodicalId":7484,"journal":{"name":"alexandria engineering journal","volume":"124 ","pages":"Pages 513-525"},"PeriodicalIF":6.2000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A novel reconstruction-based video anomaly detection with idempotent generative network\",\"authors\":\"Wenmin Dong, Lifeng Zhang, Wenjuan Shi, Xiangwei Zheng, Yuang Zhang\",\"doi\":\"10.1016/j.aej.2025.03.106\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Video anomaly detection (VAD) is vital in intelligent security for public safety. Reconstruction-based VAD has received increasing research attention, but faces challenges such as missing anomalies for the reconstruction error as a criterion, and information loss when suppressing anomalous data, existing methods also struggle to detect unseen anomalies. We propose a novel reconstruction-based video anomaly detection with idempotent generative network (RVADIGN), which is composed of the novel reconstruction module namely PSVAE and an idempotent loss term (IGN). Specifically, video frames are reconstructed within PSVAE. During this process, skip connections are established between the encoder and decoder to enhance contextual understanding. Finite Scalar Quantization (FSQ) layer is designed to discretize the encoder’s output, preserving key discriminative features. Meanwhile, the Pyramid Deformation Module (PDM), as an integral part of PSVAE, computes offset maps of original video frames for anomaly detection supplementation. Alongside PSVAE, idempotence is introduced as a regularity term, which projects the anomaly information back to the estimated manifolds of the target distribution, improves the adaptability and stability of the reconstruction method in different videos. Extensive experimental results demonstrate that our method outperforms other state-of-the-art VAD methods, achieving 99.03%, 92.40%, and 77.20% AUC on UCSD Ped2, CUHK Avenue, and ShanghaiTech, respectively.</div></div>\",\"PeriodicalId\":7484,\"journal\":{\"name\":\"alexandria engineering journal\",\"volume\":\"124 \",\"pages\":\"Pages 513-525\"},\"PeriodicalIF\":6.2000,\"publicationDate\":\"2025-04-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"alexandria engineering journal\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1110016825004144\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"alexandria engineering journal","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1110016825004144","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
A novel reconstruction-based video anomaly detection with idempotent generative network
Video anomaly detection (VAD) is vital in intelligent security for public safety. Reconstruction-based VAD has received increasing research attention, but faces challenges such as missing anomalies for the reconstruction error as a criterion, and information loss when suppressing anomalous data, existing methods also struggle to detect unseen anomalies. We propose a novel reconstruction-based video anomaly detection with idempotent generative network (RVADIGN), which is composed of the novel reconstruction module namely PSVAE and an idempotent loss term (IGN). Specifically, video frames are reconstructed within PSVAE. During this process, skip connections are established between the encoder and decoder to enhance contextual understanding. Finite Scalar Quantization (FSQ) layer is designed to discretize the encoder’s output, preserving key discriminative features. Meanwhile, the Pyramid Deformation Module (PDM), as an integral part of PSVAE, computes offset maps of original video frames for anomaly detection supplementation. Alongside PSVAE, idempotence is introduced as a regularity term, which projects the anomaly information back to the estimated manifolds of the target distribution, improves the adaptability and stability of the reconstruction method in different videos. Extensive experimental results demonstrate that our method outperforms other state-of-the-art VAD methods, achieving 99.03%, 92.40%, and 77.20% AUC on UCSD Ped2, CUHK Avenue, and ShanghaiTech, respectively.
期刊介绍:
Alexandria Engineering Journal is an international journal devoted to publishing high quality papers in the field of engineering and applied science. Alexandria Engineering Journal is cited in the Engineering Information Services (EIS) and the Chemical Abstracts (CA). The papers published in Alexandria Engineering Journal are grouped into five sections, according to the following classification:
• Mechanical, Production, Marine and Textile Engineering
• Electrical Engineering, Computer Science and Nuclear Engineering
• Civil and Architecture Engineering
• Chemical Engineering and Applied Sciences
• Environmental Engineering