Setegn Asnakew Kasegn, Ronald Waweru Mwangi, Michael Kimwele, Surafel Lemma Abebe
{"title":"基于GRU模型的深度卷积生成对抗网络增强监控视频异常行为检测","authors":"Setegn Asnakew Kasegn, Ronald Waweru Mwangi, Michael Kimwele, Surafel Lemma Abebe","doi":"10.1002/eng2.70177","DOIUrl":null,"url":null,"abstract":"<p>Automatic detection of unusual behavior in videos is a challenging task. This challenge comes from its complexity and the wide range of applications it covers. Several deep learning approaches have been proposed to address this challenge. This includes recent generative methods that use deep convolutional generative adversarial networks (DCGAN). The DCGAN model has gained high research attention recently due to its performs well in extracting spatial features and solve class imbalance issue to detect abnormalities. However, a DCGAN is unstable during training and has low performance owing to its inability to capture the long-term temporal dependency between sequences of video frames. In this study, we propose a novel gated recurrent unit (GRU)-based DCGAN architecture to improve the training stability and performance of a DCGAN model for abnormal video behavior detection. The proposed model was trained using UCSD Ped1, UCSD Ped2, CUHK Avenue, and ShanghaiTech benchmark anomaly dataset. Compared to the DCGAN model, the proposed GRU-based DCGAN model improved the detection accuracy and area under the curve (AUC) by an average of 19.91% and 8.57%, respectively. Compared with the 3D-DCGAN model, the GRU-based DCGAN model improved the detection accuracy and AUC by an average of 7.67% and 3.73%, respectively. Furthermore, the GRU-based DCGAN model stabilized from epoch 10 and converged at epoch 38, whereas the other models remained unstable and did not converge at epoch 50. The findings highlight that the combination of GRU to enhance temporal modeling within a DCGAN framework is a logical extension to improve training stability and performance for abnormal video behavior detection.</p>","PeriodicalId":72922,"journal":{"name":"Engineering reports : open access","volume":"7 5","pages":""},"PeriodicalIF":1.8000,"publicationDate":"2025-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/eng2.70177","citationCount":"0","resultStr":"{\"title\":\"Enhancing Surveillance Video Abnormal Behavior Detection Using Deep Convolutional Generative Adversarial Network With GRU Model\",\"authors\":\"Setegn Asnakew Kasegn, Ronald Waweru Mwangi, Michael Kimwele, Surafel Lemma Abebe\",\"doi\":\"10.1002/eng2.70177\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Automatic detection of unusual behavior in videos is a challenging task. This challenge comes from its complexity and the wide range of applications it covers. Several deep learning approaches have been proposed to address this challenge. This includes recent generative methods that use deep convolutional generative adversarial networks (DCGAN). The DCGAN model has gained high research attention recently due to its performs well in extracting spatial features and solve class imbalance issue to detect abnormalities. However, a DCGAN is unstable during training and has low performance owing to its inability to capture the long-term temporal dependency between sequences of video frames. In this study, we propose a novel gated recurrent unit (GRU)-based DCGAN architecture to improve the training stability and performance of a DCGAN model for abnormal video behavior detection. The proposed model was trained using UCSD Ped1, UCSD Ped2, CUHK Avenue, and ShanghaiTech benchmark anomaly dataset. Compared to the DCGAN model, the proposed GRU-based DCGAN model improved the detection accuracy and area under the curve (AUC) by an average of 19.91% and 8.57%, respectively. Compared with the 3D-DCGAN model, the GRU-based DCGAN model improved the detection accuracy and AUC by an average of 7.67% and 3.73%, respectively. Furthermore, the GRU-based DCGAN model stabilized from epoch 10 and converged at epoch 38, whereas the other models remained unstable and did not converge at epoch 50. The findings highlight that the combination of GRU to enhance temporal modeling within a DCGAN framework is a logical extension to improve training stability and performance for abnormal video behavior detection.</p>\",\"PeriodicalId\":72922,\"journal\":{\"name\":\"Engineering reports : open access\",\"volume\":\"7 5\",\"pages\":\"\"},\"PeriodicalIF\":1.8000,\"publicationDate\":\"2025-05-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1002/eng2.70177\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering reports : open access\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/eng2.70177\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering reports : open access","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/eng2.70177","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
Enhancing Surveillance Video Abnormal Behavior Detection Using Deep Convolutional Generative Adversarial Network With GRU Model
Automatic detection of unusual behavior in videos is a challenging task. This challenge comes from its complexity and the wide range of applications it covers. Several deep learning approaches have been proposed to address this challenge. This includes recent generative methods that use deep convolutional generative adversarial networks (DCGAN). The DCGAN model has gained high research attention recently due to its performs well in extracting spatial features and solve class imbalance issue to detect abnormalities. However, a DCGAN is unstable during training and has low performance owing to its inability to capture the long-term temporal dependency between sequences of video frames. In this study, we propose a novel gated recurrent unit (GRU)-based DCGAN architecture to improve the training stability and performance of a DCGAN model for abnormal video behavior detection. The proposed model was trained using UCSD Ped1, UCSD Ped2, CUHK Avenue, and ShanghaiTech benchmark anomaly dataset. Compared to the DCGAN model, the proposed GRU-based DCGAN model improved the detection accuracy and area under the curve (AUC) by an average of 19.91% and 8.57%, respectively. Compared with the 3D-DCGAN model, the GRU-based DCGAN model improved the detection accuracy and AUC by an average of 7.67% and 3.73%, respectively. Furthermore, the GRU-based DCGAN model stabilized from epoch 10 and converged at epoch 38, whereas the other models remained unstable and did not converge at epoch 50. The findings highlight that the combination of GRU to enhance temporal modeling within a DCGAN framework is a logical extension to improve training stability and performance for abnormal video behavior detection.