{"title":"基于时空自编码器的监控视频异常事件检测","authors":"","doi":"10.30534/ijatcse/2022/011132022","DOIUrl":null,"url":null,"abstract":"Surveillance cameras are proliferating, and millions of devices are being used to capture endless footage of surveillance videos. With the advancements in computer vision and deep learning, we can now contemplate these videos to detect anomalies. In this paper, we propose to identify anomalies by using Spatiotemporal autoencoders. The autoencoders based on 3-D convolutional networks will identify anomalies in surveillance footage based on spatiotemporal features. Our architecture consists of two components, an encoder for spatial feature extraction, and a decoder for the reconstruction of frames. Then, abnormal events are identified based on reconstruction loss. We used the Avenue dataset and UCSD dataset for training and evaluation.","PeriodicalId":129636,"journal":{"name":"International Journal of Advanced Trends in Computer Science and Engineering","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Anomalous Event Detection in Surveillance Videos using Spatio-temporal Autoencoders\",\"authors\":\"\",\"doi\":\"10.30534/ijatcse/2022/011132022\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Surveillance cameras are proliferating, and millions of devices are being used to capture endless footage of surveillance videos. With the advancements in computer vision and deep learning, we can now contemplate these videos to detect anomalies. In this paper, we propose to identify anomalies by using Spatiotemporal autoencoders. The autoencoders based on 3-D convolutional networks will identify anomalies in surveillance footage based on spatiotemporal features. Our architecture consists of two components, an encoder for spatial feature extraction, and a decoder for the reconstruction of frames. Then, abnormal events are identified based on reconstruction loss. We used the Avenue dataset and UCSD dataset for training and evaluation.\",\"PeriodicalId\":129636,\"journal\":{\"name\":\"International Journal of Advanced Trends in Computer Science and Engineering\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-06-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Advanced Trends in Computer Science and Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.30534/ijatcse/2022/011132022\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Advanced Trends in Computer Science and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.30534/ijatcse/2022/011132022","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Anomalous Event Detection in Surveillance Videos using Spatio-temporal Autoencoders
Surveillance cameras are proliferating, and millions of devices are being used to capture endless footage of surveillance videos. With the advancements in computer vision and deep learning, we can now contemplate these videos to detect anomalies. In this paper, we propose to identify anomalies by using Spatiotemporal autoencoders. The autoencoders based on 3-D convolutional networks will identify anomalies in surveillance footage based on spatiotemporal features. Our architecture consists of two components, an encoder for spatial feature extraction, and a decoder for the reconstruction of frames. Then, abnormal events are identified based on reconstruction loss. We used the Avenue dataset and UCSD dataset for training and evaluation.