{"title":"Learning a multi-cluster memory prototype for unsupervised video anomaly detection","authors":"","doi":"10.1016/j.ins.2024.121385","DOIUrl":null,"url":null,"abstract":"<div><p>In recent years, there has been rapid development in video anomaly detection (VAD). The previous methods ignored the differences between normal videos and only emphasized learning the commonalities of normal videos. In order to improve the performance of anomaly detection, we delve into the spatial distribution of normal video features and utilize their differences for clustering, leading to <em>more minor</em> reconstruction errors for normal videos and <em>more significant</em> reconstruction errors for abnormal videos. To achieve this goal, we introduce a Multi-Cluster Memory Prototype framework (MCMP) for VAD, which explores the coarse-grained and fine-grained information from video snippet features simultaneously to learn a memory prototype, thereby significantly improving the ability to discriminate abnormal events in complex scenes. First, a video feature clustering method that employs contrastive learning is introduced to group samples sharing similar fine-grained features. Second, the memory mechanism is <em>used</em> to capture the feature distribution of normal samples. Lastly, <em>the</em> Gaussian filter feature transformation method <em>is introduced to make</em> normal and abnormal features more distinguishable. The frame level AUC of MCMP on ShanghaiTech and UCF-Crime benchmark datasets has increased by 1.26% and 0.45% compared to state-of-the-art methods. <em>Our code is publicly available at</em> <span><span><em>https://github.com/WuIkun5658/MCMP</em></span><svg><path></path></svg></span><em>.</em></p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1000,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0020025524012994","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
In recent years, there has been rapid development in video anomaly detection (VAD). The previous methods ignored the differences between normal videos and only emphasized learning the commonalities of normal videos. In order to improve the performance of anomaly detection, we delve into the spatial distribution of normal video features and utilize their differences for clustering, leading to more minor reconstruction errors for normal videos and more significant reconstruction errors for abnormal videos. To achieve this goal, we introduce a Multi-Cluster Memory Prototype framework (MCMP) for VAD, which explores the coarse-grained and fine-grained information from video snippet features simultaneously to learn a memory prototype, thereby significantly improving the ability to discriminate abnormal events in complex scenes. First, a video feature clustering method that employs contrastive learning is introduced to group samples sharing similar fine-grained features. Second, the memory mechanism is used to capture the feature distribution of normal samples. Lastly, the Gaussian filter feature transformation method is introduced to make normal and abnormal features more distinguishable. The frame level AUC of MCMP on ShanghaiTech and UCF-Crime benchmark datasets has increased by 1.26% and 0.45% compared to state-of-the-art methods. Our code is publicly available athttps://github.com/WuIkun5658/MCMP.
期刊介绍:
Informatics and Computer Science Intelligent Systems Applications is an esteemed international journal that focuses on publishing original and creative research findings in the field of information sciences. We also feature a limited number of timely tutorial and surveying contributions.
Our journal aims to cater to a diverse audience, including researchers, developers, managers, strategic planners, graduate students, and anyone interested in staying up-to-date with cutting-edge research in information science, knowledge engineering, and intelligent systems. While readers are expected to share a common interest in information science, they come from varying backgrounds such as engineering, mathematics, statistics, physics, computer science, cell biology, molecular biology, management science, cognitive science, neurobiology, behavioral sciences, and biochemistry.