{"title":"Real Life Violence Detection in Surveillance Videos using Spatiotemporal Features","authors":"Anugrah Srivastava, Tapas Badal, Rishav Singh","doi":"10.1145/3474124.3474161","DOIUrl":null,"url":null,"abstract":"Automatic violence detection has remarkable importance from practical and academic point of view. Generally speaking, detecting violence in a crowded locality, via computational approaches, is challenging owing to rapid movements, overlapping characteristics, obstructed scenery, and scattered backgrounds. Fortunately, Deep Learning techniques can detect anomalies to a certain extent. Furthermore, their popularity, as a paradigm to detect violence, is growing at a tremendous pace. The aim of such approaches is to develop a method that recognizes violence and evokes an alarm so that immediate assistance can be provided. This paper is aong the same line of thought. This article presents a Convolution Neural Network (CNN) and Recurrent Neural Network (RNN) based approach for violence detection by learning the detailed features in videos. The spatio-temporal features extracted from the combination of InceptonV3 pre-trained model and late LSTM architecture yielded a 97.5% accuracy thereby, proving its superiority over existing methods in literature.","PeriodicalId":144611,"journal":{"name":"2021 Thirteenth International Conference on Contemporary Computing (IC3-2021)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Thirteenth International Conference on Contemporary Computing (IC3-2021)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3474124.3474161","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Automatic violence detection has remarkable importance from practical and academic point of view. Generally speaking, detecting violence in a crowded locality, via computational approaches, is challenging owing to rapid movements, overlapping characteristics, obstructed scenery, and scattered backgrounds. Fortunately, Deep Learning techniques can detect anomalies to a certain extent. Furthermore, their popularity, as a paradigm to detect violence, is growing at a tremendous pace. The aim of such approaches is to develop a method that recognizes violence and evokes an alarm so that immediate assistance can be provided. This paper is aong the same line of thought. This article presents a Convolution Neural Network (CNN) and Recurrent Neural Network (RNN) based approach for violence detection by learning the detailed features in videos. The spatio-temporal features extracted from the combination of InceptonV3 pre-trained model and late LSTM architecture yielded a 97.5% accuracy thereby, proving its superiority over existing methods in literature.