Machine Cognition of Violence in Videos Using Novel Outlier-Resistant VLAD

2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA) Pub Date : 2018-12-01 DOI:10.1109/ICMLA.2018.00161

Tonmoay Deb, Aziz Arman, A. Firoze

{"title":"Machine Cognition of Violence in Videos Using Novel Outlier-Resistant VLAD","authors":"Tonmoay Deb, Aziz Arman, A. Firoze","doi":"10.1109/ICMLA.2018.00161","DOIUrl":null,"url":null,"abstract":"Understanding highly accurate and real-time violent actions from surveillance videos is a demanding challenge. Our primary contribution of this work is divided into two parts. Firstly, we propose a computationally efficient Bag-of-Words (BoW) pipeline along with improved accuracy of violent videos classification. The novel pipeline's feature extraction stage is implemented with densely sampled Histogram of Oriented Gradients (HOG) and Histogram of Optical Flow (HOF) descriptors rather than Space-Time Interest Point (STIP) based extraction. Secondly, in encoding stage, we propose Outlier-Resistant VLAD (OR-VLAD), a novel higher order statistics-based feature encoding, to improve the original VLAD performance. In classification, efficient Linear Support Vector Machine (LSVM) is employed. The performance of the proposed pipeline is evaluated with three popular violent action datasets. On comparison, our pipeline achieved near perfect classification accuracies over three standard video datasets, outperforming most state-of-the-art approaches and having very low number of vocabulary size compared to previous BoW Models.","PeriodicalId":6533,"journal":{"name":"2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"24 1","pages":"989-994"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2018.00161","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 12

Abstract

Understanding highly accurate and real-time violent actions from surveillance videos is a demanding challenge. Our primary contribution of this work is divided into two parts. Firstly, we propose a computationally efficient Bag-of-Words (BoW) pipeline along with improved accuracy of violent videos classification. The novel pipeline's feature extraction stage is implemented with densely sampled Histogram of Oriented Gradients (HOG) and Histogram of Optical Flow (HOF) descriptors rather than Space-Time Interest Point (STIP) based extraction. Secondly, in encoding stage, we propose Outlier-Resistant VLAD (OR-VLAD), a novel higher order statistics-based feature encoding, to improve the original VLAD performance. In classification, efficient Linear Support Vector Machine (LSVM) is employed. The performance of the proposed pipeline is evaluated with three popular violent action datasets. On comparison, our pipeline achieved near perfect classification accuracies over three standard video datasets, outperforming most state-of-the-art approaches and having very low number of vocabulary size compared to previous BoW Models.

查看原文本刊更多论文

基于新型抗离群值VLAD的视频暴力机器认知

从监控视频中了解高度准确和实时的暴力行为是一项艰巨的挑战。我们对这项工作的主要贡献分为两部分。首先，我们提出了一个计算效率高的词袋(BoW)管道，并提高了暴力视频分类的准确性。新型管道的特征提取阶段采用密集采样的定向梯度直方图和光流直方图描述符来实现，而不是基于时空兴趣点的提取。其次，在编码阶段，我们提出了一种新的基于高阶统计量的特征编码，即抗离群值VLAD (OR-VLAD)，以提高原有VLAD的性能。在分类方面，采用了高效的线性支持向量机(LSVM)。用三个流行的暴力行为数据集评估了所提出的管道的性能。相比之下，我们的管道在三个标准视频数据集上实现了近乎完美的分类精度，优于大多数最先进的方法，并且与以前的BoW模型相比，词汇量非常小。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)

自引率

0.00%

发文量