Computational optimization for violent scenes detection

2016 International Conference on Computer, Control, Informatics and its Applications (IC3INA) Pub Date : 2016-10-01 DOI:10.1109/IC3INA.2016.7863039

Vu Lam, Sang Phan Le, Tien Do, T. Ngo, Duy-Dinh Le, D. Duong

引用次数: 4

Abstract

Violent scenes detection (VSD) can be considered as a specific problem of multimedia event detection. One popular approach to this problem is to employ multiple modals for presentation. By combining complementary modals, it has been shown remarkable improvement in accuracy. But, such an approach also requires high computational cost to process all features globally and locally extracted from static frames, video sequences, audio streams, or deep visual features. In this paper, we address the problem of modal selection (i.e. feature selection) when the computing resource (including both CPU and GPU) is limited. We evaluated possible combinations of features with different specifications of the computing resource. Evaluation results can be used to choose the optimal set of features for high accuracy regarding a pre-selected resource. We conducted experiments on the benchmark dataset MedialEval VSD 2014 (total of 60 hours).

查看原文本刊更多论文

暴力场景检测的计算优化

暴力场景检测(VSD)可以看作是多媒体事件检测中的一个具体问题。解决这个问题的一种流行方法是使用多种模态来表示。通过结合互补情态，其准确性得到了显著提高。但是，这种方法对从静态帧、视频序列、音频流或深度视觉特征中提取的所有特征进行全局和局部处理也需要很高的计算成本。在本文中，我们解决了计算资源(包括CPU和GPU)有限时的模态选择(即特征选择)问题。我们用不同规格的计算资源评估了特征的可能组合。评估结果可用于选择最优的特征集，以获得关于预先选择的资源的高精度。我们在基准数据集MedialEval VSD 2014上进行了实验(共60小时)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 International Conference on Computer, Control, Informatics and its Applications (IC3INA)

自引率

0.00%

发文量