Parallel two-class 3D-CNN classifiers for video classification

2017 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS) Pub Date : 2017-11-01 DOI:10.1109/ISPACS.2017.8265636

Jing Li

引用次数: 11

Abstract

The required amount of computation and training data for training 3D-CNN, especially for complex classification tasks with videos, hinders the wide application of 3D-CNN. In this paper, inspired by the exclusion method in human's judgement, a parallel 3D-CNN architecture is proposed to decompose the multi-class classification task using one 3D-CNN into the combination of multiple two-class classification tasks. 3D-CNN is used for each of the two-class classification tasks, and the difficulty and the data requirement on training such a 3D-CNN is reduced greatly comparing with the 3D-CNN for multi-class classification. In addition, the combination of two-class classifiers provides the ability of recognizing unknown class to the proposed 3D-CNN model. The feasibility of this proposed 3D-CNN model is verified via its application on video copy detection on the CC_WEB_VIDEO dataset, which shows the potentiality of the proposed parallel two-class 3D-CNN model in video classification.

查看原文本刊更多论文

并行两类3D-CNN视频分类器

训练3D-CNN所需的计算量和训练数据量，特别是复杂的视频分类任务，阻碍了3D-CNN的广泛应用。本文受人类判断中的排除方法的启发，提出了一种并行3D-CNN架构，将使用一个3D-CNN的多类分类任务分解为多个两类分类任务的组合。两类分类任务均使用3D-CNN，与多类分类的3D-CNN相比，训练3D-CNN的难度和对数据的要求大大降低。此外，两类分类器的组合为所提出的3D-CNN模型提供了识别未知类的能力。通过对CC_WEB_VIDEO数据集的视频拷贝检测，验证了所提3D-CNN模型的可行性，显示了所提并行两类3D-CNN模型在视频分类中的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)

自引率

0.00%

发文量