并行两类3D-CNN视频分类器

2017 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS) Pub Date : 2017-11-01 DOI:10.1109/ISPACS.2017.8265636

Jing Li

{"title":"并行两类3D-CNN视频分类器","authors":"Jing Li","doi":"10.1109/ISPACS.2017.8265636","DOIUrl":null,"url":null,"abstract":"The required amount of computation and training data for training 3D-CNN, especially for complex classification tasks with videos, hinders the wide application of 3D-CNN. In this paper, inspired by the exclusion method in human's judgement, a parallel 3D-CNN architecture is proposed to decompose the multi-class classification task using one 3D-CNN into the combination of multiple two-class classification tasks. 3D-CNN is used for each of the two-class classification tasks, and the difficulty and the data requirement on training such a 3D-CNN is reduced greatly comparing with the 3D-CNN for multi-class classification. In addition, the combination of two-class classifiers provides the ability of recognizing unknown class to the proposed 3D-CNN model. The feasibility of this proposed 3D-CNN model is verified via its application on video copy detection on the CC_WEB_VIDEO dataset, which shows the potentiality of the proposed parallel two-class 3D-CNN model in video classification.","PeriodicalId":166414,"journal":{"name":"2017 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":"{\"title\":\"Parallel two-class 3D-CNN classifiers for video classification\",\"authors\":\"Jing Li\",\"doi\":\"10.1109/ISPACS.2017.8265636\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The required amount of computation and training data for training 3D-CNN, especially for complex classification tasks with videos, hinders the wide application of 3D-CNN. In this paper, inspired by the exclusion method in human's judgement, a parallel 3D-CNN architecture is proposed to decompose the multi-class classification task using one 3D-CNN into the combination of multiple two-class classification tasks. 3D-CNN is used for each of the two-class classification tasks, and the difficulty and the data requirement on training such a 3D-CNN is reduced greatly comparing with the 3D-CNN for multi-class classification. In addition, the combination of two-class classifiers provides the ability of recognizing unknown class to the proposed 3D-CNN model. The feasibility of this proposed 3D-CNN model is verified via its application on video copy detection on the CC_WEB_VIDEO dataset, which shows the potentiality of the proposed parallel two-class 3D-CNN model in video classification.\",\"PeriodicalId\":166414,\"journal\":{\"name\":\"2017 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISPACS.2017.8265636\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISPACS.2017.8265636","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 11

摘要

训练3D-CNN所需的计算量和训练数据量，特别是复杂的视频分类任务，阻碍了3D-CNN的广泛应用。本文受人类判断中的排除方法的启发，提出了一种并行3D-CNN架构，将使用一个3D-CNN的多类分类任务分解为多个两类分类任务的组合。两类分类任务均使用3D-CNN，与多类分类的3D-CNN相比，训练3D-CNN的难度和对数据的要求大大降低。此外，两类分类器的组合为所提出的3D-CNN模型提供了识别未知类的能力。通过对CC_WEB_VIDEO数据集的视频拷贝检测，验证了所提3D-CNN模型的可行性，显示了所提并行两类3D-CNN模型在视频分类中的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Parallel two-class 3D-CNN classifiers for video classification

The required amount of computation and training data for training 3D-CNN, especially for complex classification tasks with videos, hinders the wide application of 3D-CNN. In this paper, inspired by the exclusion method in human's judgement, a parallel 3D-CNN architecture is proposed to decompose the multi-class classification task using one 3D-CNN into the combination of multiple two-class classification tasks. 3D-CNN is used for each of the two-class classification tasks, and the difficulty and the data requirement on training such a 3D-CNN is reduced greatly comparing with the 3D-CNN for multi-class classification. In addition, the combination of two-class classifiers provides the ability of recognizing unknown class to the proposed 3D-CNN model. The feasibility of this proposed 3D-CNN model is verified via its application on video copy detection on the CC_WEB_VIDEO dataset, which shows the potentiality of the proposed parallel two-class 3D-CNN model in video classification.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2017 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)

自引率

0.00%

发文量