Reducing structure of deep Convolutional Neural Networks for Huawei Accurate and Fast Mobile Video Annotation Challenge

2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2014-07-14 DOI:10.1109/ICMEW.2014.6890608

Yunlong Bian, Yuan Dong, Hongliang Bai, Bo Liu, Kai Wang, Yinan Liu

引用次数: 4

Abstract

Big structure of deep Convolutional Neural Networks (CNN) has staggeringly impressive improvement in the Imagenet Large Scale Visual Recognition Challenge (ILSVRC) 2012 and 2013. But only tens of classes are required to be trained in the most real applications. After the deep CNNs are trained in the ILSVRC dataset, efficiently transferring the big and deep structure to a new dataset is a tough problem. In this paper, three algorithms are proposed to implement the transfer, namely fine-tunning of the big structure, normalized Google distance and Wordnet lexical semantic similarity. After experiments are conducted in the Huawei accurate and fast Mobile Video Annotation Challenge (MoVAC) dataset, the fine-tuning algorithm has achieved the best performance in the accuracy and training time.

查看原文本刊更多论文

面向华为移动视频准确快速标注挑战的深度卷积神经网络降维结构

大结构深度卷积神经网络(CNN)在2012年和2013年Imagenet大规模视觉识别挑战赛(ILSVRC)中取得了惊人的进步。但是，在大多数实际应用中，只需要几十个课程就可以进行培训。在ILSVRC数据集上训练深度cnn后，如何有效地将大而深的结构转移到新的数据集上是一个难题。本文提出了三种实现迁移的算法，即大结构微调、归一化Google距离和Wordnet词汇语义相似度。在华为精准快速移动视频标注挑战赛(MoVAC)数据集上进行了实验，该微调算法在准确率和训练时间上都取得了最好的成绩。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)

自引率

0.00%

发文量