Yunlong Bian, Yuan Dong, Hongliang Bai, Bo Liu, Kai Wang, Yinan Liu
{"title":"面向华为移动视频准确快速标注挑战的深度卷积神经网络降维结构","authors":"Yunlong Bian, Yuan Dong, Hongliang Bai, Bo Liu, Kai Wang, Yinan Liu","doi":"10.1109/ICMEW.2014.6890608","DOIUrl":null,"url":null,"abstract":"Big structure of deep Convolutional Neural Networks (CNN) has staggeringly impressive improvement in the Imagenet Large Scale Visual Recognition Challenge (ILSVRC) 2012 and 2013. But only tens of classes are required to be trained in the most real applications. After the deep CNNs are trained in the ILSVRC dataset, efficiently transferring the big and deep structure to a new dataset is a tough problem. In this paper, three algorithms are proposed to implement the transfer, namely fine-tunning of the big structure, normalized Google distance and Wordnet lexical semantic similarity. After experiments are conducted in the Huawei accurate and fast Mobile Video Annotation Challenge (MoVAC) dataset, the fine-tuning algorithm has achieved the best performance in the accuracy and training time.","PeriodicalId":178700,"journal":{"name":"2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Reducing structure of deep Convolutional Neural Networks for Huawei Accurate and Fast Mobile Video Annotation Challenge\",\"authors\":\"Yunlong Bian, Yuan Dong, Hongliang Bai, Bo Liu, Kai Wang, Yinan Liu\",\"doi\":\"10.1109/ICMEW.2014.6890608\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Big structure of deep Convolutional Neural Networks (CNN) has staggeringly impressive improvement in the Imagenet Large Scale Visual Recognition Challenge (ILSVRC) 2012 and 2013. But only tens of classes are required to be trained in the most real applications. After the deep CNNs are trained in the ILSVRC dataset, efficiently transferring the big and deep structure to a new dataset is a tough problem. In this paper, three algorithms are proposed to implement the transfer, namely fine-tunning of the big structure, normalized Google distance and Wordnet lexical semantic similarity. After experiments are conducted in the Huawei accurate and fast Mobile Video Annotation Challenge (MoVAC) dataset, the fine-tuning algorithm has achieved the best performance in the accuracy and training time.\",\"PeriodicalId\":178700,\"journal\":{\"name\":\"2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)\",\"volume\":\"14 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-07-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMEW.2014.6890608\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMEW.2014.6890608","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Reducing structure of deep Convolutional Neural Networks for Huawei Accurate and Fast Mobile Video Annotation Challenge
Big structure of deep Convolutional Neural Networks (CNN) has staggeringly impressive improvement in the Imagenet Large Scale Visual Recognition Challenge (ILSVRC) 2012 and 2013. But only tens of classes are required to be trained in the most real applications. After the deep CNNs are trained in the ILSVRC dataset, efficiently transferring the big and deep structure to a new dataset is a tough problem. In this paper, three algorithms are proposed to implement the transfer, namely fine-tunning of the big structure, normalized Google distance and Wordnet lexical semantic similarity. After experiments are conducted in the Huawei accurate and fast Mobile Video Annotation Challenge (MoVAC) dataset, the fine-tuning algorithm has achieved the best performance in the accuracy and training time.