Qian Wang, Jianqing Zhu, Wei Shao, Lei Wang, Xiaobin Zhu
{"title":"基于深度局部特征编码的图像分类","authors":"Qian Wang, Jianqing Zhu, Wei Shao, Lei Wang, Xiaobin Zhu","doi":"10.1109/ISPACS.2017.8266526","DOIUrl":null,"url":null,"abstract":"In this paper, we propose an improved locally aggregated descriptor (VLAD) algorithm coded on deep local features for image classification. Firstly, convolutional neural network (CNN) is adopted to extract the dense local features of images. Secondly, a subset of feature, chosen by the criterion of normal distribution, is selected for high quality codebook generation. Finally, the local features are assigned to multi-neighbor visual words instead of the nearest one with different weights, simultaneously, the statistical distribution information about local features is taken into account during VLAD coding process. Extensive experiments on public available datasets demonstrate the promising performance of the proposed method against state-of-the-art methods.","PeriodicalId":166414,"journal":{"name":"2017 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Image classification based on deep local feature coding\",\"authors\":\"Qian Wang, Jianqing Zhu, Wei Shao, Lei Wang, Xiaobin Zhu\",\"doi\":\"10.1109/ISPACS.2017.8266526\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose an improved locally aggregated descriptor (VLAD) algorithm coded on deep local features for image classification. Firstly, convolutional neural network (CNN) is adopted to extract the dense local features of images. Secondly, a subset of feature, chosen by the criterion of normal distribution, is selected for high quality codebook generation. Finally, the local features are assigned to multi-neighbor visual words instead of the nearest one with different weights, simultaneously, the statistical distribution information about local features is taken into account during VLAD coding process. Extensive experiments on public available datasets demonstrate the promising performance of the proposed method against state-of-the-art methods.\",\"PeriodicalId\":166414,\"journal\":{\"name\":\"2017 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISPACS.2017.8266526\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISPACS.2017.8266526","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Image classification based on deep local feature coding
In this paper, we propose an improved locally aggregated descriptor (VLAD) algorithm coded on deep local features for image classification. Firstly, convolutional neural network (CNN) is adopted to extract the dense local features of images. Secondly, a subset of feature, chosen by the criterion of normal distribution, is selected for high quality codebook generation. Finally, the local features are assigned to multi-neighbor visual words instead of the nearest one with different weights, simultaneously, the statistical distribution information about local features is taken into account during VLAD coding process. Extensive experiments on public available datasets demonstrate the promising performance of the proposed method against state-of-the-art methods.