{"title":"基于哈希层残差网络的大规模多标签图像检索","authors":"Baohua Qiang, Peiyao Wang, Shui-ping Guo, Zhi Xu, Wu Xie, Jinlong Chen, Xianjun Chen","doi":"10.1109/ICACI.2019.8778549","DOIUrl":null,"url":null,"abstract":"In recent years, increasing deep hashing methods have been applied in large-scale multi-label image retrieval. However, in the existing deep network models, the extracted low-level features cannot effectively integrate the multi-level semantic information and the similarity ranking information of pairwise multi-label images into one hash coding learning scheme. Therefore, we cannot obtain an efficient and accurate index method. Motivated by this, in this paper, we proposed a novel approach adopting the cosine distance of pairwise multi-label images semantic vector to quantify existing multi-level similarity in a multi-label image. Meanwhile, we utilized the residual network to learn the final representation of multi-label images features. Finally, we constructed a deep hashing framework to extract features and generate binary codes simultaneously. On the one hand, the improved model uses a deeper network and more complex network structures to enhance the ability of low-level features extraction. On the other hand, the improved model was trained by a fine-tuning strategy, which can accelerate the convergence speed. Extensive experiments on two popular multi-label datasets demonstrate that the improved model outperforms the reference models regarding accuracy. The mean average precision is improved by 1.0432 and 1.1114 times on two datasets, respectively.","PeriodicalId":213368,"journal":{"name":"2019 Eleventh International Conference on Advanced Computational Intelligence (ICACI)","volume":"2010 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Large-scale Multi-label Image Retrieval Using Residual Network with Hash Layer\",\"authors\":\"Baohua Qiang, Peiyao Wang, Shui-ping Guo, Zhi Xu, Wu Xie, Jinlong Chen, Xianjun Chen\",\"doi\":\"10.1109/ICACI.2019.8778549\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, increasing deep hashing methods have been applied in large-scale multi-label image retrieval. However, in the existing deep network models, the extracted low-level features cannot effectively integrate the multi-level semantic information and the similarity ranking information of pairwise multi-label images into one hash coding learning scheme. Therefore, we cannot obtain an efficient and accurate index method. Motivated by this, in this paper, we proposed a novel approach adopting the cosine distance of pairwise multi-label images semantic vector to quantify existing multi-level similarity in a multi-label image. Meanwhile, we utilized the residual network to learn the final representation of multi-label images features. Finally, we constructed a deep hashing framework to extract features and generate binary codes simultaneously. On the one hand, the improved model uses a deeper network and more complex network structures to enhance the ability of low-level features extraction. On the other hand, the improved model was trained by a fine-tuning strategy, which can accelerate the convergence speed. Extensive experiments on two popular multi-label datasets demonstrate that the improved model outperforms the reference models regarding accuracy. The mean average precision is improved by 1.0432 and 1.1114 times on two datasets, respectively.\",\"PeriodicalId\":213368,\"journal\":{\"name\":\"2019 Eleventh International Conference on Advanced Computational Intelligence (ICACI)\",\"volume\":\"2010 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 Eleventh International Conference on Advanced Computational Intelligence (ICACI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICACI.2019.8778549\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 Eleventh International Conference on Advanced Computational Intelligence (ICACI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICACI.2019.8778549","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Large-scale Multi-label Image Retrieval Using Residual Network with Hash Layer
In recent years, increasing deep hashing methods have been applied in large-scale multi-label image retrieval. However, in the existing deep network models, the extracted low-level features cannot effectively integrate the multi-level semantic information and the similarity ranking information of pairwise multi-label images into one hash coding learning scheme. Therefore, we cannot obtain an efficient and accurate index method. Motivated by this, in this paper, we proposed a novel approach adopting the cosine distance of pairwise multi-label images semantic vector to quantify existing multi-level similarity in a multi-label image. Meanwhile, we utilized the residual network to learn the final representation of multi-label images features. Finally, we constructed a deep hashing framework to extract features and generate binary codes simultaneously. On the one hand, the improved model uses a deeper network and more complex network structures to enhance the ability of low-level features extraction. On the other hand, the improved model was trained by a fine-tuning strategy, which can accelerate the convergence speed. Extensive experiments on two popular multi-label datasets demonstrate that the improved model outperforms the reference models regarding accuracy. The mean average precision is improved by 1.0432 and 1.1114 times on two datasets, respectively.