{"title":"Commodity Image Retrieval Method Based on CNN and Score Fusion","authors":"Zihao Liu, Xiaoyu Wu, Jiayao Qian, Zhiyi Zhu","doi":"10.1109/ICCST53801.2021.00101","DOIUrl":null,"url":null,"abstract":"Content-based image retrieval (CBIR) is an important research direction of computer vision and has been well studied for a long period. The important aim in CBIR is to reduce the semantic gap issue that improves the performance of image retrieval. It plays an important role in the field of e-commerce. This paper proposes a CBIR method based on ResNet and EfficientNet, and conducts experiments on the dataset of eProduct Visual Search Challenge 2021. First, because the dataset lacks direct query-index tags and the categories are not balanced, transfer learning is used to fine-tune the model with a balanced sub-dataset constructed by data augmentation. Second, the semantic features are extracted by the ResNet and EfficientNet and similarity comparison is calculated between the query and index image. Finally, the similarity result is decided through fusing the scores obtained by the two CNNs through a strategy. Experiments on the eProduct dataset demonstrate our algorithm can achieve a good performance.","PeriodicalId":222463,"journal":{"name":"2021 International Conference on Culture-oriented Science & Technology (ICCST)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Culture-oriented Science & Technology (ICCST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCST53801.2021.00101","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Content-based image retrieval (CBIR) is an important research direction of computer vision and has been well studied for a long period. The important aim in CBIR is to reduce the semantic gap issue that improves the performance of image retrieval. It plays an important role in the field of e-commerce. This paper proposes a CBIR method based on ResNet and EfficientNet, and conducts experiments on the dataset of eProduct Visual Search Challenge 2021. First, because the dataset lacks direct query-index tags and the categories are not balanced, transfer learning is used to fine-tune the model with a balanced sub-dataset constructed by data augmentation. Second, the semantic features are extracted by the ResNet and EfficientNet and similarity comparison is calculated between the query and index image. Finally, the similarity result is decided through fusing the scores obtained by the two CNNs through a strategy. Experiments on the eProduct dataset demonstrate our algorithm can achieve a good performance.