{"title":"Contrastive learning-guided hashing model for artwork image retrieval","authors":"Zhenyu Wang, Yingdong Yang, Fucheng Wu, Wenjia Li","doi":"10.1016/j.asoc.2025.113486","DOIUrl":null,"url":null,"abstract":"<div><div>With the growing demand for spiritual enrichment, artwork retrieval has gained increasing popularity in computer vision. While current quick response (QR) code-based and deep learning-based methods enable real-time artwork recognition, their dependence on manual annotations limits adaptability and scalability. In this paper, we propose an unsupervised hashing-based artwork retrieval framework that requires no manual labeling and adapts to new classes. The system comprises two key components: a detection module and a retrieval module. The detection module processes user-captured artwork images by predicting polygonal bounding boxes and rectifying them through perspective transformation algorithms. The retrieval module is the focus of our research. We present a novel contrastive learning framework for artwork retrieval that integrates a convolutional neural network (CNN) based feature extractor and a hashing encoder–decoder structure. This architecture processes input images into both floating-point and hashing representations while maintaining a binary memory bank for efficient similarity matching. Two specialized loss functions facilitate adaptive hashing encoding, aligning floating-point and hashing features through an unsupervised learning process. We evaluate our framework on a self-built dataset containing over 24k images spanning traditional Chinese paintings, oil paintings, and Chinese chops. The samples are unlabeled and there is only one image of each artwork. Comparative experiments with state-of-the-art methods demonstrate our system’s superior effectiveness and strong potential for practical artwork retrieval applications.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"181 ","pages":"Article 113486"},"PeriodicalIF":7.2000,"publicationDate":"2025-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494625007975","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
With the growing demand for spiritual enrichment, artwork retrieval has gained increasing popularity in computer vision. While current quick response (QR) code-based and deep learning-based methods enable real-time artwork recognition, their dependence on manual annotations limits adaptability and scalability. In this paper, we propose an unsupervised hashing-based artwork retrieval framework that requires no manual labeling and adapts to new classes. The system comprises two key components: a detection module and a retrieval module. The detection module processes user-captured artwork images by predicting polygonal bounding boxes and rectifying them through perspective transformation algorithms. The retrieval module is the focus of our research. We present a novel contrastive learning framework for artwork retrieval that integrates a convolutional neural network (CNN) based feature extractor and a hashing encoder–decoder structure. This architecture processes input images into both floating-point and hashing representations while maintaining a binary memory bank for efficient similarity matching. Two specialized loss functions facilitate adaptive hashing encoding, aligning floating-point and hashing features through an unsupervised learning process. We evaluate our framework on a self-built dataset containing over 24k images spanning traditional Chinese paintings, oil paintings, and Chinese chops. The samples are unlabeled and there is only one image of each artwork. Comparative experiments with state-of-the-art methods demonstrate our system’s superior effectiveness and strong potential for practical artwork retrieval applications.
期刊介绍:
Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities.
Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.