Graph hashing network for image retrieval

IF 4.2 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Image and Vision Computing Pub Date : 2025-07-30 DOI:10.1016/j.imavis.2025.105677

Xudong Zhou , Jun Tang , Ke Wang , Nian Wang , Han Chen

{"title":"Graph hashing network for image retrieval","authors":"Xudong Zhou , Jun Tang , Ke Wang , Nian Wang , Han Chen","doi":"10.1016/j.imavis.2025.105677","DOIUrl":null,"url":null,"abstract":"<div><div>Deep supervised hashing is more popular among researchers due to its satisfactory computational efficiency and retrieval performance. Most existing models learn hash codes for data by constructing inter-sample pair-wise or triplet losses, allowing for consideration of the topological information from the label space. However, the topological relationships among samples in the feature space are not fully explored, which may result in less discriminative hash codes. To address this issue, we propose a novel graph hashing network (GHash) for image retrieval. Our GHash explores positional relationships among samples under a large receptive field through alternating updates of graph nodes and edges, generating high-quality image descriptors based on optimized positional relationships and neighborhood information. Subsequently, graph-level descriptors are mapped into highly discriminative hash codes. Additionally, we introduce an extra classification loss to enhance the accuracy of the topological relationships among samples in the graph by supervising the learning of edge features. Finally, we conduct extensive comparison and ablation experiments on three benchmark datasets, with results demonstrating that our method achieves superior retrieval performance compared to state-of-the-art deep hashing methods.</div></div>","PeriodicalId":50374,"journal":{"name":"Image and Vision Computing","volume":"162 ","pages":"Article 105677"},"PeriodicalIF":4.2000,"publicationDate":"2025-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Image and Vision Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0262885625002653","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Deep supervised hashing is more popular among researchers due to its satisfactory computational efficiency and retrieval performance. Most existing models learn hash codes for data by constructing inter-sample pair-wise or triplet losses, allowing for consideration of the topological information from the label space. However, the topological relationships among samples in the feature space are not fully explored, which may result in less discriminative hash codes. To address this issue, we propose a novel graph hashing network (GHash) for image retrieval. Our GHash explores positional relationships among samples under a large receptive field through alternating updates of graph nodes and edges, generating high-quality image descriptors based on optimized positional relationships and neighborhood information. Subsequently, graph-level descriptors are mapped into highly discriminative hash codes. Additionally, we introduce an extra classification loss to enhance the accuracy of the topological relationships among samples in the graph by supervising the learning of edge features. Finally, we conduct extensive comparison and ablation experiments on three benchmark datasets, with results demonstrating that our method achieves superior retrieval performance compared to state-of-the-art deep hashing methods.

查看原文本刊更多论文

图哈希网络图像检索

深度监督哈希因其令人满意的计算效率和检索性能而受到研究人员的青睐。大多数现有模型通过构造样本间成对或三元组损失来学习数据的哈希码，允许考虑来自标签空间的拓扑信息。然而，特征空间中样本之间的拓扑关系没有得到充分的探索，这可能导致判别哈希码较少。为了解决这个问题，我们提出了一种新的图像检索图哈希网络（GHash）。我们的GHash通过交替更新图节点和边来探索大接受域下样本之间的位置关系，基于优化的位置关系和邻域信息生成高质量的图像描述符。随后，将图级描述符映射为高度判别的哈希码。此外，我们引入了一个额外的分类损失，通过监督边缘特征的学习来提高图中样本之间拓扑关系的准确性。最后，我们在三个基准数据集上进行了广泛的比较和消融实验，结果表明，与最先进的深度哈希方法相比，我们的方法取得了更好的检索性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Image and Vision Computing 工程技术-工程：电子与电气

CiteScore

8.50

自引率

8.50%

发文量

143

审稿时长

7.8 months

期刊介绍： Image and Vision Computing has as a primary aim the provision of an effective medium of interchange for the results of high quality theoretical and applied research fundamental to all aspects of image interpretation and computer vision. The journal publishes work that proposes new image interpretation and computer vision methodology or addresses the application of such methods to real world scenes. It seeks to strengthen a deeper understanding in the discipline by encouraging the quantitative comparison and performance evaluation of the proposed methodology. The coverage includes: image interpretation, scene modelling, object recognition and tracking, shape analysis, monitoring and surveillance, active vision and robotic systems, SLAM, biologically-inspired computer vision, motion analysis, stereo vision, document image understanding, character and handwritten text recognition, face and gesture recognition, biometrics, vision-based human-computer interaction, human activity and behavior understanding, data fusion from multiple sensor inputs, image databases.