Deep Consistent Penalizing Hashing with noise-robust representation for large-scale image retrieval

IF 6.5 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Neurocomputing Pub Date : 2025-03-17 DOI:10.1016/j.neucom.2025.130014

Qibing Qin , Hong Wang , Wenfeng Zhang , Lei Huang , Jie Nie

{"title":"Deep Consistent Penalizing Hashing with noise-robust representation for large-scale image retrieval","authors":"Qibing Qin , Hong Wang , Wenfeng Zhang , Lei Huang , Jie Nie","doi":"10.1016/j.neucom.2025.130014","DOIUrl":null,"url":null,"abstract":"<div><div>Benefiting from the powerful representational capacity of deep learning and the attractive computational efficiency of binary codes, deep hashing frameworks have made large progress for large-scale image retrieval applications. By calculating the priori labels, most existing deep supervised hashing usually introduces the effective margin-based objective loss to generate label-level penalizing boundaries for training samples during the model optimization. However, the decision boundaries from label-level penalizing may be inconsistent with semantic relations hidden in raw samples, compromising the performance. Besides, for classes with low intra-class variances or inter-class correlations, the force field of the margin-based methods might be too weak to learn the discriminant embedding space. In this paper, we solve this dilemma with a novel unified deep hashing framework, termed Deep Consistent Penalizing Hashing with noise-robust representation (DCPH), to generate compact yet discriminative binary codes for efficient and accurate image retrieval. Specifically, by learning the unbalanced correlations of training samples, the semantic consistency penalizing loss, consisting of pulling penalizing elements and pushing penalizing elements, is proposed to generate the semantic decision boundaries across classes. For parameter optimization, the dice-like optimization strategy is introduced to balance the pulling and pushing field, facilitating the generation of highly separable Hamming space. Besides, to mitigate the negative influence caused by objective-unrelated information or noise, by introducing patch-wise attention strategy and depth-wise convolution operation, the noise-robust representation module is developed to capture the robust feature descriptor with abundant fine-grained information. Comprehensive evaluations are performed on several benchmark datasets, and the experimental results consistently validate the effectiveness of our proposed DCPH framework, which significantly outperforms the state-of-the-art deep hashing methods.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"635 ","pages":"Article 130014"},"PeriodicalIF":6.5000,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231225006861","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Benefiting from the powerful representational capacity of deep learning and the attractive computational efficiency of binary codes, deep hashing frameworks have made large progress for large-scale image retrieval applications. By calculating the priori labels, most existing deep supervised hashing usually introduces the effective margin-based objective loss to generate label-level penalizing boundaries for training samples during the model optimization. However, the decision boundaries from label-level penalizing may be inconsistent with semantic relations hidden in raw samples, compromising the performance. Besides, for classes with low intra-class variances or inter-class correlations, the force field of the margin-based methods might be too weak to learn the discriminant embedding space. In this paper, we solve this dilemma with a novel unified deep hashing framework, termed Deep Consistent Penalizing Hashing with noise-robust representation (DCPH), to generate compact yet discriminative binary codes for efficient and accurate image retrieval. Specifically, by learning the unbalanced correlations of training samples, the semantic consistency penalizing loss, consisting of pulling penalizing elements and pushing penalizing elements, is proposed to generate the semantic decision boundaries across classes. For parameter optimization, the dice-like optimization strategy is introduced to balance the pulling and pushing field, facilitating the generation of highly separable Hamming space. Besides, to mitigate the negative influence caused by objective-unrelated information or noise, by introducing patch-wise attention strategy and depth-wise convolution operation, the noise-robust representation module is developed to capture the robust feature descriptor with abundant fine-grained information. Comprehensive evaluations are performed on several benchmark datasets, and the experimental results consistently validate the effectiveness of our proposed DCPH framework, which significantly outperforms the state-of-the-art deep hashing methods.

查看原文本刊更多论文

基于噪声鲁棒表示的深度一致惩罚哈希大规模图像检索

得益于深度学习强大的表示能力和二进制码诱人的计算效率，深度哈希框架在大规模图像检索应用中取得了很大进展。现有的深度监督哈希算法通常通过计算先验标签，在模型优化过程中引入有效的基于边缘的目标损失来生成训练样本的标签级惩罚边界。然而，标签级惩罚的决策边界可能与原始样本中隐藏的语义关系不一致，从而影响性能。此外，对于类内方差或类间相关性较低的类，基于边缘的方法的力场可能太弱，无法学习到判别嵌入空间。在本文中，我们用一种新的统一深度哈希框架解决了这一困境，称为具有噪声鲁棒表示的深度一致惩罚哈希（DCPH），以生成紧凑但具有区别性的二进制代码，用于高效准确的图像检索。具体而言，通过学习训练样本的不平衡相关性，提出了由拉罚元素和推罚元素组成的语义一致性惩罚损失，以生成跨类的语义决策边界。在参数优化方面，引入类骰子优化策略平衡拉推场，促进生成高度可分的Hamming空间。此外，为了减轻客观无关信息或噪声对图像的负面影响，通过引入局部关注策略和深度卷积运算，开发了噪声鲁棒表示模块，以捕获具有丰富细粒度信息的鲁棒特征描述子。在几个基准数据集上进行了全面的评估，实验结果一致地验证了我们提出的DCPH框架的有效性，该框架明显优于最先进的深度哈希方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Neurocomputing 工程技术-计算机：人工智能

CiteScore

13.10

自引率

10.00%

发文量

1382

审稿时长

70 days

期刊介绍： Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.