Contrastive Learning-Driven Image Dehazing with Multi-Scale Feature Fusion and Hybrid Attention Mechanism.

IF 2.7 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY

Journal of Imaging Pub Date : 2025-08-26 DOI:10.3390/jimaging11090290

Huazhong Zhang, Jiaozhuo Wang, Xiaoguang Tu, Zhiyi Niu, Yu Wang

{"title":"Contrastive Learning-Driven Image Dehazing with Multi-Scale Feature Fusion and Hybrid Attention Mechanism.","authors":"Huazhong Zhang, Jiaozhuo Wang, Xiaoguang Tu, Zhiyi Niu, Yu Wang","doi":"10.3390/jimaging11090290","DOIUrl":null,"url":null,"abstract":"<p><p>Image dehazing is critical for visual enhancement and a wide range of computer vision applications. Despite significant advancements, challenges remain in preserving fine details and adapting to diverse, non-uniformly degraded scenes. To address these issues, we propose a novel image dehazing method that introduces a contrastive learning framework, enhanced by the InfoNCE loss, to improve model robustness. In this framework, hazy images are treated as negative samples and their clear counterparts as positive samples. By optimizing the InfoNCE loss, the model is trained to maximize the similarity between positive pairs and minimize that between negative pairs, thereby improving its ability to distinguish haze artifacts from intrinsic scene features and better preserving the structural integrity of images. In addition to contrastive learning, our method integrates a multi-scale dynamic feature fusion with a hybrid attention mechanism. Specifically, we introduce dynamically adjustable frequency band filters and refine the hybrid attention module to more effectively capture fine-grained, cross-scale image details. Extensive experiments on the RESIDE-6K and RS-Haze datasets demonstrate that our approach outperforms most existing methods, offering a promising solution for practical image dehazing applications.</p>","PeriodicalId":37035,"journal":{"name":"Journal of Imaging","volume":"11 9","pages":""},"PeriodicalIF":2.7000,"publicationDate":"2025-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12470370/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Imaging","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/jimaging11090290","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Image dehazing is critical for visual enhancement and a wide range of computer vision applications. Despite significant advancements, challenges remain in preserving fine details and adapting to diverse, non-uniformly degraded scenes. To address these issues, we propose a novel image dehazing method that introduces a contrastive learning framework, enhanced by the InfoNCE loss, to improve model robustness. In this framework, hazy images are treated as negative samples and their clear counterparts as positive samples. By optimizing the InfoNCE loss, the model is trained to maximize the similarity between positive pairs and minimize that between negative pairs, thereby improving its ability to distinguish haze artifacts from intrinsic scene features and better preserving the structural integrity of images. In addition to contrastive learning, our method integrates a multi-scale dynamic feature fusion with a hybrid attention mechanism. Specifically, we introduce dynamically adjustable frequency band filters and refine the hybrid attention module to more effectively capture fine-grained, cross-scale image details. Extensive experiments on the RESIDE-6K and RS-Haze datasets demonstrate that our approach outperforms most existing methods, offering a promising solution for practical image dehazing applications.

查看原文本刊更多论文

基于多尺度特征融合和混合注意机制的对比学习驱动图像去雾。

图像去雾对于视觉增强和广泛的计算机视觉应用至关重要。尽管取得了重大进展，但在保留细节和适应多样化、非均匀退化场景方面仍然存在挑战。为了解决这些问题，我们提出了一种新的图像去雾方法，该方法引入了一个对比学习框架，通过InfoNCE损失增强，以提高模型的鲁棒性。在这个框架中，模糊图像被视为负样本，清晰图像被视为正样本。通过优化InfoNCE损失，训练模型最大化正对之间的相似度，最小化负对之间的相似度，从而提高模型区分雾霾伪影与场景固有特征的能力，更好地保持图像的结构完整性。除了对比学习之外，我们的方法还将多尺度动态特征融合与混合注意机制相结合。具体来说，我们引入了动态可调频带滤波器，并改进了混合注意力模块，以更有效地捕获细粒度、跨尺度的图像细节。在RESIDE-6K和RS-Haze数据集上进行的大量实验表明，我们的方法优于大多数现有方法，为实际图像去雾应用提供了一个有前途的解决方案。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊