Weichao Yi , Liquan Dong , Ming Liu , Lingqin Kong , Yue Yang , Xuhong Chu , Yuejin Zhao
{"title":"Towards haze removal with derived pseudo-label supervision from real-world non-aligned training data","authors":"Weichao Yi , Liquan Dong , Ming Liu , Lingqin Kong , Yue Yang , Xuhong Chu , Yuejin Zhao","doi":"10.1016/j.inffus.2025.103104","DOIUrl":null,"url":null,"abstract":"<div><div>Single-image dehazing seeks to restore clear images by addressing degradation issues caused by hazy conditions, such as detail loss and color distortion. However, since collecting large-scale and precisely aligned hazy/clear image pairs is unrealistic in real-world scenarios, existing data-driven learning-based dehazing algorithms are often affected by data authenticity and domain gaps between synthetic and real-world scenes, resulting in unsatisfactory performance. To this end, we propose a novel haze removal framework derived from real-world captured non-aligned training data. Specifically, our framework can be divided into two components: a pseudo-label supervision generation stage and an image dehazing stage. For one thing, the former stage explores clean-related style information from the haze-free image and transfers it to its corresponding hazy counterpart, thus generating fine-aligned training image pairs. More clearly, we relieve domain divergence and pixel misalignment through a well-designed Cross-Modulation Align Network (CMA-Net), which includes the Domain Transfer Module (DTM) and Feature Alignment Module (FAM). For another, the later stage focuses on constructing an effective dehazing architecture with a pseudo-label pixel-wise supervision training paradigm. Therefore, we propose a standard U-shape dehazing network with Physics-related Feature Unit (PFU) and Gate Attentive Fusion (GAF). Furthermore, we establish a new non-aligned hazy/clear dataset named Hazy-JXBIT by our camera devices, to further evaluate the proposed dehazing framework. Extensive experimental results demonstrate that these fine-aligned pseudo-label training pairs generated by CMA-Net can be beneficial for building a steady dehazing network USD-Net, and prompt us to obtain superior performance over existing state-of-the-art methods.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"120 ","pages":"Article 103104"},"PeriodicalIF":14.7000,"publicationDate":"2025-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Fusion","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1566253525001770","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Single-image dehazing seeks to restore clear images by addressing degradation issues caused by hazy conditions, such as detail loss and color distortion. However, since collecting large-scale and precisely aligned hazy/clear image pairs is unrealistic in real-world scenarios, existing data-driven learning-based dehazing algorithms are often affected by data authenticity and domain gaps between synthetic and real-world scenes, resulting in unsatisfactory performance. To this end, we propose a novel haze removal framework derived from real-world captured non-aligned training data. Specifically, our framework can be divided into two components: a pseudo-label supervision generation stage and an image dehazing stage. For one thing, the former stage explores clean-related style information from the haze-free image and transfers it to its corresponding hazy counterpart, thus generating fine-aligned training image pairs. More clearly, we relieve domain divergence and pixel misalignment through a well-designed Cross-Modulation Align Network (CMA-Net), which includes the Domain Transfer Module (DTM) and Feature Alignment Module (FAM). For another, the later stage focuses on constructing an effective dehazing architecture with a pseudo-label pixel-wise supervision training paradigm. Therefore, we propose a standard U-shape dehazing network with Physics-related Feature Unit (PFU) and Gate Attentive Fusion (GAF). Furthermore, we establish a new non-aligned hazy/clear dataset named Hazy-JXBIT by our camera devices, to further evaluate the proposed dehazing framework. Extensive experimental results demonstrate that these fine-aligned pseudo-label training pairs generated by CMA-Net can be beneficial for building a steady dehazing network USD-Net, and prompt us to obtain superior performance over existing state-of-the-art methods.
期刊介绍:
Information Fusion serves as a central platform for showcasing advancements in multi-sensor, multi-source, multi-process information fusion, fostering collaboration among diverse disciplines driving its progress. It is the leading outlet for sharing research and development in this field, focusing on architectures, algorithms, and applications. Papers dealing with fundamental theoretical analyses as well as those demonstrating their application to real-world problems will be welcome.