Huangqimei Zheng , Chengyi Pan , Xin Jin , Michal Wozniak , Puming Wang , Shin-Jye Lee , Qian Jiang
{"title":"A pan-sharpening model using dual-branch attention-guided diffusion networks","authors":"Huangqimei Zheng , Chengyi Pan , Xin Jin , Michal Wozniak , Puming Wang , Shin-Jye Lee , Qian Jiang","doi":"10.1016/j.inffus.2025.103076","DOIUrl":null,"url":null,"abstract":"<div><div>Pan-sharpening has significant applications in remote sensing image processing. By fusing high-resolution panchromatic (PAN) images with low-resolution multispectral (MS) images, it generates high-spatial-resolution images with multispectral information (HRMS). Most deep learning methods, while excelling at extracting single-modality image features, face limitations in capturing the global joint distribution of cross-modality images. To address these issues, this paper introduces a novel pan-sharpening model, the Dual-Branch Attention-Guided Diffusion Network (DADiff), which incorporates diffusion models capable of effectively reconstructing the latent distribution of images. DADiff consists of a Diffusion Branch and an Attention-Guided Branch. The diffusion branch captures the global joint features of PAN and MS images during the denoising process, constructing a cross-modal distribution for HRMS images. Meanwhile, the attention-guided branch enhances the high-frequency details and local features of PAN and MS images through a multi-scale convolutional dense connection module and an improved attention mechanism. Leveraging the diffusion model’s global modeling capability and the multi-scale attention mechanism’s detail-capturing advantage, this network significantly enhances the spatial and spectral fidelity of generated HRMS images. Extensive experimental results validate the effectiveness of the proposed modules. We conducted experiments on the WorldViewII, QuickBird, and Maryland datasets, and the results confirm the superiority of DADiff over state-of-the-art methods.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"120 ","pages":"Article 103076"},"PeriodicalIF":14.7000,"publicationDate":"2025-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Fusion","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1566253525001496","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Pan-sharpening has significant applications in remote sensing image processing. By fusing high-resolution panchromatic (PAN) images with low-resolution multispectral (MS) images, it generates high-spatial-resolution images with multispectral information (HRMS). Most deep learning methods, while excelling at extracting single-modality image features, face limitations in capturing the global joint distribution of cross-modality images. To address these issues, this paper introduces a novel pan-sharpening model, the Dual-Branch Attention-Guided Diffusion Network (DADiff), which incorporates diffusion models capable of effectively reconstructing the latent distribution of images. DADiff consists of a Diffusion Branch and an Attention-Guided Branch. The diffusion branch captures the global joint features of PAN and MS images during the denoising process, constructing a cross-modal distribution for HRMS images. Meanwhile, the attention-guided branch enhances the high-frequency details and local features of PAN and MS images through a multi-scale convolutional dense connection module and an improved attention mechanism. Leveraging the diffusion model’s global modeling capability and the multi-scale attention mechanism’s detail-capturing advantage, this network significantly enhances the spatial and spectral fidelity of generated HRMS images. Extensive experimental results validate the effectiveness of the proposed modules. We conducted experiments on the WorldViewII, QuickBird, and Maryland datasets, and the results confirm the superiority of DADiff over state-of-the-art methods.
期刊介绍:
Information Fusion serves as a central platform for showcasing advancements in multi-sensor, multi-source, multi-process information fusion, fostering collaboration among diverse disciplines driving its progress. It is the leading outlet for sharing research and development in this field, focusing on architectures, algorithms, and applications. Papers dealing with fundamental theoretical analyses as well as those demonstrating their application to real-world problems will be welcome.