Xiaoyang Wu , Jiahui Qu , Wenqian Dong , Hongxiang Li , Song Xiao , Yunsong Li
{"title":"Registration-fusion binocular diffusion model: Exploring continuous fusion of unregistered hyperspectral and multispectral images","authors":"Xiaoyang Wu , Jiahui Qu , Wenqian Dong , Hongxiang Li , Song Xiao , Yunsong Li","doi":"10.1016/j.knosys.2025.114007","DOIUrl":null,"url":null,"abstract":"<div><div>The fusion of hyperspectral image (HSI) and multispectral image (MSI) constitutes an effective approach to obtain high spatial resolution HSI (HR-HSI). However, in practice, multi-source images obtained under different imaging conditions are difficult to be perfectly registered. The vast majority of existing fusion methods crudely assume that HSI and MSI are registered, which is challenging to generalize to practical applications. To this end, we propose a unified registration-fusion binocular diffusion model (RF-BDiff) to achieve continuous fusion of unregistered HSI and MSI. RF-BDiff alternately optimizes image registration and fusion tasks to gradually recover HR-HSI by repeated refinement over multi time steps. The refinement operation at each time step is performed by the specially designed implicit neural registration module (INRM) and invertible spatial–spectral fusion module (IS<sup>2</sup>FM). INRM parameterizes the mapping between spatial coordinates of MSI and spectral values of reconstructed image and HSI as continuous implicit functions to provide the registered spatial coordinates for subsequent refinement. IS<sup>2</sup>FM designs an invertible bijective transformation for modeling fusion problem of registered images to reduce the information loss in fusion process and enriches the spatial–spectral information of reconstructed image. INRM and IS<sup>2</sup>FM are executed alternately to continuous fuse unregistered HSI and MSI. Such a design is conducive to the generalization of RF-BDiff in real applications, and we especially provide a real dataset to verify its effectiveness in real scenarios. Systematic experiments on simulated and real datasets demonstrate the state-of-the-art performance of RF-BDiff.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"326 ","pages":"Article 114007"},"PeriodicalIF":7.6000,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125010524","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
The fusion of hyperspectral image (HSI) and multispectral image (MSI) constitutes an effective approach to obtain high spatial resolution HSI (HR-HSI). However, in practice, multi-source images obtained under different imaging conditions are difficult to be perfectly registered. The vast majority of existing fusion methods crudely assume that HSI and MSI are registered, which is challenging to generalize to practical applications. To this end, we propose a unified registration-fusion binocular diffusion model (RF-BDiff) to achieve continuous fusion of unregistered HSI and MSI. RF-BDiff alternately optimizes image registration and fusion tasks to gradually recover HR-HSI by repeated refinement over multi time steps. The refinement operation at each time step is performed by the specially designed implicit neural registration module (INRM) and invertible spatial–spectral fusion module (IS2FM). INRM parameterizes the mapping between spatial coordinates of MSI and spectral values of reconstructed image and HSI as continuous implicit functions to provide the registered spatial coordinates for subsequent refinement. IS2FM designs an invertible bijective transformation for modeling fusion problem of registered images to reduce the information loss in fusion process and enriches the spatial–spectral information of reconstructed image. INRM and IS2FM are executed alternately to continuous fuse unregistered HSI and MSI. Such a design is conducive to the generalization of RF-BDiff in real applications, and we especially provide a real dataset to verify its effectiveness in real scenarios. Systematic experiments on simulated and real datasets demonstrate the state-of-the-art performance of RF-BDiff.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.