{"title":"基于空频域交叉学习的红外与可见光图像融合网络","authors":"Haode Shi, Hongyu Chu, Yanhua Shao, XiaoQiang Zhang","doi":"10.1016/j.infrared.2025.105854","DOIUrl":null,"url":null,"abstract":"<div><div>The goal of infrared and visible image fusion is to combine complementary information from source images to generate fused images with high contrast, which can highlight salient targets while preserving rich texture details. Most deep learning-based fusion methods focus solely on the spatial domain, neglecting valuable frequency domain information. Furthermore, existing spatial-frequency fusion networks fail to fully exploit the advantages of both domains, resulting in limited fusion performance. To address this challenge, we propose a Spatial-Frequency Domain Cross-Learning Network (SFCFusion) for infrared and visible image fusion. Specifically, we have designed a frequency-domain learning branch that captures global feature information in the Fourier space, thereby more effectively preserving the global consistency of the source images. Additionally, we develop a spatial branch to extract local detail features and propose a Multi-scale Selective Enhancement Module (MSEM). Finally, when bridging the frequency and spatial branches, we observe that the feature information extracted from these two branches are complementary. To leverage this property, we design a Frequency-Spatial Cross-Guidance Module (FSCGM). This module employs a bidirectional guidance learning strategy to integrate critical information from both domains into each branch, thereby enhancing the quality of fused images. Extensive experiments on public datasets demonstrate that our method achieves significant advantages in terms of key fusion performance metrics and visual quality. It also exhibits robust performance in downstream detection tasks. Our code is available at <span><span>https://github.com/HaodeShi/SFCFusion</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":13549,"journal":{"name":"Infrared Physics & Technology","volume":"148 ","pages":"Article 105854"},"PeriodicalIF":3.1000,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Infrared and visible image fusion network based on spatial-frequency domain cross-learning\",\"authors\":\"Haode Shi, Hongyu Chu, Yanhua Shao, XiaoQiang Zhang\",\"doi\":\"10.1016/j.infrared.2025.105854\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The goal of infrared and visible image fusion is to combine complementary information from source images to generate fused images with high contrast, which can highlight salient targets while preserving rich texture details. Most deep learning-based fusion methods focus solely on the spatial domain, neglecting valuable frequency domain information. Furthermore, existing spatial-frequency fusion networks fail to fully exploit the advantages of both domains, resulting in limited fusion performance. To address this challenge, we propose a Spatial-Frequency Domain Cross-Learning Network (SFCFusion) for infrared and visible image fusion. Specifically, we have designed a frequency-domain learning branch that captures global feature information in the Fourier space, thereby more effectively preserving the global consistency of the source images. Additionally, we develop a spatial branch to extract local detail features and propose a Multi-scale Selective Enhancement Module (MSEM). Finally, when bridging the frequency and spatial branches, we observe that the feature information extracted from these two branches are complementary. To leverage this property, we design a Frequency-Spatial Cross-Guidance Module (FSCGM). This module employs a bidirectional guidance learning strategy to integrate critical information from both domains into each branch, thereby enhancing the quality of fused images. Extensive experiments on public datasets demonstrate that our method achieves significant advantages in terms of key fusion performance metrics and visual quality. It also exhibits robust performance in downstream detection tasks. Our code is available at <span><span>https://github.com/HaodeShi/SFCFusion</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":13549,\"journal\":{\"name\":\"Infrared Physics & Technology\",\"volume\":\"148 \",\"pages\":\"Article 105854\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2025-04-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Infrared Physics & Technology\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1350449525001471\",\"RegionNum\":3,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"INSTRUMENTS & INSTRUMENTATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Infrared Physics & Technology","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1350449525001471","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"INSTRUMENTS & INSTRUMENTATION","Score":null,"Total":0}
Infrared and visible image fusion network based on spatial-frequency domain cross-learning
The goal of infrared and visible image fusion is to combine complementary information from source images to generate fused images with high contrast, which can highlight salient targets while preserving rich texture details. Most deep learning-based fusion methods focus solely on the spatial domain, neglecting valuable frequency domain information. Furthermore, existing spatial-frequency fusion networks fail to fully exploit the advantages of both domains, resulting in limited fusion performance. To address this challenge, we propose a Spatial-Frequency Domain Cross-Learning Network (SFCFusion) for infrared and visible image fusion. Specifically, we have designed a frequency-domain learning branch that captures global feature information in the Fourier space, thereby more effectively preserving the global consistency of the source images. Additionally, we develop a spatial branch to extract local detail features and propose a Multi-scale Selective Enhancement Module (MSEM). Finally, when bridging the frequency and spatial branches, we observe that the feature information extracted from these two branches are complementary. To leverage this property, we design a Frequency-Spatial Cross-Guidance Module (FSCGM). This module employs a bidirectional guidance learning strategy to integrate critical information from both domains into each branch, thereby enhancing the quality of fused images. Extensive experiments on public datasets demonstrate that our method achieves significant advantages in terms of key fusion performance metrics and visual quality. It also exhibits robust performance in downstream detection tasks. Our code is available at https://github.com/HaodeShi/SFCFusion.
期刊介绍:
The Journal covers the entire field of infrared physics and technology: theory, experiment, application, devices and instrumentation. Infrared'' is defined as covering the near, mid and far infrared (terahertz) regions from 0.75um (750nm) to 1mm (300GHz.) Submissions in the 300GHz to 100GHz region may be accepted at the editors discretion if their content is relevant to shorter wavelengths. Submissions must be primarily concerned with and directly relevant to this spectral region.
Its core topics can be summarized as the generation, propagation and detection, of infrared radiation; the associated optics, materials and devices; and its use in all fields of science, industry, engineering and medicine.
Infrared techniques occur in many different fields, notably spectroscopy and interferometry; material characterization and processing; atmospheric physics, astronomy and space research. Scientific aspects include lasers, quantum optics, quantum electronics, image processing and semiconductor physics. Some important applications are medical diagnostics and treatment, industrial inspection and environmental monitoring.