Jiulong Peng, Wei Zhang, Yingli Hou, Hai Yu, Zhiliang Zhu
{"title":"ECAFusion:红外和可见光图像融合,通过边缘保持和跨模态注意机制","authors":"Jiulong Peng, Wei Zhang, Yingli Hou, Hai Yu, Zhiliang Zhu","doi":"10.1016/j.infrared.2025.106085","DOIUrl":null,"url":null,"abstract":"<div><div>Infrared and visible image fusion (IVIF) strives to integrate detailed textures with clear thermal objects, thus obtaining an information-rich image to enhance the performance of downstream tasks. However, existing image fusion networks generally overlook critical information, which includes edge details, and fail to facilitate effective cross-modal information interaction when extracting features across modalities. As a result, the fused outputs often fall short of expectations. To overcome this challenge, we propose ECAFusion, a dual-branch fusion model for IVIF that incorporates an edge-preserving and cross-modal attention mechanism. Specifically, we design a Sobel convolution block (SCB) by applying the Sobel operator to preserve edge details. Additionally, a cross-modal interaction module (CMIM) is proposed to capture complementary information from different modalities. Finally, we employ a well-designed feature fusion module to combine these features from different modalities. Comprehensive experiments show that ECAFusion surpasses the most advanced fusion methods. Moreover, our model significantly improves the performance of high-level vision tasks.</div></div>","PeriodicalId":13549,"journal":{"name":"Infrared Physics & Technology","volume":"151 ","pages":"Article 106085"},"PeriodicalIF":3.4000,"publicationDate":"2025-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"ECAFusion: Infrared and visible image fusion via edge-preserving and cross-modal attention mechanism\",\"authors\":\"Jiulong Peng, Wei Zhang, Yingli Hou, Hai Yu, Zhiliang Zhu\",\"doi\":\"10.1016/j.infrared.2025.106085\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Infrared and visible image fusion (IVIF) strives to integrate detailed textures with clear thermal objects, thus obtaining an information-rich image to enhance the performance of downstream tasks. However, existing image fusion networks generally overlook critical information, which includes edge details, and fail to facilitate effective cross-modal information interaction when extracting features across modalities. As a result, the fused outputs often fall short of expectations. To overcome this challenge, we propose ECAFusion, a dual-branch fusion model for IVIF that incorporates an edge-preserving and cross-modal attention mechanism. Specifically, we design a Sobel convolution block (SCB) by applying the Sobel operator to preserve edge details. Additionally, a cross-modal interaction module (CMIM) is proposed to capture complementary information from different modalities. Finally, we employ a well-designed feature fusion module to combine these features from different modalities. Comprehensive experiments show that ECAFusion surpasses the most advanced fusion methods. Moreover, our model significantly improves the performance of high-level vision tasks.</div></div>\",\"PeriodicalId\":13549,\"journal\":{\"name\":\"Infrared Physics & Technology\",\"volume\":\"151 \",\"pages\":\"Article 106085\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-08-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Infrared Physics & Technology\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1350449525003780\",\"RegionNum\":3,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"INSTRUMENTS & INSTRUMENTATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Infrared Physics & Technology","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1350449525003780","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"INSTRUMENTS & INSTRUMENTATION","Score":null,"Total":0}
ECAFusion: Infrared and visible image fusion via edge-preserving and cross-modal attention mechanism
Infrared and visible image fusion (IVIF) strives to integrate detailed textures with clear thermal objects, thus obtaining an information-rich image to enhance the performance of downstream tasks. However, existing image fusion networks generally overlook critical information, which includes edge details, and fail to facilitate effective cross-modal information interaction when extracting features across modalities. As a result, the fused outputs often fall short of expectations. To overcome this challenge, we propose ECAFusion, a dual-branch fusion model for IVIF that incorporates an edge-preserving and cross-modal attention mechanism. Specifically, we design a Sobel convolution block (SCB) by applying the Sobel operator to preserve edge details. Additionally, a cross-modal interaction module (CMIM) is proposed to capture complementary information from different modalities. Finally, we employ a well-designed feature fusion module to combine these features from different modalities. Comprehensive experiments show that ECAFusion surpasses the most advanced fusion methods. Moreover, our model significantly improves the performance of high-level vision tasks.
期刊介绍:
The Journal covers the entire field of infrared physics and technology: theory, experiment, application, devices and instrumentation. Infrared'' is defined as covering the near, mid and far infrared (terahertz) regions from 0.75um (750nm) to 1mm (300GHz.) Submissions in the 300GHz to 100GHz region may be accepted at the editors discretion if their content is relevant to shorter wavelengths. Submissions must be primarily concerned with and directly relevant to this spectral region.
Its core topics can be summarized as the generation, propagation and detection, of infrared radiation; the associated optics, materials and devices; and its use in all fields of science, industry, engineering and medicine.
Infrared techniques occur in many different fields, notably spectroscopy and interferometry; material characterization and processing; atmospheric physics, astronomy and space research. Scientific aspects include lasers, quantum optics, quantum electronics, image processing and semiconductor physics. Some important applications are medical diagnostics and treatment, industrial inspection and environmental monitoring.