TCIGFusion：一种用于红外和可见光图像融合的两阶段相关特征交互引导网络

IF 3.7 2区工程技术 Q2 OPTICS

Optics and Lasers in Engineering Pub Date : 2025-08-21 DOI:10.1016/j.optlaseng.2025.109265

Jiawei Liu, Guiling Sun, Bowen Zheng, Liang Dong

{"title":"TCIGFusion：一种用于红外和可见光图像融合的两阶段相关特征交互引导网络","authors":"Jiawei Liu, Guiling Sun, Bowen Zheng, Liang Dong","doi":"10.1016/j.optlaseng.2025.109265","DOIUrl":null,"url":null,"abstract":"<div><div>Infrared and visible image fusion is aimed at generating images with prominent targets and texture details, providing support for downstream applications such as object detection. However, most existing deep learning-based fusion methods involve single-stage training and manually designed fusion rules, which cannot effectively extract and fuse features. Therefore, in this paper, we propose a two-stage correlated feature interactive guided network termed TCIGFusion. In the first stage, a Unet-like dual-branch Transformer module and dynamic large kernel convolution block (DLKB) are used to extract global features from the two source images, while the convolution blocks extract local features from the source images. In the second phase, we designed a cross attention guide module (CAGM) to interactively fuse the heterogeneously related features of the two modalities, avoiding the complexity associated with manually designing fusion rules. Furthermore, to optimize the efficacy of the fusion network, we employ a combination of image reconstruction, decomposition, and gradient loss functions for unsupervised training of the model. The superiority of our TCIGFusion is evidenced by extensive experimentation conducted on multiple public datasets. These experiments demonstrate that our method outperforms other state-of-the-art deep learning approaches, as evaluated through both subjective and objective metrics.</div></div>","PeriodicalId":49719,"journal":{"name":"Optics and Lasers in Engineering","volume":"195 ","pages":"Article 109265"},"PeriodicalIF":3.7000,"publicationDate":"2025-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"TCIGFusion: A two-stage correlated feature interactive guided network for infrared and visible image fusion\",\"authors\":\"Jiawei Liu, Guiling Sun, Bowen Zheng, Liang Dong\",\"doi\":\"10.1016/j.optlaseng.2025.109265\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Infrared and visible image fusion is aimed at generating images with prominent targets and texture details, providing support for downstream applications such as object detection. However, most existing deep learning-based fusion methods involve single-stage training and manually designed fusion rules, which cannot effectively extract and fuse features. Therefore, in this paper, we propose a two-stage correlated feature interactive guided network termed TCIGFusion. In the first stage, a Unet-like dual-branch Transformer module and dynamic large kernel convolution block (DLKB) are used to extract global features from the two source images, while the convolution blocks extract local features from the source images. In the second phase, we designed a cross attention guide module (CAGM) to interactively fuse the heterogeneously related features of the two modalities, avoiding the complexity associated with manually designing fusion rules. Furthermore, to optimize the efficacy of the fusion network, we employ a combination of image reconstruction, decomposition, and gradient loss functions for unsupervised training of the model. The superiority of our TCIGFusion is evidenced by extensive experimentation conducted on multiple public datasets. These experiments demonstrate that our method outperforms other state-of-the-art deep learning approaches, as evaluated through both subjective and objective metrics.</div></div>\",\"PeriodicalId\":49719,\"journal\":{\"name\":\"Optics and Lasers in Engineering\",\"volume\":\"195 \",\"pages\":\"Article 109265\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2025-08-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Optics and Lasers in Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0143816625004506\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"OPTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Optics and Lasers in Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0143816625004506","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"OPTICS","Score":null,"Total":0}

引用次数: 0

摘要

红外和可见光图像融合旨在生成具有突出目标和纹理细节的图像，为目标检测等下游应用提供支持。然而，现有的基于深度学习的融合方法大多采用单阶段训练和人工设计的融合规则，无法有效地提取和融合特征。因此，本文提出了一种两阶段相关特征交互引导网络TCIGFusion。在第一阶段，使用类似unet的双分支Transformer模块和动态大核卷积块（DLKB）从两幅源图像中提取全局特征，而卷积块从源图像中提取局部特征。在第二阶段，我们设计了一个交叉注意引导模块（CAGM），以交互方式融合两种模式的异构相关特征，避免了人工设计融合规则的复杂性。此外，为了优化融合网络的有效性，我们采用图像重构、分解和梯度损失函数相结合的方法对模型进行无监督训练。在多个公共数据集上进行的大量实验证明了我们的TCIGFusion的优越性。这些实验表明，我们的方法优于其他最先进的深度学习方法，通过主观和客观指标进行评估。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

TCIGFusion: A two-stage correlated feature interactive guided network for infrared and visible image fusion

Infrared and visible image fusion is aimed at generating images with prominent targets and texture details, providing support for downstream applications such as object detection. However, most existing deep learning-based fusion methods involve single-stage training and manually designed fusion rules, which cannot effectively extract and fuse features. Therefore, in this paper, we propose a two-stage correlated feature interactive guided network termed TCIGFusion. In the first stage, a Unet-like dual-branch Transformer module and dynamic large kernel convolution block (DLKB) are used to extract global features from the two source images, while the convolution blocks extract local features from the source images. In the second phase, we designed a cross attention guide module (CAGM) to interactively fuse the heterogeneously related features of the two modalities, avoiding the complexity associated with manually designing fusion rules. Furthermore, to optimize the efficacy of the fusion network, we employ a combination of image reconstruction, decomposition, and gradient loss functions for unsupervised training of the model. The superiority of our TCIGFusion is evidenced by extensive experimentation conducted on multiple public datasets. These experiments demonstrate that our method outperforms other state-of-the-art deep learning approaches, as evaluated through both subjective and objective metrics.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Optics and Lasers in Engineering 工程技术-光学

CiteScore

8.90

自引率

8.70%

发文量

384

审稿时长

42 days

期刊介绍： Optics and Lasers in Engineering aims at providing an international forum for the interchange of information on the development of optical techniques and laser technology in engineering. Emphasis is placed on contributions targeted at the practical use of methods and devices, the development and enhancement of solutions and new theoretical concepts for experimental methods. Optics and Lasers in Engineering reflects the main areas in which optical methods are being used and developed for an engineering environment. Manuscripts should offer clear evidence of novelty and significance. Papers focusing on parameter optimization or computational issues are not suitable. Similarly, papers focussed on an application rather than the optical method fall outside the journal''s scope. The scope of the journal is defined to include the following: -Optical Metrology- Optical Methods for 3D visualization and virtual engineering- Optical Techniques for Microsystems- Imaging, Microscopy and Adaptive Optics- Computational Imaging- Laser methods in manufacturing- Integrated optical and photonic sensors- Optics and Photonics in Life Science- Hyperspectral and spectroscopic methods- Infrared and Terahertz techniques