轻量化多分支异构图像融合框架的再思考:基于并行Mamba-KAN框架的红外与可见光图像融合

IF 5 2区 物理与天体物理 Q1 OPTICS
Yichen Sun , Mingli Dong , Lianqing Zhu
{"title":"轻量化多分支异构图像融合框架的再思考:基于并行Mamba-KAN框架的红外与可见光图像融合","authors":"Yichen Sun ,&nbsp;Mingli Dong ,&nbsp;Lianqing Zhu","doi":"10.1016/j.optlastec.2025.112612","DOIUrl":null,"url":null,"abstract":"<div><div>The infrared and visible image fusion (IVIF) technique, as an important branch of image processing, has garnered significant attention due to its ability to capture thermal radiation features in low-light conditions and combine them with the rich detail and color information of visible images. However, challenges remain in this field, including the difficulty of balancing information from both modalities using homogenous fusion strategies and the increasing model complexity, which affects computational efficiency. To address these issues, we present an innovative lightweight IVIF framework based on the parallel Mamba-KAN (PMKFuse) model, featuring a multi-branch heterogeneous model design. By integrating our proposed multi-channel parallel cross-vision Mamba (PCVM) modules with parallel KAGtention (PKAGN) modules, our approach effectively extracts features at both global and local levels. This not only ensures high performance in image fusion tasks but also significantly reduces the number of model parameters. Additionally, a composite loss function is developed, integrating intensity, gradient, and feature decomposition losses to optimize the training process. The experimental results demonstrate that PMKFuse not only minimizes the number of model parameters but also outperforms the current state-of-the-art (SOTA) methods in both subjective visual evaluation and objective performance metrics for IVIF. These findings highlight the model’s effectiveness in improving fusion quality and its potential for wide-ranging practical applications, advancing the field of image processing. The codes are available at <span><span>https://github.com/sunyichen1994/PMKFuse.</span><svg><path></path></svg></span></div></div>","PeriodicalId":19511,"journal":{"name":"Optics and Laser Technology","volume":"185 ","pages":"Article 112612"},"PeriodicalIF":5.0000,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Rethinking the approach to lightweight multi-branch heterogeneous image fusion frameworks: Infrared and visible image fusion via the parallel Mamba-KAN framework\",\"authors\":\"Yichen Sun ,&nbsp;Mingli Dong ,&nbsp;Lianqing Zhu\",\"doi\":\"10.1016/j.optlastec.2025.112612\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The infrared and visible image fusion (IVIF) technique, as an important branch of image processing, has garnered significant attention due to its ability to capture thermal radiation features in low-light conditions and combine them with the rich detail and color information of visible images. However, challenges remain in this field, including the difficulty of balancing information from both modalities using homogenous fusion strategies and the increasing model complexity, which affects computational efficiency. To address these issues, we present an innovative lightweight IVIF framework based on the parallel Mamba-KAN (PMKFuse) model, featuring a multi-branch heterogeneous model design. By integrating our proposed multi-channel parallel cross-vision Mamba (PCVM) modules with parallel KAGtention (PKAGN) modules, our approach effectively extracts features at both global and local levels. This not only ensures high performance in image fusion tasks but also significantly reduces the number of model parameters. Additionally, a composite loss function is developed, integrating intensity, gradient, and feature decomposition losses to optimize the training process. The experimental results demonstrate that PMKFuse not only minimizes the number of model parameters but also outperforms the current state-of-the-art (SOTA) methods in both subjective visual evaluation and objective performance metrics for IVIF. These findings highlight the model’s effectiveness in improving fusion quality and its potential for wide-ranging practical applications, advancing the field of image processing. The codes are available at <span><span>https://github.com/sunyichen1994/PMKFuse.</span><svg><path></path></svg></span></div></div>\",\"PeriodicalId\":19511,\"journal\":{\"name\":\"Optics and Laser Technology\",\"volume\":\"185 \",\"pages\":\"Article 112612\"},\"PeriodicalIF\":5.0000,\"publicationDate\":\"2025-02-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Optics and Laser Technology\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0030399225002002\",\"RegionNum\":2,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"OPTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Optics and Laser Technology","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0030399225002002","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OPTICS","Score":null,"Total":0}
引用次数: 0

摘要

红外与可见光图像融合(IVIF)技术作为图像处理的一个重要分支,因其能够捕获弱光条件下的热辐射特征,并将其与可见光图像丰富的细节和色彩信息相结合而备受关注。然而,该领域仍然存在挑战,包括使用同质融合策略平衡两种模式的信息的困难以及模型复杂性的增加,这影响了计算效率。为了解决这些问题,我们提出了一个基于并行Mamba-KAN (PMKFuse)模型的创新轻量级IVIF框架,具有多分支异构模型设计。通过将我们提出的多通道并行交叉视觉曼巴(PCVM)模块与并行kagattention (PKAGN)模块相结合,我们的方法有效地提取了全局和局部水平的特征。这不仅保证了图像融合任务的高性能,而且大大减少了模型参数的数量。此外,开发了一种复合损失函数,集成强度、梯度和特征分解损失来优化训练过程。实验结果表明,PMKFuse不仅可以最大限度地减少模型参数的数量,而且在IVIF的主观视觉评价和客观性能指标方面都优于当前最先进的SOTA方法。这些发现突出了该模型在提高融合质量方面的有效性及其广泛的实际应用潜力,推动了图像处理领域的发展。代码可在https://github.com/sunyichen1994/PMKFuse上获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Rethinking the approach to lightweight multi-branch heterogeneous image fusion frameworks: Infrared and visible image fusion via the parallel Mamba-KAN framework
The infrared and visible image fusion (IVIF) technique, as an important branch of image processing, has garnered significant attention due to its ability to capture thermal radiation features in low-light conditions and combine them with the rich detail and color information of visible images. However, challenges remain in this field, including the difficulty of balancing information from both modalities using homogenous fusion strategies and the increasing model complexity, which affects computational efficiency. To address these issues, we present an innovative lightweight IVIF framework based on the parallel Mamba-KAN (PMKFuse) model, featuring a multi-branch heterogeneous model design. By integrating our proposed multi-channel parallel cross-vision Mamba (PCVM) modules with parallel KAGtention (PKAGN) modules, our approach effectively extracts features at both global and local levels. This not only ensures high performance in image fusion tasks but also significantly reduces the number of model parameters. Additionally, a composite loss function is developed, integrating intensity, gradient, and feature decomposition losses to optimize the training process. The experimental results demonstrate that PMKFuse not only minimizes the number of model parameters but also outperforms the current state-of-the-art (SOTA) methods in both subjective visual evaluation and objective performance metrics for IVIF. These findings highlight the model’s effectiveness in improving fusion quality and its potential for wide-ranging practical applications, advancing the field of image processing. The codes are available at https://github.com/sunyichen1994/PMKFuse.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
8.50
自引率
10.00%
发文量
1060
审稿时长
3.4 months
期刊介绍: Optics & Laser Technology aims to provide a vehicle for the publication of a broad range of high quality research and review papers in those fields of scientific and engineering research appertaining to the development and application of the technology of optics and lasers. Papers describing original work in these areas are submitted to rigorous refereeing prior to acceptance for publication. The scope of Optics & Laser Technology encompasses, but is not restricted to, the following areas: •development in all types of lasers •developments in optoelectronic devices and photonics •developments in new photonics and optical concepts •developments in conventional optics, optical instruments and components •techniques of optical metrology, including interferometry and optical fibre sensors •LIDAR and other non-contact optical measurement techniques, including optical methods in heat and fluid flow •applications of lasers to materials processing, optical NDT display (including holography) and optical communication •research and development in the field of laser safety including studies of hazards resulting from the applications of lasers (laser safety, hazards of laser fume) •developments in optical computing and optical information processing •developments in new optical materials •developments in new optical characterization methods and techniques •developments in quantum optics •developments in light assisted micro and nanofabrication methods and techniques •developments in nanophotonics and biophotonics •developments in imaging processing and systems
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信