W-EICMFusion: A fusion network for infrared and visible images utilising WOA hyperparameter optimisation

IF 7.6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Xiaocong Wu , Xin Feng , Xuanlong Lu , Yi Yuan , Meina Huang , Yu Shen , Jiacheng Li
{"title":"W-EICMFusion: A fusion network for infrared and visible images utilising WOA hyperparameter optimisation","authors":"Xiaocong Wu ,&nbsp;Xin Feng ,&nbsp;Xuanlong Lu ,&nbsp;Yi Yuan ,&nbsp;Meina Huang ,&nbsp;Yu Shen ,&nbsp;Jiacheng Li","doi":"10.1016/j.patcog.2025.112531","DOIUrl":null,"url":null,"abstract":"<div><div>Currently, most innovations in image fusion methods focus on designing network architectures for source image feature extraction and formulating loss functions for fusion networks, while often neglecting the necessary adjustments and optimisations of hyperparameters within these loss functions. However, the selection of hyperparameters for the loss function of the fusion network is crucial because it determines the iterative direction of the network and significantly influences the final results. This study proposes an image fusion network with hyperparameter adaptive optimisation adjustment, termed whale optimisation algorithm-edge invertible CMamba fusion (W-EICMFusion). First, we introduce a CMamba module that effectively extracts common information features within the local and broad receptive fields. We designed an edge-extraction invertible neural network (EE-INN) module that captures edge detail information from two modalities, and a fusion layer known as a residual dense efficient channel attention network (RDENet) to enhance the extraction of complementary information. Unlike other fusion networks that depend on manual parameter tuning, this study employs the whale optimisation algorithm (WOA) to determine the optimal hyperparameters adaptively. Our experiments compared our designed fusion network with 11 recently developed advanced methods related to it. The proposed method demonstrates the best overall performance and achieves the highest comprehensive score in comparative experiments conducted on the MSRS, LLVIP, Road-Scene, and M3FD datasets. Furthermore, it attained the highest detection accuracy in subsequent tasks, such as object detection. In the final design of the optimisation algorithm generalisation experiments, the proposed approach to hyperparameter adaptive optimisation produced superior classification outcomes. The code can be found at <span><span>https://github.com/ljcuestc/W-EICMFusion.git</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"172 ","pages":"Article 112531"},"PeriodicalIF":7.6000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S003132032501194X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Currently, most innovations in image fusion methods focus on designing network architectures for source image feature extraction and formulating loss functions for fusion networks, while often neglecting the necessary adjustments and optimisations of hyperparameters within these loss functions. However, the selection of hyperparameters for the loss function of the fusion network is crucial because it determines the iterative direction of the network and significantly influences the final results. This study proposes an image fusion network with hyperparameter adaptive optimisation adjustment, termed whale optimisation algorithm-edge invertible CMamba fusion (W-EICMFusion). First, we introduce a CMamba module that effectively extracts common information features within the local and broad receptive fields. We designed an edge-extraction invertible neural network (EE-INN) module that captures edge detail information from two modalities, and a fusion layer known as a residual dense efficient channel attention network (RDENet) to enhance the extraction of complementary information. Unlike other fusion networks that depend on manual parameter tuning, this study employs the whale optimisation algorithm (WOA) to determine the optimal hyperparameters adaptively. Our experiments compared our designed fusion network with 11 recently developed advanced methods related to it. The proposed method demonstrates the best overall performance and achieves the highest comprehensive score in comparative experiments conducted on the MSRS, LLVIP, Road-Scene, and M3FD datasets. Furthermore, it attained the highest detection accuracy in subsequent tasks, such as object detection. In the final design of the optimisation algorithm generalisation experiments, the proposed approach to hyperparameter adaptive optimisation produced superior classification outcomes. The code can be found at https://github.com/ljcuestc/W-EICMFusion.git.
W-EICMFusion:利用WOA超参数优化的红外和可见光图像融合网络
目前,大多数图像融合方法的创新都集中在设计用于源图像特征提取的网络架构和制定融合网络的损失函数上,而往往忽略了这些损失函数中超参数的必要调整和优化。然而,融合网络损失函数的超参数选择至关重要,因为它决定了网络的迭代方向,并对最终结果产生重大影响。本研究提出了一种具有超参数自适应优化调整的图像融合网络,称为鲸鱼优化算法-边缘可逆CMamba融合(W-EICMFusion)。首先,我们引入了一个CMamba模块,该模块可以有效地提取局部和广泛接受域中的共同信息特征。我们设计了一个边缘提取可逆神经网络(EE-INN)模块,从两个模态中捕获边缘细节信息,并设计了一个被称为残差密集有效通道注意网络(RDENet)的融合层来增强互补信息的提取。与其他依赖手动参数调整的融合网络不同,本研究采用鲸鱼优化算法(WOA)自适应确定最优超参数。我们的实验将我们设计的融合网络与最近开发的11种与之相关的先进方法进行了比较。在MSRS、LLVIP、Road-Scene和M3FD数据集上进行的对比实验中,该方法综合性能最好,综合得分最高。此外,在后续的任务中,如目标检测,它达到了最高的检测精度。在优化算法推广实验的最终设计中,提出的超参数自适应优化方法产生了较好的分类结果。代码可以在https://github.com/ljcuestc/W-EICMFusion.git上找到。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Pattern Recognition
Pattern Recognition 工程技术-工程:电子与电气
CiteScore
14.40
自引率
16.20%
发文量
683
审稿时长
5.6 months
期刊介绍: The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信