EEF: Energy score-guided feature enhancement fusion method for RGB and thermal infrared images object detection

IF 3.6 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC
Tianhao Hao , Jinfu Yang , Shaochen Zhang , Shuwen Wu
{"title":"EEF: Energy score-guided feature enhancement fusion method for RGB and thermal infrared images object detection","authors":"Tianhao Hao ,&nbsp;Jinfu Yang ,&nbsp;Shaochen Zhang ,&nbsp;Shuwen Wu","doi":"10.1016/j.sigpro.2025.110231","DOIUrl":null,"url":null,"abstract":"<div><div>The full exploitation of the complementarity between different modalities is crucial for RGB and Thermal infrared images (RGB-T) object detection. However, most existing methods utilizing a traditional backbone to extract features often struggle to enhance the discriminability of features from different modalities, thereby restricting the representational capacity of fused features. We propose an energy score-guided feature enhancement fusion method (EEF) for RGB-T object detection. Firstly, we design an energy-based feature enhancement module (EFEM) that leverages the proposed channel energy score to assess the importance and reliability of feature channels to enhance the discriminability of features and make them more focused on the region of the object. Then, we introduce an Efficient Cross-modal Fusion Module (ECFM) to capture complementary information between modalities by utilizing the global feature interaction capability of attention mechanisms. Finally, we incorporate an adaptive feedback module (AFM), which utilizes the fused features as guidance information to obtain the corresponding learning weights for different modalities to enhance the representational capacity of original features. We thoroughly evaluate our approach on the LLVIP and FLIR datasets, achieving preferable results of 64.9% and 41.1% mAP. The promising results adequately demonstrate the effectiveness of EEF in RGB-T object detection tasks.</div></div>","PeriodicalId":49523,"journal":{"name":"Signal Processing","volume":"239 ","pages":"Article 110231"},"PeriodicalIF":3.6000,"publicationDate":"2025-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0165168425003457","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

The full exploitation of the complementarity between different modalities is crucial for RGB and Thermal infrared images (RGB-T) object detection. However, most existing methods utilizing a traditional backbone to extract features often struggle to enhance the discriminability of features from different modalities, thereby restricting the representational capacity of fused features. We propose an energy score-guided feature enhancement fusion method (EEF) for RGB-T object detection. Firstly, we design an energy-based feature enhancement module (EFEM) that leverages the proposed channel energy score to assess the importance and reliability of feature channels to enhance the discriminability of features and make them more focused on the region of the object. Then, we introduce an Efficient Cross-modal Fusion Module (ECFM) to capture complementary information between modalities by utilizing the global feature interaction capability of attention mechanisms. Finally, we incorporate an adaptive feedback module (AFM), which utilizes the fused features as guidance information to obtain the corresponding learning weights for different modalities to enhance the representational capacity of original features. We thoroughly evaluate our approach on the LLVIP and FLIR datasets, achieving preferable results of 64.9% and 41.1% mAP. The promising results adequately demonstrate the effectiveness of EEF in RGB-T object detection tasks.
EEF:能量分值引导下的RGB和热红外图像目标检测特征增强融合方法
充分利用不同模态之间的互补性对RGB和热红外图像(RGB- t)目标检测至关重要。然而,大多数利用传统主干提取特征的方法往往难以提高不同模态特征的可分辨性,从而限制了融合特征的表示能力。提出了一种能量分数导向的RGB-T目标检测特征增强融合方法(EEF)。首先,我们设计了一个基于能量的特征增强模块(EFEM),利用提出的通道能量分数来评估特征通道的重要性和可靠性,以增强特征的可分辨性,使其更加集中在目标区域;然后,我们引入了一个高效跨模态融合模块(ECFM),利用注意机制的全局特征交互能力捕获模态之间的互补信息。最后,引入自适应反馈模块(AFM),利用融合后的特征作为引导信息获取不同模态的学习权值,增强原始特征的表征能力。我们在LLVIP和FLIR数据集上对我们的方法进行了全面的评估,获得了64.9%和41.1% mAP的较好结果。这些令人鼓舞的结果充分证明了EEF在RGB-T目标检测任务中的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Signal Processing
Signal Processing 工程技术-工程:电子与电气
CiteScore
9.20
自引率
9.10%
发文量
309
审稿时长
41 days
期刊介绍: Signal Processing incorporates all aspects of the theory and practice of signal processing. It features original research work, tutorial and review articles, and accounts of practical developments. It is intended for a rapid dissemination of knowledge and experience to engineers and scientists working in the research, development or practical application of signal processing. Subject areas covered by the journal include: Signal Theory; Stochastic Processes; Detection and Estimation; Spectral Analysis; Filtering; Signal Processing Systems; Software Developments; Image Processing; Pattern Recognition; Optical Signal Processing; Digital Signal Processing; Multi-dimensional Signal Processing; Communication Signal Processing; Biomedical Signal Processing; Geophysical and Astrophysical Signal Processing; Earth Resources Signal Processing; Acoustic and Vibration Signal Processing; Data Processing; Remote Sensing; Signal Processing Technology; Radar Signal Processing; Sonar Signal Processing; Industrial Applications; New Applications.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信