IR-DETR: An efficient detection transformer with multi-layer feature fusion for infrared small targets

IF 3.1 3区 物理与天体物理 Q2 INSTRUMENTS & INSTRUMENTATION
Xinbo Yue, Liwei Liu, Yue Du
{"title":"IR-DETR: An efficient detection transformer with multi-layer feature fusion for infrared small targets","authors":"Xinbo Yue,&nbsp;Liwei Liu,&nbsp;Yue Du","doi":"10.1016/j.infrared.2025.105926","DOIUrl":null,"url":null,"abstract":"<div><div>Automatic target recognition is critical in infrared imaging guidance. Detecting small targets is challenging due to poorly defined silhouettes, low signal-to-noise ratio (SNR) and complex backgrounds. Traditional methods and deep learning approaches often struggle with SNR, target size, and insufficient feature extraction. To address these issues, we propose the IR-DETR network family based on DETR. Specifically, we design a multilayer efficient backbone (MEB) for efficient feature extraction of infrared small targets using a simple and efficient network structure. We then fuse local and global spatial features through the local window attention-based intrascale feature interaction (LWAIFI). Our proposed TripleRepC3 (TRC3) expands the model’s receptive field, which improves the detection accuracy of infrared targets while also reducing the model’s overall size. Additionally, we introduce the Adaptive Max-Sigmoid activation function to address the shortcomings of previous activation functions in small target detection. Finally, by incorporating the Normalized Wasserstein Distance (NWD) loss function, we further improve the detection performance of IR-DETR for small infrared targets. Compared to SOTA models on ATR dataset, NUDT-SIRST dataset and IRSTD-1k dataset, the IR-DETR network family achieved the best performance. The code will be released at <span><span>https://github.com/Eason215xB/IR-DETR</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":13549,"journal":{"name":"Infrared Physics & Technology","volume":"150 ","pages":"Article 105926"},"PeriodicalIF":3.1000,"publicationDate":"2025-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Infrared Physics & Technology","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1350449525002191","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"INSTRUMENTS & INSTRUMENTATION","Score":null,"Total":0}
引用次数: 0

Abstract

Automatic target recognition is critical in infrared imaging guidance. Detecting small targets is challenging due to poorly defined silhouettes, low signal-to-noise ratio (SNR) and complex backgrounds. Traditional methods and deep learning approaches often struggle with SNR, target size, and insufficient feature extraction. To address these issues, we propose the IR-DETR network family based on DETR. Specifically, we design a multilayer efficient backbone (MEB) for efficient feature extraction of infrared small targets using a simple and efficient network structure. We then fuse local and global spatial features through the local window attention-based intrascale feature interaction (LWAIFI). Our proposed TripleRepC3 (TRC3) expands the model’s receptive field, which improves the detection accuracy of infrared targets while also reducing the model’s overall size. Additionally, we introduce the Adaptive Max-Sigmoid activation function to address the shortcomings of previous activation functions in small target detection. Finally, by incorporating the Normalized Wasserstein Distance (NWD) loss function, we further improve the detection performance of IR-DETR for small infrared targets. Compared to SOTA models on ATR dataset, NUDT-SIRST dataset and IRSTD-1k dataset, the IR-DETR network family achieved the best performance. The code will be released at https://github.com/Eason215xB/IR-DETR.
IR-DETR:一种高效的红外小目标多层特征融合检测变压器
自动目标识别是红外成像制导的关键。由于轮廓不清晰、信噪比低和背景复杂,小目标的检测具有挑战性。传统的方法和深度学习方法经常受到信噪比、目标大小和特征提取不足的困扰。为了解决这些问题,我们提出了基于DETR的IR-DETR网络家族。具体来说,我们设计了一种多层高效主干网(MEB),利用简单高效的网络结构实现红外小目标的高效特征提取。然后,我们通过基于局部窗口注意力的尺度内特征交互(LWAIFI)融合局部和全局空间特征。我们提出的TripleRepC3 (TRC3)扩展了模型的接受野,提高了红外目标的检测精度,同时减小了模型的整体尺寸。此外,我们引入了自适应Max-Sigmoid激活函数,以解决以往激活函数在小目标检测中的不足。最后,通过加入归一化Wasserstein距离(NWD)损失函数,进一步提高IR-DETR对红外小目标的检测性能。与ATR数据集、ndt - sirst数据集和IRSTD-1k数据集上的SOTA模型相比,IR-DETR网络家族取得了最好的性能。代码将在https://github.com/Eason215xB/IR-DETR上发布。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
5.70
自引率
12.10%
发文量
400
审稿时长
67 days
期刊介绍: The Journal covers the entire field of infrared physics and technology: theory, experiment, application, devices and instrumentation. Infrared'' is defined as covering the near, mid and far infrared (terahertz) regions from 0.75um (750nm) to 1mm (300GHz.) Submissions in the 300GHz to 100GHz region may be accepted at the editors discretion if their content is relevant to shorter wavelengths. Submissions must be primarily concerned with and directly relevant to this spectral region. Its core topics can be summarized as the generation, propagation and detection, of infrared radiation; the associated optics, materials and devices; and its use in all fields of science, industry, engineering and medicine. Infrared techniques occur in many different fields, notably spectroscopy and interferometry; material characterization and processing; atmospheric physics, astronomy and space research. Scientific aspects include lasers, quantum optics, quantum electronics, image processing and semiconductor physics. Some important applications are medical diagnostics and treatment, industrial inspection and environmental monitoring.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信