MSCA-Net: Multi-scale context aggregation network for infrared small target detection

IF 5 2区 物理与天体物理 Q1 OPTICS
Xiaojin Lu, Taoran Yue, Jiaxi Cai, Yuanping Chen, Cuihong Lv, Shibing Chu
{"title":"MSCA-Net: Multi-scale context aggregation network for infrared small target detection","authors":"Xiaojin Lu,&nbsp;Taoran Yue,&nbsp;Jiaxi Cai,&nbsp;Yuanping Chen,&nbsp;Cuihong Lv,&nbsp;Shibing Chu","doi":"10.1016/j.optlastec.2025.113894","DOIUrl":null,"url":null,"abstract":"<div><div>In complex environments, detecting tiny infrared targets has always been challenging because of the low contrast and high noise levels inherent in infrared images. These factors often lead to the loss of crucial details during feature extraction. Moreover, existing detection methods have limitations in adequately integrating global and local information, which constrains the efficiency and accuracy of infrared small target detection. To address these challenges, we propose the MSCA-Net network architecture, which enhances feature preservation and global-local fusion through three synergistic modules: the Multi-Scale Enhanced Dilated Attention Module (MSEDA), the Positional Convolutional Block Attention Module (PCBAM), and the Channel Aggregation Module (CAB). Specifically, the MSEDA captures contextual information across different receptive fields, preserving fine-grained features. The PCBAM improves spatial understanding by modeling position-aware dependencies, while the CAB adaptively aggregates multi-level features across channels to highlight key information. Through the synergistic effect of these modules, MSCA-Net effectively retains key discriminative features and achieves robust detection performance in complex infrared scenes. The experimental results demonstrate that MSCA-Net achieves strong small target detection performance in complex backgrounds. Specifically, it attains mIoU scores of 78.43 %, 94.56 %, and 67.08 % on the NUAA-SIRST, NUDT-SIRST, and IRTSD-1K datasets, respectively, underscoring its effectiveness and suggesting potential applicability in real-world scenarios.</div></div>","PeriodicalId":19511,"journal":{"name":"Optics and Laser Technology","volume":"192 ","pages":"Article 113894"},"PeriodicalIF":5.0000,"publicationDate":"2025-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Optics and Laser Technology","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0030399225014859","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OPTICS","Score":null,"Total":0}
引用次数: 0

Abstract

In complex environments, detecting tiny infrared targets has always been challenging because of the low contrast and high noise levels inherent in infrared images. These factors often lead to the loss of crucial details during feature extraction. Moreover, existing detection methods have limitations in adequately integrating global and local information, which constrains the efficiency and accuracy of infrared small target detection. To address these challenges, we propose the MSCA-Net network architecture, which enhances feature preservation and global-local fusion through three synergistic modules: the Multi-Scale Enhanced Dilated Attention Module (MSEDA), the Positional Convolutional Block Attention Module (PCBAM), and the Channel Aggregation Module (CAB). Specifically, the MSEDA captures contextual information across different receptive fields, preserving fine-grained features. The PCBAM improves spatial understanding by modeling position-aware dependencies, while the CAB adaptively aggregates multi-level features across channels to highlight key information. Through the synergistic effect of these modules, MSCA-Net effectively retains key discriminative features and achieves robust detection performance in complex infrared scenes. The experimental results demonstrate that MSCA-Net achieves strong small target detection performance in complex backgrounds. Specifically, it attains mIoU scores of 78.43 %, 94.56 %, and 67.08 % on the NUAA-SIRST, NUDT-SIRST, and IRTSD-1K datasets, respectively, underscoring its effectiveness and suggesting potential applicability in real-world scenarios.
MSCA-Net:用于红外小目标检测的多尺度上下文聚合网络
由于红外图像固有的低对比度和高噪声水平,在复杂环境中检测微小红外目标一直是一个挑战。这些因素通常会导致特征提取过程中关键细节的丢失。此外,现有的检测方法在充分整合全局和局部信息方面存在局限性,制约了红外小目标检测的效率和精度。为了解决这些挑战,我们提出了MSCA-Net网络架构,该架构通过三个协同模块:多尺度增强扩张注意模块(MSEDA)、位置卷积块注意模块(PCBAM)和信道聚合模块(CAB)来增强特征保存和全局局部融合。具体来说,MSEDA捕获跨不同接受域的上下文信息,保留细粒度特征。PCBAM通过建模位置感知依赖关系来提高空间理解,而CAB自适应地聚合跨通道的多级特征以突出关键信息。通过这些模块的协同作用,MSCA-Net有效地保留了关键的判别特征,并在复杂的红外场景中实现了鲁棒的检测性能。实验结果表明,MSCA-Net在复杂背景下具有较强的小目标检测性能。具体而言,该方法在NUAA-SIRST、NUDT-SIRST和IRTSD-1K数据集上的mIoU得分分别为78.43%、94.56%和67.08%,表明了其有效性,并表明其在现实场景中的潜在适用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
8.50
自引率
10.00%
发文量
1060
审稿时长
3.4 months
期刊介绍: Optics & Laser Technology aims to provide a vehicle for the publication of a broad range of high quality research and review papers in those fields of scientific and engineering research appertaining to the development and application of the technology of optics and lasers. Papers describing original work in these areas are submitted to rigorous refereeing prior to acceptance for publication. The scope of Optics & Laser Technology encompasses, but is not restricted to, the following areas: •development in all types of lasers •developments in optoelectronic devices and photonics •developments in new photonics and optical concepts •developments in conventional optics, optical instruments and components •techniques of optical metrology, including interferometry and optical fibre sensors •LIDAR and other non-contact optical measurement techniques, including optical methods in heat and fluid flow •applications of lasers to materials processing, optical NDT display (including holography) and optical communication •research and development in the field of laser safety including studies of hazards resulting from the applications of lasers (laser safety, hazards of laser fume) •developments in optical computing and optical information processing •developments in new optical materials •developments in new optical characterization methods and techniques •developments in quantum optics •developments in light assisted micro and nanofabrication methods and techniques •developments in nanophotonics and biophotonics •developments in imaging processing and systems
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信