MSCA-Net: Multi-scale context aggregation network for infrared small target detection

IF 5 2区物理与天体物理 Q1 OPTICS

Optics and Laser Technology Pub Date : 2025-09-19 DOI:10.1016/j.optlastec.2025.113894

Xiaojin Lu, Taoran Yue, Jiaxi Cai, Yuanping Chen, Cuihong Lv, Shibing Chu

{"title":"MSCA-Net: Multi-scale context aggregation network for infrared small target detection","authors":"Xiaojin Lu, Taoran Yue, Jiaxi Cai, Yuanping Chen, Cuihong Lv, Shibing Chu","doi":"10.1016/j.optlastec.2025.113894","DOIUrl":null,"url":null,"abstract":"<div><div>In complex environments, detecting tiny infrared targets has always been challenging because of the low contrast and high noise levels inherent in infrared images. These factors often lead to the loss of crucial details during feature extraction. Moreover, existing detection methods have limitations in adequately integrating global and local information, which constrains the efficiency and accuracy of infrared small target detection. To address these challenges, we propose the MSCA-Net network architecture, which enhances feature preservation and global-local fusion through three synergistic modules: the Multi-Scale Enhanced Dilated Attention Module (MSEDA), the Positional Convolutional Block Attention Module (PCBAM), and the Channel Aggregation Module (CAB). Specifically, the MSEDA captures contextual information across different receptive fields, preserving fine-grained features. The PCBAM improves spatial understanding by modeling position-aware dependencies, while the CAB adaptively aggregates multi-level features across channels to highlight key information. Through the synergistic effect of these modules, MSCA-Net effectively retains key discriminative features and achieves robust detection performance in complex infrared scenes. The experimental results demonstrate that MSCA-Net achieves strong small target detection performance in complex backgrounds. Specifically, it attains mIoU scores of 78.43 %, 94.56 %, and 67.08 % on the NUAA-SIRST, NUDT-SIRST, and IRTSD-1K datasets, respectively, underscoring its effectiveness and suggesting potential applicability in real-world scenarios.</div></div>","PeriodicalId":19511,"journal":{"name":"Optics and Laser Technology","volume":"192 ","pages":"Article 113894"},"PeriodicalIF":5.0000,"publicationDate":"2025-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Optics and Laser Technology","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0030399225014859","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OPTICS","Score":null,"Total":0}

引用次数: 0

Abstract

In complex environments, detecting tiny infrared targets has always been challenging because of the low contrast and high noise levels inherent in infrared images. These factors often lead to the loss of crucial details during feature extraction. Moreover, existing detection methods have limitations in adequately integrating global and local information, which constrains the efficiency and accuracy of infrared small target detection. To address these challenges, we propose the MSCA-Net network architecture, which enhances feature preservation and global-local fusion through three synergistic modules: the Multi-Scale Enhanced Dilated Attention Module (MSEDA), the Positional Convolutional Block Attention Module (PCBAM), and the Channel Aggregation Module (CAB). Specifically, the MSEDA captures contextual information across different receptive fields, preserving fine-grained features. The PCBAM improves spatial understanding by modeling position-aware dependencies, while the CAB adaptively aggregates multi-level features across channels to highlight key information. Through the synergistic effect of these modules, MSCA-Net effectively retains key discriminative features and achieves robust detection performance in complex infrared scenes. The experimental results demonstrate that MSCA-Net achieves strong small target detection performance in complex backgrounds. Specifically, it attains mIoU scores of 78.43 %, 94.56 %, and 67.08 % on the NUAA-SIRST, NUDT-SIRST, and IRTSD-1K datasets, respectively, underscoring its effectiveness and suggesting potential applicability in real-world scenarios.

查看原文本刊更多论文

MSCA-Net：用于红外小目标检测的多尺度上下文聚合网络

由于红外图像固有的低对比度和高噪声水平，在复杂环境中检测微小红外目标一直是一个挑战。这些因素通常会导致特征提取过程中关键细节的丢失。此外，现有的检测方法在充分整合全局和局部信息方面存在局限性，制约了红外小目标检测的效率和精度。为了解决这些挑战，我们提出了MSCA-Net网络架构，该架构通过三个协同模块：多尺度增强扩张注意模块（MSEDA）、位置卷积块注意模块（PCBAM）和信道聚合模块（CAB）来增强特征保存和全局局部融合。具体来说，MSEDA捕获跨不同接受域的上下文信息，保留细粒度特征。PCBAM通过建模位置感知依赖关系来提高空间理解，而CAB自适应地聚合跨通道的多级特征以突出关键信息。通过这些模块的协同作用，MSCA-Net有效地保留了关键的判别特征，并在复杂的红外场景中实现了鲁棒的检测性能。实验结果表明，MSCA-Net在复杂背景下具有较强的小目标检测性能。具体而言，该方法在NUAA-SIRST、NUDT-SIRST和IRTSD-1K数据集上的mIoU得分分别为78.43%、94.56%和67.08%，表明了其有效性，并表明其在现实场景中的潜在适用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Optics and Laser Technology 物理-光学

CiteScore

8.50

自引率

10.00%

发文量

1060

审稿时长

3.4 months

期刊介绍： Optics & Laser Technology aims to provide a vehicle for the publication of a broad range of high quality research and review papers in those fields of scientific and engineering research appertaining to the development and application of the technology of optics and lasers. Papers describing original work in these areas are submitted to rigorous refereeing prior to acceptance for publication. The scope of Optics & Laser Technology encompasses, but is not restricted to, the following areas: •development in all types of lasers •developments in optoelectronic devices and photonics •developments in new photonics and optical concepts •developments in conventional optics, optical instruments and components •techniques of optical metrology, including interferometry and optical fibre sensors •LIDAR and other non-contact optical measurement techniques, including optical methods in heat and fluid flow •applications of lasers to materials processing, optical NDT display (including holography) and optical communication •research and development in the field of laser safety including studies of hazards resulting from the applications of lasers (laser safety, hazards of laser fume) •developments in optical computing and optical information processing •developments in new optical materials •developments in new optical characterization methods and techniques •developments in quantum optics •developments in light assisted micro and nanofabrication methods and techniques •developments in nanophotonics and biophotonics •developments in imaging processing and systems