{"title":"MSCA-Net: Multi-scale context aggregation network for infrared small target detection","authors":"Xiaojin Lu, Taoran Yue, Jiaxi Cai, Yuanping Chen, Cuihong Lv, Shibing Chu","doi":"10.1016/j.optlastec.2025.113894","DOIUrl":null,"url":null,"abstract":"<div><div>In complex environments, detecting tiny infrared targets has always been challenging because of the low contrast and high noise levels inherent in infrared images. These factors often lead to the loss of crucial details during feature extraction. Moreover, existing detection methods have limitations in adequately integrating global and local information, which constrains the efficiency and accuracy of infrared small target detection. To address these challenges, we propose the MSCA-Net network architecture, which enhances feature preservation and global-local fusion through three synergistic modules: the Multi-Scale Enhanced Dilated Attention Module (MSEDA), the Positional Convolutional Block Attention Module (PCBAM), and the Channel Aggregation Module (CAB). Specifically, the MSEDA captures contextual information across different receptive fields, preserving fine-grained features. The PCBAM improves spatial understanding by modeling position-aware dependencies, while the CAB adaptively aggregates multi-level features across channels to highlight key information. Through the synergistic effect of these modules, MSCA-Net effectively retains key discriminative features and achieves robust detection performance in complex infrared scenes. The experimental results demonstrate that MSCA-Net achieves strong small target detection performance in complex backgrounds. Specifically, it attains mIoU scores of 78.43 %, 94.56 %, and 67.08 % on the NUAA-SIRST, NUDT-SIRST, and IRTSD-1K datasets, respectively, underscoring its effectiveness and suggesting potential applicability in real-world scenarios.</div></div>","PeriodicalId":19511,"journal":{"name":"Optics and Laser Technology","volume":"192 ","pages":"Article 113894"},"PeriodicalIF":5.0000,"publicationDate":"2025-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Optics and Laser Technology","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0030399225014859","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OPTICS","Score":null,"Total":0}
引用次数: 0
Abstract
In complex environments, detecting tiny infrared targets has always been challenging because of the low contrast and high noise levels inherent in infrared images. These factors often lead to the loss of crucial details during feature extraction. Moreover, existing detection methods have limitations in adequately integrating global and local information, which constrains the efficiency and accuracy of infrared small target detection. To address these challenges, we propose the MSCA-Net network architecture, which enhances feature preservation and global-local fusion through three synergistic modules: the Multi-Scale Enhanced Dilated Attention Module (MSEDA), the Positional Convolutional Block Attention Module (PCBAM), and the Channel Aggregation Module (CAB). Specifically, the MSEDA captures contextual information across different receptive fields, preserving fine-grained features. The PCBAM improves spatial understanding by modeling position-aware dependencies, while the CAB adaptively aggregates multi-level features across channels to highlight key information. Through the synergistic effect of these modules, MSCA-Net effectively retains key discriminative features and achieves robust detection performance in complex infrared scenes. The experimental results demonstrate that MSCA-Net achieves strong small target detection performance in complex backgrounds. Specifically, it attains mIoU scores of 78.43 %, 94.56 %, and 67.08 % on the NUAA-SIRST, NUDT-SIRST, and IRTSD-1K datasets, respectively, underscoring its effectiveness and suggesting potential applicability in real-world scenarios.
期刊介绍:
Optics & Laser Technology aims to provide a vehicle for the publication of a broad range of high quality research and review papers in those fields of scientific and engineering research appertaining to the development and application of the technology of optics and lasers. Papers describing original work in these areas are submitted to rigorous refereeing prior to acceptance for publication.
The scope of Optics & Laser Technology encompasses, but is not restricted to, the following areas:
•development in all types of lasers
•developments in optoelectronic devices and photonics
•developments in new photonics and optical concepts
•developments in conventional optics, optical instruments and components
•techniques of optical metrology, including interferometry and optical fibre sensors
•LIDAR and other non-contact optical measurement techniques, including optical methods in heat and fluid flow
•applications of lasers to materials processing, optical NDT display (including holography) and optical communication
•research and development in the field of laser safety including studies of hazards resulting from the applications of lasers (laser safety, hazards of laser fume)
•developments in optical computing and optical information processing
•developments in new optical materials
•developments in new optical characterization methods and techniques
•developments in quantum optics
•developments in light assisted micro and nanofabrication methods and techniques
•developments in nanophotonics and biophotonics
•developments in imaging processing and systems