Biaohua Liu;Qian Jiang;Puming Wang;Shaowen Yao;Wei Zhou;Xin Jin
{"title":"IRMSD-YOLO:具有倒残差的红外小目标检测多尺度扩张网络","authors":"Biaohua Liu;Qian Jiang;Puming Wang;Shaowen Yao;Wei Zhou;Xin Jin","doi":"10.1109/JSEN.2025.3546966","DOIUrl":null,"url":null,"abstract":"Infrared small target detection (IRSTD) has seen growing applications in practical scenarios. In recent years, deep learning technologies have achieved significant progress in this field. However, existing algorithms fail to fully account for the distinct characteristics of infrared small targets and inadequately utilize features at different depths. Additionally, challenges stem from the weak distinction between the targets and the background, as well as the interference of complex noise. To address these issues, we designed an inverted residual multiscale dilation (IRMSD-YOLO) network. During the feature extraction stage, we designed an inverted residual feature extraction (IRFE) module to more effectively capture the characteristic information of infrared small targets under complex noise conditions. In the feature fusion stage, we conducted a detailed analysis of the importance of features from different layers. By fusing features from various layers at different frequencies, we effectively achieved frequency-weighted feature fusion (FWFF), balancing target localization information and noise suppression. Moreover, to overcome the interference caused by high-brightness noise in the background and the limitations of local contrast enhancement methods, we developed an inverted residual multiscale dilated (IRMSD) attention module. This module employs efficient feature transformations and multiscale dilated attention (MSDA) to reduce the impact of background noise, expand the receptive field, and improve the perception of small targets, thereby addressing the shortcomings of local contrast methods in complex backgrounds. Finally, recognizing the scarcity of infrared small target datasets, we constructed a real-world infrared small target dataset for complex scene (RISTCS). Through extensive experiments on multiple datasets, the results demonstrate that our proposed method exhibits advanced performance.","PeriodicalId":447,"journal":{"name":"IEEE Sensors Journal","volume":"25 9","pages":"16006-16019"},"PeriodicalIF":4.3000,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"IRMSD-YOLO: Multiscale Dilated Network With Inverted Residuals for Infrared Small Target Detection\",\"authors\":\"Biaohua Liu;Qian Jiang;Puming Wang;Shaowen Yao;Wei Zhou;Xin Jin\",\"doi\":\"10.1109/JSEN.2025.3546966\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Infrared small target detection (IRSTD) has seen growing applications in practical scenarios. In recent years, deep learning technologies have achieved significant progress in this field. However, existing algorithms fail to fully account for the distinct characteristics of infrared small targets and inadequately utilize features at different depths. Additionally, challenges stem from the weak distinction between the targets and the background, as well as the interference of complex noise. To address these issues, we designed an inverted residual multiscale dilation (IRMSD-YOLO) network. During the feature extraction stage, we designed an inverted residual feature extraction (IRFE) module to more effectively capture the characteristic information of infrared small targets under complex noise conditions. In the feature fusion stage, we conducted a detailed analysis of the importance of features from different layers. By fusing features from various layers at different frequencies, we effectively achieved frequency-weighted feature fusion (FWFF), balancing target localization information and noise suppression. Moreover, to overcome the interference caused by high-brightness noise in the background and the limitations of local contrast enhancement methods, we developed an inverted residual multiscale dilated (IRMSD) attention module. This module employs efficient feature transformations and multiscale dilated attention (MSDA) to reduce the impact of background noise, expand the receptive field, and improve the perception of small targets, thereby addressing the shortcomings of local contrast methods in complex backgrounds. Finally, recognizing the scarcity of infrared small target datasets, we constructed a real-world infrared small target dataset for complex scene (RISTCS). Through extensive experiments on multiple datasets, the results demonstrate that our proposed method exhibits advanced performance.\",\"PeriodicalId\":447,\"journal\":{\"name\":\"IEEE Sensors Journal\",\"volume\":\"25 9\",\"pages\":\"16006-16019\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2025-03-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Sensors Journal\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10918595/\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Journal","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/10918595/","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
IRMSD-YOLO: Multiscale Dilated Network With Inverted Residuals for Infrared Small Target Detection
Infrared small target detection (IRSTD) has seen growing applications in practical scenarios. In recent years, deep learning technologies have achieved significant progress in this field. However, existing algorithms fail to fully account for the distinct characteristics of infrared small targets and inadequately utilize features at different depths. Additionally, challenges stem from the weak distinction between the targets and the background, as well as the interference of complex noise. To address these issues, we designed an inverted residual multiscale dilation (IRMSD-YOLO) network. During the feature extraction stage, we designed an inverted residual feature extraction (IRFE) module to more effectively capture the characteristic information of infrared small targets under complex noise conditions. In the feature fusion stage, we conducted a detailed analysis of the importance of features from different layers. By fusing features from various layers at different frequencies, we effectively achieved frequency-weighted feature fusion (FWFF), balancing target localization information and noise suppression. Moreover, to overcome the interference caused by high-brightness noise in the background and the limitations of local contrast enhancement methods, we developed an inverted residual multiscale dilated (IRMSD) attention module. This module employs efficient feature transformations and multiscale dilated attention (MSDA) to reduce the impact of background noise, expand the receptive field, and improve the perception of small targets, thereby addressing the shortcomings of local contrast methods in complex backgrounds. Finally, recognizing the scarcity of infrared small target datasets, we constructed a real-world infrared small target dataset for complex scene (RISTCS). Through extensive experiments on multiple datasets, the results demonstrate that our proposed method exhibits advanced performance.
期刊介绍:
The fields of interest of the IEEE Sensors Journal are the theory, design , fabrication, manufacturing and applications of devices for sensing and transducing physical, chemical and biological phenomena, with emphasis on the electronics and physics aspect of sensors and integrated sensors-actuators. IEEE Sensors Journal deals with the following:
-Sensor Phenomenology, Modelling, and Evaluation
-Sensor Materials, Processing, and Fabrication
-Chemical and Gas Sensors
-Microfluidics and Biosensors
-Optical Sensors
-Physical Sensors: Temperature, Mechanical, Magnetic, and others
-Acoustic and Ultrasonic Sensors
-Sensor Packaging
-Sensor Networks
-Sensor Applications
-Sensor Systems: Signals, Processing, and Interfaces
-Actuators and Sensor Power Systems
-Sensor Signal Processing for high precision and stability (amplification, filtering, linearization, modulation/demodulation) and under harsh conditions (EMC, radiation, humidity, temperature); energy consumption/harvesting
-Sensor Data Processing (soft computing with sensor data, e.g., pattern recognition, machine learning, evolutionary computation; sensor data fusion, processing of wave e.g., electromagnetic and acoustic; and non-wave, e.g., chemical, gravity, particle, thermal, radiative and non-radiative sensor data, detection, estimation and classification based on sensor data)
-Sensors in Industrial Practice