Yingmei Zhang;Wangtao Bao;Weiguo Wan;Qin Xiao;Yingjun Tang;Xueting Zou;Liu Huang;Kaichen Zhong;Yuhao Lan
{"title":"APTNet:用于红外小目标检测的自适应局部变压器网络","authors":"Yingmei Zhang;Wangtao Bao;Weiguo Wan;Qin Xiao;Yingjun Tang;Xueting Zou;Liu Huang;Kaichen Zhong;Yuhao Lan","doi":"10.1109/JSEN.2025.3559093","DOIUrl":null,"url":null,"abstract":"Infrared small target detection (IRSTD) plays a vital role in various applications such as surveillance and rescue missions. However, existing methods struggle to effectively integrate contextual information for small targets, with background noise and limited long-distance dependency learning further constraining detection performance. To address these issues, this article proposes a novel adaptive partial transformer network based on asymmetric U-Net architecture, namely APTNet, which enhances contextual information integration and accurately detects small objects in complex scenes, improving detection performance. Specifically, a double residual attention block (DRAB) is designed to enhance the contrast of dim targets within complex backgrounds through channel and spatial attention mechanisms, thereby improving discrimination capability for small targets. Furthermore, leveraging the context pooling connections and DRAB output features, an adaptive partial transformer block (APTB) is proposed. This block first proposes an adaptive channel splitting factor, achieving contextual information integration and long-distance dependency learning with low computational overhead. Extensive experiments on two public datasets demonstrate that the proposed method significantly outperforms the existing state-of-the-art IRSTD methods in terms of intersection over union (IoU), probability of detection (<inline-formula> <tex-math>${P}_{d}$ </tex-math></inline-formula>), and false alarm rate (<inline-formula> <tex-math>${F}_{a}$ </tex-math></inline-formula>). The implementation of our code can be accessed at <uri>https://github.com/Wangtao-Bao/APTNet</uri>.","PeriodicalId":447,"journal":{"name":"IEEE Sensors Journal","volume":"25 10","pages":"17960-17974"},"PeriodicalIF":4.3000,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"APTNet: Adaptive Partial Transformer Network for Infrared Small Target Detection\",\"authors\":\"Yingmei Zhang;Wangtao Bao;Weiguo Wan;Qin Xiao;Yingjun Tang;Xueting Zou;Liu Huang;Kaichen Zhong;Yuhao Lan\",\"doi\":\"10.1109/JSEN.2025.3559093\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Infrared small target detection (IRSTD) plays a vital role in various applications such as surveillance and rescue missions. However, existing methods struggle to effectively integrate contextual information for small targets, with background noise and limited long-distance dependency learning further constraining detection performance. To address these issues, this article proposes a novel adaptive partial transformer network based on asymmetric U-Net architecture, namely APTNet, which enhances contextual information integration and accurately detects small objects in complex scenes, improving detection performance. Specifically, a double residual attention block (DRAB) is designed to enhance the contrast of dim targets within complex backgrounds through channel and spatial attention mechanisms, thereby improving discrimination capability for small targets. Furthermore, leveraging the context pooling connections and DRAB output features, an adaptive partial transformer block (APTB) is proposed. This block first proposes an adaptive channel splitting factor, achieving contextual information integration and long-distance dependency learning with low computational overhead. Extensive experiments on two public datasets demonstrate that the proposed method significantly outperforms the existing state-of-the-art IRSTD methods in terms of intersection over union (IoU), probability of detection (<inline-formula> <tex-math>${P}_{d}$ </tex-math></inline-formula>), and false alarm rate (<inline-formula> <tex-math>${F}_{a}$ </tex-math></inline-formula>). The implementation of our code can be accessed at <uri>https://github.com/Wangtao-Bao/APTNet</uri>.\",\"PeriodicalId\":447,\"journal\":{\"name\":\"IEEE Sensors Journal\",\"volume\":\"25 10\",\"pages\":\"17960-17974\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2025-04-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Sensors Journal\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10965861/\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Journal","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/10965861/","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
APTNet: Adaptive Partial Transformer Network for Infrared Small Target Detection
Infrared small target detection (IRSTD) plays a vital role in various applications such as surveillance and rescue missions. However, existing methods struggle to effectively integrate contextual information for small targets, with background noise and limited long-distance dependency learning further constraining detection performance. To address these issues, this article proposes a novel adaptive partial transformer network based on asymmetric U-Net architecture, namely APTNet, which enhances contextual information integration and accurately detects small objects in complex scenes, improving detection performance. Specifically, a double residual attention block (DRAB) is designed to enhance the contrast of dim targets within complex backgrounds through channel and spatial attention mechanisms, thereby improving discrimination capability for small targets. Furthermore, leveraging the context pooling connections and DRAB output features, an adaptive partial transformer block (APTB) is proposed. This block first proposes an adaptive channel splitting factor, achieving contextual information integration and long-distance dependency learning with low computational overhead. Extensive experiments on two public datasets demonstrate that the proposed method significantly outperforms the existing state-of-the-art IRSTD methods in terms of intersection over union (IoU), probability of detection (${P}_{d}$ ), and false alarm rate (${F}_{a}$ ). The implementation of our code can be accessed at https://github.com/Wangtao-Bao/APTNet.
期刊介绍:
The fields of interest of the IEEE Sensors Journal are the theory, design , fabrication, manufacturing and applications of devices for sensing and transducing physical, chemical and biological phenomena, with emphasis on the electronics and physics aspect of sensors and integrated sensors-actuators. IEEE Sensors Journal deals with the following:
-Sensor Phenomenology, Modelling, and Evaluation
-Sensor Materials, Processing, and Fabrication
-Chemical and Gas Sensors
-Microfluidics and Biosensors
-Optical Sensors
-Physical Sensors: Temperature, Mechanical, Magnetic, and others
-Acoustic and Ultrasonic Sensors
-Sensor Packaging
-Sensor Networks
-Sensor Applications
-Sensor Systems: Signals, Processing, and Interfaces
-Actuators and Sensor Power Systems
-Sensor Signal Processing for high precision and stability (amplification, filtering, linearization, modulation/demodulation) and under harsh conditions (EMC, radiation, humidity, temperature); energy consumption/harvesting
-Sensor Data Processing (soft computing with sensor data, e.g., pattern recognition, machine learning, evolutionary computation; sensor data fusion, processing of wave e.g., electromagnetic and acoustic; and non-wave, e.g., chemical, gravity, particle, thermal, radiative and non-radiative sensor data, detection, estimation and classification based on sensor data)
-Sensors in Industrial Practice