Xin Wang , Guangmei Xu , Chen Hong , Ning He , Runjie Li , Fengxi Sun , Wenjing Han
{"title":"A real-time detector for small-object remote sensing","authors":"Xin Wang , Guangmei Xu , Chen Hong , Ning He , Runjie Li , Fengxi Sun , Wenjing Han","doi":"10.1016/j.compeleceng.2025.110413","DOIUrl":null,"url":null,"abstract":"<div><div>In the field of object detection, small-object detection has always been a difficult task. Remote sensing images have complex backgrounds and small objects can be densely distributed. Moreover, remote sensing detection must meet real-time requirements. To address these challenges, this paper proposes a detector called NanoDet-Drone for the real-time detection of small objects in remote sensing scenes. The baseline model lacks a sufficient receptive field to capture both local and long-distance information, and cannot achieve satisfactory detection results when directly applied to remote sensing detection. Our project improves the baseline network. First, the receptive field module is proposed, which uses dilated convolution at different dilation rates to expand the model’s receptive field while fully exploiting the contextual information of the small objects, incorporating the coordinate attention mechanism to highlight the features of small objects. Then, the adaptive fusion feature pyramid network (AF-FPN) is proposed to reasonably fuse the features of different branches; this efficiently uses multi-scale features and provides the network with more detailed information about small objects. Finally, the improved training auxiliary module, called the assign guidance module, is used to guide the detection head training and help the network learn richer feature representations to improve the accuracy and robustness of the model. In this study, we conducted extensive experiments on two challenging remote sensing datasets, VisDrone and AI-TOD, to demonstrate the effectiveness and robustness of NanoDet-Drone. Results show that NanoDet-Drone is capable of running at 56.8 frames per second on a CPU, outperforming other advanced detectors (YOLOv9-T and YOLOv10-N) at the same scale. Our model achieves a better trade-off between accuracy and inference speed. The proposed AF-FPN can be easily embedded into a one-stage detector, which effectively improves detection performance while significantly reducing the number of model parameters and computations. Compared with the baseline, NanoDet-Drone increased average precision (AP) and <span><math><mrow><mi>A</mi><msub><mrow><mi>P</mi></mrow><mrow><mn>0</mn><mo>.</mo><mn>5</mn></mrow></msub></mrow></math></span> by 5.2% and 8.6%, respectively, on VisDrone, and increased AP and <span><math><mrow><mi>A</mi><msub><mrow><mi>P</mi></mrow><mrow><mn>0</mn><mo>.</mo><mn>5</mn></mrow></msub></mrow></math></span> by 4.8% and 10.9%, respectively, on AI-TOD.</div></div>","PeriodicalId":50630,"journal":{"name":"Computers & Electrical Engineering","volume":"125 ","pages":"Article 110413"},"PeriodicalIF":4.0000,"publicationDate":"2025-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Electrical Engineering","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0045790625003568","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
In the field of object detection, small-object detection has always been a difficult task. Remote sensing images have complex backgrounds and small objects can be densely distributed. Moreover, remote sensing detection must meet real-time requirements. To address these challenges, this paper proposes a detector called NanoDet-Drone for the real-time detection of small objects in remote sensing scenes. The baseline model lacks a sufficient receptive field to capture both local and long-distance information, and cannot achieve satisfactory detection results when directly applied to remote sensing detection. Our project improves the baseline network. First, the receptive field module is proposed, which uses dilated convolution at different dilation rates to expand the model’s receptive field while fully exploiting the contextual information of the small objects, incorporating the coordinate attention mechanism to highlight the features of small objects. Then, the adaptive fusion feature pyramid network (AF-FPN) is proposed to reasonably fuse the features of different branches; this efficiently uses multi-scale features and provides the network with more detailed information about small objects. Finally, the improved training auxiliary module, called the assign guidance module, is used to guide the detection head training and help the network learn richer feature representations to improve the accuracy and robustness of the model. In this study, we conducted extensive experiments on two challenging remote sensing datasets, VisDrone and AI-TOD, to demonstrate the effectiveness and robustness of NanoDet-Drone. Results show that NanoDet-Drone is capable of running at 56.8 frames per second on a CPU, outperforming other advanced detectors (YOLOv9-T and YOLOv10-N) at the same scale. Our model achieves a better trade-off between accuracy and inference speed. The proposed AF-FPN can be easily embedded into a one-stage detector, which effectively improves detection performance while significantly reducing the number of model parameters and computations. Compared with the baseline, NanoDet-Drone increased average precision (AP) and by 5.2% and 8.6%, respectively, on VisDrone, and increased AP and by 4.8% and 10.9%, respectively, on AI-TOD.
期刊介绍:
The impact of computers has nowhere been more revolutionary than in electrical engineering. The design, analysis, and operation of electrical and electronic systems are now dominated by computers, a transformation that has been motivated by the natural ease of interface between computers and electrical systems, and the promise of spectacular improvements in speed and efficiency.
Published since 1973, Computers & Electrical Engineering provides rapid publication of topical research into the integration of computer technology and computational techniques with electrical and electronic systems. The journal publishes papers featuring novel implementations of computers and computational techniques in areas like signal and image processing, high-performance computing, parallel processing, and communications. Special attention will be paid to papers describing innovative architectures, algorithms, and software tools.