Fan Liu;Liang Yao;Chuanyi Zhang;Ting Wu;Xinlei Zhang;Xiruo Jiang;Jun Zhou
{"title":"Boost UAV-Based Object Detection via Scale-Invariant Feature Disentanglement and Adversarial Learning","authors":"Fan Liu;Liang Yao;Chuanyi Zhang;Ting Wu;Xinlei Zhang;Xiruo Jiang;Jun Zhou","doi":"10.1109/TGRS.2025.3564261","DOIUrl":null,"url":null,"abstract":"Detecting objects from uncrewed aerial vehicles (UAVs) are often hindered by a large number of small objects, resulting in low detection accuracy. To address this issue, mainstream approaches typically utilize multistage inferences. Despite their remarkable detecting accuracies, the real-time efficiency is sacrificed, making them less practical to handle real applications. To this end, we propose to improve the single-stage inference accuracy through learning scale-invariant features. Specifically, a scale-invariant feature disentangling (SIFD) module is designed to disentangle scale-related and scale-invariant features. Then, an adversarial feature learning (AFL) scheme is employed to enhance disentanglement. Finally, scale-invariant features are leveraged for robust UAV-based object detection (UAV-OD). Furthermore, we construct a multimodal UAV object detection dataset, State-Air, which incorporates annotated UAV state parameters. We apply our approach to three lightweight detection frameworks on two benchmark datasets. Extensive experiments demonstrate that our approach can effectively improve model accuracy and achieve state-of-the-art (SoTA) performance on three datasets. Our code and dataset are publicly available at: <uri>https://github.com/1e12Leon/SIFDAL</uri>","PeriodicalId":13213,"journal":{"name":"IEEE Transactions on Geoscience and Remote Sensing","volume":"63 ","pages":"1-13"},"PeriodicalIF":7.5000,"publicationDate":"2025-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Geoscience and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10976665/","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Detecting objects from uncrewed aerial vehicles (UAVs) are often hindered by a large number of small objects, resulting in low detection accuracy. To address this issue, mainstream approaches typically utilize multistage inferences. Despite their remarkable detecting accuracies, the real-time efficiency is sacrificed, making them less practical to handle real applications. To this end, we propose to improve the single-stage inference accuracy through learning scale-invariant features. Specifically, a scale-invariant feature disentangling (SIFD) module is designed to disentangle scale-related and scale-invariant features. Then, an adversarial feature learning (AFL) scheme is employed to enhance disentanglement. Finally, scale-invariant features are leveraged for robust UAV-based object detection (UAV-OD). Furthermore, we construct a multimodal UAV object detection dataset, State-Air, which incorporates annotated UAV state parameters. We apply our approach to three lightweight detection frameworks on two benchmark datasets. Extensive experiments demonstrate that our approach can effectively improve model accuracy and achieve state-of-the-art (SoTA) performance on three datasets. Our code and dataset are publicly available at: https://github.com/1e12Leon/SIFDAL
期刊介绍:
IEEE Transactions on Geoscience and Remote Sensing (TGRS) is a monthly publication that focuses on the theory, concepts, and techniques of science and engineering as applied to sensing the land, oceans, atmosphere, and space; and the processing, interpretation, and dissemination of this information.