Boost UAV-Based Object Detection via Scale-Invariant Feature Disentanglement and Adversarial Learning

IF 7.5 1区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC
Fan Liu;Liang Yao;Chuanyi Zhang;Ting Wu;Xinlei Zhang;Xiruo Jiang;Jun Zhou
{"title":"Boost UAV-Based Object Detection via Scale-Invariant Feature Disentanglement and Adversarial Learning","authors":"Fan Liu;Liang Yao;Chuanyi Zhang;Ting Wu;Xinlei Zhang;Xiruo Jiang;Jun Zhou","doi":"10.1109/TGRS.2025.3564261","DOIUrl":null,"url":null,"abstract":"Detecting objects from uncrewed aerial vehicles (UAVs) are often hindered by a large number of small objects, resulting in low detection accuracy. To address this issue, mainstream approaches typically utilize multistage inferences. Despite their remarkable detecting accuracies, the real-time efficiency is sacrificed, making them less practical to handle real applications. To this end, we propose to improve the single-stage inference accuracy through learning scale-invariant features. Specifically, a scale-invariant feature disentangling (SIFD) module is designed to disentangle scale-related and scale-invariant features. Then, an adversarial feature learning (AFL) scheme is employed to enhance disentanglement. Finally, scale-invariant features are leveraged for robust UAV-based object detection (UAV-OD). Furthermore, we construct a multimodal UAV object detection dataset, State-Air, which incorporates annotated UAV state parameters. We apply our approach to three lightweight detection frameworks on two benchmark datasets. Extensive experiments demonstrate that our approach can effectively improve model accuracy and achieve state-of-the-art (SoTA) performance on three datasets. Our code and dataset are publicly available at: <uri>https://github.com/1e12Leon/SIFDAL</uri>","PeriodicalId":13213,"journal":{"name":"IEEE Transactions on Geoscience and Remote Sensing","volume":"63 ","pages":"1-13"},"PeriodicalIF":7.5000,"publicationDate":"2025-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Geoscience and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10976665/","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Detecting objects from uncrewed aerial vehicles (UAVs) are often hindered by a large number of small objects, resulting in low detection accuracy. To address this issue, mainstream approaches typically utilize multistage inferences. Despite their remarkable detecting accuracies, the real-time efficiency is sacrificed, making them less practical to handle real applications. To this end, we propose to improve the single-stage inference accuracy through learning scale-invariant features. Specifically, a scale-invariant feature disentangling (SIFD) module is designed to disentangle scale-related and scale-invariant features. Then, an adversarial feature learning (AFL) scheme is employed to enhance disentanglement. Finally, scale-invariant features are leveraged for robust UAV-based object detection (UAV-OD). Furthermore, we construct a multimodal UAV object detection dataset, State-Air, which incorporates annotated UAV state parameters. We apply our approach to three lightweight detection frameworks on two benchmark datasets. Extensive experiments demonstrate that our approach can effectively improve model accuracy and achieve state-of-the-art (SoTA) performance on three datasets. Our code and dataset are publicly available at: https://github.com/1e12Leon/SIFDAL
通过尺度不变特征解纠缠和对抗学习增强基于无人机的目标检测
无人机对目标的探测常常受到大量小目标的阻碍,导致探测精度低。为了解决这个问题,主流方法通常使用多阶段推理。尽管它们的检测精度很高,但却牺牲了实时效率,使得它们在处理实际应用时不太实用。为此,我们提出通过学习尺度不变特征来提高单阶段推理的精度。具体来说,设计了一个尺度不变特征解缠(SIFD)模块来解缠尺度相关特征和尺度不变特征。然后,采用对抗特征学习(AFL)方案来增强解纠缠。最后,利用尺度不变特征实现鲁棒的基于无人机的目标检测(UAV-OD)。此外,我们构建了一个多模态无人机目标检测数据集state - air,该数据集包含了注释的无人机状态参数。我们将我们的方法应用于两个基准数据集上的三个轻量级检测框架。大量的实验表明,我们的方法可以有效地提高模型精度,并在三个数据集上实现最先进的(SoTA)性能。我们的代码和数据集可以在:https://github.com/1e12Leon/SIFDAL上公开获取
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IEEE Transactions on Geoscience and Remote Sensing
IEEE Transactions on Geoscience and Remote Sensing 工程技术-地球化学与地球物理
CiteScore
11.50
自引率
28.00%
发文量
1912
审稿时长
4.0 months
期刊介绍: IEEE Transactions on Geoscience and Remote Sensing (TGRS) is a monthly publication that focuses on the theory, concepts, and techniques of science and engineering as applied to sensing the land, oceans, atmosphere, and space; and the processing, interpretation, and dissemination of this information.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信