Few-Shot Object Detection in Remote Sensing: Mitigating Label Inconsistencies and Navigating Category Variations

IF 3.4 3区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS
Tiancheng Si;Shenyu Kong
{"title":"Few-Shot Object Detection in Remote Sensing: Mitigating Label Inconsistencies and Navigating Category Variations","authors":"Tiancheng Si;Shenyu Kong","doi":"10.1109/ACCESS.2025.3527881","DOIUrl":null,"url":null,"abstract":"Over recent years, the increasing expansion of remote sensing image (RSI) datasets has made annotation tasks more challenging and labor-intensive, drawing considerable attention toward few-shot object detection (FSOD). Nevertheless, current mainstream FSOD models are primarily designed for natural images and encounter two substantial challenges when applied to RSIs. 1) Inconsistent label assignment for novel instances between pre-training and fine-tuning confuses detectors, leading to diminished generalization performance. 2) Complex scenes within RSIs result in significant category variations, comprising high inter-class similarity and large intra-class variance, which impairs classification accuracy. Against the aforementioned challenges, we propose a novel FSOD approach in RSIs, termed EC-FSOD. Specifically, our approach introduces two key modules: Ensemble Class-free RPN (ECF-RPN) and Contrastive Prototype ETF Classifier (CPEC). The preceding module, ECF-RPN, generates proposals by integrating multiple dissimilar yet cooperative Class-free RPNs that perceive the shape and location of target objects, mitigating the confusion caused by label inconsistencies. Furthermore, the subsequent CPEC module combines two submodules, namely Contrastive Prototype Learning Network (CPLN) and Simplex ETF Classifier (SEC), to obtain a set of representative class prototypes and robust discriminative feature representations, which are employed to overcome the category variations and enhance the generalization performance of novel instances. Extensive experiments have revealed that our approach achieves top-2 results on the DIOR dataset and optimal performance on the NWPU VHR-10.v2 dataset.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"13 ","pages":"8169-8186"},"PeriodicalIF":3.4000,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10835074","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Access","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10835074/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Over recent years, the increasing expansion of remote sensing image (RSI) datasets has made annotation tasks more challenging and labor-intensive, drawing considerable attention toward few-shot object detection (FSOD). Nevertheless, current mainstream FSOD models are primarily designed for natural images and encounter two substantial challenges when applied to RSIs. 1) Inconsistent label assignment for novel instances between pre-training and fine-tuning confuses detectors, leading to diminished generalization performance. 2) Complex scenes within RSIs result in significant category variations, comprising high inter-class similarity and large intra-class variance, which impairs classification accuracy. Against the aforementioned challenges, we propose a novel FSOD approach in RSIs, termed EC-FSOD. Specifically, our approach introduces two key modules: Ensemble Class-free RPN (ECF-RPN) and Contrastive Prototype ETF Classifier (CPEC). The preceding module, ECF-RPN, generates proposals by integrating multiple dissimilar yet cooperative Class-free RPNs that perceive the shape and location of target objects, mitigating the confusion caused by label inconsistencies. Furthermore, the subsequent CPEC module combines two submodules, namely Contrastive Prototype Learning Network (CPLN) and Simplex ETF Classifier (SEC), to obtain a set of representative class prototypes and robust discriminative feature representations, which are employed to overcome the category variations and enhance the generalization performance of novel instances. Extensive experiments have revealed that our approach achieves top-2 results on the DIOR dataset and optimal performance on the NWPU VHR-10.v2 dataset.
遥感中的少镜头目标检测:减轻标签不一致和导航类别变化
近年来,遥感图像(RSI)数据集的不断扩展使得标注任务更具挑战性和劳动强度,引起了人们对少射目标检测(FSOD)的广泛关注。然而,目前主流的FSOD模型主要是为自然图像设计的,在应用于rsi时遇到了两个重大挑战。1)在预训练和微调之间对新实例不一致的标签分配会混淆检测器,导致泛化性能下降。2) rsi内复杂场景导致类别差异显著,类间相似度高,类内方差大,影响分类精度。针对上述挑战,我们提出了一种新的rsi FSOD方法,称为EC-FSOD。具体来说,我们的方法引入了两个关键模块:集成无类RPN (ECF-RPN)和对比原型ETF分类器(CPEC)。前面的模块ECF-RPN通过整合多个不同但具有协作性的无类rpn来生成提案,这些rpn可以感知目标物体的形状和位置,从而减轻标签不一致造成的混淆。随后的CPEC模块结合对比原型学习网络(CPLN)和单纯形ETF分类器(SEC)两个子模块,获得一组具有代表性的类原型和鲁棒判别特征表示,用于克服类别变化,提高新实例的泛化性能。大量的实验表明,我们的方法在DIOR数据集上取得了前2名的结果,在NWPU VHR-10上取得了最佳性能。v2数据集。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IEEE Access
IEEE Access COMPUTER SCIENCE, INFORMATION SYSTEMSENGIN-ENGINEERING, ELECTRICAL & ELECTRONIC
CiteScore
9.80
自引率
7.70%
发文量
6673
审稿时长
6 weeks
期刊介绍: IEEE Access® is a multidisciplinary, open access (OA), applications-oriented, all-electronic archival journal that continuously presents the results of original research or development across all of IEEE''s fields of interest. IEEE Access will publish articles that are of high interest to readers, original, technically correct, and clearly presented. Supported by author publication charges (APC), its hallmarks are a rapid peer review and publication process with open access to all readers. Unlike IEEE''s traditional Transactions or Journals, reviews are "binary", in that reviewers will either Accept or Reject an article in the form it is submitted in order to achieve rapid turnaround. Especially encouraged are submissions on: Multidisciplinary topics, or applications-oriented articles and negative results that do not fit within the scope of IEEE''s traditional journals. Practical articles discussing new experiments or measurement techniques, interesting solutions to engineering. Development of new or improved fabrication or manufacturing techniques. Reviews or survey articles of new or evolving fields oriented to assist others in understanding the new area.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信