{"title":"Domain Adaptation With Contrastive Learning for Object Detection in Satellite Imagery","authors":"Debojyoti Biswas;Jelena Tešić","doi":"10.1109/TGRS.2024.3391621","DOIUrl":null,"url":null,"abstract":"State-of-the-art (SOTA) object detection methods applied to satellite and drone imagery largely fail to identify cross-domain small and dense objects. The high content variability in the overhead imagery is due to different sensors, terrestrial regions, lighting conditions, and the image acquisition time of the day. Moreover, the number and size of objects in aerial imagery are very different than in the consumer data. We propose a small object detection pipeline that improves the feature extraction process by spatial pyramid pooling, cross-stage partial networks, and heatmap-based region proposal networks (RPNs). Next, we propose the instance-aware image difficulty score (DS) that adapts the overall focal loss to improve object localization and identification. Finally, we add the two progressive domain adaptation (DA) blocks using contrastive learning in the pipeline. The blocks align the local and global features extracted from the customized CSP Darknet backbone, as different levels of feature alignment alleviate the degradation of object identification in previously unseen datasets. We create a first-ever DA benchmark using contrastive learning for the object detection task in highly imbalanced satellite datasets with significant domain gaps and dominant small objects from existing satellite benchmarks—the proposed method results in up to a 7.4% and 4.6% increase in mean average precision (mAP) over the best SOTA method for the DOTA and NWPU-VHR10 datasets, respectively.","PeriodicalId":13213,"journal":{"name":"IEEE Transactions on Geoscience and Remote Sensing","volume":"62 ","pages":"1-15"},"PeriodicalIF":8.6000,"publicationDate":"2024-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10505321","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Geoscience and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10505321/","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
State-of-the-art (SOTA) object detection methods applied to satellite and drone imagery largely fail to identify cross-domain small and dense objects. The high content variability in the overhead imagery is due to different sensors, terrestrial regions, lighting conditions, and the image acquisition time of the day. Moreover, the number and size of objects in aerial imagery are very different than in the consumer data. We propose a small object detection pipeline that improves the feature extraction process by spatial pyramid pooling, cross-stage partial networks, and heatmap-based region proposal networks (RPNs). Next, we propose the instance-aware image difficulty score (DS) that adapts the overall focal loss to improve object localization and identification. Finally, we add the two progressive domain adaptation (DA) blocks using contrastive learning in the pipeline. The blocks align the local and global features extracted from the customized CSP Darknet backbone, as different levels of feature alignment alleviate the degradation of object identification in previously unseen datasets. We create a first-ever DA benchmark using contrastive learning for the object detection task in highly imbalanced satellite datasets with significant domain gaps and dominant small objects from existing satellite benchmarks—the proposed method results in up to a 7.4% and 4.6% increase in mean average precision (mAP) over the best SOTA method for the DOTA and NWPU-VHR10 datasets, respectively.
期刊介绍:
IEEE Transactions on Geoscience and Remote Sensing (TGRS) is a monthly publication that focuses on the theory, concepts, and techniques of science and engineering as applied to sensing the land, oceans, atmosphere, and space; and the processing, interpretation, and dissemination of this information.