{"title":"Robust Augmentations for Small Object Detection of Aerial Images","authors":"Weiyu Xiong, Zhanyu Ma, Yi-Zhe Song","doi":"10.1109/IC-NIDC54101.2021.9660458","DOIUrl":null,"url":null,"abstract":"Object detection is one of the most fundamental but important computer vision tasks. However, small object detection remains an unsolved challenge due to insufficient detailed appearances and additional noises. Meanwhile, aerial images and intelligent transportation systems are under the restriction of difficulties such as dense object arrangement, a large number of small objects, and different perspectives, compared with natural images. To deal with these problems, an adversarial-like data augmentation training is proposed in this paper to narrow the data gap between remote sensing images and natural ones. The difficulty of the remote sensing object detection is verified and analyzed firstly by the classic single-stage anchor-based detector RetinaNet. Then, the multi-scale and data augmentations are introduced to alleviate the mismatch between general detector training and aerial images based on the anchor-free state-of-the-art (SOTA) model FCOS. Experiments on the remote sensing dataset, NWPU VHR10, demonstrate the quasi-antagonism data augmentation method improves the both anchor-based and anchor-free SOTA detectors for natural images with significant margins and shows the effectiveness on aerial images, especially small objects.","PeriodicalId":264468,"journal":{"name":"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC-NIDC54101.2021.9660458","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Object detection is one of the most fundamental but important computer vision tasks. However, small object detection remains an unsolved challenge due to insufficient detailed appearances and additional noises. Meanwhile, aerial images and intelligent transportation systems are under the restriction of difficulties such as dense object arrangement, a large number of small objects, and different perspectives, compared with natural images. To deal with these problems, an adversarial-like data augmentation training is proposed in this paper to narrow the data gap between remote sensing images and natural ones. The difficulty of the remote sensing object detection is verified and analyzed firstly by the classic single-stage anchor-based detector RetinaNet. Then, the multi-scale and data augmentations are introduced to alleviate the mismatch between general detector training and aerial images based on the anchor-free state-of-the-art (SOTA) model FCOS. Experiments on the remote sensing dataset, NWPU VHR10, demonstrate the quasi-antagonism data augmentation method improves the both anchor-based and anchor-free SOTA detectors for natural images with significant margins and shows the effectiveness on aerial images, especially small objects.