The poor generalization of deep convolutional networks to aerial imagery from new geographic locations: an empirical study with solar array detection

2017 IEEE Applied Imagery Pattern Recognition Workshop (AIPR) Pub Date : 1900-01-01 DOI:10.1109/AIPR.2017.8457965

Rui Wang, Joseph A. Camilo, L. Collins, Kyle Bradbury, Jordan M. Malof

{"title":"The poor generalization of deep convolutional networks to aerial imagery from new geographic locations: an empirical study with solar array detection","authors":"Rui Wang, Joseph A. Camilo, L. Collins, Kyle Bradbury, Jordan M. Malof","doi":"10.1109/AIPR.2017.8457965","DOIUrl":null,"url":null,"abstract":"Convolutional neural networks (CNNs) have recently achieved unprecedented performance for the automatic recognition of objects (e.g., buildings, roads, or vehicles) in color aerial imagery. Although these results are promising, questions remain about their practical applicability. This is because there is a wide variability in the visual characteristics of remote sensing imagery across different geographic locations, and CNNs are often trained and tested on imagery from nearby (or the same) geographic locations. It is therefore unclear whether trained CNNs will perform well on new, previously unseen, geographic locations, which is an important practical consideration. In this work we investigate this problem when applying CNNs for solar array detection on a large aerial imagery dataset comprised of two nearby US cities. We compare the performance of CNNs under two conditions: training and testing on the same city vs training on one city and testing on another city. We discuss several subtle difficulties with these experiments and make recommendations. We show that there can be substantial performance loss in second case, when compared to the first. We also investigate how much training data is required from the unseen city in order to fine-tune the CNN so that it performs well. We investigate several different fine-tuning strategies, yielding a clear winner.","PeriodicalId":128779,"journal":{"name":"2017 IEEE Applied Imagery Pattern Recognition Workshop (AIPR)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE Applied Imagery Pattern Recognition Workshop (AIPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIPR.2017.8457965","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Convolutional neural networks (CNNs) have recently achieved unprecedented performance for the automatic recognition of objects (e.g., buildings, roads, or vehicles) in color aerial imagery. Although these results are promising, questions remain about their practical applicability. This is because there is a wide variability in the visual characteristics of remote sensing imagery across different geographic locations, and CNNs are often trained and tested on imagery from nearby (or the same) geographic locations. It is therefore unclear whether trained CNNs will perform well on new, previously unseen, geographic locations, which is an important practical consideration. In this work we investigate this problem when applying CNNs for solar array detection on a large aerial imagery dataset comprised of two nearby US cities. We compare the performance of CNNs under two conditions: training and testing on the same city vs training on one city and testing on another city. We discuss several subtle difficulties with these experiments and make recommendations. We show that there can be substantial performance loss in second case, when compared to the first. We also investigate how much training data is required from the unseen city in order to fine-tune the CNN so that it performs well. We investigate several different fine-tuning strategies, yielding a clear winner.

查看原文本刊更多论文

深度卷积网络对来自新地理位置的航空图像的不良泛化:太阳能阵列检测的实证研究

卷积神经网络(cnn)最近在彩色航空图像中自动识别物体(如建筑物、道路或车辆)方面取得了前所未有的成绩。尽管这些结果很有希望，但它们的实际适用性仍然存在问题。这是因为不同地理位置的遥感图像的视觉特征存在很大差异，cnn经常在附近(或相同)地理位置的图像上进行训练和测试。因此，训练有素的cnn是否能在新的、以前未见过的地理位置上表现良好尚不清楚，这是一个重要的实际考虑因素。在这项工作中，我们在一个由两个附近的美国城市组成的大型航空图像数据集上应用cnn进行太阳能阵列检测时研究了这个问题。我们比较了cnn在两种情况下的性能:在同一个城市上训练和测试与在一个城市上训练和测试另一个城市。我们讨论了这些实验的几个微妙的困难，并提出了建议。我们表明，与第一种情况相比，第二种情况可能会有很大的性能损失。我们还研究了需要多少来自未知城市的训练数据才能微调CNN，使其表现良好。我们研究了几种不同的微调策略，得出了一个明显的赢家。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 IEEE Applied Imagery Pattern Recognition Workshop (AIPR)

自引率

0.00%

发文量