Treepedia 2.0: Applying Deep Learning for Large-Scale Quantification of Urban Tree Cover

2018 IEEE International Congress on Big Data (BigData Congress) Pub Date : 2018-07-01 DOI:10.1109/bigdatacongress.2018.00014

B. Cai, Xiaojiang Li, Ian Seiferling, C. Ratti

{"title":"Treepedia 2.0: Applying Deep Learning for Large-Scale Quantification of Urban Tree Cover","authors":"B. Cai, Xiaojiang Li, Ian Seiferling, C. Ratti","doi":"10.1109/bigdatacongress.2018.00014","DOIUrl":null,"url":null,"abstract":"Recent advances in deep learning have made it possible to quantify urban metrics at fine resolution, and over large extents using street-level images. Here, we focus on measuring urban tree cover using Google Street View (GSV) images. First, we provide a small-scale labelled validation dataset and propose standard metrics to compare the performance of automated estimations of street tree cover using GSV. We apply state-of-the-art deep learning models, and compare their performance to a previously established benchmark of an unsupervised method. Our training procedure for deep learning models is novel; we utilize the abundance of openly available and similarly labelled street-level image datasets to pre-train our model. We then perform additional training on a small training dataset consisting of GSV images. We find that deep learning models significantly outperform the unsupervised benchmark method. Our semantic segmentation model increased mean intersection-over-union (IoU) from 44.10% to 60.42% relative to the unsupervised method and our end-to-end model decreased Mean Absolute Error from 10.04% to 4.67%. We also employ a recently developed method called gradient-weighted class activation map (Grad-CAM) to interpret the features learned by the end-to-end model. This technique confirms that the end-to-end model has accurately learned to identify tree cover area as key features for predicting percentage tree cover. Our paper provides an example of applying advanced deep learning techniques on a large-scale, geo-tagged and image-based dataset to efficiently estimate important urban metrics. The results demonstrate that deep learning models are highly accurate, can be interpretable, and can also be efficient in terms of data-labelling effort and computational resources.","PeriodicalId":177250,"journal":{"name":"2018 IEEE International Congress on Big Data (BigData Congress)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"34","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Congress on Big Data (BigData Congress)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/bigdatacongress.2018.00014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 34

Abstract

Recent advances in deep learning have made it possible to quantify urban metrics at fine resolution, and over large extents using street-level images. Here, we focus on measuring urban tree cover using Google Street View (GSV) images. First, we provide a small-scale labelled validation dataset and propose standard metrics to compare the performance of automated estimations of street tree cover using GSV. We apply state-of-the-art deep learning models, and compare their performance to a previously established benchmark of an unsupervised method. Our training procedure for deep learning models is novel; we utilize the abundance of openly available and similarly labelled street-level image datasets to pre-train our model. We then perform additional training on a small training dataset consisting of GSV images. We find that deep learning models significantly outperform the unsupervised benchmark method. Our semantic segmentation model increased mean intersection-over-union (IoU) from 44.10% to 60.42% relative to the unsupervised method and our end-to-end model decreased Mean Absolute Error from 10.04% to 4.67%. We also employ a recently developed method called gradient-weighted class activation map (Grad-CAM) to interpret the features learned by the end-to-end model. This technique confirms that the end-to-end model has accurately learned to identify tree cover area as key features for predicting percentage tree cover. Our paper provides an example of applying advanced deep learning techniques on a large-scale, geo-tagged and image-based dataset to efficiently estimate important urban metrics. The results demonstrate that deep learning models are highly accurate, can be interpretable, and can also be efficient in terms of data-labelling effort and computational resources.

查看原文本刊更多论文

Treepedia 2.0:应用深度学习进行城市树木覆盖的大规模量化

深度学习的最新进展使得以精细分辨率量化城市指标成为可能，并且在很大程度上使用街道级图像。在这里，我们的重点是使用谷歌街景(GSV)图像测量城市树木覆盖。首先，我们提供了一个小规模的标记验证数据集，并提出了标准指标来比较使用GSV自动估计街道树木覆盖的性能。我们应用最先进的深度学习模型，并将其性能与先前建立的无监督方法的基准进行比较。我们对深度学习模型的训练过程是新颖的;我们利用大量公开可用的和类似标记的街道级图像数据集来预训练我们的模型。然后，我们在一个由GSV图像组成的小训练数据集上进行额外的训练。我们发现深度学习模型明显优于无监督基准方法。相对于无监督方法，我们的语义分割模型将平均交叉超并度(IoU)从44.10%提高到60.42%，我们的端到端模型将平均绝对误差(mean Absolute Error)从10.04%降低到4.67%。我们还采用了最近开发的一种称为梯度加权类激活图(Grad-CAM)的方法来解释端到端模型学习到的特征。这项技术证实了端到端模型已经准确地学会了识别树木覆盖面积作为预测树木覆盖率百分比的关键特征。我们的论文提供了一个在大规模、地理标记和基于图像的数据集上应用先进深度学习技术的例子，以有效地估计重要的城市指标。结果表明，深度学习模型非常准确，可以解释，并且在数据标记工作和计算资源方面也很有效。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 IEEE International Congress on Big Data (BigData Congress)

自引率

0.00%

发文量