Comparing U-Net convolutional network with mask R-CNN in the performances of pomegranate tree canopy segmentation

Asia-Pacific Remote Sensing Pub Date : 2018-10-23 DOI:10.1117/12.2325570

Tiebiao Zhao, Yonghuan Yang, Haoyu Niu, Dong Wang, Y. Chen

{"title":"Comparing U-Net convolutional network with mask R-CNN in the performances of pomegranate tree canopy segmentation","authors":"Tiebiao Zhao, Yonghuan Yang, Haoyu Niu, Dong Wang, Y. Chen","doi":"10.1117/12.2325570","DOIUrl":null,"url":null,"abstract":"In the last decade, technologies of unmanned aerial vehicles (UAVs) and small imaging sensors have achieved a significant improvement in terms of equipment cost, operation cost and image quality. These low-cost platforms provide flexible access to high resolution visible and multispectral images. As a result, many studies have been conducted regarding the applications in precision agriculture, such as water stress detection, nutrient status detection, yield prediction, etc. Different from traditional satellite low-resolution images, high-resolution UAVbased images allow much more freedom in image post-processing. For example, the very first procedure in post-processing is pixel classification, or image segmentation for extracting region of interest(ROI). With the very high resolution, it becomes possible to classify pixels from a UAV-based image, yet it is still a challenge to conduct pixel classification using traditional remote sensing features such as vegetation indices (VIs), especially considering various changes during the growing season such as light intensity, crop size, crop color etc. Thanks to the development of deep learning technologies, it provides a general framework to solve this problem. In this study, we proposed to use deep learning methods to conduct image segmentation. We created our data set of pomegranate trees by flying an off-shelf commercial camera at 30 meters above the ground around noon, during the whole growing season from the beginning of April to the middle of October 2017. We then trained and tested two convolutional network based methods U-Net and Mask R-CNN using this data set. Finally, we compared their performances with our dataset aerial images of pomegranate trees. [Tiebiao- add a sentence to summarize the findings and their implications to precision agriculture]","PeriodicalId":370971,"journal":{"name":"Asia-Pacific Remote Sensing","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"60","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Asia-Pacific Remote Sensing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2325570","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 60

Abstract

In the last decade, technologies of unmanned aerial vehicles (UAVs) and small imaging sensors have achieved a significant improvement in terms of equipment cost, operation cost and image quality. These low-cost platforms provide flexible access to high resolution visible and multispectral images. As a result, many studies have been conducted regarding the applications in precision agriculture, such as water stress detection, nutrient status detection, yield prediction, etc. Different from traditional satellite low-resolution images, high-resolution UAVbased images allow much more freedom in image post-processing. For example, the very first procedure in post-processing is pixel classification, or image segmentation for extracting region of interest(ROI). With the very high resolution, it becomes possible to classify pixels from a UAV-based image, yet it is still a challenge to conduct pixel classification using traditional remote sensing features such as vegetation indices (VIs), especially considering various changes during the growing season such as light intensity, crop size, crop color etc. Thanks to the development of deep learning technologies, it provides a general framework to solve this problem. In this study, we proposed to use deep learning methods to conduct image segmentation. We created our data set of pomegranate trees by flying an off-shelf commercial camera at 30 meters above the ground around noon, during the whole growing season from the beginning of April to the middle of October 2017. We then trained and tested two convolutional network based methods U-Net and Mask R-CNN using this data set. Finally, we compared their performances with our dataset aerial images of pomegranate trees. [Tiebiao- add a sentence to summarize the findings and their implications to precision agriculture]

查看原文本刊更多论文

比较U-Net卷积网络与掩模R-CNN在石石榴树冠分割中的性能

近十年来，无人机和小型成像传感器技术在设备成本、运行成本和图像质量等方面都取得了显著进步。这些低成本平台提供了灵活的高分辨率可见光和多光谱图像访问。因此，人们对其在精准农业中的应用进行了许多研究，如水分胁迫检测、营养状况检测、产量预测等。与传统的卫星低分辨率图像不同，基于无人机的高分辨率图像在图像后处理方面具有更大的自由度。例如，后处理的第一个步骤是像素分类，或提取感兴趣区域(ROI)的图像分割。随着分辨率的提高，从无人机图像中对像元进行分类成为可能，但利用植被指数(VIs)等传统遥感特征进行像元分类仍然是一个挑战，特别是考虑到生长季节的各种变化，如光照强度、作物大小、作物颜色等。由于深度学习技术的发展，它提供了一个通用的框架来解决这个问题。在本研究中，我们提出使用深度学习方法进行图像分割。我们在2017年4月初到10月中旬的整个生长季节，通过一台现成的商用相机在中午时分在离地面30米的高空飞行，创建了我们的石榴树数据集。然后，我们使用该数据集训练和测试了两种基于卷积网络的方法U-Net和Mask R-CNN。最后，我们将它们的性能与我们的数据集石榴树航拍图像进行了比较。[铁标-加一句话总结研究结果及其对精准农业的启示]

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Asia-Pacific Remote Sensing

自引率

0.00%

发文量