High-throughput 3D shape completion of potato tubers on a harvester

IF 8.9 1区农林科学 Q1 AGRICULTURE, MULTIDISCIPLINARY

Computers and Electronics in Agriculture Pub Date : 2024-11-28 DOI:10.1016/j.compag.2024.109673

Pieter M. Blok , Federico Magistri , Cyrill Stachniss , Haozhou Wang , James Burridge , Wei Guo

{"title":"High-throughput 3D shape completion of potato tubers on a harvester","authors":"Pieter M. Blok , Federico Magistri , Cyrill Stachniss , Haozhou Wang , James Burridge , Wei Guo","doi":"10.1016/j.compag.2024.109673","DOIUrl":null,"url":null,"abstract":"<div><div>Potato yield is an important metric for farmers to further optimize their cultivation practices. Potato yield can be estimated on a harvester using an RGB-D camera that can estimate the three-dimensional (3D) volume of individual potato tubers. A challenge, however, is that the 3D shape derived from RGB-D images is only partially completed, underestimating the actual volume. To address this issue, we developed a 3D shape completion network, called CoRe++, which can complete the 3D shape from RGB-D images. CoRe++ is a deep learning network that consists of a convolutional encoder and a decoder. The encoder compresses RGB-D images into latent vectors that are used by the decoder to complete the 3D shape using the deep signed distance field network (DeepSDF). To evaluate our CoRe++ network, we collected partial and complete 3D point clouds of 339 potato tubers on an operational harvester in Japan. On the 1425 RGB-D images in the test set (representing 51 unique potato tubers), our network achieved a completion accuracy of 2.8 mm on average. For volumetric estimation, the root mean squared error (RMSE) was 22.6 ml, and this was better than the RMSE of the linear regression (31.1 ml) and the base model (36.9 ml). We found that the RMSE can be further reduced to 18.2 ml when performing the 3D shape completion in the center of the RGB-D image. With an average 3D shape completion time of 10 ms per tuber, we can conclude that CoRe++ is both fast and accurate enough to be implemented on an operational harvester for high-throughput potato yield estimation. CoRe++’s high-throughput and accurate processing allows it to be applied to other tuber, fruit and vegetable crops, thereby enabling versatile, accurate and real-time yield monitoring in precision agriculture. Our code, network weights and dataset are publicly available at <span><span>https://github.com/UTokyo-FieldPhenomics-Lab/corepp.git</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"228 ","pages":"Article 109673"},"PeriodicalIF":8.9000,"publicationDate":"2024-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169924010640","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

Potato yield is an important metric for farmers to further optimize their cultivation practices. Potato yield can be estimated on a harvester using an RGB-D camera that can estimate the three-dimensional (3D) volume of individual potato tubers. A challenge, however, is that the 3D shape derived from RGB-D images is only partially completed, underestimating the actual volume. To address this issue, we developed a 3D shape completion network, called CoRe++, which can complete the 3D shape from RGB-D images. CoRe++ is a deep learning network that consists of a convolutional encoder and a decoder. The encoder compresses RGB-D images into latent vectors that are used by the decoder to complete the 3D shape using the deep signed distance field network (DeepSDF). To evaluate our CoRe++ network, we collected partial and complete 3D point clouds of 339 potato tubers on an operational harvester in Japan. On the 1425 RGB-D images in the test set (representing 51 unique potato tubers), our network achieved a completion accuracy of 2.8 mm on average. For volumetric estimation, the root mean squared error (RMSE) was 22.6 ml, and this was better than the RMSE of the linear regression (31.1 ml) and the base model (36.9 ml). We found that the RMSE can be further reduced to 18.2 ml when performing the 3D shape completion in the center of the RGB-D image. With an average 3D shape completion time of 10 ms per tuber, we can conclude that CoRe++ is both fast and accurate enough to be implemented on an operational harvester for high-throughput potato yield estimation. CoRe++’s high-throughput and accurate processing allows it to be applied to other tuber, fruit and vegetable crops, thereby enabling versatile, accurate and real-time yield monitoring in precision agriculture. Our code, network weights and dataset are publicly available at https://github.com/UTokyo-FieldPhenomics-Lab/corepp.git.

查看原文本刊更多论文

马铃薯块茎在收割机上的高通量3D形状完成

马铃薯产量是农民进一步优化种植方式的重要指标。马铃薯产量可以在收割机上估计，使用RGB-D相机可以估计单个马铃薯块茎的三维（3D）体积。然而，一个挑战是，由RGB-D图像导出的3D形状仅部分完成，低估了实际体积。为了解决这个问题，我们开发了一个3D形状补全网络，称为CoRe++，它可以从RGB-D图像中完成3D形状。core++是一个深度学习网络，由一个卷积编码器和一个解码器组成。编码器将RGB-D图像压缩为潜在向量，解码器使用这些潜在向量使用深度签名距离场网络（DeepSDF）完成3D形状。为了评估我们的CoRe++网络，我们在日本的一台操作收割机上收集了339个马铃薯块茎的部分和完整的3D点云。在测试集中的1425张RGB-D图像（代表51个独特的马铃薯块茎）上，我们的网络实现了平均2.8 mm的补全精度。对于容量估计，均方根误差（RMSE）为22.6 ml，优于线性回归的RMSE （31.1 ml）和基础模型的RMSE （36.9 ml）。我们发现，在RGB-D图像的中心进行三维形状补全时，RMSE可以进一步降低到18.2 ml。每个块茎的平均3D形状完成时间为10毫秒，我们可以得出结论，CoRe++既快速又准确，足以在操作收割机上实现高通量马铃薯产量估计。core++的高通量和精确加工使其能够应用于其他块茎，水果和蔬菜作物，从而实现精准农业中多功能，准确和实时的产量监测。我们的代码、网络权重和数据集可以在https://github.com/UTokyo-FieldPhenomics-Lab/corepp.git上公开获取。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computers and Electronics in Agriculture 工程技术-计算机：跨学科应用

CiteScore

15.30

自引率

14.50%

发文量

800

审稿时长

62 days

期刊介绍： Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.