Unsupervised semantic label generation in agricultural fields.

IF 2.9 Q2 ROBOTICS

Frontiers in Robotics and AI Pub Date : 2025-02-25 eCollection Date: 2025-01-01 DOI:10.3389/frobt.2025.1548143

Gianmarco Roggiolani, Julius Rückin, Marija Popović, Jens Behley, Cyrill Stachniss

{"title":"Unsupervised semantic label generation in agricultural fields.","authors":"Gianmarco Roggiolani, Julius Rückin, Marija Popović, Jens Behley, Cyrill Stachniss","doi":"10.3389/frobt.2025.1548143","DOIUrl":null,"url":null,"abstract":"<p><p>Robust perception systems allow farm robots to recognize weeds and vegetation, enabling the selective application of fertilizers and herbicides to mitigate the environmental impact of traditional agricultural practices. Today's perception systems typically rely on deep learning to interpret sensor data for tasks such as distinguishing soil, crops, and weeds. These approaches usually require substantial amounts of manually labeled training data, which is often time-consuming and requires domain expertise. This paper aims to reduce this limitation and propose an automated labeling pipeline for crop-weed semantic image segmentation in managed agricultural fields. It allows the training of deep learning models without or with only limited manual labeling of images. Our system uses RGB images recorded with unmanned aerial or ground robots operating in the field to produce semantic labels exploiting the field row structure for spatially consistent labeling. We use the rows previously detected to identify multiple crop rows, reducing labeling errors and improving consistency. We further reduce labeling errors by assigning an \"unknown\" class to challenging-to-segment vegetation. We use evidential deep learning because it provides predictions uncertainty estimates that we use to refine and improve our predictions. In this way, the evidential deep learning assigns high uncertainty to the weed class, as it is often less represented in the training data, allowing us to use the uncertainty to correct the semantic predictions. Experimental results suggest that our approach outperforms general-purpose labeling methods applied to crop fields by a large margin and domain-specific approaches on multiple fields and crop species. Using our generated labels to train deep learning models boosts our prediction performance on previously unseen fields with respect to unseen crop species, growth stages, or different lighting conditions. We obtain an IoU of 88.6% on crops, and 22.7% on weeds for a managed field of sugarbeets, where fully supervised methods have 83.4% on crops and 33.5% on weeds and other unsupervised domain-specific methods get 54.6% on crops and 11.2% on weeds. Finally, our method allows fine-tuning models trained in a fully supervised fashion to improve their performance in unseen field conditions up to +17.6% in mean IoU without additional manual labeling.</p>","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"12 ","pages":"1548143"},"PeriodicalIF":2.9000,"publicationDate":"2025-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11893429/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Robotics and AI","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/frobt.2025.1548143","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"ROBOTICS","Score":null,"Total":0}

引用次数: 0

Abstract

Robust perception systems allow farm robots to recognize weeds and vegetation, enabling the selective application of fertilizers and herbicides to mitigate the environmental impact of traditional agricultural practices. Today's perception systems typically rely on deep learning to interpret sensor data for tasks such as distinguishing soil, crops, and weeds. These approaches usually require substantial amounts of manually labeled training data, which is often time-consuming and requires domain expertise. This paper aims to reduce this limitation and propose an automated labeling pipeline for crop-weed semantic image segmentation in managed agricultural fields. It allows the training of deep learning models without or with only limited manual labeling of images. Our system uses RGB images recorded with unmanned aerial or ground robots operating in the field to produce semantic labels exploiting the field row structure for spatially consistent labeling. We use the rows previously detected to identify multiple crop rows, reducing labeling errors and improving consistency. We further reduce labeling errors by assigning an "unknown" class to challenging-to-segment vegetation. We use evidential deep learning because it provides predictions uncertainty estimates that we use to refine and improve our predictions. In this way, the evidential deep learning assigns high uncertainty to the weed class, as it is often less represented in the training data, allowing us to use the uncertainty to correct the semantic predictions. Experimental results suggest that our approach outperforms general-purpose labeling methods applied to crop fields by a large margin and domain-specific approaches on multiple fields and crop species. Using our generated labels to train deep learning models boosts our prediction performance on previously unseen fields with respect to unseen crop species, growth stages, or different lighting conditions. We obtain an IoU of 88.6% on crops, and 22.7% on weeds for a managed field of sugarbeets, where fully supervised methods have 83.4% on crops and 33.5% on weeds and other unsupervised domain-specific methods get 54.6% on crops and 11.2% on weeds. Finally, our method allows fine-tuning models trained in a fully supervised fashion to improve their performance in unseen field conditions up to +17.6% in mean IoU without additional manual labeling.

查看原文本刊更多论文

农业领域的无监督语义标签生成。

强大的感知系统使农场机器人能够识别杂草和植被，从而能够选择性地使用肥料和除草剂，以减轻传统农业实践对环境的影响。今天的感知系统通常依靠深度学习来解释传感器数据，以完成诸如区分土壤、作物和杂草等任务。这些方法通常需要大量手工标记的训练数据，这通常是耗时的，并且需要领域的专业知识。本文旨在减少这一限制，并提出了一种用于管理农田作物杂草语义图像分割的自动标记管道。它允许深度学习模型的训练，而不需要或只需要有限的图像手动标记。我们的系统使用在现场操作的无人驾驶空中或地面机器人记录的RGB图像来产生语义标签，利用现场行结构进行空间一致的标记。我们使用先前检测到的行来识别多个作物行，减少标签错误并提高一致性。我们通过为具有挑战性的植被分配一个“未知”类来进一步减少标记错误。我们使用证据深度学习，因为它提供了预测的不确定性估计，我们用它来完善和改进我们的预测。通过这种方式，证据深度学习为杂草类分配了高不确定性，因为它在训练数据中通常较少代表，允许我们使用不确定性来纠正语义预测。实验结果表明，我们的方法在很大程度上优于应用于作物领域的通用标记方法和应用于多领域和作物物种的领域特定方法。使用我们生成的标签来训练深度学习模型，可以提高我们对未知作物品种、生长阶段或不同光照条件的未知领域的预测性能。在一个管理甜菜田，我们得到作物的IoU为88.6%，杂草的IoU为22.7%，其中完全监督方法对作物的IoU为83.4%，对杂草的IoU为33.5%，其他非监督领域特定方法对作物的IoU为54.6%，对杂草的IoU为11.2%。最后，我们的方法允许以完全监督的方式训练的微调模型在未见过的现场条件下提高其性能，平均IoU可提高17.6%，而无需额外的手动标记。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Frontiers in Robotics and AI ROBOTICS-

CiteScore

6.50

自引率

5.90%

发文量

355

审稿时长

14 weeks

期刊介绍： Frontiers in Robotics and AI publishes rigorously peer-reviewed research covering all theory and applications of robotics, technology, and artificial intelligence, from biomedical to space robotics.