Picking point localization method based on semantic reasoning for complex picking scenarios in vineyards

IF 12.4 Q1 AGRICULTURE, MULTIDISCIPLINARY

Artificial Intelligence in Agriculture Pub Date : 2025-05-26 DOI:10.1016/j.aiia.2025.05.004

Xuemin Lin , Jinhai Wang , Jinshuan Wang , Huiling Wei , Mingyou Chen , Lufeng Luo

{"title":"Picking point localization method based on semantic reasoning for complex picking scenarios in vineyards","authors":"Xuemin Lin , Jinhai Wang , Jinshuan Wang , Huiling Wei , Mingyou Chen , Lufeng Luo","doi":"10.1016/j.aiia.2025.05.004","DOIUrl":null,"url":null,"abstract":"<div><div>In the complex orchard environment, precise picking point localization is crucial for the automation of fruit picking robots. However, existing methods are prone to positioning errors when dealing with complex scenarios such as short peduncles, partial occlusion, or complete misidentification, which can affect the actual work efficiency of the fruit picking robot. This study proposes an enhanced picking point localization method based on semantic reasoning for complex picking scenarios in vineyard. It innovatively designs three modules: the semantic reasoning module (SRM), the ROI threshold adjustment strategy (RTAS), and the picking point location optimization module (PPOM). The SRM is applied to handle the scenarios of grape peduncles being obstructed by obstacles, partial misidentification of peduncles, and complete misidentification of peduncles. The RTAS addresses the issue of low and short peduncles during the picking process. Finally, the PPOM optimizes the final position of the picking point, allowing the robotic arm to perform the picking operation with greater flexibility. Experimental results show that SegFormer achieves an mIoU (mean Intersection over Union) of 84.54 %, with B_IoU and P_IoU reaching 73.90 % and 75.63 %, respectively. Additionally, the success rate of the improved fruit picking point localization algorithm reached 94.96 %, surpassing the baseline algorithm by 8.12 %. The algorithm's average processing time is 0.5428 ± 0.0063 s, meeting the practical requirements for real-time picking.</div></div>","PeriodicalId":52814,"journal":{"name":"Artificial Intelligence in Agriculture","volume":"15 4","pages":"Pages 744-756"},"PeriodicalIF":12.4000,"publicationDate":"2025-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence in Agriculture","FirstCategoryId":"1087","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2589721725000595","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

In the complex orchard environment, precise picking point localization is crucial for the automation of fruit picking robots. However, existing methods are prone to positioning errors when dealing with complex scenarios such as short peduncles, partial occlusion, or complete misidentification, which can affect the actual work efficiency of the fruit picking robot. This study proposes an enhanced picking point localization method based on semantic reasoning for complex picking scenarios in vineyard. It innovatively designs three modules: the semantic reasoning module (SRM), the ROI threshold adjustment strategy (RTAS), and the picking point location optimization module (PPOM). The SRM is applied to handle the scenarios of grape peduncles being obstructed by obstacles, partial misidentification of peduncles, and complete misidentification of peduncles. The RTAS addresses the issue of low and short peduncles during the picking process. Finally, the PPOM optimizes the final position of the picking point, allowing the robotic arm to perform the picking operation with greater flexibility. Experimental results show that SegFormer achieves an mIoU (mean Intersection over Union) of 84.54 %, with B_IoU and P_IoU reaching 73.90 % and 75.63 %, respectively. Additionally, the success rate of the improved fruit picking point localization algorithm reached 94.96 %, surpassing the baseline algorithm by 8.12 %. The algorithm's average processing time is 0.5428 ± 0.0063 s, meeting the practical requirements for real-time picking.

查看原文本刊更多论文

基于语义推理的葡萄园复杂采摘场景采摘点定位方法

在复杂的果园环境中，采摘点的精确定位是实现采摘机器人自动化的关键。然而，现有方法在处理短柄、部分遮挡或完全错认等复杂场景时，容易出现定位误差，影响摘果机器人的实际工作效率。针对复杂的葡萄园采摘场景，提出了一种基于语义推理的增强采摘点定位方法。创新设计了语义推理模块（SRM）、ROI阈值调整策略模块（RTAS）和拾取点位置优化模块（PPOM）三个模块。SRM用于处理葡萄梗被障碍物遮挡、部分错认和完全错认的情况。RTAS解决了采摘过程中花梗低而短的问题。最后，PPOM优化了拾取点的最终位置，使机械臂能够以更大的灵活性进行拾取操作。实验结果表明，SegFormer实现了84.54%的mIoU (average Intersection over Union)，其中B_IoU和P_IoU分别达到73.90%和75.63%。改进的果实采摘点定位算法的成功率达到94.96%，比基线算法高出8.12%。算法的平均处理时间为0.5428±0.0063 s，满足实时采摘的实际要求。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊