{"title":"基于yolo的多任务链接网络(YL-Net)增强运动胡萝卜几何特性预测","authors":"Yi-Liang Wu , Sze-Teng Liong , Gen-Bing Liong , Jun-Hui Liang , Y.S. Gan","doi":"10.1016/j.compag.2025.110583","DOIUrl":null,"url":null,"abstract":"<div><div>With a growing shortage of manual labor in agriculture, there is an urgent need for efficient and automated solutions to address the complex challenges of inspecting and classifying agricultural products, such as carrots, which exhibit irregular shapes, occlusions, and high variability in geometric properties. This study introduces a novel Multi-task YOLO-based Linked Network (YL-Net) designed to estimate the geometric properties of carrots in motion, including width, length, volume, and mass. The proposed network integrates RGB-D input with decoupled multi-task learning to simultaneously perform instance segmentation and regression. Building upon our previous work, the enhanced framework presented herein achieves outstanding performance, with MAPE values below 2.5% for all estimated properties. When aggregating multi-view data from a rolling conveyor system, the accuracy further improves, yielding MAPE values below 2%. In terms of detection, the model demonstrates excellent performance, achieving a mean F1-score of 98.78% and an instance segmentation IoU of 89.25%. To evaluate its scalability, the system was deployed on an NVIDIA Jetson Orin Nano, where it achieved a real-time processing speed of 80 FPS. Beyond carrots, the proposed approach can be extended to inspect other agricultural products, such as potatoes and sweet potatoes, where geometric properties are essential for sorting and grading. This work provides a scalable and transferable solution for automated agricultural inspection, laying a robust foundation for broader applications in smart farming, industrial automation, and food quality control.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"237 ","pages":"Article 110583"},"PeriodicalIF":8.9000,"publicationDate":"2025-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhanced geometric properties prediction for carrots in motion using a Multi-task YOLO-based Linked Network (YL-Net)\",\"authors\":\"Yi-Liang Wu , Sze-Teng Liong , Gen-Bing Liong , Jun-Hui Liang , Y.S. Gan\",\"doi\":\"10.1016/j.compag.2025.110583\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>With a growing shortage of manual labor in agriculture, there is an urgent need for efficient and automated solutions to address the complex challenges of inspecting and classifying agricultural products, such as carrots, which exhibit irregular shapes, occlusions, and high variability in geometric properties. This study introduces a novel Multi-task YOLO-based Linked Network (YL-Net) designed to estimate the geometric properties of carrots in motion, including width, length, volume, and mass. The proposed network integrates RGB-D input with decoupled multi-task learning to simultaneously perform instance segmentation and regression. Building upon our previous work, the enhanced framework presented herein achieves outstanding performance, with MAPE values below 2.5% for all estimated properties. When aggregating multi-view data from a rolling conveyor system, the accuracy further improves, yielding MAPE values below 2%. In terms of detection, the model demonstrates excellent performance, achieving a mean F1-score of 98.78% and an instance segmentation IoU of 89.25%. To evaluate its scalability, the system was deployed on an NVIDIA Jetson Orin Nano, where it achieved a real-time processing speed of 80 FPS. Beyond carrots, the proposed approach can be extended to inspect other agricultural products, such as potatoes and sweet potatoes, where geometric properties are essential for sorting and grading. This work provides a scalable and transferable solution for automated agricultural inspection, laying a robust foundation for broader applications in smart farming, industrial automation, and food quality control.</div></div>\",\"PeriodicalId\":50627,\"journal\":{\"name\":\"Computers and Electronics in Agriculture\",\"volume\":\"237 \",\"pages\":\"Article 110583\"},\"PeriodicalIF\":8.9000,\"publicationDate\":\"2025-06-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers and Electronics in Agriculture\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0168169925006891\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169925006891","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
摘要
随着农业中体力劳动的日益短缺,迫切需要高效和自动化的解决方案来解决检查和分类农产品的复杂挑战,例如胡萝卜,它们具有不规则的形状,闭塞性和几何特性的高度可变性。本研究介绍了一种新颖的基于yolo的多任务链接网络(YL-Net),旨在估计运动中的胡萝卜的几何特性,包括宽度、长度、体积和质量。该网络将RGB-D输入与解耦的多任务学习相结合,同时进行实例分割和回归。在我们之前工作的基础上,本文提出的增强框架取得了出色的性能,所有估计属性的MAPE值均低于2.5%。当聚合来自滚动输送系统的多视图数据时,精度进一步提高,MAPE值低于2%。在检测方面,该模型表现出优异的性能,平均f1得分为98.78%,实例分割IoU为89.25%。为了评估其可扩展性,该系统被部署在NVIDIA Jetson Orin Nano上,实现了80 FPS的实时处理速度。除了胡萝卜,该方法还可以推广到其他农产品,如土豆和红薯,这些农产品的几何特性对分类和分级至关重要。这项工作为自动化农业检验提供了可扩展和可转移的解决方案,为智能农业、工业自动化和食品质量控制的更广泛应用奠定了坚实的基础。
Enhanced geometric properties prediction for carrots in motion using a Multi-task YOLO-based Linked Network (YL-Net)
With a growing shortage of manual labor in agriculture, there is an urgent need for efficient and automated solutions to address the complex challenges of inspecting and classifying agricultural products, such as carrots, which exhibit irregular shapes, occlusions, and high variability in geometric properties. This study introduces a novel Multi-task YOLO-based Linked Network (YL-Net) designed to estimate the geometric properties of carrots in motion, including width, length, volume, and mass. The proposed network integrates RGB-D input with decoupled multi-task learning to simultaneously perform instance segmentation and regression. Building upon our previous work, the enhanced framework presented herein achieves outstanding performance, with MAPE values below 2.5% for all estimated properties. When aggregating multi-view data from a rolling conveyor system, the accuracy further improves, yielding MAPE values below 2%. In terms of detection, the model demonstrates excellent performance, achieving a mean F1-score of 98.78% and an instance segmentation IoU of 89.25%. To evaluate its scalability, the system was deployed on an NVIDIA Jetson Orin Nano, where it achieved a real-time processing speed of 80 FPS. Beyond carrots, the proposed approach can be extended to inspect other agricultural products, such as potatoes and sweet potatoes, where geometric properties are essential for sorting and grading. This work provides a scalable and transferable solution for automated agricultural inspection, laying a robust foundation for broader applications in smart farming, industrial automation, and food quality control.
期刊介绍:
Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.