Zijing Huang , Won Suk Lee , Peng Yang , Yiannis Ampatzidis , Agehara Shinsuke , Natalia A. Peres
{"title":"草莓生产中先进的冠层大小估计:基于YOLOv11和SAM的机器学习方法","authors":"Zijing Huang , Won Suk Lee , Peng Yang , Yiannis Ampatzidis , Agehara Shinsuke , Natalia A. Peres","doi":"10.1016/j.compag.2025.110501","DOIUrl":null,"url":null,"abstract":"<div><div>This study presents a novel approach for estimating strawberry canopy size by integrating the Segment Anything Model (SAM) with YOLOv11 detection, enhancing accuracy and efficiency in precision agriculture. Traditional methods of canopy size estimation are labor-intensive and frequently inaccurate, posing considerable limitations in agricultural applications. To overcome these issues, our research introduces an innovative integration of SAM’s zero-shot segmentation capabilities with YOLOv11′s advanced detection accuracy, underpinned by a novel prompt selection algorithm. This algorithm automates prompt optimization by using precise detection outputs from YOLOv11 to guide SAM, eliminating the need for extensively annotated datasets required by conventional and supervised segmentation methods. The prompt selection algorithm is proposed in two innovative variants: vanilla and refined. The vanilla approach employs bounding box detections from YOLOv11 plant detection alongside strategically chosen point prompts from fruit detection outputs to enhance segmentation specificity. The refined approach further advances this concept by introducing a hollow concentric structure algorithm to selectively choose background points from regions overlapping fruit detections and preliminary SAM masks. This refinement reduces segmentation errors by identifying non-canopy points, thus improving segmentation reliability. Experimental validation demonstrated that the vanilla approach achieved an Intersection over Union (IoU) of 0.913, while the refined approach reached an even higher IoU of 0.924. Additionally, we integrated Depth Anything v2 (DAv2) to transition from 2D segmentation to robust 3D canopy volume estimation. This comprehensive framework not only improves upon existing segmentation methods but also provides a practical, scalable solution for precision agriculture, showcasing significant advancements in automated canopy analysis.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"236 ","pages":"Article 110501"},"PeriodicalIF":7.7000,"publicationDate":"2025-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Advanced canopy size estimation in strawberry production: a machine learning approach using YOLOv11 and SAM\",\"authors\":\"Zijing Huang , Won Suk Lee , Peng Yang , Yiannis Ampatzidis , Agehara Shinsuke , Natalia A. Peres\",\"doi\":\"10.1016/j.compag.2025.110501\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This study presents a novel approach for estimating strawberry canopy size by integrating the Segment Anything Model (SAM) with YOLOv11 detection, enhancing accuracy and efficiency in precision agriculture. Traditional methods of canopy size estimation are labor-intensive and frequently inaccurate, posing considerable limitations in agricultural applications. To overcome these issues, our research introduces an innovative integration of SAM’s zero-shot segmentation capabilities with YOLOv11′s advanced detection accuracy, underpinned by a novel prompt selection algorithm. This algorithm automates prompt optimization by using precise detection outputs from YOLOv11 to guide SAM, eliminating the need for extensively annotated datasets required by conventional and supervised segmentation methods. The prompt selection algorithm is proposed in two innovative variants: vanilla and refined. The vanilla approach employs bounding box detections from YOLOv11 plant detection alongside strategically chosen point prompts from fruit detection outputs to enhance segmentation specificity. The refined approach further advances this concept by introducing a hollow concentric structure algorithm to selectively choose background points from regions overlapping fruit detections and preliminary SAM masks. This refinement reduces segmentation errors by identifying non-canopy points, thus improving segmentation reliability. Experimental validation demonstrated that the vanilla approach achieved an Intersection over Union (IoU) of 0.913, while the refined approach reached an even higher IoU of 0.924. Additionally, we integrated Depth Anything v2 (DAv2) to transition from 2D segmentation to robust 3D canopy volume estimation. This comprehensive framework not only improves upon existing segmentation methods but also provides a practical, scalable solution for precision agriculture, showcasing significant advancements in automated canopy analysis.</div></div>\",\"PeriodicalId\":50627,\"journal\":{\"name\":\"Computers and Electronics in Agriculture\",\"volume\":\"236 \",\"pages\":\"Article 110501\"},\"PeriodicalIF\":7.7000,\"publicationDate\":\"2025-05-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers and Electronics in Agriculture\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0168169925006076\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169925006076","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
摘要
本研究提出了一种将分段任意模型(SAM)与YOLOv11检测相结合的草莓冠层大小估算新方法,提高了精准农业的精度和效率。传统的冠层大小估算方法是劳动密集型的,而且往往不准确,在农业应用中存在相当大的局限性。为了克服这些问题,我们的研究引入了一种创新的集成,将SAM的零射击分割能力与YOLOv11的先进检测精度结合起来,并以一种新的提示选择算法为基础。该算法通过使用YOLOv11的精确检测输出来指导SAM,从而自动进行提示优化,从而消除了传统和监督分割方法所需的大量注释数据集的需要。提出了两种创新的提示选择算法:vanilla和refined。香草方法采用来自YOLOv11植物检测的边界框检测以及从水果检测输出中策略性选择的点提示来增强分割特异性。改进后的方法进一步推进了这一概念,引入了一种空心同心结构算法,从重叠的水果检测区域和初步SAM掩模中选择性地选择背景点。这种细化通过识别非冠层点来减少分割误差,从而提高分割的可靠性。实验验证表明,香草方法实现了0.913的交联(Intersection over Union, IoU),而改进方法达到了更高的IoU 0.924。此外,我们集成了Depth Anything v2 (DAv2)来从2D分割过渡到鲁棒的3D冠层体积估计。这个全面的框架不仅改进了现有的分割方法,而且为精准农业提供了一个实用的、可扩展的解决方案,展示了自动化冠层分析的重大进步。
Advanced canopy size estimation in strawberry production: a machine learning approach using YOLOv11 and SAM
This study presents a novel approach for estimating strawberry canopy size by integrating the Segment Anything Model (SAM) with YOLOv11 detection, enhancing accuracy and efficiency in precision agriculture. Traditional methods of canopy size estimation are labor-intensive and frequently inaccurate, posing considerable limitations in agricultural applications. To overcome these issues, our research introduces an innovative integration of SAM’s zero-shot segmentation capabilities with YOLOv11′s advanced detection accuracy, underpinned by a novel prompt selection algorithm. This algorithm automates prompt optimization by using precise detection outputs from YOLOv11 to guide SAM, eliminating the need for extensively annotated datasets required by conventional and supervised segmentation methods. The prompt selection algorithm is proposed in two innovative variants: vanilla and refined. The vanilla approach employs bounding box detections from YOLOv11 plant detection alongside strategically chosen point prompts from fruit detection outputs to enhance segmentation specificity. The refined approach further advances this concept by introducing a hollow concentric structure algorithm to selectively choose background points from regions overlapping fruit detections and preliminary SAM masks. This refinement reduces segmentation errors by identifying non-canopy points, thus improving segmentation reliability. Experimental validation demonstrated that the vanilla approach achieved an Intersection over Union (IoU) of 0.913, while the refined approach reached an even higher IoU of 0.924. Additionally, we integrated Depth Anything v2 (DAv2) to transition from 2D segmentation to robust 3D canopy volume estimation. This comprehensive framework not only improves upon existing segmentation methods but also provides a practical, scalable solution for precision agriculture, showcasing significant advancements in automated canopy analysis.
期刊介绍:
Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.