Zijing Huang , Won Suk Lee , Peng Yang , Yiannis Ampatzidis , Agehara Shinsuke , Natalia A. Peres
{"title":"Advanced canopy size estimation in strawberry production: a machine learning approach using YOLOv11 and SAM","authors":"Zijing Huang , Won Suk Lee , Peng Yang , Yiannis Ampatzidis , Agehara Shinsuke , Natalia A. Peres","doi":"10.1016/j.compag.2025.110501","DOIUrl":null,"url":null,"abstract":"<div><div>This study presents a novel approach for estimating strawberry canopy size by integrating the Segment Anything Model (SAM) with YOLOv11 detection, enhancing accuracy and efficiency in precision agriculture. Traditional methods of canopy size estimation are labor-intensive and frequently inaccurate, posing considerable limitations in agricultural applications. To overcome these issues, our research introduces an innovative integration of SAM’s zero-shot segmentation capabilities with YOLOv11′s advanced detection accuracy, underpinned by a novel prompt selection algorithm. This algorithm automates prompt optimization by using precise detection outputs from YOLOv11 to guide SAM, eliminating the need for extensively annotated datasets required by conventional and supervised segmentation methods. The prompt selection algorithm is proposed in two innovative variants: vanilla and refined. The vanilla approach employs bounding box detections from YOLOv11 plant detection alongside strategically chosen point prompts from fruit detection outputs to enhance segmentation specificity. The refined approach further advances this concept by introducing a hollow concentric structure algorithm to selectively choose background points from regions overlapping fruit detections and preliminary SAM masks. This refinement reduces segmentation errors by identifying non-canopy points, thus improving segmentation reliability. Experimental validation demonstrated that the vanilla approach achieved an Intersection over Union (IoU) of 0.913, while the refined approach reached an even higher IoU of 0.924. Additionally, we integrated Depth Anything v2 (DAv2) to transition from 2D segmentation to robust 3D canopy volume estimation. This comprehensive framework not only improves upon existing segmentation methods but also provides a practical, scalable solution for precision agriculture, showcasing significant advancements in automated canopy analysis.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"236 ","pages":"Article 110501"},"PeriodicalIF":7.7000,"publicationDate":"2025-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169925006076","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
This study presents a novel approach for estimating strawberry canopy size by integrating the Segment Anything Model (SAM) with YOLOv11 detection, enhancing accuracy and efficiency in precision agriculture. Traditional methods of canopy size estimation are labor-intensive and frequently inaccurate, posing considerable limitations in agricultural applications. To overcome these issues, our research introduces an innovative integration of SAM’s zero-shot segmentation capabilities with YOLOv11′s advanced detection accuracy, underpinned by a novel prompt selection algorithm. This algorithm automates prompt optimization by using precise detection outputs from YOLOv11 to guide SAM, eliminating the need for extensively annotated datasets required by conventional and supervised segmentation methods. The prompt selection algorithm is proposed in two innovative variants: vanilla and refined. The vanilla approach employs bounding box detections from YOLOv11 plant detection alongside strategically chosen point prompts from fruit detection outputs to enhance segmentation specificity. The refined approach further advances this concept by introducing a hollow concentric structure algorithm to selectively choose background points from regions overlapping fruit detections and preliminary SAM masks. This refinement reduces segmentation errors by identifying non-canopy points, thus improving segmentation reliability. Experimental validation demonstrated that the vanilla approach achieved an Intersection over Union (IoU) of 0.913, while the refined approach reached an even higher IoU of 0.924. Additionally, we integrated Depth Anything v2 (DAv2) to transition from 2D segmentation to robust 3D canopy volume estimation. This comprehensive framework not only improves upon existing segmentation methods but also provides a practical, scalable solution for precision agriculture, showcasing significant advancements in automated canopy analysis.
期刊介绍:
Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.