Keyi Zhu , Jiajia Li , Kaixiang Zhang , Chaaran Arunachalam , Siddhartha Bhattacharya , Renfu Lu , Zhaojian Li
{"title":"基于基础模型的苹果成熟和大小估计的选择性收获","authors":"Keyi Zhu , Jiajia Li , Kaixiang Zhang , Chaaran Arunachalam , Siddhartha Bhattacharya , Renfu Lu , Zhaojian Li","doi":"10.1016/j.compag.2025.110407","DOIUrl":null,"url":null,"abstract":"<div><div>Harvesting is a critical task in the tree fruit industry, demanding extensive manual labor and substantial costs, and exposing workers to potential hazards. Recent advances in automated harvesting offer a promising solution by enabling efficient, cost-effective, and ergonomic fruit picking within tight harvesting windows. However, existing harvesting technologies often indiscriminately harvest all visible and accessible fruits, including those that are unripe or undersized. This study introduces a novel foundation-model-based framework for efficient apple ripeness and size estimation. Specifically, we curated two public RGBD-based Fuji apple image datasets, integrating expanded annotations for ripeness (“Ripe” vs. “Unripe”) based on fruit color and image capture dates. The resulting comprehensive dataset, <em>Fuji-Ripeness-Size Dataset</em>, includes 4,027 images and 16,257 annotated apples with ripeness and size labels. To the best of our knowledge, this is the first published dataset on apples with ripeness and size annotations. Leveraging Grounding-DINO, a foundation-model-based object detector, we achieved robust apple detection and ripeness estimation, with mean Average Precision being 72.8, outperforming other state-of-the-art models in the evaluation on our dataset. Additionally, we developed six size estimation algorithms, made a comprehensive comparison using box-plots, and identified the best algorithm with lowest error and variation. The <em>Fuji-Ripeness-Size Dataset</em> and the apple detection and size estimation algorithms are made publicly available<span><span><sup>1</sup></span></span>, which provides valuable benchmarks for future studies in automated and selective harvesting.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"236 ","pages":"Article 110407"},"PeriodicalIF":7.7000,"publicationDate":"2025-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Foundation model-based apple ripeness and size estimation for selective harvesting\",\"authors\":\"Keyi Zhu , Jiajia Li , Kaixiang Zhang , Chaaran Arunachalam , Siddhartha Bhattacharya , Renfu Lu , Zhaojian Li\",\"doi\":\"10.1016/j.compag.2025.110407\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Harvesting is a critical task in the tree fruit industry, demanding extensive manual labor and substantial costs, and exposing workers to potential hazards. Recent advances in automated harvesting offer a promising solution by enabling efficient, cost-effective, and ergonomic fruit picking within tight harvesting windows. However, existing harvesting technologies often indiscriminately harvest all visible and accessible fruits, including those that are unripe or undersized. This study introduces a novel foundation-model-based framework for efficient apple ripeness and size estimation. Specifically, we curated two public RGBD-based Fuji apple image datasets, integrating expanded annotations for ripeness (“Ripe” vs. “Unripe”) based on fruit color and image capture dates. The resulting comprehensive dataset, <em>Fuji-Ripeness-Size Dataset</em>, includes 4,027 images and 16,257 annotated apples with ripeness and size labels. To the best of our knowledge, this is the first published dataset on apples with ripeness and size annotations. Leveraging Grounding-DINO, a foundation-model-based object detector, we achieved robust apple detection and ripeness estimation, with mean Average Precision being 72.8, outperforming other state-of-the-art models in the evaluation on our dataset. Additionally, we developed six size estimation algorithms, made a comprehensive comparison using box-plots, and identified the best algorithm with lowest error and variation. The <em>Fuji-Ripeness-Size Dataset</em> and the apple detection and size estimation algorithms are made publicly available<span><span><sup>1</sup></span></span>, which provides valuable benchmarks for future studies in automated and selective harvesting.</div></div>\",\"PeriodicalId\":50627,\"journal\":{\"name\":\"Computers and Electronics in Agriculture\",\"volume\":\"236 \",\"pages\":\"Article 110407\"},\"PeriodicalIF\":7.7000,\"publicationDate\":\"2025-04-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers and Electronics in Agriculture\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0168169925005137\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169925005137","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
Foundation model-based apple ripeness and size estimation for selective harvesting
Harvesting is a critical task in the tree fruit industry, demanding extensive manual labor and substantial costs, and exposing workers to potential hazards. Recent advances in automated harvesting offer a promising solution by enabling efficient, cost-effective, and ergonomic fruit picking within tight harvesting windows. However, existing harvesting technologies often indiscriminately harvest all visible and accessible fruits, including those that are unripe or undersized. This study introduces a novel foundation-model-based framework for efficient apple ripeness and size estimation. Specifically, we curated two public RGBD-based Fuji apple image datasets, integrating expanded annotations for ripeness (“Ripe” vs. “Unripe”) based on fruit color and image capture dates. The resulting comprehensive dataset, Fuji-Ripeness-Size Dataset, includes 4,027 images and 16,257 annotated apples with ripeness and size labels. To the best of our knowledge, this is the first published dataset on apples with ripeness and size annotations. Leveraging Grounding-DINO, a foundation-model-based object detector, we achieved robust apple detection and ripeness estimation, with mean Average Precision being 72.8, outperforming other state-of-the-art models in the evaluation on our dataset. Additionally, we developed six size estimation algorithms, made a comprehensive comparison using box-plots, and identified the best algorithm with lowest error and variation. The Fuji-Ripeness-Size Dataset and the apple detection and size estimation algorithms are made publicly available1, which provides valuable benchmarks for future studies in automated and selective harvesting.
期刊介绍:
Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.