Tantan Jin , Xiongzhe Han , Pingan Wang , Zhao Zhang , Jie Guo , Fan Ding
{"title":"用于复杂果园中苹果检测、定位和计数的增强型深度学习模型,适用于基于机械臂的收获作业","authors":"Tantan Jin , Xiongzhe Han , Pingan Wang , Zhao Zhang , Jie Guo , Fan Ding","doi":"10.1016/j.atech.2025.100784","DOIUrl":null,"url":null,"abstract":"<div><div>The growing demand for automation in the apple-harvesting industry remains challenging due to the complex and dynamic nature of orchard environments. This study presents an enhanced deep learning model designed to improve the accuracy and adaptability of recognition algorithms for robotic arm-based harvesting. Specifically, an optimized You Only Look Once (YOLO) v8n model was developed by integrating a dilation-wise residual–dilated re-parameterization block module, a generalized feature pyramid network, and the Scylla Intersection-over-Union loss function. The enhanced model was trained and evaluated on a comprehensive dataset, achieving precision, recall, F1 score, and mAP50 values of 81.43 %, 68.48 %, 74.40 %, and 81.68 %, respectively. These results indicate improvements of 1.06 %, 1.42 %, 1.28 %, and 1.61 % over the original YOLOv8n, while preserving comparable model parameters, computational efficiency, and detection speed. Furthermore, the enhanced model demonstrated superior overall performance compared to YOLOv5, YOLOv6, and RT-DETR. To validate its adaptability and robustness, the enhanced model was rigorously tested against the original YOLOv8n model diverse conditions, including varying growth stage, lighting environments, field of view, and levels of occlusion. In outdoor field experiments conducted under cloudy, low-light, and artificial lighting conditions, the model achieved localization errors of 2.43 mm (X-axis), 3.70 mm (Y-axis), and 1.28 mm (Z-axis), representing reductions of 19.27 %, 12.67 %, and 23.05 %, respectively. Furthermore, counting accuracy improved to 69.39 %, reflecting a 2.42 % increase over the original model. The results demonstrate the enhanced model's reliable performance and heightened precision for robotic arm-based apple harvesting in complex and challenging orchard environments. The study also provides a comprehensive analysis of the model's strengths, limitations, and avenues for future research. Ultimately, this work contributes to advancing agricultural automation, paving the way for smarter, more efficient, and sustainable farming practices.</div></div>","PeriodicalId":74813,"journal":{"name":"Smart agricultural technology","volume":"10 ","pages":"Article 100784"},"PeriodicalIF":6.3000,"publicationDate":"2025-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhanced deep learning model for apple detection, localization, and counting in complex orchards for robotic arm-based harvesting\",\"authors\":\"Tantan Jin , Xiongzhe Han , Pingan Wang , Zhao Zhang , Jie Guo , Fan Ding\",\"doi\":\"10.1016/j.atech.2025.100784\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The growing demand for automation in the apple-harvesting industry remains challenging due to the complex and dynamic nature of orchard environments. This study presents an enhanced deep learning model designed to improve the accuracy and adaptability of recognition algorithms for robotic arm-based harvesting. Specifically, an optimized You Only Look Once (YOLO) v8n model was developed by integrating a dilation-wise residual–dilated re-parameterization block module, a generalized feature pyramid network, and the Scylla Intersection-over-Union loss function. The enhanced model was trained and evaluated on a comprehensive dataset, achieving precision, recall, F1 score, and mAP50 values of 81.43 %, 68.48 %, 74.40 %, and 81.68 %, respectively. These results indicate improvements of 1.06 %, 1.42 %, 1.28 %, and 1.61 % over the original YOLOv8n, while preserving comparable model parameters, computational efficiency, and detection speed. Furthermore, the enhanced model demonstrated superior overall performance compared to YOLOv5, YOLOv6, and RT-DETR. To validate its adaptability and robustness, the enhanced model was rigorously tested against the original YOLOv8n model diverse conditions, including varying growth stage, lighting environments, field of view, and levels of occlusion. In outdoor field experiments conducted under cloudy, low-light, and artificial lighting conditions, the model achieved localization errors of 2.43 mm (X-axis), 3.70 mm (Y-axis), and 1.28 mm (Z-axis), representing reductions of 19.27 %, 12.67 %, and 23.05 %, respectively. Furthermore, counting accuracy improved to 69.39 %, reflecting a 2.42 % increase over the original model. The results demonstrate the enhanced model's reliable performance and heightened precision for robotic arm-based apple harvesting in complex and challenging orchard environments. The study also provides a comprehensive analysis of the model's strengths, limitations, and avenues for future research. Ultimately, this work contributes to advancing agricultural automation, paving the way for smarter, more efficient, and sustainable farming practices.</div></div>\",\"PeriodicalId\":74813,\"journal\":{\"name\":\"Smart agricultural technology\",\"volume\":\"10 \",\"pages\":\"Article 100784\"},\"PeriodicalIF\":6.3000,\"publicationDate\":\"2025-01-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Smart agricultural technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2772375525000188\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURAL ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Smart agricultural technology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772375525000188","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURAL ENGINEERING","Score":null,"Total":0}
Enhanced deep learning model for apple detection, localization, and counting in complex orchards for robotic arm-based harvesting
The growing demand for automation in the apple-harvesting industry remains challenging due to the complex and dynamic nature of orchard environments. This study presents an enhanced deep learning model designed to improve the accuracy and adaptability of recognition algorithms for robotic arm-based harvesting. Specifically, an optimized You Only Look Once (YOLO) v8n model was developed by integrating a dilation-wise residual–dilated re-parameterization block module, a generalized feature pyramid network, and the Scylla Intersection-over-Union loss function. The enhanced model was trained and evaluated on a comprehensive dataset, achieving precision, recall, F1 score, and mAP50 values of 81.43 %, 68.48 %, 74.40 %, and 81.68 %, respectively. These results indicate improvements of 1.06 %, 1.42 %, 1.28 %, and 1.61 % over the original YOLOv8n, while preserving comparable model parameters, computational efficiency, and detection speed. Furthermore, the enhanced model demonstrated superior overall performance compared to YOLOv5, YOLOv6, and RT-DETR. To validate its adaptability and robustness, the enhanced model was rigorously tested against the original YOLOv8n model diverse conditions, including varying growth stage, lighting environments, field of view, and levels of occlusion. In outdoor field experiments conducted under cloudy, low-light, and artificial lighting conditions, the model achieved localization errors of 2.43 mm (X-axis), 3.70 mm (Y-axis), and 1.28 mm (Z-axis), representing reductions of 19.27 %, 12.67 %, and 23.05 %, respectively. Furthermore, counting accuracy improved to 69.39 %, reflecting a 2.42 % increase over the original model. The results demonstrate the enhanced model's reliable performance and heightened precision for robotic arm-based apple harvesting in complex and challenging orchard environments. The study also provides a comprehensive analysis of the model's strengths, limitations, and avenues for future research. Ultimately, this work contributes to advancing agricultural automation, paving the way for smarter, more efficient, and sustainable farming practices.