CTDA:复杂环境下准确高效的圣女果检测算法。

IF 4.1 2区 生物学 Q1 PLANT SCIENCES
Frontiers in Plant Science Pub Date : 2025-03-13 eCollection Date: 2025-01-01 DOI:10.3389/fpls.2025.1492110
Zhi Liang, Caihong Zhang, Zhonglong Lin, Guoqiang Wang, Xiaojuan Li, Xiangjun Zou
{"title":"CTDA:复杂环境下准确高效的圣女果检测算法。","authors":"Zhi Liang, Caihong Zhang, Zhonglong Lin, Guoqiang Wang, Xiaojuan Li, Xiangjun Zou","doi":"10.3389/fpls.2025.1492110","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>In the natural harvesting conditions of cherry tomatoes, the robotic vision for harvesting faces challenges such as lighting, overlapping, and occlusion among various environmental factors. To ensure accuracy and efficiency in detecting cherry tomatoes in complex environments, the study proposes a precise, realtime, and robust target detection algorithm: the CTDA model, to support robotic harvesting operations in unstructured environments.</p><p><strong>Methods: </strong>The model, based on YOLOv8, introduces a lightweight downsampling method to restructure the backbone network, incorporating adaptive weights and receptive field spatial characteristics to ensure that low-dimensional small target features are not completely lost. By using softpool to replace maxpool in SPPF, a new SPPFS is constructed, achieving efficient feature utilization and richer multi-scale feature fusion. Additionally, by incorporating a dynamic head driven by the attention mechanism, the recognition precision of cherry tomatoes in complex scenarios is enhanced through more effective feature capture across different scales.</p><p><strong>Results: </strong>CTDA demonstrates good adaptability and robustness in complex scenarios. Its detection accuracy reaches 94.3%, with recall and average precision of 91.5% and 95.3%, respectively, while achieving a mAP@0.5:0.95 of 76.5% and an FPS of 154.1 frames per second. Compared to YOLOv8, it improves mAP by 2.9% while maintaining detection speed, with a model size of 6.7M.</p><p><strong>Discussion: </strong>Experimental results validate the effectiveness of the CTDA model in cherry tomato detection under complex environments. While improving detection accuracy, the model also enhances adaptability to lighting variations, occlusion, and dense small target scenarios, and can be deployed on edge devices for rapid detection, providing strong support for automated cherry tomato picking.</p>","PeriodicalId":12632,"journal":{"name":"Frontiers in Plant Science","volume":"16 ","pages":"1492110"},"PeriodicalIF":4.1000,"publicationDate":"2025-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11965914/pdf/","citationCount":"0","resultStr":"{\"title\":\"CTDA: an accurate and efficient cherry tomato detection algorithm in complex environments.\",\"authors\":\"Zhi Liang, Caihong Zhang, Zhonglong Lin, Guoqiang Wang, Xiaojuan Li, Xiangjun Zou\",\"doi\":\"10.3389/fpls.2025.1492110\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Introduction: </strong>In the natural harvesting conditions of cherry tomatoes, the robotic vision for harvesting faces challenges such as lighting, overlapping, and occlusion among various environmental factors. To ensure accuracy and efficiency in detecting cherry tomatoes in complex environments, the study proposes a precise, realtime, and robust target detection algorithm: the CTDA model, to support robotic harvesting operations in unstructured environments.</p><p><strong>Methods: </strong>The model, based on YOLOv8, introduces a lightweight downsampling method to restructure the backbone network, incorporating adaptive weights and receptive field spatial characteristics to ensure that low-dimensional small target features are not completely lost. By using softpool to replace maxpool in SPPF, a new SPPFS is constructed, achieving efficient feature utilization and richer multi-scale feature fusion. Additionally, by incorporating a dynamic head driven by the attention mechanism, the recognition precision of cherry tomatoes in complex scenarios is enhanced through more effective feature capture across different scales.</p><p><strong>Results: </strong>CTDA demonstrates good adaptability and robustness in complex scenarios. Its detection accuracy reaches 94.3%, with recall and average precision of 91.5% and 95.3%, respectively, while achieving a mAP@0.5:0.95 of 76.5% and an FPS of 154.1 frames per second. Compared to YOLOv8, it improves mAP by 2.9% while maintaining detection speed, with a model size of 6.7M.</p><p><strong>Discussion: </strong>Experimental results validate the effectiveness of the CTDA model in cherry tomato detection under complex environments. While improving detection accuracy, the model also enhances adaptability to lighting variations, occlusion, and dense small target scenarios, and can be deployed on edge devices for rapid detection, providing strong support for automated cherry tomato picking.</p>\",\"PeriodicalId\":12632,\"journal\":{\"name\":\"Frontiers in Plant Science\",\"volume\":\"16 \",\"pages\":\"1492110\"},\"PeriodicalIF\":4.1000,\"publicationDate\":\"2025-03-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11965914/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in Plant Science\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.3389/fpls.2025.1492110\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"PLANT SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Plant Science","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.3389/fpls.2025.1492110","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"PLANT SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

在圣女果的自然收获条件下,收获机器人视觉面临着各种环境因素之间的光照、重叠、遮挡等挑战。为了确保在复杂环境中检测圣果的准确性和效率,该研究提出了一种精确、实时和鲁棒的目标检测算法:CTDA模型,以支持非结构化环境中的机器人收获操作。方法:该模型以YOLOv8为基础,引入轻量化下采样方法重构骨干网,结合自适应权值和感受野空间特征,保证低维小目标特征不被完全丢失。利用softpool取代SPPF中的maxpool,构建新的SPPF,实现了高效的特征利用和更丰富的多尺度特征融合。此外,通过引入由注意机制驱动的动态头部,通过在不同尺度上更有效地捕获特征,提高了复杂场景下圣女果的识别精度。结果:CTDA在复杂场景下具有良好的适应性和鲁棒性。检测准确率达到94.3%,召回率91.5%,平均精度95.3%,达到mAP@0.5:0.95的76.5%,帧数154.1帧/秒。与YOLOv8相比,在保持检测速度的同时,mAP提高2.9%,模型尺寸6.7M。讨论:实验结果验证了CTDA模型在复杂环境下圣女果检测中的有效性。在提高检测精度的同时,该模型还增强了对光照变化、遮挡、密集小目标场景的适应性,可部署在边缘设备上进行快速检测,为樱桃番茄的自动化采摘提供有力支持。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
CTDA: an accurate and efficient cherry tomato detection algorithm in complex environments.

Introduction: In the natural harvesting conditions of cherry tomatoes, the robotic vision for harvesting faces challenges such as lighting, overlapping, and occlusion among various environmental factors. To ensure accuracy and efficiency in detecting cherry tomatoes in complex environments, the study proposes a precise, realtime, and robust target detection algorithm: the CTDA model, to support robotic harvesting operations in unstructured environments.

Methods: The model, based on YOLOv8, introduces a lightweight downsampling method to restructure the backbone network, incorporating adaptive weights and receptive field spatial characteristics to ensure that low-dimensional small target features are not completely lost. By using softpool to replace maxpool in SPPF, a new SPPFS is constructed, achieving efficient feature utilization and richer multi-scale feature fusion. Additionally, by incorporating a dynamic head driven by the attention mechanism, the recognition precision of cherry tomatoes in complex scenarios is enhanced through more effective feature capture across different scales.

Results: CTDA demonstrates good adaptability and robustness in complex scenarios. Its detection accuracy reaches 94.3%, with recall and average precision of 91.5% and 95.3%, respectively, while achieving a mAP@0.5:0.95 of 76.5% and an FPS of 154.1 frames per second. Compared to YOLOv8, it improves mAP by 2.9% while maintaining detection speed, with a model size of 6.7M.

Discussion: Experimental results validate the effectiveness of the CTDA model in cherry tomato detection under complex environments. While improving detection accuracy, the model also enhances adaptability to lighting variations, occlusion, and dense small target scenarios, and can be deployed on edge devices for rapid detection, providing strong support for automated cherry tomato picking.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Frontiers in Plant Science
Frontiers in Plant Science PLANT SCIENCES-
CiteScore
7.30
自引率
14.30%
发文量
4844
审稿时长
14 weeks
期刊介绍: In an ever changing world, plant science is of the utmost importance for securing the future well-being of humankind. Plants provide oxygen, food, feed, fibers, and building materials. In addition, they are a diverse source of industrial and pharmaceutical chemicals. Plants are centrally important to the health of ecosystems, and their understanding is critical for learning how to manage and maintain a sustainable biosphere. Plant science is extremely interdisciplinary, reaching from agricultural science to paleobotany, and molecular physiology to ecology. It uses the latest developments in computer science, optics, molecular biology and genomics to address challenges in model systems, agricultural crops, and ecosystems. Plant science research inquires into the form, function, development, diversity, reproduction, evolution and uses of both higher and lower plants and their interactions with other organisms throughout the biosphere. Frontiers in Plant Science welcomes outstanding contributions in any field of plant science from basic to applied research, from organismal to molecular studies, from single plant analysis to studies of populations and whole ecosystems, and from molecular to biophysical to computational approaches. Frontiers in Plant Science publishes articles on the most outstanding discoveries across a wide research spectrum of Plant Science. The mission of Frontiers in Plant Science is to bring all relevant Plant Science areas together on a single platform.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信