SE-YOLO: A sobel-enhanced framework for high-accuracy, lightweight real-time tomato detection with edge deployment capability

IF 8.9 1区 农林科学 Q1 AGRICULTURE, MULTIDISCIPLINARY
Xiao Deng , Tianlun Huang , Weijun Wang , Wei Feng
{"title":"SE-YOLO: A sobel-enhanced framework for high-accuracy, lightweight real-time tomato detection with edge deployment capability","authors":"Xiao Deng ,&nbsp;Tianlun Huang ,&nbsp;Weijun Wang ,&nbsp;Wei Feng","doi":"10.1016/j.compag.2025.110973","DOIUrl":null,"url":null,"abstract":"<div><div>Tomato detection is one of the most crucial steps in automated tomato harvesting. However, detection in agricultural settings faces challenges from complex environments and computational constraints. Here, we propose SE-YOLO, a lightweight framework with fundamentally different technical approach compared to existing methods. While current SOTA methods like RT-DETR rely on complex transformer architectures for global attention and Hyper-YOLO employs hypergraph computation for high-order feature relationships, SE-YOLO introduces a more direct and efficient paradigm through explicit edge perception. This approach includes: (i) SPStem’s integration of Sobel operators at network entry to directly extract edge features—a departure from the conventional approach of implicitly learning edges through multiple convolutional layers used in existing frameworks; (ii) ADown’s efficient channel-split architecture; (iii) SEDFF’s residual-based edge enhancement throughout the backbone that preserves both edge and semantic information; and (iv) WFPIoU’s adaptive spatial supervision that dynamically adjusts loss weights across training phases—contrasting with the static penalty terms in conventional IoU losses like CIoU and EIoU. These innovations create a detection framework specifically optimized for agricultural environments, capturing crucial boundary information for occluded fruits while maintaining minimal computational overhead. SE-YOLO achieves 93.6% [email protected] and 67.3% [email protected]:0.95, outperforming state-of-the-art detectors while using only 4.3% of RT-DETR’s computational cost Subsequently, the developed SE-YOLO model is deployed on standard edge hardware in INT8-quantized form. The deployment achieves 18.7 frames per second (FPS), with an 88.8% [email protected] and a 61.6% [email protected]:0.95, showing gains of 3.2% and 2.4%, respectively, over the YOLO11n model. Finally, the generalization performance of SE-YOLO is tested and validated across other datasets, demonstrating its effectiveness in detecting occluded and partially visible tomatoes, as well as in maturity-related detection tasks.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"239 ","pages":"Article 110973"},"PeriodicalIF":8.9000,"publicationDate":"2025-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169925010798","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Tomato detection is one of the most crucial steps in automated tomato harvesting. However, detection in agricultural settings faces challenges from complex environments and computational constraints. Here, we propose SE-YOLO, a lightweight framework with fundamentally different technical approach compared to existing methods. While current SOTA methods like RT-DETR rely on complex transformer architectures for global attention and Hyper-YOLO employs hypergraph computation for high-order feature relationships, SE-YOLO introduces a more direct and efficient paradigm through explicit edge perception. This approach includes: (i) SPStem’s integration of Sobel operators at network entry to directly extract edge features—a departure from the conventional approach of implicitly learning edges through multiple convolutional layers used in existing frameworks; (ii) ADown’s efficient channel-split architecture; (iii) SEDFF’s residual-based edge enhancement throughout the backbone that preserves both edge and semantic information; and (iv) WFPIoU’s adaptive spatial supervision that dynamically adjusts loss weights across training phases—contrasting with the static penalty terms in conventional IoU losses like CIoU and EIoU. These innovations create a detection framework specifically optimized for agricultural environments, capturing crucial boundary information for occluded fruits while maintaining minimal computational overhead. SE-YOLO achieves 93.6% [email protected] and 67.3% [email protected]:0.95, outperforming state-of-the-art detectors while using only 4.3% of RT-DETR’s computational cost Subsequently, the developed SE-YOLO model is deployed on standard edge hardware in INT8-quantized form. The deployment achieves 18.7 frames per second (FPS), with an 88.8% [email protected] and a 61.6% [email protected]:0.95, showing gains of 3.2% and 2.4%, respectively, over the YOLO11n model. Finally, the generalization performance of SE-YOLO is tested and validated across other datasets, demonstrating its effectiveness in detecting occluded and partially visible tomatoes, as well as in maturity-related detection tasks.
SE-YOLO:具有边缘部署能力的高精度、轻量级实时番茄检测的sobel增强框架
番茄检测是番茄自动化收获过程中最关键的步骤之一。然而,农业环境中的检测面临着来自复杂环境和计算限制的挑战。在这里,我们提出了SE-YOLO,这是一个轻量级框架,与现有方法相比,它的技术方法完全不同。当前的SOTA方法(如RT-DETR)依赖于复杂的变压器架构来获得全局关注,Hyper-YOLO使用超图计算来处理高阶特征关系,而SE-YOLO通过显式边缘感知引入了更直接和有效的范式。该方法包括:(i) SPStem在网络入口处集成Sobel算子以直接提取边缘特征,这与现有框架中使用的通过多个卷积层隐式学习边缘的传统方法不同;down的高效渠道分割架构;(iii) SEDFF在整个主干中基于残差的边缘增强,既保留了边缘信息,又保留了语义信息;(iv)与CIoU和EIoU等传统IoU损失中的静态惩罚项相比,WFPIoU的自适应空间监督可以动态调整训练阶段的损失权重。这些创新创造了一个专门针对农业环境优化的检测框架,在保持最小计算开销的同时捕获遮挡水果的关键边界信息。SE-YOLO达到93.6% [email protected]和67.3% [email protected]:0.95,优于最先进的检测器,而仅使用RT-DETR计算成本的4.3%。随后,开发的SE-YOLO模型以int8量化形式部署在标准边缘硬件上。该部署实现了每秒18.7帧(FPS), 88.8% [email protected]和61.6% [email protected]:0.95,分别比YOLO11n模型提高3.2%和2.4%。最后,在其他数据集上对SE-YOLO的泛化性能进行了测试和验证,证明了其在检测遮挡和部分可见西红柿以及成熟度相关检测任务中的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Computers and Electronics in Agriculture
Computers and Electronics in Agriculture 工程技术-计算机:跨学科应用
CiteScore
15.30
自引率
14.50%
发文量
800
审稿时长
62 days
期刊介绍: Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信