TBD-Y:基于目标-空间注意协同和全局-局部注意引导特征融合的茶芽自动检测

IF 5.7 Q1 AGRICULTURAL ENGINEERING
Zhongyuan Liu , Li Zhuo , Chunwang Dong , Jiafeng Li , Yang Li
{"title":"TBD-Y:基于目标-空间注意协同和全局-局部注意引导特征融合的茶芽自动检测","authors":"Zhongyuan Liu ,&nbsp;Li Zhuo ,&nbsp;Chunwang Dong ,&nbsp;Jiafeng Li ,&nbsp;Yang Li","doi":"10.1016/j.atech.2025.101066","DOIUrl":null,"url":null,"abstract":"<div><div>Automatic Tea Bud Detection (TBD) is a critical technology in intelligent tea-picking systems. Nevertheless, challenges, such as complex environments and the high visual similarity between tea buds and backgrounds, frequently result in false detection and missed detection, especially for small tea buds. To address these issues, in this paper, an automatic TBD method is proposed, which is built upon the YOLOv11 object detection framework, named TBD-Y. Firstly, a Synergistic Object-Spatial Attention (SOSA) mechanism is proposed, which incorporates the proposed Local Context Attention (LCA) mechanism to enhance the features in both spatial and regional dimensions. It enables the network to focus more on the tea bud regions, and suppress the interference from background noise. Secondly, a Global-local Attention Guided Feature Fusion (GAGFF) strategy is designed. It consists of two branches: one branch enhances low-resolution, high-level features containing rich global semantic information, while the other branch strengthens high-resolution features that preserve low-level visual details. The fusion of these two branches improves the representation capability of the features. The SOSA and GAGFF are integrated into the YOLOv11 framework, constructing three variants of the TBD model with different parameter scales, named TBD-Y-L, TBD-Y-M, and TBD-Y-S. Experimental results on the self-built TBD dataset and the publicly available Global Wheat Head Dataset 2021 (GWHD_2021) demonstrate that the proposed TBD-Y-L outperforms the existing methods, achieving superior detection accuracy. Furthermore, the TBD-Y-S model exhibits improved detection accuracy compared to YOLOv11-L, while maintaining lower model parameters and computational complexity.</div></div>","PeriodicalId":74813,"journal":{"name":"Smart agricultural technology","volume":"12 ","pages":"Article 101066"},"PeriodicalIF":5.7000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"TBD-Y: Automatic tea bud detection with synergistic object-spatial attention and global-local attention guided feature fusion\",\"authors\":\"Zhongyuan Liu ,&nbsp;Li Zhuo ,&nbsp;Chunwang Dong ,&nbsp;Jiafeng Li ,&nbsp;Yang Li\",\"doi\":\"10.1016/j.atech.2025.101066\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Automatic Tea Bud Detection (TBD) is a critical technology in intelligent tea-picking systems. Nevertheless, challenges, such as complex environments and the high visual similarity between tea buds and backgrounds, frequently result in false detection and missed detection, especially for small tea buds. To address these issues, in this paper, an automatic TBD method is proposed, which is built upon the YOLOv11 object detection framework, named TBD-Y. Firstly, a Synergistic Object-Spatial Attention (SOSA) mechanism is proposed, which incorporates the proposed Local Context Attention (LCA) mechanism to enhance the features in both spatial and regional dimensions. It enables the network to focus more on the tea bud regions, and suppress the interference from background noise. Secondly, a Global-local Attention Guided Feature Fusion (GAGFF) strategy is designed. It consists of two branches: one branch enhances low-resolution, high-level features containing rich global semantic information, while the other branch strengthens high-resolution features that preserve low-level visual details. The fusion of these two branches improves the representation capability of the features. The SOSA and GAGFF are integrated into the YOLOv11 framework, constructing three variants of the TBD model with different parameter scales, named TBD-Y-L, TBD-Y-M, and TBD-Y-S. Experimental results on the self-built TBD dataset and the publicly available Global Wheat Head Dataset 2021 (GWHD_2021) demonstrate that the proposed TBD-Y-L outperforms the existing methods, achieving superior detection accuracy. Furthermore, the TBD-Y-S model exhibits improved detection accuracy compared to YOLOv11-L, while maintaining lower model parameters and computational complexity.</div></div>\",\"PeriodicalId\":74813,\"journal\":{\"name\":\"Smart agricultural technology\",\"volume\":\"12 \",\"pages\":\"Article 101066\"},\"PeriodicalIF\":5.7000,\"publicationDate\":\"2025-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Smart agricultural technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2772375525002990\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURAL ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Smart agricultural technology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772375525002990","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURAL ENGINEERING","Score":null,"Total":0}
引用次数: 0

摘要

茶芽自动检测是智能采茶系统中的一项关键技术。然而,复杂的环境和茶芽与背景的高度视觉相似性等挑战,经常导致误检和漏检,尤其是对小茶芽。针对这些问题,本文提出了一种基于YOLOv11目标检测框架的自动TBD方法,命名为TBD- y。首先,提出了一种目标-空间注意(SOSA)机制,该机制融合了已提出的局部上下文注意(LCA)机制,增强了空间和区域两个维度的特征;它可以使网络更加集中在茶芽区域,抑制背景噪声的干扰。其次,设计了全局-局部注意引导特征融合(GAGFF)策略。它由两个分支组成:一个分支增强包含丰富全局语义信息的低分辨率高级特征,而另一个分支增强保留低级视觉细节的高分辨率特征。这两个分支的融合提高了特征的表示能力。将SOSA和GAGFF集成到YOLOv11框架中,构建了具有不同参数尺度的TBD模型的三个变体,分别为TBD- y - l、TBD- y - m和TBD- y - s。在自建的TBD数据集和公开的全球麦穗数据集2021 (GWHD_2021)上的实验结果表明,所提出的TBD- y - l方法优于现有方法,具有较高的检测精度。此外,与YOLOv11-L相比,TBD-Y-S模型具有更高的检测精度,同时保持更低的模型参数和计算复杂度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
TBD-Y: Automatic tea bud detection with synergistic object-spatial attention and global-local attention guided feature fusion
Automatic Tea Bud Detection (TBD) is a critical technology in intelligent tea-picking systems. Nevertheless, challenges, such as complex environments and the high visual similarity between tea buds and backgrounds, frequently result in false detection and missed detection, especially for small tea buds. To address these issues, in this paper, an automatic TBD method is proposed, which is built upon the YOLOv11 object detection framework, named TBD-Y. Firstly, a Synergistic Object-Spatial Attention (SOSA) mechanism is proposed, which incorporates the proposed Local Context Attention (LCA) mechanism to enhance the features in both spatial and regional dimensions. It enables the network to focus more on the tea bud regions, and suppress the interference from background noise. Secondly, a Global-local Attention Guided Feature Fusion (GAGFF) strategy is designed. It consists of two branches: one branch enhances low-resolution, high-level features containing rich global semantic information, while the other branch strengthens high-resolution features that preserve low-level visual details. The fusion of these two branches improves the representation capability of the features. The SOSA and GAGFF are integrated into the YOLOv11 framework, constructing three variants of the TBD model with different parameter scales, named TBD-Y-L, TBD-Y-M, and TBD-Y-S. Experimental results on the self-built TBD dataset and the publicly available Global Wheat Head Dataset 2021 (GWHD_2021) demonstrate that the proposed TBD-Y-L outperforms the existing methods, achieving superior detection accuracy. Furthermore, the TBD-Y-S model exhibits improved detection accuracy compared to YOLOv11-L, while maintaining lower model parameters and computational complexity.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
4.20
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信