Zhongyuan Liu , Li Zhuo , Chunwang Dong , Jiafeng Li
{"title":"YOLO-TBD: Tea Bud Detection with Triple-Branch Attention Mechanism and Self-Correction Group Convolution","authors":"Zhongyuan Liu , Li Zhuo , Chunwang Dong , Jiafeng Li","doi":"10.1016/j.indcrop.2025.120607","DOIUrl":null,"url":null,"abstract":"<div><div>Automatic Tea Bud Detection (TBD) is one of the core technologies in intelligent tea-picking systems Since the tea buds are small, dense, highly overlapped, and their colors are close to the background, accurate tea bud detection faces great challenges. In this paper, a tea bud detection method, named as YOLO-TBD, is proposed, which adopts YOLOv8 as the basic framework. Firstly, the Path Aggregation Feature Pyramid Network (PAFPN) in YOLOv8 is improved by incorporating the features from the 2nd layer into the PAFPN network. This modification enables better utilization of low-level features, such as texture and color information, thereby enhancing the network’s feature representation ability. Secondly, a Triple-Branch Attention Mechanism (TBAM) is designed and integrated into the output of the backbone network and the C2f module. This attention mechanism strengthens the features of the tea bud objects and suppresses background noise through feature channel interactions, without increasing the model parameters. Finally, a Self-Correction Group Convolution (SCGC) is proposed, which replaces the conventional convolution in the C2f module. This convolution establishes long-range spatial and channel dependencies around each spatial position, enabling a larger receptive field and better contextual information capture with fewer parameters, thereby mitigating false detections and missed detections of tea bud objects. The proposed modules are integrated into the YOLOv8 network architecture, resulting in the construction of three detection models with different parameters, namely YOLO-TBD-L, YOLO-TBD-M and YOLO-TBD-S, respectively. Experimental results on our self-built tea bud detection dataset and the publicly available GWHD_2021 dataset demonstrate that, compared with current methods, the proposed YOLO-TBD-L method can attain a state-of-the-art accuracy, with mAP value reaching 87.04 % and 94.5 %, respectively. And the proposed YOLO-TBD-S model achieves comparable detection accuracy to the YOLOv8-L model with much lower model parameters and computational complexity.</div></div>","PeriodicalId":13581,"journal":{"name":"Industrial Crops and Products","volume":"226 ","pages":"Article 120607"},"PeriodicalIF":5.6000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Industrial Crops and Products","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0926669025001530","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURAL ENGINEERING","Score":null,"Total":0}
引用次数: 0
Abstract
Automatic Tea Bud Detection (TBD) is one of the core technologies in intelligent tea-picking systems Since the tea buds are small, dense, highly overlapped, and their colors are close to the background, accurate tea bud detection faces great challenges. In this paper, a tea bud detection method, named as YOLO-TBD, is proposed, which adopts YOLOv8 as the basic framework. Firstly, the Path Aggregation Feature Pyramid Network (PAFPN) in YOLOv8 is improved by incorporating the features from the 2nd layer into the PAFPN network. This modification enables better utilization of low-level features, such as texture and color information, thereby enhancing the network’s feature representation ability. Secondly, a Triple-Branch Attention Mechanism (TBAM) is designed and integrated into the output of the backbone network and the C2f module. This attention mechanism strengthens the features of the tea bud objects and suppresses background noise through feature channel interactions, without increasing the model parameters. Finally, a Self-Correction Group Convolution (SCGC) is proposed, which replaces the conventional convolution in the C2f module. This convolution establishes long-range spatial and channel dependencies around each spatial position, enabling a larger receptive field and better contextual information capture with fewer parameters, thereby mitigating false detections and missed detections of tea bud objects. The proposed modules are integrated into the YOLOv8 network architecture, resulting in the construction of three detection models with different parameters, namely YOLO-TBD-L, YOLO-TBD-M and YOLO-TBD-S, respectively. Experimental results on our self-built tea bud detection dataset and the publicly available GWHD_2021 dataset demonstrate that, compared with current methods, the proposed YOLO-TBD-L method can attain a state-of-the-art accuracy, with mAP value reaching 87.04 % and 94.5 %, respectively. And the proposed YOLO-TBD-S model achieves comparable detection accuracy to the YOLOv8-L model with much lower model parameters and computational complexity.
期刊介绍:
Industrial Crops and Products is an International Journal publishing academic and industrial research on industrial (defined as non-food/non-feed) crops and products. Papers concern both crop-oriented and bio-based materials from crops-oriented research, and should be of interest to an international audience, hypothesis driven, and where comparisons are made statistics performed.