{"title":"Positive Anchor Area Merge Algorithm: A Knowledge Distillation Algorithm for Fruit Detection Tasks Based on Yolov8","authors":"Xiangqun Shi;Xian Zhang;Yifan Su;Xun Zhang","doi":"10.1109/ACCESS.2025.3544361","DOIUrl":null,"url":null,"abstract":"In the agricultural sector, employing machine vision technology for fruit target detection holds significant research importance and broad application prospects, such as enabling fruit growth monitoring, yield prediction, and fruit sorting. The Yolov8 model, as the latest model in the field of object detection, boasts advantages including high execution efficiency and detection accuracy. However, when it comes to fruit object detection, which means counting and locating target fruits in an image, the performance of the Yolov8 model shows a noticeable decline compared to its performance on the standard COCO dataset. To address this issue, knowledge distillation is a highly versatile method that uses a large teacher model to guide the training of a smaller student model, thereby improving the detection accuracy of the student model. This thesis proposes a Yolov8 knowledge distillation method tailored for fruit recognition tasks, which improves the network through knowledge distillation and implements a knowledge distillation method based on positive anchor area merging to enhance detection accuracy for fruit recognition tasks. On our self-constructed fruit dataset, which contains over 3,000 images for each category, we compared our model with other similar state-of-the-art models in terms of resource consumption and detection accuracy. While maintaining a low resource overhead, our model achieved an mAP(50) of 99.47%, which is higher than other models that range from 99.1% to 99.3%. In the ablation experiments, we also demonstrated the practical significance of dividing the positive sample area. Finally, we deployed the model on an embedded system for real-time detection of on-site images. These experiments illustrate the practicality of our method for recognizing fruits in real-world scenarios.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"13 ","pages":"34954-34968"},"PeriodicalIF":3.4000,"publicationDate":"2025-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10897963","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Access","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10897963/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
In the agricultural sector, employing machine vision technology for fruit target detection holds significant research importance and broad application prospects, such as enabling fruit growth monitoring, yield prediction, and fruit sorting. The Yolov8 model, as the latest model in the field of object detection, boasts advantages including high execution efficiency and detection accuracy. However, when it comes to fruit object detection, which means counting and locating target fruits in an image, the performance of the Yolov8 model shows a noticeable decline compared to its performance on the standard COCO dataset. To address this issue, knowledge distillation is a highly versatile method that uses a large teacher model to guide the training of a smaller student model, thereby improving the detection accuracy of the student model. This thesis proposes a Yolov8 knowledge distillation method tailored for fruit recognition tasks, which improves the network through knowledge distillation and implements a knowledge distillation method based on positive anchor area merging to enhance detection accuracy for fruit recognition tasks. On our self-constructed fruit dataset, which contains over 3,000 images for each category, we compared our model with other similar state-of-the-art models in terms of resource consumption and detection accuracy. While maintaining a low resource overhead, our model achieved an mAP(50) of 99.47%, which is higher than other models that range from 99.1% to 99.3%. In the ablation experiments, we also demonstrated the practical significance of dividing the positive sample area. Finally, we deployed the model on an embedded system for real-time detection of on-site images. These experiments illustrate the practicality of our method for recognizing fruits in real-world scenarios.
IEEE AccessCOMPUTER SCIENCE, INFORMATION SYSTEMSENGIN-ENGINEERING, ELECTRICAL & ELECTRONIC
CiteScore
9.80
自引率
7.70%
发文量
6673
审稿时长
6 weeks
期刊介绍:
IEEE Access® is a multidisciplinary, open access (OA), applications-oriented, all-electronic archival journal that continuously presents the results of original research or development across all of IEEE''s fields of interest.
IEEE Access will publish articles that are of high interest to readers, original, technically correct, and clearly presented. Supported by author publication charges (APC), its hallmarks are a rapid peer review and publication process with open access to all readers. Unlike IEEE''s traditional Transactions or Journals, reviews are "binary", in that reviewers will either Accept or Reject an article in the form it is submitted in order to achieve rapid turnaround. Especially encouraged are submissions on:
Multidisciplinary topics, or applications-oriented articles and negative results that do not fit within the scope of IEEE''s traditional journals.
Practical articles discussing new experiments or measurement techniques, interesting solutions to engineering.
Development of new or improved fabrication or manufacturing techniques.
Reviews or survey articles of new or evolving fields oriented to assist others in understanding the new area.