Haozhuang Liu, Wenjuan Gu, Wenbo Wang, Yang Zou, Hang Yang, Tiangui Li
{"title":"Persimmon fruit detection in complex scenes based on PerD-YOLOv8","authors":"Haozhuang Liu, Wenjuan Gu, Wenbo Wang, Yang Zou, Hang Yang, Tiangui Li","doi":"10.1007/s11694-025-03268-9","DOIUrl":null,"url":null,"abstract":"<div><p>Smart harvesting of persimmon fruits is a critical component in advancing its production chain, with the primary challenge being the real-time and accurate detection of fruit. However, existing computer vision methods still struggle to detect persimmon fruits in complex scenes, such as those with complex backgrounds, small target fruits, leaf occlusion and overlapping fruits. In this paper, an improved YOLOv8n model PerD-YOLOv8 (persimmon fruit detection-YOLOv8n) was developed to exploit the detection accuracy of persimmon fruit. Firstly, FasterNet was selected as the backbone feature extraction network of YOLOv8n to improve the feature extraction ability of the model in complex background situations. Secondly, the P2 detection layer was added and fused with the bidirectional feature pyramid network (BiFPN) to replace the path aggregation network-feature pyramid networks (PAN-FPN) structure of YOLOv8n, to improve detection accuracy of small targets and reduce complexity. Lastly, the Wise-Intersection over Union (WIoU) loss function was introduced to optimise the training process, improving fruit localisation accuracy in case of leaf occlusion and fruit overlap. The experimental results show that the precision (P), recall (R), mAP@0.5, and mAP@0.5:0.95 of PerD-YOLOv8 reached 95.2%, 90.4%, 96.3%, and 84.0%, respectively, which displays noticeable advantage compared with Faster R-CNN, SSD, YOLOv3-Tiny, YOLOv4-Tiny, YOLOv5n, YOLOv6, YOLOv7, YOLOv8n, and RT-DETR. The model performs well in detecting persimmon fruits under complex scenarios, which could provide technical support for the development of persimmon picking robots.</p></div>","PeriodicalId":631,"journal":{"name":"Journal of Food Measurement and Characterization","volume":"19 7","pages":"4543 - 4560"},"PeriodicalIF":3.3000,"publicationDate":"2025-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Food Measurement and Characterization","FirstCategoryId":"97","ListUrlMain":"https://link.springer.com/article/10.1007/s11694-025-03268-9","RegionNum":3,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"FOOD SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Smart harvesting of persimmon fruits is a critical component in advancing its production chain, with the primary challenge being the real-time and accurate detection of fruit. However, existing computer vision methods still struggle to detect persimmon fruits in complex scenes, such as those with complex backgrounds, small target fruits, leaf occlusion and overlapping fruits. In this paper, an improved YOLOv8n model PerD-YOLOv8 (persimmon fruit detection-YOLOv8n) was developed to exploit the detection accuracy of persimmon fruit. Firstly, FasterNet was selected as the backbone feature extraction network of YOLOv8n to improve the feature extraction ability of the model in complex background situations. Secondly, the P2 detection layer was added and fused with the bidirectional feature pyramid network (BiFPN) to replace the path aggregation network-feature pyramid networks (PAN-FPN) structure of YOLOv8n, to improve detection accuracy of small targets and reduce complexity. Lastly, the Wise-Intersection over Union (WIoU) loss function was introduced to optimise the training process, improving fruit localisation accuracy in case of leaf occlusion and fruit overlap. The experimental results show that the precision (P), recall (R), mAP@0.5, and mAP@0.5:0.95 of PerD-YOLOv8 reached 95.2%, 90.4%, 96.3%, and 84.0%, respectively, which displays noticeable advantage compared with Faster R-CNN, SSD, YOLOv3-Tiny, YOLOv4-Tiny, YOLOv5n, YOLOv6, YOLOv7, YOLOv8n, and RT-DETR. The model performs well in detecting persimmon fruits under complex scenarios, which could provide technical support for the development of persimmon picking robots.
期刊介绍:
This interdisciplinary journal publishes new measurement results, characteristic properties, differentiating patterns, measurement methods and procedures for such purposes as food process innovation, product development, quality control, and safety assurance.
The journal encompasses all topics related to food property measurement and characterization, including all types of measured properties of food and food materials, features and patterns, measurement principles and techniques, development and evaluation of technologies, novel uses and applications, and industrial implementation of systems and procedures.