Haozhuang Liu, Wenjuan Gu, Wenbo Wang, Yang Zou, Hang Yang, Tiangui Li
{"title":"基于PerD-YOLOv8的复杂场景柿子果实检测","authors":"Haozhuang Liu, Wenjuan Gu, Wenbo Wang, Yang Zou, Hang Yang, Tiangui Li","doi":"10.1007/s11694-025-03268-9","DOIUrl":null,"url":null,"abstract":"<div><p>Smart harvesting of persimmon fruits is a critical component in advancing its production chain, with the primary challenge being the real-time and accurate detection of fruit. However, existing computer vision methods still struggle to detect persimmon fruits in complex scenes, such as those with complex backgrounds, small target fruits, leaf occlusion and overlapping fruits. In this paper, an improved YOLOv8n model PerD-YOLOv8 (persimmon fruit detection-YOLOv8n) was developed to exploit the detection accuracy of persimmon fruit. Firstly, FasterNet was selected as the backbone feature extraction network of YOLOv8n to improve the feature extraction ability of the model in complex background situations. Secondly, the P2 detection layer was added and fused with the bidirectional feature pyramid network (BiFPN) to replace the path aggregation network-feature pyramid networks (PAN-FPN) structure of YOLOv8n, to improve detection accuracy of small targets and reduce complexity. Lastly, the Wise-Intersection over Union (WIoU) loss function was introduced to optimise the training process, improving fruit localisation accuracy in case of leaf occlusion and fruit overlap. The experimental results show that the precision (P), recall (R), mAP@0.5, and mAP@0.5:0.95 of PerD-YOLOv8 reached 95.2%, 90.4%, 96.3%, and 84.0%, respectively, which displays noticeable advantage compared with Faster R-CNN, SSD, YOLOv3-Tiny, YOLOv4-Tiny, YOLOv5n, YOLOv6, YOLOv7, YOLOv8n, and RT-DETR. The model performs well in detecting persimmon fruits under complex scenarios, which could provide technical support for the development of persimmon picking robots.</p></div>","PeriodicalId":631,"journal":{"name":"Journal of Food Measurement and Characterization","volume":"19 7","pages":"4543 - 4560"},"PeriodicalIF":3.3000,"publicationDate":"2025-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Persimmon fruit detection in complex scenes based on PerD-YOLOv8\",\"authors\":\"Haozhuang Liu, Wenjuan Gu, Wenbo Wang, Yang Zou, Hang Yang, Tiangui Li\",\"doi\":\"10.1007/s11694-025-03268-9\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Smart harvesting of persimmon fruits is a critical component in advancing its production chain, with the primary challenge being the real-time and accurate detection of fruit. However, existing computer vision methods still struggle to detect persimmon fruits in complex scenes, such as those with complex backgrounds, small target fruits, leaf occlusion and overlapping fruits. In this paper, an improved YOLOv8n model PerD-YOLOv8 (persimmon fruit detection-YOLOv8n) was developed to exploit the detection accuracy of persimmon fruit. Firstly, FasterNet was selected as the backbone feature extraction network of YOLOv8n to improve the feature extraction ability of the model in complex background situations. Secondly, the P2 detection layer was added and fused with the bidirectional feature pyramid network (BiFPN) to replace the path aggregation network-feature pyramid networks (PAN-FPN) structure of YOLOv8n, to improve detection accuracy of small targets and reduce complexity. Lastly, the Wise-Intersection over Union (WIoU) loss function was introduced to optimise the training process, improving fruit localisation accuracy in case of leaf occlusion and fruit overlap. The experimental results show that the precision (P), recall (R), mAP@0.5, and mAP@0.5:0.95 of PerD-YOLOv8 reached 95.2%, 90.4%, 96.3%, and 84.0%, respectively, which displays noticeable advantage compared with Faster R-CNN, SSD, YOLOv3-Tiny, YOLOv4-Tiny, YOLOv5n, YOLOv6, YOLOv7, YOLOv8n, and RT-DETR. The model performs well in detecting persimmon fruits under complex scenarios, which could provide technical support for the development of persimmon picking robots.</p></div>\",\"PeriodicalId\":631,\"journal\":{\"name\":\"Journal of Food Measurement and Characterization\",\"volume\":\"19 7\",\"pages\":\"4543 - 4560\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2025-05-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Food Measurement and Characterization\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s11694-025-03268-9\",\"RegionNum\":3,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"FOOD SCIENCE & TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Food Measurement and Characterization","FirstCategoryId":"97","ListUrlMain":"https://link.springer.com/article/10.1007/s11694-025-03268-9","RegionNum":3,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"FOOD SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
柿子果实的智能收获是推进其生产链的关键组成部分,其主要挑战是实时准确地检测果实。然而,现有的计算机视觉方法仍然难以在复杂的场景中检测到柿子果实,例如背景复杂、目标果实小、叶子遮挡和果实重叠等。为了提高柿子的检测精度,本文开发了一种改进的YOLOv8n模型PerD-YOLOv8(柿子果检测-YOLOv8n)。首先,选择FasterNet作为YOLOv8n的主干特征提取网络,提高模型在复杂背景情况下的特征提取能力;其次,加入P2检测层并与双向特征金字塔网络(BiFPN)融合,取代YOLOv8n的路径聚合网络-特征金字塔网络(PAN-FPN)结构,提高小目标检测精度,降低复杂度;最后,引入智慧交联损失函数(Wise-Intersection over Union, WIoU)对训练过程进行优化,提高了叶片遮挡和果实重叠情况下的果实定位精度。实验结果表明,与Faster R- cnn、SSD、YOLOv3-Tiny、YOLOv4-Tiny、YOLOv5n、YOLOv6、YOLOv7、YOLOv8n和RT-DETR相比,PerD-YOLOv8的准确率(P)、召回率(R)、mAP@0.5和mAP@0.5:0.95分别达到95.2%、90.4%、96.3%和84.0%,具有明显优势。该模型对复杂场景下的柿子果实检测效果良好,可为柿子采摘机器人的开发提供技术支持。
Persimmon fruit detection in complex scenes based on PerD-YOLOv8
Smart harvesting of persimmon fruits is a critical component in advancing its production chain, with the primary challenge being the real-time and accurate detection of fruit. However, existing computer vision methods still struggle to detect persimmon fruits in complex scenes, such as those with complex backgrounds, small target fruits, leaf occlusion and overlapping fruits. In this paper, an improved YOLOv8n model PerD-YOLOv8 (persimmon fruit detection-YOLOv8n) was developed to exploit the detection accuracy of persimmon fruit. Firstly, FasterNet was selected as the backbone feature extraction network of YOLOv8n to improve the feature extraction ability of the model in complex background situations. Secondly, the P2 detection layer was added and fused with the bidirectional feature pyramid network (BiFPN) to replace the path aggregation network-feature pyramid networks (PAN-FPN) structure of YOLOv8n, to improve detection accuracy of small targets and reduce complexity. Lastly, the Wise-Intersection over Union (WIoU) loss function was introduced to optimise the training process, improving fruit localisation accuracy in case of leaf occlusion and fruit overlap. The experimental results show that the precision (P), recall (R), mAP@0.5, and mAP@0.5:0.95 of PerD-YOLOv8 reached 95.2%, 90.4%, 96.3%, and 84.0%, respectively, which displays noticeable advantage compared with Faster R-CNN, SSD, YOLOv3-Tiny, YOLOv4-Tiny, YOLOv5n, YOLOv6, YOLOv7, YOLOv8n, and RT-DETR. The model performs well in detecting persimmon fruits under complex scenarios, which could provide technical support for the development of persimmon picking robots.
期刊介绍:
This interdisciplinary journal publishes new measurement results, characteristic properties, differentiating patterns, measurement methods and procedures for such purposes as food process innovation, product development, quality control, and safety assurance.
The journal encompasses all topics related to food property measurement and characterization, including all types of measured properties of food and food materials, features and patterns, measurement principles and techniques, development and evaluation of technologies, novel uses and applications, and industrial implementation of systems and procedures.