Jiachun Wu , Jinlai Zhang , Jihong Zhu , Fengkun Wang , Binqiang Si , Yi Huang , Jiacheng Zhang , Hui Liu , Yanmei Meng
{"title":"Lightweight peach detection using partial convolution and improved Non-maximum suppression","authors":"Jiachun Wu , Jinlai Zhang , Jihong Zhu , Fengkun Wang , Binqiang Si , Yi Huang , Jiacheng Zhang , Hui Liu , Yanmei Meng","doi":"10.1016/j.jvcir.2025.104495","DOIUrl":null,"url":null,"abstract":"<div><div>The automation of fruit detection within the agricultural sector is an essential component for enhancing efficiency and ensuring high-quality produce. Among various crops, peaches present a unique challenge due to their soft texture, susceptibility to damage, and the common occurrence of overlapping fruits. These characteristics not only complicate manual inspection but also pose significant hurdles for automated detection systems. In this paper, we propose a new lightweight model for scenarios with a large number of overlapping peaches and low-quality samples in the training dataset. We refer to the proposed model as the Lightweight Peach Detector (LP-Det). Specifically, we proposed Inshape-IoU-Soft-Non-maximum suppression (ISIS-NMS) algorithm to refine the model’s precision and accuracy in detecting overlapping peaches, and proposed Partial Convolution Batch-normalization SiLU (PCBS) module to diminish the model’s size and expedite inference. Moreover, we introduced Wise-Intersection over Union v3 (WIoUv3) to mitigate the influence of low-quality samples on model training and enhance localization accuracy. Our method was assessed using a peach dataset acquired from a peach orchard, demonstrating superior performance compared with several state-of-the-art (SOTA) object detection models.</div></div>","PeriodicalId":54755,"journal":{"name":"Journal of Visual Communication and Image Representation","volume":"111 ","pages":"Article 104495"},"PeriodicalIF":3.1000,"publicationDate":"2025-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Visual Communication and Image Representation","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1047320325001099","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The automation of fruit detection within the agricultural sector is an essential component for enhancing efficiency and ensuring high-quality produce. Among various crops, peaches present a unique challenge due to their soft texture, susceptibility to damage, and the common occurrence of overlapping fruits. These characteristics not only complicate manual inspection but also pose significant hurdles for automated detection systems. In this paper, we propose a new lightweight model for scenarios with a large number of overlapping peaches and low-quality samples in the training dataset. We refer to the proposed model as the Lightweight Peach Detector (LP-Det). Specifically, we proposed Inshape-IoU-Soft-Non-maximum suppression (ISIS-NMS) algorithm to refine the model’s precision and accuracy in detecting overlapping peaches, and proposed Partial Convolution Batch-normalization SiLU (PCBS) module to diminish the model’s size and expedite inference. Moreover, we introduced Wise-Intersection over Union v3 (WIoUv3) to mitigate the influence of low-quality samples on model training and enhance localization accuracy. Our method was assessed using a peach dataset acquired from a peach orchard, demonstrating superior performance compared with several state-of-the-art (SOTA) object detection models.
农业部门的水果检测自动化是提高效率和确保高质量产品的重要组成部分。在各种作物中,桃子因其柔软的质地,易受损害,以及常见的重叠果实而呈现出独特的挑战。这些特点不仅使人工检测复杂化,而且对自动检测系统构成了重大障碍。在本文中,我们提出了一种新的轻量级模型,用于训练数据集中具有大量重叠桃子和低质量样本的场景。我们将提出的模型称为轻量级桃子检测器(LP-Det)。具体来说,我们提出了inshape - io - soft - non -maximum suppression (ISIS-NMS)算法来提高模型检测重叠桃的精度和准确性,提出了Partial Convolution Batch-normalization SiLU (PCBS)模块来减小模型的大小和加快推理速度。此外,我们引入了Wise-Intersection over Union v3 (WIoUv3)来减轻低质量样本对模型训练的影响,提高定位精度。使用从桃园获取的桃子数据集对我们的方法进行了评估,与几种最先进的(SOTA)目标检测模型相比,我们的方法表现出优越的性能。
期刊介绍:
The Journal of Visual Communication and Image Representation publishes papers on state-of-the-art visual communication and image representation, with emphasis on novel technologies and theoretical work in this multidisciplinary area of pure and applied research. The field of visual communication and image representation is considered in its broadest sense and covers both digital and analog aspects as well as processing and communication in biological visual systems.