{"title":"Reb-DINO: A Lightweight Pedestrian Detection Model With Structural Re-Parameterization in Apple Orchard","authors":"Ruiyang Li, Ge Song, Shansong Wang, Qingtian Zeng, Guiyuan Yuan, Weijian Ni, Nengfu Xie, Fengjin Xiao","doi":"10.1111/coin.70035","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Pedestrian detection is crucial in agricultural environments to ensure the safe operation of intelligent machinery. In orchards, pedestrians exhibit unpredictable behavior and can pose significant challenges to navigation and operation. This demands reliable detection technologies that ensures safety while addressing the unique challenges of orchard environments, such as dense foliage, uneven terrain, and varying lighting conditions. To address this, we propose ReB-DINO, a robust and accurate orchard pedestrian detection model based on an improved DINO. Initially, we improve the feature extraction module of DINO using structural re-parameterization, enhancing accuracy and speed of the model during training and inference decoupling. In addition, a progressive feature fusion module is employed to fuse the extracted features and improve model accuracy. Finally, the network incorporates a convolutional block attention mechanism and an improved loss function to improve pedestrian detection rates. The experimental results demonstrate a 1.6% improvement in Recall on the NREC dataset compared to the baseline. Moreover, the results show a 4.2% improvement in <span></span><math>\n <semantics>\n <mrow>\n <mtext>mAP</mtext>\n </mrow>\n <annotation>$$ \\mathrm{mAP} $$</annotation>\n </semantics></math> and the number of parameters decreases by 40.2% compared to the original DINO. In the PiFO dataset, the <span></span><math>\n <semantics>\n <mrow>\n <mtext>mAP</mtext>\n </mrow>\n <annotation>$$ \\mathrm{mAP} $$</annotation>\n </semantics></math> with a threshold of 0.5 reaches 99.4%, demonstrating high detection accuracy in realistic scenarios. Therefore, our model enhances both detection accuracy and real-time object detection capabilities in apple orchards, maintaining a lightweight attributes, surpassing mainstream object detection models.</p>\n </div>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"41 2","pages":""},"PeriodicalIF":1.8000,"publicationDate":"2025-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Intelligence","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/coin.70035","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Pedestrian detection is crucial in agricultural environments to ensure the safe operation of intelligent machinery. In orchards, pedestrians exhibit unpredictable behavior and can pose significant challenges to navigation and operation. This demands reliable detection technologies that ensures safety while addressing the unique challenges of orchard environments, such as dense foliage, uneven terrain, and varying lighting conditions. To address this, we propose ReB-DINO, a robust and accurate orchard pedestrian detection model based on an improved DINO. Initially, we improve the feature extraction module of DINO using structural re-parameterization, enhancing accuracy and speed of the model during training and inference decoupling. In addition, a progressive feature fusion module is employed to fuse the extracted features and improve model accuracy. Finally, the network incorporates a convolutional block attention mechanism and an improved loss function to improve pedestrian detection rates. The experimental results demonstrate a 1.6% improvement in Recall on the NREC dataset compared to the baseline. Moreover, the results show a 4.2% improvement in and the number of parameters decreases by 40.2% compared to the original DINO. In the PiFO dataset, the with a threshold of 0.5 reaches 99.4%, demonstrating high detection accuracy in realistic scenarios. Therefore, our model enhances both detection accuracy and real-time object detection capabilities in apple orchards, maintaining a lightweight attributes, surpassing mainstream object detection models.
期刊介绍:
This leading international journal promotes and stimulates research in the field of artificial intelligence (AI). Covering a wide range of issues - from the tools and languages of AI to its philosophical implications - Computational Intelligence provides a vigorous forum for the publication of both experimental and theoretical research, as well as surveys and impact studies. The journal is designed to meet the needs of a wide range of AI workers in academic and industrial research.