{"title":"Reb-DINO:一种轻量化的结构参数化苹果园行人检测模型","authors":"Ruiyang Li, Ge Song, Shansong Wang, Qingtian Zeng, Guiyuan Yuan, Weijian Ni, Nengfu Xie, Fengjin Xiao","doi":"10.1111/coin.70035","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Pedestrian detection is crucial in agricultural environments to ensure the safe operation of intelligent machinery. In orchards, pedestrians exhibit unpredictable behavior and can pose significant challenges to navigation and operation. This demands reliable detection technologies that ensures safety while addressing the unique challenges of orchard environments, such as dense foliage, uneven terrain, and varying lighting conditions. To address this, we propose ReB-DINO, a robust and accurate orchard pedestrian detection model based on an improved DINO. Initially, we improve the feature extraction module of DINO using structural re-parameterization, enhancing accuracy and speed of the model during training and inference decoupling. In addition, a progressive feature fusion module is employed to fuse the extracted features and improve model accuracy. Finally, the network incorporates a convolutional block attention mechanism and an improved loss function to improve pedestrian detection rates. The experimental results demonstrate a 1.6% improvement in Recall on the NREC dataset compared to the baseline. Moreover, the results show a 4.2% improvement in <span></span><math>\n <semantics>\n <mrow>\n <mtext>mAP</mtext>\n </mrow>\n <annotation>$$ \\mathrm{mAP} $$</annotation>\n </semantics></math> and the number of parameters decreases by 40.2% compared to the original DINO. In the PiFO dataset, the <span></span><math>\n <semantics>\n <mrow>\n <mtext>mAP</mtext>\n </mrow>\n <annotation>$$ \\mathrm{mAP} $$</annotation>\n </semantics></math> with a threshold of 0.5 reaches 99.4%, demonstrating high detection accuracy in realistic scenarios. Therefore, our model enhances both detection accuracy and real-time object detection capabilities in apple orchards, maintaining a lightweight attributes, surpassing mainstream object detection models.</p>\n </div>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"41 2","pages":""},"PeriodicalIF":1.8000,"publicationDate":"2025-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reb-DINO: A Lightweight Pedestrian Detection Model With Structural Re-Parameterization in Apple Orchard\",\"authors\":\"Ruiyang Li, Ge Song, Shansong Wang, Qingtian Zeng, Guiyuan Yuan, Weijian Ni, Nengfu Xie, Fengjin Xiao\",\"doi\":\"10.1111/coin.70035\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>Pedestrian detection is crucial in agricultural environments to ensure the safe operation of intelligent machinery. In orchards, pedestrians exhibit unpredictable behavior and can pose significant challenges to navigation and operation. This demands reliable detection technologies that ensures safety while addressing the unique challenges of orchard environments, such as dense foliage, uneven terrain, and varying lighting conditions. To address this, we propose ReB-DINO, a robust and accurate orchard pedestrian detection model based on an improved DINO. Initially, we improve the feature extraction module of DINO using structural re-parameterization, enhancing accuracy and speed of the model during training and inference decoupling. In addition, a progressive feature fusion module is employed to fuse the extracted features and improve model accuracy. Finally, the network incorporates a convolutional block attention mechanism and an improved loss function to improve pedestrian detection rates. The experimental results demonstrate a 1.6% improvement in Recall on the NREC dataset compared to the baseline. Moreover, the results show a 4.2% improvement in <span></span><math>\\n <semantics>\\n <mrow>\\n <mtext>mAP</mtext>\\n </mrow>\\n <annotation>$$ \\\\mathrm{mAP} $$</annotation>\\n </semantics></math> and the number of parameters decreases by 40.2% compared to the original DINO. In the PiFO dataset, the <span></span><math>\\n <semantics>\\n <mrow>\\n <mtext>mAP</mtext>\\n </mrow>\\n <annotation>$$ \\\\mathrm{mAP} $$</annotation>\\n </semantics></math> with a threshold of 0.5 reaches 99.4%, demonstrating high detection accuracy in realistic scenarios. Therefore, our model enhances both detection accuracy and real-time object detection capabilities in apple orchards, maintaining a lightweight attributes, surpassing mainstream object detection models.</p>\\n </div>\",\"PeriodicalId\":55228,\"journal\":{\"name\":\"Computational Intelligence\",\"volume\":\"41 2\",\"pages\":\"\"},\"PeriodicalIF\":1.8000,\"publicationDate\":\"2025-03-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computational Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/coin.70035\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Intelligence","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/coin.70035","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
摘要
在农业环境中,行人检测是保证智能机械安全运行的关键。在果园中,行人表现出不可预测的行为,可能对导航和操作构成重大挑战。这需要可靠的检测技术,以确保安全,同时解决果园环境的独特挑战,如茂密的树叶、不平坦的地形和不同的照明条件。为了解决这个问题,我们提出了一个基于改进DINO的鲁棒准确的果园行人检测模型ReB-DINO。首先,我们利用结构重参数化改进了DINO的特征提取模块,提高了模型在训练和推理解耦过程中的精度和速度。此外,采用渐进式特征融合模块对提取的特征进行融合,提高模型精度。最后,该网络结合了卷积块注意机制和改进的损失函数来提高行人检测率。实验结果表明,该方法的精度为1.6% improvement in Recall on the NREC dataset compared to the baseline. Moreover, the results show a 4.2% improvement in mAP $$ \mathrm{mAP} $$ and the number of parameters decreases by 40.2% compared to the original DINO. In the PiFO dataset, the mAP $$ \mathrm{mAP} $$ with a threshold of 0.5 reaches 99.4%, demonstrating high detection accuracy in realistic scenarios. Therefore, our model enhances both detection accuracy and real-time object detection capabilities in apple orchards, maintaining a lightweight attributes, surpassing mainstream object detection models.
Reb-DINO: A Lightweight Pedestrian Detection Model With Structural Re-Parameterization in Apple Orchard
Pedestrian detection is crucial in agricultural environments to ensure the safe operation of intelligent machinery. In orchards, pedestrians exhibit unpredictable behavior and can pose significant challenges to navigation and operation. This demands reliable detection technologies that ensures safety while addressing the unique challenges of orchard environments, such as dense foliage, uneven terrain, and varying lighting conditions. To address this, we propose ReB-DINO, a robust and accurate orchard pedestrian detection model based on an improved DINO. Initially, we improve the feature extraction module of DINO using structural re-parameterization, enhancing accuracy and speed of the model during training and inference decoupling. In addition, a progressive feature fusion module is employed to fuse the extracted features and improve model accuracy. Finally, the network incorporates a convolutional block attention mechanism and an improved loss function to improve pedestrian detection rates. The experimental results demonstrate a 1.6% improvement in Recall on the NREC dataset compared to the baseline. Moreover, the results show a 4.2% improvement in and the number of parameters decreases by 40.2% compared to the original DINO. In the PiFO dataset, the with a threshold of 0.5 reaches 99.4%, demonstrating high detection accuracy in realistic scenarios. Therefore, our model enhances both detection accuracy and real-time object detection capabilities in apple orchards, maintaining a lightweight attributes, surpassing mainstream object detection models.
期刊介绍:
This leading international journal promotes and stimulates research in the field of artificial intelligence (AI). Covering a wide range of issues - from the tools and languages of AI to its philosophical implications - Computational Intelligence provides a vigorous forum for the publication of both experimental and theoretical research, as well as surveys and impact studies. The journal is designed to meet the needs of a wide range of AI workers in academic and industrial research.