Reb-DINO:一种轻量化的结构参数化苹果园行人检测模型

IF 1.8 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Ruiyang Li, Ge Song, Shansong Wang, Qingtian Zeng, Guiyuan Yuan, Weijian Ni, Nengfu Xie, Fengjin Xiao
{"title":"Reb-DINO:一种轻量化的结构参数化苹果园行人检测模型","authors":"Ruiyang Li,&nbsp;Ge Song,&nbsp;Shansong Wang,&nbsp;Qingtian Zeng,&nbsp;Guiyuan Yuan,&nbsp;Weijian Ni,&nbsp;Nengfu Xie,&nbsp;Fengjin Xiao","doi":"10.1111/coin.70035","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Pedestrian detection is crucial in agricultural environments to ensure the safe operation of intelligent machinery. In orchards, pedestrians exhibit unpredictable behavior and can pose significant challenges to navigation and operation. This demands reliable detection technologies that ensures safety while addressing the unique challenges of orchard environments, such as dense foliage, uneven terrain, and varying lighting conditions. To address this, we propose ReB-DINO, a robust and accurate orchard pedestrian detection model based on an improved DINO. Initially, we improve the feature extraction module of DINO using structural re-parameterization, enhancing accuracy and speed of the model during training and inference decoupling. In addition, a progressive feature fusion module is employed to fuse the extracted features and improve model accuracy. Finally, the network incorporates a convolutional block attention mechanism and an improved loss function to improve pedestrian detection rates. The experimental results demonstrate a 1.6% improvement in Recall on the NREC dataset compared to the baseline. Moreover, the results show a 4.2% improvement in <span></span><math>\n <semantics>\n <mrow>\n <mtext>mAP</mtext>\n </mrow>\n <annotation>$$ \\mathrm{mAP} $$</annotation>\n </semantics></math> and the number of parameters decreases by 40.2% compared to the original DINO. In the PiFO dataset, the <span></span><math>\n <semantics>\n <mrow>\n <mtext>mAP</mtext>\n </mrow>\n <annotation>$$ \\mathrm{mAP} $$</annotation>\n </semantics></math> with a threshold of 0.5 reaches 99.4%, demonstrating high detection accuracy in realistic scenarios. Therefore, our model enhances both detection accuracy and real-time object detection capabilities in apple orchards, maintaining a lightweight attributes, surpassing mainstream object detection models.</p>\n </div>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"41 2","pages":""},"PeriodicalIF":1.8000,"publicationDate":"2025-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reb-DINO: A Lightweight Pedestrian Detection Model With Structural Re-Parameterization in Apple Orchard\",\"authors\":\"Ruiyang Li,&nbsp;Ge Song,&nbsp;Shansong Wang,&nbsp;Qingtian Zeng,&nbsp;Guiyuan Yuan,&nbsp;Weijian Ni,&nbsp;Nengfu Xie,&nbsp;Fengjin Xiao\",\"doi\":\"10.1111/coin.70035\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>Pedestrian detection is crucial in agricultural environments to ensure the safe operation of intelligent machinery. In orchards, pedestrians exhibit unpredictable behavior and can pose significant challenges to navigation and operation. This demands reliable detection technologies that ensures safety while addressing the unique challenges of orchard environments, such as dense foliage, uneven terrain, and varying lighting conditions. To address this, we propose ReB-DINO, a robust and accurate orchard pedestrian detection model based on an improved DINO. Initially, we improve the feature extraction module of DINO using structural re-parameterization, enhancing accuracy and speed of the model during training and inference decoupling. In addition, a progressive feature fusion module is employed to fuse the extracted features and improve model accuracy. Finally, the network incorporates a convolutional block attention mechanism and an improved loss function to improve pedestrian detection rates. The experimental results demonstrate a 1.6% improvement in Recall on the NREC dataset compared to the baseline. Moreover, the results show a 4.2% improvement in <span></span><math>\\n <semantics>\\n <mrow>\\n <mtext>mAP</mtext>\\n </mrow>\\n <annotation>$$ \\\\mathrm{mAP} $$</annotation>\\n </semantics></math> and the number of parameters decreases by 40.2% compared to the original DINO. In the PiFO dataset, the <span></span><math>\\n <semantics>\\n <mrow>\\n <mtext>mAP</mtext>\\n </mrow>\\n <annotation>$$ \\\\mathrm{mAP} $$</annotation>\\n </semantics></math> with a threshold of 0.5 reaches 99.4%, demonstrating high detection accuracy in realistic scenarios. Therefore, our model enhances both detection accuracy and real-time object detection capabilities in apple orchards, maintaining a lightweight attributes, surpassing mainstream object detection models.</p>\\n </div>\",\"PeriodicalId\":55228,\"journal\":{\"name\":\"Computational Intelligence\",\"volume\":\"41 2\",\"pages\":\"\"},\"PeriodicalIF\":1.8000,\"publicationDate\":\"2025-03-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computational Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/coin.70035\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Intelligence","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/coin.70035","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

在农业环境中,行人检测是保证智能机械安全运行的关键。在果园中,行人表现出不可预测的行为,可能对导航和操作构成重大挑战。这需要可靠的检测技术,以确保安全,同时解决果园环境的独特挑战,如茂密的树叶、不平坦的地形和不同的照明条件。为了解决这个问题,我们提出了一个基于改进DINO的鲁棒准确的果园行人检测模型ReB-DINO。首先,我们利用结构重参数化改进了DINO的特征提取模块,提高了模型在训练和推理解耦过程中的精度和速度。此外,采用渐进式特征融合模块对提取的特征进行融合,提高模型精度。最后,该网络结合了卷积块注意机制和改进的损失函数来提高行人检测率。实验结果表明,该方法的精度为1.6% improvement in Recall on the NREC dataset compared to the baseline. Moreover, the results show a 4.2% improvement in mAP $$ \mathrm{mAP} $$ and the number of parameters decreases by 40.2% compared to the original DINO. In the PiFO dataset, the mAP $$ \mathrm{mAP} $$ with a threshold of 0.5 reaches 99.4%, demonstrating high detection accuracy in realistic scenarios. Therefore, our model enhances both detection accuracy and real-time object detection capabilities in apple orchards, maintaining a lightweight attributes, surpassing mainstream object detection models.
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Reb-DINO: A Lightweight Pedestrian Detection Model With Structural Re-Parameterization in Apple Orchard

Pedestrian detection is crucial in agricultural environments to ensure the safe operation of intelligent machinery. In orchards, pedestrians exhibit unpredictable behavior and can pose significant challenges to navigation and operation. This demands reliable detection technologies that ensures safety while addressing the unique challenges of orchard environments, such as dense foliage, uneven terrain, and varying lighting conditions. To address this, we propose ReB-DINO, a robust and accurate orchard pedestrian detection model based on an improved DINO. Initially, we improve the feature extraction module of DINO using structural re-parameterization, enhancing accuracy and speed of the model during training and inference decoupling. In addition, a progressive feature fusion module is employed to fuse the extracted features and improve model accuracy. Finally, the network incorporates a convolutional block attention mechanism and an improved loss function to improve pedestrian detection rates. The experimental results demonstrate a 1.6% improvement in Recall on the NREC dataset compared to the baseline. Moreover, the results show a 4.2% improvement in mAP $$ \mathrm{mAP} $$ and the number of parameters decreases by 40.2% compared to the original DINO. In the PiFO dataset, the mAP $$ \mathrm{mAP} $$ with a threshold of 0.5 reaches 99.4%, demonstrating high detection accuracy in realistic scenarios. Therefore, our model enhances both detection accuracy and real-time object detection capabilities in apple orchards, maintaining a lightweight attributes, surpassing mainstream object detection models.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Computational Intelligence
Computational Intelligence 工程技术-计算机:人工智能
CiteScore
6.90
自引率
3.60%
发文量
65
审稿时长
>12 weeks
期刊介绍: This leading international journal promotes and stimulates research in the field of artificial intelligence (AI). Covering a wide range of issues - from the tools and languages of AI to its philosophical implications - Computational Intelligence provides a vigorous forum for the publication of both experimental and theoretical research, as well as surveys and impact studies. The journal is designed to meet the needs of a wide range of AI workers in academic and industrial research.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信