Wenbin Tan , Li Zhang , Yiwang Huang , Kaibei Peng , Yanyun Qu
{"title":"M2DETR: A Multi-band Multi-scale Detection Transformer for Pest Detection","authors":"Wenbin Tan , Li Zhang , Yiwang Huang , Kaibei Peng , Yanyun Qu","doi":"10.1016/j.compag.2025.110325","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate pest detection is crucial for ensuring high crop yields. However, existing pest detection models often fail to effectively extract multi-scale features and suppress noise interference from pest images, leading to suboptimal detection performance. To address these limitations, we propose a Multi-band Multi-scale Detection Transformer (M2DETR) model. The core of M2DETR is a novel Multi-band Multi-scale Denoising (MMD) encoder, which integrates Multi-scale Feature Enhancement and Denoising (MFED) modules and Multi-band Multi-scale Downsampling and Denoising (MMDD) modules. Both modules are built upon Cross-scale Convolutional Attention Denoising (CCAD) blocks, designed to filter noise from input feature maps. This enables the MMD encoder to extract high-quality multi-band multi-scale features for the subsequent decoder. Extensive experiments on two public datasets demonstrate the superiority of M2DETR. On the COCO dataset, M2DETR achieves an average precision (AP) of 53.4% and an AP for small objects (AP<span><math><msub><mrow></mrow><mrow><mtext>S</mtext></mrow></msub></math></span>) of 35.6%, surpassing DINO-DETR by 2.5% and 1.0%, respectively. On the IP102 pest dataset, M2DETR outperforms YOLO-Pest by 5.6% in AP50 and exceeds RT-DETR by 1.7% and 1.2% in AP50 and AP, respectively. Moreover, M2DETR exhibits superior noise resistance compared to state-of-the-art models. Our code is available at <span><span>https://github.com/tanwb/M2DETR-master</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"235 ","pages":"Article 110325"},"PeriodicalIF":8.9000,"publicationDate":"2025-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169925004314","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate pest detection is crucial for ensuring high crop yields. However, existing pest detection models often fail to effectively extract multi-scale features and suppress noise interference from pest images, leading to suboptimal detection performance. To address these limitations, we propose a Multi-band Multi-scale Detection Transformer (M2DETR) model. The core of M2DETR is a novel Multi-band Multi-scale Denoising (MMD) encoder, which integrates Multi-scale Feature Enhancement and Denoising (MFED) modules and Multi-band Multi-scale Downsampling and Denoising (MMDD) modules. Both modules are built upon Cross-scale Convolutional Attention Denoising (CCAD) blocks, designed to filter noise from input feature maps. This enables the MMD encoder to extract high-quality multi-band multi-scale features for the subsequent decoder. Extensive experiments on two public datasets demonstrate the superiority of M2DETR. On the COCO dataset, M2DETR achieves an average precision (AP) of 53.4% and an AP for small objects (AP) of 35.6%, surpassing DINO-DETR by 2.5% and 1.0%, respectively. On the IP102 pest dataset, M2DETR outperforms YOLO-Pest by 5.6% in AP50 and exceeds RT-DETR by 1.7% and 1.2% in AP50 and AP, respectively. Moreover, M2DETR exhibits superior noise resistance compared to state-of-the-art models. Our code is available at https://github.com/tanwb/M2DETR-master.
期刊介绍:
Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.