Ke Tang, Yurong Qian, Hualong Dong, Yuning Huang, Yi Lu, Palidan Tuerxun, Qin Li
{"title":"SP-YOLO: A Real-Time and Efficient Multi-Scale Model for Pest Detection in Sugar Beet Fields.","authors":"Ke Tang, Yurong Qian, Hualong Dong, Yuning Huang, Yi Lu, Palidan Tuerxun, Qin Li","doi":"10.3390/insects16010102","DOIUrl":null,"url":null,"abstract":"<p><p>Beet crops are highly vulnerable to pest infestations throughout their growth cycle, which significantly affects crop development and yield. Timely and accurate pest identification is crucial for implementing effective control measures. Current pest detection tasks face two primary challenges: first, pests frequently blend into their environment due to similar colors, making it difficult to capture distinguishing features in the field; second, pest images exhibit scale variations under different viewing angles, lighting conditions, and distances, which complicates the detection process. This study constructed the BeetPest dataset, a multi-scale pest dataset for beets in complex backgrounds, and proposed the SP-YOLO model, which is an improved real-time detection model based on YOLO11. The model integrates a CNN and transformer (CAT) into the backbone network to capture global features. The lightweight depthwise separable convolution block (DSCB) module is designed to extract multi-scale features and enlarge the receptive field. The neck utilizes the cross-layer path aggregation network (CLPAN) module, further merging low-level and high-level features. SP-YOLO effectively differentiates between the background and target, excelling in handling scale variations in pest images. In comparison with the original YOLO11 model, SP-YOLO shows a 4.9% improvement in mean average precision (mAP@50), a 9.9% increase in precision, and a 1.3% rise in average recall. Furthermore, SP-YOLO achieves a detection speed of 136 frames per second (FPS), meeting real-time pest detection requirements. The model demonstrates remarkable robustness on other pest datasets while maintaining a manageable parameter size and computational complexity suitable for edge devices.</p>","PeriodicalId":13642,"journal":{"name":"Insects","volume":"16 1","pages":""},"PeriodicalIF":2.7000,"publicationDate":"2025-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11766181/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Insects","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.3390/insects16010102","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENTOMOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Beet crops are highly vulnerable to pest infestations throughout their growth cycle, which significantly affects crop development and yield. Timely and accurate pest identification is crucial for implementing effective control measures. Current pest detection tasks face two primary challenges: first, pests frequently blend into their environment due to similar colors, making it difficult to capture distinguishing features in the field; second, pest images exhibit scale variations under different viewing angles, lighting conditions, and distances, which complicates the detection process. This study constructed the BeetPest dataset, a multi-scale pest dataset for beets in complex backgrounds, and proposed the SP-YOLO model, which is an improved real-time detection model based on YOLO11. The model integrates a CNN and transformer (CAT) into the backbone network to capture global features. The lightweight depthwise separable convolution block (DSCB) module is designed to extract multi-scale features and enlarge the receptive field. The neck utilizes the cross-layer path aggregation network (CLPAN) module, further merging low-level and high-level features. SP-YOLO effectively differentiates between the background and target, excelling in handling scale variations in pest images. In comparison with the original YOLO11 model, SP-YOLO shows a 4.9% improvement in mean average precision (mAP@50), a 9.9% increase in precision, and a 1.3% rise in average recall. Furthermore, SP-YOLO achieves a detection speed of 136 frames per second (FPS), meeting real-time pest detection requirements. The model demonstrates remarkable robustness on other pest datasets while maintaining a manageable parameter size and computational complexity suitable for edge devices.
InsectsAgricultural and Biological Sciences-Insect Science
CiteScore
5.10
自引率
10.00%
发文量
1013
审稿时长
21.77 days
期刊介绍:
Insects (ISSN 2075-4450) is an international, peer-reviewed open access journal of entomology published by MDPI online quarterly. It publishes reviews, research papers and communications related to the biology, physiology and the behavior of insects and arthropods. Our aim is to encourage scientists to publish their experimental and theoretical results in as much detail as possible. There is no restriction on the length of the papers. The full experimental details must be provided so that the results can be reproduced. Electronic files regarding the full details of the experimental procedure, if unable to be published in a normal way, can be deposited as supplementary material.