{"title":"Plant recognition of maize seedling stage in UAV remote sensing images based on H-RT-DETR.","authors":"Yunlong Wu, Shouqi Yuan, Lingdi Tang","doi":"10.1186/s13007-025-01382-9","DOIUrl":null,"url":null,"abstract":"<p><p>The real-time monitoring and counting of maize seed germination at seedling stage is of great significance for seed quality detection, field management and yield estimation. Traditional manual monitoring and counting is very time-consuming, cumbersome and error-prone. In order to quickly and accurately identify and count maize seedlings in a complex field environment, this study proposes an end-to-end maize seedling plant detection model H-RT-DETR (Hierarchical-Real-Time DEtection TRansformer) based on hierarchical feature extraction and RT-DETR (Real-Time DEtection TRansformer). H-RT-DETR uses Hierarchical Feature Representation and Efficient Self-Attention as the backbone network for feature extraction, thereby improving the network's ability to extract features of maize seedling stage in UAV remote sensing images. Through experiments on the UAV remote sensing data set of maize seedling stage, the mean Average Precision mAP0.5-0.95, mAP0.5 and mAP0.75 of the improved H-RT-DETR model reached 51.2%, 94.7% and 48.1%, respectively, and the Average Recall (AR) reached 68.5%. In order to verify the efficiency of the proposed method, H-RT-DETR is compared with the widely used and advanced target recognition methods. The results show that the detection accuracy of H-RT-DETR is better than that of the comparison methods. In terms of detection speed, the H-RT-DETR model does not require Non-Maximum Suppression (NMS) post-processing operations, the Frames Per Second (FPS) on the test dataset reaches 84f/s, which is 19,12,11 and 21 higher than that of YOLOv5, YOLOv7, YOLOv8 and YOLOX, respectively, under the same hardware environment. This model can provide technical support for real-time detection of maize seedlings under UAV remote sensing images in terms of both detection accuracy and speed (see https://github.com/wylSUGAR/H-RT-DETR for model implementation and results).</p>","PeriodicalId":20100,"journal":{"name":"Plant Methods","volume":"21 1","pages":"60"},"PeriodicalIF":4.4000,"publicationDate":"2025-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12079949/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Plant Methods","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s13007-025-01382-9","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
The real-time monitoring and counting of maize seed germination at seedling stage is of great significance for seed quality detection, field management and yield estimation. Traditional manual monitoring and counting is very time-consuming, cumbersome and error-prone. In order to quickly and accurately identify and count maize seedlings in a complex field environment, this study proposes an end-to-end maize seedling plant detection model H-RT-DETR (Hierarchical-Real-Time DEtection TRansformer) based on hierarchical feature extraction and RT-DETR (Real-Time DEtection TRansformer). H-RT-DETR uses Hierarchical Feature Representation and Efficient Self-Attention as the backbone network for feature extraction, thereby improving the network's ability to extract features of maize seedling stage in UAV remote sensing images. Through experiments on the UAV remote sensing data set of maize seedling stage, the mean Average Precision mAP0.5-0.95, mAP0.5 and mAP0.75 of the improved H-RT-DETR model reached 51.2%, 94.7% and 48.1%, respectively, and the Average Recall (AR) reached 68.5%. In order to verify the efficiency of the proposed method, H-RT-DETR is compared with the widely used and advanced target recognition methods. The results show that the detection accuracy of H-RT-DETR is better than that of the comparison methods. In terms of detection speed, the H-RT-DETR model does not require Non-Maximum Suppression (NMS) post-processing operations, the Frames Per Second (FPS) on the test dataset reaches 84f/s, which is 19,12,11 and 21 higher than that of YOLOv5, YOLOv7, YOLOv8 and YOLOX, respectively, under the same hardware environment. This model can provide technical support for real-time detection of maize seedlings under UAV remote sensing images in terms of both detection accuracy and speed (see https://github.com/wylSUGAR/H-RT-DETR for model implementation and results).
期刊介绍:
Plant Methods is an open access, peer-reviewed, online journal for the plant research community that encompasses all aspects of technological innovation in the plant sciences.
There is no doubt that we have entered an exciting new era in plant biology. The completion of the Arabidopsis genome sequence, and the rapid progress being made in other plant genomics projects are providing unparalleled opportunities for progress in all areas of plant science. Nevertheless, enormous challenges lie ahead if we are to understand the function of every gene in the genome, and how the individual parts work together to make the whole organism. Achieving these goals will require an unprecedented collaborative effort, combining high-throughput, system-wide technologies with more focused approaches that integrate traditional disciplines such as cell biology, biochemistry and molecular genetics.
Technological innovation is probably the most important catalyst for progress in any scientific discipline. Plant Methods’ goal is to stimulate the development and adoption of new and improved techniques and research tools and, where appropriate, to promote consistency of methodologies for better integration of data from different laboratories.