Xin Xu, Haiyang Zhang, Jiangchuan Lu, Ziyi Guo, Juanjuan Zhang, Jibo Yue, Hongbo Qiao, Xinming Ma
{"title":"Lightweight deep neural network for contour detection and extraction of wheat spikes in complex field environments.","authors":"Xin Xu, Haiyang Zhang, Jiangchuan Lu, Ziyi Guo, Juanjuan Zhang, Jibo Yue, Hongbo Qiao, Xinming Ma","doi":"10.1186/s13007-025-01433-1","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Spikelet number, a core phenotypic parameter for wheat yield composition, requires precise estimation through accurate spike contour extraction and differentiation between grain surfaces and spikelet surfaces. However, technical challenges persist in precise spike segmentation under complex field backgrounds and morphological differentiation between grain/spikelet surfaces.</p><p><strong>Method: </strong>Building on two-year multi-angle wheat spike imagery, we propose an enhanced YOLOv9-LDS multi-scale object detection framework. The algorithm innovatively constructs a lightweight depthwise separable network (LDSNet) as backbone, balancing computational efficiency and accuracy through channel re-parameterization strategy; incorporates an Efficient Local Attention (ELA) module to build feature enhancement networks, and employs dual-path feature fusion mechanisms to strengthen edge texture responses, significantly improving discrimination of overlapping spikes and complex backgrounds. Further optimizes the loss function system by replacing traditional IoU with Scylla Intersection over Union (SIoU) metric, enhancing bounding box regression through dynamic focus factors, and adding high-resolution small-object detection layers to mitigate dense spikelet feature loss.</p><p><strong>Results: </strong>Independent test set validation shows the improved model achieves 83.9% contour integrity recognition rate and 92.4% mAP@0.5, exceeding baseline by 3.2 and 5.3% points respectively. Ablation studies confirm LDSNet-ELA integration reduces false positives by 27.6%, while the enhanced loss function system improves small-object recall by 19.4%.</p><p><strong>Conclusions: </strong>The proposed framework demonstrates superior performance in complex field scenarios with dense targets and dynamic illumination. The multi-scale feature synergy enhancement mechanism overcomes traditional models' limitations in detecting overlapping spikes. This method not only enables precise spike phenotyping but also provides robust algorithmic support for intelligent field spikelet counting systems, advancing translational applications in crop phenomics.</p>","PeriodicalId":20100,"journal":{"name":"Plant Methods","volume":"21 1","pages":"116"},"PeriodicalIF":4.4000,"publicationDate":"2025-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12372210/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Plant Methods","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s13007-025-01433-1","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Spikelet number, a core phenotypic parameter for wheat yield composition, requires precise estimation through accurate spike contour extraction and differentiation between grain surfaces and spikelet surfaces. However, technical challenges persist in precise spike segmentation under complex field backgrounds and morphological differentiation between grain/spikelet surfaces.
Method: Building on two-year multi-angle wheat spike imagery, we propose an enhanced YOLOv9-LDS multi-scale object detection framework. The algorithm innovatively constructs a lightweight depthwise separable network (LDSNet) as backbone, balancing computational efficiency and accuracy through channel re-parameterization strategy; incorporates an Efficient Local Attention (ELA) module to build feature enhancement networks, and employs dual-path feature fusion mechanisms to strengthen edge texture responses, significantly improving discrimination of overlapping spikes and complex backgrounds. Further optimizes the loss function system by replacing traditional IoU with Scylla Intersection over Union (SIoU) metric, enhancing bounding box regression through dynamic focus factors, and adding high-resolution small-object detection layers to mitigate dense spikelet feature loss.
Results: Independent test set validation shows the improved model achieves 83.9% contour integrity recognition rate and 92.4% mAP@0.5, exceeding baseline by 3.2 and 5.3% points respectively. Ablation studies confirm LDSNet-ELA integration reduces false positives by 27.6%, while the enhanced loss function system improves small-object recall by 19.4%.
Conclusions: The proposed framework demonstrates superior performance in complex field scenarios with dense targets and dynamic illumination. The multi-scale feature synergy enhancement mechanism overcomes traditional models' limitations in detecting overlapping spikes. This method not only enables precise spike phenotyping but also provides robust algorithmic support for intelligent field spikelet counting systems, advancing translational applications in crop phenomics.
背景:小穗数是小麦产量组成的核心表型参数,需要通过精确的穗形提取和粒面与小穗面区分来精确估算。然而,在复杂的田间背景和籽粒/小穗表面的形态分化下,如何精确地分割穗状花序仍然存在技术上的挑战。方法:基于两年多角度小麦穗图像,提出了一种增强的YOLOv9-LDS多尺度目标检测框架。该算法创新性地构建了一个轻量级的深度可分离网络(LDSNet)作为主干,通过信道重参数化策略平衡计算效率和精度;采用高效局部注意(ELA)模块构建特征增强网络,采用双路径特征融合机制增强边缘纹理响应,显著提高了重叠尖峰和复杂背景的识别能力。进一步优化损失函数系统,用Scylla Intersection over Union (SIoU)度量取代传统的IoU,通过动态焦点因子增强边界盒回归,增加高分辨率小目标检测层以减轻密集小穗特征损失。结果:独立测试集验证表明,改进模型的轮廓完整性识别率达到83.9%,mAP@0.5达到92.4%,分别比基线提高3.2和5.3%。消融研究证实LDSNet-ELA集成减少了27.6%的误报,而增强的损失函数系统将小物体回忆率提高了19.4%。结论:所提出的框架在具有密集目标和动态照明的复杂野外场景中表现出优越的性能。多尺度特征协同增强机制克服了传统模型在检测重叠尖峰时的局限性。该方法不仅可以实现精确的穗型表型,还为智能田间穗型计数系统提供了强大的算法支持,促进了作物表型组学的转化应用。
期刊介绍:
Plant Methods is an open access, peer-reviewed, online journal for the plant research community that encompasses all aspects of technological innovation in the plant sciences.
There is no doubt that we have entered an exciting new era in plant biology. The completion of the Arabidopsis genome sequence, and the rapid progress being made in other plant genomics projects are providing unparalleled opportunities for progress in all areas of plant science. Nevertheless, enormous challenges lie ahead if we are to understand the function of every gene in the genome, and how the individual parts work together to make the whole organism. Achieving these goals will require an unprecedented collaborative effort, combining high-throughput, system-wide technologies with more focused approaches that integrate traditional disciplines such as cell biology, biochemistry and molecular genetics.
Technological innovation is probably the most important catalyst for progress in any scientific discipline. Plant Methods’ goal is to stimulate the development and adoption of new and improved techniques and research tools and, where appropriate, to promote consistency of methodologies for better integration of data from different laboratories.