DLV3-CRSNet在白菜和杂草语义分割中的性能及速度优化

IF 2.5 2区农林科学 Q1 AGRONOMY

Crop Protection Pub Date : 2025-04-24 DOI:10.1016/j.cropro.2025.107236

Xiaomei Gao , Gang Wang , Zihao Zhou , Jie Li , Kexin Song , Jiangtao Qi

{"title":"DLV3-CRSNet在白菜和杂草语义分割中的性能及速度优化","authors":"Xiaomei Gao , Gang Wang , Zihao Zhou , Jie Li , Kexin Song , Jiangtao Qi","doi":"10.1016/j.cropro.2025.107236","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate and efficient recognition of crops and weeds in complex agricultural environments is crucial for promoting intelligent weed management and sustainable farming. Despite DeepLabv3+ showing robust semantic segmentation capabilities, its complex architecture and large number of parameters impede training efficiency and pose deployment difficulties, especially in resource-constrained farming environments. Additionally, it has suboptimal accuracy in segmenting small targets, which limits its ability to identify minor crops and weeds. To address these issues, we propose using ConvNeXt as its backbone, integrating the RepVgg structure, and applying the Sigmoid Linear Unit (SiLU) activation function based on the DeepLabv3+ model (DLV3-CRSNet). Experimental results show that DLV3-CRSNet outperforms DeepLabv3+ by achieving improvements of 23% in Mean Intersection over Union (MIoU), 21% in Mean Pixel Accuracy (MPA), 23% in Average Precision (AP), and 15% in Frames per Second (FPS). Additionally, Floating Point Operations per Second (FLOPS) and inference time (IT) are reduced by 13% and 12%. Compared with Pyramid Scene Parsing Network (PSPNet), Expectation-Maximization Attention Networks (EMANet), Fully Convolutional Networks (FCN), UNet, Efficient Neural Network (ENet), and Segmentation Network (SegNet), DLV3-CRSNet enhances MIoU, MPA, AP, and Accuracy by an average of 27%, 18%, 18%, and 3%, while reducing the number of parameters by 26.84 million (M). Field experiments further confirm that DLV3-CRSNet effectively distinguishes Chinese cabbage (<em>Brassica pekinensis</em> Rupr.) and weeds, achieving recognition rates of 95.31% and 93.6%, respectively.</div></div>","PeriodicalId":10785,"journal":{"name":"Crop Protection","volume":"195 ","pages":"Article 107236"},"PeriodicalIF":2.5000,"publicationDate":"2025-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Performance and speed optimization of DLV3-CRSNet for semantic segmentation of Chinese cabbage (Brassica pekinensis Rupr.) and weeds\",\"authors\":\"Xiaomei Gao , Gang Wang , Zihao Zhou , Jie Li , Kexin Song , Jiangtao Qi\",\"doi\":\"10.1016/j.cropro.2025.107236\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Accurate and efficient recognition of crops and weeds in complex agricultural environments is crucial for promoting intelligent weed management and sustainable farming. Despite DeepLabv3+ showing robust semantic segmentation capabilities, its complex architecture and large number of parameters impede training efficiency and pose deployment difficulties, especially in resource-constrained farming environments. Additionally, it has suboptimal accuracy in segmenting small targets, which limits its ability to identify minor crops and weeds. To address these issues, we propose using ConvNeXt as its backbone, integrating the RepVgg structure, and applying the Sigmoid Linear Unit (SiLU) activation function based on the DeepLabv3+ model (DLV3-CRSNet). Experimental results show that DLV3-CRSNet outperforms DeepLabv3+ by achieving improvements of 23% in Mean Intersection over Union (MIoU), 21% in Mean Pixel Accuracy (MPA), 23% in Average Precision (AP), and 15% in Frames per Second (FPS). Additionally, Floating Point Operations per Second (FLOPS) and inference time (IT) are reduced by 13% and 12%. Compared with Pyramid Scene Parsing Network (PSPNet), Expectation-Maximization Attention Networks (EMANet), Fully Convolutional Networks (FCN), UNet, Efficient Neural Network (ENet), and Segmentation Network (SegNet), DLV3-CRSNet enhances MIoU, MPA, AP, and Accuracy by an average of 27%, 18%, 18%, and 3%, while reducing the number of parameters by 26.84 million (M). Field experiments further confirm that DLV3-CRSNet effectively distinguishes Chinese cabbage (<em>Brassica pekinensis</em> Rupr.) and weeds, achieving recognition rates of 95.31% and 93.6%, respectively.</div></div>\",\"PeriodicalId\":10785,\"journal\":{\"name\":\"Crop Protection\",\"volume\":\"195 \",\"pages\":\"Article 107236\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2025-04-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Crop Protection\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0261219425001280\",\"RegionNum\":2,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRONOMY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Crop Protection","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0261219425001280","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRONOMY","Score":null,"Total":0}

引用次数: 0

摘要

在复杂的农业环境中准确、高效地识别作物和杂草，对于促进杂草智能管理和可持续农业至关重要。尽管DeepLabv3+具有强大的语义分割能力，但其复杂的架构和大量的参数阻碍了训练效率，并给部署带来困难，特别是在资源受限的农业环境中。此外，它在分割小目标时精度不理想，这限制了它识别小作物和杂草的能力。为了解决这些问题，我们提出以ConvNeXt为骨干，集成RepVgg结构，并应用基于DeepLabv3+模型（DLV3-CRSNet）的Sigmoid Linear Unit （SiLU）激活函数。实验结果表明，DLV3-CRSNet比DeepLabv3+在MIoU （Mean Intersection over Union）、MPA （Mean Pixel Accuracy）、AP （Average Precision）和FPS （FPS）方面分别提高了23%、21%、23%和15%。此外，每秒浮点运算（FLOPS）和推理时间（IT）分别减少了13%和12%。与金字塔场景解析网络（PSPNet）、期望最大化注意力网络（EMANet）、全卷积网络（FCN）、UNet、高效神经网络（ENet）和分割网络（SegNet）相比，DLV3-CRSNet的MIoU、MPA、AP和准确率平均提高了27%、18%、18%和3%。田间实验进一步证实，DLV3-CRSNet能有效识别大白菜（Brassica pekinensis Rupr.）和杂草，识别率分别达到95.31%和93.6%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Performance and speed optimization of DLV3-CRSNet for semantic segmentation of Chinese cabbage (Brassica pekinensis Rupr.) and weeds

查看原文本刊更多论文

Performance and speed optimization of DLV3-CRSNet for semantic segmentation of Chinese cabbage (Brassica pekinensis Rupr.) and weeds

Accurate and efficient recognition of crops and weeds in complex agricultural environments is crucial for promoting intelligent weed management and sustainable farming. Despite DeepLabv3+ showing robust semantic segmentation capabilities, its complex architecture and large number of parameters impede training efficiency and pose deployment difficulties, especially in resource-constrained farming environments. Additionally, it has suboptimal accuracy in segmenting small targets, which limits its ability to identify minor crops and weeds. To address these issues, we propose using ConvNeXt as its backbone, integrating the RepVgg structure, and applying the Sigmoid Linear Unit (SiLU) activation function based on the DeepLabv3+ model (DLV3-CRSNet). Experimental results show that DLV3-CRSNet outperforms DeepLabv3+ by achieving improvements of 23% in Mean Intersection over Union (MIoU), 21% in Mean Pixel Accuracy (MPA), 23% in Average Precision (AP), and 15% in Frames per Second (FPS). Additionally, Floating Point Operations per Second (FLOPS) and inference time (IT) are reduced by 13% and 12%. Compared with Pyramid Scene Parsing Network (PSPNet), Expectation-Maximization Attention Networks (EMANet), Fully Convolutional Networks (FCN), UNet, Efficient Neural Network (ENet), and Segmentation Network (SegNet), DLV3-CRSNet enhances MIoU, MPA, AP, and Accuracy by an average of 27%, 18%, 18%, and 3%, while reducing the number of parameters by 26.84 million (M). Field experiments further confirm that DLV3-CRSNet effectively distinguishes Chinese cabbage (Brassica pekinensis Rupr.) and weeds, achieving recognition rates of 95.31% and 93.6%, respectively.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Crop Protection 农林科学-农艺学

CiteScore

6.10

自引率

3.60%

发文量

200

审稿时长

29 days

期刊介绍： The Editors of Crop Protection especially welcome papers describing an interdisciplinary approach showing how different control strategies can be integrated into practical pest management programs, covering high and low input agricultural systems worldwide. Crop Protection particularly emphasizes the practical aspects of control in the field and for protected crops, and includes work which may lead in the near future to more effective control. The journal does not duplicate the many existing excellent biological science journals, which deal mainly with the more fundamental aspects of plant pathology, applied zoology and weed science. Crop Protection covers all practical aspects of pest, disease and weed control, including the following topics: -Abiotic damage- Agronomic control methods- Assessment of pest and disease damage- Molecular methods for the detection and assessment of pests and diseases- Biological control- Biorational pesticides- Control of animal pests of world crops- Control of diseases of crop plants caused by microorganisms- Control of weeds and integrated management- Economic considerations- Effects of plant growth regulators- Environmental benefits of reduced pesticide use- Environmental effects of pesticides- Epidemiology of pests and diseases in relation to control- GM Crops, and genetic engineering applications- Importance and control of postharvest crop losses- Integrated control- Interrelationships and compatibility among different control strategies- Invasive species as they relate to implications for crop protection- Pesticide application methods- Pest management- Phytobiomes for pest and disease control- Resistance management- Sampling and monitoring schemes for diseases, nematodes, pests and weeds.