Lightweight Wheat Spike Detection Method Based on Activation and Loss Function Enhancements for YOLOv5s

Agronomy Pub Date : 2024-09-06 DOI:10.3390/agronomy14092036

Jingsong Li, Feijie Dai, Haiming Qian, Linsheng Huang, Jinling Zhao

{"title":"Lightweight Wheat Spike Detection Method Based on Activation and Loss Function Enhancements for YOLOv5s","authors":"Jingsong Li, Feijie Dai, Haiming Qian, Linsheng Huang, Jinling Zhao","doi":"10.3390/agronomy14092036","DOIUrl":null,"url":null,"abstract":"Wheat spike count is one of the critical indicators for assessing the growth and yield of wheat. However, illumination variations, mutual occlusion, and background interference have greatly affected wheat spike detection. A lightweight detection method was proposed based on the YOLOv5s. Initially, the original YOLOv5s was improved by combing the additional small-scale detection layer and integrating the ECA (Efficient Channel Attention) attention mechanism into all C3 modules (YOLOv5s + 4 + ECAC3). After comparing GhostNet, ShuffleNetV2, and MobileNetV3, the GhostNet architecture was finally selected as the optimal lightweight model framework based on its superior performance in various evaluations. Subsequently, the incorporation of five different activation functions into the network led to the identification of the RReLU (Randomized Leaky ReLU) activation function as the most effective in augmenting the network’s performance. Ultimately, the network’s loss function of CIoU (Complete Intersection over Union) was optimized using the EIoU (Efficient Intersection over Union) loss function. Despite a minor reduction of 2.17% in accuracy for the refined YOLOv5s + 4 + ECAC3 + G + RR + E network when compared to the YOLOv5s + 4 + ECAC3, there was a marginal improvement of 0.77% over the original YOLOv5s. Furthermore, the parameter count was diminished by 32% and 28.2% relative to the YOLOv5s + 4 + ECAC3 and YOLOv5s, respectively. The model size was reduced by 28.0% and 20%, and the Giga Floating-point Operations Per Second (GFLOPs) were lowered by 33.2% and 9.5%, respectively, signifying a substantial improvement in the network’s efficiency without significantly compromising accuracy. This study offers a methodological reference for the rapid and accurate detection of agricultural objects through the enhancement of a deep learning network.","PeriodicalId":7601,"journal":{"name":"Agronomy","volume":"5 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Agronomy","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/agronomy14092036","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Wheat spike count is one of the critical indicators for assessing the growth and yield of wheat. However, illumination variations, mutual occlusion, and background interference have greatly affected wheat spike detection. A lightweight detection method was proposed based on the YOLOv5s. Initially, the original YOLOv5s was improved by combing the additional small-scale detection layer and integrating the ECA (Efficient Channel Attention) attention mechanism into all C3 modules (YOLOv5s + 4 + ECAC3). After comparing GhostNet, ShuffleNetV2, and MobileNetV3, the GhostNet architecture was finally selected as the optimal lightweight model framework based on its superior performance in various evaluations. Subsequently, the incorporation of five different activation functions into the network led to the identification of the RReLU (Randomized Leaky ReLU) activation function as the most effective in augmenting the network’s performance. Ultimately, the network’s loss function of CIoU (Complete Intersection over Union) was optimized using the EIoU (Efficient Intersection over Union) loss function. Despite a minor reduction of 2.17% in accuracy for the refined YOLOv5s + 4 + ECAC3 + G + RR + E network when compared to the YOLOv5s + 4 + ECAC3, there was a marginal improvement of 0.77% over the original YOLOv5s. Furthermore, the parameter count was diminished by 32% and 28.2% relative to the YOLOv5s + 4 + ECAC3 and YOLOv5s, respectively. The model size was reduced by 28.0% and 20%, and the Giga Floating-point Operations Per Second (GFLOPs) were lowered by 33.2% and 9.5%, respectively, signifying a substantial improvement in the network’s efficiency without significantly compromising accuracy. This study offers a methodological reference for the rapid and accurate detection of agricultural objects through the enhancement of a deep learning network.

查看原文本刊更多论文

基于激活和损失函数增强的 YOLOv5s 轻量级麦穗检测方法

小麦穗数是评估小麦生长和产量的重要指标之一。然而，光照变化、相互遮挡和背景干扰极大地影响了小麦穗的检测。基于 YOLOv5s 提出了一种轻量级检测方法。起初，YOLOv5s 在原有基础上进行了改进，增加了小尺度检测层，并在所有 C3 模块（YOLOv5s + 4 + ECAC3）中集成了 ECA（Efficient Channel Attention）注意力机制。在对 GhostNet、ShuffleNetV2 和 MobileNetV3 进行比较后，基于 GhostNet 架构在各种评估中的优异表现，最终将其选为最佳轻量级模型框架。随后，在网络中加入了五种不同的激活函数，最终确定 RReLU（随机泄漏 ReLU）激活函数在增强网络性能方面最为有效。最终，使用 EIoU（高效交叉联合）损失函数对网络的 CIoU（完全交叉联合）损失函数进行了优化。尽管与 YOLOv5s + 4 + ECAC3 相比，改进后的 YOLOv5s + 4 + ECAC3 + G + RR + E 网络的准确率略微降低了 2.17%，但与最初的 YOLOv5s 相比，还是有了 0.77% 的微小改进。此外，与 YOLOv5s + 4 + ECAC3 和 YOLOv5s 相比，参数数量分别减少了 32% 和 28.2%。模型大小分别减少了 28.0% 和 20%，每秒千兆浮点运算次数（GFLOPs）分别降低了 33.2% 和 9.5%，这表明网络的效率有了大幅提高，而精度却没有受到明显影响。这项研究为通过增强深度学习网络快速、准确地检测农业物体提供了方法参考。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Agronomy

自引率

0.00%

发文量