MC-ShuffleNetV2: A lightweight model for maize disease recognition

IF 5 3区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Egyptian Informatics Journal Pub Date : 2024-07-06 DOI:10.1016/j.eij.2024.100503

Shaoqiu Zhu , Haitao Gao

{"title":"MC-ShuffleNetV2: A lightweight model for maize disease recognition","authors":"Shaoqiu Zhu , Haitao Gao","doi":"10.1016/j.eij.2024.100503","DOIUrl":null,"url":null,"abstract":"<div><p>Maize has a long history of cultivation and is renowned for its high yield, superior quality, and adaptability. Currently, maize holds a significant position in grain cultivation and occupies a significant place in the agricultural structure. However, maize is susceptible to various diseases during its growth process, which can have a significant impact on the quality and yield. Traditional machine learning is heavily reliant on feature extraction, whereas deep learning has demonstrated notable success in image recognition for computer vision.The use of bloated models and the resulting wastage of computational resources represent significant challenges. The paper proposes a lightweight model, MC-ShuffleNetV2 (Mish + Convolutional Block Attention Module + ShuffleNetV2), to meet the practical needs of convolutional neural networks in maize disease image recognition. The model has designed with a focus on network lightweighting and accurate feature extraction. The model was constructed upon the foundation of the high-performance ShuffleNetV2 1 × network. The Convolutional Block Attention Module was integrated into the network architecture to enhance the model’s adaptive expressiveness. The depthwise separable convolution kernel of the depth-separable module was modified from a 3 × 3 kernel to a 5 × 5 kernel. This modification was implemented with the objective of expanding the image receptive field and extracting more detailed features of the image. It was necessary to modify the activation function in each stage for Mish. The model was compressed through the application of pruning operations. In the maize disease dataset test, the accuracy of the test set recognition accuracy of the network model constructed in this paper reaches 99.86 %, the model parameters are only 873,936, and the FLOPs (Floating-point Operations) are only 1,751,286. Compared with LeNet, AlexNet, MobileNetV2, and EfficientNetV2 models, the MC-ShufflenetV2 model’s recognition ability and size have obvious advantages, and it is more conducive to the actual deployment of the agricultural mobile terminal.</p></div>","PeriodicalId":56010,"journal":{"name":"Egyptian Informatics Journal","volume":"27 ","pages":"Article 100503"},"PeriodicalIF":5.0000,"publicationDate":"2024-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1110866524000665/pdfft?md5=b6fc6748f084b0530dace13f462b152c&pid=1-s2.0-S1110866524000665-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Egyptian Informatics Journal","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1110866524000665","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Maize has a long history of cultivation and is renowned for its high yield, superior quality, and adaptability. Currently, maize holds a significant position in grain cultivation and occupies a significant place in the agricultural structure. However, maize is susceptible to various diseases during its growth process, which can have a significant impact on the quality and yield. Traditional machine learning is heavily reliant on feature extraction, whereas deep learning has demonstrated notable success in image recognition for computer vision.The use of bloated models and the resulting wastage of computational resources represent significant challenges. The paper proposes a lightweight model, MC-ShuffleNetV2 (Mish + Convolutional Block Attention Module + ShuffleNetV2), to meet the practical needs of convolutional neural networks in maize disease image recognition. The model has designed with a focus on network lightweighting and accurate feature extraction. The model was constructed upon the foundation of the high-performance ShuffleNetV2 1 × network. The Convolutional Block Attention Module was integrated into the network architecture to enhance the model’s adaptive expressiveness. The depthwise separable convolution kernel of the depth-separable module was modified from a 3 × 3 kernel to a 5 × 5 kernel. This modification was implemented with the objective of expanding the image receptive field and extracting more detailed features of the image. It was necessary to modify the activation function in each stage for Mish. The model was compressed through the application of pruning operations. In the maize disease dataset test, the accuracy of the test set recognition accuracy of the network model constructed in this paper reaches 99.86 %, the model parameters are only 873,936, and the FLOPs (Floating-point Operations) are only 1,751,286. Compared with LeNet, AlexNet, MobileNetV2, and EfficientNetV2 models, the MC-ShufflenetV2 model’s recognition ability and size have obvious advantages, and it is more conducive to the actual deployment of the agricultural mobile terminal.

查看原文本刊更多论文

MC-ShuffleNetV2：玉米病害识别的轻量级模型

玉米种植历史悠久，以高产、优质和适应性强而闻名。目前，玉米在粮食种植中占有重要地位，在农业结构中占据重要位置。然而，玉米在生长过程中容易受到各种病害的侵袭，对玉米的品质和产量产生重大影响。传统的机器学习严重依赖于特征提取，而深度学习在计算机视觉的图像识别方面取得了显著的成功。本文提出了一种轻量级模型 MC-ShuffleNetV2（Mish + 卷积块注意力模块 + ShuffleNetV2），以满足卷积神经网络在玉米病害图像识别中的实际需求。该模型的设计侧重于网络轻量化和精确特征提取。该模型是在高性能 ShuffleNetV2 1 × 网络的基础上构建的。网络架构中集成了卷积块注意力模块，以增强模型的自适应表达能力。深度可分模块的深度可分卷积核从 3 × 3 核修改为 5 × 5 核。实施这一修改的目的是扩大图像感受野，提取图像中更多的细节特征。有必要修改 Mish 每个阶段的激活函数。通过剪枝操作对模型进行了压缩。在玉米病害数据集测试中，本文构建的网络模型的测试集识别准确率达到 99.86 %，模型参数仅为 873 936，FLOP（浮点运算）仅为 1 751 286。与 LeNet、AlexNet、MobileNetV2 和 EfficientNetV2 模型相比，MC-ShufflenetV2 模型的识别能力和规模具有明显优势，更有利于农业移动终端的实际部署。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Egyptian Informatics Journal Decision Sciences-Management Science and Operations Research

CiteScore

11.10

自引率

1.90%

发文量

审稿时长

110 days

期刊介绍： The Egyptian Informatics Journal is published by the Faculty of Computers and Artificial Intelligence, Cairo University. This Journal provides a forum for the state-of-the-art research and development in the fields of computing, including computer sciences, information technologies, information systems, operations research and decision support. Innovative and not-previously-published work in subjects covered by the Journal is encouraged to be submitted, whether from academic, research or commercial sources.