Ensemble Learning based on CNN and Transformer Models for Leaf Diseases Classification

Li-Hua Li, Radius Tanone
{"title":"Ensemble Learning based on CNN and Transformer Models for Leaf Diseases Classification","authors":"Li-Hua Li, Radius Tanone","doi":"10.1109/IMCOM60618.2024.10418393","DOIUrl":null,"url":null,"abstract":"Symptoms on the leaves are often the first indication of a plant disease. In order not to affect the process of crop production, farmers need to identify plant diseases on their leaves as quickly as possible. This problem has long been addressed by a variety of computational techniques, such as deep learning models. Today, many specialized deep learning models are built using Transformer or Convolution Neural Networks (CNN). However, the accuracy and performance of individual deep learning models depends on many factors, such as the number of parameters, training time, and the dataset used. Often a single model is not well suited to solving problems such as image classification of leaf diseases. This study proposes an ensemble learning based on CNN and Transformer models. The models used in this study are MobileNetV3, DenseNet201, ResNext50, Vision Transformer and Swin Transformer. The purpose of ensemble learning with these five models is to achieve accuracy and good performance through weighted voting such as hard voting and soft voting. The experimental findings indicate that the utilization of ensemble learning, employing a combination of five models, yields enhanced accuracy and performance in the classification of three distinct types of datasets: corn leaf diseases, grape leaf diseases, and potato leaf diseases. Our experiment also showed that the Vision Transformer model has higher accuracy compared to other models. To perform a detailed analysis, we use the Grad-CAM technique to visualize how all models use the gradient to create a classification score. The results of this experiment can be a recommendation for the agricultural sector so that they can be implemented as early as possible to address the problem of leaf diseases classification.","PeriodicalId":518057,"journal":{"name":"2024 18th International Conference on Ubiquitous Information Management and Communication (IMCOM)","volume":"64 2","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2024 18th International Conference on Ubiquitous Information Management and Communication (IMCOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IMCOM60618.2024.10418393","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Symptoms on the leaves are often the first indication of a plant disease. In order not to affect the process of crop production, farmers need to identify plant diseases on their leaves as quickly as possible. This problem has long been addressed by a variety of computational techniques, such as deep learning models. Today, many specialized deep learning models are built using Transformer or Convolution Neural Networks (CNN). However, the accuracy and performance of individual deep learning models depends on many factors, such as the number of parameters, training time, and the dataset used. Often a single model is not well suited to solving problems such as image classification of leaf diseases. This study proposes an ensemble learning based on CNN and Transformer models. The models used in this study are MobileNetV3, DenseNet201, ResNext50, Vision Transformer and Swin Transformer. The purpose of ensemble learning with these five models is to achieve accuracy and good performance through weighted voting such as hard voting and soft voting. The experimental findings indicate that the utilization of ensemble learning, employing a combination of five models, yields enhanced accuracy and performance in the classification of three distinct types of datasets: corn leaf diseases, grape leaf diseases, and potato leaf diseases. Our experiment also showed that the Vision Transformer model has higher accuracy compared to other models. To perform a detailed analysis, we use the Grad-CAM technique to visualize how all models use the gradient to create a classification score. The results of this experiment can be a recommendation for the agricultural sector so that they can be implemented as early as possible to address the problem of leaf diseases classification.
基于 CNN 和变压器模型的集合学习用于叶病分类
叶片上的症状往往是植物病害的最初征兆。为了不影响作物生产进程,农民需要尽快识别叶片上的植物病害。长期以来,深度学习模型等各种计算技术一直在解决这一问题。如今,许多专门的深度学习模型都是利用变换器或卷积神经网络(CNN)构建的。然而,单个深度学习模型的准确性和性能取决于许多因素,如参数数量、训练时间和使用的数据集。通常情况下,单一模型并不能很好地解决叶片疾病图像分类等问题。本研究提出了一种基于 CNN 和 Transformer 模型的集合学习方法。本研究中使用的模型包括 MobileNetV3、DenseNet201、ResNext50、Vision Transformer 和 Swin Transformer。使用这五个模型进行集合学习的目的是通过加权投票(如硬投票和软投票)来实现准确性和良好性能。实验结果表明,在对玉米叶片病害、葡萄叶片病害和马铃薯叶片病害这三种不同类型的数据集进行分类时,采用五种模型组合的集合学习可以提高分类的准确性和性能。我们的实验还表明,与其他模型相比,Vision Transformer 模型具有更高的准确性。为了进行详细分析,我们使用了 Grad-CAM 技术来直观展示所有模型是如何利用梯度来创建分类分数的。本实验的结果可作为农业部门的建议,以便尽早实施,解决叶病分类问题。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信