Optimizing Soursop leaf disease classification with a lightweight ensemble model and explainable AI

IF 4.5 Q1 PLANT SCIENCES
Sumaya Mustofa, Shahrin Khan, Shahriar Ahmed Shovo, Yousuf Rayhan Emon, Md. Sadekur Rahman
{"title":"Optimizing Soursop leaf disease classification with a lightweight ensemble model and explainable AI","authors":"Sumaya Mustofa,&nbsp;Shahrin Khan,&nbsp;Shahriar Ahmed Shovo,&nbsp;Yousuf Rayhan Emon,&nbsp;Md. Sadekur Rahman","doi":"10.1016/j.cpb.2025.100526","DOIUrl":null,"url":null,"abstract":"<div><div>Traditional deep-learning methods to detect plant leaf disease can be complex and time-consuming if image numbers and size increase. Moreover, complex deep learning networks take longer and require larger memory to produce results. However, feature extraction methods provide some advantages in such a scenario. Using heavy-weighted models to enhance accuracy without considering the long execution time is a drawback of research. A weighted model increases the time and space complexity of an experiment. Considering the mentioned limitations, this study proposes a lightweight model experimenting with six deep feature extraction models, five feature selection models, and four machine learning classifiers. During the experiment, a soft voting ensemble classifier was developed to remove a single classifier's limitations and the unstable performance of the standalone classifiers. After a rigorous experiment, the (ResNet101 – RFE – Ensemble Classifier) together formed the best performer Soursop Ensemble (S-Ensemble) model that obtained a test accuracy of 99.6 % with an execution time of 648.05 s, outperforming other models. The whole experimental analysis was performed on a primary Soursop leaf disease dataset with six classes containing 3838 images. Finally, the Explainable AI (XAI) model Local Interpretable Model-agnostic Explanations (LIME) is used to interpret the reasons behind the best-performer and lowest-performer models' performance. LIME visually highlights which leaf regions influence each prediction, helping users understand model behaviour and enhancing its practical usability in real-world agricultural settings. This research aims to assist farmers with detecting Soursop leaf disease with less execution time and offer researchers an in-depth preview of deep feature-based detection and classification technology to detect and classify diseases within a short training time.</div></div>","PeriodicalId":38090,"journal":{"name":"Current Plant Biology","volume":"43 ","pages":"Article 100526"},"PeriodicalIF":4.5000,"publicationDate":"2025-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Plant Biology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2214662825000945","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PLANT SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Traditional deep-learning methods to detect plant leaf disease can be complex and time-consuming if image numbers and size increase. Moreover, complex deep learning networks take longer and require larger memory to produce results. However, feature extraction methods provide some advantages in such a scenario. Using heavy-weighted models to enhance accuracy without considering the long execution time is a drawback of research. A weighted model increases the time and space complexity of an experiment. Considering the mentioned limitations, this study proposes a lightweight model experimenting with six deep feature extraction models, five feature selection models, and four machine learning classifiers. During the experiment, a soft voting ensemble classifier was developed to remove a single classifier's limitations and the unstable performance of the standalone classifiers. After a rigorous experiment, the (ResNet101 – RFE – Ensemble Classifier) together formed the best performer Soursop Ensemble (S-Ensemble) model that obtained a test accuracy of 99.6 % with an execution time of 648.05 s, outperforming other models. The whole experimental analysis was performed on a primary Soursop leaf disease dataset with six classes containing 3838 images. Finally, the Explainable AI (XAI) model Local Interpretable Model-agnostic Explanations (LIME) is used to interpret the reasons behind the best-performer and lowest-performer models' performance. LIME visually highlights which leaf regions influence each prediction, helping users understand model behaviour and enhancing its practical usability in real-world agricultural settings. This research aims to assist farmers with detecting Soursop leaf disease with less execution time and offer researchers an in-depth preview of deep feature-based detection and classification technology to detect and classify diseases within a short training time.
基于轻量级集成模型和可解释人工智能优化刺蒺藜叶病分类
当图像数量和大小增加时,传统的深度学习方法检测植物叶片病害可能会变得复杂且耗时。此外,复杂的深度学习网络需要更长的时间和更大的内存才能产生结果。然而,特征提取方法在这种情况下提供了一些优势。使用重权重模型来提高准确性而不考虑较长的执行时间是研究的一个缺点。加权模型增加了实验的时间和空间复杂性。考虑到上述局限性,本研究提出了一个轻量级模型,实验了6个深度特征提取模型、5个特征选择模型和4个机器学习分类器。在实验中,开发了一种软投票集成分类器,以消除单个分类器的局限性和独立分类器的不稳定性能。经过严格的实验,(ResNet101 - RFE -Ensemble Classifier)共同形成了性能最好的Soursop Ensemble (s -Ensemble)模型,其测试准确率为99.6 %,执行时间为648.05 s,优于其他模型。整个实验分析是在一个包含6个类3838张图像的刺蒺藜叶病初级数据集上进行的。最后,可解释AI (XAI)模型局部可解释模型不可知解释(LIME)用于解释性能最佳和性能最差模型性能背后的原因。LIME在视觉上突出显示了哪些叶片区域影响每个预测,帮助用户理解模型行为,并增强其在现实农业环境中的实际可用性。本研究旨在帮助农民以更少的执行时间检测番荔枝叶病,并为研究人员提供深度预览基于深度特征的检测分类技术,在较短的培训时间内检测和分类疾病。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Current Plant Biology
Current Plant Biology Agricultural and Biological Sciences-Plant Science
CiteScore
10.90
自引率
1.90%
发文量
32
审稿时长
50 days
期刊介绍: Current Plant Biology aims to acknowledge and encourage interdisciplinary research in fundamental plant sciences with scope to address crop improvement, biodiversity, nutrition and human health. It publishes review articles, original research papers, method papers and short articles in plant research fields, such as systems biology, cell biology, genetics, epigenetics, mathematical modeling, signal transduction, plant-microbe interactions, synthetic biology, developmental biology, biochemistry, molecular biology, physiology, biotechnologies, bioinformatics and plant genomic resources.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信