Sumaya Mustofa, Shahrin Khan, Shahriar Ahmed Shovo, Yousuf Rayhan Emon, Md. Sadekur Rahman
{"title":"Optimizing Soursop leaf disease classification with a lightweight ensemble model and explainable AI","authors":"Sumaya Mustofa, Shahrin Khan, Shahriar Ahmed Shovo, Yousuf Rayhan Emon, Md. Sadekur Rahman","doi":"10.1016/j.cpb.2025.100526","DOIUrl":null,"url":null,"abstract":"<div><div>Traditional deep-learning methods to detect plant leaf disease can be complex and time-consuming if image numbers and size increase. Moreover, complex deep learning networks take longer and require larger memory to produce results. However, feature extraction methods provide some advantages in such a scenario. Using heavy-weighted models to enhance accuracy without considering the long execution time is a drawback of research. A weighted model increases the time and space complexity of an experiment. Considering the mentioned limitations, this study proposes a lightweight model experimenting with six deep feature extraction models, five feature selection models, and four machine learning classifiers. During the experiment, a soft voting ensemble classifier was developed to remove a single classifier's limitations and the unstable performance of the standalone classifiers. After a rigorous experiment, the (ResNet101 – RFE – Ensemble Classifier) together formed the best performer Soursop Ensemble (S-Ensemble) model that obtained a test accuracy of 99.6 % with an execution time of 648.05 s, outperforming other models. The whole experimental analysis was performed on a primary Soursop leaf disease dataset with six classes containing 3838 images. Finally, the Explainable AI (XAI) model Local Interpretable Model-agnostic Explanations (LIME) is used to interpret the reasons behind the best-performer and lowest-performer models' performance. LIME visually highlights which leaf regions influence each prediction, helping users understand model behaviour and enhancing its practical usability in real-world agricultural settings. This research aims to assist farmers with detecting Soursop leaf disease with less execution time and offer researchers an in-depth preview of deep feature-based detection and classification technology to detect and classify diseases within a short training time.</div></div>","PeriodicalId":38090,"journal":{"name":"Current Plant Biology","volume":"43 ","pages":"Article 100526"},"PeriodicalIF":4.5000,"publicationDate":"2025-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Plant Biology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2214662825000945","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PLANT SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Traditional deep-learning methods to detect plant leaf disease can be complex and time-consuming if image numbers and size increase. Moreover, complex deep learning networks take longer and require larger memory to produce results. However, feature extraction methods provide some advantages in such a scenario. Using heavy-weighted models to enhance accuracy without considering the long execution time is a drawback of research. A weighted model increases the time and space complexity of an experiment. Considering the mentioned limitations, this study proposes a lightweight model experimenting with six deep feature extraction models, five feature selection models, and four machine learning classifiers. During the experiment, a soft voting ensemble classifier was developed to remove a single classifier's limitations and the unstable performance of the standalone classifiers. After a rigorous experiment, the (ResNet101 – RFE – Ensemble Classifier) together formed the best performer Soursop Ensemble (S-Ensemble) model that obtained a test accuracy of 99.6 % with an execution time of 648.05 s, outperforming other models. The whole experimental analysis was performed on a primary Soursop leaf disease dataset with six classes containing 3838 images. Finally, the Explainable AI (XAI) model Local Interpretable Model-agnostic Explanations (LIME) is used to interpret the reasons behind the best-performer and lowest-performer models' performance. LIME visually highlights which leaf regions influence each prediction, helping users understand model behaviour and enhancing its practical usability in real-world agricultural settings. This research aims to assist farmers with detecting Soursop leaf disease with less execution time and offer researchers an in-depth preview of deep feature-based detection and classification technology to detect and classify diseases within a short training time.
期刊介绍:
Current Plant Biology aims to acknowledge and encourage interdisciplinary research in fundamental plant sciences with scope to address crop improvement, biodiversity, nutrition and human health. It publishes review articles, original research papers, method papers and short articles in plant research fields, such as systems biology, cell biology, genetics, epigenetics, mathematical modeling, signal transduction, plant-microbe interactions, synthetic biology, developmental biology, biochemistry, molecular biology, physiology, biotechnologies, bioinformatics and plant genomic resources.