{"title":"DBA-ViNet: an effective deep learning framework for fruit disease detection and classification using explainable AI.","authors":"Saravanan Srinivasan, Lalitha Somasundharam, Sukumar Rajendran, Virendra Pal Singh, Sandeep Kumar Mathivanan, Usha Moorthy","doi":"10.1186/s12870-025-07015-6","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>The primary aim of this research is to develop an effective and robust model for identifying and classifying diseases in general fruits, particularly apples, guavas, mangoes, pomegranates, and oranges, utilizing computer vision techniques.</p><p><strong>Material: </strong>An open-source collection of fruit disease images, comprising both diseased and healthy samples from the first five fruit types, was used in this study. The data was split into 70% training, 15% validation, and 15% testing. A 5-fold cross-validation was used to maintain the generalizability and stability of the model's performance.</p><p><strong>Models: </strong>For performance comparisons of these models on the dataset, we benchmarked state-of-the-art pre-trained convolutional neural network (ConvNet) models, including Swin Transformer (ST), EfficientNetV2, ConvNeXt, YOLOv8, and MobileNetV3. A new model, the Dual-Branch Attention-Guided Vision Network (DBA-ViNet), was introduced. A hybrid with two branches of DBA-ViNet can efficiently integrate global and local features for improved disease identification accuracy. Grad-CAM was used to visualize the regions that contributed to each prediction, helping to interpret the model. These heatmaps verified that DBA-ViNet can correctly direct its attention to disease-specific symptoms, thereby increasing trust and transparency in the classification results.</p><p><strong>Results: </strong>The proposed DBA-ViNet achieved a high testing classification accuracy of 99.51%, specificity of 99.42%, recall of 99.61%, precision of 99.30% and F1 score of 99.45% outperforming baseline models in all evaluation metrics. While the improvements were consistent, statistical significance testing was not performed and will be explored in future work.</p><p><strong>Conclusion: </strong>These results confirm the effectiveness of the proposed DBA-ViNet architecture in fruit disease detection, suggesting that incorporating both global and local feature extraction into the design of the double-branch attention mechanism for classification can achieve high accuracy and reliability. It is potentially practical in smart agriculture and the automated crop health monitoring system.</p>","PeriodicalId":9198,"journal":{"name":"BMC Plant Biology","volume":"25 1","pages":"965"},"PeriodicalIF":4.8000,"publicationDate":"2025-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Plant Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s12870-025-07015-6","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PLANT SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: The primary aim of this research is to develop an effective and robust model for identifying and classifying diseases in general fruits, particularly apples, guavas, mangoes, pomegranates, and oranges, utilizing computer vision techniques.
Material: An open-source collection of fruit disease images, comprising both diseased and healthy samples from the first five fruit types, was used in this study. The data was split into 70% training, 15% validation, and 15% testing. A 5-fold cross-validation was used to maintain the generalizability and stability of the model's performance.
Models: For performance comparisons of these models on the dataset, we benchmarked state-of-the-art pre-trained convolutional neural network (ConvNet) models, including Swin Transformer (ST), EfficientNetV2, ConvNeXt, YOLOv8, and MobileNetV3. A new model, the Dual-Branch Attention-Guided Vision Network (DBA-ViNet), was introduced. A hybrid with two branches of DBA-ViNet can efficiently integrate global and local features for improved disease identification accuracy. Grad-CAM was used to visualize the regions that contributed to each prediction, helping to interpret the model. These heatmaps verified that DBA-ViNet can correctly direct its attention to disease-specific symptoms, thereby increasing trust and transparency in the classification results.
Results: The proposed DBA-ViNet achieved a high testing classification accuracy of 99.51%, specificity of 99.42%, recall of 99.61%, precision of 99.30% and F1 score of 99.45% outperforming baseline models in all evaluation metrics. While the improvements were consistent, statistical significance testing was not performed and will be explored in future work.
Conclusion: These results confirm the effectiveness of the proposed DBA-ViNet architecture in fruit disease detection, suggesting that incorporating both global and local feature extraction into the design of the double-branch attention mechanism for classification can achieve high accuracy and reliability. It is potentially practical in smart agriculture and the automated crop health monitoring system.
期刊介绍:
BMC Plant Biology is an open access, peer-reviewed journal that considers articles on all aspects of plant biology, including molecular, cellular, tissue, organ and whole organism research.