Truncated MobileNetV2 Sparse Vision Graph Attention Model for Explainable Monkeypox Disease Classification

IF 7.6 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge-Based Systems Pub Date : 2025-09-19 DOI:10.1016/j.knosys.2025.114503

Mehdhar S.A.M. Al-Gaashani , Abduljabbar S. Ba Mahel , Ammar Muthanna

{"title":"Truncated MobileNetV2 Sparse Vision Graph Attention Model for Explainable Monkeypox Disease Classification","authors":"Mehdhar S.A.M. Al-Gaashani , Abduljabbar S. Ba Mahel , Ammar Muthanna","doi":"10.1016/j.knosys.2025.114503","DOIUrl":null,"url":null,"abstract":"<div><div>The current outbreak of monkeypox (mpox) presents challenges for timely and accurate diagnosis due to the disease’s diverse and unusual skin lesion patterns. Traditional deep learning models, such as Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs), struggle with these irregular features because they rely on rigid, grid-based methods. To address this, we introduce the Truncated MobileNetV2 Sparse Vision Graph Attention (TMSVGA) model. TMSVGA combines components of MobileNetV2, which focuses on identifying smaller details, with a Sparse Vision Graph Attention block enhanced by a Squeeze-and-Excitation (SE) mechanism to improve channel-wise attention. This approach enhances the understanding of complex and long-distance relationships, emphasizing diagnostically significant regions and improving classification precision. We optimized TMSVGA using the Optuna framework for automated hyperparameter tuning. Additionally, Gradient-weighted Class Activation Mapping (Grad-CAM) and Local Interpretable Model-Agnostic Explanations (LIME) provided interpretable visualizations, highlighting influential regions in decision-making. The TMSVGA model was validated on the Monkeypox Skin Images Dataset (MSID), achieving 96.79 % accuracy, 96.90 % precision, 95.34 % recall, 96.08 % F1-score, and 95.37% Matthews Correlation Coefficient (MCC). These results demonstrate that TMSVGA outperforms existing models, particularly in handling irregular lesion patterns. By achieving high diagnostic accuracy and precision, our study showcases the potential of Vision Graph Neural Networks (ViGNNs) in advancing medical image analysis for diseases with non-uniform spatial patterns. Furthermore, the lightweight architecture of TMSVGA ensures suitability for mobile and resource-constrained diagnostic applications.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"330 ","pages":"Article 114503"},"PeriodicalIF":7.6000,"publicationDate":"2025-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125015424","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

The current outbreak of monkeypox (mpox) presents challenges for timely and accurate diagnosis due to the disease’s diverse and unusual skin lesion patterns. Traditional deep learning models, such as Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs), struggle with these irregular features because they rely on rigid, grid-based methods. To address this, we introduce the Truncated MobileNetV2 Sparse Vision Graph Attention (TMSVGA) model. TMSVGA combines components of MobileNetV2, which focuses on identifying smaller details, with a Sparse Vision Graph Attention block enhanced by a Squeeze-and-Excitation (SE) mechanism to improve channel-wise attention. This approach enhances the understanding of complex and long-distance relationships, emphasizing diagnostically significant regions and improving classification precision. We optimized TMSVGA using the Optuna framework for automated hyperparameter tuning. Additionally, Gradient-weighted Class Activation Mapping (Grad-CAM) and Local Interpretable Model-Agnostic Explanations (LIME) provided interpretable visualizations, highlighting influential regions in decision-making. The TMSVGA model was validated on the Monkeypox Skin Images Dataset (MSID), achieving 96.79 % accuracy, 96.90 % precision, 95.34 % recall, 96.08 % F1-score, and 95.37% Matthews Correlation Coefficient (MCC). These results demonstrate that TMSVGA outperforms existing models, particularly in handling irregular lesion patterns. By achieving high diagnostic accuracy and precision, our study showcases the potential of Vision Graph Neural Networks (ViGNNs) in advancing medical image analysis for diseases with non-uniform spatial patterns. Furthermore, the lightweight architecture of TMSVGA ensures suitability for mobile and resource-constrained diagnostic applications.

查看原文本刊更多论文

可解释猴痘疾病分类的截断MobileNetV2稀疏视觉图注意模型

由于猴痘的多种多样和不寻常的皮肤病变模式，目前的猴痘疫情对及时和准确诊断提出了挑战。传统的深度学习模型，如卷积神经网络（cnn）和视觉变换（ViTs），难以处理这些不规则特征，因为它们依赖于严格的、基于网格的方法。为了解决这个问题，我们引入了截断的MobileNetV2稀疏视觉图注意（TMSVGA）模型。TMSVGA结合了MobileNetV2的组件，该组件专注于识别较小的细节，并通过挤压和激励（SE）机制增强了稀疏视觉图注意块，以提高频道的注意力。该方法增强了对复杂和远距离关系的理解，强调了诊断意义区域，提高了分类精度。我们使用Optuna框架对TMSVGA进行了优化，以实现自动超参数调优。此外，梯度加权类激活映射（Grad-CAM）和局部可解释模型不可知论解释（LIME）提供了可解释的可视化，突出了决策中的影响区域。在猴痘皮肤图像数据集（MSID）上验证TMSVGA模型，准确率为96.79%，精密度为96.90%，召回率为95.34%，f1得分为96.08%，马修斯相关系数（MCC）为95.37%。这些结果表明，TMSVGA优于现有模型，特别是在处理不规则病变模式方面。通过实现高诊断准确性和精确度，我们的研究展示了视觉图神经网络（vignn）在推进非均匀空间模式疾病的医学图像分析方面的潜力。此外，TMSVGA的轻量级架构确保了对移动和资源受限的诊断应用程序的适用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Knowledge-Based Systems 工程技术-计算机：人工智能

CiteScore

14.80

自引率

12.50%

发文量

1245

审稿时长

7.8 months

期刊介绍： Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.