Truncated MobileNetV2 Sparse Vision Graph Attention Model for Explainable Monkeypox Disease Classification

IF 7.6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Mehdhar S.A.M. Al-Gaashani , Abduljabbar S. Ba Mahel , Ammar Muthanna
{"title":"Truncated MobileNetV2 Sparse Vision Graph Attention Model for Explainable Monkeypox Disease Classification","authors":"Mehdhar S.A.M. Al-Gaashani ,&nbsp;Abduljabbar S. Ba Mahel ,&nbsp;Ammar Muthanna","doi":"10.1016/j.knosys.2025.114503","DOIUrl":null,"url":null,"abstract":"<div><div>The current outbreak of monkeypox (mpox) presents challenges for timely and accurate diagnosis due to the disease’s diverse and unusual skin lesion patterns. Traditional deep learning models, such as Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs), struggle with these irregular features because they rely on rigid, grid-based methods. To address this, we introduce the Truncated MobileNetV2 Sparse Vision Graph Attention (TMSVGA) model. TMSVGA combines components of MobileNetV2, which focuses on identifying smaller details, with a Sparse Vision Graph Attention block enhanced by a Squeeze-and-Excitation (SE) mechanism to improve channel-wise attention. This approach enhances the understanding of complex and long-distance relationships, emphasizing diagnostically significant regions and improving classification precision. We optimized TMSVGA using the Optuna framework for automated hyperparameter tuning. Additionally, Gradient-weighted Class Activation Mapping (Grad-CAM) and Local Interpretable Model-Agnostic Explanations (LIME) provided interpretable visualizations, highlighting influential regions in decision-making. The TMSVGA model was validated on the Monkeypox Skin Images Dataset (MSID), achieving 96.79 % accuracy, 96.90 % precision, 95.34 % recall, 96.08 % F1-score, and 95.37% Matthews Correlation Coefficient (MCC). These results demonstrate that TMSVGA outperforms existing models, particularly in handling irregular lesion patterns. By achieving high diagnostic accuracy and precision, our study showcases the potential of Vision Graph Neural Networks (ViGNNs) in advancing medical image analysis for diseases with non-uniform spatial patterns. Furthermore, the lightweight architecture of TMSVGA ensures suitability for mobile and resource-constrained diagnostic applications.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"330 ","pages":"Article 114503"},"PeriodicalIF":7.6000,"publicationDate":"2025-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125015424","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

The current outbreak of monkeypox (mpox) presents challenges for timely and accurate diagnosis due to the disease’s diverse and unusual skin lesion patterns. Traditional deep learning models, such as Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs), struggle with these irregular features because they rely on rigid, grid-based methods. To address this, we introduce the Truncated MobileNetV2 Sparse Vision Graph Attention (TMSVGA) model. TMSVGA combines components of MobileNetV2, which focuses on identifying smaller details, with a Sparse Vision Graph Attention block enhanced by a Squeeze-and-Excitation (SE) mechanism to improve channel-wise attention. This approach enhances the understanding of complex and long-distance relationships, emphasizing diagnostically significant regions and improving classification precision. We optimized TMSVGA using the Optuna framework for automated hyperparameter tuning. Additionally, Gradient-weighted Class Activation Mapping (Grad-CAM) and Local Interpretable Model-Agnostic Explanations (LIME) provided interpretable visualizations, highlighting influential regions in decision-making. The TMSVGA model was validated on the Monkeypox Skin Images Dataset (MSID), achieving 96.79 % accuracy, 96.90 % precision, 95.34 % recall, 96.08 % F1-score, and 95.37% Matthews Correlation Coefficient (MCC). These results demonstrate that TMSVGA outperforms existing models, particularly in handling irregular lesion patterns. By achieving high diagnostic accuracy and precision, our study showcases the potential of Vision Graph Neural Networks (ViGNNs) in advancing medical image analysis for diseases with non-uniform spatial patterns. Furthermore, the lightweight architecture of TMSVGA ensures suitability for mobile and resource-constrained diagnostic applications.
可解释猴痘疾病分类的截断MobileNetV2稀疏视觉图注意模型
由于猴痘的多种多样和不寻常的皮肤病变模式,目前的猴痘疫情对及时和准确诊断提出了挑战。传统的深度学习模型,如卷积神经网络(cnn)和视觉变换(ViTs),难以处理这些不规则特征,因为它们依赖于严格的、基于网格的方法。为了解决这个问题,我们引入了截断的MobileNetV2稀疏视觉图注意(TMSVGA)模型。TMSVGA结合了MobileNetV2的组件,该组件专注于识别较小的细节,并通过挤压和激励(SE)机制增强了稀疏视觉图注意块,以提高频道的注意力。该方法增强了对复杂和远距离关系的理解,强调了诊断意义区域,提高了分类精度。我们使用Optuna框架对TMSVGA进行了优化,以实现自动超参数调优。此外,梯度加权类激活映射(Grad-CAM)和局部可解释模型不可知论解释(LIME)提供了可解释的可视化,突出了决策中的影响区域。在猴痘皮肤图像数据集(MSID)上验证TMSVGA模型,准确率为96.79%,精密度为96.90%,召回率为95.34%,f1得分为96.08%,马修斯相关系数(MCC)为95.37%。这些结果表明,TMSVGA优于现有模型,特别是在处理不规则病变模式方面。通过实现高诊断准确性和精确度,我们的研究展示了视觉图神经网络(vignn)在推进非均匀空间模式疾病的医学图像分析方面的潜力。此外,TMSVGA的轻量级架构确保了对移动和资源受限的诊断应用程序的适用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Knowledge-Based Systems
Knowledge-Based Systems 工程技术-计算机:人工智能
CiteScore
14.80
自引率
12.50%
发文量
1245
审稿时长
7.8 months
期刊介绍: Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信