Bihter Das, Huseyin Alperen Dagdogen, Muhammed Onur Kaya, Resul Das
{"title":"A novel hybrid model combining Vision Transformers and Graph Convolutional Networks for monkeypox disease effective diagnosis","authors":"Bihter Das, Huseyin Alperen Dagdogen, Muhammed Onur Kaya, Resul Das","doi":"10.1016/j.inffus.2024.102858","DOIUrl":null,"url":null,"abstract":"Accurate diagnosis of monkeypox is challenging due to the limitations of current diagnostic techniques, which struggle to account for skin lesions’ complex visual and structural characteristics. This study aims to develop a novel hybrid model that combines the strengths of Vision Transformers (ViT), ResNet50, and AlexNet with Graph Convolutional Networks (GCN) to improve monkeypox diagnostic accuracy. Our method captures both the visual features and structural relationships within skin lesions, offering a more comprehensive approach to classification. Rigorous testing on two distinct datasets demonstrated that the ViT+GCN model achieved superior accuracy, particularly excelling in binary classification with 100% accuracy and multi-class classification with a 97% accuracy rate. These findings indicate that integrating visual and structural information enhances diagnostic reliability. While promising, this model requires further development, including larger datasets and optimization for real-time applications. Overall, this approach advances dermatological diagnostics and holds potential for broader applications in diagnosing other skin-related diseases.","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"3 1","pages":""},"PeriodicalIF":14.7000,"publicationDate":"2024-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Fusion","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1016/j.inffus.2024.102858","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate diagnosis of monkeypox is challenging due to the limitations of current diagnostic techniques, which struggle to account for skin lesions’ complex visual and structural characteristics. This study aims to develop a novel hybrid model that combines the strengths of Vision Transformers (ViT), ResNet50, and AlexNet with Graph Convolutional Networks (GCN) to improve monkeypox diagnostic accuracy. Our method captures both the visual features and structural relationships within skin lesions, offering a more comprehensive approach to classification. Rigorous testing on two distinct datasets demonstrated that the ViT+GCN model achieved superior accuracy, particularly excelling in binary classification with 100% accuracy and multi-class classification with a 97% accuracy rate. These findings indicate that integrating visual and structural information enhances diagnostic reliability. While promising, this model requires further development, including larger datasets and optimization for real-time applications. Overall, this approach advances dermatological diagnostics and holds potential for broader applications in diagnosing other skin-related diseases.
期刊介绍:
Information Fusion serves as a central platform for showcasing advancements in multi-sensor, multi-source, multi-process information fusion, fostering collaboration among diverse disciplines driving its progress. It is the leading outlet for sharing research and development in this field, focusing on architectures, algorithms, and applications. Papers dealing with fundamental theoretical analyses as well as those demonstrating their application to real-world problems will be welcome.