{"title":"一种新型皮肤病变预测和分类技术:ViT-GradCAM","authors":"Muhammad Shafiq, Kapil Aggarwal, Jagannathan Jayachandran, Gayathri Srinivasan, Rajasekhar Boddu, Adugna Alemayehu","doi":"10.1111/srt.70040","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Skin cancer is one of the highly occurring diseases in human life. Early detection and treatment are the prime and necessary points to reduce the malignancy of infections. Deep learning techniques are supplementary tools to assist clinical experts in detecting and localizing skin lesions. Vision transformers (ViT) based on image segmentation classification using multiple classes provide fairly accurate detection and are gaining more popularity due to legitimate multiclass prediction capabilities.</p><p><strong>Materials and methods: </strong>In this research, we propose a new ViT Gradient-Weighted Class Activation Mapping (GradCAM) based architecture named ViT-GradCAM for detecting and classifying skin lesions by spreading ratio on the lesion's surface area. The proposed system is trained and validated using a HAM 10000 dataset by studying seven skin lesions. The database comprises 10 015 dermatoscopic images of varied sizes. The data preprocessing and data augmentation techniques are applied to overcome the class imbalance issues and improve the model's performance.</p><p><strong>Result: </strong>The proposed algorithm is based on ViT models that classify the dermatoscopic images into seven classes with an accuracy of 97.28%, precision of 98.51, recall of 95.2%, and an F1 score of 94.6, respectively. The proposed ViT-GradCAM obtains better and more accurate detection and classification than other state-of-the-art deep learning-based skin lesion detection models. The architecture of ViT-GradCAM is extensively visualized to highlight the actual pixels in essential regions associated with skin-specific pathologies.</p><p><strong>Conclusion: </strong>This research proposes an alternate solution to overcome the challenges of detecting and classifying skin lesions using ViTs and GradCAM, which play a significant role in detecting and classifying skin lesions accurately rather than relying solely on deep learning models.</p>","PeriodicalId":21746,"journal":{"name":"Skin Research and Technology","volume":"30 9","pages":"e70040"},"PeriodicalIF":2.0000,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11367666/pdf/","citationCount":"0","resultStr":"{\"title\":\"A novel Skin lesion prediction and classification technique: ViT-GradCAM.\",\"authors\":\"Muhammad Shafiq, Kapil Aggarwal, Jagannathan Jayachandran, Gayathri Srinivasan, Rajasekhar Boddu, Adugna Alemayehu\",\"doi\":\"10.1111/srt.70040\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Skin cancer is one of the highly occurring diseases in human life. Early detection and treatment are the prime and necessary points to reduce the malignancy of infections. Deep learning techniques are supplementary tools to assist clinical experts in detecting and localizing skin lesions. Vision transformers (ViT) based on image segmentation classification using multiple classes provide fairly accurate detection and are gaining more popularity due to legitimate multiclass prediction capabilities.</p><p><strong>Materials and methods: </strong>In this research, we propose a new ViT Gradient-Weighted Class Activation Mapping (GradCAM) based architecture named ViT-GradCAM for detecting and classifying skin lesions by spreading ratio on the lesion's surface area. The proposed system is trained and validated using a HAM 10000 dataset by studying seven skin lesions. The database comprises 10 015 dermatoscopic images of varied sizes. The data preprocessing and data augmentation techniques are applied to overcome the class imbalance issues and improve the model's performance.</p><p><strong>Result: </strong>The proposed algorithm is based on ViT models that classify the dermatoscopic images into seven classes with an accuracy of 97.28%, precision of 98.51, recall of 95.2%, and an F1 score of 94.6, respectively. The proposed ViT-GradCAM obtains better and more accurate detection and classification than other state-of-the-art deep learning-based skin lesion detection models. The architecture of ViT-GradCAM is extensively visualized to highlight the actual pixels in essential regions associated with skin-specific pathologies.</p><p><strong>Conclusion: </strong>This research proposes an alternate solution to overcome the challenges of detecting and classifying skin lesions using ViTs and GradCAM, which play a significant role in detecting and classifying skin lesions accurately rather than relying solely on deep learning models.</p>\",\"PeriodicalId\":21746,\"journal\":{\"name\":\"Skin Research and Technology\",\"volume\":\"30 9\",\"pages\":\"e70040\"},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2024-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11367666/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Skin Research and Technology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1111/srt.70040\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"DERMATOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Skin Research and Technology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/srt.70040","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"DERMATOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
背景:皮肤癌是人类高发疾病之一。早期发现和治疗是减少恶性感染的首要和必要条件。深度学习技术是辅助临床专家检测和定位皮肤病变的辅助工具。基于多类图像分割分类的视觉变换器(ViT)可提供相当准确的检测,并因其合法的多类预测能力而越来越受欢迎:在这项研究中,我们提出了一种新的基于梯度加权类激活映射(GradCAM)的 ViT 架构,命名为 ViT-GradCAM,用于通过皮损表面积的扩散比率检测皮损并对其进行分类。通过研究七种皮肤病变,使用 HAM 10000 数据集对所提出的系统进行了训练和验证。该数据库包括 10 015 张不同大小的皮肤镜图像。数据预处理和数据增强技术的应用克服了类不平衡问题,提高了模型的性能:结果:所提出的算法基于 ViT 模型,可将皮肤镜图像分为七类,准确率为 97.28%,精确率为 98.51%,召回率为 95.2%,F1 分数为 94.6。与其他最先进的基于深度学习的皮损检测模型相比,ViT-GradCAM 的检测和分类效果更好、更准确。ViT-GradCAM 的架构被广泛可视化,以突出与皮肤特定病变相关的重要区域的实际像素:本研究提出了另一种解决方案,利用 ViTs 和 GradCAM 克服检测和分类皮肤病变的挑战,它们在准确检测和分类皮肤病变方面发挥了重要作用,而不是仅仅依赖深度学习模型。
A novel Skin lesion prediction and classification technique: ViT-GradCAM.
Background: Skin cancer is one of the highly occurring diseases in human life. Early detection and treatment are the prime and necessary points to reduce the malignancy of infections. Deep learning techniques are supplementary tools to assist clinical experts in detecting and localizing skin lesions. Vision transformers (ViT) based on image segmentation classification using multiple classes provide fairly accurate detection and are gaining more popularity due to legitimate multiclass prediction capabilities.
Materials and methods: In this research, we propose a new ViT Gradient-Weighted Class Activation Mapping (GradCAM) based architecture named ViT-GradCAM for detecting and classifying skin lesions by spreading ratio on the lesion's surface area. The proposed system is trained and validated using a HAM 10000 dataset by studying seven skin lesions. The database comprises 10 015 dermatoscopic images of varied sizes. The data preprocessing and data augmentation techniques are applied to overcome the class imbalance issues and improve the model's performance.
Result: The proposed algorithm is based on ViT models that classify the dermatoscopic images into seven classes with an accuracy of 97.28%, precision of 98.51, recall of 95.2%, and an F1 score of 94.6, respectively. The proposed ViT-GradCAM obtains better and more accurate detection and classification than other state-of-the-art deep learning-based skin lesion detection models. The architecture of ViT-GradCAM is extensively visualized to highlight the actual pixels in essential regions associated with skin-specific pathologies.
Conclusion: This research proposes an alternate solution to overcome the challenges of detecting and classifying skin lesions using ViTs and GradCAM, which play a significant role in detecting and classifying skin lesions accurately rather than relying solely on deep learning models.
期刊介绍:
Skin Research and Technology is a clinically-oriented journal on biophysical methods and imaging techniques and how they are used in dermatology, cosmetology and plastic surgery for noninvasive quantification of skin structure and functions. Papers are invited on the development and validation of methods and their application in the characterization of diseased, abnormal and normal skin.
Topics include blood flow, colorimetry, thermography, evaporimetry, epidermal humidity, desquamation, profilometry, skin mechanics, epiluminiscence microscopy, high-frequency ultrasonography, confocal microscopy, digital imaging, image analysis and computerized evaluation and magnetic resonance. Noninvasive biochemical methods (such as lipids, keratin and tissue water) and the instrumental evaluation of cytological and histological samples are also covered.
The journal has a wide scope and aims to link scientists, clinical researchers and technicians through original articles, communications, editorials and commentaries, letters, reviews, announcements and news. Contributions should be clear, experimentally sound and novel.