Enhancing Melanoma Diagnosis with Advanced Deep Learning Models Focusing on Vision Transformer, Swin Transformer, and ConvNeXt.

IF 1.6 Q3 DERMATOLOGY
Serra Aksoy, Pinar Demircioglu, Ismail Bogrekci
{"title":"Enhancing Melanoma Diagnosis with Advanced Deep Learning Models Focusing on Vision Transformer, Swin Transformer, and ConvNeXt.","authors":"Serra Aksoy, Pinar Demircioglu, Ismail Bogrekci","doi":"10.3390/dermatopathology11030026","DOIUrl":null,"url":null,"abstract":"<p><p>Skin tumors, especially melanoma, which is highly aggressive and progresses quickly to other sites, are an issue in various parts of the world. Nevertheless, the one and only way to save lives is to detect it at its initial stages. This study explores the application of advanced deep learning models for classifying benign and malignant melanoma using dermoscopic images. The aim of the study is to enhance the accuracy and efficiency of melanoma diagnosis with the ConvNeXt, Vision Transformer (ViT) Base-16, and Swin Transformer V2 Small (Swin V2 S) deep learning models. The ConvNeXt model, which integrates principles of both convolutional neural networks and transformers, demonstrated superior performance, with balanced precision and recall metrics. The dataset, sourced from Kaggle, comprises 13,900 uniformly sized images, preprocessed to standardize the inputs for the models. Experimental results revealed that ConvNeXt achieved the highest diagnostic accuracy among the tested models. Experimental results revealed that ConvNeXt achieved an accuracy of 91.5%, with balanced precision and recall rates of 90.45% and 92.8% for benign cases, and 92.61% and 90.2% for malignant cases, respectively. The F1-scores for ConvNeXt were 91.61% for benign cases and 91.39% for malignant cases. This research points out the potential of hybrid deep learning architectures in medical image analysis, particularly for early melanoma detection.</p>","PeriodicalId":42885,"journal":{"name":"Dermatopathology","volume":"11 3","pages":"239-252"},"PeriodicalIF":1.6000,"publicationDate":"2024-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11348198/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Dermatopathology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/dermatopathology11030026","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"DERMATOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Skin tumors, especially melanoma, which is highly aggressive and progresses quickly to other sites, are an issue in various parts of the world. Nevertheless, the one and only way to save lives is to detect it at its initial stages. This study explores the application of advanced deep learning models for classifying benign and malignant melanoma using dermoscopic images. The aim of the study is to enhance the accuracy and efficiency of melanoma diagnosis with the ConvNeXt, Vision Transformer (ViT) Base-16, and Swin Transformer V2 Small (Swin V2 S) deep learning models. The ConvNeXt model, which integrates principles of both convolutional neural networks and transformers, demonstrated superior performance, with balanced precision and recall metrics. The dataset, sourced from Kaggle, comprises 13,900 uniformly sized images, preprocessed to standardize the inputs for the models. Experimental results revealed that ConvNeXt achieved the highest diagnostic accuracy among the tested models. Experimental results revealed that ConvNeXt achieved an accuracy of 91.5%, with balanced precision and recall rates of 90.45% and 92.8% for benign cases, and 92.61% and 90.2% for malignant cases, respectively. The F1-scores for ConvNeXt were 91.61% for benign cases and 91.39% for malignant cases. This research points out the potential of hybrid deep learning architectures in medical image analysis, particularly for early melanoma detection.

以 Vision Transformer、Swin Transformer 和 ConvNeXt 为重点,利用先进的深度学习模型加强黑色素瘤诊断。
皮肤肿瘤,尤其是黑色素瘤,具有很强的侵袭性,并会迅速发展到其他部位,这在世界各地都是一个问题。尽管如此,挽救生命的唯一办法就是在初期阶段发现它。本研究探讨了如何应用先进的深度学习模型,利用皮肤镜图像对良性和恶性黑色素瘤进行分类。研究的目的是利用 ConvNeXt、Vision Transformer (ViT) Base-16 和 Swin Transformer V2 Small (Swin V2 S) 深度学习模型提高黑色素瘤诊断的准确性和效率。ConvNeXt 模型集成了卷积神经网络和变换器的原理,表现出卓越的性能,精确度和召回率指标均衡。该数据集来自 Kaggle,包含 13,900 张大小一致的图像,经过预处理以标准化模型的输入。实验结果表明,在所有测试模型中,ConvNeXt 的诊断准确率最高。实验结果显示,ConvNeXt 的准确率达到 91.5%,良性病例的精确率和召回率分别为 90.45% 和 92.8%,恶性病例的精确率和召回率分别为 92.61% 和 90.2%。ConvNeXt 对良性病例的 F1 分数为 91.61%,对恶性病例的 F1 分数为 91.39%。这项研究指出了混合深度学习架构在医学图像分析,尤其是早期黑色素瘤检测方面的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Dermatopathology
Dermatopathology DERMATOLOGY-
自引率
5.30%
发文量
39
审稿时长
11 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信