Categorical classification of skin cancer using a weighted ensemble of transfer learning with test time augmentation

Data Science and Management Pub Date : 2025-06-01 DOI:10.1016/j.dsm.2024.10.002

Aliyu Tetengi Ibrahim , Mohammed Abdullahi , Armand Florentin Donfack Kana , Mohammed Tukur Mohammed , Ibrahim Hayatu Hassan

{"title":"Categorical classification of skin cancer using a weighted ensemble of transfer learning with test time augmentation","authors":"Aliyu Tetengi Ibrahim , Mohammed Abdullahi , Armand Florentin Donfack Kana , Mohammed Tukur Mohammed , Ibrahim Hayatu Hassan","doi":"10.1016/j.dsm.2024.10.002","DOIUrl":null,"url":null,"abstract":"<div><div>Skin cancer is the abnormal development of cells on the surface of the skin and is one of the most fatal diseases in humans. It usually appears in locations that are exposed to the sun, but can also appear in areas that are not regularly exposed to the sun. Due to the striking similarities between benign and malignant lesions, skin cancer detection remains a problem, even for expert dermatologists. Considering the inability of dermatologists to diagnose skin cancer accurately, a convolutional neural network (CNN) approach was used for skin cancer diagnosis. However, the CNN model requires a significant number of image datasets for better performance; thus, image augmentation and transfer learning techniques have been used in this study to boost the number of images and the performance of the model, because there are a limited number of medical images. This study proposes an ensemble transfer-learning-based model that can efficiently classify skin lesions into one of seven categories to aid dermatologists in skin cancer detection: (i) actinic keratoses, (ii) basal cell carcinoma, (iii) benign keratosis, (iv) dermatofibroma, (v) melanocytic nevi, (vi) melanoma, and (vii) vascular skin lesions. Five transfer learning models were used as the basis of the ensemble: MobileNet, EfficientNetV2B2, Xception, ResNext101, and DenseNet201. In addition to the stratified 10-fold cross-validation, the results of each individual model were fused to achieve greater classification accuracy. An annealing learning rate scheduler and test time augmentation (TTA) were also used to increase the performance of the model during the training and testing stages. A total of 10,015 publicly available dermoscopy images from the HAM10000 (Human Against Machine) dataset, which contained samples from the seven common skin lesion categories, were used to train and evaluate the models. The proposed technique attained 94.49% accuracy on the dataset. These results suggest that this strategy can be useful for improving the accuracy of skin cancer classification. However, the weighted average of F1-score, recall, and precision were obtained to be 94.68%, 94.49%, and 95.07%, respectively.</div></div>","PeriodicalId":100353,"journal":{"name":"Data Science and Management","volume":"8 2","pages":"Pages 174-184"},"PeriodicalIF":0.0000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data Science and Management","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666764924000535","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Skin cancer is the abnormal development of cells on the surface of the skin and is one of the most fatal diseases in humans. It usually appears in locations that are exposed to the sun, but can also appear in areas that are not regularly exposed to the sun. Due to the striking similarities between benign and malignant lesions, skin cancer detection remains a problem, even for expert dermatologists. Considering the inability of dermatologists to diagnose skin cancer accurately, a convolutional neural network (CNN) approach was used for skin cancer diagnosis. However, the CNN model requires a significant number of image datasets for better performance; thus, image augmentation and transfer learning techniques have been used in this study to boost the number of images and the performance of the model, because there are a limited number of medical images. This study proposes an ensemble transfer-learning-based model that can efficiently classify skin lesions into one of seven categories to aid dermatologists in skin cancer detection: (i) actinic keratoses, (ii) basal cell carcinoma, (iii) benign keratosis, (iv) dermatofibroma, (v) melanocytic nevi, (vi) melanoma, and (vii) vascular skin lesions. Five transfer learning models were used as the basis of the ensemble: MobileNet, EfficientNetV2B2, Xception, ResNext101, and DenseNet201. In addition to the stratified 10-fold cross-validation, the results of each individual model were fused to achieve greater classification accuracy. An annealing learning rate scheduler and test time augmentation (TTA) were also used to increase the performance of the model during the training and testing stages. A total of 10,015 publicly available dermoscopy images from the HAM10000 (Human Against Machine) dataset, which contained samples from the seven common skin lesion categories, were used to train and evaluate the models. The proposed technique attained 94.49% accuracy on the dataset. These results suggest that this strategy can be useful for improving the accuracy of skin cancer classification. However, the weighted average of F1-score, recall, and precision were obtained to be 94.68%, 94.49%, and 95.07%, respectively.

查看原文本刊更多论文

使用迁移学习与测试时间增加的加权集合对皮肤癌进行分类

皮肤癌是皮肤表面细胞的异常发育，是人类最致命的疾病之一。它通常出现在暴露在阳光下的地方，但也可能出现在不经常暴露在阳光下的地方。由于良性和恶性病变之间惊人的相似性，皮肤癌的检测仍然是一个问题，即使是专家皮肤科医生。考虑到皮肤科医生无法准确诊断皮肤癌，采用卷积神经网络（CNN）方法进行皮肤癌诊断。然而，CNN模型需要大量的图像数据集才能获得更好的性能；因此，由于医学图像的数量有限，本研究中使用了图像增强和迁移学习技术来提高图像的数量和模型的性能。本研究提出了一个基于集成迁移学习的模型，该模型可以有效地将皮肤病变分为七种类型之一，以帮助皮肤科医生进行皮肤癌检测：(i)光化性角化病，（ii）基底细胞癌，（iii）良性角化病，（iv）皮肤纤维瘤，(v)黑素细胞痣，（vi）黑色素瘤，（vii）血管皮肤病变。五个迁移学习模型被用作集成的基础：MobileNet、EfficientNetV2B2、Xception、ResNext101和DenseNet201。除了分层的10倍交叉验证之外，每个单独模型的结果被融合以获得更高的分类精度。在训练和测试阶段，还使用了退火学习率调度器和测试时间增强（TTA）来提高模型的性能。来自HAM10000（人类对抗机器）数据集的10,015张公开可用的皮肤镜图像，其中包含来自七种常见皮肤病变类别的样本，用于训练和评估模型。该方法在数据集上的准确率达到94.49%。这些结果表明，该策略可用于提高皮肤癌分类的准确性。而f1得分、召回率和准确率的加权平均值分别为94.68%、94.49%和95.07%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Data Science and Management

CiteScore

7.50

自引率

0.00%

发文量