Mohamed Osman, Mohamed Kamel, F. Mohammed, Tarek Hussein
{"title":"SkinFormer: Robust Vision Transformer for Automatic Skin Disease Identification","authors":"Mohamed Osman, Mohamed Kamel, F. Mohammed, Tarek Hussein","doi":"10.1109/JAC-ECC56395.2022.10044005","DOIUrl":null,"url":null,"abstract":"The largest, most visible, and most exposed organ of the human body is the skin. Skin diseases impact about a third of the global population. Furthermore, many serious skin diseases, such as melanoma, may remain misdiagnosed for years since only experienced dermatologists can reliably diagnose them. As a result, some regions and demographic groups may be more negatively impacted than others in terms of access to such medical professionals. Given the range and possible hazards of skin diseases, democratizing access to accurate identification is critical. Fortunately, automated deep-learning systems have made significant advances in picture classification in recent years. In this paper we present SkinFormer, a vision transformer trained using strong augmentations and optimization techniques to ensure robustness and generalization. The dataset we use is a combination of 3 different datasets from previous works, and contains a total of 48,322 images. The choice of datasets and augmentations ensures strong generalization even on consumergrade hardware. We achieve a top-1 accuracy of 84.43% and a top-5 accuracy of 93.89%. We publish pre-trained model weights under an open source license.","PeriodicalId":326002,"journal":{"name":"2022 10th International Japan-Africa Conference on Electronics, Communications, and Computations (JAC-ECC)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 10th International Japan-Africa Conference on Electronics, Communications, and Computations (JAC-ECC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/JAC-ECC56395.2022.10044005","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The largest, most visible, and most exposed organ of the human body is the skin. Skin diseases impact about a third of the global population. Furthermore, many serious skin diseases, such as melanoma, may remain misdiagnosed for years since only experienced dermatologists can reliably diagnose them. As a result, some regions and demographic groups may be more negatively impacted than others in terms of access to such medical professionals. Given the range and possible hazards of skin diseases, democratizing access to accurate identification is critical. Fortunately, automated deep-learning systems have made significant advances in picture classification in recent years. In this paper we present SkinFormer, a vision transformer trained using strong augmentations and optimization techniques to ensure robustness and generalization. The dataset we use is a combination of 3 different datasets from previous works, and contains a total of 48,322 images. The choice of datasets and augmentations ensures strong generalization even on consumergrade hardware. We achieve a top-1 accuracy of 84.43% and a top-5 accuracy of 93.89%. We publish pre-trained model weights under an open source license.