Marcello Di Giammarco , Antonella Santone , Mario Cesarelli , Fabio Martinelli , Francesco Mercaldo
{"title":"A method for skin lesion detection and localization by means of Deep Learning and reliable prediction explainability","authors":"Marcello Di Giammarco , Antonella Santone , Mario Cesarelli , Fabio Martinelli , Francesco Mercaldo","doi":"10.1016/j.imavis.2025.105675","DOIUrl":null,"url":null,"abstract":"<div><div>Skin lesions are any abnormal growths or appearances on the skin, ranging from benign (i.e., non-cancerous) to malignant (i.e., cancerous). The identification of a skin lesion is a crucial task that is carried out in short periods of time to initiate an eventual therapeutic treatment. In this paper, we propose a method for automatic skin lesion detection, implementing Convolutional Neural Networks. Moreover, with the aim of providing a rationale behind the model prediction, we also consider explainability by adopting two different Class Activation Mapping algorithms, which highlight regions in skin images that most contribute to the network’s classification decision. We also include the indices of similarity for further quantitative analysis. Several Convolutional Neural Networks are considered, by obtaining the best results with the MobileNet model, achieving an accuracy equal to 0.935 in skin lesion detection. Moreover, in the experimental analysis, we discuss the effectiveness of Class Activation Mapping algorithms exploited for skin lesion localization.</div></div>","PeriodicalId":50374,"journal":{"name":"Image and Vision Computing","volume":"162 ","pages":"Article 105675"},"PeriodicalIF":4.2000,"publicationDate":"2025-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Image and Vision Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S026288562500263X","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Skin lesions are any abnormal growths or appearances on the skin, ranging from benign (i.e., non-cancerous) to malignant (i.e., cancerous). The identification of a skin lesion is a crucial task that is carried out in short periods of time to initiate an eventual therapeutic treatment. In this paper, we propose a method for automatic skin lesion detection, implementing Convolutional Neural Networks. Moreover, with the aim of providing a rationale behind the model prediction, we also consider explainability by adopting two different Class Activation Mapping algorithms, which highlight regions in skin images that most contribute to the network’s classification decision. We also include the indices of similarity for further quantitative analysis. Several Convolutional Neural Networks are considered, by obtaining the best results with the MobileNet model, achieving an accuracy equal to 0.935 in skin lesion detection. Moreover, in the experimental analysis, we discuss the effectiveness of Class Activation Mapping algorithms exploited for skin lesion localization.
期刊介绍:
Image and Vision Computing has as a primary aim the provision of an effective medium of interchange for the results of high quality theoretical and applied research fundamental to all aspects of image interpretation and computer vision. The journal publishes work that proposes new image interpretation and computer vision methodology or addresses the application of such methods to real world scenes. It seeks to strengthen a deeper understanding in the discipline by encouraging the quantitative comparison and performance evaluation of the proposed methodology. The coverage includes: image interpretation, scene modelling, object recognition and tracking, shape analysis, monitoring and surveillance, active vision and robotic systems, SLAM, biologically-inspired computer vision, motion analysis, stereo vision, document image understanding, character and handwritten text recognition, face and gesture recognition, biometrics, vision-based human-computer interaction, human activity and behavior understanding, data fusion from multiple sensor inputs, image databases.