Jing Yang , Yajie Wan , Su Diao , Osama Alfarraj , Fahad Alblehai , Amr Tolba , Zaffar Ahmed Shaikh , Lip Yee Por , Roohallah Alizadehsani , Yudong Zhang
{"title":"Melanoma Detection through Combining Reinforcement Learning, Generative Adversarial Network, and Bayesian Optimization","authors":"Jing Yang , Yajie Wan , Su Diao , Osama Alfarraj , Fahad Alblehai , Amr Tolba , Zaffar Ahmed Shaikh , Lip Yee Por , Roohallah Alizadehsani , Yudong Zhang","doi":"10.1016/j.bspc.2025.108668","DOIUrl":null,"url":null,"abstract":"<div><div>Melanoma, a highly aggressive form of skin cancer, is primarily driven by DNA alterations often linked to environmental factors such as ultraviolet radiation. Addressing the need for improved early detection, this study tackles the key limitations of current methods, which frequently employ convolutional neural networks (CNNs) but struggle with feature selection, class imbalance, hyperparameter tuning, and generalizability. Our strategy leverages dilated convolution (DC) layers trained using reinforcement learning (RL). Unlike other RL-based approaches that handle these challenges in isolation, our method introduces a multi-stage architecture. It integrates RL for feature selection and class balancing. Shapley additive explanations (SHAP) guide feature identification, while augmented rewards for underrepresented classes help mitigate data imbalance. Bayesian optimization hyperband (BOHB) is used for hyperparameter tuning in a unified training process. BOHB combines the predictive strength of Bayesian optimization with the efficiency of hyperband, accelerating model tuning. It also includes an online GAN module for dynamic data augmentation that responds to the evolving output of the RL agent. A novel regularization technique stabilizes GAN training and prevents mode collapse. Importantly, existing RL methods face the challenge of balancing exploration and exploitation. In our RL model, the scope loss function (SLF), integrated with RL, balances exploration and exploitation, thereby ensuring accuracy and generalizability. Collectively, the model jointly tackles four persistent challenges in earlier RL-based approaches: poor exploration–exploitation balance, unstable reward dynamics, static data augmentation, and manual hyperparameter tuning. The model achieved F-measures of 94.3 %, 93.7 %, and 91.5 % on ISIC-2020, HAM10000, and PH2, respectively. This advancement significantly improves early melanoma detection and supports more accurate treatment decisions, contributing valuably to the ongoing effort to combat this lethal cancer.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"112 ","pages":"Article 108668"},"PeriodicalIF":4.9000,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809425011796","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Melanoma, a highly aggressive form of skin cancer, is primarily driven by DNA alterations often linked to environmental factors such as ultraviolet radiation. Addressing the need for improved early detection, this study tackles the key limitations of current methods, which frequently employ convolutional neural networks (CNNs) but struggle with feature selection, class imbalance, hyperparameter tuning, and generalizability. Our strategy leverages dilated convolution (DC) layers trained using reinforcement learning (RL). Unlike other RL-based approaches that handle these challenges in isolation, our method introduces a multi-stage architecture. It integrates RL for feature selection and class balancing. Shapley additive explanations (SHAP) guide feature identification, while augmented rewards for underrepresented classes help mitigate data imbalance. Bayesian optimization hyperband (BOHB) is used for hyperparameter tuning in a unified training process. BOHB combines the predictive strength of Bayesian optimization with the efficiency of hyperband, accelerating model tuning. It also includes an online GAN module for dynamic data augmentation that responds to the evolving output of the RL agent. A novel regularization technique stabilizes GAN training and prevents mode collapse. Importantly, existing RL methods face the challenge of balancing exploration and exploitation. In our RL model, the scope loss function (SLF), integrated with RL, balances exploration and exploitation, thereby ensuring accuracy and generalizability. Collectively, the model jointly tackles four persistent challenges in earlier RL-based approaches: poor exploration–exploitation balance, unstable reward dynamics, static data augmentation, and manual hyperparameter tuning. The model achieved F-measures of 94.3 %, 93.7 %, and 91.5 % on ISIC-2020, HAM10000, and PH2, respectively. This advancement significantly improves early melanoma detection and supports more accurate treatment decisions, contributing valuably to the ongoing effort to combat this lethal cancer.
期刊介绍:
Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management.
Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.