Zheng Wang , Li Chang , Tong Shi , Hui Hu , Chong Wang , Kaibin Lin , Jianglin Zhang
{"title":"Identifying diagnostic biomarkers for Erythemato-Squamous diseases using explainable machine learning","authors":"Zheng Wang , Li Chang , Tong Shi , Hui Hu , Chong Wang , Kaibin Lin , Jianglin Zhang","doi":"10.1016/j.bspc.2024.107101","DOIUrl":null,"url":null,"abstract":"<div><div>Erythemato-squamous diseases (ESD) are a heterogeneous group encompassing six clinically and histopathologically overlapping subtypes, representing a substantial diagnostic challenge within dermatology. The existing body of research reveals a notable void in detailed examinations that deconvolute the distinct features endemic to each ESD variant. To bridge this knowledge gap, our study applied Explainable Artificial Intelligence (XAI) techniques to systematically elucidate the intricate diagnostic biomarker profiles unique to each ESD category. Methodological rigor was fortified through the employment of stratified cross-validation, bolstering the robustness and generalizability of our diagnostic model. The CatBoost classifier emerged as a preeminent algorithm within our analytical framework, manifesting exemplary classification prowess with an accuracy of 99.07%, precision of 99.12%, recall of 98.89%, and an F1 score of 98.97%. Central to our inquiry was the deployment of Shapley Additive exPlanations (SHAP) values, which afforded granular insight into the contributory weight of individual diagnostic biomarkers for each ESD subtype. Our findings delineated pivotal diagnostic biomarkers including saw-tooth appearance of retes (STAR), melanin incontinence (MI), vacuolisation and damage of basal layer (VDBL), polygonal papules (PP), and band-like infiltrate (BLI) as instrumental in the identification of seborrheic dermatitis, while Psoriasis was characterized by fibrosis of the papillary dermis (FPD), thinning of the suprapapillary epidermis (TSE), elongation of the rete ridges (ERR), clubbing of the rete ridges (CRR), and notable psoriatic spongiosis. This integrative approach, leveraging the analytical acumen of Random Forest coupled with the interpretability afforded by SHAP, signifies a significant advancement in the nuanced diagnostic landscape of ESD.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":null,"pages":null},"PeriodicalIF":4.9000,"publicationDate":"2024-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809424011595","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Erythemato-squamous diseases (ESD) are a heterogeneous group encompassing six clinically and histopathologically overlapping subtypes, representing a substantial diagnostic challenge within dermatology. The existing body of research reveals a notable void in detailed examinations that deconvolute the distinct features endemic to each ESD variant. To bridge this knowledge gap, our study applied Explainable Artificial Intelligence (XAI) techniques to systematically elucidate the intricate diagnostic biomarker profiles unique to each ESD category. Methodological rigor was fortified through the employment of stratified cross-validation, bolstering the robustness and generalizability of our diagnostic model. The CatBoost classifier emerged as a preeminent algorithm within our analytical framework, manifesting exemplary classification prowess with an accuracy of 99.07%, precision of 99.12%, recall of 98.89%, and an F1 score of 98.97%. Central to our inquiry was the deployment of Shapley Additive exPlanations (SHAP) values, which afforded granular insight into the contributory weight of individual diagnostic biomarkers for each ESD subtype. Our findings delineated pivotal diagnostic biomarkers including saw-tooth appearance of retes (STAR), melanin incontinence (MI), vacuolisation and damage of basal layer (VDBL), polygonal papules (PP), and band-like infiltrate (BLI) as instrumental in the identification of seborrheic dermatitis, while Psoriasis was characterized by fibrosis of the papillary dermis (FPD), thinning of the suprapapillary epidermis (TSE), elongation of the rete ridges (ERR), clubbing of the rete ridges (CRR), and notable psoriatic spongiosis. This integrative approach, leveraging the analytical acumen of Random Forest coupled with the interpretability afforded by SHAP, signifies a significant advancement in the nuanced diagnostic landscape of ESD.
期刊介绍:
Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management.
Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.