Tao Zhang, Zi-Han Nan, Xiao-Xuan Fan, Jing-Xiao Pang, Cong-Cong Zhao, Yan Xin, Zhen-Jie Hu, Shao-Han Guo
{"title":"Predicting 28-Day Mortality in Critically Ill Patients Receiving Continuous Renal Replacement Therapy: A Novel Interpretable Machine Learning Approach.","authors":"Tao Zhang, Zi-Han Nan, Xiao-Xuan Fan, Jing-Xiao Pang, Cong-Cong Zhao, Yan Xin, Zhen-Jie Hu, Shao-Han Guo","doi":"10.2147/JMDH.S533031","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>This study aimed to develop and validate an interpretable machine learning (ML) model to predict 28-day all-cause mortality in critically ill patients undergoing continuous renal replacement therapy (CRRT), facilitating early risk stratification and clinical decision-making.</p><p><strong>Patients and methods: </strong>Data from 1362 CRRT patients were analyzed, including 1224 from the Medical Information Mart for Intensive Care IV database (training cohort) and 138 from a Chinese hospital (external validation cohort). Feature selection was performed using least absolute shrinkage and selection operator, support vector machine-recursive feature elimination, and Boruta algorithms. Nine machine learning models were constructed and compared, including Gaussian process (GP), ensemble methods (gradient boosting machine and eXtreme gradient boosting), and other classifiers. Model performance was assessed via the area under the receiver operating characteristic curve (AUC), decision curve analysis, and other metrics. The SHapley Additive exPlanation (SHAP) method was used to interpret the ML models.</p><p><strong>Results: </strong>The GP model demonstrated consistent predictive performance across all cohorts, with training (AUC=0.841, accuracy=76.8%, sensitivity=65.5%), internal validation (AUC=0.794, accuracy=73.4%, sensitivity=60.0%), and external validation (AUC=0.780, accuracy=63.8%, sensitivity=39.0%) sets. Key predictors included red cell distribution width, age, lactate, septic shock, and vasoactive drug use. SHAP analysis provided transparent insights into feature contributions.</p><p><strong>Conclusion: </strong>The GP-based model accurately predicts 28-day mortality in CRRT patients and demonstrates strong generalizability. By integrating SHAP explanations, it offers clinicians an interpretable tool to identify high-risk patients early, potentially improving outcomes.</p>","PeriodicalId":16357,"journal":{"name":"Journal of Multidisciplinary Healthcare","volume":"18 ","pages":"5535-5550"},"PeriodicalIF":2.4000,"publicationDate":"2025-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12419209/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Multidisciplinary Healthcare","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2147/JMDH.S533031","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: This study aimed to develop and validate an interpretable machine learning (ML) model to predict 28-day all-cause mortality in critically ill patients undergoing continuous renal replacement therapy (CRRT), facilitating early risk stratification and clinical decision-making.
Patients and methods: Data from 1362 CRRT patients were analyzed, including 1224 from the Medical Information Mart for Intensive Care IV database (training cohort) and 138 from a Chinese hospital (external validation cohort). Feature selection was performed using least absolute shrinkage and selection operator, support vector machine-recursive feature elimination, and Boruta algorithms. Nine machine learning models were constructed and compared, including Gaussian process (GP), ensemble methods (gradient boosting machine and eXtreme gradient boosting), and other classifiers. Model performance was assessed via the area under the receiver operating characteristic curve (AUC), decision curve analysis, and other metrics. The SHapley Additive exPlanation (SHAP) method was used to interpret the ML models.
Results: The GP model demonstrated consistent predictive performance across all cohorts, with training (AUC=0.841, accuracy=76.8%, sensitivity=65.5%), internal validation (AUC=0.794, accuracy=73.4%, sensitivity=60.0%), and external validation (AUC=0.780, accuracy=63.8%, sensitivity=39.0%) sets. Key predictors included red cell distribution width, age, lactate, septic shock, and vasoactive drug use. SHAP analysis provided transparent insights into feature contributions.
Conclusion: The GP-based model accurately predicts 28-day mortality in CRRT patients and demonstrates strong generalizability. By integrating SHAP explanations, it offers clinicians an interpretable tool to identify high-risk patients early, potentially improving outcomes.
期刊介绍:
The Journal of Multidisciplinary Healthcare (JMDH) aims to represent and publish research in healthcare areas delivered by practitioners of different disciplines. This includes studies and reviews conducted by multidisciplinary teams as well as research which evaluates or reports the results or conduct of such teams or healthcare processes in general. The journal covers a very wide range of areas and we welcome submissions from practitioners at all levels and from all over the world. Good healthcare is not bounded by person, place or time and the journal aims to reflect this. The JMDH is published as an open-access journal to allow this wide range of practical, patient relevant research to be immediately available to practitioners who can access and use it immediately upon publication.